splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
To install this package, run one of the following:
Splink is a Python package for probabilistic record linkage (entity resolution) that allows you to deduplicate and link records from datasets without unique identifiers.
Summary
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
Last Updated
Apr 8, 2026 at 08:44
License
MIT
Total Downloads
75
Version Downloads
75
Supported Platforms
GitHub Repository
https://github.com/moj-analytical-services/splinkDocumentation
https://moj-analytical-services.github.io/splink