A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
conda install sfe1ed40::dedupe
dedupe is a python library that uses machine learning to perform fuzzy matching, deduplication and entity resolution quickly on structured data.