About Anaconda Help Download Anaconda

Efficient MinHashing

copied from cf-staging / pyminhash

Installers

  • noarch v0.1.5

conda install

To install this package run one of the following:
conda install conda-forge::pyminhash

Description

MinHashing is a very efficient way of finding similar records in a dataset based on Jaccard similarity. PyMinHash implements efficient minhashing for Pandas dataframes. See instructions below or look at the example notebook to get started.

Developed by Frits Hermans

PyPI: https://pypi.org/project/PyMinHash/


© 2024 Anaconda, Inc. All Rights Reserved. (v4.0.4) Legal | Privacy Policy