scrubadub_spacy
Use Spacy NER models to clean personally identifiable information from dirty dirty text.
Use Spacy NER models to clean personally identifiable information from dirty dirty text.
To install this package, run one of the following:
scrubadub removes personally identifiable information from text. scrubadubspacy is an extension that uses spaCy NLP models to remove personal information from text. This package contains two extra detectors: scrubadubspacy.detectors.SpacyEnityDetector - A detector that uses the spacy NER model to find locations, names, dates and other entities. scrubadub_spacy.detectors.SpacyNameDetector - A detector that uses the spacy NER model and context words to find names in text.
Summary
Use Spacy NER models to clean personally identifiable information from dirty dirty text.
Last Updated
Feb 5, 2025 at 12:08
License
Apache-2.0
Total Downloads
3.1K
Supported Platforms
GitHub Repository
https://github.com/LeapBeyond/scrubadub_spacyDocumentation
https://scrubadub.readthedocs.io/