Package Name | Access | Summary | Updated |
---|---|---|---|
simhash | public | Near-Duplicate Detection with Simhash | 2025-03-25 |
pydepta | public | A Python implementation of DEPTA | 2025-03-25 |
slybot | public | Slybot crawler | 2025-03-25 |
scrapely | public | A pure-python HTML screen-scraping library | 2025-03-25 |
scrapinghub | public | No Summary | 2025-03-25 |