frontera
|
public |
A flexible frontier for web crawlers
|
2025-03-25 |
cssselect
|
public |
cssselect parses CSS3 Selectors and translates them to XPath 1.0
|
2025-03-25 |
webstruct
|
public |
A library for creating statistical NER systems that work on HTML data
|
2025-03-25 |
retrying
|
public |
Retrying
|
2025-03-25 |
hubstorage
|
public |
Client interface for Scrapinghub HubStorage
|
2025-03-25 |
pydispatcher
|
public |
Multi-producer-multi-consumer signal dispatching mechanism
|
2025-03-25 |
parsel
|
public |
Parsel is a library to extract data from HTML and XML using XPath and CSS selectors
|
2025-03-25 |
pyasn1-modules
|
public |
A collection of ASN.1-based protocols modules.
|
2025-03-25 |
scrapy
|
None |
A high-level Web Crawling and Web Scraping framework
|
2025-03-25 |