crwlr/robots-txt

Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping

v1.1.1 2022-11-08 12:25 UTC

This package is auto-updated.

Last update: 2024-11-06 13:55:03 UTC


README

crwlr.software logo

Robots Exclusion Standard/Protocol Parser

for Web Crawling/Scraping

Use this library within crawler/scraper programs to parse robots.txt files and check if your crawler user-agent is allowed to load certain paths.

Documentation

You can find the documentation at crwlr.software.

Contributing

If you consider contributing something to this package, read the contribution guide (CONTRIBUTING.md).