Packages from crwlr

  • PHP

    crwlr/crawler

    Web crawling and scraping library.

  • PHP

    crwlr/crawler-ext-browser

    Extension for the crwlr/crawler package containing steps utilizing a headless browser.

  • PHP

    crwlr/crwl-ext-browser

    Extension configurations for integration of crwlr/crawler-ext-browser into the crwl.io app.

  • PHP

    crwlr/crwl-extension-utils

    Utils for extension packages for the crwl.io app.

  • PHP

    crwlr/html-2-text

    Convert HTML to formatted plain text.

  • PHP

    crwlr/query-string

    A library for convenient handling of query strings used in HTTP requests.

  • PHP

    crwlr/robots-txt

    Robots Exclusion Standard/Protocol Parser for Web Crawling/Scraping

  • PHP

    crwlr/schema-org

    Extract schema.org structured data from HTML documents.

  • PHP

    crwlr/url

    Swiss Army knife for URLs.

  • PHP

    crwlr/utils

    Utilities that are needed in multiple crawler packages.