crwlr/schema-org

Extract schema.org structured data from HTML documents.

v0.3.1 2023-11-30 12:02 UTC

This package is auto-updated.

Last update: 2024-04-30 00:27:59 UTC


README

crwlr.software logo

schema.org Structured Data Extraction

This library helps to extract schema.org structured data in JSON-LD format from HTML documents. It tries to find JavaScript tags in HTML containing JSON-LD objects, getting the data and populating instances of the equivalent classes from spatie's schema-org package. Shout-out to the awesome guys at spatie!

Documentation

You can find the documentation at crwlr.software.

Contributing

If you consider contributing something to this package, read the contribution guide (CONTRIBUTING.md).