thingston / extractor
Collection of PHP classes to extract data from HTML pages.
0.4.0
2019-11-26 15:02 UTC
Requires
- php: ^7.1
- guzzlehttp/psr7: ^1.4
- jwage/purl: ^0.0.10
- neitanod/forceutf8: ^2.0
- patrickschur/language-detection: ^3.4
- symfony/css-selector: ^5.0
- symfony/dom-crawler: ^5.0
Requires (Dev)
- phpunit/phpunit: ^7.4
- squizlabs/php_codesniffer: ^3.3
- symfony/var-dumper: ^4.1
README
Collection of PHP classes to extract data from HTML pages.
Requirements
Thingston Extractor requires:
- PHP 7.1 or above.
Instalation
Add Thingston Extractor to any PHP project using Composer:
composer require thingston/extractor
Getting Started
use Thingston\Extractor\Page; $uri = 'https://google.com/'; $html = file_get_contents($uri); $page = Page\PageExtractor::create($html, $uri)->extract();
Contributors
Open Source is made of contribuition. If you want to contribute to Thingston please follow these steps:
- Fork latest version into your own repository.
- Write your changes or additions and commit them.
- Follow PSR-2 coding style standard.
- Make sure you have unit tests with full coverage to your changes.
- Go to Github Pull Requests at https://github.com/thingston/extractor/pulls and create a new request.
Thank you!
Changes and Versioning
All relevant changes on this code are logged in a separated log file.
Version numbers follow recommendations from Semantic Versioning.
License
Thingston code is maintained under The MIT License.