mnb / scraperkit
MNB ScraperKit V1.0.3 - enterprise-ready PHP crawling and data extraction framework with AI crawl intelligence, search discovery, authorized mail/webmail extraction connectors, publisher metadata workflows, extraction recipes, provenance, quality reports, datasets, queues, dashboards, and compliance
v1.0.8
2026-06-12 22:40 UTC
Requires
- php: >=8.2
- ext-dom: *
- ext-json: *
- ext-mbstring: *
- symfony/console: ^6.4 || ^7.0
Suggests
- ext-curl: Recommended for the cURL HTTP engine. The stream/file_get_contents engine works without cURL when allow_url_fopen is enabled.
- ext-openssl: Recommended for secure HTTPS transport and checksum/signature workflows.
- ext-pdo: Needed only for database storage features.
- ext-redis: Optional Redis extension for distributed multi-worker queue mode. File-based distributed queue fallback works without Redis.
- chrome/chromium: Needed only when using browser-assisted crawling through Panther/Chrome.
- pdo_mysql: Optional MySQL/MariaDB driver for server database storage.
- pdo_sqlite: Optional SQLite driver for local database storage.
- php-ai/php-ml: Optional machine-learning toolkit for future model training/inference. V1.0.3 includes deterministic ML-ready features, dataset annotations, evaluation reports, rule-builder assisted profile creation, training-ready exports, and export connector delivery manifests without requiring this dependency.
- symfony/panther: Optional browser-assisted crawling adapter for JavaScript-rendered pages. Normal PHP HTTP crawling works without it.