jansuchanek / nscraper
Web scraper for competitor product analysis — Heureka, Google, JSON-LD extraction
v1.0.0
2026-03-19 08:33 UTC
Requires
- php: >=8.1
- ext-curl: *
Requires (Dev)
- nette/tester: ^2.6
README
Competitor product scraping — Heureka.cz, Google snippets, JSON-LD extraction. Zero dependencies beyond ext-curl.
Installation
composer require jansuchanek/nscraper
Usage
use NScraper\WebScraper; $scraper = new WebScraper(); // Scrape Heureka.cz product descriptions $data = $scraper->scrapeHeureka('Samsung Galaxy S24'); // ['descriptions' => [...], 'source' => 'heureka'] // Google search snippets $data = $scraper->scrapeGoogle('Samsung Galaxy S24'); // Scrape any URL (JSON-LD, meta, content extraction) $desc = $scraper->scrapeUrl('https://example.com/product/123'); // Build context for AI pipelines $context = $scraper->buildCompetitorContext('Samsung Galaxy S24');
Extraction Strategies
- JSON-LD structured data (
@type: Product) - Meta description tag
- CSS selectors —
.product-desc,.description,#description
Requirements
- PHP >= 8.1
- ext-curl