maxlen / webcrawler
Search engine crawlers
Installs: 10
Dependents: 0
Suggesters: 0
Security: 0
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Type:extension
Requires
- php: >=5.4.0
- electrolinux/phpquery: dev-master
- guzzlehttp/guzzle: ~6.0
This package is not auto-updated.
Last update: 2025-02-01 22:03:01 UTC
README
Search engines crawlers
Google example:
$proxy = []; //['host' => '*.*.*.*', 'port' => '', 'login' => '', 'password' => '']
$params = ['query' => 'test search', 'page' => $page, 'proxy' => $proxy];
$crawler = new WebCrawler(['strategy' => new GoogleSearch()]);
print_r($crawler->crawl($params));
Site-parse example:
$params = ['url' => 'http://your-site.com', 'proxy' => []];
$crawler = new WebCrawler(['strategy' => new SiteSearch()]);
print_r($crawler->crawl($params));