maxlen/webcrawler

Search engine crawlers

Installs: 10

Dependents: 0

Suggesters: 0

Security: 0

Stars: 0

Watchers: 1

Forks: 0

Open Issues: 0

Type:extension

dev-master 2017-03-29 13:46 UTC

This package is not auto-updated.

Last update: 2024-04-27 17:20:47 UTC


README

Search engines crawlers

Google example:

  $proxy = []; //['host' => '*.*.*.*', 'port' => '', 'login' => '', 'password' => '']
  $params = ['query' => 'test search', 'page' => $page, 'proxy' => $proxy];
  $crawler = new WebCrawler(['strategy' => new GoogleSearch()]);
  print_r($crawler->crawl($params));

Site-parse example:

  $params = ['url' => 'http://your-site.com', 'proxy' => []];
  $crawler = new WebCrawler(['strategy' => new SiteSearch()]);
  print_r($crawler->crawl($params));