byjg / pgbrowser
A 'pretty good' mechanize-like php library for managing cookies and submitting forms. [Fork: Totally rewrite]
Requires
- php: >=5.3.0
- electrolinux/phpquery: 0.9.*
- pguardiario/phpuri: *
This package is auto-updated.
Last update: 2024-11-18 18:23:35 UTC
README
A 'pretty good' mechanize-like php library for managing cookies and submitting forms. [Fork Version]. This fork version was totally refactored for PSR-0 full compliant and was implemented the persistence of cookies between connection in the same session instance (good for crawling)
Read the Documentation
require 'vendor\autoload.php' $b = new \PGuardiario\PGBrowser(); $page = $b->get('http://www.google.com/'); $form = $page->form(); $form->set('q', 'foo'); $page = $form->submit(); echo $page->title;
Now do something with $page->html or query it with $page->xpath->query()
PGBrowser will also let you query the page with phpquery, simple-html-dom, advanced-html-dom or xpath:
require 'vendor\autoload.php'; $browser = new \PGuardiario\PGBrowser(\PGuardario\PGBrowser::PHPQUERY); $page = $browser->get('http://www.google.com/search?q=php'); foreach($page->search('li.g') as $li){ echo $li->at('a')->text . "\n"; }
New - PGBrowser can now cache requests to disk and reuse them on subsequent requests to save network traffic. Cached responses go into a folder called 'cache'
$browser->useCache = true; // turn on cacheing $browser->useCache = false; // turn off cacheing