modstore/rotating-proxy-bundle

There is no license information available for the latest version (dev-master) of this package.

Rotating proxy bundle for Symfony.

Installs: 4 204

Dependents: 0

Suggesters: 0

Security: 0

Stars: 2

Watchers: 1

Forks: 1

Open Issues: 0

Type:symfony-bundle

dev-master 2019-01-03 00:43 UTC

This package is auto-updated.

Last update: 2024-04-06 11:05:18 UTC


README

This bundle rotates through your pool of http proxies when sending subsequent requests.

Installation

composer require modstore/rotating-proxy-bundle:dev-master

Add RotatingProxyBundle to your application kernel

// app/AppKernel.php
public function registerBundles()
{
    return array(
        // ...
        new Modstore\RotatingProxyBundle\ModstoreRotatingProxyBundle(),
        // ...
    );
}

Add your proxy pool

Add your proxy ip addresses to the db table: modstore_rotating_proxy

  • host: The proxy IP address
  • port: The proxy port
  • status: 1 to enable, 0 to disable

Usage example

/** @var \Symfony\Component\DomCrawler\Crawler $crawler */
$crawler = $container->get('modstore_rotating_proxy.manager')->crawlPage(
    'https://github.com/modstore/RotatingProxyBundle',
    'github',
    ['Referer' => 'https://www.google.com']
);

An instance of the Symfony dom crawler will be returned: https://symfony.com/doc/current/components/dom_crawler.html

Notes

In order to ensure requests aren't too similar and blocked as a bot, each request will set a random user agent string. It is also a good idea to provide a "Referer" string that would be a plausible organic referrer to the page you're requesting.

The second argument is a group name. Requests are rotated within a group. Generally for all requests to a particular domain, you would set the same group name.

Test command

There's a test command to see how it works and test your requests. The text from the body of the page will be output.

php bin/console modstore_rotating_proxy:test

Optionally a url can be provided:

php bin/console modstore_rotating_proxy:test --url='https://github.com/modstore/RotatingProxyBundle'

When the url argument is omitted, a request will be sent to whatsmyip.org.

Log

A log of all requests is stored in the modstore_rotating_proxy_log table.