mingfei / querylist-coroutine
QueryList Plugin: 协程插件
1.0.0
2022-02-25 06:09 UTC
Requires
- php: >=7.0
- ext-swoole: >=4.5
- tightenco/collect: >5.0
Requires (Dev)
- jaeger/querylist: dev-master
This package is auto-updated.
Last update: 2025-03-29 01:03:21 UTC
README
QueryList Plugin: Coroutine
Installation
composer require mingfei/querylist-coroutine
API
- Coroutine coroutine(int $size=0):use Coroutine QueryList.
class Coroutine:
- Coroutine size($size):set max parallel num.
- Coroutine add(string $option):add option.
- Coroutine success(callable $callable): success callback.
- Coroutine wait($post): wait for request.
Usage
- Bind Coroutine
use QL\QueryList; use QL\Ext\Coroutine; $ql = new QueryList(); $ql->use(Coroutine::class); //or Custom function name $ql->use(Coroutine::class,'coroutine');
- Your Rule like this
$rule = [ 'package' => ['.col-sm-9>h4>a','text'], 'link' => ['.col-sm-9>h4>a','href'], 'desc'=>['.col-sm-9>p:last','text'], 'language'=>['.col-sm-9>p:eq(0)','text'], 'star'=>['.col-sm-3 span:eq(1)','text'], ]; $range = "li .row";
- Use Coroutine
$coroutine = $ql->range($range)->rules($rule)->coroutine(); $coroutine->add("https://packagist.org/explore/popular"); $data = $coroutine->wait(); print_r($data->all());
- Use Coroutine with success callback
$coroutine = $ql->range($range)->rules($rule)->coroutine(); $coroutine->add("https://packagist.org/explore/popular"); $coroutine->success(function($item){ $item["star"] = preg_replace('/\D/s', '', $item["star"] ); return $item; }); $data = $coroutine->wait(); print_r($data->all());
Out:
Array
(
[0] => Array
(
[package] => symfony/polyfill-mbstring
[link] => /packages/symfony/polyfill-mbstring
[desc] => Symfony polyfill for the Mbstring extension
[language] => PHP
[star] => 7373
)
[1] => Array
(
[package] => psr/log
[link] => /packages/psr/log
[desc] => Common interface for logging libraries
[language] => PHP
[star] => 9976
)
...
)
- Use Coroutine with size
$coroutine = $ql->range($range)->rules($rule)->coroutine(1000); $range = range(1,10000); foreach($range as $i){ $coroutine->add("https://packagist.org/explore/popular?page=".$i); } $data = $coroutine->wait(); print_r($data->all()); //爬取速度受packagist.org并发限制,这里大约需要等几分钟
Out:
use 424.99214887619s