migliori / sitemap-crawler
Sitemap crawler/generator. For the given URL it will return sitemap XML file with URLs and images.
Installs: 1 232
Dependents: 0
Suggesters: 0
Security: 0
Stars: 3
Watchers: 2
Forks: 0
Open Issues: 0
Requires
- php: >=5.5
This package is auto-updated.
Last update: 2024-12-10 19:08:11 UTC
README
Sitemap crawler/generator. For the given URL it will return sitemap XML file with URLs and images.
Can be used as Standalone or with Ajax (build sitemap, submit to Search Engines & show results on a button click)
Original project: https://github.com/ivebe/sitemap-crawler
Install
composer require migliori/sitemap-crawler
Features
- crawl given URL and generate sitemap
- crawl each found URL and add images to the sitemap
- save the sitemap on your server or download it
- configure the maximum deph
- use as standalone (or CRON task)
- call with ajax on button click
- onscreen live report with urls count and urls list
- search engines auto-submission (ping) with onscreen report
Example
<?php require_once __DIR__ . '/vendor/autoload.php'; use SitemapCrawler\Crawler; use SitemapCrawler\SitemapService; use SitemapCrawler\LinkCollection; $config = require __DIR__ . '/vendor/ivebe/sitemap-crawler/src/config.php'; $url = 'http://www.google.com'; /** * $dest: * false if you want to download the generated sitemap * 'filename.xml' to save file on server */ $dest = __DIR__ . '/sitemap.xml'; /** * sitemap url for search engines submission */ $sitemap_url = 'http://www.google.com/sitemap.xml'; $crawler = new Crawler($config['crawler']); $collection = new LinkCollection(); $provider = new SitemapService($crawler, $collection, $url, $config['sitemap_service']); $links = $provider->crawl($url); $provider->export('daily', $dest); if ($config['submit_to_search_engines'] === true) { $provider->SubmitSiteMap($sitemap_url); }
Example 2 (Ajax)
Refer to ajax-demo.php