me578022 / simplehtmldom
A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way!
v1.11
2016-10-01 09:34 UTC
Requires
- php: >=5.2.0
This package is not auto-updated.
Last update: 2025-02-01 22:13:54 UTC
README
A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way!
Requirement & Features:
- A HTML DOM parser written in PHP5+ let you manipulate HTML in a very easy way!
- Require PHP 5+.
- Supports invalid HTML.
- Find tags on an HTML page with selectors just like jQuery.
- Extract contents from HTML in a single line.
Download & Documents:
- Download latest version form Sourceforge.
- Read Online Document.
Quick Start:
Get HTML elements
// Create DOM from URL or file $html = file_get_html('http://www.google.com/'); // Find all images foreach($html->find('img') as $element) echo $element->src . '<br>'; // Find all links foreach($html->find('a') as $element) echo $element->href . '<br>';
Modify HTML elements
// Create DOM from string $html = str_get_html('<div id="hello">Hello</div><div id="world">World</div>'); $html->find('div', 1)->class = 'bar'; $html->find('div[id=hello]', 0)->innertext = 'foo'; echo $html; // Output: <div id="hello">foo</div><div id="world" class="bar">World</div>
Extract contents from HTML
// Dump contents (without tags) from HTML echo file_get_html('http://www.google.com/')->plaintext;
Scraping Slashdot!
// Create DOM from URL $html = file_get_html('http://slashdot.org/'); // Find all article blocks foreach($html->find('div.article') as $article) { $item['title'] = $article->find('div.title', 0)->plaintext; $item['intro'] = $article->find('div.intro', 0)->plaintext; $item['details'] = $article->find('div.details', 0)->plaintext; $articles[] = $item; } print_r($articles);
Feedback:
Author:
Author: S.C. Chen (me578022@gmail.com) Original idea is from Jose Solorzano's HTML Parser for PHP 4. Contributions by: Yousuke Kumakura (Attribute Filters)