smalot / pdfparser
Pdf parser library. Can read and extract information from pdf file.
Installs: 3 843 681
Dependents: 99
Suggesters: 5
Security: 0
Stars: 1 469
Watchers: 78
Forks: 411
Open Issues: 143
Requires
- php: >=5.6
- ext-zlib: *
- symfony/polyfill-mbstring: ^1.18
Requires (Dev)
- friendsofphp/php-cs-fixer: ^2.16
- symfony/phpunit-bridge: ^5.2
- dev-master
- v0.18.2
- v0.18.1
- v0.18.0
- v0.17.1
- v0.17.0
- v0.16.2
- 0.16.1
- v0.16.0
- v0.15.1
- v0.15.0
- v0.14.0
- v0.13.3
- v0.13.2
- v0.13.1
- v0.13.0
- v0.12.0
- v0.11
- v0.10.0
- v0.9.26
- v0.9.25
- v0.9.24
- v0.9.23
- v0.9.22
- v0.9.21
- v0.9.20
- v0.9.19
- v0.9.18
- v0.9.17
- v0.9.16
- v0.9.15
- v0.9.14
- v0.9.13
- v0.9.12
- v0.9.11
- v0.9.10
- v0.9.9
- v0.9.8
- v0.9.7
- v0.9.6
- v0.9.5
- v0.9.4
- v0.9.3
- v0.9.2
- v0.9.1
- v0.9.0
- dev-fix/undefined-method-header-to-string-391
- dev-php8
This package is auto-updated.
Last update: 2021-02-25 08:12:20 UTC
README
Pdf Parser, a standalone PHP library, provides various tools to extract data from a PDF file.
Website : https://www.pdfparser.org
Test the API on our demo page.
This project is supported by Actualys.
Features
Features included :
- Load/parse objects and headers
- Extract meta data (author, description, ...)
- Extract text from ordered pages
- Support of compressed pdf
- Support of MAC OS Roman charset encoding
- Handling of hexa and octal encoding in text sections
- PSR-0 compliant (autoloader)
- PSR-1 compliant (code styling)
Currently, secured documents are not supported.
This Library is still under active development. As a result, users must expect BC breaks when using the master version.
Documentation
Read the documentation on website.
Original PDF References files can be downloaded from this url: http://www.adobe.com/devnet/pdf/pdf_reference_archive.html
Installation
Using Composer
- Obtain Composer
- Run
composer require smalot/pdfparser
Use alternate file loader
In case you can't use Composer, you can include alt_autoload.php-dist
into your project.
It will load all required files at once.
Afterwards you can use PDFParser
class and others.
License
This library is under the LGPLv3 license.