lmasforne / pdfparser
Pdf parser library. Can read and extract information from pdf file.
v0.11
2017-09-14 12:26 UTC
Requires
- php: >=5.3.0
- ext-iconv: *
- tecnickcom/tcpdf: ~6.0
Requires (Dev)
- atoum/atoum: ^2.8 | ^3.0
This package is not auto-updated.
Last update: 2024-11-10 05:24:48 UTC
README
Pdf Parser, a standalone PHP library, provides various tools to extract data from a PDF file.
Website : http://www.pdfparser.org
Test the API on our demo page.
This project is supported by Actualys.
Features
Features included :
- Load/parse objects and headers
- Extract meta data (author, description, ...)
- Extract text from ordered pages
- Support of compressed pdf
- Support of MAC OS Roman charset encoding
- Handling of hexa and octal encoding in text sections
- PSR-0 compliant (autoloader)
- PSR-1 compliant (code styling)
Currently, secured documents are not supported.
This Library is still under active development. As a result, users must expect BC breaks when using the master version.
Documentation
Read the documentation on website.
Original PDF References files can be downloaded from this url : http://www.adobe.com/devnet/pdf/pdf_reference_archive.html
License
This library is under the LGPLv3 license.