Pdf parser library. Can read and extract information from pdf file.

Installs: 6 313 734

Dependents: 104

Suggesters: 5

Security: 0

Stars: 1 661

Watchers: 81

Forks: 448

Open Issues: 158

v2.0.1 2021-11-22 11:57 UTC


Pdf Parser, a standalone PHP library, provides various tools to extract data from a PDF file.

CI Scrutinizer Code Quality Code Coverage License

Latest Stable Version Total Downloads Monthly Downloads Daily Downloads

Website :

Test the API on our demo page.

This project is supported by Actualys.


Features included :

  • Load/parse objects and headers
  • Extract meta data (author, description, ...)
  • Extract text from ordered pages
  • Support of compressed pdf
  • Support of MAC OS Roman charset encoding
  • Handling of hexa and octal encoding in text sections
  • PSR-0 compliant (autoloader)
  • PSR-1 compliant (code styling)

Currently, secured documents are not supported.

This Library is under active maintenance. There is no active development by the author of this library (at the moment), but we welcome any pull request adding/extending functionality!


Read the documentation on the wiki.

Original PDF References files can be downloaded from this url:

For developers

Please read for more information about local development of the PDFParser library. Here you will also find information about how to handle Base63 encoded PDFs.


Using Composer

  • Obtain Composer
  • Run composer require smalot/pdfparser

Use alternate file loader

In case you can't use Composer, you can include alt_autoload.php-dist into your project. It will load all required files at once. Afterwards you can use PDFParser class and others.


This library is under the LGPLv3 license.