Pdf parser library. Can read and extract information from pdf file.

Installs: 3 843 681

Dependents: 99

Suggesters: 5

Security: 0

Stars: 1 469

Watchers: 78

Forks: 411

Open Issues: 143

v0.18.2 2021-02-25 07:40 UTC


Pdf Parser, a standalone PHP library, provides various tools to extract data from a PDF file.

CI Scrutinizer Code Quality Code Coverage License

Latest Stable Version Total Downloads Monthly Downloads Daily Downloads

Website :

Test the API on our demo page.

This project is supported by Actualys.


Features included :

  • Load/parse objects and headers
  • Extract meta data (author, description, ...)
  • Extract text from ordered pages
  • Support of compressed pdf
  • Support of MAC OS Roman charset encoding
  • Handling of hexa and octal encoding in text sections
  • PSR-0 compliant (autoloader)
  • PSR-1 compliant (code styling)

Currently, secured documents are not supported.

This Library is still under active development. As a result, users must expect BC breaks when using the master version.


Read the documentation on website.

Original PDF References files can be downloaded from this url:


Using Composer

  • Obtain Composer
  • Run composer require smalot/pdfparser

Use alternate file loader

In case you can't use Composer, you can include alt_autoload.php-dist into your project. It will load all required files at once. Afterwards you can use PDFParser class and others.


This library is under the LGPLv3 license.