wrseward/pdf-parser

PHP library to parse text from PDF files

Maintainers

Package info

github.com/wrseward/pdf-parser

pkg:composer/wrseward/pdf-parser

Statistics

Installs: 5 630

Dependents: 0

Suggesters: 0

Stars: 16

Open Issues: 3

0.1.0 2015-09-01 00:54 UTC

This package is not auto-updated.

Last update: 2026-05-10 01:58:14 UTC


README

PHP library to parse PDF files to text. A wrapper for pdftotext.

Build Status

Installation

Via Composer

composer require wrseward/pdf-parser

pdftotext binary

Debian / Ubuntu

apt-get install poppler-utils

RedHat / CentOS

yum install poppler-utils

OS X

brew install xpdf

Verify your installation / Get the path of the binary

which pdftotext

Usage

$parser = new \Wrseward\PdfParser\Pdf\PdfToTextParser('/usr/bin/pdftotext');
$parser->parse('/path/to/file.pdf');
echo $parser->text();

Running tests

./vendor/bin/phpspec run

License

MIT