wrseward/pdf-parser

PHP library to parse text from PDF files

0.1.0 2015-09-01 00:54 UTC

This package is not auto-updated.

Last update: 2024-04-27 15:31:43 UTC


README

PHP library to parse PDF files to text. A wrapper for pdftotext.

Build Status

Installation

Via Composer

composer require wrseward/pdf-parser

pdftotext binary

Debian / Ubuntu

apt-get install poppler-utils

RedHat / CentOS

yum install poppler-utils

OS X

brew install xpdf

Verify your installation / Get the path of the binary

which pdftotext

Usage

$parser = new \Wrseward\PdfParser\Pdf\PdfToTextParser('/usr/bin/pdftotext');
$parser->parse('/path/to/file.pdf');
echo $parser->text();

Running tests

./vendor/bin/phpspec run

License

MIT