webgriffe/pdftotext-bundle

This bundle integrates Symfony2 with pdftotext binary command.

Installs: 16 493

Dependents: 0

Suggesters: 0

Security: 0

Stars: 3

Watchers: 4

Forks: 3

Open Issues: 0

Type:symfony-bundle

1.1.0 2017-04-24 16:35 UTC

This package is auto-updated.

Last update: 2021-11-07 14:47:06 UTC


README

This Symfony2 bundle allows you to convert an input PDF file into plain text.

Conversion is made through pdftotext command-line utilty (http://en.wikipedia.org/wiki/Pdftotext). pdftotext is part of Xpdf software suite, is included in many Linux distributions and that should be available also for Mac OS X and Windows platforms.

Installation

Install this bundle as any other Symfony2 bundle.

Symfony >= 2.1.x

Add the following requirement to your composer.json:

"require": {
	…
	"webgriffe/pdftotext-bundle": "dev-master"
}

Install the bundle with the following command:

$ composer update webgriffe/pdftotext-bundle

Register the bundle in the AppKernel:

public function registerBundles()
{
	…
	new Webgriffe\PdfToTextBundle\WebgriffePdfToTextBundle(),
}

Symfony 2.0.x

Add the following requirement in your deps file:

…
[WebgriffePdfToTextBundle]
	git=git://github.com/webgriffe/pdftotext-bundle.git
	target=bundles/Webgriffe/PdfToTextBundle

Install the bundle with the following command:

$ bin/vendors install

Register the bundle in the AppKernel:

public function registerBundles()
{
	…
	new Webgriffe\PdfToTextBundle\WebgriffePdfToTextBundle(),
}

Usage

Simply, you can get the PdfToTextConverter from DIC and get the plain text string.

// Acme\MyBundle\Controller\MyController

public function myAction()
{
	$pdfFile = '/path/to/file.pdf';
	$pdfToTextConverter = $this->get('webgriffe_pdf_to_text.converter');
	$pdfText = $pdfToTextConverter->convert($pdfFile);
	
	return new \Symfony\Component\HttpFoundation\Response($pdfText);
}

You can also specify the output encoding (default is UTF-8).

$pdfText = $pdfToTextConverter->convert($pdfFile, 'ISO-8859-1');

Specify pdftotext binary path

You can specify the pdftotext binary path in your config.yml:

webgriffe_pdf_to_text:
    bin_path: /usr/local/bin/pdftotext

Credits

This bundle has been developed by Webgriffe®. Please, report to us any bug or suggestion by GitHub issues.