funstaff/tika

libs for tika wrapper

dev-master / 1.x-dev 2016-03-09 12:06 UTC

This package is auto-updated.

Last update: 2024-10-29 04:14:33 UTC


README

Master: Build Status

A wrapper php for Tika binary

Installation

To install tika with Composer just run:

$ php composer.phar require funstaff/tika dev-master

Usage

<?php

use Funstaff\Tika\Configuration;
use Funstaff\Tika\Document;
use Funstaff\Tika\Wrapper;

$config = new Configuration('/path/to/tika.jar');
$config
    ->setOutputFormat('html')
    ->setOutputEncoding('UTF-8');

$wrapper = new Wrapper($config);
$wrapper
    ->addDocument(new Document('doc.pdf','/path/to/document.pdf'))
    ->addDocument(new Document('doc2.pdf','/path/to/document2.pdf'))
    ->execute();

/* Get All documents */
$documents = $wrapper->getDocument();

/* or only one document */
$document = $wrapper->getDocument('doc.pdf');

/* Get Document content */
$content = $document->getContent();

/* or raw content (output of Tika) */
$content = $document->getRawContent();

/* Get All Metadata for document (only on xml and html output format) */
$metadata = $document->getMetadata();

/* Get Value for metadata */
$author = $metadata->get('author');

Credits

To all users that gave feedback and committed code https://github.com/Funstaff/Tika.

© Bertrand Zuchuat - Funstaff