contextualcode/ezplatform-search-binary-extractor

eZ Platform bundle which allows to index the content of binary files

v2.1.0 2020-07-16 10:45 UTC

This package is not auto-updated.

Last update: 2020-08-01 15:10:13 UTC


README

It was possible to use third-party binaries to index binary files in eZ Publish. This functionality is missing in the latest eZ Platform versions. And this bundle provides it.

Also, it provides an example of a binary extractor for PDF files. Which uses pdftotext third-party binary.

Installation

  1. Require contextualcode/ezplatform-search-binary-extractor via composer:
     composer require contextualcode/ezplatform-search-binary-extractor
    

Usage

First of all, please double check if "Searchable" checkbox is checked for binary file field types that need to be searchable.

After the bundle is installed, all the PDF files content will be indexed. And you would need to rebuild the search index by running:

php bin/console ezplatform:reindex

Also it is possible to build your own custom binary extractors. You just need to follow a few simple steps:

  1. Create a new service which implements BinaryExtractor. Please use BinaryExtractor\Pdf as an example.

  2. Tag your service with ezplatform.field_type.ezbinaryfile.binary_extractor tag.