matasarei/text-at-any-cost

There is no license information available for the latest version (dev-master) of this package.

Text at any cost: get text from any office document (DOC, DOCX, ODT, PDF and other).

dev-master 2019-11-11 14:46 UTC

This package is auto-updated.

Last update: 2024-12-19 11:57:03 UTC


README

Old PHP scripts to read text content from different binary formats:

  • DOC/PPT (using self-written CFB basic module)
  • PDF
  • RTF