Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/xperseguers/t3ext-extractor

TYPO3 Extension extractor
https://github.com/xperseguers/t3ext-extractor

Last synced: 1 day ago
JSON representation

TYPO3 Extension extractor

Awesome Lists containing this project

README

        

Metadata and content analysis service
=====================================

This extension detects and extracts metadata (EXIF / IPTC / XMP / ...) from
potentially thousand different file types (such as MS Word/Powerpoint/Excel
documents, PDF and images) and bring them automatically and natively to TYPO3
when uploading assets. Works with built-in PHP functions but takes advantage of
Apache Tika and other external tools for enhanced metadata extraction.

.. image:: Documentation/Images/metadata.png
:alt: Metadata for a document

Requirements
------------

For best results, `Apache Tika `__ is
required (either as standalone JAR or running as server).

Extraction of metadata from common image files (jpg, tiff, ...) is often quicker
using external tool `exiftool `__ and if not available,
it will fall back to PHP's built-in EXIF and IPTC library.

For PDF, external tool
`pdfinfo `__ will be used.

Read more in the
`manual `__.