Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/typo3-solr/ext-tika

A TYPO3 CMS extension that provides Apache Tika functionality
https://github.com/typo3-solr/ext-tika

cms cms-extension file-indexing language-detection metadata php search text-extraction tika typo3 typo3-cms-extension

Last synced: 6 days ago
JSON representation

A TYPO3 CMS extension that provides Apache Tika functionality

Awesome Lists containing this project

README

        

# Apache Tika for TYPO3

[![Build Status](https://github.com/TYPO3-Solr/ext-tika/actions/workflows/ci.yml/badge.svg?branch=main)](https://github.com/TYPO3-Solr/ext-tika/actions?query=branch:main)
[![Latest Stable Version](https://poser.pugx.org/apache-solr-for-typo3/tika/v/stable)](https://packagist.org/packages/apache-solr-for-typo3/tika)
[![License](https://poser.pugx.org/apache-solr-for-typo3/tika/license)](https://packagist.org/packages/apache-solr-for-typo3/tika)
[![Total Downloads](http://poser.pugx.org/apache-solr-for-typo3/tika/downloads)](https://packagist.org/packages/apache-solr-for-typo3/tika)
[![Monthly Downloads](https://poser.pugx.org/apache-solr-for-typo3/tika/d/monthly)](https://packagist.org/packages/apache-solr-for-typo3/tika)
[![TYPO3 13](https://img.shields.io/badge/TYPO3-13-orange.svg?style=flat-square)](https://get.typo3.org/version/13)

A TYPO3 CMS extension that provides Apache Tika functionality including

* text extraction
* meta data extraction
* language detection (from strings or files)

Tika can be used as standalone Tika app/jar, Tika server, and via SolrCell integrated in Apache Solr.

We're open for [contributions](#Contributions) !

Please find further information regarding Apache Tika on the [project's homepage](http://tika.apache.org)

---

| | URL |
|------------------|-----------------------------------------------------------------|
| **Repository:** | https://github.com/TYPO3-Solr/ext-tika |
| **Read online:** | https://docs.typo3.org/p/apache-solr-for-typo3/tika/main/en-us/ |
| **TER:** | https://extensions.typo3.org/extension/tika |
| **Homepage:** | https://www.typo3-solr.com/ |
| **Fund:** | https://shop.dkd.de/Produkte/Apache-Solr-fuer-TYPO3/ |

Powered by the TYPO3 community and

![dkd Internet Service GmbH](./Documentation/Images/dkdLogo.png)

---

## Continuous Integration

We use GitHub Actions for continuous integration.

To run the test suite locally, please use our DDEV docker environment https://github.com/TYPO3-Solr/solr-ddev-site.

**Note**:
This requires a proper combination of branches:
* solr-ddev-site on main branch
* packages/ext-solr on main
* packages/ext-tika on main
* Please refer to [version matrix](https://raw.githubusercontent.com/TYPO3-Solr/ext-solr/main/Documentation/Appendix/VersionMatrix.rst) for proper combination of branches

```shell
ddev solr:enable tika
ddev composer t3:standards:fix packages/ext-tika/
ddev composer tests:tika:phpstan
ddev composer tests:tika:unit
ddev composer tests:tika:integration
```

## Contributions

1. Fork the repository
2. Clone repository
3. Create a new branch
4. Make your changes
5. Commit your changes to your fork. In your commit message refer to the issue number if there is already one, e.g. `[BUGFIX] short description of fix (resolves #4711)`
6. Submit a Pull Request (here are some hints on [How to write the perfect pull request](https://github.com/blog/1943-how-to-write-the-perfect-pull-request))

### Keep your fork in sync with original repository

1. git remote add upstream https://github.com/TYPO3-Solr/ext-tika.git
2. git fetch upstream
3. git checkout master
4. git merge upstream/master
5. git push origin master