An open API service indexing awesome lists of open source software.

https://github.com/kevm/tikaondotnet

Use the Java Tika text extraction library on the .NET platform
https://github.com/kevm/tikaondotnet

extract-text tika

Last synced: 7 months ago
JSON representation

Use the Java Tika text extraction library on the .NET platform

Awesome Lists containing this project

README

          

Tika on .NET
============

[![Build status](https://ci.appveyor.com/api/projects/status/ofc68okbo9s75okr?svg=true)](https://ci.appveyor.com/project/KevM/tikaondotnet) [![NuGet version](https://badge.fury.io/nu/TikaOnDotNet.TextExtractor.svg)](https://badge.fury.io/nu/TikaOnDotNet.TextExtractor)

This project is a simple wrapper around the very excellent and robust
[Tika](http://tika.apache.org/) text extraction Java library. This project produces two nugets:
- TikaOnDotNet - A straight [IKVM](http://www.ikvm.net/userguide/ikvmc.html) hosted port of Java Tika project.

[![Install-Package TikaOnDotNet](https://cldup.com/H-IdGdU75T.png)](https://www.nuget.org/packages/TikaOnDotnet/)

- TikaOnDotNet.TextExtractor - Use Tika to extract text from rich documents.

[![Install-Package TikaOnDotNet.TextExtractor](https://cldup.com/_BM0b5jVjU.png)](https://www.nuget.org/packages/TikaOnDotNet.TextExtractor/)

## Getting Started

The best way to get started is to:
- Add a Nuget dependency to [TikaOnDotNet.TextExtractor](https://www.nuget.org/packages/TikaOnDotNet.TextExtractor/).
- Instantiate a new `TextExtractor` object and call one of the `Extract` methods.

### Usage
```cs
// using TikaOnDotNet.TextExtraction;

var textExtractor = new TextExtractor();

var wordDocContents = textExtractor.Extract(@".\path\to\my favorite word.docx");
var webPageContents = textExtractor.Extract(new Uri("https://google.com"));
```

Take a look at [our tests](https://github.com/KevM/tikaondotnet/tree/master/src/TikaOnDotNet.Tests) for more usage examples.

## How To Contribute

Have an idea to make this project better? Great! Start out by taking a look at our [Contributing Guide](https://github.com/KevM/tikaondotnet/blob/master/Contributing.md).

## Having A Problem?

Search in the [Issues](https://github.com/KevM/tikaondotnet/issues?q=is%3Aopen+is%3Aissue)
as your problem may be a common one. If don't find your problem please [create an
issue](https://github.com/KevM/tikaondotnet/issues/new). Contributors here will
chime in when they can.