{"id":18735204,"url":"https://github.com/irontec/elastika","last_synced_at":"2025-06-23T10:34:01.510Z","repository":{"id":32436726,"uuid":"36014835","full_name":"irontec/elastika","owner":"irontec","description":"Command line utility to extract data with Apache Tika and send them over to Elasticseach","archived":false,"fork":false,"pushed_at":"2015-05-22T08:47:36.000Z","size":40848,"stargazers_count":3,"open_issues_count":0,"forks_count":2,"subscribers_count":23,"default_branch":"master","last_synced_at":"2025-05-20T01:12:40.789Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Java","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"eupl-1.1","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/irontec.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2015-05-21T13:37:54.000Z","updated_at":"2016-07-13T16:35:21.000Z","dependencies_parsed_at":"2022-09-23T09:02:37.326Z","dependency_job_id":null,"html_url":"https://github.com/irontec/elastika","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/irontec/elastika","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/irontec%2Felastika","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/irontec%2Felastika/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/irontec%2Felastika/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/irontec%2Felastika/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/irontec","download_url":"https://codeload.github.com/irontec/elastika/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/irontec%2Felastika/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":261462283,"owners_count":23161917,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-07T15:16:04.797Z","updated_at":"2025-06-23T10:33:56.486Z","avatar_url":"https://github.com/irontec.png","language":"Java","funding_links":[],"categories":[],"sub_categories":[],"readme":"Elastika\n==============================\n\n\nCommand line utility that extracts the metadata and plain text content of files supported by Apache Tika and send them to the [Elastic](https://www.elastic.co/) server.\n\nRelases\n-------------\n\n[v0.9](https://github.com/irontec/elastika/releases/tag/v0.9)\n\nUsage\n-------------\nOnce downloaded, place the `tika-app.jar` file that it´s placed inside the `libs/` folder on the same folder that contains your `elastika.jar`. Now, you're ready to use Elastika.\n\nNote: this document assumes that the final user does have [Elastic](https://www.elastic.co/) installed and running at least on localhost or in some host that you can specify with the following options.\n\nOptions:\n\n    usage: elastika\n     -i,--indice \u003carg\u003e   (Required) Elastic indice name.\n     -t,--type \u003carg\u003e     (Required) Elastic indice type name.\n     -f,--file \u003carg\u003e     (Required) The document to be parsed and sent to\n                         Elastic.\n     \n     -h,--host \u003carg\u003e     (Optional) Elastic REST Endpoint hostname. Default\n                         http://localhost.\n     -p,--port \u003carg\u003e     (Optional) Elastic REST Endpoint port. Default 9200.\n     \n     -?,--help           Print this usage message\n     -v,--version        Display version information\n     \n\n\nUsage sample:\n\n    axier$ java -jar elastika.jar -i myIndice -t myType --file my_fancy_document.pdf\n\nOutputs:\n\n    # Extracting JSON Metadata from the file\n    Executing: java -jar tika-app.jar -j my_fancy_document.pdf\n\n\t# Extracting the plain text content from the file\n\tExecuting: java -jar tika-app.jar -T my_fancy_document.pdf\n\t\n\t# Result of the POST to Elastic\n\t{\"_type\":\"data\",\"_version\":1,\"_id\":\"AU12rlvKHYuWDiEyeqrY\",\"created\":true,\"_index\":\"ekt\"}\n\n\nBuilding\n-------------\n\nOn the first place the code into your java project [Eclipse](https://www.eclipse.org/downloads/packages/eclipse-ide-java-developers/lunasr2). Now, for generating the jar file just follow this simple steps:\n\n- Right click on your project and click on `Export`\n- Select `Java \u003e JAR File` and click `Next`\n- Enter the path of the folder where you want to leave the jar file on the `Select the export destination` section and click `Next` and `Next` again.\n- Now, on the JAR Manifest Specification part, on the `Select the class of the application entry point` select `Browse` and then select `Elastika`\n- Click `Finish` and you're done\n\nLibraries\n-------------\n\n - [Apache Commons Cli](https://commons.apache.org/proper/commons-cli/)\n - [Apache Commons IO](https://commons.apache.org/proper/commons-io/)\n - [Apache Tika 1.8](https://tika.apache.org/)\n\n\nLicense\n-------------\n[EUPL v1.1](https://github.com/irontec/elastika/blob/master/LICENSE.txt)\n\n\u003e Copyright 2015 Irontec SL\n\u003e \n\u003e Licensed under the EUPL, Version 1.1 or - as soon they will be approved by the European\n\u003e Commission - subsequent versions of the EUPL (the \"Licence\"); You may not use this work\n\u003e except in compliance with the Licence.\n\u003e \n\u003e You may obtain a copy of the Licence at:\n\u003e http://ec.europa.eu/idabc/eupl.html\n\u003e \n\u003e Unless required by applicable law or agreed to in writing, software distributed under \n\u003e the Licence is distributed on an \"AS IS\" basis, WITHOUT WARRANTIES OR CONDITIONS OF \n\u003e ANY KIND, either express or implied. See the Licence for the specific language \n\u003e governing permissions and limitations under the Licence.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Firontec%2Felastika","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Firontec%2Felastika","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Firontec%2Felastika/lists"}