{"id":16679764,"url":"https://github.com/buren/document_parser","last_synced_at":"2025-08-31T12:33:15.444Z","repository":{"id":142732364,"uuid":"95386854","full_name":"buren/document_parser","owner":"buren","description":"Small Rails API app to parse documents.","archived":false,"fork":false,"pushed_at":"2021-11-30T01:29:11.000Z","size":31,"stargazers_count":0,"open_issues_count":0,"forks_count":1,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-01-20T05:17:09.627Z","etag":null,"topics":["document-parser","rails-api","yomu"],"latest_commit_sha":null,"homepage":"","language":"Ruby","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/buren.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-06-25T21:30:43.000Z","updated_at":"2017-06-26T06:14:00.000Z","dependencies_parsed_at":null,"dependency_job_id":"fded4a4a-5470-4743-a07d-10745f722cf8","html_url":"https://github.com/buren/document_parser","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/buren%2Fdocument_parser","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/buren%2Fdocument_parser/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/buren%2Fdocument_parser/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/buren%2Fdocument_parser/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/buren","download_url":"https://codeload.github.com/buren/document_parser/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243374214,"owners_count":20280656,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["document-parser","rails-api","yomu"],"created_at":"2024-10-12T13:37:24.477Z","updated_at":"2025-03-13T08:41:00.908Z","avatar_url":"https://github.com/buren.png","language":"Ruby","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Document Parser\n\n\u003e This app does one little thing, and does it well. Which is to wrap the [`yomu`](https://github.com/Erol/yomu) gem.\n\nSmall Rails API app to parse documents.\n\n[![Deploy](https://www.herokucdn.com/deploy/button.svg)](https://heroku.com/deploy?template=https://github.com/buren/document_parser)\n\n## API\n\n__Parse a document__:\n\nRequest\n\n`POST /document`.\n\n```json\n{\n  \"file\": \"base64-endcoded-string-goes-here...\"\n}\n```\n\nResponse\n\n```json\n{\n    \"title\": \"The title of the document\",\n    \"content_type\": \"application/pdf\",\n    \"created_at\": \"2015-08-17T22:29:23Z\",\n    \"metadata\": {\n        \"Content-Type\": \"application/pdf\",\n        \"Creation-Date\": \"2015-08-17T22:29:23Z\",\n        \"X-Parsed-By\": [\n            \"org.apache.tika.parser.DefaultParser\",\n            \"org.apache.tika.parser.pdf.PDFParser\"\n        ],\n        \"title\": \"The title of the document\",\n        \"xmpTPg:NPages\": \"1\"\n    },\n    \"text\": \"The contents of the document will be returned here.\"\n}\n```\n\n## Configuration\n\n__Configuration__\n\n`API_AUTH_TOKEN` - optional ENV-variable to protect API endpoints.\n\n## Deploy\n\nOne click deploy to Heroku.\n\n[![Deploy](https://www.herokucdn.com/deploy/button.svg)](https://heroku.com/deploy?template=https://github.com/buren/document_parser)\n\n## Dependencies\n\n* Rails 5.1\n* System dependencies\n  - Ruby 2.4\n  - java\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fburen%2Fdocument_parser","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fburen%2Fdocument_parser","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fburen%2Fdocument_parser/lists"}