Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/bridgeconn/anyfile_to_text
text extraction
https://github.com/bridgeconn/anyfile_to_text
Last synced: about 2 months ago
JSON representation
text extraction
- Host: GitHub
- URL: https://github.com/bridgeconn/anyfile_to_text
- Owner: Bridgeconn
- Created: 2016-02-10T15:05:03.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2016-02-10T16:26:38.000Z (almost 9 years ago)
- Last Synced: 2023-08-18T04:10:25.509Z (over 1 year ago)
- Language: Ruby
- Homepage:
- Size: 1.95 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# AnyFile_to_Text
This script convert any file type to .txt format.
### Here are some of the formats supported:
1. Microsoft Office OLE 2 and Office Open XML Formats (.doc, .docx, .xls, .xlsx, .ppt, .pptx)
2. OpenOffice.org OpenDocument Formats (.odt, .ods, .odp)
3. Apple iWorks Formats
4. Rich Text Format (.rtf)
5. Portable Document Format (.pdf)## DEPENDENCIES
Requires a working 7 JRE for it to work. Download before run script for [JRE Download Link](http://openjdk.java.net/install/)
1. **you can run from terminal in Debian, Ubuntu, etc. $ sudo apt-get install openjdk-7-jre**
2. **Fedora, Oracle Linux, Red Hat Enterprise Linux, etc. $ su -c "yum install java-1.7.0-openjdk"**### Please follow below steps for convert file
1. Put you file in folder.
2. File may be multiple or single
3. Output will be current folder where Script will be present### Make file executable with below command from console/ Terminal
$ chmod +x doc_to_text.rb### Please check example file with below command:
$ ./doc_to_text.rbDeveloped by Uday Kumar [email protected] [Bridge Connectivity Solutions](http://bridgeconn.com/)