Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/zdavatz/ydocx

Parsing docx files with Ruby and output them as HTML
https://github.com/zdavatz/ydocx

Last synced: about 23 hours ago
JSON representation

Parsing docx files with Ruby and output them as HTML

Awesome Lists containing this project

README

        

== ydocx by ywesee GmbH

* https://github.com/zdavatz/ydocx
* Parsing docx files with Ruby and output them as HTML and XML.

== Supports

* Tables
* Uppercase letters, numbers
* Lowercase letters, numbers
* Umlaute
* bold, italic, underline
* Images
** wmf requires imagemagick and is only partially supported due to imagemagick
** png files are copied 1:1

* works on Windows as well.

== Usage

* Usage: docx2html file [options]
-f, --format Format of style and chapter {(fi|fachinfo)|(pi|patinfo)|(pl|plain)|none}, default fachinfo.
-h, --help Display this help message.
-l, --lang Language option for templates {de|fr}
-v, --version Show version.

== Using the great libraries

* rubyzip (< 1.0.0)
* nokogiri
* htmlentities
* rmagick

Install them using bundle install.

Requires Ruby >= 1.9.2.

== TODO

For unknown reason the gem fails to parse properly the Présentation for sinovial_fr.
Failing spec is: ydocx should convert sinovial_FR to xml
Failure/Error: doc.xpath('//chapters/chapter[contains(heading, "Présentation")]').size.should > 0

== License GPLv3.0

* http://www.gnu.org/licenses/gpl.html