Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/bdurand/html_to_plain_text

A ruby gem that can convert HTML to formatted plain text.
https://github.com/bdurand/html_to_plain_text

Last synced: about 2 months ago
JSON representation

A ruby gem that can convert HTML to formatted plain text.

Awesome Lists containing this project

README

        

= HTML To Plain Text

gem install html_to_plain_text

A simple gem that provide code to convert HTML into a plain text alternative. Line breaks from HTML block level elements will be maintained. Lists and tables will also maintain a little bit of formatting.

* Line breaks will be approximated using the generally established default margins for HTML tags (i.e.


tag generates two line breaks,

generates one)
* Lists items will be numbered or bulleted with an asterisk
*
tags will add line breaks
*
tags will add a string of hyphens to serve as a horizontal rule
* elements will enclosed in "|" delimiters
* tags will have the href URL appended to the text in parentheses
* Formatting tags like or will be stripped
* Formatting inside
 or  elements will be honored

* Code-like tags like or <style> will be stripped

== Usage

require 'html_to_plain_text'
html = "<h1>Hello</h1><p>world!</p>"
HtmlToPlainText.plain_text(html)
=> "Hello\n\nworld!"