Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Alir3z4/html2text
Convert HTML to Markdown-formatted text.
https://github.com/Alir3z4/html2text
markdown markdown-parser python
Last synced: about 2 months ago
JSON representation
Convert HTML to Markdown-formatted text.
- Host: GitHub
- URL: https://github.com/Alir3z4/html2text
- Owner: Alir3z4
- License: gpl-3.0
- Created: 2014-02-19T22:41:11.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2024-04-21T19:10:09.000Z (2 months ago)
- Last Synced: 2024-04-27T11:26:27.700Z (about 2 months ago)
- Topics: markdown, markdown-parser, python
- Language: Python
- Homepage: alir3z4.github.io/html2text/
- Size: 1.21 MB
- Stars: 1,658
- Watchers: 26
- Forks: 261
- Open Issues: 87
-
Metadata Files:
- Readme: README.md
- Changelog: ChangeLog.rst
- Contributing: docs/contributing.md
- License: COPYING
Lists
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python-cn - html2text
- awesome-python - html2text - 3.0](https://api.github.com/licenses/gpl-3.0)- Convert HTML to Markdown-formatted text. (Awesome Python / Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- Awesome-Python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- Python-Awesome - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python-resources - GitHub - 39% open · ⏱️ 22.02.2022): (网络)
- awesome-stars - Alir3z4/html2text - Convert HTML to Markdown-formatted text. (Python)
- python-awesome-case1 - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- fucking-awesome-python - :octocat: html2text - :star: 1433 :fork_and_knife: 242 - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python-master - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome_python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- my-awesome-starred - Alir3z4/html2text - Convert HTML to Markdown-formatted text. (Python)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-from-stars - Alir3z4/html2text - formatted text. (HarmonyOS / Windows Manager)
- awesome-python-cn - html2text
- join-awesome-python-interview-topics - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- my-awesome-stars - Alir3z4/html2text - Convert HTML to Markdown-formatted text. (Python)
- awesome-python-cn - 官网
- awesome-python-clone - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- my-awesome-stars - Alir3z4/html2text - Convert HTML to Markdown-formatted text. (Python)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python4 - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python-resources-all - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- fucking-awesome-python - :octocat: html2text - :star: 1014 :fork_and_knife: 176 - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome_python_with_star - Alir3z4/html2text - formatted text.|879| (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python-cn - 官网
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- git-github.com-vinta-awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python-master - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- python-awesome - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- my-awesome - Alir3z4/html2text - Convert HTML to Markdown-formatted text. (Python)
- awesomePython - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python-zh - html2text - 将HTML转换为Markdown格式的文本。 (Web内容提取)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- fucking_awesome_python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- Mpaperlee-awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- starred-awesome - html2text - Convert HTML to Markdown-formatted text. (Python)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome_python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-stars - html2text - formatted text. | Alir3z4 | 1721 | (Python)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- project-awesome - Alir3z4/html2text - Convert HTML to Markdown-formatted text. (Python)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome-stars - Alir3z4/html2text - `★1723` Convert HTML to Markdown-formatted text. (Python)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. (Web Content Extracting)
- awesome - html2text - Convert HTML to Markdown-formatted text. (Python)
- awesome-stars - Alir3z4/html2text - Convert HTML to Markdown-formatted text. (Python)
- awesome-python - html2text - Convert HTML to Markdown-formatted text. ` 📝 14 days ago ` (Web Content Extracting [🔝](#readme))
README
# html2text
[![Build Status](https://secure.travis-ci.org/Alir3z4/html2text.png)](https://travis-ci.org/Alir3z4/html2text)
[![Coverage Status](https://coveralls.io/repos/Alir3z4/html2text/badge.png)](https://coveralls.io/r/Alir3z4/html2text)
[![Downloads](http://badge.kloud51.com/pypi/d/html2text.png)](https://pypi.org/project/html2text/)
[![Version](http://badge.kloud51.com/pypi/v/html2text.png)](https://pypi.org/project/html2text/)
[![Wheel?](http://badge.kloud51.com/pypi/wheel/html2text.png)](https://pypi.org/project/html2text/)
[![Format](http://badge.kloud51.com/pypi/format/html2text.png)](https://pypi.org/project/html2text/)
[![License](http://badge.kloud51.com/pypi/license/html2text.png)](https://pypi.org/project/html2text/)html2text is a Python script that converts a page of HTML into clean, easy-to-read plain ASCII text. Better yet, that ASCII also happens to be valid Markdown (a text-to-HTML format).
Usage: `html2text [filename [encoding]]`
| Option | Description
|--------------------------------------------------------|---------------------------------------------------
| `--version` | Show program's version number and exit
| `-h`, `--help` | Show this help message and exit
| `--ignore-links` | Don't include any formatting for links
|`--escape-all` | Escape all special characters. Output is less readable, but avoids corner case formatting issues.
| `--reference-links` | Use reference links instead of links to create markdown
| `--mark-code` | Mark preformatted and code blocks with [code]...[/code]For a complete list of options see the [docs](https://github.com/Alir3z4/html2text/blob/master/docs/usage.md)
Or you can use it from within `Python`:
```
>>> import html2text
>>>
>>> print(html2text.html2text("Zed's dead baby, Zed's dead.
"))
**Zed's** dead baby, _Zed's_ dead.```
Or with some configuration options:
```
>>> import html2text
>>>
>>> h = html2text.HTML2Text()
>>> # Ignore converting links from HTML
>>> h.ignore_links = True
>>> print h.handle("Hello, world!")
Hello, world!>>> print(h.handle("
Hello, world!"))
Hello, world!
>>> # Don't Ignore links anymore, I like links
>>> h.ignore_links = False
>>> print(h.handle("Hello, world!"))
Hello, [world](https://www.google.com/earth/)!```
*Originally written by Aaron Swartz. This code is distributed under the GPLv3.*
## How to install
`html2text` is available on pypi
https://pypi.org/project/html2text/```
$ pip install html2text
```## How to run unit tests
tox
To see the coverage results:
coverage html
then open the `./htmlcov/index.html` file in your browser.
## Documentation
Documentation lives [here](https://github.com/Alir3z4/html2text/blob/master/docs/usage.md)