https://github.com/puhoy/readability_cli
a cli tool to fetch webpages main content and print it as markdown
https://github.com/puhoy/readability_cli
fetch-webpages html-to-markdown html2text markdown python3 readability readability-cli readability-lxml
Last synced: 2 months ago
JSON representation
a cli tool to fetch webpages main content and print it as markdown
- Host: GitHub
- URL: https://github.com/puhoy/readability_cli
- Owner: puhoy
- Created: 2020-10-20T20:49:46.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2020-10-31T11:24:48.000Z (over 4 years ago)
- Last Synced: 2025-01-18T04:43:26.815Z (4 months ago)
- Topics: fetch-webpages, html-to-markdown, html2text, markdown, python3, readability, readability-cli, readability-lxml
- Language: Python
- Homepage:
- Size: 1.95 KB
- Stars: 1
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# readability_cli
a really small thing to fetch webpages main content and print it as markdown
basically its just [readability-lxml](https://github.com/buriy/python-readability) and [html2text](https://github.com/Alir3z4/html2text) glued together
## installation
pip install git+https://github.com/puhoy/readability_cli --user
## usage
readcli SOMEURL
from there, you could pipe the result through [pygments](https://github.com/pygments/pygments) and a pager, for example:
readcli https://github.com/puhoy/lspace | pygmentize -l md | less -R
or append website content to a file using vim:
:read !readcli https://some/url