Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cnseniorious000/python-readability
Python wrapper for @mozilla/readability
https://github.com/cnseniorious000/python-readability
html llm mozilla readability scraping web
Last synced: 25 days ago
JSON representation
Python wrapper for @mozilla/readability
- Host: GitHub
- URL: https://github.com/cnseniorious000/python-readability
- Owner: CNSeniorious000
- Created: 2024-10-23T15:13:13.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2024-12-07T10:21:33.000Z (about 1 month ago)
- Last Synced: 2024-12-09T11:48:04.912Z (about 1 month ago)
- Topics: html, llm, mozilla, readability, scraping, web
- Language: Python
- Homepage: https://cnseniorious000.github.io/python-readability/
- Size: 12.7 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Readability.py
This is a Python wrapper for [@mozilla/readability](https://github.com/mozilla/readability), a standalone version of the readability library used for [Firefox Reader View](https://support.mozilla.org/kb/firefox-reader-view-clutter-free-web-pages).
## Installation
```sh
pip install python-readability
```## Usage
```py
from readability import parseparse(html_string, **options)
```See the original documentation and the type hints for details.
## Requirements
Since this package is a wrapper for the original JavaScript library, it use a JavaScript engine to run the code.
For now, if you are running this package in a regular CPython environment, it will use [pythonmonkey](https://github.com/Distributive-Network/PythonMonkey) to interpret JavaScript, which requires Python 3.8+.
This package is also available inside [pyodide](https://github.com/pyodide/pyodide) because it can use the native JavaScript engine that `pyodide` runs on.