Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/arrix/node-readability
Server side readability with node.js
https://github.com/arrix/node-readability
Last synced: 16 days ago
JSON representation
Server side readability with node.js
- Host: GitHub
- URL: https://github.com/arrix/node-readability
- Owner: arrix
- License: mit
- Created: 2010-11-16T15:17:13.000Z (almost 14 years ago)
- Default Branch: master
- Last Pushed: 2011-08-17T21:46:27.000Z (about 13 years ago)
- Last Synced: 2024-08-01T08:06:42.090Z (3 months ago)
- Language: JavaScript
- Homepage:
- Size: 618 KB
- Stars: 397
- Watchers: 22
- Forks: 86
- Open Issues: 9
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# node-readability
[Readability.js by Arc90](http://lab.arc90.com/experiments/readability/) ported to node.js.Blog post: [Server side readability with node.js](http://arrix.blogspot.com/2010/11/server-side-readability-with-nodejs.html)
## Requirements
* [node.js](http://nodejs.org/)
* [jsdom](https://github.com/tmpvar/jsdom)
* [htmlparser](https://github.com/tautologistics/node-htmlparser)## Live demo
I'm working on it...
## Examplevar readability = require('readability');
//...
// This is an very early example. The API is subject to change.
readability.parse(html, url, function(result) {
console.log(result.title, result.content);
});## Performance
In my testing of 140 pages with an average size of **58KB** collected from [digg](http://digg.com/news.rss), [delicious](http://feeds.delicious.com/v2/rss/?count=50) and [hacker news](http://news.ycombinator.com/rss), the average time taken for each page is about **1.1 seconds** on a Mac Mini (2.4G Intel Core 2 Duo).
## Limitation
* no fetching next pages
* no support for frames## Plan
* Performance optimization
* Better API, more options
* Support more readability features