https://github.com/suminb/boilerpipe
Extract the main article from a given web page
https://github.com/suminb/boilerpipe
Last synced: 2 months ago
JSON representation
Extract the main article from a given web page
- Host: GitHub
- URL: https://github.com/suminb/boilerpipe
- Owner: suminb
- Created: 2013-04-30T01:03:42.000Z (about 13 years ago)
- Default Branch: master
- Last Pushed: 2013-06-26T07:37:54.000Z (about 13 years ago)
- Last Synced: 2025-12-27T08:24:44.769Z (6 months ago)
- Language: Java
- Size: 1.4 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Boilerpipe
============
The boilerpipe library extracts the main textual content of a web page. The original project can be found at . This primary purpose of this project is to implement a command line frontend.
Usage
------
./boilerpipe.sh -u $(URL)
./boilerpipe.sh -f $(PATH)