Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/joyceannie/web-crawler
The crawler downloads textual content from a particular domain and removes noise using site agnostic techniques.
https://github.com/joyceannie/web-crawler
java
Last synced: 24 days ago
JSON representation
The crawler downloads textual content from a particular domain and removes noise using site agnostic techniques.
- Host: GitHub
- URL: https://github.com/joyceannie/web-crawler
- Owner: joyceannie
- Created: 2017-06-23T18:11:38.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2017-06-23T18:37:11.000Z (over 7 years ago)
- Last Synced: 2023-03-05T11:33:10.193Z (almost 2 years ago)
- Topics: java
- Language: Java
- Homepage:
- Size: 45.9 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0