Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dalthviz/csapp
Crawler-Scrapper for the playstore
https://github.com/dalthviz/csapp
crawler csapp keyword nlp playstore rating review scrapper
Last synced: 25 days ago
JSON representation
Crawler-Scrapper for the playstore
- Host: GitHub
- URL: https://github.com/dalthviz/csapp
- Owner: dalthviz
- License: mit
- Created: 2017-08-20T20:22:19.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2017-08-23T03:58:47.000Z (over 7 years ago)
- Last Synced: 2024-11-13T10:51:46.518Z (3 months ago)
- Topics: crawler, csapp, keyword, nlp, playstore, rating, review, scrapper
- Language: Java
- Size: 96.7 KB
- Stars: 0
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# CSApp
Crawler and Scrapper for the PlayStore. Also find the keywords in the reviews by rating.
Generates a `.csv` with the info extracted and .txt for the reviews by rating
(4-5 starts, 3-4 starts, 2-3 starts, 0-2 starts) by one of the categories for apps (58 options the last time I tested)
and from the top_selling apps (free or paid).In the data directory the files are generated. Also there is the script to use `RAKE` (`keywords.py`) that would take any `.txt` in the directory and find the keywords from it. A resume file is created (`resume.txt`)
## Resources:
* [Apache Commons CSV](https://github.com/apache/commons-csv)
* [Jsoup](https://github.com/jhy/jsoup)
* [RAKE](https://github.com/zelandiya/RAKE-tutorial)