Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/markzhai/erogamescape-mining
A spider and data-mining project on ErogameScape
https://github.com/markzhai/erogamescape-mining
Last synced: about 1 month ago
JSON representation
A spider and data-mining project on ErogameScape
- Host: GitHub
- URL: https://github.com/markzhai/erogamescape-mining
- Owner: markzhai
- Created: 2013-11-02T12:16:37.000Z (about 11 years ago)
- Default Branch: master
- Last Pushed: 2013-11-11T04:42:43.000Z (about 11 years ago)
- Last Synced: 2024-04-09T21:38:11.411Z (9 months ago)
- Language: Python
- Size: 152 KB
- Stars: 5
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
ErogameScape-Mining
===================A spider and data-mining project on ErogameScape.
Plan
----
* X[] Find hidden tags.
* [o] Grab comments.
* [X] Grab game pov.
* [X] User recommendations.
* [X] Game recommendations.Redis
-----
How data stores in redis:1. Hash - game:$id -> title, brand_id
HMSET game:16506 title この大空に、翼をひろげて brand_id 689
HGETALL game:16506
HEXISTS game:16506 title3. String - brand:$id -> brand_name
SET brand:689 PULLTOP
4. Hash - uid:game_idHMSET comment:yamadayo:7062 score 65 playtime 30h date 2013年11月04日02時13分14秒 comment "個別が鈴√と来ヶ谷以外全く面白くない" netabare 1
5. Set - indexes for later mining entry: games, users, brands, $user:games
SADD games "16506"
SMEMBERS games
SADD users "yamadayo" "christia"
SMEMBERS users
SADD brands 689
SADD yamadayo:games 165066. List - new_commented_games (can use LTRIM to create a list that just remembers the lastest N elements)
LPUSH new_commented_games 16506
LRANGE new_commented_games 0 9Spider
------
* spider_comment.py - grab user comments including score, playtime, comment text, etc.
* spider_game.py - grab game pov.Versioning
----------
At version 0.0.1.