https://github.com/racinmat/myanimelist-crawler
Crawler for myanimelist
https://github.com/racinmat/myanimelist-crawler
Last synced: about 2 months ago
JSON representation
Crawler for myanimelist
- Host: GitHub
- URL: https://github.com/racinmat/myanimelist-crawler
- Owner: racinmat
- License: gpl-3.0
- Created: 2018-05-12T10:58:16.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2025-01-16T17:46:10.000Z (5 months ago)
- Last Synced: 2025-03-22T14:41:23.654Z (2 months ago)
- Language: Python
- Size: 22.9 MB
- Stars: 21
- Watchers: 1
- Forks: 6
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# myanimelist-data-set-creator
Collection of some simple python scripts to create https://myanimelist.net/ anime and user data set.# How to use Anime Dataset Generator
This script can be used to download anime dataset from [**Myanimelist**](https://myanimelist.net/) using an unofficial MyAnimeList REST API, [**Jikan**](https://jikan.me/docs).
#### Column metadata:
* animeID: id of anime as in anime url [https://myanimelist.net/anime/**1**](https://myanimelist.net/anime/1)
* name: title of anime
* premiered: premiered on. default format (season year)
* genre: list of genre
* type: type of anime (example TV, Movie etc)
* episodes: number of episodes
* studios: list of studio
* source: source of anime (example original, manga, game etc)
* scored: score of anime
* scoredBy: number of member scored the anime
* members: number of member added anime to their list***
### Using your own PC/Laptop
#### Syntax
```
python getAnime.py starting_index ending_index [output_file.csv]
```#### Demo:

***
# How to use User Dataset Generator
This script can be used to download user dataset from [**Myanimelist**](https://myanimelist.net/) using an API, [**Kuristina**](https://github.com/TimboKZ/kuristina).
#### Column metadata:
* userID: MAL user ID
* animeID: id of anime as in anime url https://myanimelist.net/anime/ID
* score: score by the use for anime with id = animeID (if user haven't score the anime then this field is 0).#### Syntax
```
python getUser.py UserList.txt [User.csv]
```**NOTE**: Make sure you have a **Userlist.txt** file containing the name of the users. If you don't have that then use the scrapper([scrap from club]() or [scrap from post]()).
#### How to create User List from forum post:
For this you need to get topic ID.
Go to [**MAL**](https://myanimelist.net/) -> [**Community** -> **Forums**](https://myanimelist.net/forum/) -> **Select a forum**For example for the following forums links their respective ID are highlighted in bold below:
[https://myanimelist.net/forum/?topicid=1699126](https://myanimelist.net/forum/?topicid=1699126) -> **1699126**
[https://myanimelist.net/forum/?topicid=1696289](https://myanimelist.net/forum/?topicid=1696289) -> **1696289**
After getting the topic ID, you can use **createUserListFromPost** script.
###### Syntax:
```
python createUserListFromPost.py topicID [UserList.txt]
```#### How to create User List from club:
For this you need to get club ID.
Go to [**MAL**](https://myanimelist.net/) -> [**Community** -> **Clubs**](https://myanimelist.net/forum/) -> **Select a club**For example for the following clubs links their respective ID are highlighted in red below:
[https://myanimelist.net/clubs.php?cid=72250](https://myanimelist.net/clubs.php?cid=72250) -> **72250**
[https://myanimelist.net/clubs.php?cid=32683](https://myanimelist.net/clubs.php?cid=32683) -> **32683**
After getting the topic ID, you can use **createUserListFromClub** script.
###### Syntax:
```
python createUserListFromClub.py clubID [UserList.txt]
```#### Demo:
###### Create User List from forum

###### Create User List from club

###### Get user dataset

***
#### TO DO LIST
* Scrapping Locally ✔
* Scrapping using Heroku ✔
* Creating Heroku Deploy Button ⌛