Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/aramrw/jpdb.io-scraper
Scrapes jdbp.io for words based on frequency, and writes them to a file.
https://github.com/aramrw/jpdb.io-scraper
japanese jpdb
Last synced: about 1 month ago
JSON representation
Scrapes jdbp.io for words based on frequency, and writes them to a file.
- Host: GitHub
- URL: https://github.com/aramrw/jpdb.io-scraper
- Owner: aramrw
- Created: 2023-08-11T20:04:11.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-09-20T01:24:09.000Z (over 1 year ago)
- Last Synced: 2024-04-24T04:47:40.704Z (8 months ago)
- Topics: japanese, jpdb
- Language: JavaScript
- Homepage: https://github.com/aramrw/jp-frequency_sorter
- Size: 23.2 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# jpdb.io-scraper
## What Does It Do?
### Automatically scrapes and compiles words from **anime**, **visual novels**, **web / light novels**, + more from [jpdb.io](https://jpdb.io/).
- **Frequency Insights**: Provides frequency analysis to help you identify useful words at your proficiency level before you scrape them.
- **Quickly Scrape Where You Left Off**: Tracks what anime / novels you frequently scrape + suggests similar ones.
- **Built In Frequency Sorter**: Gives you the option to sort words you scraped by using [this frequency dictionary](https://github.com/MarvNC/jpdb-freq-list) or 5 others from [here](https://drive.google.com/drive/folders/1g1drkFzokc8KNpsPHoRmDJ4OtMTWFuXi).[**Here's the standalone version of the sorter without the scraper**](https://github.com/aramrw/jp-frequency_sorter)
**Using Your Own frequency Dictonary(s)**:
+ Download one fIom [here](https://drive.google.com/drive/folders/1g1drkFzokc8KNpsPHoRmDJ4OtMTWFuXi) *or any other ones that are compatible with* [Yomichan](https://chrome.google.com/webstore/detail/yomichan/ogmnaimimemjmbakcfefmnahgdfhfami).
+ copy **index.json** into the **yomichan_dicts** folder *(Make sure to rename **index.json** to the **name of the dictionary**)*. You don't need the meta banks.
```
├── Anime & J-Drama Frequency Dictionary
│ ├── index.json
│ ├── term_meta_bank1.md
│ └── term_meta_bank2.md
...more meta_banks.md
```## Getting Started
Please note that the current version of this tool only works with **Node.js v18.17.1** in the **VsCode Terminal**.
### Install Dependencies
```
npm install puppeteer
```
```
npm install chalk
```## Additional Information
- **Max pages: infinite.
## Special Thanks To:
- [jpdb.io](https://jpdb.io/)
- [jpdb's Discord Server](https://discord.com/invite/jWwVD7D2sZ)