Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/armelvidali/derja_ninja_scraper
Get Tunisian translation, audio and sample sentence for the most common 20.000 english word
https://github.com/armelvidali/derja_ninja_scraper
anki arabic language learn python scraping tunisian
Last synced: 2 months ago
JSON representation
Get Tunisian translation, audio and sample sentence for the most common 20.000 english word
- Host: GitHub
- URL: https://github.com/armelvidali/derja_ninja_scraper
- Owner: ArmelVidali
- Created: 2023-11-26T17:22:51.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-01-20T22:33:13.000Z (about 1 year ago)
- Last Synced: 2024-01-20T23:29:25.201Z (about 1 year ago)
- Topics: anki, arabic, language, learn, python, scraping, tunisian
- Language: Python
- Homepage:
- Size: 3.47 MB
- Stars: 9
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Build your Tunisian arabic dataset
Get Tunisian translation, audio and sample sentence for the most common 20.000 english word for the awsome website https://derja.ninja/
You can get the Anki flashcard deck i created with this script, containing 11.215 words and sentences here : https://ankiweb.net/shared/info/1875413457, looking like this :
## Install
To download the project, run
git clone https://github.com/ArmelVidali/derja_ninja_scraper
Install depencencies
pip install -r requirements.txt
Run scraper.py to start scraping :
python scraper.py
Run anki.py (need Anki app installed on your computer, with Anki connector https://foosoft.net/projects/anki-connect/, with the Anki desktop app opened)
python anki.py
## Output
You will get a `translations_results.json` with the translations and an audio url for each translated word and sample sentence provided by Derja Ninja.
If you want to download the audios, use the `download_mp3` function from `download_audios.py`.# Contribution
This project is open to any contribution, I'd be happy to add new features or make a website out of it !