Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/armelvidali/derja_ninja_scraper

Get Tunisian translation, audio and sample sentence for the most common 20.000 english word
https://github.com/armelvidali/derja_ninja_scraper

anki arabic language learn python scraping tunisian

Last synced: 2 months ago
JSON representation

Get Tunisian translation, audio and sample sentence for the most common 20.000 english word

Awesome Lists containing this project

README

        

# Build your Tunisian arabic dataset

Get Tunisian translation, audio and sample sentence for the most common 20.000 english word for the awsome website https://derja.ninja/

You can get the Anki flashcard deck i created with this script, containing 11.215 words and sentences here : https://ankiweb.net/shared/info/1875413457, looking like this :

image

## Install

To download the project, run

git clone https://github.com/ArmelVidali/derja_ninja_scraper

Install depencencies

pip install -r requirements.txt

Run scraper.py to start scraping :

python scraper.py

Run anki.py (need Anki app installed on your computer, with Anki connector https://foosoft.net/projects/anki-connect/, with the Anki desktop app opened)

python anki.py

## Output

You will get a `translations_results.json` with the translations and an audio url for each translated word and sample sentence provided by Derja Ninja.
If you want to download the audios, use the `download_mp3` function from `download_audios.py`.

# Contribution

This project is open to any contribution, I'd be happy to add new features or make a website out of it !