Projects in Awesome Lists tagged with dialect-identification
A curated list of projects in awesome lists tagged with dialect-identification .
https://github.com/instadeepai/tunbert
TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT was applied to three NLP downstream tasks: Sentiment Analysis (SA), Tunisian Dialect Identification (TDI) and Reading Comprehension Question-Answering (RCQA)
bert-models dialect-identification nlp question-answering sentiment-analysis
Last synced: 30 Jan 2025
https://github.com/qcri/arabic_speech_code_switching
The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguistic and the acoustic cues. This dataset is a potential benchmark for DCS in spontaneous speech.
acoustic arabic asr codeswitching dialect-identification egyptian evaluation lexical mordern-standard-arabic
Last synced: 19 Feb 2025
https://github.com/sinaahmadi/CORDI
Language and Speech Technology for Central Kurdish Varieties (LREC-COLING 2024)
automatic-speech-recognition dialect-identification erbil kurdish kurdish-language-processing language-identification machine-translation mahabad sanandaj sorani sulaymaniyah
Last synced: 07 May 2025
https://github.com/mohamedsebaie/arabic_dialect_identification_nlp-aim-task
Arabic_Dialect_Identification_NLP-AIM-Task
arabert bert-fine-tuning dialect-identification farasa linearsvc nlp-machine-learning preprocessing
Last synced: 20 Nov 2024