An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with dialect-identification

A curated list of projects in awesome lists tagged with dialect-identification .

https://github.com/instadeepai/tunbert

TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT was applied to three NLP downstream tasks: Sentiment Analysis (SA), Tunisian Dialect Identification (TDI) and Reading Comprehension Question-Answering (RCQA)

bert-models dialect-identification nlp question-answering sentiment-analysis

Last synced: 30 Jan 2025

https://github.com/qcri/arabic_speech_code_switching

The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguistic and the acoustic cues. This dataset is a potential benchmark for DCS in spontaneous speech.

acoustic arabic asr codeswitching dialect-identification egyptian evaluation lexical mordern-standard-arabic

Last synced: 19 Feb 2025