Projects in Awesome Lists by CAMeL-Lab
A curated list of projects in awesome lists by CAMeL-Lab .
https://github.com/camel-lab/arabic_ala-lc_romanization
Romanizing Arabic bibliographic records in the ALA-LC standard.
Last synced: 25 Jan 2026
https://github.com/camel-lab/widh_2020_arabic_text_analysis
Material for the Text Analysis of Arabic course taught at the NYU Abu Dhabi Winter Institute in Digital Humanities 2020.
Last synced: 25 Jan 2026
https://github.com/camel-lab/arabic_error_type_annotation
The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.
Last synced: 25 Jan 2026
https://github.com/camel-lab/gumar-ngrams
The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.
Last synced: 25 Jan 2026
https://github.com/camel-lab/arafix_ocr
A tool for improving the output of generic Arabic OCR systems using an n-gram based post-correction approach.
Last synced: 25 Jan 2026
https://github.com/camel-lab/gender-reinflection
Code, models, and data for "Gender-Aware Reinflection using Linguistically Enhanced Neural Models". COLING 2020, GeBNLP.
Last synced: 25 Jan 2026
https://github.com/camel-lab/deseg
Unsupervised, De-lexical, Linguistic Segmentation
Last synced: 25 Jan 2026
https://github.com/camel-lab/camel_arabic_frequency_lists
The repository for the CAMeL Arabic Frequency Lists dataset
Last synced: 25 Jan 2026
https://github.com/camel-lab/camelbert_morphosyntactic_tagger
Code, models, and data for "Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects". Findings of ACL, 2022.
Last synced: 25 Jan 2026
https://github.com/camel-lab/muddler
The Muddler derived-file sharing utility.
Last synced: 25 Jan 2026
https://github.com/camel-lab/barec-shared-task-2025
Evaluation code and data for the BAREC shared task 2025
Last synced: 25 Jan 2026
https://github.com/camel-lab/gender-rewriting
Code, models, and data for "User-Centric Gender Rewriting". NAACL 2022.
arabic-nlp deep-learning machine-translation nlp text-generation
Last synced: 25 Jan 2026
https://github.com/camel-lab/qalb
Code for "Utilizing Character and Word Embeddings for Text Normalization with Sequence-to-Sequence Models"
Last synced: 25 Jan 2026
https://github.com/camel-lab/arabic-atb-closed-class-list
A Modern Standard Arabic Closed-Class Word List
Last synced: 25 Jan 2026
https://github.com/camel-lab/gender-rewriting-shared-task
Evaluation code and data for the gender rewriting shared task
Last synced: 25 Jan 2026
https://github.com/camel-lab/codafication
Code, models, and data for "Exploiting Dialect Identification in Automatic Dialectal Text Normalization". ArabicNLP 2024, ACL.
arabic arabic-nlp deep-learning nlp text-normalization
Last synced: 25 Jan 2026
https://github.com/camel-lab/palmyra_server
A server that adds extra functionality to Palmyra
Last synced: 25 Jan 2026
https://github.com/camel-lab/camelprop
A dataset with Arabic words, English glosses, sourced from Wikimedia and annotated with maximal diacritization Resources
Last synced: 25 Jan 2026