{"id":17718596,"url":"https://github.com/linuxscout/arramooz","last_synced_at":"2026-02-26T06:53:10.284Z","repository":{"id":46118693,"uuid":"76109744","full_name":"linuxscout/arramooz","owner":"linuxscout","description":"Arabic Dictionary for Morphological analysis","archived":false,"fork":false,"pushed_at":"2024-08-26T16:45:10.000Z","size":36603,"stargazers_count":145,"open_issues_count":6,"forks_count":23,"subscribers_count":11,"default_branch":"master","last_synced_at":"2025-04-15T11:06:57.192Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/linuxscout.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":"AUTHORS.md","dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-12-10T11:56:49.000Z","updated_at":"2025-04-07T18:32:07.000Z","dependencies_parsed_at":"2024-10-26T00:29:37.594Z","dependency_job_id":null,"html_url":"https://github.com/linuxscout/arramooz","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/linuxscout%2Farramooz","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/linuxscout%2Farramooz/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/linuxscout%2Farramooz/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/linuxscout%2Farramooz/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/linuxscout","download_url":"https://codeload.github.com/linuxscout/arramooz/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":249058370,"owners_count":21205910,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-25T14:55:00.811Z","updated_at":"2026-02-26T06:53:05.259Z","avatar_url":"https://github.com/linuxscout.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Arramooz\nArabic Dictionary for Morphological analysis\n\n[![downloads]( https://img.shields.io/sourceforge/dt/arramooz.svg)](http://sourceforge.org/projects/arramooz)\n[![downloads]( https://img.shields.io/sourceforge/dm/arramooz.svg)](http://sourceforge.org/projects/arramooz)\n\n  Developers:  Taha Zerrouki: http://tahadz.com\n    taha dot zerrouki at gmail dot com\n  Collect data manually Mohamed Kebdani, Morroco \u003c med.kebdani gmail.com\u003e\n\nFeatures |   value\n---------|---------------------------------------------------------------------------------\nAuthors  | [Authors.md](https://github.com/linuxscout/arramooz/master/AUTHORS.md)\nRelease  | 0.3\nLicense  |[GPL](https://github.com/linuxscout/arramooz/master/LICENSE)\nTracker  |[linuxscout/arramooz/Issues](https://github.com/linuxscout/arramooz/issues)\nWebsite  |[http://arramooz.sourceforge.net](http://arramooz.sourceforge.net)\nSource  |[Github](http://github.com/linuxscout/arramooz)\nDownload  |[sourceforge](http://arramooz.sourceforge.net)\nFeedbacks  |[Comments](https://github.com/linuxscout/arramooz/)\nAccounts  |[@Twitter](https://twitter.com/linuxscout)  [@Sourceforge](http://sourceforge.net/projectsarramooz/)\n# Description\n\nArramooz Alwaseet is an open source Arabic dictionary for morphological analyze,\nIt can help Natural Language processing developers.\nThis work is generated from the Ayaspell( Arabic spellchecker) brut data, which are collected manually.\n\nThis dictionary consists of three parts :\n\n- stop words\n- verbs\n- Nouns\n\nIf you would cite it in academic work, can you use this citation\n```\nT. Zerrouki‏, Arramooz Alwaseet : Arabic Dictionary for Morphological analysis,  http://arramooz.sourceforge.net/ https://github.com/linuxscout/arramooz\n```\nor in bibtex format\n```bibtex\n@misc{zerrouki2011arramooz,\n  title={Arramooz Alwaseet : Arabic Dictionary for Morphological analysis},\n  author={Zerrouki, Taha},\n  url={http://arramooz.sourceforge.net/},\n  year={2011}\n}\n```\n## API\nThe python API is available as [arramooz-pysqlite](http://github.com/linuxscout/arramooz-pysqlite)\n\n## Files formats\n\nThose files are available as :\n- Text format (tab separated)\n- SQL database\n- XML  files.\n- StarDict files\n- Python + Sqlite libray \n\n## BUILD Dictionary in multiple format\nThe source files are data folder as open document speadsheet files, then we can build dictionary with\n```\nmake\n```\nwhich will generate xml, sql and text files, and package it in releases folder.\n\n\nTo make Hunspell files only\n```\nmake spell\n```\n\nTo make SatrDict files only\n```\nmake stardict\n```\nNOTE: you must use stardict-editor to Compile releases/stardict/arramooz.sdic in babylon format\n\n\nTo modify the version, you can update $VERSION variable in Makefile file.\n\nTo clean  releases use:\n```\nmake clean\n```\nTo modify data or updating data you can open files in data/ in libreoffice calc format, clean releases, and do make.\n\n## Stopwords \nThe Stop words list is developed in an independent project (see http://arabicstopwords.sourceforge.ne)\n\n\n\n## Data Structure \n\nData Structures in multiple format (csv, sql, xml) are described in [DataStructures.md](docs/datastructures.md)\n\n* nouns and verbs are described in datastructures.md\n* Stop words ( are explained in separate project [Arabic Stopwords](http://github.com/linuxscout/arabicstopwords)\n\n## Script Files:\n\n1- generate the abstract dictionary from the brut manual dictionary:\n```shell\npython2 $SCRIPT/verbs/gen_verb_dict.py -f $DATA_DIR/verbs/verb_dic_data-net.csv \u003e $OUTPUT/verbs.aya.dic\n```\n2- generate the file format (xml, csv, sql) of  dictionary from verbs.aya.dic\n```shell\npython2 $SCRIPT/verbs/gen_verb_dict_format.py -o xml -f $OUTPUT/verbs.aya.dic \u003e $OUTPUT/verbs.xml\n```\n\n* [scripts/verbs]\n\n    1- verbdict_functions.py : functions to handle verbs dict used in the generation process\n    \n    2- verbs/gen_verb_dict.py: generate the abstract dictionary from the brut manual dictionary\n    \n    3- verbs/gen_verb_dict_format.py: generate the file format (xml, csv, sql) of  dictionary from verbs.aya.dic\n    \n* [scripts/nouns]\n\n    1- noundict_functions.py : functions to handle nouns dict used in the generation process\n    \n    2- nouns/gen_noun_dict.py: generate the file format (xml, csv, sql) of  dictionary \n    \n* [requirement]\n\n    1- libqutrub\n    \n    2- pyarabic \n\n\n\n\n\nData Files:\n=============\nThis files are used to create ayaspell dictionary for spellchecking\narramooz\\verbs\\data\n\n\nFile|Description\n----|-----------\nverb_dic_data-net.csv |     brut data made manually by Mohamed kebdani.\nar_verb_normalized.dict|    A list of arabic verbs, from Qutrub project.\ntriverbtable.py     |       A list of trilateral verbs, used by Qutrub.\nverbs.aya.dic       |       The verb dictionary in abstract format.\n\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flinuxscout%2Farramooz","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flinuxscout%2Farramooz","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flinuxscout%2Farramooz/lists"}