{"id":17718547,"url":"https://github.com/linuxscout/mysam-tagmanager","last_synced_at":"2025-09-08T08:35:51.666Z","repository":{"id":138755315,"uuid":"131870569","full_name":"linuxscout/mysam-tagmanager","owner":"linuxscout","description":"Mysam: Arabic tags manager, ميسم: إدارة الوسوم العربية","archived":false,"fork":false,"pushed_at":"2025-07-21T00:30:39.000Z","size":173,"stargazers_count":16,"open_issues_count":3,"forks_count":6,"subscribers_count":5,"default_branch":"master","last_synced_at":"2025-08-25T23:14:44.860Z","etag":null,"topics":["arabic-nlp","tagging"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/linuxscout.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2018-05-02T15:28:26.000Z","updated_at":"2025-07-21T00:30:43.000Z","dependencies_parsed_at":null,"dependency_job_id":"21c72734-b7be-4bc2-9150-8ee4905c7064","html_url":"https://github.com/linuxscout/mysam-tagmanager","commit_stats":null,"previous_names":[],"tags_count":2,"template":false,"template_full_name":null,"purl":"pkg:github/linuxscout/mysam-tagmanager","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/linuxscout%2Fmysam-tagmanager","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/linuxscout%2Fmysam-tagmanager/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/linuxscout%2Fmysam-tagmanager/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/linuxscout%2Fmysam-tagmanager/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/linuxscout","download_url":"https://codeload.github.com/linuxscout/mysam-tagmanager/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/linuxscout%2Fmysam-tagmanager/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":274158569,"owners_count":25232628,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-09-08T02:00:09.813Z","response_time":121,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["arabic-nlp","tagging"],"created_at":"2024-10-25T14:54:44.385Z","updated_at":"2025-09-08T08:35:51.214Z","avatar_url":"https://github.com/linuxscout.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Mysam: Arabic tags manager, ميسم: إدارة الوسوم  العربية\n\nتسيير وسوم الكلمات العربية في مجال المعالجة الآلية للغة،  ترميز وتفكيك\nهذه المكتبة توفر سكريبت خاصا بترميز وسوم الكلمات (الخصائص الصرفية والنحوية والدلالية) في عبارة وسم مختصرة على شكل سلسلة حروف قصيرة مرمّزة نسميها سلسلة الوسوم.\nيمكن التحويل بين قائمة الوسوم وسلسة الوسوم المختص\nيمكن الاستفادة من هذه المكتبة من أجل ترميز الوسوم وفك ترميزها، سنستعملها في :\n\n *  التحليل الصرفي ([مكتبة قلصادي](https://github.com/linuxscout/qalsadi) )\n * التحليل النحوي ( [مكتبة ثعلب](https://github.com/linuxscout/thaalab-aranasyn) )\n * التشكيل ( برنامج مشكال[برنامج مشكال](https://github.com/linuxscout/mishkal) [ برنامج مشكال](http://tahadz.com/mishkal) )\n * التدقيق اللغوي النحوي (LanguageTool).\n\nكما تقدّم خدمة متميزة في  **الإعراب بالطريقة القديمة **\n\n![TAG STRINGS](doc/images/mysam_sample.png  \"Example\")\n\n* قائمة الوسوم :\n    Noun, جامد, مضاف, مجرور, متحرك, ينون\n* سلسلة الوسوم المختصرة\n    [N--;--I-;---H;----]\n* جملة الإعراب\n{اسم مجرور وهو مضاف، والضمير المتصل مبني في محل جر مضاف إليه}\n\n** هذه ليست مكتبة للتوسيم، بل لإدارة الوسوم في معالجة اللغة**\n\n\nManage arabic words tags, encode, decode\nThis library provides a script to encode POS tags (Words features : morphology, syntax, semantic), as a brief tag string  called tag string.\nWe can convert between tag list \u003c==\u003e coded tag string.\nWe plan to use it in:\n\n * Morphology analysis  ([Qalsadi library](https://github.com/linuxscout/qalsadi) )\n *  Syntactic analysis  ( [Thaalab Library](https://github.com/linuxscout/thaalab-aranasyn) )\n * Tashkeel ( [Mishkal](https://github.com/linuxscout/mishkal) [Mishkal-site](http://tahadz.com/mishkal) )\n * LanguageTool - Style and Grammar Checker [LanguageTool](https://languagetool.org/).\n\nIt provides a special feature, make **traditional Inflection**.\n\nThe conversion can be do like:\n\n * Tags list:\n    Noun, جامد, مضاف, مجرور, متحرك, ينون\n * Encoded tag string\n    [N--;--I-;---H;----]\n * Inflection phrase\n{اسم مجرور وهو مضاف، والضمير المتصل مبني في محل جر مضاف إليه}\n\n** This not a Tagger library, but, a tags mangger for NLP **\n\n\n### Tagging System description\nYou can look at tagging descripton on [doc/tagset.md](doc/tagset.md)\n\n\n  Developpers:  Taha Zerrouki: http://tahadz.com\n    taha dot zerrouki at gmail dot com\n\nFeatures |   value\n------------|-----------\nAuthors  | Taha Zerrouki: http://tahadz.com,  taha dot zerrouki at gmail dot com\nRelease  | 0.4 \nLicense  |[GPL](https://github.com/linuxscout/mysam-tagmanager/master/LICENSE)\nTracker  |[linuxscout/mysam-tagmanager/Issues](https://github.com/linuxscout/mysam-tagmanager/issues)\nWebsite  |[https://pypi.python.org/pypi/mysam-tagmanager](https://pypi.python.org/pypi/mysam-tagmanager)\nSource  |[Github](http://github.com/linuxscout/mysam-tagmanager)\nFeedbacks  |[Comments](https://github.com/linuxscout/mysam-tagmanager/issues)\nAccounts  |[@Twitter](https://twitter.com/linuxscout)  [@Sourceforge](http://sourceforge.net/projects/mysam-tagmanager/)\n\n\u003c!--Doc  |[package Documentaion](http://pythonhosted.org/mysam-tagmanager/)--\u003e\n\u003c!--Download  |[pypi.python.org](https://pypi.python.org/pypi/mysam-tagmanager)--\u003e\n\n## Citation\nIf you would cite it in academic work, can you use this citation\n```\nT. Zerrouki‏, mysam-tagmanager,  Arabic Word Tagger,\n  https://pypi.python.org/pypi/mysam-tagmanager/, 2018\n```\nor in bibtex format\n\n```bibtex\n@misc{zerrouki2018mysam,\n  title={mysam-tagmanager : Arabic Word Tagger},\n  author={Zerrouki, Taha},\n  url={https://pypi.python.org/pypi/mysam-tagmanager,\n  year={2018}\n}\n```\n\n\n## مزايا\n* ترميز المزايا إلى وسم موحد مختصر\n* تفكيك الوسم إلى خصائصه\n* توليد الإعراب حسب الطريق التقليدية\n\n## Features\n* Encode features to an unified tag string\n* Encode unified tag string to a list of features\n* Generate a traditional inflection style\n\n## Applications\n* Text summarizing.\n* Sentences identification.\n* Grammar analysis.\n* Morphological analysis.\n\n## تطبيقات \n* التنقيب عن المعلومات.\n* التعرف على الجمل.\n* التحليل النحوي.\n* تسريع التحليل الصرفي.\n\n\n\n## Demo جرّب\n\nيمكن التجربة على [موقع مشكال](http://tahadz.com/mishkal)\n، اختر تشكيل، ثم مرّر الفأرة على الكلمة لرؤية التلميح\n\nYou can test it on [Mishkal Site](http://tahadz.com/mishkal), choose: Tashkeel, and move mouse over word to get hint.\n\n![mysam-tagmanager Demo](doc/images/mysam_demo.png)\n\nInstallation\n=====\n```python\npip install mysam-tagmanager\n```\n## Usage\n\n```python\nimport mysam.tagmaker as tagmaker\n```\n## Example\n\n### Test load configuration\n\n```python\nimport pandas as pd\n\nimport mysam.tagconfig as tagconfig\nimport mysam.tag_const as tag_const\n\nconfiguer = tagconfig.tagConfig()\nconfiguer.load_config()\n# display\ndf = pd.DataFrame(configuer.tagsdict)\nprint('****tagdict ****')\nprint(df)\n*****Result *****\n****tagdict ****\n                 اسم        Noun         فعل        Verb        أداة  ...  1st person       مخاطب  2nd person        غائب  3rd person\npart               1           1           1           1           1  ...           4           4           4           4           4\npos                1           1           1           1           1  ...           4           4           4           4           4\nattr       word_type   word_type   word_type   word_type   word_type  ...      person      person      person      person      person\nar_attr   نوع الكلمة  نوع الكلمة  نوع الكلمة  نوع الكلمة  نوع الكلمة  ...         شخص         شخص         شخص         شخص         شخص\ncode               n           n           V           V           T  ...           I           Y           Y           H           H\nvalue           Noun        Noun        Verb        Verb        Tool  ...  1st person  2nd person  2nd person  3rd person  3rd person\nar_value         اسم         اسم         فعل         فعل        أداة  ...       متكلم       مخاطب       مخاطب        غائب        غائب\ninflect          اسم         اسم         فعل         فعل         حرف  ...                                                            \n\n[8 rows x 95 columns]\n....\n....\n\n```\nYou can load a specific config file by passing parameter to load_conf.\nIf the file doesn't exist or failed to be open, the default config is loaded.\n\n```python\nconfiguer = tagconfig.tagConfig()\nconfiguer.load_config(\"tag.config\")\n\n```\nIf you want to know if the input file is opened, fix 'debug' parameter to 'True'\n\n\nIf you want to know if the input file is open, fix 'debug' parameter to 'True'\n```python\nconfiguer = tagconfig.tagConfig()\nconfiguer.load_config(\"tag.config\", debug=True)\n```\n\n### Test call tagmaker\n\n```python\nimport mysam.tagcoder\n# given tags as list    \ntaglists = [[u'اسم', u'هاء', u'مجرور',],\n        u'تعريف::مرفوع:متحرك:ينون:::'.split(\":\"),\n        ]\ntgcoder = mysam.tagcoder.tagCoder()\nfor taglist in taglists:\n    # encode\n    tagcode = tgcoder.encode(taglist)\n    print(\"tags list:\", u\";\".join(taglist))\n    print(\"tagcode:\", tagcode)\n    # decode a tag code string into (key, values)\n    print(\"decode:\", tgcoder.decode(tagcode))\n\n**** result ****\ntags list: اسم;هاء;مجرور\ntagcode: N--;------I;---\ndecode: [('نوع الكلمة', 'اسم'), ('خاصية', 'لاشيء'), ('جنس', 'لاشيء'), ('عدد', 'لاشيء'), ('شخص', 'لاشيء'), ('علامة', 'لاشيء'), ('زمن', 'لاشيء'), ('بناء', 'لاشيء'), ('إعراب', 'مجرور'), ('عطف', 'لاشيء'), ('استقبال', 'لاشيء'), ('ضمير متصل', 'لاشيء')]\ntags list: تعريف;;مرفوع;متحرك;ينون;;;\ntagcode: ---;------U;--L\ndecode: [('نوع الكلمة', 'لاشيء'), ('خاصية', 'لاشيء'), ('جنس', 'لاشيء'), ('عدد', 'لاشيء'), ('شخص', 'لاشيء'), ('علامة', 'لاشيء'), ('زمن', 'لاشيء'), ('بناء', 'لاشيء'), ('إعراب', 'مرفوع'), ('عطف', 'لاشيء'), ('استقبال', 'لاشيء'), ('تعريف', 'معرفة')]\n    \n```\n\n\n### Exmaple for inflect\n```python \nimport mysam.taginflector\ntag_maker = mysam.taginflector.tagInflector()\nword = \"يَسْتَعْمِلُونَهَا\"\ntagcode = 'V-1;M3H-faU;W-H'\nprint(tag_maker.inflect(tagcode))\n**** result ****\nفعل مضارع   مرفوع وعلامة رفعه ثبوت النون لأنه من الأفعال الخمسة  والواو: ضمير متصل مبني في محل رفع فاعل والضمير المتصل مبني في محل نصب مفعول به\n```\n\n### Exmaple for add tag\n\nAdd a new tag to existing tags string code\n\n```python\nimport mysam.tagcoder\ntag_coder = mysam.tagcoder.tagCoder()\ntagcode = 'V-0;M1H-faU;W--'\ntag = \"ضمير متصل\"\nprint(tag_coder.add_tag(tag, tagcode))\n**** result ****\nV-0;M1H-faU;W-H\n```\n\n### Exmaple for remove tag\n\nRemove and  a tag from existing tags string code\n\n```python\nimport mysam.tagcoder\ntag_coder = mysam.tagcoder.tagCoder()\ntagcode = 'V-0;M1H-faU;W-H'\ntag = \"ضمير متصل\"\nprint(tag_coder.remove_tag(tag, tagcode))\n**** result ****\nV-0;M1H-faU;W--\n```\n\n### \n\n### Exmaple for has tag\n\n```python \nimport mysam.tagcoder\ntag_coder = mysam.tagcoder.tagCoder()\ntags = ['اسم', 'مجرور', 'مذكر', \"مفرد\", \"واو\"]\ntagcode = tag_coder.encode(tags)\nprint(tagcode)\ntag_search = u\"مجرور\"\nprint(tag_coder.has_tag(tag_search, tagcode))\ntag_search = u\"فعل\"\nprint(tag_coder.has_tag(tag_search, tagcode))\n**** result ****\nN--;M1----I;---\nTrue\nFalse\n```\n\n\n\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flinuxscout%2Fmysam-tagmanager","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flinuxscout%2Fmysam-tagmanager","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flinuxscout%2Fmysam-tagmanager/lists"}