{"id":15675034,"url":"https://github.com/ccoreilly/spacy-catala","last_synced_at":"2025-05-06T23:41:28.744Z","repository":{"id":107871575,"uuid":"245839148","full_name":"ccoreilly/spacy-catala","owner":"ccoreilly","description":"Spacy NLP Model for the Catalan language","archived":false,"fork":false,"pushed_at":"2020-11-21T19:26:48.000Z","size":14,"stargazers_count":16,"open_issues_count":0,"forks_count":0,"subscribers_count":5,"default_branch":"master","last_synced_at":"2025-03-31T04:04:56.927Z","etag":null,"topics":["catalan","catalan-language","nlp","nlp-model","nlu","nlu-model","spacy"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"agpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ccoreilly.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-03-08T15:28:27.000Z","updated_at":"2024-09-12T19:37:21.000Z","dependencies_parsed_at":null,"dependency_job_id":"462faec9-0418-46ea-a6a2-88b1b63b8ea4","html_url":"https://github.com/ccoreilly/spacy-catala","commit_stats":{"total_commits":4,"total_committers":2,"mean_commits":2.0,"dds":0.25,"last_synced_commit":"1c68093e170b0fff74396adc5cbe15086564f855"},"previous_names":[],"tags_count":6,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ccoreilly%2Fspacy-catala","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ccoreilly%2Fspacy-catala/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ccoreilly%2Fspacy-catala/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ccoreilly%2Fspacy-catala/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ccoreilly","download_url":"https://codeload.github.com/ccoreilly/spacy-catala/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252788376,"owners_count":21804280,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["catalan","catalan-language","nlp","nlp-model","nlu","nlu-model","spacy"],"created_at":"2024-10-03T15:55:06.712Z","updated_at":"2025-05-06T23:41:28.704Z","avatar_url":"https://github.com/ccoreilly.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# [CA] Model pel processament del llenguatge natural en Català per a spaCy\n\nModel per a [spaCy](https://spacy.io) de la llengua catalana generat a partir de:\n\n- Vectors de paraules de [fastText](https://github.com/facebookresearch/fastText/blob/master/docs/crawl-vectors.md)\n- Gramàtica, morfologia i sintaxi fent servir dades del corpus d'[AnCora](https://github.com/UniversalDependencies/UD_Catalan-AnCora)\n- Annotacions per a l'extracció d'entitats derivades de la wikipedia ([Cross-lingual Name Tagging and Linking for 282 Languages](http://nlp.cs.rpi.edu/paper/282elisa2017.pdf))\n\nEls models es poden descarregar a la secció Publicacions (Releases).\n\n## Instal·lació i ús\n\nPodeu escollir entre dos models. El model gran és més precís però com que spaCy carrega tot el model a memòria assegureu-vos de tenir-ne suficient.\n\n| Dada | Mitjà | Gran |\n|---|---|---|\n| Nom | `ca_fasttext_wiki_md` | `ca_fasttext_wiki_lg` |\n| Versió | `1.0.0` | `1.0.0` |\n| spaCy | `\u003e=2.3.2`,`\u003c2.4.0`| `\u003e=2.3.2`,`\u003c2.4.0`|\n| Mida | 62 MB| 1,16 GB |\n| Pipeline | `tagger`, `parser`, `ner` | `tagger`, `parser`, `ner` |\n| Vectors | 20.000 | 2.000.000 |\n| Llicència | `AGPL-3.0` |`AGPL-3.0` |\n| Autor | Ciaran O'Reilly |Ciaran O'Reilly |\n\nPodeu instal·lar el model i fer-lo servir amb spaCy executant les següents ordres a l'interfície de línia d'ordres:\n\n```sh\n# Per instal·lar el model mitjà\n\u003e pip install https://github.com/ccoreilly/spacy-catala/releases/download/ca_fasttext_wiki_md-1.0.0/ca_fasttext_wiki_md-1.0.0-py3-none-any.whl\n\u003e python -m spacy link ca_fasttext_wiki_md ca\n\n# Per instal·lar el model gran\n\u003e pip install https://github.com/ccoreilly/spacy-catala/releases/download/ca_fasttext_wiki_lg-1.0.0/ca_fasttext_wiki_lg-1.0.0-py3-none-any.whl\n\u003e python -m spacy link ca_fasttext_wiki_lg ca\n```\n\n# [EN] spaCy NLP Model for the Catalan language\n\nspaCy NLP model for the Catalan language generated from:\n\n- [fastText](https://github.com/facebookresearch/fastText/blob/master/docs/crawl-vectors.md) word vectors\n- The [AnCora](https://github.com/UniversalDependencies/UD_Catalan-AnCora) corpus for parts of speech, morphological features, and syntactic dependencies.\n- Wikipedia annotations for named entity extraction ([Cross-lingual Name Tagging and Linking for 282 Languages](http://nlp.cs.rpi.edu/paper/282elisa2017.pdf))\n\nModels can be found in the releases section of the repository.\n## Installing and using the model\n\nYou can choose between two models. The larger one is more accurate but make sure to have enough memory as spaCy will load the whole model into it.\n\n| Dada | Medium | Large |\n|---|---|---|\n| Name | `ca_fasttext_wiki_md` | `ca_fasttext_wiki_lg` |\n| Version | `1.0.0` | `1.0.0` |\n| spaCy | `\u003e=2.3.2`,`\u003c2.4.0`| `\u003e=2.3.2`,`\u003c2.4.0`|\n| Size | 62 MB| 1,16 GB |\n| Pipeline | `tagger`, `parser`, `ner` | `tagger`, `parser`, `ner` |\n| Vectors | 20.000 | 2.000.000 |\n| License | `AGPL-3.0` |`AGPL-3.0` |\n| Author | Ciaran O'Reilly |Ciaran O'Reilly |\n\n```sh\n# To install the medium sized model\n\u003e pip install https://github.com/ccoreilly/spacy-catala/releases/download/ca_fasttext_wiki_md-1.0.0/ca_fasttext_wiki_md-1.0.0-py3-none-any.whl\n\u003e python -m spacy link ca_fasttext_wiki_md ca\n\n# To install the larger model\n\u003e pip install https://github.com/ccoreilly/spacy-catala/releases/download/ca_fasttext_wiki_lg-1.0.0/ca_fasttext_wiki_lg-1.0.0-py3-none-any.whl\n\u003e python -m spacy link ca_fasttext_wiki_lg ca\n```","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fccoreilly%2Fspacy-catala","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fccoreilly%2Fspacy-catala","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fccoreilly%2Fspacy-catala/lists"}