{"id":21198753,"url":"https://github.com/natlibfi/fintoai-data-yso","last_synced_at":"2025-03-14T22:13:42.318Z","repository":{"id":172993226,"uuid":"522526335","full_name":"NatLibFi/FintoAI-data-YSO","owner":"NatLibFi","description":"DVC pipeline for YSO projects of Finto AI","archived":false,"fork":false,"pushed_at":"2024-12-12T07:32:52.000Z","size":2473,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":4,"default_branch":"main","last_synced_at":"2025-01-21T14:46:15.460Z","etag":null,"topics":["annif","dvc","dvc-pipeline","glam","subject-indexing","text-classification"],"latest_commit_sha":null,"homepage":"https://ai.finto.fi","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"cc0-1.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/NatLibFi.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-08-08T11:45:33.000Z","updated_at":"2023-12-04T11:05:21.000Z","dependencies_parsed_at":null,"dependency_job_id":"e429cfdb-f748-496c-a1a6-ff2cf8f313f4","html_url":"https://github.com/NatLibFi/FintoAI-data-YSO","commit_stats":null,"previous_names":["natlibfi/fintoai-data-yso"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NatLibFi%2FFintoAI-data-YSO","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NatLibFi%2FFintoAI-data-YSO/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NatLibFi%2FFintoAI-data-YSO/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NatLibFi%2FFintoAI-data-YSO/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/NatLibFi","download_url":"https://codeload.github.com/NatLibFi/FintoAI-data-YSO/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243652707,"owners_count":20325611,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["annif","dvc","dvc-pipeline","glam","subject-indexing","text-classification"],"created_at":"2024-11-20T19:53:18.901Z","updated_at":"2025-03-14T22:13:42.297Z","avatar_url":"https://github.com/NatLibFi.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# FintoAI-data-YSO\n\nConfigurations for maintaining the Annif projects with YSO vocabulary used at [Finto AI service](ai.finto.fi/) and the [analysis notebook](/repository-metrics-analysis/analyse-theseus-tietolinja.ipynb) of Annif suggestions in [Theseus repository](https://www.theseus.fi/).\n\nThe projects are trained and evaluated using a [DVC (Data Version Control) pipeline](https://dvc.org/doc/start/data-management/data-pipelines) defined in [dvc.yaml](/dvc.yaml).\nThe training corpora that are public can be found from [Annif-corpora repository](https://github.com/NatLibFi/Annif-corpora/).\n\nThe pipeline takes care of \n\n1. installing Annif in a venv,\n2. loading YSO vocabulary,\n3. training the projects,\n4. evaluating the projects.\n\nWhen the necessary vocabulary and training corpora are in place the pipeline can be run using the command\n\n    dvc repro\n    \nFor more information about using DVC with Annif projects see the [DVC exercise of Annif tutorial](https://github.com/NatLibFi/Annif-tutorial/blob/master/exercises/OPT_dvc.md).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnatlibfi%2Ffintoai-data-yso","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fnatlibfi%2Ffintoai-data-yso","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnatlibfi%2Ffintoai-data-yso/lists"}