{"id":15983133,"url":"https://github.com/zuevmaxim/itmo-ibd","last_synced_at":"2026-04-24T16:34:29.878Z","repository":{"id":37688885,"uuid":"489595946","full_name":"zuevmaxim/itmo-ibd","owner":"zuevmaxim","description":"A tool for suggesting topics related to the project on Github based on packages used in the project","archived":false,"fork":false,"pushed_at":"2022-06-27T16:39:27.000Z","size":43072,"stargazers_count":0,"open_issues_count":0,"forks_count":2,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-02-10T05:13:23.728Z","etag":null,"topics":["big-data","python","python-notebook"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/zuevmaxim.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2022-05-07T06:57:34.000Z","updated_at":"2022-06-26T18:41:22.000Z","dependencies_parsed_at":"2022-09-15T10:13:07.030Z","dependency_job_id":null,"html_url":"https://github.com/zuevmaxim/itmo-ibd","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zuevmaxim%2Fitmo-ibd","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zuevmaxim%2Fitmo-ibd/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zuevmaxim%2Fitmo-ibd/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zuevmaxim%2Fitmo-ibd/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/zuevmaxim","download_url":"https://codeload.github.com/zuevmaxim/itmo-ibd/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247247665,"owners_count":20908011,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["big-data","python","python-notebook"],"created_at":"2024-10-08T01:40:47.906Z","updated_at":"2026-04-24T16:34:24.857Z","avatar_url":"https://github.com/zuevmaxim.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Github Topic Suggester\n[![License](https://img.shields.io/badge/License-MIT%202.0-blue.svg)](https://github.com/zuevmaxim/itmo-ibd/blob/master/LICENSE)\n\nA tool for suggesting topics related to the project on Github based on packages used in the project.\n\nThe tool uses [Lupa](https://github.com/JetBrains-Research/Lupa) analyzer for extract information about the packages used in the project and \nsupports only Python and Kotlin project for now.\n\n# Demo\nThis is a demo of the Github Topic Suggester.\n\nThe user enters the owner and name of the repository on Github and clicks \"Suggest\". \nAfter a few minutes of waiting, he gets the recommended topics for his project!\n\nTake a look on it!\n\n[//]: # (Insert demo video)\n\nYou can run the demo yourself using the instructions [here](https://github.com/zuevmaxim/itmo-ibd/tree/master/app).\n\n# Pipeline suggesting topics\nThe pipeline for processing a new project and suggesting topics for it is as follows.\n 1. Clone repository from Github\n 2. Apply [Lupa](https://github.com/JetBrains-Research/Lupa) analyser\nfor extracting package imports from the project\n 3. Made some processing\n 4. Predict relative topics\n 5. Save suggested topics to file\n\nYou can find more information about the pipeline [here](https://github.com/zuevmaxim/itmo-ibd/tree/master/pipeline).\n\n# Used technologies\n * Docker - runs [Lupa](https://github.com/JetBrains-Research/Lupa) and pipeline\n * Spark - data processing and pipeline processing\n * XGBoost - builds topic predictor\n * Flask - builds demonstration app \n * Celery - runs the docker container with pipeline on a separate worker\n\n# Team\n * Dmitry Pogrebnoy\n * Maria Tigina\n * Maxim Zuev\n * Ksenia Razheva","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzuevmaxim%2Fitmo-ibd","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fzuevmaxim%2Fitmo-ibd","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzuevmaxim%2Fitmo-ibd/lists"}