{"id":28151793,"url":"https://github.com/0xdeval/suggestprediction","last_synced_at":"2025-05-15T04:15:08.148Z","repository":{"id":113189663,"uuid":"180157135","full_name":"0xdeval/SuggestPrediction","owner":"0xdeval","description":null,"archived":false,"fork":false,"pushed_at":"2019-05-29T08:12:44.000Z","size":102,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"master","last_synced_at":"2025-04-06T07:40:41.724Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/0xdeval.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-04-08T13:38:39.000Z","updated_at":"2019-10-23T16:35:10.000Z","dependencies_parsed_at":null,"dependency_job_id":"30511b7f-500d-4cb3-bcd7-d0161506d526","html_url":"https://github.com/0xdeval/SuggestPrediction","commit_stats":null,"previous_names":["0xdeval/suggestprediction"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/0xdeval%2FSuggestPrediction","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/0xdeval%2FSuggestPrediction/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/0xdeval%2FSuggestPrediction/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/0xdeval%2FSuggestPrediction/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/0xdeval","download_url":"https://codeload.github.com/0xdeval/SuggestPrediction/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254270659,"owners_count":22042863,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-05-15T04:14:17.651Z","updated_at":"2025-05-15T04:15:08.136Z","avatar_url":"https://github.com/0xdeval.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# SuggestPrediction\n\nCurrent repository consist of 2 main files:\u003cbr\u003e\n1. \u003cb\u003e [StatisticAlgorithm](https://github.com/maikReal/SuggestPrediction/blob/master/StatisticAlgorithm.ipynb) \u003c/b\u003e \u003cbr\u003e\n2. \u003cb\u003e [Category-Suggestion Algorithm](https://github.com/maikReal/SuggestPrediction/blob/master/Category-SuggestionAlgorithm.ipynb) \u003c/b\u003e \u003cbr\u003e\n3. \u003cb\u003e [SuggestPredictionVersion2](https://github.com/maikReal/SuggestPrediction/blob/master/SuggestPredictionVersion2.ipynb) \u003c/b\u003e \u003cbr\u003e\nMoreover, there are two main \u003cb\u003e.csv\u003c/b\u003e files, which are the examples of data:\n1. \u003cb\u003e[category_data.csv](https://github.com/maikReal/SuggestPrediction/blob/master/caregory_data.csv)\u003c/b\u003e (for category prediction) \u003cbr\u003e\n2. \u003cb\u003e[suggest_data.csv](https://github.com/maikReal/SuggestPrediction/blob/master/suggest_data.csv)\u003c/b\u003e (for statistic prediction) \u003cbr\u003e\n\n## Statistic Algorithm\n\nThis algorithm use statistic data for prediction. The principle of work is next:\n1. We send get request to old version of suggest for recieving the suggest based on text score of \u003cb\u003eElasticsearch\u003c/b\u003e \u003cbr\u003e\n2. We use every suggest, which we recieved earlier, for finding the most relevant goods, using the statistic \u003cbr\u003e\n3. We recieve the TOP-n goods and return the TOP-n SKU of goods \u003cbr\u003e\n\n## Category-Suggestion Algorithm\n\nThis algorithm has two part: prediction of category and prediction of TOP-n SKU of goods in predicted category. The main plot of algorithm is next: \u003cbr\u003e\n1. We use `category_data.csv` for predicting the category of new good (one of 49 unique category) \u003cbr\u003e\n2. Ranking goods in predicted category \u003cbr\u003e\n3. Recieve TOP-n SKU of goods in predicted category\n\n## SuggestPredictionVersion2 Algorithm\n\nThis algorithm has many advantages over algorithms described above. His main feature is context searching. It means, that he can look for different queries by his prefix. So, the algorithm principle of work is next: \u003cbr\u003e\n1. We get every, for example, 3 letter of user query (it's our query) \u003cbr\u003e\n2. We find different queries with using context searching. For example: \u003cbr\u003e\n```\nPrefix - шок\nVariants:\nшоколадный заяц\nкубик шоколадный\nторт в шоколадной глазури\n```\n3. Try to find all correct variants for user prefix \u003cbr\u003e\n4. Return all variants for user query \u003cbr\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2F0xdeval%2Fsuggestprediction","html_url":"https://awesome.ecosyste.ms/projects/github.com%2F0xdeval%2Fsuggestprediction","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2F0xdeval%2Fsuggestprediction/lists"}