{"id":25531953,"url":"https://github.com/generousman/zhihurecapp","last_synced_at":"2026-01-20T13:30:19.271Z","repository":{"id":62404307,"uuid":"560323844","full_name":"GenerousMan/ZhihuRecApp","owner":"GenerousMan","description":"A Flask App For Analyzing ZhihuRec Dataset.","archived":false,"fork":false,"pushed_at":"2022-11-21T17:53:35.000Z","size":12436,"stargazers_count":3,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2023-05-23T10:41:26.878Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/GenerousMan.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2022-11-01T08:26:00.000Z","updated_at":"2022-11-03T07:51:09.000Z","dependencies_parsed_at":"2023-01-22T06:02:38.401Z","dependency_job_id":null,"html_url":"https://github.com/GenerousMan/ZhihuRecApp","commit_stats":null,"previous_names":[],"tags_count":null,"template":null,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GenerousMan%2FZhihuRecApp","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GenerousMan%2FZhihuRecApp/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GenerousMan%2FZhihuRecApp/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/GenerousMan%2FZhihuRecApp/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/GenerousMan","download_url":"https://codeload.github.com/GenerousMan/ZhihuRecApp/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239763601,"owners_count":19692794,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-02-20T01:42:30.113Z","updated_at":"2026-01-20T13:30:19.180Z","avatar_url":"https://github.com/GenerousMan.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# ZhihuRec Data-mining\nA flask app for analyzing ZhihuRec dataset.\n\n## Requirement\n\n``` bash\n pip install requirements.txt\n```\n\n## Usage\n- [Dataset] Put dataset ZhihuRec in the root directory.\n- [Work Path] Set the work path in root directory.\n- [Preprocess] Run the io.py, to convert answer_infos.txt into .csv files.\n\n`1.`First, run this command to get answers' csv files:\n\n``` bash\n python tools/io.py\n```\n\nOr just download from here:\n```\nBaidu NetDisk \nLink:https://pan.baidu.com/s/1Ey-R9yo6_HNuoZuhEJivjg \nCode: 8rc7\n```\nUnzip and put the folder `answer_csv` into `source/`\n\n\n`2.`Then you can use this command to run the flask app:\n\n``` bash\n python app.py\n```\n\nThe flask app will run on the \"127.0.0.1:5000\"\n\n## Files\n- `[model]` The tf-idf model will be saved here.\n- `[source]` Processed files \n  - `[answer_csv]` Answers' csv files. All files are sorted.\n    - `[xxxx.csv]` The xxxx means the start(min) answer's index in this file. \n- `[tools]` Tools help you analyze the dataset.\n  - `[io.py]` Used to read/write/convert dataset.\n  - `[tfidf.py]` TF-IDF algorithm. its mainly functions are \n    - `train()`\n    - `load_tfidf()`\n    - `save_tfidf()`\n    - `compare_similarity()`.\n- `[zhihuRec]` The dataset. You should put txt files here.\n- `[app.py]` The entry of the flask app.\n- `[preprocess.py]` Use the code in `tools` to create tfidf matrix, and save the result into `model`.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgenerousman%2Fzhihurecapp","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgenerousman%2Fzhihurecapp","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgenerousman%2Fzhihurecapp/lists"}