{"id":28854656,"url":"https://github.com/hyperplasma/olympic-visualization-analysis","last_synced_at":"2026-05-04T12:33:40.001Z","repository":{"id":296131973,"uuid":"992376560","full_name":"hyperplasma/olympic-visualization-analysis","owner":"hyperplasma","description":"Multidimensional analysis and visualization of Olympic medals, economy, and happiness index.","archived":false,"fork":false,"pushed_at":"2025-06-10T09:23:01.000Z","size":20327,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-06-19T22:51:30.291Z","etag":null,"topics":["data-analysis","data-visualization","matplotlib","numpy","pandas","python","wordcloud"],"latest_commit_sha":null,"homepage":"https://www.hyperplasma.top","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/hyperplasma.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-05-29T04:01:55.000Z","updated_at":"2025-06-15T08:11:47.000Z","dependencies_parsed_at":"2025-05-29T05:36:52.937Z","dependency_job_id":null,"html_url":"https://github.com/hyperplasma/olympic-visualization-analysis","commit_stats":null,"previous_names":["hyperplasma/olympic-visualization-analysis"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/hyperplasma/olympic-visualization-analysis","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hyperplasma%2Folympic-visualization-analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hyperplasma%2Folympic-visualization-analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hyperplasma%2Folympic-visualization-analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hyperplasma%2Folympic-visualization-analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/hyperplasma","download_url":"https://codeload.github.com/hyperplasma/olympic-visualization-analysis/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hyperplasma%2Folympic-visualization-analysis/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32607583,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-04T10:08:07.713Z","status":"ssl_error","status_checked_at":"2026-05-04T10:08:02.005Z","response_time":58,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-analysis","data-visualization","matplotlib","numpy","pandas","python","wordcloud"],"created_at":"2025-06-19T22:39:32.186Z","updated_at":"2026-05-04T12:33:39.995Z","avatar_url":"https://github.com/hyperplasma.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Olympic Visualization Analysis\n\n[Hyperplasma](https://www.hyperplasma.top) [《奥运体育健康经济大数据可视化分析》——数据采集与可视化项目](https://www.hyperplasma.top/?p=12872)\n\n本项目通过数据清洗、分析与可视化，系统性地探索了奥运奖牌、各国GDP、人口、幸福指数等多维度数据之间的关系。项目包含多种统计图表，支持对奥运奖牌分布、经济与社会指标的多角度分析。\n\n## 主要方法\n\n- **数据清洗**：对奥运奖牌、GDP与人口、幸福指数等原始数据进行标准化、缺失值处理和格式统一。\n- **奖牌统计**：按国家/地区统计奖牌总数，并对团体项目去重，确保奖牌数准确。\n- **数据合并**：将奖牌数据与经济、人口、幸福指数等数据集按国家/地区合并。\n- **多样可视化**：实现了柱状图、堆积柱状图、散点图、气泡图、箱线图、直方图、热力图、饼图、折线图、词云等10余种统计图表，直观展示各类关系。\n\n## 使用的数据集\n\n- **奥运奖牌数据**：`data/Olympics-Dataset-master/clean-data/results.csv` 包含历届奥运会各项目奖牌获得者信息。\n- **NOC与地区映射**：`data/Olympics-Dataset-master/clean-data/noc_regions.csv` 用于将NOC代码映射为国家/地区名。\n- **全球GDP与人口数据**：data/ `2020-All Country_data_GDP, Population, Electricity-consumption and many more.csv `包含各国GDP（购买力平价）与人口等经济指标。\n- **世界幸福指数数据**：`data/Exploring-World-Happiness-main/Resources/2020.csv` 包含各国幸福指数、GDP等社会指标。\n- **健身问卷数据**：`data/fitness analysis.csv` 用于生成健身动机词云。\n- 其他可选数据集\n\n## 主要可视化图表\n\n- 奖牌总数柱状图\n- 奖牌类型堆积柱状图\n- 奖牌数与GDP散点图\n- 奖牌数与人口气泡图\n- 中国历年奖牌与GDP趋势图\n- 相关性热力图\n- 各国GDP直方图与箱线图\n- 各国奖牌数箱线图\n- 幸福度前/后20国家柱状图、幸福度直方图\n- 中国奖牌类型饼图\n- 幸福度与GDP散点图\n- 健身动机词云\n\n## 快速开始\n\n1. 克隆仓库并准备好上述数据集（保持目录结构一致）。\n2. 根据 `requirements.txt`安装依赖。\n3. 运行 `python main.py`，所有图表将输出至 `output/figures/` 目录。\n\n## 依赖库\n\n项目使用的主要Python库包括：\n\n* **matplotlib** (3.7.2): 用于创建静态、动画和交互式可视化\n* **numpy** (1.26.4): 提供数值计算支持\n* **pandas** (2.0.3): 用于数据处理和分析\n* **seaborn** (0.12.2): 基于matplotlib的统计数据可视化\n* **wordcloud** (1.9.4): 用于生成词云图\n\n可通过以下命令安装所需依赖：\n\n```bash\npip install -r requirements.txt\n```\n\n## 致谢\n\n- [Kaggle Olympics Dataset](https://www.kaggle.com/datasets/the-guardian/olympic-games)\n- [World Happiness Report](https://worldhappiness.report/)\n- 其他公开数据集\n- [Hyperplasma](www.hyperplasa.top)\n\n## LICENSE\n\n[LICENSE](LICENSE)\n\n---\n\n如有问题欢迎提issue或PR！\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhyperplasma%2Folympic-visualization-analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fhyperplasma%2Folympic-visualization-analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhyperplasma%2Folympic-visualization-analysis/lists"}