{"id":18925925,"url":"https://github.com/farfarfun/fundata","last_synced_at":"2026-02-17T23:33:27.235Z","repository":{"id":40658320,"uuid":"188334362","full_name":"farfarfun/fundata","owner":"farfarfun","description":"数据处理工具包 - 提供数据清洗、转换和分析功能","archived":false,"fork":false,"pushed_at":"2024-12-02T07:13:05.000Z","size":22,"stargazers_count":3,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"master","last_synced_at":"2025-10-26T21:33:21.483Z","etag":null,"topics":["data-analysis","data-processing","farfarfun","numpy","pandas","python"],"latest_commit_sha":null,"homepage":"https://pypi.org/project/fundata/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/farfarfun.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-05-24T01:53:23.000Z","updated_at":"2025-09-09T13:44:16.000Z","dependencies_parsed_at":"2024-12-31T18:26:40.878Z","dependency_job_id":"058c1544-8fe2-4c1f-9f1b-d452acea6f6f","html_url":"https://github.com/farfarfun/fundata","commit_stats":null,"previous_names":["farfarfun/notedata","darkchats/notedata","farfarfun/fundata"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/farfarfun/fundata","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/farfarfun%2Ffundata","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/farfarfun%2Ffundata/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/farfarfun%2Ffundata/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/farfarfun%2Ffundata/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/farfarfun","download_url":"https://codeload.github.com/farfarfun/fundata/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/farfarfun%2Ffundata/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29562252,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-17T21:50:49.831Z","status":"ssl_error","status_checked_at":"2026-02-17T21:46:15.313Z","response_time":100,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-analysis","data-processing","farfarfun","numpy","pandas","python"],"created_at":"2024-11-08T11:13:52.464Z","updated_at":"2026-02-17T23:33:22.225Z","avatar_url":"https://github.com/farfarfun.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# notedata\n\n在学习和工作过程中，经常用到一些比较通用的数据集，很多是国外的数据集，下载很慢。\n这里整理一些数据，并转存到蓝奏云，如果后续有免费的公有云，也可以再迁移到其他云盘。\n\n\n\n|序号|分类|名称|描述|官网下载|蓝奏下载|\n|:-:|:-:|:-:|:-:|:-:|:-:|\n|0|dataset|iris|iris数据集|[官网链接](https://archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.data)|暂无|\n|1|dataset|electronics-reviews|Amazon评论数据|[官网链接](http://snap.stanford.edu/data/amazon/productGraph/categoryFiles/reviews_Electronics_5.json.gz)|[蓝奏链接](https://wws.lanzous.com/b01hkzfuj)|\n|2|dataset|electronics-meta|Amazon评论数据|[官网链接](http://snap.stanford.edu/data/amazon/productGraph/categoryFiles/meta_Electronics.json.gz)|[蓝奏链接](https://wws.lanzous.com/b01hqeora)|\n|3|dataset|movielens-100k|包含用户对电影的评级数据、电影元数据信息和用户属性信息|[官网链接](http://files.grouplens.org/datasets/movielens/ml-100k.zip)|[蓝奏链接](https://wws.lanzous.com/iyykCfbi64j)|\n|4|dataset|movielens-1m|包含用户对电影的评级数据、电影元数据信息和用户属性信息|[官网链接](http://files.grouplens.org/datasets/movielens/ml-1m.zip)|[蓝奏链接](https://wws.lanzous.com/ihoSUfbi65a)|\n|5|dataset|movielens-10m|包含用户对电影的评级数据、电影元数据信息和用户属性信息|[官网链接](http://files.grouplens.org/datasets/movielens/ml-10m.zip)|[蓝奏链接](https://wws.lanzous.com/iXvEmfbi6di)|\n|6|dataset|movielens-20m|包含用户对电影的评级数据、电影元数据信息和用户属性信息|[官网链接](http://files.grouplens.org/datasets/movielens/ml-20m.zip)|[蓝奏链接](https://wws.lanzous.com/b01hkt17g)|\n|7|dataset|movielens-25m|包含用户对电影的评级数据、电影元数据信息和用户属性信息|[官网链接](http://files.grouplens.org/datasets/movielens/ml-25m.zip)|[蓝奏链接](https://wws.lanzous.com/b01hkt24j)|\n|8|dataset|adult-train||[官网链接](https://raw.githubusercontent.com/1007530194/data/master/recommendation/data/adult.data.txt)|暂无|\n|9|dataset|adult-test||[官网链接](https://raw.githubusercontent.com/1007530194/data/master/recommendation/data/adult.test.txt)|暂无|\n|10|dataset|porto-seguro-train||[官网链接](https://raw.githubusercontent.com/1007530194/data/master/recommendation/data/porto_seguro_train.csv)|暂无|\n|11|dataset|porto-seguro-test||[官网链接](https://raw.githubusercontent.com/1007530194/data/master/recommendation/data/porto_seguro_test.csv)|暂无|\n|12|dataset|bitly-usagov||[官网链接](https://raw.githubusercontent.com/1007530194/data/master/datasets/bitly_usagov/example.txt)|暂无|\n|13|dataset|coco-val2017|大型图像数据集, 用于对象检测、分割、人体关键点检测、语义分割和字幕生成|[官网链接](http://images.cocodataset.org/zips/val2017.zip)|[蓝奏链接](https://wws.lanzous.com/b01hkb8fi)|\n|14|dataset|coco-annotations_trainval2017|大型图像数据集, 用于对象检测、分割、人体关键点检测、语义分割和字幕生成|[官网链接](http://images.cocodataset.org/annotations/annotations_trainval2017.zip)|[蓝奏链接](https://wws.lanzous.com/b01hkb86j)|\n|15|model|yolov3.weight|yolov3模型的权重|暂无|[蓝奏链接](https://wws.lanzous.com/b01hjn3ih)|\n|16|model|yolov3.h5|yolov3模型的权重|暂无|[蓝奏链接](https://wws.lanzous.com/b01hjn3aj)|\n|17|model|yolov4.weight|yolov4模型的权重|暂无|[蓝奏链接](https://wws.lanzous.com/b01hjn3yd)|\n|18|model|yolov4-416.h5|yolov4模型的权重|暂无|[蓝奏链接](https://wws.lanzous.com/b01hl9lej)|\n|19|dataset|criteo-sample|criteo_sample数据集|[官网链接](https://raw.githubusercontent.com/shenweichen/DeepCTR/master/examples/criteo_sample.txt)|[蓝奏链接](https://wws.lanzous.com/ihLhrhejkxi)|\n|20|dataset|criteo-kaggle|criteo-kaggle数据集|[官网链接](https://s3-eu-west-1.amazonaws.com/kaggle-display-advertising-challenge-dataset/dac.tar.gz)|[蓝奏链接](https://wws.lanzous.com/b01hqh97i)|\n\n\n注：蓝奏无法上传大于100MB的数据，将一个数据拆分为多个文件上传，必须用[notedrive](https://github.com/notechats/notedrive) 来下载\n\n\n# 感谢\n感谢蓝奏云  \n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ffarfarfun%2Ffundata","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ffarfarfun%2Ffundata","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ffarfarfun%2Ffundata/lists"}