{"id":20620198,"url":"https://github.com/stefen-taime/open-source-data","last_synced_at":"2026-02-19T18:31:58.904Z","repository":{"id":218124444,"uuid":"745670351","full_name":"Stefen-Taime/open-source-data","owner":"Stefen-Taime","description":"This repository contains structured datasets in various categories","archived":false,"fork":false,"pushed_at":"2024-07-15T01:28:06.000Z","size":3666,"stargazers_count":2,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-10-16T10:36:42.825Z","etag":null,"topics":["csv","data","json","python3","xml"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Stefen-Taime.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-01-19T20:52:16.000Z","updated_at":"2024-07-15T01:28:11.000Z","dependencies_parsed_at":"2024-01-19T22:06:08.047Z","dependency_job_id":"a056c47b-65be-4456-acfc-763f4f4eac57","html_url":"https://github.com/Stefen-Taime/open-source-data","commit_stats":null,"previous_names":["stefen-taime/open-source-data"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Stefen-Taime/open-source-data","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Stefen-Taime%2Fopen-source-data","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Stefen-Taime%2Fopen-source-data/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Stefen-Taime%2Fopen-source-data/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Stefen-Taime%2Fopen-source-data/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Stefen-Taime","download_url":"https://codeload.github.com/Stefen-Taime/open-source-data/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Stefen-Taime%2Fopen-source-data/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29627112,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-19T18:02:07.722Z","status":"ssl_error","status_checked_at":"2026-02-19T18:01:46.144Z","response_time":117,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["csv","data","json","python3","xml"],"created_at":"2024-11-16T12:13:39.668Z","updated_at":"2026-02-19T18:31:58.871Z","avatar_url":"https://github.com/Stefen-Taime.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\n# Open Source Data\n\n## Description\n\nThis repository contains structured datasets in various categories such as \"bank\", \"beer\", \"coffee\", \"commerce\", \"company\", \"computer\", \"credit_card\", \"dessert\", \"device\", \"food\", \"keywords\", \"movies\", \"ratings\", \"restaurant\", \"stripe\", \"subscription\", and \"user\". Each category includes data in three different formats: CSV, JSON, and XML, with relevant and updated information as of January 16, 2024. The data is organized to facilitate access and exploitation for various analyses and developments.\n\n## Repository Structure\n\nThe repository is organized as follows:\n\n```\n.\n├── bank\n│   ├── csv\n│   │   ├── csv_bank_20240116_1.csv\n│   │   ├── csv_bank_20240116_2.csv\n│   │   ├── csv_bank_20240116_3.csv\n│   │   ├── csv_bank_20240116_4.csv\n│   │   └── csv_bank_20240116_5.csv\n│   ├── json\n│   │   ├── json_bank_20240116_1.json\n│   │   ├── json_bank_20240116_2.json\n│   │   ├── json_bank_20240116_3.json\n│   │   ├── json_bank_20240116_4.json\n│   │   └── json_bank_20240116_5.json\n│   └── xml\n│       ├── xml_bank_20240116_1.xml\n│       ├── xml_bank_20240116_2.xml\n│       ├── xml_bank_20240116_3.xml\n│       ├── xml_bank_20240116_4.xml\n│       └── xml_bank_20240116_5.xml\n├── bank.py\n├── [Other Categories]\n└── [Corresponding Files]\n```\n\n## Usage\n\nEach category comes with a Python script (e.g., `bank.py`, `beer.py`, etc.) to facilitate interaction with the data. These scripts are designed to import and process data in CSV, JSON, and XML formats. Users can leverage these scripts to develop applications or perform data analysis.\n\n## Reference Key\n\nData across all categories use a common `user_id` as the primary reference key, allowing for coherent integration and comparison across different categories.\n\n## Contribution\n\nContributions to the repository are welcome. Please follow the contribution guidelines to submit your changes or additions.\n\n\n\n## Contact\n\nFor any questions or comments, feel free to contact [Stefen Taime] at [stefentaime@gmail.com].\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fstefen-taime%2Fopen-source-data","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fstefen-taime%2Fopen-source-data","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fstefen-taime%2Fopen-source-data/lists"}