{"id":24970170,"url":"https://github.com/ff137/bitstamp-btcusd-minute-data","last_synced_at":"2025-08-09T03:49:24.800Z","repository":{"id":275335527,"uuid":"925612778","full_name":"ff137/bitstamp-btcusd-minute-data","owner":"ff137","description":"Daily updates of Bitstamp BTC/USD 1-minute OHLC data, with historical data since 2012","archived":false,"fork":false,"pushed_at":"2025-08-08T00:44:53.000Z","size":91936,"stargazers_count":19,"open_issues_count":1,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-08-08T02:36:16.635Z","etag":null,"topics":["bitcoin","bitstamp","candle","data","ohlcv-data","price-data"],"latest_commit_sha":null,"homepage":"https://ff137.github.io/bitstamp-btcusd-minute-data/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":"mczielinski/kaggle-bitcoin","license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ff137.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-02-01T09:50:12.000Z","updated_at":"2025-08-08T00:44:56.000Z","dependencies_parsed_at":null,"dependency_job_id":"2309b062-00c7-48f7-87fd-bbbea0a97244","html_url":"https://github.com/ff137/bitstamp-btcusd-minute-data","commit_stats":null,"previous_names":["ff137/bitstamp_btcusd_minute_data","ff137/bitstamp-btcusd-minute-data"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/ff137/bitstamp-btcusd-minute-data","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ff137%2Fbitstamp-btcusd-minute-data","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ff137%2Fbitstamp-btcusd-minute-data/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ff137%2Fbitstamp-btcusd-minute-data/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ff137%2Fbitstamp-btcusd-minute-data/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ff137","download_url":"https://codeload.github.com/ff137/bitstamp-btcusd-minute-data/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ff137%2Fbitstamp-btcusd-minute-data/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":269527558,"owners_count":24432441,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-09T02:00:10.424Z","response_time":111,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bitcoin","bitstamp","candle","data","ohlcv-data","price-data"],"created_at":"2025-02-03T15:56:30.141Z","updated_at":"2025-08-09T03:49:24.787Z","avatar_url":"https://github.com/ff137.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Bitstamp BTC/USD 1-minute OHLC Data\n\n[![Last update](https://img.shields.io/github/last-commit/ff137/bitstamp-btcusd-minute-data/main?path=data%2Fupdates%2Fbtcusd_bitstamp_1min_latest.csv\u0026label=Last%20update)](./data/updates/)\n![GitHub repo size](https://img.shields.io/github/repo-size/ff137/bitstamp-btcusd-minute-data)\n\nThis repository provides historical and up-to-date Bitcoin (BTC/USD) 1-minute OHLC candle data from Bitstamp.\n\n## Bulk Historical Data\n\nThe historical dataset is saved in [data/historical/btcusd_bitstamp_1min_2012-2025.csv.gz](data/historical/btcusd_bitstamp_1min_2012-2025.csv.gz).\n\nSome facts about the data:\n\n- **Date Range:** From 1 January 2012 to 7 January 2025.\n- **Number of Records:** 6,846,600\n- **File Size:** Approximately 90MB zipped, 326MB unzipped.\n- **Data Integrity:** No missing minutes, no duplicates, and no null values.\n\n### Data Preview\n\nBelow is a preview of the first and last two rows of the bulk dataset:\n\n| timestamp  | open     | high     | low      | close    | volume   |\n| ---------- | -------- | -------- | -------- | -------- | -------- |\n| 1325412060 | 4.58     | 4.58     | 4.58     | 4.58     | 0.0      |\n| 1325412120 | 4.58     | 4.58     | 4.58     | 4.58     | 0.0      |\n| ...        | ...      | ...      | ...      | ...      | ...      |\n| 1736207940 | 102280.0 | 102280.0 | 102280.0 | 102280.0 | 0.007554 |\n| 1736208000 | 102278.0 | 102291.0 | 102263.0 | 102263.0 | 0.523107 |\n\n## Daily Updates\n\nA daily GitHub action runs at midnight UTC to fetch the latest data and append it to a separate, daily update file.\n\nThe daily updates (since the bulk data) are saved in [data/updates/btcusd_bitstamp_1min_latest.csv](data/updates/btcusd_bitstamp_1min_latest.csv).\n\n## How Can I Use This Data?\n\nThe simplest way to use the data is to clone the repository:\n\n```bash\ngit clone https://github.com/ff137/bitstamp-btcusd-minute-data\ncd bitstamp-btcusd-minute-data\n```\n\nIf you don't have git, you can also [download the repository as a zip file](https://github.com/ff137/bitstamp-btcusd-minute-data/archive/refs/heads/main.zip).\nOr, just download the individual datasets:\n\n- [data/historical/btcusd_bitstamp_1min_2012-2025.csv.gz](https://github.com/ff137/bitstamp-btcusd-minute-data/blob/main/data/historical/btcusd_bitstamp_1min_2012-2025.csv.gz)\n- [data/updates/btcusd_bitstamp_1min_latest.csv](https://github.com/ff137/bitstamp-btcusd-minute-data/blob/main/data/updates/btcusd_bitstamp_1min_latest.csv)\n\n### Keeping the Data Up-to-Date\n\nSome time passes and you want to fetch the new daily updates. Perform a force pull:\n\n```bash\ngit fetch upstream\ngit reset --hard upstream/main\n```\n\nThis is needed instead of `git pull`, because the daily update file gets overwritten to keep the git history clean.\n\n## Working with the Data in Python\n\nAssuming you have [Python installed](https://www.python.org/downloads/release/python-3129/),\nyou can install Poetry and the project dependencies:\n\n```bash\npython -m venv venv  # Create a new virtual environment\nsource venv/bin/activate  # On Windows use `venv\\Scripts\\activate`\n\npip install poetry  # Install Poetry\npoetry install  # Install the project dependencies\n```\n\nWe have a [sample script](scripts/inspect_data.py) for you to inspect the data integrity\n(validate that there are no missing minutes, no duplicates, no nulls, etc):\n\n```bash\npython -m scripts.inspect_data merged\n```\n\nReplace `merged` with `bulk` or `updated` to inspect the individual bulk or daily datasets.\n\n### Python Template for Loading the Data\n\nIf you need a basic template for just loading the data into a single DataFrame:\n\n```python\nimport pandas as pd\n\n# Load historical and recent data\nDATA_DIR = 'data'\ndf_hist = pd.read_csv(\n    f'{DATA_DIR}/historical/btcusd_bitstamp_1min_2012-2025.csv.gz',\n    compression='gzip'\n)\ndf_recent = pd.read_csv(\n    f'{DATA_DIR}/updates/btcusd_bitstamp_1min_latest.csv'\n)\n\n# Combine the datasets\ndf = pd.concat([df_hist, df_recent], ignore_index=True)\ndf.info()\n```\n\n## Want to Know More About this Repo?\n\n\u003e Forked from [mczielinski/kaggle-bitcoin](https://github.com/mczielinski/kaggle-bitcoin) and fixed some issues.\n\nSee [scripts/README.md](scripts/README.md) for more information on how this data was onboarded.\n\nGo to [scripts/update_data.py](scripts/update_data.py) and\n[.github/workflows/update-automation.yml](.github/workflows/update-automation.yml)\nif you are curious about how the data is processed and kept up-to-date.\n\n## Support\n\nIf you need any help or have any questions, please feel free to open an issue or contact me directly.\n\nWe hope this repo makes your life easier! If it does, please give us a star! ⭐\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fff137%2Fbitstamp-btcusd-minute-data","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fff137%2Fbitstamp-btcusd-minute-data","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fff137%2Fbitstamp-btcusd-minute-data/lists"}