https://github.com/dalelane/kafka-connect-bikesharing-source
random data generator
https://github.com/dalelane/kafka-connect-bikesharing-source
Last synced: 25 days ago
JSON representation
random data generator
- Host: GitHub
- URL: https://github.com/dalelane/kafka-connect-bikesharing-source
- Owner: dalelane
- Created: 2025-03-20T22:34:19.000Z (3 months ago)
- Default Branch: master
- Last Pushed: 2025-05-02T13:57:48.000Z (about 1 month ago)
- Last Synced: 2025-05-02T15:03:23.780Z (about 1 month ago)
- Language: Java
- Size: 294 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# kafka-connect-bikesharing-source
Random data generator, that generates events using the data in the `hour.csv` file from https://www.kaggle.com/datasets/lakshmi25npathi/bike-sharing-dataset
> This dataset contains the hourly and daily count of rental bikes between the years 2011 and 2012 in the Capital bike share system with the corresponding weather and seasonal information.
It uses the timestamps from the CSV file, but ignores the year value, so the emitted values look like they are current.
Because the CSV file contains two years' worth of data, this connector can run for up to two years before it runs out of events to generate.
Emitted events:
- `BIKESHARING.WEATHER` - weather "forecasts" taken from the CSV file - emitted at the start of the hour, as if they are a forecast for the next hour
- `BIKESHARING.LOCATION` - current location of bikes that are currently on a journey (emitted at the start of a journey, periodically during a journey, and at the end of a journey)