{"id":22158359,"url":"https://github.com/patricktrainer/duckdb-user-analytics","last_synced_at":"2025-08-20T10:07:51.150Z","repository":{"id":264501968,"uuid":"893548469","full_name":"patricktrainer/duckdb-user-analytics","owner":"patricktrainer","description":"POC for serving analytics to inidividual users.","archived":false,"fork":false,"pushed_at":"2024-11-24T18:19:25.000Z","size":11,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-07-28T05:33:51.085Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/patricktrainer.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-11-24T18:14:22.000Z","updated_at":"2024-11-24T18:19:28.000Z","dependencies_parsed_at":"2024-11-24T23:49:18.646Z","dependency_job_id":null,"html_url":"https://github.com/patricktrainer/duckdb-user-analytics","commit_stats":null,"previous_names":["patricktrainer/duckdb-user-analytics"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/patricktrainer/duckdb-user-analytics","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patricktrainer%2Fduckdb-user-analytics","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patricktrainer%2Fduckdb-user-analytics/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patricktrainer%2Fduckdb-user-analytics/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patricktrainer%2Fduckdb-user-analytics/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/patricktrainer","download_url":"https://codeload.github.com/patricktrainer/duckdb-user-analytics/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patricktrainer%2Fduckdb-user-analytics/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":271300175,"owners_count":24735513,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-20T02:00:09.606Z","response_time":69,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-02T03:22:59.898Z","updated_at":"2025-08-20T10:07:51.126Z","avatar_url":"https://github.com/patricktrainer.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# DuckDB User Analytics POC\n\nA simple proof of concept for user analytics using DuckDB, Flask, and Chart.js. The application demonstrates a scalable architecture where each user's data is stored in a separate DuckDB instance.\n\n```mermaid\nflowchart LR\n    Frontend[HTML/JS Frontend] --\u003e |API Calls| Server[Flask Server]\n    Server --\u003e |Query| Registry[Registry DB]\n    Server --\u003e |Query| DB1[User 1 DB]\n    Server --\u003e |Query| DB2[User 2 DB]\n    Server --\u003e |Query| DBN[User N DB...]\n```\n\n## Features\n\n- Separate DuckDB instance for each user's data\n- Central registry database to track user databases\n- Simple Flask API server\n- Basic frontend visualization using Chart.js\n- Sample data generation for testing\n\n## Directory Structure\n\n```sh\n.\n├── app.py              # Flask backend server\n├── index.html          # Frontend application\n├── data/\n│   ├── registry.db     # Central registry database\n│   └── user_dbs/       # Individual user databases\n│       ├── user_1.db\n│       ├── user_2.db\n│       └── ...\n└── README.md\n```\n\n## Prerequisites\n\n- Python 3.10+\n- pip\n\n## Installation\n\n1. Clone the repository:\n\n    ```sh\n    git clone git@github.com:patricktrainer/duckdb-user-analytics.git\n    cd duckdb-user-analytics\n    ```\n\n2. Install dependencies:\n\n    ```sh\n    pip install duckdb flask flask-cors\n    ```\n\n## Usage\n\n1. Start the server:\n\n    ```bash\n    python app.py\n    ```\n\n2. Open your browser and navigate to:\n\n    ```sh\n    http://127.0.0.1:5000\n    ```\n\n## API Endpoints\n\n- `GET /users` - Returns list of available users\n- `GET /metrics/\u003cuser_id\u003e` - Returns daily metrics for specified user\n\n## Architecture\n\n### Backend\n\n- Flask server handles API requests\n- Registry database tracks all user databases\n- Each user has their own DuckDB instance\n- Sample data generation for testing\n\n### Frontend\n\n- Simple HTML/JavaScript interface\n- Chart.js for data visualization\n- Responsive time series chart\n- User selection dropdown\n\n## Sample Data\n\nThe application generates sample data for testing:\n\n- 5 users by default\n- Daily counts from 2024-01-01 to 2024-02-01\n- Random values between 0-100\n\n## Development\n\nTo modify sample data generation, update the `generate_sample_data()` method in `app.py`:\n\n```python\ndef generate_sample_data(self):\n    for user_id in range(1, 6):  # Modify number of users\n        ...\n```\n\n## Performance Considerations\n\n- Each user's data is isolated in its own database file\n- Scales horizontally with number of users\n- Efficient querying for individual user metrics\n- Registry database provides quick user lookup\n\n## Future Improvements\n\n- Add authentication\n- Implement data ingestion API\n- Add more complex analytics\n- Enhance visualization options\n- Add data export functionality\n- Implement user management\n\n## License\n\nMIT\n\n## Contributing\n\n1. Fork the repository\n2. Create your feature branch\n3. Commit your changes\n4. Push to the branch\n5. Create a new Pull Request\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpatricktrainer%2Fduckdb-user-analytics","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpatricktrainer%2Fduckdb-user-analytics","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpatricktrainer%2Fduckdb-user-analytics/lists"}