Projects in Awesome Lists tagged with data-aggregation
A curated list of projects in awesome lists tagged with data-aggregation .
https://github.com/fastverse/collapse
Advanced and Fast Data Transformation in R
cran data-aggregation data-analysis data-manipulation data-processing data-science data-transformation econometrics high-performance panel-data r rstats scientific-computing statistics time-series weighted weights
Last synced: 11 Jan 2026
https://github.com/sebkrantz/collapse
Advanced and Fast Data Transformation in R
cran data-aggregation data-analysis data-manipulation data-processing data-science data-transformation econometrics high-performance panel-data r rstats scientific-computing statistics time-series weighted weights
Last synced: 14 May 2025
https://github.com/SebKrantz/collapse
Advanced and Fast Data Transformation in R
cran data-aggregation data-analysis data-manipulation data-processing data-science data-transformation econometrics high-performance panel-data r rstats scientific-computing statistics time-series weighted weights
Last synced: 26 Apr 2025
https://github.com/djdembeck/audnexus.bundle
An Audnexus client proof of concept for Plex, providing rich author and audiobook data. Developed in Python, offering enhanced user experiences via Plex's legacy plugin agent system.
api audiobook audiobook-data audiobooks audnexus client data-aggregation plex plex-agent plex-media-server proof-of-concept python user-experience
Last synced: 04 Apr 2025
https://github.com/djdembeck/Audnexus.bundle
An Audnexus client proof of concept for Plex, providing rich author and audiobook data. Developed in Python, offering enhanced user experiences via Plex's legacy plugin agent system.
api audiobook audiobook-data audiobooks audnexus client data-aggregation plex plex-agent plex-media-server proof-of-concept python user-experience
Last synced: 15 Apr 2025
https://github.com/fastverse/fastverse
An Extensible Suite of High-Performance and Low-Dependency Packages for Statistical Computing and Data Manipulation in R
c cpp data-aggregation data-manipulation data-science data-transformation high-performance low-dependency panel-data r rstats statistical-computing time-series weights
Last synced: 12 Dec 2025
https://github.com/lvyahui8/spring-boot-data-aggregator
基于注解实现并行地依赖注入(数据聚合),可以看做 Spring Async 注解的升级版
bean-mapping concurrency data-aggregation parallel-processing spring-boot
Last synced: 06 Sep 2025
https://github.com/synacker/daggy
Daggy - Data Aggregation Utility and C/C++ developer library for data streams catching
aggregation cross-platform-app cross-platform-development data-aggregation extensible monitoring process qt serverless-framework ssh-client ssh2 stream-processing streaming
Last synced: 04 Apr 2025
https://github.com/laxamentumtech/audnexus
An audiobook data aggregation API that harmonizes data from multiple sources into a unified stream. It offers a consistent and user-friendly source of audiobook data for various applications.
api audiobooks audnexus data-aggregation docker fastify metadata mongodb nodejs papr redis typescript
Last synced: 05 Apr 2025
https://github.com/the-osint-toolbox/people-search-osint
Search tools to help you find people, focused towards UK resources.
data data-aggregation osint people privacy search searching
Last synced: 03 Oct 2025
https://github.com/pinkpixel-dev/deep-research-mcp
A Model Context Protocol (MCP) compliant server designed for comprehensive web research. It uses Tavily's Search and Crawl APIs to gather detailed information on a given topic, then structures this data in a format perfect for LLMs to create high-quality markdown documents.
ai-tools data-aggregation deep-research documentation-generation information-retrieval knowledge-base llm mcp mcp-server model-context-protocol model-context-protocol-servers nodejs research-assistant search-api tavily typescript web-crawling web-research
Last synced: 25 Dec 2025
https://github.com/karrlab/datanator
Toolkit for discovering and aggregating data for whole-cell modeling
cells data-aggregation data-discovery data-integration mathematical-modeling systems-biology
Last synced: 02 Sep 2025
https://github.com/czcorpus/wag
WaG - install your own word profile generator out of diverse data resources
corpora data-aggregation dictionaries language-resources linguistics portal react rxjs typescript visualization
Last synced: 07 Feb 2026
https://github.com/azer0s/tinygator
A tiny, Kafka based, data aggregator for storing JSON metrics in timescaledb
data-aggregation kafka postgres timescaledb
Last synced: 01 May 2025
https://github.com/9ssi7/acc
Go library for efficient data accumulation and processing.
data-aggregation data-processing go-library scheduled-tasks
Last synced: 23 Apr 2025
https://github.com/karrlab/wc_kb
Tools for building databases of experimental data for constructing whole-cell models
bioinformatics cell-biology data-aggregation data-integration genomics molecular-biology systems-biology whole-cell-modeling
Last synced: 03 Sep 2025
https://github.com/ashtonav/addressdata
AddressData.net is a website containing millions of real addresses and maps from 1,500+ cities.
address-search addressdata addresses cities city-data crowdsourced-data csv-data data-aggregation data-collection data-mining data-visualization datasette geospatial lat-long maps openstreetmap overpass-turbo world-cities-csv world-cities-database
Last synced: 22 Sep 2025
https://github.com/joeycumines/go-smartpoll
Package smartpoll offers dynamic, reactive scheduling for synchronized polling of multiple data points.
background-jobs backoff concurrency control-loop data-aggregation dynamic-scheduling go golang polling reactive-programming scheduler task-runner task-scheduler
Last synced: 30 Oct 2025
https://github.com/viclm/mgate
Lightweight gateway written in Node
data-aggregation express gateway
Last synced: 05 Apr 2025
https://github.com/kwb-r/kwb.lca
Functions to Be Used in Life Cycle Assessment (LCA) Projects
data-aggregation data-export data-import data-visualisation excel life-cycle-assessment modelling project-fakin project-smartplant questionaires r rstats spreadsheets template
Last synced: 16 May 2025
https://github.com/estnafinema0/github-trends-aggregator
A real-time GitHub trends aggregator that collects and visualizes popular repositories by language and topic, with WebSocket updates and analytics.
data-aggregation full-stack go go-microservice golang goquery gorilla-mux gorilla-websocket html-css real-time-data web-scraping websocket-updates
Last synced: 10 Apr 2025
https://github.com/shriram-vibhute/data-analysis
This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.
data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn
Last synced: 02 Aug 2025
https://github.com/dmunish/reach
AI-powered disaster alert system for Pakistan that automatically processes official emergency warning documents.
ai data-aggregation disaster-preparedness disaster-risk-reduction early-warning-systems vlm
Last synced: 31 Jan 2026
https://github.com/kwb-r/kwb.pilot
Importing, Aggregating and Visualising Data From KWB Pilot Plants
data-aggregation data-import data-visualisation project-aquanes project-mbr40 project-suleman project-ultimate r r-package rstats
Last synced: 16 May 2025
https://github.com/dmitryro/graphdb-rs
Open-source graph engine for healthcare and analytics | Rust | RocksDB | Sled | FHIR | Decision Support
clinical-data data-aggregation database-proxy distributed-systems event-driven fhir graph-database healthcare hl7 key-value-store knowledge-graph medical-decision-support medical-informatics open-source persistence query-engine rocksdb rust sled telemedicine
Last synced: 20 Jan 2026
https://github.com/effet/cxusage
Codex usage analytics CLI that scans ~/.codex/sessions, aggregates tokens by day or model, and estimates cost using OpenRouter pricing.
anthropic claude cli codex command-line-tool cost-estimation daily-reports data-aggregation developer-tools jsonl log-analysis metrics nodejs observability openai openrouter reporting token-usage typescript usage-analytics
Last synced: 13 Sep 2025
https://github.com/madhurimarawat/data-warehousing
This repository contains practical examples of data warehousing concepts, including star schema and ETL processes, all implemented using MySQL.
data-aggregation data-cleaning data-cleaning-and-preprocessing data-warehousing detailed-documentation etl etl-pipeline mysql normalization olap-cube olap-data olap-database query-optimization snowflake-schema star-schema
Last synced: 25 Mar 2025
https://github.com/utmhikari/daggre
DAta-AGGREgator, a tool to handle data aggregation tasks
daggre data-aggregation data-filtering data-process game-configuration game-testing gin golang join-tables table-joining-service
Last synced: 28 Feb 2025
https://github.com/frankfmy/data-aggregator-dashboard-reactjs
Data Aggregator Dashboard — это современный, многостраничный, модульный React-проект для агрегации и визуализации данных из различных публичных API. Проект реализован с максимальным вниманием к архитектуре, UX/UI, качеству кода, тестам и автоматизации.
api-integration async async-await dashboard data-aggregation fetch frontend hooks javascript promises react spa state-management testing typescript ui-components
Last synced: 07 Jul 2025
https://github.com/akaliutau/map-redux
MapRedux is a lightweight library to aggregate data from Map-like data structures.
Last synced: 28 Feb 2025
https://github.com/kwb-r/kwb.event
Generate Events from Time Series and Work with Events
data-aggregation project-miasco r r-package rstats
Last synced: 16 May 2025
https://github.com/kwb-r/aquanes.report
Collects, aggregates and visualises operational analytical data from water suppliers (including a standardised reporting document)
automated-reporting data-aggregation data-export data-import data-visualisation pilot-plant project-aquanes r r-package rstats shiny-app
Last synced: 16 May 2025
https://github.com/thedigitalninja/personal-data-nexus
A personal data aggregation system that collects, processes, and prepares data from various sources for LLM consumption. This tool helps you aggregate your personal data from services like Fitbit, Rubber Bands, Rocket Money, and more, organizing it into LLM-friendly formats.
data-aggregation llm personal-data quantified-self
Last synced: 26 Dec 2025
https://github.com/madhurimarawat/data-wrangling
This repository contains experiments on data wrangling techniques, focusing on methods for handling missing values, filtering, aggregation, and more.
codes data-aggregation data-concatenation data-conversion data-filtering data-merging data-preprocessing data-reshaping data-sampling data-visualization data-wrangling data-wrangling-workflow date-time-processing detailed-documentation handling-missing-values jupyter-notebook markdown output python text-data-processing
Last synced: 11 Oct 2025
https://github.com/ot-code/sql-sabor-y-tradicion
A SQL-driven project that integrates menu and order data to reveal insights on dish performance, customer preferences, and spending trends. It informs pricing strategies, menu adjustments, and targeted promotions, ultimately enhancing the overall customer experience and driving business growth.
analytical-queries data data-aggregation data-analysis database-design join-queries mysql order-analytics relational-databases restaurant-data sql sql-script
Last synced: 08 Apr 2025
https://github.com/teragrep/dpf_02
Teragrep Result Aggregation for Apache Spark
aggregation data-aggregation data-science data-summarization data-summary data-visualisation data-visualization teragrep
Last synced: 10 Jan 2026
https://github.com/adiii581/pypulse
PyPulse is a polite Python scraper using Selenium and selenium-stealth to aggregate data from PyPI. It automatically handles pagination, cookie banners, and CAPTCHA-based bot detection for a seamless aggregation experience.
automation bot-detection data-aggregation data-engineering pypi python selenium selenium-stealth web-scraping
Last synced: 10 Oct 2025
https://github.com/data-forge-notebook/ohlc-aggregation-example
An example of aggregating OHLC stock data using Data-Forge Notebook
algorithmic-trading data data-aggregation data-analysis ohlc quantitative-finance share-market stock-market trading
Last synced: 30 Jan 2026
https://github.com/toluwaa-o/lite-api
A FastAPI-powered backend that gathers and returns enriched company insights, including Wikipedia-sourced details, country of origin, macroeconomic indicators from the World Bank, and recent news articles with sentiment analysis. Built to support an interactive frontend that helps users understand the economic context behind African companies.
africa api backend company-insights data-aggregation economic-data fastapi google-news macroeconomics python sentiment-analysis web-scraping wikipedia-api world-bank
Last synced: 08 May 2025
https://github.com/toluwaa-o/stears-lite-overview
Central overview repository for the Stears Lite project — documentation, resources, and links to frontend and backend repositories.
africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview
Last synced: 04 Jul 2025
https://github.com/tejbirringtm/uk-explr
An ETL pipeline (and REST API web server) implementation that ingests bulk data (such as CSV files from UK censuses) to produce a single stats lookup table with OA (output area) resolution; queryable by OA, LSOA (lower-layer super output area), MSOA (middle-layer super output area), LAD (local area district), or postal code.
backend-service data-aggregation data-query demographic-analysis england etl etl-pipeline extract-transform-load geospatial-analysis market-analysis mcp mcp-server model-context-protocol northern-ireland policy-analysis rest-api restful-api scotland united-kingdom wales
Last synced: 02 Aug 2025
https://github.com/lkethridge/sda_project
A Statistical Data Analysis project from TripleTen
binomial-distribution continuous-variables data-aggregation data-manipulation data-preparation distribution frequency-histogram hypothesis-tests law-of-large-numbers normal-approximation normal-distribution one-tail-test paired-samples probability-theory random-sampling skewed-data standard-deviation statistical-data-analysis summary-statistics two-tail-test
Last synced: 04 Jul 2025
https://github.com/sanad343/complete_sql
SQL (Structured Query Language) is a powerful programming language used to manage and manipulate relational databases. It enables users to create, read, update, and delete data within databases, making it essential for data analysis and database management.
data-aggregation data-management data-manipulation data-organization data-retrieval
Last synced: 21 Jan 2026
https://github.com/pgrlq7/pierre_mcp_server
# Pierre MCP ServerA powerful MCP server for analyzing fitness data from various sources like Strava and Fitbit. Connect with AI assistants to gain insights into your running and cycling activities. 🏃♂️🚴♀️
ai-assistant api async claude copilot data-aggregation fitness fitness-data fitness-tracker mcp oauth2 privacy rust self-hosted strava tokio
Last synced: 18 Jun 2025
https://github.com/tashi-2004/apache-flink-spark-data-streaming
This project showcases a real-time data streaming pipeline using Apache Flink, Apache Spark, and Grafana. It streams data, stores it in Parquet format, and performs aggregations for insights, with seamless visualization via Grafana dashboards.
apache-flink apache-spark data-aggregation data-analysis data-science data-streaming data-visualization flink flink-stream-processing flink-streaming grafana-dashboard grafana-plugin pyflink python3
Last synced: 09 Feb 2026
https://github.com/trayanaboykova/mysql
Tasks from my course MYSQL at SoftUni
basic-crud built-in-functions data-aggregation data-definition data-types database-programming functions-and-procedures subqueries-and-joins table-relations transactions
Last synced: 07 Apr 2025
https://github.com/mrminemeet/chiaboard
CLI tool that displays stats similarly to the GUI version of chia
chia-blockchain data-aggregation
Last synced: 13 Aug 2025
https://github.com/elrf3lipes/python_automation_projects
Scripts to automate general time-consuming tasks
api-integration automation beautifulsoup biopython-library clinical-trials data-aggregation data-parsing etl-automation json-data-handling pandas pubmed python scraper
Last synced: 05 Mar 2025
https://github.com/victory-ik/analysing-motorcycle-part-sales
This project analyses wholesale sales data to determine how much net revenue each product line generated per month per warehouse. The dataset contains sales transactions from June to August 2021, including details such as payment methods, warehouses, and client types.
data-aggregation postgresql query-optimization sql
Last synced: 07 Mar 2025
https://github.com/dreamiurg/claude-mountaineering-skills
Automates mountain route research for North American peaks. Aggregates data from 10+ mountaineering sources to generate detailed route beta reports with weather, avalanche conditions, and trip reports.
backcountry claude-code-plugin climbing data-aggregation hiking mountaineering route-planning trip-reports
Last synced: 30 Jan 2026
https://github.com/kwb-r/kwb.umberto
Helper functions for UMBERTO (https://www.ifu.com/umberto/) model output
data-aggregation data-import data-visualisation life-cycle-assessment modelling project-fakin project-smartplant r rstats
Last synced: 16 May 2025