An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/sirmaxx/log_manager

log manager services for microservices

data fastapi logging microservice mongodb

Last synced: 09 Apr 2026

https://github.com/srindot/average_flightdata_collection_fwuaav

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 18 Aug 2025

https://github.com/vedikasnehil/my-data-science-projects

This repository is a comprehensive collection of resources and implementations dedicated to the field of Data Science. It serves as a platform for exploring various aspects of data science, ranging from data preprocessing and exploratory data analysis (EDA) to machine learning and deep learning.

data data-science deep-learning machine-learning matplotlib numpy python sql visualization

Last synced: 10 Apr 2026

https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization

A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.

api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode

Last synced: 14 Apr 2026

https://github.com/wittyicon29/zeotap-ds-assignment

Internship application assignment

data data-science

Last synced: 19 Aug 2025

https://github.com/hakusaro/facts

A fact based knowledge system (FBKS) experiment.

data facts hacktoberfest

Last synced: 03 Jan 2026

https://github.com/rationalprabal/book-management-app

A Node.js and Express.js application for managing books, featuring role-based authentication and authorization with JWT, file uploads for book cover pages, robust data validation and documentation using swagger. The project includes user roles such as Admin, Author, and Reader, each with specific permissions.

data expressjs jwt-authentication mongodb mongoose nodejs rbac-roles

Last synced: 10 Apr 2026

https://github.com/spajai/etl-sharepoint-data-uploader-pipeline

Custom Python Script to Pull specific data from source and Upload to the Microsoft SharePoint

data etl etl-pipeline microsoft microsoft365 python3 sharepoint sharepoint-online

Last synced: 11 Nov 2025

https://github.com/h4fide/politicalcompassbot

This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!

bot data greedy-algorithms politics python python3 sql telegram

Last synced: 19 Aug 2025

https://github.com/rugwiroparfait/alx_sql

This repo is where I save my queries and learning materials in Data Science program from ALX

anaconda data data-analysis jupyter-notebook sql

Last synced: 19 Aug 2025

https://github.com/rachelresende/projeto-finan-as

Este repositório é referente a um curso de análise de dados para finanças que realizei em 2025 na Udemy.

analytics data financas finance finance-management

Last synced: 19 Aug 2025

https://github.com/giscience/measures-rest-oshdb-app

A frontend for providing measures for geospatial datasets, using the OSHDB

data dggs geospatial measure openstreetmap rest

Last synced: 20 Apr 2026

https://github.com/ahmad-ali-rafique/wine-quality-dataset

Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.

analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model

Last synced: 21 Aug 2025

https://github.com/paulrosset/cyclone

Network data consumption monitoring

data monitoring network networking

Last synced: 23 Aug 2025

https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis

In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.

data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public

Last synced: 09 Mar 2026

https://github.com/luminati-io/google-maps-dataset-samples

A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.

api data dataset google-maps maps web-scraping

Last synced: 03 Jan 2026

https://github.com/franckalbinet/maris-crawlers

Automated data harvesting of MARIS data sources

automation data marine-radioactivity

Last synced: 25 Aug 2025

https://github.com/0xnu/data-analyst-training

The repository contains training materials for data analysts.

data data-analysis data-analyst

Last synced: 25 Aug 2025

https://github.com/mateuszskoczek/generatorcsv

GeneratorCSV is a students and teachers data converter for Microsoft 365 Admin Center. The project was implemented for Sobolew High School.

admin converter data microsoft365 python school tkinter

Last synced: 26 Aug 2025

https://github.com/debjyotisaha/tableau-projects-phase-2

Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.

dashboards data data-analysis data-visualisation tableau

Last synced: 26 Aug 2025

https://github.com/ferru97/jsketchfabcrawler

jSketchfabCrawler is a java for the automatic crawling of model's information from sketchfab.com

crawler data database java sketchfab sql

Last synced: 03 Jan 2026

https://github.com/darshjasani/insurance-claim-analysis

This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.

analysis data kaggle sql

Last synced: 27 Aug 2025

https://github.com/schoolsquirrel/holiday-data

Automatically updated holiday data for SchoolSquirrel

data holidays schoolsquirrel scripts vacation

Last synced: 03 Oct 2025

https://github.com/miozilla/fraudfinder

fraudfinder :mag_right::smiling_imp::suspect: : Historical Payment Transactions # Fraud Detection # EDA # Feature Store # Model Registry

analysis data exploratory feature-store fraud-detection

Last synced: 29 Aug 2025

https://github.com/lucasnbsb/data-structures-and-algorithms

Studying data structures and algorithms, mostly on leetcode

algorithms data structures

Last synced: 29 Aug 2025

https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models

This repo contains a model for determining the rainfall patterns and another for medical expense prediction model

data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning

Last synced: 30 Aug 2025

https://github.com/chocolateboy/data

Structured data scraped from unstructured (or semi-structured) sources

data dataset datasets json opendata scrape scraped scraper wikipedia

Last synced: 30 Aug 2025

https://github.com/ate47/playerdata

Get data about a player with a command

bukkit-plugin command data spigot-plugin

Last synced: 30 Aug 2025

https://github.com/olekscode/datageneration

Exploring the methods of data generation for different Machine Learning algorithms

data javascript machine-learning

Last synced: 05 Apr 2025

https://github.com/bilgehangecici/datatypeconverter

Converting integer and floating numbers to appropriate bit-level representation.

data datatypeconverter java machine-level variables

Last synced: 30 Mar 2025

https://github.com/passly-nl/data

Source code of the data layer.

data passly ticketing typescript

Last synced: 27 May 2026

https://github.com/sungchun12/demotron

CLI to delight real people with live demos

cli data demo sqlmesh

Last synced: 26 Feb 2025

https://github.com/agdturner/ccg-data

A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.

data data-analysis

Last synced: 12 Jan 2026

https://github.com/mikpom/genomvar

Sequence variant analysis in Python

data genomics

Last synced: 10 Apr 2026

https://github.com/shubhammittal-data/hr_dashboard_tableau

An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.

chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public

Last synced: 17 May 2026

https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project

This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.

cleaning-data data database dataset mysql mysql-database sql

Last synced: 07 Apr 2025

https://github.com/aminnairi/node-decode

Check that your data meet your expectations

check data decode expectations schema

Last synced: 22 Apr 2026

https://github.com/tks18/xl-pq-handler

A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation

analytics automation data excel power-query powerbi python xlwings

Last synced: 20 Jan 2026

https://github.com/carlosrs14/parallel-data-preprocessig-system

A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.

barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads

Last synced: 24 Jul 2025

https://github.com/nmelgar/birthday_sports_dataviz

We will analyze how the Matthew Effect has influenced in professional sports players.

analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau

Last synced: 08 Jan 2026

https://github.com/ttozatto/sparkify

Churn Prediction for music streaming app with PySpark

analysis churn data learning machine predictive pyspark science spark

Last synced: 16 Jan 2026

https://github.com/goutamhegde002/dsa-roadmap-for-beginners

The "DSA Roadmap for Beginners" repository is a comprehensive guide designed to help beginners learn Data Structures and Algorithms (DSA) efficiently. It provides structured content covering fundamental and advanced topics in DSA, with practical examples, exercises, and coding problems.

beginner beginner-friendly beginner-guide coding-practice data data-science data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice dsa-roadmap interview-preparation interview-resources programming-fundamentals

Last synced: 28 Feb 2026

https://github.com/fatihemres/Africa

Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.

animations avfoundation data mapkit models swift swift-animations swiftui

Last synced: 31 Aug 2025

https://github.com/fatihemres/Fruits

Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.

animations data models onboarding swift swiftui

Last synced: 31 Aug 2025

https://github.com/team810/frcs

FRCS is an online international crowd sources data collection software written for the FRC Competitions. It was created by team 810, The Mechanical Bulls.

crowdsourcing data web

Last synced: 14 Mar 2025

https://github.com/cmda-tt/course-25-26

🎓 tech track · 2025-2026 · curriculum and syllabus 📊

d3 data datavis functional javascript programming research svelte visualization

Last synced: 20 Jan 2026

https://github.com/srgchrksv/articles

My articles about coding, data etc

article coding data learning medium python

Last synced: 18 Jun 2026

https://github.com/sidneyarcidiacono/data-parser

A node module designed to make reading in large files as easy as calling one function.

data javascript node npm

Last synced: 05 May 2026

https://github.com/equinor/fmu-sumo-uploader

Upload to Sumo in the FMU context

data fmu python subsurface sumo

Last synced: 06 May 2026

https://github.com/hit07/fitgpt-hacksc

AI-Powered Fitness Coach; 🥈 Runner up at HackSC's SoCal Tech Week hackathon

data elasticsearch gpt-4o-mini llm pipeline

Last synced: 28 Feb 2025

https://github.com/mnkanout/patients_medication_prediction

The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.

data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3

Last synced: 29 Jun 2025

https://github.com/ccworld1000/cccomposition

CCComposition for code style, Accept code style conversion business(接受code style转换业务)

cccomposition composit construction data structure visual

Last synced: 04 Jan 2026

https://github.com/andrii04/ga4-gcs-to-bigquery-etl

Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.

automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql

Last synced: 18 May 2026

https://github.com/karashiiro/lodestone-character-data-scraper

Lodestone character data scraper.

data ffxiv ffxiv-character lodestone

Last synced: 23 Apr 2026

https://github.com/checco9811/data-engineering-bootcamp-homework

Homework solutions for DataExpert.io data engineering bootcamp

apache-spark data data-engineering sql

Last synced: 14 Mar 2025

https://github.com/nicholas-owen/rdm-calendar

A small utility to manage conference and event information

calendar conference data event research

Last synced: 26 May 2026

https://github.com/open-geodata/sp_bh_pcj-2020-2035

Dados Espaciais da Agência das Bacias PCJ, com informações apresentadas no Plano de Bacias 2020-2035

data python

Last synced: 16 Jan 2026

https://github.com/pythoncoderunicorn/tool-discography

Music Band TOOL albums and songs dataset

data data-science metal-music music r songs

Last synced: 26 May 2026

https://github.com/tomcardoso/journalism-data-intersection

A talk on working at the intersection of journalism and data science

data data-journalism journalism

Last synced: 15 May 2025

https://github.com/reubano/devcraft-workshop

Materials for the DevCraft workshop on stream processing

data functional-programming meza python riko stream-processing tutorial

Last synced: 04 May 2026

https://github.com/mnazlukhanyan/da-projects

Портфолио с работами по аналитике данных, показывающие мои навыки, умения и опыт

data data-vizualisation hypothesis-tests matplotlib pandas plotly postgresql product-metrics python scipy seaborn sql visualization

Last synced: 11 Apr 2026

https://github.com/oniani/miniframe

Minimal data frames with relational algebra

data dataframe-library haskell haskell-library library

Last synced: 04 Mar 2025

https://github.com/illustratien/toolphd

Make your analysis simple and reproducible

academic analysis data phd publications r r-package reproducible-research scientific

Last synced: 26 Jan 2026

https://github.com/muhammadadilnaeem/student-performance-indicater-end-to-end-data-science-project

This project leverages data science techniques to build a predictive model that estimates a student's exam performance. The project follows a structured data science workflow, including data collection, preprocessing, model building, evaluation, and deployment.

data machine-learning-algorithms pandas pymysql python sql

Last synced: 11 Apr 2026

https://github.com/jigyasag18/fake-news-prediction-app

The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Lemmatization algorithm, achieving ~95% classification accuracy with random forest classifier model

data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming streamlit streamlit-webapp vectorization

Last synced: 11 Apr 2026

https://github.com/quantumudit/test-store-data-analysis

This repository showcases a web scraper with a pipeline structure for efficient data extraction and transformation from websites. The tool can be tailored to leverage its capabilities for insightful data analysis, providing valuable insights and informed decision-making.

data data-visualization dataanalytics python python-webscraping webscraper webscraping-data

Last synced: 11 Apr 2026

https://github.com/guilyx/airplane-booking

Simple airline ticket reservation program.

algorithms data linked-list

Last synced: 25 Jun 2025

https://github.com/fuzzt/location-analyzer

The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.

analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack

Last synced: 11 Apr 2026

https://github.com/nisanth2004/springboot-kafka-real-world-project-wikimedia

Creating a project about Wikimedia using Kafka involves building a system that leverages Apache Kafka for data streaming and processing related to Wikimedia data.

async broker communication data java kafka message real-time real-time-analytics springboot wikimedia

Last synced: 14 May 2026

https://github.com/martinius96/meteostanica-odosielacie-scripty

Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.

arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka

Last synced: 11 Apr 2026

https://github.com/justinjjlee/simulation-discrete

Employing data transformations and simulations to answer random questions

analytics data data-science julia python simulation spark

Last synced: 30 Apr 2026

https://github.com/gdcmarinho/vaultchat

VaultChat is a end-to-end encryption chat service

chat data e2ee encrypted messaging privacy

Last synced: 23 Mar 2025

https://github.com/dug22/jjournal

A Jupyter like notebook software for Java

data data-analysis data-science java jshell jshell-repl notebook swing swing-application

Last synced: 11 Apr 2026

https://github.com/g3th/fit_file_decoder

Decodes '*.fit' files and returns readable values.

bytes data decoder fit-file hex parsing

Last synced: 30 Jun 2025

https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm

Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.

algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model

Last synced: 01 Sep 2025

https://github.com/gappeah/layoffs-exploratory-data-analysis

This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.

data dataanalysis eda mysql sql

Last synced: 12 Jul 2025

https://github.com/nyo16/megas_pinakas

Bigtable elixir grpc client

bigtable data elixir grpc

Last synced: 13 Jan 2026

https://github.com/raphaellaude/usaschooldata

Cleaned and accessible school enrollment data for US schools

data duckdb duckdb-wasm education object-storage oss wasm

Last synced: 12 May 2026