An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/raulmaulidhino-dev/ml_modelling_regression

There are many factors that influence the grades/scores of students. One of the factors is study hours. In this mini analysis project, there are 3 models that will learn and predict the relation between study hours of students and their scores in an exam/test. This project will result the best ML model to solve the problem.

data data-analysis-python data-science eda machine-learning scikit-learn

Last synced: 28 Jan 2026

https://github.com/equinor/fmu-sumo-uploader

Upload to Sumo in the FMU context

data fmu python subsurface sumo

Last synced: 06 May 2026

https://github.com/cmda-tt/course-25-26

🎓 tech track · 2025-2026 · curriculum and syllabus 📊

d3 data datavis functional javascript programming research svelte visualization

Last synced: 20 Jan 2026

https://github.com/sahraiidle/email-spam-detector

Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.

data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm

Last synced: 24 Jan 2026

https://github.com/jdanielgoh/cobertura-campanias

En una democracia ¿caben todas las voces? Proyecto para visualizar el monitoreo de radio y TV que realiza el INE de las candidaturas presidenciales 2024

d3js data datavisualization vue

Last synced: 09 Jun 2026

https://github.com/bishtrishu/pizza_sales_analysis_dashboard_sql_bi

Welcome to the Pizza Sales Analysis Dashboard project! This repository contains a comprehensive guide to building an interactive and insightful dashboard for analyzing pizza sales data using SQL and Power BI.

data data-science dataanalyst datavisualization dax dax-query microsoft microsoft-azure microsoft-sql-server msexcel mysql powerbi powerquery project sql

Last synced: 16 Mar 2026

https://github.com/soenneker/soenneker.dtos.idpartitionpair

A minimal Record type with an Id (string), PartitionKey (string), and maximum JSON compatibility

csharp data dotnet dto id key partition

Last synced: 09 Mar 2026

https://github.com/fatihemres/Africa

Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.

animations avfoundation data mapkit models swift swift-animations swiftui

Last synced: 31 Aug 2025

https://github.com/andygol/andygol.github.io

Andrii Holovin – Product & Project Manager Geospatial Expert / OpenStreetMap Consultant / DevOps practitioner

consultant data data-structures devops experience floss gis mapping navigation openstreetmap personal-site personal-website

Last synced: 13 May 2026

https://github.com/priyapuranik/data-analytics-using_python

Analyzed data of Hotels and find out meaningful insights from it including booking patterns and seasonal trends and many more.

data pandas python sql visualization

Last synced: 06 Apr 2026

https://github.com/ttozatto/sparkify

Churn Prediction for music streaming app with PySpark

analysis churn data learning machine predictive pyspark science spark

Last synced: 16 Jan 2026

https://github.com/spatialcurrent/go-pipe

go-pipe is a simple library for piping objects from iterators to writers.

big-data bigdata concurrency data

Last synced: 29 Jan 2026

https://github.com/nasa-pds/nucleus

Nucleus is a software platform used to create workflows for the Planetary Data (PDS).

data ingestion pds planetary workflow

Last synced: 06 Feb 2026

https://github.com/als8446/tripleten-data-science-projects

Projects Overview Projects made in the Data Scientist course from TripleTen LatAm

data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn

Last synced: 10 Apr 2026

https://github.com/apoorv74/njdg-stats

Tracking data from the National Judicial Data Grid's (NJDG) district courts portal

data git-scraping judiciary law

Last synced: 29 Jan 2026

https://github.com/terracrow/tml

Easy to use data manipulation package using YAML.

data database db node npm tml yml

Last synced: 26 Feb 2025

https://github.com/dfsp-spirit/neuroimaging_testdata

Contains test data for unit tests, used in developing neuroimaging software. Ignore this. Licenses in the individual archives.

data unittesting

Last synced: 25 Feb 2026

https://github.com/peterhellberg/bugsnag-data

Dump Bugsnag data using the Data access API

bugsnag data go

Last synced: 22 Jun 2026

https://github.com/snimmagadda1/luigi-etl-example

🔍 Example of an ETL pipeline using Spotify's Luigi

data luigi luigi-pipeline python spotify

Last synced: 30 Mar 2025

https://github.com/restricted/redis-data-cache

TypeScript implementation of data cache management by class name

cache data object redis state typesript

Last synced: 30 Jan 2026

https://github.com/tks18/xl-pq-handler

A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation

analytics automation data excel power-query powerbi python xlwings

Last synced: 20 Jan 2026

https://github.com/gabya06/twitter_models

Repository used for twitter impression models

data data-science impressions machinelearning python ridge-regression sklearn twitter

Last synced: 04 May 2026

https://github.com/team-hydrogen/2025-adc-data

All files relating to the computation of the data provided

data jupyter-notebook nasa-app-development-challenge

Last synced: 11 Apr 2025

https://github.com/bubblymaps/bubblymaps

The open source bubbler map. Mapping the world's water fountains. Open Code, Open Data.

bubbler bubbly-maps data fountain map open-source water

Last synced: 31 Jan 2026

https://github.com/hlan22/2025-03-18-data-validation

(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:

data validation

Last synced: 23 Jun 2026

https://github.com/opendatach/alds

a colaborative list of resources and ideas to enable "Amt Local Data Stewards" to manage the (open) data of their respective federal office

awesome-list data datagovernance dataliteracy datamanagement datastewardship opendata opengovernmentdata

Last synced: 31 Jan 2026

https://github.com/piyushkumar2025/india-general-elections-2024_data-analyst

Analyzed election data for 540+ constituencies and 100+ parties using SQL. Calculated state-wise seat distributions, classified 30+ parties into alliances, identified top 10 candidates by EVM votes, calculated victory margins, and analyzed voting patterns for 300+ candidates to uncover key insights.

analytics data database mysql sql statistics

Last synced: 22 May 2026

https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project

This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.

cleaning-data data database dataset mysql mysql-database sql

Last synced: 07 Apr 2025

https://github.com/abhishekn1947/samgov-scraper

Automated Python scraper for sam.gov contracts

analytics automation aws data pandas postgresql rds selenium webscraper

Last synced: 09 Apr 2026

https://github.com/mikpom/genomvar

Sequence variant analysis in Python

data genomics

Last synced: 10 Apr 2026

https://github.com/bhar2254/sobershift

Simply attendance tracking application

data form ifc jambi java qt tracking utility

Last synced: 05 May 2026

https://github.com/agdturner/ccg-data

A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.

data data-analysis

Last synced: 12 Jan 2026

https://github.com/jeugregg/deeplearningpicturedogs

Classify dogs pictures by Deep Learning CNN neural networks

classez-des-images cnn-keras data data-science ipynb neural-network vision

Last synced: 24 Jul 2025

https://github.com/passly-nl/data

Source code of the data layer.

data passly ticketing typescript

Last synced: 27 May 2026

https://github.com/chocolateboy/data

Structured data scraped from unstructured (or semi-structured) sources

data dataset datasets json opendata scrape scraped scraper wikipedia

Last synced: 30 Aug 2025

https://github.com/ymorsi7/quranicvisualization

A visual exploration tool for the Holy Quran using D3.js treemaps.

css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization

Last synced: 15 Apr 2026

https://github.com/schoolsquirrel/holiday-data

Automatically updated holiday data for SchoolSquirrel

data holidays schoolsquirrel scripts vacation

Last synced: 03 Oct 2025

https://github.com/tanyagarg25/project_covidanalysis

This repository is a project for analyzing COVID-19 data using SQL and visualizing it with Tableau. Technologies used include SQL for querying and Tableau for data visualization.

analysis dashboard data data-visualization sql tableau

Last synced: 08 Feb 2026

https://github.com/darshjasani/insurance-claim-analysis

This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.

analysis data kaggle sql

Last synced: 27 Aug 2025

https://github.com/suhanyujie/ai-driving-data

some AI driving data

ai-driving-car data

Last synced: 08 Feb 2026

https://github.com/munas-git/codm-review-analysis-and-predictions

Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.

data flask machine-learning python sentiment-analysis

Last synced: 05 May 2026

https://github.com/ferru97/jsketchfabcrawler

jSketchfabCrawler is a java for the automatic crawling of model's information from sketchfab.com

crawler data database java sketchfab sql

Last synced: 03 Jan 2026

https://github.com/matt-dray/draytasets

:1234::disguised_face: Miscellaneous datasets I've collected or prepared

card-games data phd pokemon

Last synced: 09 Feb 2026

https://github.com/debjyotisaha/tableau-projects-phase-2

Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.

dashboards data data-analysis data-visualisation tableau

Last synced: 26 Aug 2025

https://github.com/rdmurphy/deno-quaff

A port of the quaff Node.js library to Deno.

archieml csv data deno json toml yaml

Last synced: 05 May 2026

https://github.com/muthupillai1204/diwali_sales_analysis

The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.

data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization

Last synced: 05 May 2026

https://github.com/mateuszskoczek/generatorcsv

GeneratorCSV is a students and teachers data converter for Microsoft 365 Admin Center. The project was implemented for Sobolew High School.

admin converter data microsoft365 python school tkinter

Last synced: 26 Aug 2025

https://github.com/haroontrailblazer/machine_learning

About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.

data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics

Last synced: 16 Apr 2026

https://github.com/rudxain/xorsum

Get XOR checksum with this command-line tool

binary checksum cli data digest file files hexadecimal rust-crate xor

Last synced: 08 Mar 2026

https://github.com/0xnu/data-analyst-training

The repository contains training materials for data analysts.

data data-analysis data-analyst

Last synced: 25 Aug 2025

https://github.com/julienmalka/shiftgenerator

ShiftGenerator WeSki 2018

data data-science latex python

Last synced: 06 May 2026

https://github.com/vatshayan/songs-datasets

Datasets for Songs and Music for Dancing, Emotional, Happy and scenic view

1000dataset classfication csv data datapackage datapackages dataset datasets excel free freedata freedatasets genre machine music sgenre song songs

Last synced: 18 Mar 2026

https://github.com/luminati-io/google-maps-dataset-samples

A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.

api data dataset google-maps maps web-scraping

Last synced: 03 Jan 2026

https://github.com/ksm26/ml-ai-data-science-jobs-in-canada

Explore the latest machine learning, artificial intelligence, and data science job opportunities in Canada. Stay informed about Canadian tech job market trends and find your next career move.

ai-canada ai-careers canada canadian-tech-companies canadian-tech-job-market data data-analysis data-engineering data-science data-science-careers machine-learning prompt-engineering robotics

Last synced: 06 May 2026

https://github.com/miozilla/snowden

snowden :snowman::video_game: : VR Game # Snowflake # Data Engineering # ELT

data elt engineering snowflake sql vr-game

Last synced: 11 Feb 2026

https://github.com/anandanraju/power_bi_dashboard_projects

The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.

amazon dashboard data data-visualization healthcare powerbi project

Last synced: 11 Feb 2026

https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis

In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.

data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public

Last synced: 09 Mar 2026

https://github.com/kunalthakur204/visualization-on-flower

🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊

data data-visualization dataanalysis flowerdataset python

Last synced: 16 Apr 2026

https://github.com/parthds02/analyzing-student-success-with-data

Discover key factors influencing student performance through data analysis and visualization. Explore gender, parental education, sports, and ethnicity impacts.

data datascience jupyter-notebook kaggle python pythonlibraries

Last synced: 06 May 2026

https://github.com/gaemapiracicaba/norma_dec_8468-76

Padrões de qualidade e lançamento de efluentes de águas interiores

data python

Last synced: 19 Apr 2026

https://github.com/jbn/vaquero

A Python library for iterative and interactive data wrangling at laptop-scale.

data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework

Last synced: 10 Jun 2026

https://github.com/afeiship/next-object-operator

Object set/get/sets/gets and other operator.

data get gets next operator set sets store

Last synced: 27 Feb 2026

https://github.com/paulrosset/cyclone

Network data consumption monitoring

data monitoring network networking

Last synced: 23 Aug 2025

https://github.com/ralzz/dibimbing_datascience

This project contains an Exploratory Data Analysis (EDA) of the Estonia Passenger List dataset. I handled missing values, removed duplicate data, and created basic visualizations to find insights.

data data-science eda google-colab kaggle pandas python

Last synced: 06 May 2026

https://github.com/alexyiann/finance

In this repository you can find scripts for pulling data and comparing them , but you can also find simple python scripts to automate trades on Crypto and back testing trading strategies on both crypto and stocks .

api bots data database finance option option-strategies strategy trading trading-algorithms

Last synced: 03 Jan 2026

https://github.com/namratha2301/sales-orders-analysis

Wanted to experiment with Looker. This dashboard visualizes sales trends across regions, customer segments, and product categories.

business-analytics dashboard data dataanalysis datavisualization excel looker looker-studio

Last synced: 13 Feb 2026

https://github.com/wiseql/wiseql

The wise data browser — run SQL recipes as small, observable, debuggable steps

data debugging duckdb oracle quality sql tui

Last synced: 13 Jun 2026

https://github.com/neptun-software/neptun.data.generators

Send scraped data from neptun-scraper to CHATGPT to generate training data for NEPTUN.AI.

data generator

Last synced: 30 Jul 2025

https://github.com/h4fide/politicalcompassbot

This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!

bot data greedy-algorithms politics python python3 sql telegram

Last synced: 19 Aug 2025

https://github.com/spajai/etl-sharepoint-data-uploader-pipeline

Custom Python Script to Pull specific data from source and Upload to the Microsoft SharePoint

data etl etl-pipeline microsoft microsoft365 python3 sharepoint sharepoint-online

Last synced: 11 Nov 2025

https://github.com/wittyicon29/zeotap-ds-assignment

Internship application assignment

data data-science

Last synced: 19 Aug 2025

https://github.com/e-kotov/albofr-data-archive

Tiger Mosquito Colonisation in France data

aedes-albopictus colonisation data france tiger-mosquito

Last synced: 23 May 2026

https://github.com/sunnahboy/checkfake_true_news

Building data structures using Linked lists and arrays and find best algorithms for implementing a system for detecting Fake News

algorithms data level low programming structure

Last synced: 28 Feb 2026

https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization

A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.

api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode

Last synced: 14 Apr 2026

https://github.com/lab5e/loadabledata

Simple framework-agnostic wrapper around loadable data to help encapsulate and use state changes in a UI.

async data loadable state typescript ui

Last synced: 07 May 2026

https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata

I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.

chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation

Last synced: 14 Feb 2026

https://github.com/florianreuth/pit

pit - the private information tracker

data java passwords security vault

Last synced: 28 Feb 2026