An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/csmith0651/ormy

A simple python ORM.

data database python

Last synced: 13 May 2026

https://github.com/ciyer/altair-matplotlib

Ports of examples from a Matplotlib tutorial to Altair/Vega

altair data dataviz vega vega-lite

Last synced: 29 Jul 2025

https://github.com/julienmalka/shiftgenerator

ShiftGenerator WeSki 2018

data data-science latex python

Last synced: 06 May 2026

https://github.com/domarps/grad-project-reports

Write-ups of a few key semester-long projects I have worked during my Masters

circuit data deeplearning graph-algorithms matlab question-answering

Last synced: 26 Mar 2025

https://github.com/chompfoods/stub-jaxrs-jersey

JAX-RS Jersey server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients jax-rs jersey nutrition raw recipe-api recipes server server-stub stub stub-server

Last synced: 02 May 2026

https://github.com/poode/firebase-modeling

Get firebase/firestore entity model to migrate to mongo or any db later

data database firebase firestore modeling schema

Last synced: 06 May 2026

https://github.com/jigyasag18/credit-card-fraud-detection-using-machine-learning

This repository presents a credit card fraud detection system utilizing a Logistic Regression model trained on a dataset of 284,807 transactions with significant class imbalance. After employing under-sampling for balance, the model achieves a test accuracy of around 93.40%, showcasing the effectiveness of ML in identifying fraudulent transactions.

credit-card-fraud creditcardfrauddetection data dataset logistic-regression logisticregression machine-learning machine-learning-algorithms mlproject mlprojects

Last synced: 02 Sep 2025

https://github.com/gagolews/clustering-data-v0

Datasets for Clustering [DEPRECATED – A NEW VERSION IS AVAILABLE]

clustering data dataset machine-learning

Last synced: 15 Sep 2025

https://github.com/ntnn/dataparse

Parsing, transforming and unmarshalling data.

data data-parser data-parsing data-transformation golang golang-lib

Last synced: 30 Jun 2026

https://github.com/ressuman/csv-writer-project

CSV Writer with TypeScript. This project demonstrates my implementation of a CSV writer using plain TypeScript and JavaScript, without relying on any frameworks.

data javascript typescript

Last synced: 15 May 2026

https://github.com/badranalyst/covid-deaths-dashboard-with-tableau

This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.

covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization

Last synced: 02 Mar 2026

https://github.com/ekoepplin/dbt-bigquery-core

How to get data to BigQuery (or duckDB) and setup dbt tests for SODA cloud monitoring

bigquery data data-quality dbt dlt duckdb gcp soda

Last synced: 06 May 2026

https://github.com/j2kun/terrorism-usa-post-9-11

A copy of the terror data published by NewAmerica

data politics terrorism transparency

Last synced: 02 Mar 2026

https://github.com/dms-codes/www.usu.ac.ididdirektori

Faculty and Docent Data Retrieval Script The faculty_and_docent_data_retrieval.py script is a Python script for retrieving faculty and docent data from a university website using Selenium. It includes functions to extract faculty names and docent profiles, as well as a multithreading approach to fetch data for multiple faculty-docent pairs.

data python scrape

Last synced: 26 May 2026

https://github.com/soenneker/soenneker.data.zipcode

US ZIP code data from USPS, updated daily

code csharp data dotnet usps zip

Last synced: 02 Mar 2026

https://github.com/kenjyco/mongo-helper

Helper funcs and tools for working with MongoDB

aggregation-pipeline data database kenjyco mongo mongodb python

Last synced: 28 Jan 2026

https://github.com/omari-kd/environmental-impact-on-food-production

The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.

data data-analysis data-science data-visualization environmental-impact-analysis r

Last synced: 30 Mar 2025

https://github.com/fabsdevx/file-format-converter-handout

Data Engineering project for learning purposes. Credits to itversity

csv csv-import data data-engineering database pandas python

Last synced: 06 May 2026

https://github.com/coderjolly/spotify-api-data-analysis

The project leverages Apache Airflow for automating Spotify API data analysis, focusing on user activity. Extracting, transforming, and loading data efficiently, it provides insights via PowerBI dashboards.

airflow airflow-dags data data-engineering etl etl-pipeline microsoft-sql-server power-bi python scripting sql

Last synced: 27 Mar 2026

https://github.com/nagar2nd/financial-analysis-power-bi

This project analyzes financial and credit card usage data using Power BI and DAX, focusing on customer behavior, credit risk, and financial performance. It includes insights on spending trends, delinquency rates, churn indicators, and satisfaction scores to drive better financial management and customer retention strategies.

analysis data dax dax-functions dax-query excel powerbi

Last synced: 03 Mar 2026

https://github.com/inzhenerka/scooters_data_generator

Generate data of scooter trips for analysis

data dbt generator

Last synced: 02 Jun 2026

https://github.com/questionlp/wwdtm_uniquedates

Script that lists out the unique months and days of months that Wait Wait... Don't Tell Me! shows have aired

data python python3 script wwdtm

Last synced: 17 May 2026

https://github.com/miozilla/pandas

pandas :panda_face::panda_face: : Python Library # Data Analysis # Dataframe

analysis data dataframe pandas python sqlite3

Last synced: 07 May 2026

https://github.com/shubhamsoni98/excel-practice

Excel-Practice-Questions

analysis data excel formula raw-data xlsx

Last synced: 03 Mar 2026

https://github.com/metapsy-project/data-depression-anxiety-transdiagnostic

Database of transdiagnostic treatment of depression and anxiety

data

Last synced: 01 Apr 2026

https://github.com/ngupta23/data_prep_helper

A helper package for preparing and combining data from a variety of sources

data data-science dataprep datapreparation dataprocessing helpers python

Last synced: 03 Apr 2025

https://github.com/jillmpla/kaggle_notebooks

Kaggle-based data analysis, data science, and data visualization.

data data-science data-visualization kaggle machine-learning

Last synced: 16 Apr 2026

https://github.com/agdturner/ccg-data

A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.

data data-analysis

Last synced: 12 Jan 2026

https://github.com/shubhamsoni98/analysis-with-sql

This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.

analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql

Last synced: 03 Jan 2026

https://github.com/bagustris/dataits

Web for DataITS17: Summer School on Data Science

data data-science

Last synced: 28 Jun 2025

https://github.com/mikpom/genomvar

Sequence variant analysis in Python

data genomics

Last synced: 10 Apr 2026

https://github.com/passly-nl/data

Source code of the data layer.

data passly ticketing typescript

Last synced: 27 May 2026

https://github.com/edjoukou/human_resources

A data analysis project using MySQL Server database

analysis data mysql powerbi sql visualization

Last synced: 25 Sep 2025

https://github.com/frnt-end/ts-context-items-list

⚛️ React Typescript project - Fetch data and display it as a list of 10 items in 10 (pagination) pages. click on each item leads to more details page- using axios, Context and Styled Components.

api axios context context-api data fetch list pagination router router-dom styled-components typescript

Last synced: 19 May 2026

https://github.com/caprogs/paris-events-analyzer

A project to analyze events in Paris using open source data provided by the city.

data data-analysis data-platform dbt docker ingestion python streamlit transformation vizualisation

Last synced: 04 May 2026

https://github.com/estherslabbert/sql

Using SQL working with student data

data python sql sqlite3

Last synced: 06 Apr 2025

https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project

This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.

cleaning-data data database dataset mysql mysql-database sql

Last synced: 07 Apr 2025

https://github.com/fordinand45/bdp_a_kelompok_3

Project Big Data Python yang diadakan oleh Digitalent Kominfo. Berikut adalah yang ikut serta pada project, yaitu : Dhian Prameswari, Fordinand Pasaribu, dan Muhdad Alfaris Bachmid

data data-analytics data-science linear-regression python3

Last synced: 12 Apr 2026

https://github.com/realbxnnie/accountservice

A Simple DataStoreService wrapper with session backuping and session locking.

data lua luau roblox

Last synced: 29 Jul 2025

https://github.com/raghavendranhp/attrition-alchemy

This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.

data datawrangling decision-trees eda gradient-boosting logistic-regression macine-learning pandas preprocessing random-forest-classifier skicit-learn svm

Last synced: 18 May 2026

https://github.com/aminnairi/node-decode

Check that your data meet your expectations

check data decode expectations schema

Last synced: 22 Apr 2026

https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails

Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.

cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql

Last synced: 24 Jul 2025

https://github.com/rid17pawar/friendscircle

Friends Circle is a console based application developed in cpp using Graph Data Structure.

cpp data graph graph-algorithms oop

Last synced: 08 Jun 2026

https://github.com/olekscode/datageneration

Exploring the methods of data generation for different Machine Learning algorithms

data javascript machine-learning

Last synced: 05 Apr 2025

https://github.com/tks18/xl-pq-handler

A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation

analytics automation data excel power-query powerbi python xlwings

Last synced: 20 Jan 2026

https://github.com/anuraganalog/onyx-data

BI Visualizations to the problems in website. All the Visualization can be found at the below link

data onyx public tableau viz

Last synced: 02 Apr 2026

https://github.com/carlosrs14/parallel-data-preprocessig-system

A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.

barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads

Last synced: 24 Jul 2025

https://github.com/cannt39t/data-mining-spider-vk

Паук который собирают всю информацию о рекламных постах в группе VK

data data-mining python3 vk vkontakte

Last synced: 05 Apr 2025

https://github.com/als8446/tripleten-data-science-projects

Projects Overview Projects made in the Data Scientist course from TripleTen LatAm

data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn

Last synced: 10 Apr 2026

https://github.com/jigyasag18/amazon-power-bi-dashboard

The Amazon Power BI Dashboard Project repository provides an interactive analytics dashboard for visualizing and analyzing sales performance across various product categories within Amazon's ecosystem. Utilizing comprehensive sales data, it empowers stakeholders with actionable insights to enhance decision-making and improve business strategies.

data data-visualization dataanalysis dataanalytics dataset datasets datavisualization-project powerbi powerbi-report powerbi-visuals powerbidashboard

Last synced: 07 Mar 2026

https://github.com/bonnevoyager/quick-storage

Simple key/value storage module with persistency.

browser data fs indexeddb javascript key-value nodejs persistence quick server storage

Last synced: 16 Apr 2026

https://github.com/rudxain/xorsum

Get XOR checksum with this command-line tool

binary checksum cli data digest file files hexadecimal rust-crate xor

Last synced: 08 Mar 2026

https://github.com/priyapuranik/data-analytics-using_python

Analyzed data of Hotels and find out meaningful insights from it including booking patterns and seasonal trends and many more.

data pandas python sql visualization

Last synced: 06 Apr 2026

https://github.com/natarizkie2/neurochain-airdrop-bot

🍋 — A smart bot designed to complete data tasks like true/false selections automatically, with multi-account support for extra convenience.

airdrop automated bot data multi-account natarizkie neurochain nodejs web3

Last synced: 10 Jun 2026

https://github.com/justinyahin/wpdf

Create, filter, sort and display users data on your WordPress site.

data filtering wordpress

Last synced: 18 Apr 2026

https://github.com/lambocreeper/spotify-visualiser

Visualise Spotify Data

data spotify visualise

Last synced: 21 Jul 2025

https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics

This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.

big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard

Last synced: 01 Aug 2025

https://github.com/fatihemres/Africa

Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.

animations avfoundation data mapkit models swift swift-animations swiftui

Last synced: 31 Aug 2025

https://github.com/fatihemres/Fruits

Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.

animations data models onboarding swift swiftui

Last synced: 31 Aug 2025

https://github.com/erickpeirson/jhb-data

Data from the forthcoming paper: Quantitative Perspectives on Fifty Years of the Journal of the History of Biology

data geolocation history-of-biology named-entity-recognition topic-modeling

Last synced: 04 Mar 2026

https://github.com/chompfoods/sdk-typescript-angular

Angular TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

angular api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes sdk typescript

Last synced: 09 May 2026

https://github.com/greatwoman23/car_insurance_analysis

The Car Insurance Analysis project aims to provide a comprehensive examination of a car insurance portfolio using advanced data analytics tools. The analysis offers valuable insights into policy demographics, claims patterns, and financial metrics, helping stakeholders make informed decisions.

bigquery data data-science dataanalytics insurance-claims looker-studio tableau

Last synced: 03 Feb 2026

https://github.com/ashakoen/bls-data-extract

This repository contains scripts and a database schema to set up and manage a local SQLite database for storing and querying the Average Price data from the U.S. Bureau of Labor Statistics. It includes tools for downloading the latest data from the BLS website and fetching Consumer Price Index (CPI) data via the BLS API.

data government sqlite us

Last synced: 01 Apr 2026

https://github.com/soenneker/soenneker.dtos.idpartitionpair

A minimal Record type with an Id (string), PartitionKey (string), and maximum JSON compatibility

csharp data dotnet dto id key partition

Last synced: 09 Mar 2026

https://github.com/thomasjewson/cci-data-science-textbook

This is a short, interactive textbook aimed at introducing data science to non-IT university undergraduates. Funded by Erasmus+.

data data-science learning python textbook

Last synced: 16 Apr 2026

https://github.com/chocolateboy/data

Structured data scraped from unstructured (or semi-structured) sources

data dataset datasets json opendata scrape scraped scraper wikipedia

Last synced: 30 Aug 2025

https://github.com/e-kotov/albofr

alboFr: Get French Data on Tiger Mosquito Colonisation

aedes-albopictus data france tiger-mosquito

Last synced: 11 Jun 2026

https://github.com/redatargaoui/dataconverter

Data conversion functionality to integrate into the software used for autism detection research.

apache-poi data dataconversion excel java

Last synced: 06 Sep 2025

https://github.com/davidkhala/sql

Standard SQL collection

data sql

Last synced: 06 Apr 2025

https://github.com/team810/frcs

FRCS is an online international crowd sources data collection software written for the FRC Competitions. It was created by team 810, The Mechanical Bulls.

crowdsourcing data web

Last synced: 14 Mar 2025

https://github.com/yadavkaushal/datascience-e-commerce-shopping-details

This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.

data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization

Last synced: 06 May 2026

https://github.com/bryanhe24/data_analysis_app

A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.

ai data data-analysis data-visualization fullstack-development javascript math python reactjs

Last synced: 07 May 2026

https://github.com/dms-codes/scrape_tripsantai

Trip Santai Tour Data Scraper This Python script is a web scraper designed to extract and collect information about tours from the Trip Santai website. It utilizes the requests library to fetch web pages, BeautifulSoup for parsing HTML, and writes the collected data to a CSV file.

beautifulsoup4 data python requests scraper webscraper

Last synced: 21 May 2026

https://github.com/cmda-tt/course-25-26

🎓 tech track · 2025-2026 · curriculum and syllabus 📊

d3 data datavis functional javascript programming research svelte visualization

Last synced: 20 Jan 2026

https://github.com/luminati-io/google-search-api

Two methods to collect real Google SERP data—a free scraper for basic use and the enterprise-grade Bright Data API for high-volume demands.

data google-scraper html python serp-api web-scraping

Last synced: 25 Jun 2025

https://github.com/fastpix/android-data-bitmovin

FastPix Video Data SDK to monitor and analyze video playback metrics within Bitmovin for android

analytics android-sdk bitmovin data fastpix metrics player sdk video

Last synced: 16 Apr 2026

https://github.com/talitalobo/statistics-with-python

Repo about statistical concepts and (not always) their python implementation.

data data-science machine-learning statistics

Last synced: 11 Jan 2026

https://github.com/mrk214/bible-data-es-spa

La Biblia en formato JSON

api bible biblia data god jesus json spanish

Last synced: 05 Apr 2025

https://github.com/mekramy/ircity

Iran province, county and city data in json format.

data iran-city json mekramy

Last synced: 05 Apr 2025

https://github.com/jigyasag18/power-bi-dashboard-project

The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.

dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization

Last synced: 04 Mar 2026

https://github.com/ksimicevic/discord-message-analyzer

Analyzing discord messages in Jupyter notebook

analysis data discord messages

Last synced: 16 Apr 2026

https://github.com/haideratgh/sql-data-analytics-project

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis

analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-query sql-server window-functions-in-sql

Last synced: 29 Jun 2025