An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/allanotieno254/powerbi-dax-filter-context

This repository contains a Power BI project that explores **DAX Filter Context**, a crucial concept in DAX calculations. The project focuses on **Bank Loan Analysis**, demonstrating how different filter contexts affect DAX formulas.

business-intelligence data data-analysis dax dax-functions powerbi powerbi-visuals visualization

Last synced: 08 Jan 2026

https://github.com/carlosrs14/parallel-data-preprocessig-system

A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.

barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads

Last synced: 24 Jul 2025

https://github.com/snimmagadda1/luigi-etl-example

πŸ” Example of an ETL pipeline using Spotify's Luigi

data luigi luigi-pipeline python spotify

Last synced: 30 Mar 2025

https://github.com/tks18/xl-pq-handler

A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation

analytics automation data excel power-query powerbi python xlwings

Last synced: 20 Jan 2026

https://github.com/shadeglare/genum

The ES Next tools to process data in a LINQ manner

data linq processing typescript

Last synced: 13 Apr 2026

https://github.com/Coko7/vegapull-records

Cards dataset for One Piece TCG

data one-piece one-piece-card-game one-piece-tcg tcg

Last synced: 28 Apr 2025

https://github.com/team-hydrogen/2025-adc-data

All files relating to the computation of the data provided

data jupyter-notebook nasa-app-development-challenge

Last synced: 11 Apr 2025

https://github.com/rishitabansal9/adult-census-income-prediction

This is a project made for data analysis and income prediction using random forest classifier with 91% accuracy.

data data-analysis data-science feature-engineering random-forest-classifier

Last synced: 25 Mar 2025

https://github.com/q-aware-labs/bias-insights

Bias detection project for the Chicago Face Database (CFD)

ai chicago-data-portal data data-science llm statistical-analysis

Last synced: 21 Jan 2026

https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails

Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.

cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql

Last synced: 24 Jul 2025

https://github.com/meokullu/prefill

PreFill adds desired characters onto output values to increase their legibility.

alignment data data-analysis data-engineering data-science legibility

Last synced: 17 Jan 2026

https://github.com/atiqurcode/scrap-spec

Scrap data from the html to table html code / json

data html-table json-data scarp

Last synced: 05 Feb 2026

https://github.com/aminnairi/node-decode

Check that your data meet your expectations

check data decode expectations schema

Last synced: 22 Apr 2026

https://github.com/fiedsch/data_util

misc. Utilities for data files like variable name lists

data helper management php

Last synced: 14 Jun 2025

https://github.com/piyushkumar2025/india-general-elections-2024_data-analyst

Analyzed election data for 540+ constituencies and 100+ parties using SQL. Calculated state-wise seat distributions, classified 30+ parties into alliances, identified top 10 candidates by EVM votes, calculated victory margins, and analyzed voting patterns for 300+ candidates to uncover key insights.

analytics data database mysql sql statistics

Last synced: 22 May 2026

https://github.com/buffdelta/basketball_ref_webscraper

Python package to make webscraping from basketball-reference easy

basketball data python python-library webscraping

Last synced: 14 Jan 2026

https://github.com/mladen/ds-ml-and-ai-experiments

:1234: My Data Science, Machine learning and Artificial Intelligence experiments and projects

data data-mining data-science datascience dataset

Last synced: 09 Jun 2026

https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project

This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.

cleaning-data data database dataset mysql mysql-database sql

Last synced: 07 Apr 2025

https://github.com/arunabhagit/bank-customer-churn-analysis-and-risk-tracker

This project analyzes customer churn using machine learning and visual storytelling through Power BI. A Random Forest model identifies high-risk customers, while interactive dashboards reveal key churn patterns, enabling targeted retention strategies and data-driven decision-making for business improvement.

analysis data powerbi predictive-modeling sql

Last synced: 28 Jul 2025

https://github.com/soenneker/soenneker.constants.data

A set of commonly used constants related to various types of data

constants csharp data dotnet

Last synced: 12 Mar 2026

https://github.com/nukopian/shell-flatten

Flatten a series into a single record

automation data shell

Last synced: 18 Jun 2025

https://github.com/vatshayan/pokemon-analysis

Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning

artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn

Last synced: 30 May 2026

https://github.com/jooapa/bytebrother

Byte Brother is watching YOU

data data-analysis security

Last synced: 26 Jan 2026

https://github.com/zazza123/hamana

A python library for seamless data extraction, storage, and SQL-based analysis using pandas and SQLite.

analysis data python

Last synced: 14 Jan 2026

https://github.com/shubhammittal-data/hr_dashboard_tableau

An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.

chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public

Last synced: 17 May 2026

https://github.com/kashifkhan7/cleaning-analysis_cli

Analyze sales data easily with our CLI app. Gain insights on revenue trends and visualize results using Python, Pandas, and Matplotlib. πŸš€πŸ“Š

conditional-statements css data datacleaning exception-handling exiftool html json matplotlib-pyplot metadata metadata-extraction pandas-python python sales-analysis seaborn-python speech-to-text transcription youtube

Last synced: 13 Apr 2026

https://github.com/2kabhishek/pybank

Data Analysis for the silliest Bank πŸ’°πŸ¦

csv data data-science learning pandas python topic1 topic2

Last synced: 12 May 2026

https://github.com/newrelic-experimental/newrelic-java-atomikos

Gives status of Atomikos Data Sources since this information is unavailable via JMX

atomikos data instrumentation java nrlabs nrlabs-data nrlabs-java-verify nrlabs-odp observability-data

Last synced: 30 May 2026

https://github.com/mikpom/genomvar

Sequence variant analysis in Python

data genomics

Last synced: 10 Apr 2026

https://github.com/koppalexander/flightdelaychallenge

This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.

data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling

Last synced: 19 Jun 2026

https://github.com/gman-au/white-knight-neo4j

Neo4j implementation of White Knight data abstraction library

abstractions data datastore dotnet neo4j repository-pattern specification-pattern

Last synced: 20 Jan 2026

https://github.com/agdturner/ccg-data

A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.

data data-analysis

Last synced: 12 Jan 2026

https://github.com/lefuturiste/npm-api

Search or get a npm package

api data npm php

Last synced: 14 May 2026

https://github.com/yeti-robotics/past-scouting-data

❄️ Scouting Data from Previous Events/Seasons ❄️

data first frc

Last synced: 06 Jan 2026

https://github.com/grimen/js-humanizer

A human/developer friendly value humanizer - for JavaScript/Node.

data debug debugging format formatting humanize humanizer log logging print printing value

Last synced: 13 Jun 2026

https://github.com/sungchun12/demotron

CLI to delight real people with live demos

cli data demo sqlmesh

Last synced: 26 Feb 2025

https://github.com/kunalshelke90/kunalshelke90

πŸ’» Machine Learning Enthusiast | Data Science Explorer | eager about solving problems with help of data.

data data-science dataanalysis database machine-learning mlops

Last synced: 06 Jul 2025

https://github.com/passly-nl/data

Source code of the data layer.

data passly ticketing typescript

Last synced: 27 May 2026

https://github.com/luminati-io/jupyter-notebooks-web-scraping

Perform web scraping interactively using Jupyter Notebooks, integrating coding, data analysis, and visualization into one seamless workflow.

beautifulsoup4 data jupyter jupyter-notebook pandas python requests seaborn virtual-environment web-scraper web-scraping

Last synced: 13 Apr 2026

https://github.com/bilgehangecici/datatypeconverter

Converting integer and floating numbers to appropriate bit-level representation.

data datatypeconverter java machine-level variables

Last synced: 30 Mar 2025

https://github.com/olekscode/datageneration

Exploring the methods of data generation for different Machine Learning algorithms

data javascript machine-learning

Last synced: 05 Apr 2025

https://github.com/rajkumarbestha/nsedataextractor

NSEDataExtractor

data python python3

Last synced: 26 Mar 2025

https://github.com/davitshahnazaryan3/data-management-web

Explore datasets with ease using taxonomy filtering, allowing you to quickly identify the specific experimental datasets you need and download them effortlessly

data environmental experiments filtering-data seismic taxonomy

Last synced: 17 Jan 2026

https://github.com/danielrosehill/global-value-factors-explorer-dataset

Derivative database of IFVI Global Value Factors for data analysis and visualization use cases.

data environmental-data sustainability-data

Last synced: 23 Feb 2026

https://github.com/mightymetrika/holi

holi: Higher Order Likelihood Inference Web Applications

data data-science r statistics

Last synced: 10 Feb 2026

https://github.com/boytchev/coursedataviz

Supplementary materials for "Data Visualization" course

data fmi su visualization

Last synced: 16 Mar 2025

https://github.com/welli7ngton/mysql-server-formacao-alura

repositΓ³rio para guardar cΓ³digos escritos em SQL de cursos da formaΓ§Γ£o em mysql server da alura

data database mysql

Last synced: 19 Apr 2026

https://github.com/ate47/playerdata

Get data about a player with a command

bukkit-plugin command data spigot-plugin

Last synced: 30 Aug 2025

https://github.com/chocolateboy/data

Structured data scraped from unstructured (or semi-structured) sources

data dataset datasets json opendata scrape scraped scraper wikipedia

Last synced: 30 Aug 2025

https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models

This repo contains a model for determining the rainfall patterns and another for medical expense prediction model

data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning

Last synced: 30 Aug 2025

https://github.com/greedchikara/dsajs

Data Structures and Algorithms written in Javascript

algorithms data structures

Last synced: 09 Apr 2026

https://github.com/lucasnbsb/data-structures-and-algorithms

Studying data structures and algorithms, mostly on leetcode

algorithms data structures

Last synced: 29 Aug 2025

https://github.com/sandysanthosh/aspose-doc-to-pdf

Document & Browser object model

aspose build data doc java pdf

Last synced: 04 Jun 2026

https://github.com/vidushibhadana/covid19-data-exploration-using-sql

Deployed diverse SQL techniques to analyze COVID-19 data for an improved understanding of pandemic's regression.

data database database-management sql

Last synced: 19 Aug 2025

https://github.com/miozilla/fraudfinder

fraudfinder :mag_right::smiling_imp::suspect: : Historical Payment Transactions # Fraud Detection # EDA # Feature Store # Model Registry

analysis data exploratory feature-store fraud-detection

Last synced: 29 Aug 2025

https://github.com/stoyank7/football-prediction

This is my Semester 7 Project for my "AI for Society" minor at Fontys University of Applied Sciences.

ai betting data football machine-learning university-project

Last synced: 25 Mar 2025

https://github.com/arkanovicz/skorm

Simple Kotlin Object Relational Mapping

data database model orm sql

Last synced: 19 Apr 2026

https://github.com/schoolsquirrel/holiday-data

Automatically updated holiday data for SchoolSquirrel

data holidays schoolsquirrel scripts vacation

Last synced: 03 Oct 2025

https://github.com/quonverbat/ordner

A simple, customizable and cross-platform data tracker.

data datatracker javafx management

Last synced: 07 Jul 2025

https://github.com/darshjasani/insurance-claim-analysis

This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.

analysis data kaggle sql

Last synced: 27 Aug 2025

https://github.com/roshaka/samplr

Samplr is a Python decorator for selecting a subset of items from a list, with options for customisation and informative console printouts.

data data-analysis data-engineering decorators list python sampling

Last synced: 14 Jan 2026

https://github.com/austinhartzheim/career-fair-backend

Backend for ECS Career Fair app

data django python

Last synced: 13 Apr 2026

https://github.com/ferru97/jsketchfabcrawler

jSketchfabCrawler is a java for the automatic crawling of model's information from sketchfab.com

crawler data database java sketchfab sql

Last synced: 03 Jan 2026

https://github.com/blueheron786/quranic-universal-library-mushaf-layouts

The Quranic Universal Library (QUL)'s Qur'an mushaf 15-line layouts (madini, uthmani)

data database layout mushaf quran sqlite uthmani uthmani-quran

Last synced: 13 Apr 2026

https://github.com/debjyotisaha/tableau-projects-phase-2

Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.

dashboards data data-analysis data-visualisation tableau

Last synced: 26 Aug 2025

https://github.com/stdlib-js/array-base-last-index-of-same-value

Return the index of the last element which equals a provided search element according to the same value algorithm.

array data find generic index javascript locate node node-js nodejs same scan search stdlib structure types

Last synced: 13 Apr 2026

https://github.com/henryssondaniel/teacup-service-report-mysql-java

Connect your Teacup report data to a MySQL database

data logs mysql reports teacup

Last synced: 13 Apr 2026

https://github.com/romaintailhurat/dagster-playground

Playing with Dagster πŸ™

data pipelines python3

Last synced: 14 Jun 2025

https://github.com/mateuszskoczek/generatorcsv

GeneratorCSV is a students and teachers data converter for Microsoft 365 Admin Center. The project was implemented for Sobolew High School.

admin converter data microsoft365 python school tkinter

Last synced: 26 Aug 2025

https://github.com/deliprofesor/health-score-prediction-model-the-impact-of-lifestyle-and-demographic-factors

A machine learning project predicting health scores based on lifestyle and demographic factors like age, BMI, diet, and exercise. Techniques include Random Forest, Polynomial Regression, and Linear Regression, with a focus on model performance and actionable health insights.

cross-validation data data-science data-visualization feature-engineering linear-regression machine-learning polynomial-regression random-forest

Last synced: 10 Apr 2025

https://github.com/krakozaure/pyzzy

Set of packages to simplify development in Python

configuration data formats json library logging logs python3 toml utils yaml

Last synced: 14 Jan 2026

https://github.com/0xnu/data-analyst-training

The repository contains training materials for data analysts.

data data-analysis data-analyst

Last synced: 25 Aug 2025

https://github.com/franckalbinet/maris-crawlers

Automated data harvesting of MARIS data sources

automation data marine-radioactivity

Last synced: 25 Aug 2025

https://github.com/corneliustanui/personal_quarto_website

This repo contains source files for my personal Quarto-based website.

data netlify programming quarto r rbind websites

Last synced: 02 Apr 2025

https://github.com/luminati-io/google-maps-dataset-samples

A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.

api data dataset google-maps maps web-scraping

Last synced: 03 Jan 2026

https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis

In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.

data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public

Last synced: 09 Mar 2026

https://github.com/nmelgar/marathons_data_viz

Data visualization project to analyze finishing times and other data.

csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau

Last synced: 15 Feb 2026

https://github.com/paulrosset/cyclone

Network data consumption monitoring

data monitoring network networking

Last synced: 23 Aug 2025

https://github.com/samhollings/nhs_data_cleansing

A repo of reusable functions for cleansing data

cleansing data data-cleaning data-cleansing preprocessing pyspark python python3

Last synced: 05 Oct 2025

https://github.com/fuadarradhi/gps_data_reset

Flutter plugin to reset and download gps data

cache data extra gps reset

Last synced: 23 Feb 2026

https://github.com/lukakerr/us-surnames

US Surname data visualisation using R. Displays top 25 US surnames and race/ethnic percentage per name.

data data-visualization r

Last synced: 05 Oct 2025

https://github.com/affan005-ai/tesla-stock-prediction

This project analyzes Tesla stock data and builds machine learning models to predict and classify stock movements. The analysis includes EDA, feature correlation, moving averages, and two models

data data-analysis data-science data-visualization-project eda machine-learning matplotlib pandas predictive-analytics predictive-modeling python scikit-learn

Last synced: 05 Oct 2025

https://github.com/ahmad-ali-rafique/wine-quality-dataset

Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.

analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model

Last synced: 21 Aug 2025