An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/haideratgh/sql-data-analytics-project

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis

analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-query sql-server window-functions-in-sql

Last synced: 29 Jun 2025

https://github.com/hit07/fitgpt-hacksc

AI-Powered Fitness Coach; 🥈 Runner up at HackSC's SoCal Tech Week hackathon

data elasticsearch gpt-4o-mini llm pipeline

Last synced: 28 Feb 2025

https://github.com/maluscat/reactive-storage

[MIRROR] Register, observe and intercept deeply reactive data without the need for proxies

data javascript reactive typescript

Last synced: 10 Mar 2026

https://github.com/mnkanout/patients_medication_prediction

The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.

data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3

Last synced: 29 Jun 2025

https://github.com/ccworld1000/cccomposition

CCComposition for code style, Accept code style conversion business(接受code style转换业务)

cccomposition composit construction data structure visual

Last synced: 04 Jan 2026

https://github.com/checco9811/data-engineering-bootcamp-homework

Homework solutions for DataExpert.io data engineering bootcamp

apache-spark data data-engineering sql

Last synced: 14 Mar 2025

https://github.com/sanchittechnogeek/overscripted-analysis

Geolocation and user language extraction analysis from Mozilla Overscripted dataset

analysis data data-analysis mozilla

Last synced: 23 Mar 2025

https://github.com/pythoncoderunicorn/tool-discography

Music Band TOOL albums and songs dataset

data data-science metal-music music r songs

Last synced: 26 May 2026

https://github.com/thicclatka/tetration

New file format for tensors

cli data fileformat mmap tensors

Last synced: 26 May 2026

https://github.com/wlgs/got-dialogues-data-stats

Game of Thrones dialogues data statistics processed with R and SQLite. Project for Probability and Statistics course 21/22 at AGH UST. The project was about manipulating data and getting many pieces of information from it in addition to visualizing these results.

data game-of-thrones got r statistics stats

Last synced: 22 May 2026

https://github.com/agustinmusanti/sqlchallenge-7

Resolución de un extenso desafío de SQL propuesto por el profesor Diego Moisset De Espanes, quien comparte ejercicios para aprender y practicar SQL Server a través de su canal de YouTube.

challenge data learning sqlserver

Last synced: 15 Apr 2025

https://github.com/oniani/miniframe

Minimal data frames with relational algebra

data dataframe-library haskell haskell-library library

Last synced: 04 Mar 2025

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/illustratien/toolphd

Make your analysis simple and reproducible

academic analysis data phd publications r r-package reproducible-research scientific

Last synced: 26 Jan 2026

https://github.com/itrauco/data-dirtying-tool

a simple command line tool to generate dirty data and do common data things in google cloud

data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning

Last synced: 24 Feb 2025

https://github.com/muhammadadilnaeem/student-performance-indicater-end-to-end-data-science-project

This project leverages data science techniques to build a predictive model that estimates a student's exam performance. The project follows a structured data science workflow, including data collection, preprocessing, model building, evaluation, and deployment.

data machine-learning-algorithms pandas pymysql python sql

Last synced: 11 Apr 2026

https://github.com/mierune/tinybufr

[WIP] A Rust library for decoding BUFR (Binary Universal Form for the Representation of meteorological data) files.

bufr data meteorology rust weather wmo

Last synced: 15 May 2025

https://github.com/jigyasag18/fake-news-prediction-app

The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Lemmatization algorithm, achieving ~95% classification accuracy with random forest classifier model

data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming streamlit streamlit-webapp vectorization

Last synced: 11 Apr 2026

https://github.com/mnz1365/saving-record-time-text

date saving in text file with python

data python txt-files writefile

Last synced: 18 Jul 2025

https://github.com/oliver021/helppad-net

Versatile .NET Toolkit: A Comprehensive Set of Miscellaneous Helpers, Classes, and Utilities

assert async checks cryptographic-algorithms data date dotnet fluent functional functional-programming hash helpers parallel pipe pipeline pointers review supports tasks

Last synced: 15 Jun 2026

https://github.com/taeefnajib/ibm-applied-data-science-capstone

This repository is for my IBM Applied Data Science Capstone Project. All the notebooks and other files are uploaded. If you are benefited by this repository by any means, please feel free to "Star" it and follow me. Thanks.

advance capstone capstone-project data data-science ibm ibm-watson jupyter jupyter-notebook notebook notebook-jupyter project science spacex spacex-api

Last synced: 14 Mar 2025

https://github.com/sushmashreeps/python

This repository showcases a comprehensive Python project, demonstrating expertise in backend development, data analysis, and machine learning. Built with Python 3.x, the project utilizes popular libraries like Django, Flask, NumPy, pandas, and scikit-learn. The project features efficient data processing, robust API integration, and scalable archite

api data data-science dataanalysis datavisualization game gamedeveloment python

Last synced: 12 May 2026

https://github.com/fuzzt/location-analyzer

The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.

analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack

Last synced: 11 Apr 2026

https://github.com/nisanth2004/springboot-kafka-real-world-project-wikimedia

Creating a project about Wikimedia using Kafka involves building a system that leverages Apache Kafka for data streaming and processing related to Wikimedia data.

async broker communication data java kafka message real-time real-time-analytics springboot wikimedia

Last synced: 14 May 2026

https://github.com/justinjjlee/simulation-discrete

Employing data transformations and simulations to answer random questions

analytics data data-science julia python simulation spark

Last synced: 30 Apr 2026

https://github.com/g3th/fit_file_decoder

Decodes '*.fit' files and returns readable values.

bytes data decoder fit-file hex parsing

Last synced: 30 Jun 2025

https://github.com/muhamedlabs/muhamed_onedrive

Muhamed_OneDrive - це надійне і зручне хмарне сховище для файлів, розроблене для безпечного зберігання і легкого обміну даними.

data html5 onedrive programming style

Last synced: 04 Jan 2026

https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-

Customer Segmentation using the Recency, Frequency and Monetary Values

customer-segmentation data data-visualization python3 science time-series-analysis

Last synced: 26 May 2026

https://github.com/apostolissiampanis/weather-app-api

WeatherApp is a Java-based console application that retrieves and processes weather data using the wttr.in web service.

api data hibernate java json lombok objected-orientated-programing oop spring-boot spring-data-jpa sqlite webflux

Last synced: 05 May 2026

https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm

Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.

algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model

Last synced: 01 Sep 2025

https://github.com/ersinkoc/minote

Minimal Notation for LLMs

data llm notation token

Last synced: 21 Feb 2026

https://github.com/satyam4229/iit-and-nit-college-dataset

The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.

college-data csv data excel iit nit

Last synced: 04 Jan 2026

https://github.com/officialxviid/gloogia

👓 Make your big ideas come true by building real projects using real data 🌎

api build data gloogia projects xviid

Last synced: 05 Jan 2026

https://github.com/barbosa89/vue-table

A classical data table component in VueJS and Bootstrap 4, optimized for Laravel applications.

bootstrap4 data datatable javascript laravel php table vuejs

Last synced: 11 Apr 2026

https://github.com/sanad343/complete-data-analyst

Data analysis is the process of turning raw data into useful information for decision-making.

data data-visualization datamanipulation eda excel exploratory-data-analysis powerbi python-3 sql tableau

Last synced: 30 Jun 2025

https://github.com/csoren66/financial-budget-analysis

Financial budget for 2021

analytics data python

Last synced: 03 Mar 2025

https://github.com/ashu3291/blinkit-app-store-

conducted a comprehensive analysis of Blinkit's sales performance, customer satisfaction and inventory distribution to improve the sales performance.

cleaning-data data dataanalysis-projects powerbi-visuals powerbidashboard sql

Last synced: 05 Jan 2026

https://github.com/simonbolivarpy/vault-decode-py

Simple Tools for decode crypto data, from extensions wallet, Metamask, Ronin, TrustWallet, TronLink(old), etc.

data decode decrypt metamask passwords python ronin salt tronlink trustwallet vault

Last synced: 15 Mar 2025

https://github.com/purarue/HPI-personal

Personal HPI modules/scripts

data history lifelogging

Last synced: 30 Mar 2025

https://github.com/fiddlydigital/anonimizer

A lib to replace and rehydrate sensitive data in text

anonimize anonymize data data-security prompt sanitize string string-manipulation text

Last synced: 15 Mar 2025

https://github.com/s-babaeizadeh/next-mini-app

nextjs mini application

css data nextjs reactjs

Last synced: 11 Apr 2026

https://github.com/cognitixe/metamask-wallet-recovery-funds-phrase-data-seed-token

This repository provides tools and guidelines for securely recovering MetaMask Wallet funds using recovery phrases, seed data, and tokens. It ensures safe and reliable methods for recovering access to your wallet and managing your cryptocurrency assets.

bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask metamask-bot metamask-desktop metamask-extension metamask-plugin metamask-snap metamask-wallet phrase recovery seed token wallet wallet-security

Last synced: 13 May 2026

https://github.com/bbfh-dev/protox

Go library for (de-)serializing custom protocols

binary data format go library parsing protocol reader writer

Last synced: 01 Jul 2025

https://github.com/ehvenga/data.driven.modeling

Repository to practice data driven modelling

data data-modeling

Last synced: 23 Mar 2025

https://github.com/jprando/mattkillua

Estudo sobre .Net Core

data dbcontext domain efcore netcore

Last synced: 23 Mar 2025

https://github.com/kalaspuff/ready

🎟 [not yet built] Take control of the event loop with simplified task management, queueing and data loading.

asyncio data dataloading event futures python python3 resolver tasks

Last synced: 10 May 2026

https://github.com/mecha-cms/x.time

Creates page time data if it does not exist.

data date extension page time

Last synced: 23 Mar 2025

https://github.com/smeltier/data-structures-c

This repository contains C language implementations of the main data structures covered in the Algorithms and Data Structures course. The implementations were developed as part of my hands-on learning process and include sequential lists, linked lists, and other fundamental structures.

algorithms algorithms-and-data-structures c c-language c-programming data data-structures data-structures-c structures-c

Last synced: 16 May 2025

https://github.com/gkannan-codes/habitableexos

With Earth’s habitability under strain, we ask: which known exoplanets could humans live on? Using NASA’s Exoplanet Archive, we score planets 0–1 (1 ≈ Earth) from five Earth-normalized features to rank top candidates.

data html kaggle matplotlib-pyplot numpy pandas plotly python seaborn visualization

Last synced: 11 Apr 2026

https://github.com/suryadev99/stream_processing_website_click_data

Stream Processing of website click data using Kafka and monitored and visualised using Prometheus and Grafana

clickdata data dataengineering docker flink-kafka flink-metrics flink-stream-processing git grafana kafka kafka-streams kafka-topic prometheus psql python

Last synced: 10 Mar 2026

https://github.com/2022-04-11588/data-fakes

🔍 Generate realistic fake data for testing and development, enhancing your projects with simple, customizable data solutions.

data dataset developer-tools fake-content faker fakery groovy java mock phoenix python random ruby seeding struct swift-framework test-data testing

Last synced: 11 Apr 2026

https://github.com/halyusa16/mysql-employee-analysis

This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.

data data-analytics data-exploration database mysql self-project sql

Last synced: 20 Jan 2026

https://github.com/avestura/shell-dads

❓ Show a random tip from NIST DADS (https://xlinux.nist.gov/dads) every time you open your terminal

algorithms dads data data-structures ds nist

Last synced: 23 Oct 2025

https://github.com/praxtube/dogg

CLI tool to log data manually

data data-logger log logger

Last synced: 10 Jun 2026

https://github.com/dhimmel/thinklytics

Continuous Thinklab project exports and analytics

analytics data rephetio thinklab travis-ci

Last synced: 23 Mar 2025

https://github.com/lotfiferaga/instagram-reach-analysis

The Instagram Reach Analysis project aims to develop a Python-based tool to analyze the reach and engagement metrics of Instagram posts.

analytics data data-science datavisualization python

Last synced: 18 Jun 2026

https://github.com/nel-zi/insighthire_agency

Built a web scraping solution using BeautifulSoup to extract job listings from MyJobMag, cleaned the data, and loaded it into PostgreSQL with SQLAlchemy for better job data management.

data dataloading datatransformation sql webscraping

Last synced: 16 May 2025

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/asma-hachaichi/imdb-movies-rating-prediction

This project collects movies information from IMDb using web scraping, then uses this data to guess movie ratings. It combines the skills of gathering data from the internet to predict how well movies are liked.

beautifulsoup4 data data-science machine-learning movies movies-reviews prediction python scraping

Last synced: 31 Mar 2025

https://github.com/rezapace/newbash

This project involves managing various application shortcuts and configurations primarily for a Linux environment. It includes scripts for creating .desktop entries for applications, managing system configurations, and handling application processes.

automation backup bash data dekstop linux newbash ohmyzsh script testing zsh

Last synced: 11 Apr 2026

https://github.com/roovedot/unet-cnn-for-road-segmentation

(In Progress) Unet architecture with CNNs (Convolutional Neural Networks) aimed at Road Segmentation

cnn cnn-for-visual-recognition cnn-pytorch computer-vision data data-engineering data-science unet unet-image-segmentation unet-pytorch

Last synced: 01 Jul 2025

https://github.com/chaewonkong/kaggle-competitions

kaggle competitions and lessions

ai data kaggle-competition ml

Last synced: 15 Mar 2025

https://github.com/sakan811/show-leaving-soon-tracker-website

This is a Vue.js application that displays shows that are leaving each platform soon, featuring a countdown timer for each title based on the user's local timezone.

data hbo hbomax netflix shows streaming tv-shows vue vuejs web webapp website

Last synced: 18 Mar 2025

https://github.com/gsinghjay/ywcc-307-003

Group Presentations

cloud data government

Last synced: 04 Feb 2026

https://github.com/omarcodex/data_analysis

My repository of past and present research and data-driven projects.

data ecodev ecology science sustainability yale

Last synced: 18 Jan 2026

https://github.com/jamiew/void-runners-analysis

basic data analysis for the Void Runners Genesis Fleet spaceships

analysis data nfts

Last synced: 29 Mar 2025

https://github.com/nivasharmaa/genetrack

A Java program for analyzing DNA sequences and identifying individuals based on Short Tandem Repeats (STRs). Features profile database creation, STR analysis, individual identification, and relationship detection.

data data-processing dna-analysis file-io-in-java genetic-analysis java-oop

Last synced: 25 Aug 2025

https://github.com/loosenthedark/going-for-gold

A fairer, more measured look at the Tokyo 2020 Olympic medal count. Countries are ranked in relative (per capita) instead of absolute medal-winning terms. Users can toggle between two different ranking breakdowns, search for countries, contact the site owner and enable dark mode. Mobile-first React application leveraging the REST Countries API as well as a local JSON Olympic dataset. EmailJS and React Context API integration with custom form validation and error handling.

api create-react-app css data es6 fetch-api frontend html5 interactive-front-end-development javascript mobile-first olympics react react-components react-context-api react-hooks react-router react-router-dom reactjs responsive-web-design

Last synced: 07 May 2026

https://gitlab.com/hailstorm75/Common

A collection of extension libraries for various use-cases

common core cpp csharp data extensions libraries library math matrix

Last synced: 07 May 2025

https://github.com/zoekelepiri/winedataprediction

A machine learning application in wine quality prediction

data descriptive-statistics machine-learning-algorithms

Last synced: 05 Jan 2026

https://github.com/meizuflux/cion

Python minimal data validation library

data minimal python validation

Last synced: 28 May 2026

https://github.com/stdlib-js/ndarray-vector-int8

Create a signed 8-bit integer vector (i.e., a one-dimensional ndarray).

constructor ctor data int8 javascript ndarray node node-js nodejs stdlib structure types vec vector

Last synced: 24 Apr 2026

https://github.com/nolanbconaway/rollercoaster-tycoon-data

Every roller coaster I have built in RCT2 for iPad

data roller-coaster-tycoon

Last synced: 24 Mar 2025

https://github.com/sasanthns/sql_data_warehouse_project

A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

data data-analysis data-science data-warehouse datacleaning etl etlpipeline sql sqlserver

Last synced: 24 Mar 2025

https://github.com/bertrand31/one-billion-rows-challenge

🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds

competitive-programming competitive-programming-contests data data-engineering data-processing performance scala

Last synced: 05 Sep 2025

https://github.com/plnech/never2late

Never 2 Late - a reinterpretation of Everest Pipkin's 'i've never picked a protected flower'

dada dada-science data generative-art glitch-art installation nlp poetry spacy vector-similarity wallpaper

Last synced: 10 Jun 2025

https://github.com/docuvesta/shiseido_skincare_usa_fr_infographics

Découvrir les indicateurs de performance liés aux avis d'un sérum très réputé de la marque de beauté luxe japonaise Shiseido. Cette comparaison concerne les sites web USA et FR 💯

analysis automatisation data datanalysis graphique infographie pandas plotly python skincare soins

Last synced: 11 Apr 2026

https://github.com/moeabbas6/bq_data_loader

A Python script for executing and logging batch SQL commands in Google BigQuery. Includes tracking of execution times, unique job and statement IDs, and automated logging to a specified BigQuery table.

bigquery data python

Last synced: 24 Mar 2025

https://github.com/murshidazher/client-side-data-storage

🚌 A workspace containing client-side data storage implementations

cache cache-storage client-side data indexeddb localstorage sessionstorage storage websql

Last synced: 02 Sep 2025

https://github.com/heyimsteve/solnftdatadash

This a React-based web application that provides detailed information about NFT collections on the Solana blockchain. It uses the HelloMoon API to fetch and display data about NFT collections, including statistics, loan summaries, ownership information, and floor prices.

dashboard data hellomoon nft react solana solana-nft

Last synced: 30 Jan 2026

https://github.com/bdr-pro/graphyml

A powerful, interactive Streamlit application to explore, edit, visualize, and query a graph-based database of YAML nodes — ideal for movie metadata, research articles, or structured knowledge graphs.

data database yaml yml

Last synced: 23 Jul 2025

https://github.com/pbinkley/mfmcollections

Project to distill data about published collections of microfilms from library lists

data research retro

Last synced: 28 May 2026

https://github.com/turner-kendall/turner-kendall

Turner Kendall - dev, opps, sec.

config data github-config go rust security

Last synced: 31 Oct 2025

https://github.com/gustavonav/daily-youtube-extraction

Projeto que completa a criação de um ambiente para extração, armazenamento e processamento de dados do Youtube

airflow data minio python3 spark

Last synced: 21 Feb 2026

https://github.com/seldszar/piccha

Another tree data structure

data tree

Last synced: 16 Jul 2025