An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/s-babaeizadeh/next-mini-app

nextjs mini application

css data nextjs reactjs

Last synced: 11 Apr 2026

https://github.com/lohithgsk/dynamic-qr-generator

A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.

data pillow python qrcode qrcode-generator

Last synced: 16 Mar 2025

https://github.com/ybelenko/openapi-data-mocker-interfaces

Package with OpenApiDataMocker interfaces.

data fake faker interface mock mocker oas oas3 openapi swagger

Last synced: 05 Jan 2026

https://github.com/charon25/weatherdata

17 000 weather measurements collected by a weather station created for a college project.

csv data dataset datasets json measurements strasbourg weather weather-data

Last synced: 16 Jan 2026

https://github.com/zulfachafidz/telco_churn_insight_customer_loss_prediction_with_random_forest_and_decision_tree-algorithms

The main problem in the business world is customer churn, or losing customers, especially in the telecommunications industry, which experiences very tight competition. To overcome this problem, an analysis was carried out to help the company understand how many customers have the potential to switch providers.

data data-science data-visualization dataanalysis dataanalyst dataanalytics datadrivenwithdataprovider decision-tree decision-tree-classifier decision-trees random-forest random-forest-classifier

Last synced: 01 May 2026

https://github.com/thingston/extractor

Collection of PHP classes to extract data from HTML pages.

data html php

Last synced: 14 Jan 2026

https://github.com/e-panourgia/big-data

Big Data Management Systems course assignments

analytics azure bigdata data hadoop json latex mrjob neo4j python redis stream

Last synced: 11 Apr 2026

https://github.com/jacoblincool/moodle-export

A streamlined library for retrieving data from Moodle.

data moodle

Last synced: 07 May 2025

https://github.com/kuanhungchen/spring-2019-data-structures

📦 Some programming assignments about basic data structures.

data data-structures

Last synced: 25 Feb 2025

https://github.com/tn3w/moviedb-json

A JSON library with 981,530 films.

data database db json movie movie-database movies

Last synced: 03 May 2026

https://github.com/prajakta1321/streetml-a-cityscape-traffic-volume-prognostication

StreetML leverages ML learning techniques to revolutionize urban traffic prediction through precise volume prognostication, aiming to enhance cityscape mobility through data-driven insights.

catboostregressor data datavisualisation exploratory-data-analysis lightgbm-regressor linearregression machine-learning machine-learning-algorithms predictive-analytics random-forest-regression xgboost-regression

Last synced: 08 Apr 2025

https://github.com/etmendz/mendz.data

Provides tools and guidance for creating data access contexts and repositories.

context data datasettings entity-framework mendz paginginfo repository resultinfo

Last synced: 11 Jun 2025

https://github.com/dms-codes/scrape-kesaintblanc-id

Kesaintblanc Data Scraper This Python script is designed to scrape product data from the Kesaintblanc website. It collects information about products, including product name, URL, price, image URLs, status, stock, and more. The scraped data is saved to a CSV file for further analysis.

data kesaintblanc python webscraper

Last synced: 27 May 2026

https://github.com/ournet/videos-data

Ournet videos data module

data ournet video videos

Last synced: 04 Apr 2025

https://github.com/team-hydrogen/nasa-adc-data

All files relating to the computation of the data provided

data jupyter-notebook nasa-app-development-challenge

Last synced: 25 Mar 2025

https://github.com/purarue/HPI-personal

Personal HPI modules/scripts

data history lifelogging

Last synced: 30 Mar 2025

https://github.com/csoren66/financial-budget-analysis

Financial budget for 2021

analytics data python

Last synced: 03 Mar 2025

https://github.com/vlamug/ratibor

Ratibor is a service for making metrics from data

data metrics prometheus

Last synced: 10 Mar 2026

https://github.com/nyo16/megas_pinakas

Bigtable elixir grpc client

bigtable data elixir grpc

Last synced: 13 Jan 2026

https://github.com/satyam4229/iit-and-nit-college-dataset

The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.

college-data csv data excel iit nit

Last synced: 04 Jan 2026

https://github.com/bkataru/spotigo

AI-powered local music intelligence platform with a task runner server core to retrieve and backup spotify account data to storage(s) at set periodic intervals

ai backup cron data go intelligence local-llm music ollama rag runner spotify task-runner tool-calling

Last synced: 16 Jan 2026

https://github.com/gappeah/layoffs-exploratory-data-analysis

This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.

data dataanalysis eda mysql sql

Last synced: 12 Jul 2025

https://github.com/ersinkoc/minote

Minimal Notation for LLMs

data llm notation token

Last synced: 21 Feb 2026

https://github.com/keminghe/osu

Unofficial and publicly-available NPM data-package about The Ohio State University.

college data majors ohio-state organizations public students university unofficial

Last synced: 06 Jan 2026

https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm

Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.

algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model

Last synced: 01 Sep 2025

https://github.com/jneidel/nationalities

Dataset of 100 common nationalities

data dataset json nationalities nationality opendata

Last synced: 25 Mar 2025

https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-

Customer Segmentation using the Recency, Frequency and Monetary Values

customer-segmentation data data-visualization python3 science time-series-analysis

Last synced: 26 May 2026

https://github.com/acovaci/orbit

ORBIT: an Open source Rust-based implementation of a data Build Tool, inspired by DBT

cargo clap-rs data data-warehouse dbt rust rust-lang tokio-rs

Last synced: 16 Mar 2025

https://github.com/4strium/data-analysis-france

🔍 Script allowing the analysis and recovery of precise data on French cities.

cities csv data france python research

Last synced: 01 Apr 2025

https://github.com/denisecase/buzzline-05-case

Kafka pipelines with data storage

consumer data kafka producer python

Last synced: 11 Apr 2025

https://github.com/elimu-ai/ml-event-simulator

🤖 Simulation of learning events and assessment events

data learning-analytics machine-learning ml

Last synced: 28 Feb 2025

https://github.com/tdjsnelling/hermes

Hermes is a real-time data framework for React + MongoDB

data docker framework mongodb nodejs react react-hooks reactjs real-time typescript websocket

Last synced: 12 Apr 2026

https://github.com/saikatharryc/motionchart-d3js

A dynamic Motion chart Built with D3 js.

chart d3js data data-science

Last synced: 23 Dec 2025

https://github.com/filiprokita/tobase64

This Python program encodes a file in base64 format and saves the result to a new file with a ".b64" extension. It is a command-line tool that can be used to automate file encoding tasks.

base64 command-line data data-conversion data-manipulation data-privacy data-prottection data-security encoding file file-conversion file-handling python python-script python3 tobase64

Last synced: 30 Jun 2025

https://github.com/dug22/jjournal

A Jupyter like notebook software for Java

data data-analysis data-science java jshell jshell-repl notebook swing swing-application

Last synced: 11 Apr 2026

https://github.com/gdcmarinho/vaultchat

VaultChat is a end-to-end encryption chat service

chat data e2ee encrypted messaging privacy

Last synced: 23 Mar 2025

https://github.com/martinius96/meteostanica-odosielacie-scripty

Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.

arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka

Last synced: 11 Apr 2026

https://github.com/yash-chauhan-dev/sf_analytics

Business teams often rely on data analysts to extract insights using SQL. This tool eliminates that dependency by bridging the gap between humans and data using AI.

aiml analytics data dbt langchain llm python snowflake streamlit

Last synced: 07 May 2026

https://github.com/luciarevaliente/shell_script_data_cleaning

This project focuses on cleaning and processing datasets using Shell scripts. It is part of the Fundamentals of Informatics course (2022-23) and involves handling movie and show data to create cleaned and filtered datasets for further analysis.

data data-cleaning shell-script

Last synced: 04 Feb 2026

https://github.com/getconversio/dig-the-data

Data visualizations for the Conversio blog

d3 data data-visualization

Last synced: 12 Apr 2026

https://github.com/oliver021/helppad-net

Versatile .NET Toolkit: A Comprehensive Set of Miscellaneous Helpers, Classes, and Utilities

assert async checks cryptographic-algorithms data date dotnet fluent functional functional-programming hash helpers parallel pipe pipeline pointers review supports tasks

Last synced: 15 Jun 2026

https://github.com/wisdom-osborn/data-analytics-course-online-

🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples

data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python

Last synced: 19 Apr 2026

https://github.com/vidushibhadana/eda-on-nyc-taxi-data

About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.

data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy

This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.

data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit

Last synced: 19 Apr 2026

https://github.com/rohitblaze10/netflix_analysis_using_tableau

The Netflix dashboard in Tableau provides a professional and visually captivating interface for users to explore a vast collection of TV shows and series. With seamless navigation and interactive filters, users can easily personalize their recommendations based on release year, genre, duration, and rating.

data data-analysis data-science data-visualization netflix tableau

Last synced: 04 Feb 2026

https://github.com/axetroy/stone

build data stuck like a stone, Sturdy!

axetroy data stone stuck

Last synced: 04 Jul 2025

https://github.com/trissim/polystore

Framework-agnostic multi-backend storage abstraction for ML and scientific computing

backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr

Last synced: 12 Apr 2026

https://github.com/arnavk-09/phishing-detection

🎣 Detect Phishing URLs with Data Pre-fitted... API & Web UI

csv data fastapi flask python scikit-learn

Last synced: 03 May 2026

https://github.com/nikolatechie/spotify-playlist

Data pipeline that fetches recently played songs in the past 24 hours using Spotify API and saves the data in the SQLite database. Scheduled to run daily using Apache Airflow.

apache-airflow api data data-engineering python spotify sql sqlite

Last synced: 30 Apr 2026

https://github.com/davorg/cookingvinyl

Web site with info about Cooking Vinyl records

cooking-vinyl data hacktoberfest music perl

Last synced: 02 Apr 2025

https://github.com/syed-bakhtawar-fahim/dsa_algorithm_code

Assalam o Alikum Guys, This is the repo of Data Structure and Algorithm in C programming language. I hope it will help you in learning Data Structure and Algorithm in C. I'm also learning Data Structure and algorithm in Python in better and easy way you can also explore it

algorithm algorithms-and-data-structures c data data-structures-and-algorithms dsa-algorithm dsa-learning-series dsa-practice

Last synced: 12 Apr 2025

https://github.com/kaiepi/ra-annotations

Thread-safe static buffer

data type

Last synced: 13 Jul 2025

https://github.com/jigyasag18/fake-news-prediction-app

The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Lemmatization algorithm, achieving ~95% classification accuracy with random forest classifier model

data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming streamlit streamlit-webapp vectorization

Last synced: 11 Apr 2026

https://github.com/code-str8/time-series-forecasting

Developing a model that effectively forecasts the unit sales of numerous items across various Favorita stores with precision.

data dataanalysis forcasting machine-learning time-series visualizations

Last synced: 31 Mar 2025

https://github.com/sakshamarora07/whatsapp-chat-analyser

This repository contains code for a WhatsApp Chat Analyzer that uses Python libraries to extract insights from chat messages.

chat data dataanalytics datascience matplotlib pandas python seaborn statistics streamlit whatsapp

Last synced: 04 Jan 2026

https://github.com/unkaktus/pktconn

wrapper around io.ReadWriteCloser that implements gopacket's 'device'

connection data gopacket packet

Last synced: 29 May 2026

https://github.com/mierune/tinybufr

[WIP] A Rust library for decoding BUFR (Binary Universal Form for the Representation of meteorological data) files.

bufr data meteorology rust weather wmo

Last synced: 15 May 2025

https://github.com/pietrapaz/bootcamp_dio_ciencia_de_dados

Bootcamp Potência Tech powered by iFood | Ciência de Dados - Dio ⚠️

cienciadedados dados data datascience python

Last synced: 09 Apr 2025

https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system

To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.

analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server

Last synced: 29 May 2026

https://github.com/muhammadadilnaeem/student-performance-indicater-end-to-end-data-science-project

This project leverages data science techniques to build a predictive model that estimates a student's exam performance. The project follows a structured data science workflow, including data collection, preprocessing, model building, evaluation, and deployment.

data machine-learning-algorithms pandas pymysql python sql

Last synced: 11 Apr 2026

https://github.com/itrauco/data-dirtying-tool

a simple command line tool to generate dirty data and do common data things in google cloud

data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning

Last synced: 24 Feb 2025

https://github.com/rikiitokazu/dataprojects

Data analysis practice using SQL and Python

data python sql web-scraping

Last synced: 12 Apr 2026

https://github.com/publici/state-integrity-data

Data from a comprehensive assessment of state government accountability and transparency

data

Last synced: 04 Feb 2026

https://github.com/veronikagregorec/excel-data-analytics

Excel for data analytics from beginner to advanced

cleaning data excel formulas tables xlookup

Last synced: 21 Jan 2026

https://github.com/theanujsinha01/mcdonalds-customer-analysis

This project analyzes customer feedback data to understand what drives people to like or dislike McDonald’s. Using Python and data visualization tools in a Jupyter Notebook, we explore how different factors—such as taste, price, health, and visit frequency—affect customer satisfaction.

case-study data data-visualization dataanalysis

Last synced: 05 Sep 2025

https://github.com/goto-eof/bitmaptize

Wraps data inside a .bmp and extracts data from .bmp.

bitmap bmp convert data wrap

Last synced: 18 Jan 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/kahlery/my-jupyter-notebook-projects

🐊 collection of my data science analysis, actually I store most of my data science projects in my google drive because of google colab

data jupyter-notebook python

Last synced: 12 Apr 2026

https://github.com/miraclx/split-merge

Efficient, flexible data stream chunker and merger

chunk data efficient merge middleware nodejs pipeline split stream

Last synced: 07 May 2026

https://github.com/doughtnerd/pod-old

Read and write Excel data

data data-analysis excel poi-library workbook

Last synced: 21 Jan 2026

https://github.com/nsandoya/python_scrp_project

This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)

beautifulsoup4 data data-analysis jupyter-notebook pandas python3

Last synced: 28 Apr 2026

https://github.com/ahmad-ali-rafique/linear-regression-modeling

In-depth exploration of linear regression models, including data cleaning, model building, and performance evaluation on various datasets.

artificial-intelligence data dataanalytics linear-models linear-regression model multilinear-regression regression regression-models

Last synced: 19 Apr 2026

https://github.com/tomcardoso/journalism-data-intersection

A talk on working at the intersection of journalism and data science

data data-journalism journalism

Last synced: 15 May 2025

https://github.com/dsietz/rust-daas

An example of implementing the DaaS pattern using Rust

archconf daas data kafka rust rust-lang

Last synced: 05 Sep 2025

https://github.com/jerboaburrow/uk-counties-and-unitary-authorities-may-2023-geojson

UK "Counties" Extracted from Office for National Statistics data

data geojson maps uk

Last synced: 29 Mar 2025

https://github.com/thicclatka/tetration

New file format for tensors

cli data fileformat mmap tensors

Last synced: 26 May 2026

https://github.com/rishikesh-jadhav/track_deep_learning

Data collected from the Udacity simulator comprising RGB images with steering and throttle annotations for each frame, specifically gathered for behavioral cloning purposes.

data datacollection udacity-self-driving-car

Last synced: 03 Jan 2026

https://github.com/gianlucatruda/titanic

An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.

data database flask info matplotlib python science scrape server titanic visualisation web

Last synced: 10 Apr 2026

https://github.com/burythehammer/foosbot-results

Foosball results for the OpenCredo foosbot

data foosball machine-learning python

Last synced: 13 Apr 2026

https://github.com/pythoncoderunicorn/tool-discography

Music Band TOOL albums and songs dataset

data data-science metal-music music r songs

Last synced: 26 May 2026

https://github.com/open-geodata/sp_bh_pcj-2020-2035

Dados Espaciais da Agência das Bacias PCJ, com informações apresentadas no Plano de Bacias 2020-2035

data python

Last synced: 16 Jan 2026

https://github.com/ismailhakkii/digital_vault

This project can be used for securing data, similar to a real vault.

data digital security-data vault

Last synced: 25 Mar 2025

https://github.com/checco9811/data-engineering-bootcamp-homework

Homework solutions for DataExpert.io data engineering bootcamp

apache-spark data data-engineering sql

Last synced: 14 Mar 2025