An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/andykee/aurora

A lightweight tool for indexing, cataloging, and browsing data.

catalog data data-catalog data-discovery indexing metadata metadata-extraction search-and-discovery

Last synced: 17 Jan 2026

https://github.com/aiwithqasim/project_allocation_system

Project Allocation System (PAS) automates and simplifies the process of Allocating projects to students. Teachers can simply add details on prompting for input and perform a number of operation modules including Adding Projects, Updating Projects, Searching Projects , Deleting Projects and Display All Projects

algorithms-and-data-structures algorthims c-plus-plus data data-structures linked-list

Last synced: 08 Oct 2025

https://github.com/jacob-pitsenberger/python-electronics-inventory-management-system-object-oriented-programming-project

Welcome to the Python Electronics Inventory Management System project repository! This project is a demonstration of Object-Oriented Programming (OOP) principles in Python for managing an electronic parts inventory.

data data-structures dictionary exception-handling file-io filesystem input-output inventory-management-system management-system modules oop pickle python user-interface

Last synced: 08 Oct 2025

https://github.com/djdhairya/whatsapp-chat-analysis

WhatsApp chat analysis is a multidimensional process that delves into the content, structure, and dynamics of conversations within the platform. It provides valuable insights for personal reflection, organizational decision-making, and improving communication strategies.

data data-science dataanalytics datapreprocessing machine-learning ml

Last synced: 08 Oct 2025

https://github.com/anarya22/e-commerce_analysis

E-Commerce_Analysis is a data analysis project performed on the Superstore_USA dataset. It explores various aspects of e-commerce performance, including sales trends, customer demographics, product categories, and regional performance. The analysis includes data cleaning, visualizations, and insights on factors influencing sales and profitability.

analysis analytics cleaning-data data

Last synced: 09 Oct 2025

https://github.com/preritdas/covidactnow

A wrapper for the Covid Act Now database of live COVID-19 state-based statistics.

api covid covid-19 data python python3 science wrapper

Last synced: 09 Oct 2025

https://github.com/sillyash/untappd-viz

A data visualisation page using public datasets and HTML/CSS/JS with D3.js.

beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project

Last synced: 18 May 2026

https://github.com/loaiwalid07/automation_data_overviwe

This is Streamlit app that gives an overview for a dataset you upload

automation data data-analysis data-exploration data-science data-transformation data-visualization

Last synced: 19 May 2026

https://github.com/j-sephb-lt-n/joes_giant_toolbox

A large collection of general python functions and classes that I use in my daily work

ascii browser classifier data dataviz gcp mime nlp python regex search statistics supervised web-scraping

Last synced: 10 Oct 2025

https://github.com/chowington/bg-counter-tools

A set of tools that can pull data from Biogents BG-Counter smart mosquito traps and convert them into a Darwin Core compliant format.

bg-counter biogents darwin-core data internet-of-things mosquito-prevalence population-dynamics

Last synced: 10 Oct 2025

https://github.com/ikcede/hinge-data-ts-wrapper

Typescript wrapper for exported Hinge data

data hinge typescript

Last synced: 10 Oct 2025

https://github.com/dumkydewilde/mcp-memory-layer

A template for building your own BI MCP with dbt, LLMs and multi-user corrections

bi data dbt llm mcp-server

Last synced: 13 Mar 2026

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/madhuresh2011/daily-sql-from-hackerrank

Welcome to my SQL Series, where I tackle SQL problems from HackerRank on a daily basis.

data dataanalysis database question-answering sql

Last synced: 19 Jan 2026

https://github.com/0xnu/nfl-picks

NFL match prediction with scores using historical data (1999-Present).

american-football data nfl prediction

Last synced: 12 Oct 2025

https://github.com/adadalshabab/data-engineering-gcp-project

An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.

bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi

Last synced: 19 Jan 2026

https://github.com/jhpoelen/bees

Content-based iDigBio prototype

biodiversity data ecololgical informatics provenance

Last synced: 18 Mar 2026

https://github.com/flowsta/ods-educacion-aporta

ODS para educación, iniciativa APORTA 2021

data data-visualization ods sdg

Last synced: 27 Jan 2026

https://github.com/odiegosilva1/flask-github-style

Página de login usando Jinja no Flask.

data flask jinja2-templates orm python

Last synced: 31 May 2026

https://github.com/polyee99/kaggle-titanic-data-analytics

Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.

data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization

Last synced: 05 Feb 2026

https://github.com/mominurr/fire-gas-leak-detection-system

A real-time fire prevention system integrating IoT sensors and computer vision to trigger evacuations.

ai computer-vision data datascience machine-learning ml python yolo

Last synced: 27 Jan 2026

https://github.com/jigyasag18/project-diwali-sales-analysis

This project analyzes retail sales data during the Diwali festival using exploratory data analysis (EDA) to identify buyer demographics and product preferences. The findings reveal that the primary purchasers are married women aged 26-35 from Uttar Pradesh, Maharashtra, and Karnataka, working in IT, Healthcare, and Aviation.

analysis data datapr datapro eda jupyter-notebook python realtimedata

Last synced: 01 Jun 2026

https://github.com/rizkipragustono/extract_from_excel

Excel Contact Data Parser with Country Code Formatting

data excel extract python transform

Last synced: 18 May 2026

https://github.com/fatihilhan42/nba-players-data-1950-to-2021

In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.

data data-analysis data-engineering data-science data-visualization

Last synced: 16 Oct 2025

https://github.com/saboye/sales-performance-analysis

A dashboard that presents monthly sales performance by product segment and product category to help clients identifying the segments and categories that have met or exceeded their sales targets, as well as those that have not met their sales targets.

dashboard data data-science eda tableau visualization

Last synced: 27 Jan 2026

https://github.com/mat06mat/matbot

My discord bot code

data discord-bot discord-py py-cord

Last synced: 17 Oct 2025

https://github.com/octoenergy/tentaclio-snowflake

A python project containing all the dependencies for snowflake tentaclio schema.

data

Last synced: 20 Oct 2025

https://github.com/andrewl/danelaw

Geopackage containing the boundary of the Danelaw

data geospatial medieval viking

Last synced: 23 Jan 2026

https://github.com/jigyasag18/bird-strikes-in-aviation-project

This project analyzes over a decade of U.S. bird strike data (2000–2011) to evaluate safety risks, damage trends, and cost implications in aviation. Using PostgreSQL for database management and Power BI for dashboard visualization, it uncovers critical insights into when, where, and how wildlife impacts aircraft. Key findings inform strategically.

bird-strike-prevention bird-strike-prevention-in-real-airport data data-analysis data-analysis-project data-visualisation data-visualization data-visualization-project data-visualizations database dataset dax-query postgresql postgresql-database powerbi powerbi-desktop powerbi-report powerbi-visuals sql sql-database

Last synced: 09 May 2026

https://github.com/louis-heraut/dataverseur

🫖 A dataverse API R wrapper to enhance the deposit procedure using only R variable declarations

data data-repository data-science datascience dataset dataverse dataverse-api json metadata metadata-management metadata-parser r

Last synced: 24 Oct 2025

https://github.com/theipster/property-data

Tooling to track real estate / property market events, analyse trends and generate insights.

data property real-estate

Last synced: 24 Jan 2026

https://github.com/robertoostenveld/dccn.dsc_3015055.00_583_v1

The FieldTrip-SimBio Pipeline for EEG Forward Solutions [Data set].

data datalad open-data

Last synced: 24 Jan 2026

https://github.com/semcod/code2llm

Python Code Flow Analysis Tool - Static analysis for control flow graphs (CFG), data flow graphs (DFG), and call graph extraction

ast cfg code code2data code2logic code2process data dfg diagram flow graphs llm

Last synced: 01 Jun 2026

https://github.com/dynamiatools/module-importer

DynamiaTools extension to work with excel files for import data

data dynamia excel import java zk

Last synced: 06 Feb 2026

https://github.com/spatialcurrent/go-flat

Recursively flatten a slice of slices.

big-data bigdata data

Last synced: 29 Jan 2026

https://github.com/aimin-nur/data-analyst-model-predictive

Sebuah Project data analyst yang bertujuan untuk mengindentifikasi karakteristik customer untuk menerima penawaran campaign marketing.

analyst data mechine-learning visualization

Last synced: 29 Jan 2026

https://github.com/restricted/redis-data-cache

TypeScript implementation of data cache management by class name

cache data object redis state typesript

Last synced: 30 Jan 2026

https://github.com/chompfoods/stub-scala-akka-http-server

Scala Akka HTTP server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

akka api branded chomp data database food grocery ingredients raw recipe-api recipes scala server stub stub-server

Last synced: 15 Apr 2026

https://github.com/denisecase/dc-texter

Send a text message using Python

alerts data python sms-messages streaming

Last synced: 08 Feb 2026

https://github.com/nits2612/data-science-projects

Portfolio of data science projects completed by me during PGP AI/ML, self learning, and hobby purposes.

data data-science dataanalysis deep deep-learning keras machine-learning matplotlib numpy opencv pandas python scikit-learn seaborn surprise-python tensorflow transfer-learning

Last synced: 01 Feb 2026

https://github.com/enescidem/twitter-topic-modeling

Topic modeling is an unsupervised method to identify topics in text. This project analyzes tweets from prominent Turkish accounts to uncover underlying themes in their shared content.

data data-science machine-learning nlp topic-modeling twitter x

Last synced: 10 Feb 2026

https://github.com/os-climate/data-requests

This repo is used to track issues related to new Data Requests

data data-engineering dataset

Last synced: 27 Feb 2026

https://github.com/bastianolea/sicvir_indicadores_rurales

Sistema de Indicadores de Calidad de Vida Rural (Sicvir)

chile comunas data estado rural social

Last synced: 27 Feb 2026

https://github.com/sweta-kaundilya/power-bi-learning-projects

This repository contains completed exercises while learning Power BI

data datavisualization dax powerbi powerquery

Last synced: 27 Feb 2026

https://github.com/kunalthakur204/visualization-on-flower

🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊

data data-visualization dataanalysis flowerdataset python

Last synced: 16 Apr 2026

https://github.com/project-renard/test-data

Files for testing

data

Last synced: 27 Feb 2026

https://github.com/khalyomede/request

Function to validate request data for V.

data function request validate vlang

Last synced: 12 Feb 2026

https://github.com/beastbytes/postal-code-data-php

Implementation of PostalCodeDataInterface using PHP file storage

data php postal-code yii3

Last synced: 27 Feb 2026

https://github.com/pawamoy/keycut-data

Keyboard shortcuts data stored in YAML files

data keyboard-shortcuts

Last synced: 12 Feb 2026

https://github.com/infinitode/pywebscrapr

An open-source Python web scraping tool. Supports both image scraping and text scraping.

data data-collection data-science open-source pip scraping web-scraper

Last synced: 14 Feb 2026

https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata

I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.

chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation

Last synced: 14 Feb 2026

https://github.com/nmelgar/marathons_data_viz

Data visualization project to analyze finishing times and other data.

csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau

Last synced: 15 Feb 2026

https://github.com/gourab337/karnataka-health-visualizer

Visualizer for Karnataka's district-wise healthcare info built using PHP

analytics data

Last synced: 19 Mar 2026

https://github.com/arnocan/yapydata

The yapydata provides miscellaneous low-level Python data access APIs.

data datastructures ini json properties python python2 python3 xml yaml

Last synced: 16 Feb 2026

https://github.com/soenneker/soenneker.data.zipcode

US ZIP code data from USPS, updated daily

code csharp data dotnet usps zip

Last synced: 02 Mar 2026

https://github.com/nagar2nd/financial-analysis-power-bi

This project analyzes financial and credit card usage data using Power BI and DAX, focusing on customer behavior, credit risk, and financial performance. It includes insights on spending trends, delinquency rates, churn indicators, and satisfaction scores to drive better financial management and customer retention strategies.

analysis data dax dax-functions dax-query excel powerbi

Last synced: 03 Mar 2026

https://github.com/metapsy-project/data-depression-anxiety-transdiagnostic

Database of transdiagnostic treatment of depression and anxiety

data

Last synced: 01 Apr 2026

https://github.com/ashakoen/bls-data-extract

This repository contains scripts and a database schema to set up and manage a local SQLite database for storing and querying the Average Price data from the U.S. Bureau of Labor Statistics. It includes tools for downloading the latest data from the BLS website and fetching Consumer Price Index (CPI) data via the BLS API.

data government sqlite us

Last synced: 01 Apr 2026

https://github.com/fastpix/android-data-bitmovin

FastPix Video Data SDK to monitor and analyze video playback metrics within Bitmovin for android

analytics android-sdk bitmovin data fastpix metrics player sdk video

Last synced: 16 Apr 2026

https://github.com/jameshenderson12/data-lists

This respository contains lists of useful data that can be used in a variety of projects.

countries data list names scottish text

Last synced: 05 Mar 2026

https://github.com/derhuerst/uic-codes

UIC country codes.

data dataviz i18n transit

Last synced: 05 Mar 2026

https://github.com/jigyasag18/amazon-prime-power-bi-dashboard

The Amazon Prime Power BI Project is a centralized data storage system containing detailed information on movies and TV shows available on Amazon Prime Video, including metadata and analytics insights. It supports data-driven decision-making for content acquisition and viewer engagement strategies. This repo is optimized for querying & analysis.

dashboard data data-visualization dataanalysis dataanalytics datacleaning dataset powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 05 Mar 2026

https://github.com/evyatarmeged/mdg

Data mocking web application built with Python & Flask

csv data flask generate json mocking python sql xml

Last synced: 17 Apr 2026

https://github.com/joshuagilgallon/cam-data

Large collection of data about digital cameras

camera data

Last synced: 17 Apr 2026

https://github.com/rawdaabdelsalam42/data-cleaning-sql-python-powerbi

Data cleaning project for an e-commerce sales dataset using Python (Pandas) for preprocessing, SQL Server for queries, and Power BI for building an interactive dashboard visualization.

dashboard data data-engineering pandas powerbi python sql

Last synced: 17 Apr 2026

https://github.com/holo-nim/flue

data streaming options

data nim reader-writer streams

Last synced: 04 Apr 2026

https://github.com/bhavanachitragar/layoff_analysis

This Streamlit app is designed for Layoff Analysis. It allows users to explore and analyze layoff data from different perspectives, including overall analytics, country-specific insights, and individual company details.

data dataanalysis streamlit streamlit-webapp

Last synced: 18 Apr 2026

https://github.com/jigyasag18/iit-guhawati-final-capstone-project

Smart Dynamic Parking Price Optimization System that adjusts parking fees in real-time based on demand, traffic, and competition. It employs adaptive pricing models and rerouting logic to enhance parking utilization and reduce congestion. The system is visualized via an interactive Streamlit dashboard, enabling users to simulate dynamic pricing.

bokeh bokeh-server bokehplots capstone-project data dataset deployment machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot mlproject normalisation numpy pandas pathway python streamlit

Last synced: 05 Apr 2026

https://github.com/mksingh431/free-data-science-courses

Data science is a rapidly growing tech field that’s transforming business decision-making. To break into this field, you need the right skills. Fortunately, top institutions like Harvard and IBM offer free online courses. These courses cover everything from basic programming to advanced machine learning.

course data data-analysis data-science data-visualization free freecou python

Last synced: 19 Apr 2026

https://github.com/omers/sre-devops-tools

Tools and useful sources for SRE and DevOps

awsome awsome-list data devops monitoring sre tools

Last synced: 20 Apr 2026

https://github.com/rick-does/json-razor

Reduces JSON, YAML, and NDJSON volume by collapsing repeated structures while preserving the schema, making the schema easier for you to read.

cli data devtools json logs ndjson schema yaml

Last synced: 20 Apr 2026

https://github.com/mozzo1000/web-analytics

Website analysis tools and data

analysis analytics data website

Last synced: 21 Apr 2026

https://github.com/stefen-taime/llm-rag-mtl-public-hospital

Ce projet développe un modèle de type Retrieve-Augment-Generate (RAG) pour répondre aux questions en utilisant les données publiques des avis laissés sur Google pour des hôpitaux à Montréal

data google-reviews hopital hospital hub ia llm montreal open-source quebec rag

Last synced: 21 Apr 2026

https://github.com/yuvrajsaraogi/-iris-flower-classification

Iris flower has three species; setosa, versicolor, and virginica, which differs according to their measurements. Now assume that you have the measurements of the iris flowers according to their species, and the task is to train a machine learning model that can learn from the measurements of the iris species and classify them.

classification data data-analysis data-science data-visualization flower flower-classification iris iris-classification iris-flower iris-flower-classification knn knn-classification machine-learning machine-learning-algorithms ml natural-language-processing nlp python

Last synced: 24 Apr 2026

https://github.com/marielachirinosr/cyclistic-data-analytics-project

This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.

data data-visualization pandas powerbi-report powerbi-visuals python

Last synced: 24 Apr 2026

https://github.com/xjwllmsx/hacker-news-engagement

Analyze Hacker News data to reveal which post types and posting hours spark the most discussion, using Python and a reproducible Jupyter notebook.

data data-analysis jupyter python

Last synced: 25 Apr 2026

https://github.com/shwetajanwekar/prediction-with-regression

prediction with regression for salary_hike and delivery time dataset

data data-science datset exploratory-data-analysis matplotlib pandas plot prediction r2-score seaborn sns

Last synced: 25 Apr 2026

https://github.com/f-ssemwanga/pandas-numpy-repo

This repo has extensive work I have done on Pandas and NumPy Modules during the advanced programming Module

cleaning-data-in-python data numpy-arrays pandas visualization

Last synced: 27 Apr 2026

https://github.com/fatihemres/africa

Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.

animations avfoundation data mapkit models swift swift-animations swiftui

Last synced: 27 Apr 2026

https://github.com/yuweaec/project-scidatapipeline

A comprehensive toolkit for processing, simulating, and analyzing scientific data, integrating Python, Fortran, and Jupyter notebooks for seamless workflows.

analysis data pipeline processing scientific simulation

Last synced: 27 Apr 2026