An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/jbn/vaquero

A Python library for iterative and interactive data wrangling at laptop-scale.

data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework

Last synced: 10 Jun 2026

https://github.com/moons-14/datapot

Incorporate and serve all information.

ai aiogram api data infomation news newspaper rss video

Last synced: 04 Jan 2026

https://github.com/bala-1409/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning

Last synced: 26 Apr 2026

https://github.com/ralzz/dibimbing_datascience

This project contains an Exploratory Data Analysis (EDA) of the Estonia Passenger List dataset. I handled missing values, removed duplicate data, and created basic visualizations to find insights.

data data-science eda google-colab kaggle pandas python

Last synced: 06 May 2026

https://github.com/neurazum-ai-department/tumor-stages-dataset---v1

Synthetic MRI data generated by the ‘HF’ and 'Vbai' models based on real data.

brain data dataset datasets image mri neuroscience tumor tumor-segmentation

Last synced: 18 Mar 2026

https://github.com/ludreinsalvador/global-covid-19-data-analysis

Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.

analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization

Last synced: 26 Feb 2026

https://github.com/dysnomia-studio/achieve-games-dump

Dump parts of achieve.games database to public including Steam Games List

data dump games steam steam-api steam-game steam-games

Last synced: 27 Feb 2026

https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling

Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.

data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis

Last synced: 05 Mar 2025

https://github.com/mbagalman/lattice-doe

Python code to create experimental designs optimized to meet statistical power targets

abtesting data datascience designofexperiments experimentaldesign statistics

Last synced: 19 Jun 2026

https://github.com/krescruz/pegaso-data

Utilerías para el analisis de datos del Proveedor de Certificación de Factura Pegaso

cfdi-mexico data pac sat-gob

Last synced: 29 Apr 2026

https://github.com/vatshayan/songs-datasets

Datasets for Songs and Music for Dancing, Emotional, Happy and scenic view

1000dataset classfication csv data datapackage datapackages dataset datasets excel free freedata freedatasets genre machine music sgenre song songs

Last synced: 18 Mar 2026

https://github.com/thesfinox/fit-the-data

Data analysis using Wolfram Mathematica

analysis data data-analysis lab mathematica wolfram wolfram-mathematica

Last synced: 24 Jan 2026

https://github.com/aliasgarsogiawala/dashboards

Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard

analysis dashboards data data-visualization powerbi

Last synced: 12 Feb 2026

https://github.com/miozilla/snowden

snowden :snowman::video_game: : VR Game # Snowflake # Data Engineering # ELT

data elt engineering snowflake sql vr-game

Last synced: 11 Feb 2026

https://github.com/kenanbek/youtube-data

YouTube stats data over YouTube Data API v3 using Python.

data python youtube youtube-api

Last synced: 13 May 2026

https://github.com/entitizer/data-js

Entitizer data module

data entitizer storage

Last synced: 25 Jan 2026

https://github.com/bastianolea/mineduc_personal_academico

Datos de Personal Académico, entre los años 2008 y 2024, del sistema de Educación Superior.

chile data educacion meses tiempo

Last synced: 19 Jun 2026

https://github.com/khalyomede/request

Function to validate request data for V.

data function request validate vlang

Last synced: 12 Feb 2026

https://github.com/beastbytes/postal-code-data-php

Implementation of PostalCodeDataInterface using PHP file storage

data php postal-code yii3

Last synced: 27 Feb 2026

https://github.com/interzoid/php-examples

Provides PHP examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.

api cloud data database php quality

Last synced: 12 Jan 2026

https://github.com/bzekeria/quran_dataset

The Holy Quran (Islam) Dataset

data islam quran religion

Last synced: 12 Feb 2026

https://github.com/jeswr/blog

My personal blog

ai blog data semantics solid web

Last synced: 13 Feb 2026

https://github.com/bryanhe24/data_analysis_app

A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.

ai data data-analysis data-visualization fullstack-development javascript math python reactjs

Last synced: 07 May 2026

https://github.com/omari-kd/recommendation-system-analysis-and-modelling

This project aims to develop a recommendation system that leverages historical user data to provide tailored recommendations across different domains, such as product recommendations, content suggestions and service optimisation.

data data-science data-science-in-r machine-learning-algorithms recommendation-system

Last synced: 08 Jan 2026

https://github.com/tjas/postgrad-ai-ddv-plotly

Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.

analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python

Last synced: 07 May 2026

https://github.com/rajlabmssm/echodata

echoverse module: Example data.

data echoverse fine-mapping genomics gwas qtl

Last synced: 17 Jan 2026

https://github.com/purarue/hpi-personal

Personal HPI modules/scripts

data history lifelogging

Last synced: 06 Apr 2025

https://github.com/infinitode/pywebscrapr

An open-source Python web scraping tool. Supports both image scraping and text scraping.

data data-collection data-science open-source pip scraping web-scraper

Last synced: 14 Feb 2026

https://github.com/imartinezl/madrid-challenge

Madrid Route Optimization Challenge 🚚♻️🚚

challenge city data optimization routing-algorithm traffic

Last synced: 28 Feb 2026

https://github.com/murtaza-arif/all-you-need-to-know-for-data-engineer

This repository is designed to showcase various aspects of data engineering, including tools, frameworks, and end-to-end projects. It covers everything from data ingestion and transformation to data warehousing and cloud-based solutions.

cassandra data data-engineering data-science kafka kafka-consumer kafka-streams pyarrow spark

Last synced: 07 May 2026

https://github.com/4ment/aiv-rate-heterogeneity

Avian influenza virus data sets

data influenza

Last synced: 24 Jan 2026

https://github.com/sunnahboy/checkfake_true_news

Building data structures using Linked lists and arrays and find best algorithms for implementing a system for detecting Fake News

algorithms data level low programming structure

Last synced: 28 Feb 2026

https://github.com/gabboraron/datacamp_projects

Here you can find my DataCamp Projects

data datacamp datacamp-projects

Last synced: 14 Jun 2026

https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata

I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.

chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation

Last synced: 14 Feb 2026

https://github.com/gui-sitton/bank-loans

In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 21 May 2026

https://github.com/soenkekluth/micromitter

minimal and performant event emitter / dispatcher

data dispatch dispatcher emit emitter event eventdriven handler on send trigger

Last synced: 02 Nov 2025

https://github.com/lijesh010/roadaccidentanalysisproject

This data analysis project was completed using MS Excel, and includes the creation of a dashboard.

data data-analytics data-exploration data-visualization msexcel

Last synced: 15 Feb 2026

https://github.com/danpoynor/data-pagination-and-filtering-project

Data pagination exercise using 'vanilla' JavaScript. This script consumes a JSON array containing any number of objects and adds buttons to a page that users can click to navigate to different pages of data.

data javascript json navigation pagination vanilla-javascript

Last synced: 20 Apr 2026

https://github.com/luminati-io/google-search-api

Two methods to collect real Google SERP data—a free scraper for basic use and the enterprise-grade Bright Data API for high-volume demands.

data google-scraper html python serp-api web-scraping

Last synced: 25 Jun 2025

https://github.com/abhash-rai/regression-car-price-prediction

This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.

data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression

Last synced: 08 May 2026

https://github.com/xmen3em/kaggle-competitions

This collection contains various projects and notebooks developed to tackle a range of Kaggle competitions, showcasing different machine learning techniques, data preprocessing methods, and model optimizations.

data data-science data-visualization deep-learning deployment ensemble-learning machine-learning-algorithms python streamlit

Last synced: 09 Apr 2026

https://github.com/harrisonwelch/pythondatascience

Repo of code from the linked-in lesson "Python: Data Analysis"

data data-science matplotlib notes numpy python tutorial

Last synced: 12 Apr 2026

https://github.com/bakangmonei/is_final_assignment

My intelligent systems assignment

data data-science intelligent-systems python

Last synced: 02 May 2026

https://github.com/apigear-io/template-cpp14

C++14 technology template

conan cpp cpp14 data library

Last synced: 18 Feb 2026

https://github.com/soenneker/soenneker.data.zipcode

US ZIP code data from USPS, updated daily

code csharp data dotnet usps zip

Last synced: 02 Mar 2026

https://github.com/kunalkumar2001/coffee_sales_project_using_excel_power-bi_and_sql

Coffee Shop Sales Dashboard built using Power BI for visualization and SQL for data extraction and transformation. The project dives deep into sales performance, providing actionable insights for data-driven decisions.

analytics data dataanalytics mssql powerbi sql

Last synced: 26 Jun 2025

https://github.com/mrk214/bible-data-es-spa

La Biblia en formato JSON

api bible biblia data god jesus json spanish

Last synced: 05 Apr 2025

https://github.com/taquece/goals-per-match

basic script to calculate average football goals per match from .CSV

beginner csv data football nodejs python sports-analytics

Last synced: 09 May 2026

https://github.com/metapsy-project/data-depression-anxiety-transdiagnostic

Database of transdiagnostic treatment of depression and anxiety

data

Last synced: 01 Apr 2026

https://github.com/szc126/metadata-nnd-vocalo-twitter

ボカロ系新着動画ツイートを収集 - "new VOCALOID/UTAU videos" tweet collection

data nico-nico-douga niconico vocaloid

Last synced: 20 May 2026

https://github.com/anuraganalog/onyx-data

BI Visualizations to the problems in website. All the Visualization can be found at the below link

data onyx public tableau viz

Last synced: 02 Apr 2026

https://github.com/ethenkem/pygraphsurvey

A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc

data

Last synced: 12 Jan 2026

https://github.com/pyfig/s21_data-science-bootcamp

School21 Bootcamp Data Science

data data-science numpy pandas python school21

Last synced: 26 Jun 2025

https://github.com/circlexo/circlexo

Open-source project to seamlessly integrate and manage your business workflow, connecting Jira, GitHub, Discord, Stripe, RevenueCat, and OpenAI all in one intuitive platform.

bussiness-intelligence data discord-bot forge github google jira kpis ploi revenuecat stripe vapor

Last synced: 20 May 2026

https://github.com/jameshenderson12/data-lists

This respository contains lists of useful data that can be used in a variety of projects.

countries data list names scottish text

Last synced: 05 Mar 2026

https://github.com/dsietz/daas-workshop

Workshop for building a Data as a Service platform using the DaaS SDK.

archconf daas daas-pattern data dataprivacy nfjs rust rust-lang

Last synced: 20 May 2026

https://github.com/jigyasag18/amazon-prime-power-bi-dashboard

The Amazon Prime Power BI Project is a centralized data storage system containing detailed information on movies and TV shows available on Amazon Prime Video, including metadata and analytics insights. It supports data-driven decision-making for content acquisition and viewer engagement strategies. This repo is optimized for querying & analysis.

dashboard data data-visualization dataanalysis dataanalytics datacleaning dataset powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 05 Mar 2026

https://github.com/kaungkhantkyaw1997/mock-schema-generator

A tool for generating mock data and implementations based on schema definitions. Ideal for testing and development.

data generator mock schema testing

Last synced: 05 Mar 2026

https://github.com/majorcluster/clj-data-adapter

A Clojure library designed to convert data

clojure data lib library

Last synced: 12 Jul 2025

https://github.com/md-emranhossen/leetcode-practice

This repository stores my solutions to LeetCode problems, organized by problem number and title.

cpp data datastructures-algorithms leetcode-solutions

Last synced: 26 Jun 2025

https://github.com/randomgamingdev/randomgamingdev.github.io.data

The data for RandomGamingDev.github.io (feel free to build your own website off of mine :D)

blog custom data projects projects-list

Last synced: 02 Jan 2026

https://github.com/foreteternelle/pokemonstudiodataapi

The GitHub repository of the Pokémon Studio Data Api

api data fangame

Last synced: 02 Apr 2026

https://github.com/eyluldursun/data-science-project

This project involves a data science analysis conducted on the Obesity Data Set. The study explores factors influencing obesity, includes data visualization, and develops predictive models. The goal of the project is to gain insights to help prevent obesity.

data data-science obesity r rmarkdown

Last synced: 26 Jun 2025

https://github.com/flexthink/matricize

A convenience library to convert between pure Python objects and their vectorized representations

data machine-learning numpy python

Last synced: 09 May 2026

https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project

This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop

analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping

Last synced: 09 May 2026

https://github.com/cloud-shuttle/drover-sqlforge

The Data Automation Engine. A blazing-fast, pure Go alternative to dbt for data transformations.

ast data drover sql transformation

Last synced: 03 Jun 2026

https://github.com/yuvrajsaraogi/sales-prediction-using-python

Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.

data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql

Last synced: 19 Apr 2026

https://github.com/opdev1004/crumbdbjs

JSON files based database Javascript

data data-storage data-store database database-management nodejs

Last synced: 18 Apr 2026

https://github.com/fliplet/fliplet-widget-data-source-query

Data Source Query Provider

data provider widget

Last synced: 11 Apr 2025

https://github.com/neelamraikwar9/bookdata

This is my 1st assignment git repository. I have worked with Book Data and by using Express Js created routes and API's for Post, Update, Delete, and Get.

api books data database deployment expressjs node nodejs postman postman-api

Last synced: 05 Apr 2026

https://github.com/mipacd/holochatstats

A VTuber chat log (and general) analytics platform

data flask hololive postgresql python visualization vtuber youtube

Last synced: 05 Apr 2026

https://github.com/jormaechea/aws-firehose-producer

Easily produce data for your AWS Firehose Data Stream

aws data firehose producer stream

Last synced: 19 May 2026

https://github.com/tomwhite/misp-2017

MISP camp 2017 materials and code

bioinformatics data data-visualization hackathon

Last synced: 18 Apr 2026

https://github.com/prakashjha1/loan-eligibility-prediction

This repository contains the codebase and resources for a machine learning-based project aimed at predicting loan eligibility for individuals. The project utilizes various algorithms and data preprocessing techniques to build predictive models that assess the likelihood of an applicant being eligible for a loan based on historical data.

data data-visualization exploratory-data-analysis loan-prediction-analysis machine-learning-algorithms naive-bayes-classification parameter-tuning python random-forest

Last synced: 19 Apr 2026

https://github.com/eryks1999/data-collection-project_python

This project allowed me to practice classes, populating json files as well as extracting data.

data git json python

Last synced: 16 Apr 2026

https://github.com/jigyasag18/ibm-power-bi-dashboard-project

IBM Power BI Dashboard Project is a data-driven analysis of employees using IBM's comprehensive dataset, providing insights into key factors contributing to employee turnover and enabling organizations to strategize effectively towards improved employee retention and satisfaction.

data data-visualization dataanalysis dataanalytics dataset datavisualisation datavisualization-project powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 07 Mar 2026

https://github.com/ahadly/sql-data-analytics-project

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql

Last synced: 18 May 2026