An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/prateekmaj21/tableau-public-links

Tableau work as part of Data Visualization [AI&DS_205]

data data-visualization dataanalytics tableau-public

Last synced: 24 Jan 2026

https://github.com/ddeepanshu-997/support_vector_regression--svr-

In this repository i performed a support vector regression on real life data , initially i performed some data preprocessing technique in order to filter out the data flaws then undergoes the process of model building i.e SVM regression in order to make a machine learning regression model.

data data-science regression-analysis regression-models svm-model svm-regression

Last synced: 03 Aug 2025

https://github.com/bastianolea/sinim_municipal_genero

Datos comunales de género del Sistema Nacional de Información Municipal

chile comunas data genero laboral tiempo

Last synced: 23 Jun 2026

https://github.com/bishtrishu/pizza_sales_analysis_dashboard_sql_bi

Welcome to the Pizza Sales Analysis Dashboard project! This repository contains a comprehensive guide to building an interactive and insightful dashboard for analyzing pizza sales data using SQL and Power BI.

data data-science dataanalyst datavisualization dax dax-query microsoft microsoft-azure microsoft-sql-server msexcel mysql powerbi powerquery project sql

Last synced: 16 Mar 2026

https://github.com/abdullahashfaqvirk/Earth-Engine-Data-Scraper

A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.

beautifulsoup data data-science python requests scraper web-scraping

Last synced: 27 Sep 2025

https://github.com/dynamiatools/module-importer

DynamiaTools extension to work with excel files for import data

data dynamia excel import java zk

Last synced: 06 Feb 2026

https://github.com/cmdrvl/rvl

rvl reveals the smallest set of numeric changes that explain what actually changed between two datasets — or confidently tells you nothing changed.

cli csv data data-quality data-validation diff finance numerical-analysis open-source ops rust tooling

Last synced: 25 Feb 2026

https://github.com/jun-labs/jq

🧷 Let's practice jq.

data jq json json-data parse

Last synced: 27 Sep 2025

https://github.com/spatialcurrent/go-flat

Recursively flatten a slice of slices.

big-data bigdata data

Last synced: 29 Jan 2026

https://github.com/audeering/datasets

Data cards for public audb datasets

audb audio data management

Last synced: 29 Jan 2026

https://github.com/apoorv74/njdg-stats

Tracking data from the National Judicial Data Grid's (NJDG) district courts portal

data git-scraping judiciary law

Last synced: 29 Jan 2026

https://github.com/i-rzr-i/domaincommonextensions

The purpose of this repository/library is to provide the most relevant and used extension methods in the life cycle of application development that allow us to improve our code, and writing speed, and use more efficiently dev team time during this period for more complex functionality.

api class data datatype extension helper object parser type util

Last synced: 20 Sep 2025

https://github.com/apigear-io/template-qtcpp

QtC++ technology template

data plugin qml qt qt5

Last synced: 25 Feb 2026

https://github.com/rosacarla/databases

Bases de dados utilizados em atividades práticas do MBA Data Analytics do IGTI.

data data-analytics dataset

Last synced: 19 Mar 2026

https://github.com/bearaujus/bdatamatrix

Structured Tabular Data Management in Go

data go golang matrix

Last synced: 30 Jan 2026

https://github.com/rodgeraraujo/open-dataverse

OpenDataverse: ETL application to filter and import open data from https://dados.ifpb.edu.br/ save on database, and exported via a Rest API.

data dataset dataverse flask ifpb pandas python

Last synced: 05 May 2026

https://github.com/tompollard/data

Repository to hold sample datasets etc

data

Last synced: 05 Jan 2026

https://github.com/richardlitt/bird-watching

My birdwatching list and repo

birding data ebird

Last synced: 26 Jan 2026

https://github.com/entorb/analyze-ha-energy

Analyze Home Assistant Solar Production Data

data home-assistant pandas photovoltaic pv python

Last synced: 08 May 2026

https://github.com/thedevreda/jadaerospace

A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero

data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python

Last synced: 02 Aug 2025

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/plurid/delog

Cloud Service for Centralized Logging

cloud data logging

Last synced: 08 Nov 2025

https://github.com/bhojpur/dlm

The Bhojpur DLM is a software-as-a-service product used for Data Lifecycle Management based on Bhojpur.NET Platform for data delivery.

data lifecycle-management

Last synced: 19 Feb 2026

https://github.com/plurid/datasign

Single Source of Truth Data Contract Specifier

data file-format

Last synced: 08 Nov 2025

https://github.com/praxtube/dogg

CLI tool to log data manually

data data-logger log logger

Last synced: 10 Jun 2026

https://github.com/ymorsi7/quranicvisualization

A visual exploration tool for the Holy Quran using D3.js treemaps.

css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization

Last synced: 15 Apr 2026

https://github.com/shibbbbs/fastapi_project

A FastAPI application that reads financial data from an Excel file (capbudg.xls) and provides API endpoints to list available tables (sheet names), fetch row names from a selected table, and calculate the sum of numerical values from a specified row. The API is accessible via a web-based interactive documentation at /docs

data dataanalysis fastapi pandas python

Last synced: 06 May 2026

https://github.com/edjoukou/human_resources

A data analysis project using MySQL Server database

analysis data mysql powerbi sql visualization

Last synced: 25 Sep 2025

https://github.com/matt-dray/draytasets

:1234::disguised_face: Miscellaneous datasets I've collected or prepared

card-games data phd pokemon

Last synced: 09 Feb 2026

https://github.com/poode/firebase-modeling

Get firebase/firestore entity model to migrate to mongo or any db later

data database firebase firestore modeling schema

Last synced: 06 May 2026

https://github.com/2022-04-11588/data-fakes

🔍 Generate realistic fake data for testing and development, enhancing your projects with simple, customizable data solutions.

data dataset developer-tools fake-content faker fakery groovy java mock phoenix python random ruby seeding struct swift-framework test-data testing

Last synced: 11 Apr 2026

https://github.com/word2vect/beijing-new-house-data-visualization

Beijing New House Data Visualization for Python Programming 2024 Fall Data Visualization Lab

data python visualization

Last synced: 13 Jun 2026

https://github.com/vatshayan/songs-datasets

Datasets for Songs and Music for Dancing, Emotional, Happy and scenic view

1000dataset classfication csv data datapackage datapackages dataset datasets excel free freedata freedatasets genre machine music sgenre song songs

Last synced: 18 Mar 2026

https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice

A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.

data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks

Last synced: 02 Aug 2025

https://github.com/xljones/bugsnag-exporter

Export Bugsnag project, error, and event data easily from a command line call which automatically handles pagination, and API backoffs

bash bugsnag cmd csv data error error-capture error-handling error-reporting event export go golang json project zsh

Last synced: 06 May 2026

https://github.com/kunalthakur204/visualization-on-flower

🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊

data data-visualization dataanalysis flowerdataset python

Last synced: 16 Apr 2026

https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics

This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.

big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard

Last synced: 01 Aug 2025

https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data

This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.

big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard

Last synced: 19 Feb 2026

https://github.com/darrendavy12/azure-databricks-setup-guide-with-formula1-csv

Azure Databricks Setup Guide with Formula1 CSV - Azure Databricks, PySpark, Python, Data Lake Storage

apache azure cloud data databricks lake notebooks pyspark python spark storage

Last synced: 06 May 2026

https://github.com/word2vect/beijing-pm2.5-data-process

Beijing PM2.5 Data Process for Python Programming 2024 Fall Data Visualization Lab 2

data python visualization

Last synced: 15 Jun 2026

https://github.com/bishtrishu/super_store_sales_dashboard

This repository contains a comprehensive sales analysis dashboard for a Superstore, created using Power BI. The objective is to contribute to the success of a business by utilizing data analysis technique, specially focusing on time series analysis, to provide valuable insights and accurate sales forecasting.

analytics data data-science dataanalysis dataanalyst datacleaning datascience datavisualization-project excel microsoft-azure microsoft-excel powerbi report sql

Last synced: 28 Feb 2026

https://github.com/sumaiyyaf/british-airline-dashboard

This Tableau dashboard visualizes British Airways customer reviews, showcasing key metrics like average ratings for service, entertainment, and seat comfort. It features interactive filters for exploring ratings by aircraft type, country, and traveler type, along with trend analysis over time.

analysis dashboard data tableau visualization

Last synced: 13 Feb 2026

https://github.com/miozilla/pandas

pandas :panda_face::panda_face: : Python Library # Data Analysis # Dataframe

analysis data dataframe pandas python sqlite3

Last synced: 07 May 2026

https://github.com/xiaomingx/10000-public-apis-and-data

Public APIs are interfaces that allow developers to access various services, features, or data from external systems or platforms.

api-ecosystem api-integration data developer-friendly-apis open-api-access public-api-tools third-party-services

Last synced: 30 Jul 2025

https://github.com/hudson-newey/data-miner

A simple data miner that collects information from an API and stores it in a file

api api-client big-data bigdata data logger logging

Last synced: 10 Jun 2026

https://github.com/ciyer/altair-matplotlib

Ports of examples from a Matplotlib tutorial to Altair/Vega

altair data dataviz vega vega-lite

Last synced: 29 Jul 2025

https://github.com/imartinezl/madrid-challenge

Madrid Route Optimization Challenge 🚚♻️🚚

challenge city data optimization routing-algorithm traffic

Last synced: 28 Feb 2026

https://github.com/molinsagustin/cinedata

# CineData Trabajo práctico grupal para la materia Ingeniería de Datos I en la Universidad Argentina de la Empresa. El mismo consistió en el desarrollo de una base de datos relacional en Microsoft SQL Server Managment Studio utilizando metodología Ágil SCRUM, que se utilizó desde el relevamiento de requisitos hasta la implementación final.

agile data data-modeling database diagram entity-relationship-diagram microsoft-sql-server relational-databases relational-model scrum scrum-agile sql sqlserver

Last synced: 28 Feb 2026

https://github.com/arch-fan/pokedata

Pokemon Data in CSV format for whatever you need!

csv data dataset pokemon

Last synced: 17 Jun 2026

https://github.com/ibttf/bayborhood

Interactive map to find the ideal neighborhood in San Francisco based on data.

data data-analysis data-visualization gis mapbox react

Last synced: 18 Jun 2026

https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata

I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.

chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation

Last synced: 14 Feb 2026

https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends

This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.

analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization

Last synced: 16 Feb 2026

https://github.com/dushansenadheera/web_scraper

web scraper using Python along with BeautifulSoup and Selenium

beautifulsoup data python selenium web-scraping

Last synced: 19 Jun 2026

https://github.com/danyal-faheem/project-logs-analyzer

This repo contains scripts to analyze project logs and display some charts related to the data

data data-visualization matplotlib pandas python streamlit

Last synced: 07 May 2026

https://github.com/jigyasag18/iit-guhawati

Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.

aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp

Last synced: 08 May 2026

https://github.com/davidkhala/datasets

sample datasets

data

Last synced: 19 Mar 2026

https://github.com/aaronspindler/selfdrivingcar

Learning deep learning and making a self driving car in the process

car data deep deep-learning driving keras learning machine machine-learning python self self-driving-car

Last synced: 09 Apr 2026

https://github.com/abhishekn1947/samgov-scraper

Automated Python scraper for sam.gov contracts

analytics automation aws data pandas postgresql rds selenium webscraper

Last synced: 09 Apr 2026

https://github.com/soenneker/soenneker.attributes.mapto

A C# attribute for generic data mapping translation

attributes columns csharp data datatables dotnet mapping mapto maptoattribute object

Last synced: 02 Mar 2026

https://github.com/kayahr/datastream

Data stream classes for writing and reading all kinds of data types, even single bits

data datastream input output stream typescript

Last synced: 01 Aug 2025

https://github.com/lckylke/vizweb

Web application for data visualization:)

data expressjs nextjs web

Last synced: 08 May 2026

https://github.com/creativecuriositystudio/cruddle

(DEPRECATED) Simplifying CRUDL screen development using ModelSafe

angular2 crud data html model typescript ui web

Last synced: 09 Apr 2026

https://github.com/bastianolea/servel_elecciones_core

Resultados electorales desde Servel (2024)

chile comunas data elecciones genero

Last synced: 01 Aug 2025

https://github.com/metapsy-project/data-depression-anxiety-transdiagnostic

Database of transdiagnostic treatment of depression and anxiety

data

Last synced: 01 Apr 2026

https://github.com/yashkp1234/movie-recommendation-engine

My project on analyzing the movie data set, and creating a recommendation engine using that analysis.

analysis data notebook python recommendation-engine

Last synced: 04 May 2025

https://github.com/bonnevoyager/quick-storage

Simple key/value storage module with persistency.

browser data fs indexeddb javascript key-value nodejs persistence quick server storage

Last synced: 16 Apr 2026

https://github.com/erickpeirson/jhb-data

Data from the forthcoming paper: Quantitative Perspectives on Fifty Years of the Journal of the History of Biology

data geolocation history-of-biology named-entity-recognition topic-modeling

Last synced: 04 Mar 2026

https://github.com/jigyasag18/power-bi-dashboard-project

The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.

dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization

Last synced: 04 Mar 2026

https://github.com/mbagalman/lattice-doe

Python code to create experimental designs optimized to meet statistical power targets

abtesting data datascience designofexperiments experimentaldesign statistics

Last synced: 19 Jun 2026

https://github.com/derhuerst/uic-codes

UIC country codes.

data dataviz i18n transit

Last synced: 05 Mar 2026

https://github.com/cunfuu/network-bubbles

For Easier to manage organizations and keeping notes about them to organize events and easy access their needs

data data-visualization organizations organizations-volunteer

Last synced: 31 Jul 2025

https://github.com/michael-ljn/cirp-lce-2025

Prospective Global Warming Potential of Australian Low-Emission Hydrogen in a Net-Zero Emission Context

data publication

Last synced: 06 Mar 2026

https://github.com/taquece/goals-per-match

basic script to calculate average football goals per match from .CSV

beginner csv data football nodejs python sports-analytics

Last synced: 09 May 2026

https://github.com/mecha-cms/x.time

Creates page time data if it does not exist.

data date extension page time

Last synced: 23 Mar 2025

https://github.com/farrelfaricaf/exploratorydataanalyst---titanic

This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.

data data-analysis data-science data-visualization eda python titanic-dataset

Last synced: 31 Jul 2025

https://github.com/inist-cnrs/ws-data

Modèles et données pour les web services

data dvc models

Last synced: 03 Sep 2025

https://github.com/sushmashreeps/data-science-with-python

This repository showcases a comprehensive data science project utilizing Python, demonstrating expertise in data analysis, visualization, and machine learning. Built with Python 3.x, the project leverages popular libraries like Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, and TensorFlow. The project features data preprocessing, feature engine

cnn data dataanalysis datascience keras linear-regression matplotlib python python3 regression rnn visualization

Last synced: 14 Apr 2026

https://github.com/satyam4229/iit-and-nit-college-dataset

The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.

college-data csv data excel iit nit

Last synced: 04 Jan 2026

https://github.com/stdlib-js/wasm-base-dtype2wasm

Return the WebAssembly data type associated with a provided array data type value.

array base data dtype javascript node node-js nodejs stdlib type types util utilities utility utils wasm webassembly

Last synced: 09 May 2026

https://github.com/amethyst-php/attendance

Indicate the attendance/absence of an employee in a defined office with a range of dates

amethyst amethyst-package api attendance data laravel

Last synced: 17 Apr 2026

https://github.com/steveanik/kestra

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

data data-engineering data-integration data-pipeline data-quality elt etl low-code orchestration pipelines scheduler workflow workflow-engine

Last synced: 06 Jan 2026

https://github.com/purarue/scramble-history

parses rubiks cube scramble history/solve time from cstimer.net, cubers.io, twistytimer -- merges them together giving you uniform averages/data/graphs

cstimer cubing data rubiks-cube speedsolving

Last synced: 11 Jun 2025

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/holo-nim/flue

data streaming options

data nim reader-writer streams

Last synced: 04 Apr 2026