An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/soenneker/soenneker.dtos.requestdataoptions

A flexible request options object for paging, sorting, and filtering queryable data, similar to OData-style parameters.

controller coordinator csharp data dotnet dto dtos http manager object odata options request requestdataoptions

Last synced: 12 Mar 2026

https://github.com/foundationallm/.github

A platform accelerating delivery of secure, trustworthy enterprise copilots.

agent ai data enterprise generative-ai large-language-model llm ml tool

Last synced: 12 Feb 2026

https://github.com/quonverbat/ordner

A simple, customizable and cross-platform data tracker.

data datatracker javafx management

Last synced: 07 Jul 2025

https://github.com/miozilla/pandas

pandas :panda_face::panda_face: : Python Library # Data Analysis # Dataframe

analysis data dataframe pandas python sqlite3

Last synced: 07 May 2026

https://github.com/hackersandslackers/hackers-jupyter-posts

:red_circle: :closed_book: Our repository for Jupyter Notebook to serve as blog posts.

blog data data-engineering gatsbyjs jupyter jupyter-notebook python python3

Last synced: 07 May 2026

https://github.com/publici/state-integrity-data

Data from a comprehensive assessment of state government accountability and transparency

data

Last synced: 04 Feb 2026

https://github.com/smac-group/smacdata

Data sets used in various packages.

data r

Last synced: 02 Apr 2025

https://github.com/pocketfullofdata/electric-vehicles-market-size-analysis

This project analyzes the growth, adoption trends, and future projections of the electric vehicle (EV) market. Using data analysis and visualization techniques, it examines key factors like sales trends, and consumer adoption to understand the evolving landscape of the EV industry.

analysis data jupyter-notebook matplotlib numpy python seaborn vscode

Last synced: 07 May 2026

https://github.com/infinitode/pywebscrapr

An open-source Python web scraping tool. Supports both image scraping and text scraping.

data data-collection data-science open-source pip scraping web-scraper

Last synced: 14 Feb 2026

https://github.com/amir76717/healthai-pro

HealthAI Pro revolutionizes the healthcare experience by leveraging cutting-edge AI technologies to provide intelligent, personalized healthcare solutions to patients and medical professionals alike. This platform incorporates machine learning, natural language processing, and robust data management to enhance the quality of healthcare services.

data machine-learning nlp

Last synced: 31 Mar 2025

https://github.com/imartinezl/madrid-challenge

Madrid Route Optimization Challenge 🚚♻️🚚

challenge city data optimization routing-algorithm traffic

Last synced: 28 Feb 2026

https://github.com/safwan2003/randomforest_heart_disease_prediction

A machine learning project using Random Forest Classifier to predict heart disease. Includes data preprocessing (with binning), feature selection, and model evaluation.

binning data data-science datapipeline datapreprocessing datavisaulization deep-learning machine-learning python random-forest-classifier streamlit

Last synced: 07 May 2026

https://github.com/82luli02/sakila_dvd_rental_database_analysis

Analysis of the Sakila DVD Rental database using SQL

data data-analysis data-science data-visualization sql

Last synced: 10 Mar 2026

https://github.com/gusenov/open-data-scripts

Scripts to explore public datasets. Скрипты для работы с открытыми данными.

charts data data-visualisation data-visualization datavisualization highcharts kazakhstan open-data opendata qazaqstan

Last synced: 28 Feb 2026

https://github.com/hemangsharma/dataanalysis

This repo contains analysis like a dashboard and time series forecast on NASDAQ data

analysis data data-analysis data-visualization python

Last synced: 10 Mar 2026

https://github.com/elkingarcia11/mlb-gameday-obp-odds

Small Python script that pulls MLB team on-base percentage (OBP) for the current season, loads today’s schedule, and writes CSV files that list each team’s OBP edge against its opponent for the day. It also labels each side of a game as betting favorite, not favorite, or equal using American moneylines from ESPN’s public game data.

api csv data http https json mlb mlb-stats-api moneyline odds python rest sports urllib

Last synced: 30 May 2026

https://github.com/sadratehranian/data-collection-and-machine-learning

create a model using logistic regression to predict whether the fire alarm of a smoke detector should sound or not. Second, predicts whether an electric drive in a production plant may be faulty or not.

data data-analysis data-science datacollection logistic-regression machine-learning ml nn

Last synced: 05 Jan 2026

https://github.com/paezha/bsantiago

A data package with the results of a travel and well-being survey conducted in Santiago in 2016

data equity package r santiago survey travel well-being

Last synced: 18 Mar 2025

https://github.com/lijesh010/roadaccidentanalysisproject

This data analysis project was completed using MS Excel, and includes the creation of a dashboard.

data data-analytics data-exploration data-visualization msexcel

Last synced: 15 Feb 2026

https://github.com/nmelgar/marathons_data_viz

Data visualization project to analyze finishing times and other data.

csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau

Last synced: 15 Feb 2026

https://github.com/gourab337/karnataka-health-visualizer

Visualizer for Karnataka's district-wise healthcare info built using PHP

analytics data

Last synced: 19 Mar 2026

https://github.com/jigyasag18/iit-guhawati

Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.

aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp

Last synced: 08 May 2026

https://github.com/davidkhala/datasets

sample datasets

data

Last synced: 19 Mar 2026

https://github.com/zsvoboda/olympics

Self service analytics of 120 years of Olympics data

analytics dashboards data datavisualization dataviz olympics open-data open-datasets opendata reports

Last synced: 08 May 2026

https://github.com/omarcodex/data_analysis

My repository of past and present research and data-driven projects.

data ecodev ecology science sustainability yale

Last synced: 18 Jan 2026

https://github.com/randomfractals/unfolded-map-snippets

Html, CSS, JavaScript, and Python 🐍 vscode snippets ✂️ extension for Unfolded Map 🗺️ and Data SDKs

code data extension map sdk snippets template unfolded vscode

Last synced: 08 May 2026

https://github.com/soenneker/soenneker.attributes.mapto

A C# attribute for generic data mapping translation

attributes columns csharp data datatables dotnet mapping mapto maptoattribute object

Last synced: 02 Mar 2026

https://github.com/ybelenko/openapi-data-mocker-interfaces

Package with OpenApiDataMocker interfaces.

data fake faker interface mock mocker oas oas3 openapi swagger

Last synced: 05 Jan 2026

https://github.com/coderjolly/spotify-api-data-analysis

The project leverages Apache Airflow for automating Spotify API data analysis, focusing on user activity. Extracting, transforming, and loading data efficiently, it provides insights via PowerBI dashboards.

airflow airflow-dags data data-engineering etl etl-pipeline microsoft-sql-server power-bi python scripting sql

Last synced: 27 Mar 2026

https://github.com/nagar2nd/financial-analysis-power-bi

This project analyzes financial and credit card usage data using Power BI and DAX, focusing on customer behavior, credit risk, and financial performance. It includes insights on spending trends, delinquency rates, churn indicators, and satisfaction scores to drive better financial management and customer retention strategies.

analysis data dax dax-functions dax-query excel powerbi

Last synced: 03 Mar 2026

https://github.com/sakan811/show-leaving-soon-tracker-website

This is a Vue.js application that displays shows that are leaving each platform soon, featuring a countdown timer for each title based on the user's local timezone.

data hbo hbomax netflix shows streaming tv-shows vue vuejs web webapp website

Last synced: 18 Mar 2025

https://github.com/scx567888/scx-data

✨ SCX Data

data java scx

Last synced: 05 Apr 2025

https://github.com/shubhamsoni98/excel-practice

Excel-Practice-Questions

analysis data excel formula raw-data xlsx

Last synced: 03 Mar 2026

https://github.com/writetome51/page-load-access

A TypeScript/Javascript class that loads a batch (array) of data from a larger set too big to be loaded all at once.

batch class data javascript load loader typescript

Last synced: 16 May 2026

https://github.com/vishwas-chakilam/hr-dashboard

This project involves creating an interactive HR Dashboard using Power BI for visualization and MySQL for data cleaning and analysis. It provides insights into employee performance, attrition, salary distribution, and hiring trends.

dashboard data datac datacleaning datavisualization mysql powerbi

Last synced: 23 Mar 2025

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/ashakoen/bls-data-extract

This repository contains scripts and a database schema to set up and manage a local SQLite database for storing and querying the Average Price data from the U.S. Bureau of Labor Statistics. It includes tools for downloading the latest data from the BLS website and fetching Consumer Price Index (CPI) data via the BLS API.

data government sqlite us

Last synced: 01 Apr 2026

https://github.com/fastpix/android-data-bitmovin

FastPix Video Data SDK to monitor and analyze video playback metrics within Bitmovin for android

analytics android-sdk bitmovin data fastpix metrics player sdk video

Last synced: 16 Apr 2026

https://github.com/ksimicevic/discord-message-analyzer

Analyzing discord messages in Jupyter notebook

analysis data discord messages

Last synced: 16 Apr 2026

https://github.com/jameshenderson12/data-lists

This respository contains lists of useful data that can be used in a variety of projects.

countries data list names scottish text

Last synced: 05 Mar 2026

https://github.com/sehgal-vishal/world-population-

World Population Sql Analysis

data dataanalysis population sql

Last synced: 05 Mar 2026

https://github.com/amethyst-php/collection

Simple as the name, this package allow you to create collection of other models.

amethyst amethyst-package api collection data laravel

Last synced: 17 Apr 2026

https://github.com/chompfoods/sdk-typescript-angular

Angular TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

angular api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes sdk typescript

Last synced: 09 May 2026

https://github.com/jwszolek/accelerated-data-generator

Ultra-fast random data generator. It gives you an ability to generate almost 1M of rows in around second.

bash csv data data-generator generator shell

Last synced: 02 Apr 2026

https://github.com/rezapace/newbash

This project involves managing various application shortcuts and configurations primarily for a Linux environment. It includes scripts for creating .desktop entries for applications, managing system configurations, and handling application processes.

automation backup bash data dekstop linux newbash ohmyzsh script testing zsh

Last synced: 11 Apr 2026

https://github.com/psyteachr/psyteachrdata

Datasets for psyTeachR Books

data

Last synced: 23 Mar 2025

https://github.com/lotfiferaga/instagram-reach-analysis

The Instagram Reach Analysis project aims to develop a Python-based tool to analyze the reach and engagement metrics of Instagram posts.

analytics data data-science datavisualization python

Last synced: 18 Jun 2026

https://github.com/2022-04-11588/data-fakes

🔍 Generate realistic fake data for testing and development, enhancing your projects with simple, customizable data solutions.

data dataset developer-tools fake-content faker fakery groovy java mock phoenix python random ruby seeding struct swift-framework test-data testing

Last synced: 11 Apr 2026

https://github.com/basemax/okala-product-ids

A PHP script to fetch and save product IDs from Okala's online store API across multiple categories and store branches.

crawler crawler-okala crawler-php crawlers data database ids ir iran json okala okala-crawler php php-crawler product

Last synced: 09 May 2026

https://github.com/tjpalanca/pins

Data Pins

data pins

Last synced: 05 Jan 2026

https://github.com/mecha-cms/x.time

Creates page time data if it does not exist.

data date extension page time

Last synced: 23 Mar 2025

https://github.com/snacks02/wobbling-statistics

Audio equipment statistics using Squiglink data

audio data data-visualization headphones iems speakers squiglink statistics

Last synced: 17 Apr 2026

https://github.com/rawdaabdelsalam42/data-cleaning-sql-python-powerbi

Data cleaning project for an e-commerce sales dataset using Python (Pandas) for preprocessing, SQL Server for queries, and Power BI for building an interactive dashboard visualization.

dashboard data data-engineering pandas powerbi python sql

Last synced: 17 Apr 2026

https://github.com/caiorss/julia-box-docker

Docker that provides a development environment for Julia language, Octave, Python, R (Rlang) with a Jupyter Notebook; Jupyter QtConsole and so on.

data datascience deveops docker julia jupyter octave python rlang scientific

Last synced: 09 May 2026

https://github.com/ayush-raj8/godata

Write data to file. Standardizes the format for easy parsing and read by other programs.

data golang

Last synced: 18 Jan 2026

https://github.com/anisimov-anthony/data_forest

Implementation of various types of trees

algorithms-and-data-structures data lib rust tree

Last synced: 28 Apr 2025

https://github.com/goto-eof/bitmaptize

Wraps data inside a .bmp and extracts data from .bmp.

bitmap bmp convert data wrap

Last synced: 18 Jan 2026

https://github.com/zurd46/zurdsynthdatagen

This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).

data data-structures dataset electron json jsonl nodejs openai synthetic

Last synced: 04 Apr 2026

https://github.com/rd-uk/rduk-data-pg

PostgreSQL Data Provider implementation for rduk-data

data postgresql provider rduk

Last synced: 18 Apr 2026

https://github.com/kahlery/my-jupyter-notebook-projects

🐊 collection of my data science analysis, actually I store most of my data science projects in my google drive because of google colab

data jupyter-notebook python

Last synced: 12 Apr 2026

https://github.com/gunjanmimo/d3-visualization

D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework

d3js data data-science data-visualization reactjs

Last synced: 29 Apr 2026

https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly

Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool

blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly

Last synced: 18 Apr 2026

https://github.com/cao7113/datalab

data lab and tools

data tool

Last synced: 18 Apr 2026

https://github.com/phelipe-sempreboni/certificates

Tutorial intended for information about my licenses and certificates acquired over time.

certificate certificates certification course data database datascience licences license-management marketing marketing-analytics python sql

Last synced: 16 May 2026

https://github.com/ahmad-ali-rafique/decision-tree-classifier-modeling

👏Comprehensive exploration of decision tree classifiers, including data cleaning, model building🏩, and performance evaluation on various datasets.

analytics classification classification-models data data-science dataanalytics datacleaning dataset decision-tree-classifier models

Last synced: 20 Apr 2026

https://github.com/anjaliwork20/moodify

Mood-based music recommendation system that considers a user's emotional state to recommend songs, genres, artists and playlists using Machine learning

artificial-intelligence cnn-keras cnn-model convolutional-neural-networks data data-analysis data-science data-structures data-visualization database deep-learning machine-learning machine-learning-algorithms python recommended song songs

Last synced: 20 Apr 2026

https://github.com/miraclx/split-merge

Efficient, flexible data stream chunker and merger

chunk data efficient merge middleware nodejs pipeline split stream

Last synced: 07 May 2026

https://github.com/ngofilho/scripts-db

Repository containing several dbs scripts samples.

cache data database db mariadb mongodb mysql oracle redis sql-server

Last synced: 11 Apr 2026

https://github.com/stdlib-js/array-base-symmetric-banded-filled2d-by

Create a filled two-dimensional symmetric banded nested array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types

Last synced: 20 Apr 2026

https://github.com/petermeissner/suuntor

Data from a Suunto watch extracted by R - !because!

automation data r rstats suunto windows

Last synced: 20 Apr 2026

https://github.com/brayflex/spy-sector-rotation-google-sheet

Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.

data etf google-sheets index price rotation script sector spreadsheet spy stock-market

Last synced: 29 Jun 2026

https://github.com/schluppeck/2024-abdsa-notes

some notes related to DS's presentation

abdsa data python rstats science

Last synced: 21 Apr 2026

https://github.com/fastpix/android-data-kaltura

This SDK enables seamless integration with Kaltura Player, offering advanced video analytics via the FastPix Dashboard

analytics android-sdk data fastpix kaltura kaltura-player metrics sdk video video-metrics

Last synced: 21 Apr 2026

https://github.com/stefen-taime/llm-rag-mtl-public-hospital

Ce projet développe un modèle de type Retrieve-Augment-Generate (RAG) pour répondre aux questions en utilisant les données publiques des avis laissés sur Google pour des hôpitaux à Montréal

data google-reviews hopital hospital hub ia llm montreal open-source quebec rag

Last synced: 21 Apr 2026

https://github.com/thanh-wutan/chess-opening-comparator

Interactive web app using R to visualize and compare chess opening performance and popularity.

chess-openings data databases datavisualisation r

Last synced: 09 May 2026

https://github.com/schijioke-uche/data-analysis-with-python-an-spss-model

With this Python notebook algorithm, you can use SPSS Model notebook to build machine learning pipelines that you can use to iterate rapidly during the model building process in data analysis. Whether you're trying to find the right algorithm or experimenting with different ways of preparing your data, you can create reproducible research that's easily understood by any member of your team with Hypothesis definition.

anova cp4a cp4d cp4i cp4s data ibm ibm-cloud jeffrey-chijioke-uche jeffrey-solomon-chijioke-uche openshift python python3 redhat t-test

Last synced: 22 Apr 2026

https://github.com/sebastianbrzustowicz/flight-quality-overview-microservice

Go + Docker. Microservice with parallel computations to convert raw vehicle flight data into overview raport with visualisation.

container control csv data docker drone flight go goroutines http microservice parallel-computing pdf quadcopter raport rms sse vehicle

Last synced: 10 May 2026

https://github.com/grimen/python-humanizer

A human/developer friendly value humanizer - for Python.

data debug debugging format formatting humanize humanizer log logging print printing value

Last synced: 05 Jun 2026

https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis

A data science notebook project focused on analyzing car features and building a model for car price prediction.

data data-analysis data-visualization jupyter-notebook python

Last synced: 23 Apr 2026

https://github.com/charlenry/python_data_science

Mes notebooks de travaux pratiques sur Python pour la Data Science

analysis data dataframe jupyter kaggle matplotlib notebook numpy pandas pyplot python science seaborn visualisation

Last synced: 25 Jun 2026

https://github.com/yuvrajsaraogi/-iris-flower-classification

Iris flower has three species; setosa, versicolor, and virginica, which differs according to their measurements. Now assume that you have the measurements of the iris flowers according to their species, and the task is to train a machine learning model that can learn from the measurements of the iris species and classify them.

classification data data-analysis data-science data-visualization flower flower-classification iris iris-classification iris-flower iris-flower-classification knn knn-classification machine-learning machine-learning-algorithms ml natural-language-processing nlp python

Last synced: 24 Apr 2026

https://github.com/cyberoctane29/python-for-data-analysis

A repository dedicated to learning Python for data analysis, data science, and data analytics. This collection of Jupyter notebooks covers practical exercises and concepts from the Google Advanced Data Analytics Professional Certificate program.

data data-analysis data-analytics data-science python

Last synced: 24 Apr 2026

https://github.com/fehmitahsindemirkan/web-scrapper

Professional and high performance web scraping project.

data ecommerce emailsender fileexplorer logging python web webscraping

Last synced: 10 Jan 2026

https://github.com/mehmetkahya0/gallstone_dataset_analysis_project

Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)

analysis analytics data data-analysis data-science data-visualization database graph matplotlib python

Last synced: 25 Apr 2026

https://github.com/xjwllmsx/hacker-news-engagement

Analyze Hacker News data to reveal which post types and posting hours spark the most discussion, using Python and a reproducible Jupyter notebook.

data data-analysis jupyter python

Last synced: 25 Apr 2026

https://github.com/marielachirinosr/bellabeat-wellness-data-trends

Analyzing smart device data for insights on user activity patterns to optimize interventions for better health outcomes.

data data-analysis data-visualization pandas python python3 tableau tableau-public

Last synced: 25 Apr 2026

https://github.com/shwetajanwekar/prediction-with-regression

prediction with regression for salary_hike and delivery time dataset

data data-science datset exploratory-data-analysis matplotlib pandas plot prediction r2-score seaborn sns

Last synced: 25 Apr 2026

https://github.com/davitshahnazaryan3/data-management-web

Explore datasets with ease using taxonomy filtering, allowing you to quickly identify the specific experimental datasets you need and download them effortlessly

data environmental experiments filtering-data seismic taxonomy

Last synced: 17 Jan 2026

https://github.com/jigyasag18/multiple-disease-detection-app

This repository contains the implementation of a Multiple Disease Detection System, which employs advanced machine learning techniques for early detection and prediction of prevalent diseases, including diabetes, heart disease, and Parkinson's disease. The system utilizes a variety of patient health metrics such as demographics and medical history.

data datapreprocessing machine-learning machine-learning-algorithms machinelearningmodel prediction python streamlit streamlit-webapp

Last synced: 07 Jun 2026