An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models

This repo contains a model for determining the rainfall patterns and another for medical expense prediction model

data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning

Last synced: 30 Aug 2025

https://github.com/dataglyder/data-analysis-tools-to-get-you-started

This repository describes a few tools for a beginner Data Analyst.

analytics data python r sql

Last synced: 18 Apr 2026

https://github.com/woctezuma/hidden-gems-data

Data available to compute regional rankings of hidden gems.

data hidden-gems steam steam-reviews

Last synced: 06 Feb 2026

https://github.com/eugenedakin/des-encryption-decryption

Encrypt and Decrypt text in Xojo using DES - Written in Native Xojo Language - Cross Platform

data data-encryption-standard decryption des encryption standard xojo

Last synced: 24 Feb 2026

https://github.com/eshitakundu/disease-outbreak-predictor

Disease Outbreak Predictor: A Streamlit-based web application for predicting diabetes, heart disease, and Parkinson's disease using machine learning models.

data data-science disease-prediction healthcare-application jupyter-notebook machinelearning ml notebook prediction python streamlit streamlit-webapp

Last synced: 01 May 2026

https://github.com/karensaraimoralesmontiel/8-week-sql-challenge

Case Studies Solutions for the 8-Week-SQL-Challenge.

data database sql

Last synced: 02 Jan 2026

https://github.com/ferru97/jsketchfabcrawler

jSketchfabCrawler is a java for the automatic crawling of model's information from sketchfab.com

crawler data database java sketchfab sql

Last synced: 03 Jan 2026

https://github.com/peterhellberg/bugsnag-data

Dump Bugsnag data using the Data access API

bugsnag data go

Last synced: 22 Jun 2026

https://github.com/tasosfotiadis/time-series-forecasting-for-bitcoin

This project forecasts Bitcoin’s daily closing price using time series models. Data from Jan 2021 to Mar 2022 is processed by converting timestamps, resampling, and handling missing values. LSTM and ARIMA models are evaluated on MAE, RMSE, and MAPE, with LSTM achieving better accuracy while ARIMA is faster in training and inference.

arima bitcoin data data-analysis data-science deep-learning forecasting jupyter-notebook neural-networks python time-series

Last synced: 06 May 2026

https://github.com/debjyotisaha/tableau-projects-phase-2

Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.

dashboards data data-analysis data-visualisation tableau

Last synced: 26 Aug 2025

https://github.com/0xnu/data-analyst-training

The repository contains training materials for data analysts.

data data-analysis data-analyst

Last synced: 25 Aug 2025

https://github.com/franckalbinet/maris-crawlers

Automated data harvesting of MARIS data sources

automation data marine-radioactivity

Last synced: 25 Aug 2025

https://github.com/hafs96/prediction_consommation-de-carburant

Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.

analysis data data-visualization machine-learning testing training

Last synced: 09 Jun 2026

https://github.com/aimin-nur/data-analyst-model-predictive

Sebuah Project data analyst yang bertujuan untuk mengindentifikasi karakteristik customer untuk menerima penawaran campaign marketing.

analyst data mechine-learning visualization

Last synced: 29 Jan 2026

https://github.com/mubashirsidiki/olympics-data-enigeering

Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.

analytics azure big-data data dataengineering devops pipeline

Last synced: 02 May 2026

https://github.com/paulrosset/cyclone

Network data consumption monitoring

data monitoring network networking

Last synced: 23 Aug 2025

https://github.com/s1dewalker/electric-future

Visual Analysis: Future of Automotive Industry

data data-visualization machine-learning python3 regression-analysis tableau

Last synced: 02 May 2026

https://github.com/rationalprabal/book-management-app

A Node.js and Express.js application for managing books, featuring role-based authentication and authorization with JWT, file uploads for book cover pages, robust data validation and documentation using swagger. The project includes user roles such as Admin, Author, and Reader, each with specific permissions.

data expressjs jwt-authentication mongodb mongoose nodejs rbac-roles

Last synced: 10 Apr 2026

https://github.com/restricted/redis-data-cache

TypeScript implementation of data cache management by class name

cache data object redis state typesript

Last synced: 30 Jan 2026

https://github.com/lut-ful/pizza-sales-report

This Pizza Sales Report provides valuable insights into sales performance through detailed analysis and visualizations. By leveraging Power BI and SQL Server

data data-wrangling microsoft-sql-server power-bi power-bi-dax python

Last synced: 30 Jan 2026

https://github.com/hakusaro/facts

A fact based knowledge system (FBKS) experiment.

data facts hacktoberfest

Last synced: 03 Jan 2026

https://github.com/vbhatsaccnt/retail-strategy-and-analytics-optimization-of-control-stores-for-sales-enhancement

In this project, we aim to optimize the performance of retail chain stores by establishing control stores based on their performance compared to selected trial stores. By leveraging data analytics and strategic insights, we seek to enhance sales revenue and drive growth within the retail chain.

customer-segmentation data data-science risk-analysis

Last synced: 13 May 2026

https://github.com/lucavallin/uppa

Unsplash Photo Performance Analyzer.

analyzer data photo python unsplash

Last synced: 05 Oct 2025

https://github.com/denisecase/dc-texter

Send a text message using Python

alerts data python sms-messages streaming

Last synced: 08 Feb 2026

https://github.com/pythoncoderunicorn/jamesbeardaward

a repo for James Beard Award data

data dataset jamesbeard

Last synced: 07 Feb 2026

https://github.com/yugsumeet17/churn-analysis-project--power-bi-sql-machine-learning

Dataset Explained, Project Goals & Metrics Required, SQL Server ETL & Data Cleaning, Power BI Data Load, Transformation, Blueprint & Measures, Power BI Visualization - Summary Page, Building Machine Learning Model - Random Forest, Power BI Visualization - Churn Prediction Page

data data-visualization dataanalytics excel postgresql powerbi python3

Last synced: 03 May 2026

https://github.com/seqeralabs/ffq-api

A minimal wrapper to make ffq searches available via a REST API.

api data fastq fetch-fastq ffq genomics

Last synced: 15 Aug 2025

https://github.com/jleung51/foundations-dags

Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.

data data-engineering etl extract housing load pipeline transform

Last synced: 04 Oct 2025

https://github.com/aaisha-nexus/sql_company_insights

A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.

business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms

Last synced: 12 Aug 2025

https://github.com/fallaciousreasoning/nz-mountains

A list of mountains in NZ, scraped from https://climbnz.org.nz

alpine climbing climbnz data json json-api maps mountaineering scraping

Last synced: 04 May 2026

https://github.com/ymorsi7/quranicvisualization

A visual exploration tool for the Holy Quran using D3.js treemaps.

css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization

Last synced: 15 Apr 2026

https://github.com/interzoid/php-examples

Provides PHP examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.

api cloud data database php quality

Last synced: 12 Jan 2026

https://github.com/michaelfromyeg/lyrics

Lyric-store and API hosted on Git.

data lyrics

Last synced: 08 Feb 2026

https://github.com/keziatbnn/supervised-regression-salaryprediction

Make salary predictions based on years of experience using supervised regression.

data data-analysis-python data-prediction data-science python

Last synced: 11 Aug 2025

https://github.com/andrii04/andreamonforte-bi-assignment

Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.

automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql

Last synced: 09 Nov 2025

https://github.com/matt-dray/draytasets

:1234::disguised_face: Miscellaneous datasets I've collected or prepared

card-games data phd pokemon

Last synced: 09 Feb 2026

https://github.com/srindot/fwuav-average-flight-data-collection

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 10 Aug 2025

https://github.com/myles-parfeniuk/esp32_sdlogger

C++ esp-idf driver component for SD cards interfaced via SPI. WIP

card data esp-idf esp32 logger sd sdcard sdmmc sdspi spi

Last synced: 09 Feb 2026

https://github.com/4ment/aiv-rate-heterogeneity

Avian influenza virus data sets

data influenza

Last synced: 24 Jan 2026

https://github.com/parzibyte/jsonp-php

Ejemplo de JSONP con PHP

data example json jsonp php request

Last synced: 04 May 2026

https://github.com/enescidem/twitter-topic-modeling

Topic modeling is an unsupervised method to identify topics in text. This project analyzes tweets from prominent Turkish accounts to uncover underlying themes in their shared content.

data data-science machine-learning nlp topic-modeling twitter x

Last synced: 10 Feb 2026

https://github.com/brayflex/spy-sector-rotation-google-sheet

Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.

data etf google-sheets index price rotation script sector spreadsheet spy stock-market

Last synced: 29 Jun 2026

https://github.com/os-climate/data-requests

This repo is used to track issues related to new Data Requests

data data-engineering dataset

Last synced: 27 Feb 2026

https://github.com/fabsdevx/files-to-database-loader-handout

Data Engineering project for learning purposes. Credits to itversity

csv data data-engineering database json pandas python

Last synced: 09 Apr 2026

https://github.com/utrechtuniversity/momentum-dataflow

Repository for publishing website about data management practices of the Momentum project

data datageneration datamanagement

Last synced: 27 Feb 2026

https://github.com/chompfoods/sdk-java

Java SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food gradle grocery ingredients jar java java-sdk nutrition openapi raw recipe-api recipes sdk

Last synced: 09 Apr 2026

https://github.com/bastianolea/sicvir_indicadores_rurales

Sistema de Indicadores de Calidad de Vida Rural (Sicvir)

chile comunas data estado rural social

Last synced: 27 Feb 2026

https://github.com/ppabam/eda-bam

Navigating data from one thing to another.

cli data eda python

Last synced: 11 Feb 2026

https://github.com/praveendecode/retail-revenue-forecasting

Designed an end-to-end ML model pipeline, forecasting department-wide sales by accounting for holiday markdown effects, spanning data collection to inferencing.

azure collection data datapreprocessing docker exploratory-data-analysis feature-engineering featureimportance model modelbuilding modeldeployment modelselction python report tableau

Last synced: 16 Apr 2026

https://github.com/iliyasalve/cyclistic_case_study

Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"

bike-sharing data data-analysis data-visualisation r

Last synced: 06 Apr 2025

https://github.com/sourceduty/text_file_metadata

📄 Extract metadata from .txt files and record the metadata in .txt files.

data datascience metadata metafile practice sourceduty

Last synced: 08 Aug 2025

https://github.com/a-poor/datatransform.jl

A package for defining (and performing) tabular-data transformations with JSON.

data data-science data-transformation etl feature-engineering json julia julia-package tabular-data

Last synced: 05 May 2026

https://github.com/kirillsemyonkin/lsd

LSD (Less Syntax Data) configuration/data transfer format.

configuration data java parsing rust

Last synced: 27 Feb 2026

https://github.com/sourceduty/cults_3d

🔢 Software concept for additional statistics from Python for Cults design data .csv files.

3d 3d-model 3d-model-software 3d-modelling account account-management concept cults cults-3d data idea sourceduty

Last synced: 08 Aug 2025

https://github.com/pawamoy/keycut-data

Keyboard shortcuts data stored in YAML files

data keyboard-shortcuts

Last synced: 12 Feb 2026

https://github.com/foundationallm/.github

A platform accelerating delivery of secure, trustworthy enterprise copilots.

agent ai data enterprise generative-ai large-language-model llm ml tool

Last synced: 12 Feb 2026

https://github.com/rdmurphy/deno-quaff

A port of the quaff Node.js library to Deno.

archieml csv data deno json toml yaml

Last synced: 05 May 2026

https://github.com/chanchalsoorma/web-scraping

This repo aims to provide a straightforward, easy-to-use scraping code written in Python.

beautifulsoup beautifulsoup4 data python request selenium webscraping

Last synced: 05 May 2026

https://github.com/shibbbbs/fastapi_project

A FastAPI application that reads financial data from an Excel file (capbudg.xls) and provides API endpoints to list available tables (sheet names), fetch row names from a selected table, and calculate the sum of numerical values from a specified row. The API is accessible via a web-based interactive documentation at /docs

data dataanalysis fastapi pandas python

Last synced: 06 May 2026

https://github.com/0xbitx/dedsec_pastebin-cli

allows you to manage your pastes directly from the terminal

code data paste pastebin payload

Last synced: 25 Jan 2026

https://github.com/Alpine418/DataHandler

Data handler for PHP arrays.

data data-handler php73

Last synced: 01 Oct 2025

https://github.com/sanand0/iss-location

Tracks the International Space Station position. A demo of how to use GitHub Actions to schedule commits weekly.

data

Last synced: 14 Feb 2026

https://github.com/adilsaid64/real-time-data-monitoring

Exploring what a real-time data drift monitoring solution could look like within MLOps

data datadrift grafana machine-learning mlops mlops-workflow prometheus python software-engineering

Last synced: 04 Aug 2025

https://github.com/molinsagustin/cinedata

# CineData Trabajo práctico grupal para la materia Ingeniería de Datos I en la Universidad Argentina de la Empresa. El mismo consistió en el desarrollo de una base de datos relacional en Microsoft SQL Server Managment Studio utilizando metodología Ágil SCRUM, que se utilizó desde el relevamiento de requisitos hasta la implementación final.

agile data data-modeling database diagram entity-relationship-diagram microsoft-sql-server relational-databases relational-model scrum scrum-agile sql sqlserver

Last synced: 28 Feb 2026

https://github.com/sunnahboy/checkfake_true_news

Building data structures using Linked lists and arrays and find best algorithms for implementing a system for detecting Fake News

algorithms data level low programming structure

Last synced: 28 Feb 2026

https://github.com/ddeepanshu-997/support_vector_regression--svr-

In this repository i performed a support vector regression on real life data , initially i performed some data preprocessing technique in order to filter out the data flaws then undergoes the process of model building i.e SVM regression in order to make a machine learning regression model.

data data-science regression-analysis regression-models svm-model svm-regression

Last synced: 03 Aug 2025

https://github.com/ims94/ballerina-tsv-querying

An example Ballerina project to query tsv data using Ballerina language integrated queries

ballerina ballerina-lang data olympics query sql

Last synced: 03 Feb 2026

https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata

I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.

chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation

Last synced: 14 Feb 2026

https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration

This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.

covid-19 data data-exploration dataset sql

Last synced: 19 Feb 2026

https://github.com/nmelgar/marathons_data_viz

Data visualization project to analyze finishing times and other data.

csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau

Last synced: 15 Feb 2026

https://github.com/plurid/datasign

Single Source of Truth Data Contract Specifier

data file-format

Last synced: 08 Nov 2025

https://github.com/nagar2nd/ml-regressionmodel---cardekho-price-prediction

This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.

analytics cleaning-data data linear-regression machine-learning matplotlib numpy pandas python seaborn

Last synced: 16 Apr 2026

https://github.com/xljones/bugsnag-exporter

Export Bugsnag project, error, and event data easily from a command line call which automatically handles pagination, and API backoffs

bash bugsnag cmd csv data error error-capture error-handling error-reporting event export go golang json project zsh

Last synced: 06 May 2026

https://github.com/edjoukou/human_resources

A data analysis project using MySQL Server database

analysis data mysql powerbi sql visualization

Last synced: 25 Sep 2025