An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/ksimicevic/discord-message-analyzer

Analyzing discord messages in Jupyter notebook

analysis data discord messages

Last synced: 16 Apr 2026

https://github.com/ahmad-ali-rafique/heart-disease-detection-model

A comprehensive project for detecting heart disease using machine learning, including data processing, model training, and evaluation metrics with AUC curve analysis.

artificial-intelligence data datascience heart-disease machine-learning modeling prediction-model

Last synced: 11 Aug 2025

https://github.com/srindot/fwuav-average-flight-data-collection

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 10 Aug 2025

https://github.com/jigyasag18/amazon-prime-power-bi-dashboard

The Amazon Prime Power BI Project is a centralized data storage system containing detailed information on movies and TV shows available on Amazon Prime Video, including metadata and analytics insights. It supports data-driven decision-making for content acquisition and viewer engagement strategies. This repo is optimized for querying & analysis.

dashboard data data-visualization dataanalysis dataanalytics datacleaning dataset powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 05 Mar 2026

https://github.com/ometman/vet-clinic

This is a database project for vetinary data management for animals, owners, clinic employees and visits; and applicable to any data management need. It uses Postgresql, a relational database management system. It allows storing, updating and querying.

data database normalization postgresql postgresql-database queries sql sql-server-database tables transactions

Last synced: 13 May 2026

https://github.com/0xkibh/datamining-algo

This repository consist data mining algorithm implementation example in python

apriori-algorithm data datamining fp-growth python

Last synced: 19 May 2026

https://github.com/chubek/pyramid-dashboard

A Dashboard to Show Data Made Using Plotly Dash

dash data docker ml plotly plotly-dash python

Last synced: 19 May 2026

https://github.com/chompfoods/sdk-java

Java SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food gradle grocery ingredients jar java java-sdk nutrition openapi raw recipe-api recipes sdk

Last synced: 09 Apr 2026

https://github.com/mapaor/horaris-rodalies

Web que utilitza la API de rodalies de Catalunya per mostrar els horaris d'una manera més divertida

adif api ave barcelona bordils catalunya dades data distancia generalitat girona horaris md r11 regional renfe rodalies sants tren viajes

Last synced: 16 May 2026

https://github.com/writetome51/public-data-container-interface

Just a TypeScript interface with 1 property: 'data'

container data interface typescript

Last synced: 15 May 2026

https://github.com/natarizkie2/neurochain-airdrop-bot

🍋 — A smart bot designed to complete data tasks like true/false selections automatically, with multi-account support for extra convenience.

airdrop automated bot data multi-account natarizkie neurochain nodejs web3

Last synced: 10 Jun 2026

https://github.com/cleanzr/cd

CD dataset for Entity Resolution

data linkage

Last synced: 10 Mar 2026

https://github.com/gaemapiracicaba/norma_dec_8468-76

Padrões de qualidade e lançamento de efluentes de águas interiores

data python

Last synced: 19 Apr 2026

https://github.com/shsiddhant/womens-wc

ML project to predict match outcomes for Women's Cricket World Cup 2025.

cricket-prediction data feature-engineering postgresql python

Last synced: 04 Apr 2026

https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling

Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.

artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models

Last synced: 17 Apr 2026

https://github.com/mbagalman/lattice-doe

Python code to create experimental designs optimized to meet statistical power targets

abtesting data datascience designofexperiments experimentaldesign statistics

Last synced: 19 Jun 2026

https://github.com/sourceduty/text_file_metadata

📄 Extract metadata from .txt files and record the metadata in .txt files.

data datascience metadata metafile practice sourceduty

Last synced: 08 Aug 2025

https://github.com/zurd46/zurdsynthdatagen

This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).

data data-structures dataset electron json jsonl nodejs openai synthetic

Last synced: 04 Apr 2026

https://github.com/mipacd/holochatstats

A VTuber chat log (and general) analytics platform

data flask hololive postgresql python visualization vtuber youtube

Last synced: 05 Apr 2026

https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly

Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool

blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly

Last synced: 18 Apr 2026

https://github.com/huemulsolutions/huemul_sql_decode

Obtiene los campos y tablas utilizados en una sentencia SQL

bigdata chile data data-governance governance spark sql

Last synced: 19 Apr 2026

https://github.com/flexthink/matricize

A convenience library to convert between pure Python objects and their vectorized representations

data machine-learning numpy python

Last synced: 09 May 2026

https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project

This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop

analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping

Last synced: 09 May 2026

https://github.com/montanaz0r/suicide-rate-analysis

Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector

analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate

Last synced: 20 Apr 2026

https://github.com/arda-guler/binmotion

Convert ANY data to a video file. Sister project of binGallery.

data data-visualization proof-of-concept video

Last synced: 04 Jun 2026

https://github.com/alexyiann/finance

In this repository you can find scripts for pulling data and comparing them , but you can also find simple python scripts to automate trades on Crypto and back testing trading strategies on both crypto and stocks .

api bots data database finance option option-strategies strategy trading trading-algorithms

Last synced: 03 Jan 2026

https://github.com/rick-does/json-razor

Reduces JSON, YAML, and NDJSON volume by collapsing repeated structures while preserving the schema, making the schema easier for you to read.

cli data devtools json logs ndjson schema yaml

Last synced: 20 Apr 2026

https://github.com/neptun-software/neptun.data.generators

Send scraped data from neptun-scraper to CHATGPT to generate training data for NEPTUN.AI.

data generator

Last synced: 30 Jul 2025

https://github.com/sourceduty/data_marketer

💰 Analyze uploaded data and prepare a data marketing plan for selling data. Create data product plans.

ai ai-data ai-tool artificial-intelligence business chatgpt company custom-gpt customgpts data data-business data-market data-marketer data-marketing data-tool gpt gpt-store gpts gptstore openai

Last synced: 03 Sep 2025

https://github.com/yashkp1234/movie-recommendation-engine

My project on analyzing the movie data set, and creating a recommendation engine using that analysis.

analysis data notebook python recommendation-engine

Last synced: 04 May 2025

https://github.com/mozzo1000/web-analytics

Website analysis tools and data

analysis analytics data website

Last synced: 21 Apr 2026

https://github.com/renebentes/2806

Curso 2806 - Acesso à dados com C#, .NET 5, Dapper e SQL Server

csharp dapper data dotnet sqlserver

Last synced: 19 Apr 2026

https://github.com/jdenn0514/surveycore

Core Survey Analysis Infrastructure

data r resear survey-analysis

Last synced: 21 Apr 2026

https://github.com/dms-codes/scrape-tokoalvabet-com

Toko Alvabet Data Scraping and Price Comparator This Python script is designed to scrape data from Toko Alvabet's website and perform price comparison for the obtained products. It includes features for viewing and analyzing product data, as well as comparing prices with other sellers.

data price python scraping

Last synced: 29 Jul 2025

https://github.com/rbcavi/factorio-mod-data

The modpacke data for factorio-viewer

data factorio factorio-data factorio-mod-data

Last synced: 23 Apr 2026

https://github.com/pixlcrashr/stwhh-mensa

Better STWHH Mensa menu data / interface / notifier

api crawler data food studierendenwerk-hamburg university website

Last synced: 07 Aug 2025

https://github.com/ppatrzyk/heatmap

Display CSV as a heatmap in terminal

csv data data-visualization terminal

Last synced: 24 Apr 2026

https://github.com/howwohmm/fetchgram

era-adjusted Instagram content intelligence — scrape any public profile, OCR every image, measure what actually works. free, local, no API keys.

analytics cli content-strategy data instagram ocr python scraper

Last synced: 06 Jun 2026

https://github.com/hruth-vik/sales-analysis-report

SalesScope is a powerful sales analytics dashboard that extracts insights, reveals trends, and drives strategy from raw data.

analytics data powerbi-report powerbi-visuals python

Last synced: 24 Apr 2026

https://github.com/scjoaoantonio/trab_datascience

Este projeto tem como objetivo analisar os posts da rede social Bluesky. A aplicação interativa foi desenvolvida utilizando Streamlit e permite a coleta e visualização de dados, além de oferecer análises avançadas como previsão de engajamento, modelagem de tópicos e análise de sentimentos.

bluesky data data-science streamlit

Last synced: 09 May 2026

https://github.com/xjwllmsx/hacker-news-engagement

Analyze Hacker News data to reveal which post types and posting hours spark the most discussion, using Python and a reproducible Jupyter notebook.

data data-analysis jupyter python

Last synced: 25 Apr 2026

https://github.com/carlos-levi/twitterbots_analise_redesneurais

Projeto para a disciplina de IA - análise exploratória e aplicação de técnicas de aprendizado de máquina para detectar contas automatizadas (bots) na plataforma 𝕏 (Twitter)

data machine-learning twitter-bot

Last synced: 06 Jun 2026

https://github.com/sebastianbrzustowicz/flight-quality-overview-microservice

Go + Docker. Microservice with parallel computations to convert raw vehicle flight data into overview raport with visualisation.

container control csv data docker drone flight go goroutines http microservice parallel-computing pdf quadcopter raport rms sse vehicle

Last synced: 10 May 2026

https://github.com/marielachirinosr/hotel-data-analysis

Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries

data data-analysis matplotlib pandas python

Last synced: 25 Apr 2026

https://github.com/Alpine418/DataHandler

Data handler for PHP arrays.

data data-handler php73

Last synced: 01 Oct 2025

https://github.com/svetlanam/kbl-to-csv-s3

Keboola extractor, that converts excel to CSV based on input mapping criteria and upload to S3 bucket

data data-cleaning data-transformation etl keboola s3-bucket

Last synced: 20 Jun 2026

https://github.com/tsbarr/citi-bikes-challenge

Citibikes NYC Data Analysis: Uncover insights from over a decade of ride data. Jupyter notebook for data aggregation/cleaning & Tableau dashboards for interactive visualization.

data data-visualization pandas-python python tableau

Last synced: 27 Apr 2026

https://github.com/theprodigyleague/d1g174lx534f00d

react/node bootstrapped project for a digi(company){["SEAFOOD"]}

bootstrap companies data data-conduit digital digital-seafood java javascript node project react seafood

Last synced: 01 Oct 2025

https://github.com/o-rumiantsev/exchange

Data Exchange System (Prototype)

chat css data exchange system websocket

Last synced: 27 Apr 2026

https://github.com/chompfoods/stub-inflector

Inflector server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery inflector ingredients nutrition raw recipe-api recipes server stub stub-inflector stub-server

Last synced: 27 Apr 2026

https://github.com/elissorokin/data-analyst-portfolio

Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 09 Apr 2026

https://github.com/hemangsharma/assignment-2---classification-models

Assignment 2 - Classification Models repository contains project for 36106 Machine Learning Algorithms and Applications

data datascience-machinelearning machine-learning ml

Last synced: 10 Jun 2026

https://github.com/e22m4u/ts-data-schema

Валидация данных и приведение типов для TypeScript

data schema typescript validation

Last synced: 05 Aug 2025

https://github.com/haimonmon/j3mify

Convert your jejemon word into a formal sentence or word

data jejemon nlp normalization python regex tagalog tokenization

Last synced: 12 Oct 2025

https://github.com/n-ce/localstorage-data-interchange-manager

Implementation of local storage data interchange using map data structure.

data export import javascript js-maps json localstorage

Last synced: 28 Apr 2026

https://github.com/brightway-lca/bw_io

IO tools for Brightway LCA framework

bw3 data life-cycle-assessment python

Last synced: 10 Jun 2026

https://github.com/kfrural/customer-churn-prediction

Customer churn prediction using machine learning. The project follows CRISP-DM and KDD methodologies, including data preprocessing, feature engineering, modeling, and evaluation. It also features an interactive dashboard for visualizing results.

crisp-dm data jupyter kdd python

Last synced: 29 Apr 2026

https://github.com/mumtaz4118/scraping-medium-and-data-analytics

The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py

data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python

Last synced: 29 Apr 2026

https://github.com/mr-dhan/eda-sales-customer-transactions

Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.

dashboard data data-analysis data-analysis-python data-science data-visualization eda python

Last synced: 29 Apr 2026

https://github.com/entorb/analyze-ha-energy

Analyze Home Assistant Solar Production Data

data home-assistant pandas photovoltaic pv python

Last synced: 08 May 2026

https://github.com/gurpreet0022/airbnb-eda

EDA on Airbnb booking data to uncover valuable insights, trends, and patterns

data data-science dataanalytics insights jupyter-notebook matplotlib numy pandas projects python3 seaborn visualization

Last synced: 11 May 2026

https://github.com/musamairshad/dsa-python

This repository contains all the material related to Data Structures and Algorithms implemented in Python.

algorithms data datastructures efficiency python searching-algorithms sorting-algorithms

Last synced: 25 Mar 2025

https://github.com/living-with-machines/zoonyper

Code to make it easy to import and process Zooniverse annotations and their metadata in Python/Jupyter Notebooks

crowdsourcing data data-processing data-science python zooniverse

Last synced: 04 Jul 2025

https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization

This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.

classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm

Last synced: 10 Apr 2025

https://github.com/grace-mengke-hu/redditpushshiftapi

This package is for collecting Reddit dataset and organize the data in Mongo Database

collection data reddit

Last synced: 13 Jun 2025

https://github.com/braiso-22/ejercicio-seguro-medico

Ejercicio de acercamiento a los datos para hacer predicciones

data data-science dataset ia insurance jupyter-notebook ml python python3

Last synced: 24 Apr 2026

https://github.com/quangandrei1003/france_air_pollution_pipeline

End-to-end air pollution data pipeline for French metropolitan cities using Airflow, Python, dbt, BigQuery.

airflow bigquery data data-analytics data-engineering data-modeling data-visualization dbt docker etl pandas python terraform

Last synced: 13 Apr 2026

https://github.com/white-gecko/lineage-dump

RDF dump of the device information from the lineage wiki

data dataset lineageos rdf

Last synced: 28 May 2026

https://github.com/smac-group/smacdata

Data sets used in various packages.

data r

Last synced: 02 Apr 2025

https://github.com/koltyakov/pgcopy

🐘 PostgreSQL data migration tool

cli data database golang migration postgresql sync

Last synced: 29 Apr 2026

https://github.com/burythehammer/foosbot-results

Foosball results for the OpenCredo foosbot

data foosball machine-learning python

Last synced: 13 Apr 2026

https://github.com/scx567888/scx-data

✨ SCX Data

data java scx

Last synced: 05 Apr 2025

https://github.com/rishikesh-jadhav/track_deep_learning

Data collected from the Udacity simulator comprising RGB images with steering and throttle annotations for each frame, specifically gathered for behavioral cloning purposes.

data datacollection udacity-self-driving-car

Last synced: 03 Jan 2026

https://github.com/jerboaburrow/uk-counties-and-unitary-authorities-may-2023-geojson

UK "Counties" Extracted from Office for National Statistics data

data geojson maps uk

Last synced: 29 Mar 2025

https://github.com/jstafford5380/provausio.testing.generators

Generate fake data for testing and/or mocking

data fake-data generator testing

Last synced: 14 Jan 2026

https://github.com/afolabi022/getting-and-cleaning-data-course-project

Tidy Dataset Creation for Human Activity Recognition" This repository contains the code and files for cleaning and transforming the Human Activity Recognition Using Smartphones dataset into a tidy format. The project demonstrates data wrangling skills in R, including merging datasets

data data-science datacleaning r

Last synced: 25 Mar 2025