An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/hit07/fitgpt-hacksc

AI-Powered Fitness Coach; 🥈 Runner up at HackSC's SoCal Tech Week hackathon

data elasticsearch gpt-4o-mini llm pipeline

Last synced: 28 Feb 2025

https://github.com/sharoonjoseph321/insurance_fraud_detection

Fraud Detection using machine learning algorithm-KN Neighbors .Data exploration using Pyspark and matplotlib.

analytics data data-science eda high-performance knn-algorithm knn-classification machine-learning matplotlib-pyplot pyspark python seaborn spark statistics

Last synced: 23 Mar 2025

https://github.com/ccworld1000/cccomposition

CCComposition for code style, Accept code style conversion business(接受code style转换业务)

cccomposition composit construction data structure visual

Last synced: 04 Jan 2026

https://github.com/open-geodata/sp_bh_pcj-2020-2035

Dados Espaciais da Agência das Bacias PCJ, com informações apresentadas no Plano de Bacias 2020-2035

data python

Last synced: 16 Jan 2026

https://github.com/sanchittechnogeek/overscripted-analysis

Geolocation and user language extraction analysis from Mozilla Overscripted dataset

analysis data data-analysis mozilla

Last synced: 23 Mar 2025

https://github.com/pythoncoderunicorn/tool-discography

Music Band TOOL albums and songs dataset

data data-science metal-music music r songs

Last synced: 26 May 2026

https://github.com/thicclatka/tetration

New file format for tensors

cli data fileformat mmap tensors

Last synced: 26 May 2026

https://github.com/wlgs/got-dialogues-data-stats

Game of Thrones dialogues data statistics processed with R and SQLite. Project for Probability and Statistics course 21/22 at AGH UST. The project was about manipulating data and getting many pieces of information from it in addition to visualizing these results.

data game-of-thrones got r statistics stats

Last synced: 22 May 2026

https://github.com/mnazlukhanyan/da-projects

Портфолио с работами по аналитике данных, показывающие мои навыки, умения и опыт

data data-vizualisation hypothesis-tests matplotlib pandas plotly postgresql product-metrics python scipy seaborn sql visualization

Last synced: 11 Apr 2026

https://github.com/taeefnajib/ibm-applied-data-science-capstone

This repository is for my IBM Applied Data Science Capstone Project. All the notebooks and other files are uploaded. If you are benefited by this repository by any means, please feel free to "Star" it and follow me. Thanks.

advance capstone capstone-project data data-science ibm ibm-watson jupyter jupyter-notebook notebook notebook-jupyter project science spacex spacex-api

Last synced: 14 Mar 2025

https://github.com/sushmashreeps/python

This repository showcases a comprehensive Python project, demonstrating expertise in backend development, data analysis, and machine learning. Built with Python 3.x, the project utilizes popular libraries like Django, Flask, NumPy, pandas, and scikit-learn. The project features efficient data processing, robust API integration, and scalable archite

api data data-science dataanalysis datavisualization game gamedeveloment python

Last synced: 12 May 2026

https://github.com/fuzzt/location-analyzer

The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.

analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack

Last synced: 11 Apr 2026

https://github.com/martinius96/meteostanica-odosielacie-scripty

Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.

arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka

Last synced: 11 Apr 2026

https://github.com/justinjjlee/simulation-discrete

Employing data transformations and simulations to answer random questions

analytics data data-science julia python simulation spark

Last synced: 30 Apr 2026

https://github.com/gdcmarinho/vaultchat

VaultChat is a end-to-end encryption chat service

chat data e2ee encrypted messaging privacy

Last synced: 23 Mar 2025

https://github.com/g3th/fit_file_decoder

Decodes '*.fit' files and returns readable values.

bytes data decoder fit-file hex parsing

Last synced: 30 Jun 2025

https://github.com/vlamug/ratibor

Ratibor is a service for making metrics from data

data metrics prometheus

Last synced: 10 Mar 2026

https://github.com/naveenk-ds/redbus_web_screaping.app.py

🚌 Red Bus Project Overview The Red Bus Project is a web scraping and visualization tool built with Selenium to extract bus information from the RedBus website. It stores the data in a MySQL database and provides an interactive visualization interface using Streamlit. The goal is to deliver insights into bus schedules, prices, ratings, etc...

data data-science database-management pandas pyhton selenium-webdriver sql

Last synced: 11 Apr 2026

https://github.com/yorkearwaker/data

Data things; representation, transformation, pipelines, governance,

actuality data epistemology information knowledge ontology

Last synced: 07 Apr 2025

https://github.com/fiddlydigital/anonimizer

A lib to replace and rehydrate sensitive data in text

anonimize anonymize data data-security prompt sanitize string string-manipulation text

Last synced: 15 Mar 2025

https://github.com/kuanhungchen/spring-2019-data-structures

📦 Some programming assignments about basic data structures.

data data-structures

Last synced: 25 Feb 2025

https://github.com/abirsaha111/ipl-2022-analysis

The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.

data dataana dataanalytics datavi matplotlib python

Last synced: 07 Jun 2026

https://github.com/soenneker/soenneker.dtos.idnamepair

A minimal Record type with an Id (string), Name (string), and maximum JSON compatibility

csharp data dotnet dto id name

Last synced: 12 Mar 2026

https://github.com/charon25/weatherdata

17 000 weather measurements collected by a weather station created for a college project.

csv data dataset datasets json measurements strasbourg weather weather-data

Last synced: 16 Jan 2026

https://github.com/s-babaeizadeh/next-mini-app

nextjs mini application

css data nextjs reactjs

Last synced: 11 Apr 2026

https://github.com/cognitixe/metamask-wallet-recovery-funds-phrase-data-seed-token

This repository provides tools and guidelines for securely recovering MetaMask Wallet funds using recovery phrases, seed data, and tokens. It ensures safe and reliable methods for recovering access to your wallet and managing your cryptocurrency assets.

bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask metamask-bot metamask-desktop metamask-extension metamask-plugin metamask-snap metamask-wallet phrase recovery seed token wallet wallet-security

Last synced: 13 May 2026

https://github.com/kalaspuff/ready

🎟 [not yet built] Take control of the event loop with simplified task management, queueing and data loading.

asyncio data dataloading event futures python python3 resolver tasks

Last synced: 10 May 2026

https://github.com/mecha-cms/x.route

Custom route files.

custom data extension file folder path route url

Last synced: 23 Mar 2025

https://github.com/smeltier/data-structures-c

This repository contains C language implementations of the main data structures covered in the Algorithms and Data Structures course. The implementations were developed as part of my hands-on learning process and include sequential lists, linked lists, and other fundamental structures.

algorithms algorithms-and-data-structures c c-language c-programming data data-structures data-structures-c structures-c

Last synced: 16 May 2025

https://github.com/halyusa16/mysql-employee-analysis

This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.

data data-analytics data-exploration database mysql self-project sql

Last synced: 20 Jan 2026

https://github.com/praxtube/dogg

CLI tool to log data manually

data data-logger log logger

Last synced: 10 Jun 2026

https://github.com/dhimmel/thinklytics

Continuous Thinklab project exports and analytics

analytics data rephetio thinklab travis-ci

Last synced: 23 Mar 2025

https://github.com/nel-zi/nuga_bank

Developed an automated data exploration and cleaning pipeline for Nuga Bank to streamline data preparation, ensure consistent data quality, and normalize datasets into structured databases for efficient analysis and reporting.

data data-automation data-visualization datacleaning datatransformation etl-automation etl-pipeline

Last synced: 16 May 2025

https://github.com/adamouization/python-machine-learning-data-science-notes

:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.

data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/sakan811/show-leaving-soon-tracker-website

This is a Vue.js application that displays shows that are leaving each platform soon, featuring a countdown timer for each title based on the user's local timezone.

data hbo hbomax netflix shows streaming tv-shows vue vuejs web webapp website

Last synced: 18 Mar 2025

https://github.com/omarcodex/data_analysis

My repository of past and present research and data-driven projects.

data ecodev ecology science sustainability yale

Last synced: 18 Jan 2026

https://github.com/lablnet/alibaba_scraper

This is a robust web scraper that extracts data from the Alibaba website. It's multi-threaded and utilizes Playwright to efficiently scrape data from the website. This script is capable of scraping the entire Alibaba site, which would take approximately 4-6 months to complete.

alibaba data ecom mit-license open-source products scraper

Last synced: 15 Mar 2025

https://github.com/d4niee/exifpy

An simple console tool to view Image meta datas

data exif image meta python

Last synced: 23 Mar 2025

https://github.com/wittyicon29/kritika-iit-b-2023

Seletcion task for the summer projects of Kritika IIT-B

data data-analysis data-science

Last synced: 15 Mar 2025

https://github.com/amir76717/healthai-pro

HealthAI Pro revolutionizes the healthcare experience by leveraging cutting-edge AI technologies to provide intelligent, personalized healthcare solutions to patients and medical professionals alike. This platform incorporates machine learning, natural language processing, and robust data management to enhance the quality of healthcare services.

data machine-learning nlp

Last synced: 31 Mar 2025

https://github.com/plnech/never2late

Never 2 Late - a reinterpretation of Everest Pipkin's 'i've never picked a protected flower'

dada dada-science data generative-art glitch-art installation nlp poetry spacy vector-similarity wallpaper

Last synced: 10 Jun 2025

https://github.com/docuvesta/shiseido_skincare_usa_fr_infographics

Découvrir les indicateurs de performance liés aux avis d'un sérum très réputé de la marque de beauté luxe japonaise Shiseido. Cette comparaison concerne les sites web USA et FR 💯

analysis automatisation data datanalysis graphique infographie pandas plotly python skincare soins

Last synced: 11 Apr 2026

https://github.com/mikeqfu/network-rail-track-fixity-layer

This project develops a data mining tool for analysing and predicting track movements using asset data, environmental factors and track design knowledge to model key parameters and generate fixity values for the GB rail network.

data data-integration data-mining data-science information-management knowledge-discovery point-cloud rail rail-alignment rail-track track-fixity

Last synced: 02 Sep 2025

https://github.com/amethyst-php/activity

Someone just did something, should we save who did this and when?

activity amethyst amethyst-package api data laravel

Last synced: 17 May 2026

https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python

This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.

analysis csv data insights matplotlib numpy pandas project python seaborn

Last synced: 06 Apr 2026

https://github.com/turner-kendall/turner-kendall

Turner Kendall - dev, opps, sec.

config data github-config go rust security

Last synced: 31 Oct 2025

https://github.com/0xHericles/SpamDetector

:email: A Simple Python Spam Detector with Scikit-Learn

data ham machine-learning python sklearn spam

Last synced: 24 Mar 2025

https://github.com/noedemange/orderedheatmapanalysis

OrderedHeatMapAnalysis (OHMA) is a direct data analysis framework allowing to simultaneously visualize and analyze the structure of complex datasets. An optimized seriation of rows and columns of the input data table is performed, resulting in a mapping of the whole dataset into an ordered heatmap.

analysis bi-seriation data dataanalysis heatmap r rstats seriation shiny shiny-apps

Last synced: 27 Feb 2025

https://github.com/arthurdanjou/studies

💼 This is the repository containing all my projects done during my studies in Python and R.

ai data data-science data-visualization jupyter jupyter-notebook ml python r

Last synced: 08 Apr 2025

https://github.com/infinitode/pyautoplot

PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.

analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python

Last synced: 16 Mar 2025

https://github.com/beriberikix/senml-zephyr

A codec for encoding and decoding Sensor Measurement Lists (SenML) for Zephyr

codec data iot senml sensor zephyr-rtos

Last synced: 24 Mar 2025

https://github.com/natanast/euroleaguebasketball

An R package providing data on Euroleague Basketball

data data-science package r

Last synced: 01 Apr 2025

https://github.com/suchi25sathavara/data-wrangling-with-r

Analyzing Road Accidents in Victoria, Australia

data r reporting rstudio wrangling-data

Last synced: 01 Apr 2025

https://github.com/suchi25sathavara/r-projects

R projects in Real world Scenerios for Data Analysis

data data-analysis datavisualization r

Last synced: 01 Apr 2025

https://github.com/tompollard/data

Repository to hold sample datasets etc

data

Last synced: 05 Jan 2026

https://github.com/wolfchamane/data-sandbox

Sandbox tool for Front-end developments.

data database front-end nodejs npm rest sandbox tool

Last synced: 28 Oct 2025

https://github.com/lohithgsk/dynamic-qr-generator

A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.

data pillow python qrcode qrcode-generator

Last synced: 16 Mar 2025

https://github.com/zulfachafidz/telco_churn_insight_customer_loss_prediction_with_random_forest_and_decision_tree-algorithms

The main problem in the business world is customer churn, or losing customers, especially in the telecommunications industry, which experiences very tight competition. To overcome this problem, an analysis was carried out to help the company understand how many customers have the potential to switch providers.

data data-science data-visualization dataanalysis dataanalyst dataanalytics datadrivenwithdataprovider decision-tree decision-tree-classifier decision-trees random-forest random-forest-classifier

Last synced: 01 May 2026

https://github.com/thingston/extractor

Collection of PHP classes to extract data from HTML pages.

data html php

Last synced: 14 Jan 2026

https://github.com/purarue/scramble-history

parses rubiks cube scramble history/solve time from cstimer.net, cubers.io, twistytimer -- merges them together giving you uniform averages/data/graphs

cstimer cubing data rubiks-cube speedsolving

Last synced: 11 Jun 2025

https://github.com/etmendz/mendz.data

Provides tools and guidance for creating data access contexts and repositories.

context data datasettings entity-framework mendz paginginfo repository resultinfo

Last synced: 11 Jun 2025

https://github.com/ournet/ournet.web.data

Ournet web data module

data ournet web

Last synced: 04 Apr 2025

https://github.com/team-hydrogen/nasa-adc-data

All files relating to the computation of the data provided

data jupyter-notebook nasa-app-development-challenge

Last synced: 25 Mar 2025

https://github.com/igor-starostenko/sabre

Slice your files like a champ with **sabre**

data golang package

Last synced: 28 Mar 2025

https://github.com/filipnet/infoscreen

Arduino subscribes values by MQTT and view info on an OLED I2C display

arduino data display i2c mqtt oled-display-ssd1306 visualization weather weatherstation

Last synced: 12 Apr 2026

https://github.com/entropyorg/p5-data-testimage

:notebook::camera: interface for retrieving test images

cpan data image-analysis

Last synced: 29 May 2026

https://github.com/denisecase/dc-mailer

Send an email using Python

alerts data email python streaming

Last synced: 11 Apr 2025

https://github.com/tdjsnelling/hermes

Hermes is a real-time data framework for React + MongoDB

data docker framework mongodb nodejs react react-hooks reactjs real-time typescript websocket

Last synced: 12 Apr 2026

https://github.com/tttardigrado/fq

Graffs for the MEDEA project

bokehplots data data-science dataanalysis pandas physics python3

Last synced: 12 Apr 2026

https://github.com/wisdom-osborn/data-analytics-course-online-

🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples

data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python

Last synced: 19 Apr 2026

https://github.com/rohitblaze10/netflix_analysis_using_tableau

The Netflix dashboard in Tableau provides a professional and visually captivating interface for users to explore a vast collection of TV shows and series. With seamless navigation and interactive filters, users can easily personalize their recommendations based on release year, genre, duration, and rating.

data data-analysis data-science data-visualization netflix tableau

Last synced: 04 Feb 2026

https://github.com/astridlyre/offhand

A Random Data Generator Library for JavaScript.

data generator javascript library random typescript

Last synced: 20 May 2026

https://github.com/romtaug/scoring-stoxx

Scoring et création de portefeuilles du STOXX, CAC et DAX via scrapping Wikipédia et envoi des résultats par mail - yfinance

api data emailing portfolio scoring stoxx wikipedia yfinance

Last synced: 05 Sep 2025

https://github.com/goto-eof/bitmaptize

Wraps data inside a .bmp and extracts data from .bmp.

bitmap bmp convert data wrap

Last synced: 18 Jan 2026

https://github.com/doughtnerd/pod-old

Read and write Excel data

data data-analysis excel poi-library workbook

Last synced: 21 Jan 2026

https://github.com/nsandoya/python_scrp_project

This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)

beautifulsoup4 data data-analysis jupyter-notebook pandas python3

Last synced: 28 Apr 2026

https://github.com/dsietz/rust-daas

An example of implementing the DaaS pattern using Rust

archconf daas data kafka rust rust-lang

Last synced: 05 Sep 2025

https://github.com/bcongdon/nid-data

National Inventory of Dams Data

data datasette government-data

Last synced: 21 Apr 2026

https://github.com/unknownsoup/budget_tracker

A personal budget tracker to build my knowledge of working with databases and data analysis. In this case using SQL and python for the analysis.

data data-science databases python sql

Last synced: 26 Jan 2026

https://github.com/shadmanshaikh/data-analysis-and-ml-work

All of my work in Data Analysis and Machine learning

analytics artificial-intelligence data machine-learning

Last synced: 05 Jul 2025

https://github.com/quangandrei1003/france_air_pollution_pipeline

End-to-end air pollution data pipeline for French metropolitan cities using Airflow, Python, dbt, BigQuery.

airflow bigquery data data-analytics data-engineering data-modeling data-visualization dbt docker etl pandas python terraform

Last synced: 13 Apr 2026

https://github.com/musamairshad/dsa-python

This repository contains all the material related to Data Structures and Algorithms implemented in Python.

algorithms data datastructures efficiency python searching-algorithms sorting-algorithms

Last synced: 25 Mar 2025

https://github.com/nmsud/formdata

🗃️ Data from the NMSUD Form submissions

api data json unification-day

Last synced: 16 May 2026

https://github.com/otoneko1102/roulette-base

ルーレットの色と番号をjson形式でまとめたものです。カジノ風ルーレットを作るときにどうぞ。A collection of roulette colors and numbers in json format. Use it when making a casino-style roulette.

casino casino-games data json require roulette

Last synced: 16 Mar 2025

https://github.com/vishwas-chakilam/twitter-sentiment-analysis

Twitter Sentiment Analysis is a Python project that analyzes the sentiment of tweets based on a user-defined keyword. It uses Tweepy to fetch tweets from the Twitter API and TextBlob for sentiment analysis. The application features a user-friendly GUI with Tkinter, displaying tweet sentiment as positive, negative, or neutral.

api data data-science dataanalysis python3 textblob-sentiment-analysis tkinter tweepy-api

Last synced: 11 Mar 2025

https://github.com/karosi12/ng-data-share

Angular communication with input and output properties

angular communication data data-binding input output sharing typescript

Last synced: 16 Jan 2026

https://github.com/jstafford5380/provausio.testing.generators

Generate fake data for testing and/or mocking

data fake-data generator testing

Last synced: 14 Jan 2026

https://github.com/remidumas/rstats

RStats weblog

data ia r science stats

Last synced: 25 Mar 2025

https://github.com/q-aware-labs/bias-insights

Bias detection project for the Chicago Face Database (CFD)

ai chicago-data-portal data data-science llm statistical-analysis

Last synced: 21 Jan 2026

https://github.com/juangesino/research-project

Course files for Research Project @ University of Amsterdam

data data-science economics stata

Last synced: 02 Jan 2026

https://github.com/fiedsch/data_util

misc. Utilities for data files like variable name lists

data helper management php

Last synced: 14 Jun 2025

https://github.com/buffdelta/basketball_ref_webscraper

Python package to make webscraping from basketball-reference easy

basketball data python python-library webscraping

Last synced: 14 Jan 2026

https://github.com/soenneker/soenneker.constants.data

A set of commonly used constants related to various types of data

constants csharp data dotnet

Last synced: 12 Mar 2026