An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/danieljdufour/fast-b64

Quickly Convert between B64 and Binary Strings

b64 base64 base64-decoding base64-encoding binary bits compression data

Last synced: 08 Oct 2025

https://github.com/hafs96/prediction_consommation-de-carburant

Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.

analysis data data-visualization machine-learning testing training

Last synced: 09 Jun 2026

https://github.com/djdhairya/whatsapp-chat-analysis

WhatsApp chat analysis is a multidimensional process that delves into the content, structure, and dynamics of conversations within the platform. It provides valuable insights for personal reflection, organizational decision-making, and improving communication strategies.

data data-science dataanalytics datapreprocessing machine-learning ml

Last synced: 08 Oct 2025

https://github.com/shubhamsoni98/classification-with-random-forest-1

To classify sales into categories (Low, Moderate, High) using Random Forests to inform strategic decisions and optimize marketing strategies.

algorithms anaconda data data-science datacleaning eda jupyter-notebook machine-learning pyhton random-forest scikit-learn visualization

Last synced: 18 Jan 2026

https://github.com/leevilaukka/alkometriikka

Tool to search Alko database and see some fun stats about different beverages

data gh-pages svelte typescript xlsx

Last synced: 18 May 2026

https://github.com/satyam4229/iit-and-nit-college-dataset

The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.

college-data csv data excel iit nit

Last synced: 04 Jan 2026

https://github.com/thibautre/dataipsum

Configurable data generator (with crumbles inside)

algorithm data random-generation

Last synced: 21 Jul 2025

https://github.com/diegoperea20/datos-secuenciales-con-ia

Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes

ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao

Last synced: 06 Feb 2026

https://github.com/g3th/fit_file_decoder

Decodes '*.fit' files and returns readable values.

bytes data decoder fit-file hex parsing

Last synced: 30 Jun 2025

https://github.com/cburmeister/disc-golf-courses

All the disc golf courses i've played at. Maintained with http://geojson.io/.

data geojson

Last synced: 21 Jan 2026

https://github.com/justinjjlee/simulation-discrete

Employing data transformations and simulations to answer random questions

analytics data data-science julia python simulation spark

Last synced: 30 Apr 2026

https://github.com/kaijagahm/2023-10-20-stlzoo

Data Carpentry workshop, hosted at the St. Louis Zoo. Beta testing the new ecology data lesson.

data data-science ecology r rstudio

Last synced: 05 Feb 2026

https://github.com/martinius96/meteostanica-odosielacie-scripty

Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.

arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka

Last synced: 11 Apr 2026

https://github.com/fuzzt/location-analyzer

The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.

analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack

Last synced: 11 Apr 2026

https://github.com/sushmashreeps/python

This repository showcases a comprehensive Python project, demonstrating expertise in backend development, data analysis, and machine learning. Built with Python 3.x, the project utilizes popular libraries like Django, Flask, NumPy, pandas, and scikit-learn. The project features efficient data processing, robust API integration, and scalable archite

api data data-science dataanalysis datavisualization game gamedeveloment python

Last synced: 12 May 2026

https://github.com/muhammadadilnaeem/student-performance-indicater-end-to-end-data-science-project

This project leverages data science techniques to build a predictive model that estimates a student's exam performance. The project follows a structured data science workflow, including data collection, preprocessing, model building, evaluation, and deployment.

data machine-learning-algorithms pandas pymysql python sql

Last synced: 11 Apr 2026

https://github.com/yash-rewalia/airbnb_eda_pandas

The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.

data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn

Last synced: 11 Apr 2026

https://github.com/oniani/miniframe

Minimal data frames with relational algebra

data dataframe-library haskell haskell-library library

Last synced: 04 Mar 2025

https://github.com/wlgs/got-dialogues-data-stats

Game of Thrones dialogues data statistics processed with R and SQLite. Project for Probability and Statistics course 21/22 at AGH UST. The project was about manipulating data and getting many pieces of information from it in addition to visualizing these results.

data game-of-thrones got r statistics stats

Last synced: 22 May 2026

https://github.com/j-sephb-lt-n/joes_giant_toolbox

A large collection of general python functions and classes that I use in my daily work

ascii browser classifier data dataviz gcp mime nlp python regex search statistics supervised web-scraping

Last synced: 10 Oct 2025

https://github.com/azkarmoulana/winter-of-data-2019

:snowflake: :snowman: Winter of Data is coming..... :wolf:

data data-science machine-learning mathematics

Last synced: 05 Feb 2026

https://github.com/thicclatka/tetration

New file format for tensors

cli data fileformat mmap tensors

Last synced: 26 May 2026

https://github.com/gianlucatruda/titanic

An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.

data database flask info matplotlib python science scrape server titanic visualisation web

Last synced: 10 Apr 2026

https://github.com/sanchittechnogeek/overscripted-analysis

Geolocation and user language extraction analysis from Mozilla Overscripted dataset

analysis data data-analysis mozilla

Last synced: 23 Mar 2025

https://github.com/open-geodata/sp_bh_pcj-2020-2035

Dados Espaciais da Agência das Bacias PCJ, com informações apresentadas no Plano de Bacias 2020-2035

data python

Last synced: 16 Jan 2026

https://github.com/checco9811/data-engineering-bootcamp-homework

Homework solutions for DataExpert.io data engineering bootcamp

apache-spark data data-engineering sql

Last synced: 14 Mar 2025

https://github.com/nukopian/shell-series

Extract columns from tabular text

automation data shell

Last synced: 11 Oct 2025

https://github.com/karashiiro/lodestone-character-data-scraper

Lodestone character data scraper.

data ffxiv ffxiv-character lodestone

Last synced: 23 Apr 2026

https://github.com/dhruvil-26/tableau-projects

This repository contains Tableau visualization projects focused on data analysis across different domains. Projects include: 1. IPL Visualization - Insights into IPL match, Team and player statistics. 2. EV Analysis - Visualizations exploring the adoption of electric vehicles. 3. Road Accident Analysis - Analysis of road accident patterns

analysis data data-analysis data-analytics electric-vehicles ipl road-accident-analysis tableau tableau-public

Last synced: 19 Jan 2026

https://github.com/sharoonjoseph321/insurance_fraud_detection

Fraud Detection using machine learning algorithm-KN Neighbors .Data exploration using Pyspark and matplotlib.

analytics data data-science eda high-performance knn-algorithm knn-classification machine-learning matplotlib-pyplot pyspark python seaborn spark statistics

Last synced: 23 Mar 2025

https://github.com/laguer/jupyterdatascienceworkflow

Jupyter Notebook dedicated to studying Agriculture and AMI analytics

agriculture amis corn data fao jupyter maize oecd rice science soja

Last synced: 11 Oct 2025

https://github.com/maluscat/reactive-storage

[MIRROR] Register, observe and intercept deeply reactive data without the need for proxies

data javascript reactive typescript

Last synced: 10 Mar 2026

https://github.com/vidupriya/aws-glue--data-copy

The function for copying data like CSV, Parquet, avro etc., from a source S3 bucket to a destination S3 bucket using AWS Glue. It includes the necessary setup for the Glue job, logging, reading data from the source bucket, and writing it to the destination bucket

aws awsglue awss3 data data-copying glue glue-job pyspark python3 s3 s3-bucket s3-buckets s3-storage spark

Last synced: 02 May 2026

https://github.com/team810/frcs

FRCS is an online international crowd sources data collection software written for the FRC Competitions. It was created by team 810, The Mechanical Bulls.

crowdsourcing data web

Last synced: 14 Mar 2025

https://github.com/soenneker/soenneker.dtos.idpartitionpair

A minimal Record type with an Id (string), PartitionKey (string), and maximum JSON compatibility

csharp data dotnet dto id key partition

Last synced: 09 Mar 2026

https://github.com/12458/99co

99co Web Scraping

99co data property scraper website

Last synced: 02 May 2026

https://github.com/greatwoman23/car_insurance_analysis

The Car Insurance Analysis project aims to provide a comprehensive examination of a car insurance portfolio using advanced data analytics tools. The analysis offers valuable insights into policy demographics, claims patterns, and financial metrics, helping stakeholders make informed decisions.

bigquery data data-science dataanalytics insurance-claims looker-studio tableau

Last synced: 03 Feb 2026

https://github.com/thanhleviet/vietnam_antibiotics_bidding

This repo contains data of bidding for multiple drugs and antibiotics reported to Vietnam Ministry of Health in 2015, 2016, 2017.

antibiotics data vietnam

Last synced: 23 Feb 2026

https://github.com/fatihemres/Africa

Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.

animations avfoundation data mapkit models swift swift-animations swiftui

Last synced: 31 Aug 2025

https://github.com/viniddev/active_finance

Nesse projeto busquei solucionar um problema corriqueiro que é a dificuldade de se manter atualizado sobre as variações do mercado de ações e fundos imobiliários. Usei selenium webdriver para buscar informações e uma API do Telegram para enviar relatórios para o usuário

automation data data-analisis rpa selenium-webdriver telegram-bot

Last synced: 03 May 2026

https://github.com/justinyahin/wpdf

Create, filter, sort and display users data on your WordPress site.

data filtering wordpress

Last synced: 18 Apr 2026

https://github.com/priyapuranik/data-analytics-using_python

Analyzed data of Hotels and find out meaningful insights from it including booking patterns and seasonal trends and many more.

data pandas python sql visualization

Last synced: 06 Apr 2026

https://github.com/als8446/tripleten-data-science-projects

Projects Overview Projects made in the Data Scientist course from TripleTen LatAm

data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn

Last synced: 10 Apr 2026

https://github.com/tyriek-cloud/nyc-dca-etl

Created an ETL pipeline to merge two CSV files (converted to JSON) into a parquet file using Azure Data Factory, The data was extracted from NYC Open Data: https://opendata.cityofnewyork.us/ and I created a Blob Container within an existing storage account.

azure azure-data-factory blob-storage data data-engineering etl-pipeline

Last synced: 21 Jan 2026

https://github.com/carlosrs14/parallel-data-preprocessig-system

A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.

barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads

Last synced: 24 Jul 2025

https://github.com/snimmagadda1/luigi-etl-example

🔍 Example of an ETL pipeline using Spotify's Luigi

data luigi luigi-pipeline python spotify

Last synced: 30 Mar 2025

https://github.com/shubhammittal-data/hr_dashboard_tableau

An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.

chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public

Last synced: 17 May 2026

https://github.com/koppalexander/flightdelaychallenge

This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.

data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling

Last synced: 19 Jun 2026

https://github.com/sungchun12/demotron

CLI to delight real people with live demos

cli data demo sqlmesh

Last synced: 26 Feb 2025

https://github.com/asacxyz/flutter_aplicando_persistencia_de_dados

Para acompanhamento do curso Flutter: aplicando persistência de dados

dart data data-storage flutter persistence persistent-storage sqflite sql sqlite

Last synced: 03 May 2026

https://github.com/ate47/playerdata

Get data about a player with a command

bukkit-plugin command data spigot-plugin

Last synced: 30 Aug 2025

https://github.com/flowsta/ods-educacion-aporta

ODS para educación, iniciativa APORTA 2021

data data-visualization ods sdg

Last synced: 27 Jan 2026

https://github.com/joshuadeguzman/xcraper

Python based stocks exchange data scraper

data pandas python stock-market

Last synced: 18 May 2026

https://github.com/schoolsquirrel/holiday-data

Automatically updated holiday data for SchoolSquirrel

data holidays schoolsquirrel scripts vacation

Last synced: 03 Oct 2025

https://github.com/digital-media/cv_data

Datasets used for courses/tutorials at the Digital Media Department

computer-vision data image-processing images

Last synced: 14 Oct 2025

https://github.com/isandyawan/simplelinearregression

A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion

data linear r regression rstudio shiny statistic

Last synced: 14 Oct 2025

https://github.com/yash-chauhan-dev/spark_cluster_docker

Set-up local spark cluster, hadoop (hdfs), airflow, postgresql on docker with ease, without any local installations

apache-spark data data-engineering data-engineering-pipeline deployment docker docker-compose hadoop hdfs local-development localhost pyspark python

Last synced: 04 May 2026

https://github.com/ahmad-ali-rafique/wine-quality-dataset

Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.

analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model

Last synced: 21 Aug 2025

https://github.com/fallaciousreasoning/nz-mountains

A list of mountains in NZ, scraped from https://climbnz.org.nz

alpine climbing climbnz data json json-api maps mountaineering scraping

Last synced: 04 May 2026

https://github.com/e-kotov/albofr

alboFr: Get French Data on Tiger Mosquito Colonisation

aedes-albopictus data france tiger-mosquito

Last synced: 11 Jun 2026

https://github.com/rachelresende/projeto-finan-as

Este repositório é referente a um curso de análise de dados para finanças que realizei em 2025 na Udemy.

analytics data financas finance finance-management

Last synced: 19 Aug 2025

https://github.com/dimitryzub/russo-ukraine-war-prediction-losses

Highlights rusian losses with predictions based on historic data from Ministry Defence of Ukraine 🐱‍👤

data dataanalysis dataanalytics matplotlib pandas prophet python

Last synced: 04 May 2026

https://github.com/jdanielgoh/cobertura-campanias

En una democracia ¿caben todas las voces? Proyecto para visualizar el monitoreo de radio y TV que realiza el INE de las candidaturas presidenciales 2024

d3js data datavisualization vue

Last synced: 09 Jun 2026

https://github.com/davorg/towerbridge

When is Tower Bridge lifting?

data hacktoberfest london perl web-scraping

Last synced: 29 Jun 2026

https://github.com/yagoluiz/enem-analise-extracao

[PT-BR] Extração e análise de dados do desempenho da região Centro-Oeste

analysis data extraction python3 r

Last synced: 17 Apr 2026

https://github.com/h4fide/politicalcompassbot

This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!

bot data greedy-algorithms politics python python3 sql telegram

Last synced: 19 Aug 2025

https://github.com/bdr-pro/streamlint

ltra-cool Streamlit app, where you can interact with widgets, see data in action, and even upload and download files

data streamlit

Last synced: 14 Apr 2026

https://github.com/octoenergy/tentaclio-s3

A python project containing all the dependencies for s3 tentaclio schema.

data

Last synced: 24 Jun 2025

https://github.com/bhemen/aave-data

Borrowing and lending data sets from the Aave protocol on Ethereum

aave borrow data ethereum lend python

Last synced: 05 Feb 2026

https://github.com/hakusaro/facts

A fact based knowledge system (FBKS) experiment.

data facts hacktoberfest

Last synced: 03 Jan 2026

https://github.com/vedikasnehil/my-data-science-projects

This repository is a comprehensive collection of resources and implementations dedicated to the field of Data Science. It serves as a platform for exploring various aspects of data science, ranging from data preprocessing and exploratory data analysis (EDA) to machine learning and deep learning.

data data-science deep-learning machine-learning matplotlib numpy python sql visualization

Last synced: 10 Apr 2026

https://github.com/ronknight/user-data-dashboard

📈 A data visualization tool for analyzing user data using an Excel-based data source.

dashboard data excel ga4 screenshot

Last synced: 17 Oct 2025

https://github.com/rijkvanzanten/ds-fa-1

The first final assignment for the data structures class

assignment data final map now parsons structures thenewschool

Last synced: 04 Oct 2025

https://github.com/jleung51/foundations-dags

Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.

data data-engineering etl extract housing load pipeline transform

Last synced: 04 Oct 2025