An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/kahlery/my-jupyter-notebook-projects

🐊 collection of my data science analysis, actually I store most of my data science projects in my google drive because of google colab

data jupyter-notebook python

Last synced: 12 Apr 2026

https://github.com/prajakta1321/streetml-a-cityscape-traffic-volume-prognostication

StreetML leverages ML learning techniques to revolutionize urban traffic prediction through precise volume prognostication, aiming to enhance cityscape mobility through data-driven insights.

catboostregressor data datavisualisation exploratory-data-analysis lightgbm-regressor linearregression machine-learning machine-learning-algorithms predictive-analytics random-forest-regression xgboost-regression

Last synced: 08 Apr 2025

https://github.com/jacoblincool/moodle-export

A streamlined library for retrieving data from Moodle.

data moodle

Last synced: 07 May 2025

https://github.com/veivel/f1-sentiment-analysis

An entiment analysis project on tweets about Formula 1. To be reworked.

data f1 nlp-library nlp-machine-learning

Last synced: 04 Jul 2025

https://github.com/steveanik/kestra

Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.

data data-engineering data-integration data-pipeline data-quality elt etl low-code orchestration pipelines scheduler workflow workflow-engine

Last synced: 06 Jan 2026

https://github.com/lohithgsk/dynamic-qr-generator

A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.

data pillow python qrcode qrcode-generator

Last synced: 16 Mar 2025

https://github.com/lightdash/quickstart-github

Instant analytics for Github

analytics business-intelligence data dbt github

Last synced: 14 Sep 2025

https://github.com/afnanenayet/ds-a

Some interview prep I've been doing. This repo is reimplementations of algorithms and data structures in Python3

algorithms data interview prep python structures

Last synced: 05 Apr 2025

https://github.com/miraclx/split-merge

Efficient, flexible data stream chunker and merger

chunk data efficient merge middleware nodejs pipeline split stream

Last synced: 07 May 2026

https://github.com/doughtnerd/pod-old

Read and write Excel data

data data-analysis excel poi-library workbook

Last synced: 21 Jan 2026

https://github.com/axnjr/csv-parser-utils

My own Pandas in Go, Python & Rust, Utility methods for Handling CSV Files in Core Go & Rust with bindings for python.

csv data dataanalysis datatools go golang golang-application pandas python rs rust

Last synced: 29 Apr 2026

https://github.com/nsandoya/python_scrp_project

This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)

beautifulsoup4 data data-analysis jupyter-notebook pandas python3

Last synced: 28 Apr 2026

https://github.com/gvatsal60/ds-on-kaggle

A collection of data science projects, experiments, and insights from Kaggle competitions and datasets

data data-science data-visualization numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/ahmad-ali-rafique/linear-regression-modeling

In-depth exploration of linear regression models, including data cleaning, model building, and performance evaluation on various datasets.

artificial-intelligence data dataanalytics linear-models linear-regression model multilinear-regression regression regression-models

Last synced: 19 Apr 2026

https://github.com/patrickdavies100/pipeline38

An application to automate the creation and execution of SQL queries.

data pandas-dataframe pipeline postgresql psycopg2 sqlalchemy

Last synced: 30 Apr 2026

https://github.com/lamouchi-bayrem/data-matrix-scanner

A dual-interface tool that leverages AI to **detect and decode QR codes and Data Matrix codes** from images using computer vision

data datamatrix-scanner decoder flask qrcode scanner tkinter-gui webapp

Last synced: 30 Apr 2026

https://github.com/shubhamsoni98/project_using_knn

This project applies the K-Nearest Neighbors (KNN) algorithm to predict iPhone purchases based on customer data. Using features like age, salary, and previous purchase behavior, the KNN model classifies customers into buyers and non-buyers.

anaconda analytics data data-science eda knn knn-classification machine-learning-algorithms predict project python scikit-learn tableau

Last synced: 03 Jan 2026

https://github.com/onekiloparsec/arcsecond-swift

The swift client for interacting with the server-side RESTful resources of arcsecond.io.

arcsecond astro-library astronomy data django swift swift-3

Last synced: 30 Apr 2026

https://github.com/wolfchamane/data-sandbox

Sandbox tool for Front-end developments.

data database front-end nodejs npm rest sandbox tool

Last synced: 28 Oct 2025

https://github.com/dsietz/rust-daas

An example of implementing the DaaS pattern using Rust

archconf daas data kafka rust rust-lang

Last synced: 05 Sep 2025

https://github.com/andygol/andygol.github.io

Andrii Holovin – Product & Project Manager Geospatial Expert / OpenStreetMap Consultant / DevOps practitioner

consultant data data-structures devops experience floss gis mapping navigation openstreetmap personal-site personal-website

Last synced: 13 May 2026

https://github.com/companyakis/financial-data

Financial Data & Python

data finance python

Last synced: 29 Jun 2025

https://github.com/fatihilhan42/olympics-data-analysis-with-python

I will examine the Data Analysis of the Olympics between 1896-2016, which we have done on Python.

data data-science dataanalysis datavisualization jupyter-notebook olympics python

Last synced: 30 Apr 2026

https://github.com/tompollard/data

Repository to hold sample datasets etc

data

Last synced: 05 Jan 2026

https://github.com/armand-sauzay/datasets

Datasets for machine learning

ai data datasets machine-learning ml

Last synced: 18 Jan 2026

https://github.com/dnut/json-match-finder

Python application used to match listings against openings via authenticated JSON API access.

data data-structures data-wrangling database json-api python-application python-modules

Last synced: 01 May 2026

https://github.com/bcongdon/nid-data

National Inventory of Dams Data

data datasette government-data

Last synced: 21 Apr 2026

https://github.com/dantetrb/diabetes-readmission-dbt

Predictive analytics on diabetic patient readmissions using dbt, DuckDB and Python – with explainability and clustering.

clustering data dataengineering dbt diabetes duckdb hdbscan healthcare jupyter lime readmission-prediction sql

Last synced: 01 May 2026

https://github.com/natanast/euroleaguebasketball

An R package providing data on Euroleague Basketball

data data-science package r

Last synced: 01 Apr 2025

https://github.com/giuleo129/dataanalysis

This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.

data data-analysis data-science statistical-learning

Last synced: 25 Jan 2026

https://github.com/lexiortiz/advanced-data-analytics

Structured learning notes, code snippets, and key takeaways from the Google Advanced Data Analytics Professional Certificate. Serves as a personal reference for reinforcing concepts and as a resource for others on a similar learning journey.

data data-analysis data-engineering google python-3 sql

Last synced: 29 May 2026

https://github.com/sysread/skewer

A priority queue for Go implemented using a skew heap

binary data go heap min minqueue priority queue skew structure

Last synced: 26 Aug 2025

https://github.com/beriberikix/senml-zephyr

A codec for encoding and decoding Sensor Measurement Lists (SenML) for Zephyr

codec data iot senml sensor zephyr-rtos

Last synced: 24 Mar 2025

https://github.com/jigyasag18/movie-recommendation-system-project

This repository features a personalized movie recommendation system that offers tailored suggestions to users. It leverages a dataset of 5,000 English-language films and utilizes data processing, feature engineering, and a cosine similarity algorithm to analyze user preferences. The system includes an intuitive user interface for easy navigation.

data datacleaning datapreprocessing machine-learning machine-learning-algorithms python streamlit streamlit-webapp

Last synced: 28 May 2026

https://github.com/lut-ful/ibm-capstone-project-stack-overflow-job-survey

IBM Data Analyst professionale certificate program final project.

cognos data data-analytics looker power-bi python sql statics

Last synced: 01 May 2026

https://github.com/merekat/flight-delay-prediction

This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.

aviation data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling

Last synced: 08 Apr 2025

https://github.com/shauryauppal/mydatatoolkit

A toolkit for data scientists to get work done faster, easier, and in a smarter way.

analytics awesome-list data data-science hacktoberfest

Last synced: 08 Jun 2026

https://github.com/shahsuvarli/election-voters-data-analysis-pandas

Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.

cleaning data grouping matplotlib numpy pandas python visualization

Last synced: 12 Apr 2026

https://github.com/etmendz/mendz.data.oracle

Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to Oracle databases.

ado-net context data database datasettings mendz oracle

Last synced: 13 Apr 2026

https://github.com/mumtaz4118/nlp-course

Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning

course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning

Last synced: 24 Nov 2025

https://github.com/cpietsch/breitband

developer repo of breitband-berlin

d3js data threejs visualization

Last synced: 02 May 2026

https://github.com/petzi53/repair

R Datasets of the Open Repair Alliance (ORA).

data r repair repair-cafe

Last synced: 19 May 2026

https://github.com/fatihilhan42/hollywood-theatrical-market-synopsis-1995-to-2021

In this project, the data of hollywood film production companies from 1995 to 2021 were examined. Significant tables and graphs were created using data visualization algorithms, with the tickets sold divided into categories.

data data-analysis data-science data-visualization

Last synced: 23 Mar 2025

https://github.com/skygenesisenterprise/aether-meet

Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem

applications data docker javascript meeting nextjs notes typescript voip

Last synced: 01 May 2026

https://github.com/hidayathamir/get-telegram-group-data

With these project you can get data in csv file from your telegram group.

bahasa-indonesia data python3 scrape telegram telethon

Last synced: 13 Sep 2025

https://github.com/cljoly/data

📊 Data sets to populate some parts of my website (mostly https://cj.rs/open-source/).

data open-source sqlite wip

Last synced: 03 May 2026

https://github.com/ayush-raj8/godata

Write data to file. Standardizes the format for easy parsing and read by other programs.

data golang

Last synced: 18 Jan 2026

https://github.com/richardlitt/bird-watching

My birdwatching list and repo

birding data ebird

Last synced: 26 Jan 2026

https://github.com/abdiasarsene/edusight-data-driven-insights-for-smarter-education

EduSight transforms educational data into actionable insights, helping NGOs, schools, and policymakers improve academic performance, optimize resources, and evaluate learning programs for better outcomes.

data excel github powerbi

Last synced: 26 Jan 2026

https://github.com/heitang/fcu-classid

逢甲大學:學院 ID 、 系所 ID 和班級 ID

data fcu project

Last synced: 30 Mar 2025

https://github.com/white-gecko/lineage-dump

RDF dump of the device information from the lineage wiki

data dataset lineageos rdf

Last synced: 28 May 2026

https://github.com/quangandrei1003/france_air_pollution_pipeline

End-to-end air pollution data pipeline for French metropolitan cities using Airflow, Python, dbt, BigQuery.

airflow bigquery data data-analytics data-engineering data-modeling data-visualization dbt docker etl pandas python terraform

Last synced: 13 Apr 2026

https://github.com/jigyasag18/aircraft-data-management

This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat

aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql

Last synced: 04 Feb 2026

https://gitlab.com/pommalabs/htmlark

HtmlArk packs a webpage into a single HTML file: https://htmlark-docs.pommalabs.xyz/

audios css data embed fonts html images javascript uri videos

Last synced: 03 Sep 2025

https://github.com/seldszar/piccha

Another tree data structure

data tree

Last synced: 16 Jul 2025

https://github.com/raghavendranhp/youtube_data_harvesting

The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions

apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api

Last synced: 13 Apr 2026

https://github.com/gustavonav/daily-youtube-extraction

Projeto que completa a criação de um ambiente para extração, armazenamento e processamento de dados do Youtube

airflow data minio python3 spark

Last synced: 21 Feb 2026

https://github.com/smaug6739/data-bit

This project is a module for converting a structured dataset into a number that can be stored in a database taking up little space.

bits data nodejs

Last synced: 14 May 2026

https://github.com/faster-games/dynamic-components

Dynamic Runtime Components for Unity3D

data framework unity3d

Last synced: 11 Apr 2026

https://github.com/awpala/udemy-my-courses-data-parser

Download Udemy lists and courses metadata for authenticated student user

data scripts udemy

Last synced: 07 May 2026

https://github.com/turner-kendall/turner-kendall

Turner Kendall - dev, opps, sec.

config data github-config go rust security

Last synced: 31 Oct 2025

https://github.com/fatihemres/fruits

Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.

animations data models onboarding swift swiftui

Last synced: 01 May 2026

https://github.com/jameshenderson12/chatbot-utils

Generic data and elements that can be reused or repurposed for chatbot development.

boilerplate chatbot data development elements intents template utterances

Last synced: 04 Mar 2026

https://github.com/musamairshad/dsa-python

This repository contains all the material related to Data Structures and Algorithms implemented in Python.

algorithms data datastructures efficiency python searching-algorithms sorting-algorithms

Last synced: 25 Mar 2025

https://github.com/pbinkley/mfmcollections

Project to distill data about published collections of microfilms from library lists

data research retro

Last synced: 28 May 2026

https://github.com/amethyst-php/activity

Someone just did something, should we save who did this and when?

activity amethyst amethyst-package api data laravel

Last synced: 17 May 2026

https://github.com/stdlib-js/ndarray-vector-uint32

Create an unsigned 32-bit integer vector (i.e., a one-dimensional ndarray).

constructor ctor data javascript ndarray node node-js nodejs stdlib structure types uint32 vec vector

Last synced: 25 Apr 2026

https://github.com/davidkhala/datasets

sample datasets

data

Last synced: 19 Mar 2026

https://github.com/allanotieno254/powerbi-dax-filter-context

This repository contains a Power BI project that explores **DAX Filter Context**, a crucial concept in DAX calculations. The project focuses on **Bank Loan Analysis**, demonstrating how different filter contexts affect DAX formulas.

business-intelligence data data-analysis dax dax-functions powerbi powerbi-visuals visualization

Last synced: 08 Jan 2026

https://github.com/poissonconsulting/klexdatr

An R package of data from the Kootenay Lake Exploitation Study

cran data fish kootenay-lake rstats

Last synced: 16 Oct 2025

https://github.com/basemax/okala-product-ids

A PHP script to fetch and save product IDs from Okala's online store API across multiple categories and store branches.

crawler crawler-okala crawler-php crawlers data database ids ir iran json okala okala-crawler php php-crawler product

Last synced: 09 May 2026

https://github.com/rizkipragustono/extract_from_excel

Excel Contact Data Parser with Country Code Formatting

data excel extract python transform

Last synced: 18 May 2026

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 May 2026

https://github.com/jpcurada/exploralytics

A python package for creating intermediate plotly visualizations

data eda plotly python visualization

Last synced: 05 Feb 2026

https://github.com/afolabi022/getting-and-cleaning-data-course-project

Tidy Dataset Creation for Human Activity Recognition" This repository contains the code and files for cleaning and transforming the Human Activity Recognition Using Smartphones dataset into a tidy format. The project demonstrates data wrangling skills in R, including merging datasets

data data-science datacleaning r

Last synced: 25 Mar 2025

https://github.com/soenneker/soenneker.attributes.mapto

A C# attribute for generic data mapping translation

attributes columns csharp data datatables dotnet mapping mapto maptoattribute object

Last synced: 02 Mar 2026

https://github.com/badranalyst/covid-deaths-dashboard-with-tableau

This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.

covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization

Last synced: 02 Mar 2026

https://github.com/j2kun/terrorism-usa-post-9-11

A copy of the terror data published by NewAmerica

data politics terrorism transparency

Last synced: 02 Mar 2026

https://github.com/mominurr/fire-gas-leak-detection-system

A real-time fire prevention system integrating IoT sensors and computer vision to trigger evacuations.

ai computer-vision data datascience machine-learning ml python yolo

Last synced: 27 Jan 2026

https://github.com/isandyawan/simplelinearregression

A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion

data linear r regression rstudio shiny statistic

Last synced: 14 Oct 2025