An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/laguer/jupyterdatascienceworkflow

Jupyter Notebook dedicated to studying Agriculture and AMI analytics

agriculture amis corn data fao jupyter maize oecd rice science soja

Last synced: 11 Oct 2025

https://github.com/mr-chang95/udacity-starbucks-challenge

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

data data-science data-visualization numpy pandas sklearn

Last synced: 14 Apr 2026

https://github.com/sebhoss/countries-and-cities

dolt database for countries and their cities

cities countries data database dolt

Last synced: 11 Oct 2025

https://github.com/sanand0/marvel-powers

Scrapes Marvel Fandom for character powers

data

Last synced: 12 Oct 2025

https://github.com/equinor/sumo-wrapper-python

Thin python wrapper to interact with Sumo API

analytics data fmu python subsurface sumo

Last synced: 19 Jan 2026

https://github.com/thanhleviet/vietnam_antibiotics_bidding

This repo contains data of bidding for multiple drugs and antibiotics reported to Vietnam Ministry of Health in 2015, 2016, 2017.

antibiotics data vietnam

Last synced: 23 Feb 2026

https://github.com/madhuresh2011/daily-sql-from-hackerrank

Welcome to my SQL Series, where I tackle SQL problems from HackerRank on a daily basis.

data dataanalysis database question-answering sql

Last synced: 19 Jan 2026

https://github.com/0xnu/nfl-picks

NFL match prediction with scores using historical data (1999-Present).

american-football data nfl prediction

Last synced: 12 Oct 2025

https://github.com/adadalshabab/data-engineering-gcp-project

An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.

bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi

Last synced: 19 Jan 2026

https://github.com/tyriek-cloud/nyc-dca-etl

Created an ETL pipeline to merge two CSV files (converted to JSON) into a parquet file using Azure Data Factory, The data was extracted from NYC Open Data: https://opendata.cityofnewyork.us/ and I created a Blob Container within an existing storage account.

azure azure-data-factory blob-storage data data-engineering etl-pipeline

Last synced: 21 Jan 2026

https://github.com/luminati-io/httpx-web-scraping

Web scraping using HTTPX in Python, covering setup, advanced features, comparisons with Requests, and more.

beautifulsoup data html httpx python web-scraper web-scraping

Last synced: 13 Oct 2025

https://github.com/mikeschinkel/go-testdata-defaulter

Simple package for Go to set table-driven test data defaults so that tables in tests only need include data that differs from defaults.

data defaults package testing tests

Last synced: 13 Oct 2025

https://github.com/donghquinn/gopandas

gopandas

data go golang

Last synced: 14 Oct 2025

https://github.com/tabarzin/dh

A collection of links to various resources on Digital Humanities

data digitalhumanities opensource

Last synced: 24 Jan 2026

https://github.com/odiegosilva1/flask-github-style

Página de login usando Jinja no Flask.

data flask jinja2-templates orm python

Last synced: 31 May 2026

https://github.com/digital-media/cv_data

Datasets used for courses/tutorials at the Digital Media Department

computer-vision data image-processing images

Last synced: 14 Oct 2025

https://github.com/soenneker/soenneker.data.email.disposables

Simply adds a list of compiled disposable/temporary email domains, updated daily (if available)

csharp data disposable disposables domain dotnet email mailinator

Last synced: 29 May 2026

https://github.com/brandonzylstra/essence

🧘🏼‍♂️ Relaxed Rails Modeling & Migrations

active-record data database gem hcl modeling rails ruby ruby-on-rails yaml

Last synced: 14 Apr 2026

https://github.com/yagoluiz/enem-analise-extracao

[PT-BR] Extração e análise de dados do desempenho da região Centro-Oeste

analysis data extraction python3 r

Last synced: 17 Apr 2026

https://github.com/jigyasag18/project-diwali-sales-analysis

This project analyzes retail sales data during the Diwali festival using exploratory data analysis (EDA) to identify buyer demographics and product preferences. The findings reveal that the primary purchasers are married women aged 26-35 from Uttar Pradesh, Maharashtra, and Karnataka, working in IT, Healthcare, and Aviation.

analysis data datapr datapro eda jupyter-notebook python realtimedata

Last synced: 01 Jun 2026

https://github.com/rizkipragustono/extract_from_excel

Excel Contact Data Parser with Country Code Formatting

data excel extract python transform

Last synced: 18 May 2026

https://github.com/j-sephb-lt-n/personal-projects

A history of my personal projects and professional development

ai api auth cloud data llms personal-development web

Last synced: 24 Jan 2026

https://github.com/tyriek-cloud/statistical-work-sample

The purpose of this study is to observe if a sample of people that has siblings is independent of a sample of people that possess an opinion of whether patients with incurable diseases should be allowed to die.

analysis data spss statistics t-test

Last synced: 22 Jan 2026

https://github.com/fatihilhan42/nba-players-data-1950-to-2021

In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.

data data-analysis data-engineering data-science data-visualization

Last synced: 16 Oct 2025

https://github.com/vanduc1102/parse-stackoverflow-data

Parse stackoverflow data

data parser stackoverflow

Last synced: 16 Oct 2025

https://github.com/saboye/sales-performance-analysis

A dashboard that presents monthly sales performance by product segment and product category to help clients identifying the segments and categories that have met or exceeded their sales targets, as well as those that have not met their sales targets.

dashboard data data-science eda tableau visualization

Last synced: 27 Jan 2026

https://github.com/mat06mat/matbot

My discord bot code

data discord-bot discord-py py-cord

Last synced: 17 Oct 2025

https://github.com/ronknight/user-data-dashboard

📈 A data visualization tool for analyzing user data using an Excel-based data source.

dashboard data excel ga4 screenshot

Last synced: 17 Oct 2025

https://github.com/enoch208/eventmaster

A user-friendly application that helps you easily record and play back your keyboard and mouse actions. With its modern design using `tkinter` and `ttkthemes`, it provides a smooth and easy-to-use interface. The app combines reliable technical features to give you a great experience.

automation data key keylogging-python replay spy tools

Last synced: 01 Jun 2026

https://github.com/meokullu/colorizenumber

ColorizeNumber - Bodrum Papatya, visualizes numeric data into colors which creates an image.

color colorize colors data data-visualization visualization vizualize-data

Last synced: 01 Jun 2026

https://github.com/psgebeline/harvard-data-science

My work for the nine courses in Harvard's data science program, each with notes/assignments. Work in progress.

data linear-regression machine-learning modeling probability-theory r visualization wrangling

Last synced: 19 Oct 2025

https://github.com/parvezk/d3-fundamentals

D3 library API fundamentals

charts d3 data graphs visualization

Last synced: 19 Oct 2025

https://github.com/erencelik/binance-public-data-node

Nodejs downloader and unzipper script for Binance Public Data

binance data downloader nodejs public script

Last synced: 15 May 2026

https://github.com/octoenergy/tentaclio-snowflake

A python project containing all the dependencies for snowflake tentaclio schema.

data

Last synced: 20 Oct 2025

https://github.com/zanysoft/virtualcolumn

Laravel virtual column

data laravel virtual-column

Last synced: 12 Apr 2026

https://github.com/mohibmirza-py/email-verifier-script

Streamlit app to verify emails in bulk

ai analysis data streamlit

Last synced: 29 Apr 2026

https://github.com/robertoostenveld/dcn.dsc_62002071_01_114_v1

Simon task M/EEG data [Data set].

data datalad open-data

Last synced: 23 Jan 2026

https://github.com/andrewl/danelaw

Geopackage containing the boundary of the Danelaw

data geospatial medieval viking

Last synced: 23 Jan 2026

https://github.com/jigyasag18/bird-strikes-in-aviation-project

This project analyzes over a decade of U.S. bird strike data (2000–2011) to evaluate safety risks, damage trends, and cost implications in aviation. Using PostgreSQL for database management and Power BI for dashboard visualization, it uncovers critical insights into when, where, and how wildlife impacts aircraft. Key findings inform strategically.

bird-strike-prevention bird-strike-prevention-in-real-airport data data-analysis data-analysis-project data-visualisation data-visualization data-visualization-project data-visualizations database dataset dax-query postgresql postgresql-database powerbi powerbi-desktop powerbi-report powerbi-visuals sql sql-database

Last synced: 09 May 2026

https://github.com/brianlesko/r_data_science_stat5730

Written by Brian Lesko, the repository contains R Scripts demonstrating data science topics largely originating from study at Ohio State. Contents are written in R studio using the R markdown file. As of 1/21/23 Future projects concerning data science, statistics, and machine learning will be in python in my machine learning Repository

data data-analysis flight-data ggplot2 olympics-data r-markdown tidyverse

Last synced: 23 Jan 2026

https://github.com/dhanish03/reliance-sales-report-dashboard

This project, Reliance Sales Report Dashboard, showcases a dynamic and interactive Power BI dashboard designed to analyze sales performance. The dashboard provides key insights into various aspects of sales data, including product-wise performance, region-based revenue, and profitability trends.

data datavisualization-project powerbi visualization

Last synced: 23 Jan 2026

https://github.com/harmanveer-2546/reducing-data-entries

Way to delete data entries from csv/excel file using. For excel file, use excel instead of csv in the code.

csv data data-entry delete-data excel numpy pandas python

Last synced: 05 May 2026

https://github.com/knowcnu12/metamask-wallet-recovery-funds-phrase-data-seed-token

This repository provides tools and guidelines for securely recovering MetaMask Wallet funds using recovery phrases, seed data, and tokens. It ensures safe and reliable methods for recovering access to your wallet and managing your cryptocurrency assets.

bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask metamask-bot metamask-desktop metamask-extension metamask-plugin metamask-snap metamask-wallet phrase recovery seed token wallet wallet-security

Last synced: 08 Mar 2026

https://github.com/louis-heraut/dataverseur

🫖 A dataverse API R wrapper to enhance the deposit procedure using only R variable declarations

data data-repository data-science datascience dataset dataverse dataverse-api json metadata metadata-management metadata-parser r

Last synced: 24 Oct 2025

https://github.com/moscatellimarco/webscrap-tinydeal

"WebScrap-TinyDeal" is a Scrapy-powered 🕷️ tool for harvesting product information 🏷️ from TinyDeal. It outputs structured CSV data 📁, ready for analysis. Explore the scripts 👨‍💻 for an interactive scraping adventure or leverage the data for competitive pricing strategies 📈.

css data datascience html pandas python scrapy web webscraper webscraping

Last synced: 14 Apr 2026

https://github.com/mikeasilva/api_data

API Data makes working with open data APIs easy.

api data python

Last synced: 23 Jan 2026

https://github.com/byndyusoft/byndyusoft.data.relational

Relational abstractions for Byndyusoft.Data.Relational.

byndyusoft data dataaccess db relational-databases

Last synced: 25 Oct 2025

https://github.com/uznetdev/smoking-prediction

This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.

ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking

Last synced: 17 Apr 2026

https://github.com/johndelatto/automate-your-job-search-ai-applies-to-1000-positions

Automate Your Job Search: AI Applies to 1000 Positions Overnight & Get 100+ Interviews! In today’s fast-paced and highly competitive job market, finding and securing your dream job can be both time-consuming and exhausting.

ai data non-profit open-ai open-source

Last synced: 28 Jan 2026

https://github.com/zainea-bogdan/data_engineer_project_wowcinema

WoWCinema is a project based on a fictional scenario where I stepped into the role of a Data Engineer, designing and building an end-to-end Data Infrastructure. A ETL pipeline ingests data from multiple sources, transforms it, and loads it into a centralized PostgreSQL data warehouse to power analytics, KPI tracking, and reporting

analytics big-data data datawarehousing etl-pipeline postgres python sql

Last synced: 19 May 2026

https://github.com/robertoostenveld/dccn.dsc_3015055.00_583_v1

The FieldTrip-SimBio Pipeline for EEG Forward Solutions [Data set].

data datalad open-data

Last synced: 24 Jan 2026

https://github.com/semcod/code2llm

Python Code Flow Analysis Tool - Static analysis for control flow graphs (CFG), data flow graphs (DFG), and call graph extraction

ast cfg code code2data code2logic code2process data dfg diagram flow graphs llm

Last synced: 01 Jun 2026

https://github.com/eugenedakin/des-encryption-decryption

Encrypt and Decrypt text in Xojo using DES - Written in Native Xojo Language - Cross Platform

data data-encryption-standard decryption des encryption standard xojo

Last synced: 24 Feb 2026

https://github.com/bishtrishu/pizza_sales_analysis_dashboard_sql_bi

Welcome to the Pizza Sales Analysis Dashboard project! This repository contains a comprehensive guide to building an interactive and insightful dashboard for analyzing pizza sales data using SQL and Power BI.

data data-science dataanalyst datavisualization dax dax-query microsoft microsoft-azure microsoft-sql-server msexcel mysql powerbi powerquery project sql

Last synced: 16 Mar 2026

https://github.com/dynamiatools/module-importer

DynamiaTools extension to work with excel files for import data

data dynamia excel import java zk

Last synced: 06 Feb 2026

https://github.com/maxisoft/yahoo-finance-data-downloader

Automate downloading historical and recent stock data from Yahoo Finance.

data stock-market yahoo-finance

Last synced: 29 Jan 2026

https://github.com/audeering/datasets

Data cards for public audb datasets

audb audio data management

Last synced: 29 Jan 2026

https://github.com/aimin-nur/data-analyst

Sebuah project Data Analyst (Mechine Learning) untuk melakukan analisa harga mobil bekas Ford berdasarkan dataset yang sudah ada, serta mengetahui apa saja feature atau kolom yang mempengaruhi harga mobil bekas Ford.

analytics data mechine-learing visualization

Last synced: 29 Jan 2026

https://github.com/apigear-io/template-qtcpp

QtC++ technology template

data plugin qml qt qt5

Last synced: 25 Feb 2026

https://github.com/dfsp-spirit/neuroimaging_testdata

Contains test data for unit tests, used in developing neuroimaging software. Ignore this. Licenses in the individual archives.

data unittesting

Last synced: 25 Feb 2026

https://github.com/bearaujus/bdatamatrix

Structured Tabular Data Management in Go

data go golang matrix

Last synced: 30 Jan 2026

https://github.com/mreshboboyev/elastic-search-dotnet

A powerful and easy-to-use .NET library for integrating Elasticsearch, enabling fast full-text search, scalable indexing, and advanced data analytics in your applications.

analytics c-sharp data dotnet-core elastic-search full-text indexing open-source scalable search

Last synced: 30 Jan 2026

https://github.com/lut-ful/pizza-sales-report

This Pizza Sales Report provides valuable insights into sales performance through detailed analysis and visualizations. By leveraging Power BI and SQL Server

data data-wrangling microsoft-sql-server power-bi power-bi-dax python

Last synced: 30 Jan 2026

https://github.com/denisecase/dc-texter

Send a text message using Python

alerts data python sms-messages streaming

Last synced: 08 Feb 2026

https://github.com/pythoncoderunicorn/jamesbeardaward

a repo for James Beard Award data

data dataset jamesbeard

Last synced: 07 Feb 2026

https://github.com/ms140569/loki-example-store

Testdata for loki password manager

data

Last synced: 26 Feb 2026

https://github.com/mehmetkahya0/earthquake-tracker

Earthquake Tracker, A real-time earthquake monitoring application that visualizes seismic activity worldwide using interactive maps and data visualization.

ai api css cursor data data-vizualisation earth-observation earthquake earthquake-data earthquake-visualization earthquakes html js modern-web scrape ui ui-design web

Last synced: 15 Apr 2026

https://github.com/gman-au/white-knight

Experimental .NET data abstraction using specification pattern

abstractions data datastore dotnet repository-pattern specification-pattern

Last synced: 17 Mar 2026

https://github.com/michaelfromyeg/lyrics

Lyric-store and API hosted on Git.

data lyrics

Last synced: 08 Feb 2026

https://github.com/samaalharbi2/project-recommendation-system

This project focuses on building a Recommendation System using real interaction data from IBM's Watson Studio platform.

clustering data ibm-watson kmeans nlp python rec svd udacity-nanodegree

Last synced: 09 Feb 2026

https://github.com/myles-parfeniuk/esp32_sdlogger

C++ esp-idf driver component for SD cards interfaced via SPI. WIP

card data esp-idf esp32 logger sd sdcard sdmmc sdspi spi

Last synced: 09 Feb 2026

https://github.com/metapsy-project/data-panic-psyctr

Database of psychotherapy for panic disorder compared to control conditions

data

Last synced: 18 Mar 2026

https://github.com/neurazum-ai-department/tumor-stages-dataset---v1

Synthetic MRI data generated by the ‘HF’ and 'Vbai' models based on real data.

brain data dataset datasets image mri neuroscience tumor tumor-segmentation

Last synced: 18 Mar 2026

https://github.com/haroontrailblazer/machine_learning

About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.

data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics

Last synced: 16 Apr 2026

https://github.com/dysnomia-studio/achieve-games-dump

Dump parts of achieve.games database to public including Steam Games List

data dump games steam steam-api steam-game steam-games

Last synced: 27 Feb 2026

https://github.com/enescidem/twitter-topic-modeling

Topic modeling is an unsupervised method to identify topics in text. This project analyzes tweets from prominent Turkish accounts to uncover underlying themes in their shared content.

data data-science machine-learning nlp topic-modeling twitter x

Last synced: 10 Feb 2026