An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/juangesino/research-project

Course files for Research Project @ University of Amsterdam

data data-science economics stata

Last synced: 02 Jan 2026

https://github.com/atiqurcode/scrap-spec

Scrap data from the html to table html code / json

data html-table json-data scarp

Last synced: 05 Feb 2026

https://github.com/buffdelta/basketball_ref_webscraper

Python package to make webscraping from basketball-reference easy

basketball data python python-library webscraping

Last synced: 14 Jan 2026

https://github.com/seqeralabs/ffq-api

A minimal wrapper to make ffq searches available via a REST API.

api data fastq fetch-fastq ffq genomics

Last synced: 15 Aug 2025

https://github.com/supremkc05/global-job-market-analytics

Scrape jobs from websites like Indeed/LinkedIn, extract skills using NLP, then visualize hiring trends.

beautifulsoup data machine-learning nlp pandas scrapping

Last synced: 14 Aug 2025

https://github.com/dahmansphi/analysis_from_start_to_end

The Big Bang of Data Science- Analysis from the Start to The End- [Book Two]

analysis data data-analytics data-mining data-science hypothesis-testing jamovi machine-learning

Last synced: 08 Jan 2026

https://github.com/jooapa/bytebrother

Byte Brother is watching YOU

data data-analysis security

Last synced: 26 Jan 2026

https://github.com/fcoagz/rate-reader-epv

pyDolarVenezuela API utilities, image processing (EnParaleloVzla) to extract currency exchange rates from specific platforms, validating content against expected patterns

data finance json processing-images pydolarvenezuela

Last synced: 14 Jun 2025

https://github.com/vasak-os/hydriam-data

Data for hydriam menu

data linux menu vasak

Last synced: 04 Oct 2025

https://github.com/2kabhishek/pybank

Data Analysis for the silliest Bank πŸ’°πŸ¦

csv data data-science learning pandas python topic1 topic2

Last synced: 12 May 2026

https://github.com/itsachrafmansari/moroccan-real-estate-analysis

Scrape, process, analyze, and visualize data from Avito.ma to uncover current trends in Morocco's real estate market.

api-scraping data data-analysis data-mining data-science data-scraping data-visualization eda exploratory-data-analysis morocco real-estate web-scraping

Last synced: 13 Aug 2025

https://github.com/rse/nebulize

Nebulize Security-Sensitive Information

data dsgvo gdpr information nebulize security sensitive

Last synced: 16 Mar 2025

https://github.com/elijah-1994/pre-process-e-commerce-dataset

Importing, Cleaning, and Pre-Processing E-Commerce Data for Analysis Using MySQL.

analytics data dataanalytics datacleaning dataprocessing mysql mysql-database sql

Last synced: 11 Mar 2025

https://github.com/jleung51/foundations-dags

Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.

data data-engineering etl extract housing load pipeline transform

Last synced: 04 Oct 2025

https://github.com/bocchilorenzo/hugginginfo

Unofficial library to retrieve information from the HuggingFace website.

api data huggingface scrape

Last synced: 03 Apr 2026

https://github.com/itsmeyogesh22/Solved-8-Weeks-SQL-Challenge-Correct-Solutions

Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/

8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql

Last synced: 29 Aug 2025

https://github.com/anand-sony/mttr-dashboard

Streamlit dashboard for MTTR analysis with shift-wise loss insights and machine-level downtime tracking.

analytics business-analytics dashboard data python statistical-analysis

Last synced: 30 May 2026

https://github.com/nafisalawalidris/nafisalawalidris

Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and Bitcoin converge.

artifical-intelligence bitcoin config data data-science developer github-config github-pages machine-learning

Last synced: 16 May 2026

https://github.com/luminati-io/jupyter-notebooks-web-scraping

Perform web scraping interactively using Jupyter Notebooks, integrating coding, data analysis, and visualization into one seamless workflow.

beautifulsoup4 data jupyter jupyter-notebook pandas python requests seaborn virtual-environment web-scraper web-scraping

Last synced: 13 Apr 2026

https://github.com/rajkumarbestha/nsedataextractor

NSEDataExtractor

data python python3

Last synced: 26 Mar 2025

https://github.com/robthree/cfnreader

Provides a simple way to read FNIRSI's CFN files (*.cfn) produced by the FNIRSI UsbMeter tool

cfn csv data fnirsi usb usb-tester

Last synced: 01 Mar 2025

https://github.com/aaisha-nexus/sql_company_insights

A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.

business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms

Last synced: 12 Aug 2025

https://github.com/kadirlofca/unity-csvmaker

Quick and easy way to create and export .csv files from Unity.

csharp data database unity

Last synced: 09 Apr 2026

https://github.com/sandysanthosh/aspose-doc-to-pdf

Document & Browser object model

aspose build data doc java pdf

Last synced: 04 Jun 2026

https://github.com/programmer-rd-ai/competitive-programming-solutions

A collection of my solutions to various competitive programming problems from platforms like LeetCode. This repository serves as a personal archive of my problem-solving journey, covering a range of algorithms, data structures, and problem-solving techniques.

algorithm algorithms algorithms-and-data-structures data datastructures dsa javascript pandas python structures

Last synced: 01 Mar 2025

https://github.com/keziatbnn/supervised-regression-salaryprediction

Make salary predictions based on years of experience using supervised regression.

data data-analysis-python data-prediction data-science python

Last synced: 11 Aug 2025

https://github.com/arkanovicz/skorm

Simple Kotlin Object Relational Mapping

data database model orm sql

Last synced: 19 Apr 2026

https://github.com/mcraiha/datagensharp

C# managed library for generating data

csharp data generator

Last synced: 11 Aug 2025

https://github.com/roshaka/samplr

Samplr is a Python decorator for selecting a subset of items from a list, with options for customisation and informative console printouts.

data data-analysis data-engineering decorators list python sampling

Last synced: 14 Jan 2026

https://github.com/austinhartzheim/career-fair-backend

Backend for ECS Career Fair app

data django python

Last synced: 13 Apr 2026

https://github.com/andrii04/andreamonforte-bi-assignment

Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.

automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql

Last synced: 09 Nov 2025

https://github.com/blueheron786/quranic-universal-library-mushaf-layouts

The Quranic Universal Library (QUL)'s Qur'an mushaf 15-line layouts (madini, uthmani)

data database layout mushaf quran sqlite uthmani uthmani-quran

Last synced: 13 Apr 2026

https://github.com/0xhericles/ufcg-geojson

GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.

data data-visualization geojson map open-source ufcg university

Last synced: 09 Feb 2026

https://github.com/ashita-ai/ashita-ai.github.io

Ashita AI - The island of misfit data tools

ai data

Last synced: 19 Feb 2026

https://github.com/stupidcucumber/elephant-crawler

System for mining texts from websites.

data data-mining-python python

Last synced: 25 Apr 2026

https://github.com/ahmad-ali-rafique/heart-disease-detection-model

A comprehensive project for detecting heart disease using machine learning, including data processing, model training, and evaluation metrics with AUC curve analysis.

artificial-intelligence data datascience heart-disease machine-learning modeling prediction-model

Last synced: 11 Aug 2025

https://github.com/danielrosehill/value-factors-data-vis

Streamlit app containing visualisations of the Global Value Factors Database (GVFD) released by the IFVI in 2024

data data-visualization sustainability sustainability-data

Last synced: 29 Jul 2025

https://github.com/srindot/fwuav-average-flight-data-collection

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 10 Aug 2025

https://github.com/ometman/vet-clinic

This is a database project for vetinary data management for animals, owners, clinic employees and visits; and applicable to any data management need. It uses Postgresql, a relational database management system. It allows storing, updating and querying.

data database normalization postgresql postgresql-database queries sql sql-server-database tables transactions

Last synced: 13 May 2026

https://github.com/fabsdevx/files-to-database-loader-handout

Data Engineering project for learning purposes. Credits to itversity

csv data data-engineering database json pandas python

Last synced: 09 Apr 2026

https://github.com/0xkibh/datamining-algo

This repository consist data mining algorithm implementation example in python

apriori-algorithm data datamining fp-growth python

Last synced: 19 May 2026

https://github.com/lukakerr/us-surnames

US Surname data visualisation using R. Displays top 25 US surnames and race/ethnic percentage per name.

data data-visualization r

Last synced: 05 Oct 2025

https://github.com/pathilink/ebury_case

Technical case study in Analytics Engineering using BigQuery, focusing on dimensional modeling and SQL queries for payment and client analysis.

bigquery data modeling sql

Last synced: 05 Oct 2025

https://github.com/affan005-ai/tesla-stock-prediction

This project analyzes Tesla stock data and builds machine learning models to predict and classify stock movements. The analysis includes EDA, feature correlation, moving averages, and two models

data data-analysis data-science data-visualization-project eda machine-learning matplotlib pandas predictive-analytics predictive-modeling python scikit-learn

Last synced: 05 Oct 2025

https://github.com/rysteq/abstract-data-structures

This repository contains two programs written in C about the stack and queue ADT's

abstract-data-structures c data queue stack

Last synced: 06 Oct 2025

https://github.com/chubek/pyramid-dashboard

A Dashboard to Show Data Made Using Plotly Dash

dash data docker ml plotly plotly-dash python

Last synced: 19 May 2026

https://github.com/vim89/flowforge

Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently

archetype data data-contracts data-engineering data-pipelines data-quality data-science database dataengineering datapipeline etl etl-framework pipelines scala scalability spark spark-sql spark-streaming

Last synced: 14 Apr 2026

https://github.com/kolyaventuri/covid-act-now

A CovidActNow.org API client

covid data typescript

Last synced: 09 Aug 2025

https://github.com/paul-henryp/simulate-investment-strategies

This Java program simulates different investment strategies using historical stock market data. It allows users to test various strategies such as buy and hold, moving average, buying when the stock price is lower than the last purchase, and dollar-cost averaging.

data data-science investing-java java plots plotting simulated-data simulated-investments sp500 sp500-data-analysis

Last synced: 21 May 2026

https://github.com/eharshit/end-to-end-vendor-insights

End-to-end analysis of vendor performance for wholesale/retail businesses, featuring data ingestion, cleaning, insights, and interactive Power BI dashboards.

analysis analysis-algorithms analytics dashboard data data-analysis datascience jupyter jupyter-notebook pandas powerbi powerbi-report retail wholesale

Last synced: 07 Oct 2025

https://github.com/prajjwol09/sql_retail_analysis_project

This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.

data data-analysis datacleaning dataexploration pgadmin4 sql

Last synced: 15 Feb 2026

https://github.com/iankitnegi/tableautales

"Discover my Tableau journey! Dive into data-driven stories, visualizations, and projects as I explore the power of data visualization."

data data-visualization tableau

Last synced: 21 Jan 2026

https://github.com/pythoncoderunicorn/startrek

a repo for Star Trek data from Technical Manuals

data klingon-language star-trek vulcan

Last synced: 07 Oct 2025

https://github.com/ournet/weather-data

Ournet weather data module

data forecast ournet storage weather

Last synced: 07 Oct 2025

https://github.com/rahulthedevil/metric-converter

A simple utility package for converting between metric units such as meters, kilometers, grams, kilograms, liters, and more. Simple and powerful way for Units Convert solution

convert converter data fraction imperial length mass measurements metric metrics ratio system temperature unit unit-conversion unit-converter units uom utilities weight

Last synced: 08 Oct 2025

https://github.com/jacob-pitsenberger/python-electronics-inventory-management-system-object-oriented-programming-project

Welcome to the Python Electronics Inventory Management System project repository! This project is a demonstration of Object-Oriented Programming (OOP) principles in Python for managing an electronic parts inventory.

data data-structures dictionary exception-handling file-io filesystem input-output inventory-management-system management-system modules oop pickle python user-interface

Last synced: 08 Oct 2025

https://github.com/danieljdufour/fast-b64

Quickly Convert between B64 and Binary Strings

b64 base64 base64-decoding base64-encoding binary bits compression data

Last synced: 08 Oct 2025

https://github.com/rahul1582/bank-loan-classification

Classifying whether a person is taking personal loan or not using all the Classification Algorithms.

algorithm analysis classi data

Last synced: 08 Oct 2025

https://github.com/chompfoods/sdk-java

Java SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food gradle grocery ingredients jar java java-sdk nutrition openapi raw recipe-api recipes sdk

Last synced: 09 Apr 2026

https://github.com/djdhairya/whatsapp-chat-analysis

WhatsApp chat analysis is a multidimensional process that delves into the content, structure, and dynamics of conversations within the platform. It provides valuable insights for personal reflection, organizational decision-making, and improving communication strategies.

data data-science dataanalytics datapreprocessing machine-learning ml

Last synced: 08 Oct 2025

https://github.com/shubhamsoni98/classification-with-random-forest-1

To classify sales into categories (Low, Moderate, High) using Random Forests to inform strategic decisions and optimize marketing strategies.

algorithms anaconda data data-science datacleaning eda jupyter-notebook machine-learning pyhton random-forest scikit-learn visualization

Last synced: 18 Jan 2026

https://github.com/mapaor/horaris-rodalies

Web que utilitza la API de rodalies de Catalunya per mostrar els horaris d'una manera mΓ©s divertida

adif api ave barcelona bordils catalunya dades data distancia generalitat girona horaris md r11 regional renfe rodalies sants tren viajes

Last synced: 16 May 2026

https://github.com/mchenryspagg/wrangle-and-analyze-data

This project which is known as 'wrangle and analyze data' involves the wrangling of WeRateDogs twitter archive data from the period of 2015 to 2017

api data dataanalysis datacollection datawrangling datetime json numpy os pandas pil python requests tweepy-api visualization

Last synced: 09 Apr 2026

https://github.com/preritdas/covidactnow

A wrapper for the Covid Act Now database of live COVID-19 state-based statistics.

api covid covid-19 data python python3 science wrapper

Last synced: 09 Oct 2025

https://github.com/jeswr/blog

My personal blog

ai blog data semantics solid web

Last synced: 13 Feb 2026

https://github.com/psyteachr/sdg-data

Data relevant to the UN Sustainable Development Goals

data

Last synced: 09 Oct 2025

https://github.com/sourceduty/clock_metadata

πŸ•’ Recording time data and statistical metadata to .csv files.

clock data data-science metadata practice python time timing

Last synced: 08 Aug 2025

https://github.com/quetz-al/quetzal-openapi-client

Autogenerated Python client for the Quetzal API

client data data-science openapi-client openapi3 python quetzal

Last synced: 10 Oct 2025

https://github.com/sillyash/untappd-viz

A data visualisation page using public datasets and HTML/CSS/JS with D3.js.

beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project

Last synced: 18 May 2026

https://github.com/sourceduty/text_file_metadata

πŸ“„ Extract metadata from .txt files and record the metadata in .txt files.

data datascience metadata metafile practice sourceduty

Last synced: 08 Aug 2025

https://github.com/loaiwalid07/automation_data_overviwe

This is Streamlit app that gives an overview for a dataset you upload

automation data data-analysis data-exploration data-science data-transformation data-visualization

Last synced: 19 May 2026

https://github.com/theopenwebjp/theopenweb-data-loader

Package for loading data to local project

data downloader import javascript typings

Last synced: 10 Oct 2025

https://github.com/j-sephb-lt-n/joes_giant_toolbox

A large collection of general python functions and classes that I use in my daily work

ascii browser classifier data dataviz gcp mime nlp python regex search statistics supervised web-scraping

Last synced: 10 Oct 2025

https://github.com/bastianolea/minsal_suicidios

Casos de intento de suicidio y suicidio consumado en Chile

chile comunas data genero salud tiempo

Last synced: 19 Jan 2026

https://github.com/azkarmoulana/winter-of-data-2019

:snowflake: :snowman: Winter of Data is coming..... :wolf:

data data-science machine-learning mathematics

Last synced: 05 Feb 2026

https://github.com/loggdme/kyro

Collection of utilities and examples for creating efficient data pipelines in go with parallel queues and, rate limitiers and much more.

data package

Last synced: 14 Jan 2026