An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/bdr-pro/streamlint

ltra-cool Streamlit app, where you can interact with widgets, see data in action, and even upload and download files

data streamlit

Last synced: 14 Apr 2026

https://github.com/vanduc1102/parse-stackoverflow-data

Parse stackoverflow data

data parser stackoverflow

Last synced: 16 Oct 2025

https://github.com/bhemen/aave-data

Borrowing and lending data sets from the Aave protocol on Ethereum

aave borrow data ethereum lend python

Last synced: 05 Feb 2026

https://github.com/saboye/sales-performance-analysis

A dashboard that presents monthly sales performance by product segment and product category to help clients identifying the segments and categories that have met or exceeded their sales targets, as well as those that have not met their sales targets.

dashboard data data-science eda tableau visualization

Last synced: 27 Jan 2026

https://github.com/jleung51/foundations-dags

Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.

data data-engineering etl extract housing load pipeline transform

Last synced: 04 Oct 2025

https://github.com/bocchilorenzo/hugginginfo

Unofficial library to retrieve information from the HuggingFace website.

api data huggingface scrape

Last synced: 03 Apr 2026

https://github.com/meokullu/colorizenumber

ColorizeNumber - Bodrum Papatya, visualizes numeric data into colors which creates an image.

color colorize colors data data-visualization visualization vizualize-data

Last synced: 01 Jun 2026

https://github.com/parvezk/d3-fundamentals

D3 library API fundamentals

charts d3 data graphs visualization

Last synced: 19 Oct 2025

https://github.com/aaisha-nexus/sql_company_insights

A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.

business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms

Last synced: 12 Aug 2025

https://github.com/kadirlofca/unity-csvmaker

Quick and easy way to create and export .csv files from Unity.

csharp data database unity

Last synced: 09 Apr 2026

https://github.com/erencelik/binance-public-data-node

Nodejs downloader and unzipper script for Binance Public Data

binance data downloader nodejs public script

Last synced: 15 May 2026

https://github.com/octoenergy/tentaclio-snowflake

A python project containing all the dependencies for snowflake tentaclio schema.

data

Last synced: 20 Oct 2025

https://github.com/cemc-oper/nmc-typhoon-db-client

A CLI client for NMC Typhoon Database.

data database-client nmc

Last synced: 01 Jun 2026

https://github.com/dilkushsingh/webscraping-with-selenium-and-beautifulsoup

Web Scrapped a popular tech gadgets website using Selenium and BeautifulSoup, also performed Data Analysis on scrapped data.

beautifulsoup data datacleaning datagathering eda exploratory-data-analysis python selenium webscraping

Last synced: 24 Feb 2026

https://github.com/zanysoft/virtualcolumn

Laravel virtual column

data laravel virtual-column

Last synced: 12 Apr 2026

https://github.com/vidupriya/aws-glue--data-copy

The function for copying data like CSV, Parquet, avro etc., from a source S3 bucket to a destination S3 bucket using AWS Glue. It includes the necessary setup for the Glue job, logging, reading data from the source bucket, and writing it to the destination bucket

aws awsglue awss3 data data-copying glue glue-job pyspark python3 s3 s3-bucket s3-buckets s3-storage spark

Last synced: 02 May 2026

https://github.com/mohibmirza-py/email-verifier-script

Streamlit app to verify emails in bulk

ai analysis data streamlit

Last synced: 29 Apr 2026

https://github.com/keziatbnn/supervised-regression-salaryprediction

Make salary predictions based on years of experience using supervised regression.

data data-analysis-python data-prediction data-science python

Last synced: 11 Aug 2025

https://github.com/mcraiha/datagensharp

C# managed library for generating data

csharp data generator

Last synced: 11 Aug 2025

https://github.com/politicaargentina/opinar

📈 ICG toolbox for R - Indice de Confianza en el Gobierno 🇦🇷 (Universidad Torcuato Di Tella)

argentina data political-science politics public-opinion

Last synced: 22 Oct 2025

https://github.com/robertoostenveld/dcn.dsc_62002071_01_114_v1

Simon task M/EEG data [Data set].

data datalad open-data

Last synced: 23 Jan 2026

https://github.com/andrii04/andreamonforte-bi-assignment

Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.

automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql

Last synced: 09 Nov 2025

https://github.com/0xhericles/ufcg-geojson

GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.

data data-visualization geojson map open-source ufcg university

Last synced: 09 Feb 2026

https://github.com/andrewl/danelaw

Geopackage containing the boundary of the Danelaw

data geospatial medieval viking

Last synced: 23 Jan 2026

https://github.com/ashita-ai/ashita-ai.github.io

Ashita AI - The island of misfit data tools

ai data

Last synced: 19 Feb 2026

https://github.com/thais81/gamesbox

Another desktop app in JSE/Jswing with hangman game and tic-tac-toe game. This project was made at LDNR school with 4 friends

data database hangman-game jse tictactoe tictactoe-game

Last synced: 28 Jan 2026

https://github.com/sankooc/validatez

object validation for node

data validate

Last synced: 13 May 2026

https://github.com/12458/99co

99co Web Scraping

99co data property scraper website

Last synced: 02 May 2026

https://github.com/ethenkem/pygraphsurvey

A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc

data

Last synced: 12 Jan 2026

https://github.com/ramonrsv/f1_data

Provides consolidated access to various sources of Formula 1 information and data, including event schedules, session results, timing and telemetry data, as well as historical information about drivers, constructors, circuits, etc.

data f1 rust

Last synced: 07 Apr 2026

https://github.com/uzinfocom-org/archive

📦 | Archived projects that aren't used anymore

archive archive-data data notused

Last synced: 01 Sep 2025

https://github.com/allanotieno254/spss-nutrition-research

This repository contains the results of statistical analyses performed in IBM SPSS Statistics on a child nutrition dataset.

data data-preprocessing dataanalysis spss

Last synced: 17 Feb 2026

https://github.com/ppmim/papi4k_old2

PAPI: the PANIC data reduction pipeline

data near-infrared pipeline processing

Last synced: 23 Jun 2025

https://github.com/stdlib-js/array-base-assert-any-has-property

Test whether at least one element in a provided array has a specified property, either own or inherited.

any array assert data generic has javascript node node-js nodejs prop property stdlib structure test types validate

Last synced: 07 May 2025

https://github.com/johndelatto/-universities-to-pursue-a-master-s-degree-in-machine-learning

Best Master’s Programs in Machine Learning (ML) for 2021 These are the best universities to pursue a master’s degree in machine learning, with research rankings in AI and machine learning

ai api data education project school

Last synced: 17 Jun 2025

https://github.com/darshjasani/claims-analysis

This repository contains a comprehensive analysis of claims data, detailing the workflow from data preprocessing to model evaluation. The goal of this analysis is to build predictive models to improve claims prediction and management.

analysis data linear machine-learning python

Last synced: 16 May 2026

https://github.com/mawiegand/automatic-point-label-placement-data

Test instances for the automatic point label placement problem.

data datastructures generator javascript labeling problem ruby

Last synced: 16 May 2026

https://github.com/webianks/anotech-android

Android application which deals on various anomalous behaviour that occur on server data.

anomaly-detection data server

Last synced: 13 Apr 2025

https://github.com/stdlib-js/array-base-banded-filled2d-by

Create a filled two-dimensional banded nested array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types

Last synced: 19 May 2026

https://github.com/christopherandrewtopalian/catopalian_javascript_data_navigator

A JavaScript application that allows for easy sorting of data. Easily navigate through any amount of data using button filters.

data javascript sorting

Last synced: 13 Apr 2025

https://github.com/cmdrvl/profile

profile manages column-scoping configurations for report tools — defining which columns to include, key alignment, and normalization rules for rvl, compare, and shape.

cli configuration csv data data-quality open-source ops rust tooling

Last synced: 07 Mar 2026

https://github.com/os-climate/rmi-utility-transition-hub-ingestion-pipeline

Data ingest for RMI's Utility Transition Hub data (as of March 7, 2022)

data emissions-co2 energy-data os-climate

Last synced: 12 Apr 2025

https://github.com/priyanshubiswas-tech/farmlab-report-and-case-study-iot

This project was developed through live interviews and case studies with farmers in the year 2023 to address key agricultural challenges. The device provides real-time farm insights for better decision-making. Future plans include a digital portal, increased range, more sensors, and improved design. Open to collaboration!

arduino-ide c case case-study data data-analysis iot iot-device serialization

Last synced: 15 Jul 2025

https://github.com/germanpaul12/flights-data-sky-scraper-api

Sky Scraper - Python app for searching flight information using the Sky Scrapper API.

data flights flights-api scraping

Last synced: 15 Jul 2025

https://github.com/birjemin/wxgameod

wxgame 开放数据 weixin 微信小游戏 关系链数据

data interactive-data relation user-storage

Last synced: 16 Jul 2025

https://github.com/shoaib1522/database-systems

📚💾 Master the fundamentals of database systems with this all-in-one lab repository, featuring ERD design diagrams 🧠🗺️, Oracle SQL 🌐📝, relational schema practice, and complete PowerPoint lectures 🖥️📑. Perfect for revision, exams, or quick reference! 💡📘

data database database-management databases databases-course db dbms-project erd notes oracle oracle-database sql

Last synced: 21 Aug 2025

https://github.com/youmenomi/hydreigon

Are you looking for a Hydreigon to classify data for you? Come and catch it!

classify data hydreigon indexer items management pokemon sortable structure typescript

Last synced: 07 May 2025

https://github.com/flowsynx/plugin-sqlite

FlowSynx plugin to enables data access and manipulation on SQLite databases.

data database flowsynx sql sqlite

Last synced: 08 May 2026

https://github.com/andygeiss/pipeline-example

This is a basic example of using a pipeline in data science.

data data-pipeline data-science example go golang iris-dataset pipeline protobuf

Last synced: 17 Jul 2025

https://github.com/potlock/data

data research for other funding mechanisms and PotLock related data.

data flipsidecrypto near-protocol potlock

Last synced: 07 Mar 2026

https://github.com/srindot/average_flightdata_collection_fwuav

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 18 Sep 2025

https://github.com/dimitryzub/allrecipes-us-recipes-by-state-analysis

Personal Data Exploratory Project in Python. Data extracted from AllRecipes.

data data-visualization dataexploration dataextraction matplotlib pandas python seaborn webscraping

Last synced: 10 May 2026

https://github.com/merekat/hb-passiv-income

Ein Rechner, der basierend auf historischen Daten unterschiedlicher Assets kalkuliert, welches voraussichtliche passive Einkommen der User abhängig von seinen Eingaben zu erwarten hat.

assets data datajournalism etf passive-income treasury

Last synced: 19 Jul 2025

https://github.com/jcloh98/rental-property-finder

A web scraper that helps users find rental properties by automatically gathering and organizing listings from various websites to discover available homes and apartments.

data headless-browser node scraper scraping web

Last synced: 17 May 2026

https://github.com/joseluisq/input-verifier

Some useful functions to check common data input.

data input utils validation

Last synced: 19 Jul 2025

https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate

This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.

correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif

Last synced: 17 May 2026

https://github.com/zshn1248/pyfilecrypto

PyFileCrypto is a Python module for easy encryption and decryption of files using the cryptography library. It provides a simple interface to generate encryption keys, encrypt files, and decrypt files securely.

data decryption encryption file security-tools

Last synced: 07 Apr 2026

https://github.com/ashishsingh789/hr_analysis_dashboard

The HR Analyst Dashboard is an interactive Power BI tool that provides insights into HR metrics sourced from Excel. It focuses on data cleaning, transformation, and visualization, enabling stakeholders to explore key indicators like employee demographics and performance through intuitive charts.

dashboard data dataanalysis datacleaning powerbi-desktop visualization

Last synced: 06 Mar 2026

https://github.com/sharoonjoseph321/social_media_eda

Data Analysis on social media apps ,using pandas, python, matplotlib.

data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects

Last synced: 03 Mar 2025

https://github.com/halyusa16/basic-sql-employee-analysis

This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.

data data-analytics data-exploration database mysql self-project sql

Last synced: 16 May 2026

https://github.com/sajjadanwar0/booking.com-scraping

Scraping booking.com using Selenium and Beautiful Soup

crawler data python scraping selenium

Last synced: 18 Oct 2025

https://github.com/vaibhavmojidra/data-structures---hashtable-using-array-and-linked-list-in-java

Hash Table is a data structure which stores data in an associative manner. In a hash table, data is stored in an array format, where each data value has its own unique index value. Access of data becomes very fast if we know the index of the desired data. Thus, it becomes a data structure in which insertion and search operations are very fast irrespective of the size of the data. Hash Table uses an array as a storage medium and uses hash technique to generate an index where an element is to be inserted or is to be located from.

arrays data data-structures hashing java linked-list mojidra vaibhav vaibhav-mojidra vaibhavmojidra

Last synced: 12 Apr 2025

https://github.com/UznetDev/Smoking-Prediction

This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.

ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking

Last synced: 28 Mar 2025

https://github.com/namescode/hub_harvester

A python script to gather data on a user or organisations git repos

data github nix nix-flake python python3 sqlite

Last synced: 08 Apr 2026

https://github.com/webobite/fact-chatbot

A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.

chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp

Last synced: 04 May 2026

https://github.com/chompfoods/sdk-scala

Scala SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes scala sdk

Last synced: 17 May 2026

https://github.com/reubano/pyconza-tutorial

Jupyter notebooks and data for "Data Mining and Processing for fun and profit" PyConZA16 tutorial

data functional-programming jupyter-notebook meza pycon python tutorial

Last synced: 17 May 2026

https://github.com/sibeux/redesigned-broccoli

Repositori untuk menyimpan data file musik

data data-center nasrulwahabi sibeux

Last synced: 24 Jan 2026

https://github.com/sumansuhag/prediction_model

This repository features a collection of Jupyter notebooks designed to showcase the practical applications of machine learning, data preprocessing, feature engineering, and recommendation systems. These notebooks enable users to explore, analyze, and predict business events.

algotithms artificial-intelligence data logistic-regression machine-learning-algorithms science sckiit-learn

Last synced: 28 Mar 2025

https://github.com/sumansuhag/wasserstoff-aiinterntask

Welcome to the AI Pipeline for Image Segmentation and Object Analysis project – a state-of-the-art solution designed to process, segment, identify, and analyze objects within images. This AI-powered pipeline is engineered to deliver precise insights by extracting, mapping, and summarizing data from each segmented object.

artificial-intelligence cdn data data-science modeling pipline

Last synced: 28 Mar 2025

https://github.com/1sumer/mass-mail-automation

Mass Emailer is a Python-based application designed to send bulk emails efficiently using an SMTP server. Leveraging the power of the Tkinter library for the graphical user interface (GUI), this tool provides a user-friendly platform for managing and dispatching large volumes of emails with ease.

data oops-in-python python smtp-server tkinter

Last synced: 20 Aug 2025