An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/nimomach/amazon-sales-data

This is a small dataset containing Amazon sales data analysis for few regions.

dashboards data data-analysis data-visualization

Last synced: 08 Mar 2026

https://github.com/biril/audio-test-data

Audio data to use for testing

audio data mpeg test

Last synced: 11 Jan 2026

https://github.com/kylepw/multistack

Example of multiple stacks in one array.

algorithms array data data-structures python stack

Last synced: 17 Mar 2025

https://github.com/namescode/hub_harvester

A python script to gather data on a user or organisations git repos

data github nix nix-flake python python3 sqlite

Last synced: 08 Apr 2026

https://github.com/UznetDev/Smoking-Prediction

This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.

ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking

Last synced: 28 Mar 2025

https://github.com/sajjadanwar0/booking.com-scraping

Scraping booking.com using Selenium and Beautiful Soup

crawler data python scraping selenium

Last synced: 18 Oct 2025

https://github.com/samharrison7/datamapper

Making mapping between datasets as simple as possible.

data data-mapper data-mapping data-science data-structures

Last synced: 17 Mar 2025

https://github.com/sharoonjoseph321/social_media_eda

Data Analysis on social media apps ,using pandas, python, matplotlib.

data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects

Last synced: 03 Mar 2025

https://github.com/zshn1248/pyfilecrypto

PyFileCrypto is a Python module for easy encryption and decryption of files using the cryptography library. It provides a simple interface to generate encryption keys, encrypt files, and decrypt files securely.

data decryption encryption file security-tools

Last synced: 07 Apr 2026

https://github.com/devbigboy/iti-database

This course will cover the following Topics: joins, Normalization, Aggregate function, Group By, Order By, Select, Ranking Functions, Built-In Functions

analytics data data-analytics mssql-database sql sql-server

Last synced: 03 Nov 2025

https://github.com/gabboraron/datacamp_projects

Here you can find my DataCamp Projects

data datacamp datacamp-projects

Last synced: 14 Jun 2026

https://github.com/wciesialka/top-names

A Python module for scraping the list of top first names in the United States.

data python python3

Last synced: 08 Jun 2026

https://github.com/fridex/real-estate

My machine learning in real estate

data machine-learning real-estate

Last synced: 27 Jun 2025

https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate

This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.

correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif

Last synced: 17 May 2026

https://github.com/vaibhavmojidra/data-structures---hashtable-using-array-and-linked-list-in-java

Hash Table is a data structure which stores data in an associative manner. In a hash table, data is stored in an array format, where each data value has its own unique index value. Access of data becomes very fast if we know the index of the desired data. Thus, it becomes a data structure in which insertion and search operations are very fast irrespective of the size of the data. Hash Table uses an array as a storage medium and uses hash technique to generate an index where an element is to be inserted or is to be located from.

arrays data data-structures hashing java linked-list mojidra vaibhav vaibhav-mojidra vaibhavmojidra

Last synced: 12 Apr 2025

https://github.com/jeugregg/deeplearningpicturedogs

Classify dogs pictures by Deep Learning CNN neural networks

classez-des-images cnn-keras data data-science ipynb neural-network vision

Last synced: 24 Jul 2025

https://github.com/radekbednarik/att

Python wrapper for calling Apitalks API.

api-wrapper apitalks data python3 rest-api wrapper

Last synced: 05 Apr 2025

https://github.com/webobite/fact-chatbot

A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.

chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp

Last synced: 04 May 2026

https://github.com/csmith0651/ormy

A simple python ORM.

data database python

Last synced: 13 May 2026

https://github.com/alexdonh/adonis-cache

Another cache provider for AdonisJs. Supports Object, File, Db and Redis cache. With cache dependencies!

adonis-framework adonisjs cache data dependency redis storing

Last synced: 15 May 2026

https://github.com/chrisrobertsjr/chrisrobertsjr

Welcome to my Github Profile!

data data-analysis java r sql statistics

Last synced: 03 May 2026

https://github.com/iliyasalve/cyclistic_case_study

Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"

bike-sharing data data-analysis data-visualisation r

Last synced: 06 Apr 2025

https://github.com/mx51/data-dictionary-action

GitHub Action for generating and checking freshness of data dictionaries

action analytics data

Last synced: 17 Jan 2026

https://github.com/joseluisq/input-verifier

Some useful functions to check common data input.

data input utils validation

Last synced: 19 Jul 2025

https://github.com/mightymetrika/mmirestriktor

Informative Hypothesis Testing Web Applications

data hypothesis infomative power r restriktor statistics testing

Last synced: 17 Mar 2025

https://github.com/antononcube/raku-data-typesystem

Data type system for different data structures.

data data-structures rakulang type-system

Last synced: 09 Jul 2025

https://github.com/sambhav/fb-insights

A tool to analyze your Facebook data dumps and generate insights

analytics data facebook graphs insights language learning machine natural personal processing

Last synced: 17 Mar 2025

https://github.com/jcloh98/rental-property-finder

A web scraper that helps users find rental properties by automatically gathering and organizing listings from various websites to discover available homes and apartments.

data headless-browser node scraper scraping web

Last synced: 17 May 2026

https://github.com/alexis-gss/games-data

Games Data is a library of informations about all games, realised under NuxtJs

css3 data games nuxtjs tailwindcss typescript vuejs

Last synced: 13 Mar 2025

https://github.com/afeiship/data-selection

Data structure for radio/checkbox-group.

checkbox data group radio

Last synced: 17 Jun 2025

https://github.com/noraui/noraui-datas-webservices

noraui-datas-webservices is a RESTdataProvider for NoraUi

data noraui rest-api service spring-boot-2 spring-boot-actuator

Last synced: 17 Mar 2025

https://github.com/peternaydenov/data-pool

Data layer for node apps and single page applications

cache data store

Last synced: 29 Apr 2025

https://github.com/umstek/sampler

Generate elaborate random data instantly.

data faker javascript json sample

Last synced: 20 Jul 2025

https://github.com/merekat/hb-passiv-income

Ein Rechner, der basierend auf historischen Daten unterschiedlicher Assets kalkuliert, welches voraussichtliche passive Einkommen der User abhängig von seinen Eingaben zu erwarten hat.

assets data datajournalism etf passive-income treasury

Last synced: 19 Jul 2025

https://github.com/4ment/aiv-rate-heterogeneity

Avian influenza virus data sets

data influenza

Last synced: 24 Jan 2026

https://github.com/sibeux/redesigned-broccoli

Repositori untuk menyimpan data file musik

data data-center nasrulwahabi sibeux

Last synced: 24 Jan 2026

https://github.com/thesfinox/sql-simple-backup

Simple script to backup data in a MySQL database and store it in a WebDAV server.

backup bash data mysql script sql webdav

Last synced: 18 Apr 2026

https://github.com/purarue/blizzard_gdpr_parser

Parses date-related information from my blizzard GDPR export.

blizzard data gdpr webscraping

Last synced: 06 Apr 2025

https://github.com/purarue/hpi-personal

Personal HPI modules/scripts

data history lifelogging

Last synced: 06 Apr 2025

https://github.com/1sumer/mass-mail-automation

Mass Emailer is a Python-based application designed to send bulk emails efficiently using an SMTP server. Leveraging the power of the Tkinter library for the graphical user interface (GUI), this tool provides a user-friendly platform for managing and dispatching large volumes of emails with ease.

data oops-in-python python smtp-server tkinter

Last synced: 20 Aug 2025

https://github.com/goutam1511/real-time-covid-19-tracker-for-slack

This automated tracker tracks the spread of Covid-19 in a real time basis by scraping data from Ministry of Health and Family Welfare and notifies the same at Slack

covid-19 data python slack-bot web-scraping

Last synced: 30 Aug 2025

https://github.com/renebentes/2808

Curso 2808 - Fundamentos do Entity Framework

course csharp data ef-core

Last synced: 27 Jun 2025

https://github.com/lakshyakumar266/jee-dpp-manager-app

DPP manager app for JEE preparing Students

data expo javascript management react-native

Last synced: 07 May 2026

https://github.com/lu-sketch/chocolate-imports-dataset

Chocolate Imports for South Africa

data eda visualization

Last synced: 18 May 2026

https://github.com/mai-space/design-concept-sharing-recipes

🖼️ Concept for a framework based on state of the art technology and libaries for secure data sharing and online collaboration, as well as focus on the ux and ui of said framework

concept content-map data datasharing framework hci mci mock-up navigation-map peer-to-peer screendesign userstories

Last synced: 14 May 2025

https://github.com/agusk/ilmudata-book-excel-analytics

Hallo Microsoft Excel: Mastering Data Analytics

analytics data data-analytics excel power-query-editor

Last synced: 06 Jan 2026

https://github.com/jph5396/sumomodel

A data models related to sumo wrestling.

data go sumo

Last synced: 17 Jan 2026

https://github.com/dimitryzub/allrecipes-us-recipes-by-state-analysis

Personal Data Exploratory Project in Python. Data extracted from AllRecipes.

data data-visualization dataexploration dataextraction matplotlib pandas python seaborn webscraping

Last synced: 10 May 2026

https://github.com/gagolews/clustering-data-v0

Datasets for Clustering [DEPRECATED – A NEW VERSION IS AVAILABLE]

clustering data dataset machine-learning

Last synced: 15 Sep 2025

https://github.com/citizenlabsgr/data.world

Work with data sets prior to uploading to data.world

data data-structures

Last synced: 26 Mar 2025

https://github.com/potlock/data

data research for other funding mechanisms and PotLock related data.

data flipsidecrypto near-protocol potlock

Last synced: 07 Mar 2026

https://github.com/miss-mhv/data-analysis-for-social-buzz

In this work, we focus on a small dataset extracted from a large enterprise dataset on social buzz.

data jupyter-notebook python

Last synced: 14 May 2026

https://github.com/canadaluke888/terminaltablebuilder

Build and edit tabular data all from the terminal.

cli data data-manipulation excel json ods rich spreadsheets sqlite3 tables

Last synced: 20 Apr 2026

https://github.com/luminati-io/linkedin-dataset-samples

Sample dataset of 1001 LinkedIn companies, extracted via Bright Data API, featuring essential data points for competitive analysis and market insights.

data database dataset linkedin linkedin-api linkedin-data linkedin-dataset linkedin-scraper sample web-scraping

Last synced: 17 Mar 2025

https://github.com/ramonmeza/mysteamstats

Visualize your stats from your favorite games on Steam!

data statistics steam steam-api videogame visualization

Last synced: 17 Mar 2025

https://github.com/nanis/unitedat

Unify data sets which consist of separate files with a common header repeated in each one.

cli data etl utility

Last synced: 12 Apr 2025

https://github.com/parmsam/rweekly.data

R package containing data on Rweekly posts

data package rweekly

Last synced: 21 May 2026

https://github.com/kwame-mintah/ml-data-copy-to-aws-s3

Automatically copy new data to an AWS S3 bucket for Machine Learning.

aws aws-actions aws-s3 data

Last synced: 14 May 2026

https://github.com/rajlabmssm/echodata

echoverse module: Example data.

data echoverse fine-mapping genomics gwas qtl

Last synced: 17 Jan 2026

https://github.com/jitsasmal/customer-purches-behavior-and-shopping-analysis

Create dashboard to analyse the data based to total product sales, terget, revenue, state and season wize analyse to show the current treand the data.

analytics dashboard data etl powerbi

Last synced: 14 Feb 2026

https://github.com/scanthe-net/scanthenet-php

PHP API Data Fetcher.

api data php scan scanner threat

Last synced: 25 Jul 2025

https://github.com/gusgitmath/cnn_braintumor_classification

Built a CNN for MRI brain tumor classification (Glioma, Meningioma, No Tumor, Pituitary) with 99.4% accuracy. Used data augmentation, optimized learning rates (Adam), and included EarlyStopping, ReduceLROnPlateau for superior performance, averting overfitting. Boosts early, accurate diagnosis, advancing medical treatment.

classification convolutional-neural-networks data deep-learning machine-learning

Last synced: 25 Jul 2025

https://github.com/badawy403/egy.list

A Node.js package providing access to official Egyptian data including universities, governorates, cities, and more. This package makes it easy for developers to integrate Egypt-specific information into their applications.

city data egypt javascript nodejs npm package

Last synced: 08 Mar 2026

https://github.com/sam-moen/data-analyst-portfolio

This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.

data dataanalysis matplotlib mssql pandas powerbi python seaborn sql

Last synced: 08 Mar 2026

https://github.com/erictleung/2018-new-coder-survey

:beginner: Code to wrangle data from the 2018 New Coder Survey by freeCodeCamp

data data-cleaning dataset freecodecamp new-coders-survey programmers

Last synced: 03 Apr 2025

https://github.com/andygeiss/pipeline-example

This is a basic example of using a pipeline in data science.

data data-pipeline data-science example go golang iris-dataset pipeline protobuf

Last synced: 17 Jul 2025

https://github.com/flowsynx/plugin-sqlite

FlowSynx plugin to enables data access and manipulation on SQLite databases.

data database flowsynx sql sqlite

Last synced: 08 May 2026

https://github.com/indhra/cats-ijcnn-data-2004

CATS IJCNN Data 2004 Competition of Artificial Time Series

2004 artificial cats data ijcnn time-series

Last synced: 22 Mar 2025

https://github.com/shoaib1522/database-systems

📚💾 Master the fundamentals of database systems with this all-in-one lab repository, featuring ERD design diagrams 🧠🗺️, Oracle SQL 🌐📝, relational schema practice, and complete PowerPoint lectures 🖥️📑. Perfect for revision, exams, or quick reference! 💡📘

data database database-management databases databases-course db dbms-project erd notes oracle oracle-database sql

Last synced: 21 Aug 2025

https://github.com/basemax/okala-database-crawler

A robust, UTF-8 compliant PHP-based crawler designed to extract structured product data from Okala. This tool efficiently scrapes and saves store information, category slugs, and detailed product listings into organized JSON files. Ideal for data analysis, backup, or integration into other systems.

crawler crawler-php curl data json okala okala-com okalacom php php-crawler scraper

Last synced: 01 May 2026

https://github.com/birjemin/wxgameod

wxgame 开放数据 weixin 微信小游戏 关系链数据

data interactive-data relation user-storage

Last synced: 16 Jul 2025

https://github.com/anti-duhring/nfl-qb-stats

data of all NFL QB starters until 2021

data json nfl qb stats

Last synced: 05 Apr 2025

https://github.com/gsmithun4/expressjs-field-validator

Plugin for validating JSON request, middleware for expressjs

data express-js expressjs json-request middleware nodejs request rest-api validation

Last synced: 06 Mar 2026

https://github.com/ishansurdi/data-visualisation-empowering-business-with-effective-insights

The following tasks are completed for Data Visualization: Empowering Business with Effective Insights on Forage in October 2024. It is important to note that this should not be interpreted as an endorsement.

chart communicating-insights-and-analysis dashboard data data-analysis forage powerbi powerbi-visuals tableau tata tata-group virtual-internship visual visualization

Last synced: 17 Feb 2026

https://github.com/mvuorre/osfdatasette

Harvest, wrangle, and serve preprint data from OSF API with Datasette

data datasette open-science preprints

Last synced: 11 Apr 2025

https://github.com/germanpaul12/flights-data-sky-scraper-api

Sky Scraper - Python app for searching flight information using the Sky Scrapper API.

data flights flights-api scraping

Last synced: 15 Jul 2025

https://github.com/ioboi/obloc-data

Scrape guest counter of O'BLOC 🧗‍♀️

data scraping

Last synced: 04 Nov 2025

https://github.com/priyanshubiswas-tech/farmlab-report-and-case-study-iot

This project was developed through live interviews and case studies with farmers in the year 2023 to address key agricultural challenges. The device provides real-time farm insights for better decision-making. Future plans include a digital portal, increased range, more sensors, and improved design. Open to collaboration!

arduino-ide c case case-study data data-analysis iot iot-device serialization

Last synced: 15 Jul 2025

https://github.com/pooja-manjunatha/nyc_parking_violations_dbt

This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations

data data-analysis data-engineering dbt duckdb python sql

Last synced: 14 May 2026