An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/aravind-selvam/bikeshare-company-analysis

Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company

analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server

Last synced: 22 Apr 2026

https://github.com/andygol/osm-diff-state

CLI tool to search OSM diff state files

custom data openstreetmap planet replication

Last synced: 24 Apr 2026

https://github.com/chriseaton/sample-database

A long-term supported sample dataset for file and database unit testing and validation. Simple, straight-forward, raw data shared across formats.

data database examples flat-file samples schema unit-testing

Last synced: 25 Apr 2026

https://github.com/ahmad-ali-rafique/pyviznotebook

PyVizNotebook is a collection of Matplotlib visualizations demonstrating a wide range of plot types and techniques for data visualization. Whether you're a beginner looking to learn or an experienced developer seeking inspiration, this repository offers a diverse set of examples to explore.

analytics colab-notebook data data-science data-visualization dataanalytics matplotlib-python plots seaborn-python visualization

Last synced: 06 Jun 2026

https://github.com/sap-samples/security-research-codegraphsmote

Data augmentation strategy that can be applied to code graphs for learning-based vulnerability discovery.

augmentation data detection learning machine research sample security vulnerability

Last synced: 07 Jun 2026

https://github.com/iamlucianojr/laravel-api-query-handler

:flashlight: This Laravel package helps to handle a query request properly

api collection data eloquent handler l5x laravel query

Last synced: 28 Apr 2026

https://github.com/rdjarbeng/rdjarbeng

Richard Djarbeng's github profile-computer engineer specializing in web development, machine learning, and IoT devices. New web posts have moved to website below

data jekyll machine-learning ruby website

Last synced: 28 Apr 2026

https://github.com/jackosheadev/databasetechproject

This is a repo for a database project which involves creating tables, populating them, viewing data with selects and finally simulating a transaction

data database mssql sql

Last synced: 18 May 2026

https://github.com/wu-rymd/pyobjectify

Bridging the gap across the different file formats and streamlining the process to accessing ingested data via Python objects

data objects python3

Last synced: 08 Jun 2026

https://github.com/scarblase/salary-comparison

Submission for the DataCamp Salary Competition(1 level). 🏆

data data-analysis data-science data-visualization engineering python sql structured-data

Last synced: 01 May 2026

https://github.com/leomsgit/extrator-de-parametros-analise-hemograma-e-bioquimico

Software em Python para varrer arquivos PDF e extrair parâmetros diretamente para arquivo Excel

analysis data excel excel-export google-colab hemogram jupyter-notebook pdf pdf-document-processor pdf-viewer python python3

Last synced: 01 May 2026

https://github.com/liuliqiang/laueagle

YAML/JSON Lints and Converters

converter data formater json linter python serialization yaml

Last synced: 02 May 2026

https://github.com/rbruinier/mysqlbulkimportbenchmark

Benchmarking some methods to import big data sets into mysql tables

benchmark data database mysql php

Last synced: 02 May 2026

https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist

This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).

ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling

Last synced: 09 Jun 2026

https://github.com/thenoim/youtubelibrary

Nils little youtube library :)

api browser data nodejs simple youtube

Last synced: 04 May 2026

https://github.com/montanaz0r/imdb-ratings-auto-inserter

A Python script that enables auto-inserting movie ratings into the IMDB profile.

data data-science dataanalysis imdb movies pandas pandas-dataframe python3 selenium selenium-webdriver webscraping

Last synced: 07 May 2026

https://github.com/favarettorm/bd_universidade

BD_UNIVERSIDADE V01 - Banco de dados fictício de uma universidade para fins didáticos

data database dataset mariadb mariadb-database mariadb-mysql mysql mysql-database scripts sql university

Last synced: 08 May 2026

https://github.com/themuhd/world-cup-analysis

Analysis of The FIFA World cup from its inception to the recently completed tournament in 2023

data data-science data-visualization dataanalysis matplotlib matplotlib-pyplot notebook python

Last synced: 08 May 2026

https://github.com/raynardj/r_notes

Learning notebooks of R

data docker guru99 jupyter learning r

Last synced: 09 May 2026

https://github.com/kouisamine/data-uri-to-image

Convert Data URI into Image(png, jpeg, webp, gif, svg, ...) files.

conversion convert converter data datauri datauri-to-image image js online php script source-code tools uri

Last synced: 10 May 2026

https://github.com/pferreirafabricio/data-immersion

🏊🏻‍♂️ Activities and exercises from 'Imersão Dados' event

data data-analysis data-science dataset jupiter-notebook python

Last synced: 14 May 2026

https://github.com/iotchulindrarai/reactlearning

learning react like data passing using usestate and props using fom both child to parent and parent to child

data passing props react usestate-hook

Last synced: 14 May 2026

https://github.com/erwan-simon/aws-serverless-notebook-platform

A self-hosted, serverless platform offering an intuitive UI to manage, schedule, and execute Jupyter notebooks on AWS.

aws data docker notebook python serverless terraform webapp

Last synced: 13 Jun 2026

https://github.com/gsmith257-cyber/bit3434cve

BI T3434 Project on data mining CVEs and Exploits

cve data data-mining exploits research-project

Last synced: 17 Jun 2026

https://github.com/DOSM-GitHub/2022-UN-Big-Data-Hackathon

Repository : 2022 UN Big Data Hackathon for DOSM Team

big data food malaysia security trade

Last synced: 18 Jun 2026

https://github.com/CentralFloridaAttorney/ComfyUI-ZMongo

An Easy-to-Use database framework and parameter library for ComfyUI. Centralize node presets, capture workflow logic, manage structured image collections, and build document-driven text automation pipelines on an offline Local File Store or BusinessProcessApplications.com .

api comfy comfy-ui comfyui comfyui-custom-node comfyui-custom-nodes comfyui-manager comfyui-node comfyui-nodes comfyui-workflow data database

Last synced: 21 Jun 2026

https://github.com/matusf/glasgow_wifi

Script that plots wifi access points to map and labels them by their protection

data data-visualization folium python python3

Last synced: 24 Jun 2026

https://github.com/codeforafrica/ckanext-followy

[ARCHIVED] A CKAN extension to show the datasets a user is following.

ckan ckan-extension ckanext-followy data dataset followy-extension open-data

Last synced: 29 Jun 2026

https://github.com/ccworld1000/cccomposition

CCComposition for code style, Accept code style conversion business(接受code style转换业务)

cccomposition composit construction data structure visual

Last synced: 04 Jan 2026

https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling

Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.

artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models

Last synced: 17 Apr 2026

https://github.com/bastianolea/sinim_municipal_genero

Datos comunales de género del Sistema Nacional de Información Municipal

chile comunas data genero laboral tiempo

Last synced: 23 Jun 2026

https://github.com/dhruvil-26/tableau-projects

This repository contains Tableau visualization projects focused on data analysis across different domains. Projects include: 1. IPL Visualization - Insights into IPL match, Team and player statistics. 2. EV Analysis - Visualizations exploring the adoption of electric vehicles. 3. Road Accident Analysis - Analysis of road accident patterns

analysis data data-analysis data-analytics electric-vehicles ipl road-accident-analysis tableau tableau-public

Last synced: 19 Jan 2026

https://github.com/sharoonjoseph321/insurance_fraud_detection

Fraud Detection using machine learning algorithm-KN Neighbors .Data exploration using Pyspark and matplotlib.

analytics data data-science eda high-performance knn-algorithm knn-classification machine-learning matplotlib-pyplot pyspark python seaborn spark statistics

Last synced: 23 Mar 2025

https://github.com/zurd46/zurdsynthdatagen

This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).

data data-structures dataset electron json jsonl nodejs openai synthetic

Last synced: 04 Apr 2026

https://github.com/laguer/jupyterdatascienceworkflow

Jupyter Notebook dedicated to studying Agriculture and AMI analytics

agriculture amis corn data fao jupyter maize oecd rice science soja

Last synced: 11 Oct 2025

https://github.com/nanis/unitedat

Unify data sets which consist of separate files with a common header repeated in each one.

cli data etl utility

Last synced: 12 Apr 2025

https://github.com/mnkanout/patients_medication_prediction

The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.

data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3

Last synced: 29 Jun 2025

https://github.com/sidneyarcidiacono/data-parser

A node module designed to make reading in large files as easy as calling one function.

data javascript node npm

Last synced: 05 May 2026

https://github.com/sebhoss/countries-and-cities

dolt database for countries and their cities

cities countries data database dolt

Last synced: 11 Oct 2025

https://github.com/mvuorre/osfdatasette

Harvest, wrangle, and serve preprint data from OSF API with Datasette

data datasette open-science preprints

Last synced: 11 Apr 2025

https://github.com/srgchrksv/articles

My articles about coding, data etc

article coding data learning medium python

Last synced: 18 Jun 2026

https://github.com/cmda-tt/course-25-26

🎓 tech track · 2025-2026 · curriculum and syllabus 📊

d3 data datavis functional javascript programming research svelte visualization

Last synced: 20 Jan 2026

https://github.com/equinor/sumo-wrapper-python

Thin python wrapper to interact with Sumo API

analytics data fmu python subsurface sumo

Last synced: 19 Jan 2026

https://github.com/soenneker/soenneker.dtos.idpartitionpair

A minimal Record type with an Id (string), PartitionKey (string), and maximum JSON compatibility

csharp data dotnet dto id key partition

Last synced: 09 Mar 2026

https://github.com/prakashjha1/loan-eligibility-prediction

This repository contains the codebase and resources for a machine learning-based project aimed at predicting loan eligibility for individuals. The project utilizes various algorithms and data preprocessing techniques to build predictive models that assess the likelihood of an applicant being eligible for a loan based on historical data.

data data-visualization exploratory-data-analysis loan-prediction-analysis machine-learning-algorithms naive-bayes-classification parameter-tuning python random-forest

Last synced: 19 Apr 2026

https://github.com/justinyahin/wpdf

Create, filter, sort and display users data on your WordPress site.

data filtering wordpress

Last synced: 18 Apr 2026

https://github.com/priyapuranik/data-analytics-using_python

Analyzed data of Hotels and find out meaningful insights from it including booking patterns and seasonal trends and many more.

data pandas python sql visualization

Last synced: 06 Apr 2026

https://github.com/thanhleviet/vietnam_antibiotics_bidding

This repo contains data of bidding for multiple drugs and antibiotics reported to Vietnam Ministry of Health in 2015, 2016, 2017.

antibiotics data vietnam

Last synced: 23 Feb 2026

https://github.com/ttozatto/sparkify

Churn Prediction for music streaming app with PySpark

analysis churn data learning machine predictive pyspark science spark

Last synced: 16 Jan 2026

https://github.com/elimu-ai/analytics

📊 Android application which collects, provides and uploads learning event data

csv data data-science dataset edtech egma egra infrastructural learning-analytics

Last synced: 12 Oct 2025

https://github.com/madhuresh2011/daily-sql-from-hackerrank

Welcome to my SQL Series, where I tackle SQL problems from HackerRank on a daily basis.

data dataanalysis database question-answering sql

Last synced: 19 Jan 2026

https://github.com/team-hydrogen/2025-adc-data

All files relating to the computation of the data provided

data jupyter-notebook nasa-app-development-challenge

Last synced: 11 Apr 2025

https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails

Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.

cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql

Last synced: 24 Jul 2025

https://github.com/agdturner/ccg-data

A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.

data data-analysis

Last synced: 12 Jan 2026

https://github.com/0xnu/nfl-picks

NFL match prediction with scores using historical data (1999-Present).

american-football data nfl prediction

Last synced: 12 Oct 2025

https://github.com/bilgehangecici/datatypeconverter

Converting integer and floating numbers to appropriate bit-level representation.

data datatypeconverter java machine-level variables

Last synced: 30 Mar 2025

https://github.com/drzax/light-up-brisbane

Where, what and why various public places in Brisbane are lit up.

brisbane data git-scraping

Last synced: 19 Jan 2026

https://github.com/basemax/okala-product-ids

A PHP script to fetch and save product IDs from Okala's online store API across multiple categories and store branches.

crawler crawler-okala crawler-php crawlers data database ids ir iran json okala okala-crawler php php-crawler product

Last synced: 09 May 2026

https://github.com/lord3008/instances-of-data-analysis

This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.

data data-analysis

Last synced: 03 Mar 2025

https://github.com/anjaliwork20/moodify

Mood-based music recommendation system that considers a user's emotional state to recommend songs, genres, artists and playlists using Machine learning

artificial-intelligence cnn-keras cnn-model convolutional-neural-networks data data-analysis data-science data-structures data-visualization database deep-learning machine-learning machine-learning-algorithms python recommended song songs

Last synced: 20 Apr 2026

https://github.com/olekscode/datageneration

Exploring the methods of data generation for different Machine Learning algorithms

data javascript machine-learning

Last synced: 05 Apr 2025

https://github.com/jhpoelen/bees

Content-based iDigBio prototype

biodiversity data ecololgical informatics provenance

Last synced: 18 Mar 2026

https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models

This repo contains a model for determining the rainfall patterns and another for medical expense prediction model

data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning

Last synced: 30 Aug 2025

https://github.com/dimaa1608/azurecontent

AzureContent is a repository on GitHub containing documentation and resources related to Microsoft Azure services and features. It provides clear and concise information for users seeking guidance on Azure cloud computing solutions.

azure azurecontent cloud computing content data deployment integration management networking platform security service storage virtualization

Last synced: 10 Apr 2025

https://github.com/luminati-io/httpx-web-scraping

Web scraping using HTTPX in Python, covering setup, advanced features, comparisons with Requests, and more.

beautifulsoup data html httpx python web-scraper web-scraping

Last synced: 13 Oct 2025

https://github.com/0xnu/data-analyst-training

The repository contains training materials for data analysts.

data data-analysis data-analyst

Last synced: 25 Aug 2025

https://github.com/franckalbinet/maris-crawlers

Automated data harvesting of MARIS data sources

automation data marine-radioactivity

Last synced: 25 Aug 2025

https://github.com/stdlib-js/array-base-symmetric-banded-filled2d-by

Create a filled two-dimensional symmetric banded nested array according to a provided callback function.

alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types

Last synced: 20 Apr 2026

https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project

This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop

analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping

Last synced: 09 May 2026

https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis

In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.

data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public

Last synced: 09 Mar 2026

https://github.com/nxion/sql-data-warehouse-project

Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.

data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server

Last synced: 05 Jun 2026

https://github.com/donghquinn/gopandas

gopandas

data go golang

Last synced: 14 Oct 2025

https://github.com/deepanshkhurana/facebook-birthdays

Python script to create a .csv from Facebook's Event Data to list Birthdays.

data facebook python

Last synced: 14 Oct 2025

https://github.com/spajai/etl-sharepoint-data-uploader-pipeline

Custom Python Script to Pull specific data from source and Upload to the Microsoft SharePoint

data etl etl-pipeline microsoft microsoft365 python3 sharepoint sharepoint-online

Last synced: 11 Nov 2025

https://github.com/wittyicon29/zeotap-ds-assignment

Internship application assignment

data data-science

Last synced: 19 Aug 2025

https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization

A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.

api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode

Last synced: 14 Apr 2026

https://github.com/rijkvanzanten/ds-fa-1

The first final assignment for the data structures class

assignment data final map now parsons structures thenewschool

Last synced: 04 Oct 2025

https://github.com/rahulpatel0615/sales-analysis-project

Sales Data Analysis Dashboard with Python, Pandas, and Matplotlib. Features 12+ visualizations and comprehensive insights.

data data-analysis data-visualization matplotlib pandas portfolio python

Last synced: 21 Apr 2026

https://github.com/the-universal-linux-society/sysreport

Bash script to give you a full system report. Just by running the script it offers insight into CPU data, disk space, temperature readings, network configuration, MAC addresses, firewall status, and system logs for error analysis.

analysis bash bash-script bash-scripting data report reporting system

Last synced: 15 May 2026

https://github.com/isandyawan/simplelinearregression

A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion

data linear r regression rstudio shiny statistic

Last synced: 14 Oct 2025

https://github.com/seqeralabs/ffq-api

A minimal wrapper to make ffq searches available via a REST API.

api data fastq fetch-fastq ffq genomics

Last synced: 15 Aug 2025