An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/henryssondaniel/teacup-java-report-mysql

Report Teacup data to a MySQL database

data logs mysql reports teacup

Last synced: 20 Apr 2026

https://github.com/montanaz0r/suicide-rate-analysis

Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector

analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate

Last synced: 20 Apr 2026

https://github.com/omers/sre-devops-tools

Tools and useful sources for SRE and DevOps

awsome awsome-list data devops monitoring sre tools

Last synced: 20 Apr 2026

https://github.com/arda-guler/binmotion

Convert ANY data to a video file. Sister project of binGallery.

data data-visualization proof-of-concept video

Last synced: 04 Jun 2026

https://github.com/rick-does/json-razor

Reduces JSON, YAML, and NDJSON volume by collapsing repeated structures while preserving the schema, making the schema easier for you to read.

cli data devtools json logs ndjson schema yaml

Last synced: 20 Apr 2026

https://github.com/prashhhant213/data_analysis_and_visualization-_for_streaming_platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

colab-notebook data datavisualization matplotlib numpy pandas python seaborn

Last synced: 20 Apr 2026

https://github.com/petermeissner/suuntor

Data from a Suunto watch extracted by R - !because!

automation data r rstats suunto windows

Last synced: 20 Apr 2026

https://github.com/nxion/sql-data-warehouse-project

Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.

data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server

Last synced: 05 Jun 2026

https://github.com/fastpix/android-data-kaltura

This SDK enables seamless integration with Kaltura Player, offering advanced video analytics via the FastPix Dashboard

analytics android-sdk data fastpix kaltura kaltura-player metrics sdk video video-metrics

Last synced: 21 Apr 2026

https://github.com/vishwas-chakilam/movies-review-scraping-analysis

A project for collecting, cleaning, and analyzing movie data. Includes scripts for web scraping (deprecated) and using the OMDb API to fetch movie details. Analyze and visualize data with Python and Power BI to uncover insights and trends in movie ratings and genres.

data dataanalysis datacleaning datavisualization matplotlib-python numpy-library pandas python webscraping

Last synced: 21 Apr 2026

https://github.com/stefen-taime/llm-rag-mtl-public-hospital

Ce projet développe un modèle de type Retrieve-Augment-Generate (RAG) pour répondre aux questions en utilisant les données publiques des avis laissés sur Google pour des hôpitaux à Montréal

data google-reviews hopital hospital hub ia llm montreal open-source quebec rag

Last synced: 21 Apr 2026

https://github.com/jdenn0514/surveycore

Core Survey Analysis Infrastructure

data r resear survey-analysis

Last synced: 21 Apr 2026

https://github.com/wittyicon29/kritika-iit-b-2023

Seletcion task for the summer projects of Kritika IIT-B

data data-analysis data-science

Last synced: 15 Mar 2025

https://github.com/rbcavi/factorio-mod-data

The modpacke data for factorio-viewer

data factorio factorio-data factorio-mod-data

Last synced: 23 Apr 2026

https://github.com/grimen/python-humanizer

A human/developer friendly value humanizer - for Python.

data debug debugging format formatting humanize humanizer log logging print printing value

Last synced: 05 Jun 2026

https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis

A data science notebook project focused on analyzing car features and building a model for car price prediction.

data data-analysis data-visualization jupyter-notebook python

Last synced: 23 Apr 2026

https://github.com/coryson/osm-mla-finder

Python script to locate institutions employing Medical Laboratory Assistants in Germany, developed for BTZ – Berufliche Bildung Köln GmbH. It uses OpenStreetMap, SerpAPI, and web scraping to find and verify relevant labs, clinics, and diagnostic centers.

beautifulsoup data openstreetmap osm python scraping serpapi webscraping

Last synced: 24 Apr 2026

https://github.com/jun-labs/jq

🧷 Let's practice jq.

data jq json json-data parse

Last synced: 27 Sep 2025

https://github.com/hruth-vik/sales-analysis-report

SalesScope is a powerful sales analytics dashboard that extracts insights, reveals trends, and drives strategy from raw data.

analytics data powerbi-report powerbi-visuals python

Last synced: 24 Apr 2026

https://github.com/stdlib-js/ndarray-vector-bool

Create a boolean vector (i.e., a one-dimensional ndarray).

bool boolean constructor ctor data javascript ndarray node node-js nodejs stdlib structure types vec vector

Last synced: 24 Apr 2026

https://github.com/marielachirinosr/cyclistic-data-analytics-project

This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.

data data-visualization pandas powerbi-report powerbi-visuals python

Last synced: 24 Apr 2026

https://github.com/mehmetkahya0/gallstone_dataset_analysis_project

Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)

analysis analytics data data-analysis data-science data-visualization database graph matplotlib python

Last synced: 25 Apr 2026

https://github.com/rubix982/product-quality-classification

This is an implementation for the CIKM AnalytiCup 2017, around the topic of "Product Title Quality". The goal is to take SKUs and rank its title's clarity and conciseness. Referenced papers are attached to this repository. And as such, the aim is to craft ensemble models that either try to replicate results or find new methods for classification.

data data-analysis information-retrieval jupyter-notebook machine-learning nlp python spacy-nlp

Last synced: 25 Apr 2026

https://github.com/xjwllmsx/hacker-news-engagement

Analyze Hacker News data to reveal which post types and posting hours spark the most discussion, using Python and a reproducible Jupyter notebook.

data data-analysis jupyter python

Last synced: 25 Apr 2026

https://github.com/mlkav/tri-hita-karana

Project Tri Hita Karana - Future Knowledge G20 Bali. DTS Kominfo x Binar Academy.

bali data data-science g20 science

Last synced: 06 Jun 2026

https://github.com/carlos-levi/twitterbots_analise_redesneurais

Projeto para a disciplina de IA - análise exploratória e aplicação de técnicas de aprendizado de máquina para detectar contas automatizadas (bots) na plataforma 𝕏 (Twitter)

data machine-learning twitter-bot

Last synced: 06 Jun 2026

https://github.com/marielachirinosr/hotel-data-analysis

Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries

data data-analysis matplotlib pandas python

Last synced: 25 Apr 2026

https://github.com/anuraganalog/blog

Data Science Blog

anuraganalog blog data science

Last synced: 26 Apr 2026

https://github.com/luminati-io/seleniumbase-with-proxy

SeleniumBase with authenticated proxies to bypass restrictions, enhance web scraping, and manage rotating proxies for better data extraction.

data data-collection proxy-server python residential-proxy selenium seleniumwire web-scraping

Last synced: 27 Apr 2026

https://github.com/ioanzicu/batch_loading_one-to-many_data_model

Unesco Batch Loading One-to-Many Data using Django

batch data django sqlite3

Last synced: 27 Apr 2026

https://github.com/gurpreet0022/crop-fertilizers-recommendation-system-using-ml-

This repository is a part of AICTE - Shell Internship on 'Green Skills using AI technologies' Cycle 3.

data datapreprocessing datavisualization jupyter-notebook machine-learning python

Last synced: 27 Apr 2026

https://github.com/schenkd/tweetminer

Data Miner for Twitter Streaming API

data dataminer datamining java twitter twitter-api twitter4j

Last synced: 07 Jun 2026

https://github.com/bhumitbedse/machine-learning-projects

AI Machine learning Deep learning Computer vision NLP Projects with code

computer-vision data data-science deep-learning machine-learning natural-language-processing python

Last synced: 27 Apr 2026

https://github.com/o-rumiantsev/exchange

Data Exchange System (Prototype)

chat css data exchange system websocket

Last synced: 27 Apr 2026

https://github.com/santiagoenriquega/custom_database

Python-based database library for database management, indexing, transactions, and constraints, showcasing foundational database concepts.

data data-engineering database database-design python

Last synced: 27 Apr 2026

https://github.com/tacticalnuclearraccoon/dataviz_with_js

Sample data vizualisation as part of a training on Javascript Frameworks for dataviz

d3 data datawrapper echarts javascript visualization

Last synced: 27 Apr 2026

https://github.com/drkane/area-profiles

Produce UK area profiles based on various data sources

dash-plotly data flask statistics uk

Last synced: 27 Apr 2026

https://github.com/oguzhanfatihkucuk/data-analytics-project-kafka-spark

The data in this project was collected in a database using Apache Kafka and processed with Apache Spark Streaming. The project aims to create a forecasting model and analyze sales forecasts per customer.

big-data data data-visualization hadoop kafka ml mlpipeline plt pyhton spark

Last synced: 28 Apr 2026

https://github.com/leonardomusini/mbe-growth-nexus-converter

Python tool to convert laboratory text files into NeXus files for Molecular Beam Epitaxy (MBE) data.

data data-engineering nexus python

Last synced: 28 Apr 2026

https://github.com/priyanshubiswas-tech/e-commerce_data_analysis

Analyzes 9,994 e-commerce transactions to uncover insights on sales trends, customer behavior, profitability, and logistics using EDA and visualization. Identifies top products, customer segments, and shipping efficiencies to optimize marketing, inventory, and operations, making it valuable for retail, finance, and logistics.

data data-analysis data-visualization pandas pandas-dataframe plotly-analytics-projects plotly-express python

Last synced: 28 Apr 2026

https://github.com/entorb/analyze-ha-energy

Analyze Home Assistant Solar Production Data

data home-assistant pandas photovoltaic pv python

Last synced: 08 May 2026

https://github.com/n-ce/localstorage-data-interchange-manager

Implementation of local storage data interchange using map data structure.

data export import javascript js-maps json localstorage

Last synced: 28 Apr 2026

https://github.com/howz1t/ptypes

This package provides useful data types for use in PHP.

badges composer computer-science data data-structures data-types packagist php types

Last synced: 29 Apr 2026

https://github.com/gcoronelc/uni-epies-das-2022-2

Curso de Análisis y Diseño de Sistemas en UNI-EPIES.

dao data datos gcoronelc java jdbc mvc mvc-pattern sql sqlserver

Last synced: 29 Apr 2026

https://github.com/mtalhaofc/nutrition_system

A simple AI-powered web app built using Streamlit that provides personalized weekly meal plans and nutrition recommendations based on user demographics, health goals, and nutritional preferences.

cosine-similarity data data-science food machine-learning model nutrition pandas python streamlit

Last synced: 29 Apr 2026

https://github.com/sn0wfree/factor_table

an universal connector for all kind data source and manage all kind data as factor type by one package

connector data database factor

Last synced: 29 Apr 2026

https://github.com/stdlib-js/array-struct-factory

Return a constructor for creating arrays having a fixed-width composite data type.

array composite data factory javascript node node-js nodejs stdlib struct structure typed typed-array types

Last synced: 29 Apr 2026

https://github.com/barkintopcu/apple-stock-prediction-edu

The purpose of this project is to demonstrate time series analysis techniques using real-world stock data, without offering any form of financial advice or investment suggestion.

data deep-learning forecasting machine-learning python

Last synced: 29 Apr 2026

https://github.com/mr-dhan/eda-sales-customer-transactions

Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.

dashboard data data-analysis data-analysis-python data-science data-visualization eda python

Last synced: 29 Apr 2026

https://github.com/chandansoren/financial-budget-analysis

Financial budget for 2021

analytics data python

Last synced: 29 Apr 2026

https://github.com/koltyakov/pgcopy

🐘 PostgreSQL data migration tool

cli data database golang migration postgresql sync

Last synced: 29 Apr 2026

https://github.com/diegoperea20/pytorch-vs-tensorflow

Testing the differences of the pytorch and tensorflow libraries in the different prediction and classification applications, each of them gives improvements depending on the problem they are assigned or data set assigned.

classification data images prediction pytorch tensorflow

Last synced: 29 Apr 2026

https://github.com/tazeenrashid/orders-analysis-using-python-sql-server-and-tableau

I sourced some Orders data through Kaggle; did EDA using Python and then fetched some insights out of cleaned data using SQL Server (SSMS). Then, I built a Tableau Dashboard for some visual insights. Have a look and share your feedback!

analytics data eda jupyter-notebook python sql tableau

Last synced: 29 Apr 2026

https://github.com/istinnew/eniac_ab_insight

Dive into a comprehensive analysis aimed at boosting iPhone 13 sales by optimizing the Click-Through Rate (CTR) of the “SHOP NOW” button, compare different button designs and determine the most effective strategy for increasing engagement.

ab-testing data data-analysis data-engineering data-science data-visualization google googlecolab libraries python testing testing-tools visual-studio-code

Last synced: 29 Apr 2026

https://github.com/smokingplaya/gm_datastorages

💖 Data Storages like in JavaScript.

data dev gmod javascript lua

Last synced: 29 Apr 2026

https://github.com/ipstack/wizard

Wizard for create ipstack databases

composer data geo geoip id-database info ip ipstack ipstack-wizard php wizard

Last synced: 29 Apr 2026

https://github.com/devcsrj/docparsr-jvm

JVM client for https://github.com/axa-group/Parsr

data document extraction nlp ocr pdf

Last synced: 08 Jun 2026

https://github.com/wireservice/workbench-lookup

A port of `agate-lookup` to Workbench

data journalism lookup workbench

Last synced: 08 Jun 2026

https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration

This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.

covid-19 data data-exploration dataset sql

Last synced: 19 Feb 2026

https://github.com/gvatsal60/ds-on-kaggle

A collection of data science projects, experiments, and insights from Kaggle competitions and datasets

data data-science data-visualization numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/patrickdavies100/pipeline38

An application to automate the creation and execution of SQL queries.

data pandas-dataframe pipeline postgresql psycopg2 sqlalchemy

Last synced: 30 Apr 2026

https://github.com/abhinav330/instagram-influencers-analysis

This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.

data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn

Last synced: 08 Jun 2026

https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python

This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.

csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/omarsaad21/it-salary-eda

A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset

data explotary-data-analysis juypter-notebook numpy pandas python visualization

Last synced: 30 Apr 2026

https://github.com/onekiloparsec/arcsecond-swift

The swift client for interacting with the server-side RESTful resources of arcsecond.io.

arcsecond astro-library astronomy data django swift swift-3

Last synced: 30 Apr 2026

https://github.com/mmaithani/kaggle-projects

Collection of all the resources from competition, kernal And data section also all the magic code i have been using to get most of out of a problem

computer-vision data data-science image-processing machine-learning python

Last synced: 30 Apr 2026

https://github.com/raphcodec/rand-org-generator

Rand-Org-Generator attempts mimic real company structures. The dummy data generated by this project is intended to be used in analytics projects or web projects.

data duckdb factory-boy faker org-chart polars python3

Last synced: 30 Apr 2026

https://github.com/miguelmedinacastro/trabalho-dados-r

Trabalho final da disciplina Análise Exploratória de Dados

data data-science data-science-projects data-visualization database r rstudio

Last synced: 01 May 2026

https://github.com/dnut/json-match-finder

Python application used to match listings against openings via authenticated JSON API access.

data data-structures data-wrangling database json-api python-application python-modules

Last synced: 01 May 2026

https://github.com/benmizrahi/reactivejs

microservices event bus for async/sync communications

data microservices nodejs

Last synced: 01 May 2026

https://github.com/lut-ful/ibm-capstone-project-stack-overflow-job-survey

IBM Data Analyst professionale certificate program final project.

cognos data data-analytics looker power-bi python sql statics

Last synced: 01 May 2026

https://github.com/dnut/associations

Python 3 library to identify high-dimensional statistical relationships in any data set.

analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules

Last synced: 01 May 2026

https://github.com/skygenesisenterprise/aether-meet

Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem

applications data docker javascript meeting nextjs notes typescript voip

Last synced: 01 May 2026

https://github.com/chompfoods/sdk-kotlin

Kotlin SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food foods grocery ingredients kotlin nutrition raw recipe-api recipes sdk sdk-kotlin

Last synced: 01 May 2026

https://github.com/nel-zi/climainsights

Developed an automated ETL pipeline using Apache Airflow and Python to collect, process, and store weather data from multiple cities via Weatherstack API. Implemented data cleaning, orchestration, and error handling to ensure accuracy and scalability.

airflow apache-spark data data-engineering engineering etl-pipeline

Last synced: 01 May 2026

https://github.com/rec/kson

🔑 Json with the rough edges removed 🔑

data json serialization

Last synced: 01 May 2026

https://github.com/palutz/rust_nextstep

A series of exercises to play with more advanced topics in Rust

data deltalake graphql grpc p2p protobuf rust rust-lang xml

Last synced: 01 May 2026

https://github.com/sorairolake/japanese-era-dataset

日本の元号のデータセット / Dataset of the Japanese era

data dataset date japanese-calendar japanese-era json toml wareki yaml

Last synced: 01 May 2026

https://github.com/sebastianbrzustowicz/github-data

Java + Spring Boot. Application for sending requests to GitHub API and collecting received data.

api ci data github json junit mapping parallel repository rest-api stream

Last synced: 01 May 2026

https://github.com/thedevreda/jadaerospace

A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero

data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python

Last synced: 02 Aug 2025