An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/boettiger-lab/taxadb-cache

Cache for taxadb files

data

Last synced: 19 May 2026

https://github.com/joseluisq/input-verifier

Some useful functions to check common data input.

data input utils validation

Last synced: 19 Jul 2025

https://github.com/hamolicious/console-table

Displaying Tables in the console

console data pypi python table

Last synced: 11 Jul 2025

https://github.com/encoreshao/data-science

Data analyze examples, using Jupyter notebook and Python!!!

data dataanalysis encore jupyter-notebook

Last synced: 29 Mar 2025

https://github.com/pulgamecanica/d3examples

https://www.oreilly.com/library/view/d3-for-the/9781492046783/

d3 d3-visualization d3js d3v4 data javascript

Last synced: 19 May 2026

https://github.com/kameronbrooks/datalys2-reporting

Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.

data data-visualization html react

Last synced: 08 Apr 2026

https://github.com/austinv11/pypeline

A simple data pipeline builder for Python 3+

data leveldb pypeline python python3 stream-processing

Last synced: 20 Aug 2025

https://github.com/rrwen/twitter2return

Module for extracting Twitter data using option objects

access api data extract geo get location media oauth object option post rest return sample social stream token tweet twitter

Last synced: 03 Apr 2025

https://github.com/theanujsinha01/data-analytics-portal-

Data Analytics Portal Built a web-based data analytics tool using Streamlit, Pandas, and Plotly. Supported CSV and Excel uploads (up to 200MB) for data exploration. Features included statistical summaries, group-by aggregation, and frequency counts. Integrated interactive charts (bar, pie, line, scatter) for visual insights. This tool is live now.

analytics data portal

Last synced: 28 Apr 2026

https://github.com/shahules786/titanic-analysis

different analysis of titanic accident (data from kaggle)

analyze data titanic-kaggle

Last synced: 26 Jun 2025

https://github.com/jigyasag18/financial-risk-analysis-project

The Credit Card Financial Risk Analysis Dashboard is a real-time Power BI tool designed to provide insights into credit card transactions and customer demographics. It features interactive visualizations, efficient data processing, and actionable insights to support decision-making. Utilizing data from SQL database, the dashboard tracks key metrics

data dataanalysis database datacleaning datapreprocessing dataprocessing datavisualization financial-analysis financialriskanalysis mysql powerbi sql statistical-analysis

Last synced: 06 Mar 2026

https://github.com/henryssondaniel/teacup-java-report-file

Report Teacup data to a file

data file logs reports teacup

Last synced: 22 Jul 2025

https://github.com/naufalbasara/superstores-pipeline

Data Pipeline on Dummy E-commerce with Apache Airflow

airflow data data-engineering data-pipeline data-warehouse postgresql

Last synced: 16 May 2026

https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate

This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.

correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif

Last synced: 17 May 2026

https://github.com/saksham-jain177/data-analysis

A collection of data analysis and machine learning projects across various datasets. Explore predictive modeling, data visualization, and insights from real-world data. Projects include sales predictions, disease detection, customer segmentation, and more.

api data data-analysis data-cleaning data-science data-visualization datamodeling dataset datasets exploratory-data-analysis python python3 web-scraping youtube-api

Last synced: 01 May 2026

https://github.com/amarlearning/exploring-the-evolution-of-linux

Data Analysis about the development of the Linux operating system by exploring its Git repository history.

cleaning-data data data-analysis data-wrangling datacamp first-commit git-history linux

Last synced: 12 May 2026

https://github.com/gsmithun4/expressjs-field-validator

Plugin for validating JSON request, middleware for expressjs

data express-js expressjs json-request middleware nodejs request rest-api validation

Last synced: 06 Mar 2026

https://github.com/eyluldursun/data-science-project

This project involves a data science analysis conducted on the Obesity Data Set. The study explores factors influencing obesity, includes data visualization, and develops predictive models. The goal of the project is to gain insights to help prevent obesity.

data data-science obesity r rmarkdown

Last synced: 26 Jun 2025

https://github.com/rameshaditya/dynamic-hybrid-data-grid

Facilitates faster read-and-write of large ordered collections of data.

algorithms data data-structures storage

Last synced: 30 Jun 2026

https://github.com/skygenesisenterprise/aether-calendar

Aether Calendar is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem

applications calendar capacitorjs data javascript linux macos nextjs typescript windows

Last synced: 12 Apr 2026

https://github.com/sakan811/gachascope

Evaluate the cost-effectiveness of various in-app purchase bundles available in gacha games.

data data-analysis data-visualization game honkai honkai-star-rail honkai-starrail hoyoverse javascript nextjs tableau tableau-public typescript wutheringwaves

Last synced: 04 May 2026

https://github.com/rd-uk/rduk-data-sqlite

SQLite Data Provider implementation for rduk-data

data rduk sqlite

Last synced: 16 May 2026

https://github.com/tuscanicz/doctrine-data-applier

Symfony bundle for Doctrine Migrations of data using doctrine entities

data database doctrine entity migrations symfony symfony-bundle

Last synced: 02 Feb 2026

https://github.com/shailu2004/azure_big_data_project

This project demonstrates a comprehensive Azure Data Engineering workflow using multiple Azure resources to process and analyze an e-commerce dataset. The dataset consists of 8 files containing details about customers, payments, orders, and other key information

ai azure cloud data data-engineering

Last synced: 08 Jul 2025

https://github.com/skygenesisenterprise/api-service

The Official Sky Genesis Enterprise API Service Ecosystem

api-service client cryptography data dns docker javascript nextjs service stalwart typescript websocket

Last synced: 31 Dec 2025

https://github.com/akashlogics/street-data-tracking

Detect, Track and Count number of persons walking across the path(s) making use of YOLO. This Python project tracks people moving across predefined street zones

analysis data excel newdataset object-detection opencv python python3 yolo

Last synced: 19 May 2026

https://github.com/buildinamsterdam/contentful-graphql

Contentful GraphQL connection

contentful data graphql

Last synced: 05 Jan 2026

https://github.com/bho0920/crime-data-analysis-eu

Crime Data Analysis for Self-Defense Tool Market Entry in the EU.

data data-analysis sql sqlite tableau

Last synced: 21 Jun 2025

https://github.com/denisecase/cintel-04-reactive

Interactive analytics, reactive app built with Shiny for Python

analytics bokeh data flights interactive mtcars penguins python relationships shiny

Last synced: 20 Jun 2025

https://github.com/zshn1248/pyfilecrypto

PyFileCrypto is a Python module for easy encryption and decryption of files using the cryptography library. It provides a simple interface to generate encryption keys, encrypt files, and decrypt files securely.

data decryption encryption file security-tools

Last synced: 07 Apr 2026

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/ezeparziale/analisis-data-delitos

:gun: Analsis de delitos de CABA

data data-science

Last synced: 14 Mar 2025

https://github.com/official-imvoiid/multifetch

A high-performance web scraper for bulk image and GIF extraction from reliable sources — built for AI/ML data pipelines and large-scale media collection

aiml data dataset gifscraper imagescraper python pythontool tools webscraper windows

Last synced: 19 May 2026

https://github.com/samaalharbi2/virtual-work-experience---data-analysis-at-stc

Virtual Work Experience in Data Analysis at STC

analysis data data-visualization misk stc

Last synced: 20 Jun 2025

https://github.com/kingabzpro/5-airflow-alternatives-for-data-orchestration-tutorial

Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI

dagster data data-orchestration kedro luigi mageai prefect

Last synced: 18 Apr 2026

https://github.com/randomgamingdev/randomgamingdev.github.io.data

The data for RandomGamingDev.github.io (feel free to build your own website off of mine :D)

blog custom data projects projects-list

Last synced: 02 Jan 2026

https://github.com/prcharan592/olympic-insights-historical-data-analytics-in-r

This project analyzes 120 years of Olympic history (1896–2016), uncovering trends and insights from the data

data data-analytics data-science data-visualization kaggle r-programming

Last synced: 03 Apr 2025

https://github.com/sharoonjoseph321/social_media_eda

Data Analysis on social media apps ,using pandas, python, matplotlib.

data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects

Last synced: 03 Mar 2025

https://github.com/ressuman/next-blog-1-project

Next.js with TypeScript: Fetching Data and Setting Up Routes. This project demonstrates my first experience with Next.js using TypeScript. It involves fetching posts from the JSON Placeholder dummy API, setting up pages, and linking routes.

api-rest data html-css-javascript jsx nextjs14 routing typescript

Last synced: 15 May 2026

https://github.com/lu-sketch/chocolate-imports-dataset

Chocolate Imports for South Africa

data eda visualization

Last synced: 18 May 2026

https://github.com/md-emranhossen/leetcode-practice

This repository stores my solutions to LeetCode problems, organized by problem number and title.

cpp data datastructures-algorithms leetcode-solutions

Last synced: 26 Jun 2025

https://github.com/nonsignificantp/enfermedades-inmunoprevenibles

Analisis sobre el efecto de las vacunas y la incidencia de casos de enfermedades inmunoprevenibles en la Ciudad de Buenos Aires entre los años 1995 y 2016

a analysis argentina buenosaires data hepatitis science vaccination

Last synced: 18 Jun 2026

https://github.com/jun-labs/json-handling

🔍 Json 데이터 핸들링 예제.

data gson jackson json json-object

Last synced: 15 May 2026

https://github.com/ayushman0511/data-warehouse-project1

A comprehensive guide to building a data warehouse with SQL Server, including ETL processes, data modeling, and analytics.

data data-ana data-anal data-cleaning data-enginee data-lakehou datalake datasci dataware datawarehouse datawarehousi etl etl-job etl-pipeline medallion sql sql-quer sql-query sql-server sqlserver

Last synced: 26 Jun 2025

https://github.com/majorcluster/clj-data-adapter

A Clojure library designed to convert data

clojure data lib library

Last synced: 12 Jul 2025

https://github.com/stkisengese/numpy-data-fundamentals

A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.

data data-analysis numpy pre-processing

Last synced: 16 May 2026

https://github.com/xylambda/data-structures-algorithms

This repository provides implementations of popular algorithms and abstract data types using JAVA.

algorithm algorithms array arraylist avl-tree data data-structures graph heap iterative java linked list netbeans queue recursive set stack tree

Last synced: 30 Jun 2026

https://github.com/dsietz/daas-workshop

Workshop for building a Data as a Service platform using the DaaS SDK.

archconf daas daas-pattern data dataprivacy nfjs rust rust-lang

Last synced: 20 May 2026

https://github.com/kashyap-prabhat/sigma

A Scala library for probability and statistics formulas, including rules for probability calculations.

data formulas library mathematics probability scala statistics

Last synced: 30 Jun 2026

https://github.com/bcodmo/workshop_bios_oceanographic_data

Repository holding lesson on Data Management Basics. See webpage for rendered view: https://bcodmo.github.io/workshop_bios_oceanographic_data/

bco-dmo data datamanagement fair workshop

Last synced: 08 Apr 2026

https://github.com/jigyasag18/orders-sales-analysis-report-using-power-bi

This repository analyzes and visualizes office supply sales data to improve profitability. It examines sales performance by various factors, using charts to provide insights and actionable recommendations for sales optimization, market research, and product mix.

data dataanalysis dataanalytics dataset powerbi powerbi-dashboards powerbi-report powerbi-reports powerbi-visuals powerbidashboard

Last synced: 18 Feb 2026

https://github.com/codehub001/ai-driven-automation-for-data-quality-monitoring-in-cloud-data-warehouses

This project focuses on leveraging AI to automate data quality monitoring in cloud data warehouses. Traditional data validation methods often require manual intervention and fail to scale with increasing data complexity. By integrating machine learning models, this approach enables real-time anomaly detection, automated data cleansing.

csv-export csv-import dashboard data datacleaning lib modeltraining python testing-library visualization

Last synced: 13 May 2025

https://github.com/theduardomaciel/cc-pe

Conteúdos, scripts em R e datasets utilizados durante a matéria de Probabilidade e Estatística.

data probability r statistics

Last synced: 27 Mar 2025

https://github.com/wolfchamane/amjs-data-types

Data types for your OOP javascript project

cjs data javascript modules nodejs oop types

Last synced: 20 May 2026

https://github.com/circlexo/circlexo

Open-source project to seamlessly integrate and manage your business workflow, connecting Jira, GitHub, Discord, Stripe, RevenueCat, and OpenAI all in one intuitive platform.

bussiness-intelligence data discord-bot forge github google jira kpis ploi revenuecat stripe vapor

Last synced: 20 May 2026

https://github.com/shimul-zahan/all-practices-tukitaki

This is repository for all the practice tasks or learning new things. Cause environment are setup and no need to setup a new project or environments.

data data-science datapreprocessing deep-learning machine-learning neural-network practice python visualization

Last synced: 12 Jan 2026

https://github.com/furkankarakuz/turkey_earthquake

This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.

api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake

Last synced: 20 May 2026

https://github.com/heshamalsaqqaf2/python-projects

Beginner Level Python Projects

data python3

Last synced: 22 Jul 2025

https://github.com/chompfoods/stub-jaxrs-jersey

JAX-RS Jersey server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients jax-rs jersey nutrition raw recipe-api recipes server server-stub stub stub-server

Last synced: 02 May 2026

https://github.com/clagiordano/marketplaces-data-export

LIbrary that share the same interface and provide adapters for online marketplaces services

adapter amazon api clagiordano data ebay ebay-api export marketplaces mws mws-api rest soap

Last synced: 22 Mar 2025

https://github.com/scanthe-net/scanthenet-php

PHP API Data Fetcher.

api data php scan scanner threat

Last synced: 25 Jul 2025

https://github.com/tomasfarias/louis

Yet another challenge project

challenge data python

Last synced: 29 Mar 2025

https://github.com/jigyasag18/fake-news-prediction-project

The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Porter stemming algorithm, achieving around 97% classification accuracy with logistic regression model.

data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming vectorization

Last synced: 08 Jun 2026

https://github.com/vijaykumar1303/sales-data-analysis-and-dashboard-development

To analyze sales data to uncover insights into sales performance, trends, and patterns, and to develop an interactive dashboard that provides a comprehensive view of sales metrics and KPIs.

data dataanalysis datacleaning datavisualisation dax-query powerbi powerquery sql sqldataanalysis

Last synced: 11 Feb 2026

https://github.com/jigyasag18/credit-card-fraud-detection-using-machine-learning

This repository presents a credit card fraud detection system utilizing a Logistic Regression model trained on a dataset of 284,807 transactions with significant class imbalance. After employing under-sampling for balance, the model achieves a test accuracy of around 93.40%, showcasing the effectiveness of ML in identifying fraudulent transactions.

credit-card-fraud creditcardfrauddetection data dataset logistic-regression logisticregression machine-learning machine-learning-algorithms mlproject mlprojects

Last synced: 02 Sep 2025

https://github.com/pyfig/s21_data-science-bootcamp

School21 Bootcamp Data Science

data data-science numpy pandas python school21

Last synced: 26 Jun 2025

https://github.com/gusgitmath/cnn_braintumor_classification

Built a CNN for MRI brain tumor classification (Glioma, Meningioma, No Tumor, Pituitary) with 99.4% accuracy. Used data augmentation, optimized learning rates (Adam), and included EarlyStopping, ReduceLROnPlateau for superior performance, averting overfitting. Boosts early, accurate diagnosis, advancing medical treatment.

classification convolutional-neural-networks data deep-learning machine-learning

Last synced: 25 Jul 2025

https://github.com/ntnn/dataparse

Parsing, transforming and unmarshalling data.

data data-parser data-parsing data-transformation golang golang-lib

Last synced: 30 Jun 2026

https://github.com/danielrosehill/ghg-ebitda-correlations

Streamlit data visualisation examining correlation between emissions & profitability

data sustainability sustainability-data

Last synced: 14 Mar 2025

https://github.com/ressuman/csv-writer-project

CSV Writer with TypeScript. This project demonstrates my implementation of a CSV writer using plain TypeScript and JavaScript, without relying on any frameworks.

data javascript typescript

Last synced: 15 May 2026

https://github.com/dhi13man/rca_ace

RCA Ace is designed for organizations seeking to enhance their understanding and utilization of insights derived from Root Cause Analyses (RCAs).

analytics data enterprise open-source python python3 rca

Last synced: 10 Sep 2025

https://github.com/sam-moen/data-analyst-portfolio

This is a repository that I have created to showcase skills, share projects and track my progress in Data Analytics / Data Science related topics.

data dataanalysis matplotlib mssql pandas powerbi python seaborn sql

Last synced: 08 Mar 2026

https://github.com/dilkushsingh/webscraping-with-selenium-and-beautifulsoup

Web Scrapped a popular tech gadgets website using Selenium and BeautifulSoup, also performed Data Analysis on scrapped data.

beautifulsoup data datacleaning datagathering eda exploratory-data-analysis python selenium webscraping

Last synced: 24 Feb 2026

https://github.com/nrrso/ex_quickfs

A wrapper / elixir client / SDK to access the quickfs.net API.

data elixir financial financial-data

Last synced: 04 Sep 2025

https://github.com/jigyasag18/aircraft-data-management

This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat

aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql

Last synced: 04 Feb 2026

https://github.com/ailixter/gears-dictionary

The project, which Gears Dictionary

arrays data dictionaries dictionary php struct utilities

Last synced: 19 Jul 2025

https://github.com/0xHericles/SpamDetector

:email: A Simple Python Spam Detector with Scikit-Learn

data ham machine-learning python sklearn spam

Last synced: 24 Mar 2025

https://github.com/living-with-machines/zoonyper

Code to make it easy to import and process Zooniverse annotations and their metadata in Python/Jupyter Notebooks

crowdsourcing data data-processing data-science python zooniverse

Last synced: 04 Jul 2025

https://github.com/acovaci/orbit

ORBIT: an Open source Rust-based implementation of a data Build Tool, inspired by DBT

cargo clap-rs data data-warehouse dbt rust rust-lang tokio-rs

Last synced: 16 Mar 2025

https://github.com/g3th/fit_file_decoder

Decodes '*.fit' files and returns readable values.

bytes data decoder fit-file hex parsing

Last synced: 30 Jun 2025