An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/kitpymes/netcore-serialize-data

El objetivo es resguardar datos secretos encriptando y serializando archivos .json y convertirlos en archivos .dat.

csharp data decrypt encrypt json net netcore2 serialize

Last synced: 29 Apr 2026

https://github.com/howz1t/ptypes

This package provides useful data types for use in PHP.

badges composer computer-science data data-structures data-types packagist php types

Last synced: 29 Apr 2026

https://github.com/gcoronelc/uni-epies-das-2022-2

Curso de Análisis y Diseño de Sistemas en UNI-EPIES.

dao data datos gcoronelc java jdbc mvc mvc-pattern sql sqlserver

Last synced: 29 Apr 2026

https://github.com/mtalhaofc/nutrition_system

A simple AI-powered web app built using Streamlit that provides personalized weekly meal plans and nutrition recommendations based on user demographics, health goals, and nutritional preferences.

cosine-similarity data data-science food machine-learning model nutrition pandas python streamlit

Last synced: 29 Apr 2026

https://github.com/mumtaz4118/scraping-medium-and-data-analytics

The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py

data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python

Last synced: 29 Apr 2026

https://github.com/sn0wfree/factor_table

an universal connector for all kind data source and manage all kind data as factor type by one package

connector data database factor

Last synced: 29 Apr 2026

https://github.com/apsalverda/ebird-hotspot-menu-bar-python

🪶 Retrieve recent hotspot observations using eBird API

data ebird ebird-api hotspot instructions live livedata macos menubar observations platypus

Last synced: 29 Apr 2026

https://github.com/shoaib1522/data-aggregator-tool-in-python

This all are the illustration of the things used in " Data Aggregation Tool " as a scenario of Data Science Engineer written in Document(PDF)

data data-science dataaggregation lists python-script python3 sets-python tuples

Last synced: 29 Apr 2026

https://github.com/barkintopcu/apple-stock-prediction-edu

The purpose of this project is to demonstrate time series analysis techniques using real-world stock data, without offering any form of financial advice or investment suggestion.

data deep-learning forecasting machine-learning python

Last synced: 29 Apr 2026

https://github.com/martgro/datagrabber

Tool for extracting data points from plots

data extract image plots python3

Last synced: 29 Apr 2026

https://github.com/chandansoren/financial-budget-analysis

Financial budget for 2021

analytics data python

Last synced: 29 Apr 2026

https://github.com/ayushman0511/data-analytics-project1

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics busine data data-anal data-enginee data-sci data-scien database datascien query reporting sql sql-query sql-server window-func

Last synced: 17 Jun 2026

https://github.com/mirzayasirabdullahbaig07/advanced-sql-in-python

This repository covers advanced SQL concepts implemented using Python. It demonstrates how to interact with databases, run complex queries, perform joins, aggregations, window functions, and more using libraries like sqlite3, SQLAlchemy, or pandas. Ideal for data analysts and developers looking to integrate SQL power into Python workflows.

data databases dbms mysql nosql programing-language python sql

Last synced: 29 Apr 2026

https://github.com/diegoperea20/pytorch-vs-tensorflow

Testing the differences of the pytorch and tensorflow libraries in the different prediction and classification applications, each of them gives improvements depending on the problem they are assigned or data set assigned.

classification data images prediction pytorch tensorflow

Last synced: 29 Apr 2026

https://github.com/tazeenrashid/orders-analysis-using-python-sql-server-and-tableau

I sourced some Orders data through Kaggle; did EDA using Python and then fetched some insights out of cleaned data using SQL Server (SSMS). Then, I built a Tableau Dashboard for some visual insights. Have a look and share your feedback!

analytics data eda jupyter-notebook python sql tableau

Last synced: 29 Apr 2026

https://github.com/istinnew/eniac_ab_insight

Dive into a comprehensive analysis aimed at boosting iPhone 13 sales by optimizing the Click-Through Rate (CTR) of the “SHOP NOW” button, compare different button designs and determine the most effective strategy for increasing engagement.

ab-testing data data-analysis data-engineering data-science data-visualization google googlecolab libraries python testing testing-tools visual-studio-code

Last synced: 29 Apr 2026

https://github.com/vlamug/ratibor

Ratibor is a service for making metrics from data

data metrics prometheus

Last synced: 10 Mar 2026

https://github.com/ehvenga/data.driven.modeling

Repository to practice data driven modelling

data data-modeling

Last synced: 23 Mar 2025

https://github.com/dxtaner/graphql_events

Graphql-Events

data events graphql

Last synced: 29 Apr 2026

https://github.com/mbagalman/lattice-doe

Python code to create experimental designs optimized to meet statistical power targets

abtesting data datascience designofexperiments experimentaldesign statistics

Last synced: 19 Jun 2026

https://github.com/fs23yayan/membuatfungsidatapemrosesan

Membuat Fungsi Data Pemrosesan for Data Science in Marketing : Customer Segmentation with Python - Part 2

data function processing

Last synced: 29 Apr 2026

https://github.com/devcsrj/docparsr-jvm

JVM client for https://github.com/axa-group/Parsr

data document extraction nlp ocr pdf

Last synced: 08 Jun 2026

https://github.com/axnjr/csv-parser-utils

My own Pandas in Go, Python & Rust, Utility methods for Handling CSV Files in Core Go & Rust with bindings for python.

csv data dataanalysis datatools go golang golang-application pandas python rs rust

Last synced: 29 Apr 2026

https://github.com/gvatsal60/ds-on-kaggle

A collection of data science projects, experiments, and insights from Kaggle competitions and datasets

data data-science data-visualization numpy pandas python3

Last synced: 29 Apr 2026

https://github.com/abhinav330/instagram-influencers-analysis

This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.

data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn

Last synced: 08 Jun 2026

https://github.com/lamouchi-bayrem/data-matrix-scanner

A dual-interface tool that leverages AI to **detect and decode QR codes and Data Matrix codes** from images using computer vision

data datamatrix-scanner decoder flask qrcode scanner tkinter-gui webapp

Last synced: 30 Apr 2026

https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python

This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.

csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn

Last synced: 30 Apr 2026

https://github.com/omarsaad21/it-salary-eda

A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset

data explotary-data-analysis juypter-notebook numpy pandas python visualization

Last synced: 30 Apr 2026

https://github.com/priyam-hub/covid-19-data-analysis

Explore COVID19 case numbers and deaths related to Coronavirus outbreak 2019/2020 in Pandas and in Jupyter notebook

analysis data data-visualization jupyter-notebook machine-learning python

Last synced: 08 Jun 2026

https://github.com/chompfoods/sdk-jaxrs-cxf

JAXRS-CXF SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

apache-cxf api branded chomp cxf data database food grocery ingredients java jax-rs nutrition raw recipe-api recipes sdk

Last synced: 30 Apr 2026

https://github.com/fatihilhan42/olympics-data-analysis-with-python

I will examine the Data Analysis of the Olympics between 1896-2016, which we have done on Python.

data data-science dataanalysis datavisualization jupyter-notebook olympics python

Last synced: 30 Apr 2026

https://github.com/ddeepanshu-997/datascience-e-commerce-shopping-details-

in this project i am going to apply data preprocessing technique on the dataset in order to clean the data using libraries, etc. make some insights/analyses to findout the hotpicks of the shopping along with some data visualsation libraries to get the trends and many more aspects in order to make a small contribution to the field of data science

cleaning-data data data-science data-visualization dataframe datapreprocessing dataset libraries matplotlib-pyplot numpy pandas plots python visualization

Last synced: 30 Apr 2026

https://github.com/miguelmedinacastro/trabalho-dados-r

Trabalho final da disciplina Análise Exploratória de Dados

data data-science data-science-projects data-visualization database r rstudio

Last synced: 01 May 2026

https://github.com/dhimmel/hgnc

Extracting human gene families from HGNC

data gene-families genes hgnc hugo human

Last synced: 01 May 2026

https://github.com/dantetrb/diabetes-readmission-dbt

Predictive analytics on diabetic patient readmissions using dbt, DuckDB and Python – with explainability and clustering.

clustering data dataengineering dbt diabetes duckdb hdbscan healthcare jupyter lime readmission-prediction sql

Last synced: 01 May 2026

https://github.com/syedzaheerabbas/jamboree-education-linear-regression

Using data from Jamboree, this project explores the relationship between applicant profiles (GRE, TOEFL, GPA, etc.) and their chances of admission to Ivy League graduate programs. Linear regression, Ridge, and Lasso regression are employed to build predictive models and identify key factors.

data eda linear-regression python visualization

Last synced: 01 May 2026

https://github.com/svetlanam/kbl-to-csv-s3

Keboola extractor, that converts excel to CSV based on input mapping criteria and upload to S3 bucket

data data-cleaning data-transformation etl keboola s3-bucket

Last synced: 20 Jun 2026

https://github.com/shauryauppal/mydatatoolkit

A toolkit for data scientists to get work done faster, easier, and in a smarter way.

analytics awesome-list data data-science hacktoberfest

Last synced: 08 Jun 2026

https://github.com/skygenesisenterprise/aether-meet

Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem

applications data docker javascript meeting nextjs notes typescript voip

Last synced: 01 May 2026

https://github.com/chompfoods/sdk-kotlin

Kotlin SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food foods grocery ingredients kotlin nutrition raw recipe-api recipes sdk sdk-kotlin

Last synced: 01 May 2026

https://github.com/karo23361/toy-store-kpi-power-bi

PowerBI Portfolio Project

csv data data-visualization powerbi

Last synced: 03 Feb 2026

https://github.com/fatihemres/fruits

Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.

animations data models onboarding swift swiftui

Last synced: 01 May 2026

https://github.com/linguini1/edueval

The BorealisAI Let's Solve It mentorship project: summarizing student feedback submissions on their professor into one cohesive paragraph for faculty consideration during performance reviews.

ai data data-analysis data-science machine-learning machinelearning nlp python pytorch sentiment-analysis

Last synced: 01 May 2026

https://github.com/svetlanam/etl-transformation

ETL data cleaning and transformation for specific use case in own Keboola project

cleaning data etl keboola python rest-api transformation

Last synced: 20 Jun 2026

https://github.com/sorairolake/japanese-era-dataset

日本の元号のデータセット / Dataset of the Japanese era

data dataset date japanese-calendar japanese-era json toml wareki yaml

Last synced: 01 May 2026

https://github.com/sebastianbrzustowicz/github-data

Java + Spring Boot. Application for sending requests to GitHub API and collecting received data.

api ci data github json junit mapping parallel repository rest-api stream

Last synced: 01 May 2026

https://github.com/muhammadadilnaeem/bcg-data-science-job-simulation-on-forage-august-2024

This repository contains all the tasks, code, and documentation completed during the BCG Data Science job simulation on The Forage platform. The simulation focused on analyzing customer churn, building predictive models, and presenting insights for a major utility company.

bcg customer-churn-prediction-with-machine-learning data data-science forage numpy pandas

Last synced: 01 May 2026

https://github.com/ngofilho/scripts-db

Repository containing several dbs scripts samples.

cache data database db mariadb mongodb mysql oracle redis sql-server

Last synced: 11 Apr 2026

https://github.com/lurenss/healthypandas

A library that takes row output from the export of the Iphone Health app and produce pandas dataframes.

data health ios pandas

Last synced: 02 May 2026

https://github.com/0xhericles/spamdetector

:email: A Simple Python Spam Detector with Scikit-Learn

data ham machine-learning python sklearn spam

Last synced: 02 May 2026

https://github.com/hafs96/prediction_consommation-de-carburant

Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.

analysis data data-visualization machine-learning testing training

Last synced: 09 Jun 2026

https://github.com/mubashirsidiki/olympics-data-enigeering

Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.

analytics azure big-data data dataengineering devops pipeline

Last synced: 02 May 2026

https://github.com/radekbednarik/covid-czech-data-api

Library to make it easy to work with REST API of official Czech Covid data.

api covid-19 data deno library typescript

Last synced: 02 May 2026

https://github.com/s1dewalker/electric-future

Visual Analysis: Future of Automotive Industry

data data-visualization machine-learning python3 regression-analysis tableau

Last synced: 02 May 2026

https://github.com/jesuscc1993/data-cleaner-extension

Clears browser data in a single click.

application-data chrome chrome-extension data

Last synced: 02 May 2026

https://github.com/badranalyst/movie-correlation-analysis-in-python

This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.

data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn

Last synced: 03 May 2026

https://github.com/prakashpandey16/sql_data_warehouse_project

Building a modern data warehouse with SQL Server, including ETL Processes, data modeling, and analytics.

cleaning-data data data-engineering data-science database etl-pipeline sqlserver

Last synced: 03 May 2026

https://github.com/antoineaugusti/youtubers-tips

Collecting data about tips given to Youtubers

data economy youtube youtubers

Last synced: 03 May 2026

https://github.com/charon25/weatherdata

17 000 weather measurements collected by a weather station created for a college project.

csv data dataset datasets json measurements strasbourg weather weather-data

Last synced: 16 Jan 2026

https://github.com/charityeverett/gobackfetchit

Award Winning WebXR Data Journalism Storytelling Project

3d aframe ar css data html html-css-javascript nodejs visuzalization vr webxr xr

Last synced: 03 May 2026

https://github.com/tn3w/moviedb-json

A JSON library with 981,530 films.

data database db json movie movie-database movies

Last synced: 03 May 2026

https://github.com/yugsumeet17/churn-analysis-project--power-bi-sql-machine-learning

Dataset Explained, Project Goals & Metrics Required, SQL Server ETL & Data Cleaning, Power BI Data Load, Transformation, Blueprint & Measures, Power BI Visualization - Summary Page, Building Machine Learning Model - Random Forest, Power BI Visualization - Churn Prediction Page

data data-visualization dataanalytics excel postgresql powerbi python3

Last synced: 03 May 2026

https://github.com/yash-chauhan-dev/spark_cluster_docker

Set-up local spark cluster, hadoop (hdfs), airflow, postgresql on docker with ease, without any local installations

apache-spark data data-engineering data-engineering-pipeline deployment docker docker-compose hadoop hdfs local-development localhost pyspark python

Last synced: 04 May 2026

https://github.com/qrailibs/dataflow

✨ Data processing in Node.js made multithreaded and type-safe.

data dataprocessing multithread node

Last synced: 04 May 2026

https://github.com/fallaciousreasoning/nz-mountains

A list of mountains in NZ, scraped from https://climbnz.org.nz

alpine climbing climbnz data json json-api maps mountaineering scraping

Last synced: 04 May 2026

https://github.com/soham7998/data-analysis-projects

My Data Analysis Projects which are completed by me and gain a hands on Experience from each project. the project showcase different Concepts , Visualization and many things.

data data-analysis data-science machine-learning nlp python soham visualization

Last synced: 04 May 2026

https://github.com/maxwelllzh/gis-tutorial-

Tutorials for Columbia University GIS Club

data python

Last synced: 04 May 2026

https://github.com/soenneker/soenneker.dtos.idnamepair

A minimal Record type with an Id (string), Name (string), and maximum JSON compatibility

csharp data dotnet dto id name

Last synced: 12 Mar 2026

https://github.com/sjg/my-search-story

My Search Story is a demo application developed for the Data Portability API Workshop and the #AISprint2025 events. #BuildwithAI

data docker generative-ai google-cloud-platform google-cloud-run nodejs

Last synced: 04 May 2026

https://github.com/gabya06/twitter_models

Repository used for twitter impression models

data data-science impressions machinelearning python ridge-regression sklearn twitter

Last synced: 04 May 2026

https://github.com/srevenant/data-science-alpine

A docker container for data science, using alpine linux and python3

alpine data numpy pandas python3 science scipy xgboost

Last synced: 05 May 2026

https://github.com/kasunjayasanka/simple-backend-database-data-retrieval

Simple HTML form with inserting and retrieving data from Firebase Realtime Database

bootstrap css3 data firebase firebase-realtime-database html5 insert-data javascript retrieve-data

Last synced: 05 May 2026

https://github.com/hlan22/2025-03-18-data-validation

(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:

data validation

Last synced: 23 Jun 2026

https://github.com/bhar2254/sobershift

Simply attendance tracking application

data form ifc jambi java qt tracking utility

Last synced: 05 May 2026

https://github.com/edjoukou/pizza-sales-report

A data analysis project using SQL with MySQL database

analysis data mysql powerbi visualization

Last synced: 05 May 2026

https://github.com/contawo/travel-journal

This is a travel journal application for storing all the places that you have visited. I was learning by doing react when creating this project. I learnt a lot with it and upgraded my reactjs skills.

data learning-by-doing props reactjs

Last synced: 05 May 2026

https://github.com/munas-git/codm-review-analysis-and-predictions

Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.

data flask machine-learning python sentiment-analysis

Last synced: 05 May 2026

https://github.com/rdmurphy/deno-quaff

A port of the quaff Node.js library to Deno.

archieml csv data deno json toml yaml

Last synced: 05 May 2026

https://github.com/muthupillai1204/diwali_sales_analysis

The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.

data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization

Last synced: 05 May 2026

https://github.com/welli7ngton/mysql-server-formacao-alura

repositório para guardar códigos escritos em SQL de cursos da formação em mysql server da alura

data database mysql

Last synced: 19 Apr 2026

https://github.com/mito-ds/mitosheet_helper_config

The mitosheet_helper_config package used by enterprises to configure the mitosheet package.

data data-analytics data-science data-visualization jupyter pandas python

Last synced: 05 May 2026

https://github.com/sohomm/predict-insurance-charges

A predictive model to estimate the insurance charges based on a client's attributes, such as age and health factors. It offers a practical application of ml in business, enabling more accurate pricing models and helping companies manage risk while delivering personalized pricing strategies to clients.

administration algorithm bot data decision-trees download easy finance github java machine-learning management model neural-network nlp prediction project science trading university

Last synced: 05 May 2026

https://github.com/shibbbbs/fastapi_project

A FastAPI application that reads financial data from an Excel file (capbudg.xls) and provides API endpoints to list available tables (sheet names), fetch row names from a selected table, and calculate the sum of numerical values from a specified row. The API is accessible via a web-based interactive documentation at /docs

data dataanalysis fastapi pandas python

Last synced: 06 May 2026

https://github.com/rrwen/twitter2mongodb-cli

Command line tool for extracting Twitter data to MongoDB databases

api cli cmd command data database get interface line mdb media mongo mongod mongodb post social stream tool tweet twitter

Last synced: 06 May 2026

https://github.com/iv4n-ga6l/functional-dataprocessing-pipeline

A functional data processing pipeline that accepts an input file, allows specifying both input and output formats, applies specified transformations, and produces a resulting output file.

csv data datapreprocessing excel json pandas parquet pipeline python

Last synced: 06 May 2026