An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/richelbilderbeek/heyahmama

Data about the Flemish/Dutch band K3

band data k3 package r r-lang r-language

Last synced: 22 May 2026

https://github.com/uttori/uttori-data-tools

Tools for working with binary data.

binary data stream uttori

Last synced: 17 Feb 2026

https://github.com/the-universal-linux-society/sysreport

Bash script to give you a full system report. Just by running the script it offers insight into CPU data, disk space, temperature readings, network configuration, MAC addresses, firewall status, and system logs for error analysis.

analysis bash bash-script bash-scripting data report reporting system

Last synced: 15 May 2026

https://github.com/fastbolt/entity-importer

Entity importing library for importing data from files (CSV and Excel currently) or API into doctrine.

data doctrine2 excel excel-import

Last synced: 17 Feb 2026

https://github.com/styd/sd_struct

Searchable Deep Struct

activesupport data gem openstruct rails ruby structure

Last synced: 18 May 2026

https://github.com/sumansuhag/wasserstoff-aiinterntask

Welcome to the AI Pipeline for Image Segmentation and Object Analysis project – a state-of-the-art solution designed to process, segment, identify, and analyze objects within images. This AI-powered pipeline is engineered to deliver precise insights by extracting, mapping, and summarizing data from each segmented object.

artificial-intelligence cdn data data-science modeling pipline

Last synced: 28 Mar 2025

https://github.com/metapsy-project/data-depression-psiloctr

Database of psilocybin-assisted therapies for adults with depression versus control conditions.

data

Last synced: 01 Mar 2026

https://github.com/sumansuhag/prediction_model

This repository features a collection of Jupyter notebooks designed to showcase the practical applications of machine learning, data preprocessing, feature engineering, and recommendation systems. These notebooks enable users to explore, analyze, and predict business events.

algotithms artificial-intelligence data logistic-regression machine-learning-algorithms science sckiit-learn

Last synced: 28 Mar 2025

https://github.com/frnt-end/ts-context-items-list

⚛️ React Typescript project - Fetch data and display it as a list of 10 items in 10 (pagination) pages. click on each item leads to more details page- using axios, Context and Styled Components.

api axios context context-api data fetch list pagination router router-dom styled-components typescript

Last synced: 19 May 2026

https://github.com/kirkalyn13/xyz-books-pipeline

XYZ Books Pipeline to check and update incoming ISBNs from newly added books from the CRUD UI, and record new data to a CSV file.

api csv data go http rabbitmq

Last synced: 05 Mar 2025

https://github.com/fastpix/flutter-core-data-sdk

A comprehensive Flutter SDK for video player analytics and event tracking, designed to provide detailed insights into video playback behavior and user engagement metrics.

analyt dart data flutter

Last synced: 15 May 2026

https://github.com/charlieroth/exoexplo

Exploring NASA Exoplanet Archive Data

data exoplanets julia nasa

Last synced: 03 Apr 2025

https://github.com/dscamilo/gestion-clientes-springboot

Proyecto de gestión de clientes aplicando Java y Springboot, haciendo uso de Lombok, uso de interface, inyección de dependencias, uso de anotaciones Service, Data, RestController . Consumo de API haciendo uso de Postman.

data interface java lombok-maven restcontroller spring-boot

Last synced: 15 May 2026

https://github.com/ebrizzzz/data-visualization-project-using-tableau

A data visualization project for the Visual Data Analysis course (Spring Term 2025) at the University of Skövde. This project explores the factors influencing national happiness scores across different global regions from 2005 to 2022.

analytics data data-analysis data-science data-visualization python regression tableau

Last synced: 16 Jun 2025

https://github.com/1sumer/mass-mail-automation

Mass Emailer is a Python-based application designed to send bulk emails efficiently using an SMTP server. Leveraging the power of the Tkinter library for the graphical user interface (GUI), this tool provides a user-friendly platform for managing and dispatching large volumes of emails with ease.

data oops-in-python python smtp-server tkinter

Last synced: 20 Aug 2025

https://github.com/eryks1999/data-collection-project_python

This project allowed me to practice classes, populating json files as well as extracting data.

data git json python

Last synced: 16 Apr 2026

https://github.com/aliaksandr-master/unipipeline

simple way to build the declarative and destributed data pipelines with python

celery data pipeline python

Last synced: 11 Jul 2025

https://github.com/shubhamsoni98/analysis-with-sql

This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.

analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql

Last synced: 03 Jan 2026

https://github.com/shrutakeerti/eye-gaze-detection

This repo contains everything that I have done at IIT Jodhpur Summer Internship May 15 - July 15

ai aiml data eda eeg eeg-signals eye jodhpur mlflow

Last synced: 17 Mar 2025

https://github.com/henryssondaniel/teacup-service-visualization-mysql-java

Connect your Teacup visualization data to a MySQL database

data mysql service teacup visualization

Last synced: 19 May 2026

https://github.com/ezmiller/boe-election-data

CSV files containing parsed NYC Bureau of Elections data for 2009 and 2013

data elections nyc

Last synced: 18 Oct 2025

https://github.com/mksingh431/sql-complete-notes

SQL, or Structured Query Language, is a robust and specialized programming language designed for efficient management and manipulation of relational databases. With SQL, you can seamlessly interact with databases like MySQL, PostgreSQL, Microsoft SQL Server, Oracle,.

data database sql sql-server

Last synced: 21 Apr 2026

https://github.com/kenanbek/youtube-data

YouTube stats data over YouTube Data API v3 using Python.

data python youtube youtube-api

Last synced: 13 May 2026

https://github.com/e22m4u/ts-projection

Модуль для работы с проекцией данных для TypeScript

data projection typescript

Last synced: 12 Apr 2025

https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier

Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.

data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/juniorreisx/movelo-logstica

Movelo is a lightweight logistics simulator built with TypeScript that provides mock order and delivery data for developing and testing UIs, dashboards, and backend features without external APIs.

data hooks lucide-react react tailwindcss typescript

Last synced: 12 Apr 2025

https://github.com/abhishekn1947/samgov-scraper

Automated Python scraper for sam.gov contracts

analytics automation aws data pandas postgresql rds selenium webscraper

Last synced: 09 Apr 2026

https://github.com/nanvenomous/sizable

A generic interface to mongo go driver

data driver generic generics go golang mongodb

Last synced: 15 May 2026

https://github.com/davidkhala/sql

Standard SQL collection

data sql

Last synced: 06 Apr 2025

https://github.com/alex0x4b/akutils

High-level Python library for recurring data manipulation (Pandas, Python data structure, API, file manipulation, etc.).

data dataframe pandas python

Last synced: 08 Mar 2026

https://github.com/reubano/pyconza-tutorial

Jupyter notebooks and data for "Data Mining and Processing for fun and profit" PyConZA16 tutorial

data functional-programming jupyter-notebook meza pycon python tutorial

Last synced: 17 May 2026

https://github.com/jigyasag18/ibm-power-bi-dashboard-project

IBM Power BI Dashboard Project is a data-driven analysis of employees using IBM's comprehensive dataset, providing insights into key factors contributing to employee turnover and enabling organizations to strategize effectively towards improved employee retention and satisfaction.

data data-visualization dataanalysis dataanalytics dataset datavisualisation datavisualization-project powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 07 Mar 2026

https://github.com/theleopard65/isa-imitation

This repository contains a simple C++ implementation of a Von-Neumann architecture simulator. The program mimics the behavior of a basic computer architecture that uses a single memory space for both instructions and data. Users can load programs, execute them, and view the current state of the memory and registers.

32-bit 64-bit ac architecture c-plus-plus data executable explained implementation ir isa mar mdr memory pc registers simulation von-neumann x64 x86

Last synced: 18 Mar 2025

https://github.com/push-protocol/push-google-bigquery

The Power of Web3 Big Data: A Guide to Using Google BigQuery and Push Protocol for Data Communication and Analysis

bigquery data push push-notifications web3

Last synced: 26 Mar 2025

https://github.com/notthestallion/data_visualisation-examples

This repository was created to learn and practice graph showing and data visualization. The goal is to gain experience in creating compelling and informative visualizations.

data data-science data-visualization database learn learn-to-code learning learning-by-doing matplotlib matplotlib-figures matplotlib-pyplot visualization

Last synced: 12 May 2026

https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023

Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.

cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill

Last synced: 16 Aug 2025

https://github.com/skygenesisenterprise/aether-calendar

Aether Calendar is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem

applications calendar capacitorjs data javascript linux macos nextjs typescript windows

Last synced: 12 Apr 2026

https://github.com/RedInfinityPro/ScientificSharp

Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.

componentmodel cryptography data drawing forms generic linq system tasks text

Last synced: 30 Sep 2025

https://github.com/afnanenayet/academic-pinetable

A revamp of the Dartmouth academic timetable. Designed to be intuitive and make searching for classes much easier.

dartmouth data design dev python scraping ui web

Last synced: 11 Jan 2026

https://github.com/skygenesisenterprise/api-service

The Official Sky Genesis Enterprise API Service Ecosystem

api-service client cryptography data dns docker javascript nextjs service stalwart typescript websocket

Last synced: 31 Dec 2025

https://github.com/shrutakeerti/crime-filex

Crime FileX : The mission to trace crime and make this a crime free world

ai aiml analysis crime-data css data html ics js ml

Last synced: 19 Apr 2026

https://github.com/sibeux/redesigned-broccoli

Repositori untuk menyimpan data file musik

data data-center nasrulwahabi sibeux

Last synced: 24 Jan 2026

https://github.com/nika2811/new-york-city-taxi-fare-prediction

About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff

data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost

Last synced: 06 Apr 2025

https://github.com/hidayathamir/get-telegram-group-data

With these project you can get data in csv file from your telegram group.

bahasa-indonesia data python3 scrape telegram telethon

Last synced: 13 Sep 2025

https://github.com/ahadly/sql-data-analytics-project

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql

Last synced: 18 May 2026

https://github.com/yugoff/ml-kaggle-regression-with-a-mohs-hardness-dataset

Your Goal: For this Episode of the Series, your task is to use regression to predict the Mohs hardness of a mineral, given its properties

data gradient-boosting kaggle kaggle-competition regression-models

Last synced: 18 May 2026

https://github.com/thetacom/byteclasses

A Python package to manage and interact with binary data in a simple and structured manner.

binary-data bytes data dataclasses package python python3

Last synced: 11 Jul 2025

https://github.com/opengeoshub/vdownload

A Powerful Geospatial Data Downloader

data geospatial opendata

Last synced: 19 May 2026

https://github.com/mkshah605/personal-brand-development

A data-driven approach to a personal brand development project.

branding data data-science growth music personal

Last synced: 12 Sep 2025

https://github.com/prasad-chavan1/bank_data_analysis_r

Bank data analysis in R language

data data-analysis data-science r

Last synced: 24 Feb 2025

https://github.com/chompfoods/sdk-scala

Scala SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes scala sdk

Last synced: 17 May 2026

https://github.com/furkantosun1607/cse201-data-structure

This repository contains implementations of various data structures completed as part of the CSE201 (Data Structures) course. Each week, a different data structure was implemented during lab sessions.

array arraylist bfs-search binarytree data dfs-search java linkedlist queue stack structure tree-structure

Last synced: 26 Jun 2025

https://github.com/kammarah/studentdata

I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓

connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp

Last synced: 18 May 2026

https://github.com/tkxwaweru/python_data_manipulation

Manipulating the MASSIVE dataset using python

data dataanalysis excel python

Last synced: 11 Jan 2026

https://github.com/samridhisainii/airbnb-data-analysis

Data analysis of airbnb dataset

analysis data data-visualization eda models

Last synced: 16 May 2026

https://github.com/tomwhite/misp-2017

MISP camp 2017 materials and code

bioinformatics data data-visualization hackathon

Last synced: 18 Apr 2026

https://github.com/pcpp94/elexon_pipeline_gb_demand

Guidelines and code snippets for extracting and processing Elexon gross demand data on Databricks. Provides half-hourly GB demand at sectoral (Domestic, Non-domestic), GSP-area granularity, settlement demand, and embedded generation. Supports non-commodity cost calculations for CfD, RO, and FiT.

data electricity elexon gb octopusenergy power powerdata pypsa uk

Last synced: 12 Jul 2025

https://github.com/annaanastasy/mushroom-binary-classification-eda-ml

Explored and modeled a competition dataset of mushroom species, focusing on data cleaning, exploratory data analysis, and building machine learning models for accurate classification of edible and poisonous mushrooms.

binary-classification data data-cleaning-and-preprocessing data-science exploratory-data-analysis machine-learning-algorithms xgboost-classifier

Last synced: 29 Mar 2025

https://github.com/sweta-kaundilya/911-calls-capstone-project

For this capstone project we will be analyzing some 911 call data from Kaggle.

data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 28 Apr 2026

https://github.com/takamoso/umami

Cross browser compatibility data.

browser compat compatibility data dataset json

Last synced: 27 Mar 2025

https://github.com/namescode/hub_harvester

A python script to gather data on a user or organisations git repos

data github nix nix-flake python python3 sqlite

Last synced: 08 Apr 2026

https://github.com/jormaechea/aws-firehose-producer

Easily produce data for your AWS Firehose Data Stream

aws data firehose producer stream

Last synced: 19 May 2026

https://github.com/haykam821/circle-tracking

A tool for generating Markdown tracking of the Circle of Trust experiment.

circle data markdown reddit subreddit tracker trust

Last synced: 19 May 2026

https://github.com/cunfuu/network-bubbles

For Easier to manage organizations and keeping notes about them to organize events and easy access their needs

data data-visualization organizations organizations-volunteer

Last synced: 31 Jul 2025

https://github.com/christopherandrewtopalian/catopalian_javascript_data_navigator

A JavaScript application that allows for easy sorting of data. Easily navigate through any amount of data using button filters.

data javascript sorting

Last synced: 13 Apr 2025

https://github.com/phtrempe/l2a

This is a small project which aims to show an example of applied machine learning in Python 3 with the Keras library and its TensorFlow backend to train a neural network model for it to learn to add two integers.

applied data data-science deep-learning keras machine-learning neural-network tensorboard tensorflow

Last synced: 05 May 2026

https://github.com/ahabdel/amazon-web-scraper

Amazon Web Scraper to scrape pricing adjustments and provide updates on a day to day basis

data web-scraping

Last synced: 29 Oct 2025

https://github.com/noorkhokhar99/text-to-speech-demo

Text to Speech Demo

data python roboflow

Last synced: 27 Mar 2025

https://github.com/germanpaul12/automating-hacker-news-and-weather-mails

Project for my Raspberry Pi to send me mails when it rains and to inform with hot tech news

beautifulsoup beautifulsoup4 data hacker-news openweather-api raspberry-pi requests

Last synced: 05 May 2026

https://github.com/patrikcze/meshtatic_data

Meshtastic Data Transfer - Trying some stupid thing, like transferring files over LORA network.

data meshtastic meshtastic-python

Last synced: 03 Feb 2026

https://github.com/echang1802/normandy

Normandy is a python framework for data pipelines, which main objective is standardizing your team code and provide a data treatment methodology flexible to your team needs.

analytics business-intelligence data dataengineering datascience etl pipeline

Last synced: 11 Mar 2026

https://github.com/krescruz/pegaso-data

Utilerías para el analisis de datos del Proveedor de Certificación de Factura Pegaso

cfdi-mexico data pac sat-gob

Last synced: 29 Apr 2026

https://github.com/UznetDev/Smoking-Prediction

This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.

ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking

Last synced: 28 Mar 2025

https://github.com/8hrsk/ranger

Package for generating fake userdata to work with.

data factory faker generator npm

Last synced: 30 Apr 2026

https://github.com/ahmad-ali-rafique/random-forest-classifier-modeling

Detailed exploration of random forest classifiers, including data cleaning, model building, and performance evaluation on various datasets.

classification classification-models data dataanalytics datamodel dataset model-checking models random-forest random-forest-classifier

Last synced: 01 Jun 2026

https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql

This project involves cleaning a dataset containing information about layoffs from companies around the world.

data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql

Last synced: 08 Jun 2026

https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling

Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.

data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis

Last synced: 05 Mar 2025

https://github.com/himanshub16/lekhpal

Monitor and catalog Twitter feed matching your desired keywords

analytics data data-catalog data-filtering mongodb twitter twitter-streaming-api

Last synced: 14 May 2026

https://github.com/encelo/nctracer-data

Data files for the ncTracer project

data icons ncine

Last synced: 15 Jan 2026

https://github.com/vladandreitoma/igisol_jyvaskyla_xept_experimental_campaign

A simulation toolkit together with data analysis for the Xe&Pt Exotic Nuclei Generation experiment @ Jyvaskyla December 2022. Helping dr.Paul Constantin with simulation development. Simulation is done using Geant4 provided by CERN. Data anlysis is done using ROOT by Cern. Both C++ based. Job distributors to run the sim are coded in pearl

analysis architecture-design cplusplus data oop oop-principles pearl simulations

Last synced: 05 Sep 2025

https://github.com/cemoktra/data_series

time series handling

data lazy-evaluation time-series

Last synced: 29 Oct 2025

https://github.com/webobite/fact-chatbot

A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.

chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp

Last synced: 04 May 2026

https://github.com/axafrance/azureml-to-openshift-talk

Scale your dev IA: From dev AzureML to prod OpenShift in one click

ai axa azureml data learn ml openshift raise-the-bar talk

Last synced: 16 Feb 2026

https://github.com/bala-1409/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning

Last synced: 26 Apr 2026

https://github.com/itsmeyogesh22/solved-8-weeks-sql-challenge-correct-solutions

Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/

8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql

Last synced: 07 Apr 2025