An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/dan149/uselesscontentcreator

Useless Content Creator (UCC) is a fake content generator, text, html and pdf files.

content customizable data easy-to-use fake-data fake-data-generator faker-generator generator lightweight open-source opensource python python3

Last synced: 03 Apr 2025

https://github.com/rid17pawar/friendscircle

Friends Circle is a console based application developed in cpp using Graph Data Structure.

cpp data graph graph-algorithms oop

Last synced: 08 Jun 2026

https://github.com/brunosalerno/osm_data

Ruby objects for dealing with OSM data, and generating XML files

data openstreetmap ruby xml

Last synced: 21 Apr 2026

https://github.com/raghavendranhp/attrition-alchemy

This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.

data datawrangling decision-trees eda gradient-boosting logistic-regression macine-learning pandas preprocessing random-forest-classifier skicit-learn svm

Last synced: 18 May 2026

https://github.com/garcane/layoffs-exploratory-data-analysis

This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.

data dataanalysis eda mysql sql

Last synced: 29 Oct 2025

https://github.com/webdevcave/collections-php

A PHP library for managing collections of data with support for nested keys.

array collection data helper library nested-keys package php utility utility-classes

Last synced: 28 Jun 2026

https://github.com/meltymooncakes/blockdata

Minecraft Block data

api data json minecraft minecraft-data

Last synced: 13 Apr 2025

https://github.com/pbinkley/tweets-online-classes-covid19

A twarc harvest of tweets related to online classes during the COVID-19 outbreak, starting 2020-03-02

data social

Last synced: 06 Mar 2026

https://github.com/pedrozamecki/datatube

Site Open Source para análise de dados de canais do YouTube.

data estatistica statistical-analysis statistics youtube

Last synced: 18 May 2026

https://github.com/luminovrym/crawler-tools-js

Crawler Tools Js adalah sebuah aplikasi yang digunakan untuk scrapping data pada sebuah web

crawler crawler-js data js web-scraping

Last synced: 08 Sep 2025

https://github.com/inekipelov/swift-codable-advance

A library of extensions for Swift Codable protocols, simplifying the process of encoding and decoding objects.

codable data dictionary json swift

Last synced: 25 Jan 2026

https://github.com/fordinand45/bdp_a_kelompok_3

Project Big Data Python yang diadakan oleh Digitalent Kominfo. Berikut adalah yang ikut serta pada project, yaitu : Dhian Prameswari, Fordinand Pasaribu, dan Muhdad Alfaris Bachmid

data data-analytics data-science linear-regression python3

Last synced: 12 Apr 2026

https://github.com/siongui/xemaauj9k5qn34x88m4h

No source code. Only serve JSON files of Pāli words

data go json pali

Last synced: 15 May 2026

https://github.com/michaelfromyeg/data

Data set dump.

data data-set

Last synced: 16 Jan 2026

https://github.com/devprnvk/pycryptochain

A implementation of a blockchain-based cryptocurrency in Python. This project aims to provide a fundamental understanding of blockchain technology and cryptocurrency by building a basic version from scratch. Features include blockchain creation, transaction handling, mining rewards, simulation.

blockchain crypto data decryption encryption hashing processing py python salting storage

Last synced: 09 Mar 2026

https://github.com/estherslabbert/sql

Using SQL working with student data

data python sql sqlite3

Last synced: 06 Apr 2025

https://github.com/caprogs/paris-events-analyzer

A project to analyze events in Paris using open source data provided by the city.

data data-analysis data-platform dbt docker ingestion python streamlit transformation vizualisation

Last synced: 04 May 2026

https://github.com/rellyson/data-engineering-tools

This repository holds examples and documentation about the most used tools in the data engineering ecosystem.

apache-airflow apache-spark data data-engineering jupyter-notebook python tools

Last synced: 17 Jan 2026

https://github.com/bastianolea/servel_elecciones

Resultados electorales desde Servel (2024)

chile comunas data elecciones genero

Last synced: 08 Jul 2025

https://github.com/bastianolea/mineduc_matriculas_superior

Bases de datos de estudiantes matriculados en Educación Superior

chile comunas data educacion social

Last synced: 16 Jun 2026

https://github.com/heitang/fcu-courseapi

逢甲大學:課程檢索系統 API 使用說明

api data fcu project

Last synced: 27 Jul 2025

https://github.com/frnt-end/ts-context-items-list

⚛️ React Typescript project - Fetch data and display it as a list of 10 items in 10 (pagination) pages. click on each item leads to more details page- using axios, Context and Styled Components.

api axios context context-api data fetch list pagination router router-dom styled-components typescript

Last synced: 19 May 2026

https://github.com/the-universal-linux-society/sysreport

Bash script to give you a full system report. Just by running the script it offers insight into CPU data, disk space, temperature readings, network configuration, MAC addresses, firewall status, and system logs for error analysis.

analysis bash bash-script bash-scripting data report reporting system

Last synced: 15 May 2026

https://github.com/fastpix/flutter-core-data-sdk

A comprehensive Flutter SDK for video player analytics and event tracking, designed to provide detailed insights into video playback behavior and user engagement metrics.

analyt dart data flutter

Last synced: 15 May 2026

https://github.com/dscamilo/gestion-clientes-springboot

Proyecto de gestión de clientes aplicando Java y Springboot, haciendo uso de Lombok, uso de interface, inyección de dependencias, uso de anotaciones Service, Data, RestController . Consumo de API haciendo uso de Postman.

data interface java lombok-maven restcontroller spring-boot

Last synced: 15 May 2026

https://github.com/henryssondaniel/teacup-service-visualization-mysql-java

Connect your Teacup visualization data to a MySQL database

data mysql service teacup visualization

Last synced: 19 May 2026

https://github.com/shrutakeerti/eye-gaze-detection

This repo contains everything that I have done at IIT Jodhpur Summer Internship May 15 - July 15

ai aiml data eda eeg eeg-signals eye jodhpur mlflow

Last synced: 17 Mar 2025

https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier

Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.

data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/mksingh431/sql-complete-notes

SQL, or Structured Query Language, is a robust and specialized programming language designed for efficient management and manipulation of relational databases. With SQL, you can seamlessly interact with databases like MySQL, PostgreSQL, Microsoft SQL Server, Oracle,.

data database sql sql-server

Last synced: 21 Apr 2026

https://github.com/skygenesisenterprise/api-service

The Official Sky Genesis Enterprise API Service Ecosystem

api-service client cryptography data dns docker javascript nextjs service stalwart typescript websocket

Last synced: 31 Dec 2025

https://github.com/davidkhala/sql

Standard SQL collection

data sql

Last synced: 06 Apr 2025

https://github.com/notthestallion/data_visualisation-examples

This repository was created to learn and practice graph showing and data visualization. The goal is to gain experience in creating compelling and informative visualizations.

data data-science data-visualization database learn learn-to-code learning learning-by-doing matplotlib matplotlib-figures matplotlib-pyplot visualization

Last synced: 12 May 2026

https://github.com/mysociety/sync-ep-to-jkan

Syncs EveryPolitician data to mySociety's data portal.

data everypolitician jkan politicians

Last synced: 27 Jul 2025

https://github.com/manifoldfinance/honte

reference data and metrics for sushiswap proposal

data ethereum sushi sushiswap

Last synced: 18 May 2026

https://github.com/nika2811/new-york-city-taxi-fare-prediction

About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff

data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost

Last synced: 06 Apr 2025

https://github.com/hidayathamir/get-telegram-group-data

With these project you can get data in csv file from your telegram group.

bahasa-indonesia data python3 scrape telegram telethon

Last synced: 13 Sep 2025

https://github.com/opengeoshub/vdownload

A Powerful Geospatial Data Downloader

data geospatial opendata

Last synced: 19 May 2026

https://github.com/prasad-chavan1/bank_data_analysis_r

Bank data analysis in R language

data data-analysis data-science r

Last synced: 24 Feb 2025

https://github.com/furkantosun1607/cse201-data-structure

This repository contains implementations of various data structures completed as part of the CSE201 (Data Structures) course. Each week, a different data structure was implemented during lab sessions.

array arraylist bfs-search binarytree data dfs-search java linkedlist queue stack structure tree-structure

Last synced: 26 Jun 2025

https://github.com/afnanenayet/academic-pinetable

A revamp of the Dartmouth academic timetable. Designed to be intuitive and make searching for classes much easier.

dartmouth data design dev python scraping ui web

Last synced: 11 Jan 2026

https://github.com/gunn/covid-19-scripts

Scripts for processing COVID-19 data - e.g. converting from absolute to per capita numbers, adding fine-grained data from more countries

covid-19 data geography typescript

Last synced: 17 May 2026

https://github.com/tomwhite/misp-2017

MISP camp 2017 materials and code

bioinformatics data data-visualization hackathon

Last synced: 18 Apr 2026

https://github.com/tkxwaweru/python_data_manipulation

Manipulating the MASSIVE dataset using python

data dataanalysis excel python

Last synced: 11 Jan 2026

https://github.com/sweta-kaundilya/911-calls-capstone-project

For this capstone project we will be analyzing some 911 call data from Kaggle.

data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 28 Apr 2026

https://github.com/jormaechea/aws-firehose-producer

Easily produce data for your AWS Firehose Data Stream

aws data firehose producer stream

Last synced: 19 May 2026

https://github.com/haykam821/circle-tracking

A tool for generating Markdown tracking of the Circle of Trust experiment.

circle data markdown reddit subreddit tracker trust

Last synced: 19 May 2026

https://github.com/pcpp94/elexon_pipeline_gb_demand

Guidelines and code snippets for extracting and processing Elexon gross demand data on Databricks. Provides half-hourly GB demand at sectoral (Domestic, Non-domestic), GSP-area granularity, settlement demand, and embedded generation. Supports non-commodity cost calculations for CfD, RO, and FiT.

data electricity elexon gb octopusenergy power powerdata pypsa uk

Last synced: 12 Jul 2025

https://github.com/germanpaul12/automating-hacker-news-and-weather-mails

Project for my Raspberry Pi to send me mails when it rains and to inform with hot tech news

beautifulsoup beautifulsoup4 data hacker-news openweather-api raspberry-pi requests

Last synced: 05 May 2026

https://github.com/phtrempe/l2a

This is a small project which aims to show an example of applied machine learning in Python 3 with the Keras library and its TensorFlow backend to train a neural network model for it to learn to add two integers.

applied data data-science deep-learning keras machine-learning neural-network tensorboard tensorflow

Last synced: 05 May 2026

https://github.com/echang1802/normandy

Normandy is a python framework for data pipelines, which main objective is standardizing your team code and provide a data treatment methodology flexible to your team needs.

analytics business-intelligence data dataengineering datascience etl pipeline

Last synced: 11 Mar 2026

https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql

This project involves cleaning a dataset containing information about layoffs from companies around the world.

data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql

Last synced: 08 Jun 2026

https://github.com/himanshub16/lekhpal

Monitor and catalog Twitter feed matching your desired keywords

analytics data data-catalog data-filtering mongodb twitter twitter-streaming-api

Last synced: 14 May 2026

https://github.com/yourdataarchitect/french-realestate-data-pipeline

This repository contains a fully automated data pipeline built with Apache Airflow to extract, clean, analyze, and report real estate listings from Seloger. It pushes data to MongoDB, Elasticsearch, and Google Sheets, with real-time Slack alerts for monitoring.

airlfow data datanalysis datapipeline market-intelligence real-estate

Last synced: 31 Dec 2025

https://github.com/coderooz/hr-dashboard

The goal of this project is to create a power bi dashboard to showcase the attrition data within the company.

data data-analytics power-bi

Last synced: 07 Jan 2026

https://github.com/pyrustic/jayson

Intuitive interaction with JSON files [DEPRECATED, check the project Shared]

data json pyrustic python

Last synced: 17 May 2026

https://github.com/fliplet/fliplet-widget-data-source-query

Data Source Query Provider

data provider widget

Last synced: 11 Apr 2025

https://github.com/boettiger-lab/taxadb-cache

Cache for taxadb files

data

Last synced: 19 May 2026

https://github.com/axafrance/azureml-to-openshift-talk

Scale your dev IA: From dev AzureML to prod OpenShift in one click

ai axa azureml data learn ml openshift raise-the-bar talk

Last synced: 16 Feb 2026

https://github.com/encoreshao/data-science

Data analyze examples, using Jupyter notebook and Python!!!

data dataanalysis encore jupyter-notebook

Last synced: 29 Mar 2025

https://github.com/pulgamecanica/d3examples

https://www.oreilly.com/library/view/d3-for-the/9781492046783/

d3 d3-visualization d3js d3v4 data javascript

Last synced: 19 May 2026

https://github.com/kameronbrooks/datalys2-reporting

Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.

data data-visualization html react

Last synced: 08 Apr 2026

https://github.com/azaz9026/loan_approval_prediction

Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio

data data-analysis data-visualization eda machine-learning numpy pandas python statistics

Last synced: 06 Apr 2026

https://github.com/shahules786/titanic-analysis

different analysis of titanic accident (data from kaggle)

analyze data titanic-kaggle

Last synced: 26 Jun 2025

https://github.com/jigyasag18/financial-risk-analysis-project

The Credit Card Financial Risk Analysis Dashboard is a real-time Power BI tool designed to provide insights into credit card transactions and customer demographics. It features interactive visualizations, efficient data processing, and actionable insights to support decision-making. Utilizing data from SQL database, the dashboard tracks key metrics

data dataanalysis database datacleaning datapreprocessing dataprocessing datavisualization financial-analysis financialriskanalysis mysql powerbi sql statistical-analysis

Last synced: 06 Mar 2026

https://github.com/henryssondaniel/teacup-java-report-file

Report Teacup data to a file

data file logs reports teacup

Last synced: 22 Jul 2025

https://github.com/amarlearning/exploring-the-evolution-of-linux

Data Analysis about the development of the Linux operating system by exploring its Git repository history.

cleaning-data data data-analysis data-wrangling datacamp first-commit git-history linux

Last synced: 12 May 2026

https://github.com/lisakey/lisakey

I am passionate about Python 🐍 and SQL 🗃️ for data analysis 📊, and I actively develop projects in these languages.

analysis analyst data dataanalysis dataanalyst java python sql

Last synced: 02 May 2026

https://github.com/eyluldursun/data-science-project

This project involves a data science analysis conducted on the Obesity Data Set. The study explores factors influencing obesity, includes data visualization, and develops predictive models. The goal of the project is to gain insights to help prevent obesity.

data data-science obesity r rmarkdown

Last synced: 26 Jun 2025

https://github.com/nxank4/an-augment

A Python library for advanced and novel data augmentation, combining traditional techniques like cropping and blurring with state-of-the-art generative AI methods such as style transfer, image inpainting, and latent space interpolation. It boosts data diversity for robust machine learning applications.

computer-vision data data-augmentation data-augmentation-strategies data-augmentation-techniques generative-ai image image-processing synthetic-data

Last synced: 10 Mar 2026

https://github.com/akashlogics/street-data-tracking

Detect, Track and Count number of persons walking across the path(s) making use of YOLO. This Python project tracks people moving across predefined street zones

analysis data excel newdataset object-detection opencv python python3 yolo

Last synced: 19 May 2026

https://github.com/buildinamsterdam/contentful-graphql

Contentful GraphQL connection

contentful data graphql

Last synced: 05 Jan 2026

https://github.com/weskal/vexus_pipeline

Automated pipeline for generating, ingesting, and validating realistic data, designed to simulate real-world workflows with scheduling, data quality checks, and version control.

airflow data pipeline python sqlserver workflow

Last synced: 20 Jan 2026

https://github.com/ezeparziale/analisis-uso-bicicletas-caba

:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.

data data-science data-visualization

Last synced: 14 Mar 2025

https://github.com/ezeparziale/analisis-data-delitos

:gun: Analsis de delitos de CABA

data data-science

Last synced: 14 Mar 2025

https://github.com/official-imvoiid/multifetch

A high-performance web scraper for bulk image and GIF extraction from reliable sources — built for AI/ML data pipelines and large-scale media collection

aiml data dataset gifscraper imagescraper python pythontool tools webscraper windows

Last synced: 19 May 2026