An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/bhavanachitragar/layoff_analysis

This Streamlit app is designed for Layoff Analysis. It allows users to explore and analyze layoff data from different perspectives, including overall analytics, country-specific insights, and individual company details.

data dataanalysis streamlit streamlit-webapp

Last synced: 18 Apr 2026

https://github.com/codbex/codbex-hestia-data-sample

Sample data for codbex-hestia

data module sample

Last synced: 05 Apr 2026

https://github.com/josericodata/josericodata.github.io

Welcome to my portfolio website. This site showcases my skills, experience, education, and projects as a Data Analyst.

awesine-latex big-data career-development data data-analyst data-science database dublin ireland job-seeking jose-maria-rico-leal jose-rico jose-rico-data latex latex-cv portfolio portfolio-website python sql

Last synced: 18 Apr 2026

https://github.com/prakashjha1/loan-eligibility-prediction

This repository contains the codebase and resources for a machine learning-based project aimed at predicting loan eligibility for individuals. The project utilizes various algorithms and data preprocessing techniques to build predictive models that assess the likelihood of an applicant being eligible for a loan based on historical data.

data data-visualization exploratory-data-analysis loan-prediction-analysis machine-learning-algorithms naive-bayes-classification parameter-tuning python random-forest

Last synced: 19 Apr 2026

https://github.com/hormcodes/data

Terraform configuration for public data storage hosted on data.horm.codes

aws cloudfront content-management data github-actions s3-bucket terraform

Last synced: 20 Apr 2026

https://github.com/nikoheikkila/maps

A TypeScript collection of specialized map implementations

data javascript maps typescript

Last synced: 20 Apr 2026

https://github.com/zhukovanan/stepik_

The completed tasks of different data or computer science related fields on stepik

data statistical-learning statistics stepik-course

Last synced: 21 Apr 2026

https://github.com/vishwas-chakilam/movies-review-scraping-analysis

A project for collecting, cleaning, and analyzing movie data. Includes scripts for web scraping (deprecated) and using the OMDb API to fetch movie details. Analyze and visualize data with Python and Power BI to uncover insights and trends in movie ratings and genres.

data dataanalysis datacleaning datavisualization matplotlib-python numpy-library pandas python webscraping

Last synced: 21 Apr 2026

https://github.com/schijioke-uche/data-analysis-with-python-an-spss-model

With this Python notebook algorithm, you can use SPSS Model notebook to build machine learning pipelines that you can use to iterate rapidly during the model building process in data analysis. Whether you're trying to find the right algorithm or experimenting with different ways of preparing your data, you can create reproducible research that's easily understood by any member of your team with Hypothesis definition.

anova cp4a cp4d cp4i cp4s data ibm ibm-cloud jeffrey-chijioke-uche jeffrey-solomon-chijioke-uche openshift python python3 redhat t-test

Last synced: 22 Apr 2026

https://github.com/rbcavi/factorio-mod-data

The modpacke data for factorio-viewer

data factorio factorio-data factorio-mod-data

Last synced: 23 Apr 2026

https://github.com/howwohmm/fetchgram

era-adjusted Instagram content intelligence — scrape any public profile, OCR every image, measure what actually works. free, local, no API keys.

analytics cli content-strategy data instagram ocr python scraper

Last synced: 06 Jun 2026

https://github.com/stdlib-js/ndarray-vector-bool

Create a boolean vector (i.e., a one-dimensional ndarray).

bool boolean constructor ctor data javascript ndarray node node-js nodejs stdlib structure types vec vector

Last synced: 24 Apr 2026

https://github.com/marielachirinosr/cyclistic-data-analytics-project

This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.

data data-visualization pandas powerbi-report powerbi-visuals python

Last synced: 24 Apr 2026

https://github.com/thinkphp/my-react-tictactoeai-app

App React Tic Tac Toe Component based on Artificial Intelligence

ai algoirthms data datastructures games javascript react

Last synced: 25 Apr 2026

https://github.com/marielachirinosr/hotel-data-analysis

Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries

data data-analysis matplotlib pandas python

Last synced: 25 Apr 2026

https://github.com/tsbarr/citi-bikes-challenge

Citibikes NYC Data Analysis: Uncover insights from over a decade of ride data. Jupyter notebook for data aggregation/cleaning & Tableau dashboards for interactive visualization.

data data-visualization pandas-python python tableau

Last synced: 27 Apr 2026

https://github.com/ioanzicu/batch_loading_one-to-many_data_model

Unesco Batch Loading One-to-Many Data using Django

batch data django sqlite3

Last synced: 27 Apr 2026

https://github.com/schenkd/tweetminer

Data Miner for Twitter Streaming API

data dataminer datamining java twitter twitter-api twitter4j

Last synced: 07 Jun 2026

https://github.com/kitpymes/netcore-serialize-data

El objetivo es resguardar datos secretos encriptando y serializando archivos .json y convertirlos en archivos .dat.

csharp data decrypt encrypt json net netcore2 serialize

Last synced: 29 Apr 2026

https://github.com/kfrural/customer-churn-prediction

Customer churn prediction using machine learning. The project follows CRISP-DM and KDD methodologies, including data preprocessing, feature engineering, modeling, and evaluation. It also features an interactive dashboard for visualizing results.

crisp-dm data jupyter kdd python

Last synced: 29 Apr 2026

https://github.com/mumtaz4118/scraping-medium-and-data-analytics

The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py

data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python

Last synced: 29 Apr 2026

https://github.com/martgro/datagrabber

Tool for extracting data points from plots

data extract image plots python3

Last synced: 29 Apr 2026

https://github.com/koltyakov/pgcopy

🐘 PostgreSQL data migration tool

cli data database golang migration postgresql sync

Last synced: 29 Apr 2026

https://github.com/ozgrozer/electron-store-data

A Node.js module to store Electron data in the computer

data electron store

Last synced: 29 Apr 2026

https://github.com/smokingplaya/gm_datastorages

💖 Data Storages like in JavaScript.

data dev gmod javascript lua

Last synced: 29 Apr 2026

https://github.com/devcsrj/docparsr-jvm

JVM client for https://github.com/axa-group/Parsr

data document extraction nlp ocr pdf

Last synced: 08 Jun 2026

https://github.com/wireservice/workbench-lookup

A port of `agate-lookup` to Workbench

data journalism lookup workbench

Last synced: 08 Jun 2026

https://github.com/lamouchi-bayrem/data-matrix-scanner

A dual-interface tool that leverages AI to **detect and decode QR codes and Data Matrix codes** from images using computer vision

data datamatrix-scanner decoder flask qrcode scanner tkinter-gui webapp

Last synced: 30 Apr 2026

https://github.com/fatihilhan42/olympics-data-analysis-with-python

I will examine the Data Analysis of the Olympics between 1896-2016, which we have done on Python.

data data-science dataanalysis datavisualization jupyter-notebook olympics python

Last synced: 30 Apr 2026

https://github.com/miguelmedinacastro/trabalho-dados-r

Trabalho final da disciplina Análise Exploratória de Dados

data data-science data-science-projects data-visualization database r rstudio

Last synced: 01 May 2026

https://github.com/benmizrahi/reactivejs

microservices event bus for async/sync communications

data microservices nodejs

Last synced: 01 May 2026

https://github.com/anandvai/ai_rag_chatbot_multi_pdf_support

RAG (Retrieval-Augmented Generation) Chatbot built with Streamlit and LangChain, powered by Groq's blazing-fast LLaMA3-8B. It allows you to upload multiple PDFs, ask questions, and get precise, context-aware answers in a conversational format.

ai data data-science data-visualization data-visualizations dataengineering fastapi langchain langgraph python sql streamlit

Last synced: 01 May 2026

https://github.com/rec/kson

🔑 Json with the rough edges removed 🔑

data json serialization

Last synced: 01 May 2026

https://github.com/lurenss/healthypandas

A library that takes row output from the export of the Iphone Health app and produce pandas dataframes.

data health ios pandas

Last synced: 02 May 2026

https://github.com/tn3w/moviedb-json

A JSON library with 981,530 films.

data database db json movie movie-database movies

Last synced: 03 May 2026

https://github.com/parzibyte/jsonp-php

Ejemplo de JSONP con PHP

data example json jsonp php request

Last synced: 04 May 2026

https://github.com/a-poor/datatransform.jl

A package for defining (and performing) tabular-data transformations with JSON.

data data-science data-transformation etl feature-engineering json julia julia-package tabular-data

Last synced: 05 May 2026

https://github.com/edjoukou/pizza-sales-report

A data analysis project using SQL with MySQL database

analysis data mysql powerbi visualization

Last synced: 05 May 2026

https://github.com/munas-git/codm-review-analysis-and-predictions

Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.

data flask machine-learning python sentiment-analysis

Last synced: 05 May 2026

https://github.com/rdmurphy/deno-quaff

A port of the quaff Node.js library to Deno.

archieml csv data deno json toml yaml

Last synced: 05 May 2026

https://github.com/chanchalsoorma/web-scraping

This repo aims to provide a straightforward, easy-to-use scraping code written in Python.

beautifulsoup beautifulsoup4 data python request selenium webscraping

Last synced: 05 May 2026

https://github.com/mito-ds/mitosheet_helper_config

The mitosheet_helper_config package used by enterprises to configure the mitosheet package.

data data-analytics data-science data-visualization jupyter pandas python

Last synced: 05 May 2026

https://github.com/shibbbbs/fastapi_project

A FastAPI application that reads financial data from an Excel file (capbudg.xls) and provides API endpoints to list available tables (sheet names), fetch row names from a selected table, and calculate the sum of numerical values from a specified row. The API is accessible via a web-based interactive documentation at /docs

data dataanalysis fastapi pandas python

Last synced: 06 May 2026

https://github.com/lexz-08/sharpdata

Easily manage DataGridViews or create one with the struct 'DataGridManager' provided.

csharp data datagridview ui user-interface windows windows-forms winforms

Last synced: 06 May 2026

https://github.com/poode/firebase-modeling

Get firebase/firestore entity model to migrate to mongo or any db later

data database firebase firestore modeling schema

Last synced: 06 May 2026

https://github.com/rrwen/twitter2mongodb-cli

Command line tool for extracting Twitter data to MongoDB databases

api cli cmd command data database get interface line mdb media mongo mongod mongodb post social stream tool tweet twitter

Last synced: 06 May 2026

https://github.com/shantanujpk/bigdatacloud

Exploration of PySpark for data processing and interview prep — demonstrates handling corrupted records, applying transformations/actions, and building efficient data pipelines with practical examples.

big-data data jupyter-notebook pipeline pyspark python spark sparksql

Last synced: 07 May 2026

https://github.com/hudson-newey/data-miner

A simple data miner that collects information from an API and stores it in a file

api api-client big-data bigdata data logger logging

Last synced: 10 Jun 2026

https://github.com/randomfractals/unfolded-map-snippets

Html, CSS, JavaScript, and Python 🐍 vscode snippets ✂️ extension for Unfolded Map 🗺️ and Data SDKs

code data extension map sdk snippets template unfolded vscode

Last synced: 08 May 2026

https://github.com/writetome51/page-load-access

A TypeScript/Javascript class that loads a batch (array) of data from a larger set too big to be loaded all at once.

batch class data javascript load loader typescript

Last synced: 16 May 2026

https://github.com/natarizkie2/neurochain-airdrop-bot

🍋 — A smart bot designed to complete data tasks like true/false selections automatically, with multi-account support for extra convenience.

airdrop automated bot data multi-account natarizkie neurochain nodejs web3

Last synced: 10 Jun 2026

https://github.com/tupizz/python-data-manipulation

Data manipulation and visualization with Python 2.x

csv data pandas python

Last synced: 09 May 2026

https://github.com/pawlo77/nos_snowflake

Network Operating Systems course for DS studies in Winter 2024/25

azure data data-science snowflake snowpark streamlit

Last synced: 09 May 2026

https://github.com/stdlib-js/wasm-base-dtype2wasm

Return the WebAssembly data type associated with a provided array data type value.

array base data dtype javascript node node-js nodejs stdlib type types util utilities utility utils wasm webassembly

Last synced: 09 May 2026

https://github.com/sebastianbrzustowicz/flight-quality-overview-microservice

Go + Docker. Microservice with parallel computations to convert raw vehicle flight data into overview raport with visualisation.

container control csv data docker drone flight go goroutines http microservice parallel-computing pdf quadcopter raport rms sse vehicle

Last synced: 10 May 2026

https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio

This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time

data data-analysis data-visualization pandas seaborn

Last synced: 10 May 2026

https://github.com/hemangsharma/assignment-2---classification-models

Assignment 2 - Classification Models repository contains project for 36106 Machine Learning Algorithms and Applications

data datascience-machinelearning machine-learning ml

Last synced: 10 Jun 2026

https://github.com/brightway-lca/bw_io

IO tools for Brightway LCA framework

bw3 data life-cycle-assessment python

Last synced: 10 Jun 2026

https://github.com/sebastian-diaz-berdecia/analisis-popularidad-de-series-y-generos-de-series

Consultas SQL para el análisis de la popularidad de series y géneros series de la base de datos NetflixDB.

business-analytics bussiness-intelligence data data-analysis database mysql mysql-database sql

Last synced: 12 May 2026

https://github.com/vbhatsaccnt/retail-strategy-and-analytics-optimization-of-control-stores-for-sales-enhancement

In this project, we aim to optimize the performance of retail chain stores by establishing control stores based on their performance compared to selected trial stores. By leveraging data analytics and strategic insights, we seek to enhance sales revenue and drive growth within the retail chain.

customer-segmentation data data-science risk-analysis

Last synced: 13 May 2026

https://github.com/andygol/andygol.github.io

Andrii Holovin – Product & Project Manager Geospatial Expert / OpenStreetMap Consultant / DevOps practitioner

consultant data data-structures devops experience floss gis mapping navigation openstreetmap personal-site personal-website

Last synced: 13 May 2026

https://github.com/flexiui-labs/flexi-grid

Flexi Grid is an advanced, lightweight, and customizable Angular 19 data grid component

angular data filter grid search select sort table

Last synced: 14 May 2026

https://github.com/lulloooo/article-fromfitto55tofittoeveryone

Analysis leading to an article published in the EcoSprinter 2024 Annual edition about an Analysis of EU "Fit for 55" packages under a different perspective 🔎

analysis data environment european-union

Last synced: 12 Jun 2026

https://github.com/neuro-mechatronics-interfaces/ros2_data_agent

Code for a multipurpose file explorer specializing in reading ROS2 topic data from '.bag' or '.db3' files

data python ros2

Last synced: 13 Jun 2026

https://github.com/word2vect/beijing-pm2.5-data-process

Beijing PM2.5 Data Process for Python Programming 2024 Fall Data Visualization Lab 2

data python visualization

Last synced: 15 Jun 2026

https://github.com/arch-fan/pokedata

Pokemon Data in CSV format for whatever you need!

csv data dataset pokemon

Last synced: 17 Jun 2026

https://github.com/ibttf/bayborhood

Interactive map to find the ideal neighborhood in San Francisco based on data.

data data-analysis data-visualization gis mapbox react

Last synced: 18 Jun 2026

https://github.com/dineshram0212/youtube-analysis

This YouTube Analysis Package provides tools for analyzing YouTube video data, including metrics on views, likes, comments, and engagement trends. Ideal for gaining insights into video performance and audience interaction patterns.

data data-visualization pandas python webscraping youtube-api-v3

Last synced: 19 Jun 2026

https://github.com/petzi53/repairdata

Open Repair Alliance Datasets 2021

data open-data open-datasets r repair repair-cafe repairs

Last synced: 22 Jun 2026

https://github.com/mtnzorlu/quiz-content-builder

Structured JSON quiz data builder for developers

builder data education json vue

Last synced: 23 Jun 2026

https://github.com/azkarmoulana/winter-of-data-2019

:snowflake: :snowman: Winter of Data is coming..... :wolf:

data data-science machine-learning mathematics

Last synced: 05 Feb 2026

https://github.com/chowington/bg-counter-tools

A set of tools that can pull data from Biogents BG-Counter smart mosquito traps and convert them into a Darwin Core compliant format.

bg-counter biogents darwin-core data internet-of-things mosquito-prevalence population-dynamics

Last synced: 10 Oct 2025

https://github.com/d4niee/exifpy

An simple console tool to view Image meta datas

data exif image meta python

Last synced: 23 Mar 2025

https://github.com/dumkydewilde/mcp-memory-layer

A template for building your own BI MCP with dbt, LLMs and multi-user corrections

bi data dbt llm mcp-server

Last synced: 13 Mar 2026