An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/cainmi/data-page-project

A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now

data html javascript python schema

Last synced: 21 Oct 2025

https://github.com/timxor/bitcoind-data-ingestion

crypto payments bitcoind data ingestion

bitcoind data ingestion

Last synced: 02 Jul 2026

https://github.com/undistraction/grid-model

A small API for creating a grid and accessing the positions of the cells, rows and columns within it.

2d calculations cells data grid layout model

Last synced: 04 Aug 2025

https://github.com/panukatan/senso

An Interface to the Philippine Census of Population and Housing Data

census data philippines r rstats

Last synced: 29 Jun 2026

https://github.com/desininja/data-engineer-interview-questions

This repository contains all the Data Engineer Interview Questions asked by interviewers.

data data-engineer-interview-questions

Last synced: 31 Mar 2025

https://github.com/vishwagauravin/screener-scraper-pro

Effortlessly scrape comprehensive financial data from screener.in and use it in your projects. No API key required.

data finance finances market-data scraper scrapers screener screener-in screener-plugin stock stock-data stock-market stocks

Last synced: 18 Feb 2026

https://github.com/hoaihuongbk/lakeops

A modern data lake operations toolkit working with multiple table formats (Delta, Iceberg, Parquet) and engines (Spark, Polars) via the same APIs.

data data-operations dataengineering datalake

Last synced: 07 Mar 2026

https://github.com/snegovoy98/data-storage

This is test version of data storage

data of storage test version

Last synced: 19 Jul 2025

https://github.com/humbertocg18/pucrs-alest-i-2.3-2023.24

Trabalhos, Projetos, Exercícios e aulas realizados em Java na cadeira de Algoritimos e estrutura de dados 1, matéria do segundo semestre.

beecrowd beecrowd-solution-in-js beecrowd-solutions-in-java data data-structures datastructures-algorithms hashmap hashtable java-8 leetcode leetcode-javascript leetcode-solutions leetcodepra pucrs sorting-algorithms

Last synced: 29 Mar 2025

https://github.com/prioritizr/prioritizrdata

Conservation planning data sets

data r spatial-data

Last synced: 19 Jul 2025

https://github.com/saboye/web-scraping-with-python

A web scraping project using Python's "Requests" and "BeautifulSoup" libraries to extract structured data from one or more websites. This project involves sending HTTP requests to the target website(s), retrieving the HTML content of the website(s), and parsing this content to extract the desired data in a usable format.

beautifulsoup csv data data-harvesting data-mining python request web webscraping

Last synced: 18 Jul 2025

https://github.com/ahmad-ali-rafique/comment-generation-tool

This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.

ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning

Last synced: 07 Oct 2025

https://github.com/simranjeet97/leetcode_practice

Practicing the Leet Code Codes for Competitive Programming

algorithms amazon coding competitive-programming data data-structures facebook google leetcode python

Last synced: 03 Aug 2025

https://github.com/bredalis/datastructure

📚 Estructuras de Datos en Python

algorithms data data-structure python

Last synced: 12 Apr 2026

https://github.com/mierune/tinygrib2

(experimental) A tiny toolkit for parsing JMA's GRIB2 files.

data grib grib2 meteorology rust weather

Last synced: 27 Jun 2025

https://github.com/eve-ning/osumania_data

processed osu!mania data from osu!API

data osu rhythm-game vsrg

Last synced: 24 Feb 2026

https://github.com/e-kotov/mapineqr

Access Mapineq inequality indicators via API

data demogrpahy r rstats socio-economic-indicators

Last synced: 06 Apr 2025

https://github.com/fritzrehde/asciibar

A cli tool to print percentages as ascii bar charts

cli data percentage visualization

Last synced: 02 Jul 2026

https://github.com/tushar2704/insurance-cross-sell

This project harnesses the power of cutting-edge technologies including H2O AutoML, MLflow, FastAPI, and Streamlit to enhance cross-selling campaigns and boost efficiency.

data datascience h20automl machine-learning mlflow python streamlit-tushar2704

Last synced: 08 Oct 2025

https://github.com/diegoperea20/own_dataset_segmentation_yolov8

Segmentacion y detection de objetos con propio dataset usando YOLOV8 , en el que se utiliza un dataset propio de una moneda de 200 pesos colombianos del año 2023.

coins colombia data opencv own python segmentation tensorflow yolov8

Last synced: 12 Apr 2026

https://github.com/bredalis/matplotlib

📊 Library to create graphs in Python 📊

data graphics librery matplotlib matplotlib-pyplot python

Last synced: 30 Mar 2025

https://github.com/devlive-community/mockaroo

一个轻量级的 HTTP Mock 服务器,用于快速构建模拟数据接口,适用于前后端开发和接口测试场景。

data mock

Last synced: 08 Jul 2025

https://github.com/himel-sarder/web-scraping-it-jobs-dataset

This project is a Python-based web scraping tool that collects job listings from TimesJobs for IT-related positions. It extracts job titles, company names, locations, and experience requirements, and saves the data into a CSV file. The tool uses BeautifulSoup and Pandas for web scraping and data manipulation.

data datascience dataset kaggle-dataset machine-learning machinelearning ml web-scraping

Last synced: 22 Feb 2026

https://github.com/codenoid/webtoons.com-database

a Webtoons.com Database, collected by Hofesh Bot (Scrapper)

data database

Last synced: 28 Mar 2025

https://github.com/stdlib-js/ndarray-slice-dimension-from

Return a read-only shifted view of an input ndarray along a specific dimension.

copy data javascript matrix ndarray node node-js nodejs shift slice stdlib structure truncate types vector view

Last synced: 24 Apr 2025

https://github.com/aymane-maghouti/mobile-data-hive-insights

This project demonstrates the process of extracting data from a MySQL database, transferring it using Apache Sqoop, storing it in Hive Data warehouse (the data actually is store in Hadoop Distributed File System (HDFS)), and performing analysis using Hive Query Language (Hive QL) (it is a language close to SQL). Then visualize the data in Power BI,

apache-sqoop data data-integration data-visualization hadoop-hdfs hivedb hiveql powerbi

Last synced: 09 Mar 2026

https://github.com/bileljegham/api-sport-cli

Cli for https://api-sports.io/ Retreive data and convert to sql file

cli data database match nodejs sports sports-analytics

Last synced: 08 May 2026

https://github.com/brianali-codes/github-searcher

A website for API experimentation that users the github Api to search for different users and some of their (public) information

api data github user

Last synced: 21 May 2026

https://github.com/jessielw/parse-fel-master-data

Simple CLI to parse Dolby Vision master data via the RPU/MediaInfo and output data needed for x265

data dolby fel master mediainfo mi parse rpu vision

Last synced: 26 Aug 2025

https://github.com/dalikewara/typego

typego provides custom type that can be used to construct information (such as success data, error data, etc)

custom data golang helper type typego

Last synced: 09 Apr 2025

https://github.com/wooldoughnut310/xboxgamertag

Python module to get data from www.xboxgamertag.com

data gamertag html python3 requests xbox

Last synced: 24 Mar 2025

https://github.com/yasenstar/powerbi_tutorial

Base on "PowerBI Tutorial" book, provide step by step video demo on learning and mastering Power BI tool

analytics data microsoft powerbi tutorial visualization

Last synced: 07 Jan 2026

https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data

This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.

aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python

Last synced: 03 Jan 2026

https://github.com/giscience/measures-rest-oshdb-docker

Scripts for starting measures for geospatial datasets in docker container, using the OSHDB

data dggs docker geospatial mesure openstreetmap rest

Last synced: 18 Apr 2026

https://github.com/sefakcmn00/tensorflow_car_price_analysis

In this project, after extracting the data sets as csv, we tried to represent the car prices graphically and schematically by using data analysis and data visualization methods. We checked the connection of the car prices we analyzed with other data, then we created a 4-layer and 12-neuron system.

data datatrain keras machine-learning matplotlib-pyplot pandas seaborn sklearn tensorflow

Last synced: 14 Apr 2026

https://github.com/incubrain/awesome-maharashtra-data

A collection of datasets specific to Maharashtra, India. WIP

ai artificial-intelligence data data-analysis data-science datasets maharashtra marathi

Last synced: 23 May 2026

https://github.com/xdrokra/road-accident-analytics

A data visualization project that maps and analyzes road accidents across major Italian municipalities in 2023

analytics data design italy javascript

Last synced: 30 Aug 2025

https://github.com/stdlib-js/ndarray-base-zeros-like

Create a zero-filled ndarray having the same shape and data type as a provided ndarray.

base data fill filled javascript matrix ndarray node node-js nodejs stdlib structure types vector zeros

Last synced: 04 Oct 2025

https://github.com/stdlib-js/array-base-last

Return the last element of an array-like object.

array data generic javascript last node node-js nodejs stdlib structure types

Last synced: 30 Aug 2025

https://github.com/tatey/list_of_baby_names

A list of baby names given to tiny humans in Ruby

data names ruby

Last synced: 11 Nov 2025

https://github.com/geo-y20/uber-rides-data-analysis

This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.

dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber

Last synced: 13 Apr 2026

https://github.com/miniql/miniql-express-mongodb-example

A MiniQL example for querying a MongoDB database through an Express REST API.

data database mongodb query query-language

Last synced: 19 Apr 2026

https://github.com/grkndev/twitcher

A great library that will allow you to use the Twitch API service. All you need to do is use your Token and Client Id information.

api clip clipr data javascript nodejs npm npm-package npmjs streamers streaming twitch twitch-api twitch-bot twitchtv twtich-clip user

Last synced: 09 Mar 2026

https://github.com/public-health-scotland/waiting_times_clinical_prioritisation

This repository contains the Reproducible Analytical Pipeline (RAP) to produce the quarterly statistics on clinical prioritisation, part of the Stage of Treatment (SoT) publication.

data healthcare nhs public-health scotland shiny shiny-app treatment waiting-time

Last synced: 26 Jul 2025

https://github.com/nafisalawalidris/sales-performance-dashboard

Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.

analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business

Last synced: 03 Feb 2026

https://github.com/mtingers/opacify

Opacify reads a file and builds a manifest of external sources to rebuild said file.

backup data obfuscation python

Last synced: 18 May 2026

https://github.com/patelabhi574/hotel_reservation_analysis

Analyzing data collected by hotel to make future prediction for the owner of what are the segments they are making most profit & also which are the patterns & trends which have been seen over the past years in the booking in different times throughout the year and price setting on the website in peak time as per availability index.

data data-visualization datamodeling looker-studio powerbi reporting sql-query sql-server

Last synced: 19 Feb 2026

https://github.com/n4ze3m/timezone-json

JSON file with more than 1642 cities timezone in UTC format.

data json timeszone

Last synced: 19 Jul 2025

https://github.com/petermartens98/nba-analytics-streamlit-app-with-langchain-agent

Interactive NBA Analytics app with Streamlit and a LangChain conversational agent connected to extracted data. Explore player, team, and game stats, track injuries, run simulations, visualize trends, and get AI-powered insights. Ongoing development, open to collaboration.

agentic-ai analysis data deepseek langchain nba python streamlit visualization

Last synced: 08 May 2026

https://github.com/bukalapak/bukadata

Data supplier plugin for populating design with real data.

data plugin sketch sketch-plugin

Last synced: 05 Jul 2025

https://github.com/jerryfzhang/rockets

A Node + React App that displays space launch missions around the world.

bootstrap data expressjs less momentjs nodejs react reactjs reactstrap

Last synced: 10 Apr 2026

https://github.com/stonecharioteer/renfield

Synchronize and Search through Hard Drives

catalogue data search storage synchronization

Last synced: 09 Feb 2026

https://github.com/labwhatever/leetcode

Collection of LeetCode questions to ace the coding interview!

data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning

Last synced: 22 Aug 2025

https://github.com/ibz-04/data-encryption

Encrypting and Decrypting given data of hospital patients such as: audio & image files

data decryption encryption

Last synced: 23 Jul 2025

https://github.com/rayenfathallah/students_analysis

This projects contains an analysis of the different fadtors affecting students performance in their final exams. The project uses D3.js to create interactive dashboards that are compelling and easy to interpret.

analysis d3 data education javascript python students

Last synced: 12 Apr 2026

https://github.com/aadityatamrakar/futures_spread_chart

Cash Market & Futures Daily Spread Chart - NSE Stocks

data data-analysis data-mining expressjs nodejs requests

Last synced: 10 Apr 2026

https://github.com/stdlib-js/array-one-to-like

Generate a linearly spaced numeric array whose elements increment by 1 starting from one and having the same length and data type as a provided input array.

array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector

Last synced: 20 Feb 2026

https://github.com/seguradevinn/data-project

A healthcare data audit demo using CMS SynPUF and DuckDB, showing how raw claims are cleaned, validated, and transformed into a 2009 cohort with descriptives and a RADV-style chase list.

auditing cms data duckdb sql

Last synced: 02 Sep 2025

https://github.com/shysolocup/stews

Stews is a Node.JS package meant to make storing data easier by mixing parts from common data types.

aepl array arrays data datatypes html javascript js json map maps nodejs object objects package set sets stews

Last synced: 25 Jul 2025

https://github.com/freddy03h/immutable-data-structure

Normalize and Merge your application's data store using Immutable.JS objects

data immutable redux store

Last synced: 05 Oct 2025

https://github.com/carlotta94c/sql4datascientistsdemo

Demo material for Microsoft Reactor session "Getting Started with Databases: SQL and Data Visualizations"

analysis data r sqlite tidyverse visualisation

Last synced: 18 Apr 2026

https://github.com/r-mahesh45/hr---resume-text-classification

Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.

data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing

Last synced: 12 Sep 2025

https://github.com/so-cool/uobrain

My solution to the University of Bristol PURE Data Challenge

competition data modeling

Last synced: 09 Sep 2025

https://github.com/gusenov/qazaqstan-geography-data

:world_map: Географические данные Казахстана.

data geographic-data geography json kazakhstan qazaqstan regions

Last synced: 20 Feb 2026

https://github.com/snimmagadda1/stack-exchange-dump-to-mysql

Batch pipeline to import Stack Exchange XML data dumps to relational DB

batch data mysql spring-batch stackoverflow

Last synced: 30 Mar 2025

https://github.com/gher-uliege/bluecloud-plankton

Spatial interpolation of plankton data using a neural network

data data-analysis data-visualization neural-network oceanography

Last synced: 30 Mar 2025

https://github.com/oya163/corteva

Corteva Data Ingestion Pipeline

corteva data engineering etl

Last synced: 25 Jul 2025

https://github.com/bredalis/seaborn

📊 Library to create graphics 📊

data graphics-programming librery python seaborn seaborn-plots

Last synced: 04 Mar 2025

https://github.com/adrian-pasek-prv/data-modeling-with-cassandra

Create a data model in Apache Cassandra for music streaming app

apache-cassandra data data-engineering data-modeling python

Last synced: 02 Jan 2026

https://github.com/jaldekoa/fdicapi

A Python wrapper to easily retrieve data from the BankFind Suite official API from FDIC in pandas format.

api api-wrapper banking data finance pandas python united-states

Last synced: 07 Jan 2026

https://github.com/davidgamero/gatech-covid-data-scraper

Utility for scraping GATech Exposure Alert Information into a CSV file with automated case number extraction and aggregation

covid data gatech georgia scraper

Last synced: 31 Mar 2025

https://github.com/san089/black-friday-sales-analysis

This Project gives an insight into few statistics related to black Friday Sale.

custom data dataanalysis insights sales statistics

Last synced: 13 Jul 2025

https://github.com/nikhilash45/live_ipl_report

This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.

analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi

Last synced: 19 Mar 2026

https://github.com/ngambip/priscilla

About my work and Experience

accounting analytics data finance-management

Last synced: 03 Feb 2026

https://github.com/camara94/introduction-to-data-engineering

Describe the different entities that form a modern data ecosystem. Describe and differentiate between the role and responsibilities of Data Engineers, Data Scientists, Data Analysts, Business Analysts, and Business Intelligence Analysts. Explain what Data Engineering is. List the tasks that need to be performed in a typical data engineering lifecycle. Describe what a day in the life of a Data Engineer looks like.

business-analytics business-intelligence data dataingestion dataintegration datascience machinelearning python statistical-analysis

Last synced: 09 Apr 2025

https://github.com/pharo-ai/data-imputers

This project contains transformers for missing value imputation

ai data data-science imputer pharo pharo-smalltalk smalltalk

Last synced: 18 Jan 2026

https://github.com/sksubhadeep/nashville-housing-data-cleaning-project-using-sql

SQL Data Cleaning Project on Nashville Housing Dataset

data datacleaning sql

Last synced: 19 Mar 2026

https://github.com/sambacha/yearn-finance-data

data repo for proposed YIP-DATA

cryptocurrency data erc20 ethereum exchange yearn yip yyip

Last synced: 18 May 2026

https://github.com/sixarm/sixarm_ruby_fab

SixArm.com → Ruby → Fab gem to fabricate sample data for testing

data fabrication factory fake gem mock ruby

Last synced: 24 Jul 2025

https://github.com/mascanho/ruddit

CLI to interact with Reddit's API to programatically retrieve data

cli data marketing rust rust-lang rustlang sales

Last synced: 19 Aug 2025