An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/luminati-io/google-search-api

Two methods to collect real Google SERP data—a free scraper for basic use and the enterprise-grade Bright Data API for high-volume demands.

data google-scraper html python serp-api web-scraping

Last synced: 25 Jun 2025

https://github.com/henryssondaniel/teacup-service-visualization-mysql-java

Connect your Teacup visualization data to a MySQL database

data mysql service teacup visualization

Last synced: 19 May 2026

https://github.com/bakangmonei/is_final_assignment

My intelligent systems assignment

data data-science intelligent-systems python

Last synced: 02 May 2026

https://github.com/mrk214/bible-data-es-spa

La Biblia en formato JSON

api bible biblia data god jesus json spanish

Last synced: 05 Apr 2025

https://github.com/naithikjorapur/practive-tanstacktsx

Practice TanStack with React, Vite, and TypeScript to build fast, type-safe apps. Leverage tools like TanStack Query for data management and Vite for a streamlined development experience.

data exercise fetching html-css-javascript json learning-by-doing practice query router tsx

Last synced: 05 Apr 2025

https://github.com/mekramy/ircity

Iran province, county and city data in json format.

data iran-city json mekramy

Last synced: 05 Apr 2025

https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier

Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.

data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn

Last synced: 11 Apr 2026

https://github.com/fastbolt/entity-importer

Entity importing library for importing data from files (CSV and Excel currently) or API into doctrine.

data doctrine2 excel excel-import

Last synced: 17 Feb 2026

https://github.com/styd/sd_struct

Searchable Deep Struct

activesupport data gem openstruct rails ruby structure

Last synced: 18 May 2026

https://github.com/eryks1999/data-collection-project_python

This project allowed me to practice classes, populating json files as well as extracting data.

data git json python

Last synced: 16 Apr 2026

https://github.com/e22m4u/ts-projection

Модуль для работы с проекцией данных для TypeScript

data projection typescript

Last synced: 12 Apr 2025

https://github.com/davidkhala/sql

Standard SQL collection

data sql

Last synced: 06 Apr 2025

https://github.com/heitang/fcu-courseapi

逢甲大學:課程檢索系統 API 使用說明

api data fcu project

Last synced: 27 Jul 2025

https://github.com/kenjyco/mongo-helper

Helper funcs and tools for working with MongoDB

aggregation-pipeline data database kenjyco mongo mongodb python

Last synced: 28 Jan 2026

https://github.com/juniorreisx/movelo-logstica

Movelo is a lightweight logistics simulator built with TypeScript that provides mock order and delivery data for developing and testing UIs, dashboards, and backend features without external APIs.

data hooks lucide-react react tailwindcss typescript

Last synced: 12 Apr 2025

https://github.com/jigyasag18/ibm-power-bi-dashboard-project

IBM Power BI Dashboard Project is a data-driven analysis of employees using IBM's comprehensive dataset, providing insights into key factors contributing to employee turnover and enabling organizations to strategize effectively towards improved employee retention and satisfaction.

data data-visualization dataanalysis dataanalytics dataset datavisualisation datavisualization-project powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard

Last synced: 07 Mar 2026

https://github.com/shubhamsoni98/classification-with-random-forest---2

Fraud detection is a critical task for financial institutions and businesses. This document outlines the end-to-end process of predicting fraudulent activities using a Random Forest model. The process includes data preparation, exploration, model training, and evaluation.

algorithms anaconda data data-science dataflow feature-engineering jupyter-notebook machine-learning model modeltraining prediction python random-forest sql visualization

Last synced: 20 Jan 2026

https://github.com/notthestallion/data_visualisation-examples

This repository was created to learn and practice graph showing and data visualization. The goal is to gain experience in creating compelling and informative visualizations.

data data-science data-visualization database learn learn-to-code learning learning-by-doing matplotlib matplotlib-figures matplotlib-pyplot visualization

Last synced: 12 May 2026

https://github.com/shrutakeerti/crime-filex

Crime FileX : The mission to trace crime and make this a crime free world

ai aiml analysis crime-data css data html ics js ml

Last synced: 19 Apr 2026

https://github.com/ahadly/sql-data-analytics-project

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql

Last synced: 18 May 2026

https://github.com/yugoff/ml-kaggle-regression-with-a-mohs-hardness-dataset

Your Goal: For this Episode of the Series, your task is to use regression to predict the Mohs hardness of a mineral, given its properties

data gradient-boosting kaggle kaggle-competition regression-models

Last synced: 18 May 2026

https://github.com/mkshah605/personal-brand-development

A data-driven approach to a personal brand development project.

branding data data-science growth music personal

Last synced: 12 Sep 2025

https://github.com/kammarah/studentdata

I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓

connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp

Last synced: 18 May 2026

https://github.com/annaanastasy/mushroom-binary-classification-eda-ml

Explored and modeled a competition dataset of mushroom species, focusing on data cleaning, exploratory data analysis, and building machine learning models for accurate classification of edible and poisonous mushrooms.

binary-classification data data-cleaning-and-preprocessing data-science exploratory-data-analysis machine-learning-algorithms xgboost-classifier

Last synced: 29 Mar 2025

https://github.com/josecsotomorales/dataform

Repository for testing dataform

cli data data-engineering data-transformation

Last synced: 27 Mar 2025

https://github.com/nika2811/new-york-city-taxi-fare-prediction

About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff

data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost

Last synced: 06 Apr 2025

https://github.com/hidayathamir/get-telegram-group-data

With these project you can get data in csv file from your telegram group.

bahasa-indonesia data python3 scrape telegram telethon

Last synced: 13 Sep 2025

https://github.com/encelo/nctracer-data

Data files for the ncTracer project

data icons ncine

Last synced: 15 Jan 2026

https://github.com/pratik-codes/zomato_data_eda

Cleaned, analysed messy data and created a predictive model with and accuracy of 93% with tree Regressor algorithm

bengaluru data data-cleaning data-science famous-restaurants restaurants-delivering-online restraunts

Last synced: 27 Mar 2025

https://github.com/moons-14/datapot

Incorporate and serve all information.

ai aiogram api data infomation news newspaper rss video

Last synced: 04 Jan 2026

https://github.com/opengeoshub/vdownload

A Powerful Geospatial Data Downloader

data geospatial opendata

Last synced: 19 May 2026

https://github.com/prasad-chavan1/bank_data_analysis_r

Bank data analysis in R language

data data-analysis data-science r

Last synced: 24 Feb 2025

https://github.com/khansasafira19/sk-cool-storytelling

Source Code for Data Storytelling with HTML5

data html5 javascript storytelling

Last synced: 13 May 2026

https://github.com/furkantosun1607/cse201-data-structure

This repository contains implementations of various data structures completed as part of the CSE201 (Data Structures) course. Each week, a different data structure was implemented during lab sessions.

array arraylist bfs-search binarytree data dfs-search java linkedlist queue stack structure tree-structure

Last synced: 26 Jun 2025

https://github.com/gsmith257-cyber/BIT3434CVE

BI T3434 Project on data mining CVEs and Exploits

cve data data-mining exploits research-project

Last synced: 10 Mar 2025

https://github.com/hallmx/mx_utils

Utility scripts for software development in data science

colaboratory data development nbdev python science scripts software utlities

Last synced: 19 May 2026

https://github.com/yash22222/olympic-games-analytics-using-apache-spark

The "Olympic Games Analytics Using Apache Spark Databricks" project explores data from the Olympic Games (1896-2016) to identify trends and insights. Using Apache Spark for big data processing and Databricks for visualization, the project analyzes key factors like top-performing countries and athlete attributes, showcasing real-world analytics.

apache apache-kafka apache-spark big-data-analytics csv data data-analytics data-visualization databricks excel mysql olympics regions

Last synced: 03 May 2026

https://github.com/tomwhite/misp-2017

MISP camp 2017 materials and code

bioinformatics data data-visualization hackathon

Last synced: 18 Apr 2026

https://github.com/a-poor/taro

A package for repeatable rectangular data transformations in Python.

data data-science data-transformation pipeline pypi-package python

Last synced: 13 Oct 2025

https://github.com/sweta-kaundilya/911-calls-capstone-project

For this capstone project we will be analyzing some 911 call data from Kaggle.

data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 28 Apr 2026

https://github.com/solrikk/vargen

VarGen (Variation Generator) is a user-friendly desktop application designed to simplify the creation of product variations from CSV files.

csv-files csv-format csv-parser data data-engineering excel excelparser python

Last synced: 29 Mar 2025

https://github.com/pedelriomarron/spanish-api-covid19

Data from Spain of COVID-19 (by Datadista) as a service

api covid-19 covid-19-spain data now spain zeit

Last synced: 12 Mar 2025

https://github.com/jormaechea/aws-firehose-producer

Easily produce data for your AWS Firehose Data Stream

aws data firehose producer stream

Last synced: 19 May 2026

https://github.com/haykam821/circle-tracking

A tool for generating Markdown tracking of the Circle of Trust experiment.

circle data markdown reddit subreddit tracker trust

Last synced: 19 May 2026

https://github.com/bodfdaf/api

api data service provider

api data detail instagram lazada shopee tiktok video

Last synced: 11 Mar 2025

https://github.com/vin20777/drone-data-layer

Drone Project Data Layer

csharp data drone layer software-design

Last synced: 18 May 2026

https://github.com/apparaomulpuri/readline

Explains you the usage of readLine function in Swift.

data fromkeyboard keyboard reading readline swift

Last synced: 29 Mar 2025

https://github.com/nathanieliskandar26/data-analysis-project

This project demonstrates my ability to clean and analyze data using Python and SQL so far. The dataset used for this analysis focuses on general customer information. Through this project, I aimed to uncover meaningful insights and trends by cleaning the data and performing structured queries.

analysis data data-cleaning jupyter-notebook mysql mysql-database python

Last synced: 19 Apr 2026

https://github.com/germanpaul12/automating-hacker-news-and-weather-mails

Project for my Raspberry Pi to send me mails when it rains and to inform with hot tech news

beautifulsoup beautifulsoup4 data hacker-news openweather-api raspberry-pi requests

Last synced: 05 May 2026

https://github.com/ember-nexus/reference-dataset

Ember Nexus API backup containing different standardized scenarios

backup data ember-nexus

Last synced: 25 Jan 2026

https://github.com/ashishsingh789/data_visualization

Data visualization project using Python to analyze categorical and continuous variables. Includes bar charts, histograms, and scatter plots. Libraries used: pandas, matplotlib, and seaborn.

analysis barchart data data-science data-visualization histogram matplotlib pandas-dataframe scatter-plot seaborn

Last synced: 07 Sep 2025

https://github.com/mysociety/sync-ep-to-jkan

Syncs EveryPolitician data to mySociety's data portal.

data everypolitician jkan politicians

Last synced: 27 Jul 2025

https://github.com/pawlo77/messenger-analyser

Repo for Data Visualization project, part of IAD study program at Faculty of Mathematics and Information Science, Warsaw University of Technology

data visualization

Last synced: 17 May 2026

https://github.com/ramtinsoltani/safe-cli

A simple Command-line Interface which encrypts and decrypts UTF-8 files using AES-256.

aes-256 cli data data-hook decryption encryption generator handlebars hooks markup partial partial-decryption password safe swap temp temporary tool

Last synced: 16 Apr 2026

https://github.com/questionlp/wwdtm_uniquedates

Script that lists out the unique months and days of months that Wait Wait... Don't Tell Me! shows have aired

data python python3 script wwdtm

Last synced: 17 May 2026

https://github.com/mightymetrika/scdtb

Single Case Design Toolbox

data math r science statistics

Last synced: 04 Jan 2026

https://github.com/lukaszkn/data-software-engineering-interview-questions

Data and Software engineering interview questions

data engineering interview-questions python

Last synced: 20 Jul 2025

https://github.com/ahmad-ali-rafique/logistic-regression-modeling

An in-depth exploration of logistic regression models, including data cleaning, model building, and performance evaluation on various datasets.

accuracy confusion-matrix data dataanalytics logistic-regression logistic-regression-classifier machine-learning-algorithms mlmodels model modelling regression-models

Last synced: 11 Sep 2025

https://github.com/gregoritsch3/project_excel_dataanalysis_carsales

An Excel Data Analysis project based on a vehicle vendor's car sales data from 2014 and 2015 showcasing data cleaning and formatting, DAX, pivot tables and charts, timelines, slicers, an interactive Dashboard, descriptive Statistics and more.

analysis dashboard data excel sales statistics

Last synced: 01 Feb 2026

https://github.com/maximkrouk/storage

Lightweight framework for storing data (beta)

cache data keychain memmory storage swift swift5-1 userdefaults

Last synced: 30 Oct 2025

https://github.com/manifoldfinance/honte

reference data and metrics for sushiswap proposal

data ethereum sushi sushiswap

Last synced: 18 May 2026

https://github.com/nagipragalathan/linkedin_backup_datas

This repository contains the backup data from my previous LinkedIn account. Unfortunately, my old LinkedIn account was compromised and subsequently blocked by LinkedIn. As a result, I created a new account, but that too got blocked for reasons unknown to me.

backup blocked data linkedin linkedin-account memory nagipragalathan recovery storage

Last synced: 18 Jan 2026

https://github.com/renebentes/2806

Curso 2806 - Acesso à dados com C#, .NET 5, Dapper e SQL Server

csharp dapper data dotnet sqlserver

Last synced: 19 Apr 2026

https://github.com/blackroad-os-inc/blackroad-portal

BlackRoad Portal — unified search routing to 30+ BlackRoad services.

blackroad cloudflare-workers data search

Last synced: 04 Apr 2026

https://github.com/v41bh4vr4jput/data-analysis-with-python

This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.

api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn

Last synced: 09 Apr 2026

https://github.com/sirmaxx/log_manager

log manager services for microservices

data fastapi logging microservice mongodb

Last synced: 09 Apr 2026

https://github.com/jstafford5380/provausio.testing.generators

Generate fake data for testing and/or mocking

data fake-data generator testing

Last synced: 14 Jan 2026

https://github.com/ahmadjamil888/ink-flow-share

A medium clone with all basic features such as blog generation , auth and history and user data

articles blogs cs data flow herald ink ink-flow-share journalism medium post react shad shadcn share users vite

Last synced: 09 Apr 2026

https://github.com/0xbitx/dedsec_pastebin-cli

allows you to manage your pastes directly from the terminal

code data paste pastebin payload

Last synced: 25 Jan 2026

https://github.com/lijesh010/roadaccidentanalysisproject

This data analysis project was completed using MS Excel, and includes the creation of a dashboard.

data data-analytics data-exploration data-visualization msexcel

Last synced: 15 Feb 2026

https://github.com/plnech/never2late

Never 2 Late - a reinterpretation of Everest Pipkin's 'i've never picked a protected flower'

dada dada-science data generative-art glitch-art installation nlp poetry spacy vector-similarity wallpaper

Last synced: 10 Jun 2025

https://github.com/bertrand31/one-billion-rows-challenge

🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds

competitive-programming competitive-programming-contests data data-engineering data-processing performance scala

Last synced: 05 Sep 2025

https://github.com/elkingarcia11/mlb-gameday-obp-odds

Small Python script that pulls MLB team on-base percentage (OBP) for the current season, loads today’s schedule, and writes CSV files that list each team’s OBP edge against its opponent for the day. It also labels each side of a game as betting favorite, not favorite, or equal using American moneylines from ESPN’s public game data.

api csv data http https json mlb mlb-stats-api moneyline odds python rest sports urllib

Last synced: 30 May 2026

https://github.com/pixlcrashr/stwhh-mensa

Better STWHH Mensa menu data / interface / notifier

api crawler data food studierendenwerk-hamburg university website

Last synced: 07 Aug 2025

https://github.com/mubashirsidiki/certifications_work

his repository contains my work, projects, and solutions from various professional certification programs.

analysis coursera data data-science google ibm john-hopkins machine-learning michigan udemy

Last synced: 01 Jul 2025

https://github.com/rubyonworld/ldpath

This is a ruby implementation of LDPath, a language for selecting values linked data resources.

data ldpath resource ruby

Last synced: 12 Nov 2025

https://github.com/pranjaldhamane/social-media-sentiment-analysis

This project aims to analyze sentiment in Twitter data to understand attitudes towards specific topics or entities. It seeks to uncover positive and negative sentiment patterns, detect potential cyberbullying or hate speech, and provide insights into Twitter's overall sentiment landscape.

data dataanalysis logistic-regression nlp-machine-learning python sentiment-analysis twitter

Last synced: 18 Apr 2026

https://github.com/Alpine418/DataHandler

Data handler for PHP arrays.

data data-handler php73

Last synced: 01 Oct 2025

https://github.com/meizuflux/cion

Python minimal data validation library

data minimal python validation

Last synced: 28 May 2026