An open API service indexing awesome lists of open source software.

data

Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)

https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization

This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.

classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm

Last synced: 10 Apr 2025

https://github.com/prishabhanot/facial_recognition_pca

A face recognition system using Principal Component Analysis (PCA) for dimensionality reduction and a Support Vector Machine (SVM) classifier for classification. PCA extracts essential features (eigenfaces) from facial images, significantly reducing computational complexity while retaining critical information for accurate recognition.

data eigenfaces facial-recognition pca python reducing-computational-complexity reducing-data-dimensions svm-classifier

Last synced: 01 Mar 2025

https://github.com/contawo/travel-journal

This is a travel journal application for storing all the places that you have visited. I was learning by doing react when creating this project. I learnt a lot with it and upgraded my reactjs skills.

data learning-by-doing props reactjs

Last synced: 05 May 2026

https://github.com/remcostoeten/github-and-vercel-api-showcase-dashboard

Showcase results of possible fetched data from the Github and Vercel API built in all vanilla js.

api-rest da data express-js github-api nodejs vercel-api

Last synced: 07 Mar 2026

https://github.com/musamairshad/dsa-python

This repository contains all the material related to Data Structures and Algorithms implemented in Python.

algorithms data datastructures efficiency python searching-algorithms sorting-algorithms

Last synced: 25 Mar 2025

https://github.com/dhanish03/reliance-sales-report-dashboard

This project, Reliance Sales Report Dashboard, showcases a dynamic and interactive Power BI dashboard designed to analyze sales performance. The dashboard provides key insights into various aspects of sales data, including product-wise performance, region-based revenue, and profitability trends.

data datavisualization-project powerbi visualization

Last synced: 23 Jan 2026

https://github.com/munas-git/codm-review-analysis-and-predictions

Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.

data flask machine-learning python sentiment-analysis

Last synced: 05 May 2026

https://github.com/sakshamarora07/blinkit-sales-report-power-bi

This dashboard provides Blinkit with insights to optimize its grocery delivery operations and understand customer preferences. It evaluates sales trends, outlet performance, and item categories to identify key areas for improvement. The interactive visuals allow detailed exploration of sales distribution, customer ratings, and product popularity.

data data-science dataanalytics datavisualization excel powerbi sql

Last synced: 08 Jan 2026

https://github.com/raghavendranhp/youtube_data_harvesting

The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions

apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api

Last synced: 13 Apr 2026

https://github.com/codegouvfr/codegouvfr-sources

🧒 Static web frontend for code.gouv.fr

bluehats codegouvfr data frontend

Last synced: 28 Feb 2025

https://github.com/rorylshanks/devdb-client

This is the repository for the official command line client for DevDB (https://devdb.cloud)

cloud data database-management development

Last synced: 29 May 2026

https://github.com/shadmanshaikh/data-analysis-and-ml-work

All of my work in Data Analysis and Machine learning

analytics artificial-intelligence data machine-learning

Last synced: 05 Jul 2025

https://github.com/louis-heraut/dataverseur

πŸ«– A dataverse API R wrapper to enhance the deposit procedure using only R variable declarations

data data-repository data-science datascience dataset dataverse dataverse-api json metadata metadata-management metadata-parser r

Last synced: 24 Oct 2025

https://github.com/gianlucatruda/titanic

An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.

data database flask info matplotlib python science scrape server titanic visualisation web

Last synced: 10 Apr 2026

https://github.com/moscatellimarco/webscrap-tinydeal

"WebScrap-TinyDeal" is a Scrapy-powered πŸ•·οΈ tool for harvesting product information 🏷️ from TinyDeal. It outputs structured CSV data πŸ“, ready for analysis. Explore the scripts πŸ‘¨β€πŸ’» for an interactive scraping adventure or leverage the data for competitive pricing strategies πŸ“ˆ.

css data datascience html pandas python scrapy web webscraper webscraping

Last synced: 14 Apr 2026

https://github.com/ztgx/muvera

MUVERA: Making multi-vector retrieval as fast as single-vector search

algorithms data google muvera retrieval rust search structure vector

Last synced: 25 Oct 2025

https://github.com/prajjwol09/power-bi-project

The Data Survey Breakdown is an interactive Power BI dashboard designed to present insights gathered from a survey of professionals and enthusiasts in the data industry.

dashboard data interactive powerbi survey

Last synced: 15 Mar 2026

https://github.com/byndyusoft/byndyusoft.data.relational

Relational abstractions for Byndyusoft.Data.Relational.

byndyusoft data dataaccess db relational-databases

Last synced: 25 Oct 2025

https://github.com/ayush-raj8/godata

Write data to file. Standardizes the format for easy parsing and read by other programs.

data golang

Last synced: 18 Jan 2026

https://github.com/brayflex/spy-sector-rotation-google-sheet

Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.

data etf google-sheets index price rotation script sector spreadsheet spy stock-market

Last synced: 29 Jun 2026

https://github.com/greedchikara/dsajs

Data Structures and Algorithms written in Javascript

algorithms data structures

Last synced: 09 Apr 2026

https://github.com/solrikk/bluemoon

This project is a Go language tool designed to automatically download, process, and save product data from a remote server into a CSV file.

analyze converter data go golang xml-parser

Last synced: 31 Jul 2025

https://github.com/farrelfaricaf/exploratorydataanalyst---titanic

This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.

data data-analysis data-science data-visualization eda python titanic-dataset

Last synced: 31 Jul 2025

https://github.com/cunfuu/network-bubbles

For Easier to manage organizations and keeping notes about them to organize events and easy access their needs

data data-visualization organizations organizations-volunteer

Last synced: 31 Jul 2025

https://github.com/revolutionarybukhari/datawarehouse_meshjoin_superstore

A dataware house is generated for streaming data of a superstore using extended mesh join by Syed Husnain Haider Bukhari

data data-science data-warehousing meshjoin

Last synced: 23 May 2026

https://github.com/aaronspindler/selfdrivingcar

Learning deep learning and making a self driving car in the process

car data deep deep-learning driving keras learning machine machine-learning python self self-driving-car

Last synced: 09 Apr 2026

https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends

This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.

analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization

Last synced: 16 Feb 2026

https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data

This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.

big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard

Last synced: 19 Feb 2026

https://github.com/nushratjabenaurnima/cse_477_data_mining

A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.

data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3

Last synced: 09 Apr 2026

https://github.com/plurid/delog

Cloud Service for Centralized Logging

cloud data logging

Last synced: 08 Nov 2025

https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration

This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.

covid-19 data data-exploration dataset sql

Last synced: 19 Feb 2026

https://github.com/entorb/analyze-ha-energy

Analyze Home Assistant Solar Production Data

data home-assistant pandas photovoltaic pv python

Last synced: 08 May 2026

https://github.com/jun-labs/jq

🧷 Let's practice jq.

data jq json json-data parse

Last synced: 27 Sep 2025

https://github.com/alecxcode/table-parser

Python Table Parser (data extraction)

automation data extraction python robotic-process-automation

Last synced: 04 May 2026

https://github.com/ddeepanshu-997/support_vector_regression--svr-

In this repository i performed a support vector regression on real life data , initially i performed some data preprocessing technique in order to filter out the data flaws then undergoes the process of model building i.e SVM regression in order to make a machine learning regression model.

data data-science regression-analysis regression-models svm-model svm-regression

Last synced: 03 Aug 2025

https://github.com/haimonmon/j3mify

Convert your jejemon word into a formal sentence or word

data jejemon nlp normalization python regex tagalog tokenization

Last synced: 12 Oct 2025

https://github.com/e22m4u/ts-data-schema

Валидация Π΄Π°Π½Π½Ρ‹Ρ… ΠΈ ΠΏΡ€ΠΈΠ²Π΅Π΄Π΅Π½ΠΈΠ΅ Ρ‚ΠΈΠΏΠΎΠ² для TypeScript

data schema typescript validation

Last synced: 05 Aug 2025

https://github.com/elissorokin/data-analyst-portfolio

Π­Ρ‚ΠΎ Ρ€Π΅ΠΏΠΎΠ·ΠΈΡ‚ΠΎΡ€ΠΈΠΉ, Π² ΠΊΠΎΡ‚ΠΎΡ€ΠΎΠΌ я Π΄Π΅ΠΌΠΎΠ½ΡΡ‚Ρ€ΠΈΡ€ΡƒΡŽ свои Π½Π°Π²Ρ‹ΠΊΠΈ, дСлюсь ΠΏΡ€ΠΎΠ΅ΠΊΡ‚Π°ΠΌΠΈ ΠΈ ΠΎΡ‚ΡΠ»Π΅ΠΆΠΈΠ²Π°ΡŽ прогрСсс Π² области Π°Π½Π°Π»ΠΈΠ·Π° Π΄Π°Π½Π½Ρ‹Ρ… ΠΈ Data Science.

ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis

Last synced: 09 Apr 2026

https://github.com/0xbitx/dedsec_pastebin-cli

allows you to manage your pastes directly from the terminal

code data paste pastebin payload

Last synced: 25 Jan 2026

https://github.com/renebentes/2806

Curso 2806 - Acesso Γ  dados com C#, .NET 5, Dapper e SQL Server

csharp dapper data dotnet sqlserver

Last synced: 19 Apr 2026

https://github.com/sourceduty/data_marketer

πŸ’° Analyze uploaded data and prepare a data marketing plan for selling data. Create data product plans.

ai ai-data ai-tool artificial-intelligence business chatgpt company custom-gpt customgpts data data-business data-market data-marketer data-marketing data-tool gpt gpt-store gpts gptstore openai

Last synced: 03 Sep 2025

https://github.com/mchenryspagg/wrangle-and-analyze-data

This project which is known as 'wrangle and analyze data' involves the wrangling of WeRateDogs twitter archive data from the period of 2015 to 2017

api data dataanalysis datacollection datawrangling datetime json numpy os pandas pil python requests tweepy-api visualization

Last synced: 09 Apr 2026

https://github.com/chompfoods/sdk-java

Java SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.

api branded chomp data database food gradle grocery ingredients jar java java-sdk nutrition openapi raw recipe-api recipes sdk

Last synced: 09 Apr 2026

https://github.com/kolyaventuri/covid-act-now

A CovidActNow.org API client

covid data typescript

Last synced: 09 Aug 2025

https://github.com/srindot/fwuav-average-flight-data-collection

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 10 Aug 2025

https://github.com/ashita-ai/ashita-ai.github.io

Ashita AI - The island of misfit data tools

ai data

Last synced: 19 Feb 2026

https://github.com/mcraiha/datagensharp

C# managed library for generating data

csharp data generator

Last synced: 11 Aug 2025

https://github.com/keziatbnn/supervised-regression-salaryprediction

Make salary predictions based on years of experience using supervised regression.

data data-analysis-python data-prediction data-science python

Last synced: 11 Aug 2025

https://github.com/kadirlofca/unity-csvmaker

Quick and easy way to create and export .csv files from Unity.

csharp data database unity

Last synced: 09 Apr 2026

https://github.com/aaisha-nexus/sql_company_insights

A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.

business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms

Last synced: 12 Aug 2025

https://github.com/jleung51/foundations-dags

Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.

data data-engineering etl extract housing load pipeline transform

Last synced: 04 Oct 2025

https://github.com/seqeralabs/ffq-api

A minimal wrapper to make ffq searches available via a REST API.

api data fastq fetch-fastq ffq genomics

Last synced: 15 Aug 2025

https://github.com/rijkvanzanten/ds-fa-1

The first final assignment for the data structures class

assignment data final map now parsons structures thenewschool

Last synced: 04 Oct 2025

https://github.com/sulujulianto/population-data-retrieval-and-analysis

I created a simple program that can be used to search for global population data or population data from various countries using Python.

data population world

Last synced: 09 Mar 2026

https://github.com/srindot/average_flightdata_collection_fwuaav

This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.

data flaping-uav

Last synced: 18 Aug 2025

https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization

A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.

api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode

Last synced: 14 Apr 2026

https://github.com/hakusaro/facts

A fact based knowledge system (FBKS) experiment.

data facts hacktoberfest

Last synced: 03 Jan 2026

https://github.com/rationalprabal/book-management-app

A Node.js and Express.js application for managing books, featuring role-based authentication and authorization with JWT, file uploads for book cover pages, robust data validation and documentation using swagger. The project includes user roles such as Admin, Author, and Reader, each with specific permissions.

data expressjs jwt-authentication mongodb mongoose nodejs rbac-roles

Last synced: 10 Apr 2026

https://github.com/rugwiroparfait/alx_sql

This repo is where I save my queries and learning materials in Data Science program from ALX

anaconda data data-analysis jupyter-notebook sql

Last synced: 19 Aug 2025

https://github.com/giscience/measures-rest-oshdb-app

A frontend for providing measures for geospatial datasets, using the OSHDB

data dggs geospatial measure openstreetmap rest

Last synced: 20 Apr 2026

https://github.com/ahmad-ali-rafique/wine-quality-dataset

Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.

analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model

Last synced: 21 Aug 2025

https://github.com/paulrosset/cyclone

Network data consumption monitoring

data monitoring network networking

Last synced: 23 Aug 2025

https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis

In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.

data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public

Last synced: 09 Mar 2026

https://github.com/luminati-io/google-maps-dataset-samples

A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.

api data dataset google-maps maps web-scraping

Last synced: 03 Jan 2026

https://github.com/franckalbinet/maris-crawlers

Automated data harvesting of MARIS data sources

automation data marine-radioactivity

Last synced: 25 Aug 2025

https://github.com/0xnu/data-analyst-training

The repository contains training materials for data analysts.

data data-analysis data-analyst

Last synced: 25 Aug 2025

https://github.com/darshjasani/insurance-claim-analysis

This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.

analysis data kaggle sql

Last synced: 27 Aug 2025

https://github.com/schoolsquirrel/holiday-data

Automatically updated holiday data for SchoolSquirrel

data holidays schoolsquirrel scripts vacation

Last synced: 03 Oct 2025

https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models

This repo contains a model for determining the rainfall patterns and another for medical expense prediction model

data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning

Last synced: 30 Aug 2025

https://github.com/ate47/playerdata

Get data about a player with a command

bukkit-plugin command data spigot-plugin

Last synced: 30 Aug 2025

https://github.com/olekscode/datageneration

Exploring the methods of data generation for different Machine Learning algorithms

data javascript machine-learning

Last synced: 05 Apr 2025

https://github.com/passly-nl/data

Source code of the data layer.

data passly ticketing typescript

Last synced: 27 May 2026

https://github.com/sungchun12/demotron

CLI to delight real people with live demos

cli data demo sqlmesh

Last synced: 26 Feb 2025

https://github.com/agdturner/ccg-data

A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.

data data-analysis

Last synced: 12 Jan 2026

https://github.com/koppalexander/flightdelaychallenge

This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.

data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling

Last synced: 19 Jun 2026

https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project

This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.

cleaning-data data database dataset mysql mysql-database sql

Last synced: 07 Apr 2025

https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails

Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.

cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql

Last synced: 24 Jul 2025