data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/bishtrishu/super_store_sales_dashboard
This repository contains a comprehensive sales analysis dashboard for a Superstore, created using Power BI. The objective is to contribute to the success of a business by utilizing data analysis technique, specially focusing on time series analysis, to provide valuable insights and accurate sales forecasting.
analytics data data-science dataanalysis dataanalyst datacleaning datascience datavisualization-project excel microsoft-azure microsoft-excel powerbi report sql
Last synced: 28 Feb 2026
https://github.com/mnkanout/patients_medication_prediction
The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.
data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3
Last synced: 29 Jun 2025
https://github.com/sumaiyyaf/british-airline-dashboard
This Tableau dashboard visualizes British Airways customer reviews, showcasing key metrics like average ratings for service, entertainment, and seat comfort. It features interactive filters for exploring ratings by aircraft type, country, and traveler type, along with trend analysis over time.
analysis dashboard data tableau visualization
Last synced: 13 Feb 2026
https://github.com/dushansenadheera/web_scraper
web scraper using Python along with BeautifulSoup and Selenium
beautifulsoup data python selenium web-scraping
Last synced: 19 Jun 2026
https://github.com/equinor/fmu-sumo-uploader
Upload to Sumo in the FMU context
data fmu python subsurface sumo
Last synced: 06 May 2026
https://github.com/beastbytes/n6l-phone-number-data-php
NationalPhoneNumerInterface implementation using PHP for storage
data itu-t0202 phone-number php yii3
Last synced: 08 Feb 2026
https://github.com/sidneyarcidiacono/data-parser
A node module designed to make reading in large files as easy as calling one function.
Last synced: 05 May 2026
https://github.com/cmda-tt/course-25-26
🎓 tech track · 2025-2026 · curriculum and syllabus 📊
d3 data datavis functional javascript programming research svelte visualization
Last synced: 20 Jan 2026
https://github.com/imartinezl/madrid-challenge
Madrid Route Optimization Challenge 🚚♻️🚚
challenge city data optimization routing-algorithm traffic
Last synced: 28 Feb 2026
https://github.com/srgchrksv/stream-crypto
Crypto trades streaming with azure services
azure binance crypto data databricks dataengineering pyspark python streaming websocket
Last synced: 30 Apr 2026
https://github.com/molinsagustin/cinedata
# CineData Trabajo práctico grupal para la materia Ingeniería de Datos I en la Universidad Argentina de la Empresa. El mismo consistió en el desarrollo de una base de datos relacional en Microsoft SQL Server Managment Studio utilizando metodología Ágil SCRUM, que se utilizó desde el relevamiento de requisitos hasta la implementación final.
agile data data-modeling database diagram entity-relationship-diagram microsoft-sql-server relational-databases relational-model scrum scrum-agile sql sqlserver
Last synced: 28 Feb 2026
https://github.com/rylan12/apscores
A quick way to visualize how the AP score distributions have changed from year to year.
advanced-placement analysis ap-exam data scores
Last synced: 19 Jun 2026
https://github.com/sunnahboy/checkfake_true_news
Building data structures using Linked lists and arrays and find best algorithms for implementing a system for detecting Fake News
algorithms data level low programming structure
Last synced: 28 Feb 2026
https://github.com/team810/frcs
FRCS is an online international crowd sources data collection software written for the FRC Competitions. It was created by team 810, The Mechanical Bulls.
Last synced: 14 Mar 2025
https://github.com/writetome51/page-load-access
A TypeScript/Javascript class that loads a batch (array) of data from a larger set too big to be loaded all at once.
batch class data javascript load loader typescript
Last synced: 16 May 2026
https://github.com/greatwoman23/car_insurance_analysis
The Car Insurance Analysis project aims to provide a comprehensive examination of a car insurance portfolio using advanced data analytics tools. The analysis offers valuable insights into policy demographics, claims patterns, and financial metrics, helping stakeholders make informed decisions.
bigquery data data-science dataanalytics insurance-claims looker-studio tableau
Last synced: 03 Feb 2026
https://github.com/priyapuranik/data-analytics-using_python
Analyzed data of Hotels and find out meaningful insights from it including booking patterns and seasonal trends and many more.
data pandas python sql visualization
Last synced: 06 Apr 2026
https://github.com/vaxdata22/cyclistic-ride-sharing-company
This is my Google Data Analytics Certificate case study for the Cyclistic ride-sharing company
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-visualization data-wrangling exploratory-data-analysis google-data-analytics spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql tableau transact-sql
Last synced: 10 Jun 2026
https://github.com/nevoland/unchangeable
🧊 Tools for immutable values.
data datastructure functional immutable persistent pure stateless
Last synced: 24 Jul 2025
https://github.com/ttozatto/sparkify
Churn Prediction for music streaming app with PySpark
analysis churn data learning machine predictive pyspark science spark
Last synced: 16 Jan 2026
https://github.com/lijesh010/roadaccidentanalysisproject
This data analysis project was completed using MS Excel, and includes the creation of a dashboard.
data data-analytics data-exploration data-visualization msexcel
Last synced: 15 Feb 2026
https://github.com/dawidolko/datafusion-app-python
Project as part of the Data Warehousing subject.
academic-project data dataprocessing extraction gui loading project pysimplegui python transformation
Last synced: 15 Feb 2026
https://github.com/nmelgar/birthday_sports_dataviz
We will analyze how the Matthew Effect has influenced in professional sports players.
analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau
Last synced: 08 Jan 2026
https://github.com/carlosrs14/parallel-data-preprocessig-system
A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.
barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads
Last synced: 24 Jul 2025
https://github.com/arnocan/yapydata
The yapydata provides miscellaneous low-level Python data access APIs.
data datastructures ini json properties python python2 python3 xml yaml
Last synced: 16 Feb 2026
https://github.com/iankitnegi/statistically_speaking
Explore diverse projects showcasing statistical techniques with real-world data, comprehensive docs, and interactive visualizations.
data excel statistical-analysis statistics
Last synced: 09 Feb 2026
https://github.com/tks18/xl-pq-handler
A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation
analytics automation data excel power-query powerbi python xlwings
Last synced: 20 Jan 2026
https://github.com/natarizkie2/neurochain-airdrop-bot
🍋 — A smart bot designed to complete data tasks like true/false selections automatically, with multi-account support for extra convenience.
airdrop automated bot data multi-account natarizkie neurochain nodejs web3
Last synced: 10 Jun 2026
https://github.com/taquece/goals-per-match
basic script to calculate average football goals per match from .CSV
beginner csv data football nodejs python sports-analytics
Last synced: 09 May 2026
https://github.com/artcc/coredatademo
Demo for CoreDataGenericModule implementation
core coredata coredata-model data encrypted encrypted-data encryption persist
Last synced: 19 Jun 2026
https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails
Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.
cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql
Last synced: 24 Jul 2025
https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project
This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.
cleaning-data data database dataset mysql mysql-database sql
Last synced: 07 Apr 2025
https://github.com/greedchikara/dsajs
Data Structures and Algorithms written in Javascript
Last synced: 09 Apr 2026
https://github.com/coderjolly/spotify-api-data-analysis
The project leverages Apache Airflow for automating Spotify API data analysis, focusing on user activity. Extracting, transforming, and loading data efficiently, it provides insights via PowerBI dashboards.
airflow airflow-dags data data-engineering etl etl-pipeline microsoft-sql-server power-bi python scripting sql
Last synced: 27 Mar 2026
https://github.com/gman-au/white-knight-neo4j
Neo4j implementation of White Knight data abstraction library
abstractions data datastore dotnet neo4j repository-pattern specification-pattern
Last synced: 20 Jan 2026
https://github.com/inzhenerka/scooters_data_generator
Generate data of scooter trips for analysis
Last synced: 02 Jun 2026
https://github.com/bilgehangecici/datatypeconverter
Converting integer and floating numbers to appropriate bit-level representation.
data datatypeconverter java machine-level variables
Last synced: 30 Mar 2025
https://github.com/neptun-software/neptun.data.generators
Send scraped data from neptun-scraper to CHATGPT to generate training data for NEPTUN.AI.
Last synced: 30 Jul 2025
https://github.com/ate47/playerdata
Get data about a player with a command
bukkit-plugin command data spigot-plugin
Last synced: 30 Aug 2025
https://github.com/tupizz/python-data-manipulation
Data manipulation and visualization with Python 2.x
Last synced: 09 May 2026
https://github.com/jillmpla/kaggle_notebooks
Kaggle-based data analysis, data science, and data visualization.
data data-science data-visualization kaggle machine-learning
Last synced: 16 Apr 2026
https://github.com/abhroroy365/market_analysis
This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.
clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis
Last synced: 09 May 2026
https://github.com/flexthink/matricize
A convenience library to convert between pure Python objects and their vectorized representations
data machine-learning numpy python
Last synced: 09 May 2026
https://github.com/erickpeirson/jhb-data
Data from the forthcoming paper: Quantitative Perspectives on Fifty Years of the Journal of the History of Biology
data geolocation history-of-biology named-entity-recognition topic-modeling
Last synced: 04 Mar 2026
https://github.com/lucasnbsb/data-structures-and-algorithms
Studying data structures and algorithms, mostly on leetcode
Last synced: 29 Aug 2025
https://github.com/miozilla/fraudfinder
fraudfinder :mag_right::smiling_imp::suspect: : Historical Payment Transactions # Fraud Detection # EDA # Feature Store # Model Registry
analysis data exploratory feature-store fraud-detection
Last synced: 29 Aug 2025
https://github.com/hubtou/adsv
Analyze delimiter-separated values files
command-line-tool csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining learning-python pnu-project python servier shell tools unix utility
Last synced: 17 Apr 2026
https://github.com/udhaya2823/microsoft---classifying-cybersecurity-incidents-with-machine_learning
🚨Microsoft: Classifying Cybersecurity Incidents with Machine Learning🔐 This project leverages the power of Machine Learning to classify cybersecurity incidents 🚨, improving the efficiency of Security Operation Centers (SOCs) at Microsoft. We train a model to predict incident grades, helping analysts prioritize threats with precision🎯.
classification data feature-engineering iqr-method machine-learning matplotlib model-evaluation modelselection predictive-modeling python sklearn
Last synced: 17 Apr 2026
https://github.com/ssiarhei115/countryhouse-price-prediction
ML modeling for house price prediction in Belarus
big-data data data-science fullstack fullstack-development mashine-learning parsing parsing-engine
Last synced: 28 Aug 2025
https://github.com/darshjasani/insurance-claim-analysis
This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.
Last synced: 27 Aug 2025
https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project
This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop
analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping
Last synced: 09 May 2026
https://github.com/jwszolek/accelerated-data-generator
Ultra-fast random data generator. It gives you an ability to generate almost 1M of rows in around second.
bash csv data data-generator generator shell
Last synced: 02 Apr 2026
https://github.com/ssiarhei115/shop-customers-segmentation
Shop customers segmentation
data data-analysis data-science data-visualization
Last synced: 24 Aug 2025
https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis
In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.
data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public
Last synced: 09 Mar 2026
https://github.com/canadaluke888/ttb2
TerminalTableBuilder 2
c17 csv data database datasets datautils json ncurses ods spreadsheet sqlite3 tables terminal terminaltablebuilder terminaltablebuilder2 ttb ttb2 ttbx xlsx
Last synced: 10 Apr 2026
https://github.com/cfloressuazo/academic-kickstart
This is my personal website :)
analytics blog data data-engineering data-science personal technology
Last synced: 17 Apr 2026
https://github.com/ahmad-ali-rafique/wine-quality-dataset
Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.
analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model
Last synced: 21 Aug 2025
https://github.com/foreteternelle/pokemonstudiodataapi
The GitHub repository of the Pokémon Studio Data Api
Last synced: 02 Apr 2026
https://github.com/amethyst-php/company
amethyst amethyst-package api company data laravel
Last synced: 17 Apr 2026
https://github.com/rawdaabdelsalam42/data-cleaning-sql-python-powerbi
Data cleaning project for an e-commerce sales dataset using Python (Pandas) for preprocessing, SQL Server for queries, and Power BI for building an interactive dashboard visualization.
dashboard data data-engineering pandas powerbi python sql
Last synced: 17 Apr 2026
https://github.com/umrlastig/global-local
The Global-Local loop: bridging the gap between geospatial communities
challenges communities data fusion gaps geospatial perspectives
Last synced: 03 Apr 2026
https://github.com/epomatti/az-data-services
End-to-end scenario for Azure data services.
azure data data-engineering databricks datalake lake synapse terraform
Last synced: 17 Apr 2026
https://github.com/rrohitramsen/expression-evaluator
Expression Evaluator + Tree Data Structure + Postorder Traversal + Rest API + Spring Boot
data data-structures design-patterns json microservice postorder problem-solving spring-boot swagger-api swagger-docs swagger-ui tree tree-structure
Last synced: 04 Apr 2026
https://github.com/mohamedbilal1800/olympic_history_data_analysis
This project delves into the 120 Years of Olympic History: Athletes and Results dataset, analyzing athlete demographics, medal achievements, and country performances across the Summer and Winter Olympics from 1896 to 2016.
analysis data eda matplotlib-pyplot pandas python seaborn visulaization
Last synced: 09 May 2026
https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling
Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models
Last synced: 17 Apr 2026
https://github.com/awhipp/forex-api-export
API Service that pulls forex data and returns CSV file based on the parameters
data forex forex-trading oanda oanda-api-v20 trading
Last synced: 04 Jun 2026
https://github.com/bhavanachitragar/layoff_analysis
This Streamlit app is designed for Layoff Analysis. It allows users to explore and analyze layoff data from different perspectives, including overall analytics, country-specific insights, and individual company details.
data dataanalysis streamlit streamlit-webapp
Last synced: 18 Apr 2026
https://github.com/zurd46/zurdsynthdatagen
This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).
data data-structures dataset electron json jsonl nodejs openai synthetic
Last synced: 04 Apr 2026
https://github.com/karolkrupa/javascript-orm-mapper
ORM mapping library. Especially for Rest API
api data data-mapper entity es6 javascript mapper model mongo mysql node nuxt orm relational rest typescript vue vuex
Last synced: 10 Apr 2026
https://github.com/urvish-06/seaborn-dataset
Seaborn data sets
csv csv-files data data-science data-visualization dataset example jupyter-notebook jypyternotebook python seborn vacation
Last synced: 18 May 2026
https://github.com/mipacd/holochatstats
A VTuber chat log (and general) analytics platform
data flask hololive postgresql python visualization vtuber youtube
Last synced: 05 Apr 2026
https://github.com/h4fide/politicalcompassbot
This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!
bot data greedy-algorithms politics python python3 sql telegram
Last synced: 19 Aug 2025
https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly
Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool
blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly
Last synced: 18 Apr 2026
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/huemulsolutions/huemul_sql_decode
Obtiene los campos y tablas utilizados en una sentencia SQL
bigdata chile data data-governance governance spark sql
Last synced: 19 Apr 2026
https://github.com/sulujulianto/population-data-retrieval-and-analysis
I created a simple program that can be used to search for global population data or population data from various countries using Python.
Last synced: 09 Mar 2026
https://github.com/istinnew/etl-pipeline-ganz-project
End-to-end ETL pipeline project for collecting, transforming, and loading data into a cloud-based database using Python, MySQL, and Google Cloud Analytics
cloud cloud-engineering cloud-services data data-science dataanalytics database database-schema googlecloud mysql mysql-database python python-lambda
Last synced: 20 Apr 2026
https://github.com/omers/sre-devops-tools
Tools and useful sources for SRE and DevOps
awsome awsome-list data devops monitoring sre tools
Last synced: 20 Apr 2026
https://github.com/rijkvanzanten/ds-fa-1
The first final assignment for the data structures class
assignment data final map now parsons structures thenewschool
Last synced: 04 Oct 2025
https://github.com/jacopodl/jcollections
Common data structures for the C language
c collections data data-structures jcollections
Last synced: 30 Jul 2025
https://github.com/farhad2415/Job_Scraper
Job Site Based Job Scrapping with python
automation bash-script data data-scraping data-structures python selenium selenium-python
Last synced: 15 Aug 2025
https://github.com/yashkp1234/movie-recommendation-engine
My project on analyzing the movie data set, and creating a recommendation engine using that analysis.
analysis data notebook python recommendation-engine
Last synced: 04 May 2025
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/ashamethedestroyer/data-structures
Dedication of all Data Structures Creation 🛠
cpp data data-structures implementation implementation-of-data-structures structure structured-data
Last synced: 23 May 2026
https://github.com/ahmad-ali-rafique/heart-disease-detection-model
A comprehensive project for detecting heart disease using machine learning, including data processing, model training, and evaluation metrics with AUC curve analysis.
artificial-intelligence data datascience heart-disease machine-learning modeling prediction-model
Last synced: 11 Aug 2025
https://github.com/mozzo1000/web-analytics
Website analysis tools and data
analysis analytics data website
Last synced: 21 Apr 2026
https://github.com/fabsdevx/files-to-database-loader-handout
Data Engineering project for learning purposes. Credits to itversity
csv data data-engineering database json pandas python
Last synced: 09 Apr 2026
https://github.com/snickerdoodlelabs/whitepaper
LaTex files for protocol whitepaper.
data latex pdf self-custody snickerdoodle whitepaper zero-knowledge
Last synced: 21 Apr 2026