data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/nmelgar/birthday_sports_dataviz
We will analyze how the Matthew Effect has influenced in professional sports players.
analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau
Last synced: 08 Jan 2026
https://github.com/aimin-nur/data-analyst-model-predictive
Sebuah Project data analyst yang bertujuan untuk mengindentifikasi karakteristik customer untuk menerima penawaran campaign marketing.
analyst data mechine-learning visualization
Last synced: 29 Jan 2026
https://github.com/carlosrs14/parallel-data-preprocessig-system
A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.
barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads
Last synced: 24 Jul 2025
https://github.com/jurooravec/knwldg
Datasets, scrapers, pipelines
companies crawler data dataset non-profit-organizations scraper scrapy
Last synced: 13 Jun 2026
https://github.com/snimmagadda1/luigi-etl-example
π Example of an ETL pipeline using Spotify's Luigi
data luigi luigi-pipeline python spotify
Last synced: 30 Mar 2025
https://github.com/data-forge-notebook/ohlc-aggregation-example
An example of aggregating OHLC stock data using Data-Forge Notebook
algorithmic-trading data data-aggregation data-analysis ohlc quantitative-finance share-market stock-market trading
Last synced: 30 Jan 2026
https://github.com/team-hydrogen/2025-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 11 Apr 2025
https://github.com/piyushkumar2025/india-general-elections-2024_data-analyst
Analyzed election data for 540+ constituencies and 100+ parties using SQL. Calculated state-wise seat distributions, classified 30+ parties into alliances, identified top 10 candidates by EVM votes, calculated victory margins, and analyzed voting patterns for 300+ candidates to uncover key insights.
analytics data database mysql sql statistics
Last synced: 22 May 2026
https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project
This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.
cleaning-data data database dataset mysql mysql-database sql
Last synced: 07 Apr 2025
https://github.com/abendayan/orm
Lightweight orm
cli dao data database database-management javascript mysql node node-js nodejs orm ormius ormius-cli schema
Last synced: 25 Feb 2026
https://github.com/lut-ful/pizza-sales-report
This Pizza Sales Report provides valuable insights into sales performance through detailed analysis and visualizations. By leveraging Power BI and SQL Server
data data-wrangling microsoft-sql-server power-bi power-bi-dax python
Last synced: 30 Jan 2026
https://github.com/brianlesko/postresql-docker
Run a postgreSQL server hosted in a docker container, and start a webUI for basic querying
basics container containerization containers data data-science docker postgres postgresql sql template
Last synced: 31 Jan 2026
https://github.com/amazenmb/web-scraping
Web Scraping Methods using Python
analytics beautifulsoup data lxml pyautogui-automation python scheduling schedulingscraping selenium webdriver webscraping xpath
Last synced: 06 May 2026
https://github.com/opendatach/alds
a colaborative list of resources and ideas to enable "Amt Local Data Stewards" to manage the (open) data of their respective federal office
awesome-list data datagovernance dataliteracy datamanagement datastewardship opendata opengovernmentdata
Last synced: 31 Jan 2026
https://github.com/agdturner/ccg-data
A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.
Last synced: 12 Jan 2026
https://github.com/word2vect/beijing-new-house-data-visualization
Beijing New House Data Visualization for Python Programming 2024 Fall Data Visualization Lab
Last synced: 13 Jun 2026
https://github.com/bilgehangecici/datatypeconverter
Converting integer and floating numbers to appropriate bit-level representation.
data datatypeconverter java machine-level variables
Last synced: 30 Mar 2025
https://github.com/olekscode/datageneration
Exploring the methods of data generation for different Machine Learning algorithms
data javascript machine-learning
Last synced: 05 Apr 2025
https://github.com/rissh/titanicsurvivalpredictionusingml
Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. π’
data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic
Last synced: 01 Feb 2026
https://github.com/okieraised/rke2-deployment
Single-node RKE2 deployment
data helm helm-charts helm-deployment rke2
Last synced: 17 Mar 2026
https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models
This repo contains a model for determining the rainfall patterns and another for medical expense prediction model
data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning
Last synced: 30 Aug 2025
https://github.com/mehmetkahya0/earthquake-tracker
Earthquake Tracker, A real-time earthquake monitoring application that visualizes seismic activity worldwide using interactive maps and data visualization.
ai api css cursor data data-vizualisation earth-observation earthquake earthquake-data earthquake-visualization earthquakes html js modern-web scrape ui ui-design web
Last synced: 15 Apr 2026
https://github.com/mahtabranjbar/onlineshopping_analysis_dashboard
This project analyzes online shopper behavior using various machine learning models and EDA techniques.
dashboard data dataanalysis eda machine-learning streamlit
Last synced: 08 Feb 2026
https://github.com/iankitnegi/statistically_speaking
Explore diverse projects showcasing statistical techniques with real-world data, comprehensive docs, and interactive visualizations.
data excel statistical-analysis statistics
Last synced: 09 Feb 2026
https://github.com/danicaalana/wine-dataset-decision-tree
This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.
data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset
Last synced: 18 Apr 2026
https://github.com/matt-dray/draytasets
:1234::disguised_face: Miscellaneous datasets I've collected or prepared
Last synced: 09 Feb 2026
https://github.com/schoolsquirrel/holiday-data
Automatically updated holiday data for SchoolSquirrel
data holidays schoolsquirrel scripts vacation
Last synced: 03 Oct 2025
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 07 May 2026
https://github.com/darshjasani/insurance-claim-analysis
This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.
Last synced: 27 Aug 2025
https://github.com/nicolaeiotu/dbindjs
Data Binding for Javascript
bind binding data data-binding databind dbind dbindjs javascript
Last synced: 09 Feb 2026
https://github.com/fnu-ankit/nyc_parking_violation
data dataengineering dbt githubactions python
Last synced: 16 Apr 2026
https://github.com/n4en/python-for-data-engineers
Python for data engineers
data data-engineer data-engineering dataengineering python python-notebooks python3 tutorial
Last synced: 26 Aug 2025
https://github.com/robertoostenveld/bird
BagIt Research Data
bagit data fair open-datasets repository
Last synced: 18 Mar 2026
https://github.com/os-climate/data-requests
This repo is used to track issues related to new Data Requests
Last synced: 27 Feb 2026
https://github.com/0xnu/data-analyst-training
The repository contains training materials for data analysts.
data data-analysis data-analyst
Last synced: 25 Aug 2025
https://github.com/utrechtuniversity/momentum-dataflow
Repository for publishing website about data management practices of the Momentum project
data datageneration datamanagement
Last synced: 27 Feb 2026
https://github.com/ssiarhei115/shop-customers-segmentation
Shop customers segmentation
data data-analysis data-science data-visualization
Last synced: 24 Aug 2025
https://github.com/luminati-io/google-maps-dataset-samples
A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.
api data dataset google-maps maps web-scraping
Last synced: 03 Jan 2026
https://github.com/ppabam/eda-bam
Navigating data from one thing to another.
Last synced: 11 Feb 2026
https://github.com/anandanraju/power_bi_dashboard_projects
The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.
amazon dashboard data data-visualization healthcare powerbi project
Last synced: 11 Feb 2026
https://github.com/pbinkley/tweets-national-emergency-library
A twarc harvest of tweets related to Internet Archive's National Emergency Library (2020-03-23 to 2021-02-13)
Last synced: 11 Feb 2026
https://github.com/paulrosset/cyclone
Network data consumption monitoring
data monitoring network networking
Last synced: 23 Aug 2025
https://github.com/canadaluke888/ttb2
TerminalTableBuilder 2
c17 csv data database datasets datautils json ncurses ods spreadsheet sqlite3 tables terminal terminaltablebuilder terminaltablebuilder2 ttb ttb2 ttbx xlsx
Last synced: 10 Apr 2026
https://github.com/nouraalgohary/fifa-world-cup-data-analysis
data dataanalysis powerbi powerbi-visuals
Last synced: 19 Mar 2026
https://github.com/ahmad-ali-rafique/wine-quality-dataset
Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.
analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model
Last synced: 21 Aug 2025
https://github.com/pawamoy/keycut-data
Keyboard shortcuts data stored in YAML files
Last synced: 12 Feb 2026
https://github.com/foundationallm/.github
A platform accelerating delivery of secure, trustworthy enterprise copilots.
agent ai data enterprise generative-ai large-language-model llm ml tool
Last synced: 12 Feb 2026
https://github.com/justinhennis1/hackathon24
Hofstra's Hacknology Competition 2024 - Team Null Pointers
data data-analysis data-science data-visualization data-visualization-python dataanalysis dataanalytics traveling web webapplication
Last synced: 21 Aug 2025
https://github.com/giscience/measures-rest-oshdb-app
A frontend for providing measures for geospatial datasets, using the OSHDB
data dggs geospatial measure openstreetmap rest
Last synced: 20 Apr 2026
https://github.com/anderson-andre-p/datastructuresandalgorithms
Data Structures and Algorithms to Study
data data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series structured-data
Last synced: 20 Aug 2025
https://github.com/bastianolea/plebiscitos_chile
Datos de resultados electorales de los plebiscitos constitucionales de 2022 y 2023
chile comunas data elecciones politica social
Last synced: 15 Jun 2026
https://github.com/pocketfullofdata/electric-vehicles-market-size-analysis
This project analyzes the growth, adoption trends, and future projections of the electric vehicle (EV) market. Using data analysis and visualization techniques, it examines key factors like sales trends, and consumer adoption to understand the evolving landscape of the EV industry.
analysis data jupyter-notebook matplotlib numpy python seaborn vscode
Last synced: 07 May 2026
https://github.com/rachelresende/projeto-finan-as
Este repositΓ³rio Γ© referente a um curso de anΓ‘lise de dados para finanΓ§as que realizei em 2025 na Udemy.
analytics data financas finance finance-management
Last synced: 19 Aug 2025
https://github.com/rugwiroparfait/alx_sql
This repo is where I save my queries and learning materials in Data Science program from ALX
anaconda data data-analysis jupyter-notebook sql
Last synced: 19 Aug 2025
https://github.com/danyal-faheem/project-logs-analyzer
This repo contains scripts to analyze project logs and display some charts related to the data
data data-visualization matplotlib pandas python streamlit
Last synced: 07 May 2026
https://github.com/h4fide/politicalcompassbot
This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!
bot data greedy-algorithms politics python python3 sql telegram
Last synced: 19 Aug 2025
https://github.com/spajai/etl-sharepoint-data-uploader-pipeline
Custom Python Script to Pull specific data from source and Upload to the Microsoft SharePoint
data etl etl-pipeline microsoft microsoft365 python3 sharepoint sharepoint-online
Last synced: 11 Nov 2025
https://github.com/KarajMiglani-DataScientist/karajmiglaniFAKE-NEWS-DETECTION
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 19 Aug 2025
https://github.com/saritaphd/predicting-performance-of-students---complete-ml-project-with-deployment-using-aws
Student performance analysis with deployment (End to end ML project)
aws data data-science deployment jupyter-notebook machine-learning python visualization
Last synced: 10 Apr 2026
https://github.com/bhenk/msdata-d
MySql DAO
dao data data-layer database mysql mysql-database mysqli
Last synced: 07 May 2026
https://github.com/liolb/sql2csv
Export SQL Server Table data to CSV
automation csv data database export extraction powershell scripting sql sql-server sql-table
Last synced: 08 May 2026
https://github.com/guardias-eu/reasin
Interface to the European Alien Species Information Network API
api biodiversity biodiversity-data biodiversity-informatics data invasive-species oscibio r r-package
Last synced: 04 Oct 2025
https://github.com/sulujulianto/population-data-retrieval-and-analysis
I created a simple program that can be used to search for global population data or population data from various countries using Python.
Last synced: 09 Mar 2026
https://github.com/gourab337/karnataka-health-visualizer
Visualizer for Karnataka's district-wise healthcare info built using PHP
Last synced: 19 Mar 2026
https://github.com/zsvoboda/olympics
Self service analytics of 120 years of Olympics data
analytics dashboards data datavisualization dataviz olympics open-data open-datasets opendata reports
Last synced: 08 May 2026
https://github.com/juanpablo70/pgad-assignment01
Breast Cancer Coimbra data set analysis
data data-science dataframe dataset jupyter-notebook matplotlib numpy pandas python
Last synced: 08 May 2026
https://github.com/rijkvanzanten/ds-fa-1
The first final assignment for the data structures class
assignment data final map now parsons structures thenewschool
Last synced: 04 Oct 2025
https://github.com/spiraldb/spiraldb-nemo-curator
SpiralDB connectors for NVIDIA NeMo Curator
computer-vision data data-curation data-prep data-preparation data-processing data-quality datacuration datarecipes deduplication fast-data-processing multimodal multimodal-ai nvidia-nemo physical-ai python spiral vortex
Last synced: 15 Jun 2026
https://github.com/farhad2415/Job_Scraper
Job Site Based Job Scrapping with python
automation bash-script data data-scraping data-structures python selenium selenium-python
Last synced: 15 Aug 2025
https://github.com/supremkc05/global-job-market-analytics
Scrape jobs from websites like Indeed/LinkedIn, extract skills using NLP, then visualize hiring trends.
beautifulsoup data machine-learning nlp pandas scrapping
Last synced: 14 Aug 2025
https://github.com/hlan22/2025-03-18-data-validation
(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:
Last synced: 23 Jun 2026
https://github.com/badranalyst/covid-deaths-dashboard-with-tableau
This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.
covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization
Last synced: 02 Mar 2026
https://github.com/writetome51/page-load-access
A TypeScript/Javascript class that loads a batch (array) of data from a larger set too big to be loaded all at once.
batch class data javascript load loader typescript
Last synced: 16 May 2026
https://github.com/panodata/tikray
A compact data transformation engine.
data data-transformation data-transformation-pipeline data-transformer jmes jmespath jq jqlang json json-pointer json-transform json-transformation json-translate json-translator transformation transon
Last synced: 04 Oct 2025
https://github.com/aaisha-nexus/sql_company_insights
A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.
business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms
Last synced: 12 Aug 2025
https://github.com/nia-cloud-official/influx-agents
Influx-CRD is a web application designed to facilitate data collection, recovery, and distribution for agents uploading data to a centralized database. It provides an intuitive interface for managing data collection from various sources, recovering lost or corrupted data.
broker collection data data- influx influx-agent
Last synced: 30 Jul 2025
https://github.com/amethyst-php/cycle
amethyst amethyst-package api cycle data laravel
Last synced: 17 May 2026
https://github.com/nagar2nd/financial-analysis-power-bi
This project analyzes financial and credit card usage data using Power BI and DAX, focusing on customer behavior, credit risk, and financial performance. It includes insights on spending trends, delinquency rates, churn indicators, and satisfaction scores to drive better financial management and customer retention strategies.
analysis data dax dax-functions dax-query excel powerbi
Last synced: 03 Mar 2026
https://github.com/keziatbnn/supervised-regression-salaryprediction
Make salary predictions based on years of experience using supervised regression.
data data-analysis-python data-prediction data-science python
Last synced: 11 Aug 2025
https://github.com/metapsy-project/data-depression-anxiety-transdiagnostic
Database of transdiagnostic treatment of depression and anxiety
Last synced: 01 Apr 2026
https://github.com/gunn/covid-19-scripts
Scripts for processing COVID-19 data - e.g. converting from absolute to per capita numbers, adding fine-grained data from more countries
covid-19 data geography typescript
Last synced: 17 May 2026
https://github.com/oroszgy/hunlp-resources
Scripts and resources for making spaCy understand Hungarian.
corpus-linguistics data hungarian hungarian-language hunlp magyarlanc model natural-language-processing nlp resources script spacy wikipedia
Last synced: 18 May 2026
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/jigyasag18/power-bi-dashboard-project
The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.
dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization
Last synced: 04 Mar 2026
https://github.com/0xhericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 09 Feb 2026
https://github.com/arjunrao87/world-countries-graphql-api
GraphQL API for retrieving information about countries of the world
countries data database geographic-data geography graphql world
Last synced: 10 May 2026
https://github.com/hubtou/adsv
Analyze delimiter-separated values files
command-line-tool csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining learning-python pnu-project python servier shell tools unix utility
Last synced: 17 Apr 2026
https://github.com/ashita-ai/ashita-ai.github.io
Ashita AI - The island of misfit data tools
Last synced: 19 Feb 2026