data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-27 00:07:33 UTC
- JSON Representation
https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization
This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.
classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm
Last synced: 10 Apr 2025
https://github.com/stdlib-js/ndarray-vector-uint32
Create an unsigned 32-bit integer vector (i.e., a one-dimensional ndarray).
constructor ctor data javascript ndarray node node-js nodejs stdlib structure types uint32 vec vector
Last synced: 25 Apr 2026
https://github.com/soenneker/soenneker.dtos.requestdataoptions
A flexible request options object for paging, sorting, and filtering queryable data, similar to OData-style parameters.
controller coordinator csharp data dotnet dto dtos http manager object odata options request requestdataoptions
Last synced: 12 Mar 2026
https://github.com/bishtrishu/super_store_sales_dashboard
This repository contains a comprehensive sales analysis dashboard for a Superstore, created using Power BI. The objective is to contribute to the success of a business by utilizing data analysis technique, specially focusing on time series analysis, to provide valuable insights and accurate sales forecasting.
analytics data data-science dataanalysis dataanalyst datacleaning datascience datavisualization-project excel microsoft-azure microsoft-excel powerbi report sql
Last synced: 28 Feb 2026
https://github.com/bastianolea/plebiscitos_chile
Datos de resultados electorales de los plebiscitos constitucionales de 2022 y 2023
chile comunas data elecciones politica social
Last synced: 15 Jun 2026
https://github.com/vatshayan/youtube-user-analysis
Analysis of Youtube Users about their choice and preferences
data data-analysis data-mining data-science data-visualization dataset machine-learning machine-learning-algorithms
Last synced: 05 Feb 2026
https://github.com/namratha2301/sales-orders-analysis
Wanted to experiment with Looker. This dashboard visualizes sales trends across regions, customer segments, and product categories.
business-analytics dashboard data dataanalysis datavisualization excel looker looker-studio
Last synced: 13 Feb 2026
https://github.com/sumaiyyaf/british-airline-dashboard
This Tableau dashboard visualizes British Airways customer reviews, showcasing key metrics like average ratings for service, entertainment, and seat comfort. It features interactive filters for exploring ratings by aircraft type, country, and traveler type, along with trend analysis over time.
analysis dashboard data tableau visualization
Last synced: 13 Feb 2026
https://github.com/j0a0m4/olympics
Final Project for Data Engineering Accelerated LATAM
Last synced: 13 Feb 2026
https://github.com/ashleydavis/brisjs-web-scraping-talk
Code to accompany my talk on web scraping for the Brisbane JavaScript meeting in September 2018
cheerio data data-acquisition data-acquisiton electron headless-browsers javascript nightmare nightmarejs nodejs web-scraping
Last synced: 06 May 2026
https://github.com/fiddlydigital/anonimizer
A lib to replace and rehydrate sensitive data in text
anonimize anonymize data data-security prompt sanitize string string-manipulation text
Last synced: 15 Mar 2025
https://github.com/yashaswitir28/yashaswitir28.github.io
This is my Portfolio Website
data data-analysis-python data-analyst data-cleaning data-science data-visualization excel html-css ms office365 portfolio-website powerbi python sql
Last synced: 29 May 2026
https://github.com/dakshdeephere/bank_eda-practice
EDA analysis of Bank.csv dataset
analysis data data-visualization dataanalysis matplotlib numpy pandas python3 seaborn
Last synced: 07 May 2026
https://github.com/shantanujpk/bigdatacloud
Exploration of PySpark for data processing and interview prep — demonstrates handling corrupted records, applying transformations/actions, and building efficient data pipelines with practical examples.
big-data data jupyter-notebook pipeline pyspark python spark sparksql
Last synced: 07 May 2026
https://github.com/farhashaad/farhashaad98
This is a repository to showcase my skills, share projects and track my progress in Data Science related projects.
data data-visualization dataanalysis matplotlib pandas python seaborn sql tableau
Last synced: 24 Apr 2026
https://github.com/infinitode/pywebscrapr
An open-source Python web scraping tool. Supports both image scraping and text scraping.
data data-collection data-science open-source pip scraping web-scraper
Last synced: 14 Feb 2026
https://github.com/sakshamarora07/blinkit-sales-report-power-bi
This dashboard provides Blinkit with insights to optimize its grocery delivery operations and understand customer preferences. It evaluates sales trends, outlet performance, and item categories to identify key areas for improvement. The interactive visuals allow detailed exploration of sales distribution, customer ratings, and product popularity.
data data-science dataanalytics datavisualization excel powerbi sql
Last synced: 08 Jan 2026
https://github.com/sanand0/iss-location
Tracks the International Space Station position. A demo of how to use GitHub Actions to schedule commits weekly.
Last synced: 14 Feb 2026
https://github.com/halyusa16/mysql-employee-analysis
This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.
data data-analytics data-exploration database mysql self-project sql
Last synced: 20 Jan 2026
https://github.com/natanast/euroleaguebasketball
An R package providing data on Euroleague Basketball
Last synced: 01 Apr 2025
https://github.com/molinsagustin/cinedata
# CineData Trabajo práctico grupal para la materia Ingeniería de Datos I en la Universidad Argentina de la Empresa. El mismo consistió en el desarrollo de una base de datos relacional en Microsoft SQL Server Managment Studio utilizando metodología Ágil SCRUM, que se utilizó desde el relevamiento de requisitos hasta la implementación final.
agile data data-modeling database diagram entity-relationship-diagram microsoft-sql-server relational-databases relational-model scrum scrum-agile sql sqlserver
Last synced: 28 Feb 2026
https://github.com/lab5e/loadabledata
Simple framework-agnostic wrapper around loadable data to help encapsulate and use state changes in a UI.
async data loadable state typescript ui
Last synced: 07 May 2026
https://github.com/bryanhe24/data_analysis_app
A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.
ai data data-analysis data-visualization fullstack-development javascript math python reactjs
Last synced: 07 May 2026
https://github.com/darkogamerz/dhis2heat
A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.
analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization
Last synced: 01 Apr 2025
https://github.com/gusenov/open-data-scripts
Scripts to explore public datasets. Скрипты для работы с открытыми данными.
charts data data-visualisation data-visualization datavisualization highcharts kazakhstan open-data opendata qazaqstan
Last synced: 28 Feb 2026
https://github.com/2022-04-11588/data-fakes
🔍 Generate realistic fake data for testing and development, enhancing your projects with simple, customizable data solutions.
data dataset developer-tools fake-content faker fakery groovy java mock phoenix python random ruby seeding struct swift-framework test-data testing
Last synced: 11 Apr 2026
https://github.com/ludwing-mj/manipulacion_ej
Ejercicio utilizado en la seccion numero ocho del manual para ejemplificar las herramientas proporcionadas por el tydyverse para la manipulacion de datos.
data manipulate-data package r
Last synced: 01 Apr 2025
https://github.com/arch-fan/pokedata
Pokemon Data in CSV format for whatever you need!
Last synced: 17 Jun 2026
https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata
I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.
chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation
Last synced: 14 Feb 2026
https://github.com/safwan2003/randomforest_heart_disease_prediction
A machine learning project using Random Forest Classifier to predict heart disease. Includes data preprocessing (with binning), feature selection, and model evaluation.
binning data data-science datapipeline datapreprocessing datavisaulization deep-learning machine-learning python random-forest-classifier streamlit
Last synced: 07 May 2026
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 09 May 2026
https://github.com/lijesh010/roadaccidentanalysisproject
This data analysis project was completed using MS Excel, and includes the creation of a dashboard.
data data-analytics data-exploration data-visualization msexcel
Last synced: 15 Feb 2026
https://github.com/grace-mengke-hu/redditpushshiftapi
This package is for collecting Reddit dataset and organize the data in Mongo Database
Last synced: 13 Jun 2025
https://github.com/pdoup/enegry
Time-Series dataset combining multiple sources to explain the broader Greek energy market
data dataset day-ahead-auction energy-markets exploratory-data-analysis forecasting futures-market greek-energy-market renewable-energy time-series-data weather-data
Last synced: 07 May 2025
https://github.com/mochsyahrizal/jkfkjabar_studycase
First Data Analytics Study Case
Last synced: 15 Feb 2026
https://github.com/mustafaozvardar/selenium-eksisozluk
This project is a simple web scraper built with Python using Selenium. It extracts and prints the content of popular entries from a specific EksiSozluk page.
data python selenium selenium-python
Last synced: 29 Apr 2026
https://github.com/vuurvos1/functional-programming
HVA functional-programming
data formatting functional nodejs programming
Last synced: 03 Oct 2025
https://github.com/murtaza-arif/all-you-need-to-know-for-data-engineer
This repository is designed to showcase various aspects of data engineering, including tools, frameworks, and end-to-end projects. It covers everything from data ingestion and transformation to data warehousing and cloud-based solutions.
cassandra data data-engineering data-science kafka kafka-consumer kafka-streams pyarrow spark
Last synced: 07 May 2026
https://github.com/eva-kaushik/data-clustering
Clustering Accelerators for hard and soft clustering, including implementations of K-means, K-medoids, hierarchical clustering, fuzzy C-means, and Gaussian mixture models. Demonstrates text clustering using both hard and soft clustering algorithms.
clustering clustering-algorithm data datascience machine-learning-algorithms
Last synced: 09 Apr 2025
https://github.com/armand-sauzay/datasets
Datasets for machine learning
ai data datasets machine-learning ml
Last synced: 18 Jan 2026
https://github.com/jigyasag18/iit-guhawati
Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.
aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp
Last synced: 08 May 2026
https://github.com/quangandrei1003/france_air_pollution_pipeline
End-to-end air pollution data pipeline for French metropolitan cities using Airflow, Python, dbt, BigQuery.
airflow bigquery data data-analytics data-engineering data-modeling data-visualization dbt docker etl pandas python terraform
Last synced: 13 Apr 2026
https://github.com/trollmii/bunnybase
An efficient data managing system
bunnybase data data-science data-structures database datascience python python3
Last synced: 22 Apr 2025
https://github.com/liolb/sql2csv
Export SQL Server Table data to CSV
automation csv data database export extraction powershell scripting sql sql-server sql-table
Last synced: 08 May 2026
https://github.com/soenneker/soenneker.attributes.mapto
A C# attribute for generic data mapping translation
attributes columns csharp data datatables dotnet mapping mapto maptoattribute object
Last synced: 02 Mar 2026
https://github.com/badranalyst/covid-deaths-dashboard-with-tableau
This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.
covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization
Last synced: 02 Mar 2026
https://github.com/zcebeci/odetector
Outlier Detection Using Cluster Analysis
anomaly-detection cluster-analysis clustering clustering-methods data datapreparation datapreprocessing exception-handling fcm fraud-detection fuzzy-clustering novelty-detection outlier-detection outlier-removal outliers partitioning pcm r surprise-exploration
Last synced: 29 Oct 2025
https://github.com/white-gecko/lineage-dump
RDF dump of the device information from the lineage wiki
Last synced: 28 May 2026
https://github.com/shadmanshaikh/data-analysis-and-ml-work
All of my work in Data Analysis and Machine learning
analytics artificial-intelligence data machine-learning
Last synced: 05 Jul 2025
https://github.com/zsvoboda/olympics
Self service analytics of 120 years of Olympics data
analytics dashboards data datavisualization dataviz olympics open-data open-datasets opendata reports
Last synced: 08 May 2026
https://github.com/abhash-rai/regression-car-price-prediction
This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.
data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression
Last synced: 08 May 2026
https://github.com/gunjanmimo/d3-visualization
D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework
d3js data data-science data-visualization reactjs
Last synced: 29 Apr 2026
https://github.com/wyattowalsh/proxywhirl
rotating proxy system
data data-extraction dataextraction proxy proxy-checker proxy-list proxy-scraper proxy-server proxypool python python3 rotating-proxy sqlite sqlite3 web-data-extraction
Last synced: 03 Mar 2026
https://github.com/hlan22/2025-03-18-data-validation
(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:
Last synced: 23 Jun 2026
https://github.com/inzhenerka/scooters_data_generator
Generate data of scooter trips for analysis
Last synced: 02 Jun 2026
https://github.com/unknownsoup/budget_tracker
A personal budget tracker to build my knowledge of working with databases and data analysis. In this case using SQL and python for the analysis.
data data-science databases python sql
Last synced: 26 Jan 2026
https://github.com/blackhatdevx/leetcode
LeetCode Solutions by Jash Gro
algorithm algorithms dart data datastructures datastructures-algorithms dsa java javascript leetcode leetcode-java leetcode-python leetcode-solutions neetcode
Last synced: 08 May 2026
https://github.com/colesmcintosh/colesmcintosh.github.io
My portfolio site :)
ai automation data llms open-source
Last synced: 04 Mar 2026
https://github.com/ibttf/bayborhood
Interactive map to find the ideal neighborhood in San Francisco based on data.
data data-analysis data-visualization gis mapbox react
Last synced: 18 Jun 2026
https://github.com/etmendz/mendz.data.oracle
Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to Oracle databases.
ado-net context data database datasettings mendz oracle
Last synced: 13 Apr 2026
https://github.com/juanpablo70/pgad-assignment01
Breast Cancer Coimbra data set analysis
data data-science dataframe dataset jupyter-notebook matplotlib numpy pandas python
Last synced: 08 May 2026
https://github.com/inist-cnrs/ws-data
Modèles et données pour les web services
Last synced: 03 Sep 2025
https://github.com/ashakoen/bls-data-extract
This repository contains scripts and a database schema to set up and manage a local SQLite database for storing and querying the Average Price data from the U.S. Bureau of Labor Statistics. It includes tools for downloading the latest data from the BLS website and fetching Consumer Price Index (CPI) data via the BLS API.
Last synced: 01 Apr 2026
https://github.com/thomasjewson/cci-data-science-textbook
This is a short, interactive textbook aimed at introducing data science to non-IT university undergraduates. Funded by Erasmus+.
data data-science learning python textbook
Last synced: 16 Apr 2026
https://github.com/fastpix/android-data-bitmovin
FastPix Video Data SDK to monitor and analyze video playback metrics within Bitmovin for android
analytics android-sdk bitmovin data fastpix metrics player sdk video
Last synced: 16 Apr 2026
https://github.com/jigyasag18/power-bi-dashboard-project
The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.
dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization
Last synced: 04 Mar 2026
https://github.com/ehvenga/data.driven.modeling
Repository to practice data driven modelling
Last synced: 23 Mar 2025
https://github.com/arjunrao87/world-countries-graphql-api
GraphQL API for retrieving information about countries of the world
countries data database geographic-data geography graphql world
Last synced: 10 May 2026
https://github.com/equinor/fmu-sumo
Interaction with Sumo in the FMU context
analytics data fmu python subsurface sumo visualization
Last synced: 01 May 2025
https://github.com/so-cool/junction
My solution to the University of Bristol "Bristol Journey Time" Data Challenge https://So-Cool.github.io/junction
competition data modelling timeseries
Last synced: 02 Apr 2025
https://github.com/sehgal-vishal/world-population-
World Population Sql Analysis
data dataanalysis population sql
Last synced: 05 Mar 2026
https://github.com/amethyst-php/collection
Simple as the name, this package allow you to create collection of other models.
amethyst amethyst-package api collection data laravel
Last synced: 17 Apr 2026
https://github.com/jigyasag18/amazon-prime-power-bi-dashboard
The Amazon Prime Power BI Project is a centralized data storage system containing detailed information on movies and TV shows available on Amazon Prime Video, including metadata and analytics insights. It supports data-driven decision-making for content acquisition and viewer engagement strategies. This repo is optimized for querying & analysis.
dashboard data data-visualization dataanalysis dataanalytics datacleaning dataset powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard
Last synced: 05 Mar 2026
https://github.com/maximiliancw/completely
Measure your data completeness
data data-cleaning data-quality data-science missing-data
Last synced: 25 Jun 2025
https://github.com/stefanpietrusky/factsv2
Repository for the article in the online magazine TDS.
ai arxiv-papers beautifulsoup data flask-application gensim llama matplotlib ollama plotly pyldavis python selenium webdriver
Last synced: 09 Apr 2025
https://github.com/sushmashreeps/data-science-with-python
This repository showcases a comprehensive data science project utilizing Python, demonstrating expertise in data analysis, visualization, and machine learning. Built with Python 3.x, the project leverages popular libraries like Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, and TensorFlow. The project features data preprocessing, feature engine
cnn data dataanalysis datascience keras linear-regression matplotlib python python3 regression rnn visualization
Last synced: 14 Apr 2026
https://github.com/vanshuchaudhary/flightpriceanalysis-
The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.
business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis
Last synced: 08 May 2026
https://github.com/lexiortiz/advanced-data-analytics
Structured learning notes, code snippets, and key takeaways from the Google Advanced Data Analytics Professional Certificate. Serves as a personal reference for reinforcing concepts and as a resource for others on a similar learning journey.
data data-analysis data-engineering google python-3 sql
Last synced: 29 May 2026
https://github.com/squareslab/frameworkstudytranscripts
archived data human-study zackc
Last synced: 06 Mar 2026
https://github.com/ashfaqalizardariofficial/databasehelper
A C# database helper library to connect with the database server and perform actions insert, update, delete, select data and select multiple data from the database.
ashfaq-ali-zardari ashfaq-ali-zardari-official data database delete helper insert ms-sql-server multiple select-data server sql-server update
Last synced: 02 Apr 2026
https://github.com/ismailhakkii/digital_vault
This project can be used for securing data, similar to a real vault.
data digital security-data vault
Last synced: 25 Mar 2025