data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/jbn/vaquero
A Python library for iterative and interactive data wrangling at laptop-scale.
data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework
Last synced: 10 Jun 2026
https://github.com/moons-14/datapot
Incorporate and serve all information.
ai aiogram api data infomation news newspaper rss video
Last synced: 04 Jan 2026
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/ralzz/dibimbing_datascience
This project contains an Exploratory Data Analysis (EDA) of the Estonia Passenger List dataset. I handled missing values, removed duplicate data, and created basic visualizations to find insights.
data data-science eda google-colab kaggle pandas python
Last synced: 06 May 2026
https://github.com/neurazum-ai-department/tumor-stages-dataset---v1
Synthetic MRI data generated by the ‘HF’ and 'Vbai' models based on real data.
brain data dataset datasets image mri neuroscience tumor tumor-segmentation
Last synced: 18 Mar 2026
https://github.com/ludreinsalvador/global-covid-19-data-analysis
Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.
analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization
Last synced: 26 Feb 2026
https://github.com/dysnomia-studio/achieve-games-dump
Dump parts of achieve.games database to public including Steam Games List
data dump games steam steam-api steam-game steam-games
Last synced: 27 Feb 2026
https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling
Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.
data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis
Last synced: 05 Mar 2025
https://github.com/robertoostenveld/bird
BagIt Research Data
bagit data fair open-datasets repository
Last synced: 18 Mar 2026
https://github.com/mbagalman/lattice-doe
Python code to create experimental designs optimized to meet statistical power targets
abtesting data datascience designofexperiments experimentaldesign statistics
Last synced: 19 Jun 2026
https://github.com/krescruz/pegaso-data
Utilerías para el analisis de datos del Proveedor de Certificación de Factura Pegaso
Last synced: 29 Apr 2026
https://github.com/vatshayan/songs-datasets
Datasets for Songs and Music for Dancing, Emotional, Happy and scenic view
1000dataset classfication csv data datapackage datapackages dataset datasets excel free freedata freedatasets genre machine music sgenre song songs
Last synced: 18 Mar 2026
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/aliasgarsogiawala/dashboards
Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard
analysis dashboards data data-visualization powerbi
Last synced: 12 Feb 2026
https://github.com/miozilla/snowden
snowden :snowman::video_game: : VR Game # Snowflake # Data Engineering # ELT
data elt engineering snowflake sql vr-game
Last synced: 11 Feb 2026
https://github.com/kenanbek/youtube-data
YouTube stats data over YouTube Data API v3 using Python.
data python youtube youtube-api
Last synced: 13 May 2026
https://github.com/davecumin/ancir_next
analysis chronobiology circadian d3 data data-analysis data-visualization svelte timeseries
Last synced: 18 May 2026
https://github.com/amethyst-php/target
amethyst amethyst-package api data laravel target
Last synced: 22 May 2026
https://github.com/karajmiglani-datascientist/karajmiglanifake-news-detection
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 22 May 2026
https://github.com/beastbytes/postal-code-data-php
Implementation of PostalCodeDataInterface using PHP file storage
Last synced: 27 Feb 2026
https://github.com/interzoid/php-examples
Provides PHP examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.
api cloud data database php quality
Last synced: 12 Jan 2026
https://github.com/rubidev68/citadelai-community
Community version of citadelai.app
ai ai-assistant chatbot chatbot-framework data knowledge-management silo-digital
Last synced: 03 Feb 2026
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 07 May 2026
https://github.com/bryanhe24/data_analysis_app
A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.
ai data data-analysis data-visualization fullstack-development javascript math python reactjs
Last synced: 07 May 2026
https://github.com/omari-kd/recommendation-system-analysis-and-modelling
This project aims to develop a recommendation system that leverages historical user data to provide tailored recommendations across different domains, such as product recommendations, content suggestions and service optimisation.
data data-science data-science-in-r machine-learning-algorithms recommendation-system
Last synced: 08 Jan 2026
https://github.com/tjas/postgrad-ai-ddv-plotly
Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.
analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python
Last synced: 07 May 2026
https://github.com/rajlabmssm/echodata
echoverse module: Example data.
data echoverse fine-mapping genomics gwas qtl
Last synced: 17 Jan 2026
https://github.com/infinitode/pywebscrapr
An open-source Python web scraping tool. Supports both image scraping and text scraping.
data data-collection data-science open-source pip scraping web-scraper
Last synced: 14 Feb 2026
https://github.com/vojtech-dobes/php-conformance
constraint data input normalization php sanitization schema validation
Last synced: 23 Jul 2025
https://github.com/imartinezl/madrid-challenge
Madrid Route Optimization Challenge 🚚♻️🚚
challenge city data optimization routing-algorithm traffic
Last synced: 28 Feb 2026
https://github.com/murtaza-arif/all-you-need-to-know-for-data-engineer
This repository is designed to showcase various aspects of data engineering, including tools, frameworks, and end-to-end projects. It covers everything from data ingestion and transformation to data warehousing and cloud-based solutions.
cassandra data data-engineering data-science kafka kafka-consumer kafka-streams pyarrow spark
Last synced: 07 May 2026
https://github.com/4ment/aiv-rate-heterogeneity
Avian influenza virus data sets
Last synced: 24 Jan 2026
https://github.com/sunnahboy/checkfake_true_news
Building data structures using Linked lists and arrays and find best algorithms for implementing a system for detecting Fake News
algorithms data level low programming structure
Last synced: 28 Feb 2026
https://github.com/kiing-dom/data-structures-algorithms
data structures and algorithms
algorithms-and-data-structures data data-structures java leetcode
Last synced: 09 Aug 2025
https://github.com/gabboraron/datacamp_projects
Here you can find my DataCamp Projects
data datacamp datacamp-projects
Last synced: 14 Jun 2026
https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata
I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.
chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation
Last synced: 14 Feb 2026
https://github.com/gui-sitton/bank-loans
In this project I will prepare a report for a bank's loan division. I find out whether a customer's marital status and number of children have an impact on loan default, as well as other factors
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 21 May 2026
https://github.com/soenkekluth/micromitter
minimal and performant event emitter / dispatcher
data dispatch dispatcher emit emitter event eventdriven handler on send trigger
Last synced: 02 Nov 2025
https://github.com/lijesh010/roadaccidentanalysisproject
This data analysis project was completed using MS Excel, and includes the creation of a dashboard.
data data-analytics data-exploration data-visualization msexcel
Last synced: 15 Feb 2026
https://github.com/danpoynor/data-pagination-and-filtering-project
Data pagination exercise using 'vanilla' JavaScript. This script consumes a JSON array containing any number of objects and adds buttons to a page that users can click to navigate to different pages of data.
data javascript json navigation pagination vanilla-javascript
Last synced: 20 Apr 2026
https://github.com/bhenk/msdata-d
MySql DAO
dao data data-layer database mysql mysql-database mysqli
Last synced: 07 May 2026
https://github.com/luminati-io/google-search-api
Two methods to collect real Google SERP data—a free scraper for basic use and the enterprise-grade Bright Data API for high-volume demands.
data google-scraper html python serp-api web-scraping
Last synced: 25 Jun 2025
https://github.com/abhash-rai/regression-car-price-prediction
This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.
data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression
Last synced: 08 May 2026
https://github.com/xmen3em/kaggle-competitions
This collection contains various projects and notebooks developed to tackle a range of Kaggle competitions, showcasing different machine learning techniques, data preprocessing methods, and model optimizations.
data data-science data-visualization deep-learning deployment ensemble-learning machine-learning-algorithms python streamlit
Last synced: 09 Apr 2026
https://github.com/harrisonwelch/pythondatascience
Repo of code from the linked-in lesson "Python: Data Analysis"
data data-science matplotlib notes numpy python tutorial
Last synced: 12 Apr 2026
https://github.com/deva-246/excel-power-query-data-cleaning-dashboard
dashboard data datacleaning excel pivottable powerquery slicer
Last synced: 22 Mar 2025
https://github.com/juanpablo70/pgad-assignment01
Breast Cancer Coimbra data set analysis
data data-science dataframe dataset jupyter-notebook matplotlib numpy pandas python
Last synced: 08 May 2026
https://github.com/reshmaaiman/liver-patient-prediction
Liver Disease Prediction
data data-science data-visualization dataanalysis jupyter-notebook numpy pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/bakangmonei/is_final_assignment
My intelligent systems assignment
data data-science intelligent-systems python
Last synced: 02 May 2026
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 20 Jun 2026
https://github.com/wyattowalsh/proxywhirl
rotating proxy system
data data-extraction dataextraction proxy proxy-checker proxy-list proxy-scraper proxy-server proxypool python python3 rotating-proxy sqlite sqlite3 web-data-extraction
Last synced: 03 Mar 2026
https://github.com/kunalkumar2001/coffee_sales_project_using_excel_power-bi_and_sql
Coffee Shop Sales Dashboard built using Power BI for visualization and SQL for data extraction and transformation. The project dives deep into sales performance, providing actionable insights for data-driven decisions.
analytics data dataanalytics mssql powerbi sql
Last synced: 26 Jun 2025
https://github.com/taquece/goals-per-match
basic script to calculate average football goals per match from .CSV
beginner csv data football nodejs python sports-analytics
Last synced: 09 May 2026
https://github.com/metapsy-project/data-depression-anxiety-transdiagnostic
Database of transdiagnostic treatment of depression and anxiety
Last synced: 01 Apr 2026
https://github.com/sakan811/honkai-star-rail-characters-damage-simulation
Honkai Star Rail Characters' Damage Simulation
data data-science data-visualization honkai honkai-star-rail honkai-starrail powerbi powerbi-visuals python sqlite
Last synced: 29 Jun 2026
https://github.com/szc126/metadata-nnd-vocalo-twitter
ボカロ系新着動画ツイートを収集 - "new VOCALOID/UTAU videos" tweet collection
data nico-nico-douga niconico vocaloid
Last synced: 20 May 2026
https://github.com/ournet/embed-providers-data
Embed provides data
data embed embed-providers json providers
Last synced: 03 May 2026
https://github.com/ethenkem/pygraphsurvey
A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc
Last synced: 12 Jan 2026
https://github.com/pyfig/s21_data-science-bootcamp
School21 Bootcamp Data Science
data data-science numpy pandas python school21
Last synced: 26 Jun 2025
https://github.com/circlexo/circlexo
Open-source project to seamlessly integrate and manage your business workflow, connecting Jira, GitHub, Discord, Stripe, RevenueCat, and OpenAI all in one intuitive platform.
bussiness-intelligence data discord-bot forge github google jira kpis ploi revenuecat stripe vapor
Last synced: 20 May 2026
https://github.com/ubeydgur/car-price-prediction
Predicting the price of a used car
ai artificial-intelligence data data-science data-visualization machine-learning machine-learning-algorithms
Last synced: 08 Jun 2026
https://github.com/dsietz/daas-workshop
Workshop for building a Data as a Service platform using the DaaS SDK.
archconf daas daas-pattern data dataprivacy nfjs rust rust-lang
Last synced: 20 May 2026
https://github.com/jigyasag18/amazon-prime-power-bi-dashboard
The Amazon Prime Power BI Project is a centralized data storage system containing detailed information on movies and TV shows available on Amazon Prime Video, including metadata and analytics insights. It supports data-driven decision-making for content acquisition and viewer engagement strategies. This repo is optimized for querying & analysis.
dashboard data data-visualization dataanalysis dataanalytics datacleaning dataset powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard
Last synced: 05 Mar 2026
https://github.com/majorcluster/clj-data-adapter
A Clojure library designed to convert data
Last synced: 12 Jul 2025
https://github.com/md-emranhossen/leetcode-practice
This repository stores my solutions to LeetCode problems, organized by problem number and title.
cpp data datastructures-algorithms leetcode-solutions
Last synced: 26 Jun 2025
https://github.com/randomgamingdev/randomgamingdev.github.io.data
The data for RandomGamingDev.github.io (feel free to build your own website off of mine :D)
blog custom data projects projects-list
Last synced: 02 Jan 2026
https://github.com/nitheshgoutham/sentinel-2-data-processing-for-pichavaram-mangrove-forest-using-cnn
Image Processing using CNN
cnn cnn-classification cnn-keras data deep-learning matplotlib ploty python seaborn-python visualization
Last synced: 29 Jun 2026
https://github.com/amethyst-php/recipe
amethyst amethyst-package api data laravel recipe
Last synced: 19 May 2026
https://github.com/foreteternelle/pokemonstudiodataapi
The GitHub repository of the Pokémon Studio Data Api
Last synced: 02 Apr 2026
https://github.com/eyluldursun/data-science-project
This project involves a data science analysis conducted on the Obesity Data Set. The study explores factors influencing obesity, includes data visualization, and develops predictive models. The goal of the project is to gain insights to help prevent obesity.
data data-science obesity r rmarkdown
Last synced: 26 Jun 2025
https://github.com/cnr-ibba/smarter-repository
SMARTER Data Repository
bootstrap5 data django repository smarter
Last synced: 03 Apr 2026
https://github.com/flexthink/matricize
A convenience library to convert between pure Python objects and their vectorized representations
data machine-learning numpy python
Last synced: 09 May 2026
https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project
This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop
analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping
Last synced: 09 May 2026
https://github.com/cloud-shuttle/drover-sqlforge
The Data Automation Engine. A blazing-fast, pure Go alternative to dbt for data transformations.
ast data drover sql transformation
Last synced: 03 Jun 2026
https://github.com/mumtaz4118/employee-satisfaction-and-attrition
Analysis of attrition based on environmental satisfaction from a Kaggle dataset.
data data-analysis data-science data-visualization ipynb jupyter-notebook machine-learning python statistical-analysis statistical-models
Last synced: 19 May 2026
https://github.com/yuvrajsaraogi/sales-prediction-using-python
Sales prediction involves estimating future product sales based on factors like advertising spend, target audience, and platform. Businesses rely on data scientists to forecast sales and optimize advertising costs. Machine learning in Python can be used for this task.
data data-analysis data-science data-visualization machine-learning matplotlib natural-language-processing numpy pandas prediction python sales-prediction-using-python sql
Last synced: 19 Apr 2026
https://github.com/opdev1004/crumbdbjs
JSON files based database Javascript
data data-storage data-store database database-management nodejs
Last synced: 18 Apr 2026
https://github.com/fliplet/fliplet-widget-data-source-query
Data Source Query Provider
Last synced: 11 Apr 2025
https://github.com/neelamraikwar9/bookdata
This is my 1st assignment git repository. I have worked with Book Data and by using Express Js created routes and API's for Post, Update, Delete, and Get.
api books data database deployment expressjs node nodejs postman postman-api
Last synced: 05 Apr 2026
https://github.com/mipacd/holochatstats
A VTuber chat log (and general) analytics platform
data flask hololive postgresql python visualization vtuber youtube
Last synced: 05 Apr 2026
https://github.com/tomwhite/misp-2017
MISP camp 2017 materials and code
bioinformatics data data-visualization hackathon
Last synced: 18 Apr 2026
https://github.com/prakashjha1/loan-eligibility-prediction
This repository contains the codebase and resources for a machine learning-based project aimed at predicting loan eligibility for individuals. The project utilizes various algorithms and data preprocessing techniques to build predictive models that assess the likelihood of an applicant being eligible for a loan based on historical data.
data data-visualization exploratory-data-analysis loan-prediction-analysis machine-learning-algorithms naive-bayes-classification parameter-tuning python random-forest
Last synced: 19 Apr 2026
https://github.com/eryks1999/data-collection-project_python
This project allowed me to practice classes, populating json files as well as extracting data.
Last synced: 16 Apr 2026
https://github.com/jigyasag18/ibm-power-bi-dashboard-project
IBM Power BI Dashboard Project is a data-driven analysis of employees using IBM's comprehensive dataset, providing insights into key factors contributing to employee turnover and enabling organizations to strategize effectively towards improved employee retention and satisfaction.
data data-visualization dataanalysis dataanalytics dataset datavisualisation datavisualization-project powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard
Last synced: 07 Mar 2026
https://github.com/ahadly/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 May 2026