data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/raulmaulidhino-dev/ml_modelling_regression
There are many factors that influence the grades/scores of students. One of the factors is study hours. In this mini analysis project, there are 3 models that will learn and predict the relation between study hours of students and their scores in an exam/test. This project will result the best ML model to solve the problem.
data data-analysis-python data-science eda machine-learning scikit-learn
Last synced: 28 Jan 2026
https://github.com/equinor/fmu-sumo-uploader
Upload to Sumo in the FMU context
data fmu python subsurface sumo
Last synced: 06 May 2026
https://github.com/cmda-tt/course-25-26
🎓 tech track · 2025-2026 · curriculum and syllabus 📊
d3 data datavis functional javascript programming research svelte visualization
Last synced: 20 Jan 2026
https://github.com/sahraiidle/email-spam-detector
Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.
data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm
Last synced: 24 Jan 2026
https://github.com/jdanielgoh/cobertura-campanias
En una democracia ¿caben todas las voces? Proyecto para visualizar el monitoreo de radio y TV que realiza el INE de las candidaturas presidenciales 2024
d3js data datavisualization vue
Last synced: 09 Jun 2026
https://github.com/bishtrishu/pizza_sales_analysis_dashboard_sql_bi
Welcome to the Pizza Sales Analysis Dashboard project! This repository contains a comprehensive guide to building an interactive and insightful dashboard for analyzing pizza sales data using SQL and Power BI.
data data-science dataanalyst datavisualization dax dax-query microsoft microsoft-azure microsoft-sql-server msexcel mysql powerbi powerquery project sql
Last synced: 16 Mar 2026
https://github.com/srgchrksv/stream-crypto
Crypto trades streaming with azure services
azure binance crypto data databricks dataengineering pyspark python streaming websocket
Last synced: 30 Apr 2026
https://github.com/fatihemres/Africa
Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.
animations avfoundation data mapkit models swift swift-animations swiftui
Last synced: 31 Aug 2025
https://github.com/supunlakmal/coronavirus-covid-19-status
Covid 19 cases and death count for each country in a json file.
coronavirus count country covid-19 covid-data covid19 data data-science data-visualization geographical geographical-information-system json
Last synced: 21 Jun 2026
https://github.com/andygol/andygol.github.io
Andrii Holovin – Product & Project Manager Geospatial Expert / OpenStreetMap Consultant / DevOps practitioner
consultant data data-structures devops experience floss gis mapping navigation openstreetmap personal-site personal-website
Last synced: 13 May 2026
https://github.com/priyapuranik/data-analytics-using_python
Analyzed data of Hotels and find out meaningful insights from it including booking patterns and seasonal trends and many more.
data pandas python sql visualization
Last synced: 06 Apr 2026
https://github.com/ttozatto/sparkify
Churn Prediction for music streaming app with PySpark
analysis churn data learning machine predictive pyspark science spark
Last synced: 16 Jan 2026
https://github.com/spatialcurrent/go-pipe
go-pipe is a simple library for piping objects from iterators to writers.
big-data bigdata concurrency data
Last synced: 29 Jan 2026
https://github.com/als8446/tripleten-data-science-projects
Projects Overview Projects made in the Data Scientist course from TripleTen LatAm
data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn
Last synced: 10 Apr 2026
https://github.com/apoorv74/njdg-stats
Tracking data from the National Judicial Data Grid's (NJDG) district courts portal
data git-scraping judiciary law
Last synced: 29 Jan 2026
https://github.com/thedragoncode/training-data-for-ai
Training data for the neural network
ai data flood meaningless neural-network neural-networks nn obscene politics spam toxic training
Last synced: 29 Jan 2026
https://github.com/dfsp-spirit/neuroimaging_testdata
Contains test data for unit tests, used in developing neuroimaging software. Ignore this. Licenses in the individual archives.
Last synced: 25 Feb 2026
https://github.com/peterhellberg/bugsnag-data
Dump Bugsnag data using the Data access API
Last synced: 22 Jun 2026
https://github.com/snimmagadda1/luigi-etl-example
🔍 Example of an ETL pipeline using Spotify's Luigi
data luigi luigi-pipeline python spotify
Last synced: 30 Mar 2025
https://github.com/amethyst-php/project
amethyst amethyst-package api data laravel project
Last synced: 15 Apr 2026
https://github.com/tks18/xl-pq-handler
A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation
analytics automation data excel power-query powerbi python xlwings
Last synced: 20 Jan 2026
https://github.com/gabya06/twitter_models
Repository used for twitter impression models
data data-science impressions machinelearning python ridge-regression sklearn twitter
Last synced: 04 May 2026
https://github.com/team-hydrogen/2025-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 11 Apr 2025
https://github.com/bubblymaps/bubblymaps
The open source bubbler map. Mapping the world's water fountains. Open Code, Open Data.
bubbler bubbly-maps data fountain map open-source water
Last synced: 31 Jan 2026
https://github.com/hlan22/2025-03-18-data-validation
(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:
Last synced: 23 Jun 2026
https://github.com/opendatach/alds
a colaborative list of resources and ideas to enable "Amt Local Data Stewards" to manage the (open) data of their respective federal office
awesome-list data datagovernance dataliteracy datamanagement datastewardship opendata opengovernmentdata
Last synced: 31 Jan 2026
https://github.com/piyushkumar2025/india-general-elections-2024_data-analyst
Analyzed election data for 540+ constituencies and 100+ parties using SQL. Calculated state-wise seat distributions, classified 30+ parties into alliances, identified top 10 candidates by EVM votes, calculated victory margins, and analyzed voting patterns for 300+ candidates to uncover key insights.
analytics data database mysql sql statistics
Last synced: 22 May 2026
https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project
This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.
cleaning-data data database dataset mysql mysql-database sql
Last synced: 07 Apr 2025
https://github.com/abhishekn1947/samgov-scraper
Automated Python scraper for sam.gov contracts
analytics automation aws data pandas postgresql rds selenium webscraper
Last synced: 09 Apr 2026
https://github.com/agdturner/ccg-data
A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.
Last synced: 12 Jan 2026
https://github.com/jeugregg/deeplearningpicturedogs
Classify dogs pictures by Deep Learning CNN neural networks
classez-des-images cnn-keras data data-science ipynb neural-network vision
Last synced: 24 Jul 2025
https://github.com/passly-nl/data
Source code of the data layer.
data passly ticketing typescript
Last synced: 27 May 2026
https://github.com/hess125/data-visualizations
A repository of data visualization projects
data data-analysis data-science data-visualization powerbi projects sql sqlite tableau
Last synced: 31 Aug 2025
https://github.com/mlkav/digital-talent-scholarship
Learn in Digital Talent Scholarship Program
data data-science digital-talent-scholarship dts google-cloud google-cloud-platform science
Last synced: 26 Feb 2026
https://github.com/assada/free-words
Data for/from NLP
corpus-data data nlp-machine-learning npl
Last synced: 26 Feb 2026
https://github.com/ymorsi7/quranicvisualization
A visual exploration tool for the Holy Quran using D3.js treemaps.
css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization
Last synced: 15 Apr 2026
https://github.com/schoolsquirrel/holiday-data
Automatically updated holiday data for SchoolSquirrel
data holidays schoolsquirrel scripts vacation
Last synced: 03 Oct 2025
https://github.com/tanyagarg25/project_covidanalysis
This repository is a project for analyzing COVID-19 data using SQL and visualizing it with Tableau. Technologies used include SQL for querying and Tableau for data visualization.
analysis dashboard data data-visualization sql tableau
Last synced: 08 Feb 2026
https://github.com/darshjasani/insurance-claim-analysis
This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.
Last synced: 27 Aug 2025
https://github.com/munas-git/codm-review-analysis-and-predictions
Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.
data flask machine-learning python sentiment-analysis
Last synced: 05 May 2026
https://github.com/matt-dray/draytasets
:1234::disguised_face: Miscellaneous datasets I've collected or prepared
Last synced: 09 Feb 2026
https://github.com/manishjanky/wrangle-weratedogs-dataset
A data wrangling project done ad part of Udacity DAND
data data-wrangling twitter udacity udacity-data-analyst-nanodegree udacity-nanodegree weratedogs
Last synced: 15 Apr 2026
https://github.com/debjyotisaha/tableau-projects-phase-2
Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.
dashboards data data-analysis data-visualisation tableau
Last synced: 26 Aug 2025
https://github.com/muthupillai1204/diwali_sales_analysis
The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.
data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization
Last synced: 05 May 2026
https://github.com/mateuszskoczek/generatorcsv
GeneratorCSV is a students and teachers data converter for Microsoft 365 Admin Center. The project was implemented for Sobolew High School.
admin converter data microsoft365 python school tkinter
Last synced: 26 Aug 2025
https://github.com/prakhargpt/sql-data-warehouse-project
Building Data Warehouse project using SQL Server, including ETL processes, data modelling and analytics.
analytics data data-analysis data-cleaning data-engineering data-engineering-pipeline data-lakehouse data-science data-warehouse etl etl-job etl-pipeline medallion-architecture sql sql-server
Last synced: 12 Jun 2026
https://github.com/haroontrailblazer/machine_learning
About This Repository A curated resource hub for learning machine learning, featuring tutorials, code examples, datasets, and hands-on projects to build foundational skills and explore real-world applications.
data data-analysis data-visualization database dataset gradient-descent machine-learning pandas python3 random-forest sklearn statistics
Last synced: 16 Apr 2026
https://github.com/rudxain/xorsum
Get XOR checksum with this command-line tool
binary checksum cli data digest file files hexadecimal rust-crate xor
Last synced: 08 Mar 2026
https://github.com/0xnu/data-analyst-training
The repository contains training materials for data analysts.
data data-analysis data-analyst
Last synced: 25 Aug 2025
https://github.com/julienmalka/shiftgenerator
ShiftGenerator WeSki 2018
data data-science latex python
Last synced: 06 May 2026
https://github.com/robertoostenveld/bird
BagIt Research Data
bagit data fair open-datasets repository
Last synced: 18 Mar 2026
https://github.com/ssiarhei115/shop-customers-segmentation
Shop customers segmentation
data data-analysis data-science data-visualization
Last synced: 24 Aug 2025
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 09 Jun 2026
https://github.com/vatshayan/songs-datasets
Datasets for Songs and Music for Dancing, Emotional, Happy and scenic view
1000dataset classfication csv data datapackage datapackages dataset datasets excel free freedata freedatasets genre machine music sgenre song songs
Last synced: 18 Mar 2026
https://github.com/luminati-io/google-maps-dataset-samples
A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.
api data dataset google-maps maps web-scraping
Last synced: 03 Jan 2026
https://github.com/soenneker/soenneker.extensions.httprequestdatas
A collection of helpful HttpRequestData (Functions) extension methods
azure csharp data dotnet extension extensions function http httprequest httprequestdataextension httprequestdatas request
Last synced: 21 Apr 2026
https://github.com/ksm26/ml-ai-data-science-jobs-in-canada
Explore the latest machine learning, artificial intelligence, and data science job opportunities in Canada. Stay informed about Canadian tech job market trends and find your next career move.
ai-canada ai-careers canada canadian-tech-companies canadian-tech-job-market data data-analysis data-engineering data-science data-science-careers machine-learning prompt-engineering robotics
Last synced: 06 May 2026
https://github.com/miozilla/snowden
snowden :snowman::video_game: : VR Game # Snowflake # Data Engineering # ELT
data elt engineering snowflake sql vr-game
Last synced: 11 Feb 2026
https://github.com/anandanraju/power_bi_dashboard_projects
The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.
amazon dashboard data data-visualization healthcare powerbi project
Last synced: 11 Feb 2026
https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis
In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.
data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public
Last synced: 09 Mar 2026
https://github.com/kunalthakur204/visualization-on-flower
🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊
data data-visualization dataanalysis flowerdataset python
Last synced: 16 Apr 2026
https://github.com/parthds02/analyzing-student-success-with-data
Discover key factors influencing student performance through data analysis and visualization. Explore gender, parental education, sports, and ethnicity impacts.
data datascience jupyter-notebook kaggle python pythonlibraries
Last synced: 06 May 2026
https://github.com/gaemapiracicaba/norma_dec_8468-76
Padrões de qualidade e lançamento de efluentes de águas interiores
Last synced: 19 Apr 2026
https://github.com/jbn/vaquero
A Python library for iterative and interactive data wrangling at laptop-scale.
data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework
Last synced: 10 Jun 2026
https://github.com/nouraalgohary/fifa-world-cup-data-analysis
data dataanalysis powerbi powerbi-visuals
Last synced: 19 Mar 2026
https://github.com/paulrosset/cyclone
Network data consumption monitoring
data monitoring network networking
Last synced: 23 Aug 2025
https://github.com/canadaluke888/ttb2
TerminalTableBuilder 2
c17 csv data database datasets datautils json ncurses ods spreadsheet sqlite3 tables terminal terminaltablebuilder terminaltablebuilder2 ttb ttb2 ttbx xlsx
Last synced: 10 Apr 2026
https://github.com/ralzz/dibimbing_datascience
This project contains an Exploratory Data Analysis (EDA) of the Estonia Passenger List dataset. I handled missing values, removed duplicate data, and created basic visualizations to find insights.
data data-science eda google-colab kaggle pandas python
Last synced: 06 May 2026
https://github.com/alexyiann/finance
In this repository you can find scripts for pulling data and comparing them , but you can also find simple python scripts to automate trades on Crypto and back testing trading strategies on both crypto and stocks .
api bots data database finance option option-strategies strategy trading trading-algorithms
Last synced: 03 Jan 2026
https://github.com/karolkrupa/javascript-orm-mapper
ORM mapping library. Especially for Rest API
api data data-mapper entity es6 javascript mapper model mongo mysql node nuxt orm relational rest typescript vue vuex
Last synced: 10 Apr 2026
https://github.com/namratha2301/sales-orders-analysis
Wanted to experiment with Looker. This dashboard visualizes sales trends across regions, customer segments, and product categories.
business-analytics dashboard data dataanalysis datavisualization excel looker looker-studio
Last synced: 13 Feb 2026
https://github.com/urvish-06/seaborn-dataset
Seaborn data sets
csv csv-files data data-science data-visualization dataset example jupyter-notebook jypyternotebook python seborn vacation
Last synced: 18 May 2026
https://github.com/neptun-software/neptun.data.generators
Send scraped data from neptun-scraper to CHATGPT to generate training data for NEPTUN.AI.
Last synced: 30 Jul 2025
https://github.com/h4fide/politicalcompassbot
This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!
bot data greedy-algorithms politics python python3 sql telegram
Last synced: 19 Aug 2025
https://github.com/spajai/etl-sharepoint-data-uploader-pipeline
Custom Python Script to Pull specific data from source and Upload to the Microsoft SharePoint
data etl etl-pipeline microsoft microsoft365 python3 sharepoint sharepoint-online
Last synced: 11 Nov 2025
https://github.com/KarajMiglani-DataScientist/karajmiglaniFAKE-NEWS-DETECTION
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 19 Aug 2025
https://github.com/europanite/gundam-forest
Random Forest Data Analysis of Kill In Action rate for every personnel in GUNDAM world, like in Titanic.
data data-analysis data-science data-visualization death-rate death-rates gundam-model gundam-series gundom-forest jupyter jupyter-notebook kia kill-in-action python random-forest titanic titanic-kaggle titanic-survival titanic-survival-prediction
Last synced: 29 Jun 2026
https://github.com/wittyicon29/zeotap-ds-assignment
Internship application assignment
Last synced: 19 Aug 2025
https://github.com/e-kotov/albofr-data-archive
Tiger Mosquito Colonisation in France data
aedes-albopictus colonisation data france tiger-mosquito
Last synced: 23 May 2026
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 07 May 2026
https://github.com/sunnahboy/checkfake_true_news
Building data structures using Linked lists and arrays and find best algorithms for implementing a system for detecting Fake News
algorithms data level low programming structure
Last synced: 28 Feb 2026
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/lab5e/loadabledata
Simple framework-agnostic wrapper around loadable data to help encapsulate and use state changes in a UI.
async data loadable state typescript ui
Last synced: 07 May 2026
https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata
I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.
chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation
Last synced: 14 Feb 2026