data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/miguelmedinacastro/trabalho-dados-r
Trabalho final da disciplina Análise Exploratória de Dados
data data-science data-science-projects data-visualization database r rstudio
Last synced: 01 May 2026
https://github.com/syedzaheerabbas/jamboree-education-linear-regression
Using data from Jamboree, this project explores the relationship between applicant profiles (GRE, TOEFL, GPA, etc.) and their chances of admission to Ivy League graduate programs. Linear regression, Ridge, and Lasso regression are employed to build predictive models and identify key factors.
data eda linear-regression python visualization
Last synced: 01 May 2026
https://github.com/lut-ful/ibm-capstone-project-stack-overflow-job-survey
IBM Data Analyst professionale certificate program final project.
cognos data data-analytics looker power-bi python sql statics
Last synced: 01 May 2026
https://github.com/dnut/associations
Python 3 library to identify high-dimensional statistical relationships in any data set.
analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules
Last synced: 01 May 2026
https://github.com/skygenesisenterprise/aether-meet
Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications data docker javascript meeting nextjs notes typescript voip
Last synced: 01 May 2026
https://github.com/gabrielf7/relogiohd
:watch: Relógio com Horário e Data
clock css data horario html javascript relogio relogio-hd relogio-javascript watch
Last synced: 01 May 2026
https://github.com/sorairolake/japanese-era-dataset
日本の元号のデータセット / Dataset of the Japanese era
data dataset date japanese-calendar japanese-era json toml wareki yaml
Last synced: 01 May 2026
https://github.com/jose-mwangi/my-portfolio
my-portfolio
analytics aws data data-science excel seo-optimization vba-excel webscraping
Last synced: 28 Jul 2025
https://github.com/hafs96/prediction_consommation-de-carburant
Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.
analysis data data-visualization machine-learning testing training
Last synced: 09 Jun 2026
https://github.com/mubashirsidiki/olympics-data-enigeering
Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.
analytics azure big-data data dataengineering devops pipeline
Last synced: 02 May 2026
https://github.com/jesuscc1993/data-cleaner-extension
Clears browser data in a single click.
application-data chrome chrome-extension data
Last synced: 02 May 2026
https://github.com/anyantudre/associate-data-scientist-track
Materials for the Associate Data Scientist in Python track on DataCamp.
data data-science experimental-design hypothesis-testing machine-learning matplotlib-pyplot pandas python regression sampling seaborn statistics statsmodels unsupervised-learning
Last synced: 03 May 2026
https://github.com/charityeverett/gobackfetchit
Award Winning WebXR Data Journalism Storytelling Project
3d aframe ar css data html html-css-javascript nodejs visuzalization vr webxr xr
Last synced: 03 May 2026
https://github.com/arnavk-09/phishing-detection
🎣 Detect Phishing URLs with Data Pre-fitted... API & Web UI
csv data fastapi flask python scikit-learn
Last synced: 03 May 2026
https://github.com/qrailibs/dataflow
✨ Data processing in Node.js made multithreaded and type-safe.
data dataprocessing multithread node
Last synced: 04 May 2026
https://github.com/srking501/uk-groceries-images
Repository Containing UK Groceries Images
data groceries grocery images links playwright playwright-python webscraping-data webscrapper
Last synced: 04 May 2026
https://github.com/erictleung/tidytuesdays
:chart_with_upwards_trend: My attempts at #tidytuesday
data data-science data-visualization r rstats tables tidytuesday tidyverse
Last synced: 19 Sep 2025
https://github.com/rabeal21/tea
Generate random TEA wallet addresses in bulk with this simple utility. Perfect for testing and exploring the TEA blockchain. 🌱💻
bucklescript bucklescript-tea chinese-translation cli data earlgrey educators hacking ios-automation ios-test ocaml peer-evaluations php red-team teachyourselfcs test-framework translation tui
Last synced: 04 May 2026
https://github.com/sjg/my-search-story
My Search Story is a demo application developed for the Data Portability API Workshop and the #AISprint2025 events. #BuildwithAI
data docker generative-ai google-cloud-platform google-cloud-run nodejs
Last synced: 04 May 2026
https://github.com/gabya06/twitter_models
Repository used for twitter impression models
data data-science impressions machinelearning python ridge-regression sklearn twitter
Last synced: 04 May 2026
https://github.com/munas-git/codm-review-analysis-and-predictions
Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.
data flask machine-learning python sentiment-analysis
Last synced: 05 May 2026
https://github.com/chanchalsoorma/web-scraping
This repo aims to provide a straightforward, easy-to-use scraping code written in Python.
beautifulsoup beautifulsoup4 data python request selenium webscraping
Last synced: 05 May 2026
https://github.com/sohomm/predict-insurance-charges
A predictive model to estimate the insurance charges based on a client's attributes, such as age and health factors. It offers a practical application of ml in business, enabling more accurate pricing models and helping companies manage risk while delivering personalized pricing strategies to clients.
administration algorithm bot data decision-trees download easy finance github java machine-learning management model neural-network nlp prediction project science trading university
Last synced: 05 May 2026
https://github.com/julienmalka/shiftgenerator
ShiftGenerator WeSki 2018
data data-science latex python
Last synced: 06 May 2026
https://github.com/abhibisht89/data-visualization
data matplotlib pandas ploty python visualization
Last synced: 06 May 2026
https://github.com/tadiusfrank2001/pythonprojects
Compilation of Some Fun Introduction to Python Lab Coding Projects introducing the foundamentals of data science, databases, and pythonlibraries
data data-science databases gamedesign python pythonlibrarires sorting-algorithms sqlite string-manipulation
Last synced: 06 May 2026
https://github.com/fabsdevx/file-format-converter-handout
Data Engineering project for learning purposes. Credits to itversity
csv csv-import data data-engineering database pandas python
Last synced: 06 May 2026
https://github.com/muhamedlabs/muhamed_onedrive
Muhamed_OneDrive - це надійне і зручне хмарне сховище для файлів, розроблене для безпечного зберігання і легкого обміну даними.
data html5 onedrive programming style
Last synced: 04 Jan 2026
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 07 May 2026
https://github.com/diegoperea20/datos-secuenciales-con-ia
Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes
ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao
Last synced: 06 Feb 2026
https://github.com/jhermsmeier/node-leybold-xps
Parse & write the Leybold XPS data format
analysis data esca format leybold parser photoelectron-spectroscopy spectroscopy x-ray xps xpspeak xray
Last synced: 09 Jul 2025
https://github.com/neha-adnani/sql_music-store-analysis
SQL-based data analysis of a digital music store's sales and customer data.
business-analysis data data-analysis database follow-along-projects pgadmin4 portfolio-project postgres queries sql
Last synced: 18 Jun 2025
https://github.com/lab5e/loadabledata
Simple framework-agnostic wrapper around loadable data to help encapsulate and use state changes in a UI.
async data loadable state typescript ui
Last synced: 07 May 2026
https://github.com/filiprokita/tobase64
This Python program encodes a file in base64 format and saves the result to a new file with a ".b64" extension. It is a command-line tool that can be used to automate file encoding tasks.
base64 command-line data data-conversion data-manipulation data-privacy data-prottection data-security encoding file file-conversion file-handling python python-script python3 tobase64
Last synced: 30 Jun 2025
https://github.com/justinjjlee/simulation-discrete
Employing data transformations and simulations to answer random questions
analytics data data-science julia python simulation spark
Last synced: 30 Apr 2026
https://github.com/martinius96/meteostanica-odosielacie-scripty
Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.
arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka
Last synced: 11 Apr 2026
https://github.com/nisanth2004/springboot-kafka-real-world-project-wikimedia
Creating a project about Wikimedia using Kafka involves building a system that leverages Apache Kafka for data streaming and processing related to Wikimedia data.
async broker communication data java kafka message real-time real-time-analytics springboot wikimedia
Last synced: 14 May 2026
https://github.com/satyam4229/iit-and-nit-college-dataset
The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.
college-data csv data excel iit nit
Last synced: 04 Jan 2026
https://github.com/quantumudit/test-store-data-analysis
This repository showcases a web scraper with a pipeline structure for efficient data extraction and transformation from websites. The tool can be tailored to leverage its capabilities for insightful data analysis, providing valuable insights and informed decision-making.
data data-visualization dataanalytics python python-webscraping webscraper webscraping-data
Last synced: 11 Apr 2026
https://github.com/vidushibhadana/eda-on-nyc-taxi-data
About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.
data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/vaxdata22/foresight-institution
This is a Data Analysis case study done on the Foresight Institution dataset.
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-processing data-visualization data-wrangling exploratory-data-analysis spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql transact-sql
Last synced: 28 May 2026
https://github.com/srgchrksv/datacamp-projects
Datacamps projects
analytics data data-science dataanalysis education jupyter-notebook learning pandas projects python sql
Last synced: 06 May 2026
https://github.com/climate-resource/input4mips_validation
Validation of input4MIPs data
cmip data forcing input4mips validation
Last synced: 20 Jan 2026
https://github.com/barbosa89/vue-table
A classical data table component in VueJS and Bootstrap 4, optimized for Laravel applications.
bootstrap4 data datatable javascript laravel php table vuejs
Last synced: 11 Apr 2026
https://github.com/raphaellaude/usaschooldata
Cleaned and accessible school enrollment data for US schools
data duckdb duckdb-wasm education object-storage oss wasm
Last synced: 12 May 2026
https://github.com/csoren66/financial-budget-analysis
Financial budget for 2021
Last synced: 03 Mar 2025
https://github.com/yorkearwaker/data
Data things; representation, transformation, pipelines, governance,
actuality data epistemology information knowledge ontology
Last synced: 07 Apr 2025
https://github.com/oniani/miniframe
Minimal data frames with relational algebra
data dataframe-library haskell haskell-library library
Last synced: 04 Mar 2025
https://github.com/pocketfullofdata/electric-vehicles-market-size-analysis
This project analyzes the growth, adoption trends, and future projections of the electric vehicle (EV) market. Using data analysis and visualization techniques, it examines key factors like sales trends, and consumer adoption to understand the evolving landscape of the EV industry.
analysis data jupyter-notebook matplotlib numpy python seaborn vscode
Last synced: 07 May 2026
https://github.com/safwan2003/randomforest_heart_disease_prediction
A machine learning project using Random Forest Classifier to predict heart disease. Includes data preprocessing (with binning), feature selection, and model evaluation.
binning data data-science datapipeline datapreprocessing datavisaulization deep-learning machine-learning python random-forest-classifier streamlit
Last synced: 07 May 2026
https://github.com/kuanhungchen/spring-2019-data-structures
📦 Some programming assignments about basic data structures.
Last synced: 25 Feb 2025
https://github.com/mnazlukhanyan/da-projects
Портфолио с работами по аналитике данных, показывающие мои навыки, умения и опыт
data data-vizualisation hypothesis-tests matplotlib pandas plotly postgresql product-metrics python scipy seaborn sql visualization
Last synced: 11 Apr 2026
https://github.com/agustinmusanti/sqlchallenge-7
Resolución de un extenso desafío de SQL propuesto por el profesor Diego Moisset De Espanes, quien comparte ejercicios para aprender y practicar SQL Server a través de su canal de YouTube.
challenge data learning sqlserver
Last synced: 15 Apr 2025
https://github.com/tomcardoso/journalism-data-intersection
A talk on working at the intersection of journalism and data science
data data-journalism journalism
Last synced: 15 May 2025
https://github.com/karo23361/toy-store-kpi-power-bi
PowerBI Portfolio Project
csv data data-visualization powerbi
Last synced: 03 Feb 2026
https://github.com/thicclatka/tetration
New file format for tensors
cli data fileformat mmap tensors
Last synced: 26 May 2026
https://github.com/gianlucatruda/titanic
An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.
data database flask info matplotlib python science scrape server titanic visualisation web
Last synced: 10 Apr 2026
https://github.com/checco9811/data-engineering-bootcamp-homework
Homework solutions for DataExpert.io data engineering bootcamp
apache-spark data data-engineering sql
Last synced: 14 Mar 2025
https://github.com/karashiiro/lodestone-character-data-scraper
Lodestone character data scraper.
data ffxiv ffxiv-character lodestone
Last synced: 23 Apr 2026
https://github.com/bastianolea/cut_comunas
Versión actualizada de los códigos únicos territoriales (CUT) de las comunas y regiones del país.
Last synced: 24 Jun 2026
https://github.com/diordany/spicemill
Tool for plotting Ngspice simulation results with Pyplot.
analysis data electrical-engineering electronics frontend integrated-circuit integrated-circuits ngspice plot plotting post-processing pyplot python raw simulation spice
Last synced: 13 Jan 2026
https://github.com/kalaspuff/ready
🎟 [not yet built] Take control of the event loop with simplified task management, queueing and data loading.
asyncio data dataloading event futures python python3 resolver tasks
Last synced: 10 May 2026
https://github.com/gkannan-codes/habitableexos
With Earth’s habitability under strain, we ask: which known exoplanets could humans live on? Using NASA’s Exoplanet Archive, we score planets 0–1 (1 ≈ Earth) from five Earth-normalized features to rank top candidates.
data html kaggle matplotlib-pyplot numpy pandas plotly python seaborn visualization
Last synced: 11 Apr 2026
https://github.com/halyusa16/mysql-employee-analysis
This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.
data data-analytics data-exploration database mysql self-project sql
Last synced: 20 Jan 2026
https://github.com/mnkanout/patients_medication_prediction
The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.
data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3
Last synced: 29 Jun 2025
https://github.com/jigyasag18/iit-guhawati
Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.
aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp
Last synced: 08 May 2026
https://github.com/hit07/fitgpt-hacksc
AI-Powered Fitness Coach; 🥈 Runner up at HackSC's SoCal Tech Week hackathon
data elasticsearch gpt-4o-mini llm pipeline
Last synced: 28 Feb 2025
https://github.com/reshmaaiman/fifa
FIFA20
data data-science data-visualization dataanalysisusingpython github jupyter-notebook matplotlib numpy pandas python seaborn-python
Last synced: 10 Apr 2026
https://github.com/cmda-tt/course-25-26
🎓 tech track · 2025-2026 · curriculum and syllabus 📊
d3 data datavis functional javascript programming research svelte visualization
Last synced: 20 Jan 2026
https://github.com/nel-zi/nuga_bank
Developed an automated data exploration and cleaning pipeline for Nuga Bank to streamline data preparation, ensure consistent data quality, and normalize datasets into structured databases for efficient analysis and reporting.
data data-automation data-visualization datacleaning datatransformation etl-automation etl-pipeline
Last synced: 16 May 2025
https://github.com/liolb/sql2csv
Export SQL Server Table data to CSV
automation csv data database export extraction powershell scripting sql sql-server sql-table
Last synced: 08 May 2026
https://github.com/boratechlife/tensorflow-questions-datasets
A Tensorflow questions Datasets to help you practice Machine learning and Train Models
data datapreprocessing datasets machinelearning modeltrain questions tensorflow
Last synced: 23 Mar 2025
https://github.com/roovedot/unet-cnn-for-road-segmentation
(In Progress) Unet architecture with CNNs (Convolutional Neural Networks) aimed at Road Segmentation
cnn cnn-for-visual-recognition cnn-pytorch computer-vision data data-engineering data-science unet unet-image-segmentation unet-pytorch
Last synced: 01 Jul 2025
https://github.com/thirza258/country-sdg
VOX ASTRA Submission : Country SDG
css d3 d3-visualization data django html python sdg social social-good un visualization
Last synced: 11 Apr 2026
https://github.com/miniql/miniql-inline
A MiniQL query resolver for inline data.
Last synced: 27 May 2026
https://github.com/chaewonkong/kaggle-competitions
kaggle competitions and lessions
Last synced: 15 Mar 2025
https://github.com/justinyahin/wpdf
Create, filter, sort and display users data on your WordPress site.
Last synced: 18 Apr 2026
https://github.com/nevoland/unchangeable
🧊 Tools for immutable values.
data datastructure functional immutable persistent pure stateless
Last synced: 24 Jul 2025
https://github.com/nmelgar/birthday_sports_dataviz
We will analyze how the Matthew Effect has influenced in professional sports players.
analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau
Last synced: 08 Jan 2026
https://github.com/omarcodex/data_analysis
My repository of past and present research and data-driven projects.
data ecodev ecology science sustainability yale
Last synced: 18 Jan 2026
https://github.com/hemangsharma/dataanalysis
This repo contains analysis like a dashboard and time series forecast on NASDAQ data
analysis data data-analysis data-visualization python
Last synced: 10 Mar 2026
https://github.com/lablnet/alibaba_scraper
This is a robust web scraper that extracts data from the Alibaba website. It's multi-threaded and utilizes Playwright to efficiently scrape data from the website. This script is capable of scraping the entire Alibaba site, which would take approximately 4-6 months to complete.
alibaba data ecom mit-license open-source products scraper
Last synced: 15 Mar 2025
https://github.com/carlosrs14/parallel-data-preprocessig-system
A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.
barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads
Last synced: 24 Jul 2025
https://github.com/team-hydrogen/2025-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 11 Apr 2025
https://github.com/mwelwankuta/image-match
a multi-threaded tool for batch renaming images of their appearance and match in a datasource
data openai typescript worker-threads
Last synced: 09 Mar 2025