data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/code-str8/time-series-forecasting
Developing a model that effectively forecasts the unit sales of numerous items across various Favorita stores with precision.
data dataanalysis forcasting machine-learning time-series visualizations
Last synced: 31 Mar 2025
https://github.com/bryanhe24/data_analysis_app
A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.
ai data data-analysis data-visualization fullstack-development javascript math python reactjs
Last synced: 07 May 2026
https://github.com/thingston/extractor
Collection of PHP classes to extract data from HTML pages.
Last synced: 14 Jan 2026
https://github.com/coderjolly/spotify-api-data-analysis
The project leverages Apache Airflow for automating Spotify API data analysis, focusing on user activity. Extracting, transforming, and loading data efficiently, it provides insights via PowerBI dashboards.
airflow airflow-dags data data-engineering etl etl-pipeline microsoft-sql-server power-bi python scripting sql
Last synced: 27 Mar 2026
https://github.com/mnkanout/patients_medication_prediction
The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.
data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3
Last synced: 29 Jun 2025
https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system
To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.
analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server
Last synced: 29 May 2026
https://github.com/kelvintechnical/web-scraper
Tableau Book Price Analysis
data data-analysis data-science tableau tableau-public
Last synced: 25 Jan 2026
https://github.com/reshmaaiman/fifa
FIFA20
data data-science data-visualization dataanalysisusingpython github jupyter-notebook matplotlib numpy pandas python seaborn-python
Last synced: 10 Apr 2026
https://github.com/rikiitokazu/dataprojects
Data analysis practice using SQL and Python
Last synced: 12 Apr 2026
https://github.com/rajlabmssm/echodata
echoverse module: Example data.
data echoverse fine-mapping genomics gwas qtl
Last synced: 17 Jan 2026
https://github.com/ayushverma135/dbms-labfile
Created for practical learning, this DBMS lab file offers hands-on exercises covering SQL queries, normalization, indexing, and more. With clear instructions and sample datasets, students gain invaluable experience in database design and management.
Last synced: 04 Feb 2026
https://github.com/steveanik/kestra
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
data data-engineering data-integration data-pipeline data-quality elt etl low-code orchestration pipelines scheduler workflow workflow-engine
Last synced: 06 Jan 2026
https://github.com/inzhenerka/scooters_data_generator
Generate data of scooter trips for analysis
Last synced: 02 Jun 2026
https://github.com/srgchrksv/stream-crypto
Crypto trades streaming with azure services
azure binance crypto data databricks dataengineering pyspark python streaming websocket
Last synced: 30 Apr 2026
https://github.com/vara-co/tech-certifications
These are the certifications that back-up some of my skills.
certificates certifications data data-analytics skills
Last synced: 07 Jan 2026
https://github.com/tjas/postgrad-ai-ddv-plotly
Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.
analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python
Last synced: 07 May 2026
https://github.com/afnanenayet/ds-a
Some interview prep I've been doing. This repo is reimplementations of algorithms and data structures in Python3
algorithms data interview prep python structures
Last synced: 05 Apr 2025
https://github.com/ineelhere/langchain-chat-with-your-data
LangChain Chat with Your Data course from DeepLearning.AI and LangChain
chatapplication chatgpt data deeplearning-ai deeplearning-notebooks jupyter-notebooks langchain langchain-python openai-api opensource personalised-learning python3
Last synced: 16 Apr 2026
https://github.com/doughtnerd/pod-old
Read and write Excel data
data data-analysis excel poi-library workbook
Last synced: 21 Jan 2026
https://github.com/nsandoya/python_scrp_project
This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)
beautifulsoup4 data data-analysis jupyter-notebook pandas python3
Last synced: 28 Apr 2026
https://github.com/illustratien/toolphd
Make your analysis simple and reproducible
academic analysis data phd publications r r-package reproducible-research scientific
Last synced: 26 Jan 2026
https://github.com/colesmcintosh/colesmcintosh.github.io
My portfolio site :)
ai automation data llms open-source
Last synced: 04 Mar 2026
https://github.com/miss-mhv/data-analysis-for-social-buzz
In this work, we focus on a small dataset extracted from a large enterprise dataset on social buzz.
Last synced: 14 May 2026
https://github.com/ompreetham/data-structures
binary-search-tree c data data-structures datastructures graph linked-list list stack structures tree
Last synced: 25 Mar 2025
https://github.com/jpcadena/palmer-penguins
Palmer Penguins
analytics csv data data-analytics data-science exploratory-data-analysis matplotlib numpy palmer-penguin pandas plotly pylint python seaborn visualization
Last synced: 11 Apr 2026
https://github.com/fatihemres/Africa
Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.
animations avfoundation data mapkit models swift swift-animations swiftui
Last synced: 31 Aug 2025
https://github.com/bcongdon/nid-data
National Inventory of Dams Data
data datasette government-data
Last synced: 21 Apr 2026
https://github.com/erickpeirson/jhb-data
Data from the forthcoming paper: Quantitative Perspectives on Fifty Years of the Journal of the History of Biology
data geolocation history-of-biology named-entity-recognition topic-modeling
Last synced: 04 Mar 2026
https://github.com/nadahamdy217/harvest-gaurd-plant-disease-detection-web-application
web application that help people grow healthy plants
classification-confidential cnn cnn-classification css data data-science detection html javascript keras machine-learning model plant-disease-detection supervised-learning tensorflow web-application
Last synced: 13 Apr 2026
https://github.com/nevoland/unchangeable
🧊 Tools for immutable values.
data datastructure functional immutable persistent pure stateless
Last synced: 24 Jul 2025
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 24 Feb 2025
https://github.com/nmelgar/birthday_sports_dataviz
We will analyze how the Matthew Effect has influenced in professional sports players.
analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau
Last synced: 08 Jan 2026
https://github.com/ismailhakkii/digital_vault
This project can be used for securing data, similar to a real vault.
data digital security-data vault
Last synced: 25 Mar 2025
https://github.com/carlosrs14/parallel-data-preprocessig-system
A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.
barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads
Last synced: 24 Jul 2025
https://github.com/lafkpages/minecraft-crafting-info
Scrapes https://www.minecraftcrafting.info for crafting recipes.
Last synced: 17 Jun 2026
https://github.com/muhammadadilnaeem/student-performance-indicater-end-to-end-data-science-project
This project leverages data science techniques to build a predictive model that estimates a student's exam performance. The project follows a structured data science workflow, including data collection, preprocessing, model building, evaluation, and deployment.
data machine-learning-algorithms pandas pymysql python sql
Last synced: 11 Apr 2026
https://github.com/dhimmel/adeptus
ADEPTUS -- differential gene expression signatures of disease
adeptus data differential-expression disease gene-expression genes rephetio
Last synced: 05 Jan 2026
https://github.com/maximiliancw/completely
Measure your data completeness
data data-cleaning data-quality data-science missing-data
Last synced: 25 Jun 2025
https://github.com/snimmagadda1/luigi-etl-example
🔍 Example of an ETL pipeline using Spotify's Luigi
data luigi luigi-pipeline python spotify
Last synced: 30 Mar 2025
https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails
Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.
cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql
Last synced: 24 Jul 2025
https://github.com/etmendz/mendz.data.oracle
Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to Oracle databases.
ado-net context data database datasettings mendz oracle
Last synced: 13 Apr 2026
https://github.com/ksimicevic/discord-message-analyzer
Analyzing discord messages in Jupyter notebook
analysis data discord messages
Last synced: 16 Apr 2026
https://github.com/unknownsoup/budget_tracker
A personal budget tracker to build my knowledge of working with databases and data analysis. In this case using SQL and python for the analysis.
data data-science databases python sql
Last synced: 26 Jan 2026
https://github.com/gagolews/clustering-data-v0
Datasets for Clustering [DEPRECATED – A NEW VERSION IS AVAILABLE]
clustering data dataset machine-learning
Last synced: 15 Sep 2025
https://github.com/cljoly/data
📊 Data sets to populate some parts of my website (mostly https://cj.rs/open-source/).
Last synced: 03 May 2026
https://github.com/lohithgsk/dynamic-qr-generator
A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.
data pillow python qrcode qrcode-generator
Last synced: 16 Mar 2025
https://github.com/udhaya2823/microsoft---classifying-cybersecurity-incidents-with-machine_learning
🚨Microsoft: Classifying Cybersecurity Incidents with Machine Learning🔐 This project leverages the power of Machine Learning to classify cybersecurity incidents 🚨, improving the efficiency of Security Operation Centers (SOCs) at Microsoft. We train a model to predict incident grades, helping analysts prioritize threats with precision🎯.
classification data feature-engineering iqr-method machine-learning matplotlib model-evaluation modelselection predictive-modeling python sklearn
Last synced: 17 Apr 2026
https://github.com/lightdash/quickstart-github
Instant analytics for Github
analytics business-intelligence data dbt github
Last synced: 14 Sep 2025
https://github.com/lananolana/test_data_generator
Generate test data with Telegram bot in one click: random users, files, texts and credit cards.
credit-card data data-generation fake-data random telegram-bot test-data test-data-generator test-file-generator testing testing-tools text-generation user-generator
Last synced: 18 Jan 2026
https://github.com/jph5396/sumomodel
A data models related to sumo wrestling.
Last synced: 17 Jan 2026
https://github.com/sungchun12/demotron
CLI to delight real people with live demos
Last synced: 26 Feb 2025
https://github.com/rorylshanks/devdb-client
This is the repository for the official command line client for DevDB (https://devdb.cloud)
cloud data database-management development
Last synced: 29 May 2026
https://github.com/mai-space/design-concept-sharing-recipes
🖼️ Concept for a framework based on state of the art technology and libaries for secure data sharing and online collaboration, as well as focus on the ux and ui of said framework
concept content-map data datasharing framework hci mci mock-up navigation-map peer-to-peer screendesign userstories
Last synced: 14 May 2025
https://github.com/irsol/udacity-data-foundations-nd
data data-analysis data-visualization exel sql udacity udacity-data udacity-nanodegree
Last synced: 05 Mar 2026
https://github.com/jigyasag18/iit-guhawati
Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.
aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp
Last synced: 08 May 2026
https://github.com/mustafaozvardar/selenium-eksisozluk
This project is a simple web scraper built with Python using Selenium. It extracts and prints the content of popular entries from a specific EksiSozluk page.
data python selenium selenium-python
Last synced: 29 Apr 2026
https://github.com/liolb/sql2csv
Export SQL Server Table data to CSV
automation csv data database export extraction powershell scripting sql sql-server sql-table
Last synced: 08 May 2026
https://github.com/zsvoboda/olympics
Self service analytics of 120 years of Olympics data
analytics dashboards data datavisualization dataviz olympics open-data open-datasets opendata reports
Last synced: 08 May 2026
https://github.com/heyimsteve/solnftdatadash
This a React-based web application that provides detailed information about NFT collections on the Solana blockchain. It uses the HelloMoon API to fetch and display data about NFT collections, including statistics, loan summaries, ownership information, and floor prices.
dashboard data hellomoon nft react solana solana-nft
Last synced: 30 Jan 2026
https://github.com/lucasnbsb/data-structures-and-algorithms
Studying data structures and algorithms, mostly on leetcode
Last synced: 29 Aug 2025
https://github.com/ompreetham/fylo-data-storage-component
Flyo Data Storage Component Challenge on Frontend Mentor.io.
component css data front-end front-end-development frontend frontend-mentor frontendmentor-challenge fylo html react render scss storage vite website
Last synced: 11 Apr 2026
https://github.com/inphyt/quantitative_single_neuron_modeling_competition_2007
Data for the Quantitative Single-Neuron Modeling Competition (2007).
bayesian-inference bayesian-methods bayesian-optimization bayesian-statistics challenge competition computational-neuroscience data electrophysiological-data electrophysiology model-calibration modeling neuronal-models neuroscience neuroscience-competition parameter-estimation simulation simulation-modeling single-neuron-model uncertainty-quantification
Last synced: 26 Jul 2025
https://github.com/dineshram0212/youtube-analysis
This YouTube Analysis Package provides tools for analyzing YouTube video data, including metrics on views, likes, comments, and engagement trends. Ideal for gaining insights into video performance and audience interaction patterns.
data data-visualization pandas python webscraping youtube-api-v3
Last synced: 19 Jun 2026
https://github.com/sakshamarora07/blinkit-sales-report-power-bi
This dashboard provides Blinkit with insights to optimize its grocery delivery operations and understand customer preferences. It evaluates sales trends, outlet performance, and item categories to identify key areas for improvement. The interactive visuals allow detailed exploration of sales distribution, customer ratings, and product popularity.
data data-science dataanalytics datavisualization excel powerbi sql
Last synced: 08 Jan 2026
https://github.com/living-with-machines/zoonyper
Code to make it easy to import and process Zooniverse annotations and their metadata in Python/Jupyter Notebooks
crowdsourcing data data-processing data-science python zooniverse
Last synced: 04 Jul 2025
https://github.com/radekbednarik/att
Python wrapper for calling Apitalks API.
api-wrapper apitalks data python3 rest-api wrapper
Last synced: 05 Apr 2025
https://github.com/joshuagilgallon/cam-data
Large collection of data about digital cameras
Last synced: 17 Apr 2026
https://github.com/debjyotisaha/tableau-projects-phase-2
Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.
dashboards data data-analysis data-visualisation tableau
Last synced: 26 Aug 2025
https://github.com/mateuszskoczek/generatorcsv
GeneratorCSV is a students and teachers data converter for Microsoft 365 Admin Center. The project was implemented for Sobolew High School.
admin converter data microsoft365 python school tkinter
Last synced: 26 Aug 2025
https://github.com/zeptosec/bpscrapper
Shows history of oil prices
data data-visualization database nodejs scraper
Last synced: 13 Apr 2026
https://github.com/prishabhanot/facial_recognition_pca
A face recognition system using Principal Component Analysis (PCA) for dimensionality reduction and a Support Vector Machine (SVM) classifier for classification. PCA extracts essential features (eigenfaces) from facial images, significantly reducing computational complexity while retaining critical information for accurate recognition.
data eigenfaces facial-recognition pca python reducing-computational-complexity reducing-data-dimensions svm-classifier
Last synced: 01 Mar 2025
https://github.com/vatshayan/youtube-user-analysis
Analysis of Youtube Users about their choice and preferences
data data-analysis data-mining data-science data-visualization dataset machine-learning machine-learning-algorithms
Last synced: 05 Feb 2026
https://github.com/stdlib-js/ndarray-vector-uint32
Create an unsigned 32-bit integer vector (i.e., a one-dimensional ndarray).
constructor ctor data javascript ndarray node node-js nodejs stdlib structure types uint32 vec vector
Last synced: 25 Apr 2026
https://github.com/0xnu/data-analyst-training
The repository contains training materials for data analysts.
data data-analysis data-analyst
Last synced: 25 Aug 2025
https://github.com/franckalbinet/maris-crawlers
Automated data harvesting of MARIS data sources
automation data marine-radioactivity
Last synced: 25 Aug 2025
https://github.com/sakshamarora07/whatsapp-chat-analyser
This repository contains code for a WhatsApp Chat Analyzer that uses Python libraries to extract insights from chat messages.
chat data dataanalytics datascience matplotlib pandas python seaborn statistics streamlit whatsapp
Last synced: 04 Jan 2026
https://github.com/amethyst-php/company
amethyst amethyst-package api company data laravel
Last synced: 17 Apr 2026
https://github.com/jigyasag18/fake-news-prediction-app
The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Lemmatization algorithm, achieving ~95% classification accuracy with random forest classifier model
data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming streamlit streamlit-webapp vectorization
Last synced: 11 Apr 2026
https://github.com/inist-cnrs/ws-data
Modèles et données pour les web services
Last synced: 03 Sep 2025
https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis
In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.
data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public
Last synced: 09 Mar 2026
https://github.com/paulrosset/cyclone
Network data consumption monitoring
data monitoring network networking
Last synced: 23 Aug 2025
https://github.com/desoga10/nety-form
In this tutorial, I show you how to send data from a form to the Netlify dashboard. I also show you how to create a form using Materialize.
contact-form css css3 data form forms html html5 materialize materialize-css materializecss-framework netlify
Last synced: 03 Jan 2026
https://github.com/deliprofesor/virtual-reality-in-education-impact-analysis-and-insights
This project examines the impact of Virtual Reality (VR) on education, focusing on its effects on student engagement, learning outcomes, and creativity. It uses data analysis techniques like descriptive statistics, correlation analysis, and clustering to assess VR's effectiveness in enhancing learning.
clustering data data-analysis data-science data-visualization exploratory-data-analysis hypothesis-testing machine-learning python regression-analysis virtual-reality
Last synced: 14 Jun 2025
https://github.com/cnr-ibba/smarter-repository
SMARTER Data Repository
bootstrap5 data django repository smarter
Last synced: 03 Apr 2026
https://github.com/otoneko1102/roulette-base
ルーレットの色と番号をjson形式でまとめたものです。カジノ風ルーレットを作るときにどうぞ。A collection of roulette colors and numbers in json format. Use it when making a casino-style roulette.
casino casino-games data json require roulette
Last synced: 16 Mar 2025
https://github.com/vishwas-chakilam/twitter-sentiment-analysis
Twitter Sentiment Analysis is a Python project that analyzes the sentiment of tweets based on a user-defined keyword. It uses Tweepy to fetch tweets from the Twitter API and TextBlob for sentiment analysis. The application features a user-friendly GUI with Tkinter, displaying tweet sentiment as positive, negative, or neutral.
api data data-science dataanalysis python3 textblob-sentiment-analysis tkinter tweepy-api
Last synced: 11 Mar 2025
https://github.com/sanogotech/open-source-data-stack
modern open source data stack
airbyte airflow data data-science dbt docker postgresql python
Last synced: 11 Apr 2026
https://github.com/karosi12/ng-data-share
Angular communication with input and output properties
angular communication data data-binding input output sharing typescript
Last synced: 16 Jan 2026
https://github.com/climate-resource/input4mips_validation
Validation of input4MIPs data
cmip data forcing input4mips validation
Last synced: 20 Jan 2026
https://github.com/idhruvs/angular4-smart-table-demo
Angular4 Smart Table Demo Project
angular4 data tables typescript
Last synced: 21 Apr 2026