data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/lohithgsk/dynamic-qr-generator
A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.
data pillow python qrcode qrcode-generator
Last synced: 16 Mar 2025
https://github.com/charon25/weatherdata
17 000 weather measurements collected by a weather station created for a college project.
csv data dataset datasets json measurements strasbourg weather weather-data
Last synced: 16 Jan 2026
https://github.com/zulfachafidz/telco_churn_insight_customer_loss_prediction_with_random_forest_and_decision_tree-algorithms
The main problem in the business world is customer churn, or losing customers, especially in the telecommunications industry, which experiences very tight competition. To overcome this problem, an analysis was carried out to help the company understand how many customers have the potential to switch providers.
data data-science data-visualization dataanalysis dataanalyst dataanalytics datadrivenwithdataprovider decision-tree decision-tree-classifier decision-trees random-forest random-forest-classifier
Last synced: 01 May 2026
https://github.com/thingston/extractor
Collection of PHP classes to extract data from HTML pages.
Last synced: 14 Jan 2026
https://github.com/jacoblincool/moodle-export
A streamlined library for retrieving data from Moodle.
Last synced: 07 May 2025
https://github.com/kuanhungchen/spring-2019-data-structures
📦 Some programming assignments about basic data structures.
Last synced: 25 Feb 2025
https://github.com/tn3w/moviedb-json
A JSON library with 981,530 films.
data database db json movie movie-database movies
Last synced: 03 May 2026
https://github.com/prajakta1321/streetml-a-cityscape-traffic-volume-prognostication
StreetML leverages ML learning techniques to revolutionize urban traffic prediction through precise volume prognostication, aiming to enhance cityscape mobility through data-driven insights.
catboostregressor data datavisualisation exploratory-data-analysis lightgbm-regressor linearregression machine-learning machine-learning-algorithms predictive-analytics random-forest-regression xgboost-regression
Last synced: 08 Apr 2025
https://github.com/filiprokita/foldertoiso
Python script that converts a specified folder into an ISO.
automation command-line-interface command-line-tool compression cross-platform data file-system folder-to-iso iso iso-image iso-tool python python-cli python-script python3 shutil utility
Last synced: 24 Mar 2025
https://github.com/etmendz/mendz.data
Provides tools and guidance for creating data access contexts and repositories.
context data datasettings entity-framework mendz paginginfo repository resultinfo
Last synced: 11 Jun 2025
https://github.com/dms-codes/scrape-kesaintblanc-id
Kesaintblanc Data Scraper This Python script is designed to scrape product data from the Kesaintblanc website. It collects information about products, including product name, URL, price, image URLs, status, stock, and more. The scraped data is saved to a CSV file for further analysis.
data kesaintblanc python webscraper
Last synced: 27 May 2026
https://github.com/team-hydrogen/nasa-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 25 Mar 2025
https://github.com/csoren66/financial-budget-analysis
Financial budget for 2021
Last synced: 03 Mar 2025
https://github.com/vlamug/ratibor
Ratibor is a service for making metrics from data
Last synced: 10 Mar 2026
https://github.com/srgchrksv/datacamp-projects
Datacamps projects
analytics data data-science dataanalysis education jupyter-notebook learning pandas projects python sql
Last synced: 06 May 2026
https://github.com/vaxdata22/foresight-institution
This is a Data Analysis case study done on the Foresight Institution dataset.
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-processing data-visualization data-wrangling exploratory-data-analysis spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql transact-sql
Last synced: 28 May 2026
https://github.com/satyam4229/iit-and-nit-college-dataset
The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.
college-data csv data excel iit nit
Last synced: 04 Jan 2026
https://github.com/bkataru/spotigo
AI-powered local music intelligence platform with a task runner server core to retrieve and backup spotify account data to storage(s) at set periodic intervals
ai backup cron data go intelligence local-llm music ollama rag runner spotify task-runner tool-calling
Last synced: 16 Jan 2026
https://github.com/gappeah/layoffs-exploratory-data-analysis
This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.
data dataanalysis eda mysql sql
Last synced: 12 Jul 2025
https://github.com/keminghe/osu
Unofficial and publicly-available NPM data-package about The Ohio State University.
college data majors ohio-state organizations public students university unofficial
Last synced: 06 Jan 2026
https://github.com/zulfachafidz/titanic_explorer_predicting_survival_with_classification_using_knn_algorithm
Tracking Life Safety with the KNN Predictive Analysis Approach. Leveraging the Titanic Dataset, we apply classification analysis to predict the fate of passengers based on a variety of features.
algorithm algorithms data data-analysis data-mining data-science datamodeling datapreprocessing dataset knn-algorithm knn-classification machine-learning machine-learning-algorithms prediction-model
Last synced: 01 Sep 2025
https://github.com/jneidel/nationalities
Dataset of 100 common nationalities
data dataset json nationalities nationality opendata
Last synced: 25 Mar 2025
https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-
Customer Segmentation using the Recency, Frequency and Monetary Values
customer-segmentation data data-visualization python3 science time-series-analysis
Last synced: 26 May 2026
https://github.com/acovaci/orbit
ORBIT: an Open source Rust-based implementation of a data Build Tool, inspired by DBT
cargo clap-rs data data-warehouse dbt rust rust-lang tokio-rs
Last synced: 16 Mar 2025
https://github.com/pawal/tldmonitor-ui-go
Web UI for TLDMonitor
analysis data dns go golang mongodb statistics webapp website
Last synced: 16 Jan 2026
https://github.com/elimu-ai/ml-event-simulator
🤖 Simulation of learning events and assessment events
data learning-analytics machine-learning ml
Last synced: 28 Feb 2025
https://github.com/tdjsnelling/hermes
Hermes is a real-time data framework for React + MongoDB
data docker framework mongodb nodejs react react-hooks reactjs real-time typescript websocket
Last synced: 12 Apr 2026
https://github.com/jhermsmeier/node-leybold-xps
Parse & write the Leybold XPS data format
analysis data esca format leybold parser photoelectron-spectroscopy spectroscopy x-ray xps xpspeak xray
Last synced: 09 Jul 2025
https://github.com/saikatharryc/motionchart-d3js
A dynamic Motion chart Built with D3 js.
Last synced: 23 Dec 2025
https://github.com/filiprokita/tobase64
This Python program encodes a file in base64 format and saves the result to a new file with a ".b64" extension. It is a command-line tool that can be used to automate file encoding tasks.
base64 command-line data data-conversion data-manipulation data-privacy data-prottection data-security encoding file file-conversion file-handling python python-script python3 tobase64
Last synced: 30 Jun 2025
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/martinius96/meteostanica-odosielacie-scripty
Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.
arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka
Last synced: 11 Apr 2026
https://github.com/yash-chauhan-dev/sf_analytics
Business teams often rely on data analysts to extract insights using SQL. This tool eliminates that dependency by bridging the gap between humans and data using AI.
aiml analytics data dbt langchain llm python snowflake streamlit
Last synced: 07 May 2026
https://github.com/luciarevaliente/shell_script_data_cleaning
This project focuses on cleaning and processing datasets using Shell scripts. It is part of the Fundamentals of Informatics course (2022-23) and involves handling movie and show data to create cleaned and filtered datasets for further analysis.
data data-cleaning shell-script
Last synced: 04 Feb 2026
https://github.com/getconversio/dig-the-data
Data visualizations for the Conversio blog
Last synced: 12 Apr 2026
https://github.com/oliver021/helppad-net
Versatile .NET Toolkit: A Comprehensive Set of Miscellaneous Helpers, Classes, and Utilities
assert async checks cryptographic-algorithms data date dotnet fluent functional functional-programming hash helpers parallel pipe pipeline pointers review supports tasks
Last synced: 15 Jun 2026
https://github.com/thewillyhuman/willyos-java
willyOS for java developers
collections data data-structures java os structures
Last synced: 12 Jun 2025
https://github.com/wisdom-osborn/data-analytics-course-online-
🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples
data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python
Last synced: 19 Apr 2026
https://github.com/vidushibhadana/eda-on-nyc-taxi-data
About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.
data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/matheusafonseca/deploy-ml-models-with-streamlit-udemy
This repository is dedicated to storing the code developed during the "Machine Learning Model Deployment with Streamlit" course on Udemy. The course covers basic to advanced techniques for deploying machine learning models using Streamlit.
data data-science data-visualization interface joblib layout machine-learning optimization-algorithms python python3 sklearn sklearn-datasets sklearn-library sklearn-pipeline streamlit
Last synced: 19 Apr 2026
https://github.com/rohitblaze10/netflix_analysis_using_tableau
The Netflix dashboard in Tableau provides a professional and visually captivating interface for users to explore a vast collection of TV shows and series. With seamless navigation and interactive filters, users can easily personalize their recommendations based on release year, genre, duration, and rating.
data data-analysis data-science data-visualization netflix tableau
Last synced: 04 Feb 2026
https://github.com/trissim/polystore
Framework-agnostic multi-backend storage abstraction for ML and scientific computing
backend data io jax ml multi-framework numpy pytorch scientific-computing storage tensorflow zarr
Last synced: 12 Apr 2026
https://github.com/climate-resource/input4mips_validation
Validation of input4MIPs data
cmip data forcing input4mips validation
Last synced: 20 Jan 2026
https://github.com/arnavk-09/phishing-detection
🎣 Detect Phishing URLs with Data Pre-fitted... API & Web UI
csv data fastapi flask python scikit-learn
Last synced: 03 May 2026
https://github.com/nikolatechie/spotify-playlist
Data pipeline that fetches recently played songs in the past 24 hours using Spotify API and saves the data in the SQLite database. Scheduled to run daily using Apache Airflow.
apache-airflow api data data-engineering python spotify sql sqlite
Last synced: 30 Apr 2026
https://github.com/davorg/cookingvinyl
Web site with info about Cooking Vinyl records
cooking-vinyl data hacktoberfest music perl
Last synced: 02 Apr 2025
https://github.com/syed-bakhtawar-fahim/dsa_algorithm_code
Assalam o Alikum Guys, This is the repo of Data Structure and Algorithm in C programming language. I hope it will help you in learning Data Structure and Algorithm in C. I'm also learning Data Structure and algorithm in Python in better and easy way you can also explore it
algorithm algorithms-and-data-structures c data data-structures-and-algorithms dsa-algorithm dsa-learning-series dsa-practice
Last synced: 12 Apr 2025
https://github.com/sanogotech/open-source-data-stack
modern open source data stack
airbyte airflow data data-science dbt docker postgresql python
Last synced: 11 Apr 2026
https://github.com/jigyasag18/fake-news-prediction-app
The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Lemmatization algorithm, achieving ~95% classification accuracy with random forest classifier model
data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming streamlit streamlit-webapp vectorization
Last synced: 11 Apr 2026
https://github.com/code-str8/time-series-forecasting
Developing a model that effectively forecasts the unit sales of numerous items across various Favorita stores with precision.
data dataanalysis forcasting machine-learning time-series visualizations
Last synced: 31 Mar 2025
https://github.com/sakshamarora07/whatsapp-chat-analyser
This repository contains code for a WhatsApp Chat Analyzer that uses Python libraries to extract insights from chat messages.
chat data dataanalytics datascience matplotlib pandas python seaborn statistics streamlit whatsapp
Last synced: 04 Jan 2026
https://github.com/inphyt/quantitative_single_neuron_modeling_competition_2007
Data for the Quantitative Single-Neuron Modeling Competition (2007).
bayesian-inference bayesian-methods bayesian-optimization bayesian-statistics challenge competition computational-neuroscience data electrophysiological-data electrophysiology model-calibration modeling neuronal-models neuroscience neuroscience-competition parameter-estimation simulation simulation-modeling single-neuron-model uncertainty-quantification
Last synced: 26 Jul 2025
https://github.com/unkaktus/pktconn
wrapper around io.ReadWriteCloser that implements gopacket's 'device'
connection data gopacket packet
Last synced: 29 May 2026
https://github.com/mierune/tinybufr
[WIP] A Rust library for decoding BUFR (Binary Universal Form for the Representation of meteorological data) files.
bufr data meteorology rust weather wmo
Last synced: 15 May 2025
https://github.com/pietrapaz/bootcamp_dio_ciencia_de_dados
Bootcamp Potência Tech powered by iFood | Ciência de Dados - Dio ⚠️
cienciadedados dados data datascience python
Last synced: 09 Apr 2025
https://github.com/woctezuma/steamspy-data
Data snapshot from SteamSpy.
data data-dump data-dumps steam steam-data steamspy steamspy-api
Last synced: 07 Jan 2026
https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system
To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.
analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server
Last synced: 29 May 2026
https://github.com/muhammadadilnaeem/student-performance-indicater-end-to-end-data-science-project
This project leverages data science techniques to build a predictive model that estimates a student's exam performance. The project follows a structured data science workflow, including data collection, preprocessing, model building, evaluation, and deployment.
data machine-learning-algorithms pandas pymysql python sql
Last synced: 11 Apr 2026
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 24 Feb 2025
https://github.com/rikiitokazu/dataprojects
Data analysis practice using SQL and Python
Last synced: 12 Apr 2026
https://github.com/publici/state-integrity-data
Data from a comprehensive assessment of state government accountability and transparency
Last synced: 04 Feb 2026
https://github.com/jpb06/kubot-dal
data data-access-layer gulp-tasks mongodb typescript
Last synced: 12 Apr 2026
https://github.com/theanujsinha01/mcdonalds-customer-analysis
This project analyzes customer feedback data to understand what drives people to like or dislike McDonald’s. Using Python and data visualization tools in a Jupyter Notebook, we explore how different factors—such as taste, price, health, and visit frequency—affect customer satisfaction.
case-study data data-visualization dataanalysis
Last synced: 05 Sep 2025
https://github.com/yash-rewalia/airbnb_eda_pandas
The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.
data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/kahlery/my-jupyter-notebook-projects
🐊 collection of my data science analysis, actually I store most of my data science projects in my google drive because of google colab
Last synced: 12 Apr 2026
https://github.com/miraclx/split-merge
Efficient, flexible data stream chunker and merger
chunk data efficient merge middleware nodejs pipeline split stream
Last synced: 07 May 2026
https://github.com/doughtnerd/pod-old
Read and write Excel data
data data-analysis excel poi-library workbook
Last synced: 21 Jan 2026
https://github.com/nsandoya/python_scrp_project
This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)
beautifulsoup4 data data-analysis jupyter-notebook pandas python3
Last synced: 28 Apr 2026
https://github.com/ahmad-ali-rafique/linear-regression-modeling
In-depth exploration of linear regression models, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data dataanalytics linear-models linear-regression model multilinear-regression regression regression-models
Last synced: 19 Apr 2026
https://github.com/j-sephb-lt-n/plotly-dash-dashboard-template
A data dashboard template
dash data data-visualisation data-vizualization dataviz google-cloud google-cloud-platform plotly plotly-dash python responsive-design responsive-web-design
Last synced: 18 Jun 2025
https://github.com/tomcardoso/journalism-data-intersection
A talk on working at the intersection of journalism and data science
data data-journalism journalism
Last synced: 15 May 2025
https://github.com/jerboaburrow/uk-counties-and-unitary-authorities-may-2023-geojson
UK "Counties" Extracted from Office for National Statistics data
Last synced: 29 Mar 2025
https://github.com/thicclatka/tetration
New file format for tensors
cli data fileformat mmap tensors
Last synced: 26 May 2026
https://github.com/rishikesh-jadhav/track_deep_learning
Data collected from the Udacity simulator comprising RGB images with steering and throttle annotations for each frame, specifically gathered for behavioral cloning purposes.
data datacollection udacity-self-driving-car
Last synced: 03 Jan 2026
https://github.com/nadahamdy217/harvest-gaurd-plant-disease-detection-web-application
web application that help people grow healthy plants
classification-confidential cnn cnn-classification css data data-science detection html javascript keras machine-learning model plant-disease-detection supervised-learning tensorflow web-application
Last synced: 13 Apr 2026
https://github.com/naliferopoulos/datamining
Bring your own pickaxe.
aueb aueb-students data data-mining machine-learning machine-learning-algorithms mining random-forest
Last synced: 25 Jan 2026
https://github.com/gianlucatruda/titanic
An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.
data database flask info matplotlib python science scrape server titanic visualisation web
Last synced: 10 Apr 2026
https://github.com/burythehammer/foosbot-results
Foosball results for the OpenCredo foosbot
data foosball machine-learning python
Last synced: 13 Apr 2026
https://github.com/pythoncoderunicorn/tool-discography
Music Band TOOL albums and songs dataset
data data-science metal-music music r songs
Last synced: 26 May 2026
https://github.com/open-geodata/sp_bh_pcj-2020-2035
Dados Espaciais da Agência das Bacias PCJ, com informações apresentadas no Plano de Bacias 2020-2035
Last synced: 16 Jan 2026
https://github.com/didier/frontend-data
Functional Programming subject of @CMDA-TT
convenience d3 d3-visualization d3js data datavis datavisualization dataviz front-end functional-programming interactive jsdoc node nodejs parking-spots svelte sveltejs
Last synced: 13 Apr 2026
https://github.com/ismailhakkii/digital_vault
This project can be used for securing data, similar to a real vault.
data digital security-data vault
Last synced: 25 Mar 2025
https://github.com/checco9811/data-engineering-bootcamp-homework
Homework solutions for DataExpert.io data engineering bootcamp
apache-spark data data-engineering sql
Last synced: 14 Mar 2025