data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/andygeiss/pipeline-example
This is a basic example of using a pipeline in data science.
data data-pipeline data-science example go golang iris-dataset pipeline protobuf
Last synced: 17 Jul 2025
https://github.com/jodus-melodus/queue
Simple Queue
data datastructures linear queue queues
Last synced: 10 Sep 2025
https://github.com/amethyst-php/value
amethyst amethyst-package api data laravel value
Last synced: 17 May 2026
https://github.com/topunix/hackerrank
:green_book: HackerRank Solutions
algorithm-challenges algorithms algorithms-and-data-structures data data-structures hackerrank hackerrank-algorithms-solutions hackerrank-challenges hackerrank-python hackerrank-solutions python
Last synced: 17 May 2026
https://github.com/potlock/data
data research for other funding mechanisms and PotLock related data.
data flipsidecrypto near-protocol potlock
Last synced: 07 Mar 2026
https://github.com/dimitryzub/allrecipes-us-recipes-by-state-analysis
Personal Data Exploratory Project in Python. Data extracted from AllRecipes.
data data-visualization dataexploration dataextraction matplotlib pandas python seaborn webscraping
Last synced: 10 May 2026
https://github.com/Vidya-Vijay/Vid2501
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 19 Jul 2025
https://github.com/vidya-vijay/vid2501
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 19 Jul 2025
https://github.com/merekat/hb-passiv-income
Ein Rechner, der basierend auf historischen Daten unterschiedlicher Assets kalkuliert, welches voraussichtliche passive Einkommen der User abhängig von seinen Eingaben zu erwarten hat.
assets data datajournalism etf passive-income treasury
Last synced: 19 Jul 2025
https://github.com/jcloh98/rental-property-finder
A web scraper that helps users find rental properties by automatically gathering and organizing listings from various websites to discover available homes and apartments.
data headless-browser node scraper scraping web
Last synced: 17 May 2026
https://github.com/amethyst-php/sku
amethyst amethyst-package api data laravel sku
Last synced: 17 May 2026
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/joseluisq/input-verifier
Some useful functions to check common data input.
Last synced: 19 Jul 2025
https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate
This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.
correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif
Last synced: 17 May 2026
https://github.com/zshn1248/pyfilecrypto
PyFileCrypto is a Python module for easy encryption and decryption of files using the cryptography library. It provides a simple interface to generate encryption keys, encrypt files, and decrypt files securely.
data decryption encryption file security-tools
Last synced: 07 Apr 2026
https://github.com/sharoonjoseph321/social_media_eda
Data Analysis on social media apps ,using pandas, python, matplotlib.
data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects
Last synced: 03 Mar 2025
https://github.com/UznetDev/Smoking-Prediction
This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.
ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking
Last synced: 28 Mar 2025
https://github.com/chompfoods/sdk-scala
Scala SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes scala sdk
Last synced: 17 May 2026
https://github.com/reubano/pyconza-tutorial
Jupyter notebooks and data for "Data Mining and Processing for fun and profit" PyConZA16 tutorial
data functional-programming jupyter-notebook meza pycon python tutorial
Last synced: 17 May 2026
https://github.com/ditikrushna/enotes
🌻 Personal learning notes
coursera-data-science cousera data datascience machine machinelearning ml notes
Last synced: 07 Mar 2026
https://github.com/sumansuhag/prediction_model
This repository features a collection of Jupyter notebooks designed to showcase the practical applications of machine learning, data preprocessing, feature engineering, and recommendation systems. These notebooks enable users to explore, analyze, and predict business events.
algotithms artificial-intelligence data logistic-regression machine-learning-algorithms science sckiit-learn
Last synced: 28 Mar 2025
https://github.com/sumansuhag/wasserstoff-aiinterntask
Welcome to the AI Pipeline for Image Segmentation and Object Analysis project – a state-of-the-art solution designed to process, segment, identify, and analyze objects within images. This AI-powered pipeline is engineered to deliver precise insights by extracting, mapping, and summarizing data from each segmented object.
artificial-intelligence cdn data data-science modeling pipline
Last synced: 28 Mar 2025
https://github.com/robsteranium/user2022-ldf-talk
Slides from my useR! 2022 talk about the Linked-Data Frames package
data data-frame linked-data r rdf
Last synced: 19 Apr 2025
https://github.com/ericgio/history-of-jazz
Data and visualizations based on Ted Gioia's "The History of Jazz"
Last synced: 28 Mar 2025
https://github.com/andreabozzo/andreabozzo
My personal Repo!
analytics data data-engineering data-visualization database datamodelling developer-profile github-pages github-profile go interactive-animation open-data portfolio python readme-profile rust
Last synced: 17 May 2026
https://github.com/meta-llama/synthetic-data-kit
Tool for generating high quality Synthetic datasets
data generation llm python synthetic
Last synced: 08 May 2025
https://github.com/saksham-jain177/data-analysis
A collection of data analysis and machine learning projects across various datasets. Explore predictive modeling, data visualization, and insights from real-world data. Projects include sales predictions, disease detection, customer segmentation, and more.
api data data-analysis data-cleaning data-science data-visualization datamodeling dataset datasets exploratory-data-analysis python python3 web-scraping youtube-api
Last synced: 01 May 2026
https://github.com/wellingtonmwadali/alx-low_level_programming
ALX sprint one C programming
c data datastructures linked-list loops pointers-and-arrays string structures
Last synced: 04 Apr 2025
https://github.com/hidayathamir/telegram-group-data
1,865,827 message data in telegram group. Text, identity, datetime.
bahasa-indonesia data python3 scrape telegram telethon
Last synced: 17 May 2026
https://github.com/basinghse/covid19simulator
Real Time Assessment and Simulation of COVID-19 - showing current numbers of cases, deaths and treated patients globally.
coronavirus covid-19 data real-time simulation visualisation visualisation-data-ingester
Last synced: 05 Apr 2025
https://github.com/antoninpvr/battery-logger
Simple scripts to record data from my laptop battery
Last synced: 17 May 2026
https://github.com/adadalshabab/machine-predictive-maintenance-classification
This repository hosts a machine predictive maintenance classification project, aimed at predicting the maintenance needs of industrial machinery before they fail. By leveraging machine learning algorithms, this project seeks to enhance operational efficiency and reduce downtime by identifying potential maintenance requirements proactively.
data data-science datanalysis datanalytics machine-learning machine-learning-algorithms matplotlib-pyplot pandas
Last synced: 17 May 2026
https://github.com/ericmaddox/nyc-crime-analytics
Analyzes and visualizes crime data from the NYC Police Department using interactive maps and heatmaps, leveraging the NYC Open Data API.
crime-analysis crimedata data datavisualization esri folium heatmap nycopendata python python3 rtcc
Last synced: 24 Jun 2025
https://github.com/toofancodes/h1b-dashboard-insights
An interactive Tableau dashboard that visualizes H1B visa data from the USCIS Employer Data Hub, offering insights into application trends, top employers, and geographic distributions. Showcases advanced data visualization, analytics, and business intelligence skills.
analysis analytics business-intelligence dashboard data data-visualization h1b h1b-visa interactive-data tableau
Last synced: 20 Jan 2026
https://github.com/amethyst-php/post
A comment, a note, a post, a pseudo-chat. Can be really anything
amethyst amethyst-package api data laravel post
Last synced: 17 May 2026
https://github.com/weecology/updating-data
Hugo website for instructions on how to make a regularly updating data pipeline
continuous-analysis continuous-integration data gh-actions living-data netlify travis-ci
Last synced: 17 Feb 2026
https://github.com/shivamsharma32/ipl-2022-analysis
The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.
data dataana dataanalytics datavi matplotlib python
Last synced: 17 May 2026
https://github.com/aguven6/inmemory-data-processor
Convert tabular data to columnar data with index. Aim is to process huge data quicker especially in aggregation operation
columnar-storage data data-structures parallel-computing parallel-programming processing
Last synced: 17 May 2026
https://github.com/simranjeet97/kaggle_pokemon_datset_eda-dashboard
Full EDA and Dashboard of Kaggle Pokemon Dataset with Live Streaming Data and Images
cloud data data-science dataanalytics machine-learning machine-learning-algorithms pokemon pokemon-dataset pokemon-prediction python science
Last synced: 07 May 2026
https://github.com/ciscorn/japanmesh-rs
A Rust library for handling Japanese Grid Square Code (JIS X 0410:2002 地域メッシュコード)
census data geospatial japan rust
Last synced: 11 Jan 2026
https://github.com/pulipulichen/pts-local-news-dataset
A dataset containing local news from Public Television Service.
Last synced: 27 Mar 2026
https://github.com/amethyst-php/taxonomy
amethyst amethyst-package api data laravel taxonomy
Last synced: 18 Jan 2026
https://github.com/sharmadhiraj/plot-pi
Graphical Representation of PI
data data-visualization html javascript js mathematics plot
Last synced: 28 Mar 2025
https://github.com/ellisvalentiner/legislation-embeddings
Embeddings for U.S. Congress legislation
data embeddings machine-learning nlp python
Last synced: 12 Aug 2025
https://github.com/ranjeetj06/insighthub
InsightHub is a data analytics project that helps automate the entire process of preparing, analyzing, and reporting on CSV data.
analysis begineer data springboot
Last synced: 17 May 2026
https://github.com/eloyhere/semantic-java
Semantic-Java is a modern, maven Java stream processing framework with zero dependencies. It elegantly blends the fluency of Java Streams, the laziness of JavaScript generators, and intelligent index-based control inspired by database indexing — perfect for time-series, event streams, and high-performance data pipelines as a maven pendency.
data functional functional-programming java pipeline stream
Last synced: 07 Apr 2026
https://github.com/eslamdyab21/apara-data-gui
Custom application for Apara's data wrangling scripts, Technologies used are Qt-designer, PyQt5 for the GUI and Pandas, Numpy for the data work.
csv data data-analysis data-wrangling gui pandas pyqt5-desktop-application qt5-gui
Last synced: 17 May 2026
https://github.com/greatwoman23/sentiment-analysis-on-amazon-products-review
Sentiment_Analysis_On_Amazon_Product_Review
analysis dashboard-application data data-science datascientistproject machine-learning publication python remotejob
Last synced: 17 May 2026
https://github.com/huspacy/huspacy-resources
Resources for building and evaluating huspacy
Last synced: 21 Mar 2025
https://github.com/amethyst-php/office
amethyst amethyst-package api data laravel office
Last synced: 17 May 2026
https://github.com/octoenergy/tentaclio-gdrive
A python project containing all the dependencies for the gdrive tentaclio schema
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-databricks
Module to give tentaclio support to databricks
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-s3
A python project containing all the dependencies for s3 tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-athena
A python project containing all the dependencies for awsathena+rest tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-postgres
A python project containing all the dependencies for postgresq tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-gs
A python project containing all the dependencies for gs tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/kaizadp/bbwm_moisture
HOBO data for soil moisture - Bear Brook Watershed in Maine
Last synced: 17 May 2026
https://github.com/wilcotomassen/lorem-datum-core
Java based data generator for data simulation
data dataset generator java lorem-ipsum simulated-data
Last synced: 11 Jan 2026
https://github.com/maximkrouk/storage
Lightweight framework for storing data (beta)
cache data keychain memmory storage swift swift5-1 userdefaults
Last synced: 30 Oct 2025
https://github.com/ahmad-ali-rafique/logistic-regression-modeling
An in-depth exploration of logistic regression models, including data cleaning, model building, and performance evaluation on various datasets.
accuracy confusion-matrix data dataanalytics logistic-regression logistic-regression-classifier machine-learning-algorithms mlmodels model modelling regression-models
Last synced: 11 Sep 2025
https://github.com/lukaszkn/data-software-engineering-interview-questions
Data and Software engineering interview questions
data engineering interview-questions python
Last synced: 20 Jul 2025
https://github.com/mightymetrika/scdtb
Single Case Design Toolbox
data math r science statistics
Last synced: 04 Jan 2026
https://github.com/ramtinsoltani/safe-cli
A simple Command-line Interface which encrypts and decrypts UTF-8 files using AES-256.
aes-256 cli data data-hook decryption encryption generator handlebars hooks markup partial partial-decryption password safe swap temp temporary tool
Last synced: 16 Apr 2026
https://github.com/pawlo77/messenger-analyser
Repo for Data Visualization project, part of IAD study program at Faculty of Mathematics and Information Science, Warsaw University of Technology
Last synced: 17 May 2026
https://github.com/newrelic-experimental/newrelic-java-aws-kinesis
Provides instrumenation of the Amazon Kinesis Client and Producer
amazon aws client data instrumentation java kinesis nrlabs nrlabs-data nrlabs-odp observability-data producer
Last synced: 15 May 2026
https://github.com/nathanieliskandar26/data-analysis-project
This project demonstrates my ability to clean and analyze data using Python and SQL so far. The dataset used for this analysis focuses on general customer information. Through this project, I aimed to uncover meaningful insights and trends by cleaning the data and performing structured queries.
analysis data data-cleaning jupyter-notebook mysql mysql-database python
Last synced: 19 Apr 2026
https://github.com/apparaomulpuri/readline
Explains you the usage of readLine function in Swift.
data fromkeyboard keyboard reading readline swift
Last synced: 29 Mar 2025
https://github.com/vin20777/drone-data-layer
Drone Project Data Layer
csharp data drone layer software-design
Last synced: 18 May 2026
https://github.com/pedelriomarron/spanish-api-covid19
Data from Spain of COVID-19 (by Datadista) as a service
api covid-19 covid-19-spain data now spain zeit
Last synced: 12 Mar 2025
https://github.com/solrikk/vargen
VarGen (Variation Generator) is a user-friendly desktop application designed to simplify the creation of product variations from CSV files.
csv-files csv-format csv-parser data data-engineering excel excelparser python
Last synced: 29 Mar 2025
https://github.com/a-poor/taro
A package for repeatable rectangular data transformations in Python.
data data-science data-transformation pipeline pypi-package python
Last synced: 13 Oct 2025
https://github.com/yash22222/olympic-games-analytics-using-apache-spark
The "Olympic Games Analytics Using Apache Spark Databricks" project explores data from the Olympic Games (1896-2016) to identify trends and insights. Using Apache Spark for big data processing and Databricks for visualization, the project analyzes key factors like top-performing countries and athlete attributes, showcasing real-world analytics.
apache apache-kafka apache-spark big-data-analytics csv data data-analytics data-visualization databricks excel mysql olympics regions
Last synced: 03 May 2026
https://github.com/hallmx/mx_utils
Utility scripts for software development in data science
colaboratory data development nbdev python science scripts software utlities
Last synced: 19 May 2026
https://github.com/gsmith257-cyber/BIT3434CVE
BI T3434 Project on data mining CVEs and Exploits
cve data data-mining exploits research-project
Last synced: 10 Mar 2025
https://github.com/simranjeet97/covid-19
Covid-19 Data Analysis and Important Topics to be Covered to get the Impact and Solution.
coronavirus coronavirus-analysis coronavirus-dataset coronavirus-prediction coronavirus-tracking covid-19-data-analysis covid19 covid19-data covid19-india dash dash-app dash-plotly data data-analysis data-science data-science-projects data-visualization python3
Last synced: 18 May 2026
https://github.com/shvbsle/image-augmentation
A light weight CLI for augmenting image datasets for deep learning and ML projects
augmentation data data-augmentation data-augmentation-strategies data-augmentor data-augumentation data-science dataset deep-learning image-processing
Last synced: 12 Sep 2025
https://github.com/pratik-codes/zomato_data_eda
Cleaned, analysed messy data and created a predictive model with and accuracy of 93% with tree Regressor algorithm
bengaluru data data-cleaning data-science famous-restaurants restaurants-delivering-online restraunts
Last synced: 27 Mar 2025
https://github.com/encelo/nctracer-data
Data files for the ncTracer project
Last synced: 15 Jan 2026
https://github.com/annaanastasy/mushroom-binary-classification-eda-ml
Explored and modeled a competition dataset of mushroom species, focusing on data cleaning, exploratory data analysis, and building machine learning models for accurate classification of edible and poisonous mushrooms.
binary-classification data data-cleaning-and-preprocessing data-science exploratory-data-analysis machine-learning-algorithms xgboost-classifier
Last synced: 29 Mar 2025
https://github.com/kammarah/studentdata
I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓
connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp
Last synced: 18 May 2026
https://github.com/mkshah605/personal-brand-development
A data-driven approach to a personal brand development project.
branding data data-science growth music personal
Last synced: 12 Sep 2025
https://github.com/yugoff/ml-kaggle-regression-with-a-mohs-hardness-dataset
Your Goal: For this Episode of the Series, your task is to use regression to predict the Mohs hardness of a mineral, given its properties
data gradient-boosting kaggle kaggle-competition regression-models
Last synced: 18 May 2026
https://github.com/ahadly/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 May 2026
https://github.com/byndyusoft/byndyusoft.data.relational.specifications
byndyusoft data relational specifications
Last synced: 12 Sep 2025
https://github.com/jigyasag18/ibm-power-bi-dashboard-project
IBM Power BI Dashboard Project is a data-driven analysis of employees using IBM's comprehensive dataset, providing insights into key factors contributing to employee turnover and enabling organizations to strategize effectively towards improved employee retention and satisfaction.
data data-visualization dataanalysis dataanalytics dataset datavisualisation datavisualization-project powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard
Last synced: 07 Mar 2026
https://github.com/juniorreisx/movelo-logstica
Movelo is a lightweight logistics simulator built with TypeScript that provides mock order and delivery data for developing and testing UIs, dashboards, and backend features without external APIs.
data hooks lucide-react react tailwindcss typescript
Last synced: 12 Apr 2025
https://github.com/e22m4u/ts-projection
Модуль для работы с проекцией данных для TypeScript
Last synced: 12 Apr 2025
https://github.com/eryks1999/data-collection-project_python
This project allowed me to practice classes, populating json files as well as extracting data.
Last synced: 16 Apr 2026
https://github.com/styd/sd_struct
Searchable Deep Struct
activesupport data gem openstruct rails ruby structure
Last synced: 18 May 2026