data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/tomwhite/misp-2017
MISP camp 2017 materials and code
bioinformatics data data-visualization hackathon
Last synced: 18 Apr 2026
https://github.com/eloyhere/semantic-java
Semantic-Java is a modern, maven Java stream processing framework with zero dependencies. It elegantly blends the fluency of Java Streams, the laziness of JavaScript generators, and intelligent index-based control inspired by database indexing — perfect for time-series, event streams, and high-performance data pipelines as a maven pendency.
data functional functional-programming java pipeline stream
Last synced: 07 Apr 2026
https://github.com/sweta-kaundilya/911-calls-capstone-project
For this capstone project we will be analyzing some 911 call data from Kaggle.
data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 28 Apr 2026
https://github.com/ranjeetj06/insighthub
InsightHub is a data analytics project that helps automate the entire process of preparing, analyzing, and reporting on CSV data.
analysis begineer data springboot
Last synced: 17 May 2026
https://github.com/ellisvalentiner/legislation-embeddings
Embeddings for U.S. Congress legislation
data embeddings machine-learning nlp python
Last synced: 12 Aug 2025
https://github.com/krescruz/pegaso-data
Utilerías para el analisis de datos del Proveedor de Certificación de Factura Pegaso
Last synced: 29 Apr 2026
https://github.com/sharmadhiraj/plot-pi
Graphical Representation of PI
data data-visualization html javascript js mathematics plot
Last synced: 28 Mar 2025
https://github.com/ngupta23/data_prep_helper
A helper package for preparing and combining data from a variety of sources
data data-science dataprep datapreparation dataprocessing helpers python
Last synced: 03 Apr 2025
https://github.com/amethyst-php/taxonomy
amethyst amethyst-package api data laravel taxonomy
Last synced: 18 Jan 2026
https://github.com/pulipulichen/pts-local-news-dataset
A dataset containing local news from Public Television Service.
Last synced: 27 Mar 2026
https://github.com/ciscorn/japanmesh-rs
A Rust library for handling Japanese Grid Square Code (JIS X 0410:2002 地域メッシュコード)
census data geospatial japan rust
Last synced: 11 Jan 2026
https://github.com/germanpaul12/automating-hacker-news-and-weather-mails
Project for my Raspberry Pi to send me mails when it rains and to inform with hot tech news
beautifulsoup beautifulsoup4 data hacker-news openweather-api raspberry-pi requests
Last synced: 05 May 2026
https://github.com/simranjeet97/kaggle_pokemon_datset_eda-dashboard
Full EDA and Dashboard of Kaggle Pokemon Dataset with Live Streaming Data and Images
cloud data data-science dataanalytics machine-learning machine-learning-algorithms pokemon pokemon-dataset pokemon-prediction python science
Last synced: 07 May 2026
https://github.com/ashishsingh789/data_visualization
Data visualization project using Python to analyze categorical and continuous variables. Includes bar charts, histograms, and scatter plots. Libraries used: pandas, matplotlib, and seaborn.
analysis barchart data data-science data-visualization histogram matplotlib pandas-dataframe scatter-plot seaborn
Last synced: 07 Sep 2025
https://github.com/aguven6/inmemory-data-processor
Convert tabular data to columnar data with index. Aim is to process huge data quicker especially in aggregation operation
columnar-storage data data-structures parallel-computing parallel-programming processing
Last synced: 17 May 2026
https://github.com/talitalobo/statistics-with-python
Repo about statistical concepts and (not always) their python implementation.
data data-science machine-learning statistics
Last synced: 11 Jan 2026
https://github.com/pyrustic/litedao
Intuitive interaction with SQLite database
auto-init dao data database database-access library lightweight pyrustic python sql sqlite
Last synced: 09 May 2026
https://github.com/shivamsharma32/ipl-2022-analysis
The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.
data dataana dataanalytics datavi matplotlib python
Last synced: 17 May 2026
https://github.com/weecology/updating-data
Hugo website for instructions on how to make a regularly updating data pipeline
continuous-analysis continuous-integration data gh-actions living-data netlify travis-ci
Last synced: 17 Feb 2026
https://github.com/emna-chebbi/student-performance
Predictive model for student exam scores based on student performance factors
ai computer-vision data kaggle machine-learning ml mse regression regression-models
Last synced: 15 May 2026
https://github.com/jdtzmn/cricket
Send data over sound 🔊
cricket data send sound standardjs transfer typescript typescript-library
Last synced: 03 Jul 2026
https://github.com/amethyst-php/post
A comment, a note, a post, a pseudo-chat. Can be really anything
amethyst amethyst-package api data laravel post
Last synced: 17 May 2026
https://github.com/toofancodes/h1b-dashboard-insights
An interactive Tableau dashboard that visualizes H1B visa data from the USCIS Employer Data Hub, offering insights into application trends, top employers, and geographic distributions. Showcases advanced data visualization, analytics, and business intelligence skills.
analysis analytics business-intelligence dashboard data data-visualization h1b h1b-visa interactive-data tableau
Last synced: 20 Jan 2026
https://github.com/sanand0/marvel-powers
Scrapes Marvel Fandom for character powers
Last synced: 04 Jul 2026
https://github.com/amethyst-php/consume-rule
amethyst amethyst-package api consume-rule data laravel
Last synced: 19 May 2026
https://github.com/ahmad-ali-rafique/random-forest-classifier-modeling
Detailed exploration of random forest classifiers, including data cleaning, model building, and performance evaluation on various datasets.
classification classification-models data dataanalytics datamodel dataset model-checking models random-forest random-forest-classifier
Last synced: 01 Jun 2026
https://github.com/ericmaddox/nyc-crime-analytics
Analyzes and visualizes crime data from the NYC Police Department using interactive maps and heatmaps, leveraging the NYC Open Data API.
crime-analysis crimedata data datavisualization esri folium heatmap nycopendata python python3 rtcc
Last synced: 24 Jun 2025
https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling
Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.
data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis
Last synced: 05 Mar 2025
https://github.com/ohspc89/better_call_jin
A repository containing mentoring materials for a Ph.D. student in Neuroscience
data matlab spss-statistics visualization visualization-tools wrangling-data
Last synced: 03 Jul 2026
https://github.com/ahmad-ali-rafique/electricity-consumption-analysis-household-dataset
This repository contains analysis and predictive modeling of household electricity consumption using Python. It includes data cleaning, exploratory data analysis (EDA), time series forecasting (ARIMA, SARIMA, LSTM), and model evaluation to optimize energy usage.
arima-forecasting artificial-intelligence artificial-neural-networks data data-science dataanalytics datacleaning evaluation-metrics exploratory-data-analysis long-short-term-memory lstmmodel modeling time-series timeseries-forecasting
Last synced: 23 Jun 2025
https://github.com/moscatellimarco/webscrap-imdb
🎬 Python scraper for IMDB: Extract movie/TV details for 📊 analysis & 🗃️ storage. Easy setup, 🔧 customizable, with 🖥️ CLI.
css data datascience html movies python scrapy scrapy-crawler scrapy-spider web web-scraping webdata webscraping
Last synced: 15 May 2026
https://github.com/yourdataarchitect/french-realestate-data-pipeline
This repository contains a fully automated data pipeline built with Apache Airflow to extract, clean, analyze, and report real estate listings from Seloger. It pushes data to MongoDB, Elasticsearch, and Google Sheets, with real-time Slack alerts for monitoring.
airlfow data datanalysis datapipeline market-intelligence real-estate
Last synced: 31 Dec 2025
https://github.com/coderooz/hr-dashboard
The goal of this project is to create a power bi dashboard to showcase the attrition data within the company.
Last synced: 07 Jan 2026
https://github.com/adadalshabab/machine-predictive-maintenance-classification
This repository hosts a machine predictive maintenance classification project, aimed at predicting the maintenance needs of industrial machinery before they fail. By leveraging machine learning algorithms, this project seeks to enhance operational efficiency and reduce downtime by identifying potential maintenance requirements proactively.
data data-science datanalysis datanalytics machine-learning machine-learning-algorithms matplotlib-pyplot pandas
Last synced: 17 May 2026
https://github.com/antoninpvr/battery-logger
Simple scripts to record data from my laptop battery
Last synced: 17 May 2026
https://github.com/basinghse/covid19simulator
Real Time Assessment and Simulation of COVID-19 - showing current numbers of cases, deaths and treated patients globally.
coronavirus covid-19 data real-time simulation visualisation visualisation-data-ingester
Last synced: 05 Apr 2025
https://github.com/pyrustic/jayson
Intuitive interaction with JSON files [DEPRECATED, check the project Shared]
Last synced: 17 May 2026
https://github.com/hidayathamir/telegram-group-data
1,865,827 message data in telegram group. Text, identity, datetime.
bahasa-indonesia data python3 scrape telegram telethon
Last synced: 17 May 2026
https://github.com/xenoverseup/data-structures
Data structures in every language I know.
cpp data data-science data-structures data-structures-and-algorithms doubly-linked-list linked-list
Last synced: 14 May 2026
https://github.com/fliplet/fliplet-widget-data-source-query
Data Source Query Provider
Last synced: 11 Apr 2025
https://github.com/wellingtonmwadali/alx-low_level_programming
ALX sprint one C programming
c data datastructures linked-list loops pointers-and-arrays string structures
Last synced: 04 Apr 2025
https://github.com/shysolocup/fndt
JavaScript package allowing you to see function data like body and arguments from outside of the function
aepl data fndt functions javascript javascript-tools js js-function js-functions lightweight nodejs nodejs-modules package stews
Last synced: 30 Apr 2026
https://github.com/saksham-jain177/data-analysis
A collection of data analysis and machine learning projects across various datasets. Explore predictive modeling, data visualization, and insights from real-world data. Projects include sales predictions, disease detection, customer segmentation, and more.
api data data-analysis data-cleaning data-science data-visualization datamodeling dataset datasets exploratory-data-analysis python python3 web-scraping youtube-api
Last synced: 01 May 2026
https://github.com/meta-llama/synthetic-data-kit
Tool for generating high quality Synthetic datasets
data generation llm python synthetic
Last synced: 08 May 2025
https://github.com/encoreshao/data-science
Data analyze examples, using Jupyter notebook and Python!!!
data dataanalysis encore jupyter-notebook
Last synced: 29 Mar 2025
https://github.com/andreabozzo/andreabozzo
My personal Repo!
analytics data data-engineering data-visualization database datamodelling developer-profile github-pages github-profile go interactive-animation open-data portfolio python readme-profile rust
Last synced: 17 May 2026
https://github.com/rsc-labs/see-open-data
Show www.dane.gov.pl in user friendly format. Generate flourish data or other data visualizations.
data data-visualization flourish government poland
Last synced: 04 Apr 2025
https://github.com/pulgamecanica/d3examples
https://www.oreilly.com/library/view/d3-for-the/9781492046783/
d3 d3-visualization d3js d3v4 data javascript
Last synced: 19 May 2026
https://github.com/kameronbrooks/datalys2-reporting
Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.
data data-visualization html react
Last synced: 08 Apr 2026
https://github.com/mumtaz4118/employee-satisfaction-and-attrition
Analysis of attrition based on environmental satisfaction from a Kaggle dataset.
data data-analysis data-science data-visualization ipynb jupyter-notebook machine-learning python statistical-analysis statistical-models
Last synced: 19 May 2026
https://github.com/ericgio/history-of-jazz
Data and visualizations based on Ted Gioia's "The History of Jazz"
Last synced: 28 Mar 2025
https://github.com/robsteranium/user2022-ldf-talk
Slides from my useR! 2022 talk about the Linked-Data Frames package
data data-frame linked-data r rdf
Last synced: 19 Apr 2025
https://github.com/sumansuhag/wasserstoff-aiinterntask
Welcome to the AI Pipeline for Image Segmentation and Object Analysis project – a state-of-the-art solution designed to process, segment, identify, and analyze objects within images. This AI-powered pipeline is engineered to deliver precise insights by extracting, mapping, and summarizing data from each segmented object.
artificial-intelligence cdn data data-science modeling pipline
Last synced: 28 Mar 2025
https://github.com/shahules786/titanic-analysis
different analysis of titanic accident (data from kaggle)
Last synced: 26 Jun 2025
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/jigyasag18/financial-risk-analysis-project
The Credit Card Financial Risk Analysis Dashboard is a real-time Power BI tool designed to provide insights into credit card transactions and customer demographics. It features interactive visualizations, efficient data processing, and actionable insights to support decision-making. Utilizing data from SQL database, the dashboard tracks key metrics
data dataanalysis database datacleaning datapreprocessing dataprocessing datavisualization financial-analysis financialriskanalysis mysql powerbi sql statistical-analysis
Last synced: 06 Mar 2026
https://github.com/sumansuhag/prediction_model
This repository features a collection of Jupyter notebooks designed to showcase the practical applications of machine learning, data preprocessing, feature engineering, and recommendation systems. These notebooks enable users to explore, analyze, and predict business events.
algotithms artificial-intelligence data logistic-regression machine-learning-algorithms science sckiit-learn
Last synced: 28 Mar 2025
https://github.com/ditikrushna/enotes
🌻 Personal learning notes
coursera-data-science cousera data datascience machine machinelearning ml notes
Last synced: 07 Mar 2026
https://github.com/reubano/pyconza-tutorial
Jupyter notebooks and data for "Data Mining and Processing for fun and profit" PyConZA16 tutorial
data functional-programming jupyter-notebook meza pycon python tutorial
Last synced: 17 May 2026
https://github.com/bala-1409/loan-classification-data-science-projects
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization
Last synced: 22 Mar 2025
https://github.com/chompfoods/sdk-scala
Scala SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes scala sdk
Last synced: 17 May 2026
https://github.com/itsmeyogesh22/solved-8-weeks-sql-challenge-correct-solutions
Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/
8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql
Last synced: 07 Apr 2025
https://github.com/UznetDev/Smoking-Prediction
This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.
ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking
Last synced: 28 Mar 2025
https://github.com/amarlearning/exploring-the-evolution-of-linux
Data Analysis about the development of the Linux operating system by exploring its Git repository history.
cleaning-data data data-analysis data-wrangling datacamp first-commit git-history linux
Last synced: 12 May 2026
https://github.com/gustavonav/youtubeextractorflask
Aplicação para Extração e tratamento de dados do Youtube.
data full-stack mysql pipelines python web
Last synced: 14 Jun 2025
https://github.com/sharoonjoseph321/social_media_eda
Data Analysis on social media apps ,using pandas, python, matplotlib.
data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects
Last synced: 03 Mar 2025
https://github.com/eyluldursun/data-science-project
This project involves a data science analysis conducted on the Obesity Data Set. The study explores factors influencing obesity, includes data visualization, and develops predictive models. The goal of the project is to gain insights to help prevent obesity.
data data-science obesity r rmarkdown
Last synced: 26 Jun 2025
https://github.com/zshn1248/pyfilecrypto
PyFileCrypto is a Python module for easy encryption and decryption of files using the cryptography library. It provides a simple interface to generate encryption keys, encrypt files, and decrypt files securely.
data decryption encryption file security-tools
Last synced: 07 Apr 2026
https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate
This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.
correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif
Last synced: 17 May 2026
https://github.com/joseluisq/input-verifier
Some useful functions to check common data input.
Last synced: 19 Jul 2025
https://github.com/hackolade/yugabytedb-ysql
Hackolade(https://hackolade.com) plugin for the Cloud Native Yugabyte database with YSQL API
data data-modeling entity-relationship-diagram schema-design ysql yugabyte yugabytedb
Last synced: 30 Apr 2025
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/amethyst-php/sku
amethyst amethyst-package api data laravel sku
Last synced: 17 May 2026
https://github.com/jcloh98/rental-property-finder
A web scraper that helps users find rental properties by automatically gathering and organizing listings from various websites to discover available homes and apartments.
data headless-browser node scraper scraping web
Last synced: 17 May 2026
https://github.com/jhwa426/database
SQL, MSSQL, MongoDB Database
data data-warehouse data-wrangling database datamodeling entity-relationship-diagram normalization sql sqlite3 ssms
Last synced: 06 Apr 2025
https://github.com/merekat/hb-passiv-income
Ein Rechner, der basierend auf historischen Daten unterschiedlicher Assets kalkuliert, welches voraussichtliche passive Einkommen der User abhängig von seinen Eingaben zu erwarten hat.
assets data datajournalism etf passive-income treasury
Last synced: 19 Jul 2025
https://github.com/amethyst-php/opening-hour
amethyst amethyst-package api data laravel opening-hour
Last synced: 19 May 2026
https://github.com/vidya-vijay/vid2501
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 19 Jul 2025
https://github.com/Vidya-Vijay/Vid2501
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 19 Jul 2025
https://github.com/dimitryzub/allrecipes-us-recipes-by-state-analysis
Personal Data Exploratory Project in Python. Data extracted from AllRecipes.
data data-visualization dataexploration dataextraction matplotlib pandas python seaborn webscraping
Last synced: 10 May 2026
https://github.com/amethyst-php/warehouse
amethyst amethyst-package api data laravel warehosue
Last synced: 19 May 2026
https://github.com/akashlogics/street-data-tracking
Detect, Track and Count number of persons walking across the path(s) making use of YOLO. This Python project tracks people moving across predefined street zones
analysis data excel newdataset object-detection opencv python python3 yolo
Last synced: 19 May 2026
https://github.com/amethyst-php/recipe
amethyst amethyst-package api data laravel recipe
Last synced: 19 May 2026
https://github.com/potlock/data
data research for other funding mechanisms and PotLock related data.
data flipsidecrypto near-protocol potlock
Last synced: 07 Mar 2026
https://github.com/buildinamsterdam/contentful-graphql
Contentful GraphQL connection
Last synced: 05 Jan 2026
https://github.com/erkylima/algorithms
Python project to refresh knowledge on algorithms and data structures. Interactive examples of Bubble, Merge, Quick Sort, along with Lists, Stacks, Queues, and Trees. Challenges included. Recycle your expertise! 🚀 #Python #Algorithms #DataStructures
algorithms algorithms-and-data-structures data data-structures
Last synced: 19 Jan 2026
https://github.com/topunix/hackerrank
:green_book: HackerRank Solutions
algorithm-challenges algorithms algorithms-and-data-structures data data-structures hackerrank hackerrank-algorithms-solutions hackerrank-challenges hackerrank-python hackerrank-solutions python
Last synced: 17 May 2026