data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/miraclx/split-merge
Efficient, flexible data stream chunker and merger
chunk data efficient merge middleware nodejs pipeline split stream
Last synced: 07 May 2026
https://github.com/amethyst-php/user
amethyst amethyst-package api data laravel user
Last synced: 12 Apr 2026
https://github.com/gui-sitton/prepaid
In this project I work as an analyst for the telecommunications company Megaline. The company offers its customers prepaid plans, Surf and Ultimate. The sales department wants to know which plans bring in the most revenue in order to adjust the advertising budget
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 22 May 2026
https://github.com/nsandoya/python_scrp_project
This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)
beautifulsoup4 data data-analysis jupyter-notebook pandas python3
Last synced: 28 Apr 2026
https://github.com/ohspc89/better_call_jin
A repository containing mentoring materials for a Ph.D. student in Neuroscience
data matlab spss-statistics visualization visualization-tools wrangling-data
Last synced: 08 Oct 2025
https://github.com/huspacy/huspacy-resources
Resources for building and evaluating huspacy
Last synced: 21 Mar 2025
https://github.com/pulipulichen/pts-local-news-dataset
A dataset containing local news from Public Television Service.
Last synced: 27 Mar 2026
https://github.com/rahulthedevil/metric-converter
A simple utility package for converting between metric units such as meters, kilometers, grams, kilograms, liters, and more. Simple and powerful way for Units Convert solution
convert converter data fraction imperial length mass measurements metric metrics ratio system temperature unit unit-conversion unit-converter units uom utilities weight
Last synced: 08 Oct 2025
https://github.com/simranjeet97/kaggle_pokemon_datset_eda-dashboard
Full EDA and Dashboard of Kaggle Pokemon Dataset with Live Streaming Data and Images
cloud data data-science dataanalytics machine-learning machine-learning-algorithms pokemon pokemon-dataset pokemon-prediction python science
Last synced: 07 May 2026
https://github.com/aguven6/inmemory-data-processor
Convert tabular data to columnar data with index. Aim is to process huge data quicker especially in aggregation operation
columnar-storage data data-structures parallel-computing parallel-programming processing
Last synced: 17 May 2026
https://github.com/weecology/updating-data
Hugo website for instructions on how to make a regularly updating data pipeline
continuous-analysis continuous-integration data gh-actions living-data netlify travis-ci
Last synced: 17 Feb 2026
https://github.com/rameshaditya/dynamic-hybrid-data-grid
Facilitates faster read-and-write of large ordered collections of data.
algorithms data data-structures storage
Last synced: 23 Feb 2025
https://github.com/jacob-pitsenberger/python-electronics-inventory-management-system-object-oriented-programming-project
Welcome to the Python Electronics Inventory Management System project repository! This project is a demonstration of Object-Oriented Programming (OOP) principles in Python for managing an electronic parts inventory.
data data-structures dictionary exception-handling file-io filesystem input-output inventory-management-system management-system modules oop pickle python user-interface
Last synced: 08 Oct 2025
https://github.com/rishikesh-jadhav/track_deep_learning
Data collected from the Udacity simulator comprising RGB images with steering and throttle annotations for each frame, specifically gathered for behavioral cloning purposes.
data datacollection udacity-self-driving-car
Last synced: 03 Jan 2026
https://github.com/toofancodes/h1b-dashboard-insights
An interactive Tableau dashboard that visualizes H1B visa data from the USCIS Employer Data Hub, offering insights into application trends, top employers, and geographic distributions. Showcases advanced data visualization, analytics, and business intelligence skills.
analysis analytics business-intelligence dashboard data data-visualization h1b h1b-visa interactive-data tableau
Last synced: 20 Jan 2026
https://github.com/naliferopoulos/datamining
Bring your own pickaxe.
aueb aueb-students data data-mining machine-learning machine-learning-algorithms mining random-forest
Last synced: 25 Jan 2026
https://github.com/burythehammer/foosbot-results
Foosball results for the OpenCredo foosbot
data foosball machine-learning python
Last synced: 13 Apr 2026
https://github.com/ericmaddox/nyc-crime-analytics
Analyzes and visualizes crime data from the NYC Police Department using interactive maps and heatmaps, leveraging the NYC Open Data API.
crime-analysis crimedata data datavisualization esri folium heatmap nycopendata python python3 rtcc
Last synced: 24 Jun 2025
https://github.com/adadalshabab/machine-predictive-maintenance-classification
This repository hosts a machine predictive maintenance classification project, aimed at predicting the maintenance needs of industrial machinery before they fail. By leveraging machine learning algorithms, this project seeks to enhance operational efficiency and reduce downtime by identifying potential maintenance requirements proactively.
data data-science datanalysis datanalytics machine-learning machine-learning-algorithms matplotlib-pyplot pandas
Last synced: 17 May 2026
https://github.com/basinghse/covid19simulator
Real Time Assessment and Simulation of COVID-19 - showing current numbers of cases, deaths and treated patients globally.
coronavirus covid-19 data real-time simulation visualisation visualisation-data-ingester
Last synced: 05 Apr 2025
https://github.com/saksham-jain177/data-analysis
A collection of data analysis and machine learning projects across various datasets. Explore predictive modeling, data visualization, and insights from real-world data. Projects include sales predictions, disease detection, customer segmentation, and more.
api data data-analysis data-cleaning data-science data-visualization datamodeling dataset datasets exploratory-data-analysis python python3 web-scraping youtube-api
Last synced: 01 May 2026
https://github.com/stefanpietrusky/factsv2
Repository for the article in the online magazine TDS.
ai arxiv-papers beautifulsoup data flask-application gensim llama matplotlib ollama plotly pyldavis python selenium webdriver
Last synced: 09 Apr 2025
https://github.com/ericgio/history-of-jazz
Data and visualizations based on Ted Gioia's "The History of Jazz"
Last synced: 28 Mar 2025
https://github.com/domarps/grad-project-reports
Write-ups of a few key semester-long projects I have worked during my Masters
circuit data deeplearning graph-algorithms matlab question-answering
Last synced: 26 Mar 2025
https://github.com/maximiliancw/completely
Measure your data completeness
data data-cleaning data-quality data-science missing-data
Last synced: 25 Jun 2025
https://github.com/jor-/measurements
Python functions to handle, statistically analyze and plot measurement data.
Last synced: 17 Mar 2025
https://github.com/sumansuhag/prediction_model
This repository features a collection of Jupyter notebooks designed to showcase the practical applications of machine learning, data preprocessing, feature engineering, and recommendation systems. These notebooks enable users to explore, analyze, and predict business events.
algotithms artificial-intelligence data logistic-regression machine-learning-algorithms science sckiit-learn
Last synced: 28 Mar 2025
https://github.com/etmendz/mendz.data.oracle
Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to Oracle databases.
ado-net context data database datasettings mendz oracle
Last synced: 13 Apr 2026
https://github.com/sakan811/honkai-star-rail-characters-damage-simulation
Honkai Star Rail Characters' Damage Simulation
data data-science data-visualization honkai honkai-star-rail honkai-starrail powerbi powerbi-visuals python sqlite
Last synced: 23 Feb 2025
https://github.com/mwiatrzyk/modelity
Data parsing and validation library for Python
data library model parsing python tool validation
Last synced: 18 Jan 2026
https://github.com/unknownsoup/budget_tracker
A personal budget tracker to build my knowledge of working with databases and data analysis. In this case using SQL and python for the analysis.
data data-science databases python sql
Last synced: 26 Jan 2026
https://github.com/sharoonjoseph321/social_media_eda
Data Analysis on social media apps ,using pandas, python, matplotlib.
data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects
Last synced: 03 Mar 2025
https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate
This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.
correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif
Last synced: 17 May 2026
https://github.com/jcloh98/rental-property-finder
A web scraper that helps users find rental properties by automatically gathering and organizing listings from various websites to discover available homes and apartments.
data headless-browser node scraper scraping web
Last synced: 17 May 2026
https://github.com/dimitryzub/allrecipes-us-recipes-by-state-analysis
Personal Data Exploratory Project in Python. Data extracted from AllRecipes.
data data-visualization dataexploration dataextraction matplotlib pandas python seaborn webscraping
Last synced: 10 May 2026
https://github.com/abdiasarsene/edusight-data-driven-insights-for-smarter-education
EduSight transforms educational data into actionable insights, helping NGOs, schools, and policymakers improve academic performance, optimize resources, and evaluate learning programs for better outcomes.
Last synced: 26 Jan 2026
https://github.com/rahul1582/bank-loan-classification
Classifying whether a person is taking personal loan or not using all the Classification Algorithms.
algorithm analysis classi data
Last synced: 08 Oct 2025
https://github.com/topunix/hackerrank
:green_book: HackerRank Solutions
algorithm-challenges algorithms algorithms-and-data-structures data data-structures hackerrank hackerrank-algorithms-solutions hackerrank-challenges hackerrank-python hackerrank-solutions python
Last synced: 17 May 2026
https://github.com/birjemin/wxgameod
wxgame 开放数据 weixin 微信小游戏 关系链数据
data interactive-data relation user-storage
Last synced: 16 Jul 2025
https://github.com/rorylshanks/devdb-client
This is the repository for the official command line client for DevDB (https://devdb.cloud)
cloud data database-management development
Last synced: 29 May 2026
https://github.com/codegouvfr/codegouvfr-sources
🧢 Static web frontend for code.gouv.fr
bluehats codegouvfr data frontend
Last synced: 28 Feb 2025
https://github.com/priyanshubiswas-tech/farmlab-report-and-case-study-iot
This project was developed through live interviews and case studies with farmers in the year 2023 to address key agricultural challenges. The device provides real-time farm insights for better decision-making. Future plans include a digital portal, increased range, more sensors, and improved design. Open to collaboration!
arduino-ide c case case-study data data-analysis iot iot-device serialization
Last synced: 15 Jul 2025
https://github.com/allanotieno254/spss-nutrition-research
This repository contains the results of statistical analyses performed in IBM SPSS Statistics on a child nutrition dataset.
data data-preprocessing dataanalysis spss
Last synced: 17 Feb 2026
https://github.com/uzinfocom-org/archive
📦 | Archived projects that aren't used anymore
archive archive-data data notused
Last synced: 01 Sep 2025
https://github.com/amethyst-php/setting
Give the user the ability to configure his own settings
amethyst amethyst-package api data laravel setting
Last synced: 19 May 2026
https://github.com/skygenesisenterprise/aether-calendar
Aether Calendar is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications calendar capacitorjs data javascript linux macos nextjs typescript windows
Last synced: 12 Apr 2026
https://github.com/fintech-lsi/fintech-credit-risk-prediction
This repository provides a machine learning model for predicting credit risk in the financial sector. The model uses borrower information, such as age, income, employment length, loan amount, and credit history, to assess the likelihood of loan repayment or default.
data fintech machine-learning model prediction risk
Last synced: 12 Oct 2025
https://github.com/amethyst-php/price
Define prices and attach them to any model
amethyst amethyst-package api data laravel price
Last synced: 17 May 2026
https://github.com/thetacom/byteclasses
A Python package to manage and interact with binary data in a simple and structured manner.
binary-data bytes data dataclasses package python python3
Last synced: 11 Jul 2025
https://github.com/uttori/uttori-data-tools
Tools for working with binary data.
Last synced: 17 Feb 2026
https://github.com/karaniwachira/baby_names_analysis
Data Analysis: Baby Names Exploration
data data-analysis quarto quartopub r rstats tidyverse-ggplot2
Last synced: 22 Jun 2025
https://github.com/shubhamsoni98/classification-with-random-forest-1
To classify sales into categories (Low, Moderate, High) using Random Forests to inform strategic decisions and optimize marketing strategies.
algorithms anaconda data data-science datacleaning eda jupyter-notebook machine-learning pyhton random-forest scikit-learn visualization
Last synced: 18 Jan 2026
https://github.com/austinv11/pypeline
A simple data pipeline builder for Python 3+
data leveldb pypeline python python3 stream-processing
Last synced: 20 Aug 2025
https://github.com/iqbalmind/learn-python-data-scientist
IqbalMind Playground for python data scientist
data data-analysis data-visualization datascience datascientist datascientisttraining python python-playground
Last synced: 16 Mar 2025
https://github.com/denisecase/cintel-04-reactive
Interactive analytics, reactive app built with Shiny for Python
analytics bokeh data flights interactive mtcars penguins python relationships shiny
Last synced: 20 Jun 2025
https://github.com/shubhamsoni98/survey-data-analysis
Surey Data Analysis
analysis dashboards data data-mining data-visualization dataanalysis datacleaning datascience datasets insights pivot-tables pivotanalysis
Last synced: 07 Mar 2026
https://github.com/the-tech-idea/beep.winform.sample
Application for Managing your Different DataSources . Still in Alpha.please be patient
application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows
Last synced: 08 Jul 2025
https://github.com/ramonrsv/f1_data
Provides consolidated access to various sources of Formula 1 information and data, including event schedules, session results, timing and telemetry data, as well as historical information about drivers, constructors, circuits, etc.
Last synced: 07 Apr 2026
https://github.com/sakshamarora07/blinkit-sales-report-power-bi
This dashboard provides Blinkit with insights to optimize its grocery delivery operations and understand customer preferences. It evaluates sales trends, outlet performance, and item categories to identify key areas for improvement. The interactive visuals allow detailed exploration of sales distribution, customer ratings, and product popularity.
data data-science dataanalytics datavisualization excel powerbi sql
Last synced: 08 Jan 2026
https://github.com/farhashaad/farhashaad98
This is a repository to showcase my skills, share projects and track my progress in Data Science related projects.
data data-visualization dataanalysis matplotlib pandas python seaborn sql tableau
Last synced: 24 Apr 2026
https://github.com/amethyst-php/data-view
amethyst amethyst-package api data data-view laravel
Last synced: 19 May 2026
https://github.com/stdlib-js/array-base-assert-any-has-property
Test whether at least one element in a provided array has a specified property, either own or inherited.
any array assert data generic has javascript node node-js nodejs prop property stdlib structure test types validate
Last synced: 07 May 2025
https://github.com/webianks/anotech-android
Android application which deals on various anomalous behaviour that occur on server data.
Last synced: 13 Apr 2025
https://github.com/youmenomi/hydreigon
Are you looking for a Hydreigon to classify data for you? Come and catch it!
classify data hydreigon indexer items management pokemon sortable structure typescript
Last synced: 07 May 2025
https://github.com/nadahamdy217/Harvest-Gaurd-Plant-Disease-Detection-Web-Application
web application that help people grow healthy plants
classification-confidential cnn cnn-classification css data data-science detection html javascript keras machine-learning model plant-disease-detection supervised-learning tensorflow web-application
Last synced: 12 Apr 2025
https://github.com/yvandana/pwc-power-bi-job-simulation
Projects pursued during my Job Simulation
dashboard data dataanalysis powerbi pwc-forage-switzerland
Last synced: 06 Mar 2026
https://github.com/webobite/fact-chatbot
A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.
chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp
Last synced: 04 May 2026
https://github.com/1sumer/mass-mail-automation
Mass Emailer is a Python-based application designed to send bulk emails efficiently using an SMTP server. Leveraging the power of the Tkinter library for the graphical user interface (GUI), this tool provides a user-friendly platform for managing and dispatching large volumes of emails with ease.
data oops-in-python python smtp-server tkinter
Last synced: 20 Aug 2025
https://github.com/zeptosec/bpscrapper
Shows history of oil prices
data data-visualization database nodejs scraper
Last synced: 13 Apr 2026
https://github.com/mukhlishga/data-engineering
all about data engineering
airflow beam data data-engineering pyspark python
Last synced: 13 Apr 2026
https://github.com/vatshayan/youtube-user-analysis
Analysis of Youtube Users about their choice and preferences
data data-analysis data-mining data-science data-visualization dataset machine-learning machine-learning-algorithms
Last synced: 05 Feb 2026
https://github.com/gsmithun4/expressjs-field-validator
Plugin for validating JSON request, middleware for expressjs
data express-js expressjs json-request middleware nodejs request rest-api validation
Last synced: 06 Mar 2026
https://github.com/dimaa1608/azurecontent
AzureContent is a repository on GitHub containing documentation and resources related to Microsoft Azure services and features. It provides clear and concise information for users seeking guidance on Azure cloud computing solutions.
azure azurecontent cloud computing content data deployment integration management networking platform security service storage virtualization
Last synced: 10 Apr 2025
https://github.com/istinnew/cook-me-up
[In Progress] Welcome to Cook-Me-Up! This project aims to analyze and organize cooking recipes using data analysis (Python, BigQuery SQL, Looker Studio etc.) and machine learning techniques. The goal is to simplify meal preparation and offer users a comprehensive database of culinary delights.
bigquery clustering cookme culinary data data-science dataanalysis datavisualization looker-studio machine-learning python recipe-search recipes unsupervised-learning
Last synced: 16 May 2026
https://github.com/kobowood1/data-analysis-alpha
My first data analysis project
data data-analysis data-analytics data-science
Last synced: 06 May 2025
https://github.com/madhuresh2011/kulturehire-internship
☺️Hi folk, During my internship at KultureHire, I completed a real-world Data Analyst project. I created an interactive dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.
data data-analytics data-cleaning data-standardization data-visualization excel excel-pivot-charts excel-pivot-tables genz-aspirations my-sql
Last synced: 17 Feb 2026
https://github.com/nel-zi/zipco_foods
Developed an automated ETL pipeline using Python and Apache Airflow to consolidate fragmented CSV sales data into a normalized Azure SQL database for Zipco Foods.
airflow apache-spark data dataengineering etl pyspark wsl
Last synced: 03 May 2026
https://github.com/praveendecode/data-analysis
Implemented data analysis projects with interactive Streamlit UI for user-friendly data exploration and insights presentation
data data-science dataanalysis exploratory-data-analysis insights python streamlit-dashboard tableau tableau-public
Last synced: 04 Apr 2025
https://github.com/rd-uk/rduk-data-sqlite
SQLite Data Provider implementation for rduk-data
Last synced: 16 May 2026
https://github.com/erictleung/2018-new-coder-survey
:beginner: Code to wrangle data from the 2018 New Coder Survey by freeCodeCamp
data data-cleaning dataset freecodecamp new-coders-survey programmers
Last synced: 03 Apr 2025
https://github.com/chrisrobertsjr/chrisrobertsjr
Welcome to my Github Profile!
data data-analysis java r sql statistics
Last synced: 03 May 2026
https://github.com/danicaalana/breast-cancer-random-forest
This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wisconsin Breast Cancer Diagnostic Dataset from scikit-learn, which is a classic and very easy binary classification dataset.
breast-cancer-classification breast-cancer-wisconsin data eda machine-learning-algorithms python random-forest-classifier
Last synced: 16 May 2026
https://github.com/mukul273/spring-data-rest-jpa-demo
Spring Data Rest JPA Demo
data jpa rest spring spring-boot spring-mvc
Last synced: 20 Apr 2026
https://github.com/leevilaukka/alkometriikka
Tool to search Alko database and see some fun stats about different beverages
data gh-pages svelte typescript xlsx
Last synced: 18 May 2026
https://github.com/debruine/faux.jl
Julia version of faux for data simulation
Last synced: 28 Mar 2025
https://github.com/muneeb1030/webscrapper_politifact
This initiative seeks to extract and analyze fact-checking data from Politifact.com, providing valuable insights into political statements, rulings, and the evolving information landscape.
data data-collection dataanalysis python3 scrapy scrapy-spider webscraping
Last synced: 09 Sep 2025
https://github.com/mx51/data-dictionary-action
GitHub Action for generating and checking freshness of data dictionaries
Last synced: 17 Jan 2026
https://github.com/idhruvs/angular4-smart-table-demo
Angular4 Smart Table Demo Project
angular4 data tables typescript
Last synced: 21 Apr 2026
https://github.com/tuscanicz/doctrine-data-applier
Symfony bundle for Doctrine Migrations of data using doctrine entities
data database doctrine entity migrations symfony symfony-bundle
Last synced: 02 Feb 2026