data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/asuozzo/medicare-data-analysis
An analysis of Medicare Part D data in Vermont
Last synced: 04 May 2026
https://github.com/kingabzpro/makefile-actions
GitHub Actions and MakeFile tutorial and project for beginners.
actions analytics automation data data-science makefile
Last synced: 18 Apr 2026
https://github.com/oefenweb/python-untraceables
Randomizes IDs for a given set of tables making them untraceable across environments
anonymize data database mysql privacy python python2 python3 randomization
Last synced: 03 Feb 2026
https://github.com/mohsinali08000/myportfolio
I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.
Last synced: 22 Apr 2026
https://github.com/metriccoders/metriccoders_datasets
This is the Metric Coders repository containing all the datasets for machine learning.
data datasets machine-learning natural-language-processing scikit-learn
Last synced: 08 Apr 2025
https://github.com/sanskaryo/ultimate-dsa-repo
One Stop Solution for DSA Learning and Resources
data data-structures-and-algorithms dsa hacktoberfest hacktoberfest-accepted hacktoberfest2025
Last synced: 15 Oct 2025
https://github.com/ucd-cws/nitrates-cv
california centralvalley data frep groundwater model nitrates
Last synced: 16 Jan 2026
https://github.com/thiagopanini/datadelivery
Um módulo Terraform open source capaz de proporcionar um toolkit completo de infraestrutura para que usuários iniciem suas respectivas jornadas de exploração em serviços de Analytics na AWS.
analytics athena aws catalog crawler data datamesh glue s3 terraform
Last synced: 29 Nov 2025
https://github.com/ultrasage-danz/scikit-learn-ml
Machine Learning with scikit-learn by Data School
ai data data-school machine-learning macos ml scikit-learn ultrasage-dan
Last synced: 13 May 2026
https://github.com/apfirebolt/data-structures-and-algorithms-in-python
Data Structure and Algorithms in Python
algorithms data data-structures python python3 tkinter-gui
Last synced: 15 Mar 2025
https://github.com/mickfrog/uace-analysis
UACE ANALYSIS FOR 2011 - 2015
data data-science data-visualization folium-maps geocoder jupyter-notebook pandas python3
Last synced: 14 Feb 2026
https://github.com/garcane/british-airways-analysis
This project focuses on analyzing and visualising travel data from British Airways using Tableau. The goal is to extract insights and present them in an interactive and visually appealing manner.
data data-analysis data-visualization tableau
Last synced: 19 Mar 2026
https://github.com/remcostoeten/github-and-vercel-api-showcase-dashboard
Showcase results of possible fetched data from the Github and Vercel API built in all vanilla js.
api-rest da data express-js github-api nodejs vercel-api
Last synced: 07 Mar 2026
https://github.com/dhanish03/reliance-sales-report-dashboard
This project, Reliance Sales Report Dashboard, showcases a dynamic and interactive Power BI dashboard designed to analyze sales performance. The dashboard provides key insights into various aspects of sales data, including product-wise performance, region-based revenue, and profitability trends.
data datavisualization-project powerbi visualization
Last synced: 23 Jan 2026
https://github.com/ztgx/muvera
MUVERA: Making multi-vector retrieval as fast as single-vector search
algorithms data google muvera retrieval rust search structure vector
Last synced: 25 Oct 2025
https://github.com/prajjwol09/power-bi-project
The Data Survey Breakdown is an interactive Power BI dashboard designed to present insights gathered from a survey of professionals and enthusiasts in the data industry.
dashboard data interactive powerbi survey
Last synced: 15 Mar 2026
https://github.com/panodata/tikray
A compact data transformation engine.
data data-transformation data-transformation-pipeline data-transformer jmes jmespath jq jqlang json json-pointer json-transform json-transformation json-translate json-translator transformation transon
Last synced: 04 Oct 2025
https://github.com/kadirlofca/unity-csvmaker
Quick and easy way to create and export .csv files from Unity.
Last synced: 09 Apr 2026
https://github.com/infinitode/crsd
A synthetic customer review sentiment dataset for sentiment analysis generated using different AI models.
ai data dataset datasets huggingface-datasets mit-license ml nlp open-source python sentiment sentiment-analysis sentiment-classification text-data
Last synced: 10 Jun 2026
https://github.com/metapsy-project/data-psychosis-psyctr
Database of psychological interventions for schizophrenia and psychosis compared to control conditions.
Last synced: 16 Mar 2026
https://github.com/amethyst-php/cycle
amethyst amethyst-package api cycle data laravel
Last synced: 17 May 2026
https://github.com/miniql/miniql-json
A MiniQL query resolver that loads data from JSON files.
data json query query-language
Last synced: 11 May 2026
https://github.com/mcraiha/datagensharp
C# managed library for generating data
Last synced: 11 Aug 2025
https://github.com/tomquirk/sunshine-coast-council-rates-data
Rates data for the Sunshine Coast, Australia
australia data property rates real-estate
Last synced: 24 Feb 2026
https://github.com/ometman/vet-clinic
This is a database project for vetinary data management for animals, owners, clinic employees and visits; and applicable to any data management need. It uses Postgresql, a relational database management system. It allows storing, updating and querying.
data database normalization postgresql postgresql-database queries sql sql-server-database tables transactions
Last synced: 13 May 2026
https://github.com/chubek/pyramid-dashboard
A Dashboard to Show Data Made Using Plotly Dash
dash data docker ml plotly plotly-dash python
Last synced: 19 May 2026
https://github.com/woctezuma/epic-games-js
JavaScript on the Epic Games store.
data datamining egs epic epic-games epic-games-api epic-games-launcher epic-games-store epicgames epicgames-api epicgames-launcher epicgames-store graphql graphql-api javascript webpack
Last synced: 27 Oct 2025
https://github.com/raulmaulidhino-dev/ml_modelling_regression
There are many factors that influence the grades/scores of students. One of the factors is study hours. In this mini analysis project, there are 3 models that will learn and predict the relation between study hours of students and their scores in an exam/test. This project will result the best ML model to solve the problem.
data data-analysis-python data-science eda machine-learning scikit-learn
Last synced: 28 Jan 2026
https://github.com/analyticslover/sales-python-dashboard
Dashboard Ventas Japon 2023
dashboards data data-analysis jupyter-notebook python3 sales streamlit
Last synced: 09 Apr 2026
https://github.com/OneMoreDavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 28 Oct 2025
https://github.com/paul-henryp/simulate-investment-strategies
This Java program simulates different investment strategies using historical stock market data. It allows users to test various strategies such as buy and hold, moving average, buying when the stock price is lower than the last purchase, and dollar-cost averaging.
data data-science investing-java java plots plotting simulated-data simulated-investments sp500 sp500-data-analysis
Last synced: 21 May 2026
https://github.com/prateekmaj21/tableau-public-links
Tableau work as part of Data Visualization [AI&DS_205]
data data-visualization dataanalytics tableau-public
Last synced: 24 Jan 2026
https://github.com/mfurmanczyk/wh-sales
E-commerce analytics data warehouse ETL made with Apache Spark.
airflow data data-engineering data-warehouse kotlin python spark
Last synced: 24 Jan 2026
https://github.com/sahraiidle/email-spam-detector
Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.
data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm
Last synced: 24 Jan 2026
https://github.com/sourceduty/pyduino_metadata
🔌 Arduino and Python serial communication metadata.
arduino data datalogger log metadata pyduino python sensor sensor-data serial-communication
Last synced: 08 Aug 2025
https://github.com/atharvapathak/twitter_sentiment_analysis_project
Twitter sentiment analysis is the process of analyzing tweets posted on the Twitter platform to determine the overall sentiment expressed within them. It involves using natural language processing (NLP) and machine learning techniques to classify tweets.
api bag-of-words bert cnn data gbm nltk rnn spacy twitter
Last synced: 28 Jan 2026
https://github.com/sourceduty/cults_3d
🔢 Software concept for additional statistics from Python for Cults design data .csv files.
3d 3d-model 3d-model-software 3d-modelling account account-management concept cults cults-3d data idea sourceduty
Last synced: 08 Aug 2025
https://github.com/v41bh4vr4jput/data-analysis-with-python
This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.
api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn
Last synced: 09 Apr 2026
https://github.com/theprodigyleague/d1g174lx534f00d
react/node bootstrapped project for a digi(company){["SEAFOOD"]}
bootstrap companies data data-conduit digital digital-seafood java javascript node project react seafood
Last synced: 01 Oct 2025
https://github.com/spatialcurrent/go-counter
Simple library and command line program for generating frequency distributions.
Last synced: 29 Jan 2026
https://github.com/aimin-nur/data-analyst
Sebuah project Data Analyst (Mechine Learning) untuk melakukan analisa harga mobil bekas Ford berdasarkan dataset yang sudah ada, serta mengetahui apa saja feature atau kolom yang mempengaruhi harga mobil bekas Ford.
analytics data mechine-learing visualization
Last synced: 29 Jan 2026
https://github.com/tpltnt/wir_vs_virus_hackathon_projects
A list of all projects / challenges for the WirVsVirus hackathon as CSV
coronavirus csv data hackathon raw-data
Last synced: 29 Jan 2026
https://github.com/dfsp-spirit/neuroimaging_testdata
Contains test data for unit tests, used in developing neuroimaging software. Ignore this. Licenses in the individual archives.
Last synced: 25 Feb 2026
https://github.com/rosacarla/databases
Bases de dados utilizados em atividades práticas do MBA Data Analytics do IGTI.
Last synced: 19 Mar 2026
https://github.com/themost-framework/cache
MOST Web Framework Caching Module
Last synced: 12 Feb 2026
https://github.com/bearaujus/bdatamatrix
Structured Tabular Data Management in Go
Last synced: 30 Jan 2026
https://github.com/abendayan/orm
Lightweight orm
cli dao data database database-management javascript mysql node node-js nodejs orm ormius ormius-cli schema
Last synced: 25 Feb 2026
https://github.com/chompfoods/stub-scala-akka-http-server
Scala Akka HTTP server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
akka api branded chomp data database food grocery ingredients raw recipe-api recipes scala server stub stub-server
Last synced: 15 Apr 2026
https://github.com/brianlesko/postresql-docker
Run a postgreSQL server hosted in a docker container, and start a webUI for basic querying
basics container containerization containers data data-science docker postgres postgresql sql template
Last synced: 31 Jan 2026
https://github.com/denisecase/dc-texter
Send a text message using Python
alerts data python sms-messages streaming
Last synced: 08 Feb 2026
https://github.com/azmag/spm-dashboard
System Performance Measures are a selection of criteria used by Department of Housing and Urban Development (HUD) to evaluate how local Continua of Care are performing.
Last synced: 31 Jan 2026
https://github.com/rorovic/rorovic.github.io
my github blog
code data datawarehouse devops realtime
Last synced: 01 Feb 2026
https://github.com/ddeepanshu-997/support_vector_regression--svr-
In this repository i performed a support vector regression on real life data , initially i performed some data preprocessing technique in order to filter out the data flaws then undergoes the process of model building i.e SVM regression in order to make a machine learning regression model.
data data-science regression-analysis regression-models svm-model svm-regression
Last synced: 03 Aug 2025
https://github.com/pythoncoderunicorn/jamesbeardaward
a repo for James Beard Award data
Last synced: 07 Feb 2026
https://github.com/jk-oster/laravel-collection-trend
Generate trends from collections. Easily generate charts or reports.
charts collections data laravel php reports trends
Last synced: 03 Aug 2025
https://github.com/alecxcode/table-parser
Python Table Parser (data extraction)
automation data extraction python robotic-process-automation
Last synced: 04 May 2026
https://github.com/abdullahashfaqvirk/Earth-Engine-Data-Scraper
A Python based web scraper designed to extract and organize dataset metadata from the Google Earth Engine Datasets Catalog for research, and analysis purposes.
beautifulsoup data data-science python requests scraper web-scraping
Last synced: 27 Sep 2025
https://github.com/mlkav/digital-talent-scholarship
Learn in Digital Talent Scholarship Program
data data-science digital-talent-scholarship dts google-cloud google-cloud-platform science
Last synced: 26 Feb 2026
https://github.com/assada/free-words
Data for/from NLP
corpus-data data nlp-machine-learning npl
Last synced: 26 Feb 2026
https://github.com/gman-au/white-knight
Experimental .NET data abstraction using specification pattern
abstractions data datastore dotnet repository-pattern specification-pattern
Last synced: 17 Mar 2026
https://github.com/danicaalana/wine-dataset-decision-tree
This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.
data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset
Last synced: 18 Apr 2026
https://github.com/thedevreda/jadaerospace
A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero
data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python
Last synced: 02 Aug 2025
https://github.com/nushratjabenaurnima/cse_477_data_mining
A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.
data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3
Last synced: 09 Apr 2026
https://github.com/bishtrishu/netflix_movies_dashboard
This project is a comprehensive dashboard for analyzing Netflix movies and shows. Using a combination of Power BI, Python, and Excel, this dashboard provides insights into various aspects of Netflix's content library.
ai artifical-intelligense dashboard data dataanalysis dataanalyst dataanalytics datacleaning datahandling datascience datavisualization excel machine-learning msexcel powerbi report
Last synced: 09 Feb 2026
https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics
This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.
big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard
Last synced: 01 Aug 2025
https://github.com/metapsy-project/data-panic-psyctr
Database of psychotherapy for panic disorder compared to control conditions
Last synced: 18 Mar 2026
https://github.com/ludreinsalvador/global-covid-19-data-analysis
Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.
analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization
Last synced: 26 Feb 2026
https://github.com/dysnomia-studio/achieve-games-dump
Dump parts of achieve.games database to public including Steam Games List
data dump games steam steam-api steam-game steam-games
Last synced: 27 Feb 2026
https://github.com/lemaitre4523/old-tiktok-data-report-explorer
An explorer for tiktok data report
data explorer extract package report simple tdre tiktok tiktok-data-explorer
Last synced: 25 Sep 2025
https://github.com/javdomgom/nifi-custom-processors
Apache NiFi custom processors
apache-nifi bigdata data data-engineering datascience flowfile nifi nifi-custom-processor
Last synced: 27 Feb 2026
https://github.com/os-climate/data-requests
This repo is used to track issues related to new Data Requests
Last synced: 27 Feb 2026
https://github.com/abhishekn1947/samgov-scraper
Automated Python scraper for sam.gov contracts
analytics automation aws data pandas postgresql rds selenium webscraper
Last synced: 09 Apr 2026
https://github.com/sweta-kaundilya/power-bi-learning-projects
This repository contains completed exercises while learning Power BI
data datavisualization dax powerbi powerquery
Last synced: 27 Feb 2026
https://github.com/revolutionarybukhari/datawarehouse_meshjoin_superstore
A dataware house is generated for streaming data of a superstore using extended mesh join by Syed Husnain Haider Bukhari
data data-science data-warehousing meshjoin
Last synced: 23 May 2026
https://github.com/cunfuu/network-bubbles
For Easier to manage organizations and keeping notes about them to organize events and easy access their needs
data data-visualization organizations organizations-volunteer
Last synced: 31 Jul 2025
https://github.com/kunalthakur204/visualization-on-flower
🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊
data data-visualization dataanalysis flowerdataset python
Last synced: 16 Apr 2026
https://github.com/sehaj003/boston-bruins-roster-planning-mysql-nosql
Repository for Data Management project, Boston Bruins Roster Planning using MySQL and NoSQL along with data analysis using Python
data data-management mongodb mysql project-repository python
Last synced: 11 May 2026
https://github.com/beastbytes/postal-code-data-php
Implementation of PostalCodeDataInterface using PHP file storage
Last synced: 27 Feb 2026
https://github.com/vianneymi/amplifai
Amplifai is a package that allows you to transform your raw unstructured text into structured data in a few lines of codes.
data data-mining extraction langchain llm pydantic
Last synced: 27 Feb 2026
https://github.com/soenneker/soenneker.dtos.requestdataoptions
A flexible request options object for paging, sorting, and filtering queryable data, similar to OData-style parameters.
controller coordinator csharp data dotnet dto dtos http manager object odata options request requestdataoptions
Last synced: 12 Mar 2026
https://github.com/amethyst-php/tax
amethyst amethyst-package api data laravel tax
Last synced: 11 May 2026
https://github.com/j0a0m4/olympics
Final Project for Data Engineering Accelerated LATAM
Last synced: 13 Feb 2026
https://github.com/sebastian-diaz-berdecia/analisis-popularidad-de-series-y-generos-de-series
Consultas SQL para el análisis de la popularidad de series y géneros series de la base de datos NetflixDB.
business-analytics bussiness-intelligence data data-analysis database mysql mysql-database sql
Last synced: 12 May 2026
https://github.com/vbhatsaccnt/retail-strategy-and-analytics-optimization-of-control-stores-for-sales-enhancement
In this project, we aim to optimize the performance of retail chain stores by establishing control stores based on their performance compared to selected trial stores. By leveraging data analytics and strategic insights, we seek to enhance sales revenue and drive growth within the retail chain.
customer-segmentation data data-science risk-analysis
Last synced: 13 May 2026
https://github.com/nia-cloud-official/influx-agents
Influx-CRD is a web application designed to facilitate data collection, recovery, and distribution for agents uploading data to a centralized database. It provides an intuitive interface for managing data collection from various sources, recovering lost or corrupted data.
broker collection data data- influx influx-agent
Last synced: 30 Jul 2025
https://github.com/molinsagustin/cinedata
# CineData Trabajo práctico grupal para la materia Ingeniería de Datos I en la Universidad Argentina de la Empresa. El mismo consistió en el desarrollo de una base de datos relacional en Microsoft SQL Server Managment Studio utilizando metodología Ágil SCRUM, que se utilizó desde el relevamiento de requisitos hasta la implementación final.
agile data data-modeling database diagram entity-relationship-diagram microsoft-sql-server relational-databases relational-model scrum scrum-agile sql sqlserver
Last synced: 28 Feb 2026
https://github.com/greedchikara/dsajs
Data Structures and Algorithms written in Javascript
Last synced: 09 Apr 2026
https://github.com/ashamethedestroyer/data-structures
Dedication of all Data Structures Creation 🛠
cpp data data-structures implementation implementation-of-data-structures structure structured-data
Last synced: 23 May 2026
https://github.com/cvinicius987/projetos-bigdata
Estudos de caso envolvendo projetos de BigData e Engenharia de Dados.
bigdata data data-engineering spark
Last synced: 13 May 2026
https://github.com/arunabhagit/bank-customer-churn-analysis-and-risk-tracker
This project analyzes customer churn using machine learning and visual storytelling through Power BI. A Random Forest model identifies high-risk customers, while interactive dashboards reveal key churn patterns, enabling targeted retention strategies and data-driven decision-making for business improvement.
analysis data powerbi predictive-modeling sql
Last synced: 28 Jul 2025
https://github.com/erictleung/tidytuesdays
:chart_with_upwards_trend: My attempts at #tidytuesday
data data-science data-visualization r rstats tables tidytuesday tidyverse
Last synced: 19 Sep 2025