data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/ashakoen/bls-data-extract
This repository contains scripts and a database schema to set up and manage a local SQLite database for storing and querying the Average Price data from the U.S. Bureau of Labor Statistics. It includes tools for downloading the latest data from the BLS website and fetching Consumer Price Index (CPI) data via the BLS API.
Last synced: 01 Apr 2026
https://github.com/h-sutiwas/r2de-2025
This repository is related to the Road To Data Engineer Bootcamp by DataTH. It contains all related coursework, some mini projects and other resources within the field of Data Engineering.
data data-engineering data-visualization docker gcp pipeline spark
Last synced: 30 Apr 2026
https://github.com/jigyasag18/power-bi-dashboard-project
The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.
dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization
Last synced: 04 Mar 2026
https://github.com/armand-sauzay/datasets
Datasets for machine learning
ai data datasets machine-learning ml
Last synced: 18 Jan 2026
https://github.com/suchi25sathavara/r-projects
R projects in Real world Scenerios for Data Analysis
data data-analysis datavisualization r
Last synced: 01 Apr 2025
https://github.com/suchi25sathavara/data-wrangling-with-r
Analyzing Road Accidents in Victoria, Australia
data r reporting rstudio wrangling-data
Last synced: 01 Apr 2025
https://github.com/eudesgccunha/automated-management-panel
Automated management panel using Power BI
data data-analysis data-visualization database excel powerbi
Last synced: 04 Feb 2026
https://github.com/darkogamerz/dhis2heat
A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.
analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization
Last synced: 01 Apr 2025
https://github.com/giuleo129/dataanalysis
This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.
data data-analysis data-science statistical-learning
Last synced: 25 Jan 2026
https://github.com/eby8zevin/android-intent
Intent & Bundle - Android Studio
android android-development android-studio bundle data intent java xml
Last synced: 03 Sep 2025
https://github.com/gappeah/layoffs-exploratory-data-analysis
This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.
data dataanalysis eda mysql sql
Last synced: 12 Jul 2025
https://github.com/beriberikix/senml-zephyr
A codec for encoding and decoding Sensor Measurement Lists (SenML) for Zephyr
codec data iot senml sensor zephyr-rtos
Last synced: 24 Mar 2025
https://github.com/merekat/flight-delay-prediction
This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.
aviation data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling
Last synced: 08 Apr 2025
https://github.com/pew-pew-team/hydrator
Hydrator kernel component
data deserializer dto hydrator kernel mapper mapping serializer structure
Last synced: 24 Mar 2025
https://github.com/shahsuvarli/election-voters-data-analysis-pandas
Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.
cleaning data grouping matplotlib numpy pandas python visualization
Last synced: 12 Apr 2026
https://github.com/victorowinoke/custmer-segmentation-using-rfm-python-
Customer Segmentation using the Recency, Frequency and Monetary Values
customer-segmentation data data-visualization python3 science time-series-analysis
Last synced: 26 May 2026
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 24 Nov 2025
https://github.com/sauravsrivastav/githubreposearcher
GitHub Repo Searcher 🔍 is a Streamlit web application designed to help you search for GitHub repositories based on a query and view the results in a tabular format. You can also download the results in CSV or Excel format for further analysis. 📊📈
data data-export excel github-api python repository-searcher streamlit webapp
Last synced: 20 Jan 2026
https://github.com/petzi53/repair
R Datasets of the Open Repair Alliance (ORA).
Last synced: 19 May 2026
https://github.com/diegoperea20/datos-secuenciales-con-ia
Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes
ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao
Last synced: 06 Feb 2026
https://github.com/etmendz/mendz.data.sqlserver
Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to SQL Server databases.
ado-net context data database datasettings mendz sql-server
Last synced: 10 May 2026
https://github.com/muhamedlabs/muhamed_onedrive
Muhamed_OneDrive - це надійне і зручне хмарне сховище для файлів, розроблене для безпечного зберігання і легкого обміну даними.
data html5 onedrive programming style
Last synced: 04 Jan 2026
https://github.com/shsiddhant/womens-wc
ML project to predict match outcomes for Women's Cricket World Cup 2025.
cricket-prediction data feature-engineering postgresql python
Last synced: 04 Apr 2026
https://github.com/holo-nim/flue
data streaming options
data nim reader-writer streams
Last synced: 04 Apr 2026
https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling
Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models
Last synced: 17 Apr 2026
https://github.com/machinecyc/lotteryinsight
Use crawler to collect Taiwan Lotto data, and save data into local MySQL server.
crawler data docker lottery mysql-database python3 taiwan
Last synced: 09 May 2026
https://github.com/ragibasif/bobdylan
Bob Dylan
bob-dylan csv data data-science data-visualization lyrics music python
Last synced: 03 Sep 2025
https://github.com/bhavanachitragar/layoff_analysis
This Streamlit app is designed for Layoff Analysis. It allows users to explore and analyze layoff data from different perspectives, including overall analytics, country-specific insights, and individual company details.
data dataanalysis streamlit streamlit-webapp
Last synced: 18 Apr 2026
https://github.com/zurd46/zurdsynthdatagen
This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).
data data-structures dataset electron json jsonl nodejs openai synthetic
Last synced: 04 Apr 2026
https://github.com/jpcadena/ventas-facturas
Ventas con facturas
data data-analysis data-exploration data-extraction data-science excel feature-engineering matplotlib microsoft numpy pandas powerbi product-sales pylint python receipts sales
Last synced: 12 Apr 2026
https://github.com/rd-uk/rduk-data-pg
PostgreSQL Data Provider implementation for rduk-data
Last synced: 18 Apr 2026
https://github.com/ffatahillah7/snowflake-data-governance-warehouses
Welcome to the Powered by Tasty Bytes - Zero to Snowflake Quickstart focused on Data Governance! Within this Quickstart we will learn about Snowflake Roles, Role Based Access Control and deploy both Column and Row Level Security that can scale with your business.
data data-governance snowflake
Last synced: 06 Jan 2026
https://github.com/mipacd/holochatstats
A VTuber chat log (and general) analytics platform
data flask hololive postgresql python visualization vtuber youtube
Last synced: 05 Apr 2026
https://github.com/codbex/codbex-number-generator-data
Number Generator for Documents Module - Data
Last synced: 05 Apr 2026
https://github.com/abhijeetdasbakshi/ecommerce-insights
A Dockerized end-to-end project that combines unsupervised machine learning for customer segmentation with scalable data pipelines. It uses MongoDB for data ingestion, Scikit-learn for clustering, Airflow for orchestration, and Streamlit for interactive visualization — enabling actionable insights into e-commerce
airflow airflow-dags ci-cd-pipeline clustering dags data data-pipelines docker docker-compose docker-container dockerfile git great-expectations kafka mongodb pca-analysis postgresql pyspark t-sne umap-learn
Last synced: 04 Apr 2026
https://github.com/josericodata/josericodata.github.io
Welcome to my portfolio website. This site showcases my skills, experience, education, and projects as a Data Analyst.
awesine-latex big-data career-development data data-analyst data-science database dublin ireland job-seeking jose-maria-rico-leal jose-rico jose-rico-data latex latex-cv portfolio portfolio-website python sql
Last synced: 18 Apr 2026
https://github.com/ailixter/gears-dictionary
The project, which Gears Dictionary
arrays data dictionaries dictionary php struct utilities
Last synced: 19 Jul 2025
https://github.com/prakashjha1/loan-eligibility-prediction
This repository contains the codebase and resources for a machine learning-based project aimed at predicting loan eligibility for individuals. The project utilizes various algorithms and data preprocessing techniques to build predictive models that assess the likelihood of an applicant being eligible for a loan based on historical data.
data data-visualization exploratory-data-analysis loan-prediction-analysis machine-learning-algorithms naive-bayes-classification parameter-tuning python random-forest
Last synced: 19 Apr 2026
https://github.com/mksingh431/free-data-science-courses
Data science is a rapidly growing tech field that’s transforming business decision-making. To break into this field, you need the right skills. Fortunately, top institutions like Harvard and IBM offer free online courses. These courses cover everything from basic programming to advanced machine learning.
course data data-analysis data-science data-visualization free freecou python
Last synced: 19 Apr 2026
https://github.com/huemulsolutions/huemul_sql_decode
Obtiene los campos y tablas utilizados en una sentencia SQL
bigdata chile data data-governance governance spark sql
Last synced: 19 Apr 2026
https://github.com/scjoaoantonio/trab_datascience
Este projeto tem como objetivo analisar os posts da rede social Bluesky. A aplicação interativa foi desenvolvida utilizando Streamlit e permite a coleta e visualização de dados, além de oferecer análises avançadas como previsão de engajamento, modelagem de tópicos e análise de sentimentos.
bluesky data data-science streamlit
Last synced: 09 May 2026
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 20 Jun 2026
https://github.com/istinnew/etl-pipeline-ganz-project
End-to-end ETL pipeline project for collecting, transforming, and loading data into a cloud-based database using Python, MySQL, and Google Cloud Analytics
cloud cloud-engineering cloud-services data data-science dataanalytics database database-schema googlecloud mysql mysql-database python python-lambda
Last synced: 20 Apr 2026
https://github.com/montanaz0r/suicide-rate-analysis
Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector
analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate
Last synced: 20 Apr 2026
https://github.com/haideratgh/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-query sql-server window-functions-in-sql
Last synced: 29 Jun 2025
https://github.com/omers/sre-devops-tools
Tools and useful sources for SRE and DevOps
awsome awsome-list data devops monitoring sre tools
Last synced: 20 Apr 2026
https://github.com/anjaliwork20/moodify
Mood-based music recommendation system that considers a user's emotional state to recommend songs, genres, artists and playlists using Machine learning
artificial-intelligence cnn-keras cnn-model convolutional-neural-networks data data-analysis data-science data-structures data-visualization database deep-learning machine-learning machine-learning-algorithms python recommended song songs
Last synced: 20 Apr 2026
https://github.com/arda-guler/binmotion
Convert ANY data to a video file. Sister project of binGallery.
data data-visualization proof-of-concept video
Last synced: 04 Jun 2026
https://github.com/smaug6739/data-bit
This project is a module for converting a structured dataset into a number that can be stored in a database taking up little space.
Last synced: 14 May 2026
https://github.com/awpala/udemy-my-courses-data-parser
Download Udemy lists and courses metadata for authenticated student user
Last synced: 07 May 2026
https://github.com/g-schumacher44/analyst_resource_hub
A collection of guidebooks, quickref, and resources for data analysis
analytics bigquery data lookerstudio machine-learning model python sql yaml-configuration
Last synced: 20 Jun 2026
https://github.com/khushi-sabarad/data_analysis
linkedin learning capstone project
data data-engineering matplotlib pandas python
Last synced: 10 May 2026
https://github.com/neha-adnani/sql_music-store-analysis
SQL-based data analysis of a digital music store's sales and customer data.
business-analysis data data-analysis database follow-along-projects pgadmin4 portfolio-project postgres queries sql
Last synced: 18 Jun 2025
https://github.com/hormcodes/data
Terraform configuration for public data storage hosted on data.horm.codes
aws cloudfront content-management data github-actions s3-bucket terraform
Last synced: 20 Apr 2026
https://github.com/sdspot2034/data-lemur-solutions
Solutions to SQL Problems on DataLemur
competitive-programming data data-analytics data-science database postgresql query sql
Last synced: 20 Apr 2026
https://github.com/nikoheikkila/maps
A TypeScript collection of specialized map implementations
data javascript maps typescript
Last synced: 20 Apr 2026
https://github.com/dvaser/heart-attact-analysis-prediction
DATA ANALYSIS
classification data data-analysis data-visualization jupyter jupyter-notebook lineer-regresyon machine-learning python regression
Last synced: 20 Jan 2026
https://github.com/sidneyarcidiacono/data-parser
A node module designed to make reading in large files as easy as calling one function.
Last synced: 05 May 2026
https://github.com/aneeshmurali-n/nlp-emotion-classification-in-text
Develop machine learning models to classify emotions in text samples.
bag-of-words data emotion-classification feature-extraction machine-learning naive-bayes natural-language-processing nlp nltk preprocessing python scikit-learn svm text-classification tf-idf tokenizer vectorizer
Last synced: 10 May 2026
https://github.com/nxion/sql-data-warehouse-project
Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.
data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server
Last synced: 05 Jun 2026
https://github.com/mozzo1000/web-analytics
Website analysis tools and data
analysis analytics data website
Last synced: 21 Apr 2026
https://github.com/fastpix/android-data-kaltura
This SDK enables seamless integration with Kaltura Player, offering advanced video analytics via the FastPix Dashboard
analytics android-sdk data fastpix kaltura kaltura-player metrics sdk video video-metrics
Last synced: 21 Apr 2026
https://github.com/pbinkley/mfmcollections
Project to distill data about published collections of microfilms from library lists
Last synced: 28 May 2026
https://github.com/vishwas-chakilam/movies-review-scraping-analysis
A project for collecting, cleaning, and analyzing movie data. Includes scripts for web scraping (deprecated) and using the OMDb API to fetch movie details. Analyze and visualize data with Python and Power BI to uncover insights and trends in movie ratings and genres.
data dataanalysis datacleaning datavisualization matplotlib-python numpy-library pandas python webscraping
Last synced: 21 Apr 2026
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/amethyst-php/alias
alias amethyst amethyst-libary amethyst-package api data laravel library package
Last synced: 21 Apr 2026
https://github.com/stefen-taime/llm-rag-mtl-public-hospital
Ce projet développe un modèle de type Retrieve-Augment-Generate (RAG) pour répondre aux questions en utilisant les données publiques des avis laissés sur Google pour des hôpitaux à Montréal
data google-reviews hopital hospital hub ia llm montreal open-source quebec rag
Last synced: 21 Apr 2026
https://github.com/jdenn0514/surveycore
Core Survey Analysis Infrastructure
Last synced: 21 Apr 2026
https://github.com/vck9521/traffic-accidents
In this project, we analyze the effects of various factors that correlate to traffic fatalities in the United States. Logistic regression is used, with the y variable being Fatality Rate (coded 0 for Survived, 1 for Fatality).
analysis data fatalities r regression rstudio traffic visualization
Last synced: 05 Jun 2026
https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio
This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time
data data-analysis data-visualization pandas seaborn
Last synced: 10 May 2026
https://github.com/adrianoleitedasilva/adrianoleitedasilva
Me chamo Adriano, tenho 35 anos de idade, sendo 18 anos dedicados as áreas de Tecnologia da Informação e Educação.
adrianoleitedasilva automation ceo cio cto data data-science dev diretor github mobile professor python readme techlead web
Last synced: 10 May 2026
https://github.com/schijioke-uche/data-analysis-with-python-an-spss-model
With this Python notebook algorithm, you can use SPSS Model notebook to build machine learning pipelines that you can use to iterate rapidly during the model building process in data analysis. Whether you're trying to find the right algorithm or experimenting with different ways of preparing your data, you can create reproducible research that's easily understood by any member of your team with Hypothesis definition.
anova cp4a cp4d cp4i cp4s data ibm ibm-cloud jeffrey-chijioke-uche jeffrey-solomon-chijioke-uche openshift python python3 redhat t-test
Last synced: 22 Apr 2026
https://github.com/hemangsharma/assignment-2---classification-models
Assignment 2 - Classification Models repository contains project for 36106 Machine Learning Algorithms and Applications
data datascience-machinelearning machine-learning ml
Last synced: 10 Jun 2026
https://github.com/rbcavi/factorio-mod-data
The modpacke data for factorio-viewer
data factorio factorio-data factorio-mod-data
Last synced: 23 Apr 2026
https://github.com/amethyst-php/owner
amethyst amethyst-package api data laravel owner
Last synced: 28 Apr 2026
https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis
A data science notebook project focused on analyzing car features and building a model for car price prediction.
data data-analysis data-visualization jupyter-notebook python
Last synced: 23 Apr 2026
https://github.com/moeabbas6/bq_data_loader
A Python script for executing and logging batch SQL commands in Google BigQuery. Includes tracking of execution times, unique job and statement IDs, and automated logging to a specified BigQuery table.
Last synced: 24 Mar 2025
https://github.com/elcarrillo/structpy
StructPy is a Python-based command-line tool designed for academics and scientists to manage data projects effectively. It simplifies workflows by creating structured project directories, generating timestamped filenames, validating datasets, and backing up projects seamlessly.
command-line-tool data database file-structure organization python science-tool
Last synced: 24 Apr 2026
https://github.com/howwohmm/fetchgram
era-adjusted Instagram content intelligence — scrape any public profile, OCR every image, measure what actually works. free, local, no API keys.
analytics cli content-strategy data instagram ocr python scraper
Last synced: 06 Jun 2026
https://github.com/yuvrajsaraogi/-iris-flower-classification
Iris flower has three species; setosa, versicolor, and virginica, which differs according to their measurements. Now assume that you have the measurements of the iris flowers according to their species, and the task is to train a machine learning model that can learn from the measurements of the iris species and classify them.
classification data data-analysis data-science data-visualization flower flower-classification iris iris-classification iris-flower iris-flower-classification knn knn-classification machine-learning machine-learning-algorithms ml natural-language-processing nlp python
Last synced: 24 Apr 2026
https://github.com/bablukumarjha/startup-funding-revenue-analysis-by-sql-and-pandas
SQL project analyzing startup funding, revenue, and founder data to extract business insights using Python and MySQL.
data data-analysis data-platform data-science dataanalysisusingpython dataanalytics pandas-dataframe pandas-library python sql sql-server sqlalchemy sqldatabase
Last synced: 18 May 2026
https://github.com/hruth-vik/sales-analysis-report
SalesScope is a powerful sales analytics dashboard that extracts insights, reveals trends, and drives strategy from raw data.
analytics data powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/repirate/asset-recovery-tool
A simple tool for recovering undrained tokens and NFTs from a compromised wallet on the Ethereum network.
bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask-desktop metamask-plugin phrase recovery seed token wallet
Last synced: 10 May 2026
https://github.com/marielachirinosr/cyclistic-data-analytics-project
This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.
data data-visualization pandas powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/infinitode/crsd
A synthetic customer review sentiment dataset for sentiment analysis generated using different AI models.
ai data dataset datasets huggingface-datasets mit-license ml nlp open-source python sentiment sentiment-analysis sentiment-classification text-data
Last synced: 10 Jun 2026
https://github.com/fuzzt/location-analyzer
The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.
analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack
Last synced: 11 Apr 2026