data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization
This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.
classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm
Last synced: 10 Apr 2025
https://github.com/prishabhanot/facial_recognition_pca
A face recognition system using Principal Component Analysis (PCA) for dimensionality reduction and a Support Vector Machine (SVM) classifier for classification. PCA extracts essential features (eigenfaces) from facial images, significantly reducing computational complexity while retaining critical information for accurate recognition.
data eigenfaces facial-recognition pca python reducing-computational-complexity reducing-data-dimensions svm-classifier
Last synced: 01 Mar 2025
https://github.com/contawo/travel-journal
This is a travel journal application for storing all the places that you have visited. I was learning by doing react when creating this project. I learnt a lot with it and upgraded my reactjs skills.
data learning-by-doing props reactjs
Last synced: 05 May 2026
https://github.com/remcostoeten/github-and-vercel-api-showcase-dashboard
Showcase results of possible fetched data from the Github and Vercel API built in all vanilla js.
api-rest da data express-js github-api nodejs vercel-api
Last synced: 07 Mar 2026
https://github.com/musamairshad/dsa-python
This repository contains all the material related to Data Structures and Algorithms implemented in Python.
algorithms data datastructures efficiency python searching-algorithms sorting-algorithms
Last synced: 25 Mar 2025
https://github.com/dhanish03/reliance-sales-report-dashboard
This project, Reliance Sales Report Dashboard, showcases a dynamic and interactive Power BI dashboard designed to analyze sales performance. The dashboard provides key insights into various aspects of sales data, including product-wise performance, region-based revenue, and profitability trends.
data datavisualization-project powerbi visualization
Last synced: 23 Jan 2026
https://github.com/munas-git/codm-review-analysis-and-predictions
Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.
data flask machine-learning python sentiment-analysis
Last synced: 05 May 2026
https://github.com/sakshamarora07/blinkit-sales-report-power-bi
This dashboard provides Blinkit with insights to optimize its grocery delivery operations and understand customer preferences. It evaluates sales trends, outlet performance, and item categories to identify key areas for improvement. The interactive visuals allow detailed exploration of sales distribution, customer ratings, and product popularity.
data data-science dataanalytics datavisualization excel powerbi sql
Last synced: 08 Jan 2026
https://github.com/ompreetham/fylo-data-storage-component
Flyo Data Storage Component Challenge on Frontend Mentor.io.
component css data front-end front-end-development frontend frontend-mentor frontendmentor-challenge fylo html react render scss storage vite website
Last synced: 11 Apr 2026
https://github.com/raghavendranhp/youtube_data_harvesting
The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions
apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api
Last synced: 13 Apr 2026
https://github.com/codegouvfr/codegouvfr-sources
π§’ Static web frontend for code.gouv.fr
bluehats codegouvfr data frontend
Last synced: 28 Feb 2025
https://github.com/rorylshanks/devdb-client
This is the repository for the official command line client for DevDB (https://devdb.cloud)
cloud data database-management development
Last synced: 29 May 2026
https://github.com/shadmanshaikh/data-analysis-and-ml-work
All of my work in Data Analysis and Machine learning
analytics artificial-intelligence data machine-learning
Last synced: 05 Jul 2025
https://github.com/louis-heraut/dataverseur
π« A dataverse API R wrapper to enhance the deposit procedure using only R variable declarations
data data-repository data-science datascience dataset dataverse dataverse-api json metadata metadata-management metadata-parser r
Last synced: 24 Oct 2025
https://github.com/gianlucatruda/titanic
An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.
data database flask info matplotlib python science scrape server titanic visualisation web
Last synced: 10 Apr 2026
https://github.com/moscatellimarco/webscrap-tinydeal
"WebScrap-TinyDeal" is a Scrapy-powered π·οΈ tool for harvesting product information π·οΈ from TinyDeal. It outputs structured CSV data π, ready for analysis. Explore the scripts π¨βπ» for an interactive scraping adventure or leverage the data for competitive pricing strategies π.
css data datascience html pandas python scrapy web webscraper webscraping
Last synced: 14 Apr 2026
https://github.com/ztgx/muvera
MUVERA: Making multi-vector retrieval as fast as single-vector search
algorithms data google muvera retrieval rust search structure vector
Last synced: 25 Oct 2025
https://github.com/prajjwol09/power-bi-project
The Data Survey Breakdown is an interactive Power BI dashboard designed to present insights gathered from a survey of professionals and enthusiasts in the data industry.
dashboard data interactive powerbi survey
Last synced: 15 Mar 2026
https://github.com/byndyusoft/byndyusoft.data.relational
Relational abstractions for Byndyusoft.Data.Relational.
byndyusoft data dataaccess db relational-databases
Last synced: 25 Oct 2025
https://github.com/ayush-raj8/godata
Write data to file. Standardizes the format for easy parsing and read by other programs.
Last synced: 18 Jan 2026
https://github.com/brayflex/spy-sector-rotation-google-sheet
Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.
data etf google-sheets index price rotation script sector spreadsheet spy stock-market
Last synced: 29 Jun 2026
https://github.com/greedchikara/dsajs
Data Structures and Algorithms written in Javascript
Last synced: 09 Apr 2026
https://github.com/solrikk/bluemoon
This project is a Go language tool designed to automatically download, process, and save product data from a remote server into a CSV file.
analyze converter data go golang xml-parser
Last synced: 31 Jul 2025
https://github.com/farrelfaricaf/exploratorydataanalyst---titanic
This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.
data data-analysis data-science data-visualization eda python titanic-dataset
Last synced: 31 Jul 2025
https://github.com/cunfuu/network-bubbles
For Easier to manage organizations and keeping notes about them to organize events and easy access their needs
data data-visualization organizations organizations-volunteer
Last synced: 31 Jul 2025
https://github.com/revolutionarybukhari/datawarehouse_meshjoin_superstore
A dataware house is generated for streaming data of a superstore using extended mesh join by Syed Husnain Haider Bukhari
data data-science data-warehousing meshjoin
Last synced: 23 May 2026
https://github.com/aaronspindler/selfdrivingcar
Learning deep learning and making a self driving car in the process
car data deep deep-learning driving keras learning machine machine-learning python self self-driving-car
Last synced: 09 Apr 2026
https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends
This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.
analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization
Last synced: 16 Feb 2026
https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data
This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.
big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard
Last synced: 19 Feb 2026
https://github.com/nushratjabenaurnima/cse_477_data_mining
A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.
data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3
Last synced: 09 Apr 2026
https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration
This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.
covid-19 data data-exploration dataset sql
Last synced: 19 Feb 2026
https://github.com/entorb/analyze-ha-energy
Analyze Home Assistant Solar Production Data
data home-assistant pandas photovoltaic pv python
Last synced: 08 May 2026
https://github.com/alecxcode/table-parser
Python Table Parser (data extraction)
automation data extraction python robotic-process-automation
Last synced: 04 May 2026
https://github.com/servierhub/adsv
Analyze delimiter-separated values files
csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining
Last synced: 28 Sep 2025
https://github.com/ddeepanshu-997/support_vector_regression--svr-
In this repository i performed a support vector regression on real life data , initially i performed some data preprocessing technique in order to filter out the data flaws then undergoes the process of model building i.e SVM regression in order to make a machine learning regression model.
data data-science regression-analysis regression-models svm-model svm-regression
Last synced: 03 Aug 2025
https://github.com/haimonmon/j3mify
Convert your jejemon word into a formal sentence or word
data jejemon nlp normalization python regex tagalog tokenization
Last synced: 12 Oct 2025
https://github.com/e22m4u/ts-data-schema
ΠΠ°Π»ΠΈΠ΄Π°ΡΠΈΡ Π΄Π°Π½Π½ΡΡ ΠΈ ΠΏΡΠΈΠ²Π΅Π΄Π΅Π½ΠΈΠ΅ ΡΠΈΠΏΠΎΠ² Π΄Π»Ρ TypeScript
data schema typescript validation
Last synced: 05 Aug 2025
https://github.com/elissorokin/data-analyst-portfolio
ΠΡΠΎ ΡΠ΅ΠΏΠΎΠ·ΠΈΡΠΎΡΠΈΠΉ, Π² ΠΊΠΎΡΠΎΡΠΎΠΌ Ρ Π΄Π΅ΠΌΠΎΠ½ΡΡΡΠΈΡΡΡ ΡΠ²ΠΎΠΈ Π½Π°Π²ΡΠΊΠΈ, Π΄Π΅Π»ΡΡΡ ΠΏΡΠΎΠ΅ΠΊΡΠ°ΠΌΠΈ ΠΈ ΠΎΡΡΠ»Π΅ΠΆΠΈΠ²Π°Ρ ΠΏΡΠΎΠ³ΡΠ΅ΡΡ Π² ΠΎΠ±Π»Π°ΡΡΠΈ Π°Π½Π°Π»ΠΈΠ·Π° Π΄Π°Π½Π½ΡΡ ΠΈ Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 09 Apr 2026
https://github.com/sourceduty/information_data_quality
π Assess information and data quality in various formats.
ai ai-data ai-info ai-information artificial-intelligence assessment chatgpt custom-gpt data data-quality data-tool gpt info info-tool information information-quality openai quality quality-control
Last synced: 08 Aug 2025
https://github.com/sourceduty/data_marketer
π° Analyze uploaded data and prepare a data marketing plan for selling data. Create data product plans.
ai ai-data ai-tool artificial-intelligence business chatgpt company custom-gpt customgpts data data-business data-market data-marketer data-marketing data-tool gpt gpt-store gpts gptstore openai
Last synced: 03 Sep 2025
https://github.com/sourceduty/data_metrics
π Analyzing, sorting and visualizing data.
data data-analysis data-metrics data-sci data-science data-science-projects data-sorting data-visualization database dataset metrics sorting statistics visualization
Last synced: 08 Aug 2025
https://github.com/sourceduty/language_barriers
π€ Language barriers between the world's 7,000 languages.
communication concept data idea info information language language-barrier language-barriers languages project research
Last synced: 11 Feb 2026
https://github.com/sourceduty/data_architect
π οΈ Develop, model and simulate data architecture framework.
ai artificial-intelligence chatgpt custom-gpt custom-gpts data data-architect data-design data-strategy data-structures data-systems framework framework-development gpt gpts openai openai-chatgpt
Last synced: 08 Aug 2025
https://github.com/mchenryspagg/wrangle-and-analyze-data
This project which is known as 'wrangle and analyze data' involves the wrangling of WeRateDogs twitter archive data from the period of 2015 to 2017
api data dataanalysis datacollection datawrangling datetime json numpy os pandas pil python requests tweepy-api visualization
Last synced: 09 Apr 2026
https://github.com/chompfoods/sdk-java
Java SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food gradle grocery ingredients jar java java-sdk nutrition openapi raw recipe-api recipes sdk
Last synced: 09 Apr 2026
https://github.com/analyticslover/sales-python-dashboard
Dashboard Ventas Japon 2023
dashboards data data-analysis jupyter-notebook python3 sales streamlit
Last synced: 09 Apr 2026
https://github.com/srindot/fwuav-average-flight-data-collection
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 10 Aug 2025
https://github.com/chocoscoding/fakeapi
A fake API with nice functionalities for testing
api data express fetch fetch-api frontend javascript js json json-api json-server nodejs testing typescript
Last synced: 09 Apr 2026
https://github.com/ashita-ai/ashita-ai.github.io
Ashita AI - The island of misfit data tools
Last synced: 19 Feb 2026
https://github.com/mcraiha/datagensharp
C# managed library for generating data
Last synced: 11 Aug 2025
https://github.com/keziatbnn/supervised-regression-salaryprediction
Make salary predictions based on years of experience using supervised regression.
data data-analysis-python data-prediction data-science python
Last synced: 11 Aug 2025
https://github.com/corneliustanui/personal_blogdown_website
This repo contains source files for my personal Blogdown-based website.
analyis analytics blog blogdown blogdown-sites data data-science hugo hugo-theme netlify personal-website rbind statistics web website
Last synced: 13 Feb 2026
https://github.com/ddofer/ddofer.github.io
Dan's Blog
blog cv data data-science machine-learning
Last synced: 12 Aug 2025
https://github.com/kadirlofca/unity-csvmaker
Quick and easy way to create and export .csv files from Unity.
Last synced: 09 Apr 2026
https://github.com/aaisha-nexus/sql_company_insights
A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.
business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms
Last synced: 12 Aug 2025
https://github.com/panodata/tikray
A compact data transformation engine.
data data-transformation data-transformation-pipeline data-transformer jmes jmespath jq jqlang json json-pointer json-transform json-transformation json-translate json-translator transformation transon
Last synced: 04 Oct 2025
https://github.com/jleung51/foundations-dags
Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.
data data-engineering etl extract housing load pipeline transform
Last synced: 04 Oct 2025
https://github.com/mtwn105/phonepe-pulse-plus
An API on top of PhonePe Pulse Data APIs
cors data data-science express finance hacktoberfest heroku javascript nodejs phonepe pulse
Last synced: 09 Apr 2026
https://github.com/leoBitto/CloudForge
Data foundry
airflow data data-engineering django docker docker-compose grafana postgresql prometheus
Last synced: 14 Aug 2025
https://github.com/seqeralabs/ffq-api
A minimal wrapper to make ffq searches available via a REST API.
api data fastq fetch-fastq ffq genomics
Last synced: 15 Aug 2025
https://github.com/twilighty-abhi/locust-data-visualiser
Locust Data Visualiser
Last synced: 15 Aug 2025
https://github.com/farhad2415/Job_Scraper
Job Site Based Job Scrapping with python
automation bash-script data data-scraping data-structures python selenium selenium-python
Last synced: 15 Aug 2025
https://github.com/rijkvanzanten/ds-fa-1
The first final assignment for the data structures class
assignment data final map now parsons structures thenewschool
Last synced: 04 Oct 2025
https://github.com/sulujulianto/population-data-retrieval-and-analysis
I created a simple program that can be used to search for global population data or population data from various countries using Python.
Last synced: 09 Mar 2026
https://github.com/srindot/average_flightdata_collection_fwuaav
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 18 Aug 2025
https://github.com/saritaphd/predicting-performance-of-students---complete-ml-project-with-deployment-using-aws
Student performance analysis with deployment (End to end ML project)
aws data data-science deployment jupyter-notebook machine-learning python visualization
Last synced: 10 Apr 2026
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/hakusaro/facts
A fact based knowledge system (FBKS) experiment.
Last synced: 03 Jan 2026
https://github.com/rationalprabal/book-management-app
A Node.js and Express.js application for managing books, featuring role-based authentication and authorization with JWT, file uploads for book cover pages, robust data validation and documentation using swagger. The project includes user roles such as Admin, Author, and Reader, each with specific permissions.
data expressjs jwt-authentication mongodb mongoose nodejs rbac-roles
Last synced: 10 Apr 2026
https://github.com/rugwiroparfait/alx_sql
This repo is where I save my queries and learning materials in Data Science program from ALX
anaconda data data-analysis jupyter-notebook sql
Last synced: 19 Aug 2025
https://github.com/anderson-andre-p/datastructuresandalgorithms
Data Structures and Algorithms to Study
data data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series structured-data
Last synced: 20 Aug 2025
https://github.com/giscience/measures-rest-oshdb-app
A frontend for providing measures for geospatial datasets, using the OSHDB
data dggs geospatial measure openstreetmap rest
Last synced: 20 Apr 2026
https://github.com/karolkrupa/javascript-orm-mapper
ORM mapping library. Especially for Rest API
api data data-mapper entity es6 javascript mapper model mongo mysql node nuxt orm relational rest typescript vue vuex
Last synced: 10 Apr 2026
https://github.com/ahmad-ali-rafique/wine-quality-dataset
Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.
analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model
Last synced: 21 Aug 2025
https://github.com/paulrosset/cyclone
Network data consumption monitoring
data monitoring network networking
Last synced: 23 Aug 2025
https://github.com/anuragagarwal96/hospital-mortality-rate-sql-analysis
In this project, I have taken a hospital dataset from Kaggle, analysed it and predicted the mortality rate of patients who have been admitted in hospitals. I have utilised a combination of SQL, Tableau and Microsoft Excel for this project.
data data-visualization dataanalysis dataanalysisusingsql excel msexcel mssqlserver sql tableau tableau-public
Last synced: 09 Mar 2026
https://github.com/luminati-io/google-maps-dataset-samples
A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.
api data dataset google-maps maps web-scraping
Last synced: 03 Jan 2026
https://github.com/franckalbinet/maris-crawlers
Automated data harvesting of MARIS data sources
automation data marine-radioactivity
Last synced: 25 Aug 2025
https://github.com/0xnu/data-analyst-training
The repository contains training materials for data analysts.
data data-analysis data-analyst
Last synced: 25 Aug 2025
https://github.com/darshjasani/insurance-claim-analysis
This dataset contains insightful information related to insurance claims, giving us an in-depth look into the demographic patterns of those receiving them.
Last synced: 27 Aug 2025
https://github.com/schoolsquirrel/holiday-data
Automatically updated holiday data for SchoolSquirrel
data holidays schoolsquirrel scripts vacation
Last synced: 03 Oct 2025
https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models
This repo contains a model for determining the rainfall patterns and another for medical expense prediction model
data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning
Last synced: 30 Aug 2025
https://github.com/ate47/playerdata
Get data about a player with a command
bukkit-plugin command data spigot-plugin
Last synced: 30 Aug 2025
https://github.com/olekscode/datageneration
Exploring the methods of data generation for different Machine Learning algorithms
data javascript machine-learning
Last synced: 05 Apr 2025
https://github.com/passly-nl/data
Source code of the data layer.
data passly ticketing typescript
Last synced: 27 May 2026
https://github.com/sungchun12/demotron
CLI to delight real people with live demos
Last synced: 26 Feb 2025
https://github.com/agdturner/ccg-data
A modularised Java library for processing data sets with classes for: data records; collections of data records; and identifiers.
Last synced: 12 Jan 2026
https://github.com/koppalexander/flightdelaychallenge
This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.
data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling
Last synced: 19 Jun 2026
https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project
This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.
cleaning-data data database dataset mysql mysql-database sql
Last synced: 07 Apr 2025
https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails
Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.
cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql
Last synced: 24 Jul 2025