data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-27 00:07:33 UTC
- JSON Representation
https://github.com/shrutakeerti/eye-gaze-detection
This repo contains everything that I have done at IIT Jodhpur Summer Internship May 15 - July 15
ai aiml data eda eeg eeg-signals eye jodhpur mlflow
Last synced: 17 Mar 2025
https://github.com/mksingh431/sql-complete-notes
SQL, or Structured Query Language, is a robust and specialized programming language designed for efficient management and manipulation of relational databases. With SQL, you can seamlessly interact with databases like MySQL, PostgreSQL, Microsoft SQL Server, Oracle,.
Last synced: 21 Apr 2026
https://github.com/meta-llama/synthetic-data-kit
Tool for generating high quality Synthetic datasets
data generation llm python synthetic
Last synced: 08 May 2025
https://github.com/vaxdata22/foresight-pharmaceutical
This is a Data Analysis case study done on the Foresight Pharmaceutical Company dataset.
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-visualization data-wrangling exploratory-data-analysis spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql transact-sql
Last synced: 05 Mar 2025
https://github.com/danielrosehill/ghg-ebitda-correlations
Streamlit data visualisation examining correlation between emissions & profitability
data sustainability sustainability-data
Last synced: 14 Mar 2025
https://github.com/athari22/statistics-from-stock-data
Statistics from Stock Data
cvs data data-science dataanalysis datacleaning dataframe jupyter pandas pandas-python python statistics stock table
Last synced: 16 Feb 2026
https://github.com/pyrustic/litedao
Intuitive interaction with SQLite database
auto-init dao data database database-access library lightweight pyrustic python sql sqlite
Last synced: 09 May 2026
https://github.com/dhi13man/rca_ace
RCA Ace is designed for organizations seeking to enhance their understanding and utilization of insights derived from Root Cause Analyses (RCAs).
analytics data enterprise open-source python python3 rca
Last synced: 10 Sep 2025
https://github.com/ethenkem/pygraphsurvey
A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc
Last synced: 12 Jan 2026
https://github.com/colour-science/colour-streamlit-tm-30-18
Generates the "ANSI/IES TM-30-18 Colour Rendition Report" using Colour and Streamlit
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets streamlit
Last synced: 23 Feb 2025
https://github.com/stdlib-js/dstructs-circular-buffer
Circular buffer.
buffer circular collection cyclic data data-structure data-structures fifo first-in-first-out javascript node node-js nodejs queue ring stdlib structure
Last synced: 20 May 2026
https://github.com/ournet/embed-providers-data
Embed provides data
data embed embed-providers json providers
Last synced: 03 May 2026
https://github.com/gui-sitton/carsells
In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 20 May 2026
https://github.com/amethyst-php/contact
amethyst amethyst-package api contact data laravel
Last synced: 20 May 2026
https://github.com/avijeetpandey/quizzez
Implementation of quizzez application using kotlin
Last synced: 20 May 2026
https://github.com/amethyst-php/shipment-zone
amethyst amethyst-package api data laravel shipment-zone
Last synced: 20 May 2026
https://github.com/pooja-manjunatha/nyc_parking_violations_dbt
This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations
data data-analysis data-engineering dbt duckdb python sql
Last synced: 14 May 2026
https://github.com/ahmad-ali-rafique/random-forest-classifier-modeling
Detailed exploration of random forest classifiers, including data cleaning, model building, and performance evaluation on various datasets.
classification classification-models data dataanalytics datamodel dataset model-checking models random-forest random-forest-classifier
Last synced: 01 Jun 2026
https://github.com/basis-company/data-player.js
in memory data layer for fast access to plain normalized data
collection data model traversal
Last synced: 25 Feb 2025
https://github.com/szc126/metadata-nnd-vocalo-twitter
ボカロ系新着動画ツイートを収集 - "new VOCALOID/UTAU videos" tweet collection
data nico-nico-douga niconico vocaloid
Last synced: 20 May 2026
https://github.com/estherslabbert/final-capstone-unsupervised-ml
Exploration of USArrests data using unsupervised machine learning
arrests correction data data-analysis data-clustering data-visualization jupyter-notebook machine-learning pca-analysis standardised-data usa
Last synced: 26 Jun 2025
https://github.com/raruto/cockpit-sample-data
Sample data installer addon for Cockpit CMS
Last synced: 17 Mar 2025
https://github.com/lukaszkn/data-software-engineering-interview-questions
Data and Software engineering interview questions
data engineering interview-questions python
Last synced: 20 Jul 2025
https://github.com/disruptek/bloom
bloom filters
bloom data filter hash membership nim probability set structure
Last synced: 04 Apr 2025
https://github.com/stdlib-js/array-base-any-has-property
Test whether at least one element in a provided array has a specified property, either own or inherited.
any array assert data generic has javascript node node-js nodejs prop property stdlib structure test types validate
Last synced: 20 May 2026
https://github.com/tkxwaweru/python_data_manipulation
Manipulating the MASSIVE dataset using python
data dataanalysis excel python
Last synced: 11 Jan 2026
https://github.com/pcpp94/elexon_pipeline_gb_demand
Guidelines and code snippets for extracting and processing Elexon gross demand data on Databricks. Provides half-hourly GB demand at sectoral (Domestic, Non-domestic), GSP-area granularity, settlement demand, and embedded generation. Supports non-commodity cost calculations for CfD, RO, and FiT.
data electricity elexon gb octopusenergy power powerdata pypsa uk
Last synced: 12 Jul 2025
https://github.com/ahmad-ali-rafique/logistic-regression-modeling
An in-depth exploration of logistic regression models, including data cleaning, model building, and performance evaluation on various datasets.
accuracy confusion-matrix data dataanalytics logistic-regression logistic-regression-classifier machine-learning-algorithms mlmodels model modelling regression-models
Last synced: 11 Sep 2025
https://github.com/phtrempe/l2a
This is a small project which aims to show an example of applied machine learning in Python 3 with the Keras library and its TensorFlow backend to train a neural network model for it to learn to add two integers.
applied data data-science deep-learning keras machine-learning neural-network tensorboard tensorflow
Last synced: 05 May 2026
https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling
Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.
data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis
Last synced: 05 Mar 2025
https://github.com/maximkrouk/storage
Lightweight framework for storing data (beta)
cache data keychain memmory storage swift swift5-1 userdefaults
Last synced: 30 Oct 2025
https://github.com/wilcotomassen/lorem-datum-core
Java based data generator for data simulation
data dataset generator java lorem-ipsum simulated-data
Last synced: 11 Jan 2026
https://github.com/ahmad-ali-rafique/electricity-consumption-analysis-household-dataset
This repository contains analysis and predictive modeling of household electricity consumption using Python. It includes data cleaning, exploratory data analysis (EDA), time series forecasting (ARIMA, SARIMA, LSTM), and model evaluation to optimize energy usage.
arima-forecasting artificial-intelligence artificial-neural-networks data data-science dataanalytics datacleaning evaluation-metrics exploratory-data-analysis long-short-term-memory lstmmodel modeling time-series timeseries-forecasting
Last synced: 23 Jun 2025
https://github.com/kunalkumar2001/coffee_sales_project_using_excel_power-bi_and_sql
Coffee Shop Sales Dashboard built using Power BI for visualization and SQL for data extraction and transformation. The project dives deep into sales performance, providing actionable insights for data-driven decisions.
analytics data dataanalytics mssql powerbi sql
Last synced: 26 Jun 2025
https://github.com/amliyanage/data-structures
arrays binary-tree data data-structures graph hashtable linked-list stack
Last synced: 06 Apr 2025
https://github.com/echang1802/normandy
Normandy is a python framework for data pipelines, which main objective is standardizing your team code and provide a data treatment methodology flexible to your team needs.
analytics business-intelligence data dataengineering datascience etl pipeline
Last synced: 11 Mar 2026
https://github.com/saksham-jain177/data-analysis
A collection of data analysis and machine learning projects across various datasets. Explore predictive modeling, data visualization, and insights from real-world data. Projects include sales predictions, disease detection, customer segmentation, and more.
api data data-analysis data-cleaning data-science data-visualization datamodeling dataset datasets exploratory-data-analysis python python3 web-scraping youtube-api
Last synced: 01 May 2026
https://github.com/badr-moufad/dashboard-agriedge-data
Prepare data for dashboard. This is part of my research internship.
acquisition dashboard data data-morocco data-science data-visualisation weather weather-dashboard weather-data
Last synced: 04 Apr 2025
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 08 Jun 2026
https://github.com/anthonysanalysis/bellabeat-analysis
Bellabeat Tech Case Study Capstone Project
analysis capstone case-study data data-analysis data-visualization md r rmd rstudio
Last synced: 20 Apr 2026
https://github.com/preranarao03/madhav_e-commerce_dashboard
This repository features the Madhav_E-Commerce_Dashboard built with Power BI. It provides interactive visualizations for analyzing e-commerce sales performance, product categories, customer segments, and geographic data, aiding in data-driven business decisions.
Last synced: 30 Jan 2026
https://github.com/jszafran/personal-aws-data-lake
Personal, cloud based (AWS), data lake for experimenting with cloud services.
aws cloud data data-engineering dataengineering datalake etl terraform
Last synced: 20 May 2026
https://github.com/amethyst-php/geolocation
amethyst amethyst-package api data geolocation laravel
Last synced: 20 May 2026
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 12 Dec 2025
https://github.com/redinfinitypro/scientificsharp
Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.
componentmodel cryptography data drawing forms generic linq system tasks text
Last synced: 06 Apr 2025
https://github.com/yvandana/brain-tumor-detection-and-classification
Bachelor's Major Project- Presented at ICMISC 2022
2d-cnn brain-tumor-classification brain-tumor-detection cnn-model data data-augmentation keras-tensorflow sklearn-metrics
Last synced: 16 Jun 2025
https://github.com/himanshub16/lekhpal
Monitor and catalog Twitter feed matching your desired keywords
analytics data data-catalog data-filtering mongodb twitter twitter-streaming-api
Last synced: 14 May 2026
https://github.com/wellingtonmwadali/alx-low_level_programming
ALX sprint one C programming
c data datastructures linked-list loops pointers-and-arrays string structures
Last synced: 04 Apr 2025
https://github.com/shubhamsoni98/survey-data-analysis
Surey Data Analysis
analysis dashboards data data-mining data-visualization dataanalysis datacleaning datascience datasets insights pivot-tables pivotanalysis
Last synced: 07 Mar 2026
https://github.com/pulipulichen/pts-local-news-dataset
A dataset containing local news from Public Television Service.
Last synced: 27 Mar 2026
https://github.com/anzerr/storage.ts
Util to store data used in a service
data nodejs storage typescript util
Last synced: 20 May 2026
https://github.com/deva-246/excel-power-query-data-cleaning-dashboard
dashboard data datacleaning excel pivottable powerquery slicer
Last synced: 22 Mar 2025
https://github.com/samaalharbi2/virtual-work-experience---data-analysis-at-stc
Virtual Work Experience in Data Analysis at STC
analysis data data-visualization misk stc
Last synced: 20 Jun 2025
https://github.com/axafrance/azureml-to-openshift-talk
Scale your dev IA: From dev AzureML to prod OpenShift in one click
ai axa azureml data learn ml openshift raise-the-bar talk
Last synced: 16 Feb 2026
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/bala-1409/loan-classification-data-science-projects
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization
Last synced: 22 Mar 2025
https://github.com/itsmeyogesh22/solved-8-weeks-sql-challenge-correct-solutions
Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/
8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql
Last synced: 07 Apr 2025
https://github.com/amethyst-php/sku
amethyst amethyst-package api data laravel sku
Last synced: 17 May 2026
https://github.com/harrisonwelch/pythondatascience
Repo of code from the linked-in lesson "Python: Data Analysis"
data data-science matplotlib notes numpy python tutorial
Last synced: 12 Apr 2026
https://github.com/charlieroth/exoexplo
Exploring NASA Exoplanet Archive Data
Last synced: 03 Apr 2025
https://github.com/advisors-excel-llc/angular-datafree
angularjs data data-visualization datafree-directive
Last synced: 30 Sep 2025
https://github.com/azaz9026/loan_approval_prediction
Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio
data data-analysis data-visualization eda machine-learning numpy pandas python statistics
Last synced: 06 Apr 2026
https://github.com/melvinjwallace/melvinjw.github.io
A portfolio of a host of projects completed using python and sql.
data data-analysis data-cleaning data-loading data-mining data-preparation data-processing data-science data-transformation data-visualization dataset matplotlib microsoft-sql-server pandas-python seaborn
Last synced: 02 Apr 2026
https://github.com/mbiushelix/soilresp
Geofag 1 feltarbeid fra Vg2
data data-visualization geology global-warming norwegian-language soil-quality-testing soil-respiration
Last synced: 23 Jul 2025
https://github.com/kaizadp/bbwm_moisture
HOBO data for soil moisture - Bear Brook Watershed in Maine
Last synced: 17 May 2026
https://github.com/jcloh98/rental-property-finder
A web scraper that helps users find rental properties by automatically gathering and organizing listings from various websites to discover available homes and apartments.
data headless-browser node scraper scraping web
Last synced: 17 May 2026
https://github.com/arthurcfranklin/acervo-musical
Este projeto consiste na criação de um banco de dados relacional para auxiliar um DJ na organização e catalogação do seu acervo musical. O objetivo é fornecer um sistema eficiente para armazenar e gerenciar informações sobre cantores, bandas, músicas e suas versões remixadas.
data database mysql mysql-database sql
Last synced: 22 Mar 2025
https://github.com/kinshukjainn/dclue-v1
Dsainone is a highly optimized Data Structures and Algorithms (DSA) library designed to provide efficient implementations of graph algorithms, trees, hashing, and linked lists while maintaining exceptional memory efficiency. The library is designed to be as fast and optimized as possible
Last synced: 20 May 2026
https://github.com/push-protocol/push-google-bigquery
The Power of Web3 Big Data: A Guide to Using Google BigQuery and Push Protocol for Data Communication and Analysis
bigquery data push push-notifications web3
Last synced: 26 Mar 2025
https://github.com/denisecase/cintel-04-reactive
Interactive analytics, reactive app built with Shiny for Python
analytics bokeh data flights interactive mtcars penguins python relationships shiny
Last synced: 20 Jun 2025
https://github.com/shamaz332/ecomrace-data-analysis-in-datascience
data data-science matplotlib pandas
Last synced: 15 May 2026
https://github.com/piyushkumar2025/analytical-sql-project-exploring-trends-segmentation-kpis
A complete SQL analytics project using a simulated data warehouse. It analyzes sales, customer, and product data with CTEs, joins, window functions, subqueries, and views to deliver insights on trends, segmentation, and KPIs, showing how SQL enables data-driven decisions without BI tools.
advanced-sql analytics business-intelligence data data-science-projects datascience joins kpi mysql query sql window-functions-in-sql
Last synced: 02 Jul 2025
https://github.com/jmcph4/rpdb
rpdb
automation data database dataset db real-estate rpdata sql
Last synced: 12 Apr 2025
https://github.com/xmen3em/kaggle-competitions
This collection contains various projects and notebooks developed to tackle a range of Kaggle competitions, showcasing different machine learning techniques, data preprocessing methods, and model optimizations.
data data-science data-visualization deep-learning deployment ensemble-learning machine-learning-algorithms python streamlit
Last synced: 09 Apr 2026
https://github.com/shubhamsoni98/classification-with-random-forest---2
Fraud detection is a critical task for financial institutions and businesses. This document outlines the end-to-end process of predicting fraudulent activities using a Random Forest model. The process includes data preparation, exploration, model training, and evaluation.
algorithms anaconda data data-science dataflow feature-engineering jupyter-notebook machine-learning model modeltraining prediction python random-forest sql visualization
Last synced: 20 Jan 2026
https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023
Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.
cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill
Last synced: 16 Aug 2025
https://github.com/moons-14/datapot
Incorporate and serve all information.
ai aiogram api data infomation news newspaper rss video
Last synced: 04 Jan 2026
https://github.com/lisakey/lisakey
I am passionate about Python 🐍 and SQL 🗃️ for data analysis 📊, and I actively develop projects in these languages.
analysis analyst data dataanalysis dataanalyst java python sql
Last synced: 02 May 2026
https://github.com/khansasafira19/sk-cool-storytelling
Source Code for Data Storytelling with HTML5
data html5 javascript storytelling
Last synced: 13 May 2026
https://github.com/nxank4/an-augment
A Python library for advanced and novel data augmentation, combining traditional techniques like cropping and blurring with state-of-the-art generative AI methods such as style transfer, image inpainting, and latent space interpolation. It boosts data diversity for robust machine learning applications.
computer-vision data data-augmentation data-augmentation-strategies data-augmentation-techniques generative-ai image image-processing synthetic-data
Last synced: 10 Mar 2026
https://github.com/bho0920/crime-data-analysis-eu
Crime Data Analysis for Self-Defense Tool Market Entry in the EU.
data data-analysis sql sqlite tableau
Last synced: 21 Jun 2025
https://github.com/stuffbymax/game-dependencies-db
data database game games-list json mit-license
Last synced: 15 May 2026
https://github.com/octoenergy/tentaclio-gs
A python project containing all the dependencies for gs tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-postgres
A python project containing all the dependencies for postgresq tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-athena
A python project containing all the dependencies for awsathena+rest tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/newrelic-experimental/newrelic-java-apache-sling
Provides Java instrumentation for Apache Sling framework
apache-sling data instrumentation java nrlabs nrlabs-data nrlabs-java-verify observability-data sling
Last synced: 30 May 2026
https://github.com/octoenergy/tentaclio-s3
A python project containing all the dependencies for s3 tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/nouraalgohary/data-scientist-with-python
This repo comprises of my solutions for the tasks assigned in the course.
data data-science data-visualization datacamp datacamp-course datacamp-data-science datacamp-exercises datacamp-solutions-python datascience python
Last synced: 15 Jun 2025
https://github.com/octoenergy/tentaclio-databricks
Module to give tentaclio support to databricks
Last synced: 24 Jun 2025
https://github.com/zeh237/superstore-data-analytics
This is a Flask based data analytics project based on the superstore dataset using flask, pandas, sql and python
analytics data data-analysis data-science data-visualization flask python superstore
Last synced: 04 May 2025