data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/e-kotov/albofr
alboFr: Get French Data on Tiger Mosquito Colonisation
aedes-albopictus data france tiger-mosquito
Last synced: 11 Jun 2026
https://github.com/shubhamsoni98/classification-with-decision-tree
This project predicts iPhone purchases using demographic data (gender, age, salary). A Decision Tree Classifier was used, achieving 88.16% accuracy. Insights from the model can refine marketing strategies, optimize product offerings, and boost sales by targeting key customer segments.
algorithms anaconda classification data data-science descision-tree jupyter-notebook machine-learning prediction python
Last synced: 19 Jan 2026
https://github.com/pcpp94/elexon_pipeline_gb_demand
Guidelines and code snippets for extracting and processing Elexon gross demand data on Databricks. Provides half-hourly GB demand at sectoral (Domestic, Non-domestic), GSP-area granularity, settlement demand, and embedded generation. Supports non-commodity cost calculations for CfD, RO, and FiT.
data electricity elexon gb octopusenergy power powerdata pypsa uk
Last synced: 12 Jul 2025
https://github.com/canadaluke888/terminaltablebuilder
Build and edit tabular data all from the terminal.
cli data data-manipulation excel json ods rich spreadsheets sqlite3 tables
Last synced: 20 Apr 2026
https://github.com/eloyhere/semantic-java
Semantic-Java is a modern, maven Java stream processing framework with zero dependencies. It elegantly blends the fluency of Java Streams, the laziness of JavaScript generators, and intelligent index-based control inspired by database indexing β perfect for time-series, event streams, and high-performance data pipelines as a maven pendency.
data functional functional-programming java pipeline stream
Last synced: 07 Apr 2026
https://github.com/phtrempe/l2a
This is a small project which aims to show an example of applied machine learning in Python 3 with the Keras library and its TensorFlow backend to train a neural network model for it to learn to add two integers.
applied data data-science deep-learning keras machine-learning neural-network tensorboard tensorflow
Last synced: 05 May 2026
https://github.com/denisecase/cintel-03-data
Getting started with interactive data analytics in Python
analytics data interactive python shiny
Last synced: 11 Apr 2025
https://github.com/samharrison7/datamapper
Making mapping between datasets as simple as possible.
data data-mapper data-mapping data-science data-structures
Last synced: 17 Mar 2025
https://github.com/brunosalerno/osm_data
Ruby objects for dealing with OSM data, and generating XML files
Last synced: 21 Apr 2026
https://github.com/os-climate/rmi-utility-transition-hub-ingestion-pipeline
Data ingest for RMI's Utility Transition Hub data (as of March 7, 2022)
data emissions-co2 energy-data os-climate
Last synced: 12 Apr 2025
https://github.com/takamoso/umami
Cross browser compatibility data.
browser compat compatibility data dataset json
Last synced: 27 Mar 2025
https://github.com/shivamsharma32/ipl-2022-analysis
The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.
data dataana dataanalytics datavi matplotlib python
Last synced: 17 May 2026
https://github.com/joshuadeguzman/xcraper
Python based stocks exchange data scraper
data pandas python stock-market
Last synced: 18 May 2026
https://github.com/sumansuhag/wasserstoff-aiinterntask
Welcome to the AI Pipeline for Image Segmentation and Object Analysis project β a state-of-the-art solution designed to process, segment, identify, and analyze objects within images. This AI-powered pipeline is engineered to deliver precise insights by extracting, mapping, and summarizing data from each segmented object.
artificial-intelligence cdn data data-science modeling pipline
Last synced: 28 Mar 2025
https://github.com/sumansuhag/prediction_model
This repository features a collection of Jupyter notebooks designed to showcase the practical applications of machine learning, data preprocessing, feature engineering, and recommendation systems. These notebooks enable users to explore, analyze, and predict business events.
algotithms artificial-intelligence data logistic-regression machine-learning-algorithms science sckiit-learn
Last synced: 28 Mar 2025
https://github.com/parmsam/rweekly.data
R package containing data on Rweekly posts
Last synced: 21 May 2026
https://github.com/kylepw/multistack
Example of multiple stacks in one array.
algorithms array data data-structures python stack
Last synced: 17 Mar 2025
https://github.com/afeiship/data-selection
Data structure for radio/checkbox-group.
Last synced: 17 Jun 2025
https://github.com/echang1802/normandy
Normandy is a python framework for data pipelines, which main objective is standardizing your team code and provide a data treatment methodology flexible to your team needs.
analytics business-intelligence data dataengineering datascience etl pipeline
Last synced: 11 Mar 2026
https://github.com/rajlabmssm/echodata
echoverse module: Example data.
data echoverse fine-mapping genomics gwas qtl
Last synced: 17 Jan 2026
https://github.com/ditikrushna/enotes
π» Personal learning notes
coursera-data-science cousera data datascience machine machinelearning ml notes
Last synced: 07 Mar 2026
https://github.com/hadarsharon/grizzlys
User-friendly Python DataFrames π΅π‘ powered by Julia π΄π’π£
big-data data data-analysis data-engineering data-frame data-frames data-science dataframe dataframe-library dataframes dataframes-jl julia python
Last synced: 18 May 2026
https://github.com/jlee9503/excel-projects
Fitness tracker dashboard, displaying users workout type, calories burned, and steps taken with multiple filters (gender, age, and workout intensity). Implemented using MS Excel.
Last synced: 16 Jan 2026
https://github.com/nadahamdy217/Harvest-Gaurd-Plant-Disease-Detection-Web-Application
web application that help people grow healthy plants
classification-confidential cnn cnn-classification css data data-science detection html javascript keras machine-learning model plant-disease-detection supervised-learning tensorflow web-application
Last synced: 12 Apr 2025
https://github.com/xuender/kstats
Golang statistics library package that supports v1.18+.
algorithms analytics data go golang kstats machine-learning math rounding statistics
Last synced: 20 Jul 2025
https://github.com/thibautre/dataipsum
Configurable data generator (with crumbles inside)
algorithm data random-generation
Last synced: 21 Jul 2025
https://github.com/farovictor/mongodbloader
This project is intended to be used as a data loader to support ELT pipelines or any kind of process that requires a heavy data load into a MongoDb database.
Last synced: 15 May 2026
https://github.com/ahabdel/amazon-web-scraper
Amazon Web Scraper to scrape pricing adjustments and provide updates on a day to day basis
Last synced: 29 Oct 2025
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 08 Jun 2026
https://github.com/dhi13man/rca_ace
RCA Ace is designed for organizations seeking to enhance their understanding and utilization of insights derived from Root Cause Analyses (RCAs).
analytics data enterprise open-source python python3 rca
Last synced: 10 Sep 2025
https://github.com/Axnjr/csv-parser-utils
Homework task for SWE position at Redhat.
csv data dataanalysis datatools pandas python
Last synced: 30 Oct 2025
https://github.com/anthonysanalysis/bellabeat-analysis
Bellabeat Tech Case Study Capstone Project
analysis capstone case-study data data-analysis data-visualization md r rmd rstudio
Last synced: 20 Apr 2026
https://github.com/yvandana/brain-tumor-detection-and-classification
Bachelor's Major Project- Presented at ICMISC 2022
2d-cnn brain-tumor-classification brain-tumor-detection cnn-model data data-augmentation keras-tensorflow sklearn-metrics
Last synced: 16 Jun 2025
https://github.com/stkisengese/numpy-data-fundamentals
A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.
data data-analysis numpy pre-processing
Last synced: 16 May 2026
https://github.com/himanshub16/lekhpal
Monitor and catalog Twitter feed matching your desired keywords
analytics data data-catalog data-filtering mongodb twitter twitter-streaming-api
Last synced: 14 May 2026
https://github.com/reubano/pyconza-tutorial
Jupyter notebooks and data for "Data Mining and Processing for fun and profit" PyConZA16 tutorial
data functional-programming jupyter-notebook meza pycon python tutorial
Last synced: 17 May 2026
https://github.com/bagustris/dataits
Web for DataITS17: Summer School on Data Science
Last synced: 28 Jun 2025
https://github.com/chompfoods/sdk-scala
Scala SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes scala sdk
Last synced: 17 May 2026
https://github.com/badawy403/egy.list
A Node.js package providing access to official Egyptian data including universities, governorates, cities, and more. This package makes it easy for developers to integrate Egypt-specific information into their applications.
city data egypt javascript nodejs npm package
Last synced: 08 Mar 2026
https://github.com/weecology/updating-data
Hugo website for instructions on how to make a regularly updating data pipeline
continuous-analysis continuous-integration data gh-actions living-data netlify travis-ci
Last synced: 17 Feb 2026
https://github.com/amethyst-php/manga
amethyst amethyst-package api data laravel manga
Last synced: 17 May 2026
https://github.com/axafrance/azureml-to-openshift-talk
Scale your dev IA: From dev AzureML to prod OpenShift in one click
ai axa azureml data learn ml openshift raise-the-bar talk
Last synced: 16 Feb 2026
https://github.com/kulgan/justobjects
It's all just objects
data json-schema justobjects objects parsing python python3 validation
Last synced: 10 Jul 2025
https://github.com/ashishsingh789/quantium_data-analysis-_virtual-internship
Completed a job simulation focused on Data Analytics and Commercial Insights for the data science team. Developed expertise in data preparation and customer analytics, utilizing transaction datasets to extract valuable insights and deliver data-driven commercial recommendations
data datawrangling matplotlib pandas pandas-dataframe presentation programming python python-library
Last synced: 07 Apr 2026
https://github.com/garcane/layoffs-exploratory-data-analysis
This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.
data dataanalysis eda mysql sql
Last synced: 29 Oct 2025
https://github.com/cmutel/jester
Import data from the olca-schema JSON-LD format into the HESTIA JSON-LD schema
agriculture data json-ld life-cycle-assessment ontology
Last synced: 26 Jul 2025
https://github.com/iota-pico/data
IOTA Pico Framework Data Structures and Helpers
data iota iota-pico-framework javascript typescript
Last synced: 18 May 2026
https://github.com/dolanmiu/mclaren-task
A front end assessment task for Mclaren
angular data observable observables rxjs
Last synced: 16 May 2026
https://github.com/jigyasag18/data-analysis-using-ms-excel
This project is on analyzing real-time data from Ambuvians Healthcare, a health products startup. It included data cleaning, such as removing duplicates and addressing missing values, followed by analyses to reveal insights into sales trends, customer demographics, and purchasing behaviors. Visualizations in MS-Excel including bar and pie charts.
analysis data data-visualization dataanalysis datacleaning datapreprocessing dataset msexcel visualization
Last synced: 07 Mar 2026
https://github.com/jigyasag18/amazon-power-bi-dashboard
The Amazon Power BI Dashboard Project repository provides an interactive analytics dashboard for visualizing and analyzing sales performance across various product categories within Amazon's ecosystem. Utilizing comprehensive sales data, it empowers stakeholders with actionable insights to enhance decision-making and improve business strategies.
data data-visualization dataanalysis dataanalytics dataset datasets datavisualization-project powerbi powerbi-report powerbi-visuals powerbidashboard
Last synced: 07 Mar 2026
https://github.com/yusuf4030/the-data-analyst-toolkit
π Explore essential data analysis tools organized by role and task, empowering users from students to professionals with quick access to valuable resources.
budget budget-management business-intelligence charts cookbook cureated-list data data-analysis-python data-visualization internet-of-everything internet-of-transport large-language-models nse open-source python selenium stock-market traffic-analysis
Last synced: 18 May 2026
https://github.com/cemoktra/data_series
time series handling
data lazy-evaluation time-series
Last synced: 29 Oct 2025
https://github.com/azaz9026/loan_approval_prediction
Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio
data data-analysis data-visualization eda machine-learning numpy pandas python statistics
Last synced: 06 Apr 2026
https://github.com/melvinjwallace/melvinjw.github.io
A portfolio of a host of projects completed using python and sql.
data data-analysis data-cleaning data-loading data-mining data-preparation data-processing data-science data-transformation data-visualization dataset matplotlib microsoft-sql-server pandas-python seaborn
Last synced: 02 Apr 2026
https://github.com/yourdataarchitect/abyat-scaring-
This Scrapy spider for automates the extraction of product data from the Abyat website using Hidden Backend API, supporting both Arabic and English content.
data database scraper scrapy-crawler
Last synced: 23 Apr 2026
https://github.com/cannt39t/data-mining-spider-vk
ΠΠ°ΡΠΊ ΠΊΠΎΡΠΎΡΡΠΉ ΡΠΎΠ±ΠΈΡΠ°ΡΡ Π²ΡΡ ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΡ ΠΎ ΡΠ΅ΠΊΠ»Π°ΠΌΠ½ΡΡ ΠΏΠΎΡΡΠ°Ρ Π² Π³ΡΡΠΏΠΏΠ΅ VK
data data-mining python3 vk vkontakte
Last synced: 05 Apr 2025
https://github.com/shamaz332/ecomrace-data-analysis-in-datascience
data data-science matplotlib pandas
Last synced: 15 May 2026
https://github.com/jmcph4/rpdb
rpdb
automation data database dataset db real-estate rpdata sql
Last synced: 12 Apr 2025
https://github.com/lisakey/lisakey
I am passionate about Python π and SQL ποΈ for data analysis π, and I actively develop projects in these languages.
analysis analyst data dataanalysis dataanalyst java python sql
Last synced: 02 May 2026
https://github.com/michael-sebero/data-recovery-tools
This tool suite recovers sensitive data.
algiz-linux archive corruption data data-recovery linux recover recovery rust tool tool-suite tools
Last synced: 18 May 2026
https://github.com/UznetDev/Smoking-Prediction
This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.
ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking
Last synced: 28 Mar 2025
https://github.com/nxank4/an-augment
A Python library for advanced and novel data augmentation, combining traditional techniques like cropping and blurring with state-of-the-art generative AI methods such as style transfer, image inpainting, and latent space interpolation. It boosts data diversity for robust machine learning applications.
computer-vision data data-augmentation data-augmentation-strategies data-augmentation-techniques generative-ai image image-processing synthetic-data
Last synced: 10 Mar 2026
https://github.com/rudxain/xorsum
Get XOR checksum with this command-line tool
binary checksum cli data digest file files hexadecimal rust-crate xor
Last synced: 08 Mar 2026
https://github.com/gui-sitton/games
Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 18 May 2026
https://github.com/amethyst-php/issue
amethyst amethyst-package api data issue laravel task ticket
Last synced: 18 May 2026
https://github.com/dimaa1608/azurecontent
AzureContent is a repository on GitHub containing documentation and resources related to Microsoft Azure services and features. It provides clear and concise information for users seeking guidance on Azure cloud computing solutions.
azure azurecontent cloud computing content data deployment integration management networking platform security service storage virtualization
Last synced: 10 Apr 2025
https://github.com/xjwllmsx/profitable-app-profiles
Analyzes Google Play & App Store data to recommend profitable profiles for free, ad-supported mobile apps
data data-analysis data-cleaning jupyter pandas python
Last synced: 18 May 2026
https://github.com/mvuorre/osfdatasette
Harvest, wrangle, and serve preprint data from OSF API with Datasette
data datasette open-science preprints
Last synced: 11 Apr 2025
https://github.com/stuffbymax/game-dependencies-db
data database game games-list json mit-license
Last synced: 15 May 2026
https://github.com/halyusa16/basic-sql-employee-analysis
This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.
data data-analytics data-exploration database mysql self-project sql
Last synced: 16 May 2026
https://github.com/rid17pawar/friendscircle
Friends Circle is a console based application developed in cpp using Graph Data Structure.
cpp data graph graph-algorithms oop
Last synced: 08 Jun 2026
https://github.com/jankapunkt/meteor-reactive-data-structures
Collection of verious reactive data sructures for MeteorJS
data data-structures graph linked-list list meteor meteorjs queue reactive reactivity stack tree
Last synced: 17 May 2026
https://github.com/raghavendranhp/attrition-alchemy
This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.
data datawrangling decision-trees eda gradient-boosting logistic-regression macine-learning pandas preprocessing random-forest-classifier skicit-learn svm
Last synced: 18 May 2026
https://github.com/erkylima/algorithms
Python project to refresh knowledge on algorithms and data structures. Interactive examples of Bubble, Merge, Quick Sort, along with Lists, Stacks, Queues, and Trees. Challenges included. Recycle your expertise! π #Python #Algorithms #DataStructures
algorithms algorithms-and-data-structures data data-structures
Last synced: 19 Jan 2026
https://github.com/woctezuma/humble-choice-leak
Retrieve leaks for Humble Choice.
data datamining humble-bundle humble-bundle-games humble-bundle-leak humble-choice humble-choice-leak humblebundle humblebundle-leak leak leaks steam steam-games
Last synced: 27 Mar 2025
https://github.com/nouraalgohary/data-scientist-with-python
This repo comprises of my solutions for the tasks assigned in the course.
data data-science data-visualization datacamp datacamp-course datacamp-data-science datacamp-exercises datacamp-solutions-python datascience python
Last synced: 15 Jun 2025
https://github.com/indhra/cats-ijcnn-data-2004
CATS IJCNN Data 2004 Competition of Artificial Time Series
2004 artificial cats data ijcnn time-series
Last synced: 22 Mar 2025
https://github.com/manifoldfinance/honte
reference data and metrics for sushiswap proposal
Last synced: 18 May 2026
https://github.com/muneeb1030/webscrapper_politifact
This initiative seeks to extract and analyze fact-checking data from Politifact.com, providing valuable insights into political statements, rulings, and the evolving information landscape.
data data-collection dataanalysis python3 scrapy scrapy-spider webscraping
Last synced: 09 Sep 2025
https://github.com/zeh237/superstore-data-analytics
This is a Flask based data analytics project based on the superstore dataset using flask, pandas, sql and python
analytics data data-analysis data-science data-visualization flask python superstore
Last synced: 04 May 2025
https://github.com/meltymooncakes/blockdata
Minecraft Block data
api data json minecraft minecraft-data
Last synced: 13 Apr 2025
https://github.com/mapi-developer/dapo
Simple, zero-dependency tabular data manipulation and analysis for Python.
Last synced: 06 Mar 2026
https://github.com/amethyst-php/delivery-point
amethyst amethyst-package api data delivery-point laravel
Last synced: 18 May 2026
https://github.com/pedrozamecki/datatube
Site Open Source para anΓ‘lise de dados de canais do YouTube.
data estatistica statistical-analysis statistics youtube
Last synced: 18 May 2026
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/inekipelov/swift-codable-advance
A library of extensions for Swift Codable protocols, simplifying the process of encoding and decoding objects.
codable data dictionary json swift
Last synced: 25 Jan 2026
https://github.com/fordinand45/bdp_a_kelompok_3
Project Big Data Python yang diadakan oleh Digitalent Kominfo. Berikut adalah yang ikut serta pada project, yaitu : Dhian Prameswari, Fordinand Pasaribu, dan Muhdad Alfaris Bachmid
data data-analytics data-science linear-regression python3
Last synced: 12 Apr 2026
https://github.com/piazzai/chess-variants
Analysis of Lichess variant games
analysis chess chess-variant chess-variants data data-mining data-science data-visualization lichess lichess-database logistic-regression logit-model pgn r r-code r-scripts regression regression-analysis shell shell-scripting
Last synced: 15 May 2026
https://github.com/juanpablo70/pgad-assignment02
Alzheimer data set analysis
data data-science dataframe dataset jupyter-notebook r
Last synced: 18 May 2026