data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/apparaomulpuri/readline
Explains you the usage of readLine function in Swift.
data fromkeyboard keyboard reading readline swift
Last synced: 29 Mar 2025
https://github.com/ahabdel/amazon-web-scraper
Amazon Web Scraper to scrape pricing adjustments and provide updates on a day to day basis
Last synced: 04 Jul 2026
https://github.com/nathanieliskandar26/data-analysis-project
This project demonstrates my ability to clean and analyze data using Python and SQL so far. The dataset used for this analysis focuses on general customer information. Through this project, I aimed to uncover meaningful insights and trends by cleaning the data and performing structured queries.
analysis data data-cleaning jupyter-notebook mysql mysql-database python
Last synced: 19 Apr 2026
https://github.com/newrelic-experimental/newrelic-java-aws-kinesis
Provides instrumenation of the Amazon Kinesis Client and Producer
amazon aws client data instrumentation java kinesis nrlabs nrlabs-data nrlabs-odp observability-data producer
Last synced: 15 May 2026
https://github.com/ashishsingh789/customer_purchase_prediction_using_decision-tree-_classifier
Decision Tree Classifier to predict customer purchases using demographic and behavioral data. Key steps: data preprocessing, EDA, model training, evaluation, and feature importance analysis.
data datascience desiciontree eda machine-learning-algorithms matplotlib numpy pandas-dataframe python seaborn
Last synced: 11 Apr 2026
https://github.com/pawlo77/messenger-analyser
Repo for Data Visualization project, part of IAD study program at Faculty of Mathematics and Information Science, Warsaw University of Technology
Last synced: 17 May 2026
https://github.com/ramtinsoltani/safe-cli
A simple Command-line Interface which encrypts and decrypts UTF-8 files using AES-256.
aes-256 cli data data-hook decryption encryption generator handlebars hooks markup partial partial-decryption password safe swap temp temporary tool
Last synced: 16 Apr 2026
https://github.com/mightymetrika/scdtb
Single Case Design Toolbox
data math r science statistics
Last synced: 04 Jan 2026
https://github.com/lukaszkn/data-software-engineering-interview-questions
Data and Software engineering interview questions
data engineering interview-questions python
Last synced: 20 Jul 2025
https://github.com/nikashj/pizza-sales-dashboard-analysis
Pizza sales analysis using Power Bi
data data-analysis data-visualization dax-expression excel powerbi
Last synced: 06 Apr 2026
https://github.com/ember-nexus/reference-dataset
Ember Nexus API backup containing different standardized scenarios
Last synced: 25 Jan 2026
https://github.com/ahmad-ali-rafique/logistic-regression-modeling
An in-depth exploration of logistic regression models, including data cleaning, model building, and performance evaluation on various datasets.
accuracy confusion-matrix data dataanalytics logistic-regression logistic-regression-classifier machine-learning-algorithms mlmodels model modelling regression-models
Last synced: 11 Sep 2025
https://github.com/wilcotomassen/lorem-datum-core
Java based data generator for data simulation
data dataset generator java lorem-ipsum simulated-data
Last synced: 11 Jan 2026
https://github.com/ffatahillah7/snowflake-tastybytes-data-warehouses
Build Snowflake Tasty Bytes Warehouses
data data-warehouse mysql snowflake sql warehouse
Last synced: 26 Mar 2025
https://github.com/notthestallion/data_visualisation-examples
This repository was created to learn and practice graph showing and data visualization. The goal is to gain experience in creating compelling and informative visualizations.
data data-science data-visualization database learn learn-to-code learning learning-by-doing matplotlib matplotlib-figures matplotlib-pyplot visualization
Last synced: 12 May 2026
https://github.com/kaizadp/bbwm_moisture
HOBO data for soil moisture - Bear Brook Watershed in Maine
Last synced: 17 May 2026
https://github.com/octoenergy/tentaclio-gs
A python project containing all the dependencies for gs tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-postgres
A python project containing all the dependencies for postgresq tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-athena
A python project containing all the dependencies for awsathena+rest tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-s3
A python project containing all the dependencies for s3 tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/octoenergy/tentaclio-databricks
Module to give tentaclio support to databricks
Last synced: 24 Jun 2025
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/nika2811/new-york-city-taxi-fare-prediction
About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff
data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost
Last synced: 06 Apr 2025
https://github.com/hidayathamir/get-telegram-group-data
With these project you can get data in csv file from your telegram group.
bahasa-indonesia data python3 scrape telegram telethon
Last synced: 13 Sep 2025
https://github.com/octoenergy/tentaclio-gdrive
A python project containing all the dependencies for the gdrive tentaclio schema
Last synced: 24 Jun 2025
https://github.com/amethyst-php/office
amethyst amethyst-package api data laravel office
Last synced: 17 May 2026
https://github.com/patrikcze/meshtatic_data
Meshtastic Data Transfer - Trying some stupid thing, like transferring files over LORA network.
data meshtastic meshtastic-python
Last synced: 03 Feb 2026
https://github.com/debjyotisaha/hands-on-sql
My Learning Path towards SQL
cte data data-analysis insert joins select sql subqueries update
Last synced: 04 Apr 2025
https://github.com/opengeoshub/vdownload
A Powerful Geospatial Data Downloader
Last synced: 19 May 2026
https://github.com/huspacy/huspacy-resources
Resources for building and evaluating huspacy
Last synced: 21 Mar 2025
https://github.com/prasad-chavan1/bank_data_analysis_r
Bank data analysis in R language
data data-analysis data-science r
Last synced: 24 Feb 2025
https://github.com/furkantosun1607/cse201-data-structure
This repository contains implementations of various data structures completed as part of the CSE201 (Data Structures) course. Each week, a different data structure was implemented during lab sessions.
array arraylist bfs-search binarytree data dfs-search java linkedlist queue stack structure tree-structure
Last synced: 26 Jun 2025
https://github.com/greatwoman23/sentiment-analysis-on-amazon-products-review
Sentiment_Analysis_On_Amazon_Product_Review
analysis dashboard-application data data-science datascientistproject machine-learning publication python remotejob
Last synced: 17 May 2026
https://github.com/brayflex/spy-sector-rotation-google-sheet
Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.
data etf google-sheets index price rotation script sector spreadsheet spy stock-market
Last synced: 29 Jun 2026
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/wittyicon29/zeotap-ds-assignment
Internship application assignment
Last synced: 19 Aug 2025
https://github.com/KarajMiglani-DataScientist/karajmiglaniFAKE-NEWS-DETECTION
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 19 Aug 2025
https://github.com/urvish-06/seaborn-dataset
Seaborn data sets
csv csv-files data data-science data-visualization dataset example jupyter-notebook jypyternotebook python seborn vacation
Last synced: 18 May 2026
https://github.com/ahmad-ali-rafique/wine-quality-dataset
Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.
analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model
Last synced: 21 Aug 2025
https://github.com/ssiarhei115/shop-customers-segmentation
Shop customers segmentation
data data-analysis data-science data-visualization
Last synced: 24 Aug 2025
https://github.com/mateuszskoczek/generatorcsv
GeneratorCSV is a students and teachers data converter for Microsoft 365 Admin Center. The project was implemented for Sobolew High School.
admin converter data microsoft365 python school tkinter
Last synced: 26 Aug 2025
https://github.com/lucasnbsb/data-structures-and-algorithms
Studying data structures and algorithms, mostly on leetcode
Last synced: 29 Aug 2025
https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models
This repo contains a model for determining the rainfall patterns and another for medical expense prediction model
data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning
Last synced: 30 Aug 2025
https://github.com/bilgehangecici/datatypeconverter
Converting integer and floating numbers to appropriate bit-level representation.
data datatypeconverter java machine-level variables
Last synced: 30 Mar 2025
https://github.com/koppalexander/flightdelaychallenge
This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.
data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling
Last synced: 19 Jun 2026
https://github.com/team-hydrogen/2025-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 11 Apr 2025
https://github.com/snimmagadda1/luigi-etl-example
🔍 Example of an ETL pipeline using Spotify's Luigi
data luigi luigi-pipeline python spotify
Last synced: 30 Mar 2025
https://github.com/nevoland/unchangeable
🧊 Tools for immutable values.
data datastructure functional immutable persistent pure stateless
Last synced: 24 Jul 2025
https://github.com/fatihemres/Fruits
Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.
animations data models onboarding swift swiftui
Last synced: 31 Aug 2025
https://github.com/team810/frcs
FRCS is an online international crowd sources data collection software written for the FRC Competitions. It was created by team 810, The Mechanical Bulls.
Last synced: 14 Mar 2025
https://github.com/haideratgh/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-query sql-server window-functions-in-sql
Last synced: 29 Jun 2025
https://github.com/mnkanout/patients_medication_prediction
The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.
data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3
Last synced: 29 Jun 2025
https://github.com/diordany/spicemill
Tool for plotting Ngspice simulation results with Pyplot.
analysis data electrical-engineering electronics frontend integrated-circuit integrated-circuits ngspice plot plotting post-processing pyplot python raw simulation spice
Last synced: 13 Jan 2026
https://github.com/checco9811/data-engineering-bootcamp-homework
Homework solutions for DataExpert.io data engineering bootcamp
apache-spark data data-engineering sql
Last synced: 14 Mar 2025
https://github.com/nicholas-owen/rdm-calendar
A small utility to manage conference and event information
calendar conference data event research
Last synced: 26 May 2026
https://github.com/open-geodata/sp_bh_pcj-2020-2035
Dados Espaciais da Agência das Bacias PCJ, com informações apresentadas no Plano de Bacias 2020-2035
Last synced: 16 Jan 2026
https://github.com/gianlucatruda/titanic
An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.
data database flask info matplotlib python science scrape server titanic visualisation web
Last synced: 10 Apr 2026
https://github.com/thicclatka/tetration
New file format for tensors
cli data fileformat mmap tensors
Last synced: 26 May 2026
https://github.com/tomcardoso/journalism-data-intersection
A talk on working at the intersection of journalism and data science
data data-journalism journalism
Last synced: 15 May 2025
https://github.com/mierune/tinybufr
[WIP] A Rust library for decoding BUFR (Binary Universal Form for the Representation of meteorological data) files.
bufr data meteorology rust weather wmo
Last synced: 15 May 2025
https://github.com/mnz1365/saving-record-time-text
date saving in text file with python
data python txt-files writefile
Last synced: 18 Jul 2025
https://github.com/fuzzt/location-analyzer
The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.
analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack
Last synced: 11 Apr 2026
https://github.com/nisanth2004/springboot-kafka-real-world-project-wikimedia
Creating a project about Wikimedia using Kafka involves building a system that leverages Apache Kafka for data streaming and processing related to Wikimedia data.
async broker communication data java kafka message real-time real-time-analytics springboot wikimedia
Last synced: 14 May 2026
https://github.com/diegoperea20/datos-secuenciales-con-ia
Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes
ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao
Last synced: 06 Feb 2026
https://github.com/gappeah/layoffs-exploratory-data-analysis
This project uses MySQL to perform data cleaning and exploratory data analysis (EDA) on a dataset detailing company layoffs. The primary goal is to process, clean, and explore the data to gain insights into trends and patterns related to layoffs across various sectors.
data dataanalysis eda mysql sql
Last synced: 12 Jul 2025
https://github.com/vlamug/ratibor
Ratibor is a service for making metrics from data
Last synced: 10 Mar 2026
https://github.com/naveenk-ds/redbus_web_screaping.app.py
🚌 Red Bus Project Overview The Red Bus Project is a web scraping and visualization tool built with Selenium to extract bus information from the RedBus website. It stores the data in a MySQL database and provides an interactive visualization interface using Streamlit. The goal is to deliver insights into bus schedules, prices, ratings, etc...
data data-science database-management pandas pyhton selenium-webdriver sql
Last synced: 11 Apr 2026
https://github.com/dms-codes/scrape-kesaintblanc-id
Kesaintblanc Data Scraper This Python script is designed to scrape product data from the Kesaintblanc website. It collects information about products, including product name, URL, price, image URLs, status, stock, and more. The scraped data is saved to a CSV file for further analysis.
data kesaintblanc python webscraper
Last synced: 27 May 2026
https://github.com/abirsaha111/ipl-2022-analysis
The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.
data dataana dataanalytics datavi matplotlib python
Last synced: 07 Jun 2026
https://github.com/jpcadena/palmer-penguins
Palmer Penguins
analytics csv data data-analytics data-science exploratory-data-analysis matplotlib numpy palmer-penguin pandas plotly pylint python seaborn visualization
Last synced: 11 Apr 2026
https://github.com/cognitixe/metamask-wallet-recovery-funds-phrase-data-seed-token
This repository provides tools and guidelines for securely recovering MetaMask Wallet funds using recovery phrases, seed data, and tokens. It ensures safe and reliable methods for recovering access to your wallet and managing your cryptocurrency assets.
bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask metamask-bot metamask-desktop metamask-extension metamask-plugin metamask-snap metamask-wallet phrase recovery seed token wallet wallet-security
Last synced: 13 May 2026
https://github.com/equinor/fmu-sumo
Interaction with Sumo in the FMU context
analytics data fmu python subsurface sumo visualization
Last synced: 01 May 2025
https://github.com/gunjanmimo/d3-visualization
D3.js is a JavaScript library for producing dynamic, interactive data visualizations in web browsers. It makes use of Scalable Vector Graphics, HTML5, and Cascading Style Sheets standards. It is the successor to the earlier Protovis framework
d3js data data-science data-visualization reactjs
Last synced: 29 Apr 2026
https://github.com/bastianolea/cut_comunas
Versión actualizada de los códigos únicos territoriales (CUT) de las comunas y regiones del país.
Last synced: 24 Jun 2026
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 09 May 2026
https://github.com/2022-04-11588/data-fakes
🔍 Generate realistic fake data for testing and development, enhancing your projects with simple, customizable data solutions.
data dataset developer-tools fake-content faker fakery groovy java mock phoenix python random ruby seeding struct swift-framework test-data testing
Last synced: 11 Apr 2026
https://github.com/nel-zi/insighthire_agency
Built a web scraping solution using BeautifulSoup to extract job listings from MyJobMag, cleaned the data, and loaded it into PostgreSQL with SQLAlchemy for better job data management.
data dataloading datatransformation sql webscraping
Last synced: 16 May 2025
https://github.com/roovedot/unet-cnn-for-road-segmentation
(In Progress) Unet architecture with CNNs (Convolutional Neural Networks) aimed at Road Segmentation
cnn cnn-for-visual-recognition cnn-pytorch computer-vision data data-engineering data-science unet unet-image-segmentation unet-pytorch
Last synced: 01 Jul 2025
https://github.com/michaelschoenburg/rapidfiretools-computerdatacollector-automation
Automation for RapidFire Tools Computer Data Collector.
automation collector computer data fire powershell powershell-script rapid rapidfire-tools tools
Last synced: 01 Jul 2025
https://github.com/jamiew/void-runners-analysis
basic data analysis for the Void Runners Genesis Fleet spaceships
Last synced: 29 Mar 2025
https://github.com/mohammad-malik/covid-visualizations-d3
This project provides a dashboard with five different perspectives on the pandemic, from patient-infection relationships to regional trends and hierarchical distributions. This was developed as part of a project for the course Data Analysis and Visualization (DS3001).
covid-19 d3 d3-visualization d3js data data-analysis data-analytics data-science visualization
Last synced: 28 May 2026
https://github.com/loosenthedark/going-for-gold
A fairer, more measured look at the Tokyo 2020 Olympic medal count. Countries are ranked in relative (per capita) instead of absolute medal-winning terms. Users can toggle between two different ranking breakdowns, search for countries, contact the site owner and enable dark mode. Mobile-first React application leveraging the REST Countries API as well as a local JSON Olympic dataset. EmailJS and React Context API integration with custom form validation and error handling.
api create-react-app css data es6 fetch-api frontend html5 interactive-front-end-development javascript mobile-first olympics react react-components react-context-api react-hooks react-router react-router-dom reactjs responsive-web-design
Last synced: 07 May 2026
https://github.com/meizuflux/cion
Python minimal data validation library
data minimal python validation
Last synced: 28 May 2026