data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/johndelatto/automate-your-job-search-ai-applies-to-1000-positions
Automate Your Job Search: AI Applies to 1000 Positions Overnight & Get 100+ Interviews! In todayβs fast-paced and highly competitive job market, finding and securing your dream job can be both time-consuming and exhausting.
ai data non-profit open-ai open-source
Last synced: 28 Jan 2026
https://github.com/veronsheva/global_food_wastage
Global Food Wastage Analysis
analysis data data-analitics pandas predictions python scikit-learn seaborn visualization
Last synced: 18 Apr 2026
https://github.com/kunalshelke90/kunalshelke90
π» Machine Learning Enthusiast | Data Science Explorer | eager about solving problems with help of data.
data data-science dataanalysis database machine-learning mlops
Last synced: 06 Jul 2025
https://github.com/encelo/wetpaper-data
Data files for the WetPaper project
Last synced: 23 Jan 2026
https://github.com/naveenk-ds/redbus_web_screaping.app.py
π Red Bus Project Overview The Red Bus Project is a web scraping and visualization tool built with Selenium to extract bus information from the RedBus website. It stores the data in a MySQL database and provides an interactive visualization interface using Streamlit. The goal is to deliver insights into bus schedules, prices, ratings, etc...
data data-science database-management pandas pyhton selenium-webdriver sql
Last synced: 11 Apr 2026
https://github.com/itsmeyogesh22/Solved-8-Weeks-SQL-Challenge-Correct-Solutions
Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/
8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql
Last synced: 29 Aug 2025
https://github.com/alsult/alsult
Aliia Sultanova Portfolio
data datascience programming python
Last synced: 23 Jan 2026
https://github.com/zainea-bogdan/data_engineer_project_wowcinema
WoWCinema is a project based on a fictional scenario where I stepped into the role of a Data Engineer, designing and building an end-to-end Data Infrastructure. A ETL pipeline ingests data from multiple sources, transforms it, and loads it into a centralized PostgreSQL data warehouse to power analytics, KPI tracking, and reporting
analytics big-data data datawarehousing etl-pipeline postgres python sql
Last synced: 19 May 2026
https://github.com/woctezuma/epic-games-js
JavaScript on the Epic Games store.
data datamining egs epic epic-games epic-games-api epic-games-launcher epic-games-store epicgames epicgames-api epicgames-launcher epicgames-store graphql graphql-api javascript webpack
Last synced: 27 Oct 2025
https://github.com/elijah-1994/pre-process-e-commerce-dataset
Importing, Cleaning, and Pre-Processing E-Commerce Data for Analysis Using MySQL.
analytics data dataanalytics datacleaning dataprocessing mysql mysql-database sql
Last synced: 11 Mar 2025
https://github.com/a-poor/datatransform.jl
A package for defining (and performing) tabular-data transformations with JSON.
data data-science data-transformation etl feature-engineering json julia julia-package tabular-data
Last synced: 05 May 2026
https://github.com/raulmaulidhino-dev/ml_modelling_regression
There are many factors that influence the grades/scores of students. One of the factors is study hours. In this mini analysis project, there are 3 models that will learn and predict the relation between study hours of students and their scores in an exam/test. This project will result the best ML model to solve the problem.
data data-analysis-python data-science eda machine-learning scikit-learn
Last synced: 28 Jan 2026
https://github.com/rse/nebulize
Nebulize Security-Sensitive Information
data dsgvo gdpr information nebulize security sensitive
Last synced: 16 Mar 2025
https://github.com/kashifkhan7/cleaning-analysis_cli
Analyze sales data easily with our CLI app. Gain insights on revenue trends and visualize results using Python, Pandas, and Matplotlib. ππ
conditional-statements css data datacleaning exception-handling exiftool html json matplotlib-pyplot metadata metadata-extraction pandas-python python sales-analysis seaborn-python speech-to-text transcription youtube
Last synced: 13 Apr 2026
https://github.com/isaacmaffeis/imad-2023
Model Identification and Data Analysis (IMAD) | University course
data data-analysis data-science model model-identification
Last synced: 09 May 2026
https://github.com/zazza123/hamana
A python library for seamless data extraction, storage, and SQL-based analysis using pandas and SQLite.
Last synced: 14 Jan 2026
https://github.com/mfurmanczyk/wh-sales
E-commerce analytics data warehouse ETL made with Apache Spark.
airflow data data-engineering data-warehouse kotlin python spark
Last synced: 24 Jan 2026
https://github.com/emanoelcampos/power-bi-fundamentals
Datacamp's Power BI Fundamentals Skill Track
data data-analyst data-analyst-power-bi datacamp power-bi powerbi
Last synced: 24 Jan 2026
https://github.com/robertoostenveld/dccn.dsc_3015055.00_583_v1
The FieldTrip-SimBio Pipeline for EEG Forward Solutions [Data set].
Last synced: 24 Jan 2026
https://github.com/edjoukou/pizza-sales-report
A data analysis project using SQL with MySQL database
analysis data mysql powerbi visualization
Last synced: 05 May 2026
https://github.com/vatshayan/pokemon-analysis
Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning
artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn
Last synced: 30 May 2026
https://github.com/semcod/code2llm
Python Code Flow Analysis Tool - Static analysis for control flow graphs (CFG), data flow graphs (DFG), and call graph extraction
ast cfg code code2data code2logic code2process data dfg diagram flow graphs llm
Last synced: 01 Jun 2026
https://github.com/contawo/travel-journal
This is a travel journal application for storing all the places that you have visited. I was learning by doing react when creating this project. I learnt a lot with it and upgraded my reactjs skills.
data learning-by-doing props reactjs
Last synced: 05 May 2026
https://github.com/soenneker/soenneker.cloudflare.origincerts.thumbprints
The current Cloudflare origin certificate thumbprints
cloudflare csharp data dotnet origincerts thumbprint thumbprints
Last synced: 23 Apr 2026
https://github.com/munas-git/codm-review-analysis-and-predictions
Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.
data flask machine-learning python sentiment-analysis
Last synced: 05 May 2026
https://github.com/bishtrishu/pizza_sales_analysis_dashboard_sql_bi
Welcome to the Pizza Sales Analysis Dashboard project! This repository contains a comprehensive guide to building an interactive and insightful dashboard for analyzing pizza sales data using SQL and Power BI.
data data-science dataanalyst datavisualization dax dax-query microsoft microsoft-azure microsoft-sql-server msexcel mysql powerbi powerquery project sql
Last synced: 16 Mar 2026
https://github.com/vlamug/ratibor
Ratibor is a service for making metrics from data
Last synced: 10 Mar 2026
https://github.com/raphaellaude/usaschooldata
Cleaned and accessible school enrollment data for US schools
data duckdb duckdb-wasm education object-storage oss wasm
Last synced: 12 May 2026
https://github.com/nukopian/shell-flatten
Flatten a series into a single record
Last synced: 18 Jun 2025
https://github.com/woctezuma/recent-sales-data
Data available to estimate sales of Steam games during release week.
Last synced: 05 Feb 2026
https://github.com/thedhruvish/datasciencewith
datasciencewith
coding data dataanylasis datascience learing machine-learning
Last synced: 08 Jun 2026
https://github.com/buffdelta/basketball_ref_webscraper
Python package to make webscraping from basketball-reference easy
basketball data python python-library webscraping
Last synced: 14 Jan 2026
https://github.com/muthupillai1204/diwali_sales_analysis
The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.
data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization
Last synced: 05 May 2026
https://github.com/fiedsch/data_util
misc. Utilities for data files like variable name lists
Last synced: 14 Jun 2025
https://github.com/neuro-mechatronics-interfaces/ros2_data_agent
Code for a multipurpose file explorer specializing in reading ROS2 topic data from '.bag' or '.db3' files
Last synced: 13 Jun 2026
https://github.com/renebentes/2808
Curso 2808 - Fundamentos do Entity Framework
Last synced: 27 Jun 2025
https://github.com/lakshyakumar266/jee-dpp-manager-app
DPP manager app for JEE preparing Students
data expo javascript management react-native
Last synced: 07 May 2026
https://github.com/jigyasag18/credit-card-fraud-detection-using-machine-learning
This repository presents a credit card fraud detection system utilizing a Logistic Regression model trained on a dataset of 284,807 transactions with significant class imbalance. After employing under-sampling for balance, the model achieves a test accuracy of around 93.40%, showcasing the effectiveness of ML in identifying fraudulent transactions.
credit-card-fraud creditcardfrauddetection data dataset logistic-regression logisticregression machine-learning machine-learning-algorithms mlproject mlprojects
Last synced: 02 Sep 2025
https://github.com/jph5396/sumomodel
A data models related to sumo wrestling.
Last synced: 17 Jan 2026
https://github.com/gagolews/clustering-data-v0
Datasets for Clustering [DEPRECATED β A NEW VERSION IS AVAILABLE]
clustering data dataset machine-learning
Last synced: 15 Sep 2025
https://github.com/ntnn/dataparse
Parsing, transforming and unmarshalling data.
data data-parser data-parsing data-transformation golang golang-lib
Last synced: 30 Jun 2026
https://github.com/miss-mhv/data-analysis-for-social-buzz
In this work, we focus on a small dataset extracted from a large enterprise dataset on social buzz.
Last synced: 14 May 2026
https://github.com/canadaluke888/terminaltablebuilder
Build and edit tabular data all from the terminal.
cli data data-manipulation excel json ods rich spreadsheets sqlite3 tables
Last synced: 20 Apr 2026
https://github.com/ressuman/csv-writer-project
CSV Writer with TypeScript. This project demonstrates my implementation of a CSV writer using plain TypeScript and JavaScript, without relying on any frameworks.
Last synced: 15 May 2026
https://github.com/parmsam/rweekly.data
R package containing data on Rweekly posts
Last synced: 21 May 2026
https://github.com/rajlabmssm/echodata
echoverse module: Example data.
data echoverse fine-mapping genomics gwas qtl
Last synced: 17 Jan 2026
https://github.com/rafaeljurkfitz/dbt-jaffle-shop
DBT project with postgres using best practices.
analytics analytics-engineering best-practices data data-engineering dbt dbt-core etl postgresql sql transform
Last synced: 15 Jun 2025
https://github.com/badawy403/egy.list
A Node.js package providing access to official Egyptian data including universities, governorates, cities, and more. This package makes it easy for developers to integrate Egypt-specific information into their applications.
city data egypt javascript nodejs npm package
Last synced: 08 Mar 2026
https://github.com/dms-codes/www.usu.ac.ididdirektori
Faculty and Docent Data Retrieval Script The faculty_and_docent_data_retrieval.py script is a Python script for retrieving faculty and docent data from a university website using Selenium. It includes functions to extract faculty names and docent profiles, as well as a multithreading approach to fetch data for multiple faculty-docent pairs.
Last synced: 26 May 2026
https://github.com/indhra/cats-ijcnn-data-2004
CATS IJCNN Data 2004 Competition of Artificial Time Series
2004 artificial cats data ijcnn time-series
Last synced: 22 Mar 2025
https://github.com/kenjyco/mongo-helper
Helper funcs and tools for working with MongoDB
aggregation-pipeline data database kenjyco mongo mongodb python
Last synced: 28 Jan 2026
https://github.com/ioboi/obloc-data
Scrape guest counter of O'BLOC π§ββοΈ
Last synced: 04 Nov 2025
https://github.com/josecsotomorales/dataform
Repository for testing dataform
cli data data-engineering data-transformation
Last synced: 27 Mar 2025
https://github.com/soenneker/soenneker.timezones.data
Provides TimeZone geometry
csharp data dotnet geometry lookup polygons timezone timezones timezonesdata
Last synced: 30 May 2026
https://github.com/dakostu/grabbag.h
A data structure for non-deterministic element selection in C++11
cpluscplus cpp cpp-component cpp-library cpp11 data data-structure data-structures generics non-deterministic random randomization template
Last synced: 19 Oct 2025
https://github.com/dms-codes/scrape_tripsantai
Trip Santai Tour Data Scraper This Python script is a web scraper designed to extract and collect information about tours from the Trip Santai website. It utilizes the requests library to fetch web pages, BeautifulSoup for parsing HTML, and writes the collected data to a CSV file.
beautifulsoup4 data python requests scraper webscraper
Last synced: 21 May 2026
https://github.com/bfontaine/datatools
:triangular_ruler: Some scripts I use to work with data
Last synced: 23 Jul 2025
https://github.com/nikashj/pizza-sales-dashboard-analysis
Pizza sales analysis using Power Bi
data data-analysis data-visualization dax-expression excel powerbi
Last synced: 06 Apr 2026
https://github.com/ember-nexus/reference-dataset
Ember Nexus API backup containing different standardized scenarios
Last synced: 25 Jan 2026
https://github.com/ngupta23/data_prep_helper
A helper package for preparing and combining data from a variety of sources
data data-science dataprep datapreparation dataprocessing helpers python
Last synced: 03 Apr 2025
https://github.com/omari-kd/recommendation-system-analysis-and-modelling
This project aims to develop a recommendation system that leverages historical user data to provide tailored recommendations across different domains, such as product recommendations, content suggestions and service optimisation.
data data-science data-science-in-r machine-learning-algorithms recommendation-system
Last synced: 08 Jan 2026
https://github.com/j-hagedorn/locals
:globe_with_meridians: A collection of tidied, neighborhood-level public datasets
address-dataset census-data census-tract data neighborhood social-sciences
Last synced: 03 Feb 2026
https://github.com/lut-ful/e-commerce-sales-report
This dashboard provides a visual analysis of e-commerce sales data
data data-analytics data-science data-visualization power-bi statics
Last synced: 28 Jun 2025
https://github.com/interzoid/typescript-examples
Provides TypeScript examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.
angular api cloud data database matching nodejs quality typescript
Last synced: 12 Jan 2026
https://github.com/cody-scott/arclint
A flexible tool to validate and improve your data in ArcGIS using regex and other methods
arcgis arcgispro data lint regex validation
Last synced: 14 May 2025
https://github.com/karajmiglani-datascientist/karajmiglanifake-news-detection
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 22 May 2026
https://github.com/rickstaa/ai-compute-visualizer
A StreamLit-based web application to visualize GPU inventory and AI capabilities on the Livepeer network.
Last synced: 28 Jun 2025
https://github.com/matheussoranco/how-to-estimate-required-sample-size-for-model-training
Modeling the relationship between training set size and model accuracy.
artificial-intelligence data jupyter-notebook machine-learning python
Last synced: 22 May 2026
https://github.com/nodamu/apache-beam-studies
Personal Apache Beam studies repository
apachebeam batch-processing data dataeng dataengineering datapipeline stream-processing
Last synced: 04 Nov 2025
https://github.com/talitalobo/statistics-with-python
Repo about statistical concepts and (not always) their python implementation.
data data-science machine-learning statistics
Last synced: 11 Jan 2026
https://github.com/The-Tech-Idea/Beep.winform.Sample
Application for Managing your Different DataSources . Still in Alpha.please be patient
application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows
Last synced: 04 Nov 2025
https://github.com/amethyst-php/target
amethyst amethyst-package api data laravel target
Last synced: 22 May 2026
https://github.com/aiwithqasim/p1_explore-weather-trends
In this project, I'll analyze local and global temperature data and compare the temperature trends where I live to overall global temperature trends. Moreover i will use SQL query to extract data from the given Data base and i have to visualize the insight or Average temperature to find the findings.
data dataanalyst database datavisualization nanodegree udacity
Last synced: 22 May 2026
https://github.com/emna-chebbi/student-performance
Predictive model for student exam scores based on student performance factors
ai computer-vision data kaggle machine-learning ml mse regression regression-models
Last synced: 15 May 2026
https://github.com/iamyourdre/naive-bayes-classifier-js
Naive Bayes classifier developed with MySQL, ExpressJS, and NodeJS by @iamyourdre.
backend data data-science expressjs javascript mysql naive-bayes naive-bayes-algorithm naive-bayes-classifier nodejs
Last synced: 08 Apr 2026
https://github.com/iyashwantsaini/tweetify_
Twitter Data Collection, Analysis Tool
collection data twitter twitter-sentiment-analysis
Last synced: 08 Mar 2026
https://github.com/mobinx/easymeet-js
EasyMeetjs is a robust and versatile TypeScript library that provides a solid foundation for building WebRTC-based applications. It simplifies the complexities of WebRTC, enabling developers to easily incorporate real-time communication features into their projects.From simple audio video calling to real time peer to peer file transfer , everything
data meeting react realtime screensharing streaming-video webrtc zoom
Last synced: 03 Jan 2026
https://github.com/moscatellimarco/webscrap-imdb
π¬ Python scraper for IMDB: Extract movie/TV details for π analysis & ποΈ storage. Easy setup, π§ customizable, with π₯οΈ CLI.
css data datascience html movies python scrapy scrapy-crawler scrapy-spider web web-scraping webdata webscraping
Last synced: 15 May 2026
https://github.com/xenoverseup/data-structures
Data structures in every language I know.
cpp data data-science data-structures data-structures-and-algorithms doubly-linked-list linked-list
Last synced: 14 May 2026
https://github.com/richelbilderbeek/heyahmama
Data about the Flemish/Dutch band K3
band data k3 package r r-lang r-language
Last synced: 22 May 2026
https://github.com/shysolocup/fndt
JavaScript package allowing you to see function data like body and arguments from outside of the function
aepl data fndt functions javascript javascript-tools js js-function js-functions lightweight nodejs nodejs-modules package stews
Last synced: 30 Apr 2026
https://github.com/rsc-labs/see-open-data
Show www.dane.gov.pl in user friendly format. Generate flourish data or other data visualizations.
data data-visualization flourish government poland
Last synced: 04 Apr 2025
https://github.com/hackolade/yugabytedb-ysql
Hackolade(https://hackolade.com) plugin for the Cloud Native Yugabyte database with YSQL API
data data-modeling entity-relationship-diagram schema-design ysql yugabyte yugabytedb
Last synced: 30 Apr 2025
https://github.com/shubhamsoni98/analysis-with-sql
This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.
analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql
Last synced: 03 Jan 2026