data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/v-mayya/python-sales-data-analysis
Group project with another team member held by CFG to conduct spreadsheet data analysis of fake sales data using Python
analysis data matplotlib numpy python
Last synced: 29 Apr 2026
https://github.com/iamjuniorb/data_structures_and_algorithms
I'm working on Data Structures and Algorithms I C949 class in school and decided to write up all of these searching algorithms, sorting algorithms, strutures, and so on to get a better understanding. These can be used with large datasets to test their space and time complexities.
data data-analysis data-science data-structures datastructures datastructures-algorithms datastructuresandalgorithm math mathematics programming python python-app python-library python3
Last synced: 08 Jun 2026
https://github.com/chrnthnkmutt/theartofstatistic_python
This repository is implemented from David Spiegelhalter's The Art of Statistics Book, for making Python Visualization
data data-science data-visualization machine-learning statistics
Last synced: 08 Jun 2026
https://github.com/timclicks/dataclerk
zero fuss data logging over HTTP
actix-web command-line data logging rust sqlite sqlite3 utility
Last synced: 30 Apr 2026
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/stonecypher/name_data.js
Name data in JSON for Javascript
data dataset datasets english first-name generator-data javascript javascript-data json last-name name test test-data testing
Last synced: 01 May 2026
https://github.com/divanny/academixbackend
🧑🎓 Academix is a comprehensive academic management system designed to streamline and enhance the educational experience for both students and professors. This repository contains the backend codebase for the Academix system, responsible for handling data processing, authentication, and API endpoints.
backend csharp data net webapi
Last synced: 07 Jun 2026
https://github.com/athari22/house_sales_in_king_count_usa
The idea of the project is to do a Data analysis in a Real Estate Investment Trust. The Trust would like to start investing in Residential real estate.
analysis data data-science data-visualization ibm ibm-watson linearregression machine-learning matplotlib numpy pandas sklearn-library
Last synced: 01 May 2026
https://github.com/ggeop/multiple-fields-management
Fields management from/to different data sources. :bulb:
data data-engineering data-organization data-retrieval data-science pandas python
Last synced: 01 May 2026
https://github.com/stefanbohacek/fediverse-account-analyzer
bots botsinspace data dataviz fediverse mastodon
Last synced: 02 May 2026
https://github.com/unicef/magicbox-download-shapefiles
Downloads shapefiles for each country from gadm.org and unzips them.
data data-science docker downloads-shapefiles emergency-response gadm geospatial geospatial-data humanitarian javascript magicbox nodejs shapefile unicef
Last synced: 02 May 2026
https://github.com/dominhduy09/my-links
All of my links and websites I have been creating - For saving all of my website's links
data database link linked-list linktree list save storage website
Last synced: 25 Jun 2026
https://github.com/perceptronv/miscellaneous
A huge variety of materials, mostly training data for AI. Not a lot of source code yet.
data gan machine-learning nlp text-generation
Last synced: 04 May 2026
https://github.com/kucingkode/dmerge
Small javascript library to help you merge same formatted data in a string
cithak data data-merge javascript library lightweight lightweight-javascript-library merge open-source
Last synced: 04 May 2026
https://github.com/nfaltir/dataxplorer
🔬 A Streamlit app that performs various data exploration operations on an uploaded dataset instantly.
data data-science python streamlit
Last synced: 05 May 2026
https://github.com/dkosarevsky/db_cp
DB course project
data database db postgres postgresql postgresql-database postgressql
Last synced: 05 May 2026
https://github.com/hasnocool/war_thunder_data_scraper
A web scraping tool designed to extract valuable data from War Thunder, a popular online game.
data database framework integration multi processing python scraper scraping scrapy sql threaded thunder war
Last synced: 06 May 2026
https://github.com/satur-io/estoraje
Estoraje is the simplest distributed system for key-value storage in less than 800 lines of code. It is temporary consistent, high available, lightweight, scalable and gives a good performance.
data database distributed go golang key-value performance training
Last synced: 07 May 2026
https://github.com/yash22222/sync-intern-s-ml-tasks
SYNC INTERN'S Machine Learning internship will offer you to enhance your skills by doing real-life example projects. This internship will increase your knowledge in the field of data and algorithms to understand how a machine learns.
bhpp boston-house-datasets boston-house-price-prediction boston-house-pricing data data-structures machine-learning machine-learning-algorithms numpy pandas sync-intern sync-interns
Last synced: 07 May 2026
https://github.com/bastianolea/comisarias_chile
Base de datos con las comisarías, retenes, tenencias y otras instalaciones de Carabineros
Last synced: 23 Jun 2025
https://github.com/ultrasage-danz/scikit-learn-ml
Machine Learning with scikit-learn by Data School
ai data data-school machine-learning macos ml scikit-learn ultrasage-dan
Last synced: 13 May 2026
https://github.com/masu-baumgartner/dbsync.net
A c# mysql model sync library
Last synced: 13 May 2026
https://github.com/pferreirafabricio/data-immersion
🏊🏻♂️ Activities and exercises from 'Imersão Dados' event
data data-analysis data-science dataset jupiter-notebook python
Last synced: 14 May 2026
https://github.com/yord/klp-dsv
A delimiter-separated values plugin for klp (Kelpie), the small, fast, and magical command-line data processor.
csv data deserializer dsv json kelpie klp marshaller parser serializer ssv tsv
Last synced: 14 May 2026
https://github.com/gsmith257-cyber/bit3434cve
BI T3434 Project on data mining CVEs and Exploits
cve data data-mining exploits research-project
Last synced: 17 Jun 2026
https://github.com/cintia0528/data_science-ab_testing
Conduct a 5-way AB Test on Montana State University Library's website, comparing the original "Interact" button with new versions ("Learn," "Help," "Connect," "Services") to boost user engagement.
abtesting bonferroni chisquare-test data data-science datacleaning datavisualization hypothesis-testing mde statistics
Last synced: 31 Mar 2025
https://github.com/cosmos-loops/cosmos-dapper
Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.
dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver
Last synced: 11 Apr 2026
https://github.com/sbdk-dev/sbdk.dev
A complete reference implementation of a local-first ecosystem for AI-powered analytics. This repository contains the source code for the SBDK.dev website, the central hub for the SBDK suite of open-source tools.
ai-powered-analytics data data-engineering data-engineeringlocal-first data-pipeline-automation data-pipelines dbt dlt duckdb elt etl-pipeline llm local-first machine-learning pipeline sbdk semantic-layer
Last synced: 27 May 2026
https://github.com/omarsaad21/it-salary-eda
A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset
data explotary-data-analysis juypter-notebook numpy pandas python visualization
Last synced: 30 Apr 2026
https://github.com/fnu-ankit/8-week-sql-challenge
My attempt on solving Case studies from #8WeeksSQLChallenge
8-week-sql-challenge 8-weeks-sql-challenge 8weeksqlchallenge case-study data data-analysis data-analysis-sql data-analytics database datawithdanny sql sqlserver
Last synced: 19 Apr 2026
https://github.com/mmaithani/kaggle-projects
Collection of all the resources from competition, kernal And data section also all the magic code i have been using to get most of out of a problem
computer-vision data data-science image-processing machine-learning python
Last synced: 30 Apr 2026
https://github.com/isandyawan/simplelinearregression
A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion
data linear r regression rstudio shiny statistic
Last synced: 14 Oct 2025
https://github.com/yash-chauhan-dev/sf_analytics
Business teams often rely on data analysts to extract insights using SQL. This tool eliminates that dependency by bridging the gap between humans and data using AI.
aiml analytics data dbt langchain llm python snowflake streamlit
Last synced: 07 May 2026
https://github.com/lugolbis/data-immo
End-to-end ETL pipeline
data data-engineering dbt dremio duckdb etl-pipeline lakehouse rust
Last synced: 08 Jun 2026
https://github.com/martinius96/meteostanica-odosielacie-scripty
Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.
arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka
Last synced: 11 Apr 2026
https://github.com/rafie-b/data-analytics
Activities of Data Analysis.
apache-spark api aws business-analytics data data-analytics data-science database dataframe jupyter-notebook python scikit-learn sql
Last synced: 14 Apr 2026
https://github.com/mamskie/visdat
google collab
colab-notebook data visualization
Last synced: 03 Aug 2025
https://github.com/jpcurada/exploralytics
A python package for creating intermediate plotly visualizations
data eda plotly python visualization
Last synced: 05 Feb 2026
https://github.com/desininja/food-delivery-realtime-data-analysis
ETL Pipeline in AWS for Real Time Data Analysis
airflow data data-engineering emr-cluster etl kinesis kinesis-strea real-time redshift
Last synced: 15 Oct 2025
https://github.com/servierhub/adsv
Analyze delimiter-separated values files
csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining
Last synced: 28 Sep 2025
https://github.com/alecxcode/table-parser
Python Table Parser (data extraction)
automation data extraction python robotic-process-automation
Last synced: 04 May 2026
https://github.com/intersystems-ib/workshop-smart-data-fabric
Learn the main ideas involved in developing a Smart Data Fabric using InterSystems IRIS
analytics data datafabric interoperability smart
Last synced: 14 Apr 2026
https://github.com/benmizrahi/reactivejs
microservices event bus for async/sync communications
Last synced: 01 May 2026
https://github.com/jigyasag18/project-diwali-sales-analysis
This project analyzes retail sales data during the Diwali festival using exploratory data analysis (EDA) to identify buyer demographics and product preferences. The findings reveal that the primary purchasers are married women aged 26-35 from Uttar Pradesh, Maharashtra, and Karnataka, working in IT, Healthcare, and Aviation.
analysis data datapr datapro eda jupyter-notebook python realtimedata
Last synced: 01 Jun 2026
https://github.com/poissonconsulting/klexdatr
An R package of data from the Kootenay Lake Exploitation Study
cran data fish kootenay-lake rstats
Last synced: 16 Oct 2025
https://github.com/fatihilhan42/nba-players-data-1950-to-2021
In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.
data data-analysis data-engineering data-science data-visualization
Last synced: 16 Oct 2025
https://github.com/amethyst-php/tax
amethyst amethyst-package api data laravel tax
Last synced: 11 May 2026
https://github.com/suryadev99/stream_processing_website_click_data
Stream Processing of website click data using Kafka and monitored and visualised using Prometheus and Grafana
clickdata data dataengineering docker flink-kafka flink-metrics flink-stream-processing git grafana kafka kafka-streams kafka-topic prometheus psql python
Last synced: 10 Mar 2026
https://github.com/cassandrajm/reddit-dashboard
INTERACTIVE DASHBOARD: Analyzing Political Discourse on Reddit: A Multi-Faceted NLP Approach to Toxicity, Bias, and Political Stance
capstone data data-analysis data-science politics python reddit
Last synced: 09 Apr 2025
https://github.com/psgebeline/harvard-data-science
My work for the nine courses in Harvard's data science program, each with notes/assignments. Work in progress.
data linear-regression machine-learning modeling probability-theory r visualization wrangling
Last synced: 19 Oct 2025
https://github.com/parvezk/d3-fundamentals
D3 library API fundamentals
charts d3 data graphs visualization
Last synced: 19 Oct 2025
https://github.com/entorb/analyze-ha-energy
Analyze Home Assistant Solar Production Data
data home-assistant pandas photovoltaic pv python
Last synced: 08 May 2026
https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration
This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.
covid-19 data data-exploration dataset sql
Last synced: 19 Feb 2026
https://github.com/sakan811/honkai-star-rail-a-few-fun-insights-with-data-analysis
The project gives insights that delve into the Honkai Star Rail's character's stats of all available characters as of the given date.
data data-analysis data-science data-visualization docker flask game honkai honkai-star-rail honkai-starrail seaborn webscraping webscraping-data webscraping-selenium
Last synced: 10 Jun 2026
https://github.com/dilkushsingh/webscraping-with-selenium-and-beautifulsoup
Web Scrapped a popular tech gadgets website using Selenium and BeautifulSoup, also performed Data Analysis on scrapped data.
beautifulsoup data datacleaning datagathering eda exploratory-data-analysis python selenium webscraping
Last synced: 24 Feb 2026
https://github.com/coko7/vegapull-records
Cards dataset for One Piece TCG
data dataset one-piece one-piece-card-game one-piece-tcg tcg
Last synced: 26 Feb 2025
https://github.com/jigyasag18/sql-music-store-analysis
This repository contains an analysis of sales and customer data from a fictional music store. Using SQL, we explore trends in sales, popularity of artists and genres, and customer purchasing behavior. The project aims to derive actionable insights that can guide marketing strategies and inventory management decisions.
data dataanalysis dataanalytics database database-management dataset sql sqlqueries sqlquery
Last synced: 08 Jun 2026
https://github.com/bhojpur/dlm
The Bhojpur DLM is a software-as-a-service product used for Data Lifecycle Management based on Bhojpur.NET Platform for data delivery.
Last synced: 19 Feb 2026
https://github.com/robertoostenveld/dcn.dsc_62002071_01_114_v1
Simon task M/EEG data [Data set].
Last synced: 23 Jan 2026
https://github.com/athari22/analyzing-the-yelp-dataset
SQL for Data Science
analytics data data-science data-structures er sql
Last synced: 27 Jan 2026
https://github.com/gabrieldim/complete-analysis-covid-19
Analysis of the Covid 19.
analysis covid-19 covid19 data data-science science virus
Last synced: 23 Jan 2026
https://github.com/stefen-taime/mako-main
Declarative real-time data pipelines Framework. YAML in, events out.
data datapipeline declarative-config declarative-pipeline declarative-programming declarative-workflows framework open-source
Last synced: 26 Jun 2026
https://github.com/adamouization/python-machine-learning-data-science-notes
:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.
data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn
Last synced: 11 Apr 2026
https://github.com/nel-zi/climainsights
Developed an automated ETL pipeline using Apache Airflow and Python to collect, process, and store weather data from multiple cities via Weatherstack API. Implemented data cleaning, orchestration, and error handling to ensure accuracy and scalability.
airflow apache-spark data data-engineering engineering etl-pipeline
Last synced: 01 May 2026
https://github.com/mattjesc/ddo-semiconductor
Data-Driven Optimization of Semiconductor Processes and Forecasting
ai artificial-intelligence data data-science data-visualization deep-learning keras machine-learning manufacturing ml prophet python pytorch semiconductor semiconductor-manufacturing semiconductors tensorflow
Last synced: 23 Feb 2026
https://github.com/nushratjabenaurnima/cse_477_data_mining
A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.
data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3
Last synced: 09 Apr 2026
https://github.com/edjoukou/human_resources
A data analysis project using MySQL Server database
analysis data mysql powerbi sql visualization
Last synced: 25 Sep 2025
https://github.com/eva-kaushik/data-clustering
Clustering Accelerators for hard and soft clustering, including implementations of K-means, K-medoids, hierarchical clustering, fuzzy C-means, and Gaussian mixture models. Demonstrates text clustering using both hard and soft clustering algorithms.
clustering clustering-algorithm data datascience machine-learning-algorithms
Last synced: 09 Apr 2025
https://github.com/remcostoeten/github-and-vercel-api-showcase-dashboard
Showcase results of possible fetched data from the Github and Vercel API built in all vanilla js.
api-rest da data express-js github-api nodejs vercel-api
Last synced: 07 Mar 2026
https://github.com/tdjsnelling/hermes
Hermes is a real-time data framework for React + MongoDB
data docker framework mongodb nodejs react react-hooks reactjs real-time typescript websocket
Last synced: 12 Apr 2026
https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice
A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.
data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks
Last synced: 02 Aug 2025
https://github.com/lemaitre4523/old-tiktok-data-report-explorer
An explorer for tiktok data report
data explorer extract package report simple tdre tiktok tiktok-data-explorer
Last synced: 25 Sep 2025
https://github.com/canadaluke888/speedtable
Ultra-fast terminal table renderer written in C
c data datasets fast python python-wrapper python3 tables
Last synced: 01 Mar 2026
https://github.com/prajjwol09/power-bi-project
The Data Survey Breakdown is an interactive Power BI dashboard designed to present insights gathered from a survey of professionals and enthusiasts in the data industry.
dashboard data interactive powerbi survey
Last synced: 15 Mar 2026
https://github.com/uznetdev/smoking-prediction
This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.
ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking
Last synced: 17 Apr 2026
https://github.com/nivasharmaa/genetrack
A Java program for analyzing DNA sequences and identifying individuals based on Short Tandem Repeats (STRs). Features profile database creation, STR analysis, individual identification, and relationship detection.
data data-processing dna-analysis file-io-in-java genetic-analysis java-oop
Last synced: 25 Aug 2025
https://github.com/mevlutcelik/turkey-cities-data
📍 Türkiye şehirlerine ait şehir verisi paketi: Plaka, koordinat (lat/lon), nüfus (2024 ADNKS) ve coğrafi bölge bilgilerini içerir.
cities coordinates data json nufus plaka turkey turkiye typescript
Last synced: 10 Mar 2026
https://github.com/abhishekn1947/samgov-scraper
Automated Python scraper for sam.gov contracts
analytics automation aws data pandas postgresql rds selenium webscraper
Last synced: 09 Apr 2026
https://github.com/kayahr/datastream
Data stream classes for writing and reading all kinds of data types, even single bits
data datastream input output stream typescript
Last synced: 01 Aug 2025
https://github.com/sorairolake/japanese-era-dataset
日本の元号のデータセット / Dataset of the Japanese era
data dataset date japanese-calendar japanese-era json toml wareki yaml
Last synced: 01 May 2026
https://github.com/revolutionarybukhari/datawarehouse_meshjoin_superstore
A dataware house is generated for streaming data of a superstore using extended mesh join by Syed Husnain Haider Bukhari
data data-science data-warehousing meshjoin
Last synced: 23 May 2026
https://github.com/rodrigojunqueiradev/curso-python-3-do-basico-ao-avancado
Curso de Python 3 do básico ao avançado - com projetos reais
data data-analysis data-science python python-3 python-library python-script python3
Last synced: 27 Jun 2026
https://github.com/muhamedlabs/muhamed_onedrive
Muhamed_OneDrive - це надійне і зручне хмарне сховище для файлів, розроблене для безпечного зберігання і легкого обміну даними.
data html5 onedrive programming style
Last synced: 04 Jan 2026
https://github.com/jameshenderson12/chatbot-utils
Generic data and elements that can be reused or repurposed for chatbot development.
boilerplate chatbot data development elements intents template utterances
Last synced: 04 Mar 2026
https://github.com/cunfuu/network-bubbles
For Easier to manage organizations and keeping notes about them to organize events and easy access their needs
data data-visualization organizations organizations-volunteer
Last synced: 31 Jul 2025
https://github.com/dineshram0212/youtube-analysis
This YouTube Analysis Package provides tools for analyzing YouTube video data, including metrics on views, likes, comments, and engagement trends. Ideal for gaining insights into video performance and audience interaction patterns.
data data-visualization pandas python webscraping youtube-api-v3
Last synced: 19 Jun 2026