data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/ksimicevic/discord-message-analyzer
Analyzing discord messages in Jupyter notebook
analysis data discord messages
Last synced: 16 Apr 2026
https://github.com/arjunrao87/world-countries-graphql-api
GraphQL API for retrieving information about countries of the world
countries data database geographic-data geography graphql world
Last synced: 10 May 2026
https://github.com/lemaitre4523/old-tiktok-data-report-explorer
An explorer for tiktok data report
data explorer extract package report simple tdre tiktok tiktok-data-explorer
Last synced: 25 Sep 2025
https://github.com/canadaluke888/speedtable
Ultra-fast terminal table renderer written in C
c data datasets fast python python-wrapper python3 tables
Last synced: 01 Mar 2026
https://github.com/squareslab/frameworkstudytranscripts
archived data human-study zackc
Last synced: 06 Mar 2026
https://github.com/revolutionarybukhari/datawarehouse_meshjoin_superstore
A dataware house is generated for streaming data of a superstore using extended mesh join by Syed Husnain Haider Bukhari
data data-science data-warehousing meshjoin
Last synced: 23 May 2026
https://github.com/dineshram0212/youtube-analysis
This YouTube Analysis Package provides tools for analyzing YouTube video data, including metrics on views, likes, comments, and engagement trends. Ideal for gaining insights into video performance and audience interaction patterns.
data data-visualization pandas python webscraping youtube-api-v3
Last synced: 19 Jun 2026
https://github.com/cnr-ibba/smarter-repository
SMARTER Data Repository
bootstrap5 data django repository smarter
Last synced: 03 Apr 2026
https://github.com/umrlastig/global-local
The Global-Local loop: bridging the gap between geospatial communities
challenges communities data fusion gaps geospatial perspectives
Last synced: 03 Apr 2026
https://github.com/epomatti/az-data-services
End-to-end scenario for Azure data services.
azure data data-engineering databricks datalake lake synapse terraform
Last synced: 17 Apr 2026
https://github.com/holo-nim/flue
data streaming options
data nim reader-writer streams
Last synced: 04 Apr 2026
https://github.com/bhavanachitragar/layoff_analysis
This Streamlit app is designed for Layoff Analysis. It allows users to explore and analyze layoff data from different perspectives, including overall analytics, country-specific insights, and individual company details.
data dataanalysis streamlit streamlit-webapp
Last synced: 18 Apr 2026
https://github.com/cunfuu/network-bubbles
For Easier to manage organizations and keeping notes about them to organize events and easy access their needs
data data-visualization organizations organizations-volunteer
Last synced: 31 Jul 2025
https://github.com/neelamraikwar9/bookdata
This is my 1st assignment git repository. I have worked with Book Data and by using Express Js created routes and API's for Post, Update, Delete, and Get.
api books data database deployment expressjs node nodejs postman postman-api
Last synced: 05 Apr 2026
https://github.com/mipacd/holochatstats
A VTuber chat log (and general) analytics platform
data flask hololive postgresql python visualization vtuber youtube
Last synced: 05 Apr 2026
https://github.com/codbex/codbex-hestia-data-sample
Sample data for codbex-hestia
Last synced: 05 Apr 2026
https://github.com/mksingh431/free-data-science-courses
Data science is a rapidly growing tech field that’s transforming business decision-making. To break into this field, you need the right skills. Fortunately, top institutions like Harvard and IBM offer free online courses. These courses cover everything from basic programming to advanced machine learning.
course data data-analysis data-science data-visualization free freecou python
Last synced: 19 Apr 2026
https://github.com/ahmad-ali-rafique/decision-tree-classifier-modeling
👏Comprehensive exploration of decision tree classifiers, including data cleaning, model building🏩, and performance evaluation on various datasets.
analytics classification classification-models data data-science dataanalytics datacleaning dataset decision-tree-classifier models
Last synced: 20 Apr 2026
https://github.com/istinnew/etl-pipeline-ganz-project
End-to-end ETL pipeline project for collecting, transforming, and loading data into a cloud-based database using Python, MySQL, and Google Cloud Analytics
cloud cloud-engineering cloud-services data data-science dataanalytics database database-schema googlecloud mysql mysql-database python python-lambda
Last synced: 20 Apr 2026
https://github.com/omers/sre-devops-tools
Tools and useful sources for SRE and DevOps
awsome awsome-list data devops monitoring sre tools
Last synced: 20 Apr 2026
https://github.com/farrelfaricaf/exploratorydataanalyst---titanic
This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.
data data-analysis data-science data-visualization eda python titanic-dataset
Last synced: 31 Jul 2025
https://github.com/prashhhant213/data_analysis_and_visualization-_for_streaming_platform
Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.
colab-notebook data datavisualization matplotlib numpy pandas python seaborn
Last synced: 20 Apr 2026
https://github.com/petermeissner/suuntor
Data from a Suunto watch extracted by R - !because!
automation data r rstats suunto windows
Last synced: 20 Apr 2026
https://github.com/zhukovanan/stepik_
The completed tasks of different data or computer science related fields on stepik
data statistical-learning statistics stepik-course
Last synced: 21 Apr 2026
https://github.com/nxion/sql-data-warehouse-project
Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.
data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server
Last synced: 05 Jun 2026
https://github.com/vck9521/traffic-accidents
In this project, we analyze the effects of various factors that correlate to traffic fatalities in the United States. Logistic regression is used, with the y variable being Fatality Rate (coded 0 for Survived, 1 for Fatality).
analysis data fatalities r regression rstudio traffic visualization
Last synced: 05 Jun 2026
https://github.com/rbcavi/factorio-mod-data
The modpacke data for factorio-viewer
data factorio factorio-data factorio-mod-data
Last synced: 23 Apr 2026
https://github.com/elcarrillo/structpy
StructPy is a Python-based command-line tool designed for academics and scientists to manage data projects effectively. It simplifies workflows by creating structured project directories, generating timestamped filenames, validating datasets, and backing up projects seamlessly.
command-line-tool data database file-structure organization python science-tool
Last synced: 24 Apr 2026
https://github.com/gaemapiracicaba/norma_dec_8468-76
Padrões de qualidade e lançamento de efluentes de águas interiores
Last synced: 19 Apr 2026
https://github.com/marielachirinosr/cyclistic-data-analytics-project
This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.
data data-visualization pandas powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/carlos-levi/twitterbots_analise_redesneurais
Projeto para a disciplina de IA - análise exploratória e aplicação de técnicas de aprendizado de máquina para detectar contas automatizadas (bots) na plataforma 𝕏 (Twitter)
data machine-learning twitter-bot
Last synced: 06 Jun 2026
https://github.com/datannur/datannur
datannur is an open source, lightweight and sovereign data catalog
catalog data data-catalog data-governance data-management dcat dcat-ap dcat-ap-ch metadata open-data open-source public-sector svelte swiss switzerland
Last synced: 07 Jun 2026
https://github.com/sagarkhese40/prediction-with-binomial-logistic-regression
bank data excel logistic-regression python
Last synced: 26 Apr 2026
https://github.com/fatihemres/africa
Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.
animations avfoundation data mapkit models swift swift-animations swiftui
Last synced: 27 Apr 2026
https://github.com/gurpreet0022/crop-fertilizers-recommendation-system-using-ml-
This repository is a part of AICTE - Shell Internship on 'Green Skills using AI technologies' Cycle 3.
data datapreprocessing datavisualization jupyter-notebook machine-learning python
Last synced: 27 Apr 2026
https://github.com/gngdb/llamass
LLAMASS is an arbitrary collection of tools I've put together to deal with motion data
Last synced: 28 Apr 2026
https://github.com/greedchikara/dsajs
Data Structures and Algorithms written in Javascript
Last synced: 09 Apr 2026
https://github.com/priyanshubiswas-tech/e-commerce_data_analysis
Analyzes 9,994 e-commerce transactions to uncover insights on sales trends, customer behavior, profitability, and logistics using EDA and visualization. Identifies top products, customer segments, and shipping efficiencies to optimize marketing, inventory, and operations, making it valuable for retail, finance, and logistics.
data data-analysis data-visualization pandas pandas-dataframe plotly-analytics-projects plotly-express python
Last synced: 28 Apr 2026
https://github.com/darrendavy12/earthquake-events-and-risks-project---azure-data-pipeline---api-connection-
Earthquake Events and Risks Project - Azure Data Pipeline - API Connection
azure blob-storage cloud cloudstorage data databricks databricks-notebooks databricks-workspace dataengineer dataengineering microsoft python
Last synced: 28 Apr 2026
https://github.com/iammahesh123/spring-annotations-demo
This project serves as a demonstration of various annotations used in the Spring Framework.
autowire bean component configuration controller data document postmapping repository requestmapping scope service spring
Last synced: 29 Apr 2026
https://github.com/barkintopcu/apple-stock-prediction-edu
The purpose of this project is to demonstrate time series analysis techniques using real-world stock data, without offering any form of financial advice or investment suggestion.
data deep-learning forecasting machine-learning python
Last synced: 29 Apr 2026
https://github.com/chandansoren/financial-budget-analysis
Financial budget for 2021
Last synced: 29 Apr 2026
https://github.com/ozgrozer/electron-store-data
A Node.js module to store Electron data in the computer
Last synced: 29 Apr 2026
https://github.com/smokingplaya/gm_datastorages
💖 Data Storages like in JavaScript.
Last synced: 29 Apr 2026
https://github.com/fs23yayan/membuatfungsidatapemrosesan
Membuat Fungsi Data Pemrosesan for Data Science in Marketing : Customer Segmentation with Python - Part 2
Last synced: 29 Apr 2026
https://github.com/dhruvsrikanth/superconductor-regression-kaggle-challenge
Kaggle challenge based on superconductor dataset.
data data-science jupyter-notebook kaggle kaggle-challenge kaggle-competition lasso-regression linear-regression machine-learning python random-forest regression sklearn support-vector-regression
Last synced: 30 Apr 2026
https://github.com/priyam-hub/covid-19-data-analysis
Explore COVID19 case numbers and deaths related to Coronavirus outbreak 2019/2020 in Pandas and in Jupyter notebook
analysis data data-visualization jupyter-notebook machine-learning python
Last synced: 08 Jun 2026
https://github.com/mmaithani/kaggle-projects
Collection of all the resources from competition, kernal And data section also all the magic code i have been using to get most of out of a problem
computer-vision data data-science image-processing machine-learning python
Last synced: 30 Apr 2026
https://github.com/ddeepanshu-997/datascience-e-commerce-shopping-details-
in this project i am going to apply data preprocessing technique on the dataset in order to clean the data using libraries, etc. make some insights/analyses to findout the hotpicks of the shopping along with some data visualsation libraries to get the trends and many more aspects in order to make a small contribution to the field of data science
cleaning-data data data-science data-visualization dataframe datapreprocessing dataset libraries matplotlib-pyplot numpy pandas plots python visualization
Last synced: 30 Apr 2026
https://github.com/dhimmel/hgnc
Extracting human gene families from HGNC
data gene-families genes hgnc hugo human
Last synced: 01 May 2026
https://github.com/dantetrb/diabetes-readmission-dbt
Predictive analytics on diabetic patient readmissions using dbt, DuckDB and Python – with explainability and clustering.
clustering data dataengineering dbt diabetes duckdb hdbscan healthcare jupyter lime readmission-prediction sql
Last synced: 01 May 2026
https://github.com/dnut/associations
Python 3 library to identify high-dimensional statistical relationships in any data set.
analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules
Last synced: 01 May 2026
https://github.com/skygenesisenterprise/aether-meet
Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications data docker javascript meeting nextjs notes typescript voip
Last synced: 01 May 2026
https://github.com/linguini1/edueval
The BorealisAI Let's Solve It mentorship project: summarizing student feedback submissions on their professor into one cohesive paragraph for faculty consideration during performance reviews.
ai data data-analysis data-science machine-learning machinelearning nlp python pytorch sentiment-analysis
Last synced: 01 May 2026
https://github.com/sorairolake/japanese-era-dataset
日本の元号のデータセット / Dataset of the Japanese era
data dataset date japanese-calendar japanese-era json toml wareki yaml
Last synced: 01 May 2026
https://github.com/muhammadadilnaeem/bcg-data-science-job-simulation-on-forage-august-2024
This repository contains all the tasks, code, and documentation completed during the BCG Data Science job simulation on The Forage platform. The simulation focused on analyzing customer churn, building predictive models, and presenting insights for a major utility company.
bcg customer-churn-prediction-with-machine-learning data data-science forage numpy pandas
Last synced: 01 May 2026
https://github.com/gcoronelc/cepsuni-disbd-64505
Taller de Modelamiento de de Base de Datos con Gustavo Coronel
data database databases db2 db2-database modeling oracle oracle-database relational-database relational-database-design relational-databases relationships sql sql-server
Last synced: 02 May 2026
https://github.com/waseemofficial/ml-practice
ML Practice
data data-analysis jupyter-notebook machine-learning ml python
Last synced: 02 May 2026
https://github.com/gcoronelc/ucv_gdi-1_202302-a2
Taller de Gestión de Datos e Información I con Gustavo Coronel.
data data-science database databases machine-learning machinelearning oracle sql sql-server
Last synced: 02 May 2026
https://github.com/sonwaneshivani/data-science-learning
Basics of python
css data data-science deep-learning flask gen-ai html ml nlp
Last synced: 02 May 2026
https://github.com/s1dewalker/electric-future
Visual Analysis: Future of Automotive Industry
data data-visualization machine-learning python3 regression-analysis tableau
Last synced: 02 May 2026
https://github.com/jesuscc1993/data-cleaner-extension
Clears browser data in a single click.
application-data chrome chrome-extension data
Last synced: 02 May 2026
https://github.com/badranalyst/movie-correlation-analysis-in-python
This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.
data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn
Last synced: 03 May 2026
https://github.com/anyantudre/associate-data-scientist-track
Materials for the Associate Data Scientist in Python track on DataCamp.
data data-science experimental-design hypothesis-testing machine-learning matplotlib-pyplot pandas python regression sampling seaborn statistics statsmodels unsupervised-learning
Last synced: 03 May 2026
https://github.com/asacxyz/flutter_aplicando_persistencia_de_dados
Para acompanhamento do curso Flutter: aplicando persistência de dados
dart data data-storage flutter persistence persistent-storage sqflite sql sqlite
Last synced: 03 May 2026
https://github.com/supunlakmal/coronavirus-covid-19-status
Covid 19 cases and death count for each country in a json file.
coronavirus count country covid-19 covid-data covid19 data data-science data-visualization geographical geographical-information-system json
Last synced: 21 Jun 2026
https://github.com/yash-chauhan-dev/spark_cluster_docker
Set-up local spark cluster, hadoop (hdfs), airflow, postgresql on docker with ease, without any local installations
apache-spark data data-engineering data-engineering-pipeline deployment docker docker-compose hadoop hdfs local-development localhost pyspark python
Last synced: 04 May 2026
https://github.com/srking501/uk-groceries-images
Repository Containing UK Groceries Images
data groceries grocery images links playwright playwright-python webscraping-data webscrapper
Last synced: 04 May 2026
https://github.com/neptun-software/neptun.data.generators
Send scraped data from neptun-scraper to CHATGPT to generate training data for NEPTUN.AI.
Last synced: 30 Jul 2025
https://github.com/dineshdhamodharan24/data-analysis
probability Analysis to customers and bascis analysis
analysis data powerbi probability python visualization
Last synced: 23 Jun 2026
https://github.com/kasunjayasanka/simple-backend-database-data-retrieval
Simple HTML form with inserting and retrieving data from Firebase Realtime Database
bootstrap css3 data firebase firebase-realtime-database html5 insert-data javascript retrieve-data
Last synced: 05 May 2026
https://github.com/chompfoods/stub-nodejs-server
Node.js server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients node node-js node-server nodejs nutrtion raw recipe-api recipes server server-stub stub stub-server
Last synced: 05 May 2026
https://github.com/muthupillai1204/diwali_sales_analysis
The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.
data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization
Last synced: 05 May 2026
https://github.com/julienmalka/shiftgenerator
ShiftGenerator WeSki 2018
data data-science latex python
Last synced: 06 May 2026
https://github.com/ksm26/ml-ai-data-science-jobs-in-canada
Explore the latest machine learning, artificial intelligence, and data science job opportunities in Canada. Stay informed about Canadian tech job market trends and find your next career move.
ai-canada ai-careers canada canadian-tech-companies canadian-tech-job-market data data-analysis data-engineering data-science data-science-careers machine-learning prompt-engineering robotics
Last synced: 06 May 2026
https://github.com/parthds02/analyzing-student-success-with-data
Discover key factors influencing student performance through data analysis and visualization. Explore gender, parental education, sports, and ethnicity impacts.
data datascience jupyter-notebook kaggle python pythonlibraries
Last synced: 06 May 2026
https://github.com/amazenmb/web-scraping
Web Scraping Methods using Python
analytics beautifulsoup data lxml pyautogui-automation python scheduling schedulingscraping selenium webdriver webscraping xpath
Last synced: 06 May 2026
https://github.com/ekoepplin/dbt-bigquery-core
How to get data to BigQuery (or duckDB) and setup dbt tests for SODA cloud monitoring
bigquery data data-quality dbt dlt duckdb gcp soda
Last synced: 06 May 2026
https://github.com/ashleydavis/brisjs-web-scraping-talk
Code to accompany my talk on web scraping for the Brisbane JavaScript meeting in September 2018
cheerio data data-acquisition data-acquisiton electron headless-browsers javascript nightmare nightmarejs nodejs web-scraping
Last synced: 06 May 2026
https://github.com/shantanujpk/bigdatacloud
Exploration of PySpark for data processing and interview prep — demonstrates handling corrupted records, applying transformations/actions, and building efficient data pipelines with practical examples.
big-data data jupyter-notebook pipeline pyspark python spark sparksql
Last synced: 07 May 2026
https://github.com/pocketfullofdata/electric-vehicles-market-size-analysis
This project analyzes the growth, adoption trends, and future projections of the electric vehicle (EV) market. Using data analysis and visualization techniques, it examines key factors like sales trends, and consumer adoption to understand the evolving landscape of the EV industry.
analysis data jupyter-notebook matplotlib numpy python seaborn vscode
Last synced: 07 May 2026
https://github.com/danyal-faheem/project-logs-analyzer
This repo contains scripts to analyze project logs and display some charts related to the data
data data-visualization matplotlib pandas python streamlit
Last synced: 07 May 2026
https://github.com/kemalcalak/python
computer-vision data data-science fastapi image-processing jupyter-notebook machine-learning python
Last synced: 08 May 2026