data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/charityeverett/gobackfetchit
Award Winning WebXR Data Journalism Storytelling Project
3d aframe ar css data html html-css-javascript nodejs visuzalization vr webxr xr
Last synced: 03 May 2026
https://github.com/tn3w/moviedb-json
A JSON library with 981,530 films.
data database db json movie movie-database movies
Last synced: 03 May 2026
https://github.com/didier/functional-programming
Functional Programming subject of @CMDA-TT
convenience d3 d3-visualization d3js data datavis datavisualization front-end functional functional-programming interactive jsdoc node nodejs-modules parking-spots typescript
Last synced: 03 May 2026
https://github.com/yugsumeet17/churn-analysis-project--power-bi-sql-machine-learning
Dataset Explained, Project Goals & Metrics Required, SQL Server ETL & Data Cleaning, Power BI Data Load, Transformation, Blueprint & Measures, Power BI Visualization - Summary Page, Building Machine Learning Model - Random Forest, Power BI Visualization - Churn Prediction Page
data data-visualization dataanalytics excel postgresql powerbi python3
Last synced: 03 May 2026
https://github.com/yash-chauhan-dev/spark_cluster_docker
Set-up local spark cluster, hadoop (hdfs), airflow, postgresql on docker with ease, without any local installations
apache-spark data data-engineering data-engineering-pipeline deployment docker docker-compose hadoop hdfs local-development localhost pyspark python
Last synced: 04 May 2026
https://github.com/joelgombin/intro_r_iau
Introduction à R #WeData
data data-science dataviz gis r
Last synced: 04 May 2026
https://github.com/srking501/uk-groceries-images
Repository Containing UK Groceries Images
data groceries grocery images links playwright playwright-python webscraping-data webscrapper
Last synced: 04 May 2026
https://github.com/damisparks/become_data_analyst
Are you new to Data Analysis ? Here you will find simple notebook that will help through your journey. These are personal projects I work on and still working.
data data-analysis data-visualization matplotlib numpy pandas-tutorial
Last synced: 04 May 2026
https://github.com/rabeal21/tea
Generate random TEA wallet addresses in bulk with this simple utility. Perfect for testing and exploring the TEA blockchain. 🌱💻
bucklescript bucklescript-tea chinese-translation cli data earlgrey educators hacking ios-automation ios-test ocaml peer-evaluations php red-team teachyourselfcs test-framework translation tui
Last synced: 04 May 2026
https://github.com/jdanielgoh/cobertura-campanias
En una democracia ¿caben todas las voces? Proyecto para visualizar el monitoreo de radio y TV que realiza el INE de las candidaturas presidenciales 2024
d3js data datavisualization vue
Last synced: 09 Jun 2026
https://github.com/gabya06/twitter_models
Repository used for twitter impression models
data data-science impressions machinelearning python ridge-regression sklearn twitter
Last synced: 04 May 2026
https://github.com/farhad2415/job_scraper
Job Site Based Job Scraping with python
automation bash-script data data-scraping data-structures python selenium selenium-python
Last synced: 05 May 2026
https://github.com/rrwen/py-examples
Collection of python examples in each branch
beginner data download excel guide introduction links processing python reference spreadsheet url xls
Last synced: 05 May 2026
https://github.com/contawo/travel-journal
This is a travel journal application for storing all the places that you have visited. I was learning by doing react when creating this project. I learnt a lot with it and upgraded my reactjs skills.
data learning-by-doing props reactjs
Last synced: 05 May 2026
https://github.com/munas-git/codm-review-analysis-and-predictions
Sentiment analysis on Call of Duty Mobile Google Play Store user reviews with ML model to classify new reviews.
data flask machine-learning python sentiment-analysis
Last synced: 05 May 2026
https://github.com/mito-ds/mitosheet_helper_config
The mitosheet_helper_config package used by enterprises to configure the mitosheet package.
data data-analytics data-science data-visualization jupyter pandas python
Last synced: 05 May 2026
https://github.com/julienmalka/shiftgenerator
ShiftGenerator WeSki 2018
data data-science latex python
Last synced: 06 May 2026
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 09 Jun 2026
https://github.com/ksm26/ml-ai-data-science-jobs-in-canada
Explore the latest machine learning, artificial intelligence, and data science job opportunities in Canada. Stay informed about Canadian tech job market trends and find your next career move.
ai-canada ai-careers canada canadian-tech-companies canadian-tech-job-market data data-analysis data-engineering data-science data-science-careers machine-learning prompt-engineering robotics
Last synced: 06 May 2026
https://github.com/tadiusfrank2001/pythonprojects
Compilation of Some Fun Introduction to Python Lab Coding Projects introducing the foundamentals of data science, databases, and pythonlibraries
data data-science databases gamedesign python pythonlibrarires sorting-algorithms sqlite string-manipulation
Last synced: 06 May 2026
https://github.com/jbn/vaquero
A Python library for iterative and interactive data wrangling at laptop-scale.
data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework
Last synced: 10 Jun 2026
https://github.com/ekoepplin/dbt-bigquery-core
How to get data to BigQuery (or duckDB) and setup dbt tests for SODA cloud monitoring
bigquery data data-quality dbt dlt duckdb gcp soda
Last synced: 06 May 2026
https://github.com/ralzz/dibimbing_datascience
This project contains an Exploratory Data Analysis (EDA) of the Estonia Passenger List dataset. I handled missing values, removed duplicate data, and created basic visualizations to find insights.
data data-science eda google-colab kaggle pandas python
Last synced: 06 May 2026
https://github.com/iv4n-ga6l/functional-dataprocessing-pipeline
A functional data processing pipeline that accepts an input file, allows specifying both input and output formats, applies specified transformations, and produces a resulting output file.
csv data datapreprocessing excel json pandas parquet pipeline python
Last synced: 06 May 2026
https://github.com/fabsdevx/file-format-converter-handout
Data Engineering project for learning purposes. Credits to itversity
csv csv-import data data-engineering database pandas python
Last synced: 06 May 2026
https://github.com/juanpablodiaz/beertv
A Next.js Full Stack app to displays funny Beer TV Ads
api-routes data next tailwindcss
Last synced: 07 May 2026
https://github.com/shantanujpk/bigdatacloud
Exploration of PySpark for data processing and interview prep — demonstrates handling corrupted records, applying transformations/actions, and building efficient data pipelines with practical examples.
big-data data jupyter-notebook pipeline pyspark python spark sparksql
Last synced: 07 May 2026
https://github.com/bryanhe24/data_analysis_app
A full-stack web application that allows users to upload CSV datasets, analyze the data with statistical summaries and visualizations, and interact with an AI-powered assistant for querying the dataset.
ai data data-analysis data-visualization fullstack-development javascript math python reactjs
Last synced: 07 May 2026
https://github.com/hudson-newey/data-miner
A simple data miner that collects information from an API and stores it in a file
api api-client big-data bigdata data logger logging
Last synced: 10 Jun 2026
https://github.com/safwan2003/randomforest_heart_disease_prediction
A machine learning project using Random Forest Classifier to predict heart disease. Includes data preprocessing (with binning), feature selection, and model evaluation.
binning data data-science datapipeline datapreprocessing datavisaulization deep-learning machine-learning python random-forest-classifier streamlit
Last synced: 07 May 2026
https://github.com/murtaza-arif/all-you-need-to-know-for-data-engineer
This repository is designed to showcase various aspects of data engineering, including tools, frameworks, and end-to-end projects. It covers everything from data ingestion and transformation to data warehousing and cloud-based solutions.
cassandra data data-engineering data-science kafka kafka-consumer kafka-streams pyarrow spark
Last synced: 07 May 2026
https://github.com/jigyasag18/iit-guhawati
Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.
aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp
Last synced: 08 May 2026
https://github.com/aidan-zamfir/the-iliad
Data analysis & relationship network for the characters of Homers Iliad
data data-analysis dataframes networks networkx python selenium spacy webscraping
Last synced: 08 May 2026
https://github.com/zsvoboda/olympics
Self service analytics of 120 years of Olympics data
analytics dashboards data datavisualization dataviz olympics open-data open-datasets opendata reports
Last synced: 08 May 2026
https://github.com/blackhatdevx/leetcode
LeetCode Solutions by Jash Gro
algorithm algorithms dart data datastructures datastructures-algorithms dsa java javascript leetcode leetcode-java leetcode-python leetcode-solutions neetcode
Last synced: 08 May 2026
https://github.com/juanpablo70/pgad-assignment01
Breast Cancer Coimbra data set analysis
data data-science dataframe dataset jupyter-notebook matplotlib numpy pandas python
Last synced: 08 May 2026
https://github.com/vanshuchaudhary/flightpriceanalysis-
The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.
business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis
Last synced: 08 May 2026
https://github.com/writetome51/page-load-access
A TypeScript/Javascript class that loads a batch (array) of data from a larger set too big to be loaded all at once.
batch class data javascript load loader typescript
Last synced: 16 May 2026
https://github.com/writetome51/public-data-container-interface
Just a TypeScript interface with 1 property: 'data'
container data interface typescript
Last synced: 15 May 2026
https://github.com/miniql/miniql-csv
A MiniQL query resolver that loads data from CSV files.
comma-separated-values csv data query query-language
Last synced: 08 May 2026
https://github.com/natarizkie2/neurochain-airdrop-bot
🍋 — A smart bot designed to complete data tasks like true/false selections automatically, with multi-account support for extra convenience.
airdrop automated bot data multi-account natarizkie neurochain nodejs web3
Last synced: 10 Jun 2026
https://github.com/taquece/goals-per-match
basic script to calculate average football goals per match from .CSV
beginner csv data football nodejs python sports-analytics
Last synced: 09 May 2026
https://github.com/chompfoods/sdk-typescript-angular
Angular TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
angular api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes sdk typescript
Last synced: 09 May 2026
https://github.com/basemax/okala-product-ids
A PHP script to fetch and save product IDs from Okala's online store API across multiple categories and store branches.
crawler crawler-okala crawler-php crawlers data database ids ir iran json okala okala-crawler php php-crawler product
Last synced: 09 May 2026
https://github.com/abhroroy365/market_analysis
This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.
clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis
Last synced: 09 May 2026
https://github.com/flexthink/matricize
A convenience library to convert between pure Python objects and their vectorized representations
data machine-learning numpy python
Last synced: 09 May 2026
https://github.com/naitiknayak196/tech-layoffs-cleaning-sql-vs-python
This project cleans and analyzes a tech layoffs dataset using MySQL and Python (Pandas) to compare their efficiency in data processing. It provides business insights into workforce trends, industry stability, and economic impacts to support data-driven decision-making.
data datacleaning dataset jyputer-notebook layoffdata layoffs mysql python sql
Last synced: 09 May 2026
https://github.com/mohamedbilal1800/olympic_history_data_analysis
This project delves into the 120 Years of Olympic History: Athletes and Results dataset, analyzing athlete demographics, medal achievements, and country performances across the Summer and Winter Olympics from 1896 to 2016.
analysis data eda matplotlib-pyplot pandas python seaborn visulaization
Last synced: 09 May 2026
https://github.com/xiaomingx/10000-public-apis-and-data
Public APIs are interfaces that allow developers to access various services, features, or data from external systems or platforms.
api-ecosystem api-integration data developer-friendly-apis open-api-access public-api-tools third-party-services
Last synced: 30 Jul 2025
https://github.com/baranasoftware/curricular-api
The design and implementation of a REST API for student and course data for a Higher Ed institution.
aws data data-pipeline go golang lambda rest rest-api sqlite3 system-design terraform
Last synced: 09 May 2026
https://github.com/scjoaoantonio/trab_datascience
Este projeto tem como objetivo analisar os posts da rede social Bluesky. A aplicação interativa foi desenvolvida utilizando Streamlit e permite a coleta e visualização de dados, além de oferecer análises avançadas como previsão de engajamento, modelagem de tópicos e análise de sentimentos.
bluesky data data-science streamlit
Last synced: 09 May 2026
https://github.com/khushi-sabarad/data_analysis
linkedin learning capstone project
data data-engineering matplotlib pandas python
Last synced: 10 May 2026
https://github.com/beeracs/llama
Run Llama models in your web browser using JavaScript and WebAssembly. Explore light and dark modes easily. 🌐🐱👤
ai data fine-tuning framework gpt langchain large-language-models llama3 llamaindex llm lora machine-learning nlp peft qlora qwen rlhf vllm
Last synced: 10 May 2026
https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio
This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time
data data-analysis data-visualization pandas seaborn
Last synced: 10 May 2026
https://github.com/adrianoleitedasilva/adrianoleitedasilva
Me chamo Adriano, tenho 35 anos de idade, sendo 18 anos dedicados as áreas de Tecnologia da Informação e Educação.
adrianoleitedasilva automation ceo cio cto data data-science dev diretor github mobile professor python readme techlead web
Last synced: 10 May 2026
https://github.com/hemangsharma/assignment-2---classification-models
Assignment 2 - Classification Models repository contains project for 36106 Machine Learning Algorithms and Applications
data datascience-machinelearning machine-learning ml
Last synced: 10 Jun 2026
https://github.com/brightway-lca/bw_io
IO tools for Brightway LCA framework
bw3 data life-cycle-assessment python
Last synced: 10 Jun 2026
https://github.com/afeiship/data-arary
Data array with some new methods.
array data data-structure js list
Last synced: 11 May 2026
https://github.com/sehaj003/boston-bruins-roster-planning-mysql-nosql
Repository for Data Management project, Boston Bruins Roster Planning using MySQL and NoSQL along with data analysis using Python
data data-management mongodb mysql project-repository python
Last synced: 11 May 2026
https://github.com/sakan811/honkai-star-rail-a-few-fun-insights-with-data-analysis
The project gives insights that delve into the Honkai Star Rail's character's stats of all available characters as of the given date.
data data-analysis data-science data-visualization docker flask game honkai honkai-star-rail honkai-starrail seaborn webscraping webscraping-data webscraping-selenium
Last synced: 10 Jun 2026
https://github.com/sebastian-diaz-berdecia/analisis-popularidad-de-series-y-generos-de-series
Consultas SQL para el análisis de la popularidad de series y géneros series de la base de datos NetflixDB.
business-analytics bussiness-intelligence data data-analysis database mysql mysql-database sql
Last synced: 12 May 2026
https://github.com/mateogiuffra/estrd2024s1
trabajos prácticos realizados en la materia Estructura de Datos de la Universidad Nacional de Quilmes (UNQ)
c cpp data data-structures-and-algorithms eficiency functional-programming haskell unq
Last synced: 12 May 2026
https://github.com/vbhatsaccnt/retail-strategy-and-analytics-optimization-of-control-stores-for-sales-enhancement
In this project, we aim to optimize the performance of retail chain stores by establishing control stores based on their performance compared to selected trial stores. By leveraging data analytics and strategic insights, we seek to enhance sales revenue and drive growth within the retail chain.
customer-segmentation data data-science risk-analysis
Last synced: 13 May 2026
https://github.com/gregorybchris/pca
PCA assignment for Park Tudor
analysis component data display embedding pca principal projection teach
Last synced: 13 May 2026
https://github.com/cvinicius987/projetos-bigdata
Estudos de caso envolvendo projetos de BigData e Engenharia de Dados.
bigdata data data-engineering spark
Last synced: 13 May 2026
https://github.com/triboot/ultimate-playerprefs
This repository is only created for **Issue-Tacking** and the **Wiki-Documentation** (Wiki Docs are cooming soon) of Ultimate PlayerPrefs. The plugin source code is fully open and available after the purchase on the Unity Asset Store.
anticheat assetstore data datamanagement easy manager persistent-storage playerprefs playerprefsmanager savegame storage tools unity unity3d unity3d-editor
Last synced: 11 Jun 2026
https://github.com/andygol/andygol.github.io
Andrii Holovin – Product & Project Manager Geospatial Expert / OpenStreetMap Consultant / DevOps practitioner
consultant data data-structures devops experience floss gis mapping navigation openstreetmap personal-site personal-website
Last synced: 13 May 2026
https://github.com/m0nica/datalogues-refresh
:bar_chart: Programming blog focused on data with an emphasis on exploration in Python.
data jekyll python technical-writing
Last synced: 14 May 2026
https://github.com/prakhargpt/sql-data-warehouse-project
Building Data Warehouse project using SQL Server, including ETL processes, data modelling and analytics.
analytics data data-analysis data-cleaning data-engineering data-engineering-pipeline data-lakehouse data-science data-warehouse etl etl-job etl-pipeline medallion-architecture sql sql-server
Last synced: 12 Jun 2026
https://github.com/iannil/one-data-studio
one-data-studio integrates a data governance and development platform, a cloud-native MLOps platform, and a large model application development platform. It connects the entire value chain from raw data governance to model training and deployment, and further to the construction of generative AI applications.
Last synced: 12 Jun 2026
https://github.com/shashwat9kumar/trends_in_a_country_on_twitter
Finding trending topics in each country on twitter and visualizing them in a WordCloud
data data-visualization trends tweepy twitter-api wordcloud
Last synced: 13 Jun 2026
https://github.com/asjadnaqvi/stata-tidytuesday
A Stata package for fetching Tidy Tuesday meta data and files
Last synced: 13 Jun 2026
https://github.com/word2vect/beijing-new-house-data-visualization
Beijing New House Data Visualization for Python Programming 2024 Fall Data Visualization Lab
Last synced: 13 Jun 2026
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 13 Jun 2026
https://github.com/word2vect/beijing-pm2.5-data-process
Beijing PM2.5 Data Process for Python Programming 2024 Fall Data Visualization Lab 2
Last synced: 15 Jun 2026
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 15 Jun 2026
https://github.com/lafkpages/minecraft-crafting-info
Scrapes https://www.minecraftcrafting.info for crafting recipes.
Last synced: 17 Jun 2026
https://github.com/ibttf/bayborhood
Interactive map to find the ideal neighborhood in San Francisco based on data.
data data-analysis data-visualization gis mapbox react
Last synced: 18 Jun 2026
https://github.com/dushansenadheera/web_scraper
web scraper using Python along with BeautifulSoup and Selenium
beautifulsoup data python selenium web-scraping
Last synced: 19 Jun 2026
https://github.com/rylan12/apscores
A quick way to visualize how the AP score distributions have changed from year to year.
advanced-placement analysis ap-exam data scores
Last synced: 19 Jun 2026
https://github.com/artcc/coredatademo
Demo for CoreDataGenericModule implementation
core coredata coredata-model data encrypted encrypted-data encryption persist
Last synced: 19 Jun 2026
https://github.com/svetlanam/kbl-to-csv-s3
Keboola extractor, that converts excel to CSV based on input mapping criteria and upload to S3 bucket
data data-cleaning data-transformation etl keboola s3-bucket
Last synced: 20 Jun 2026
https://github.com/g-schumacher44/analyst_resource_hub
A collection of guidebooks, quickref, and resources for data analysis
analytics bigquery data lookerstudio machine-learning model python sql yaml-configuration
Last synced: 20 Jun 2026
https://github.com/supunlakmal/coronavirus-covid-19-status
Covid 19 cases and death count for each country in a json file.
coronavirus count country covid-19 covid-data covid19 data data-science data-visualization geographical geographical-information-system json
Last synced: 21 Jun 2026
https://github.com/peterhellberg/bugsnag-data
Dump Bugsnag data using the Data access API
Last synced: 22 Jun 2026
https://github.com/hlan22/2025-03-18-data-validation
(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:
Last synced: 23 Jun 2026
https://github.com/anburocky3/cbse-schools-data
Fetch CBSE Schools in seconds and use it for your data projects
cbse data data-analysis data-science grabber nextjs
Last synced: 24 Jun 2026
https://github.com/charlenry/python_data_science
Mes notebooks de travaux pratiques sur Python pour la Data Science
analysis data dataframe jupyter kaggle matplotlib notebook numpy pandas pyplot python science seaborn visualisation
Last synced: 25 Jun 2026
https://github.com/stefen-taime/mako-main
Declarative real-time data pipelines Framework. YAML in, events out.
data datapipeline declarative-config declarative-pipeline declarative-programming declarative-workflows framework open-source
Last synced: 26 Jun 2026
https://github.com/matthewgferrari/covid-contextualizer
A Coronavirus Contextualizer for the USA
Last synced: 26 Jun 2026
https://github.com/sakan811/honkai-star-rail-characters-damage-simulation
Honkai Star Rail Characters' Damage Simulation
data data-science data-visualization honkai honkai-star-rail honkai-starrail powerbi powerbi-visuals python sqlite
Last synced: 29 Jun 2026
https://github.com/nitheshgoutham/sentinel-2-data-processing-for-pichavaram-mangrove-forest-using-cnn
Image Processing using CNN
cnn cnn-classification cnn-keras data deep-learning matplotlib ploty python seaborn-python visualization
Last synced: 29 Jun 2026