data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/yashmistry-24/ytcomment-iq
YTComment-IQ is a web app for analyzing and visualizing YouTube comments, offering insights through sentiment analysis, topic modeling, and interactive charts.
analysis comments data dataanalysis dataanalytics deep-learning machine-learning nlp python streamlit training visualization webapp youtube
Last synced: 15 Feb 2026
https://github.com/abuzar-alvi/employee-data-to-info-card-generator-with-python
This Python project is made by me, Python project for improving python skills.
card data data-generator employee python
Last synced: 03 Feb 2026
https://github.com/arcticsnow/climatepy
Collection of tools to perform timeseries analysis on climate data (Observation and Downscaled)
climate data era5 meteorological-data noaa-data pandas timeseries weather wmo xarray
Last synced: 05 Feb 2026
https://github.com/tusharnankani/analysis-2.0
An Exhaustive WhatsApp Chat Data Analysis 2.0
analysis data data-science plots trends visualization
Last synced: 31 Mar 2025
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 12 Apr 2026
https://github.com/jinsyin/datalink
⚡ 数据集成 | DataLink is a lightweight data integration framework build on top of DataX, Spark and Flink
batch big-data bigdata cdc data data-collection data-exchange data-integration data-pipeline data-synchronization datalink etl flink flink-cdc framework integration pipeline spark streaming
Last synced: 19 Jul 2025
https://github.com/hdk101/credentials-validator
A quick way to validate credentials in server-side
backend credentials data email frontend javascript login node npm npm-install password register server-side
Last synced: 21 Sep 2025
https://github.com/husna-poyraz/artificial-intelligence-and-data-science
Some studies on Artificial Intelligence and Data Science ...
artificial-intelligence data data-analysis-python data-science matplotlib-pyplot numpy pandas python
Last synced: 05 May 2026
https://github.com/dhimmel/het.io-rep-data
Data from Project Rephetio for the het.io website
browser data datatables drug-repurposing rephetio
Last synced: 07 Feb 2026
https://github.com/sadcenter/messenger
Data messaging system between servers using popular messaging brokers
Last synced: 06 Aug 2025
https://github.com/stdlib-js/utils-compact-adjacency-matrix
Compact adjacency matrix.
adjacency dag data data-structure data-structures graph javascript matrix node node-js nodejs stdlib structure topological toposort tsort util utilities utility utils
Last synced: 15 Apr 2026
https://github.com/kocyigitkim/realtime.io
Real time data streaming & socket programming library
data realtime socket streaming
Last synced: 29 Jul 2025
https://github.com/lastancientone/amd-vs-nvda
Analyzing 2 technology stocks using Master Analyst Program (MAP).
data data-analysis data-structures data-visualization excel forecasting time-series-analysis
Last synced: 15 May 2025
https://github.com/divithraju/divith-raju-openmetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
automation bigdata bigdataanalytics data data-structures dataengineering datascience hacktoberfest2022 metadata metadata-extraction
Last synced: 20 Feb 2026
https://github.com/gauravkoradiya/tensorflow-data-and-deployement
This repository contains usage of data and deployment pipline in tensorflow.
data deployment machine-learning-algorithms pipline tensorflowjs
Last synced: 06 Oct 2025
https://github.com/akhi07rx/f1-statistics-dashboard
A comprehensive command-line tool for analyzing Formula 1 race data using the FastF1 library.
akhi07rx cli cli-tools data f1 f1-score f1cli f1dashboard f1stats fastf1 formula1 opensource race race-analytics
Last synced: 23 May 2026
https://github.com/velocitatem/cellviz
Cellular Automata inspired by live-data visualization, designed to handle multidimensional and high-throughput data efficiently.
cellular-automata conways-game-of-life data economics
Last synced: 29 Jul 2025
https://github.com/lakecountryhuntclub/dnr-map-data-model
Data Model for the 2023 DNR Pheasant Stocking Property Data
data data-model documentation excel gis hunting mapping powerquery vba
Last synced: 29 Jul 2025
https://github.com/charliecm/meteorite-landings
Data visualization of meteorite landings on Earth.
astronomy d3 data data-visualization mapbox space visualization
Last synced: 18 Apr 2026
https://github.com/olamide100/capstone-project-llm-zoomcamp
Comparative Guide Assistant
argocd data dataengineering docker grafana kubernetes llm-agent mlops-workflow rag strreamlit
Last synced: 14 Feb 2026
https://github.com/joeyism/py-cifar10
This library was created to allow an easy usage of CIFAR 10 DATA. This is a wrapper around the instructions givn on the CIFAR 10 site
cifar cifar-10 cifar10 data machine-learning machinelearning
Last synced: 30 Jul 2025
https://github.com/asuozzo/medicare-data-analysis
An analysis of Medicare Part D data in Vermont
Last synced: 04 May 2026
https://github.com/mouneshgouda/learn_dsa
This repository explores fundamental data structures and their implementations. Learn how to organize and manipulate data efficiently for various programming tasks. (Feel free to add your specific focus areas here, e.g., algorithms, interview prep)
c data queue sorting-algorithms stack structured-data
Last synced: 30 Jul 2025
https://github.com/visenger/prada
Profiling Datasets
cleaning data dataset profiling
Last synced: 24 Aug 2025
https://github.com/gappeah/london-housing-price-dashboard
This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.
data data-analysis data-visualization excel visual
Last synced: 31 Jul 2025
https://github.com/derrickbaruga7/python-data-analysis
This project analyzes ORU’s off-season sewer usage using Python, with `pandas` for data handling, histograms and line plots for exploration, and a `scipy`-based model for prediction. Pearson’s correlation and visualizations help reveal key trends and relationships.
analytics data data-science visualization
Last synced: 31 Jul 2025
https://github.com/dannyben/datamix
DSL for manipulating tabular data
csv data data-analysis data-engineering gem ruby tabular-data
Last synced: 31 Jul 2025
https://github.com/flowsynx/plugin-postgresql
FlowSynx plugin to interfaces with PostgreSQL for CRUD operations. Supports JSONB, full-text search, and advanced query features.
data database flowsynx postgresql postgresql-database sql
Last synced: 09 May 2026
https://github.com/elhariri78/case-study-a-better-smoker-detector
Case Study-A better Smoker Detector
data dataframe evaluation kaggle matplotlib-pyplot numpy pandas pandas-dataframe pandas-python python3 seaborn sklearn
Last synced: 07 Apr 2026
https://github.com/danieljdufour/rle-serializers
Serialize and Deserialize Run Length Encoding
cloud-optimized compression csv data deserializer run-length run-length-decoding run-length-encoding serializer
Last synced: 24 Sep 2025
https://github.com/chandraprakash-bathula/keywords_prediction-machine-learning-integration
Keywords Prediction Model Built the Model By: Data Cleaning Removing Stopwords Constructing Word2vec Advancing to TF-IDF Weighted Word2vec.
algori artifici data machine-learning tf-idf weighted-word2vec word2vec
Last synced: 08 Nov 2025
https://github.com/ajsalemo/python-pandas-datalib
Testing and experimenting with some simple Pandas functionality using Flask to serve the parsed data.
csv data flask json pandas pandas-dataframe pandas-series python tabular tabular-data terminal
Last synced: 09 Apr 2026
https://github.com/tonykipkemboi/ens_subgraph_data
Query On-Chain Data from Subgraphs by The Graph Protocol using Python
data subgraphs thegraphprotocol web3
Last synced: 17 Sep 2025
https://github.com/stephaniehicks/flowsorted.blood.wgbs.blueprint
A Bioconductor ExperimentHub data package for flow sorted purified whole blood cell types measured using DNA methylation on WGBS platform from BLUEPRINT
bioconductor bioconductor-package bisulfite-sequencing blood data dna-methylation flowsort wgbs
Last synced: 25 Sep 2025
https://github.com/theryston/db-mycro
A node module with a json database that saves data in a specific directory, similar to sqlite, but in JSON
base crud data database db db-mycro javascript json jsondatabase nodejs nosql typescript
Last synced: 09 Apr 2026
https://github.com/woctezuma/download-steam-screenshots-data
Data consisting of Steam screenshots.
Last synced: 19 Feb 2026
https://github.com/ddeutils/ddedocs
📖 Data Developer & Engineer Documents and Hands-On
blogs data data-engineering documents hands-on
Last synced: 08 Aug 2025
https://github.com/rubenhortas/python_examples
Examples of Python code and DSA (data structures and algorithms).
algorithm algorithms data dsa examples python python-3 python3 samples snippets structures
Last synced: 03 Oct 2025
https://github.com/waqaszafar9/cricket-managment-database
cricket website data base mangment project
cricket-data cricket-dataset data data-structures database database-management database-management-system database-schema oracle oracle-database sql sql-query
Last synced: 10 Aug 2025
https://github.com/semibran/img-data
Easily read from and write to ImageData instances
Last synced: 11 Aug 2025
https://github.com/jorgeatgu/casa-caida-bot
Twitter-bot sobre la despoblación en Aragón
aragon bot data data-viz despoblacion twitter-bot
Last synced: 11 Aug 2025
https://github.com/r12habh/datacamp.com-micro_projects
data data-analysis data-science datascience python python3
Last synced: 23 May 2026
https://github.com/helosantosdesousa/analise-previsao-de-rotatividade-ml
Projeto final do Bootcamp Data Girls 2025 que analisa a rotatividade de funcionários usando Machine Learning. Com base no dataset IBM HR Analytics Attrition, o projeto identifica os principais fatores de risco e cria modelos preditivos (SVC e Random Forest) com até 89% de acurácia para antecipar saídas e apoiar decisões estratégicas de RH.
analise-de-dados analise-exploratoria bootcamp ciencia-de-dados colab-notebook dados data data-analysis data-science dataanalytics dataframe eda machine-learning machine-learning-algorithms pandas python random-forest svc
Last synced: 16 Apr 2026
https://github.com/soenneker/soenneker.quark.table
A native Blazor table component.
blazor blazorlibrary csharp data dotnet html quark quarktable table tables
Last synced: 13 Aug 2025
https://github.com/stdlib-js/array-one-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from one and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 20 Feb 2026
https://github.com/pradeep221b/turbofan_predictive_maintenance
An R project for predicting turbofan engine RUL using {targets} and {tidymodels}.
data data-science-portfolio machine-learning nasa preditive-maintaince r rstats targets-pipeline tidymodels
Last synced: 04 Oct 2025
https://github.com/zediculz/block
Block is a data structure/collection that uses Blockchain principle in managing data.
Last synced: 05 Oct 2025
https://github.com/carlotta94c/sql4datascientistsdemo
Demo material for Microsoft Reactor session "Getting Started with Databases: SQL and Data Visualizations"
analysis data r sqlite tidyverse visualisation
Last synced: 18 Apr 2026
https://github.com/labwhatever/leetcode
Collection of LeetCode questions to ace the coding interview!
data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning
Last synced: 22 Aug 2025
https://github.com/petermartens98/nba-analytics-streamlit-app-with-langchain-agent
Interactive NBA Analytics app with Streamlit and a LangChain conversational agent connected to extracted data. Explore player, team, and game stats, track injuries, run simulations, visualize trends, and get AI-powered insights. Ongoing development, open to collaboration.
agentic-ai analysis data deepseek langchain nba python streamlit visualization
Last synced: 08 May 2026
https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data
This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.
aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python
Last synced: 03 Jan 2026
https://github.com/tatey/list_of_baby_names
A list of baby names given to tiny humans in Ruby
Last synced: 11 Nov 2025
https://github.com/horisystems/uk_ev_data_analysis
Analysis of Electric Vehicle charging infrastructure in the United Kingdom.
data data-science electric-vehicles ev python uk united-kingdom
Last synced: 12 Jan 2026
https://github.com/ngambip/priscilla
About my work and Experience
accounting analytics data finance-management
Last synced: 03 Feb 2026
https://github.com/stdlib-js/array-base-to-accessor-array
Convert an array-like object to a minimal array-like object supporting the accessor protocol.
accessor accessors array array-like convert data javascript node node-js nodejs object protocol stdlib structure types wrap wrapper
Last synced: 04 Jan 2026
https://github.com/husna-poyraz/titanic-machine-learning
Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.
data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic
Last synced: 10 May 2026
https://github.com/stdlib-js/array-one-to
Generate a linearly spaced numeric array whose elements increment by 1 starting from one.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 26 Feb 2026
https://github.com/mattqdev/koalaz
Why don't use koalas as data mock? With this npm package you can!
data koala lorem-ipsum meme mock placeholder
Last synced: 13 Jan 2026
https://github.com/codenoid/webtoons.com-database
a Webtoons.com Database, collected by Hofesh Bot (Scrapper)
Last synced: 28 Mar 2025
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 11 Jun 2026
https://github.com/ilejuxepwaduzd/structured-data-extractor
🛠️ Extract structured data from messy texts using Chain-of-Thought prompting to improve processing of customer support and technical issues.
cdp chrome-fetcher data document-extraction ecommerce golang-library headless metadata-extraction ocr open-source pdf pdf-converter pdf-extractor ruby scraper shopify spider structured-data
Last synced: 10 Apr 2026
https://github.com/makepath/medaprep
medaprep is a data preparation and feature engineering toolkit for geospatial applications.
data data-science datacleaning eda exploratory-data-analysis xarray
Last synced: 29 Jun 2025
https://github.com/exoticknight/juhe
simple way to analyze complex data in one chain call
aggregation aggregator analysis data statistic typescript
Last synced: 21 May 2026
https://github.com/cdapio/website
CDAP IO website
analytics applications cdap cdapio data data-analytics data-integration hugo integration metadata oss rules-engine
Last synced: 18 Jun 2025
https://github.com/thechibuzornwachukwu/bluesky-scraper
This is a work of art that enables you to scrape data off BlueSky.
analytics bluesky bluesky-api bluesky-client data datascraper-framework datascraping scraping social-media web webscraping
Last synced: 16 Nov 2025
https://github.com/connectomicslab/cmtklib-data
Datalad dataset that stores all data resources of the cmtklib module of Connectome Mapper 3 (https://github.com/connectomicslab/connectomemapper3).
brain data parcellation resources software
Last synced: 16 Jan 2026
https://github.com/coqui123/tradegpt
TradeGPT is a full-stack cryptocurrency trading application that combines a modern Fresh (Deno) frontend with a Python (FASTAPI) backend for Coinbase integration and Azure AI Services for intelligent trading analysis. 💹
analytics automation cryptocurrency data deno fastapi fresh numpy python trading-algorithms trading-strategies tradingbot typescript
Last synced: 11 Apr 2026
https://github.com/mrbisquit/weathercollector
Open-Source weather station data collector
collector customisable data modular opensource weather weather-forecast weather-station
Last synced: 16 Jan 2026
https://github.com/astrid-project/cb-manager
APIs to interact with the Context Broker's database. Through a REST Interface, it exposes data and events stored in the internal storage system in a structured way. It provides uniform access to the capabilities of monitoring agents.
agent beats control data ebpf elasticsearch log logstash management programmability security
Last synced: 30 Jun 2025
https://github.com/mohsinali08000/myportfolio
I’m Mohsin Ali, a passionate software engineer with over 2 years of experience in developing robust software solutions. Currently transitioning into the field of data science.
Last synced: 22 Apr 2026
https://github.com/lookininward/data-formatter-demo
You have directories containing data files and specification files. The specification files describe the structure of the data files. Write an app that reads format definitions from specification files. Use these definitions to convert the parsed files to NDJSON files.
csv data demo files json ndjson python txt unittest
Last synced: 27 Apr 2026
https://github.com/emnetdegafe/allesoverfilm-backend
AllesOverFilm-backend is part of the AllesOverFilm mobile app development project and contains the database structure, server query scripts, and Sequelize-cli database structures.
backend data data-model express postgresql sequelize-cli
Last synced: 11 Apr 2026
https://github.com/spine-tools/metreload
Python application for downloading meteorological reanalysis data
Last synced: 01 Jul 2025
https://github.com/cosmos-loops/cosmos-dapper
Cosmos.Dapper is a part of Cosmos.Data, a inline project of COSMOS LOOPS PROGRAMME. This repository provides a package of StackExchange.Dapper to improve development efficiency.
dapper data mysql mysqlconnector oracle postgresql sql-query sqlite sqlkata sqlserver
Last synced: 11 Apr 2026
https://github.com/cintia0528/data_analytics_and_visualization-sql_tableau
Evaluate Magist as a strategic partner for Eniac's Brazilian expansion. Use SQL to analyze growth, tech accessory sales potential, delivery times, and customer satisfaction in Magist's database.
data dataanalysis datavisualization sql strategy tableau
Last synced: 31 Mar 2025
https://github.com/mtingers/opacify
Opacify reads a file and builds a manifest of external sources to rebuild said file.
backup data obfuscation python
Last synced: 18 May 2026
https://github.com/abdul-rafay19/youngdevinterns_machine-learning_tasks
This internship offers hands-on exposure to real-world Machine Learning applications — from data visualization and preprocessing to model development, evaluation, and deployment. It focuses on real ML workflows, problem-solving, neural networks, and hyperparameter tuning — all within a collaborative, remote, and growth-oriented environment.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data data-visualization internship machine-learning machine-learning-algorithms machinelearning ml model model-development neural-network preprocessing programming-language python task tasks youngdevintern
Last synced: 29 Apr 2026
https://github.com/fritzrehde/asciibar
A cli tool to print percentages as ascii bar charts
cli data percentage visualization
Last synced: 31 Oct 2025
https://github.com/gmersy/data-carbon
Repository accompanying the paper: Toward a Life Cycle Assessment for the Carbon Footprint of Data
carbon-emissions carbon-footprint climate-change data data-science sustainability sustainable-software
Last synced: 31 Mar 2025
https://github.com/jorgeatgu/apaga-luz
💡 ¿Cuánto cuesta la luz? 💶
data data-visualization flat-data
Last synced: 04 Feb 2026
https://github.com/victorowinoke/after-work-data-science-project-showcase-eda
You work for Lublu as a Data Science Consultant and you have been tasked to perform analysis on pricing, product and assortment of Adidas and Nike. Create a descriptive analysis report, making relevant observations and recommendations that will help Lublu in the launch of such similar products.
adidas analysis data deliverables nike pythonanalysis ranges
Last synced: 28 May 2026
https://github.com/rafaelfloressouza/Covid-19-Dashboard
Python web application to display COVID19 data from the world using Plotly and Dash
bootstrap covid-19 css data datavisualization plotly-dash python3
Last synced: 10 Mar 2025
https://github.com/diegoperea20/own_dataset_segmentation_yolov8
Segmentacion y detection de objetos con propio dataset usando YOLOV8 , en el que se utiliza un dataset propio de una moneda de 200 pesos colombianos del año 2023.
coins colombia data opencv own python segmentation tensorflow yolov8
Last synced: 12 Apr 2026