data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/awesomelistsio/awesome-open-data
A curated list of high-quality open data resources, tools, platforms, and projects across domains.
awesome awesome-list awesome-lists data open open-data
Last synced: 29 Jun 2025
https://github.com/gappeah/global-shipping-analytics-dashboard
This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.
data data-analysis data-analyst data-visualization metrics tableau
Last synced: 25 Feb 2025
https://github.com/ngambip/priscilla
About my work and Experience
accounting analytics data finance-management
Last synced: 03 Feb 2026
https://github.com/ryanjoy0000/yt-notifier
Youtube Notifier (Telegram Bot) - A real time data processing pipeline
data go kafka-streams real-time telegram-api youtube-api
Last synced: 14 Jan 2026
https://github.com/mewmix/drivehound
magic file signatures + python drive recovery magic
data disk file-signatures harddrive python recovery recovery-tool
Last synced: 08 Oct 2025
https://github.com/jakakokosar/bioinformatics-serverfiles
Knowledge base for Orange3-bioinformatics add-on
bioinformatics data dictybase gene genesets go homologene markergenes ncbi serverfiles
Last synced: 16 Apr 2026
https://github.com/scienxlab/datasets
Some small datasets for demos, courses, testing, etc.
data open-data sample-data teaching-resources
Last synced: 09 Oct 2025
https://github.com/yessasvini23/accenture_-social-buzz-data-analytics-virtual-programme-forage
Accenture Data Analytics and Visualization - Virtual Internship
accenture content data dataanalytics excel forge socialbuzz
Last synced: 18 Jan 2026
https://github.com/snimmagadda1/stack-exchange-dump-to-mysql
Batch pipeline to import Stack Exchange XML data dumps to relational DB
batch data mysql spring-batch stackoverflow
Last synced: 30 Mar 2025
https://github.com/kamal-singh22/ai-driven-emotional-sentiments-analysis
This project leverages machine learning to analyze and classify the emotional sentiment of textual data. The goal is to accurately identify and categorize emotions, aiding applications in customer feedback analysis, social media sentiment analysis, and mental health monitoring.
analysis artificial-intelligence data emotion nlp-machine-learning python sentiment-analysis streamlit text-classification
Last synced: 14 Apr 2026
https://github.com/jeanmanguy/milk-sci-fi
Census of every mention of milk in sci-fi works.
Last synced: 26 Feb 2026
https://github.com/jackokring/www
Generic www flask server with phinka module
compression data flask phinka python
Last synced: 16 Jan 2026
https://github.com/marcelo-earth/h5n8-data
🔢🦠 Confirmed cases of H5N8 in humans - Feel free to open Pull Requests with new data.
csv data h5n8 h5n8-cases h5n8-virus russia
Last synced: 19 Jan 2026
https://github.com/ajityadav2621/datadoom
Currently working on backend, and as user interaction has been done so updated also deployed for reference. will be adding up many things.
Last synced: 09 Feb 2026
https://github.com/3squared/smoulder
Smoulder is a really good data pipe
composition data facade-pattern forge-framework object-oriented
Last synced: 25 Apr 2026
https://github.com/nafisalawalidris/sales-performance-dashboard
Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.
analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business
Last synced: 03 Feb 2026
https://github.com/ukplab/pragtag2023
Code and data for the PragTag-2023 Shared Task
argument-mining data peer-review pragmatics shared-task
Last synced: 18 Jun 2025
https://github.com/tatey/list_of_baby_names
A list of baby names given to tiny humans in Ruby
Last synced: 11 Nov 2025
https://github.com/danielbello7/nosql-json-database
Simple and quick database to help development process and speed
data database json json-database models nosql nosql-database nosql-json-database schema
Last synced: 09 May 2026
https://github.com/east-empire-trading-company/eetc-data-client
Client library for retrieving data managed by EETC Data Hub.
client-library data data-science finance library python
Last synced: 31 May 2026
https://github.com/alexandregazagnes/rica-analysis
This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.
analysis argiculture business data data-analysis data-analytics food python
Last synced: 29 Apr 2026
https://github.com/definetlynotai/vulnscan_data
Logicytics VulnScan Module's Training Data and old model archive
ai data logicytics ml models pytorch sensitive-files text-processing tfidf-text-analysis training-data
Last synced: 11 Oct 2025
https://github.com/harmonydata/harmonyapi
This is the source code for the Harmony project REST API
anxiety data data-harmonisation data-harmonization data-science deep-learning depression first-timers-only gad-7 harmonisation harmonization harmony mental-health natural-language-processing neural-network nlp psychology research social-sciences wellcome
Last synced: 31 Aug 2025
https://github.com/strata/data
Tools to help you read data from a range of different data providers.
Last synced: 27 Jan 2026
https://github.com/famarks/grafarg
Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License
analytics charts data data-analysis data-science data-visualization grafana grafarg graph
Last synced: 19 Jan 2026
https://github.com/davorg/dmp
Data Munging with Perl
book data hacktoberfest munging perl
Last synced: 21 Jan 2026
https://github.com/stdlib-js/array-filled-by
Create a filled array according to a provided callback function.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 09 Mar 2026
https://github.com/wildanmujjahid29/books-sales-analytics-python
Books Sales Analytics With Pyhton
data data-analysis data-science data-visualization
Last synced: 12 Jun 2026
https://github.com/stdlib-js/array-base-assert-is-real-floating-point-data-type
Test if an input value is a supported array real-valued floating-point data type.
array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate
Last synced: 12 Oct 2025
https://github.com/malvfr/zap
Fill your database with fake data.
cli csv data database generator hacktoberfest mock node populate populate-database seed sql
Last synced: 21 Jan 2026
https://github.com/mccarthy-m-g/alda
An R data package for the book "Applied longitudinal data analysis: Modeling change and event occurrence" by Singer and Willett (2003).
data growth-curves longitudinal-data mixed-models nonlinear-mixed-models r r-package structural-equation-modeling survival-analysis time-to-event
Last synced: 19 Jan 2026
https://github.com/saroshfarhan/kaggle-playground-s4e12
Kaggle competition first attempt
analytics data data-analysis-python data-science
Last synced: 12 Oct 2025
https://github.com/sstendahl/giscan
Simple tool to read and analyze existing GISAXS data
cbf data diffraction diffraction-analysis gisans gisaxs physics reflectivity scattering xray
Last synced: 11 Nov 2025
https://github.com/rohancyberops/r-language
R Language Projects directory. This repository contains various projects, scripts, and experiments developed using R, a powerful statistical computing and data visualization language.
caret cran data dplyr ggplot2 rlanguage rstudio shiny tidyverse
Last synced: 12 Oct 2025
https://github.com/miniql/miniql-express-mongodb-example
A MiniQL example for querying a MongoDB database through an Express REST API.
data database mongodb query query-language
Last synced: 19 Apr 2026
https://github.com/codenoid/alodokter.com-database
a Alodokter.com Database, collected by Hofesh Bot (Scrapper)
alodokter data extraction hofesh
Last synced: 18 Mar 2026
https://github.com/grkndev/twitcher
A great library that will allow you to use the Twitch API service. All you need to do is use your Token and Client Id information.
api clip clipr data javascript nodejs npm npm-package npmjs streamers streaming twitch twitch-api twitch-bot twitchtv twtich-clip user
Last synced: 09 Mar 2026
https://github.com/genert/metis
Asynchronous data sender library
analytics asynchronous data dependency-free typescript
Last synced: 27 Jan 2026
https://github.com/petermartens98/nba-analytics-streamlit-app-with-langchain-agent
Interactive NBA Analytics app with Streamlit and a LangChain conversational agent connected to extracted data. Explore player, team, and game stats, track injuries, run simulations, visualize trends, and get AI-powered insights. Ongoing development, open to collaboration.
agentic-ai analysis data deepseek langchain nba python streamlit visualization
Last synced: 08 May 2026
https://github.com/velocitatem/cellviz
Cellular Automata inspired by live-data visualization, designed to handle multidimensional and high-throughput data efficiently.
cellular-automata conways-game-of-life data economics
Last synced: 29 Jul 2025
https://github.com/anobaka/insidecollector
这是一个介于Excel和纯记录工具之间的软件,您可以自由创建各种列表,然后将其以各种规则关联起来,并且可以创建自定义视图帮助您更好地理解数据。
collection data excel-like list list-manager table
Last synced: 19 Jan 2026
https://github.com/R-Mahesh45/HR---Resume-Text-Classification
Text Classification for Resumes: Conducted Exploratory Data Analysis (EDA) on a vast collection of resumes. Organized the data using Bag of Words (BoW) and TF-IDF techniques. Built and evaluated multiple models, with Logistic Regression delivering standout performance. Created Word Clouds and Histograms.
data datacleaning extract-transform-load feature-extraction nlp nltk-tokenizer text-mining text-processing
Last synced: 13 Oct 2025
https://github.com/jerryfzhang/rockets
A Node + React App that displays space launch missions around the world.
bootstrap data expressjs less momentjs nodejs react reactjs reactstrap
Last synced: 10 Apr 2026
https://github.com/richardwarepam16/hotel_analysis_using_python
Unlocking Insights: Analyzing Hotel Reservation Data to Boost Business Performance
data data-analysis data-visualization hotel-booking hotel-cancellation-solution hotel-management-system jupyter-notebook python python3
Last synced: 22 Aug 2025
https://github.com/labwhatever/leetcode
Collection of LeetCode questions to ace the coding interview!
data data-structures-and-algorithms dsa leetcode-cpp leetcode-solutions structure structure-learning
Last synced: 22 Aug 2025
https://github.com/aadityatamrakar/futures_spread_chart
Cash Market & Futures Daily Spread Chart - NSE Stocks
data data-analysis data-mining expressjs nodejs requests
Last synced: 10 Apr 2026
https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis
Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.
data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn
Last synced: 10 Feb 2026
https://github.com/carlotta94c/sql4datascientistsdemo
Demo material for Microsoft Reactor session "Getting Started with Databases: SQL and Data Visualizations"
analysis data r sqlite tidyverse visualisation
Last synced: 18 Apr 2026
https://github.com/ompreetham/dcn-network-traffic-anomaly-detection
Data Communication Networks - Network Traffic Anomaly Detection
anomaly anomaly-detection communication data dcn keras learning machine machine-learning network pandas presentation project python scikit-learn tensorflow traffic
Last synced: 08 Apr 2026
https://github.com/athul64/powerbi
Financial Reports Dashboard This repository showcases a Financial Reporting Dashboard that visualizes key financial metrics and performance insights. The dashboard contains Monthly and Annual reports, allowing users to switch between the two views to analyze data at different intervals.
data data-an data-visualization dax dax-expression powerbi
Last synced: 23 Feb 2026
https://github.com/vincentneo/sgtidetimings
Scraped SG NEA tide timings table into machine-readable JSON files!
data github-actions github-pages gov html-tables-to-json javascript json nodejs sg singapore singapore-data-analysis tide webscraping
Last synced: 10 Apr 2026
https://github.com/lisakey/datacamp-data-analyst-python-sql-projects
Several projects completed during my Data Analyst 📊 training on the DataCamp platform with Python 🐍 and SQL 🗃️. Each project addresses real-world challenges using modern analytical tools and techniques.
analysis cleaning-data data dataanalysis dataanalyst matplotlib pandas python seaborn sql transformation visuali
Last synced: 19 Apr 2026
https://github.com/lahcenezzara/whatsapp-scraping-python
WhatsApp Scraping Python
automation data python scraping selenium whatsapp
Last synced: 05 Feb 2026
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 27 Jan 2026
https://github.com/stdlib-js/ndarray-base-dtypes2signatures
Transform a list of array argument data types into a list of signatures.
api array base data dtype dtypes interface javascript multidimensional ndarray node node-js nodejs sig signatures stdlib types utilities utility utils
Last synced: 14 Apr 2026
https://github.com/gematik/app-fhir-snapshots-package-generator
The repository contains a library and a console application to generate snapshots for StructureDefinitions in FHIR-packages.
Last synced: 05 Oct 2025
https://github.com/morphaxthedeveloper/yokatlas-dataset-2025
yök atlas detaylı üniversite, bölüm, puan vb. datası..
data database liste scrape universite veri yok-atlas yok-atlas-api yok-atlas-data yokatlas yokatlas-crawler yokatlas-data
Last synced: 14 Oct 2025
https://github.com/jhpoelen/rats
self-replicating data publication related to rat (Rattus sp.) specimen.
biodiversity data natural-history-collections provenance
Last synced: 18 Mar 2026
https://github.com/arif-miad/heart-attack-risk-prediction
This dataset explores key factors influencing heart attack risk, such as age, cholesterol, blood pressure, and lifestyle habits. Using machine learning models.
classification data data-science matplotlib ml pandas-python seaborn visualization
Last synced: 18 Aug 2025
https://github.com/souvik09-tech/adventure-works-kpi-dashboard
This repository contains a complete Business Intelligence solution for AdventureWorks, a global manufacturing company specializing in cycling equipment and accessories. Built using Power BI Desktop, this project helps track KPIs, analyze product performance, compare regional data, and identify high-value customers.
analysis data kpi powerbi visualization
Last synced: 27 Jan 2026
https://github.com/freddy03h/immutable-data-structure
Normalize and Merge your application's data store using Immutable.JS objects
Last synced: 05 Oct 2025
https://github.com/ibilalkayy/covid-tracking-app
This repository contains the code of a covid tracking app that shows the data of covid-19 on Google Map.
Last synced: 14 Oct 2025
https://github.com/jhpoelen/bats
self-documenting data publication on Bat (Chiroptera) specimen
biodiversity data natural-history-collections provenance specimen
Last synced: 18 Mar 2026
https://github.com/dylanhogg/cloud-products
A package for getting cloud products and product descriptions from a cloud provider website.
aws cloud-products crawler data text-processing
Last synced: 05 Oct 2025
https://github.com/nnavales/desafios-data-engineer
En este proyecto abordaremos desafíos comunes en el rol de un Data Engineer con tecnologías modernas.
data data-engineering database dataengineering docker minio scrapping spark
Last synced: 01 Jun 2026
https://github.com/garcane/income-prediction-ml
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 08 Apr 2026
https://github.com/open-i18n/data-iso-15924
Git mirror for ISO 15924, Codes for the representation of names of scripts data
data iso iso-15924 iso15924 open-i18n scripts unicode unicode-data writing-systems
Last synced: 14 Mar 2026
https://github.com/sanskaryo/ultimate-dsa-repo
One Stop Solution for DSA Learning and Resources
data data-structures-and-algorithms dsa hacktoberfest hacktoberfest-accepted hacktoberfest2025
Last synced: 15 Oct 2025
https://github.com/pradeep221b/turbofan_predictive_maintenance
An R project for predicting turbofan engine RUL using {targets} and {tidymodels}.
data data-science-portfolio machine-learning nasa preditive-maintaince r rstats targets-pipeline tidymodels
Last synced: 04 Oct 2025
https://github.com/rishabh-agarwal/datastructuremachineproblem
Data Structure MP - Clemson University (Language C)
273 alogrithms clemson data ece structure university
Last synced: 26 Oct 2025
https://github.com/stdlib-js/array-one-to-like
Generate a linearly spaced numeric array whose elements increment by 1 starting from one and having the same length and data type as a provided input array.
array data float32array float64array int16array int32array javascript matrix ndarray node node-js nodejs stdlib structure typed typed-array types uint32array vector
Last synced: 20 Feb 2026
https://github.com/akv3sic/cryptocurrency-charts
Cryptocurrency API data visualizations 📈 with Matplolib.
cryptocurrency data data-visualization matplotlib python
Last synced: 16 Oct 2025
https://github.com/nxank4/loclean
⚡️ The All-in-One Local AI Data Cleaning Library. No GPU or API keys required.
automated-cleaning data data-cleaning data-engineering data-preprocessing data-science data-wrangling etl llm normalization open-source polars privacy-preserving python semantic-analysis slm structured-data
Last synced: 22 Jan 2026
https://github.com/bishtrishu/pizza_sales_data_analysis_sql
This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.
cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database
Last synced: 14 Apr 2026
https://github.com/simranjeet97/datastructures_algoritms_python
Data Structures and Algorithms using Python
algorithms arrays arrays-and-strings coding data data-science data-structures datastructures-python hashing interview-preparation interview-questions linked-list python stacks stacks-as-an-array
Last synced: 09 Apr 2026
https://github.com/data-forge-notebook/javascript-cheat-sheet
Cheat sheet that accompanies my book Data Wrangling with JavaScript
cheatsheet data data-wrangling javascript nodejs
Last synced: 15 Apr 2026
https://github.com/nicolasbizzozzero/datagenerator
Randomly generate various commonly used data
data data-generation data-generator data-science
Last synced: 18 Oct 2025
https://github.com/gematik/poc-isik-patient-merge
The repository contains a proof of concept (POC). The POC demonstrates how a FHIR subscription can be used to inform about happened merges within the ISIK context.
Last synced: 19 Oct 2025
https://github.com/helosantosdesousa/analise-previsao-de-rotatividade-ml
Projeto final do Bootcamp Data Girls 2025 que analisa a rotatividade de funcionários usando Machine Learning. Com base no dataset IBM HR Analytics Attrition, o projeto identifica os principais fatores de risco e cria modelos preditivos (SVC e Random Forest) com até 89% de acurácia para antecipar saídas e apoiar decisões estratégicas de RH.
analise-de-dados analise-exploratoria bootcamp ciencia-de-dados colab-notebook dados data data-analysis data-science dataanalytics dataframe eda machine-learning machine-learning-algorithms pandas python random-forest svc
Last synced: 16 Apr 2026
https://github.com/scottleechua/data
Public datasets under CC-BY-4.0 license.
Last synced: 18 Mar 2026
https://github.com/r12habh/datacamp.com-micro_projects
data data-analysis data-science datascience python python3
Last synced: 23 May 2026
https://github.com/sksubhadeep/airbnb-dashboard-tableau
Airbnb Dashboard Using Tableau
airbnb data tableau tableau-public visualization
Last synced: 19 Mar 2026
https://github.com/jorgeatgu/casa-caida-bot
Twitter-bot sobre la despoblación en Aragón
aragon bot data data-viz despoblacion twitter-bot
Last synced: 11 Aug 2025
https://github.com/qeeqbox/data-classification
Data classification defines and categorizes data according to its type, sensitivity, and value
classification data data-classification infosecsimplified qeeqbox
Last synced: 09 Mar 2026
https://github.com/vikjam/ui-policy
Unemployment policy at the state level
data government government-data
Last synced: 13 Feb 2026
https://github.com/semibran/img-data
Easily read from and write to ImageData instances
Last synced: 11 Aug 2025
https://github.com/lemniscate-world/stratai
This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.
ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading
Last synced: 23 Oct 2025
https://github.com/cisagov/cyhy-feeds
Tools to create and retrieve Cyber Hygiene (CyHy) data extracts
Last synced: 23 Oct 2025