data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/null-none/py-fear-and-greed
Fear & Greed Index
data fear-and-greed python trading
Last synced: 16 Jul 2025
https://github.com/birjemin/wxgameod
wxgame 开放数据 weixin 微信小游戏 关系链数据
data interactive-data relation user-storage
Last synced: 16 Jul 2025
https://github.com/shoaib1522/database-systems
📚💾 Master the fundamentals of database systems with this all-in-one lab repository, featuring ERD design diagrams 🧠🗺️, Oracle SQL 🌐📝, relational schema practice, and complete PowerPoint lectures 🖥️📑. Perfect for revision, exams, or quick reference! 💡📘
data database database-management databases databases-course db dbms-project erd notes oracle oracle-database sql
Last synced: 21 Aug 2025
https://github.com/ayush1999/data-mining
data mining natural-language-processing
Last synced: 10 Sep 2025
https://github.com/qubitpi/wiktionary-data
Wiktionary data in simple parsable formats hosted on 🤗 Datasets
ancient-greek data german huggingface huggingface-datasets language latin natural-language-processing nlp old-persian python wiktionary wiktionary-data
Last synced: 17 Jul 2025
https://github.com/webianks/anotech-android
Android application which deals on various anomalous behaviour that occur on server data.
Last synced: 13 Apr 2025
https://github.com/andygeiss/pipeline-example
This is a basic example of using a pipeline in data science.
data data-pipeline data-science example go golang iris-dataset pipeline protobuf
Last synced: 17 Jul 2025
https://github.com/jodus-melodus/queue
Simple Queue
data datastructures linear queue queues
Last synced: 10 Sep 2025
https://github.com/amethyst-php/value
amethyst amethyst-package api data laravel value
Last synced: 17 May 2026
https://github.com/topunix/hackerrank
:green_book: HackerRank Solutions
algorithm-challenges algorithms algorithms-and-data-structures data data-structures hackerrank hackerrank-algorithms-solutions hackerrank-challenges hackerrank-python hackerrank-solutions python
Last synced: 17 May 2026
https://github.com/christopherandrewtopalian/catopalian_javascript_data_navigator
A JavaScript application that allows for easy sorting of data. Easily navigate through any amount of data using button filters.
Last synced: 13 Apr 2025
https://github.com/os-climate/rmi-utility-transition-hub-ingestion-pipeline
Data ingest for RMI's Utility Transition Hub data (as of March 7, 2022)
data emissions-co2 energy-data os-climate
Last synced: 12 Apr 2025
https://github.com/potlock/data
data research for other funding mechanisms and PotLock related data.
data flipsidecrypto near-protocol potlock
Last synced: 07 Mar 2026
https://github.com/dimitryzub/allrecipes-us-recipes-by-state-analysis
Personal Data Exploratory Project in Python. Data extracted from AllRecipes.
data data-visualization dataexploration dataextraction matplotlib pandas python seaborn webscraping
Last synced: 10 May 2026
https://github.com/Vidya-Vijay/Vid2501
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 19 Jul 2025
https://github.com/vidya-vijay/vid2501
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 19 Jul 2025
https://github.com/merekat/hb-passiv-income
Ein Rechner, der basierend auf historischen Daten unterschiedlicher Assets kalkuliert, welches voraussichtliche passive Einkommen der User abhängig von seinen Eingaben zu erwarten hat.
assets data datajournalism etf passive-income treasury
Last synced: 19 Jul 2025
https://github.com/youmenomi/hydreigon
Are you looking for a Hydreigon to classify data for you? Come and catch it!
classify data hydreigon indexer items management pokemon sortable structure typescript
Last synced: 07 May 2025
https://github.com/jcloh98/rental-property-finder
A web scraper that helps users find rental properties by automatically gathering and organizing listings from various websites to discover available homes and apartments.
data headless-browser node scraper scraping web
Last synced: 17 May 2026
https://github.com/amethyst-php/sku
amethyst amethyst-package api data laravel sku
Last synced: 17 May 2026
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/nadahamdy217/Harvest-Gaurd-Plant-Disease-Detection-Web-Application
web application that help people grow healthy plants
classification-confidential cnn cnn-classification css data data-science detection html javascript keras machine-learning model plant-disease-detection supervised-learning tensorflow web-application
Last synced: 12 Apr 2025
https://github.com/joseluisq/input-verifier
Some useful functions to check common data input.
Last synced: 19 Jul 2025
https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate
This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.
correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif
Last synced: 17 May 2026
https://github.com/zshn1248/pyfilecrypto
PyFileCrypto is a Python module for easy encryption and decryption of files using the cryptography library. It provides a simple interface to generate encryption keys, encrypt files, and decrypt files securely.
data decryption encryption file security-tools
Last synced: 07 Apr 2026
https://github.com/sharoonjoseph321/social_media_eda
Data Analysis on social media apps ,using pandas, python, matplotlib.
data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects
Last synced: 03 Mar 2025
https://github.com/yvandana/pwc-power-bi-job-simulation
Projects pursued during my Job Simulation
dashboard data dataanalysis powerbi pwc-forage-switzerland
Last synced: 06 Mar 2026
https://github.com/plurid/defocus
Apophatic User Content Resolution [Desearch Concept]
Last synced: 08 Nov 2025
https://github.com/ashishsingh789/hr_analysis_dashboard
The HR Analyst Dashboard is an interactive Power BI tool that provides insights into HR metrics sourced from Excel. It focuses on data cleaning, transformation, and visualization, enabling stakeholders to explore key indicators like employee demographics and performance through intuitive charts.
dashboard data dataanalysis datacleaning powerbi-desktop visualization
Last synced: 06 Mar 2026
https://github.com/katahiromz/comp_decomp
data compressor/decompressor
bzip2 compress compressor cxx data decompress decompressor lzma uncompress zlib
Last synced: 10 Jul 2025
https://github.com/UznetDev/Smoking-Prediction
This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.
ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking
Last synced: 28 Mar 2025
https://github.com/halyusa16/basic-sql-employee-analysis
This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.
data data-analytics data-exploration database mysql self-project sql
Last synced: 16 May 2026
https://github.com/chompfoods/sdk-scala
Scala SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes scala sdk
Last synced: 17 May 2026
https://github.com/vaibhavmojidra/data-structures---hashtable-using-array-and-linked-list-in-java
Hash Table is a data structure which stores data in an associative manner. In a hash table, data is stored in an array format, where each data value has its own unique index value. Access of data becomes very fast if we know the index of the desired data. Thus, it becomes a data structure in which insertion and search operations are very fast irrespective of the size of the data. Hash Table uses an array as a storage medium and uses hash technique to generate an index where an element is to be inserted or is to be located from.
arrays data data-structures hashing java linked-list mojidra vaibhav vaibhav-mojidra vaibhavmojidra
Last synced: 12 Apr 2025
https://github.com/reubano/pyconza-tutorial
Jupyter notebooks and data for "Data Mining and Processing for fun and profit" PyConZA16 tutorial
data functional-programming jupyter-notebook meza pycon python tutorial
Last synced: 17 May 2026
https://github.com/anct-cartographie-nationale/mednum-cli
✨ Interface en ligne de commande pour la transformation des données de lieux de médiation numériques collectées dans un format non standard vers le schéma de la mednum et leur publication sur data.gouv
anct betagouv data donnees gouvernement mediation-numerique nodejs open-data transformation
Last synced: 02 Aug 2025
https://github.com/webobite/fact-chatbot
A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.
chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp
Last synced: 04 May 2026
https://github.com/ditikrushna/enotes
🌻 Personal learning notes
coursera-data-science cousera data datascience machine machinelearning ml notes
Last synced: 07 Mar 2026
https://github.com/sumansuhag/prediction_model
This repository features a collection of Jupyter notebooks designed to showcase the practical applications of machine learning, data preprocessing, feature engineering, and recommendation systems. These notebooks enable users to explore, analyze, and predict business events.
algotithms artificial-intelligence data logistic-regression machine-learning-algorithms science sckiit-learn
Last synced: 28 Mar 2025
https://github.com/sumansuhag/wasserstoff-aiinterntask
Welcome to the AI Pipeline for Image Segmentation and Object Analysis project – a state-of-the-art solution designed to process, segment, identify, and analyze objects within images. This AI-powered pipeline is engineered to deliver precise insights by extracting, mapping, and summarizing data from each segmented object.
artificial-intelligence cdn data data-science modeling pipline
Last synced: 28 Mar 2025
https://github.com/sibeux/redesigned-broccoli
Repositori untuk menyimpan data file musik
data data-center nasrulwahabi sibeux
Last synced: 24 Jan 2026
https://github.com/1sumer/mass-mail-automation
Mass Emailer is a Python-based application designed to send bulk emails efficiently using an SMTP server. Leveraging the power of the Tkinter library for the graphical user interface (GUI), this tool provides a user-friendly platform for managing and dispatching large volumes of emails with ease.
data oops-in-python python smtp-server tkinter
Last synced: 20 Aug 2025
https://github.com/nanis/unitedat
Unify data sets which consist of separate files with a common header repeated in each one.
Last synced: 12 Apr 2025
https://github.com/robsteranium/user2022-ldf-talk
Slides from my useR! 2022 talk about the Linked-Data Frames package
data data-frame linked-data r rdf
Last synced: 19 Apr 2025
https://github.com/ericgio/history-of-jazz
Data and visualizations based on Ted Gioia's "The History of Jazz"
Last synced: 28 Mar 2025
https://github.com/thedevreda/jadaerospace
A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero
data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python
Last synced: 02 Aug 2025
https://github.com/gsmithun4/expressjs-field-validator
Plugin for validating JSON request, middleware for expressjs
data express-js expressjs json-request middleware nodejs request rest-api validation
Last synced: 06 Mar 2026
https://github.com/mvuorre/osfdatasette
Harvest, wrangle, and serve preprint data from OSF API with Datasette
data datasette open-science preprints
Last synced: 11 Apr 2025
https://github.com/andreabozzo/andreabozzo
My personal Repo!
analytics data data-engineering data-visualization database datamodelling developer-profile github-pages github-profile go interactive-animation open-data portfolio python readme-profile rust
Last synced: 17 May 2026
https://github.com/sap-samples/sap-bdc-explore-hyperscaler-data
The repository contains detailed steps to integrate external hyperscaler data sources to SAP Datasphere in the SAP Business Data Cloud per the Open data ecosystem integration principles .
aws azure business cloud data databricks datasphere gcp hyperscalers sap
Last synced: 16 May 2026
https://github.com/dimaa1608/azurecontent
AzureContent is a repository on GitHub containing documentation and resources related to Microsoft Azure services and features. It provides clear and concise information for users seeking guidance on Azure cloud computing solutions.
azure azurecontent cloud computing content data deployment integration management networking platform security service storage virtualization
Last synced: 10 Apr 2025
https://github.com/meta-llama/synthetic-data-kit
Tool for generating high quality Synthetic datasets
data generation llm python synthetic
Last synced: 08 May 2025
https://github.com/saksham-jain177/data-analysis
A collection of data analysis and machine learning projects across various datasets. Explore predictive modeling, data visualization, and insights from real-world data. Projects include sales predictions, disease detection, customer segmentation, and more.
api data data-analysis data-cleaning data-science data-visualization datamodeling dataset datasets exploratory-data-analysis python python3 web-scraping youtube-api
Last synced: 01 May 2026
https://github.com/wellingtonmwadali/alx-low_level_programming
ALX sprint one C programming
c data datastructures linked-list loops pointers-and-arrays string structures
Last synced: 04 Apr 2025
https://github.com/ournet/news-data
Ournet news data package
data news news-data news-storage ournet storage
Last synced: 04 Apr 2025
https://github.com/ournet/quotes-data
Ournet quotes data package
data ournet ournet-quotes quotes
Last synced: 04 Apr 2025
https://github.com/hidayathamir/telegram-group-data
1,865,827 message data in telegram group. Text, identity, datetime.
bahasa-indonesia data python3 scrape telegram telethon
Last synced: 17 May 2026
https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration
This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.
covid-19 data data-exploration dataset sql
Last synced: 19 Feb 2026
https://github.com/basinghse/covid19simulator
Real Time Assessment and Simulation of COVID-19 - showing current numbers of cases, deaths and treated patients globally.
coronavirus covid-19 data real-time simulation visualisation visualisation-data-ingester
Last synced: 05 Apr 2025
https://github.com/antoninpvr/battery-logger
Simple scripts to record data from my laptop battery
Last synced: 17 May 2026
https://github.com/adadalshabab/machine-predictive-maintenance-classification
This repository hosts a machine predictive maintenance classification project, aimed at predicting the maintenance needs of industrial machinery before they fail. By leveraging machine learning algorithms, this project seeks to enhance operational efficiency and reduce downtime by identifying potential maintenance requirements proactively.
data data-science datanalysis datanalytics machine-learning machine-learning-algorithms matplotlib-pyplot pandas
Last synced: 17 May 2026
https://github.com/ericmaddox/nyc-crime-analytics
Analyzes and visualizes crime data from the NYC Police Department using interactive maps and heatmaps, leveraging the NYC Open Data API.
crime-analysis crimedata data datavisualization esri folium heatmap nycopendata python python3 rtcc
Last synced: 24 Jun 2025
https://github.com/istinnew/cook-me-up
[In Progress] Welcome to Cook-Me-Up! This project aims to analyze and organize cooking recipes using data analysis (Python, BigQuery SQL, Looker Studio etc.) and machine learning techniques. The goal is to simplify meal preparation and offer users a comprehensive database of culinary delights.
bigquery clustering cookme culinary data data-science dataanalysis datavisualization looker-studio machine-learning python recipe-search recipes unsupervised-learning
Last synced: 16 May 2026
https://github.com/onemoredavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 04 Apr 2025
https://github.com/kobowood1/data-analysis-alpha
My first data analysis project
data data-analysis data-analytics data-science
Last synced: 06 May 2025
https://github.com/madhuresh2011/kulturehire-internship
☺️Hi folk, During my internship at KultureHire, I completed a real-world Data Analyst project. I created an interactive dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.
data data-analytics data-cleaning data-standardization data-visualization excel excel-pivot-charts excel-pivot-tables genz-aspirations my-sql
Last synced: 17 Feb 2026
https://github.com/toofancodes/h1b-dashboard-insights
An interactive Tableau dashboard that visualizes H1B visa data from the USCIS Employer Data Hub, offering insights into application trends, top employers, and geographic distributions. Showcases advanced data visualization, analytics, and business intelligence skills.
analysis analytics business-intelligence dashboard data data-visualization h1b h1b-visa interactive-data tableau
Last synced: 20 Jan 2026
https://github.com/nel-zi/zipco_foods
Developed an automated ETL pipeline using Python and Apache Airflow to consolidate fragmented CSV sales data into a normalized Azure SQL database for Zipco Foods.
airflow apache-spark data dataengineering etl pyspark wsl
Last synced: 03 May 2026
https://github.com/amethyst-php/post
A comment, a note, a post, a pseudo-chat. Can be really anything
amethyst amethyst-package api data laravel post
Last synced: 17 May 2026
https://github.com/rameshaditya/dynamic-hybrid-data-grid
Facilitates faster read-and-write of large ordered collections of data.
algorithms data data-structures storage
Last synced: 23 Feb 2025
https://github.com/weecology/updating-data
Hugo website for instructions on how to make a regularly updating data pipeline
continuous-analysis continuous-integration data gh-actions living-data netlify travis-ci
Last synced: 17 Feb 2026
https://github.com/teragrep/rsm_01
Teragrep record schema mapper library for Java
data data-mining data-science datascience java-library liblognorm log-analysis log-management schema-mapper structured-data structured-logging teragrep unstructured-data
Last synced: 09 Apr 2026
https://github.com/shivamsharma32/ipl-2022-analysis
The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.
data dataana dataanalytics datavi matplotlib python
Last synced: 17 May 2026
https://github.com/aguven6/inmemory-data-processor
Convert tabular data to columnar data with index. Aim is to process huge data quicker especially in aggregation operation
columnar-storage data data-structures parallel-computing parallel-programming processing
Last synced: 17 May 2026
https://github.com/kulgan/justobjects
It's all just objects
data json-schema justobjects objects parsing python python3 validation
Last synced: 10 Jul 2025
https://github.com/simranjeet97/kaggle_pokemon_datset_eda-dashboard
Full EDA and Dashboard of Kaggle Pokemon Dataset with Live Streaming Data and Images
cloud data data-science dataanalytics machine-learning machine-learning-algorithms pokemon pokemon-dataset pokemon-prediction python science
Last synced: 07 May 2026
https://github.com/denisecase/cintel-03-data
Getting started with interactive data analytics in Python
analytics data interactive python shiny
Last synced: 11 Apr 2025
https://github.com/denisecase/buzzline-04-case
Adding live visualizations to streaming data applications
animation data kafka matplotlib python streaming
Last synced: 11 Apr 2025
https://github.com/praveendecode/data-analysis
Implemented data analysis projects with interactive Streamlit UI for user-friendly data exploration and insights presentation
data data-science dataanalysis exploratory-data-analysis insights python streamlit-dashboard tableau tableau-public
Last synced: 04 Apr 2025
https://github.com/ciscorn/japanmesh-rs
A Rust library for handling Japanese Grid Square Code (JIS X 0410:2002 地域メッシュコード)
census data geospatial japan rust
Last synced: 11 Jan 2026
https://github.com/pulipulichen/pts-local-news-dataset
A dataset containing local news from Public Television Service.
Last synced: 27 Mar 2026
https://github.com/stkisengese/numpy-data-fundamentals
A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.
data data-analysis numpy pre-processing
Last synced: 16 May 2026
https://github.com/rd-uk/rduk-data-sqlite
SQLite Data Provider implementation for rduk-data
Last synced: 16 May 2026
https://github.com/amethyst-php/taxonomy
amethyst amethyst-package api data laravel taxonomy
Last synced: 18 Jan 2026
https://github.com/sharmadhiraj/plot-pi
Graphical Representation of PI
data data-visualization html javascript js mathematics plot
Last synced: 28 Mar 2025
https://github.com/naufalbasara/superstores-pipeline
Data Pipeline on Dummy E-commerce with Apache Airflow
airflow data data-engineering data-pipeline data-warehouse postgresql
Last synced: 16 May 2026
https://github.com/ellisvalentiner/legislation-embeddings
Embeddings for U.S. Congress legislation
data embeddings machine-learning nlp python
Last synced: 12 Aug 2025
https://github.com/paulveillard/cybersecurity-analytics
An ongoing collection of awesome software, libraries, learning tutorials, documents and books, technical resources and cool stuff about Analytics Engineering in Cybersecurity.
analytics bigdata bigquery cybernetics cybersecurity data data-engineering data-science encryption encryption-decryption seo seo-friendly seo-optimization
Last synced: 28 Mar 2025
https://github.com/ranjeetj06/insighthub
InsightHub is a data analytics project that helps automate the entire process of preparing, analyzing, and reporting on CSV data.
analysis begineer data springboot
Last synced: 17 May 2026
https://github.com/hyfi06/unam-careers
A utility package for retrieving career information from UNAM.
Last synced: 16 May 2026
https://github.com/eloyhere/semantic-java
Semantic-Java is a modern, maven Java stream processing framework with zero dependencies. It elegantly blends the fluency of Java Streams, the laziness of JavaScript generators, and intelligent index-based control inspired by database indexing — perfect for time-series, event streams, and high-performance data pipelines as a maven pendency.
data functional functional-programming java pipeline stream
Last synced: 07 Apr 2026
https://github.com/erictleung/2018-new-coder-survey
:beginner: Code to wrangle data from the 2018 New Coder Survey by freeCodeCamp
data data-cleaning dataset freecodecamp new-coders-survey programmers
Last synced: 03 Apr 2025
https://github.com/chrisrobertsjr/chrisrobertsjr
Welcome to my Github Profile!
data data-analysis java r sql statistics
Last synced: 03 May 2026