data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/elcarrillo/structpy
StructPy is a Python-based command-line tool designed for academics and scientists to manage data projects effectively. It simplifies workflows by creating structured project directories, generating timestamped filenames, validating datasets, and backing up projects seamlessly.
command-line-tool data database file-structure organization python science-tool
Last synced: 24 Apr 2026
https://github.com/coryson/osm-mla-finder
Python script to locate institutions employing Medical Laboratory Assistants in Germany, developed for BTZ – Berufliche Bildung Köln GmbH. It uses OpenStreetMap, SerpAPI, and web scraping to find and verify relevant labs, clinics, and diagnostic centers.
beautifulsoup data openstreetmap osm python scraping serpapi webscraping
Last synced: 24 Apr 2026
https://github.com/cyberoctane29/python-for-data-analysis
A repository dedicated to learning Python for data analysis, data science, and data analytics. This collection of Jupyter notebooks covers practical exercises and concepts from the Google Advanced Data Analytics Professional Certificate program.
data data-analysis data-analytics data-science python
Last synced: 24 Apr 2026
https://github.com/aneeshmurali-n/nlp-emotion-classification-in-text
Develop machine learning models to classify emotions in text samples.
bag-of-words data emotion-classification feature-extraction machine-learning naive-bayes natural-language-processing nlp nltk preprocessing python scikit-learn svm text-classification tf-idf tokenizer vectorizer
Last synced: 10 May 2026
https://github.com/marielachirinosr/cyclistic-data-analytics-project
This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.
data data-visualization pandas powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/bscript07/softuni-javascript-applications
Javascript for Applications course at SoftUni -Oct 2023
architecture-component authentication client-side-rendering-seo data lit-html-template routing
Last synced: 15 Mar 2025
https://github.com/yashaswitir28/yashaswitir28.github.io
This is my Portfolio Website
data data-analysis-python data-analyst data-cleaning data-science data-visualization excel html-css ms office365 portfolio-website powerbi python sql
Last synced: 29 May 2026
https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio
This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time
data data-analysis data-visualization pandas seaborn
Last synced: 10 May 2026
https://github.com/desininja/weather-data-etl-pipeline
ETL pipeline using Apache Airflow
apache-airflow aws cicd dags data data-engineering etl glue-job mwaa pyspark redshift
Last synced: 25 Apr 2026
https://github.com/xjwllmsx/hacker-news-engagement
Analyze Hacker News data to reveal which post types and posting hours spark the most discussion, using Python and a reproducible Jupyter notebook.
data data-analysis jupyter python
Last synced: 25 Apr 2026
https://github.com/mlkav/tri-hita-karana
Project Tri Hita Karana - Future Knowledge G20 Bali. DTS Kominfo x Binar Academy.
bali data data-science g20 science
Last synced: 06 Jun 2026
https://github.com/adrianoleitedasilva/adrianoleitedasilva
Me chamo Adriano, tenho 35 anos de idade, sendo 18 anos dedicados as áreas de Tecnologia da Informação e Educação.
adrianoleitedasilva automation ceo cio cto data data-science dev diretor github mobile professor python readme techlead web
Last synced: 10 May 2026
https://github.com/smeltier/data-structures-c
This repository contains C language implementations of the main data structures covered in the Algorithms and Data Structures course. The implementations were developed as part of my hands-on learning process and include sequential lists, linked lists, and other fundamental structures.
algorithms algorithms-and-data-structures c c-language c-programming data data-structures data-structures-c structures-c
Last synced: 16 May 2025
https://github.com/marielachirinosr/hotel-data-analysis
Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries
data data-analysis matplotlib pandas python
Last synced: 25 Apr 2026
https://github.com/farrelfaricaf/exploratorydataanalyst---titanic
This project analyzes the Titanic dataset using exploratory data analysis (EDA) and visualization techniques to identify survival patterns. The goal is to understand how demographic factors like gender and age influenced survival rates during the 1912 disaster.
data data-analysis data-science data-visualization eda python titanic-dataset
Last synced: 31 Jul 2025
https://github.com/anuraganalog/blog
Data Science Blog
anuraganalog blog data science
Last synced: 26 Apr 2026
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 09 May 2026
https://github.com/hemangsharma/assignment-2---classification-models
Assignment 2 - Classification Models repository contains project for 36106 Machine Learning Algorithms and Applications
data datascience-machinelearning machine-learning ml
Last synced: 10 Jun 2026
https://github.com/gaemapiracicaba/norma_dec_8468-76
Padrões de qualidade e lançamento de efluentes de águas interiores
Last synced: 19 Apr 2026
https://github.com/infinitode/crsd
A synthetic customer review sentiment dataset for sentiment analysis generated using different AI models.
ai data dataset datasets huggingface-datasets mit-license ml nlp open-source python sentiment sentiment-analysis sentiment-classification text-data
Last synced: 10 Jun 2026
https://github.com/tsbarr/citi-bikes-challenge
Citibikes NYC Data Analysis: Uncover insights from over a decade of ride data. Jupyter notebook for data aggregation/cleaning & Tableau dashboards for interactive visualization.
data data-visualization pandas-python python tableau
Last synced: 27 Apr 2026
https://github.com/ioanzicu/batch_loading_one-to-many_data_model
Unesco Batch Loading One-to-Many Data using Django
Last synced: 27 Apr 2026
https://github.com/2022-04-11588/data-fakes
🔍 Generate realistic fake data for testing and development, enhancing your projects with simple, customizable data solutions.
data dataset developer-tools fake-content faker fakery groovy java mock phoenix python random ruby seeding struct swift-framework test-data testing
Last synced: 11 Apr 2026
https://github.com/amethyst-php/subscription
amethyst amethyst-package api data laravel subscription
Last synced: 27 Apr 2026
https://github.com/atiqurcode/scrap-spec
Scrap data from the html to table html code / json
data html-table json-data scarp
Last synced: 05 Feb 2026
https://github.com/anuppm9917/super-store-sales-analysis-power-bi-project
My drive to know which products, regions, categories and customer segments a company should target or avoid, I search and selected an appropriate dataset on kaggle which will match a standard superstore requirement.
data data-analysis data-visualization datacleansing excel exploratory-data-analysis jupyter-notebook numpy pandas plotly powerbi python3
Last synced: 10 Apr 2026
https://github.com/meokullu/prefill
PreFill adds desired characters onto output values to increase their legibility.
alignment data data-analysis data-engineering data-science legibility
Last synced: 17 Jan 2026
https://github.com/juangesino/research-project
Course files for Research Project @ University of Amsterdam
data data-science economics stata
Last synced: 02 Jan 2026
https://github.com/srgchrksv/stream-crypto
Crypto trades streaming with azure services
azure binance crypto data databricks dataengineering pyspark python streaming websocket
Last synced: 30 Apr 2026
https://github.com/gurpreet0022/crop-fertilizers-recommendation-system-using-ml-
This repository is a part of AICTE - Shell Internship on 'Green Skills using AI technologies' Cycle 3.
data datapreprocessing datavisualization jupyter-notebook machine-learning python
Last synced: 27 Apr 2026
https://github.com/schenkd/tweetminer
Data Miner for Twitter Streaming API
data dataminer datamining java twitter twitter-api twitter4j
Last synced: 07 Jun 2026
https://github.com/mladen/ds-ml-and-ai-experiments
:1234: My Data Science, Machine learning and Artificial Intelligence experiments and projects
data data-mining data-science datascience dataset
Last synced: 09 Jun 2026
https://github.com/q-aware-labs/bias-insights
Bias detection project for the Chicago Face Database (CFD)
ai chicago-data-portal data data-science llm statistical-analysis
Last synced: 21 Jan 2026
https://github.com/fatihemres/Fruits
Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.
animations data models onboarding swift swiftui
Last synced: 31 Aug 2025
https://github.com/woctezuma/recent-sales-data
Data available to estimate sales of Steam games during release week.
Last synced: 05 Feb 2026
https://github.com/soenneker/soenneker.cloudflare.origincerts.thumbprints
The current Cloudflare origin certificate thumbprints
cloudflare csharp data dotnet origincerts thumbprint thumbprints
Last synced: 23 Apr 2026
https://github.com/tacticalnuclearraccoon/dataviz_with_js
Sample data vizualisation as part of a training on Javascript Frameworks for dataviz
d3 data datawrapper echarts javascript visualization
Last synced: 27 Apr 2026
https://github.com/vatshayan/pokemon-analysis
Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning
artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn
Last synced: 30 May 2026
https://github.com/soenneker/soenneker.datatables.attributes.column
A C# attribute for Datatables.js column building
attributes column columns csharp data datatablecolumnattribute datatables dotnet mapping object
Last synced: 12 Mar 2026
https://github.com/zazza123/hamana
A python library for seamless data extraction, storage, and SQL-based analysis using pandas and SQLite.
Last synced: 14 Jan 2026
https://github.com/fcoagz/rate-reader-epv
pyDolarVenezuela API utilities, image processing (EnParaleloVzla) to extract currency exchange rates from specific platforms, validating content against expected patterns
data finance json processing-images pydolarvenezuela
Last synced: 14 Jun 2025
https://github.com/microsoftcloudessentials-learninghub/demosscenarios-techtalks
This repository showcases demonstrations and scenarios using Microsoft Cloud technologies. Please note that these demos are intended as a guide and are based on my personal experiences.
ai analytics azure copilot data data-science fabric m365 microsoft-general ml powerapps powerbi privatebot security sharepoint
Last synced: 14 Mar 2026
https://github.com/yuvrajsaraogi/car-price-prediction-with-machine-learning
The price of a car depends on a lot of factors like the goodwill of the brand of the car, features of the car, horsepower and the mileage it gives and many more. Car price prediction is one of the major research areas in machine learning. So, if you want to learn how to train a car price prediction model then this project is for you.
car-price-prediction-with-machine-learning data data-science deep-learning deep-neural-networks engineer github learning machine-learning mini-project natural-language-processing prediction predictive-modeling project python3 sql
Last synced: 15 Apr 2026
https://github.com/petzi53/repairdata
Open Repair Alliance Datasets 2021
data open-data open-datasets r repair repair-cafe repairs
Last synced: 22 Jun 2026
https://github.com/chompfoods/stub-inflector
Inflector server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery inflector ingredients nutrition raw recipe-api recipes server stub stub-inflector stub-server
Last synced: 27 Apr 2026
https://github.com/kashifkhan7/cleaning-analysis_cli
Analyze sales data easily with our CLI app. Gain insights on revenue trends and visualize results using Python, Pandas, and Matplotlib. 🚀📊
conditional-statements css data datacleaning exception-handling exiftool html json matplotlib-pyplot metadata metadata-extraction pandas-python python sales-analysis seaborn-python speech-to-text transcription youtube
Last synced: 13 Apr 2026
https://github.com/gurpreet0022/airbnb-eda
EDA on Airbnb booking data to uncover valuable insights, trends, and patterns
data data-science dataanalytics insights jupyter-notebook matplotlib numy pandas projects python3 seaborn visualization
Last synced: 11 May 2026
https://github.com/newrelic-experimental/newrelic-java-atomikos
Gives status of Atomikos Data Sources since this information is unavailable via JMX
atomikos data instrumentation java nrlabs nrlabs-data nrlabs-java-verify nrlabs-odp observability-data
Last synced: 30 May 2026
https://github.com/justinyahin/wpdf
Create, filter, sort and display users data on your WordPress site.
Last synced: 18 Apr 2026
https://github.com/oguzhanfatihkucuk/data-analytics-project-kafka-spark
The data in this project was collected in a database using Apache Kafka and processed with Apache Spark Streaming. The project aims to create a forecasting model and analyze sales forecasts per customer.
big-data data data-visualization hadoop kafka ml mlpipeline plt pyhton spark
Last synced: 28 Apr 2026
https://github.com/elkingarcia11/mlb-gameday-obp-odds
Small Python script that pulls MLB team on-base percentage (OBP) for the current season, loads today’s schedule, and writes CSV files that list each team’s OBP edge against its opponent for the day. It also labels each side of a game as betting favorite, not favorite, or equal using American moneylines from ESPN’s public game data.
api csv data http https json mlb mlb-stats-api moneyline odds python rest sports urllib
Last synced: 30 May 2026
https://github.com/priyapuranik/data-analytics-using_python
Analyzed data of Hotels and find out meaningful insights from it including booking patterns and seasonal trends and many more.
data pandas python sql visualization
Last synced: 06 Apr 2026
https://github.com/nevoland/unchangeable
🧊 Tools for immutable values.
data datastructure functional immutable persistent pure stateless
Last synced: 24 Jul 2025
https://github.com/ttozatto/sparkify
Churn Prediction for music streaming app with PySpark
analysis churn data learning machine predictive pyspark science spark
Last synced: 16 Jan 2026
https://github.com/peterhellberg/bugsnag-data
Dump Bugsnag data using the Data access API
Last synced: 22 Jun 2026
https://github.com/sidneyarcidiacono/data-parser
A node module designed to make reading in large files as easy as calling one function.
Last synced: 05 May 2026
https://github.com/nmelgar/birthday_sports_dataviz
We will analyze how the Matthew Effect has influenced in professional sports players.
analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau
Last synced: 08 Jan 2026
https://github.com/yeti-robotics/past-scouting-data
❄️ Scouting Data from Previous Events/Seasons ❄️
Last synced: 06 Jan 2026
https://github.com/amazingandyyy/dataviz
amazingandyyy data data-visualization
Last synced: 08 Jan 2026
https://github.com/equinor/fmu-sumo-uploader
Upload to Sumo in the FMU context
data fmu python subsurface sumo
Last synced: 06 May 2026
https://github.com/rosette-api/mock-data
Mock data that is used for unit testing of the Babel Street Analytics bindings
data entity-extraction entity-level-sentiment entity-linking entity-relationship entity-resolution language-detection machine-learning mock-data morphology natural-language-processing nlp relation-extraction sentiment-analysis test-framework testing text-mining text-processing tokenization
Last synced: 04 Mar 2026
https://github.com/jstafford5380/provausio.testing.generators
Generate fake data for testing and/or mocking
data fake-data generator testing
Last synced: 14 Jan 2026
https://github.com/zoekelepiri/winedataprediction
A machine learning application in wine quality prediction
data descriptive-statistics machine-learning-algorithms
Last synced: 05 Jan 2026
https://github.com/anand-sony/mttr-dashboard
Streamlit dashboard for MTTR analysis with shift-wise loss insights and machine-level downtime tracking.
analytics business-analytics dashboard data python statistical-analysis
Last synced: 30 May 2026
https://github.com/reshmaaiman/fifa
FIFA20
data data-science data-visualization dataanalysisusingpython github jupyter-notebook matplotlib numpy pandas python seaborn-python
Last synced: 10 Apr 2026
https://github.com/nafisalawalidris/nafisalawalidris
Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and Bitcoin converge.
artifical-intelligence bitcoin config data data-science developer github-config github-pages machine-learning
Last synced: 16 May 2026
https://github.com/carlosrs14/parallel-data-preprocessig-system
A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.
barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads
Last synced: 24 Jul 2025
https://github.com/alimghmi/bdlc
Bloomberg API integration, handling data requests, processing, and SQL database insertion.
api-client bloomberg data data-processing financial-data oauth2 python sql-database transformation
Last synced: 10 Jun 2026
https://github.com/luminati-io/jupyter-notebooks-web-scraping
Perform web scraping interactively using Jupyter Notebooks, integrating coding, data analysis, and visualization into one seamless workflow.
beautifulsoup4 data jupyter jupyter-notebook pandas python requests seaborn virtual-environment web-scraper web-scraping
Last synced: 13 Apr 2026
https://github.com/hoijui/osh-dir-std
Open Source Hardware directory standard(s)
data fchh interfacer-project-eu interfacer-project-eu-wp4-3 oseg specification standard
Last synced: 28 Apr 2026
https://github.com/tks18/xl-pq-handler
A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation
analytics automation data excel power-query powerbi python xlwings
Last synced: 20 Jan 2026
https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails
Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.
cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql
Last synced: 24 Jul 2025
https://github.com/dcmox/algorithms
General purpose data structures and algorithms
algorithms binary data hash linked list structures tree
Last synced: 10 Jun 2026
https://github.com/davitshahnazaryan3/data-management-web
Explore datasets with ease using taxonomy filtering, allowing you to quickly identify the specific experimental datasets you need and download them effortlessly
data environmental experiments filtering-data seismic taxonomy
Last synced: 17 Jan 2026
https://github.com/aminnairi/node-decode
Check that your data meet your expectations
check data decode expectations schema
Last synced: 22 Apr 2026
https://github.com/fehmitahsindemirkan/web-scrapper
Professional and high performance web scraping project.
data ecommerce emailsender fileexplorer logging python web webscraping
Last synced: 10 Jan 2026
https://github.com/mightymetrika/holi
holi: Higher Order Likelihood Inference Web Applications
data data-science r statistics
Last synced: 10 Feb 2026
https://github.com/shreeparab1890/indian-elections-2019-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.
data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization
Last synced: 28 Apr 2026
https://github.com/boytchev/coursedataviz
Supplementary materials for "Data Visualization" course
Last synced: 16 Mar 2025
https://github.com/blackroad-os-inc/blackroad-portal
BlackRoad Portal — unified search routing to 30+ BlackRoad services.
blackroad cloudflare-workers data search
Last synced: 04 Apr 2026
https://github.com/halyusa16/mysql-employee-analysis
This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.
data data-analytics data-exploration database mysql self-project sql
Last synced: 20 Jan 2026
https://github.com/idhruvs/angular4-smart-table-demo
Angular4 Smart Table Demo Project
angular4 data tables typescript
Last synced: 21 Apr 2026
https://github.com/mnkanout/patients_medication_prediction
The aim of the project is to create a model that can help medical professionals select the proper medication for patients based on their symptoms. The model uses historical data of other patients to predict what could be the most suitable medication based on the patient's symptoms.
data data-analysis data-science data-visualization decision-tree-classifier machine-learning python3
Last synced: 29 Jun 2025
https://github.com/karosi12/ng-data-share
Angular communication with input and output properties
angular communication data data-binding input output sharing typescript
Last synced: 16 Jan 2026
https://github.com/gagolews/datafusion
Data Fusion (open-access research monograph, 2015)
aggregation data fusion fuzzy-logic mean multidimensional-analysis multidimensional-data spread statistics strings variance
Last synced: 16 Mar 2025