data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/mehmetkahya0/earthquake-tracker
Earthquake Tracker, A real-time earthquake monitoring application that visualizes seismic activity worldwide using interactive maps and data visualization.
ai api css cursor data data-vizualisation earth-observation earthquake earthquake-data earthquake-visualization earthquakes html js modern-web scrape ui ui-design web
Last synced: 15 Apr 2026
https://github.com/ymorsi7/quranicvisualization
A visual exploration tool for the Holy Quran using D3.js treemaps.
css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization
Last synced: 15 Apr 2026
https://github.com/mahtabranjbar/onlineshopping_analysis_dashboard
This project analyzes online shopper behavior using various machine learning models and EDA techniques.
dashboard data dataanalysis eda machine-learning streamlit
Last synced: 08 Feb 2026
https://github.com/sourceduty/text_file_metadata
📄 Extract metadata from .txt files and record the metadata in .txt files.
data datascience metadata metafile practice sourceduty
Last synced: 08 Aug 2025
https://github.com/tanyagarg25/project_covidanalysis
This repository is a project for analyzing COVID-19 data using SQL and visualizing it with Tableau. Technologies used include SQL for querying and Tableau for data visualization.
analysis dashboard data data-visualization sql tableau
Last synced: 08 Feb 2026
https://github.com/ethenkem/pygraphsurvey
A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc
Last synced: 12 Jan 2026
https://github.com/sourceduty/language_barriers
🔤 Language barriers between the world's 7,000 languages.
communication concept data idea info information language language-barrier language-barriers languages project research
Last synced: 11 Feb 2026
https://github.com/michaelfromyeg/lyrics
Lyric-store and API hosted on Git.
Last synced: 08 Feb 2026
https://github.com/danicaalana/wine-dataset-decision-tree
This project is developed as part of Digital Skill Fair (DSF) 35.0 - Data Science by Dibimbing. I am using Wine Recognition Dataset from scikit-learn, which is the results of a chemical analysis of wines grown in the same region in Italy by three different cultivators.
data data-analysis-python data-science decision-tree-classification machine-learning python scikit-learn wine-dataset
Last synced: 18 Apr 2026
https://github.com/chandansoren/financial-budget-analysis
Financial budget for 2021
Last synced: 29 Apr 2026
https://github.com/infinitode/crsd
A synthetic customer review sentiment dataset for sentiment analysis generated using different AI models.
ai data dataset datasets huggingface-datasets mit-license ml nlp open-source python sentiment sentiment-analysis sentiment-classification text-data
Last synced: 10 Jun 2026
https://github.com/machinecyc/lotteryinsight
Use crawler to collect Taiwan Lotto data, and save data into local MySQL server.
crawler data docker lottery mysql-database python3 taiwan
Last synced: 09 May 2026
https://github.com/prajakta1321/streetml-a-cityscape-traffic-volume-prognostication
StreetML leverages ML learning techniques to revolutionize urban traffic prediction through precise volume prognostication, aiming to enhance cityscape mobility through data-driven insights.
catboostregressor data datavisualisation exploratory-data-analysis lightgbm-regressor linearregression machine-learning machine-learning-algorithms predictive-analytics random-forest-regression xgboost-regression
Last synced: 08 Apr 2025
https://github.com/bishtrishu/netflix_movies_dashboard
This project is a comprehensive dashboard for analyzing Netflix movies and shows. Using a combination of Power BI, Python, and Excel, this dashboard provides insights into various aspects of Netflix's content library.
ai artifical-intelligense dashboard data dataanalysis dataanalyst dataanalytics datacleaning datahandling datascience datavisualization excel machine-learning msexcel powerbi report
Last synced: 09 Feb 2026
https://github.com/dativo-io/dativo-ingest
big big-data data data-ingestion etl etl-framework gitops iceberg ingest nessie self-hosted
Last synced: 26 Feb 2026
https://github.com/manishjanky/wrangle-weratedogs-dataset
A data wrangling project done ad part of Udacity DAND
data data-wrangling twitter udacity udacity-data-analyst-nanodegree udacity-nanodegree weratedogs
Last synced: 15 Apr 2026
https://github.com/samaalharbi2/project-recommendation-system
This project focuses on building a Recommendation System using real interaction data from IBM's Watson Studio platform.
clustering data ibm-watson kmeans nlp python rec svd udacity-nanodegree
Last synced: 09 Feb 2026
https://github.com/koltyakov/pgcopy
🐘 PostgreSQL data migration tool
cli data database golang migration postgresql sync
Last synced: 29 Apr 2026
https://github.com/diegoperea20/pytorch-vs-tensorflow
Testing the differences of the pytorch and tensorflow libraries in the different prediction and classification applications, each of them gives improvements depending on the problem they are assigned or data set assigned.
classification data images prediction pytorch tensorflow
Last synced: 29 Apr 2026
https://github.com/tazeenrashid/orders-analysis-using-python-sql-server-and-tableau
I sourced some Orders data through Kaggle; did EDA using Python and then fetched some insights out of cleaned data using SQL Server (SSMS). Then, I built a Tableau Dashboard for some visual insights. Have a look and share your feedback!
analytics data eda jupyter-notebook python sql tableau
Last synced: 29 Apr 2026
https://github.com/devcsrj/docparsr-jvm
JVM client for https://github.com/axa-group/Parsr
data document extraction nlp ocr pdf
Last synced: 08 Jun 2026
https://github.com/ompreetham/data-structures
binary-search-tree c data data-structures datastructures graph linked-list list stack structures tree
Last synced: 25 Mar 2025
https://github.com/neurazum-ai-department/tumor-stages-dataset---v1
Synthetic MRI data generated by the ‘HF’ and 'Vbai' models based on real data.
brain data dataset datasets image mri neuroscience tumor tumor-segmentation
Last synced: 18 Mar 2026
https://github.com/wireservice/workbench-lookup
A port of `agate-lookup` to Workbench
data journalism lookup workbench
Last synced: 08 Jun 2026
https://github.com/ludreinsalvador/global-covid-19-data-analysis
Contains Power BI dashboards that visualizes and analyzes global COVID-19 cases, deaths, and vaccination trends using data from the World Health Organization (WHO). The project aims to provide insights into the pandemic’s impact and vaccination progress worldwide through dynamic reports and advanced analytics.
analytics covid-19 covid19-data data data-analysis data-collection data-transformation data-visualization
Last synced: 26 Feb 2026
https://github.com/fnu-ankit/nyc_parking_violation
data dataengineering dbt githubactions python
Last synced: 16 Apr 2026
https://github.com/dysnomia-studio/achieve-games-dump
Dump parts of achieve.games database to public including Steam Games List
data dump games steam steam-api steam-game steam-games
Last synced: 27 Feb 2026
https://github.com/enescidem/twitter-topic-modeling
Topic modeling is an unsupervised method to identify topics in text. This project analyzes tweets from prominent Turkish accounts to uncover underlying themes in their shared content.
data data-science machine-learning nlp topic-modeling twitter x
Last synced: 10 Feb 2026
https://github.com/v41bh4vr4jput/data-analysis-with-python
This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.
api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn
Last synced: 09 Apr 2026
https://github.com/cosmos-loops/cosmos-data
Cosmos.Data is a inline project of COSMOS LOOPS PROGRAMME to provide several SQL-Query, RMDB/ORM and No-SQL components' extensions.
connection-pool data mysql mysqlconnector oracle postgresql sqlite sqlkata sqlserver transaction uow
Last synced: 12 Apr 2026
https://github.com/miraclx/split-merge
Efficient, flexible data stream chunker and merger
chunk data efficient merge middleware nodejs pipeline split stream
Last synced: 07 May 2026
https://github.com/softloud/spunk
Nutritional interventions for male infertility: a systematic review and meta-analysis
Last synced: 18 Mar 2026
https://github.com/os-climate/data-requests
This repo is used to track issues related to new Data Requests
Last synced: 27 Feb 2026
https://github.com/paladini/aa-daily-reflections-database
Alcoholics Anonymous (AA) Daily Reflections in English, Spanish, French and Brazilian Portuguese
aa alcoholics-anonymous daily-reflections data database reflections
Last synced: 16 Apr 2026
https://github.com/syed-bakhtawar-fahim/datavisualization
Data Visualization with Python
big-data-analytics data data-analysis data-analysis-python data-science data-visualization pandas pyspark
Last synced: 30 Apr 2026
https://github.com/vatshayan/songs-datasets
Datasets for Songs and Music for Dancing, Emotional, Happy and scenic view
1000dataset classfication csv data datapackage datapackages dataset datasets excel free freedata freedatasets genre machine music sgenre song songs
Last synced: 18 Mar 2026
https://github.com/eng-gabrielscardoso/data-science-formation
Data science course walkthrough
data data-science data-visualisation google-colab google-colaboratory google-colaboratory-notebooks python r r-lang
Last synced: 28 Feb 2025
https://github.com/haimonmon/j3mify
Convert your jejemon word into a formal sentence or word
data jejemon nlp normalization python regex tagalog tokenization
Last synced: 12 Oct 2025
https://github.com/soenneker/soenneker.extensions.httprequestdatas
A collection of helpful HttpRequestData (Functions) extension methods
azure csharp data dotnet extension extensions function http httprequest httprequestdataextension httprequestdatas request
Last synced: 21 Apr 2026
https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python
This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.
csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn
Last synced: 30 Apr 2026
https://github.com/ddeepanshu-997/support_vector_regression--svr-
In this repository i performed a support vector regression on real life data , initially i performed some data preprocessing technique in order to filter out the data flaws then undergoes the process of model building i.e SVM regression in order to make a machine learning regression model.
data data-science regression-analysis regression-models svm-model svm-regression
Last synced: 03 Aug 2025
https://github.com/omarsaad21/it-salary-eda
A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset
data explotary-data-analysis juypter-notebook numpy pandas python visualization
Last synced: 30 Apr 2026
https://github.com/miozilla/snowden
snowden :snowman::video_game: : VR Game # Snowflake # Data Engineering # ELT
data elt engineering snowflake sql vr-game
Last synced: 11 Feb 2026
https://github.com/ppabam/eda-bam
Navigating data from one thing to another.
Last synced: 11 Feb 2026
https://github.com/amethyst-php/user
amethyst amethyst-package api data laravel user
Last synced: 12 Apr 2026
https://github.com/anandanraju/power_bi_dashboard_projects
The goal of this project is to provide insights into consumer behavior and purchasing trends across different platforms. By analyzing data from Amazon and other sources, we aim to uncover valuable insights that can inform marketing strategies, product development, and decision-making processes.
amazon dashboard data data-visualization healthcare powerbi project
Last synced: 11 Feb 2026
https://github.com/praveendecode/retail-revenue-forecasting
Designed an end-to-end ML model pipeline, forecasting department-wide sales by accounting for holiday markdown effects, spanning data collection to inferencing.
azure collection data datapreprocessing docker exploratory-data-analysis feature-engineering featureimportance model modelbuilding modeldeployment modelselction python report tableau
Last synced: 16 Apr 2026
https://github.com/jk-oster/laravel-collection-trend
Generate trends from collections. Easily generate charts or reports.
charts collections data laravel php reports trends
Last synced: 03 Aug 2025
https://github.com/pbinkley/tweets-national-emergency-library
A twarc harvest of tweets related to Internet Archive's National Emergency Library (2020-03-23 to 2021-02-13)
Last synced: 11 Feb 2026
https://github.com/kunalthakur204/visualization-on-flower
🌸 Flower Dataset Visualization Visualizing patterns and relationships in flower data through charts and plots. Perfect for exploring floral characteristics and trends! 📊
data data-visualization dataanalysis flowerdataset python
Last synced: 16 Apr 2026
https://github.com/mmaithani/kaggle-projects
Collection of all the resources from competition, kernal And data section also all the magic code i have been using to get most of out of a problem
computer-vision data data-science image-processing machine-learning python
Last synced: 30 Apr 2026
https://github.com/raphcodec/rand-org-generator
Rand-Org-Generator attempts mimic real company structures. The dummy data generated by this project is intended to be used in analytics projects or web projects.
data duckdb factory-boy faker org-chart polars python3
Last synced: 30 Apr 2026
https://github.com/writetome51/public-data-container-interface
Just a TypeScript interface with 1 property: 'data'
container data interface typescript
Last synced: 15 May 2026
https://github.com/teragrep/rsm_01
Teragrep record schema mapper library for Java
data data-mining data-science datascience java-library liblognorm log-analysis log-management schema-mapper structured-data structured-logging teragrep unstructured-data
Last synced: 09 Apr 2026
https://github.com/fatihilhan42/olympics-data-analysis-with-python
I will examine the Data Analysis of the Olympics between 1896-2016, which we have done on Python.
data data-science dataanalysis datavisualization jupyter-notebook olympics python
Last synced: 30 Apr 2026
https://github.com/anct-cartographie-nationale/mednum-cli
✨ Interface en ligne de commande pour la transformation des données de lieux de médiation numériques collectées dans un format non standard vers le schéma de la mednum et leur publication sur data.gouv
anct betagouv data donnees gouvernement mediation-numerique nodejs open-data transformation
Last synced: 02 Aug 2025
https://github.com/miguelmedinacastro/trabalho-dados-r
Trabalho final da disciplina Análise Exploratória de Dados
data data-science data-science-projects data-visualization database r rstudio
Last synced: 01 May 2026
https://github.com/beastbytes/postal-code-data-php
Implementation of PostalCodeDataInterface using PHP file storage
Last synced: 27 Feb 2026
https://github.com/vianneymi/amplifai
Amplifai is a package that allows you to transform your raw unstructured text into structured data in a few lines of codes.
data data-mining extraction langchain llm pydantic
Last synced: 27 Feb 2026
https://github.com/kirillsemyonkin/lsd
LSD (Less Syntax Data) configuration/data transfer format.
configuration data java parsing rust
Last synced: 27 Feb 2026
https://github.com/bhojpur/dlm
The Bhojpur DLM is a software-as-a-service product used for Data Lifecycle Management based on Bhojpur.NET Platform for data delivery.
Last synced: 19 Feb 2026
https://github.com/farhad2415/job_scraper
Job Site Based Job Scraping with python
automation bash-script data data-scraping data-structures python selenium selenium-python
Last synced: 05 May 2026
https://github.com/benmizrahi/reactivejs
microservices event bus for async/sync communications
Last synced: 01 May 2026
https://github.com/plurid/datasign
Single Source of Truth Data Contract Specifier
Last synced: 08 Nov 2025
https://github.com/miniql/miniql-csv
A MiniQL query resolver that loads data from CSV files.
comma-separated-values csv data query query-language
Last synced: 08 May 2026
https://github.com/pawamoy/keycut-data
Keyboard shortcuts data stored in YAML files
Last synced: 12 Feb 2026
https://github.com/soenneker/soenneker.dtos.requestdataoptions
A flexible request options object for paging, sorting, and filtering queryable data, similar to OData-style parameters.
controller coordinator csharp data dotnet dto dtos http manager object odata options request requestdataoptions
Last synced: 12 Mar 2026
https://github.com/syedzaheerabbas/jamboree-education-linear-regression
Using data from Jamboree, this project explores the relationship between applicant profiles (GRE, TOEFL, GPA, etc.) and their chances of admission to Ivy League graduate programs. Linear regression, Ridge, and Lasso regression are employed to build predictive models and identify key factors.
data eda linear-regression python visualization
Last synced: 01 May 2026
https://github.com/bishtrishu/super_store_sales_dashboard
This repository contains a comprehensive sales analysis dashboard for a Superstore, created using Power BI. The objective is to contribute to the success of a business by utilizing data analysis technique, specially focusing on time series analysis, to provide valuable insights and accurate sales forecasting.
analytics data data-science dataanalysis dataanalyst datacleaning datascience datavisualization-project excel microsoft-azure microsoft-excel powerbi report sql
Last synced: 28 Feb 2026
https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice
A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.
data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks
Last synced: 02 Aug 2025
https://github.com/acovaci/orbit
ORBIT: an Open source Rust-based implementation of a data Build Tool, inspired by DBT
cargo clap-rs data data-warehouse dbt rust rust-lang tokio-rs
Last synced: 16 Mar 2025
https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics
This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.
big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard
Last synced: 01 Aug 2025
https://github.com/ahmed-naserelden/astro-success-analytics
This project analyzes key factors influencing success in the Space Race using data science techniques. It includes data collection, machine learning modeling, and insightful visualizations to predict mission outcomes.
Last synced: 01 May 2026
https://github.com/dnut/associations
Python 3 library to identify high-dimensional statistical relationships in any data set.
analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules
Last synced: 01 May 2026
https://github.com/robson-python/academic-performance
Project to evaluate students' academic performance.
csv-import data data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 12 Apr 2026
https://github.com/skygenesisenterprise/aether-meet
Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications data docker javascript meeting nextjs notes typescript voip
Last synced: 01 May 2026
https://github.com/j0a0m4/olympics
Final Project for Data Engineering Accelerated LATAM
Last synced: 13 Feb 2026
https://github.com/ahmad-ali-rafique/linear-regression-modeling
In-depth exploration of linear regression models, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data dataanalytics linear-models linear-regression model multilinear-regression regression regression-models
Last synced: 19 Apr 2026
https://github.com/chompfoods/sdk-kotlin
Kotlin SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food foods grocery ingredients kotlin nutrition raw recipe-api recipes sdk sdk-kotlin
Last synced: 01 May 2026
https://github.com/furkankarakuz/turkey_earthquake
This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.
api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake
Last synced: 20 May 2026
https://github.com/fatihemres/fruits
Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.
animations data models onboarding swift swiftui
Last synced: 01 May 2026
https://github.com/gabrielf7/relogiohd
:watch: Relógio com Horário e Data
clock css data horario html javascript relogio relogio-hd relogio-javascript watch
Last synced: 01 May 2026
https://github.com/bastianolea/servel_elecciones_core
Resultados electorales desde Servel (2024)
chile comunas data elecciones genero
Last synced: 01 Aug 2025
https://github.com/thanh-wutan/chess-opening-comparator
Interactive web app using R to visualize and compare chess opening performance and popularity.
chess-openings data databases datavisualisation r
Last synced: 09 May 2026
https://github.com/nel-zi/climainsights
Developed an automated ETL pipeline using Apache Airflow and Python to collect, process, and store weather data from multiple cities via Weatherstack API. Implemented data cleaning, orchestration, and error handling to ensure accuracy and scalability.
airflow apache-spark data data-engineering engineering etl-pipeline
Last synced: 01 May 2026
https://github.com/beastbytes/n6l-phone-number-data-php
NationalPhoneNumerInterface implementation using PHP for storage
data itu-t0202 phone-number php yii3
Last synced: 08 Feb 2026
https://github.com/alexyiann/finance
In this repository you can find scripts for pulling data and comparing them , but you can also find simple python scripts to automate trades on Crypto and back testing trading strategies on both crypto and stocks .
api bots data database finance option option-strategies strategy trading trading-algorithms
Last synced: 03 Jan 2026