data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/montanaz0r/suicide-rate-analysis
Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector
analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate
Last synced: 20 Apr 2026
https://github.com/omers/sre-devops-tools
Tools and useful sources for SRE and DevOps
awsome awsome-list data devops monitoring sre tools
Last synced: 20 Apr 2026
https://github.com/crypt596-rubykz/metaai-data-explorer-scraping-tool
MetaAI data explorer tool
api-research automation data explorer html-parsing metaai playwright python rate-limiting scraping
Last synced: 20 Apr 2026
https://github.com/arda-guler/binmotion
Convert ANY data to a video file. Sister project of binGallery.
data data-visualization proof-of-concept video
Last synced: 04 Jun 2026
https://github.com/prashhhant213/data_analysis_and_visualization-_for_streaming_platform
Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.
colab-notebook data datavisualization matplotlib numpy pandas python seaborn
Last synced: 20 Apr 2026
https://github.com/sdspot2034/data-lemur-solutions
Solutions to SQL Problems on DataLemur
competitive-programming data data-analytics data-science database postgresql query sql
Last synced: 20 Apr 2026
https://github.com/petermeissner/suuntor
Data from a Suunto watch extracted by R - !because!
automation data r rstats suunto windows
Last synced: 20 Apr 2026
https://github.com/nxion/sql-data-warehouse-project
Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.
data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server
Last synced: 05 Jun 2026
https://github.com/vidya-vijay/vidya-vijay
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 21 Apr 2026
https://github.com/fastpix/android-data-kaltura
This SDK enables seamless integration with Kaltura Player, offering advanced video analytics via the FastPix Dashboard
analytics android-sdk data fastpix kaltura kaltura-player metrics sdk video video-metrics
Last synced: 21 Apr 2026
https://github.com/vishwas-chakilam/movies-review-scraping-analysis
A project for collecting, cleaning, and analyzing movie data. Includes scripts for web scraping (deprecated) and using the OMDb API to fetch movie details. Analyze and visualize data with Python and Power BI to uncover insights and trends in movie ratings and genres.
data dataanalysis datacleaning datavisualization matplotlib-python numpy-library pandas python webscraping
Last synced: 21 Apr 2026
https://github.com/amethyst-php/alias
alias amethyst amethyst-libary amethyst-package api data laravel library package
Last synced: 21 Apr 2026
https://github.com/stefen-taime/llm-rag-mtl-public-hospital
Ce projet développe un modèle de type Retrieve-Augment-Generate (RAG) pour répondre aux questions en utilisant les données publiques des avis laissés sur Google pour des hôpitaux à Montréal
data google-reviews hopital hospital hub ia llm montreal open-source quebec rag
Last synced: 21 Apr 2026
https://github.com/jdenn0514/surveycore
Core Survey Analysis Infrastructure
Last synced: 21 Apr 2026
https://github.com/wittyicon29/kritika-iit-b-2023
Seletcion task for the summer projects of Kritika IIT-B
data data-analysis data-science
Last synced: 15 Mar 2025
https://github.com/rbcavi/factorio-mod-data
The modpacke data for factorio-viewer
data factorio factorio-data factorio-mod-data
Last synced: 23 Apr 2026
https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis
A data science notebook project focused on analyzing car features and building a model for car price prediction.
data data-analysis data-visualization jupyter-notebook python
Last synced: 23 Apr 2026
https://github.com/coryson/osm-mla-finder
Python script to locate institutions employing Medical Laboratory Assistants in Germany, developed for BTZ – Berufliche Bildung Köln GmbH. It uses OpenStreetMap, SerpAPI, and web scraping to find and verify relevant labs, clinics, and diagnostic centers.
beautifulsoup data openstreetmap osm python scraping serpapi webscraping
Last synced: 24 Apr 2026
https://github.com/hruth-vik/sales-analysis-report
SalesScope is a powerful sales analytics dashboard that extracts insights, reveals trends, and drives strategy from raw data.
analytics data powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/marielachirinosr/cyclistic-data-analytics-project
This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.
data data-visualization pandas powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/mehmetkahya0/gallstone_dataset_analysis_project
Safra Taşı Hastalığı (Gallstone-1) Veri Seti Analizi (https://archive.ics.uci.edu/dataset/1150/gallstone-1)
analysis analytics data data-analysis data-science data-visualization database graph matplotlib python
Last synced: 25 Apr 2026
https://github.com/rubix982/product-quality-classification
This is an implementation for the CIKM AnalytiCup 2017, around the topic of "Product Title Quality". The goal is to take SKUs and rank its title's clarity and conciseness. Referenced papers are attached to this repository. And as such, the aim is to craft ensemble models that either try to replicate results or find new methods for classification.
data data-analysis information-retrieval jupyter-notebook machine-learning nlp python spacy-nlp
Last synced: 25 Apr 2026
https://github.com/xjwllmsx/hacker-news-engagement
Analyze Hacker News data to reveal which post types and posting hours spark the most discussion, using Python and a reproducible Jupyter notebook.
data data-analysis jupyter python
Last synced: 25 Apr 2026
https://github.com/mlkav/tri-hita-karana
Project Tri Hita Karana - Future Knowledge G20 Bali. DTS Kominfo x Binar Academy.
bali data data-science g20 science
Last synced: 06 Jun 2026
https://github.com/carlos-levi/twitterbots_analise_redesneurais
Projeto para a disciplina de IA - análise exploratória e aplicação de técnicas de aprendizado de máquina para detectar contas automatizadas (bots) na plataforma 𝕏 (Twitter)
data machine-learning twitter-bot
Last synced: 06 Jun 2026
https://github.com/marielachirinosr/hotel-data-analysis
Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries
data data-analysis matplotlib pandas python
Last synced: 25 Apr 2026
https://github.com/anuraganalog/blog
Data Science Blog
anuraganalog blog data science
Last synced: 26 Apr 2026
https://github.com/datannur/datannur
datannur is an open source, lightweight and sovereign data catalog
catalog data data-catalog data-governance data-management dcat dcat-ap dcat-ap-ch metadata open-data open-source public-sector svelte swiss switzerland
Last synced: 07 Jun 2026
https://github.com/sagarkhese40/prediction-with-binomial-logistic-regression
bank data excel logistic-regression python
Last synced: 26 Apr 2026
https://github.com/luminati-io/seleniumbase-with-proxy
SeleniumBase with authenticated proxies to bypass restrictions, enhance web scraping, and manage rotating proxies for better data extraction.
data data-collection proxy-server python residential-proxy selenium seleniumwire web-scraping
Last synced: 27 Apr 2026
https://github.com/ioanzicu/batch_loading_one-to-many_data_model
Unesco Batch Loading One-to-Many Data using Django
Last synced: 27 Apr 2026
https://github.com/amethyst-php/subscription
amethyst amethyst-package api data laravel subscription
Last synced: 27 Apr 2026
https://github.com/gurpreet0022/crop-fertilizers-recommendation-system-using-ml-
This repository is a part of AICTE - Shell Internship on 'Green Skills using AI technologies' Cycle 3.
data datapreprocessing datavisualization jupyter-notebook machine-learning python
Last synced: 27 Apr 2026
https://github.com/schenkd/tweetminer
Data Miner for Twitter Streaming API
data dataminer datamining java twitter twitter-api twitter4j
Last synced: 07 Jun 2026
https://github.com/bhumitbedse/machine-learning-projects
AI Machine learning Deep learning Computer vision NLP Projects with code
computer-vision data data-science deep-learning machine-learning natural-language-processing python
Last synced: 27 Apr 2026
https://github.com/santiagoenriquega/custom_database
Python-based database library for database management, indexing, transactions, and constraints, showcasing foundational database concepts.
data data-engineering database database-design python
Last synced: 27 Apr 2026
https://github.com/tacticalnuclearraccoon/dataviz_with_js
Sample data vizualisation as part of a training on Javascript Frameworks for dataviz
d3 data datawrapper echarts javascript visualization
Last synced: 27 Apr 2026
https://github.com/drkane/area-profiles
Produce UK area profiles based on various data sources
dash-plotly data flask statistics uk
Last synced: 27 Apr 2026
https://github.com/mohamedezzeldeenhassanmohamed/data-mining-project
Data minnig GUI project to predict laptop prices,I uses most of ML algorithmes here
data data-mining-assignments datamining-algorithms datapreprocessing decision-trees entropy gini k-means-clustering knn-classification laptop-dataset laptop-price-prediction linear-regression logistic-regression ml mlalgotithms naive-bayes-classifier pca python svm-classifier visualization
Last synced: 27 Apr 2026
https://github.com/oguzhanfatihkucuk/data-analytics-project-kafka-spark
The data in this project was collected in a database using Apache Kafka and processed with Apache Spark Streaming. The project aims to create a forecasting model and analyze sales forecasts per customer.
big-data data data-visualization hadoop kafka ml mlpipeline plt pyhton spark
Last synced: 28 Apr 2026
https://github.com/leonardomusini/mbe-growth-nexus-converter
Python tool to convert laboratory text files into NeXus files for Molecular Beam Epitaxy (MBE) data.
data data-engineering nexus python
Last synced: 28 Apr 2026
https://github.com/delonnewman/relational
Relational programming for Ruby
csv csv-import data data-analysis database export json relational relational-algebra relational-database relational-model relational-programming reporting reports ruby yaml
Last synced: 28 Apr 2026
https://github.com/sagarkhese40/python-assginment
python assignment
assignment data data-science data-visualization python seaborn-plots
Last synced: 28 Apr 2026
https://github.com/priyanshubiswas-tech/e-commerce_data_analysis
Analyzes 9,994 e-commerce transactions to uncover insights on sales trends, customer behavior, profitability, and logistics using EDA and visualization. Identifies top products, customer segments, and shipping efficiencies to optimize marketing, inventory, and operations, making it valuable for retail, finance, and logistics.
data data-analysis data-visualization pandas pandas-dataframe plotly-analytics-projects plotly-express python
Last synced: 28 Apr 2026
https://github.com/entorb/analyze-ha-energy
Analyze Home Assistant Solar Production Data
data home-assistant pandas photovoltaic pv python
Last synced: 08 May 2026
https://github.com/darrendavy12/earthquake-events-and-risks-project---azure-data-pipeline---api-connection-
Earthquake Events and Risks Project - Azure Data Pipeline - API Connection
azure blob-storage cloud cloudstorage data databricks databricks-notebooks databricks-workspace dataengineer dataengineering microsoft python
Last synced: 28 Apr 2026
https://github.com/n-ce/localstorage-data-interchange-manager
Implementation of local storage data interchange using map data structure.
data export import javascript js-maps json localstorage
Last synced: 28 Apr 2026
https://github.com/moderrek/periodic-table
Periodic Table with clickable elements to see details.
chemical chemistry data element elements generator html javascipt javascript json periodic-table pure-javascript table vanilla-html vanilla-javascript
Last synced: 28 Apr 2026
https://github.com/howz1t/ptypes
This package provides useful data types for use in PHP.
badges composer computer-science data data-structures data-types packagist php types
Last synced: 29 Apr 2026
https://github.com/mtalhaofc/nutrition_system
A simple AI-powered web app built using Streamlit that provides personalized weekly meal plans and nutrition recommendations based on user demographics, health goals, and nutritional preferences.
cosine-similarity data data-science food machine-learning model nutrition pandas python streamlit
Last synced: 29 Apr 2026
https://github.com/sn0wfree/factor_table
an universal connector for all kind data source and manage all kind data as factor type by one package
connector data database factor
Last synced: 29 Apr 2026
https://github.com/stdlib-js/array-struct-factory
Return a constructor for creating arrays having a fixed-width composite data type.
array composite data factory javascript node node-js nodejs stdlib struct structure typed typed-array types
Last synced: 29 Apr 2026
https://github.com/barkintopcu/apple-stock-prediction-edu
The purpose of this project is to demonstrate time series analysis techniques using real-world stock data, without offering any form of financial advice or investment suggestion.
data deep-learning forecasting machine-learning python
Last synced: 29 Apr 2026
https://github.com/mr-dhan/eda-sales-customer-transactions
Dalam dunia bisnis ritel yang kompetitif, pemahaman mendalam terhadap perilaku pelanggan merupakan fondasi penting untuk pengambilan keputusan strategis. Namun, data transaksi pelanggan seringkali berjumlah besar dan kompleks, sehingga memerlukan proses analisis yang efektif untuk mengungkap insight yang berharga.
dashboard data data-analysis data-analysis-python data-science data-visualization eda python
Last synced: 29 Apr 2026
https://github.com/chandansoren/financial-budget-analysis
Financial budget for 2021
Last synced: 29 Apr 2026
https://github.com/koltyakov/pgcopy
🐘 PostgreSQL data migration tool
cli data database golang migration postgresql sync
Last synced: 29 Apr 2026
https://github.com/diegoperea20/pytorch-vs-tensorflow
Testing the differences of the pytorch and tensorflow libraries in the different prediction and classification applications, each of them gives improvements depending on the problem they are assigned or data set assigned.
classification data images prediction pytorch tensorflow
Last synced: 29 Apr 2026
https://github.com/tazeenrashid/orders-analysis-using-python-sql-server-and-tableau
I sourced some Orders data through Kaggle; did EDA using Python and then fetched some insights out of cleaned data using SQL Server (SSMS). Then, I built a Tableau Dashboard for some visual insights. Have a look and share your feedback!
analytics data eda jupyter-notebook python sql tableau
Last synced: 29 Apr 2026
https://github.com/istinnew/eniac_ab_insight
Dive into a comprehensive analysis aimed at boosting iPhone 13 sales by optimizing the Click-Through Rate (CTR) of the “SHOP NOW” button, compare different button designs and determine the most effective strategy for increasing engagement.
ab-testing data data-analysis data-engineering data-science data-visualization google googlecolab libraries python testing testing-tools visual-studio-code
Last synced: 29 Apr 2026
https://github.com/smokingplaya/gm_datastorages
💖 Data Storages like in JavaScript.
Last synced: 29 Apr 2026
https://github.com/ipstack/wizard
Wizard for create ipstack databases
composer data geo geoip id-database info ip ipstack ipstack-wizard php wizard
Last synced: 29 Apr 2026
https://github.com/devcsrj/docparsr-jvm
JVM client for https://github.com/axa-group/Parsr
data document extraction nlp ocr pdf
Last synced: 08 Jun 2026
https://github.com/wireservice/workbench-lookup
A port of `agate-lookup` to Workbench
data journalism lookup workbench
Last synced: 08 Jun 2026
https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration
This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.
covid-19 data data-exploration dataset sql
Last synced: 19 Feb 2026
https://github.com/gvatsal60/ds-on-kaggle
A collection of data science projects, experiments, and insights from Kaggle competitions and datasets
data data-science data-visualization numpy pandas python3
Last synced: 29 Apr 2026
https://github.com/patrickdavies100/pipeline38
An application to automate the creation and execution of SQL queries.
data pandas-dataframe pipeline postgresql psycopg2 sqlalchemy
Last synced: 30 Apr 2026
https://github.com/abhinav330/instagram-influencers-analysis
This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.
data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn
Last synced: 08 Jun 2026
https://github.com/samiksha29-patil/hr-employee-data-analysis-visualization-in-python
This project focuses on analyzing an HR Employee Dataset that contains details about employees such as demographics, job status, salaries, performance reviews, satisfaction levels, and attrition reasons.
csv-files data data-visualization dataanalysis matplotlib numpy pandas python seaborn
Last synced: 30 Apr 2026
https://github.com/omarsaad21/it-salary-eda
A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset
data explotary-data-analysis juypter-notebook numpy pandas python visualization
Last synced: 30 Apr 2026
https://github.com/onekiloparsec/arcsecond-swift
The swift client for interacting with the server-side RESTful resources of arcsecond.io.
arcsecond astro-library astronomy data django swift swift-3
Last synced: 30 Apr 2026
https://github.com/mmaithani/kaggle-projects
Collection of all the resources from competition, kernal And data section also all the magic code i have been using to get most of out of a problem
computer-vision data data-science image-processing machine-learning python
Last synced: 30 Apr 2026
https://github.com/raphcodec/rand-org-generator
Rand-Org-Generator attempts mimic real company structures. The dummy data generated by this project is intended to be used in analytics projects or web projects.
data duckdb factory-boy faker org-chart polars python3
Last synced: 30 Apr 2026
https://github.com/lugolbis/data-immo
End-to-end ETL pipeline
data data-engineering dbt dremio duckdb etl-pipeline lakehouse rust
Last synced: 08 Jun 2026
https://github.com/miguelmedinacastro/trabalho-dados-r
Trabalho final da disciplina Análise Exploratória de Dados
data data-science data-science-projects data-visualization database r rstudio
Last synced: 01 May 2026
https://github.com/dnut/json-match-finder
Python application used to match listings against openings via authenticated JSON API access.
data data-structures data-wrangling database json-api python-application python-modules
Last synced: 01 May 2026
https://github.com/benmizrahi/reactivejs
microservices event bus for async/sync communications
Last synced: 01 May 2026
https://github.com/lut-ful/ibm-capstone-project-stack-overflow-job-survey
IBM Data Analyst professionale certificate program final project.
cognos data data-analytics looker power-bi python sql statics
Last synced: 01 May 2026
https://github.com/dnut/associations
Python 3 library to identify high-dimensional statistical relationships in any data set.
analytics arch-linux association-rules data data-analysis data-mining data-science machine-learning python-modules
Last synced: 01 May 2026
https://github.com/skygenesisenterprise/aether-meet
Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications data docker javascript meeting nextjs notes typescript voip
Last synced: 01 May 2026
https://github.com/chompfoods/sdk-kotlin
Kotlin SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food foods grocery ingredients kotlin nutrition raw recipe-api recipes sdk sdk-kotlin
Last synced: 01 May 2026
https://github.com/gabrielf7/relogiohd
:watch: Relógio com Horário e Data
clock css data horario html javascript relogio relogio-hd relogio-javascript watch
Last synced: 01 May 2026
https://github.com/nel-zi/climainsights
Developed an automated ETL pipeline using Apache Airflow and Python to collect, process, and store weather data from multiple cities via Weatherstack API. Implemented data cleaning, orchestration, and error handling to ensure accuracy and scalability.
airflow apache-spark data data-engineering engineering etl-pipeline
Last synced: 01 May 2026
https://github.com/sorairolake/japanese-era-dataset
日本の元号のデータセット / Dataset of the Japanese era
data dataset date japanese-calendar japanese-era json toml wareki yaml
Last synced: 01 May 2026
https://github.com/thedevreda/jadaerospace
A Real life project showing how to improve selling aircraftparts and helping salers to focus more on effective products at JadAero
data data-analysis data-cleaning data-visualization jupyter-notebook powerbi python
Last synced: 02 Aug 2025
https://github.com/gcoronelc/cepsuni-disbd-64505
Taller de Modelamiento de de Base de Datos con Gustavo Coronel
data database databases db2 db2-database modeling oracle oracle-database relational-database relational-database-design relational-databases relationships sql sql-server
Last synced: 02 May 2026
https://github.com/waseemofficial/ml-practice
ML Practice
data data-analysis jupyter-notebook machine-learning ml python
Last synced: 02 May 2026