data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/anct-cartographie-nationale/mednum-cli
✨ Interface en ligne de commande pour la transformation des données de lieux de médiation numériques collectées dans un format non standard vers le schéma de la mednum et leur publication sur data.gouv
anct betagouv data donnees gouvernement mediation-numerique nodejs open-data transformation
Last synced: 02 Aug 2025
https://github.com/plurid/defocus
Apophatic User Content Resolution [Desearch Concept]
Last synced: 08 Nov 2025
https://github.com/musamairshad/dsa-python
This repository contains all the material related to Data Structures and Algorithms implemented in Python.
algorithms data datastructures efficiency python searching-algorithms sorting-algorithms
Last synced: 25 Mar 2025
https://github.com/bhojpur/dlm
The Bhojpur DLM is a software-as-a-service product used for Data Lifecycle Management based on Bhojpur.NET Platform for data delivery.
Last synced: 19 Feb 2026
https://github.com/luminati-io/jupyter-notebooks-web-scraping
Perform web scraping interactively using Jupyter Notebooks, integrating coding, data analysis, and visualization into one seamless workflow.
beautifulsoup4 data jupyter jupyter-notebook pandas python requests seaborn virtual-environment web-scraper web-scraping
Last synced: 13 Apr 2026
https://github.com/zeptosec/bpscrapper
Shows history of oil prices
data data-visualization database nodejs scraper
Last synced: 13 Apr 2026
https://github.com/plurid/datasign
Single Source of Truth Data Contract Specifier
Last synced: 08 Nov 2025
https://github.com/prishabhanot/facial_recognition_pca
A face recognition system using Principal Component Analysis (PCA) for dimensionality reduction and a Support Vector Machine (SVM) classifier for classification. PCA extracts essential features (eigenfaces) from facial images, significantly reducing computational complexity while retaining critical information for accurate recognition.
data eigenfaces facial-recognition pca python reducing-computational-complexity reducing-data-dimensions svm-classifier
Last synced: 01 Mar 2025
https://github.com/mukhlishga/data-engineering
all about data engineering
airflow beam data data-engineering pyspark python
Last synced: 13 Apr 2026
https://github.com/vatshayan/youtube-user-analysis
Analysis of Youtube Users about their choice and preferences
data data-analysis data-mining data-science data-visualization dataset machine-learning machine-learning-algorithms
Last synced: 05 Feb 2026
https://github.com/nushratjabenaurnima/cse_477_data_mining
A collection of labs, reports, Jupyter notebooks, and project outputs for the CSE 477 Data Mining course. This repository tracks my learning journey through data preprocessing, association rules, clustering, classification, and real-world data analysis with Python.
data data-analysis data-mining data-science google-colab-notebook jupyter-notebook machine-learning python python-3
Last synced: 09 Apr 2026
https://github.com/stdlib-js/ndarray-vector-uint32
Create an unsigned 32-bit integer vector (i.e., a one-dimensional ndarray).
constructor ctor data javascript ndarray node node-js nodejs stdlib structure types uint32 vec vector
Last synced: 25 Apr 2026
https://github.com/edjoukou/human_resources
A data analysis project using MySQL Server database
analysis data mysql powerbi sql visualization
Last synced: 25 Sep 2025
https://github.com/mendel5/wifi
Information about Wi-Fi (wifi, WLAN, wireless LAN)
bitrate data data-transmission ethernet internet latency speed throughput transfer transmission wi-fi wifi wireless wireless-lan wlan
Last synced: 02 Aug 2025
https://github.com/ryanga09/digitalent_fundamentaldatascience-selfpractice
A repository of hands-on projects from DigiTalent’s Fundamental Data Science training, covering web scraping, data exploration, data cleaning, and data annotation. Includes Jupyter notebooks and example code for practical learning.
data data-analysis data-science data-visualization dataset digitalent komdigi notebook-jupyter notebooks
Last synced: 02 Aug 2025
https://github.com/jigyasag18/airline-performance-and-passenger-satisfaction-project-using-big-data-analytics
This project analyzes 10 years of U.S. domestic airline data (~3GB) using Hadoop (Cloudera) and Hive for data processing. Power BI dashboards visualize key metrics like delays, on-time rates, air time, and diversions. The solution includes Hive queries, DAX measures, HDFS ingestion scripts, and year-wise insights with recommendations.
big-data big-data-analytics bigdata cloudera cloudera-hadoop cloudera-hadoop-framework data data-analysis data-visualization database hadoop hive power-bi powerbi powerbi-dashboard powerbi-dashboards powerbi-report powerbi-visuals powerbi-visuals-tools powerbidashboard
Last synced: 01 Aug 2025
https://github.com/jigyasag18/global-terrorism-1970-2017-analysis-using-big-data
This repository explores over 180,000 terrorist incidents across 205 countries using Hadoop and Power BI. The project identifies global and regional patterns in terrorism, analyzes the impact on civilians, and highlights high-risk areas. Key insights include attack trends,weapon usage,top terror groups,& country-specific risks like those in India.
big-data big-data-analytics data data-analysis data-visualization dataanalytics dataset hadoop hive hive-database hive-db hivedb power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-report-validation powerbi-visuals powerbidashboard
Last synced: 19 Feb 2026
https://github.com/jigyasag18/ai-ml-salaries-and-ai-tools-usage-trends
This repository presents an in-depth Power BI analytics report on the AI job market trends and student AI tool usage from 2020 to 2025. It combines structured datasets (job postings, salaries, surveys) with custom DAX measures to uncover key patterns in salaries, remote work, industry demand, and student engagement. 5 interaractive dashboards made.
analysis data data-analysis data-visualization dataanalysis dataanalytics dataset datavisualization power-bi powerbi powerbi-dashboards powerbi-desktop powerbi-report powerbi-visuals powerbidashboard visualization
Last synced: 16 Feb 2026
https://github.com/deliprofesor/breast-cancer-detection-using-svm-with-smote-and-model-optimization
This project analyzes health and lifestyle factors influencing heart attack risk using statistical methods and machine learning, with Ridge Regression identified as the best predictive model.
classification data data-preprocessing data-science data-visualization gridsearchcv machine-learning python roc-curve smote svm
Last synced: 10 Apr 2025
https://github.com/luminati-io/Google-Maps-dataset-samples
A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.
api data dataset google-maps maps web-scraping
Last synced: 09 Apr 2025
https://github.com/luminati-io/ZoomInfo-dataset-samples
A sample dataset of over 1000 ZoomInfo companies, extracted using the Bright Data API, ideal for market growth, lead generation, and market analysis.
b2b business companies data data-extraction database dataset datasets web-scraping zoominfo
Last synced: 09 Apr 2025
https://github.com/luminati-io/LinkedIn-dataset-samples
Sample dataset of 1001 LinkedIn companies, extracted via Bright Data API, featuring essential data points for competitive analysis and market insights.
data database dataset linkedin linkedin-api linkedin-data linkedin-dataset linkedin-scraper sample web-scraping
Last synced: 09 Apr 2025
https://github.com/lemaitre4523/old-tiktok-data-report-explorer
An explorer for tiktok data report
data explorer extract package report simple tdre tiktok tiktok-data-explorer
Last synced: 25 Sep 2025
https://github.com/braiso-22/ejercicio-seguro-medico
Ejercicio de acercamiento a los datos para hacer predicciones
data data-science dataset ia insurance jupyter-notebook ml python python3
Last synced: 24 Apr 2026
https://github.com/desoga10/nety-form
In this tutorial, I show you how to send data from a form to the Netlify dashboard. I also show you how to create a form using Materialize.
contact-form css css3 data form forms html html5 materialize materialize-css materializecss-framework netlify
Last synced: 03 Jan 2026
https://github.com/deliprofesor/virtual-reality-in-education-impact-analysis-and-insights
This project examines the impact of Virtual Reality (VR) on education, focusing on its effects on student engagement, learning outcomes, and creativity. It uses data analysis techniques like descriptive statistics, correlation analysis, and clustering to assess VR's effectiveness in enhancing learning.
clustering data data-analysis data-science data-visualization exploratory-data-analysis hypothesis-testing machine-learning python regression-analysis virtual-reality
Last synced: 14 Jun 2025
https://github.com/canadaluke888/speedtable
Ultra-fast terminal table renderer written in C
c data datasets fast python python-wrapper python3 tables
Last synced: 01 Mar 2026
https://github.com/aaronspindler/selfdrivingcar
Learning deep learning and making a self driving car in the process
car data deep deep-learning driving keras learning machine machine-learning python self self-driving-car
Last synced: 09 Apr 2026
https://github.com/mukul273/spring-data-rest-jpa-demo
Spring Data Rest JPA Demo
data jpa rest spring spring-boot spring-mvc
Last synced: 20 Apr 2026
https://github.com/otoneko1102/roulette-base
ルーレットの色と番号をjson形式でまとめたものです。カジノ風ルーレットを作るときにどうぞ。A collection of roulette colors and numbers in json format. Use it when making a casino-style roulette.
casino casino-games data json require roulette
Last synced: 16 Mar 2025
https://github.com/abhishekn1947/samgov-scraper
Automated Python scraper for sam.gov contracts
analytics automation aws data pandas postgresql rds selenium webscraper
Last synced: 09 Apr 2026
https://github.com/vishwas-chakilam/twitter-sentiment-analysis
Twitter Sentiment Analysis is a Python project that analyzes the sentiment of tweets based on a user-defined keyword. It uses Tweepy to fetch tweets from the Twitter API and TextBlob for sentiment analysis. The application features a user-friendly GUI with Tkinter, displaying tweet sentiment as positive, negative, or neutral.
api data data-science dataanalysis python3 textblob-sentiment-analysis tkinter tweepy-api
Last synced: 11 Mar 2025
https://github.com/kayahr/datastream
Data stream classes for writing and reading all kinds of data types, even single bits
data datastream input output stream typescript
Last synced: 01 Aug 2025
https://github.com/creativecuriositystudio/cruddle
(DEPRECATED) Simplifying CRUDL screen development using ModelSafe
angular2 crud data html model typescript ui web
Last synced: 09 Apr 2026
https://github.com/karosi12/ng-data-share
Angular communication with input and output properties
angular communication data data-binding input output sharing typescript
Last synced: 16 Jan 2026
https://github.com/revolutionarybukhari/datawarehouse_meshjoin_superstore
A dataware house is generated for streaming data of a superstore using extended mesh join by Syed Husnain Haider Bukhari
data data-science data-warehousing meshjoin
Last synced: 23 May 2026
https://github.com/bastianolea/servel_elecciones_core
Resultados electorales desde Servel (2024)
chile comunas data elecciones genero
Last synced: 01 Aug 2025
https://github.com/nagipragalathan/linkedin_backup_datas
This repository contains the backup data from my previous LinkedIn account. Unfortunately, my old LinkedIn account was compromised and subsequently blocked by LinkedIn. As a result, I created a new account, but that too got blocked for reasons unknown to me.
backup blocked data linkedin linkedin-account memory nagipragalathan recovery storage
Last synced: 18 Jan 2026
https://github.com/idhruvs/angular4-smart-table-demo
Angular4 Smart Table Demo Project
angular4 data tables typescript
Last synced: 21 Apr 2026
https://github.com/cunfuu/network-bubbles
For Easier to manage organizations and keeping notes about them to organize events and easy access their needs
data data-visualization organizations organizations-volunteer
Last synced: 31 Jul 2025
https://github.com/brayflex/spy-sector-rotation-google-sheet
Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.
data etf google-sheets index price rotation script sector spreadsheet spy stock-market
Last synced: 29 Jun 2026
https://github.com/coderixc/rforai
Learn R Programming Language for Statistics & Data Science
artificial-neural-networks data data-science deep-neural-networks machine-learning probability quant-analyst r science
Last synced: 09 Oct 2025
https://github.com/psyteachr/sdg-data
Data relevant to the UN Sustainable Development Goals
Last synced: 09 Oct 2025
https://github.com/kaijagahm/2023-10-20-stlzoo
Data Carpentry workshop, hosted at the St. Louis Zoo. Beta testing the new ecology data lesson.
data data-science ecology r rstudio
Last synced: 05 Feb 2026
https://github.com/steventhompson6460-stack/octoparse-government-listings-scraper
Octoparse workflow for structured government data
data extraction government listings octoparse public-records python scraper scrapy structured web-crawling workflow
Last synced: 31 May 2026
https://github.com/sillyash/untappd-viz
A data visualisation page using public datasets and HTML/CSS/JS with D3.js.
beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project
Last synced: 18 May 2026
https://github.com/redgoose-dev/baguni
이미지를 보관하고 탐색하는 웹 프로그램
data explorer file management upload
Last synced: 14 Apr 2026
https://github.com/gianlucatruda/qs-analyser
A quantified self data analysis script in Python 3.
data experiment matplotlib matrix optimization productivity python quantified quantified-self science self
Last synced: 10 Oct 2025
https://github.com/theopenwebjp/theopenweb-data-loader
Package for loading data to local project
data downloader import javascript typings
Last synced: 10 Oct 2025
https://github.com/loggdme/kyro
Collection of utilities and examples for creating efficient data pipelines in go with parallel queues and, rate limitiers and much more.
Last synced: 14 Jan 2026
https://github.com/chowington/bg-counter-tools
A set of tools that can pull data from Biogents BG-Counter smart mosquito traps and convert them into a Darwin Core compliant format.
bg-counter biogents darwin-core data internet-of-things mosquito-prevalence population-dynamics
Last synced: 10 Oct 2025
https://github.com/nullmaster7/btk-pythontensorflow-ozet
data data-analysis python tensorflow-examples
Last synced: 19 Jan 2026
https://github.com/badranalyst/data-professional-survey-breakdown-power-bi-dashboard
This project presents an interactive Power BI dashboard analyzing data professionals' insights. Key focus areas include job satisfaction, challenges in entering the data field, career priorities, demographics, and more. The visualization helps uncover trends and factors impacting data professionals globally.
charts dashboard dashboards data data-cleaning data-visualization dataset dax power-bi powerbi
Last synced: 23 Feb 2026
https://github.com/jatin-mehra119/paris_housing_price-kaggle-
Paris Housing Price Kaggle Competiton
data data-visualization kaggle-competition machine-learning numpy pandas predictive-modeling scikit-learn
Last synced: 29 Apr 2026
https://github.com/writetome51/pagination-page-info
Intended to help a separate Paginator class paginate data. Specifically, this class contains the properties `itemsPerPage` and `totalPages`, which will be used by other classes
batch data javascript paginate pagination typescript
Last synced: 09 May 2026
https://github.com/aldro61/mmit-data
The data used in the Maximum Margin Interval Trees paper
data machine-learning machine-learning-algorithms reproducible-research
Last synced: 19 Feb 2026
https://github.com/ghomashudson/ao3_style_change
Style change detection dataset using AO3 fics
ao3 data dataset datasets fanfiction long-document style-change-detection
Last synced: 11 Oct 2025
https://github.com/dhruvil-26/tableau-projects
This repository contains Tableau visualization projects focused on data analysis across different domains. Projects include: 1. IPL Visualization - Insights into IPL match, Team and player statistics. 2. EV Analysis - Visualizations exploring the adoption of electric vehicles. 3. Road Accident Analysis - Analysis of road accident patterns
analysis data data-analysis data-analytics electric-vehicles ipl road-accident-analysis tableau tableau-public
Last synced: 19 Jan 2026
https://github.com/mr-chang95/udacity-starbucks-challenge
Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.
data data-science data-visualization numpy pandas sklearn
Last synced: 14 Apr 2026
https://github.com/elimu-ai/analytics
📊 Android application which collects, provides and uploads learning event data
csv data data-science dataset edtech egma egra infrastructural learning-analytics
Last synced: 12 Oct 2025
https://github.com/0xnu/nfl-picks
NFL match prediction with scores using historical data (1999-Present).
american-football data nfl prediction
Last synced: 12 Oct 2025
https://github.com/ckongala/data-warehouse-concepts
Data Warehouse Basics
data data-engineering data-warehouse data-warehouse-architecture data-warehouse-construction data-warehousing
Last synced: 13 Oct 2025
https://github.com/drzax/light-up-brisbane
Where, what and why various public places in Brisbane are lit up.
Last synced: 19 Jan 2026
https://github.com/adadalshabab/data-engineering-gcp-project
An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.
bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi
Last synced: 19 Jan 2026
https://github.com/tyriek-cloud/nyc-dca-etl
Created an ETL pipeline to merge two CSV files (converted to JSON) into a parquet file using Azure Data Factory, The data was extracted from NYC Open Data: https://opendata.cityofnewyork.us/ and I created a Blob Container within an existing storage account.
azure azure-data-factory blob-storage data data-engineering etl-pipeline
Last synced: 21 Jan 2026
https://github.com/luminati-io/httpx-web-scraping
Web scraping using HTTPX in Python, covering setup, advanced features, comparisons with Requests, and more.
beautifulsoup data html httpx python web-scraper web-scraping
Last synced: 13 Oct 2025
https://github.com/amethyst-php/courier
amethyst amethyst-package api courier data laravel
Last synced: 17 May 2026
https://github.com/tabarzin/dh
A collection of links to various resources on Digital Humanities
data digitalhumanities opensource
Last synced: 24 Jan 2026
https://github.com/fnu-ankit/8-week-sql-challenge
My attempt on solving Case studies from #8WeeksSQLChallenge
8-week-sql-challenge 8-weeks-sql-challenge 8weeksqlchallenge case-study data data-analysis data-analysis-sql data-analytics database datawithdanny sql sqlserver
Last synced: 19 Apr 2026
https://github.com/odiegosilva1/flask-github-style
Página de login usando Jinja no Flask.
data flask jinja2-templates orm python
Last synced: 31 May 2026
https://github.com/digital-media/cv_data
Datasets used for courses/tutorials at the Digital Media Department
computer-vision data image-processing images
Last synced: 14 Oct 2025
https://github.com/polyee99/kaggle-titanic-data-analytics
Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.
data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization
Last synced: 05 Feb 2026
https://github.com/isandyawan/simplelinearregression
A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion
data linear r regression rstudio shiny statistic
Last synced: 14 Oct 2025
https://github.com/arush-codes/lgmvip-data-science-task-1
data data-science iris-classification lgmvip virtual-internship
Last synced: 14 Oct 2025
https://github.com/mominurr/fire-gas-leak-detection-system
A real-time fire prevention system integrating IoT sensors and computer vision to trigger evacuations.
ai computer-vision data datascience machine-learning ml python yolo
Last synced: 27 Jan 2026
https://github.com/rafie-b/data-analytics
Activities of Data Analysis.
apache-spark api aws business-analytics data data-analytics data-science database dataframe jupyter-notebook python scikit-learn sql
Last synced: 14 Apr 2026
https://github.com/instagram-automations/scrape-data-from-instagram
scrape data from instagram and automation toolkit
api automation bot data doker instagram nodejs playwright procy scrape selenium toolkit
Last synced: 14 Oct 2025
https://github.com/desininja/food-delivery-realtime-data-analysis
ETL Pipeline in AWS for Real Time Data Analysis
airflow data data-engineering emr-cluster etl kinesis kinesis-strea real-time redshift
Last synced: 15 Oct 2025
https://github.com/science-analyse/clv_model
customer lifetime value prediction
banking banking-applications clv clv-analysis data data-science machine-learning
Last synced: 15 Oct 2025
https://github.com/intersystems-ib/workshop-smart-data-fabric
Learn the main ideas involved in developing a Smart Data Fabric using InterSystems IRIS
analytics data datafabric interoperability smart
Last synced: 14 Apr 2026
https://github.com/yagoluiz/enem-analise-extracao
[PT-BR] Extração e análise de dados do desempenho da região Centro-Oeste
analysis data extraction python3 r
Last synced: 17 Apr 2026
https://github.com/jigyasag18/project-diwali-sales-analysis
This project analyzes retail sales data during the Diwali festival using exploratory data analysis (EDA) to identify buyer demographics and product preferences. The findings reveal that the primary purchasers are married women aged 26-35 from Uttar Pradesh, Maharashtra, and Karnataka, working in IT, Healthcare, and Aviation.
analysis data datapr datapro eda jupyter-notebook python realtimedata
Last synced: 01 Jun 2026