data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/danieljdufour/fast-b64
Quickly Convert between B64 and Binary Strings
b64 base64 base64-decoding base64-encoding binary bits compression data
Last synced: 08 Oct 2025
https://github.com/hafs96/prediction_consommation-de-carburant
Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.
analysis data data-visualization machine-learning testing training
Last synced: 09 Jun 2026
https://github.com/ishaantek/leetcode-solutions
Collection of my solutions to LeetCode problems in Python.
algorithms coding-challenge coding-interviews competitive-programming data data-structures leetcode leetcode-python python python3
Last synced: 08 Oct 2025
https://github.com/djdhairya/whatsapp-chat-analysis
WhatsApp chat analysis is a multidimensional process that delves into the content, structure, and dynamics of conversations within the platform. It provides valuable insights for personal reflection, organizational decision-making, and improving communication strategies.
data data-science dataanalytics datapreprocessing machine-learning ml
Last synced: 08 Oct 2025
https://github.com/shubhamsoni98/classification-with-random-forest-1
To classify sales into categories (Low, Moderate, High) using Random Forests to inform strategic decisions and optimize marketing strategies.
algorithms anaconda data data-science datacleaning eda jupyter-notebook machine-learning pyhton random-forest scikit-learn visualization
Last synced: 18 Jan 2026
https://github.com/leevilaukka/alkometriikka
Tool to search Alko database and see some fun stats about different beverages
data gh-pages svelte typescript xlsx
Last synced: 18 May 2026
https://github.com/satyam4229/iit-and-nit-college-dataset
The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.
college-data csv data excel iit nit
Last synced: 04 Jan 2026
https://github.com/thibautre/dataipsum
Configurable data generator (with crumbles inside)
algorithm data random-generation
Last synced: 21 Jul 2025
https://github.com/diegoperea20/datos-secuenciales-con-ia
Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes
ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao
Last synced: 06 Feb 2026
https://github.com/cburmeister/disc-golf-courses
All the disc golf courses i've played at. Maintained with http://geojson.io/.
Last synced: 21 Jan 2026
https://github.com/justinjjlee/simulation-discrete
Employing data transformations and simulations to answer random questions
analytics data data-science julia python simulation spark
Last synced: 30 Apr 2026
https://github.com/kaijagahm/2023-10-20-stlzoo
Data Carpentry workshop, hosted at the St. Louis Zoo. Beta testing the new ecology data lesson.
data data-science ecology r rstudio
Last synced: 05 Feb 2026
https://github.com/martinius96/meteostanica-odosielacie-scripty
Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.
arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka
Last synced: 11 Apr 2026
https://github.com/fuzzt/location-analyzer
The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.
analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack
Last synced: 11 Apr 2026
https://github.com/sushmashreeps/python
This repository showcases a comprehensive Python project, demonstrating expertise in backend development, data analysis, and machine learning. Built with Python 3.x, the project utilizes popular libraries like Django, Flask, NumPy, pandas, and scikit-learn. The project features efficient data processing, robust API integration, and scalable archite
api data data-science dataanalysis datavisualization game gamedeveloment python
Last synced: 12 May 2026
https://github.com/muhammadadilnaeem/student-performance-indicater-end-to-end-data-science-project
This project leverages data science techniques to build a predictive model that estimates a student's exam performance. The project follows a structured data science workflow, including data collection, preprocessing, model building, evaluation, and deployment.
data machine-learning-algorithms pandas pymysql python sql
Last synced: 11 Apr 2026
https://github.com/yash-rewalia/airbnb_eda_pandas
The goal of the project is to gather information and analyze the detailed information of the different entries in order to provide insights about the host and price of the property in a particular area as per your preference , type of rooms and number of reviews accordingly.
data data-cleaning data-insights data-preprocessing data-visualization matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/steventhompson6460-stack/octoparse-government-listings-scraper
Octoparse workflow for structured government data
data extraction government listings octoparse public-records python scraper scrapy structured web-crawling workflow
Last synced: 31 May 2026
https://github.com/oniani/miniframe
Minimal data frames with relational algebra
data dataframe-library haskell haskell-library library
Last synced: 04 Mar 2025
https://github.com/wlgs/got-dialogues-data-stats
Game of Thrones dialogues data statistics processed with R and SQLite. Project for Probability and Statistics course 21/22 at AGH UST. The project was about manipulating data and getting many pieces of information from it in addition to visualizing these results.
data game-of-thrones got r statistics stats
Last synced: 22 May 2026
https://github.com/j-sephb-lt-n/joes_giant_toolbox
A large collection of general python functions and classes that I use in my daily work
ascii browser classifier data dataviz gcp mime nlp python regex search statistics supervised web-scraping
Last synced: 10 Oct 2025
https://github.com/amirreza81/kaggle-pandas-course-solutions
Kaggle Pandas Course - Solved exercises in another way of sample solution
data data-analysis data-cleaning data-manipulation data-science dataframe jupyter-notebook kaggle machine-learning open-source pandas
Last synced: 14 Apr 2026
https://github.com/azkarmoulana/winter-of-data-2019
:snowflake: :snowman: Winter of Data is coming..... :wolf:
data data-science machine-learning mathematics
Last synced: 05 Feb 2026
https://github.com/nullmaster7/btk-pythontensorflow-ozet
data data-analysis python tensorflow-examples
Last synced: 19 Jan 2026
https://github.com/thicclatka/tetration
New file format for tensors
cli data fileformat mmap tensors
Last synced: 26 May 2026
https://github.com/gianlucatruda/titanic
An exhibition of my experience in data processing and visualisation. Python script to process and visualise the Titanic survivor data.
data database flask info matplotlib python science scrape server titanic visualisation web
Last synced: 10 Apr 2026
https://github.com/sanchittechnogeek/overscripted-analysis
Geolocation and user language extraction analysis from Mozilla Overscripted dataset
analysis data data-analysis mozilla
Last synced: 23 Mar 2025
https://github.com/open-geodata/sp_bh_pcj-2020-2035
Dados Espaciais da Agência das Bacias PCJ, com informações apresentadas no Plano de Bacias 2020-2035
Last synced: 16 Jan 2026
https://github.com/checco9811/data-engineering-bootcamp-homework
Homework solutions for DataExpert.io data engineering bootcamp
apache-spark data data-engineering sql
Last synced: 14 Mar 2025
https://github.com/karashiiro/lodestone-character-data-scraper
Lodestone character data scraper.
data ffxiv ffxiv-character lodestone
Last synced: 23 Apr 2026
https://github.com/dhruvil-26/tableau-projects
This repository contains Tableau visualization projects focused on data analysis across different domains. Projects include: 1. IPL Visualization - Insights into IPL match, Team and player statistics. 2. EV Analysis - Visualizations exploring the adoption of electric vehicles. 3. Road Accident Analysis - Analysis of road accident patterns
analysis data data-analysis data-analytics electric-vehicles ipl road-accident-analysis tableau tableau-public
Last synced: 19 Jan 2026
https://github.com/sharoonjoseph321/insurance_fraud_detection
Fraud Detection using machine learning algorithm-KN Neighbors .Data exploration using Pyspark and matplotlib.
analytics data data-science eda high-performance knn-algorithm knn-classification machine-learning matplotlib-pyplot pyspark python seaborn spark statistics
Last synced: 23 Mar 2025
https://github.com/maluscat/reactive-storage
[MIRROR] Register, observe and intercept deeply reactive data without the need for proxies
data javascript reactive typescript
Last synced: 10 Mar 2026
https://github.com/reshmaaiman/fifa
FIFA20
data data-science data-visualization dataanalysisusingpython github jupyter-notebook matplotlib numpy pandas python seaborn-python
Last synced: 10 Apr 2026
https://github.com/vidupriya/aws-glue--data-copy
The function for copying data like CSV, Parquet, avro etc., from a source S3 bucket to a destination S3 bucket using AWS Glue. It includes the necessary setup for the Glue job, logging, reading data from the source bucket, and writing it to the destination bucket
aws awsglue awss3 data data-copying glue glue-job pyspark python3 s3 s3-bucket s3-buckets s3-storage spark
Last synced: 02 May 2026
https://github.com/team810/frcs
FRCS is an online international crowd sources data collection software written for the FRC Competitions. It was created by team 810, The Mechanical Bulls.
Last synced: 14 Mar 2025
https://github.com/greatwoman23/car_insurance_analysis
The Car Insurance Analysis project aims to provide a comprehensive examination of a car insurance portfolio using advanced data analytics tools. The analysis offers valuable insights into policy demographics, claims patterns, and financial metrics, helping stakeholders make informed decisions.
bigquery data data-science dataanalytics insurance-claims looker-studio tableau
Last synced: 03 Feb 2026
https://github.com/thanhleviet/vietnam_antibiotics_bidding
This repo contains data of bidding for multiple drugs and antibiotics reported to Vietnam Ministry of Health in 2015, 2016, 2017.
Last synced: 23 Feb 2026
https://github.com/fatihemres/Africa
Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.
animations avfoundation data mapkit models swift swift-animations swiftui
Last synced: 31 Aug 2025
https://github.com/amethyst-php/notification
amethyst amethyst-package api data laravel notification
Last synced: 15 May 2026
https://github.com/itrauco/streaming-data-platform
skeleton streaming data platform on gcp...
big-data data data-engineering data-infrastructure data-science engineering google-cloud platform-engineering python streaming-data
Last synced: 13 Jun 2026
https://github.com/viniddev/active_finance
Nesse projeto busquei solucionar um problema corriqueiro que é a dificuldade de se manter atualizado sobre as variações do mercado de ações e fundos imobiliários. Usei selenium webdriver para buscar informações e uma API do Telegram para enviar relatórios para o usuário
automation data data-analisis rpa selenium-webdriver telegram-bot
Last synced: 03 May 2026
https://github.com/ckongala/data-warehouse-concepts
Data Warehouse Basics
data data-engineering data-warehouse data-warehouse-architecture data-warehouse-construction data-warehousing
Last synced: 13 Oct 2025
https://github.com/justinyahin/wpdf
Create, filter, sort and display users data on your WordPress site.
Last synced: 18 Apr 2026
https://github.com/priyapuranik/data-analytics-using_python
Analyzed data of Hotels and find out meaningful insights from it including booking patterns and seasonal trends and many more.
data pandas python sql visualization
Last synced: 06 Apr 2026
https://github.com/als8446/tripleten-data-science-projects
Projects Overview Projects made in the Data Scientist course from TripleTen LatAm
data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn
Last synced: 10 Apr 2026
https://github.com/tyriek-cloud/nyc-dca-etl
Created an ETL pipeline to merge two CSV files (converted to JSON) into a parquet file using Azure Data Factory, The data was extracted from NYC Open Data: https://opendata.cityofnewyork.us/ and I created a Blob Container within an existing storage account.
azure azure-data-factory blob-storage data data-engineering etl-pipeline
Last synced: 21 Jan 2026
https://github.com/carlosrs14/parallel-data-preprocessig-system
A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.
barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads
Last synced: 24 Jul 2025
https://github.com/snimmagadda1/luigi-etl-example
🔍 Example of an ETL pipeline using Spotify's Luigi
data luigi luigi-pipeline python spotify
Last synced: 30 Mar 2025
https://github.com/anyantudre/associate-data-scientist-track
Materials for the Associate Data Scientist in Python track on DataCamp.
data data-science experimental-design hypothesis-testing machine-learning matplotlib-pyplot pandas python regression sampling seaborn statistics statsmodels unsupervised-learning
Last synced: 03 May 2026
https://github.com/shubhammittal-data/hr_dashboard_tableau
An interactive HR Analytics Dashboard built using Tableau. Provides insights into workforce demographics, hiring trends, salary analysis, and employee records for data-driven decision-making.
chatgpt4 data data-analysis data-visualization drawio-tools faker-generator hr-analytics hr-analytics-dashboard human-resources numpy python tableau tableau-public
Last synced: 17 May 2026
https://github.com/koppalexander/flightdelaychallenge
This project focuses on predicting flight delays using historical data from a Tunisian airline. We analyzed patterns in airport operations and flight schedules to build a machine learning model that can forecast potential delays.
data data-science machine-learning machine-learning-algorithms machinelearning prediction predictive-modeling
Last synced: 19 Jun 2026
https://github.com/hadarsharon/grizzlys
User-friendly Python DataFrames 🔵🟡 powered by Julia 🔴🟢🟣
big-data data data-analysis data-engineering data-frame data-frames data-science dataframe dataframe-library dataframes dataframes-jl julia python
Last synced: 18 May 2026
https://github.com/sungchun12/demotron
CLI to delight real people with live demos
Last synced: 26 Feb 2025
https://github.com/asacxyz/flutter_aplicando_persistencia_de_dados
Para acompanhamento do curso Flutter: aplicando persistência de dados
dart data data-storage flutter persistence persistent-storage sqflite sql sqlite
Last synced: 03 May 2026
https://github.com/ate47/playerdata
Get data about a player with a command
bukkit-plugin command data spigot-plugin
Last synced: 30 Aug 2025
https://github.com/flowsta/ods-educacion-aporta
ODS para educación, iniciativa APORTA 2021
data data-visualization ods sdg
Last synced: 27 Jan 2026
https://github.com/amethyst-php/courier
amethyst amethyst-package api courier data laravel
Last synced: 17 May 2026
https://github.com/joshuadeguzman/xcraper
Python based stocks exchange data scraper
data pandas python stock-market
Last synced: 18 May 2026
https://github.com/schoolsquirrel/holiday-data
Automatically updated holiday data for SchoolSquirrel
data holidays schoolsquirrel scripts vacation
Last synced: 03 Oct 2025
https://github.com/n4en/python-for-data-engineers
Python for data engineers
data data-engineer data-engineering dataengineering python python-notebooks python3 tutorial
Last synced: 26 Aug 2025
https://github.com/digital-media/cv_data
Datasets used for courses/tutorials at the Digital Media Department
computer-vision data image-processing images
Last synced: 14 Oct 2025
https://github.com/isandyawan/simplelinearregression
A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion
data linear r regression rstudio shiny statistic
Last synced: 14 Oct 2025
https://github.com/ssiarhei115/shop-customers-segmentation
Shop customers segmentation
data data-analysis data-science data-visualization
Last synced: 24 Aug 2025
https://github.com/europanite/gundam-forest
Random Forest Data Analysis of Kill In Action rate for every personnel in GUNDAM world, like in Titanic.
data data-analysis data-science data-visualization death-rate death-rates gundam-model gundam-series gundom-forest jupyter jupyter-notebook kia kill-in-action python random-forest titanic titanic-kaggle titanic-survival titanic-survival-prediction
Last synced: 29 Jun 2026
https://github.com/yash-chauhan-dev/spark_cluster_docker
Set-up local spark cluster, hadoop (hdfs), airflow, postgresql on docker with ease, without any local installations
apache-spark data data-engineering data-engineering-pipeline deployment docker docker-compose hadoop hdfs local-development localhost pyspark python
Last synced: 04 May 2026
https://github.com/ahmad-ali-rafique/wine-quality-dataset
Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.
analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model
Last synced: 21 Aug 2025
https://github.com/fallaciousreasoning/nz-mountains
A list of mountains in NZ, scraped from https://climbnz.org.nz
alpine climbing climbnz data json json-api maps mountaineering scraping
Last synced: 04 May 2026
https://github.com/srking501/uk-groceries-images
Repository Containing UK Groceries Images
data groceries grocery images links playwright playwright-python webscraping-data webscrapper
Last synced: 04 May 2026
https://github.com/karolkrupa/javascript-orm-mapper
ORM mapping library. Especially for Rest API
api data data-mapper entity es6 javascript mapper model mongo mysql node nuxt orm relational rest typescript vue vuex
Last synced: 10 Apr 2026
https://github.com/e-kotov/albofr
alboFr: Get French Data on Tiger Mosquito Colonisation
aedes-albopictus data france tiger-mosquito
Last synced: 11 Jun 2026
https://github.com/rachelresende/projeto-finan-as
Este repositório é referente a um curso de análise de dados para finanças que realizei em 2025 na Udemy.
analytics data financas finance finance-management
Last synced: 19 Aug 2025
https://github.com/dimitryzub/russo-ukraine-war-prediction-losses
Highlights rusian losses with predictions based on historic data from Ministry Defence of Ukraine 🐱👤
data dataanalysis dataanalytics matplotlib pandas prophet python
Last synced: 04 May 2026
https://github.com/jdanielgoh/cobertura-campanias
En una democracia ¿caben todas las voces? Proyecto para visualizar el monitoreo de radio y TV que realiza el INE de las candidaturas presidenciales 2024
d3js data datavisualization vue
Last synced: 09 Jun 2026
https://github.com/davorg/towerbridge
When is Tower Bridge lifting?
data hacktoberfest london perl web-scraping
Last synced: 29 Jun 2026
https://github.com/yagoluiz/enem-analise-extracao
[PT-BR] Extração e análise de dados do desempenho da região Centro-Oeste
analysis data extraction python3 r
Last synced: 17 Apr 2026
https://github.com/h4fide/politicalcompassbot
This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!
bot data greedy-algorithms politics python python3 sql telegram
Last synced: 19 Aug 2025
https://github.com/KarajMiglani-DataScientist/karajmiglaniFAKE-NEWS-DETECTION
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 19 Aug 2025
https://github.com/bdr-pro/streamlint
ltra-cool Streamlit app, where you can interact with widgets, see data in action, and even upload and download files
Last synced: 14 Apr 2026
https://github.com/octoenergy/tentaclio-s3
A python project containing all the dependencies for s3 tentaclio schema.
Last synced: 24 Jun 2025
https://github.com/hakusaro/facts
A fact based knowledge system (FBKS) experiment.
Last synced: 03 Jan 2026
https://github.com/vedikasnehil/my-data-science-projects
This repository is a comprehensive collection of resources and implementations dedicated to the field of Data Science. It serves as a platform for exploring various aspects of data science, ranging from data preprocessing and exploratory data analysis (EDA) to machine learning and deep learning.
data data-science deep-learning machine-learning matplotlib numpy python sql visualization
Last synced: 10 Apr 2026
https://github.com/ronknight/user-data-dashboard
📈 A data visualization tool for analyzing user data using an Excel-based data source.
dashboard data excel ga4 screenshot
Last synced: 17 Oct 2025
https://github.com/stefen-taime/mako-main
Declarative real-time data pipelines Framework. YAML in, events out.
data datapipeline declarative-config declarative-pipeline declarative-programming declarative-workflows framework open-source
Last synced: 26 Jun 2026
https://github.com/rijkvanzanten/ds-fa-1
The first final assignment for the data structures class
assignment data final map now parsons structures thenewschool
Last synced: 04 Oct 2025
https://github.com/jleung51/foundations-dags
Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.
data data-engineering etl extract housing load pipeline transform
Last synced: 04 Oct 2025
https://github.com/amethyst-php/account
account amethyst amethyst-package api data laravel
Last synced: 18 May 2026