data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/denisecase/620-mod6-web-scraping
Notes on how to get started scraping content from the web
beautifulsoup4 data mining python
Last synced: 11 Apr 2025
https://github.com/codegouvfr/codegouvfr-sources
🧢 Static web frontend for code.gouv.fr
bluehats codegouvfr data frontend
Last synced: 28 Feb 2025
https://github.com/eva-kaushik/data-clustering
Clustering Accelerators for hard and soft clustering, including implementations of K-means, K-medoids, hierarchical clustering, fuzzy C-means, and Gaussian mixture models. Demonstrates text clustering using both hard and soft clustering algorithms.
clustering clustering-algorithm data datascience machine-learning-algorithms
Last synced: 09 Apr 2025
https://github.com/piyushkumar2025/india-general-elections-2024_data-analyst
Analyzed election data for 540+ constituencies and 100+ parties using SQL. Calculated state-wise seat distributions, classified 30+ parties into alliances, identified top 10 candidates by EVM votes, calculated victory margins, and analyzed voting patterns for 300+ candidates to uncover key insights.
analytics data database mysql sql statistics
Last synced: 22 May 2026
https://github.com/saikatharryc/motionchart-d3js
A dynamic Motion chart Built with D3 js.
Last synced: 23 Dec 2025
https://github.com/emanoelcampos/power-bi-fundamentals
Datacamp's Power BI Fundamentals Skill Track
data data-analyst data-analyst-power-bi datacamp power-bi powerbi
Last synced: 24 Jan 2026
https://github.com/aminnairi/node-decode
Check that your data meet your expectations
check data decode expectations schema
Last synced: 22 Apr 2026
https://github.com/tdjsnelling/hermes
Hermes is a real-time data framework for React + MongoDB
data docker framework mongodb nodejs react react-hooks reactjs real-time typescript websocket
Last synced: 12 Apr 2026
https://github.com/raghavendranhp/youtube_data_harvesting
The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions
apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api
Last synced: 13 Apr 2026
https://github.com/canadaluke888/ttb2
TerminalTableBuilder 2
c17 csv data database datasets datautils json ncurses ods spreadsheet sqlite3 tables terminal terminaltablebuilder terminaltablebuilder2 ttb ttb2 ttbx xlsx
Last synced: 10 Apr 2026
https://github.com/bastianolea/servel_elecciones_core
Resultados electorales desde Servel (2024)
chile comunas data elecciones genero
Last synced: 01 Aug 2025
https://github.com/semcod/code2llm
Python Code Flow Analysis Tool - Static analysis for control flow graphs (CFG), data flow graphs (DFG), and call graph extraction
ast cfg code code2data code2logic code2process data dfg diagram flow graphs llm
Last synced: 01 Jun 2026
https://github.com/ahmad-ali-rafique/wine-quality-dataset
Comprehensive analysis and modeling of the Wine Quality dataset, including exploratory data analysis (EDA), data preprocessing, model training, and performance evaluation using MSE and RMSE.
analytics data datacleaning decision-tree-regression exploratory-data-analysis gradient-boosting-regressor linear-regression machine-learning mean-square-error model
Last synced: 21 Aug 2025
https://github.com/eugenedakin/des-encryption-decryption
Encrypt and Decrypt text in Xojo using DES - Written in Native Xojo Language - Cross Platform
data data-encryption-standard decryption des encryption standard xojo
Last synced: 24 Feb 2026
https://github.com/lancewalk87/cls-cloud-sync-ruby-on-rails
Software | SQL Database with automated Cloud Sync for mitigating lost data across dist. servers. Managed by Ruby on Rails.
cloud-computing cloud-storage data database ruby ruby-application ruby-on-rails server sql
Last synced: 24 Jul 2025
https://github.com/pawal/tldmonitor-ui-go
Web UI for TLDMonitor
analysis data dns go golang mongodb statistics webapp website
Last synced: 16 Jan 2026
https://github.com/foufou-exe/finegourmet
Project Business Intelligence Architecture
business-intelligence data docker docker-compose etl mysql powerbi pyspark python workflow
Last synced: 11 Apr 2026
https://github.com/atharvapathak/twitter_sentiment_analysis_project
Twitter sentiment analysis is the process of analyzing tweets posted on the Twitter platform to determine the overall sentiment expressed within them. It involves using natural language processing (NLP) and machine learning techniques to classify tweets.
api bag-of-words bert cnn data gbm nltk rnn spacy twitter
Last synced: 28 Jan 2026
https://github.com/cosmos-loops/cosmos-data
Cosmos.Data is a inline project of COSMOS LOOPS PROGRAMME to provide several SQL-Query, RMDB/ORM and No-SQL components' extensions.
connection-pool data mysql mysqlconnector oracle postgresql sqlite sqlkata sqlserver transaction uow
Last synced: 12 Apr 2026
https://github.com/acovaci/orbit
ORBIT: an Open source Rust-based implementation of a data Build Tool, inspired by DBT
cargo clap-rs data data-warehouse dbt rust rust-lang tokio-rs
Last synced: 16 Mar 2025
https://github.com/farhashaad/farhashaad98
This is a repository to showcase my skills, share projects and track my progress in Data Science related projects.
data data-visualization dataanalysis matplotlib pandas python seaborn sql tableau
Last synced: 24 Apr 2026
https://github.com/karolkrupa/javascript-orm-mapper
ORM mapping library. Especially for Rest API
api data data-mapper entity es6 javascript mapper model mongo mysql node nuxt orm relational rest typescript vue vuex
Last synced: 10 Apr 2026
https://github.com/justinhennis1/hackathon24
Hofstra's Hacknology Competition 2024 - Team Null Pointers
data data-analysis data-science data-visualization data-visualization-python dataanalysis dataanalytics traveling web webapplication
Last synced: 21 Aug 2025
https://github.com/badranalyst/covid-deaths-dashboard-with-tableau
This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.
covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization
Last synced: 02 Mar 2026
https://github.com/team-hydrogen/2025-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 11 Apr 2025
https://github.com/giscience/measures-rest-oshdb-app
A frontend for providing measures for geospatial datasets, using the OSHDB
data dggs geospatial measure openstreetmap rest
Last synced: 20 Apr 2026
https://github.com/tks18/xl-pq-handler
A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation
analytics automation data excel power-query powerbi python xlwings
Last synced: 20 Jan 2026
https://github.com/snimmagadda1/luigi-etl-example
🔍 Example of an ETL pipeline using Spotify's Luigi
data luigi luigi-pipeline python spotify
Last synced: 30 Mar 2025
https://github.com/zeptosec/bpscrapper
Shows history of oil prices
data data-visualization database nodejs scraper
Last synced: 13 Apr 2026
https://github.com/prishabhanot/facial_recognition_pca
A face recognition system using Principal Component Analysis (PCA) for dimensionality reduction and a Support Vector Machine (SVM) classifier for classification. PCA extracts essential features (eigenfaces) from facial images, significantly reducing computational complexity while retaining critical information for accurate recognition.
data eigenfaces facial-recognition pca python reducing-computational-complexity reducing-data-dimensions svm-classifier
Last synced: 01 Mar 2025
https://github.com/jneidel/nationalities
Dataset of 100 common nationalities
data dataset json nationalities nationality opendata
Last synced: 25 Mar 2025
https://github.com/anderson-andre-p/datastructuresandalgorithms
Data Structures and Algorithms to Study
data data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series structured-data
Last synced: 20 Aug 2025
https://github.com/stdlib-js/ndarray-vector-uint32
Create an unsigned 32-bit integer vector (i.e., a one-dimensional ndarray).
constructor ctor data javascript ndarray node node-js nodejs stdlib structure types uint32 vec vector
Last synced: 25 Apr 2026
https://github.com/ashamethedestroyer/data-structures
Dedication of all Data Structures Creation 🛠
cpp data data-structures implementation implementation-of-data-structures structure structured-data
Last synced: 23 May 2026
https://github.com/keminghe/osu
Unofficial and publicly-available NPM data-package about The Ohio State University.
college data majors ohio-state organizations public students university unofficial
Last synced: 06 Jan 2026
https://github.com/rachelresende/projeto-finan-as
Este repositório é referente a um curso de análise de dados para finanças que realizei em 2025 na Udemy.
analytics data financas finance finance-management
Last synced: 19 Aug 2025
https://github.com/urvish-06/seaborn-dataset
Seaborn data sets
csv csv-files data data-science data-visualization dataset example jupyter-notebook jypyternotebook python seborn vacation
Last synced: 18 May 2026
https://github.com/carlosrs14/parallel-data-preprocessig-system
A parallel data preprocessing system using threads and synchronization mechanisms (barrier, busy-waiting, condition variables) to clean and prepare data for AI training.
barrier-method c condition-variable data operative-systems parallel-computing posix preprocessing synchronization threads
Last synced: 24 Jul 2025
https://github.com/rugwiroparfait/alx_sql
This repo is where I save my queries and learning materials in Data Science program from ALX
anaconda data data-analysis jupyter-notebook sql
Last synced: 19 Aug 2025
https://github.com/h4fide/politicalcompassbot
This Python project allows you to take a quiz and find out where you fit on the political compass. Give it a try and see where you stand!
bot data greedy-algorithms politics python python3 sql telegram
Last synced: 19 Aug 2025
https://github.com/nmelgar/birthday_sports_dataviz
We will analyze how the Matthew Effect has influenced in professional sports players.
analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau
Last synced: 08 Jan 2026
https://github.com/justinjjlee/simulation-discrete
Employing data transformations and simulations to answer random questions
analytics data data-science julia python simulation spark
Last synced: 30 Apr 2026
https://github.com/luminati-io/Google-Maps-dataset-samples
A sample dataset of over 1000 Google Maps businesses, extracted using the Bright Data API, ideal for competitor analysis, location-based marketing, and market strategies.
api data dataset google-maps maps web-scraping
Last synced: 09 Apr 2025
https://github.com/luminati-io/ZoomInfo-dataset-samples
A sample dataset of over 1000 ZoomInfo companies, extracted using the Bright Data API, ideal for market growth, lead generation, and market analysis.
b2b business companies data data-extraction database dataset datasets web-scraping zoominfo
Last synced: 09 Apr 2025
https://github.com/als8446/tripleten-data-science-projects
Projects Overview Projects made in the Data Scientist course from TripleTen LatAm
data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn
Last synced: 10 Apr 2026
https://github.com/bertrand31/one-billion-rows-challenge
🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds
competitive-programming competitive-programming-contests data data-engineering data-processing performance scala
Last synced: 05 Sep 2025
https://github.com/spajai/etl-sharepoint-data-uploader-pipeline
Custom Python Script to Pull specific data from source and Upload to the Microsoft SharePoint
data etl etl-pipeline microsoft microsoft365 python3 sharepoint sharepoint-online
Last synced: 11 Nov 2025
https://github.com/rubyonworld/ruby-cldr
The Unicode Consortium's Common Locale Data Repository (CLDR) contains tons of high-quality locale data such as formatting rules for dates, times, numbers, currencies as well as language, country, calendar-specific names etc.
common data locale repository ruby unicode
Last synced: 12 Nov 2025
https://github.com/KarajMiglani-DataScientist/karajmiglaniFAKE-NEWS-DETECTION
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 19 Aug 2025
https://github.com/braiso-22/ejercicio-seguro-medico
Ejercicio de acercamiento a los datos para hacer predicciones
data data-science dataset ia insurance jupyter-notebook ml python python3
Last synced: 24 Apr 2026
https://github.com/desoga10/nety-form
In this tutorial, I show you how to send data from a form to the Netlify dashboard. I also show you how to create a form using Materialize.
contact-form css css3 data form forms html html5 materialize materialize-css materializecss-framework netlify
Last synced: 03 Jan 2026
https://github.com/entropyorg/p5-data-testimage
:notebook::camera: interface for retrieving test images
Last synced: 29 May 2026
https://github.com/rationalprabal/book-management-app
A Node.js and Express.js application for managing books, featuring role-based authentication and authorization with JWT, file uploads for book cover pages, robust data validation and documentation using swagger. The project includes user roles such as Admin, Author, and Reader, each with specific permissions.
data expressjs jwt-authentication mongodb mongoose nodejs rbac-roles
Last synced: 10 Apr 2026
https://github.com/mukul273/spring-data-rest-jpa-demo
Spring Data Rest JPA Demo
data jpa rest spring spring-boot spring-mvc
Last synced: 20 Apr 2026
https://github.com/otoneko1102/roulette-base
ルーレットの色と番号をjson形式でまとめたものです。カジノ風ルーレットを作るときにどうぞ。A collection of roulette colors and numbers in json format. Use it when making a casino-style roulette.
casino casino-games data json require roulette
Last synced: 16 Mar 2025
https://github.com/vishwas-chakilam/twitter-sentiment-analysis
Twitter Sentiment Analysis is a Python project that analyzes the sentiment of tweets based on a user-defined keyword. It uses Tweepy to fetch tweets from the Twitter API and TextBlob for sentiment analysis. The application features a user-friendly GUI with Tkinter, displaying tweet sentiment as positive, negative, or neutral.
api data data-science dataanalysis python3 textblob-sentiment-analysis tkinter tweepy-api
Last synced: 11 Mar 2025
https://github.com/ttozatto/sparkify
Churn Prediction for music streaming app with PySpark
analysis churn data learning machine predictive pyspark science spark
Last synced: 16 Jan 2026
https://github.com/hakusaro/facts
A fact based knowledge system (FBKS) experiment.
Last synced: 03 Jan 2026
https://github.com/wittyicon29/zeotap-ds-assignment
Internship application assignment
Last synced: 19 Aug 2025
https://github.com/nagipragalathan/linkedin_backup_datas
This repository contains the backup data from my previous LinkedIn account. Unfortunately, my old LinkedIn account was compromised and subsequently blocked by LinkedIn. As a result, I created a new account, but that too got blocked for reasons unknown to me.
backup blocked data linkedin linkedin-account memory nagipragalathan recovery storage
Last synced: 18 Jan 2026
https://github.com/idhruvs/angular4-smart-table-demo
Angular4 Smart Table Demo Project
angular4 data tables typescript
Last synced: 21 Apr 2026
https://github.com/nevoland/unchangeable
🧊 Tools for immutable values.
data datastructure functional immutable persistent pure stateless
Last synced: 24 Jul 2025
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/goutamhegde002/dsa-roadmap-for-beginners
The "DSA Roadmap for Beginners" repository is a comprehensive guide designed to help beginners learn Data Structures and Algorithms (DSA) efficiently. It provides structured content covering fundamental and advanced topics in DSA, with practical examples, exercises, and coding problems.
beginner beginner-friendly beginner-guide coding-practice data data-science data-structures data-structures-and-algorithms dsa dsa-algorithm dsa-learning-series dsa-practice dsa-roadmap interview-preparation interview-resources programming-fundamentals
Last synced: 28 Feb 2026
https://github.com/cunfuu/network-bubbles
For Easier to manage organizations and keeping notes about them to organize events and easy access their needs
data data-visualization organizations organizations-volunteer
Last synced: 31 Jul 2025
https://github.com/jstafford5380/provausio.testing.generators
Generate fake data for testing and/or mocking
data fake-data generator testing
Last synced: 14 Jan 2026
https://github.com/rosette-api/mock-data
Mock data that is used for unit testing of the Babel Street Analytics bindings
data entity-extraction entity-level-sentiment entity-linking entity-relationship entity-resolution language-detection machine-learning mock-data morphology natural-language-processing nlp relation-extraction sentiment-analysis test-framework testing text-mining text-processing tokenization
Last synced: 04 Mar 2026
https://github.com/bmcollier/contiguous
Provides COBOL-style contiguous data structures in Python
Last synced: 14 Jan 2026
https://github.com/infinitode/pywebscrapr
An open-source Python web scraping tool. Supports both image scraping and text scraping.
data data-collection data-science open-source pip scraping web-scraper
Last synced: 14 Feb 2026
https://github.com/cmdrvl/rvl
rvl reveals the smallest set of numeric changes that explain what actually changed between two datasets — or confidently tells you nothing changed.
cli csv data data-quality data-validation diff finance numerical-analysis open-source ops rust tooling
Last synced: 25 Feb 2026
https://github.com/tasosfotiadis/time-series-forecasting-for-bitcoin
This project forecasts Bitcoin’s daily closing price using time series models. Data from Jan 2021 to Mar 2022 is processed by converting timestamps, resampling, and handling missing values. LSTM and ARIMA models are evaluated on MAE, RMSE, and MAPE, with LSTM achieving better accuracy while ARIMA is faster in training and inference.
arima bitcoin data data-analysis data-science deep-learning forecasting jupyter-notebook neural-networks python time-series
Last synced: 06 May 2026
https://github.com/vedikasnehil/my-data-science-projects
This repository is a comprehensive collection of resources and implementations dedicated to the field of Data Science. It serves as a platform for exploring various aspects of data science, ranging from data preprocessing and exploratory data analysis (EDA) to machine learning and deep learning.
data data-science deep-learning machine-learning matplotlib numpy python sql visualization
Last synced: 10 Apr 2026
https://github.com/allanotieno254/powerbi-dax-filter-context
This repository contains a Power BI project that explores **DAX Filter Context**, a crucial concept in DAX calculations. The project focuses on **Bank Loan Analysis**, demonstrating how different filter contexts affect DAX formulas.
business-intelligence data data-analysis dax dax-functions powerbi powerbi-visuals visualization
Last synced: 08 Jan 2026
https://github.com/filipnet/infoscreen
Arduino subscribes values by MQTT and view info on an OLED I2C display
arduino data display i2c mqtt oled-display-ssd1306 visualization weather weatherstation
Last synced: 12 Apr 2026
https://github.com/saritaphd/predicting-performance-of-students---complete-ml-project-with-deployment-using-aws
Student performance analysis with deployment (End to end ML project)
aws data data-science deployment jupyter-notebook machine-learning python visualization
Last synced: 10 Apr 2026
https://github.com/dms-codes/scrape-tokoalvabet-com
Toko Alvabet Data Scraping and Price Comparator This Python script is designed to scrape data from Toko Alvabet's website and perform price comparison for the obtained products. It includes features for viewing and analyzing product data, as well as comparing prices with other sellers.
Last synced: 29 Jul 2025
https://github.com/sasanthns/sql_data_warehouse_project
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data data-analysis data-science data-warehouse datacleaning etl etlpipeline sql sqlserver
Last synced: 24 Mar 2025
https://github.com/nolanbconaway/rollercoaster-tycoon-data
Every roller coaster I have built in RCT2 for iPad
Last synced: 24 Mar 2025