data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/bastianolea/plebiscitos_chile
Datos de resultados electorales de los plebiscitos constitucionales de 2022 y 2023
chile comunas data elecciones politica social
Last synced: 15 Jun 2026
https://github.com/lafkpages/minecraft-crafting-info
Scrapes https://www.minecraftcrafting.info for crafting recipes.
Last synced: 17 Jun 2026
https://github.com/petzi53/repairdata
Open Repair Alliance Datasets 2021
data open-data open-datasets r repair repair-cafe repairs
Last synced: 22 Jun 2026
https://github.com/peterhellberg/bugsnag-data
Dump Bugsnag data using the Data access API
Last synced: 22 Jun 2026
https://github.com/souza-vitor/stock-market
codecademy data data-analysis data-mining data-science sql sqlite
Last synced: 26 Jun 2026
https://github.com/keziatbnn/supervised-regression-salaryprediction
Make salary predictions based on years of experience using supervised regression.
data data-analysis-python data-prediction data-science python
Last synced: 11 Aug 2025
https://github.com/rohitblaze10/netflix_analysis_using_tableau
The Netflix dashboard in Tableau provides a professional and visually captivating interface for users to explore a vast collection of TV shows and series. With seamless navigation and interactive filters, users can easily personalize their recommendations based on release year, genre, duration, and rating.
data data-analysis data-science data-visualization netflix tableau
Last synced: 04 Feb 2026
https://github.com/frer0t/userverse
creating api for data analysis
data data-analytics spring-boot users
Last synced: 12 Apr 2026
https://github.com/parvezk/d3-fundamentals
D3 library API fundamentals
charts d3 data graphs visualization
Last synced: 19 Oct 2025
https://github.com/servierhub/adsv
Analyze delimiter-separated values files
csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining
Last synced: 28 Sep 2025
https://github.com/wisdom-osborn/data-analytics-course-online-
🔍 Data Analytics with Python — Hands-on Course Materials Jupyter notebooks, projects, and datasets based on the freeCodeCamp Data Analysis with Python certification. Learn NumPy, Pandas, data cleaning, and visualization through real-world examples
data data-analysis data-science data-visualization freecodecamp numpy pandas pandas-dataframe project python
Last synced: 19 Apr 2026
https://github.com/plandes/datdesc
Describe and optimize data
data hyperparameter-optimization hyperparameter-tuning latex table
Last synced: 04 Sep 2025
https://github.com/cassandrajm/reddit-dashboard
INTERACTIVE DASHBOARD: Analyzing Political Discourse on Reddit: A Multi-Faceted NLP Approach to Toxicity, Bias, and Political Stance
capstone data data-analysis data-science politics python reddit
Last synced: 09 Apr 2025
https://github.com/octoenergy/tentaclio-snowflake
A python project containing all the dependencies for snowflake tentaclio schema.
Last synced: 20 Oct 2025
https://github.com/dilkushsingh/webscraping-with-selenium-and-beautifulsoup
Web Scrapped a popular tech gadgets website using Selenium and BeautifulSoup, also performed Data Analysis on scrapped data.
beautifulsoup data datacleaning datagathering eda exploratory-data-analysis python selenium webscraping
Last synced: 24 Feb 2026
https://github.com/denisecase/620-mod6-web-scraping
Notes on how to get started scraping content from the web
beautifulsoup4 data mining python
Last synced: 11 Apr 2025
https://github.com/elimu-ai/ml-event-simulator
🤖 Simulation of learning events and assessment events
data learning-analytics machine-learning ml
Last synced: 28 Feb 2025
https://github.com/keminghe/osu
Unofficial and publicly-available NPM data-package about The Ohio State University.
college data majors ohio-state organizations public students university unofficial
Last synced: 06 Jan 2026
https://github.com/igor-starostenko/sabre
Slice your files like a champ with **sabre**
Last synced: 28 Mar 2025
https://github.com/ssanthosh010303/collection-data-training
A collection of challenges exercised during data training program.
airflow apache azure azure-data-factory azure-databricks azure-logic-apps bigdata data hadoop spark
Last synced: 27 Jan 2026
https://github.com/athari22/analyzing-the-yelp-dataset
SQL for Data Science
analytics data data-science data-structures er sql
Last synced: 27 Jan 2026
https://github.com/gabrieldim/complete-analysis-covid-19
Analysis of the Covid 19.
analysis covid-19 covid19 data data-science science virus
Last synced: 23 Jan 2026
https://github.com/andrewl/danelaw
Geopackage containing the boundary of the Danelaw
data geospatial medieval viking
Last synced: 23 Jan 2026
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/a-poor/datatransform.jl
A package for defining (and performing) tabular-data transformations with JSON.
data data-science data-transformation etl feature-engineering json julia julia-package tabular-data
Last synced: 05 May 2026
https://github.com/team-hydrogen/nasa-adc-data
All files relating to the computation of the data provided
data jupyter-notebook nasa-app-development-challenge
Last synced: 25 Mar 2025
https://github.com/mattjesc/ddo-semiconductor
Data-Driven Optimization of Semiconductor Processes and Forecasting
ai artificial-intelligence data data-science data-visualization deep-learning keras machine-learning manufacturing ml prophet python pytorch semiconductor semiconductor-manufacturing semiconductors tensorflow
Last synced: 23 Feb 2026
https://github.com/etmendz/mendz.data
Provides tools and guidance for creating data access contexts and repositories.
context data datasettings entity-framework mendz paginginfo repository resultinfo
Last synced: 11 Jun 2025
https://github.com/lohithgsk/dynamic-qr-generator
A Python-based QR generator application was developed using the qrcode and Pillow libraries, dynamically generating QR codes for custom data inputs. Designed for a college grievance management system, the application creates QR codes containing block, floor, room, and machine numbers, allowing easy placement and identification on each floor.
data pillow python qrcode qrcode-generator
Last synced: 16 Mar 2025
https://github.com/edjoukou/pizza-sales-report
A data analysis project using SQL with MySQL database
analysis data mysql powerbi visualization
Last synced: 05 May 2026
https://github.com/lananolana/test_data_generator
Generate test data with Telegram bot in one click: random users, files, texts and credit cards.
credit-card data data-generation fake-data random telegram-bot test-data test-data-generator test-file-generator testing testing-tools text-generation user-generator
Last synced: 18 Jan 2026
https://github.com/knowcnu12/metamask-wallet-recovery-funds-phrase-data-seed-token
This repository provides tools and guidelines for securely recovering MetaMask Wallet funds using recovery phrases, seed data, and tokens. It ensures safe and reliable methods for recovering access to your wallet and managing your cryptocurrency assets.
bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask metamask-bot metamask-desktop metamask-extension metamask-plugin metamask-snap metamask-wallet phrase recovery seed token wallet wallet-security
Last synced: 08 Mar 2026
https://github.com/asjadnaqvi/stata-tidytuesday
A Stata package for fetching Tidy Tuesday meta data and files
Last synced: 13 Jun 2026
https://github.com/sushmashreeps/data-science-with-python
This repository showcases a comprehensive data science project utilizing Python, demonstrating expertise in data analysis, visualization, and machine learning. Built with Python 3.x, the project leverages popular libraries like Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, and TensorFlow. The project features data preprocessing, feature engine
cnn data dataanalysis datascience keras linear-regression matplotlib python python3 regression rnn visualization
Last synced: 14 Apr 2026
https://github.com/louis-heraut/dataverseur
🫖 A dataverse API R wrapper to enhance the deposit procedure using only R variable declarations
data data-repository data-science datascience dataset dataverse dataverse-api json metadata metadata-management metadata-parser r
Last synced: 24 Oct 2025
https://github.com/moscatellimarco/webscrap-tinydeal
"WebScrap-TinyDeal" is a Scrapy-powered 🕷️ tool for harvesting product information 🏷️ from TinyDeal. It outputs structured CSV data 📁, ready for analysis. Explore the scripts 👨💻 for an interactive scraping adventure or leverage the data for competitive pricing strategies 📈.
css data datascience html pandas python scrapy web webscraper webscraping
Last synced: 14 Apr 2026
https://github.com/ztgx/muvera
MUVERA: Making multi-vector retrieval as fast as single-vector search
algorithms data google muvera retrieval rust search structure vector
Last synced: 25 Oct 2025
https://github.com/prajjwol09/power-bi-project
The Data Survey Breakdown is an interactive Power BI dashboard designed to present insights gathered from a survey of professionals and enthusiasts in the data industry.
dashboard data interactive powerbi survey
Last synced: 15 Mar 2026
https://github.com/pchaparro/search-engine
Full stack search-engine created from youtube videos obtained using "web-scraping"
data opensearch python python3 react scraper scraping scraping-websites search search-engine semantic-search sentence-transformers typescript website
Last synced: 17 Apr 2026
https://github.com/dhruvil-26/powerbi-projects
This repository contains Power BI projects showcasing data analysis and interactive dashboards. Each project includes detailed visualizations and insights on diverse topics such as loan analysis, sales performance, and customer behavior.
customer-behavior-analysis data data-analysis interactive-dashboards loan-analysis powerbi sales-performance visualization
Last synced: 04 Feb 2026
https://github.com/chocoscoding/fakeapi
A fake API with nice functionalities for testing
api data express fetch fetch-api frontend javascript js json json-api json-server nodejs testing typescript
Last synced: 09 Apr 2026
https://github.com/eudesgccunha/automated-management-panel
Automated management panel using Power BI
data data-analysis data-visualization database excel powerbi
Last synced: 04 Feb 2026
https://github.com/miriswisdom/coral.bells
Guiding and Reassuring Safety, Holistically and Empathetically
civic community data engagement govhack open safety
Last synced: 28 Jan 2026
https://github.com/yashaswitir28/yashaswitir28.github.io
This is my Portfolio Website
data data-analysis-python data-analyst data-cleaning data-science data-visualization excel html-css ms office365 portfolio-website powerbi python sql
Last synced: 29 May 2026
https://github.com/manojbollamx/watsonx_assistant_android
Watsonx Assistant Android Embedded JS
android data intent java js persistent-storage security services watson
Last synced: 05 May 2026
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 13 Jun 2026
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 09 Jun 2026
https://github.com/giuleo129/dataanalysis
This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.
data data-analysis data-science statistical-learning
Last synced: 25 Jan 2026
https://github.com/zainea-bogdan/data_engineer_project_wowcinema
WoWCinema is a project based on a fictional scenario where I stepped into the role of a Data Engineer, designing and building an end-to-end Data Infrastructure. A ETL pipeline ingests data from multiple sources, transforms it, and loads it into a centralized PostgreSQL data warehouse to power analytics, KPI tracking, and reporting
analytics big-data data datawarehousing etl-pipeline postgres python sql
Last synced: 19 May 2026
https://github.com/woctezuma/epic-games-js
JavaScript on the Epic Games store.
data datamining egs epic epic-games epic-games-api epic-games-launcher epic-games-store epicgames epicgames-api epicgames-launcher epicgames-store graphql graphql-api javascript webpack
Last synced: 27 Oct 2025
https://github.com/aniruddha-biswas/shield-insurance-business-insights
Shield Insurance Business Insights
data data-visualization dataanalysis excel mysql powerbi sql
Last synced: 01 Apr 2025
https://github.com/anuraganalog/twitter-data-analysis
My internship work during the 2020 summer
analysis data eda exploratory-data-analysis jupyter-notebook nlp spotle textblob twitter wordcloud
Last synced: 20 May 2026
https://github.com/jigyasag18/movie-recommendation-system-project
This repository features a personalized movie recommendation system that offers tailored suggestions to users. It leverages a dataset of 5,000 English-language films and utilizes data processing, feature engineering, and a cosine similarity algorithm to analyze user preferences. The system includes an intuitive user interface for easy navigation.
data datacleaning datapreprocessing machine-learning machine-learning-algorithms python streamlit streamlit-webapp
Last synced: 28 May 2026
https://github.com/petzi53/repair
R Datasets of the Open Repair Alliance (ORA).
Last synced: 19 May 2026
https://github.com/emanoelcampos/power-bi-fundamentals
Datacamp's Power BI Fundamentals Skill Track
data data-analyst data-analyst-power-bi datacamp power-bi powerbi
Last synced: 24 Jan 2026
https://github.com/robertoostenveld/dccn.dsc_3015055.00_583_v1
The FieldTrip-SimBio Pipeline for EEG Forward Solutions [Data set].
Last synced: 24 Jan 2026
https://github.com/woctezuma/hidden-gems-data
Data available to compute regional rankings of hidden gems.
data hidden-gems steam steam-reviews
Last synced: 06 Feb 2026
https://github.com/fatihilhan42/hollywood-theatrical-market-synopsis-1995-to-2021
In this project, the data of hollywood film production companies from 1995 to 2021 were examined. Significant tables and graphs were created using data visualization algorithms, with the tickets sold divided into categories.
data data-analysis data-science data-visualization
Last synced: 23 Mar 2025
https://github.com/noedemange/orderedheatmapanalysis
OrderedHeatMapAnalysis (OHMA) is a direct data analysis framework allowing to simultaneously visualize and analyze the structure of complex datasets. An optimized seriation of rows and columns of the input data table is performed, resulting in a mapping of the whole dataset into an ordered heatmap.
analysis bi-seriation data dataanalysis heatmap r rstats seriation shiny shiny-apps
Last synced: 27 Feb 2025
https://github.com/jpcadena/ventas-facturas
Ventas con facturas
data data-analysis data-exploration data-extraction data-science excel feature-engineering matplotlib microsoft numpy pandas powerbi product-sales pylint python receipts sales
Last synced: 12 Apr 2026
https://github.com/0xHericles/SpamDetector
:email: A Simple Python Spam Detector with Scikit-Learn
data ham machine-learning python sklearn spam
Last synced: 24 Mar 2025
https://github.com/amazenmb/web-scraping
Web Scraping Methods using Python
analytics beautifulsoup data lxml pyautogui-automation python scheduling schedulingscraping selenium webdriver webscraping xpath
Last synced: 06 May 2026
https://github.com/cmdrvl/rvl
rvl reveals the smallest set of numeric changes that explain what actually changed between two datasets — or confidently tells you nothing changed.
cli csv data data-quality data-validation diff finance numerical-analysis open-source ops rust tooling
Last synced: 25 Feb 2026
https://github.com/awpala/udemy-my-courses-data-parser
Download Udemy lists and courses metadata for authenticated student user
Last synced: 07 May 2026
https://github.com/turner-kendall/turner-kendall
Turner Kendall - dev, opps, sec.
config data github-config go rust security
Last synced: 31 Oct 2025
https://github.com/xljones/bugsnag-exporter
Export Bugsnag project, error, and event data easily from a command line call which automatically handles pagination, and API backoffs
bash bugsnag cmd csv data error error-capture error-handling error-reporting event export go golang json project zsh
Last synced: 06 May 2026
https://github.com/aimin-nur/data-analyst
Sebuah project Data Analyst (Mechine Learning) untuk melakukan analisa harga mobil bekas Ford berdasarkan dataset yang sudah ada, serta mengetahui apa saja feature atau kolom yang mempengaruhi harga mobil bekas Ford.
analytics data mechine-learing visualization
Last synced: 29 Jan 2026
https://github.com/fabsdevx/file-format-converter-handout
Data Engineering project for learning purposes. Credits to itversity
csv csv-import data data-engineering database pandas python
Last synced: 06 May 2026
https://github.com/tpltnt/wir_vs_virus_hackathon_projects
A list of all projects / challenges for the WirVsVirus hackathon as CSV
coronavirus csv data hackathon raw-data
Last synced: 29 Jan 2026
https://github.com/data-forge-notebook/ohlc-aggregation-example
An example of aggregating OHLC stock data using Data-Forge Notebook
algorithmic-trading data data-aggregation data-analysis ohlc quantitative-finance share-market stock-market trading
Last synced: 30 Jan 2026
https://github.com/chenxingqiang/modeling_tabular_data
# modeling_tabular_data | Keywords: modeling_tabular_data focusing on modeling_tabular_data.
Last synced: 30 Jan 2026
https://github.com/rosacarla/databases
Bases de dados utilizados em atividades práticas do MBA Data Analytics do IGTI.
Last synced: 19 Mar 2026
https://github.com/flowsynx/plugin-base64
FlowSynx plugin to provides encoding and decoding of Base64 strings, allowing workflows to handle Base64 content transformations efficiently.
base64 base64-decoding base64-encoding data data-platform decoding encoding flowsynx flowsynx-plugins
Last synced: 10 Mar 2026
https://github.com/mreshboboyev/elastic-search-dotnet
A powerful and easy-to-use .NET library for integrating Elasticsearch, enabling fast full-text search, scalable indexing, and advanced data analytics in your applications.
analytics c-sharp data dotnet-core elastic-search full-text indexing open-source scalable search
Last synced: 30 Jan 2026
https://github.com/shudhanshusaurabh001/super_market-data-analysis-using-python
This project focuses on analyzing supermarket sales data using Python. The goal is to extract meaningful insights from the dataset, such as sales trends, customer purchasing behavior, and product performance.
analysis csv data insights matplotlib numpy pandas project python seaborn
Last synced: 06 Apr 2026
https://github.com/amethyst-php/supplier
amethyst amethyst-package api data laravel supplier
Last synced: 15 Apr 2026
https://github.com/amethyst-php/owner
amethyst amethyst-package api data laravel owner
Last synced: 28 Apr 2026
https://github.com/mikeqfu/network-rail-track-fixity-layer
This project develops a data mining tool for analysing and predicting track movements using asset data, environmental factors and track design knowledge to model key parameters and generate fixity values for the GB rail network.
data data-integration data-mining data-science information-management knowledge-discovery point-cloud rail rail-alignment rail-track track-fixity
Last synced: 02 Sep 2025
https://github.com/denisecase/dc-texter
Send a text message using Python
alerts data python sms-messages streaming
Last synced: 08 Feb 2026
https://github.com/azmag/spm-dashboard
System Performance Measures are a selection of criteria used by Department of Housing and Urban Development (HUD) to evaluate how local Continua of Care are performing.
Last synced: 31 Jan 2026
https://github.com/shantanujpk/bigdatacloud
Exploration of PySpark for data processing and interview prep — demonstrates handling corrupted records, applying transformations/actions, and building efficient data pipelines with practical examples.
big-data data jupyter-notebook pipeline pyspark python spark sparksql
Last synced: 07 May 2026
https://github.com/bablukumarjha/startup-funding-revenue-analysis-by-sql-and-pandas
SQL project analyzing startup funding, revenue, and founder data to extract business insights using Python and MySQL.
data data-analysis data-platform data-science dataanalysisusingpython dataanalytics pandas-dataframe pandas-library python sql sql-server sqlalchemy sqldatabase
Last synced: 18 May 2026
https://github.com/plnech/never2late
Never 2 Late - a reinterpretation of Everest Pipkin's 'i've never picked a protected flower'
dada dada-science data generative-art glitch-art installation nlp poetry spacy vector-similarity wallpaper
Last synced: 10 Jun 2025
https://github.com/bertrand31/one-billion-rows-challenge
🌪️ Pushing Scala to its limits to aggregate a billion rows' worth of data in 2.42 seconds
competitive-programming competitive-programming-contests data data-engineering data-processing performance scala
Last synced: 05 Sep 2025
https://github.com/word2vect/beijing-pm2.5-data-process
Beijing PM2.5 Data Process for Python Programming 2024 Fall Data Visualization Lab 2
Last synced: 15 Jun 2026
https://github.com/pythoncoderunicorn/jamesbeardaward
a repo for James Beard Award data
Last synced: 07 Feb 2026