data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/juangesino/research-project
Course files for Research Project @ University of Amsterdam
data data-science economics stata
Last synced: 02 Jan 2026
https://github.com/atiqurcode/scrap-spec
Scrap data from the html to table html code / json
data html-table json-data scarp
Last synced: 05 Feb 2026
https://github.com/ychaaby/text-classification-chat
ChatBot Boutique USPN
classification data python pytorch
Last synced: 05 Feb 2026
https://github.com/twilighty-abhi/locust-data-visualiser
Locust Data Visualiser
Last synced: 15 Aug 2025
https://github.com/buffdelta/basketball_ref_webscraper
Python package to make webscraping from basketball-reference easy
basketball data python python-library webscraping
Last synced: 14 Jan 2026
https://github.com/seqeralabs/ffq-api
A minimal wrapper to make ffq searches available via a REST API.
api data fastq fetch-fastq ffq genomics
Last synced: 15 Aug 2025
https://github.com/supremkc05/global-job-market-analytics
Scrape jobs from websites like Indeed/LinkedIn, extract skills using NLP, then visualize hiring trends.
beautifulsoup data machine-learning nlp pandas scrapping
Last synced: 14 Aug 2025
https://github.com/thedhruvish/datasciencewith
datasciencewith
coding data dataanylasis datascience learing machine-learning
Last synced: 08 Jun 2026
https://github.com/leoBitto/CloudForge
Data foundry
airflow data data-engineering django docker docker-compose grafana postgresql prometheus
Last synced: 14 Aug 2025
https://github.com/soenneker/soenneker.cloudflare.origincerts.thumbprints
The current Cloudflare origin certificate thumbprints
cloudflare csharp data dotnet origincerts thumbprint thumbprints
Last synced: 23 Apr 2026
https://github.com/dahmansphi/analysis_from_start_to_end
The Big Bang of Data Science- Analysis from the Start to The End- [Book Two]
analysis data data-analytics data-mining data-science hypothesis-testing jamovi machine-learning
Last synced: 08 Jan 2026
https://github.com/soenneker/soenneker.datatables.attributes.column
A C# attribute for Datatables.js column building
attributes column columns csharp data datatablecolumnattribute datatables dotnet mapping object
Last synced: 12 Mar 2026
https://github.com/fcoagz/rate-reader-epv
pyDolarVenezuela API utilities, image processing (EnParaleloVzla) to extract currency exchange rates from specific platforms, validating content against expected patterns
data finance json processing-images pydolarvenezuela
Last synced: 14 Jun 2025
https://github.com/2kabhishek/pybank
Data Analysis for the silliest Bank π°π¦
csv data data-science learning pandas python topic1 topic2
Last synced: 12 May 2026
https://github.com/itsachrafmansari/moroccan-real-estate-analysis
Scrape, process, analyze, and visualize data from Avito.ma to uncover current trends in Morocco's real estate market.
api-scraping data data-analysis data-mining data-science data-scraping data-visualization eda exploratory-data-analysis morocco real-estate web-scraping
Last synced: 13 Aug 2025
https://github.com/rse/nebulize
Nebulize Security-Sensitive Information
data dsgvo gdpr information nebulize security sensitive
Last synced: 16 Mar 2025
https://github.com/mtwn105/phonepe-pulse-plus
An API on top of PhonePe Pulse Data APIs
cors data data-science express finance hacktoberfest heroku javascript nodejs phonepe pulse
Last synced: 09 Apr 2026
https://github.com/elijah-1994/pre-process-e-commerce-dataset
Importing, Cleaning, and Pre-Processing E-Commerce Data for Analysis Using MySQL.
analytics data dataanalytics datacleaning dataprocessing mysql mysql-database sql
Last synced: 11 Mar 2025
https://github.com/jleung51/foundations-dags
Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.
data data-engineering etl extract housing load pipeline transform
Last synced: 04 Oct 2025
https://github.com/bocchilorenzo/hugginginfo
Unofficial library to retrieve information from the HuggingFace website.
Last synced: 03 Apr 2026
https://github.com/itsmeyogesh22/Solved-8-Weeks-SQL-Challenge-Correct-Solutions
Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/
8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql
Last synced: 29 Aug 2025
https://github.com/amazingandyyy/dataviz
amazingandyyy data data-visualization
Last synced: 08 Jan 2026
https://github.com/anand-sony/mttr-dashboard
Streamlit dashboard for MTTR analysis with shift-wise loss insights and machine-level downtime tracking.
analytics business-analytics dashboard data python statistical-analysis
Last synced: 30 May 2026
https://github.com/nafisalawalidris/nafisalawalidris
Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and Bitcoin converge.
artifical-intelligence bitcoin config data data-science developer github-config github-pages machine-learning
Last synced: 16 May 2026
https://github.com/luminati-io/jupyter-notebooks-web-scraping
Perform web scraping interactively using Jupyter Notebooks, integrating coding, data analysis, and visualization into one seamless workflow.
beautifulsoup4 data jupyter jupyter-notebook pandas python requests seaborn virtual-environment web-scraper web-scraping
Last synced: 13 Apr 2026
https://github.com/panodata/tikray
A compact data transformation engine.
data data-transformation data-transformation-pipeline data-transformer jmes jmespath jq jqlang json json-pointer json-transform json-transformation json-translate json-translator transformation transon
Last synced: 04 Oct 2025
https://github.com/adri6336/payvis-android
An app that enables people working by the hour to keep track of how much they've earned.
android android-application app clock data data-visualization database finances financial-data json money money-management monitoring paycheck-records productivity records records-management time-worked work worktime
Last synced: 09 Apr 2026
https://github.com/robthree/cfnreader
Provides a simple way to read FNIRSI's CFN files (*.cfn) produced by the FNIRSI UsbMeter tool
cfn csv data fnirsi usb usb-tester
Last synced: 01 Mar 2025
https://github.com/aaisha-nexus/sql_company_insights
A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.
business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms
Last synced: 12 Aug 2025
https://github.com/kadirlofca/unity-csvmaker
Quick and easy way to create and export .csv files from Unity.
Last synced: 09 Apr 2026
https://github.com/programmer-rd-ai/competitive-programming-solutions
A collection of my solutions to various competitive programming problems from platforms like LeetCode. This repository serves as a personal archive of my problem-solving journey, covering a range of algorithms, data structures, and problem-solving techniques.
algorithm algorithms algorithms-and-data-structures data datastructures dsa javascript pandas python structures
Last synced: 01 Mar 2025
https://github.com/ddofer/ddofer.github.io
Dan's Blog
blog cv data data-science machine-learning
Last synced: 12 Aug 2025
https://github.com/corneliustanui/personal_blogdown_website
This repo contains source files for my personal Blogdown-based website.
analyis analytics blog blogdown blogdown-sites data data-science hugo hugo-theme netlify personal-website rbind statistics web website
Last synced: 13 Feb 2026
https://github.com/amethyst-php/cycle
amethyst amethyst-package api cycle data laravel
Last synced: 17 May 2026
https://github.com/keziatbnn/supervised-regression-salaryprediction
Make salary predictions based on years of experience using supervised regression.
data data-analysis-python data-prediction data-science python
Last synced: 11 Aug 2025
https://github.com/oroszgy/hunlp-resources
Scripts and resources for making spaCy understand Hungarian.
corpus-linguistics data hungarian hungarian-language hunlp magyarlanc model natural-language-processing nlp resources script spacy wikipedia
Last synced: 18 May 2026
https://github.com/mcraiha/datagensharp
C# managed library for generating data
Last synced: 11 Aug 2025
https://github.com/roshaka/samplr
Samplr is a Python decorator for selecting a subset of items from a list, with options for customisation and informative console printouts.
data data-analysis data-engineering decorators list python sampling
Last synced: 14 Jan 2026
https://github.com/austinhartzheim/career-fair-backend
Backend for ECS Career Fair app
Last synced: 13 Apr 2026
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/blueheron786/quranic-universal-library-mushaf-layouts
The Quranic Universal Library (QUL)'s Qur'an mushaf 15-line layouts (madini, uthmani)
data database layout mushaf quran sqlite uthmani uthmani-quran
Last synced: 13 Apr 2026
https://github.com/0xhericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 09 Feb 2026
https://github.com/ashita-ai/ashita-ai.github.io
Ashita AI - The island of misfit data tools
Last synced: 19 Feb 2026
https://github.com/stupidcucumber/elephant-crawler
System for mining texts from websites.
data data-mining-python python
Last synced: 25 Apr 2026
https://github.com/ahmad-ali-rafique/heart-disease-detection-model
A comprehensive project for detecting heart disease using machine learning, including data processing, model training, and evaluation metrics with AUC curve analysis.
artificial-intelligence data datascience heart-disease machine-learning modeling prediction-model
Last synced: 11 Aug 2025
https://github.com/chocoscoding/fakeapi
A fake API with nice functionalities for testing
api data express fetch fetch-api frontend javascript js json json-api json-server nodejs testing typescript
Last synced: 09 Apr 2026
https://github.com/danielrosehill/value-factors-data-vis
Streamlit app containing visualisations of the Global Value Factors Database (GVFD) released by the IFVI in 2024
data data-visualization sustainability sustainability-data
Last synced: 29 Jul 2025
https://github.com/srindot/fwuav-average-flight-data-collection
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 10 Aug 2025
https://github.com/ometman/vet-clinic
This is a database project for vetinary data management for animals, owners, clinic employees and visits; and applicable to any data management need. It uses Postgresql, a relational database management system. It allows storing, updating and querying.
data database normalization postgresql postgresql-database queries sql sql-server-database tables transactions
Last synced: 13 May 2026
https://github.com/fabsdevx/files-to-database-loader-handout
Data Engineering project for learning purposes. Credits to itversity
csv data data-engineering database json pandas python
Last synced: 09 Apr 2026
https://github.com/0xkibh/datamining-algo
This repository consist data mining algorithm implementation example in python
apriori-algorithm data datamining fp-growth python
Last synced: 19 May 2026
https://github.com/lukakerr/us-surnames
US Surname data visualisation using R. Displays top 25 US surnames and race/ethnic percentage per name.
Last synced: 05 Oct 2025
https://github.com/pathilink/ebury_case
Technical case study in Analytics Engineering using BigQuery, focusing on dimensional modeling and SQL queries for payment and client analysis.
Last synced: 05 Oct 2025
https://github.com/affan005-ai/tesla-stock-prediction
This project analyzes Tesla stock data and builds machine learning models to predict and classify stock movements. The analysis includes EDA, feature correlation, moving averages, and two models
data data-analysis data-science data-visualization-project eda machine-learning matplotlib pandas predictive-analytics predictive-modeling python scikit-learn
Last synced: 05 Oct 2025
https://github.com/amethyst-php/catalogue-product
amethyst amethyst-catalogue-product api catalogue-product data laravel
Last synced: 20 May 2026
https://github.com/rysteq/abstract-data-structures
This repository contains two programs written in C about the stack and queue ADT's
abstract-data-structures c data queue stack
Last synced: 06 Oct 2025
https://github.com/chubek/pyramid-dashboard
A Dashboard to Show Data Made Using Plotly Dash
dash data docker ml plotly plotly-dash python
Last synced: 19 May 2026
https://github.com/vim89/flowforge
Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently
archetype data data-contracts data-engineering data-pipelines data-quality data-science database dataengineering datapipeline etl etl-framework pipelines scala scalability spark spark-sql spark-streaming
Last synced: 14 Apr 2026
https://github.com/analyticslover/sales-python-dashboard
Dashboard Ventas Japon 2023
dashboards data data-analysis jupyter-notebook python3 sales streamlit
Last synced: 09 Apr 2026
https://github.com/paul-henryp/simulate-investment-strategies
This Java program simulates different investment strategies using historical stock market data. It allows users to test various strategies such as buy and hold, moving average, buying when the stock price is lower than the last purchase, and dollar-cost averaging.
data data-science investing-java java plots plotting simulated-data simulated-investments sp500 sp500-data-analysis
Last synced: 21 May 2026
https://github.com/eharshit/end-to-end-vendor-insights
End-to-end analysis of vendor performance for wholesale/retail businesses, featuring data ingestion, cleaning, insights, and interactive Power BI dashboards.
analysis analysis-algorithms analytics dashboard data data-analysis datascience jupyter jupyter-notebook pandas powerbi powerbi-report retail wholesale
Last synced: 07 Oct 2025
https://github.com/prajjwol09/sql_retail_analysis_project
This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.
data data-analysis datacleaning dataexploration pgadmin4 sql
Last synced: 15 Feb 2026
https://github.com/iankitnegi/tableautales
"Discover my Tableau journey! Dive into data-driven stories, visualizations, and projects as I explore the power of data visualization."
data data-visualization tableau
Last synced: 21 Jan 2026
https://github.com/pythoncoderunicorn/startrek
a repo for Star Trek data from Technical Manuals
data klingon-language star-trek vulcan
Last synced: 07 Oct 2025
https://github.com/abdellah-laassairi/thyroid-disease-analysis
Thyroid dataset visualization dashboard in R
dashboard data flexdashboard imputation-methods rshiny visualization
Last synced: 18 Jan 2026
https://github.com/rahulthedevil/metric-converter
A simple utility package for converting between metric units such as meters, kilometers, grams, kilograms, liters, and more. Simple and powerful way for Units Convert solution
convert converter data fraction imperial length mass measurements metric metrics ratio system temperature unit unit-conversion unit-converter units uom utilities weight
Last synced: 08 Oct 2025
https://github.com/jacob-pitsenberger/python-electronics-inventory-management-system-object-oriented-programming-project
Welcome to the Python Electronics Inventory Management System project repository! This project is a demonstration of Object-Oriented Programming (OOP) principles in Python for managing an electronic parts inventory.
data data-structures dictionary exception-handling file-io filesystem input-output inventory-management-system management-system modules oop pickle python user-interface
Last synced: 08 Oct 2025
https://github.com/danieljdufour/fast-b64
Quickly Convert between B64 and Binary Strings
b64 base64 base64-decoding base64-encoding binary bits compression data
Last synced: 08 Oct 2025
https://github.com/rahul1582/bank-loan-classification
Classifying whether a person is taking personal loan or not using all the Classification Algorithms.
algorithm analysis classi data
Last synced: 08 Oct 2025
https://github.com/chompfoods/sdk-java
Java SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food gradle grocery ingredients jar java java-sdk nutrition openapi raw recipe-api recipes sdk
Last synced: 09 Apr 2026
https://github.com/djdhairya/whatsapp-chat-analysis
WhatsApp chat analysis is a multidimensional process that delves into the content, structure, and dynamics of conversations within the platform. It provides valuable insights for personal reflection, organizational decision-making, and improving communication strategies.
data data-science dataanalytics datapreprocessing machine-learning ml
Last synced: 08 Oct 2025
https://github.com/shubhamsoni98/classification-with-random-forest-1
To classify sales into categories (Low, Moderate, High) using Random Forests to inform strategic decisions and optimize marketing strategies.
algorithms anaconda data data-science datacleaning eda jupyter-notebook machine-learning pyhton random-forest scikit-learn visualization
Last synced: 18 Jan 2026
https://github.com/mapaor/horaris-rodalies
Web que utilitza la API de rodalies de Catalunya per mostrar els horaris d'una manera mΓ©s divertida
adif api ave barcelona bordils catalunya dades data distancia generalitat girona horaris md r11 regional renfe rodalies sants tren viajes
Last synced: 16 May 2026
https://github.com/mchenryspagg/wrangle-and-analyze-data
This project which is known as 'wrangle and analyze data' involves the wrangling of WeRateDogs twitter archive data from the period of 2015 to 2017
api data dataanalysis datacollection datawrangling datetime json numpy os pandas pil python requests tweepy-api visualization
Last synced: 09 Apr 2026
https://github.com/coderixc/rforai
Learn R Programming Language for Statistics & Data Science
artificial-neural-networks data data-science deep-neural-networks machine-learning probability quant-analyst r science
Last synced: 09 Oct 2025
https://github.com/psyteachr/sdg-data
Data relevant to the UN Sustainable Development Goals
Last synced: 09 Oct 2025
https://github.com/sourceduty/clock_metadata
π Recording time data and statistical metadata to .csv files.
clock data data-science metadata practice python time timing
Last synced: 08 Aug 2025
https://github.com/quetz-al/quetzal-openapi-client
Autogenerated Python client for the Quetzal API
client data data-science openapi-client openapi3 python quetzal
Last synced: 10 Oct 2025
https://github.com/sillyash/untappd-viz
A data visualisation page using public datasets and HTML/CSS/JS with D3.js.
beer beer-statistics data data-analysis data-visualization kaggle kaggle-dataset public-dataset school-project
Last synced: 18 May 2026
https://github.com/sourceduty/text_file_metadata
π Extract metadata from .txt files and record the metadata in .txt files.
data datascience metadata metafile practice sourceduty
Last synced: 08 Aug 2025
https://github.com/loaiwalid07/automation_data_overviwe
This is Streamlit app that gives an overview for a dataset you upload
automation data data-analysis data-exploration data-science data-transformation data-visualization
Last synced: 19 May 2026
https://github.com/gianlucatruda/qs-analyser
A quantified self data analysis script in Python 3.
data experiment matplotlib matrix optimization productivity python quantified quantified-self science self
Last synced: 10 Oct 2025
https://github.com/theopenwebjp/theopenweb-data-loader
Package for loading data to local project
data downloader import javascript typings
Last synced: 10 Oct 2025
https://github.com/j-sephb-lt-n/joes_giant_toolbox
A large collection of general python functions and classes that I use in my daily work
ascii browser classifier data dataviz gcp mime nlp python regex search statistics supervised web-scraping
Last synced: 10 Oct 2025
https://github.com/azkarmoulana/winter-of-data-2019
:snowflake: :snowman: Winter of Data is coming..... :wolf:
data data-science machine-learning mathematics
Last synced: 05 Feb 2026
https://github.com/loggdme/kyro
Collection of utilities and examples for creating efficient data pipelines in go with parallel queues and, rate limitiers and much more.
Last synced: 14 Jan 2026
https://github.com/stefanocoretta/aelfric-relatives
data old-english research-project
Last synced: 23 Feb 2026