data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-23 00:07:41 UTC
- JSON Representation
https://github.com/dumkydewilde/mcp-memory-layer
A template for building your own BI MCP with dbt, LLMs and multi-user corrections
Last synced: 13 Mar 2026
https://github.com/writetome51/pagination-page-info
Intended to help a separate Paginator class paginate data. Specifically, this class contains the properties `itemsPerPage` and `totalPages`, which will be used by other classes
batch data javascript paginate pagination typescript
Last synced: 09 May 2026
https://github.com/aldro61/mmit-data
The data used in the Maximum Margin Interval Trees paper
data machine-learning machine-learning-algorithms reproducible-research
Last synced: 19 Feb 2026
https://github.com/adadalshabab/data-engineering-gcp-project
An end-to-end modern data engineering project, including deployment of ETL pipeline on Google Cloud Platform, using BigQuery for data analysis and leveraging Looker to generate an insight dashboard.
bigquery data data-science data-visualization databases dataengineering-a engineering etl-pipeline looker-studio powerbi
Last synced: 19 Jan 2026
https://github.com/tyriek-cloud/nyc-dca-etl
Created an ETL pipeline to merge two CSV files (converted to JSON) into a parquet file using Azure Data Factory, The data was extracted from NYC Open Data: https://opendata.cityofnewyork.us/ and I created a Blob Container within an existing storage account.
azure azure-data-factory blob-storage data data-engineering etl-pipeline
Last synced: 21 Jan 2026
https://github.com/amethyst-php/courier
amethyst amethyst-package api courier data laravel
Last synced: 17 May 2026
https://github.com/tabarzin/dh
A collection of links to various resources on Digital Humanities
data digitalhumanities opensource
Last synced: 24 Jan 2026
https://github.com/odiegosilva1/flask-github-style
Página de login usando Jinja no Flask.
data flask jinja2-templates orm python
Last synced: 31 May 2026
https://github.com/isandyawan/simplelinearregression
A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion
data linear r regression rstudio shiny statistic
Last synced: 14 Oct 2025
https://github.com/jpcurada/exploralytics
A python package for creating intermediate plotly visualizations
data eda plotly python visualization
Last synced: 05 Feb 2026
https://github.com/poissonconsulting/klexdatr
An R package of data from the Kootenay Lake Exploitation Study
cran data fish kootenay-lake rstats
Last synced: 16 Oct 2025
https://github.com/fatihilhan42/nba-players-data-1950-to-2021
In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.
data data-analysis data-engineering data-science data-visualization
Last synced: 16 Oct 2025
https://github.com/vanduc1102/parse-stackoverflow-data
Parse stackoverflow data
Last synced: 16 Oct 2025
https://github.com/saboye/sales-performance-analysis
A dashboard that presents monthly sales performance by product segment and product category to help clients identifying the segments and categories that have met or exceeded their sales targets, as well as those that have not met their sales targets.
dashboard data data-science eda tableau visualization
Last synced: 27 Jan 2026
https://github.com/mat06mat/matbot
My discord bot code
data discord-bot discord-py py-cord
Last synced: 17 Oct 2025
https://github.com/analyst-amitbisht/pizza-sales-report-
Its a guided project to practice tools like SSMS + Power BI & also skills like data cleaning, data exploration, data analysis, data visualization, etc.
analytics data data-visualization powerbi sql-server
Last synced: 18 Oct 2025
https://github.com/psgebeline/harvard-data-science
My work for the nine courses in Harvard's data science program, each with notes/assignments. Work in progress.
data linear-regression machine-learning modeling probability-theory r visualization wrangling
Last synced: 19 Oct 2025
https://github.com/erencelik/binance-public-data-node
Nodejs downloader and unzipper script for Binance Public Data
binance data downloader nodejs public script
Last synced: 15 May 2026
https://github.com/dilkushsingh/webscraping-with-selenium-and-beautifulsoup
Web Scrapped a popular tech gadgets website using Selenium and BeautifulSoup, also performed Data Analysis on scrapped data.
beautifulsoup data datacleaning datagathering eda exploratory-data-analysis python selenium webscraping
Last synced: 24 Feb 2026
https://github.com/robertoostenveld/dcn.dsc_62002071_01_114_v1
Simon task M/EEG data [Data set].
Last synced: 23 Jan 2026
https://github.com/athari22/analyzing-the-yelp-dataset
SQL for Data Science
analytics data data-science data-structures er sql
Last synced: 27 Jan 2026
https://github.com/gabrieldim/complete-analysis-covid-19
Analysis of the Covid 19.
analysis covid-19 covid19 data data-science science virus
Last synced: 23 Jan 2026
https://github.com/shubhamsoni98/prediction-with-binomial-logistic-regression
To predict client subscription to term deposits and optimize marketing strategies by identifying potential subscribers.
binomial data data-science eda machine-learning matplotlib pipeline python scikit-learn seaborn sklearn sql visualization
Last synced: 06 Feb 2026
https://github.com/andrewl/danelaw
Geopackage containing the boundary of the Danelaw
data geospatial medieval viking
Last synced: 23 Jan 2026
https://github.com/j-sephb-lt-n/data-warehouse-and-etl-best-practice
A catalogue of best practices for managing data
data data-cleaning data-engineering data-validation data-warehouse etl
Last synced: 23 Jan 2026
https://github.com/mattjesc/ddo-semiconductor
Data-Driven Optimization of Semiconductor Processes and Forecasting
ai artificial-intelligence data data-science data-visualization deep-learning keras machine-learning manufacturing ml prophet python pytorch semiconductor semiconductor-manufacturing semiconductors tensorflow
Last synced: 23 Feb 2026
https://github.com/sumitkundu102022/air-quality-report
Air Quality Report using PowerBI
data data-analysis data-visualization powerbi
Last synced: 23 Jan 2026
https://github.com/remcostoeten/github-and-vercel-api-showcase-dashboard
Showcase results of possible fetched data from the Github and Vercel API built in all vanilla js.
api-rest da data express-js github-api nodejs vercel-api
Last synced: 07 Mar 2026
https://github.com/dhanish03/reliance-sales-report-dashboard
This project, Reliance Sales Report Dashboard, showcases a dynamic and interactive Power BI dashboard designed to analyze sales performance. The dashboard provides key insights into various aspects of sales data, including product-wise performance, region-based revenue, and profitability trends.
data datavisualization-project powerbi visualization
Last synced: 23 Jan 2026
https://github.com/byndyusoft/byndyusoft.data.relational
Relational abstractions for Byndyusoft.Data.Relational.
byndyusoft data dataaccess db relational-databases
Last synced: 25 Oct 2025
https://github.com/johndelatto/automate-your-job-search-ai-applies-to-1000-positions
Automate Your Job Search: AI Applies to 1000 Positions Overnight & Get 100+ Interviews! In today’s fast-paced and highly competitive job market, finding and securing your dream job can be both time-consuming and exhausting.
ai data non-profit open-ai open-source
Last synced: 28 Jan 2026
https://github.com/fatihemres/pinch
File reader app with SwiftUI. Using data and models.
Last synced: 17 May 2026
https://github.com/alsult/alsult
Aliia Sultanova Portfolio
data datascience programming python
Last synced: 23 Jan 2026
https://github.com/mfurmanczyk/wh-sales
E-commerce analytics data warehouse ETL made with Apache Spark.
airflow data data-engineering data-warehouse kotlin python spark
Last synced: 24 Jan 2026
https://github.com/sahraiidle/email-spam-detector
Email/SMS spam detector with a Flask UI/API, tuned ML models (TF‑IDF + SVM/LogReg/NB), and a ready-to-run web form plus JSON endpoint for predictions.
data machine-learning numpy pandas python randomforest scikit-learn spam-classifier spam-detection svm
Last synced: 24 Jan 2026
https://github.com/woctezuma/hidden-gems-data
Data available to compute regional rankings of hidden gems.
data hidden-gems steam steam-reviews
Last synced: 06 Feb 2026
https://github.com/semcod/code2llm
Python Code Flow Analysis Tool - Static analysis for control flow graphs (CFG), data flow graphs (DFG), and call graph extraction
ast cfg code code2data code2logic code2process data dfg diagram flow graphs llm
Last synced: 01 Jun 2026
https://github.com/maxisoft/yahoo-finance-data-downloader
Automate downloading historical and recent stock data from Yahoo Finance.
data stock-market yahoo-finance
Last synced: 29 Jan 2026
https://github.com/spatialcurrent/go-pipe
go-pipe is a simple library for piping objects from iterators to writers.
big-data bigdata concurrency data
Last synced: 29 Jan 2026
https://github.com/aimin-nur/data-analyst-model-predictive
Sebuah Project data analyst yang bertujuan untuk mengindentifikasi karakteristik customer untuk menerima penawaran campaign marketing.
analyst data mechine-learning visualization
Last synced: 29 Jan 2026
https://github.com/machinecyc/lotteryinsight
Use crawler to collect Taiwan Lotto data, and save data into local MySQL server.
crawler data docker lottery mysql-database python3 taiwan
Last synced: 09 May 2026
https://github.com/khushi-sabarad/data_analysis
linkedin learning capstone project
data data-engineering matplotlib pandas python
Last synced: 10 May 2026
https://github.com/aneeshmurali-n/nlp-emotion-classification-in-text
Develop machine learning models to classify emotions in text samples.
bag-of-words data emotion-classification feature-extraction machine-learning naive-bayes natural-language-processing nlp nltk preprocessing python scikit-learn svm text-classification tf-idf tokenizer vectorizer
Last synced: 10 May 2026
https://github.com/notthestallion/pca__3d-and-from-scratch__principal-component-analysis
In this project, I will be implementing Principal Component Analysis (PCA) from scratch on an ecological footprint consummation database for countries and a three-dimensional scale using a movie database. The goal of this project is to gain a deeper understanding of PCA and to demonstrate its capabilities in exploring complex datasets.
data data-science database pca pca-analysis principal-component-analysis principal-component-analysis-pca principle-component-analysis
Last synced: 10 May 2026
https://github.com/miniql/miniql-json
A MiniQL query resolver that loads data from JSON files.
data json query query-language
Last synced: 11 May 2026
https://github.com/amethyst-php/tax
amethyst amethyst-package api data laravel tax
Last synced: 11 May 2026
https://github.com/quarkgluant/intro_ml_udemy
cours Udemy d'Introduction au Machine Learning
anaconda3 data data-preprocessing data-regression machine-learning python-3 udemy-machine-learning
Last synced: 12 May 2026
https://github.com/gregorybchris/pca
PCA assignment for Park Tudor
analysis component data display embedding pca principal projection teach
Last synced: 13 May 2026
https://github.com/dev88jerry/cs304
Bishop's University - CS304 Data Structures
bishops bu data data-structures python structure university
Last synced: 11 Jun 2026
https://github.com/poojaharihar03/wellness-cities-case-study
A case study for dats analysis of city health centers
Last synced: 11 Jun 2026
https://github.com/prakhargpt/sql-data-warehouse-project
Building Data Warehouse project using SQL Server, including ETL processes, data modelling and analytics.
analytics data data-analysis data-cleaning data-engineering data-engineering-pipeline data-lakehouse data-science data-warehouse etl etl-job etl-pipeline medallion-architecture sql sql-server
Last synced: 12 Jun 2026
https://github.com/iannil/one-data-studio
one-data-studio integrates a data governance and development platform, a cloud-native MLOps platform, and a large model application development platform. It connects the entire value chain from raw data governance to model training and deployment, and further to the construction of generative AI applications.
Last synced: 12 Jun 2026
https://github.com/shashwat9kumar/trends_in_a_country_on_twitter
Finding trending topics in each country on twitter and visualizing them in a WordCloud
data data-visualization trends tweepy twitter-api wordcloud
Last synced: 13 Jun 2026
https://github.com/neuro-mechatronics-interfaces/ros2_data_agent
Code for a multipurpose file explorer specializing in reading ROS2 topic data from '.bag' or '.db3' files
Last synced: 13 Jun 2026
https://github.com/word2vect/beijing-pm2.5-data-process
Beijing PM2.5 Data Process for Python Programming 2024 Fall Data Visualization Lab 2
Last synced: 15 Jun 2026
https://github.com/spiraldb/spiraldb-nemo-curator
SpiralDB connectors for NVIDIA NeMo Curator
computer-vision data data-curation data-prep data-preparation data-processing data-quality datacuration datarecipes deduplication fast-data-processing multimodal multimodal-ai nvidia-nemo physical-ai python spiral vortex
Last synced: 15 Jun 2026
https://github.com/arch-fan/pokedata
Pokemon Data in CSV format for whatever you need!
Last synced: 17 Jun 2026
https://github.com/ayushman0511/data-analytics-project1
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics busine data data-anal data-enginee data-sci data-scien database datascien query reporting sql sql-query sql-server window-func
Last synced: 17 Jun 2026
https://github.com/ibttf/bayborhood
Interactive map to find the ideal neighborhood in San Francisco based on data.
data data-analysis data-visualization gis mapbox react
Last synced: 18 Jun 2026
https://github.com/mbagalman/lattice-doe
Python code to create experimental designs optimized to meet statistical power targets
abtesting data datascience designofexperiments experimentaldesign statistics
Last synced: 19 Jun 2026
https://github.com/svetlanam/kbl-to-csv-s3
Keboola extractor, that converts excel to CSV based on input mapping criteria and upload to S3 bucket
data data-cleaning data-transformation etl keboola s3-bucket
Last synced: 20 Jun 2026
https://github.com/svetlanam/etl-transformation
ETL data cleaning and transformation for specific use case in own Keboola project
cleaning data etl keboola python rest-api transformation
Last synced: 20 Jun 2026
https://github.com/petzi53/repairdata
Open Repair Alliance Datasets 2021
data open-data open-datasets r repair repair-cafe repairs
Last synced: 22 Jun 2026
https://github.com/hlan22/2025-03-18-data-validation
(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:
Last synced: 23 Jun 2026
https://github.com/dineshdhamodharan24/data-analysis
probability Analysis to customers and bascis analysis
analysis data powerbi probability python visualization
Last synced: 23 Jun 2026
https://github.com/charlenry/python_data_science
Mes notebooks de travaux pratiques sur Python pour la Data Science
analysis data dataframe jupyter kaggle matplotlib notebook numpy pandas pyplot python science seaborn visualisation
Last synced: 25 Jun 2026
https://github.com/stefen-taime/mako-main
Declarative real-time data pipelines Framework. YAML in, events out.
data datapipeline declarative-config declarative-pipeline declarative-programming declarative-workflows framework open-source
Last synced: 26 Jun 2026
https://github.com/weecology/updating-data
Hugo website for instructions on how to make a regularly updating data pipeline
continuous-analysis continuous-integration data gh-actions living-data netlify travis-ci
Last synced: 17 Feb 2026
https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm
The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.
algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script
Last synced: 17 May 2026
https://github.com/yourdataarchitect/french-realestate-data-pipeline
This repository contains a fully automated data pipeline built with Apache Airflow to extract, clean, analyze, and report real estate listings from Seloger. It pushes data to MongoDB, Elasticsearch, and Google Sheets, with real-time Slack alerts for monitoring.
airlfow data datanalysis datapipeline market-intelligence real-estate
Last synced: 31 Dec 2025
https://github.com/coderooz/hr-dashboard
The goal of this project is to create a power bi dashboard to showcase the attrition data within the company.
Last synced: 07 Jan 2026
https://github.com/a-poor/taro
A package for repeatable rectangular data transformations in Python.
data data-science data-transformation pipeline pypi-package python
Last synced: 13 Oct 2025
https://github.com/alexis-gss/games-data
Games Data is a library of informations about all games, realised under NuxtJs
css3 data games nuxtjs tailwindcss typescript vuejs
Last synced: 13 Mar 2025
https://github.com/peternaydenov/data-pool
Data layer for node apps and single page applications
Last synced: 29 Apr 2025
https://github.com/pyrustic/jayson
Intuitive interaction with JSON files [DEPRECATED, check the project Shared]
Last synced: 17 May 2026
https://github.com/skygenesisenterprise/api-service
The Official Sky Genesis Enterprise API Service Ecosystem
api-service client cryptography data dns docker javascript nextjs service stalwart typescript websocket
Last synced: 31 Dec 2025
https://github.com/fliplet/fliplet-widget-data-source-query
Data Source Query Provider
Last synced: 11 Apr 2025
https://github.com/antononcube/raku-data-typesystem
Data type system for different data structures.
data data-structures rakulang type-system
Last synced: 09 Jul 2025
https://github.com/encoreshao/data-science
Data analyze examples, using Jupyter notebook and Python!!!
data dataanalysis encore jupyter-notebook
Last synced: 29 Mar 2025
https://github.com/mightymetrika/mmirestriktor
Informative Hypothesis Testing Web Applications
data hypothesis infomative power r restriktor statistics testing
Last synced: 17 Mar 2025
https://github.com/pulgamecanica/d3examples
https://www.oreilly.com/library/view/d3-for-the/9781492046783/
d3 d3-visualization d3js d3v4 data javascript
Last synced: 19 May 2026
https://github.com/kameronbrooks/datalys2-reporting
Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.
data data-visualization html react
Last synced: 08 Apr 2026
https://github.com/mumtaz4118/employee-satisfaction-and-attrition
Analysis of attrition based on environmental satisfaction from a Kaggle dataset.
data data-analysis data-science data-visualization ipynb jupyter-notebook machine-learning python statistical-analysis statistical-models
Last synced: 19 May 2026
https://github.com/alexdonh/adonis-cache
Another cache provider for AdonisJs. Supports Object, File, Db and Redis cache. With cache dependencies!
adonis-framework adonisjs cache data dependency redis storing
Last synced: 15 May 2026
https://github.com/stdlib-js/array-base-banded-filled2d-by
Create a filled two-dimensional banded nested array according to a provided callback function.
alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types
Last synced: 19 May 2026
https://github.com/samharrison7/datamapper
Making mapping between datasets as simple as possible.
data data-mapper data-mapping data-science data-structures
Last synced: 17 Mar 2025