data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/rafaelfloressouza/Covid-19-Dashboard
Python web application to display COVID19 data from the world using Plotly and Dash
bootstrap covid-19 css data datavisualization plotly-dash python3
Last synced: 10 Mar 2025
https://github.com/mftnakrsu/crm-rfm-analysis
CRM-RFM-Analysis
ai crm data data-analysis data-science deep-learning machine-learning python rfm rfm-analysis
Last synced: 16 Mar 2025
https://github.com/danieljdufour/rle-serializers
Serialize and Deserialize Run Length Encoding
cloud-optimized compression csv data deserializer run-length run-length-decoding run-length-encoding serializer
Last synced: 24 Sep 2025
https://github.com/elhariri78/case-study-a-better-smoker-detector
Case Study-A better Smoker Detector
data dataframe evaluation kaggle matplotlib-pyplot numpy pandas pandas-dataframe pandas-python python3 seaborn sklearn
Last synced: 07 Apr 2026
https://github.com/bijx/firestore-data-fetcher
A simple Python script to fetch documents from a Firebase Firestore collection and save them to a local `.json` file.
automation data database downloader exporter fetcher firebase firestore open-source script
Last synced: 12 Apr 2026
https://github.com/cqllum/schema2dwh
⚡ Automatically produce a data model on your database using its information schema using GenAI.
ai data data-structures dataengineering datawarehousing dwh gemini gemini-api genai reporting reporting-tool schema-design
Last synced: 13 Mar 2025
https://github.com/nik-kusanagi/bash.sh-treinamento
Versão mais organizada (+ ou -)
data database debian gnome gnome-extension gnu gnu-linux linux shell shell-script
Last synced: 05 May 2026
https://github.com/shivam1808/data-cleaning-project
We take raw housing data and transform it in SQL Server to make it more usable for analysis.
analysis data datacleaning sql sqlserver
Last synced: 29 May 2026
https://github.com/lmuffato/project-job-insights-trybe
Projeto job insights - Projeto avaliativo da Trybe do Bloco 32: Introdução à Python
data data-science data-transformation filter python
Last synced: 12 Jun 2025
https://github.com/azrunguraya/kabyle-corpus-dataset
Dans l'univers du Traitement Automatique des Langues , l'accès à des datasets diversifiés et bien annotés est essentiel pour développer des modèles performants. Ce projet vise à combler cette lacune spécifique pour la langue taqbaylit, une langue berbère parlée principalement en Kabylie
ber berber berber-dataset corpus data dataset ia kabyle kabyle-art kb machine-learning nlp nlp-machine-learning python taqbaylit text words
Last synced: 31 Jul 2025
https://github.com/themost-framework/jspa
JavaScript Persistent API
api data database-schema jspa object-relational-mapping orm orm-framework
Last synced: 31 Aug 2025
https://github.com/flowsynx/plugin-postgresql
FlowSynx plugin to interfaces with PostgreSQL for CRUD operations. Supports JSONB, full-text search, and advanced query features.
data database flowsynx postgresql postgresql-database sql
Last synced: 09 May 2026
https://github.com/derrickbaruga7/python-data-analysis
This project analyzes ORU’s off-season sewer usage using Python, with `pandas` for data handling, histograms and line plots for exploration, and a `scipy`-based model for prediction. Pearson’s correlation and visualizations help reveal key trends and relationships.
analytics data data-science visualization
Last synced: 31 Jul 2025
https://github.com/gappeah/london-housing-price-dashboard
This Excel-based Housing Visual Dashboard provides a comprehensive view of average house prices across various boroughs in London from 1996 to 2013. The dashboard is designed to offer insights into housing market trends and price variations across different areas of London over time.
data data-analysis data-visualization excel visual
Last synced: 31 Jul 2025
https://github.com/visenger/prada
Profiling Datasets
cleaning data dataset profiling
Last synced: 24 Aug 2025
https://github.com/mouneshgouda/learn_dsa
This repository explores fundamental data structures and their implementations. Learn how to organize and manipulate data efficiently for various programming tasks. (Feel free to add your specific focus areas here, e.g., algorithms, interview prep)
c data queue sorting-algorithms stack structured-data
Last synced: 30 Jul 2025
https://github.com/vvipjain/bike-sales-dashboard
Bike Sales Dashboard
dashboards data data-analysis data-cleaning data-normalisation data-visualization excel pivot-chart pivot-tables
Last synced: 04 Feb 2026
https://github.com/bolajiolayinka/graph-api-automation
An End to End Automation from Facebook Business to Data Visualization of Campaigns
Last synced: 07 May 2025
https://github.com/tether/tether-schema
Custom protocol buffer schema for data validation
data protocol schema validation
Last synced: 09 Apr 2025
https://github.com/whitehathackerpr/data-visualization-tool
This is a Python-based web application that allows users to upload datasets, analyze data, and create visualizations interactively. The tool is designed for ease of use and provides a simple interface to perform basic data analysis and generate visualizations
data data-analysis data-visualization python python3
Last synced: 05 Sep 2025
https://github.com/cainmi/data-page-project
A repository to pull code and files from, may be used to store page data links, code etc. mainly used for python for now
data html javascript python schema
Last synced: 21 Oct 2025
https://github.com/stdlib-js/ndarray-base-to-reversed
Return a new ndarray where the order of elements of an input ndarray is reversed along each dimension.
base data flip javascript matrix ndarray node node-js nodejs reverse slice stdlib structure to-reversed types vector view
Last synced: 12 Apr 2026
https://github.com/stdlib-js/array-float32
Float32Array.
array data float float32 float32array ieee754 javascript node node-js nodejs single single-precision stdlib structure typed typed-array types
Last synced: 14 Jan 2026
https://github.com/asuozzo/medicare-data-analysis
An analysis of Medicare Part D data in Vermont
Last synced: 04 May 2026
https://github.com/bukalapak/bukadata
Data supplier plugin for populating design with real data.
data plugin sketch sketch-plugin
Last synced: 05 Jul 2025
https://github.com/jigyasag18/gold-price-prediction-project-using-machine-learning
This repository contains a machine learning project focused on predicting gold prices (GLD) using historical stock market data, including indicators such as SPX, USO, SLV, and EUR/USD. The project implements a Random Forest Regressor for accurate price forecasting, complete with data visualization, correlation analysis, and model evaluation metrics
data dataset jupyter-notebook jupyter-notebooks machine-learning machinelearing machinelearningalgorithms machinelearningmodel machinelearningprojects matplotlib mlproject numpy pandas randomforestregressor seaborn
Last synced: 23 Jul 2025
https://github.com/joeyism/py-cifar10
This library was created to allow an easy usage of CIFAR 10 DATA. This is a wrapper around the instructions givn on the CIFAR 10 site
cifar cifar-10 cifar10 data machine-learning machinelearning
Last synced: 30 Jul 2025
https://github.com/danreynolds/data_batcher
Data batcher batches and de-dupes data fetched in the same task of the event loop.
batching data flutter hacktoberfest
Last synced: 19 May 2026
https://github.com/olamide100/capstone-project-llm-zoomcamp
Comparative Guide Assistant
argocd data dataengineering docker grafana kubernetes llm-agent mlops-workflow rag strreamlit
Last synced: 14 Feb 2026
https://github.com/cintia0528/data_cleaning_and_analytics-python
Evaluate if aggressive discounting benefits Eniac long-term, considering differing views on customer acquisition and brand positioning. Focus on data cleaning for informed decision-making.
colab-notebook data data-analysis datacleaning dataquality jupyter-notebook matplotlib pandas python seaborn
Last synced: 08 Jan 2026
https://github.com/spiceai/datasets
Spice AI curated dataset definitions for Spice.ai
ai bitcoin blockchain data ethereum polygon
Last synced: 20 Apr 2026
https://github.com/wioniqle-q/tower-modelling
Data science
data data-science ndarray-odeint ndjson science
Last synced: 16 Mar 2025
https://github.com/izaaccoding36/dados-dinamicos
Esse repositório apresenta um site criado com API para a criação de gráficos, relatando o uso de redes sociais em uma escala global
api data redes-sociais social-media website
Last synced: 26 Mar 2025
https://github.com/basemax/buskool.com-data
This repository contains the collected product data from the Buskool website (باسکول). The data is stored in 20k+ JSON files, each containing detailed information about products available on the website.
buskool buskoolcom data farsi information ir iran json persian
Last synced: 03 Apr 2025
https://github.com/stdlib-js/array-base-fancy-slice-assign
Assign element values from a broadcasted input array to corresponding elements in an output array.
array assign assignment copy data fancy generic javascript node node-js nodejs shallow slice stdlib structure subseq subsequence types
Last synced: 06 Oct 2025
https://github.com/joocer/data_expectations
Are your data meeting your expectations?
data data-engineering data-quality data-science data-unit-tests observability pipelines quality validation
Last synced: 07 Oct 2025
https://github.com/ahmad-ali-rafique/comment-generation-tool
This repository hosts a Jupyter Notebook-based Comment Generation Tool exploring advanced NLP techniques for automated, contextually relevant comment generation from input data. Ideal for developers and researchers in NLP and automated text generation.
ai aitools artificial-intelligence content-based-recommendation data datascience jupyter-notebook machine-learning
Last synced: 07 Oct 2025
https://github.com/ryanjoy0000/yt-notifier
Youtube Notifier (Telegram Bot) - A real time data processing pipeline
data go kafka-streams real-time telegram-api youtube-api
Last synced: 14 Jan 2026
https://github.com/scienxlab/datasets
Some small datasets for demos, courses, testing, etc.
data open-data sample-data teaching-resources
Last synced: 09 Oct 2025
https://github.com/Lemniscate-world/StratAI
This project analyzes financial assets using a Hidden Markov Model (HMM) to identify different market regimes and patterns. The analysis includes calculating daily returns, rolling volatility, and volume changes, and visualizing the hidden states identified by the HMM.
ai assets data data-science data-visualization finance financial-analysis fintech hmm-model hmmlearn machine-learning trading
Last synced: 13 Oct 2025
https://github.com/lakecountryhuntclub/dnr-map-data-model
Data Model for the 2023 DNR Pheasant Stocking Property Data
data data-model documentation excel gis hunting mapping powerquery vba
Last synced: 29 Jul 2025
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 27 Jan 2026
https://github.com/open-i18n/data-iso-15924
Git mirror for ISO 15924, Codes for the representation of names of scripts data
data iso iso-15924 iso15924 open-i18n scripts unicode unicode-data writing-systems
Last synced: 14 Mar 2026
https://github.com/akv3sic/cryptocurrency-charts
Cryptocurrency API data visualizations 📈 with Matplolib.
cryptocurrency data data-visualization matplotlib python
Last synced: 16 Oct 2025
https://github.com/bishtrishu/pizza_sales_data_analysis_sql
This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.
cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database
Last synced: 14 Apr 2026
https://github.com/potreic/etl-fashion-trend-analysis
✨ Automate fashion trend analysis with Apache Airflow! Extract data from X & Pinterest, transform into insights, and load into PostgreSQL. Predict seasonal styles & visualize trends. 💃📊
airflow airflow-dags data data-engineering etl etl-automation etl-pipeline fashion-trends
Last synced: 27 Jan 2026
https://github.com/data-forge-notebook/javascript-cheat-sheet
Cheat sheet that accompanies my book Data Wrangling with JavaScript
cheatsheet data data-wrangling javascript nodejs
Last synced: 15 Apr 2026
https://github.com/florianwendelborn/metatypes
Monorepo of TypeScript Metadata Definitions (e.g. HTTP Status Codes)
code-generation data datastructures enum http-status-codes jsdoc lerna metadata typescript
Last synced: 27 Jan 2026
https://github.com/divithraju/divith-aju-hadoop-pyspark-pipeline
This project demonstrates the creation of a scalable data processing pipeline for handling and analyzing log data from a hypothetical e-commerce platform. Leveraging Hadoop and PySpark, the pipeline is designed to process large volumes of log files, providing meaningful insights into user behavior, system performance, and sales metrics.
apache-hadoop-framework apache-spark bigdata client data database dataengineering dataingestionframework datapreprocessing documentation ecommerce-platform hdfs pipeline project project-repository pyspark python3 software-engineering
Last synced: 27 Jan 2026
https://github.com/jaldekoa/fiscaldataapi
A Python wrapper to easily retrieve data from the Fiscal Data (US Treasury) official API in pandas format.
api api-wrapper banking data finance pandas python united-states
Last synced: 27 Jan 2026
https://github.com/eshaagarwa/hr-analytics-project
Explore our HR Analytics Dashboard, a powerful Power BI project designed for HR managers and leaders. Analyzed essential KPIs such as Employee Count, Attrition Rate, and Job Satisfaction across various demographics.
dashboard data data-visualization dataanylasis ms-excel ms-excel-data-analytics powerbi statistics
Last synced: 23 Jan 2026
https://github.com/garcane/Income-Prediction-ML
This is a machine learning project aimed at predicting whether an individual's annual income exceeds $50,000 based on their demographic and personal information.
data data-science machine-learning ml numpy pandas python random-forest scikit-learn
Last synced: 24 Oct 2025
https://github.com/ayushverma135/sas-health-metrics-analysis-bmi-categorization-and-gender-insights
Using SAS, this project processes Excel data on individual statistics and health metrics. It calculates BMI, categorizes health status, and visualizes distributions through pie charts.
analytics data excel sas sasprogramming statistical-analysis
Last synced: 24 Feb 2026
https://github.com/2kabhishek/pyramen
Data Analysis for Ramen 🍜💹
csv data data-analysis fun python report
Last synced: 26 Oct 2025
https://github.com/maccccd/wsoa3029a_2444372
This website serves an extension of my portfolio work. It focuses specifically on showcasing my understanding of D3.js , a JavaScript library used to create interactive data visualizations. The visualizations in here were used to provide insights on two types of cybersecurity attacks: Phishing & Ransomware.
d3js data hacking visualization
Last synced: 24 Jan 2026
https://github.com/stdlib-js/ndarray-base-output-policy-str2enum
Return the enumeration constant associated with an output ndarray data type policy string.
array data dtype dtypes enum javascript multidimensional ndarray node node-js nodejs policy stdlib types util utilities utility utils
Last synced: 15 Apr 2026
https://github.com/pablolec/sb_querydsl_criteria_builder
Complex and dynamic frontend-to-backend queries using querydsl
api data design dynamic-queries hibernate java jpa json query query-builder querydsl querydsl-generator rest-api rsql spring spring-boot sql vue web
Last synced: 07 Feb 2026
https://github.com/tee8z/noaa-oracle
NOAA data oracle, queryable from the browser and can attest to events for a Bitcoin DLC in dlctix style
data duckdb-wasm noaa-weather parquet-files sql weather
Last synced: 17 Feb 2026
https://github.com/elissorokin/data-analyst-portfolio-rus
Это репозиторий, в котором я демонстрирую свои навыки, делюсь проектами и отслеживаю прогресс в области анализа данных и Data Science.
ab-testing data data-analysis datalense matplotlib numpy pandas plotly portfolio postgresql python scipy seaborn sql statistical-analysis
Last synced: 25 Feb 2026
https://github.com/aniketkkajania/wassupanalyzer
WhatsAnalyzer is a powerful statistical analysis tool designed for analyzing WhatsApp chats. With the ability to process chat files exported from WhatsApp, this tool provides valuable insights by generating various plots and statistics.
data data-science datavisualization streamlit streamlit-webapp webapp whatsapp whatsapp-chat
Last synced: 25 Feb 2026
https://github.com/cworld1/novel-data
The data repository of novel analysis
Last synced: 01 Feb 2026
https://github.com/garcane/cookie-company-visual-dashboard
This Excel-based interactive dashboard provides a comprehensive overview of the Cookie Company's sales performance and key metrics.
dashboard data data-visualization excel microsoft-excel
Last synced: 09 Feb 2026
https://github.com/shuklayash02/excel_complete_vrindastore_dataanalysis
Compltete AnalysisData Cleaning,processing and data analysis with interactive dashboard
analysis data data-visualization datacleaning excel excel-vba
Last synced: 19 Mar 2026
https://github.com/sakshisrivastava-2601/credit-card-fraud-detection
Credit Card Fraud Detection Project Using Machine Learning. This project focuses on leveraging advanced Machine learning techniques to identify fraudulent transactions with high accuracy.
advanced-machine data machine-learning numpy project-repository python pytorch random-forest
Last synced: 16 Apr 2026
https://github.com/obsidianplusplus/5e_play_cs-go
Python工具,分析你在5EPlay的CS:GO比赛数据。抓取、分析、筛选并导出。 | Python tool to analyze your 5EPlay CS:GO match data. Fetches, analyzes, filters, and exports.
5eplay analysis api automation csgo data esports excel json match pandas performance player python reporting scraping stats team
Last synced: 13 Feb 2026
https://github.com/garcane/beverage-sales-analytics
This project provides an in-depth analysis of beverage sales and delivery across different states using Power BI.
data data-visualization powerbi powerbi-report powerbi-visuals
Last synced: 19 Mar 2026
https://github.com/saisriramkamineni/e-commerce-sales-analysis-excel-
Conducted an in-depth sales analysis for an e-commerce platform, leveraging Excel for data preprocessing and Power BI for visualization. Identified key sales trends, customer purchasing behavior, and revenue growth patterns to optimize business performance.
analysis analytics data excel sales
Last synced: 14 Feb 2026
https://github.com/stdlib-js/array-base-assert-is-complex-floating-point-data-type
Test if an input value is a supported array complex-valued floating-point data type.
array assert base check data dtype is javascript node node-js nodejs stdlib test types util utilities utility utils valid validate
Last synced: 14 Feb 2026
https://github.com/mickfrog/uace-analysis
UACE ANALYSIS FOR 2011 - 2015
data data-science data-visualization folium-maps geocoder jupyter-notebook pandas python3
Last synced: 14 Feb 2026
https://github.com/discindo/natochak
Analysis of bicycle accidents in Macedonia using Rmarkdown and ggplot2
Last synced: 19 Feb 2026
https://github.com/mvicens/sporscor
TypeScript API to manage sport data getting scoreboards and statistics
api-client data score scoreboards sport statistics typescript
Last synced: 16 Feb 2026
https://github.com/linx-software/file-import-to-rest-api
Import a CSV file and make the data available via a REST API.
Last synced: 19 Mar 2026
https://github.com/stdlib-js/array-base-none-by-right
Test whether all elements in an array fail a test implemented by a predicate function, iterating from right to left.
all array data every generic javascript node node-js nodejs none predicate stdlib structure test types validate
Last synced: 01 Mar 2026
https://github.com/theonlybeardedbeast/exercise-data
Datasets for workout exercises
data dataset fitness health healthcare
Last synced: 20 Mar 2026
https://github.com/efler/microservice-data-bus
Data bus based on Apache Kafka and consisting of separate components [copied from own private repos]
data data-bus deduplication enrichment filtering kafka microservice mongodb postgresql redis
Last synced: 16 Apr 2026
https://github.com/docusign/extension-app-data-io-reference-implementation
Extension App for Data IO Reference Implementation for the Docusign IAM Platform
Last synced: 02 Mar 2026
https://github.com/metapsy-project/data-depression-inpatients
Database of depression psychotherapy trials in inpatient settings
Last synced: 27 Mar 2026
https://github.com/meineglock20/listtotabledisplay
The List to Table Formatter for .NET is a versatile library designed to convert lists of objects into well-formatted table displays . Ideal for web applications and console applications - including log files and word documents.
asp-net asp-net-core console csharp data display dotnet formatter html list logging netstandard20 object-list presentation razor-pages table table-formatter text-table text-to-table utility
Last synced: 04 Mar 2026
https://github.com/chompfoods/stub-go-server
Go server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food go-server go-swagger grocery ingredients nutrition raw recipe-api recipes
Last synced: 17 Apr 2026
https://github.com/palewire/nyc-hpd-bronx-lead-paint-violations
Download and process housing code lead paint violations in the Bronx from NYC Open Data
bronx data data-journalism news nyc python
Last synced: 02 Apr 2026
https://github.com/sadmanca/uoft-pey-coop-job-postings
Code for parsing approximately 1.8k HTML pages of UofT PEY co-op job postings (from September 2023 to May 2024) to a single sqlite3 database file.
co-op data html python singlefile sqlite sqlite3 uoft uoft-pey
Last synced: 17 Apr 2026
https://github.com/timmymatten/spikeball-stat-tracker
Spikeball stat tracking web app built with Streamlit and Python, designed to easily log and analyze player performance over multiple games.
data data-analysis data-visualization dataset matplotlib-pyplot multipage python spikeball statistics streamlit
Last synced: 18 Apr 2026
https://github.com/aiwithqasim/recommendationengines
Recommendations Engines with IBM a project of DataScientist Nanodegree on Udacity. For this project i will analyze the interactions that users have with articles on the IBM Watson Studio platform, and make recommendations to them about new articles you think they will like.
data data-manging data-science ibm ipython-notebook normalization python3
Last synced: 18 Apr 2026
https://github.com/csheldonhess/reporting-on-congress
What has Congress passed and not passed, lately?
civic-data congress data government government-data propublica propublica-congress-api
Last synced: 20 Apr 2026