data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/roggersanguzu/weather-medical-expense-prediction-ml-models
This repo contains a model for determining the rainfall patterns and another for medical expense prediction model
data data-analysis data-science datasets joblib machine-learning machine-learning-algorithms scikitlearn-machine-learning
Last synced: 30 Aug 2025
https://github.com/woctezuma/hidden-gems-data
Data available to compute regional rankings of hidden gems.
data hidden-gems steam steam-reviews
Last synced: 06 Feb 2026
https://github.com/eugenedakin/des-encryption-decryption
Encrypt and Decrypt text in Xojo using DES - Written in Native Xojo Language - Cross Platform
data data-encryption-standard decryption des encryption standard xojo
Last synced: 24 Feb 2026
https://github.com/eshitakundu/disease-outbreak-predictor
Disease Outbreak Predictor: A Streamlit-based web application for predicting diabetes, heart disease, and Parkinson's disease using machine learning models.
data data-science disease-prediction healthcare-application jupyter-notebook machinelearning ml notebook prediction python streamlit streamlit-webapp
Last synced: 01 May 2026
https://github.com/ssiarhei115/countryhouse-price-prediction
ML modeling for house price prediction in Belarus
big-data data data-science fullstack fullstack-development mashine-learning parsing parsing-engine
Last synced: 28 Aug 2025
https://github.com/karensaraimoralesmontiel/8-week-sql-challenge
Case Studies Solutions for the 8-Week-SQL-Challenge.
Last synced: 02 Jan 2026
https://github.com/peterhellberg/bugsnag-data
Dump Bugsnag data using the Data access API
Last synced: 22 Jun 2026
https://github.com/tasosfotiadis/time-series-forecasting-for-bitcoin
This project forecasts Bitcoin’s daily closing price using time series models. Data from Jan 2021 to Mar 2022 is processed by converting timestamps, resampling, and handling missing values. LSTM and ARIMA models are evaluated on MAE, RMSE, and MAPE, with LSTM achieving better accuracy while ARIMA is faster in training and inference.
arima bitcoin data data-analysis data-science deep-learning forecasting jupyter-notebook neural-networks python time-series
Last synced: 06 May 2026
https://github.com/debjyotisaha/tableau-projects-phase-2
Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.
dashboards data data-analysis data-visualisation tableau
Last synced: 26 Aug 2025
https://github.com/0xnu/data-analyst-training
The repository contains training materials for data analysts.
data data-analysis data-analyst
Last synced: 25 Aug 2025
https://github.com/franckalbinet/maris-crawlers
Automated data harvesting of MARIS data sources
automation data marine-radioactivity
Last synced: 25 Aug 2025
https://github.com/nodamu/apache-beam-studies
Personal Apache Beam studies repository
apachebeam batch-processing data dataeng dataengineering datapipeline stream-processing
Last synced: 04 Nov 2025
https://github.com/hafs96/prediction_consommation-de-carburant
Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.
analysis data data-visualization machine-learning testing training
Last synced: 09 Jun 2026
https://github.com/aimin-nur/data-analyst-model-predictive
Sebuah Project data analyst yang bertujuan untuk mengindentifikasi karakteristik customer untuk menerima penawaran campaign marketing.
analyst data mechine-learning visualization
Last synced: 29 Jan 2026
https://github.com/mubashirsidiki/olympics-data-enigeering
Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.
analytics azure big-data data dataengineering devops pipeline
Last synced: 02 May 2026
https://github.com/paulrosset/cyclone
Network data consumption monitoring
data monitoring network networking
Last synced: 23 Aug 2025
https://github.com/thedragoncode/training-data-for-ai
Training data for the neural network
ai data flood meaningless neural-network neural-networks nn obscene politics spam toxic training
Last synced: 29 Jan 2026
https://github.com/s1dewalker/electric-future
Visual Analysis: Future of Automotive Industry
data data-visualization machine-learning python3 regression-analysis tableau
Last synced: 02 May 2026
https://github.com/data-forge-notebook/ohlc-aggregation-example
An example of aggregating OHLC stock data using Data-Forge Notebook
algorithmic-trading data data-aggregation data-analysis ohlc quantitative-finance share-market stock-market trading
Last synced: 30 Jan 2026
https://github.com/canadaluke888/ttb2
TerminalTableBuilder 2
c17 csv data database datasets datautils json ncurses ods spreadsheet sqlite3 tables terminal terminaltablebuilder terminaltablebuilder2 ttb ttb2 ttbx xlsx
Last synced: 10 Apr 2026
https://github.com/urvish-06/seaborn-dataset
Seaborn data sets
csv csv-files data data-science data-visualization dataset example jupyter-notebook jypyternotebook python seborn vacation
Last synced: 18 May 2026
https://github.com/rationalprabal/book-management-app
A Node.js and Express.js application for managing books, featuring role-based authentication and authorization with JWT, file uploads for book cover pages, robust data validation and documentation using swagger. The project includes user roles such as Admin, Author, and Reader, each with specific permissions.
data expressjs jwt-authentication mongodb mongoose nodejs rbac-roles
Last synced: 10 Apr 2026
https://github.com/lut-ful/pizza-sales-report
This Pizza Sales Report provides valuable insights into sales performance through detailed analysis and visualizations. By leveraging Power BI and SQL Server
data data-wrangling microsoft-sql-server power-bi power-bi-dax python
Last synced: 30 Jan 2026
https://github.com/hakusaro/facts
A fact based knowledge system (FBKS) experiment.
Last synced: 03 Jan 2026
https://github.com/vbhatsaccnt/retail-strategy-and-analytics-optimization-of-control-stores-for-sales-enhancement
In this project, we aim to optimize the performance of retail chain stores by establishing control stores based on their performance compared to selected trial stores. By leveraging data analytics and strategic insights, we seek to enhance sales revenue and drive growth within the retail chain.
customer-segmentation data data-science risk-analysis
Last synced: 13 May 2026
https://github.com/denisecase/dc-texter
Send a text message using Python
alerts data python sms-messages streaming
Last synced: 08 Feb 2026
https://github.com/farhad2415/Job_Scraper
Job Site Based Job Scrapping with python
automation bash-script data data-scraping data-structures python selenium selenium-python
Last synced: 15 Aug 2025
https://github.com/anyantudre/associate-data-scientist-track
Materials for the Associate Data Scientist in Python track on DataCamp.
data data-science experimental-design hypothesis-testing machine-learning matplotlib-pyplot pandas python regression sampling seaborn statistics statsmodels unsupervised-learning
Last synced: 03 May 2026
https://github.com/meicloudie/react-practice-react-router-and-authentication
Learning React Project - @academind-maxschwarzmueller
authentication data javascript practice-project react react-router
Last synced: 13 May 2026
https://github.com/rorovic/rorovic.github.io
my github blog
code data datawarehouse devops realtime
Last synced: 01 Feb 2026
https://github.com/twilighty-abhi/locust-data-visualiser
Locust Data Visualiser
Last synced: 15 Aug 2025
https://github.com/didier/functional-programming
Functional Programming subject of @CMDA-TT
convenience d3 d3-visualization d3js data datavis datavisualization front-end functional functional-programming interactive jsdoc node nodejs-modules parking-spots typescript
Last synced: 03 May 2026
https://github.com/pythoncoderunicorn/jamesbeardaward
a repo for James Beard Award data
Last synced: 07 Feb 2026
https://github.com/yugsumeet17/churn-analysis-project--power-bi-sql-machine-learning
Dataset Explained, Project Goals & Metrics Required, SQL Server ETL & Data Cleaning, Power BI Data Load, Transformation, Blueprint & Measures, Power BI Visualization - Summary Page, Building Machine Learning Model - Random Forest, Power BI Visualization - Churn Prediction Page
data data-visualization dataanalytics excel postgresql powerbi python3
Last synced: 03 May 2026
https://github.com/seqeralabs/ffq-api
A minimal wrapper to make ffq searches available via a REST API.
api data fastq fetch-fastq ffq genomics
Last synced: 15 Aug 2025
https://github.com/jleung51/foundations-dags
Data ETL pipeline to clean, process, and aggregate data from Canadian housing starts.
data data-engineering etl extract housing load pipeline transform
Last synced: 04 Oct 2025
https://github.com/aaisha-nexus/sql_company_insights
A beginner-friendly SQL project for managing employee records, departments, and sales transactions. Includes table creation, optimized queries, stored procedures, and window functions to extract business insights.
business-analytics data data-analysis dataanalysis-projects dataanalytics database-schema mssql-database query relational-databases sql sql-query ssms
Last synced: 12 Aug 2025
https://github.com/fallaciousreasoning/nz-mountains
A list of mountains in NZ, scraped from https://climbnz.org.nz
alpine climbing climbnz data json json-api maps mountaineering scraping
Last synced: 04 May 2026
https://github.com/ymorsi7/quranicvisualization
A visual exploration tool for the Holy Quran using D3.js treemaps.
css d3 d3js data data-visualization html islam islamic javascript js quran quranic treemaps visualization
Last synced: 15 Apr 2026
https://github.com/interzoid/php-examples
Provides PHP examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.
api cloud data database php quality
Last synced: 12 Jan 2026
https://github.com/michaelfromyeg/lyrics
Lyric-store and API hosted on Git.
Last synced: 08 Feb 2026
https://github.com/keziatbnn/supervised-regression-salaryprediction
Make salary predictions based on years of experience using supervised regression.
data data-analysis-python data-prediction data-science python
Last synced: 11 Aug 2025
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/matt-dray/draytasets
:1234::disguised_face: Miscellaneous datasets I've collected or prepared
Last synced: 09 Feb 2026
https://github.com/srindot/fwuav-average-flight-data-collection
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 10 Aug 2025
https://github.com/4ment/aiv-rate-heterogeneity
Avian influenza virus data sets
Last synced: 24 Jan 2026
https://github.com/kiing-dom/data-structures-algorithms
data structures and algorithms
algorithms-and-data-structures data data-structures java leetcode
Last synced: 09 Aug 2025
https://github.com/enescidem/twitter-topic-modeling
Topic modeling is an unsupervised method to identify topics in text. This project analyzes tweets from prominent Turkish accounts to uncover underlying themes in their shared content.
data data-science machine-learning nlp topic-modeling twitter x
Last synced: 10 Feb 2026
https://github.com/javdomgom/nifi-custom-processors
Apache NiFi custom processors
apache-nifi bigdata data data-engineering datascience flowfile nifi nifi-custom-processor
Last synced: 27 Feb 2026
https://github.com/brayflex/spy-sector-rotation-google-sheet
Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.
data etf google-sheets index price rotation script sector spreadsheet spy stock-market
Last synced: 29 Jun 2026
https://github.com/os-climate/data-requests
This repo is used to track issues related to new Data Requests
Last synced: 27 Feb 2026
https://github.com/fabsdevx/files-to-database-loader-handout
Data Engineering project for learning purposes. Credits to itversity
csv data data-engineering database json pandas python
Last synced: 09 Apr 2026
https://github.com/utrechtuniversity/momentum-dataflow
Repository for publishing website about data management practices of the Momentum project
data datageneration datamanagement
Last synced: 27 Feb 2026
https://github.com/chompfoods/sdk-java
Java SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food gradle grocery ingredients jar java java-sdk nutrition openapi raw recipe-api recipes sdk
Last synced: 09 Apr 2026
https://github.com/ppabam/eda-bam
Navigating data from one thing to another.
Last synced: 11 Feb 2026
https://github.com/praveendecode/retail-revenue-forecasting
Designed an end-to-end ML model pipeline, forecasting department-wide sales by accounting for holiday markdown effects, spanning data collection to inferencing.
azure collection data datapreprocessing docker exploratory-data-analysis feature-engineering featureimportance model modelbuilding modeldeployment modelselction python report tableau
Last synced: 16 Apr 2026
https://github.com/iliyasalve/cyclistic_case_study
Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"
bike-sharing data data-analysis data-visualisation r
Last synced: 06 Apr 2025
https://github.com/sourceduty/text_file_metadata
📄 Extract metadata from .txt files and record the metadata in .txt files.
data datascience metadata metafile practice sourceduty
Last synced: 08 Aug 2025
https://github.com/a-poor/datatransform.jl
A package for defining (and performing) tabular-data transformations with JSON.
data data-science data-transformation etl feature-engineering json julia julia-package tabular-data
Last synced: 05 May 2026
https://github.com/prakhargpt/sql-data-warehouse-project
Building Data Warehouse project using SQL Server, including ETL processes, data modelling and analytics.
analytics data data-analysis data-cleaning data-engineering data-engineering-pipeline data-lakehouse data-science data-warehouse etl etl-job etl-pipeline medallion-architecture sql sql-server
Last synced: 12 Jun 2026
https://github.com/nouraalgohary/fifa-world-cup-data-analysis
data dataanalysis powerbi powerbi-visuals
Last synced: 19 Mar 2026
https://github.com/kirillsemyonkin/lsd
LSD (Less Syntax Data) configuration/data transfer format.
configuration data java parsing rust
Last synced: 27 Feb 2026
https://github.com/sourceduty/language_barriers
🔤 Language barriers between the world's 7,000 languages.
communication concept data idea info information language language-barrier language-barriers languages project research
Last synced: 11 Feb 2026
https://github.com/sourceduty/cults_3d
🔢 Software concept for additional statistics from Python for Cults design data .csv files.
3d 3d-model 3d-model-software 3d-modelling account account-management concept cults cults-3d data idea sourceduty
Last synced: 08 Aug 2025
https://github.com/pawamoy/keycut-data
Keyboard shortcuts data stored in YAML files
Last synced: 12 Feb 2026
https://github.com/foundationallm/.github
A platform accelerating delivery of secure, trustworthy enterprise copilots.
agent ai data enterprise generative-ai large-language-model llm ml tool
Last synced: 12 Feb 2026
https://github.com/chanchalsoorma/web-scraping
This repo aims to provide a straightforward, easy-to-use scraping code written in Python.
beautifulsoup beautifulsoup4 data python request selenium webscraping
Last synced: 05 May 2026
https://github.com/shibbbbs/fastapi_project
A FastAPI application that reads financial data from an Excel file (capbudg.xls) and provides API endpoints to list available tables (sheet names), fetch row names from a selected table, and calculate the sum of numerical values from a specified row. The API is accessible via a web-based interactive documentation at /docs
data dataanalysis fastapi pandas python
Last synced: 06 May 2026
https://github.com/sourceduty/digital_brand_footprint
🔗 Expert in finding and analyzing branded websites and social media links.
analytics artificial-intelligence business business-footprint businesses chatgpt company concept data link openai social-media tool url website
Last synced: 16 Aug 2025
https://github.com/sanand0/iss-location
Tracks the International Space Station position. A demo of how to use GitHub Actions to schedule commits weekly.
Last synced: 14 Feb 2026
https://github.com/adilsaid64/real-time-data-monitoring
Exploring what a real-time data drift monitoring solution could look like within MLOps
data datadrift grafana machine-learning mlops mlops-workflow prometheus python software-engineering
Last synced: 04 Aug 2025
https://github.com/molinsagustin/cinedata
# CineData Trabajo práctico grupal para la materia Ingeniería de Datos I en la Universidad Argentina de la Empresa. El mismo consistió en el desarrollo de una base de datos relacional en Microsoft SQL Server Managment Studio utilizando metodología Ágil SCRUM, que se utilizó desde el relevamiento de requisitos hasta la implementación final.
agile data data-modeling database diagram entity-relationship-diagram microsoft-sql-server relational-databases relational-model scrum scrum-agile sql sqlserver
Last synced: 28 Feb 2026
https://github.com/abhibisht89/data-visualization
data matplotlib pandas ploty python visualization
Last synced: 06 May 2026
https://github.com/sunnahboy/checkfake_true_news
Building data structures using Linked lists and arrays and find best algorithms for implementing a system for detecting Fake News
algorithms data level low programming structure
Last synced: 28 Feb 2026
https://github.com/ddeepanshu-997/support_vector_regression--svr-
In this repository i performed a support vector regression on real life data , initially i performed some data preprocessing technique in order to filter out the data flaws then undergoes the process of model building i.e SVM regression in order to make a machine learning regression model.
data data-science regression-analysis regression-models svm-model svm-regression
Last synced: 03 Aug 2025
https://github.com/ims94/ballerina-tsv-querying
An example Ballerina project to query tsv data using Ballerina language integrated queries
ballerina ballerina-lang data olympics query sql
Last synced: 03 Feb 2026
https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata
I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.
chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation
Last synced: 14 Feb 2026
https://github.com/chrisabruce/scrapling-rs
Adaptive web scraping, built in Rust. A high-performance port of Python Scrapling.
ai ai-scraping automation crawler crawling crawling-rust data data-extraction mcp mcp-server playwright rust-lang scraping selectors stealth web-scraper web-scraping web-scraping-rust webscraping xpath
Last synced: 26 Jun 2026
https://github.com/servierhub/adsv
Analyze delimiter-separated values files
csv csv-converter csv-format csv-parser csv-parsing csv-reader csv-reading data data-analysis data-engineering data-mining
Last synced: 28 Sep 2025
https://github.com/badranalyst/covid-deaths-and-vaccinations-sql-data-exploration
This project involves exploratory data analysis on COVID-19 deaths and vaccinations data using SQL. It aims to uncover trends, patterns, and insights related to vaccination rates and their impact on mortality. The analysis provides a clearer understanding of the pandemic's dynamics, facilitating data-driven decisions in public health.
covid-19 data data-exploration dataset sql
Last synced: 19 Feb 2026
https://github.com/nmelgar/marathons_data_viz
Data visualization project to analyze finishing times and other data.
csv csv-files data data-analysis data-insight data-visualization data-viz dataset tableau
Last synced: 15 Feb 2026
https://github.com/plurid/datasign
Single Source of Truth Data Contract Specifier
Last synced: 08 Nov 2025
https://github.com/nagar2nd/ml-regressionmodel---cardekho-price-prediction
This repository features a machine learning model for predicting used car prices using data from CarDekho.com. The project leverages exploratory data analysis and regression techniques to empower sellers and buyers with actionable insights in the Indian used car market.
analytics cleaning-data data linear-regression machine-learning matplotlib numpy pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/xljones/bugsnag-exporter
Export Bugsnag project, error, and event data easily from a command line call which automatically handles pagination, and API backoffs
bash bugsnag cmd csv data error error-capture error-handling error-reporting event export go golang json project zsh
Last synced: 06 May 2026
https://github.com/rubidev68/citadelai-community
Community version of citadelai.app
ai ai-assistant chatbot chatbot-framework data knowledge-management silo-digital
Last synced: 03 Feb 2026
https://github.com/edjoukou/human_resources
A data analysis project using MySQL Server database
analysis data mysql powerbi sql visualization
Last synced: 25 Sep 2025