data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/matheussoranco/how-to-estimate-required-sample-size-for-model-training
Modeling the relationship between training set size and model accuracy.
artificial-intelligence data jupyter-notebook machine-learning python
Last synced: 22 May 2026
https://github.com/hemangsharma/bookingdataanalysisreport
The report helps understand key trends and insights around customer bookings, pricing, and other related attributes.
analysis data data-analysis data-analytics data-visualization streamlit streamlit-dashboard
Last synced: 14 May 2026
https://github.com/sofyan48/wahoo
Data stream library with kinesis
aws data data-stream event kinesis stream
Last synced: 14 May 2026
https://github.com/toluwaa-o/stears-lite-overview
Central overview repository for the Stears Lite project — documentation, resources, and links to frontend and backend repositories.
africa charts data data-aggregation data-visualization documentation fastapi nextjs project-overview
Last synced: 14 May 2026
https://github.com/zulfachafidz/green_horizon_forecasting_peak_organic_avocado_sales_with_the_prophet_algorithm
The Green Horizon Project leverages the Prophet algorithm to predict peak sales of organic avocados, supporting the campaign "APEAM GO ORGANIC." Using Python and Looker Studio, this analysis aims to provide deep insight into sales trends and potential, forming the basis of smarter marketing strategies.
algorithm algorithms analytics data data-analysis data-engineering data-mining data-science data-visualization forecasting machine-learning machine-learning-algorithms prophet-model python python-script
Last synced: 17 May 2026
https://github.com/nodamu/apache-beam-studies
Personal Apache Beam studies repository
apachebeam batch-processing data dataeng dataengineering datapipeline stream-processing
Last synced: 04 Jul 2026
https://github.com/analyticslover/salifort-motors-turnover-project
The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.
data data-analysis datamodeling eda machine-learning pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/The-Tech-Idea/Beep.winform.Sample
Application for Managing your Different DataSources . Still in Alpha.please be patient
application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows
Last synced: 04 Nov 2025
https://github.com/karensaraimoralesmontiel/8-week-sql-challenge
Case Studies Solutions for the 8-Week-SQL-Challenge.
Last synced: 02 Jan 2026
https://github.com/amethyst-php/target
amethyst amethyst-package api data laravel target
Last synced: 22 May 2026
https://github.com/aiwithqasim/p1_explore-weather-trends
In this project, I'll analyze local and global temperature data and compare the temperature trends where I live to overall global temperature trends. Moreover i will use SQL query to extract data from the given Data base and i have to visualize the insight or Average temperature to find the findings.
data dataanalyst database datavisualization nanodegree udacity
Last synced: 22 May 2026
https://github.com/afeiship/data-selection
Data structure for radio/checkbox-group.
Last synced: 17 Jun 2025
https://github.com/rickyarians/practical-statistic-car-emission
Practical Statistic Project- Car Emission in Canada - 2022
data data-science dataanalysis r rmarkdown rpubs statistics
Last synced: 22 May 2026
https://github.com/goutam1511/real-time-covid-19-tracker-for-slack
This automated tracker tracks the spread of Covid-19 in a real time basis by scraping data from Ministry of Health and Family Welfare and notifies the same at Slack
covid-19 data python slack-bot web-scraping
Last synced: 30 Aug 2025
https://github.com/iamyourdre/naive-bayes-classifier-js
Naive Bayes classifier developed with MySQL, ExpressJS, and NodeJS by @iamyourdre.
backend data data-science expressjs javascript mysql naive-bayes naive-bayes-algorithm naive-bayes-classifier nodejs
Last synced: 08 Apr 2026
https://github.com/iyashwantsaini/tweetify_
Twitter Data Collection, Analysis Tool
collection data twitter twitter-sentiment-analysis
Last synced: 08 Mar 2026
https://github.com/agusk/ilmudata-book-excel-analytics
Hallo Microsoft Excel: Mastering Data Analytics
analytics data data-analytics excel power-query-editor
Last synced: 06 Jan 2026
https://github.com/mobinx/easymeet-js
EasyMeetjs is a robust and versatile TypeScript library that provides a solid foundation for building WebRTC-based applications. It simplifies the complexities of WebRTC, enabling developers to easily incorporate real-time communication features into their projects.From simple audio video calling to real time peer to peer file transfer , everything
data meeting react realtime screensharing streaming-video webrtc zoom
Last synced: 03 Jan 2026
https://github.com/merrill007/sql-data-warehouse-project
The Data Warehouse and Analytics Project is a comprehensive initiative designed to demonstrate the end-to-end process of building a modern data warehouse and deriving actionable insights through SQL-based analytics.
architecture business-intelligence crm data data-analysis database database-management datawarehouse erp etl etl-pipeline model sql sqlserver
Last synced: 22 Mar 2025
https://github.com/rosa-lpz/data-analysis-handbook
Data Analysis base knowledge and practical applications
data data-analysis data-visualization database dax documentation power-bi python r sql tableau tableau-public
Last synced: 06 Apr 2026
https://github.com/citizenlabsgr/data.world
Work with data sets prior to uploading to data.world
Last synced: 26 Mar 2025
https://github.com/richelbilderbeek/heyahmama
Data about the Flemish/Dutch band K3
band data k3 package r r-lang r-language
Last synced: 22 May 2026
https://github.com/kwame-mintah/ml-data-copy-to-aws-s3
Automatically copy new data to an AWS S3 bucket for Machine Learning.
Last synced: 14 May 2026
https://github.com/santoshshinde2012/medallion-architecture-databrics
Medallion Architecture: Principles and Practical Exploration
data data-plat data-science databricks databricks-notebooks medallion-architecture
Last synced: 26 Jul 2025
https://github.com/athari22/statistics-from-stock-data
Statistics from Stock Data
cvs data data-science dataanalysis datacleaning dataframe jupyter pandas pandas-python python statistics stock table
Last synced: 16 Feb 2026
https://github.com/pooja-manjunatha/nyc_parking_violations_dbt
This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations
data data-analysis data-engineering dbt duckdb python sql
Last synced: 14 May 2026
https://github.com/realbxnnie/accountservice
A Simple DataStoreService wrapper with session backuping and session locking.
Last synced: 29 Jul 2025
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 12 Dec 2025
https://github.com/shubhamsoni98/analysis-with-sql
This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.
analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql
Last synced: 03 Jan 2026
https://github.com/kenanbek/youtube-data
YouTube stats data over YouTube Data API v3 using Python.
data python youtube youtube-api
Last synced: 13 May 2026
https://github.com/charlieroth/exoexplo
Exploring NASA Exoplanet Archive Data
Last synced: 03 Apr 2025
https://github.com/alex0x4b/akutils
High-level Python library for recurring data manipulation (Pandas, Python data structure, API, file manipulation, etc.).
Last synced: 08 Mar 2026
https://github.com/advisors-excel-llc/angular-datafree
angularjs data data-visualization datafree-directive
Last synced: 30 Sep 2025
https://github.com/push-protocol/push-google-bigquery
The Power of Web3 Big Data: A Guide to Using Google BigQuery and Push Protocol for Data Communication and Analysis
bigquery data push push-notifications web3
Last synced: 26 Mar 2025
https://github.com/aliasgarsogiawala/dashboards
Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard
analysis dashboards data data-visualization powerbi
Last synced: 12 Feb 2026
https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023
Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.
cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill
Last synced: 16 Aug 2025
https://github.com/RedInfinityPro/ScientificSharp
Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.
componentmodel cryptography data drawing forms generic linq system tasks text
Last synced: 30 Sep 2025
https://github.com/luminati-io/linkedin-dataset-samples
Sample dataset of 1001 LinkedIn companies, extracted via Bright Data API, featuring essential data points for competitive analysis and market insights.
data database dataset linkedin linkedin-api linkedin-data linkedin-dataset linkedin-scraper sample web-scraping
Last synced: 17 Mar 2025
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/patrikcze/meshtatic_data
Meshtastic Data Transfer - Trying some stupid thing, like transferring files over LORA network.
data meshtastic meshtastic-python
Last synced: 03 Feb 2026
https://github.com/vladandreitoma/igisol_jyvaskyla_xept_experimental_campaign
A simulation toolkit together with data analysis for the Xe&Pt Exotic Nuclei Generation experiment @ Jyvaskyla December 2022. Helping dr.Paul Constantin with simulation development. Simulation is done using Geant4 provided by CERN. Data anlysis is done using ROOT by Cern. Both C++ based. Job distributors to run the sim are coded in pearl
analysis architecture-design cplusplus data oop oop-principles pearl simulations
Last synced: 05 Sep 2025
https://github.com/krescruz/pegaso-data
Utilerías para el analisis de datos del Proveedor de Certificación de Factura Pegaso
Last synced: 29 Apr 2026
https://github.com/tusharios/weatherappwithmoya
binding data moyaexampleswift mvvm-architecture swift5 weather-app
Last synced: 28 Mar 2025
https://github.com/pyrustic/litedao
Intuitive interaction with SQLite database
auto-init dao data database database-access library lightweight pyrustic python sql sqlite
Last synced: 09 May 2026
https://github.com/ethenkem/PyGraphSurvey
A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc
Last synced: 30 Apr 2025
https://github.com/ahmad-ali-rafique/random-forest-classifier-modeling
Detailed exploration of random forest classifiers, including data cleaning, model building, and performance evaluation on various datasets.
classification classification-models data dataanalytics datamodel dataset model-checking models random-forest random-forest-classifier
Last synced: 01 Jun 2026
https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling
Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.
data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis
Last synced: 05 Mar 2025
https://github.com/ahmad-ali-rafique/electricity-consumption-analysis-household-dataset
This repository contains analysis and predictive modeling of household electricity consumption using Python. It includes data cleaning, exploratory data analysis (EDA), time series forecasting (ARIMA, SARIMA, LSTM), and model evaluation to optimize energy usage.
arima-forecasting artificial-intelligence artificial-neural-networks data data-science dataanalytics datacleaning evaluation-metrics exploratory-data-analysis long-short-term-memory lstmmodel modeling time-series timeseries-forecasting
Last synced: 23 Jun 2025
https://github.com/mattpap/pycon-2017-bokeh
Bokeh tutorial at PyCon.PL 2017
bokeh data tutorial visualization
Last synced: 17 Mar 2025
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/bala-1409/loan-classification-data-science-projects
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization
Last synced: 22 Mar 2025
https://github.com/itsmeyogesh22/solved-8-weeks-sql-challenge-correct-solutions
Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/
8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql
Last synced: 07 Apr 2025
https://github.com/salman-khan-mohammed/youtube-data-analysis-sentiment-analysis
Analyzing the YouTube Data
data data-visualization plotly-express scrapping-data sentiment-analysis
Last synced: 26 Mar 2025
https://github.com/maulanakavaldo/tri-hita-karana
Project Tri Hita Karana - Future Knowledge G20 Bali. DTS Kominfo x Binar Academy.
bali data data-science g20 science
Last synced: 02 Mar 2025
https://github.com/gui-sitton/prepaid
In this project I work as an analyst for the telecommunications company Megaline. The company offers its customers prepaid plans, Surf and Ultimate. The sales department wants to know which plans bring in the most revenue in order to adjust the advertising budget
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 22 May 2026
https://github.com/shubhamsoni98/classification-with-random-forest---2
Fraud detection is a critical task for financial institutions and businesses. This document outlines the end-to-end process of predicting fraudulent activities using a Random Forest model. The process includes data preparation, exploration, model training, and evaluation.
algorithms anaconda data data-science dataflow feature-engineering jupyter-notebook machine-learning model modeltraining prediction python random-forest sql visualization
Last synced: 20 Jan 2026
https://github.com/domarps/grad-project-reports
Write-ups of a few key semester-long projects I have worked during my Masters
circuit data deeplearning graph-algorithms matlab question-answering
Last synced: 26 Mar 2025
https://github.com/moons-14/datapot
Incorporate and serve all information.
ai aiogram api data infomation news newspaper rss video
Last synced: 04 Jan 2026
https://github.com/khansasafira19/sk-cool-storytelling
Source Code for Data Storytelling with HTML5
data html5 javascript storytelling
Last synced: 13 May 2026
https://github.com/jor-/measurements
Python functions to handle, statistically analyze and plot measurement data.
Last synced: 17 Mar 2025
https://github.com/amethyst-php/setting
Give the user the ability to configure his own settings
amethyst amethyst-package api data laravel setting
Last synced: 19 May 2026
https://github.com/gregoritsch3/project_excel_dataanalysis_carsales
An Excel Data Analysis project based on a vehicle vendor's car sales data from 2014 and 2015 showcasing data cleaning and formatting, DAX, pivot tables and charts, timelines, slicers, an interactive Dashboard, descriptive Statistics and more.
analysis dashboard data excel sales statistics
Last synced: 01 Feb 2026
https://github.com/amethyst-php/price
Define prices and attach them to any model
amethyst amethyst-package api data laravel price
Last synced: 17 May 2026
https://github.com/lordzintick/spellcaster-api-1.21.4
A server-side Fabric mod to load JSON spell files from datapacks.
api api-server data fabric fabric-mod fabricmc json magic minecraft-mod server-side small spells
Last synced: 08 May 2026
https://github.com/ayresgneto/use-case-gcp-etl
ELT pipeline GCP. Tecnologias utilizadas: Postgresql, GCP Storage, Airflow (local), Pyspark (local), BigQuery
airflow big-data bigquery data data-engineering etl gcp pipeline postgresql programming-oriented-object pyspark python spark
Last synced: 03 Jan 2026
https://github.com/amethyst-php/data-view
amethyst amethyst-package api data data-view laravel
Last synced: 19 May 2026
https://github.com/amethyst-php/source
The source of information. It can be used to save the origin of whatever information (news, books, etc.. )
amethyst amethyst-package api data laravel source
Last synced: 27 Apr 2026
https://github.com/srindot/average_flightdata_collection_fwuav
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 18 Sep 2025
https://github.com/bagustris/dataits
Web for DataITS17: Summer School on Data Science
Last synced: 28 Jun 2025
https://github.com/yourdataarchitect/abyat-scaring-
This Scrapy spider for automates the extraction of product data from the Abyat website using Hidden Backend API, supporting both Arabic and English content.
data database scraper scrapy-crawler
Last synced: 23 Apr 2026
https://github.com/rudxain/xorsum
Get XOR checksum with this command-line tool
binary checksum cli data digest file files hexadecimal rust-crate xor
Last synced: 08 Mar 2026
https://github.com/ffatahillah7/snowflake-tastybytes-data-warehouses
Build Snowflake Tasty Bytes Warehouses
data data-warehouse mysql snowflake sql warehouse
Last synced: 26 Mar 2025
https://github.com/ashishsingh789/data_visualization
Data visualization project using Python to analyze categorical and continuous variables. Includes bar charts, histograms, and scatter plots. Libraries used: pandas, matplotlib, and seaborn.
analysis barchart data data-science data-visualization histogram matplotlib pandas-dataframe scatter-plot seaborn
Last synced: 07 Sep 2025
https://github.com/deliprofesor/cardiac-data-analysis-exploring-cholesterol-and-heart-rate
This project analyzes a heart disease dataset to explore the relationship between cholesterol, heart rate, and chest pain type. It includes normality tests, outlier detection, correlation analysis, MANOVA, post-hoc tests, and VIF analysis, with visualizations using histograms, heatmaps, and boxplots.
correlation-analysis data data-cleaning data-visualization machine-learning manova post-hoc-analysis python tukey-hsd vif
Last synced: 17 May 2026
https://github.com/joseluisq/input-verifier
Some useful functions to check common data input.
Last synced: 19 Jul 2025
https://github.com/zshn1248/pyfilecrypto
PyFileCrypto is a Python module for easy encryption and decryption of files using the cryptography library. It provides a simple interface to generate encryption keys, encrypt files, and decrypt files securely.
data decryption encryption file security-tools
Last synced: 07 Apr 2026
https://github.com/saisurajmatta/data-warehousing-and-advanced-data-analytics
Data Analytics Project: Analyzed Promotions and Provided Tangible Insights to Sales Director
data data-analysis data-architecture data-flow-analysis data-modeling data-pipeline data-segmentation data-visualization data-warehousing docker etl etl-pipeline mssql sql tableau
Last synced: 17 May 2026
https://github.com/amethyst-php/sku
amethyst amethyst-package api data laravel sku
Last synced: 17 May 2026
https://github.com/sharoonjoseph321/social_media_eda
Data Analysis on social media apps ,using pandas, python, matplotlib.
data data-analysis data-science data-visualization matplotlib programming-language project python pythonprojects
Last synced: 03 Mar 2025
https://github.com/jcloh98/rental-property-finder
A web scraper that helps users find rental properties by automatically gathering and organizing listings from various websites to discover available homes and apartments.
data headless-browser node scraper scraping web
Last synced: 17 May 2026
https://github.com/merekat/hb-passiv-income
Ein Rechner, der basierend auf historischen Daten unterschiedlicher Assets kalkuliert, welches voraussichtliche passive Einkommen der User abhängig von seinen Eingaben zu erwarten hat.
assets data datajournalism etf passive-income treasury
Last synced: 19 Jul 2025
https://github.com/vidya-vijay/vid2501
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 19 Jul 2025
https://github.com/Vidya-Vijay/Vid2501
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 19 Jul 2025
https://github.com/UznetDev/Smoking-Prediction
This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.
ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking
Last synced: 28 Mar 2025
https://github.com/dimitryzub/allrecipes-us-recipes-by-state-analysis
Personal Data Exploratory Project in Python. Data extracted from AllRecipes.
data data-visualization dataexploration dataextraction matplotlib pandas python seaborn webscraping
Last synced: 10 May 2026
https://github.com/potlock/data
data research for other funding mechanisms and PotLock related data.
data flipsidecrypto near-protocol potlock
Last synced: 07 Mar 2026
https://github.com/chompfoods/sdk-scala
Scala SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes scala sdk
Last synced: 17 May 2026
https://github.com/reubano/pyconza-tutorial
Jupyter notebooks and data for "Data Mining and Processing for fun and profit" PyConZA16 tutorial
data functional-programming jupyter-notebook meza pycon python tutorial
Last synced: 17 May 2026
https://github.com/topunix/hackerrank
:green_book: HackerRank Solutions
algorithm-challenges algorithms algorithms-and-data-structures data data-structures hackerrank hackerrank-algorithms-solutions hackerrank-challenges hackerrank-python hackerrank-solutions python
Last synced: 17 May 2026