Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/prajjwol09/data-cleaning-project
This project is dedicated to cleaning, standardizing a dataset, dealing with null values from a CSV file named "layoffs" using MySQL, with MySQL Workbench as the workspace environment. The goal is to prepare the data for analysis.
cleaning-data columns data-analysis database duplicates mysql rows standard
Last synced: 20 Apr 2026
https://github.com/phomint/udacity_free_datawragling_with_mongodb
Udacity Free course to use MongoDB
data-analysis mongodb udacity-course
Last synced: 11 Jun 2025
https://github.com/jnnjenga/video-game-analysis
Performs exploratory data analysis on a video game dataset, examining titles, release dates, teams, ratings, genres, and user engagement metrics to uncover trends, popular genres, and developer insights.
data-analysis data-science dataset eda exploratory-data-analysis game-analysis games genres jupyter-notebook pandas python trends user-engagement video-game visualization
Last synced: 13 Apr 2026
https://github.com/ilke-kas/multivariate-data-analysis
A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.
classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics
Last synced: 05 Oct 2025
https://github.com/bitcoin-apps-suite/bitcoin-spreadsheet
Open source Bitcoin-powered spreadsheet application with blockchain data integration, smart contract calculations, and collaborative financial modeling | By THE BITCOIN CORPORATION LTD
bitcoin bitcoin-sv blockchain bsv cryptocurrency dapp data-analysis decentralized excel-alternative nextjs spreadsheet typescript web3-spreadsheet
Last synced: 05 May 2026
https://github.com/elcarrillo/computational_bootcamp_material
Material for a Computational Bootcamp
bootcamp-project computational-physics data-analysis data-visualization jupyter-notebooks
Last synced: 05 Oct 2025
https://github.com/harkishen/Agriculture-DS
An Agricultural based Mtech project, on Data Science, which predicts the growth of crops based on previous year records.
Last synced: 11 Dec 2025
https://github.com/shrunga92/restaurant_order_analysis_sql
This project is a structured SQL-based analysis of restaurant orders, aimed at deriving key insights from transactional data.
Last synced: 03 Jul 2025
https://github.com/egbe34/sql-portfolio
SQL portfolio showcasing business-focused queries for KPIs, retention, churn, RFM, and Pareto analysis. Built with sample commerce data for analytics and BI use cases.
bigquery business-intelligence churn-analysis cohort-analysis data-analysis kpi postgresql rfmsegmentation sql windowfunction
Last synced: 19 May 2026
https://github.com/josepablodmg/python--linear-regression---housing-exercise
A predictive analysis exploring the relationship between household characteristics and median income in California. Using linear regression, the project investigates whether blocks with fewer households correspond to higher median incomes.
california data-analysis data-science exploratory-data-analysis housing-data linear-regression machine-learning python regression scikit-learn statistics visualization
Last synced: 05 Oct 2025
https://github.com/jimartskenya/ai-code-context
🤖 Automate code documentation with AI to enhance understanding and streamline your workflow, saving time on unfamiliar codebases and projects.
ai claude-code codebase-analysis context-management data-analysis dependency-analysis gemini intellij-plugin jupyterlab-extension llm-integration machine-learning mcp-server open-source pandas prompt-engineering streamlit-component token-reduction vibe-coding
Last synced: 08 May 2026
https://github.com/anushkundu/crime-pattern-analysis
Analyzing Crime Patterns in Montgomery County, USA: An Inclusive Study Based on NIBRS Data (2016-2022)
data-analysis data-visualization descriptive-statistics matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/andersoncrs/prediccion-del-precio-de-vehiculos-un-enfoque-con-regresion-lineal-y-regularizacion
Este proyecto tiene como objetivo predecir el precio de vehículos usados utilizando técnicas de regresión lineal y regularización Lasso. A través del análisis y procesamiento de datos, se construye un modelo predictivo preciso e interpretable basado en las características más relevantes de cada vehículo.
data-analysis data-exploration lasso-regression machine-learning polinomial-regression regularization-methods
Last synced: 03 Jul 2025
https://github.com/aryar-06/linear-regression
A Python project demonstrating basic linear regression with gradient descent and matrix operations, alongside scikit-learn comparison.
data-analysis data-preprocessing educational-project gradient-descent linear-regression machine-learning python regression-algorithms scikit-learn
Last synced: 05 May 2026
https://github.com/mumtaz4118/covid-19-data-visualization
covid-19-data-visulaization
covid-19 covid-data data-analysis data-analytics data-science machine-learning
Last synced: 23 Nov 2025
https://github.com/data-edd/mastering_sql
This is a repo documenting me mastering sql
data-analysis mysql mysql-database sql
Last synced: 06 Oct 2025