Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-15 00:07:13 UTC
- JSON Representation
https://github.com/jinkogule/multi-analyst
O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.
apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application
Last synced: 03 Jan 2025
https://github.com/cjunwon/youtube-data-analysis
End-to-end Youtube data analysis project using Youtube Data API, MySQL, AWS, Flask
aws-rds data-analysis datapipeline flask nlp pandas python shell sql vader-sentiment-analysis youtube youtube-api
Last synced: 08 Feb 2025
https://github.com/namratha2301/best-selling-books
Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.
data-analysis data-visualization matplotlib pandas sckiit-learn seaborn
Last synced: 28 Jan 2025
https://github.com/gracysapra/r-in-data-science
This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.
data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science
Last synced: 08 Feb 2025
https://github.com/ronaldkanyepi/python-sreamlit-duplicate-records-finder-remover
This is a duplicate remover on csv,excel or txt files based on single or multi columns
css data-analysis data-visualization datascience python streamlit
Last synced: 04 Jan 2025
https://github.com/ronaldkanyepi/python-streamlit-covid-19-dashboard
This is a responsive streamlit covid 19 Dashboard
analytics data data-analysis data-visualization datascience python streamlit
Last synced: 04 Jan 2025
https://github.com/wiseaidev/truth-guard
Analyzing a 79k Dataset of Misinformation and Fake News
data-analysis fastapi lstm machine-learning python supervised-learning
Last synced: 13 Feb 2025
https://github.com/bishtrishu/pizza_sales_data_analysis_sql
This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.
cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database
Last synced: 04 Jan 2025
https://github.com/nelsonkariuki/dataanalysis
This project involves data analysis of vido game sales from https://www.kaggle.com/gregorut/videogamesales/download
data-analysis data-visualization python
Last synced: 10 Jan 2025
https://github.com/dr-saad-la/r-distilled
R Programming Language distilled
data data-analysis learning programming-language r rlanguage rprogramming statistical-analysis
Last synced: 04 Jan 2025
https://github.com/antonio-f/big-data-analysis-with-scala-and-spark
Coding assignments from the course "Big Data Analysis with Scala and Spark" (Coursera).
big-data bigdata coursera data-analysis scala spark
Last synced: 06 Feb 2025
https://github.com/oguzgn/budget-checker-for-campaign-budget-allocation
This project focuses on modeling campaign performance data for Looker, helping determine which campaigns to scale up or cut back. It aggregates metrics over the last 7 and 30 days, providing actionable insights for budget optimization and performance improvement.
budget-allocation budget-controller budget-management calculated-fields campaign-analytics data-analysis data-modeling looker-studio sql
Last synced: 07 Feb 2025
https://github.com/kaz-yos/distributed
Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)
data-analysis epidemiology statistics
Last synced: 11 Jan 2025
https://github.com/bilal-belli/personalacademicdocuments
This repository contains some personal academic assignments, maybe it will help someone!
compilation computer-architecture data-analysis data-structures-and-algorithms database front-end hpc networking operating-systems signal-processing
Last synced: 17 Jan 2025
https://github.com/suhas-005/power-bi-dashboard
Power BI Dashboard Projects
data-analysis data-visualization dataset power-bi-project powerbi
Last synced: 07 Feb 2025
https://github.com/alexandrelamarre/fission
Data analytics & Structured streaming optimized for the Edge
data-analysis data-engineering rust structured-data unstructured-data
Last synced: 11 Jan 2025
https://github.com/moscarde/pyproductivity
Application uptime tracker that monitors active windows, automatically generating daily usage reports.
daily-report data-analysis python tracker
Last synced: 06 Feb 2025
https://github.com/ganesh2409/cricket-player-performance
This repository contains a comprehensive project focused on analyzing cricket player performance using various datasets, including batting, bowling, and match results. The project involves data preprocessing, feature engineering, and model training to predict and evaluate player performance scores. It includes detailed scripts for data analysis
cricket-performance-analysis data-analysis machine-learning sports-analytics
Last synced: 11 Jan 2025
https://github.com/leosimoes/uerj-tcc-analisador-dados-texto
Texto do trabalho de conclusão de curso (TCC) em engenharia de computação. Aplicativo Web para análise de dados.
data-analysis data-science data-visualization python streamlit
Last synced: 30 Jan 2025
https://github.com/ajimaulana123/e-commerce-data-analis
Analisis dataset e-commerce guna menjawab kebutuhan product mana yang paling laris dibeli customer
Last synced: 28 Jan 2025
https://github.com/maskedsyntax/budget-pie
Android app to manage monthly budgets
android dart data-analysis data-visualization finance-management firebase flutter
Last synced: 12 Feb 2025
https://github.com/michenriksen/inspectra
A simple web app for data inspection.
data-analysis decoding web-tool
Last synced: 14 Jan 2025
https://github.com/metalwarrior665/actor-results-checker
apify data-analysis json-schema-checker
Last synced: 04 Jan 2025
https://github.com/agungbudiwirawan/sales-analysis-using-excel-formulas
The objective of this project is to analyze supermarket sales data using formulas in Microsoft Excel.
data-analysis excel excel-formulas microsoft-excel spreadsheet
Last synced: 06 Feb 2025
https://github.com/cano1998/eda-survival-of-the-titanic
This project focuses on Exploratory Data Analysis (EDA) to identify the key determinants that influenced survival during the infamous Titanic accident.
data-analysis data-cleaning data-preprocessing data-visualization exploratory-data-analysis jupyter-notebook titanic-survival-exploration
Last synced: 04 Jan 2025
https://github.com/walidalsafadi/titanic-disaster
In this challenge, we ask you to build a predictive model that answers the question: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).
data-analysis data-science decision-trees eda gradient-boosting knearest-neighbors machine-learning-algorithms naive-bayes random-forest titanic-kaggle titanic-survival-prediction
Last synced: 22 Jan 2025
https://github.com/carlosvinimsouza/dataanalysiswithpython
Learn Data Analysis using Libs Python (Numpy, Pandas, Matplotlib and Seaborn)
data-analysis data-science free-code-camp matplotlib numpy pandas python python3 seaborn
Last synced: 11 Jan 2025
https://github.com/heiderjeffer/enhancing-digital-maturity-and-analytical-capabilities-of-smes
Research Proposals RP
analytics data-analysis data-driven digital framework jupyter modeling-and-simulation pyrhon quantative smes statistical-analysis stochastic-processes
Last synced: 08 Feb 2025
https://github.com/moindalvs/learn_eda_house_price_dataset
Data Set: House Prices: Advanced Regression Techniques Exploratory Data Analysis on more than 80 features
cardinality data-analysis data-science data-structures data-visualization missing-values
Last synced: 18 Jan 2025
https://github.com/ehtisham-sadiq/building-an-ml-based-heart-disease-diagnosis-system-with-flask
It is an end-to-end project that combines machine learning to create a user-friendly Heart Disease Diagnosis System, powered by Flask.
data-analysis exploratory-data-analysis feature-engineering flask machine-learning model-building model-evaluation pipelines python3 rest-api
Last synced: 11 Jan 2025
https://github.com/nikhilash45/live_ipl_report
This repository hosts the source code for an interactive IPL (Indian Premier League) Dashboard built using PowerBI. The dashboard provides real-time updates on ongoing matches, including live scores, batting and bowling statistics for both teams, and the points table.
analysts cleaning-data cricket-data dashboard data data-analysis data-visualization dax powerbi
Last synced: 11 Jan 2025
https://github.com/thomascenni/anfavea-data-analysis
Data analysis with Pandas and Datapane.
Last synced: 30 Jan 2025
https://github.com/t-mohamed-shafeek/data-analysis-on-tamil-nadu-road-accidents
The "Data Analysis on Tamil Nadu Road Accidents" is a project deals with analysis of data on Road Accidents encountered by Tamil Nadu ( one of the states of India ) in the year of 2020 and 2021. But the dataset is most recently created (created on February 15, 2023 with source form TN Police).
dashboard data-analysis data-science data-visualization jupyter-notebook tableau
Last synced: 07 Feb 2025
https://github.com/muneeb1030/eda-of-physionets-ecg
EDA of Physionet Data set regarding "A Large Scale 12 Lead Electrocardiogram Database for Arrhythmia Study 1.0.0". This project focuses on the preprocessing of electrocardiogram (ECG) signals and utilizes Principal Component Analysis (PCA) for dimensionality reduction
12-lead-ecg data-analysis ecg-signal eda pca python3 wfdb
Last synced: 11 Jan 2025
https://github.com/codebyaadi/whatsapp-chat-analyzer
WhatsApp Chat Analyzer is a web app built with Streamlit and Python to analyze your WhatsApp conversations. Upload your chat data and gain valuable insights on message frequency, active participants, and more. Visualize your conversations with word clouds and charts. Explore and understand your chats effortlessly.
data-analysis data-science data-visualization numpy pandas pycharm-ide python python3
Last synced: 12 Feb 2025
https://github.com/brunomontezano/benzocovid
💊 Data Analysis Project of Benzodiazepines during COVID-19 Pandemic.
benzodiazepines covid-19 data-analysis
Last synced: 11 Jan 2025
https://github.com/elkronos/stat_py
Statistics functions for python
assumption-check data-analysis data-visualization python regression statistical-analysis statistical-inference statistical-models statistical-tests statistics
Last synced: 24 Jan 2025
https://github.com/jakubkorytko/data-graphs
Transform raw data into captivating visual stories with this app, effortlessly craft stunning data charts that unveil insights and trends
charts data-analysis mit-license open-source
Last synced: 11 Jan 2025
https://github.com/phomint/udacity_dataanalysis
All projects and activities
data-analysis python udacity-nanodegree
Last synced: 15 Jan 2025
https://github.com/sufiyanahmed4566/sql-musicmaven
"This Music Store Database Project showcases SQL skills through comprehensive database design, query optimization, and data analysis. Includes ER diagram, database file, query questions (Easy, Medium, Hard), answered queries, and CSV table data. Ideal for recruiters seeking skilled SQL developers for music store management and data analysis.
data-analysis database insights mysql-database oracle-database relational-databases sql
Last synced: 24 Jan 2025
https://github.com/ayu-hack/ayu-hack
Enthusiastic learner passionate about building software and exploring the world of technology. Eager to contribute to open-source projects and collaborate with the developer community. Continuously developing my skills in Python,SQL,HTML,CSS,PowerBI, MacOS. Always open to feedback and excited to keep growing!
config css data-analysis github-config html powerbi-desktop python3 sql
Last synced: 06 Feb 2025
https://github.com/subhojit45/python3-iphones-x-flipkart-sales-analysis
A simple six questions and their insights derived from iphone sales on Flipkart dataset.
data-analysis jupyter-notebook python3 visual-studio-code visualization
Last synced: 24 Jan 2025
https://github.com/gab-182/market-analysis-report-for-national-clothing-chain
Using custom M and DAX codes in Power BI, I conducte a thorough market analysis for a national clothing chain. The insights gathered from customer data and US Census Bureau statistics led to the formulation of a targeted marketing strategy, contributing to enhanced sales and customer satisfaction.
Last synced: 18 Jan 2025
https://github.com/derrickbaruga7/mapping-median-age-europe
An R project that creates an interactive map of the median age across European regions using Eurostat data and spatial visualization packages.
data-analysis data-science data-visualization datascience european-union mapping r
Last synced: 30 Jan 2025
https://github.com/aekanshd/crazytics-suicidesindia
Basic interpretation of the Suicides in India data-set using R.
data-analysis data-science graph india r suicides
Last synced: 15 Jan 2025
https://github.com/is-leeroy-jenkins/sherpa
A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts
budget-management data-analysis data-science data-visualization federal-government
Last synced: 12 Jan 2025
https://github.com/shliakhovai/sales-analysis-project
This project involves analyzing sales data to gain insights into sales trends, performance metrics, and product categories. The analysis includes data cleaning, exploratory data analysis (EDA), sales trend analysis, profit dependency analysis, and ABC analysis.
abc-analysis data-analysis data-science data-visualization eda exploratory-data-analysis jupyter-notebook python
Last synced: 12 Jan 2025
https://github.com/christianrcanlas/christianrcanlas.github.io
e-Portfolio showcasing my personal projects.
arima classification-algorithims crostons-method data-analysis data-visualization data-warehousing etl-pipelines hierarchical-forecasting holt-winters long-short-term-memory machine-learrning ms-sql-server predictive-analytics python r-markdown support-vector-regression t-sql tableau time-series-decomposition time-series-forecasting
Last synced: 18 Jan 2025
https://github.com/shrawans007/data_science_students
Customer Engagement Analysis in Excel for 365datascience.com
2021-2022 365datascience case-study case-study-analysis case-study-project customer-engagement-analysis data-analysis data-analytics data-science data-science-students free-plan indian-students microsoft-excel ms-excel ms-excel-addin ms-excel-data-analytics ms-excel-task paid-plan us-students
Last synced: 17 Jan 2025
https://github.com/mdaffailhami/customer-data-analysis
This repository contains code and analysis for exploring customer data, focusing on profiling and contact preferences. The project includes various stages of data processing, from raw data preparation to final cleaned datasets, and employs Python and popular data analysis libraries to uncover insights and trends.
data-analysis data-cleaning data-science data-visualization jupyter jupyter-notebook pandas plotly python
Last synced: 12 Jan 2025
https://github.com/abdelhakim-gh/machine-learning_data-analysis_project
recognizing handwritten numbers & comparing the Life Expectancy vs Fertility in 1960 & 2013 of regions
data-analysis jupyter-notebook machine-learning python r r-studio
Last synced: 30 Jan 2025
https://github.com/chengkangzai/malaysia-pandemic-dashboard
covid-19 data-analysis pandemic-dashboard
Last synced: 03 Feb 2025
https://github.com/johnsesana/eda-video-game-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization excel
Last synced: 17 Jan 2025
https://github.com/roland045/road_quality_measurement_analysis
Novel road quality measurement system for cost effective pavement monitoring, ML-based
azure data-analysis data-engineering data-science machine-learning mlops model-deployment python sql unsupervised-learning
Last synced: 24 Jan 2025
https://github.com/sevdanurgenc/python-for-data-science-lecture-notes
In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial
Last synced: 29 Jan 2025
https://github.com/rishabhraj43/diwali-sales-analysis
A Data Analysis project made in Python
Last synced: 12 Jan 2025
https://github.com/hayatiyrtgl/data_analysis_project
Financial data analysis: preprocess, visualize, calculate technical indicators.
data-analysis data-analysis-python data-science dataframe numpy pandas python python3 stock-price-prediction talib trade-analysis
Last synced: 14 Feb 2025
https://github.com/ituvtu/datamining-ab-testing
This project focuses on conducting A/B testing to evaluate the effectiveness of two marketing campaigns. Using statistical analysis and hypothesis testing, we determine which campaign is more effective in improving conversion rates.
a-b-testing data-analysis data-analysis-python data-mining ipynb jupyter jupyter-notebook python
Last synced: 16 Jan 2025
https://github.com/codingprivacy/feedback-portal-system
AI based Feedback Portal System which takes periodic feedbacks from users via highly human friendly chat-bot, analyse the responses through NLP and sentiment analysis and visualize the analysis on the portal website.
artificial-intelligence bokeh chatbot data-analysis flask mysql-database nlp portal python sentiment-analysis visualization website
Last synced: 12 Jan 2025
https://github.com/programmer-rd-ai/dimensionality-reduction
DimRed is a comprehensive Python toolkit for advanced dimensionality reduction, integrating with major machine learning libraries and featuring real-time performance monitoring to enhance data analysis and model efficiency.
analytics data-analysis data-science lightgbm machine-learning matplotlib numpy pandas programming python python3 sklearn university xgboost
Last synced: 12 Jan 2025
https://github.com/prankshaw/election-analytica
Analyzing previous election results for Haryana Vidhan Sabha and other factors and to compare them with various parameter to conclude results.
anaconda collection data-analysis data-science data-visualization elections jupyter-notebook python python-3 wrangling
Last synced: 30 Jan 2025
https://github.com/arielle0222/data_analysis
📊 Data analysis projects for autonomous driving and smart mobility engineering using Python and SQL.
autonomous-driving composite data-analysis electric-vehicles environmental-data python visualizatoin
Last synced: 06 Feb 2025
https://github.com/umutsevdi/hr-management
HR Management, Analytics and Salary Determination System
analytics data-analysis java java17 postgresql python spring spring-boot vaadin vaadin-flow
Last synced: 18 Jan 2025
https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci
Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov
analysis analytics data data-analysis data-science database mysql reiter sql
Last synced: 22 Dec 2024
https://github.com/jakobzmrzlikar/fake-news-analysis
An analysis of the FakeNewsNet dataset using NLP techniques.
data-analysis fake-news ipynb-jupyter-notebook nlp-machine-learning
Last synced: 12 Jan 2025
https://github.com/alejo1630/sport_stats
Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project
data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql
Last synced: 31 Dec 2024
https://github.com/prernarohra/mental-health-prediction
This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.
data-analysis data-science data-visualization machine-learning mental-health python
Last synced: 23 Jan 2025
https://github.com/mohd-faizy/08p_covid19_data_analysis_using_python
Data Analysis on COVID19 dataset, published by John Hopkins University
covid19-data data-analysis data-analysis-python data-visualization happiness-report-dataset pandas python seaborn statistics
Last synced: 12 Jan 2025
https://github.com/prernarohra/quakeguard
QuakeGuard is an innovative project for reducing earthquake intensity and structural damage. It takes a proactive approach to seismic activity, by using complex algorithms and real-time data to improve safety and resilience for people in earthquake-prone areas.
artificial-intelligence backend data-analysis data-science earthquake-intensity final-year-project front-end geology machine-learning open-source python visualization
Last synced: 23 Jan 2025
https://github.com/shivshah19/movie-recommendation-system
This Movie Recommendation System is designed to provide personalized movie recommendations based on user preferences.
cosine-similarity data-analysis machine-learning pandas python streamlit
Last synced: 12 Feb 2025
https://github.com/alejo1630/chicago_crimes
A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium
data-analysis data-visualization folium pandas python seaborn
Last synced: 31 Dec 2024
https://github.com/denizkarya1999/investor_data
Analyzing investor data (CIS 422 Term Project)
academic-project data-analysis database-management investments money research young-investors
Last synced: 06 Feb 2025
https://github.com/graphieros/data-visualisation
data visualisation solutions in vanilla js
data-analysis data-visualization pure-javascript svg-manipulating
Last synced: 07 Feb 2025
https://github.com/shuklayash02/data_analysis_using_r
Covid19 analysis and cleaning of data where the death age and deaths of specific gender is cleaned and analysed
analysis cleaning-data data-analysis data-visualization rprogramming
Last synced: 23 Dec 2024
https://github.com/karthikmprakash/911-call-dataanalysis
Data Analysis of Emergency (911) Calls: Fire, Traffic, EMS for Montgomery County, PA
911-call-analysis data-analysis data-visualization python3 united-states-data
Last synced: 09 Jan 2025
https://github.com/phillbertnevinemmanuel/automotivesalesdataanalysis
This marks my inaugural venture into personal data analysis, employing SQL and Python for Correlation Analysis. I've sourced the dataset from Kaggle, specifically focusing on automotive sales. You can find the dataset linked on my website below. I'm excited to share that I've independently managed the majority of tasks involved in this project.
data-analysis dataset microsoft-sql-server python python-lambda sql ssms tsql
Last synced: 18 Jan 2025
https://github.com/dhairyac/customer-churn-prediction
Analyze, visualize and predict customer churn using Machine Learning
data-analysis data-visualization ensemble-classifier machine-learning performance-metrics python-3 random-forest-classifier softmax-regression svm-classifier
Last synced: 22 Jan 2025
https://github.com/17bit0216/machine-learning
All of my data analysis and Machine learning Projects.
analysis data-analysis linearr logistic logisticregression machine-learning python3 random-forest
Last synced: 18 Jan 2025
https://github.com/anilkumarteegala/aspiration.ai-ml-internship
This repo contains the internship project by Career Launcher.
data-analysis data-science financial internship machine-learning python3 stock-analysis stock-market visualization
Last synced: 13 Nov 2024
https://github.com/mafda/seattle_airbnb_data_analysis
This repository contains a comprehensive analysis of the Seattle Airbnb dataset, conducted using the CRISP-DM (Cross Industry Standard Process for Data Mining) methodology.
crisp-dm data-analysis data-science jupyter-notebook pandas-python seattle-data
Last synced: 17 Jan 2025
https://github.com/pranavarora1895/proteintypeprediction
Data Analysis on Protein Type Prediction
bioinformatics data-analysis supervised-learning
Last synced: 18 Jan 2025
https://github.com/bretsw/beds
Bookdown project for an open education resource (OER) book: Becoming Educational Data Scientists
analytics data-analysis data-analytics data-science
Last synced: 06 Feb 2025
https://github.com/ansh420/mcdonald_case-study
It is basically depend on the market Segment Analysis. It is a case study of mcDonald.
algorithms-implemented data-analysis python3 segmentation
Last synced: 19 Jan 2025
https://github.com/robcyberlab/linear-regression-application
🔢Linear Regression Application💻
artificial-intelligence data-analysis data-science data-visualization linear-regression machine-learning python python-programming regression-analysis statistics
Last synced: 06 Feb 2025
https://github.com/robcyberlab/machine-learning-classifier
🤖Machine Learning Classifier⚙️
ai artificial-intelligence classifiers data-analysis data-science deep-learning digit-recognition machine-learning pca-algorithm python svm-classifier
Last synced: 06 Feb 2025
https://github.com/nhsdigital/sde_summary_notebooks
Notebooks provided by the Wranglers for users to quickly gain insights on datasets inside the Secure Data Environment (SDE)
data-analysis data-linkage data-quality data-summary metrics statistics
Last synced: 23 Dec 2024
https://github.com/ankit21111/filmilytics
This repository contains data and analysis on RSVP Movie House Production, focusing on past performance metrics and audience trends. Our goal is to derive actionable insights that can guide future productions for greater success. Explore the data, analysis scripts, and recommendations to understand how RSVP can thrive in the film industry.
data-analysis database database-design database-schema erdiagram sql
Last synced: 19 Jan 2025
https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard
A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot
analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics
Last synced: 19 Jan 2025
https://github.com/aryansharma5/data-visualization-and-thorough-analysis
comprehensive guide for data analysis and visualization
data-analysis data-visualization
Last synced: 24 Jan 2025
https://github.com/salman-khan-mohammed/predicting-the-intent-of-online-shoppers
This project aims to predict online shoppers' purchase intentions using browsing history and user data from e-commerce sites. By analyzing clickstream and session information, the goal is to create a machine learning model that accurately forecasts customers' likelihood of making a purchase.
cluster-analysis data-analysis data-pre eda outliers prediction
Last synced: 31 Jan 2025
https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost
Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.
classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost
Last synced: 17 Jan 2025
https://github.com/mafesan/2021-tfm-code
Revelio: Machine-Learning classifier to identify Bots integrable with GrimoireLab
bot-accounts data-analysis data-analytics data-science grimoirelab machine-learning metrics open-source open-source-community project-health python scikit-learn
Last synced: 14 Feb 2025
https://github.com/prangonghose/analysis_of_bangladesh_economic_complexity
In this project a brief analysis has been done by our team in the export economy of Bangldesh for the past three decades.
data-analysis data-science data-visualization inequalipy matplotlib pandas plotly
Last synced: 19 Jan 2025
https://github.com/zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 13 Feb 2025
https://github.com/olgapavlova/agile-health-hackathon
Визуализируем здоровье спринтов разработки по сырым данным
data-analysis data-visualization figma google-sheets matplotlib pandas python sql
Last synced: 19 Jan 2025
https://github.com/john-science/data_science_by_example
Examples of Data Science Tools & Libraries
data-analysis data-science ipython pandas
Last synced: 18 Nov 2024
https://github.com/datawithbaraa/sql-modern-warehouse-and-analytics
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data-analysis data-analytics data-cleaning data-engineering data-lake data-lakehouse data-science data-warehouse data-warehousing database datalake datascience datawarehouse datawarehousing etl medallion-architecture pipeline sql sql-query sql-server
Last synced: 23 Dec 2024