Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-20 00:07:05 UTC
- JSON Representation
https://github.com/gkn-tech/brisecheck_website
Web Crawler, Visualizations and Game
choropleth-map contact-form data-analysis data-visualization game-development pygame python-flask scatter-plot web-crawler web-scraping
Last synced: 08 Jan 2025
https://github.com/virajbhutada/creditcard_complaints_analysis
A comprehensive repository dedicated to the Credit Card Complaints Analysis Dashboard in Tableau. Explore insightful visualizations, trends, and analytics related to credit card complaints. Stay informed and make data-driven decisions with this powerful Tableau analysis tool.
analytics complaints credit-card dashboard data-analysis data-visualization insights tableau visualizations
Last synced: 10 Jan 2025
https://github.com/projects-developer/full-stack-network-intrusion-detection-system-using-machine-learning
The project aims to design and develop a full-stack network intrusion detection system using machine learning techniques.
algorithms cybersecurity data-analysis full-stack-development intrusion-detection-system machine-learning network-intrusion-detection network-security web-development
Last synced: 05 Jan 2025
https://github.com/josericodata/josericodata
Adding a cool README file
big-data data-analysis data-science dublin hadoop hadoop-mapreduce hadoop-spark ireland jobsearch jobseeker portfolio portfolio-data-science portfolio-website python sql
Last synced: 08 Jan 2025
https://github.com/leeway64/lwwordcounter
C++ application that analyzes the frequency of words in a text file
bson cmake conan cpp data-analysis json json-schema text-analysis ubjson
Last synced: 02 Jan 2025
https://github.com/quocduyenanhnguyen/human-trafficking-analysis
I analyzed human trafficking data
data-analysis data-analytics data-visualization human-trafficking mysql mysql-database mysql-workbench query sql tableau tableau-dashboards tableau-public
Last synced: 02 Jan 2025
https://github.com/ireneflorez/exploration_r
Data exploration on the 'White Wine Quality' dataset using R
data-analysis data-visualization r
Last synced: 11 Jan 2025
https://github.com/edwinrlambert/investigating-netflix-movies
Demonstrates data analysis and visualization techniques for Netflix movies using Python in a Jupyter notebook. This is a DataCamp project.
data-analysis data-analysis-python netflix python
Last synced: 18 Jan 2025
https://github.com/ireneflorez/nypd-mvc
Analysis of NYPD Motor Vehicle Collisions
basemap data-analysis folium jupyter-notebook matplot pandas python
Last synced: 11 Jan 2025
https://github.com/virajbhutada/telecom-customer-churn-prediction
Predict and prevent customer churn in the telecom industry with this project. Harness the power of advanced analytics and Machine Learning on a diverse dataset to develop a robust classification model. Gain deep insights into customer behavior and identify critical factors influencing churn using interactive Power BI visualizations.
churn-prediction classification-models customer-attrition-analysis customer-churn-prediction data-analysis data-science decision-tree-classifier eda logistic-regression machine-learning machine-learning-algorithms machine-learning-models pandas powerbi powerbi-desktop python random-forest-classifier roc-curve xgboost-classifier
Last synced: 10 Jan 2025
https://github.com/ct83/become-a-data-analyst-udacity
This repository contains all of the code, projects and reports that I wrote as I pursued my Udacity - Data Analyst NanoDegree.
data-analysis data-analysis-python data-analyst data-visualisation data-visualization-project datascience python udacity udacity-data-analyst-nanodegree
Last synced: 13 Jan 2025
https://github.com/luminati-io/Target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Walmart-dataset-samples
A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.
api data-analysis dataset walmart walmart-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Shopee-dataset-samples
A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..
api data-analysis data-mining datasets products shopee web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 06 Nov 2024
https://github.com/edwinrlambert/exploring-airbnb-market-trends
Dive into NYC's Airbnb market trends through detailed analysis of listings data, including prices, types, and review dates. This is a DataCamp project.
airbnb data-analysis jupyter-notebook market-trends python
Last synced: 18 Jan 2025
https://github.com/lunarwhite/lake-george-viz
Geroge Lake data analysis and visualization, ANU COMP1730/6730
Last synced: 17 Feb 2025
https://github.com/cnoret/retail-data-analysis
Let's analyze historical sales data from a large retail chain and predict weekly sales using machine learning on a Streamlit web app
data-analysis data-analyst data-science data-vizualisation pandas python streamlit streamlit-webapp
Last synced: 09 Feb 2025
https://github.com/cnoret/ibm-data-analyst-professional
WIP Final project for the IBM Data Analyst Professional Certificate.
analyzing-data data-analysis data-analyst data-manipulation data-science data-visualization ibm-certificate ibm-data-analyst-professional ibm-datascience-certification pandas python
Last synced: 09 Feb 2025
https://github.com/fanisgl/video-games-sales-data
Data Analysis of Sales Dataset using Python.
data-analysis data-science data-visualization dataset jupyter-notebooks matplotlib numpy pandas poisson-distribution python python3 sales statistics
Last synced: 08 Jan 2025
https://github.com/ttwag/p9_pandas
Problems that Introduce the DataFrame Object in Python's Pandas Library
data-analysis pandas-dataframe python
Last synced: 17 Feb 2025
https://github.com/kheriberto/pandas_and_seabron_project
In this project I showcase my ability using pandas and seaborn to mold, transform and plot data.
data-analysis pandas python seaborn
Last synced: 08 Jan 2025
https://github.com/kheriberto/bedu_dc
Ejercicios del curso de "python desde 0" de la plataforma BEDU
Last synced: 08 Jan 2025
https://github.com/sharoonjoseph321/indian-liver-diseases
Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models
data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn
Last synced: 21 Jan 2025
https://github.com/chitranjan806/predicting-on-time-premium-deposits
A Predictive analysis project to predict the success rate of On-Time deposits of Premiums by Policy Holders.
analytics-vidhya analytics-vidhya-competition catboostregressor data-analysis data-science linear-regression logistic-regression python3
Last synced: 08 Jan 2025
https://github.com/farizsidki/proyek-analisis-data
Submission Dicoding Indonesia "Proyek Analisis Data"
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python streamlit
Last synced: 31 Oct 2024
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 11 Feb 2025
https://github.com/andimashkulli/vpms
Vehicle Parking Management System for Gjon Buzuku Gymnasium
backend-api data-analysis databases frontend-react mongodb nodejs software
Last synced: 11 Feb 2025
https://github.com/panoschatzi/healthcare_and_bioinformatics_analyses
Healthcare and Bioinformatics data analysis projects with Python and SQL.
data-analysis data-cleaning data-visualisation jupyter matplotlib mysql pandas plotly python seaborn sql
Last synced: 16 Jan 2025
https://github.com/samuelsoaress/wkd-default-reduction
reduction of default from 35% to 25% or less with machine learning techniques
data-analysis data-exploration data-science machine-learning-algorithms
Last synced: 08 Jan 2025
https://github.com/samuelsoaress/predict-future-sales
Machine Learning applied to sales forecast
data-analysis data-mining data-science data-visualization forecasting-models
Last synced: 08 Jan 2025
https://github.com/risdorn/restaurant-delivery-platforms-analysis-bdm-project
This project analyzes restaurant delivery platforms to understand customer preferences, industry competition, and expansion opportunities. Conducted as part of the BDM project from IITM, it includes descriptive stats, distribution, correlation, regression, and geospatial analysis using multiple datasets.
data-analysis data-visualization jupyter-notebook kaggle
Last synced: 31 Oct 2024
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 17 Feb 2025
https://github.com/swapnanildutta/prediction-with-python
The projects are made using Jupyter Notebook
data-analysis jupyter-notebook machine-learning prediction python regression-models
Last synced: 08 Jan 2025
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 14 Jan 2025
https://github.com/thesfinox/mltools
A collection of simple tools for data science and machine learning projects.
ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox
Last synced: 17 Feb 2025
https://github.com/athari22/applied-data-science-capstone
Applied-Data-Science-Capstone
api classification data-analysis data-cleaning data-collection data-science data-scraping data-visualization data-wrangling knn machine-learning sql
Last synced: 30 Dec 2024
https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office
Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.
data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python
Last synced: 30 Dec 2024
https://github.com/siddharthbadal/youtubeapi-dataanalysis
YoutubeAPI-Data Analysis
data-analysis jupyter-notebook matplotlib pandas python seaborn
Last synced: 26 Jan 2025
https://github.com/siddharthbadal/nba-data-analysis
Basketball refernece web scraping and data analysis
data-analysis data-visualization pandas python
Last synced: 26 Jan 2025
https://github.com/ddsuhaimi/turkiye-student-evaluation-eda
A little bit of exploration of well-known Turkiye Student Evaluation dataset
data-analysis data-science data-visualization-project exploratory-data-analysis exploratory-data-visualizations
Last synced: 05 Jan 2025
https://github.com/navp7/roadaccident_powerbi
An interactive Power BI dashboard designed to analyze road accident data
dashboards data-analysis data-visualization powerbi
Last synced: 17 Feb 2025
https://github.com/hecatops/ad_libs
A real time advertisement data analytics platforming, displaying important metrics in easy to understand language.
dashboard data-analysis data-visualization kpi plotly-dash python
Last synced: 18 Feb 2025
https://github.com/tanaybhadula/twitter-trends-dashboard
An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.
dash dashboard data-analysis data-visualization plotly python trends twitter
Last synced: 10 Jan 2025
https://github.com/siddharthbadal/kpmgdataanalysisproject
Data Analytics Consulting Virtual Internship
data-analysis data-cleaning data-visualization googlestudio msexcel powerpoint
Last synced: 26 Jan 2025
https://github.com/cosmoduende/r-ggcats
StrangeR things: Adding… Cats? to your plots on R. How to analyze and visualize data with the help of funny cats with the ”ggcat” package.
data-analysis data-analytics data-science data-visualisation data-visualization data-viz dataviz ggcats r-language r-library r-package r-programming r-scripts r-studio rstats rstudio
Last synced: 18 Feb 2025
https://github.com/cosmoduende/r-earthquakes
Análisis y visualización de datos de actividad sísmica en México con R. Cómo analizar y visualizar la historia sísmica de México con datos del SSN (Servicio Sismológico Nacional)
data-analysis data-analytics data-science dataviz earthquakes r-code r-programming r-studio rstudio sismo sismologia sismos ssn ssnmx terremoto terremotos
Last synced: 18 Feb 2025
https://github.com/ntaraujo/cleo
Contact data processor for Cléo
contacts-manager data-analysis data-visualization whatsapp whatsapp-web
Last synced: 27 Jan 2025
https://github.com/nsandoya/python_scrp_project
This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)
beautifulsoup4 data data-analysis jupyter-notebook pandas python3
Last synced: 14 Jan 2025
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 13 Jan 2025
https://github.com/ianfelps/jornada_python
Projetos realizados durante a Jornada Python da Hashtag Treinamentos em maio de 2024.
artificial-intelligence automation data-analysis python
Last synced: 21 Jan 2025
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 06 Jan 2025
https://github.com/antoniszks/music-category-identifier
A 'Data-Science & Machine Learning' project where we are training a neural network to identify what kind of music we give to it. Based on a university project.
ai artificial-intelligence data-analysis data-science jupyter-notebook machine-learning ml notebook python
Last synced: 08 Jan 2025
https://github.com/antoniszks/data-analysis-problems
A repository containing real world data analysis python notebooks
data-analysis data-analysis-python data-science jupyter-notebook python real-world-data statistics
Last synced: 08 Jan 2025
https://github.com/puspacempaka/superstore-analysis-with-sql
This repository showcases various data analyses on the popular Superstore dataset using SQL queries. The analyses cover a range of business insights, including sales performance, customer segmentation, and product profitability. Each analysis is documented with the SQL queries used and explanations of the steps involved.
business-intelligence data-analysis sales-analysis sql superstore-dataset
Last synced: 16 Jan 2025
https://github.com/weybsonalves/prevendo-o-atrito-de-clientes
Projeto em que percorro as etapas que compõem o ciclo de vida da ciência de dados a fim de prever o atrito de clientes do serviço de cartões de crédito de um banco.
data-analysis data-science data-visualization machine-learning python
Last synced: 16 Jan 2025
https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics
Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.
beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping
Last synced: 08 Jan 2025
https://github.com/shubham200137/customer-churn-analysis
In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.
data-analysis data-visualization numpy-python pandas-python sqlite tableau
Last synced: 08 Jan 2025
https://github.com/shubham200137/cyclistic-case-study
This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.
data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau
Last synced: 08 Jan 2025
https://github.com/faisal-fida/box-office-mojo-analysis
Analyzed box office data from Box Office Mojo, exploring relationships between worldwide revenue, release year, and a combined score that considers both factors. It includes visualizations like scatter plots, bar charts, and identifies top and bottom performing movies.
box-office data-analysis data-science python revenue-prediction visualization
Last synced: 08 Jan 2025
https://github.com/blackcub3s/msc-finalthesis
The most important programming files, code functions and data processing pipelines for the Machine learning final thesis of my Master's degree. Also, the LaTeX code of the thesis.
data-analysis latex machine-learning numpy python sklearn
Last synced: 05 Jan 2025
https://github.com/madeiradata/microsoft-data-analysts-club
Open-source Repository of Useful Scripts and Solutions for Microsoft Data Analysts
data-analysis data-visualization microsoft-data-analysis powerbi powerbi-report
Last synced: 16 Feb 2025
https://github.com/r13i/cheapest-phone-call
Small challenge to find the best phone operator to use based on call price
big-data big-data-analytics cheapest data-analysis data-cruncher pandas phone-number pricelist
Last synced: 02 Feb 2025
https://github.com/jofaval/80-cereals
Data Analysis into almost 80 USA cereals user rating in 1993
cereals classification data-analysis data-science data-visualization google-colab kaggle linear-regression logistic-regression machine-learning matplotlib python regression scikit-learn seaborn
Last synced: 04 Feb 2025
https://github.com/jofaval/california-housing-pricing
Data Analysis about the California Housing Pricing in 1997
data-analysis data-science data-visualization deep deep-learning deep-neural-networks google-colab keras machine-learning matplotlib python regression scikit-learn seaborn tensorflow
Last synced: 04 Feb 2025
https://github.com/sanveed-adnan/supermarket-sales-sql-project
SQL-based data analysis project on supermarket sales performance using SQLite and Power BI.
business-intelligence data-analysis data-science data-science-projects data-visualization power-bi sales-data sql sqlite
Last synced: 18 Feb 2025
https://github.com/loginchik/mid_contracts
Анализ контрактов государственных закупок МИДа РФ
data-analysis dataset pandas python
Last synced: 08 Nov 2024
https://github.com/chanmeng666/mnist-handwritten-digit-recognition-project
A comprehensive implementation and analysis of handwritten digit recognition using multiple neural network architectures on the MNIST dataset. Features basic MLP, optimized feature-selected model, and deep CNN approaches with detailed performance comparisons and visualizations.
cnn computer-vision data-analysis data-visualization deep-learning feature-analysis handwritten-digit-recognition keras machine-learning mlp mnist model-optimization neural-networks python scikit-learn tensorflow
Last synced: 02 Jan 2025
https://github.com/geo-y20/uber-rides-data-analysis
This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.
dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber
Last synced: 08 Jan 2025
https://github.com/prasad-chavan1/bank_data_analysis_r
Bank data analysis in R language
data data-analysis data-science r
Last synced: 07 Jan 2025
https://github.com/yutkin/gotohack-2-solution
Solutions of the selection stage
data-analysis data-science hackathon machine-learning
Last synced: 15 Feb 2025
https://github.com/beaprogrammer02345/python_data_analysis
Sales Analysis using Python
data-analysis data-visualization python
Last synced: 16 Feb 2025
https://github.com/michaelcurrin/yahoo-finance-reports
Use the Yahoo Finance API to get info on shares of interest and report on them
data-analysis data-science python reporting shares stock-market yahoo-finance yahoo-finance-api
Last synced: 14 Jan 2025
https://github.com/nurulashraf/linear-regression-spotify
Data Science - Spotify Linear Regression Analysis
data-analysis data-preprocessing data-visualization dataset-exploration feature-selection linear-regression machine-learning matplotlib mean-squared-error model-evaluation multiple-regression music-analytics numpy predictive-modeling python regression-analysis root-mean-squared-error scikit-learn seaborn spotify-data
Last synced: 12 Feb 2025
https://github.com/praveingk/lipidanalysis
data-analysis data-visualisation
Last synced: 24 Jan 2025
https://github.com/srvcl/lung-cancer-survival-analysis
Data Cleaning of a dataset and Survival Analysis in R Language
data-analysis data-science data-visualization r survival-analysis
Last synced: 14 Jan 2025
https://github.com/neerajpokala/malasian-housing-data
This repository contains the code, visualizations, and documentation for a comprehensive analysis of Malaysian condominium prices. The project explores factors influencing property prices, such as property size, number of bedrooms, facilities, and proximity to key amenities. Using statistical techniques and data visualization
data-analysis hypothesis-testing r
Last synced: 14 Jan 2025
https://github.com/jofaval/daily-california-births
Data Analysis of the Daily AFAB (Assigned Female At Birth) Births in California, 1959
california data-analysis data-science data-visualization deep-learning google-colab machine-learning python tensorflow timeseries timeseries-analysis
Last synced: 04 Feb 2025
https://github.com/extwiii/datascience-jhu
Ask the right questions, manipulate data sets, and create visualizations to communicate results - Coursera
biostatistics data-analysis data-science linear-regression multivariate-regression r r-programming toolbox visualization
Last synced: 28 Jan 2025
https://github.com/jofaval/game-of-thrones
Data Analysis and Predictions of the Game of Thrones' character's survivance from 2016
classification data-analysis data-science data-visualization deep-learning game-of-thrones google-colab kaggle keras machine-learning matplotlib python scikit-learn seaborn tensorflow xgboost
Last synced: 04 Feb 2025
https://github.com/jofaval/ionosphere
Binary Classification of Ionosphere signals at Goose Bay, Labrador in 1988
data-analysis data-science data-visualization deep-learning google-colab keras machine-learning python scikit-learn tensorflow uci xgboost
Last synced: 04 Feb 2025
https://github.com/mysto-007/world_population_growth_analysis
World Population Growth Analysis
data-analysis data-science data-visualization kaggle matplotlib
Last synced: 20 Feb 2025
https://github.com/rohitblaze10/-excel-_seller_store_analysis
A collection of data analysis projects showcasing data cleaning, exploration, visualization, and machine learning. Using "Excel" and more to uncover insights and drive data-driven decision-making. Feel free to explore, contribute, or collaborate!
data-analysis data-visualization excel excel-export
Last synced: 18 Feb 2025
https://github.com/saksham-joshi/sentiment_analyzer
Analyze the sentiment of a text stored in a string or file and understand the reason why your blogs and posts are not ranking up.
data-analysis data-analytics python sentiment-analyser sentiment-analysis sentiment-analysis-without-nltk
Last synced: 13 Jan 2025
https://github.com/jofaval/boston-housing
Regression Analysis into the Boston Housing in-demand pricing in 1978
boston-housing data-analysis data-science data-visualization machine-learning python regression
Last synced: 04 Feb 2025
https://github.com/chardyb/prob-and-stats-bmi6106
A repository for Spring 2025 BMI 6106: Statistics and Probability. This repository contains coursework, code examples, and projects exploring statistical methods and probabilistic models in biomedical informatics.
biomedical-informatics data-analysis data-science probability r statistical-modeling
Last synced: 14 Jan 2025
https://github.com/12danielll/neurogenomics_project
This project focuses on analyzing sequencing data to understand molecular mechanisms of neurological diseases and predict the effectiveness of immunotherapy in breast cancer patients. It integrates Python and R scripts for data processing, statistical analysis, and visualization, alongside a comprehensive report detailing methods and findings.
bioinformatics biostatistics clustering clustering-algorithms data-analysis data-visualization deseq2 differential-gene-expression functional-analysis immune-therapy machine-learning neurological-disease neuroscience pca-analysis python r seurat single-cell-analysis
Last synced: 14 Jan 2025
https://github.com/andremenezesds/house_rocket_sales_insights
Data analysis for House Rocket Sales Company database, including insights for sales optimization.
data-analysis data-visualization geopandas git github jupyter-notebook linux numpy pandas powerbi python seaborn seaborn-python streamlit streamlit-webapp ubuntu vscode windows10
Last synced: 14 Feb 2025
https://github.com/stefagnone/movies-dataset-analysis-project
Comprehensive analysis of the Movies dataset, exploring genre trends, comparisons, and qualitative insights using Python, Pandas, and visualizations. Designed to uncover actionable findings for stakeholders.
data-analysis data-visualization exploratory-data-analysis matplotlib movies-analysis pandas python seaborn storytelling-with-data
Last synced: 11 Feb 2025
https://github.com/stefagnone/unsupervised-analysis-project
This project investigates the impact of video content on social media engagement using advanced analytics techniques like PCA, k-means clustering, and logistic regression. It provides actionable insights for optimizing social media strategies for Thai fashion and cosmetics retailers.
Last synced: 11 Feb 2025
https://github.com/stefagnone/-employee-salary-analysis-and-insights
Predictive analysis of employee salary determinants for an anonymized dataset, highlighting key factors influencing salary and providing insights for salary policy improvements.
business-intelligence data-analysis data-science employee-salary-analysis excel gender-pay-gap predictive-insights regression-modeling spss statistical-analysis
Last synced: 11 Feb 2025
https://github.com/stefagnone/-wedding-vendor-pricing-and-customer-satisfaction-analysis
Data-driven analysis of wedding vendor pricing and customer satisfaction, with database design, SQL optimization, and cost breakdown generation.
business-intelligence cost-optimization customer-satisfaction data-analysis database-design python sql vision-board-analysis wedding-planning wedding-vendor-pricing
Last synced: 11 Feb 2025
https://github.com/stefagnone/data_storyboarding_visualization
Data Storyboarding and Visualization Techniques for Effective Communication
data-analysis data-visualization ggplot2-analysis r tableau-dashboards
Last synced: 11 Feb 2025
https://github.com/stefagnone/moneyball_project
Data-driven analysis inspired by the Moneyball approach, identifying affordable replacements for key Oakland A's players using R and sabermetrics to support cost-effective recruitment.
baseball-statistics data-analysis data-driven-decision-making player-replacement-strategy r-programming sabermetrics sports-analytics
Last synced: 11 Feb 2025