Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2024-11-18 00:06:50 UTC
- JSON Representation
https://github.com/jakobzmrzlikar/pca-on-genomes
An analysis of human genome mutations from different populations.
data-analysis genome-analysis pca-analysis
Last synced: 07 Nov 2024
https://github.com/iguptashubham/ev-market-exploration
So, market size analysis is a crucial aspect of market research that determines the potential sales volume within a given market
data-analysis data-analysis-projects data-science-project forecast projects python
Last synced: 14 Nov 2024
https://github.com/jakobzmrzlikar/trg-dela
Data analysis of student job offers.
data-analysis ipython-notebook web-scraping
Last synced: 07 Nov 2024
https://github.com/navp7/hr_analysis_excel
This project utilizes Microsoft Excel to conduct a comprehensive analysis of HR data, focusing on identifying the various reasons for employee attrition and evaluating job satisfaction
dashboards data-analysis excel visualization
Last synced: 07 Nov 2024
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 07 Nov 2024
https://github.com/harkishen-singh/agriculture-ds
An Agricultural based Mtech project, on Data Science, which predicts the growth of crops based on previous year records.
Last synced: 07 Nov 2024
https://github.com/netesf13d/expt-sequence-analysis
Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.
cold-atoms data-analysis data-visualization optical-tweezers
Last synced: 11 Nov 2024
https://github.com/kariemseiam/geoegy
An innovative and responsive dashboard to discover, filter, and analyze places across Egypt. Featuring advanced search, interactive maps with Leaflet.js, real-time analytics, dark mode, and seamless data export—all wrapped in a sleek, modern design with RTL support.
accessibility data-analysis data-visualization es6-modules geojson javascript leaflet mapping openstreetmap places-data responsive-design web-development
Last synced: 12 Nov 2024
https://github.com/lucs1590/agidatatest
This is a repository with data analysis and data science tests.
data-analysis data-science python test
Last synced: 13 Nov 2024
https://github.com/karencofre/riesgorelativo-lookerstudio
proyecto de análisis de datos y análisis perdicitvo en looker studio y google colab
bigquery data-analysis data-science machine-learning matplotlib python sklearn sql
Last synced: 13 Oct 2024
https://github.com/xre22zax/roller-coaster
Explore award-winning wood and steel coasters from 2013-2018 Golden Ticket Awards & Captain Coaster, all powered by Python and interactive visualizations.
analytics data-analysis data-visualization pandas python python-lambda python3 visualization
Last synced: 06 Nov 2024
https://github.com/bhaskaracharjee/student-results-analysis
Analyzing student results to uncover insights
Last synced: 06 Nov 2024
https://github.com/jayita11/exploring-most-streamed-songs-for-last-four-decades-eda
Perform EDA to uncover trends in streaming patterns, likes, and artists over the last four decades.
data-analysis eda hypothesis-testing matplotlib most-streamed-songs pandas python seaborn
Last synced: 13 Nov 2024
https://github.com/thyripian/ibm_data_science_capstone
data-analysis data-science data-visualization python python3
Last synced: 14 Nov 2024
https://github.com/yandexdataschool/ml-sweights-experiments
Experiments for the "Machine Learning on data with sPlot background subtraction" paper
data-analysis high-energy-physics machine-learning statistics
Last synced: 06 Nov 2024
https://github.com/jayita11/eda-student-exam-performance
This project performs Exploratory Data Analysis (EDA) and hypothesis testing on student performance data. It explores trends based on attributes like gender, race/ethnicity, parental education, lunch type, and test preparation course completion.
data-analysis eda hypothesis-testing matplotlib pandas python seaborn statsmodels student-performance-analysis
Last synced: 13 Nov 2024
https://github.com/jayita11/atliqo-bank-credit-card-launch-eda
This project involves exploratory data analysis and statistical testing for AtliQo Bank's new credit card launch. Key insights include targeting high-income occupations and the 18-25 age group. Recommendations focus on tailored marketing campaigns, education, and incentives to enhance credit card adoption and usage among young adults.
data-analysis hypothesis-testing matplotlib p-value pandas python seaborn statistics z-test
Last synced: 13 Nov 2024
https://github.com/pseudomanifold/pump
A generic data flow program
c-plus-plus-11 cplusplus data-analysis data-flow small
Last synced: 06 Nov 2024
https://github.com/pseudomanifold/us-inauguration-speeches
Data & feature extraction for U.S. inauguration speeches
data-analysis data-science inauguration politics speech speeches
Last synced: 06 Nov 2024
https://github.com/hyperentangledqubit/shellplot
shellplot -- Generate plot(s) directly from terminal via matplotlib or ggplot2 (plotnine)!
data-analysis ggplot2 graphics matplotlib plotnine plotting pyplot terminal
Last synced: 11 Nov 2024
https://github.com/amyanchen/sf-airbnb
Exploratory Data Analysis of San Francisco Airbnb's
data-analysis data-science data-visualization r rmarkdown statistics
Last synced: 07 Nov 2024
https://github.com/ahmetzamanis/projectcatalog
Catalog of my data science projects.
classification clustering data-analysis data-science data-science-portfolio data-visualization machine-learning python quarto r regression rmarkdown statistics time-series time-series-analysis
Last synced: 07 Nov 2024
https://github.com/andremenezesds/pa004_health_insurance
Health Insurance Cross-Sell(Learning to Rank Machine Learning Project)
backend backend-api data-analysis data-science data-visualization dataviz lgbm machine-learning matplotlib numpy optuna pandas python scikit-learn shell-script sql webapi xgboost
Last synced: 11 Nov 2024
https://github.com/edumoraes1/republicacao-produtos
SQL Query realizada para criação de automação de disparo de push via salesforce
bq data-analysis salesforce sql
Last synced: 12 Oct 2024
https://github.com/silianpan/python-data-analysis-course
python data analysis course of drotion-lega
data-analysis jupyter-notebook panda
Last synced: 06 Nov 2024
https://github.com/auliannee/new-york-uber-pickups-analysis
This repository contains the projects related to data collecting, quality check, manipulation, analyzing, and visualizations.
data-analysis data-science ipython-notebook jupyter-notebook python
Last synced: 13 Nov 2024
https://github.com/motapinto/agent-based-simulation-conquest
Agent-based simulation modelation of the conquest Battlefield gamemode
agent-based-simulation data-analysis jade java sajas swing
Last synced: 06 Nov 2024
https://github.com/dsarceno/portfolio
Portafolio de Científico de Datos. Proyectos realizados por Diego Sarceño.
computer-vision data-analysis data-science deep-learning docker graph-algorithms investing keras-tensorflow machine-learning markdown neural-networks optimization-algorithms pipelines python sentiment-analysis sklearn tensorflow voice-recognition
Last synced: 10 Oct 2024
https://github.com/edumoraes1/journey_active_users
Segmentação de base via SQL para jornada de vendedores ativos
bq data-analysis salesforce sql
Last synced: 12 Oct 2024
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 09 Nov 2024
https://github.com/quantumudit/groceries-basket-analysis
This project performs market basket analysis using Power BI and Python to reveal associations between grocery items. It involves transforming raw transaction data into a processed dataset, creating interactive Power BI reports, and generating key insights through Python, enabling data-driven decision-making.
data-analysis data-visualization pandas powerbi python
Last synced: 06 Nov 2024
https://github.com/mirokeimioniemi/classifying-software-pirates
Exploring the factors driving people into software piracy by training two machine learning models to predict whether a person with certain characteristics and sentiments is likely to possess any pirated software or not using a dataset collected via a survey targeting users of music production software.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning piracy python software-piracy survey
Last synced: 12 Nov 2024
https://github.com/chrispsang/customerchurnanalysis
Predicting customer churn using a RandomForestClassifier with detailed EDA, model evaluation, and visualization. Includes a Tableau dashboard for interactive insights.
customerchurn data-analysis data-visualization datapreprocessing machine-learning python scikit-learn tableau
Last synced: 10 Oct 2024
https://github.com/szymon-budziak/real_estate_house_prices_prediction
Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.
data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn
Last synced: 10 Oct 2024
https://github.com/lunarwhite/lake-george-viz
Simple data analysis and visualization of Lake Geroge with Python.
Last synced: 06 Nov 2024
https://github.com/luminati-io/Airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Shopee-dataset-samples
A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..
api data-analysis data-mining datasets products shopee web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Walmart-dataset-samples
A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.
api data-analysis dataset walmart walmart-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/khanovico/python-stock-analyzer
This is a Webapp implemented by python and several data science frameworks, enabling online stock trend analyzing.
amcharts-js-charts data-analysis data-visualization flask javascript pandas python scikit-learn
Last synced: 03 Nov 2024
https://github.com/blakeziegler/binary-classification-competition
Binary Classification of Insurance Crosselling Kaggle Competition
data-analysis data-science database kaggle kaggle-competition machine-learning python rstudio scikit-learn xgboost
Last synced: 10 Oct 2024
https://github.com/luminati-io/Indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 06 Nov 2024
https://github.com/balajimohan18/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
classification-model data-analysis data-analytics data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning jupyter-notebook maachine-learning machine-learning-algorithms predictive-modeling python supervised-learning
Last synced: 14 Nov 2024
https://github.com/luminati-io/Amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/lopez86/datascienceexamples
Examples of various data science & data analysis topics using various sources of data.
data-analysis data-science pandas scikit-learn tutorial visualization
Last synced: 06 Nov 2024
https://github.com/balajimohan18/foreign-exchange-rate-time-series-datascience-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-analytics data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis
Last synced: 14 Nov 2024
https://github.com/rickcontreras/modelos1
Modelo de clasificación para predecir el desempeño de estudiantes en las Pruebas Saber Pro en Colombia. Incluye análisis exploratorio de datos, preprocesamiento y modelos de machine learning.
classification colombia data-analysis data-science education educational-assessment exploratory-data-analysis jupyter-notebook machine-learning python saber-pro scikit-learn student-performance
Last synced: 10 Oct 2024
https://github.com/lopez86/rust-mlearn
Machine Learning Tools in Rust
data-analysis data-science machine-learning rust
Last synced: 06 Nov 2024
https://github.com/samuelsoaress/wkd-default-reduction
reduction of default from 35% to 25% or less with machine learning techniques
data-analysis data-exploration data-science machine-learning-algorithms
Last synced: 10 Nov 2024
https://github.com/aisurjyasamantaray/-optimizing-target-s-brazilian-operations-insights-from-order-processing-pricing-and-payment-trends-
This project offers an in-depth analysis of consumer behavior, logistical performance, and payment preferences within the e-commerce sector. By examining order costs, delivery times, and payment methods, businesses can uncover valuable insights into operational efficiency and customer preferences.
bigquery consumer-insights data-analysis database sql target
Last synced: 12 Oct 2024
https://github.com/rnddave/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 11 Oct 2024
https://github.com/shuddha2021/stellar-candidate-selector
A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.
candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn
Last synced: 06 Nov 2024
https://github.com/m0saan/python-for-data-analysis
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney,
data-analysis data-science ipython-notebook machine-learning matplotlib numpy pandas python
Last synced: 16 Nov 2024
https://github.com/quocduyenanhnguyen/roi-modeling-and-analysis-of-sports-dataset
In this project, you will find my ROI model for retirement savings and PowerPoint presentation of my ROI model, as well as my data analysis/visualization of Sports Ticket Sales dataset that I concluded with a PDF group written report
data-analysis data-visualization microsoft-excel rate-of-return-modeling sports-ticket-sales-dataset
Last synced: 08 Nov 2024
https://github.com/quocduyenanhnguyen/human-trafficking-analysis
I analyzed human trafficking data
data-analysis data-analytics data-visualization human-trafficking mysql mysql-database mysql-workbench query sql tableau tableau-dashboards tableau-public
Last synced: 08 Nov 2024
https://github.com/quocduyenanhnguyen/california-crime-data-analysis
I analyzed crime incident-based data in California in the year 2022. I used SQL for analysis and Tableau for visualization.
2022 california crime-data dashboard data-analysis data-analytics data-manipulation data-modeling data-visualisation data-visualization database fbi mysql mysql-workbench nibrs sql tableau tableau-dashboards tableau-public
Last synced: 08 Nov 2024
https://github.com/leeway64/lwwordcounter
C++ application that analyzes the frequency of words in a text file
bson cmake conan cpp data-analysis json json-schema text-analysis ubjson
Last synced: 08 Nov 2024
https://github.com/leeway64/lwtaiwancurrencyconverter
Converts US dollars (USD) to New Taiwan Dollars (TWD) and vice versa
cfg currency currency-converter data-analysis data-visualization docker exchange-rates federal-reserve julia taiwan twd united-states usd
Last synced: 08 Nov 2024
https://github.com/dataforgeopenaihub/steam-sales-analysis
This repository features an ETL pipeline for retrieving, processing, validating, and ingesting game metadata and sales data from SteamSpy and Steam APIs. Data is stored in a MySQL database on Aiven Cloud and visualized using Tableau dashboards for insightful analysis of gaming trends and sales performance.
data-analysis data-engineering data-pipepline data-warehousing games mysql-database python steam-api tableau typer-cli
Last synced: 10 Oct 2024
https://github.com/anjalikumari021/sports_data_analysis_using_excel
Analyzed Sports data and prepared advanced dashboard using MS Excel.
data-analysis data-cleaning excel-dashboard ms-excel pivot-tables reporting
Last synced: 14 Nov 2024
https://github.com/themihirmathur/uber-data-analytics
The goal of this project is to perform comprehensive data analytics on Uber trip data using a modern data engineering stack on Google Cloud Platform (GCP).
bigquery data-analysis data-engineering etl-pipeline google-cloud-platform looker python
Last synced: 12 Oct 2024
https://github.com/mysftz/statistical-analysis
A in-depth review of statistical analysis in Python from datasets.
data-analysis python python3 statistics university university-project
Last synced: 06 Nov 2024
https://github.com/mysftz/statistics-analysis
A python statistical analysis of a dataset and probability.
data-analysis matplotlib python python3 statistical-analysis
Last synced: 06 Nov 2024
https://github.com/mysftz/numerical-methods-in-matlab
Multiple MatLab scripts over multiple data analysis assignments.
data-analysis data-science matlab university university-assignment
Last synced: 06 Nov 2024
https://github.com/saitoxu/data-analysis-workspace
Docker image for data analysis
Last synced: 10 Nov 2024
https://github.com/abdoomohamedd/python-data-analysis-projects
Welcome to the Python Data Analysis Projects repository! This repository contains a collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp
data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python
Last synced: 06 Nov 2024
https://github.com/manjit-baishya-datascience/flipkart-laptop-listing-eda
This project analyzes laptop price data from Flipkart using AutoScraper for web scraping. It includes data loading, EDA, cleaning, statistical analysis, and visualization. The goal is to derive insights for pricing strategies and market positioning. Explore the repository for detailed documentation and code.
data-analysis ecommerce-platform flipkart laptop python
Last synced: 13 Nov 2024
https://github.com/jimohola/movielens_data_analysis
Movielens Data Analysis
data-analysis data-visualization exploratory-data-analysis pyhton3
Last synced: 16 Nov 2024
https://github.com/mrsamsonn/monolithic-polylithic-crystal-segmentation
A grid segmentation algorithm for clustering crystal structures using diffraction patterns. Useful in material science and nanotechnology, this code enables detailed analysis of crystals for research and industrial applications.
clustering crystal-structure crystallography data-analysis diffraction-patterns grid-segmentation image-processing k-means machine-learning matertial-science nanotechnology python research-project research-tools scientific-computing
Last synced: 02 Nov 2024
https://github.com/steviecurran/gbt-scripts
IDL scripts for the reduction of Green Bank Telescope data
data-analysis data-compression data-visualization radio-astronomy spectroscopy
Last synced: 14 Nov 2024
https://github.com/vidyadnina/cyclistic-sql-tableau-project
Trip data analysis for a bike-sharing service company using SQL and Tableau.
bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql
Last synced: 12 Oct 2024
https://github.com/steviecurran/prediction-plot
C (pgplot) code to plot predicted versus measured values (generated from Tensorflow)
astrophysics data-analysis high-redshift machine-learning statistics tensorflow visualization
Last synced: 14 Nov 2024
https://github.com/mumtaz4118/scraping-medium-and-data-analytics
The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py
data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python
Last synced: 07 Nov 2024
https://github.com/mumtaz4118/covid-19-data-visualization
covid-19-data-visulaization
covid-19 covid-data data-analysis data-analytics data-science machine-learning
Last synced: 07 Nov 2024
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 07 Nov 2024
https://github.com/karan-malik/r-basic-data-analysis
Basic Data Reading and Analysis using R
data-analysis data-frame data-science john-hopkins-coursera john-hopkins-university r r-programming
Last synced: 14 Nov 2024
https://github.com/mumtaz4118/amazon-iphone-12-data-scrapped
Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages.
data-analysis data-extraction data-science data-scraping html mark-up python
Last synced: 07 Nov 2024
https://github.com/mateusoliveira30/top-intelligent-people
This project performs an exploratory analysis of the top_intelligent_people_in_the_world_5000.csv dataset, featuring some of the world's most intelligent individuals. Using pandas and matplotlib, the analysis includes checking for missing values, describing variables, and visualizing data.
data-analysis graphics kaggle-dataset python3
Last synced: 08 Nov 2024
https://github.com/tmmvn/analytics-notebooks
A bunch of data analytics notebooks done testing out JetBrains DataLore
ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python
Last synced: 07 Nov 2024
https://github.com/jbn/vaquero
A Python library for iterative and interactive data wrangling at laptop-scale.
data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework
Last synced: 13 Nov 2024
https://github.com/manishkr1754/nifty50_data_analysis_nsetools_nsepy_python
NIFTY50 Data Analysis from scratch (Data Extraction & Visualization to Investment Insights)
candlestick-chart cufflinks data-analysis data-visualization heikin-ashi interactive-visualizations moving-average nsepy nsetools plotly portfolio-analysis portfolio-analytics portfolio-and-investment-analysis python sharpe-ratio technical-analysis
Last synced: 23 Oct 2024
https://github.com/chrispsang/healthcare-dataanalysis
Analyze synthetic patient data to identify trends, improve healthcare delivery, and predict patient outcomes using machine learning models. Includes data exploration, preprocessing, model building, and visualizations.
data-analysis data-science data-visualization healthcare jupyter-notebook machine-learning python
Last synced: 06 Nov 2024
https://github.com/thisisashukla/survival-analysis
Hands-On Survival Analysis in Python
data-analysis data-science survival-analysis
Last synced: 07 Nov 2024
https://github.com/kheriberto/pandas_and_seabron_project
In this project I showcase my ability using pandas and seaborn to mold, transform and plot data.
data-analysis pandas python seaborn
Last synced: 10 Nov 2024
https://github.com/andryadsm/asset-analyzer
📈 Project Asset Analyzer (Python)
commodities data-analysis data-visualization economics financial-markets investing matplotlib numpy pandas python seaborn stock-market strategy trading
Last synced: 06 Nov 2024
https://github.com/andryadsm/predicting-house-prices
🏘️ Project Predicting House Prices (Python)
data-analysis data-preprocessing data-visualization feature-engineering house-prices machine-learning matplotlib numpy pandas python real-estate seaborn sklearn
Last synced: 06 Nov 2024
https://github.com/andryadsm/pizza-sales-report
🍕 Project Pizza Sales Report (MySQL, Tableau)
dashboards data-analysis data-visualization database-management mysql sales sql tableau
Last synced: 06 Nov 2024
https://github.com/kwonnayeon/medium-post-projects
Contains the code and projects from my Medium posts. I share what I've learned through trial and error to help others tackle similar work smoothly.
data-analysis data-science data-visualization medium-articles python r-language sql
Last synced: 13 Nov 2024
https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance
Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.
bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse
Last synced: 12 Oct 2024
https://github.com/sunsided/esc2024
Exploratory Data Analysis on the ESC 2024 results
csv data-analysis eurovision-song-contest scraping
Last synced: 02 Nov 2024
https://github.com/darkdk123/handwashing-discovery-analysis
A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.
data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots
Last synced: 07 Nov 2024
https://github.com/darkdk123/house-valuation-model
A Challenge Project in a Boot-Camp to create a ML Model to predict the prices of houses in Boston Massachusetts from multiple parameters Using Multivariable Regression.
data-analysis data-science data-visualization matplotlib-pyplot multivariate-regression predictive-modeling statistics
Last synced: 07 Nov 2024
https://github.com/pranav016/exploratory-data-analysis-of-google-app-store-dataset
This is a data analysis done on the Google app store dataset to answer a few questions related to the data through data visualization techniques.
Last synced: 05 Nov 2024
https://github.com/pranav016/exploratory-data-analysis-of-sp500-dataset
This a data-analysis that I performed on the S&P 500 dataset and answered a few questions through data visualization techniques.
Last synced: 05 Nov 2024
https://github.com/virajbhutada/article-clustered-recommendation-system-ml
This project aims to redefine content discovery by delivering personalized article recommendations tailored to individual user preferences. We use advanced machine learning techniques like PCA and K-means clustering to analyze user behavior and article characteristics to provide highly accurate recommendations.
anaconda article-recommendation clustering-algorithm data-analysis data-science keras-tensorflow machine-learning machine-learning-algorithms ml-models numpy pandas plotly python scikit-learn scipy
Last synced: 15 Oct 2024
https://github.com/maazie-khan/olympics-data-enigeering
Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.
azure big-data data-analysis dataengineering devops pipeline
Last synced: 14 Oct 2024
https://github.com/an4pdm/relatorio-de-vendas
O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".
data-analysis data-visualization database etl powerbi
Last synced: 13 Nov 2024
https://github.com/dionixius7/titanic-disaster-ml-model
This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances
data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction
Last synced: 17 Nov 2024