Projects in Awesome Lists by Gregoritsch3
A curated list of projects in awesome lists by Gregoritsch3 .
https://github.com/gregoritsch3/ml_eda_classification_loanapprovalprediction
An EDA and Machine Learning Classification exercise on the Loan Approval dataset demonstrating EDA, feature engineering, StratifiedKFold and the use of Tensorflow NN, SVC, LinearSVC, XGBoost, Naive-Bayes, Bagging, Random Forest and Decision Tree algorithms.etc. The modela are optimized using hyperparameter tuning through GridSearchCV.
eda feature-engineering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn tensorflow
Last synced: 28 Mar 2025
https://github.com/gregoritsch3/ml_eda_classification_diabetes
An EDA and Machine Learning Classification exercise on the Diabetes dataset demonstrating the use of SQLAlchemy data import from an SQL database (PostgreSQL), Pre-processing Pipelines, ANOVA, 9 ScikitLearn ML models, Hyperparamter Tuning for the best performing one, and feature importance.
anova machine-learning matplotlib numpy pandas pipelines scikit-learn seaborn sql sqlalchemy statistics
Last synced: 17 Feb 2025
https://github.com/gregoritsch3/ml_eda_clustering_aidassessment
An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.
anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics
Last synced: 03 Apr 2025
https://github.com/gregoritsch3/ml_eda_classification_goldpriceprediction
An EDA and Machine Learning Classification project on the IAU Gold ETF demonstrating the use of yfinance, stockstats, Time Series Split, Feature Expansion (SMA, EMA, lagged features, RSI-14, CL=F Close Price, etc.), Model Evaluation and Hyperparameter Tuning. The model predicts Gold price movement (1-up, 0-down) on a weekly basis and performs well.
price-prediction quantitative-finance randomforestclassifier scikit-learn yfinance
Last synced: 20 Feb 2025
https://github.com/gregoritsch3/markov_weather_model
A 5-State Markov Chain Weather Model whose transition probabilites are inferred from pre-existing daily weather data (https://www.kaggle.com/datasets/ananthr1/weather-prediction).
ergodic-stationary-processes jupyter-notebook markov-chain pandas prediction python
Last synced: 08 Apr 2025
https://github.com/gregoritsch3/ml_clustering_eda_customersegmentation
An EDA and Machine Learning Clustering exercise on the Mall Customer Segmentation synthetic dataset demonstrating the use of KMeans Clustering and the Elbow Method. The clustering algorithm successfully segments the customer base into groups distinguishable by their annual income and spending score.
clustering kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn
Last synced: 26 Mar 2025
https://github.com/gregoritsch3/dl_cv_e2e_potatodiseaseclassification
A guided CodeBasics Deep Learning Project where a Convolutional Model is deployed onto a Website (FastAPI) and Mobile App (React Native, Google Cloud). Its purpose is the classification of potato plant images into "healthy", "Early Blight" and "Late Blight" categories.
cnn-classification gcp model-deployment scikit-learn tensorflow
Last synced: 27 Mar 2025
https://github.com/gregoritsch3/dl_cnn_resnet50_digitrecognition
A Deep Learning Computer Vision project on the MNIST Digit Dataset. The project demonstrates the use of four TensorFlow Neural Network architectures, ranging from a basic Shallow Sigmoid Model to a Deep Convolutional Model constructed using the FunctionalAPI, and even a modified Resnet50 Model. Includes Error Analysis and test runs on real images.
cnn convolutional-neural-networks cv2 keras matplotlib numpy resnet-50 tensorflow
Last synced: 25 Feb 2025
https://github.com/gregoritsch3/project_radioactive_decay
An analytic ( system of coupled ODEs), numeric (Runge-Kutta 4, Newton-Raphson) and stochastic (Monte Carlo) treatment of the chained radioactive decay of Bi[210, 83] into Po[210,84], as well as the emission of alpha-particles. Associated comments and paper are in Croatian.
cplusplus matplotlib montecarlo-simulation numerical-methods numpy python radioactive-decay scipy
Last synced: 28 Feb 2025
https://github.com/Gregoritsch3/Exercise_Pandas_1
A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,
data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy
Last synced: 01 May 2025
https://github.com/gregoritsch3/projects
Repository of projects (Josip Gregorić)
Last synced: 28 Feb 2025
https://github.com/gregoritsch3/exercise_pandas_1
A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,
data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy
Last synced: 28 Feb 2025
https://github.com/gregoritsch3/project_python_jobs_data_analysis
A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.
matplotlib numpy pandas python scipy seaborn
Last synced: 28 Feb 2025
https://github.com/gregoritsch3/project_excel_dataanalysis_carsales
An Excel Data Analysis project based on a vehicle vendor's car sales data from 2014 and 2015 showcasing data cleaning and formatting, DAX, pivot tables and charts, timelines, slicers, an interactive Dashboard, descriptive Statistics and more.
analysis dashboard data excel sales statistics
Last synced: 28 Feb 2025
https://github.com/gregoritsch3/project_sql_job_analysis
An SQL project that draws useful insights from existing job data. It provides information regarding top-paying Data Scientist jobs and skills (both job-opportunity-based and salary-based), the most in-demand skills for Data Scientist roles, as well as a list of optimal Data Science skills to learn (skills high in demand and market value).
chatgpt dataanalysis postgresql sql statistics
Last synced: 28 Feb 2025
https://github.com/gregoritsch3/project_spring_pendulum
A solution to the chaotic equations of motion of a spring pendulum using Lagrangean mechanics integrated through symbolic and numeric Python libraries (Sympy, Numpy, Scipy, Matplotlib, etc.).
lagrangian-mechanics maptlotlib numpy python scipy spring-pendulum sympy
Last synced: 28 Feb 2025