An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by Gregoritsch3

A curated list of projects in awesome lists by Gregoritsch3 .

https://github.com/gregoritsch3/ml_eda_classification_loanapprovalprediction

An EDA and Machine Learning Classification exercise on the Loan Approval dataset demonstrating EDA, feature engineering, StratifiedKFold and the use of Tensorflow NN, SVC, LinearSVC, XGBoost, Naive-Bayes, Bagging, Random Forest and Decision Tree algorithms.etc. The modela are optimized using hyperparameter tuning through GridSearchCV.

eda feature-engineering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn tensorflow

Last synced: 28 Mar 2025

https://github.com/gregoritsch3/ml_eda_classification_diabetes

An EDA and Machine Learning Classification exercise on the Diabetes dataset demonstrating the use of SQLAlchemy data import from an SQL database (PostgreSQL), Pre-processing Pipelines, ANOVA, 9 ScikitLearn ML models, Hyperparamter Tuning for the best performing one, and feature importance.

anova machine-learning matplotlib numpy pandas pipelines scikit-learn seaborn sql sqlalchemy statistics

Last synced: 17 Feb 2025

https://github.com/gregoritsch3/ml_eda_clustering_aidassessment

An EDA and Machine Learning Clustering exercise on the Country Aid Assessment dataset demonstrating the use of PCA, KMeans and DBSCAN clustering, Elbow Methods, etc. The clustering algorithm successfully demarcates countries that are in most dire need of aid based on their GDPP and Child Mortality rate.

anova dbscan kmeans machine-learning matplotlib numpy pandas pca scikit-learn seaborn statistics

Last synced: 03 Apr 2025

https://github.com/gregoritsch3/ml_eda_classification_goldpriceprediction

An EDA and Machine Learning Classification project on the IAU Gold ETF demonstrating the use of yfinance, stockstats, Time Series Split, Feature Expansion (SMA, EMA, lagged features, RSI-14, CL=F Close Price, etc.), Model Evaluation and Hyperparameter Tuning. The model predicts Gold price movement (1-up, 0-down) on a weekly basis and performs well.

price-prediction quantitative-finance randomforestclassifier scikit-learn yfinance

Last synced: 20 Feb 2025

https://github.com/gregoritsch3/markov_weather_model

A 5-State Markov Chain Weather Model whose transition probabilites are inferred from pre-existing daily weather data (https://www.kaggle.com/datasets/ananthr1/weather-prediction).

ergodic-stationary-processes jupyter-notebook markov-chain pandas prediction python

Last synced: 08 Apr 2025

https://github.com/gregoritsch3/ml_clustering_eda_customersegmentation

An EDA and Machine Learning Clustering exercise on the Mall Customer Segmentation synthetic dataset demonstrating the use of KMeans Clustering and the Elbow Method. The clustering algorithm successfully segments the customer base into groups distinguishable by their annual income and spending score.

clustering kmeans-clustering machine-learning matplotlib numpy pandas scikit-learn scipy seaborn

Last synced: 26 Mar 2025

https://github.com/gregoritsch3/dl_cv_e2e_potatodiseaseclassification

A guided CodeBasics Deep Learning Project where a Convolutional Model is deployed onto a Website (FastAPI) and Mobile App (React Native, Google Cloud). Its purpose is the classification of potato plant images into "healthy", "Early Blight" and "Late Blight" categories.

cnn-classification gcp model-deployment scikit-learn tensorflow

Last synced: 27 Mar 2025

https://github.com/gregoritsch3/dl_cnn_resnet50_digitrecognition

A Deep Learning Computer Vision project on the MNIST Digit Dataset. The project demonstrates the use of four TensorFlow Neural Network architectures, ranging from a basic Shallow Sigmoid Model to a Deep Convolutional Model constructed using the FunctionalAPI, and even a modified Resnet50 Model. Includes Error Analysis and test runs on real images.

cnn convolutional-neural-networks cv2 keras matplotlib numpy resnet-50 tensorflow

Last synced: 25 Feb 2025

https://github.com/gregoritsch3/project_radioactive_decay

An analytic ( system of coupled ODEs), numeric (Runge-Kutta 4, Newton-Raphson) and stochastic (Monte Carlo) treatment of the chained radioactive decay of Bi[210, 83] into Po[210,84], as well as the emission of alpha-particles. Associated comments and paper are in Croatian.

cplusplus matplotlib montecarlo-simulation numerical-methods numpy python radioactive-decay scipy

Last synced: 28 Feb 2025

https://github.com/Gregoritsch3/Exercise_Pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 01 May 2025

https://github.com/gregoritsch3/projects

Repository of projects (Josip Gregorić)

Last synced: 28 Feb 2025

https://github.com/gregoritsch3/exercise_pandas_1

A Pandas exercise demonstrating the loading, cleaning and reorganization of data in .xlsx or .csv files, descriptive statistics, data visualization, statistical approximation of data with the normal distribution, etc.). Libraries include Pandas, NumPy, Scipy, SymPy, MatplotLib,

data-cleaning data-visualization descriptive-statistics matplotlib numpy pandas scipy sympy

Last synced: 28 Feb 2025

https://github.com/gregoritsch3/project_python_jobs_data_analysis

A multi-library Data Analysis Python project that draws useful insights from existing job data. It provides useful exploratory insight about job postings in the Data sector, such as the distribution of remote/non-remote jobs, jobs with/without healthcare, top paying and most demanded data roles, and more.

matplotlib numpy pandas python scipy seaborn

Last synced: 28 Feb 2025

https://github.com/gregoritsch3/project_excel_dataanalysis_carsales

An Excel Data Analysis project based on a vehicle vendor's car sales data from 2014 and 2015 showcasing data cleaning and formatting, DAX, pivot tables and charts, timelines, slicers, an interactive Dashboard, descriptive Statistics and more.

analysis dashboard data excel sales statistics

Last synced: 28 Feb 2025

https://github.com/gregoritsch3/project_sql_job_analysis

An SQL project that draws useful insights from existing job data. It provides information regarding top-paying Data Scientist jobs and skills (both job-opportunity-based and salary-based), the most in-demand skills for Data Scientist roles, as well as a list of optimal Data Science skills to learn (skills high in demand and market value).

chatgpt dataanalysis postgresql sql statistics

Last synced: 28 Feb 2025

https://github.com/gregoritsch3/project_spring_pendulum

A solution to the chaotic equations of motion of a spring pendulum using Lagrangean mechanics integrated through symbolic and numeric Python libraries (Sympy, Numpy, Scipy, Matplotlib, etc.).

lagrangian-mechanics maptlotlib numpy python scipy spring-pendulum sympy

Last synced: 28 Feb 2025