An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with model-evaluation

A curated list of projects in awesome lists tagged with model-evaluation .

https://github.com/thomasp85/lime

Local Interpretable Model-Agnostic Explanations (R port of original Python package)

caret model-checking model-evaluation modeling r

Last synced: 08 Apr 2025

https://github.com/eugeneyan/testing-ml

🔍 Minimal examples of machine learning tests for implementation, behaviour, and performance.

machine-learning model-evaluation testing

Last synced: 09 Apr 2025

https://github.com/hongping-zh/circular-bias-detection

a comprehensive statistical framework for detecting circular reasoning bias in AI algorithm evaluation

ai-ethics bias-detection data-science llm machine-learning model-evaluation

Last synced: 07 Mar 2026

https://github.com/IliaZenkov/sklearn-audio-classification

An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, and cross-validation with a variety of ML techniques and MLP

audio audio-data classification deep-learning-tutorial deep-neural-networks dnns emotion emotion-detection emotion-recognition feature-engineering machine-learning machine-learning-tutorials mlp-model model-evaluation ravdess-dataset scikit-learn sklearn

Last synced: 22 Mar 2025

https://github.com/roboflow/cvevals

Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, models hosted on Roboflow)

computer-vision model-evaluation

Last synced: 18 Aug 2025

https://github.com/metno/pyaerocom

Python tools for climate and air quality model evaluation

aerocom aerosol air-quality climate-science earth-observation model-evaluation

Last synced: 21 Oct 2025

https://github.com/lechmazur/deception

Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claude, GPT-4, Gemini, Llama, etc.) with standardized evaluation metrics.

ai-benchmarks ai-evaluation ai-safety ai-security claude disinformation gemini gpt4o language-model llama llm llm-benchmarking machine-learning mistral model-evaluation nlp

Last synced: 13 Oct 2025

https://github.com/medoidai/skrobot

skrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.

artificial-intelligence data-science feature-engineering feature-selection hyperparameter-tuning machine-learning model-evaluation model-selection model-training model-tuning open-source predictive-modelling python scikit-learn

Last synced: 02 Aug 2025

https://github.com/animator/titus2

Titus 2 : Portable Format for Analytics (PFA) implementation for Python 3.4+

analytics inference inference-engine ml-engine model-deployment model-evaluation model-serving pfa pfa-standard pmml python scoring scoring-engine titus

Last synced: 09 Mar 2026

https://github.com/mr-easy/badminton-stroke-classification

Classifying badminton strokes based on accelorometer and gyroscope sensor data attached to player's wrist. An end-to-end Machine Learning project, from data collection and preprocessing to final model evaluation.

badminton-stroke-classification data-analysis data-analytics data-science deep-learning machine-learning model-evaluation notebook project time-series-analysis tutorial

Last synced: 31 Aug 2025

https://github.com/amirhosseinhonardoust/underwriting-decision-safety-lab

A decision-safety lab for loan approval: trains a baseline classifier, calibrates probabilities (ECE/Brier), sweeps confidence thresholds to build a coverage, quality frontier and outputs a defensible abstention policy (auto-decide vs review). Includes a Streamlit dashboard for report cards, triage UI, and data quality checks.

abstention calibration classification credit-risk data-quality data-science decision-policy loan-approval machine-learning mlops model-evaluation monitoring pandas reliability responsible-ai scikit-learn selective-classification streamlit uncertainty underwriting

Last synced: 10 Jun 2026

https://github.com/chaitanyac22/deep-rl-project---maximize-total-profits-earned-by-cab-driver

The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-making process on the field. Taking long-term profit as the goal, a method is proposed based on reinforcement learning to optimize taxi driving strategies for profit maximization. This optimization problem is formulated as a Markov Decision Process i.e. MDP.

actions convergence data-visualization deep-reinforcement-learning dqn epsilon-decay epsilon-greedy hyperparameter-tuning markov-decision-process mdp-framework minibatch-gradient-descent model-building model-evaluation optimal-policy prediction q-values-tracking rewards rl states training-dqn-agent

Last synced: 16 Oct 2025

https://github.com/venkat-0706/twalyze

Twitter sentiment analysis project using machine learning to classify tweets and understand audience mood, opinions, and behavior trends in real-time.

logistic-regression machine-learning model-evaluation naive-bayes-classifier pandas python scikitlearn-machine-learning tfidf-vectorizer tokenization

Last synced: 07 May 2026

https://github.com/ACCESS-NRI/access-nri-intake-catalog

Tools and configuration info used to manage ACCESS-NRI's intake catalogue

med-team model-evaluation

Last synced: 20 Jul 2025

https://github.com/lenarddome/psp

Implements an n-dimensional parameter space partitioning algorithm for evaluating the global behaviour of formal computational models as described by Pitt, Kim, Navarro and Myung (2006)

cognitive-science computational-modelling cran global-analysis model-evaluation parameter-space-analysis psychology r r-package

Last synced: 18 Feb 2026

https://github.com/ksm26/reinforcement-learning-from-human-feedback

Embark on the "Reinforcement Learning from Human Feedback" course and align Large Language Models (LLMs) with human values.

fine-tuning generative-ai google-cloud large-language-models llama-2 model-evaluation reinforcement-learning rlhf

Last synced: 30 Apr 2026

https://github.com/sushant1827/fashion-clothing-classification

Kaggle Machine Learning Competition Project : In this project, we will create a classifier to classify fashion clothing into 10 categories learned from Fashion MNIST dataset of Zalando's article images

adam-optimizer categorical-cross-entropy classification classification-report conv2d dropout keras keras-classification-models keras-neural-networks maxpooling model-evaluation reshaping splitting subplots

Last synced: 07 Mar 2026

https://github.com/evaluation-context-protocol/ecp

ECP is a standardized interface for orchestrating, auditing, and enforcing authority limits in AI Agent evaluations. It moves evaluation from "brittle Python scripts" to a deterministic infrastructure protocol

evaluation-metrics evaluations llm-evaluation model-evaluation

Last synced: 25 Apr 2026

https://github.com/sushant1827/Fashion-Clothing-Classification

Kaggle Machine Learning Competition Project : In this project, we will create a classifier to classify fashion clothing into 10 categories learned from Fashion MNIST dataset of Zalando's article images

adam-optimizer categorical-cross-entropy classification classification-report conv2d dropout keras keras-classification-models keras-neural-networks maxpooling model-evaluation reshaping splitting subplots

Last synced: 14 Oct 2025

https://github.com/sayamalt/resume-classification-using-fine-tuned-bert

Successfully developed a resume classification model which can accurately classify the resume of any person into its corresponding job with a tremendously high accuracy of more than 99%.

bert-model exploratory-data-analysis fine-tuning-bert model-evaluation nlp text-preprocessing text-tokenization word-embeddings

Last synced: 31 Aug 2025

https://github.com/sminerport/sequencepredictionann

Predict next number in a sequence using a simple ANN. Modularized code with classes for data preparation, neural network architecture, and training.

artificial-neural-networks data-prep deep-learning machine-learning model-evaluation model-training neural-network numpy python scikit-learn sequence-prediction supervised-learning time-series-forecasting

Last synced: 03 Apr 2025

https://github.com/jesussantana/ibm-data-analysis-with-python-da0101en

This course will take you from the basics of Python to exploring many different types of data.

anova correlation data-analysis model-evaluation numpy pandas prepare-data python regression-models statistics

Last synced: 17 Jul 2025

https://github.com/haloapping/cotomks

Kumpulan referensi untuk belajar mengenai pemrograman Python, Data Science, Machine Learning dan Deep Learning.

data-science deep-learning machine-learning model-evaluation probabilistic statistics time-series

Last synced: 24 Feb 2025

https://github.com/chaitanyac22/multi-input-multi-output-mnist-image-digit-and-summed-output-classification

The goal of this project is to build a neural network that takes an MNIST handwritten digit (0-9) image and a random number (digit 0-9) as inputs and returns the predicted class label (0-9) for the input image and its addition (sum) with the input random number as summed output (range 0-18) label as outputs.

cnn-classification data-visualization deep-neural-networks exploratory-data-analysis functional-api image-data-generator image-processing keras mnist-classification model-building model-evaluation multi-inputs multi-outputs python3 tensorflow

Last synced: 18 Apr 2026

https://github.com/sayamalt/language-detection-using-fine-tuned-xlm-roberta-base-transformer-model

Successfully developed a language detection transformer model that can accurately recognize the language in which any given text is written.

bert-fine-tuning feature-engineering fine-tuning model-evaluation model-evaluation-metrics nlp text-classification text-preprocessing xlm-roberta

Last synced: 31 Aug 2025

https://github.com/nafisalawalidris/predicting-credit-card-approvals

Explore credit card approval prediction through data analysis and machine learning. Preprocess data, train logistic regression models, and optimize hyperparameters. Learn data preprocessing, feature engineering, model training, and evaluation. Dive into the world of machine learning with Python and popular libraries.

approval-prediction credit-card data-analysis data-preprocessing feature-engineering hyperparameter-optimization libraries logistic-regression machine-learning model-evaluation model-training python python3

Last synced: 19 Apr 2025

https://github.com/mayurasandakalum/breast-cancer-detection

Code for classifying breast cancer tumors using machine learning. Includes preprocessing, visualizations, and models like Logistic Regression, Decision Tree, and Random Forest. Evaluated with accuracy, precision, recall, and F1-score. Clone, install dependencies, and run the Jupyter notebook for full analysis.

accuracy breast-cancer-classification data-preprocessing decision-tree exploratory-data-analysis f1-score jupyter-notebook logistic-regression machine-learning ml model-evaluation precision random-forest recall svm visualizations

Last synced: 21 Feb 2026

https://github.com/shridhar1504/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data-analytics data-cleaning data-science data-testing data-visualization forecasting-models machin model-evaluation model-fitting prediction predictive-modeling python3 regression-algorithms salesforecast sklearn-library supervised-learning

Last synced: 30 Oct 2025

https://github.com/praveen1664/solved-python-machine-learning-book-book-1st-edition

Solved problem of famous book in machine learning, deep learning for learners

logistic-regression machine machine-learning model-evaluation nbviewer scikit-learn

Last synced: 20 May 2026

https://github.com/chaitanyac22/house-price-prediction-project-for-a-us-based-housing-company

The goal of this project is to garner data insights using data analytics to purchase houses at a price below their actual value and flip them on at a higher price. This project aims at building an effective regression model using regularization (i.e. advanced linear regression: Ridge and Lasso regression) in order to predict the actual values of prospective housing properties and decide whether to invest in them or not.

advanced-linear-regression business-analytics data-analysis data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering lasso-regression linear-regression machine-learning model-building model-evaluation prediction-model python3 regularization rfe ridge-regression statistics

Last synced: 30 Apr 2026

https://github.com/dngvmnh/revolving_credit_behavior_modeling

The Revolving Credit Behavior Modeling project analyzes revolving credit to facilitate flexible access to funds within a credit limit, assisting financial institutions in setting accurate pricing strategies by addressing risk factors like inflation and interest rates.

credit-contracts financial-flexibility interest-rate-risks model-evaluation predictive-modeling random-forest-regressor revolving-credit simulation time-value-of-money withdrawal-patterns

Last synced: 02 Apr 2025

https://github.com/sayamalt/stellar-classification---sloan-digital-sky-survey-17

Successfully established a machine learning model which can predict an appropriate stellar class, on the basis of a distinct set of spectral characteristics, to a substantially high level of accuracy.

cross-validation data-preprocessing data-visualization exploratory-data-analysis feature-engineering feature-scaling imbalanced-learning model-deployment model-evaluation model-training multiclass-classification supervised-machine-learning

Last synced: 11 Jan 2026

https://github.com/chaitanyac22/autistic-spectrum-disorder-asd-detection

This project aims to develop a robust classification model using test-takers' demographics and questionnaire responses from the ASD screening dataset to accurately identify individuals with Autistic Spectrum Disorder (ASD) through optimization of performance metrics.

classification-algorithm healthcare jupyter-notebook kneighborsclassifier logistic-regression machine-learning machine-learning-algorithms model-building model-evaluation modular-code pipelines python3 random-forest social-sciences xgboost

Last synced: 17 May 2026

https://github.com/mj3b/governed-decision-intelligence

Defines the decision layer for AI systems where deployment outcomes are governed, recorded, and reconstructable. 5th conforming implementation of draft-farley-acta-signed-receipts (IETF).

ai-governance auditability cryptographic-receipts decision-governance decision-systems interoperability model-evaluation responsible-ai risk-management

Last synced: 16 May 2026

https://github.com/ehtisham-sadiq/ai-pioneers-datascience-arena

This repository is dedicated to the AI Amigos team's participation in the Artificial Intelligence (AI) competition with a focus on Data Science.

artificial-intelligence competition data-analysis data-science data-visualization machine-learning model-building model-evaluation numpy pandas python3 supervised-learning unsupervised-learning

Last synced: 05 May 2026

https://github.com/raihan4520/ml

A collection of machine learning projects showcasing various algorithms and techniques, including a final project for the Machine Learning course at AIUB.

data-preprocessing jupyter-notebook machine-learning model-evaluation numpy pandas python scikit-learn

Last synced: 11 Jan 2026

https://github.com/virajbhutada/music-recommendation-system

This project is designed to provide personalized music recommendations for relaxation and meditation. Leveraging ML and data analysis, the system suggests tracks based on user preferences such as tempo, energy, and genre. Join us in enhancing music discovery through advanced algorithms and community-driven contributions.

data-analysis data-science-projects data-visualization eda html machine-learning ml-algortihms model-deployment model-evaluation music-recommendation-system nlp pivot-table principal-component-analysis python python-library similarity-matrix spotify-data streamlit-web user-experience

Last synced: 24 Jan 2026

https://github.com/abhinavsharma07/kaggle-comp.

A repository showcasing solutions to Kaggle competitions with end-to-end workflows in machine learning and data science.

data-preprocessing datascience deeplearning feature-engineering kaggle machinelearning model-evaluation model-training predictive-modeling

Last synced: 26 Feb 2026

https://github.com/andrei-vataselu/data-science-snippets

🧰 Essential EDA and Data Cleaning Helpers for Any DataFrame This collection of functions is designed to accelerate exploratory data analysis (EDA), quickly surface data quality issues, and offer high-level insights into the structure and content of your dataset.

artificial-intelligence data data-science eda feature-engineering hyperparamater-tunning library loading model-evaluation modeling preprocessing python snippets text-processing time-series visualization

Last synced: 10 Mar 2026

https://github.com/deaneeth/churn-prediction-production-pipeline

Production-grade machine learning pipeline for customer churn prediction with modular components for data preprocessing, model training, and streaming inference. This third repo in a series builds on previous work to create a deployment-ready prediction system with comprehensive configuration, evaluation metrics, and scalable architecture.

churn-prediction data-science machine-learning mlops model-evaluation pipeline production-pipeline python

Last synced: 10 Oct 2025

https://github.com/chaitanyac22/telecom-churn-prediction

In this project, data analytics is used to analyze customer-level data of a leading telecom firm, build predictive models to identify customers at high risk of churn, and identify the main indicators of churn. The project focuses on a four-month window, wherein the first two months are the ‘good’ phase, the third month is the ‘action’ phase, while the fourth month is the ‘churn’ phase. The business objective is to predict the churn in the last i.e. fourth month using the data from the first three months.

class-imbalance classification data-analytics data-cleaning data-manipulation evaluation-metrics feature-engineering hyperparameter-tuning logistic-regression machine-learning model-building model-evaluation over-sampling pca random-forest-classifier rfe smote statistics telecom xgboost

Last synced: 12 Jun 2025

https://github.com/allenleizhao/model_validation_strategy_comparison

A comparison of cross-validation techniques and classification models using synthetic data. Includes evaluation of Logistic Regression, Random Forest, and SVM with K-Fold and Repeated K-Fold methods.

classification cross-validation data-science machine-learning model-evaluation python sklearn

Last synced: 06 May 2026

https://github.com/sayamalt/news-category-classification

Successfully developed a news category classification model using fine-tuned BERT which can accurately classify any news text into its respective category i.e. Politics, Business, Technology and Entertainment.

bert-embeddings exploratory-data-analysis feature-engineering fine-tuning-bert model-evaluation nlp text-classification text-cleaning text-preprocessing text-tokenization

Last synced: 15 Jun 2025

https://github.com/sameetasadullah/cnn-benchmark-suite

A modular deep learning evaluation framework for benchmarking multiple CNN architectures across varied optimization strategies and training configurations. Built for scalable experimentation and transferability to real-world image classification tasks.

alexnet cifar10 cnn computer-vision deep-learning googlenet hyperparameter-tuning image-classification mobilenet model-benchmarking model-evaluation pytorch resnet torchvision

Last synced: 19 Jun 2026

https://github.com/redayzarra/cnn_trafficsigns

This is a machine learning project where I utilized the LeNet-5 architecture to create a convolutional deep network that classifies 43 different kind of traffic signs. I've made sure to include a full step-by-step implementation of the project as well as detailed notes for every step.

classification classification-model computer-vision deep-learning deep-neural-networks lenet lenet-5 lenet-architecture machine-learning model-evaluation model-selection

Last synced: 15 May 2026

https://github.com/arpanpramanik2003/object-detection-resnet50

This repository contains a deep learning project for CIFAR-10 image classification using the ResNet50 pre-trained model. The project includes data preprocessing, model training, evaluation, and visualization of results. Achieved high accuracy by fine-tuning the model and optimizing hyperparameters.

cifar-10 cifar10 cnn deep-learning keras machine-learning model-evaluation object-detection opencv pre-trained-model python regression-models resnet-50 streamlit tensorflow2 transformer-models

Last synced: 07 Apr 2026

https://github.com/steciuk/ium-recommendation-system

Evaluation and comparison of 3 different recommendations models for web shopping service simulation.

data-analysis model-evaluation recomendation-system

Last synced: 29 Oct 2025

https://github.com/lucianoscarpaci/resnet18-evaluation-pytorch

This project demonstrates how to fine-tune a pre-trained ResNet18 model using PyTorch for binary classification. This model is adapted to identify images in two classes Positive and Negative.

binary-classification data-preprocessing fine-tuning image-classification model-evaluation model-training neural-networks performance-metrics pre-trained-models pytorch resnet-18 transfer-learning

Last synced: 21 May 2026

https://github.com/dmarks84/ind_project_california-housing-data--kaggle

Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.

cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml

Last synced: 08 Apr 2026

https://github.com/abdullah321umar/internee.pk-dataanalytics_internship-assignment4

🌟 Fraud Detection in Application 🌟 Through Isolation Forest and K-Means Clustering, the project detects suspicious patterns like inconsistent income, duplicate entries, and unrealistic employment data. This end-to-end workflow transforms raw data into actionable fraud insights — enhancing trust and accuracy.

anomaly-detection csv-handling data-cleaning data-exporting data-import data-normalization exploratory-data-analysis export interpretation matplotlib model-evaluation pandas pca python reporting scaling scikit-learn seaborn

Last synced: 06 May 2026

https://github.com/avijit-jana/seqflipattention

SeqFlipAttention is a forward‑looking PyTorch demonstration of sequence‑to‑sequence learning enhanced by attention, trained on a synthetic reverse‑sequence task and complete with training scripts, loss and accuracy visualizations, and a quantitative analysis of attention’s impact on performance.

attention-mechanism deep-learning deeplearning machine-learning machine-translation model-evaluation modelevaluation natural-language-processing nlp python pytorch seq2seq synthetic-data syntheticdata text-generation

Last synced: 10 May 2026

https://github.com/qtle3/logistic-regression

A Python implementation of Logistic Regression to classify social network ads based on age and estimated salary, featuring data visualization and performance metrics such as confusion matrix and accuracy score.

data-visualization feature-scaling logistic-regression logistic-regression-algorithm model-evaluation

Last synced: 01 Aug 2025

https://github.com/md-emon-hasan/evidently-ai

Evidently AI in tracking, analyzing, and visualizing machine learning model performance and data drift ensure their reliability over time.

evidently evidentlyai model-evaluation model-monitoring monitoring-tool tracking tracking-system

Last synced: 17 Aug 2025

https://github.com/Zen204/airbnb-availability

A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.

binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning

Last synced: 02 Apr 2025

https://github.com/allanotieno254/codsoft

This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.

classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression

Last synced: 15 May 2025

https://github.com/davidogalo/automatic-license-plate-recognition-system

Designed and implemented an ANPR system using deep learning for accurate license plate identification. The project involved data preprocessing with OpenCV, TensorFlow, and OCR techniques for real-time identification. Model performance was evaluated using cross-validation and metrics like accuracy and F1 score.

cross-validation data-preprocessing deep-learning image-processing model-evaluation model-training-and-optimization optical-character-recognition performance-metrics

Last synced: 16 May 2025

https://github.com/satvikpraveen/fashionmnist-analysis

A comprehensive analysis of the Fashion MNIST dataset using PyTorch. Covers data preparation, EDA, baseline modeling, and fine-tuning CNNs like ResNet. Includes modular folders for data, notebooks, and results. Features CSV exports, visualizations, metrics comparison, and a requirements.txt for easy setup. Ideal for ML workflow exploration.

computer-vision confusion-matrix convolutional-neural-networks deep-learning-algorithms exploratory-data-analysis fashion-mnist-dataset fine-tuning hyperparameter-tuning image-classification jupyter-notebook machine-learning-algorithms matplotlib-pyplot model-evaluation numpy pandas pytorch resnet-18 scikit-learn seaborn vgg

Last synced: 22 Apr 2025

https://github.com/zen204/airbnb-availability

A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.

binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning

Last synced: 21 Jan 2026

https://github.com/adilrasheed139/ai-powered-resume-screening-using-bert

Successfully developed a resume classification model which can accurately classify the resume of any person into its corresponding job with a tremendously high accuracy of more than 99%.

bert-model deep-learning exploratory-data-analysis-eda fine-tuning-bert model-evaluation nlp nlp-machine-learning text-preprocessing text-tokenization word-embeddings word-embeddings-for-nlp

Last synced: 03 Apr 2025

https://github.com/varun-khorgade/churnshield-customer-retention-predictor

Built an ML-based classification model to predict customer churn. Applied data preprocessing, feature engineering, and ensemble algorithms to improve prediction accuracy and help businesses implement retention strategies.

classification-algorithm datapreprocessing f1-score feature-engineering hyperparameter-tuning logistic-regression matplotlib model-evaluation numpy pandas python ran roc-auc scikit-learn seaborn xgboost

Last synced: 07 May 2026

https://github.com/alpha-one-index/ai-llmops-index

Comprehensive LLMOps reference index: observability platforms, inference cost intelligence, failure mode taxonomy, stack compatibility matrices, and regulatory compliance mapping for LLMs in production.

ai-compliance ai-governance ai-infrastructure ai-observability ai-safety awesome-list llm-benchmarks llm-cost-comparison llm-evaluation llm-failure-modes llm-gateway llm-inference-cost llm-monitoring llm-observability llm-pricing llm-security llmops mlops model-evaluation prompt-engineering

Last synced: 13 Mar 2026

https://github.com/kashifmoin1410/computer-vision-traditional-vs.-deep-learning-approaches

This project compares traditional Bag-of-Words with SVM and a custom ResNet-style CNN for image classification on the CIFAR-10 dataset. It covers the full workflow: feature extraction, model building, training, evaluation, and visualization. Results demonstrate the superior accuracy and robustness of deep learning models over classic ML pipelines.

bag-of-words cifar10 cnn comparative-analysis computer-vision deep-learning feature-extraction image-classification keras knn-classification machine-learning model-evaluation neural-network python3 resnet scikit-learn sift-algorithm svm-classifier

Last synced: 06 May 2026

https://github.com/donishadsmith/vswift

A R package for evaluating ML classification models.

classification cross-validation data-science machine-learning model-evaluation r

Last synced: 21 Apr 2026

https://github.com/ehtisham-sadiq/building-an-ml-based-heart-disease-diagnosis-system-with-flask

It is an end-to-end project that combines machine learning to create a user-friendly Heart Disease Diagnosis System, powered by Flask.

data-analysis exploratory-data-analysis feature-engineering flask machine-learning model-building model-evaluation pipelines python3 rest-api

Last synced: 04 May 2026

https://github.com/mohammadvhossein/ml-gym

The ML-GYM repository showcases machine learning projects using **scikit-learn**, covering classification, regression, and clustering. It offers educational resources for beginners and practical examples for experienced users, complete with detailed instructions.

classification-algorithms clustering-methods cross-validation data-preprocessing data-science decision-trees feature-engineering machine-learning model-evaluation neural-networks python-programming random-forests regression-techniques scikit-learn supervised-learning unsupervised-learning

Last synced: 06 May 2026

https://github.com/savinrazvan/traffic

This project aims to develop a neural network using TensorFlow to classify traffic signs from images, utilizing the German Traffic Sign Recognition Benchmark (GTSRB) dataset.

ai cnn data-augmentation data-preprocessing deep-learning gtsrb image-recognition machine-learning model-evaluation model-training opencv tensorflow traffic-sign-classification

Last synced: 06 May 2026

https://github.com/avijit-jana/cnn-architectures-benchmark

A comparative benchmark of popular Convolutional Neural Network architectures (LeNet‑5, AlexNet, GoogLeNet, ResNet, Xeception) on MNIST, Fashion‑MNIST and CIFAR‑10 using PyTorch. Includes analysis of loss curves, accuracy, precision, recall and F1‑scores.

benchmarking cifar-10 cnn cnn-classification computer-vision deep-learning fmnist image-classification machine-learning mnist model-evaluation open-source pytorch tensorflow

Last synced: 08 May 2026

https://github.com/sedatdikbas/cifar10-transfer-learning

Bu proje, CIFAR-10 veri seti üzerinde transfer öğrenme yöntemlerini kullanarak görüntü sınıflandırma yapmayı amaçlamaktadır. Farklı derin öğrenme modelleri (VGG16, InceptionV3, ResNet50, MobileNetV2, DenseNet121) ile eğitim yapılmış ve her modelin performansı karışıklık matrisleri ile değerlendirilmiştir.

cifar10 data-augmentation deep-learning image-classification machine-learning model-evaluation neural-networks python tensorflow

Last synced: 12 May 2026

https://github.com/davidogalo/twitter-sentiment-analysis

Developed a sentiment analysis model to measure tweet positivity across regions using advanced NLP techniques. This project involved data preprocessing, feature engineering with TF-IDF and Doc2Vec, and training supervised machine learning models. Performance was validated using cross-validation and metrics like accuracy and precision

cross-validation data-preprocessing feature-engineering machine-learning model-evaluation model-training-and-tuning natural-language-processing performance-metrics

Last synced: 16 May 2025

https://github.com/transcentlin/api-probe-platform

先进的大模型 API 多服务商性能与兼容性评估平台 | An advanced, multi-provider LLM API performance benchmark and evaluation platform.

api-benchmark api-evaluation deepseek fastapi llm-benchmark llm-evaluation llm-testing model-evaluation ollama openai-compatibility python react-dashboard tool-calling

Last synced: 17 Jun 2026

https://github.com/lucianoscarpaci/vgg16-item-classification

This project accurately classifies waste items using transfer learning and fine-tuning techniques. The project utilizes the waste item dataset for training and evaluation.

classification convolutional-neural-networks data-preprocessing dataset deep-learning fine-tuning image-classification keras model-evaluation model-training performance-metrics tensorflow transfer-learning

Last synced: 08 May 2026

https://github.com/ayax537/codsoft-task2

Second task on CodSoft Internship Transaction Fraud Detection! During my CodSoft internship, I worked on a challenging project focused on detecting fraudulent credit card transactions

data-modelling data-visualization eda machine-learning model-evaluation

Last synced: 29 Oct 2025

https://github.com/sayamalt/customer-churn-prediction

Successfully established a machine learning model which can predict whether any given customer currently utilizing the products and services offered by a company will churn at anytime in the future or not, depending upon a set of unique features/characteristics pertaining to that specific individual, to a great level of accuracy.

classification data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning model-deployment model-evaluation model-optimization model-training supervised-machine-learning

Last synced: 09 Nov 2025

https://github.com/konnik88/heart-disease-ml-practice

Practice notebook on heart-disease risk with a small/noisy dataset: EDA → preprocessing → classic ML baselines (scikit-learn). Not for clinical use

classification eda healthcare heart-disease imbalanced-data jupyter-notebook machine-learning model-evaluation optuna reproducibility scikit-learn

Last synced: 18 May 2026

https://github.com/naveen88112/final_education

Student Performance Prediction This project examines the student performance data, pre-processes the features, and implements machine learning methods (Random Forest) for the forecasting of final grades. The data is subjected to exploratory data analysis (EDA) and feature engineering prior to model training and assessment.

data-preprocessing exploratory-data-analysis machine-learning model-evaluation numpy pandas python

Last synced: 11 Apr 2026

https://github.com/piras-s/tuningcurvesnestedbayesianinference

Bayesian inference of neural tuning curves using nested sampling (PyMultiNest), with theory, simulation, and diagnostic visualizations.

bayesian-inference data-visualization machine-learning model-evaluation nested-sampling neuroscience pymultinest python3 simulation

Last synced: 18 May 2026

https://github.com/monish-nallagondalla/positional-salaries

This project implements a Decision Tree Regression model to predict salaries based on position levels using the dataset Position_Salaries.csv. The dataset contains position levels and their corresponding salaries. The model uses a decision tree to learn the relationships between position levels and salaries and predict salaries for any given positi

data-preprocessing decision-tree-algorithm decision-tree-regression feature-engineering ml-for-salary-estimation model-evaluation position-level-prediction regression-models salary-prediction supervised-learning

Last synced: 25 Jun 2025