An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with catboost

A curated list of projects in awesome lists tagged with catboost .

https://github.com/catboost/catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

big-data catboost categorical-features coreml cuda data-mining data-science decision-trees gbdt gbm gpu gpu-computing gradient-boosting kaggle machine-learning python r tutorial

Last synced: 12 May 2025

https://github.com/xiaodaigh/jlboost.jl

A 100%-Julia implementation of Gradient-Boosting Regression Tree algorithms

catboost data-science gbdt gbrt lightgbm machine-learning tree tree-boosting-algorithms xgboost

Last synced: 17 Jan 2026

https://github.com/duoan/ijcai18-mama-ads-competition

IJCAI-18 阿里妈妈搜索广告转化预测初赛方案

admin catboost ctr data-processing ijcai-18 lightgbm tianchi

Last synced: 18 Jun 2026

https://github.com/auto-flow/auto-flow

AutoFlow : Automatic machine learning workflow modeling platform

automl catboost data-minig data-sicence lightgbm machine-learning workflow

Last synced: 04 Apr 2026

https://github.com/erdogant/hgboost

hgboost is a python package for hyper-parameter optimization for xgboost, catboost or lightboost using cross-validation, and evaluating the results on an independent validation set. hgboost can be applied for classification and regression tasks.

catboost crossvalidation gridsearch hyperoptimization lightboost machine-learning python xgboost

Last synced: 06 Apr 2026

https://github.com/bsharchilev/influence_boosting

Supporting code for the paper "Finding Influential Training Samples for Gradient Boosted Decision Trees"

catboost gradient-boosting influence-functions machine-learning machine-learning-algorithms paper python

Last synced: 27 Mar 2025

https://github.com/ashishpatel26/datascienv

datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries

catboost data-science data-science-env datascienv imbalanced-data lightgbm matplotlib numpy pandas pycaret scikit-learn seaborn tensorflow2 xgboost

Last synced: 24 Oct 2025

https://github.com/kwokhing/yandexcatboost-python-demo

Demo on the capability of Yandex CatBoost gradient boosting classifier on a fictitious IBM HR dataset obtained from Kaggle. Data exploration, cleaning, preprocessing and model tuning are performed on the dataset

catboost data-analysis data-preprocessing data-science feature-selection gradient-boosting gradient-boosting-classifier one-hot-encode pandas pearson-correlation python python27 seaborn variance-analysis visualization yandex-catboost

Last synced: 09 Apr 2025

https://github.com/mirecl/catboost-cgo

CatBoost a fast, scalable, high performance Gradient Boosting on Decision Trees library. Golang using Cgo for blazing fast inference CatBoost Model 🚀

catboost cgo deep-learning golang gradient-boosting inference

Last synced: 13 Oct 2025

https://github.com/marketcalls/openadvisor

Self Hostable - Personal Machine Learning Based Stock Recommendation Platform

catboost daisyui flask investment-analysis machine-learning python sqlite stock-price-prediction tailwindcss tradingview

Last synced: 13 Aug 2025

https://github.com/ahmed-maher77/wind-turbine-power-prediction-app-using-machine-learning

"Wind Power Predictor" is a machine learning project that forecasts turbine output using real-time data from Turkish wind farms. Its web app interface offers convenient access to predictions, enabling informed decisions for maximizing energy production and advancing renewable energy usage.

ai catboost data-analysis data-science flask html-css-javascript javascript machine-learning matplotlib numpy pandas predictive-modeling pwa python sklearn web web-development wind wind-turbine wind-turbine-operational-optimization

Last synced: 10 Apr 2025

https://github.com/koldim2001/time_series_theory

Решение задач по анализу временных рядов: детекция пиков QRS на сигналах ЭКГ с помощью ML, прогнозирование заболеваемости COVID с помощью LSTM и др.

catboost covid-19 ecg-qrs-detection ecg-signal econometrics fft-analysis forecasting gradient-boosting kalman-filter lstm-neural-networks machine-learning rcnn signal-processing time-series

Last synced: 29 Aug 2025

https://github.com/filipspl/degronopedia-ml-psi

Predict Protein Stability Index from the sequence

bioinformatics bioinformatics-analysis catboost degrons ml prediction

Last synced: 18 Sep 2025

https://github.com/davidromanovizc/data_fusion_contest

The solution that achieved 8th place in private. Data Fusion 2022

catboost python

Last synced: 26 Oct 2025

https://github.com/ahmedshahriar/customer-churn-prediction

Extensive EDA of the IBM telco customer churn dataset, implemented various statistical hypotheses tests and Performed single-level Stacking Ensemble and tuned hyperparameters using Optuna.

binary-classification catboost classification-models customer-churn-prediction ensemble-classifier hyperparameter-optimization kaggle lightgbm optuna pandas-python scipy stacking-ensemble xgboost

Last synced: 27 Sep 2025

https://github.com/nizarassad/stroke-prediction

This project studies the use of machine learning techniques to predict the long-term outcomes of stroke victims.

brain-stroke-prediction catboost decision-trees logistic-regression naive-bayes-classifier python random-forest svm xgboost

Last synced: 07 May 2025

https://github.com/34j/sklearn-utilities

Utilities for scikit-learn. Append prediction to x, append prediction to x single, append x prediction to x, compose var estimator, data frame wrapper, drop by noise prediction, drop missing rows y, dummy regressor var, estimator wrapper base, excluded column transformer pandas, feature union pandas, id transformer, included column transformer pand

catboost feature-engine feature-engineering multioutput pandas pca python pytorch regression scikit-learn sklearn sklearn-compatible skorch torch tqdm

Last synced: 13 Apr 2025

https://github.com/jasonzhu1313/kagglepipeline

This repository provides commonly used modules from feature engineering to model training for machine learning tasks and kaggle competition.

bayesian-optimization catboost feature-engineering feature-selection kaggle-competition kaggle-elo lightgbm machine-learning model-selection pipeline-as-code recommender-system xgboost

Last synced: 08 Apr 2025

https://github.com/owenodriscoll/automl

Python package for automated hyperparameter-optimization of common machine-learning algorithms

automl catboost classification hyperparameter-optimization lightgbm machine-learning optuna regression scikit-learn xgboost

Last synced: 16 Mar 2025

https://github.com/fatimaafzaal/multiple-ensemble-models-diabetes-prediction-project-

This project focuses on predicting the likelihood of diabetes in individuals using ensemble machine learning models. It combines various ensemble techniques, including Random Forest, AdaBoost, Gradient Boosting, Bagging, Extra Trees, XGBoost, Voting Classifier and some others to get predictions.

adaboost catboost colab-notebook diabetes diabetes-prediction ensemble-classifier ensemble-learning ensemble-machine-learning ensemble-model gradient-boosting machine-learning python stacking-classifier voting-classifier xgboost

Last synced: 18 Apr 2026

https://github.com/kriss024/anaconda-python-and-pytorch

Anaconda Python 3.8.8 with PyTorch Docker image

catboost data-science docker jupyter-notebook python pytorch xgboost

Last synced: 17 Apr 2026

https://github.com/kozistr/catboost-server-rs

CatBoost server in Rust + gRPC

catboost grpc machine-learning rust server serving

Last synced: 16 May 2026

https://github.com/screengreen/prediction-of-the-investors-class

Скрипт, который предсказывает класс инвестора по сделкам и начальному состоянию портфеля.

catboost classification lstm sklearn

Last synced: 01 May 2026

https://github.com/mubshr07/heartdiseaseprediction

This repo is the Machine Learning practice on NHANES dataset of Heart Disease prediction. The ML algorithms like LR, DT, RF, SVM, KNN, NB, MLP, AdaBoost, XGBoost, CatBoost, LightGBM, ExtraTree, etc. The results are good. I also explore the class-balancing (SMOTE) because the original dataset contains only 5% of patient and 95% of healthy record.

adaboost catboost dicision-tree extratreesclassifier knn-classification lightgbm logistic-regression machine-learning mlp-classifier mlp-networks navies-bayes-classifer nhanes nhanes-data random-forest svm svm-classifier xgboost

Last synced: 07 Oct 2025

https://github.com/errhythm/nyctaxifarepred-extended

NYC Taxi Fare Prediction with 7 models (Linear Regression, Random Forest, XGBoost, LightGBM, CatBoost, KNN, and Decision Tree) The models used range from simple linear regression to more complex ensemble methods such as boosting algorithms. The aim was to improve prediction accuracy and handle categorical features efficiently.

catboost decision-tree ensemble-model knn lightgbm nyc-taxi-dataset regression xgboost

Last synced: 29 Apr 2025

https://github.com/konnik88/blendcal-conversion-prediction

End-to-end ML pipeline for predicting conversion in web sessions: feature engineering, CatBoost+XGBoost+LightGBM ensemble with calibration, FastAPI service, Streamlit UI, Airflow DAG orchestration, Dockerized.

airflow catboost docker fastapi lightgbm machine-learning mlops streamlit xgboost

Last synced: 29 Apr 2026

https://github.com/deaneeth/telco-churn-prediction-mlops

Production-ready ML pipeline for telco customer churn prediction using advanced ensemble methods (XGBoost, CatBoost, Random Forest). Handles class imbalance, provides business insights, and includes modular MLOps architecture. Built with scikit-learn, featuring comprehensive EDA, feature engineering, and business impact analysis.

catboost data-preprocessing ensemble-methods feature-engineering machine-learning mlops pipeline-development python random-forest scikit-learn telco-analytics xgboost

Last synced: 15 Apr 2026

https://github.com/yantonov/ml-docker

Playground for common python ml libraries

anaconda catboost docker docker-image jupiter numpy pyplot python scipy sklearn

Last synced: 27 Feb 2026

https://github.com/mhmudfzli/exploring-mental-health-data

This project demonstrates a comprehensive approach to solving a regression problem using various machine learning models. The notebook includes: Data Preprocessing, Exploratory Data Analysis (EDA), Model Training, Hyperparameter Tuning, Model Evaluation, Feature Importance

catboost lightgbm matplotlib numpy pandas scikit-learn seaborn xgboost

Last synced: 09 Apr 2026

https://github.com/amr-yasser226/intrusion-detection-kaggle

End-to-end pipeline for multi-class cyber-attack detection using per-flow network features: data profiling, deduplication, skew-correction, outlier treatment, feature engineering, imbalance handling, and tree-based modeling (XGBoost, LightGBM, CatBoost, stacking), with a final Kaggle submission scoring 91.46% public / 91.63% private.

catboost cyber-security data-preprocessing ensemble-learning feature-engineering imbalanced-data jupyter-notebooks kaggle lightgbm machine-learning outlier-detection random-forest xgboost

Last synced: 18 May 2026

https://github.com/harmanveer2546/credit-card-fraud-detection

The Credit Card Fraud Detection Problem includes modeling past credit card transactions with the knowledge of the ones that turned out to be a fraud. This model is then used to identify whether a new transaction is fraudulent or not. Our aim here is to detect 100% of the fraudulent transactions while minimizing the incorrect fraud classifications.

ann catboost eda lightgbm machine-learning matplotlib neural-network numpy pandas python random-forest seaborn xgboost

Last synced: 11 Apr 2026

https://github.com/paulo-santos-ds/rotatividade_de_clientes

A operadora de comunicações InternetGO está interessada em prever a rotatividade de seus clientes (churn). Se for identificado que um usuário está planejando trocar de operadora, a empresa poderá oferecer códigos promocionais e opções de planos especiais para evitar a perda desse cliente.

catboost numpy pandas pyplot python seaborn sklearn

Last synced: 09 Apr 2026

https://github.com/priboy313/pandasflow

A set of custom python modules for friendly workflow on pandas

catboost data-analysis data-science pandas phik python scikit-learn shap

Last synced: 20 Jan 2026

https://github.com/romanthekat/drivendata-pump-it-up

https://www.drivendata.org/competitions/7/pump-it-up-data-mining-the-water-table/

catboost jupyter-notebook machine-learning ml python random-forest sklearn xgboost

Last synced: 09 May 2026

https://github.com/tanishq-ctrl/house-price-prediction-and-visualization

This repository contains code and data for analyzing real estate trends, predicting house prices, estimating time on the market, and building an interactive dashboard for visualization. It is structured to cater to data scientists, real estate analysts, and developers looking to understand property market dynamics.

catboost catboost-classifier catboostregressor dataanlaytics datavisualization-project housepriceprediction lightgbm lightgbm-classifier lightgbm-regressor machinelearningalgorithms r2score rmse-score xgboost xgboost-algorithm xgboost-classifier

Last synced: 22 Mar 2025

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/egorumaev/2023-steel-energy

Предсказание температуры стали, выплавляемой на металлургическом комбинате

catboost featureengineering lightgbm pandas pipeline python3 regression sklearn xgboost

Last synced: 09 May 2026

https://github.com/saniyaabushakimova/ames-housing-price-prediction

Implemented regularized linear regression (Lasso, Ridge, ElasticNet) and tree-based models (Random Forest, XGBoost, CatBoost, LightGBM) to predict house prices in Ames, Iowa. The project explores feature engineering, outlier handling, and model tuning to improve predictive accuracy. Tech: Python (numpy, pandas, sklearn, catboost, os)

catboost elasticnet feature-engineering hyperparameter-tuning lasso-regression python random-forest ridge-regression xgboost

Last synced: 09 May 2026

https://github.com/hhrh/real-time-fraud-detection

A production-style real-time fraud detection pipeline using Kafka, FastAPI, XGBoost/CatBoost/LightGBM, Prometheus, and Grafana.

apache-kafka catboost data-visualization fastapi grafana lightgbm mlops prometheus python xgboost

Last synced: 10 May 2026

https://github.com/victoryfanfare/car-price-prediction

ML модель для определения рыночной стоимости автомобилей с пробегом. Проект включает анализ данных, feature engineering и сравнение различных алгоритмов машинного обучения.

catboost data-analysis jupyter-notebook lightgbm machine-learning pandas python regression

Last synced: 15 Jun 2026

https://github.com/aasmirnov-webdev/data_science_projects

Сборник всех выполненных учебных проектов курса Яндекс.Практикум "Специалист по Data Science".

bert catboost data-science database lgbm mashine-learning matplotlib numpy pandas python pytorch scikit-learn scipy seaborn sql xgboost

Last synced: 06 Apr 2026

https://github.com/gilevatanya/yandex-practicum-projects

Кейсы решенные на курсах Яндекс Практикума.

bert bootstrap catboost keras lightgbm matplotlib nltk numpy pandas postgresql python pytorch scikit-learn scipy seaborn sql

Last synced: 06 Jan 2026

https://github.com/hariprasath-v/intel_oneapi_hackerearth_predict-the-quality-of-freshwater

Build a machine model to predict whether the freshwater is safe to drink or not.Based on the measures like pH, TDS, etc.

catboost classification exploratory-data-analysis f1score lightgbm modin onedal pandas python3 shapash xgboost

Last synced: 19 Apr 2026

https://github.com/hariprasath-v/doceree_machine-learning-hackathon_round_1

Create a model that can accurately predict whether a user belongs to the HCP(Healthcare Professional) category or not. Based on server logs.

accuracy binaryclassification catboost exploratory-data-analysis machine-learning optuna python shap

Last synced: 03 May 2026

https://github.com/ai-naymul/churncrafter-ml-suite

this project leverages machine learning to predict customer churn, and we've built an interactive interface using Streamlit and containerized everything with Docker for easy deployment.

ai catboost catboost-model generative-ai machine-learning machine-learning-algorithms machinelearning mlops

Last synced: 18 Mar 2025

https://github.com/simeonhristov99/kickstarter

Course project for "Data Mining" university course.

catboost classification machine-learning pandas regression

Last synced: 28 Apr 2026

https://github.com/kriss024/anaconda-python-and-pytorch-lightning

Anaconda Python 3.8.8 with PyTorch Lightning Docker image

catboost data-science docker pyhton pytorch-lightning xgboost

Last synced: 07 Apr 2026

https://github.com/viniciusmecosta/cvclassifier

A REST API that classifies resumes into occupation fields and seniority levels using machine learning. Trained on 3,000+ resumes across 26 occupations, the API provides accurate classifications with efficient PDF text extraction.

catboost fastapi python3 sklearn spacy

Last synced: 07 Apr 2026

https://github.com/hariprasath-v/dphi-data-sprint-52---covid-19-sars-b-cell-epitope-prediction

Predicting epitope regions using a machine learning model

catboost optuna pandas python shap

Last synced: 08 May 2026

https://github.com/hariprasath-v/machinehack-odetocode_predicting_weather_using_alien_fruit_properties

Identify the type of climate the exoplanet has based on the properties of the fruit by using machine learning.

catboost machine-learning matplotlib numpy pandas seaborn shap sklearn

Last synced: 11 Apr 2026