https://github.com/akai01/ngboost

An R interface to the NGBoost.
https://github.com/akai01/ngboost

ngboost pyhon r

Last synced: 6 months ago
JSON representation

An R interface to the NGBoost.

Host: GitHub
URL: https://github.com/akai01/ngboost
Owner: Akai01
License: other
Created: 2021-09-08T18:14:35.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2022-11-08T20:05:10.000Z (about 3 years ago)
Last Synced: 2025-03-22T15:51:43.343Z (10 months ago)
Topics: ngboost, pyhon, r
Language: R
Homepage:
Size: 210 KB
Stars: 5
Watchers: 1
Forks: 3
Open Issues: 0
Metadata Files:
- Readme: README.Rmd
- Changelog: NEWS.md
- License: LICENSE.md

Awesome Lists containing this project

README

          ---

output: github_document

---

```{r, include = FALSE}

knitr::opts_chunk$set(

  collapse = TRUE,

  comment = "#>",

  fig.path = "man/figures/README-",

  out.width = "100%"

)

```

# ngboost

The goal of ngboost is to provide an R interface for the Python package [NGBoost](https://stanfordmlgroup.github.io/ngboost/intro.html). 

## What is Natural Gradient Boosting?

"NGBoost is a method for probabilistic prediction with competitive state-of-the-art performance on a variety of datasets. NGBoost combines a multiparameter boosting algorithm with the natural gradient to efficiently estimate how parameters of the presumed outcome distribution vary with the observed features. NGBoost performs as well as existing methods for probabilistic regression but retains major advantages: NGBoost is flexible, scalable, and easy-to-use." (From the paper, Duan, et at., 2019, [see here](https://arxiv.org/pdf/1910.03225.pdf))

## Installation

The development version from [GitHub](https://github.com/) with:

``` r

# install.packages("devtools")

devtools::install_github("Akai01/ngboost")

```

## Example

A probabilistic regression example on the Boston housing dataset: 

```{r example_regression}

library(ngboost)

data(Boston, package = "MASS")

dta <- rsample::initial_split(Boston)

train <- rsample::training(dta)

test <- rsample::testing(dta)

x_train = train[,1:13]

y_train = train[,14]

x_test = test[,1:13]

y_test = test[,14]

model <- NGBRegression$new(Dist = Dist("Exponential"),

                           Base = sklearner(),

                           Score = Scores("MLE"),

                           natural_gradient =TRUE,

                           n_estimators = 600,

                           learning_rate = 0.002,

                           minibatch_frac = 0.8,

                           col_sample = 0.9,

                           verbose = TRUE,

                           verbose_eval = 100,

                           tol = 1e-5)

model$fit(X = x_train, Y = y_train, X_val = x_test, Y_val = y_test)

model$feature_importances()

model$plot_feature_importance()

model$predict(x_test)%>%head()

distt <- model$pred_dist(x_test) # it returns a NGBDist

class(distt)

?NGBDist # see the available methods

distt$interval(confidence = .9)

```

Classification example:

```{r example_class, echo=TRUE, warning = FALSE, message = FALSE}

data(BreastCancer, package = "mlbench")

dta <- na.omit(BreastCancer)

dta <- rsample::initial_split(dta)

train <- rsample::training(dta)

test <- rsample::testing(dta)

x_train = train[,2:10]

y_train = as.integer(train[,11])

x_test = test[,2:10]

y_test = as.integer(test[,11])

model <- NGBClassifier$new(Dist = Dist("k_categorical", k = 3),

                           Base = sklearner(),

                           Score = Scores("LogScore"),

                           natural_gradient = TRUE,

                           n_estimators = 100,

                           tol = 1e-5,

                           random_state = NULL)

model$fit(x_train, y_train, X_val = x_test, Y_val = y_test)

model$feature_importances()

model$plot_feature_importance()

model$predict(x_test)

model$predict_proba(x_test)%>%head()

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/akai01/ngboost

Awesome Lists containing this project

README