https://github.com/queelius/compositional.mle

Composable MLE solvers: a DSL for maximum likelihood estimation where solvers are first-class functions that combine via chaining, racing, and restarts
https://github.com/queelius/compositional.mle

composable dsl estimation maximum-likelihood mle mle-estimation numerical-methods optimization r-package statistics

Last synced: 4 months ago
JSON representation

Composable MLE solvers: a DSL for maximum likelihood estimation where solvers are first-class functions that combine via chaining, racing, and restarts

Host: GitHub
URL: https://github.com/queelius/compositional.mle
Owner: queelius
License: other
Created: 2023-05-06T22:02:22.000Z (about 3 years ago)
Default Branch: master
Last Pushed: 2026-02-01T07:18:40.000Z (5 months ago)
Last Synced: 2026-02-01T16:34:43.868Z (5 months ago)
Topics: composable, dsl, estimation, maximum-likelihood, mle, mle-estimation, numerical-methods, optimization, r-package, statistics
Language: R
Homepage: https://queelius.github.io/compositional.mle/
Size: 5.52 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.Rmd
- License: LICENSE

Awesome Lists containing this project

README

          ---

output: github_document

---

```{r, include = FALSE}

knitr::opts_chunk$set(

  collapse = TRUE,

  comment = "#>",

  fig.path = "man/figures/README-",

  out.width = "100%"

)

```

# compositional.mle

[![R-CMD-check](https://github.com/queelius/compositional.mle/workflows/R-CMD-check/badge.svg)](https://github.com/queelius/compositional.mle/actions)

An R package for **composable maximum likelihood estimation**. Solvers are first-class functions that combine via sequential chaining, parallel racing, and random restarts.

## When to Use This Package

**Use compositional.mle when:**

- **Multi-modal likelihoods**: Your likelihood surface has multiple local optima and you need global search strategies (simulated annealing, random restarts)

- **Coarse-to-fine optimization**: You want to start with a rough global search and progressively refine with local methods

- **Comparing strategies**: You're unsure which optimizer works best and want to race them automatically

- **Building robust pipelines**: You need reliable estimation that handles edge cases gracefully

- **Research/experimentation**: You want to explore optimization strategies and visualize convergence

**Stick with `optim()` when:**

- You have a simple, well-behaved likelihood with a single optimum

- You know exactly which method works and don't need composition

### Example: Why Composition Matters

```{r why-compose, message=FALSE}

library(compositional.mle)

# A tricky bimodal likelihood

set.seed(42)

bimodal_loglike <- function(theta) {

  # Two peaks: one at theta=2, one at theta=8

  log(0.3 * dnorm(theta, 2, 0.5) + 0.7 * dnorm(theta, 8, 0.5))

}

problem <- mle_problem(

 loglike = bimodal_loglike,

  constraint = mle_constraint(support = function(theta) TRUE)

)

# Single gradient ascent gets trapped at local optimum

result_local <- gradient_ascent()(problem, theta0 = 0)

# Simulated annealing + gradient ascent finds global optimum

strategy <- sim_anneal(temp_init = 5, max_iter = 200) %>>% gradient_ascent()

result_global <- strategy(problem, theta0 = 0)

cat("Local search found:", round(result_local$theta.hat, 2),

    "(log-lik:", round(result_local$loglike, 2), ")\n")

cat("Global strategy found:", round(result_global$theta.hat, 2),

    "(log-lik:", round(result_global$loglike, 2), ")\n")

```

## Installation

```r

# From CRAN (when available)

install.packages("compositional.mle")

# Development version

devtools::install_github("queelius/compositional.mle")

```

## Design Philosophy

Following SICP principles, the package provides:

1. **Primitive solvers** - `gradient_ascent()`, `newton_raphson()`, `bfgs()`, `sim_anneal()`, etc.

2. **Composition operators** - `%>>%` (sequential), `%|%` (race), `with_restarts()`

3. **Closure property** - Combining solvers yields a solver

## Quick Start

```{r example, message=FALSE}

# Generate sample data

set.seed(42)

x <- rnorm(100, mean = 5, sd = 2)

# Define the problem (separate from solver strategy)

problem <- mle_problem(

  loglike = function(theta) {

    if (theta[2] <= 0) return(-Inf)

    sum(dnorm(x, theta[1], theta[2], log = TRUE))

  },

  score = function(theta) {

    mu <- theta[1]; sigma <- theta[2]; n <- length(x)

    c(sum(x - mu) / sigma^2,

      -n / sigma + sum((x - mu)^2) / sigma^3)

  },

  constraint = mle_constraint(

    support = function(theta) theta[2] > 0,

    project = function(theta) c(theta[1], max(theta[2], 1e-8))

  )

)

# Simple solve

result <- gradient_ascent()(problem, theta0 = c(0, 1))

result$theta.hat

```

## Composing Solvers

### Sequential Chaining (`%>>%`)

Chain solvers for coarse-to-fine optimization:

```{r sequential}

# Grid search -> gradient ascent -> Newton-Raphson

strategy <- grid_search(lower = c(-10, 0.5), upper = c(10, 5), n = 5) %>>%

  gradient_ascent(max_iter = 50) %>>%

  newton_raphson(max_iter = 20)

result <- strategy(problem, theta0 = c(0, 1))

result$theta.hat

```

### Parallel Racing (`%|%`)

Race multiple methods, keep the best:

```{r race}

# Try multiple approaches, pick winner by log-likelihood

strategy <- gradient_ascent() %|% bfgs() %|% nelder_mead()

result <- strategy(problem, theta0 = c(0, 1))

c(result$theta.hat, loglike = result$loglike)

```

### Random Restarts

Escape local optima with multiple starting points:

```{r restarts}

strategy <- with_restarts(

  gradient_ascent(),

  n = 10,

  sampler = uniform_sampler(c(-10, 0.5), c(10, 5))

)

result <- strategy(problem, theta0 = c(0, 1))

result$theta.hat

```

## Visualization

Track and visualize the optimization path:

```{r visualization, fig.height=4, fig.width=8}

# Enable tracing

trace_cfg <- mle_trace(values = TRUE, gradients = TRUE, path = TRUE)

result <- gradient_ascent(max_iter = 50)(problem, c(0, 1), trace = trace_cfg)

# Plot convergence

plot(result, which = c("loglike", "gradient"))

```

Extract trace as data frame for custom analysis:

```{r trace-df}

path_df <- optimization_path(result)

head(path_df)

```

## Available Solvers

| Factory | Method | Best For |

|---------|--------|----------|

| `gradient_ascent()` | Steepest ascent with line search | General purpose, smooth likelihoods |

| `newton_raphson()` | Second-order Newton | Fast convergence near optimum |

| `bfgs()` | Quasi-Newton BFGS | Good balance of speed/robustness |

| `lbfgsb()` | L-BFGS-B with box constraints | High-dimensional, bounded parameters |

| `nelder_mead()` | Simplex (derivative-free) | Non-smooth or noisy likelihoods |

| `sim_anneal()` | Simulated annealing | Global optimization, multi-modal |

| `coordinate_ascent()` | One parameter at a time | Different parameter scales |

| `grid_search()` | Exhaustive grid | Finding starting points |

| `random_search()` | Random sampling | High-dimensional exploration |

## Function Transformers

```{r transformers, eval=FALSE}

# Stochastic gradient (mini-batching for large data)

loglike_sgd <- with_subsampling(loglike, data = x, subsample_size = 32)

# Regularization

loglike_l2 <- with_penalty(loglike, penalty_l2(), lambda = 0.1)

loglike_l1 <- with_penalty(loglike, penalty_l1(), lambda = 0.1)

```

## Documentation

- Full documentation: 

- Vignettes:

  - [Getting Started](https://queelius.github.io/compositional.mle/articles/getting-started.html)

  - [Case Studies](https://queelius.github.io/compositional.mle/articles/case-studies.html)

  - [Theory and Intuition](https://queelius.github.io/compositional.mle/articles/theory-and-intuition.html)

## License

MIT

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/queelius/compositional.mle

Awesome Lists containing this project

README