https://dylanpieper.github.io/hellmer/

Batch Processing for Chat Models
https://dylanpieper.github.io/hellmer/
batch batch-processing ellmer llm package r
Last synced: 3 months ago
JSON representation
Batch Processing for Chat Models
Host: GitHub
URL: https://dylanpieper.github.io/hellmer/
Owner: dylanpieper
License: other
Created: 2025-02-10T16:03:08.000Z (5 months ago)
Default Branch: main
Last Pushed: 2025-03-21T16:52:40.000Z (3 months ago)
Last Synced: 2025-04-02T22:51:13.707Z (3 months ago)
Topics: batch, batch-processing, ellmer, llm, package, r
Language: R
Homepage: https://dylanpieper.github.io/hellmer/
Size: 4.94 MB
Stars: 8
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: NEWS.md
- License: LICENSE
Awesome Lists containing this project

awesome-generative-ai-data-scientist - hellmer
README

        # hellmer 

[![CRAN status](https://www.r-pkg.org/badges/version/hellmer)](https://cran.r-pkg.org/package=hellmer) [![R-CMD-check](https://github.com/dylanpieper/hellmer/actions/workflows/testthat.yml/badge.svg)](https://github.com/dylanpieper/hellmer/actions/workflows/testthat.yml)

Enable sequential and parallel batch processing for [chat models](https://ellmer.tidyverse.org/reference/index.html#chatbots) supported by `ellmer`.

## Features

Process multiple chat interactions with:

-   [Tooling](https://ellmer.tidyverse.org/articles/tool-calling.html) and [structured data extraction](https://ellmer.tidyverse.org/articles/structured-data.html)

-   Judgments (i.e., thinking or reasoning) for structured data refinement

-   Progress tracking and recovery

-   Automatic retry with backoff

-   Sound notifications

## Installation

You can install the package from CRAN with:

``` r

install.packages("hellmer")

```

## Setup API Keys

API keys allow access to chat models are are stored as environmental variables. I recommend the `usethis` package to setup API keys in your `.Renviron` such as `OPENAI_API_KEY=your-key`.

``` r

usethis::edit_r_environ(scope = c("user", "project"))

```

## Basic Usage

### Sequential Processing

Sequential processing uses the current R process to call one chat at a time and save the data to the disk.

``` r

library(hellmer)

chat <- chat_sequential(chat_openai(system_prompt = "Reply concisely, one sentence"))

prompts <- list(

  "What is R?",

  "Explain base R versus tidyverse"

)

batch <- chat$batch(prompts)

```

Access the batch results:

``` r

batch$progress()

#> $total_prompts

#> [1] 2

#> 

#> $completed_prompts

#> [1] 2

#> 

#> $completion_percentage

#> [1] 100

#> 

#> $remaining_prompts

#> [1] 0

#> 

#> $state_path

#> [1] "/var/folders/.../chat_c5383b1279ae.rds"

batch$texts()

#> [[1]]

#> [1] "R is a programming language and software environment primarily used for 

#> statistical computing and data analysis."

#> 

#> [[2]]

#> [1] "Base R refers to the R language's core packages and functionalities, 

#> whereas Tidyverse is a collection of R packages designed for data science 

#> that provides a more intuitive and consistent syntax."

batch$chats()

#> [[1]]

#> 

#> ── system [0] ───────────────────────────────────────────────────────────────

#> Reply concisely, one sentence

#> ── user [22] ────────────────────────────────────────────────────────────────

#> What is R?

#> ── assistant [18] ───────────────────────────────────────────────────────────

#> R is a programming language and software environment primarily used for

#> statistical computing and data analysis.

#> [[2]]

#> 

#> ── system [0] ───────────────────────────────────────────────────────────────

#> Reply concisely, one sentence

#> ── user [24] ────────────────────────────────────────────────────────────────

#> Explain base R versus tidyverse

#> ── assistant [37] ───────────────────────────────────────────────────────────

#> Base R refers to the R language's core packages and functionalities, whereas 

#> Tidyverse is a collection of R packages designed for data science 

#> that provides a more intuitive and consistent syntax.

```

### Parallel Processing

Parallel processing spins up multiple R processes, or parallel workers, to chat at the same time.

By default, the upper limit for number of `workers` = `parallel::detectCores()`, and the number of prompts to process at a time is `chunk_size` = `parallel::detectCores() * 5`. Each chat in a chunk is distributed across the available R processes. When a chunk is finished, the data is saved to the disk.

``` r

chat <- chat_future(chat_openai(system_prompt = "Reply concisely, one sentence"))

```

For maximum performance, set `chunk_size` to the number of prompts, which is \~4-5x faster. However, progress will not be saved to the disk until all chats are processed.

``` r

batch <- chat$batch(

  prompts, 

  chunk_size = length(prompts)

)

```

## Features

### Tooling

Register and use tools/function calling:

``` r

get_current_time <- function(tz = "UTC") {

  format(Sys.time(), tz = tz, usetz = TRUE)

}

chat$register_tool(tool(

  get_current_time,

  "Gets the current time in the given time zone.",

  tz = type_string(

    "The time zone to get the current time in. Defaults to `\"UTC\"`.",

    required = FALSE

  )

))

prompts <- list(

  "What time is it in Chicago?",

  "What time is it in New York?"

)

batch <- chat$batch(prompts)

batch$texts()

#> [[1]]

#> [1] "The current time in Chicago is 9:29 AM CDT."

#> 

#> [[2]]

#> [1] "The current time in New York is 10:29 AM EDT."

```

### Structured Data Extraction

Extract structured data using type specifications:

``` r

type_sentiment <- type_object(

  "Extract sentiment scores",

  positive_score = type_number("Positive sentiment score, 0.00 to 1.00"),

  negative_score = type_number("Negative sentiment score, 0.00 to 1.00"),

  neutral_score = type_number("Neutral sentiment score, 0.00 to 1.00")

)

prompts <- list(

  "The R community is really supportive and welcoming.",

  "R has both base functions and tidyverse functions for data manipulation.",

  "R's object-oriented system is confusing, inconsistent, and painful to use."

)

batch <- chat$batch(prompts, type_spec = type_sentiment)

batch$texts()

#> [[1]]

#> $positive_score

#> [1] 0.95

#> 

#> $negative_score

#> [1] 0.05

#> 

#> $neutral_score

#> [1] 0

#> ...

```

To ask the chat model to evaluate and refine structured data extractions, implement iterative thinking or reasoning into the turns of the chat using the `judgements` parameter (increases token use):

``` r

batch <- chat$batch(prompts, type_spec = type_sentiment, judgements = 1)

batch$texts()

#> [[1]]

#> [[1]]$positive_score

#> [1] 0.95

#> 

#> [[1]]$negative_score

#> [1] 0

#> 

#> [[1]]$neutral_score

#> [1] 0.05

#> ...

```

![Console output of LLM streaming the evaluation and refinement of the structured data extractions using `progress` = `FALSE` and `echo` = `TRUE`.](man/figures/judgements.gif)

### Progress Tracking and Recovery

Batch processing automatically saves progress to an `.rds` file on the disk and allows you to resume interrupted operations:

``` r

batch <- chat$batch(prompts, state_path = "chat_state.rds")

batch$progress()

```

If `state_path` is not defined, a temporary file will be created by default.

### Automatic Retry

Automatically retry failed requests with exponential backoff, which acts as a wide guardrail against temporary API errors. `ellmer` uses `httr2` to act as a narrow guardrail against specific API errors and limits with most chat provider functions defaulting to retry one time.

Be aware that this retry is a brute force approach, and as long as all other validation passes, the retry will persist. However, it will stop if it detects an authorization or API key issue.

``` r

batch <- chat$batch(

  prompts = prompts,   # list or vector of prompts

  max_retries = 3,     # maximum retry attempts

  initial_delay = 20,  # initial delay in seconds

  max_delay = 80,      # maximum delay between retries

  backoff_factor = 2   # multiply delay by this factor after each retry

)

```

### Sound Notifications

Toggle sound notifications on batch completion, interruption, and error:

``` r

chat <- chat_sequential(

  chat_openai,

  beep = TRUE

)

```

### Echoing

By default, the chat `echo` is set to `FALSE` to show a progress bar. However, you can still configure `echo` in the `$batch` call by first setting `progress` to `FALSE`:

``` r

batch <- chat$batch(prompts, progress = FALSE, echo = "all")

#> > What is R?

#> < R is a programming language and software environment used for statistical computing,

#> < data analysis, and graphical representation.

#> < 

#> > Explain base R versus tidyverse

#> < Base R refers to the functions and paradigms built into the R language, while

#> < tidyverse is a collection of R packages designed for data science, emphasizing 

#> < a more consistent and human-readable syntax for data manipulation.

#> < 

```

### Methods

-   `progress()`: Returns processing status

-   `texts()`: Returns response texts in the same format as the input prompts (i.e., a list if prompts were provided as a list, or a character vector if prompts were provided as a vector). When a type specification is provided, it returns structured data instead of plain text.

-   `chats()`: Returns a list of chat objects

## Further Reading

-   [Using Ellmer Chat Models](https://dylanpieper.github.io/hellmer/articles/using-chat-models.html)
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://dylanpieper.github.io/hellmer/

Awesome Lists containing this project

README