https://github.com/ollama/ollama-js

Ollama JavaScript library
https://github.com/ollama/ollama-js

javascript js ollama

Last synced: 5 months ago
JSON representation

Ollama JavaScript library

Host: GitHub
URL: https://github.com/ollama/ollama-js
Owner: ollama
License: mit
Created: 2023-09-13T22:58:51.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2025-04-16T21:55:33.000Z (6 months ago)
Last Synced: 2025-05-07T22:00:01.655Z (5 months ago)
Topics: javascript, js, ollama
Language: TypeScript
Homepage: https://ollama.com
Size: 528 KB
Stars: 3,347
Watchers: 28
Forks: 298
Open Issues: 61
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

stars - ollama-js
stars - ollama-js
StarryDivineSky - ollama/ollama-js - js是Ollama项目的JavaScript客户端库，用于与Ollama模型服务进行交互，支持在浏览器和Node.js环境中运行大型语言模型。该项目通过封装Ollama的REST API接口，提供便捷的模型加载、推理和管理功能，用户可直接调用Llama、Mistral等开源模型进行文本生成、对话等任务。核心特性包括支持异步模型推理、模块化设计、跨平台兼容性以及对模型参数的灵活配置。工作原理基于HTTP协议与Ollama服务通信，通过发送JSON格式的请求实现模型调用，同时提供类型安全的TypeScript接口和简洁的API封装。项目适用于需要在JavaScript生态中集成本地化大模型服务的场景，如开发AI聊天机器人、自动化文本处理工具等。开发者可通过npm安装依赖，结合Ollama服务快速构建应用，且支持自定义模型参数和流式输出处理，满足不同场景下的性能需求。该项目持续更新维护，社区活跃度高，是连接JavaScript应用与Ollama模型服务的重要桥梁。 (A01_文本生成_文本对话 / 大语言对话模型及数据)

README

          # Ollama JavaScript Library

The Ollama JavaScript library provides the easiest way to integrate your JavaScript project with [Ollama](https://github.com/jmorganca/ollama).

## Getting Started

```

npm i ollama

```

## Usage

```javascript

import ollama from 'ollama'

const response = await ollama.chat({

  model: 'llama3.1',

  messages: [{ role: 'user', content: 'Why is the sky blue?' }],

})

console.log(response.message.content)

```

### Browser Usage

To use the library without node, import the browser module.

```javascript

import ollama from 'ollama/browser'

```

## Streaming responses

Response streaming can be enabled by setting `stream: true`, modifying function calls to return an `AsyncGenerator` where each part is an object in the stream.

```javascript

import ollama from 'ollama'

const message = { role: 'user', content: 'Why is the sky blue?' }

const response = await ollama.chat({ model: 'llama3.1', messages: [message], stream: true })

for await (const part of response) {

  process.stdout.write(part.message.content)

}

```

## API

The Ollama JavaScript library's API is designed around the [Ollama REST API](https://github.com/jmorganca/ollama/blob/main/docs/api.md)

### chat

```javascript

ollama.chat(request)

```

- `request` ``: The request object containing chat parameters.

  - `model` `` The name of the model to use for the chat.

  - `messages` ``: Array of message objects representing the chat history.

    - `role` ``: The role of the message sender ('user', 'system', or 'assistant').

    - `content` ``: The content of the message.

    - `images` ``: (Optional) Images to be included in the message, either as Uint8Array or base64 encoded strings.

  - `format` ``: (Optional) Set the expected format of the response (`json`).

  - `stream` ``: (Optional) When true an `AsyncGenerator` is returned.

  - `keep_alive` ``: (Optional) How long to keep the model loaded. A number (seconds) or a string with a duration unit suffix ("300ms", "1.5h", "2h45m", etc.)

  - `tools` ``: (Optional) A list of tool calls the model may make.

  - `options` ``: (Optional) Options to configure the runtime.

- Returns: ``

### generate

```javascript

ollama.generate(request)

```

- `request` ``: The request object containing generate parameters.

  - `model` `` The name of the model to use for the chat.

  - `prompt` ``: The prompt to send to the model.

  - `suffix` ``: (Optional) Suffix is the text that comes after the inserted text.

  - `system` ``: (Optional) Override the model system prompt.

  - `template` ``: (Optional) Override the model template.

  - `raw` ``: (Optional) Bypass the prompt template and pass the prompt directly to the model.

  - `images` ``: (Optional) Images to be included, either as Uint8Array or base64 encoded strings.

  - `format` ``: (Optional) Set the expected format of the response (`json`).

  - `stream` ``: (Optional) When true an `AsyncGenerator` is returned.

  - `keep_alive` ``: (Optional) How long to keep the model loaded. A number (seconds) or a string with a duration unit suffix ("300ms", "1.5h", "2h45m", etc.)

  - `options` ``: (Optional) Options to configure the runtime.

- Returns: ``

### pull

```javascript

ollama.pull(request)

```

- `request` ``: The request object containing pull parameters.

  - `model` `` The name of the model to pull.

  - `insecure` ``: (Optional) Pull from servers whose identity cannot be verified.

  - `stream` ``: (Optional) When true an `AsyncGenerator` is returned.

- Returns: ``

### push

```javascript

ollama.push(request)

```

- `request` ``: The request object containing push parameters.

  - `model` `` The name of the model to push.

  - `insecure` ``: (Optional) Push to servers whose identity cannot be verified.

  - `stream` ``: (Optional) When true an `AsyncGenerator` is returned.

- Returns: ``

### create

```javascript

ollama.create(request)

```

- `request` ``: The request object containing create parameters.

  - `model` `` The name of the model to create.

  - `from` ``: The base model to derive from.

  - `stream` ``: (Optional) When true an `AsyncGenerator` is returned.

  - `quantize` ``: Quanization precision level (`q8_0`, `q4_K_M`, etc.).

  - `template` ``: (Optional) The prompt template to use with the model.

  - `license` ``: (Optional) The license(s) associated with the model.

  - `system` ``: (Optional) The system prompt for the model.

  - `parameters` `>`: (Optional) Additional model parameters as key-value pairs.

  - `messages` ``: (Optional) Initial chat messages for the model.

  - `adapters` `>`: (Optional) A key-value map of LoRA adapter configurations.

- Returns: ``

Note: The `files` parameter is not currently supported in `ollama-js`.

### delete

```javascript

ollama.delete(request)

```

- `request` ``: The request object containing delete parameters.

  - `model` `` The name of the model to delete.

- Returns: ``

### copy

```javascript

ollama.copy(request)

```

- `request` ``: The request object containing copy parameters.

  - `source` `` The name of the model to copy from.

  - `destination` `` The name of the model to copy to.

- Returns: ``

### list

```javascript

ollama.list()

```

- Returns: ``

### show

```javascript

ollama.show(request)

```

- `request` ``: The request object containing show parameters.

  - `model` `` The name of the model to show.

  - `system` ``: (Optional) Override the model system prompt returned.

  - `template` ``: (Optional) Override the model template returned.

  - `options` ``: (Optional) Options to configure the runtime.

- Returns: ``

### embed

```javascript

ollama.embed(request)

```

- `request` ``: The request object containing embedding parameters.

  - `model` `` The name of the model used to generate the embeddings.

  - `input` ` | `: The input used to generate the embeddings.

  - `truncate` ``: (Optional) Truncate the input to fit the maximum context length supported by the model.

  - `keep_alive` ``: (Optional) How long to keep the model loaded. A number (seconds) or a string with a duration unit suffix ("300ms", "1.5h", "2h45m", etc.)

  - `options` ``: (Optional) Options to configure the runtime.

- Returns: ``

### ps

```javascript

ollama.ps()

```

- Returns: ``

### abort

```javascript

ollama.abort()

```

This method will abort **all** streamed generations currently running with the client instance.

If there is a need to manage streams with timeouts, it is recommended to have one Ollama client per stream.

All asynchronous threads listening to streams (typically the ```for await (const part of response)```) will throw an ```AbortError``` exception. See [examples/abort/abort-all-requests.ts](examples/abort/abort-all-requests.ts) for an example.

## Custom client

A custom client can be created with the following fields:

- `host` ``: (Optional) The Ollama host address. Default: `"http://127.0.0.1:11434"`.

- `fetch` ``: (Optional) The fetch library used to make requests to the Ollama host.

```javascript

import { Ollama } from 'ollama'

const ollama = new Ollama({ host: 'http://127.0.0.1:11434' })

const response = await ollama.chat({

  model: 'llama3.1',

  messages: [{ role: 'user', content: 'Why is the sky blue?' }],

})

```

## Building

To build the project files run:

```sh

npm run build

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ollama/ollama-js

Awesome Lists containing this project

README