https://github.com/googleapis/nodejs-vertexai

Last synced: 3 months ago
JSON representation
Host: GitHub
URL: https://github.com/googleapis/nodejs-vertexai
Owner: googleapis
License: apache-2.0
Created: 2023-11-17T13:35:34.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-08-26T22:48:22.000Z (11 months ago)
Last Synced: 2024-08-28T04:34:26.510Z (11 months ago)
Language: TypeScript
Size: 459 KB
Stars: 105
Watchers: 17
Forks: 35
Open Issues: 30
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Codeowners: .github/CODEOWNERS
- Security: SECURITY.md
Awesome Lists containing this project

README

        [![NPM Downloads](https://img.shields.io/npm/dm/%40google-cloud%2Fvertexai)](https://www.npmjs.com/package/@google-cloud/vertexai)

[![Node Current](https://img.shields.io/node/v/%40google-cloud%2Fvertexai)](https://www.npmjs.com/package/@google-cloud/vertexai)

> [!NOTE] A new Javascript/Typescript SDK, `@google/genai`

> ([github](https://github.com/googleapis/js-genai/tree/main)), is currently

> available in a *experimental preview launch* - designed to work with Gemini

> 2.0 features. and support both the Gemini API and the Vertex API.

# Vertex AI SDK for Node.js quickstart

The Vertex AI SDK for Node.js lets you use the Vertex AI Gemini API to build

AI-powered features and applications. Both TypeScript and JavaScript are supported.

The sample code in this document is written in JavaScript only.

For detailed samples using the Vertex AI Node.js SDK, see the

[samples repository](https://github.com/GoogleCloudPlatform/nodejs-docs-samples/tree/main/generative-ai/snippets)

on GitHub.

For the latest list of available Gemini models on Vertex AI, see the

[Model information](https://cloud.google.com/vertex-ai/docs/generative-ai/learn/models#gemini-models)

page in Vertex AI documentation.

## Before you begin

1.  Make sure your node.js version is 18 or above.

1.  [Select](https://console.cloud.google.com/project) or [create](https://cloud.google.com/resource-manager/docs/creating-managing-projects#creating_a_project) a Google Cloud project.

1.  [Enable billing for your project](https://cloud.google.com/billing/docs/how-to/modify-project).

1.  [Enable the Vertex AI API](https://console.cloud.google.com/flows/enableapi?apiid=aiplatform.googleapis.com).

1.  [Install the gcloud CLI](https://cloud.google.com/sdk/docs/install).

1.  [Initialize the gcloud CLI](https://cloud.google.com/sdk/docs/initializing).

1.  Create local authentication credentials for your user account:

    ```sh

    gcloud auth application-default login

    ```

A list of accepted authentication options are listed in [GoogleAuthOptions](https://github.com/googleapis/google-auth-library-nodejs/blob/3ae120d0a45c95e36c59c9ac8286483938781f30/src/auth/googleauth.ts#L87) interface of google-auth-library-node.js GitHub repo.

1.  Official documentation is available in the [Vertex AI SDK Overview](https://cloud.google.com/vertex-ai/generative-ai/docs/reference/nodejs/latest/overview) page. From here, a complete list of documentation on classes, interfaces, and enums are available.

## Install the SDK

Install the Vertex AI SDK for Node.js by running the following command:

```shell

npm install @google-cloud/vertexai

```

## Initialize the `VertexAI` class

To use the Vertex AI SDK for Node.js, create an instance of `VertexAI` by

passing it your Google Cloud project ID and location. Then create an instance of

the GenerativeModel class using the VertexAI class methods.

```javascript

const {

  FunctionDeclarationSchemaType,

  HarmBlockThreshold,

  HarmCategory,

  VertexAI

} = require('@google-cloud/vertexai');

const project = 'your-cloud-project';

const location = 'us-central1';

const textModel =  'gemini-1.5-flash';

const visionModel = 'gemini-1.5-flash';

const vertexAI = new VertexAI({project: project, location: location});

// Instantiate Gemini models

const generativeModel = vertexAI.getGenerativeModel({

    model: textModel,

    // The following parameters are optional

    // They can also be passed to individual content generation requests

    safetySettings: [{category: HarmCategory.HARM_CATEGORY_DANGEROUS_CONTENT, threshold: HarmBlockThreshold.BLOCK_MEDIUM_AND_ABOVE}],

    generationConfig: {maxOutputTokens: 256},

    systemInstruction: {

      role: 'system',

      parts: [{"text": `For example, you are a helpful customer service agent.`}]

    },

});

const generativeVisionModel = vertexAI.getGenerativeModel({

    model: visionModel,

});

const generativeModelPreview = vertexAI.preview.getGenerativeModel({

    model: textModel,

});

```

## Send text prompt requests

You can send text prompt requests by using `generateContentStream` for streamed

responses, or `generateContent` for nonstreamed responses.

### Get streamed text responses

The response is returned in chunks as it's being generated to reduce the

perception of latency to a human reader.

```typescript

async function streamGenerateContent() {

  const request = {

    contents: [{role: 'user', parts: [{text: 'How are you doing today?'}]}],

  };

  const streamingResult = await generativeModel.generateContentStream(request);

  for await (const item of streamingResult.stream) {

    console.log('stream chunk: ', JSON.stringify(item));

  }

  const aggregatedResponse = await streamingResult.response;

  console.log('aggregated response: ', JSON.stringify(aggregatedResponse));

};

streamGenerateContent();

```

### Get nonstreamed text responses

The response is returned all at once.

```javascript

async function generateContent() {

  const request = {

    contents: [{role: 'user', parts: [{text: 'How are you doing today?'}]}],

  };

  const result = await generativeModel.generateContent(request);

  const response = result.response;

  console.log('Response: ', JSON.stringify(response));

};

generateContent();

```

## Send multiturn chat requests

Chat requests use previous messages as context when responding to new prompts.

To send multiturn chat requests, use `sendMessageStream` for streamed responses,

or `sendMessage` for nonstreamed responses.

### Get streamed chat responses

The response is returned in chunks as it's being generated to reduce the

perception of latency to a human reader.

```javascript

async function streamChat() {

  const chat = generativeModel.startChat();

  const chatInput = "How can I learn more about Node.js?";

  const result = await chat.sendMessageStream(chatInput);

  for await (const item of result.stream) {

      console.log("Stream chunk: ", item.candidates[0].content.parts[0].text);

  }

  const aggregatedResponse = await result.response;

  console.log('Aggregated response: ', JSON.stringify(aggregatedResponse));

}

streamChat();

```

### Get nonstreamed chat responses

The response is returned all at once.

```javascript

async function sendChat() {

  const chat = generativeModel.startChat();

  const chatInput = "How can I learn more about Node.js?";

  const result = await chat.sendMessage(chatInput);

  const response = result.response;

  console.log('response: ', JSON.stringify(response));

}

sendChat();

```

## Include images or videos in your prompt request

Prompt requests can include either an image or video in addition to text.

For more information, see

[Send multimodal prompt requests](https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/send-multimodal-prompts)

in the Vertex AI documentation.

### Include an image

You can include images in the prompt either by specifying the Cloud Storage URI

where the image is located or by including a base64 encoding of the image.

#### Specify a Cloud Storage URI of the image

You can specify the Cloud Storage URI of the image in `fileUri`.

```javascript

async function multiPartContent() {

    const filePart = {fileData: {fileUri: "gs://generativeai-downloads/images/scones.jpg", mimeType: "image/jpeg"}};

    const textPart = {text: 'What is this picture about?'};

    const request = {

        contents: [{role: 'user', parts: [textPart, filePart]}],

      };

    const streamingResult = await generativeVisionModel.generateContentStream(request);

    for await (const item of streamingResult.stream) {

      console.log('stream chunk: ', JSON.stringify(item));

    }

    const aggregatedResponse = await streamingResult.response;

    console.log(aggregatedResponse.candidates[0].content);

}

multiPartContent();

```

#### Specify a base64 image encoding string

You can specify the base64 image encoding string in `data`.

```javascript

async function multiPartContentImageString() {

    // Replace this with your own base64 image string

    const base64Image = 'iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAYAAAAfFcSJAAAADUlEQVR42mP8z8BQDwAEhQGAhKmMIQAAAABJRU5ErkJggg==';

    const filePart = {inline_data: {data: base64Image, mimeType: 'image/jpeg'}};

    const textPart = {text: 'What is this picture about?'};

    const request = {

        contents: [{role: 'user', parts: [textPart, filePart]}],

      };

    const streamingResult = await generativeVisionModel.generateContentStream(request);

    const contentResponse = await streamingResult.response;

    console.log(contentResponse.candidates[0].content.parts[0].text);

}

multiPartContentImageString();

```

### Include a video

You can include videos in the prompt by specifying the Cloud Storage URI

where the video is located in `fileUri`.

```javascript

async function multiPartContentVideo() {

    const filePart = {fileData: {fileUri: 'gs://cloud-samples-data/video/animals.mp4', mimeType: 'video/mp4'}};

    const textPart = {text: 'What is in the video?'};

    const request = {

        contents: [{role: 'user', parts: [textPart, filePart]}],

      };

    const streamingResult = await generativeVisionModel.generateContentStream(request);

    for await (const item of streamingResult.stream) {

      console.log('stream chunk: ', JSON.stringify(item));

    }

    const aggregatedResponse = await streamingResult.response;

    console.log(aggregatedResponse.candidates[0].content);

}

multiPartContentVideo();

```

## Function calling

The Vertex AI SDK for Node.js supports

[function calling](https://cloud.google.com/vertex-ai/docs/generative-ai/multimodal/function-calling)

in the `sendMessage`, `sendMessageStream`, `generateContent`, and

`generateContentStream` methods. We recommend using it through the chat methods

(`sendMessage` or `sendMessageStream`) but have included examples of both

approaches below.

### Declare a function

The following examples show you how to declare a function.

```javascript

const functionDeclarations = [

  {

    functionDeclarations: [

      {

        name: "get_current_weather",

        description: 'get weather in a given location',

        parameters: {

          type: FunctionDeclarationSchemaType.OBJECT,

          properties: {

            location: {type: FunctionDeclarationSchemaType.STRING},

            unit: {

              type: FunctionDeclarationSchemaType.STRING,

              enum: ['celsius', 'fahrenheit'],

            },

          },

          required: ['location'],

        },

      },

    ],

  },

];

const functionResponseParts = [

  {

    functionResponse: {

      name: "get_current_weather",

      response:

          {name: "get_current_weather", content: {weather: "super nice"}},

    },

  },

];

```

### Function calling using `sendMessageStream`

After the function is declared, you can pass it to the model in the

`tools` parameter of the prompt request.

```javascript

async function functionCallingChat() {

  // Create a chat session and pass your function declarations

  const chat = generativeModel.startChat({

    tools: functionDeclarations,

  });

  const chatInput1 = 'What is the weather in Boston?';

  // This should include a functionCall response from the model

  const streamingResult1 = await chat.sendMessageStream(chatInput1);

  for await (const item of streamingResult1.stream) {

    console.log(item.candidates[0]);

  }

  const response1 = await streamingResult1.response;

  console.log("first aggregated response: ", JSON.stringify(response1));

  // Send a follow up message with a FunctionResponse

  const streamingResult2 = await chat.sendMessageStream(functionResponseParts);

  for await (const item of streamingResult2.stream) {

    console.log(item.candidates[0]);

  }

  // This should include a text response from the model using the response content

  // provided above

  const response2 = await streamingResult2.response;

  console.log("second aggregated response: ", JSON.stringify(response2));

}

functionCallingChat();

```

### Function calling using `generateContentStream`

```javascript

async function functionCallingGenerateContentStream() {

  const request = {

    contents: [

      {role: 'user', parts: [{text: 'What is the weather in Boston?'}]},

      {role: 'model', parts: [{functionCall: {name: 'get_current_weather', args: {'location': 'Boston'}}}]},

      {role: 'user', parts: functionResponseParts}

    ],

    tools: functionDeclarations,

  };

  const streamingResult =

      await generativeModel.generateContentStream(request);

  for await (const item of streamingResult.stream) {

    console.log(item.candidates[0]);

  }

}

functionCallingGenerateContentStream();

```

## Counting tokens

```javascript

async function countTokens() {

  const request = {

      contents: [{role: 'user', parts: [{text: 'How are you doing today?'}]}],

    };

  const response = await generativeModel.countTokens(request);

  console.log('count tokens response: ', JSON.stringify(response));

}

countTokens();

```

## Grounding (Preview)

Grounding is preview only feature.

Grounding lets you connect model output to verifiable sources of information to

reduce hallucination. You can specify Google Search or Vertex AI search as the

data source for grounding.

### Grounding using Google Search (Preview)

```javascript

async function generateContentWithGoogleSearchGrounding() {

  const generativeModelPreview = vertexAI.preview.getGenerativeModel({

    model: textModel,

    // The following parameters are optional

    // They can also be passed to individual content generation requests

    safetySettings: [{category: HarmCategory.HARM_CATEGORY_DANGEROUS_CONTENT, threshold: HarmBlockThreshold.BLOCK_MEDIUM_AND_ABOVE}],

    generationConfig: {maxOutputTokens: 256},

  });

  const googleSearchRetrievalTool = {

    googleSearchRetrieval: {

      disableAttribution: false,

    },

  };

  const result = await generativeModelPreview.generateContent({

    contents: [{role: 'user', parts: [{text: 'Why is the sky blue?'}]}],

    tools: [googleSearchRetrievalTool],

  })

  const response = result.response;

  const groundingMetadata = response.candidates[0].groundingMetadata;

  console.log("GroundingMetadata is: ", JSON.stringify(groundingMetadata));

}

generateContentWithGoogleSearchGrounding();

```

### Grounding using Vertex AI Search (Preview)

```javascript

async function generateContentWithVertexAISearchGrounding() {

  const generativeModelPreview = vertexAI.preview.getGenerativeModel({

    model: textModel,

    // The following parameters are optional

    // They can also be passed to individual content generation requests

    safetySettings: [{category: HarmCategory.HARM_CATEGORY_DANGEROUS_CONTENT, threshold: HarmBlockThreshold.BLOCK_MEDIUM_AND_ABOVE}],

    generationConfig: {maxOutputTokens: 256},

  });

  const vertexAIRetrievalTool = {

    retrieval: {

      vertexAiSearch: {

        datastore: 'projects/.../locations/.../collections/.../dataStores/...',

      },

      disableAttribution: false,

    },

  };

  const result = await generativeModelPreview.generateContent({

    contents: [{role: 'user', parts: [{text: 'Why is the sky blue?'}]}],

    tools: [vertexAIRetrievalTool],

  })

  const response = result.response;

  const groundingMetadata = response.candidates[0].groundingMetadata;

  console.log("Grounding metadata is: ", JSON.stringify(groundingMetadata));

}

generateContentWithVertexAISearchGrounding();

```

## System Instruction

You can include an optional system instruction when instantiating a generative model to provide additional context to the model.

The system instruction can also be passed to individual text prompt requests.

### Include system instruction in generative model instantiation

```javascript

const generativeModel = vertexAI.getGenerativeModel({

    model: textModel,

    // The following parameter is optional.

    systemInstruction: {

      role: 'system',

      parts: [{"text": `For example, you are a helpful customer service agent.`}]

    },

});

```

### Include system instruction in text prompt request

```javascript

async function generateContent() {

  const request = {

    contents: [{role: 'user', parts: [{text: 'How are you doing today?'}]}],

    systemInstruction: { role: 'system', parts: [{ text: `For example, you are a helpful customer service agent.` }] },

  };

  const result = await generativeModel.generateContent(request);

  const response = result.response;

  console.log('Response: ', JSON.stringify(response));

};

generateContent();

```

## FAQ

### What if I want to specify authentication options instead of using default options?

**Step1**: Find a list of accepted authentication options in [GoogleAuthOptions](https://github.com/googleapis/google-auth-library-nodejs/blob/3ae120d0a45c95e36c59c9ac8286483938781f30/src/auth/googleauth.ts#L87) interface of google-auth-library-node.js GitHub repo.

**Step2:** Instantiate the `VertexAI` class by passing in the `GoogleAuthOptions` interface as follows:

```javascript

const { VertexAI } = require('@google-cloud/vertexai');

const { GoogleAuthOptions } = require('google-auth-library');

const vertexAI = new VertexAI(

  {

    googleAuthOptions: {

      // your GoogleAuthOptions interface

    }

  }

)

```

## License

The contents of this repository are licensed under the

[Apache License, version 2.0](http://www.apache.org/licenses/LICENSE-2.0).
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/googleapis/nodejs-vertexai

Awesome Lists containing this project

README