https://github.com/jparkerweb/bedrock-proxy-endpoint

🔀 Bedrock Proxy Endpoint ⇢ Spin up your own custom OpenAI API server endpoint for easy AWS Bedrock inference (using standard baseUrl, and apiKey params)
https://github.com/jparkerweb/bedrock-proxy-endpoint

api aws-bedrock endpoint inference inference-api llama3 llm mistral mixtral openai-api proxy serverless wrapper

Last synced: 5 months ago
JSON representation

🔀 Bedrock Proxy Endpoint ⇢ Spin up your own custom OpenAI API server endpoint for easy AWS Bedrock inference (using standard baseUrl, and apiKey params)

Host: GitHub
URL: https://github.com/jparkerweb/bedrock-proxy-endpoint
Owner: jparkerweb
License: mit
Created: 2024-05-01T13:00:04.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2025-03-29T16:05:17.000Z (6 months ago)
Last Synced: 2025-05-01T09:57:15.135Z (5 months ago)
Topics: api, aws-bedrock, endpoint, inference, inference-api, llama3, llm, mistral, mixtral, openai-api, proxy, serverless, wrapper
Language: JavaScript
Homepage:
Size: 495 KB
Stars: 9
Watchers: 1
Forks: 5
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Funding: .github/FUNDING.yml
- License: LICENSE

Awesome Lists containing this project

README

          # 🔀 Bedrock Proxy Endpoint

Spin up your own custom OpenAI API server endpoint for easy AWS Bedrock LLM text inference (using standard `baseUrl`, and `apiKey` params)

![bedrock-proxy-endpoint](https://raw.githubusercontent.com/jparkerweb/bedrock-proxy-endpoint/refs/heads/main/.readme/bedrock-proxy-endpoint.jpg)

### Maintained by



  



---

### Why Bedrock Proxy Endpoint?

Are you are stuck with using AWS Bedrock for all LLM text inference, but you want to keep your application platform agnostic?  Are you tired of figuring out how to format your LLM inference calls to work with the Bedrock SDK? Are you going crazy with all the differences between configuration from model to model?

- `Bedrock Proxy Endpoint` makes it easy to continue using the OpenAI API client that you are use to by standing up your own OpenAI text compatible endpoint that will proxy all your calls to Bedrock in a compatible way.

- Great for getting existing OpenAI API compatible applications working with AWS Bedrock.

### Prerequisites

Before getting started, make sure you have the following installed:

- Node.js (version 12 or higher)

- npm (Node Package Manager)

### Installation

1. Clone the repository:

    ```bash

    git clone https://github.com/jparkerweb/bedrock-proxy-endpoint.git

    ```

2. Navigate to the project directory:

    ```bash

    cd bedrock-proxy-endpoint

    ```

3. Install the dependencies:

    ```bash

    npm ci

    ```

### Configuration

* Update the `.env` file in the root directory of the project with the following

  environment variables based on your desired configuration:

    | key                        | value type | example                   | notes                          |

    |----------------------------|------------|---------------------------|--------------------------------|

    | CONSOLE_LOGGING            | boolean    | false                     | Show realtime logs             |

    | HTTP_ENABLED               | boolean    | true                      | Start a HTTP server            |

    | HTTP_PORT                  | integer    | 80                        | HTTP server port               |

    | MAX_REQUEST_BODY_SIZE      | string     | 50mb                      | Maximum size for request body  |

    | HTTPS_ENABLED              | boolean    | false                     | Start a HTTPS server           |

    | HTTPS_PORT                 | integer    | 443                       | HTTPS server port              |

    | HTTPS_KEY_PATH             | string     | ./path/mykey.key          | Path to key file for HTTPS     |

    | HTTPS_CERT_PATH            | string     | ./path/mycert.pem         | Path to cert file for HTTPS    |

    | IP_RATE_LIMIT_ENABLED      | boolean    | true                      | Enable rate limiting by IP     |

    | IP_RATE_LIMIT_WINDOW_MS    | integer    | 60000                     | Window in milliseconds         |

    | IP_RATE_LIMIT_MAX_REQUESTS | integer    | 100                       | Max requests per IP per window |

### Authentication

`Bedrock Proxy` authenticates with AWS via `IAM`. Since the OpenAI API intance accpets an API Key we will utilize this value to hold your credentials. Construct your `apiKey` for inference in the next step following this format:

- `${AWS_REGION}.${AWS_ACCESS_KEY_ID}.${AWS_SECRET_ACCESS_KEY}`

- example `apiKey` value:  

  `us-west-2.AKIAWSXXXXXXXXXXX.YYYYYYYYYYYYYYYYYYYYYYYYY`

### Usage

- Start the server via: `node server`  

    

  You are now ready to make a standard chat completions to the server.

- Important values

  - `baseUrl`: Root address of server based on your `.env` configuration.

  - `apiKey`: Descibed in the *Authentication* section above.

  - `messages`: Array of objects in role / content format.

  - `model`: This is the `modelName` from the list of supported models found on the `Bedrock Wrapper` README file [here](https://github.com/jparkerweb/bedrock-wrapper?tab=readme-ov-file#supported-models); The `/models` enpoint of this server will also return a list of supported models.

  - `include_thinking_data`: _Optional_ boolean parameter that when set to `true` will include the model's thinking process in the response (only used with thinking models such as `Claude-3-7-Sonnet-Thinking`).

### Example OpenAI API Call

Look at the example folder for complete examples of how to use the server:

- `example.js` - Basic text completion example

- `example-vision.js` - Vision model example with image processing (image can be passed as a base64 string or a URL)

```javascript

import OpenAI from 'openai';

const messages = [

    {

        role: "system",

        content: "You are a helpful AI assistant that follows instructions extremely well. Answer the user questions accurately.",

    },

    {

        role: "user",

        content: "Describe why the OpenAI API standard is so great. Limit your response to five sentences.",

    },

    {

        role: "assistant",

        content: "",

    },

];

const baseURL = "http://localhost"; // URL of the Bedrock Proxy Endpoint

const apiKey = `${AWS_REGION}.${AWS_ACCESS_KEY_ID}.${AWS_SECRET_ACCESS_KEY}` // Your AWS Creds / API Key

const openai = new OpenAI({

    baseURL: baseURL,

    apiKey: apiKey,

});

async function main() {

    try {

        const chatCompletion = await openai.chat.completions.create({

            messages: messages,

            model: "Claude-3-7-Sonnet-Thinking",

            max_tokens: 2048,

            temperature: 0.4,

            top_p: 0.7,

            stream: true,

            include_thinking_data: true,  // Enable to include the model's thinking process in the response

        });

        if (chatCompletion) {

            for await (const chunk of chatCompletion) {

                const response = chunk.choices[0]?.delta?.content || "";

                process.stdout.write(response);

            }

        }

    } catch (error) {

        console.error('Error:', error);

    } finally {

        process.exit(0);

    }

}

main();

```

### Root Info Page

Point your browser to the root of your endpoint server to view the info page: (example: `http://localhost`)  



### Note

Alternativly you can incorporate 🪨 `Bedrock Wrapper` core directly into your code base. If you would like to explore that option checkout the npm package here: https://www.npmjs.com/package/bedrock-wrapper

---

Please consider sending me a tip to support my work 😀

# [🍵 tip me here](https://ko-fi.com/jparkerweb)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jparkerweb/bedrock-proxy-endpoint

Awesome Lists containing this project

README