An open API service indexing awesome lists of open source software.

https://github.com/truefoundry/tfy-gateway-nemotron-example

This project demonstrates how to configure nvidia nemotron with Truefoundry's AI Gateway
https://github.com/truefoundry/tfy-gateway-nemotron-example

Last synced: 3 months ago
JSON representation

This project demonstrates how to configure nvidia nemotron with Truefoundry's AI Gateway

Awesome Lists containing this project

README

          

# Nemotron 3 on TrueFoundry AI Gateway

[Nemotron 3 Super](https://research.nvidia.com/labs/nemotron/Nemotron-3/) is NVIDIA's latest open model โ€” 120B parameters, 12B active at inference via a hybrid Mamba-Transformer MoE. 5x higher throughput than the previous generation.

๐Ÿš€ [Sign up for TrueFoundry](https://www.truefoundry.com/register) ย ยทย  ๐Ÿ”‘ [Get NVIDIA API key](https://build.nvidia.com)

---

## Option 1: UI

Add Nemotron to the gateway directly from the TrueFoundry dashboard โ€” no code needed.

๐Ÿ“บ [Watch the walkthrough](https://www.loom.com/share/5342a8be7e62444f810dcb9e2144c0c7)

---

## Option 2: CLI

**1. Store your NVIDIA API key as a TFY secret**

Dashboard โ†’ Secrets โ†’ create a group โ†’ add `NVIDIA-API-KEY = nvapi-...` โ€” [docs](https://docs.truefoundry.com/docs/manage-secrets)

Update `bearer_token` in `gateway.yaml` with your secret reference.

**2. Apply**

```bash
export TFY_HOST=https://.truefoundry.cloud
export TFY_API_KEY=

tfy apply -f gateway.yaml
```

**3. Call the model**

Set `TFY_BASE_URL` and `TFY_API_KEY` in `example.py` and run it.

---

## Option 3: Claude + TrueFoundry Skills

Install [TrueFoundry Agent Skills](https://github.com/truefoundry/tfy-agent-skills), set these env vars:

```bash
TFY_HOST=https://.truefoundry.cloud
TFY_BASE_URL=https://.truefoundry.cloud
TFY_API_KEY=
```

Then ask Claude:

> "Deploy Nemotron 3 thinking model to TrueFoundry"