https://github.com/yorek/azure-sql-db-ai-samples-search

A practical sample of RAG pattern applied to a real-world use case: make finding samples using Azure SQL easier and more efficient!
https://github.com/yorek/azure-sql-db-ai-samples-search

ai azure-sql azure-sql-database azure-sql-db data-api-builder genai rag react vectors vectorstore

Last synced: 2 months ago
JSON representation

A practical sample of RAG pattern applied to a real-world use case: make finding samples using Azure SQL easier and more efficient!

Host: GitHub
URL: https://github.com/yorek/azure-sql-db-ai-samples-search
Owner: yorek
Created: 2024-11-10T18:05:44.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2025-10-06T00:37:04.000Z (3 months ago)
Last Synced: 2025-10-06T02:32:32.763Z (3 months ago)
Topics: ai, azure-sql, azure-sql-database, azure-sql-db, data-api-builder, genai, rag, react, vectors, vectorstore
Language: TSQL
Homepage: https://ai.awesome.azuresql.dev/
Size: 4.14 MB
Stars: 19
Watchers: 1
Forks: 10
Open Issues: 0
Metadata Files:
- Readme: README.MD

Awesome Lists containing this project

README

# Azure SQL Samples Search with AI

This sample demonstrates how to use Azure SQL to store and search data with AI. The sample uses RAG pattern, implementend with Azure SQL, to search for the most relevant code and end-to-end samples based on the user query. The results are then generated by AI to provide the most relevant documents to the user, with also a description of why the results are relevant.

The live websites is available here: https://ai.awesome.azuresql.dev/

>[!NOTE]
> If you are looking for the sample used in the Mr. Maeda's Cozy AI Kitchen, using SQL Server 2025, please check the `sql-server-2025` branch.

## See it in action!

Recording of the demo of this sample is here: https://www.youtube.com/watch?v=1Idzjm05UmY

## Architecture

The architecture of the sample is as follows:

[![Architecture](./_assets/ai-samples-search-azure-sql.png)](./_assets_/ai-samples-search-azure-sql.png)

1. **Azure SQL database** stores the samples data and the related embeddings.
- The **embeddings** are generated when a sample is added or updated to the database, using and embedding model hosted in Azure OpenAI.
- The **RAG pattern** is fully implemented in the database, using GPT-4o model to generate the results with a well-defined **structured** format, so tht results can be easily joined back with the samples data.

2. **Data API builder** is used to exposed stored procedure for searching and adding samples to the database. The API is used by the web application to interact with the database.

3. **React** front-end application is used to search for samples and display the results.

## (Hybrid) RAG Pattern

In order to answer questions that could have either a precise ("return all the last 5 samples") or a semantic ("return samples using a insurance use case") answer, the usual RAG pattern is not enough as it will fail short for precise answers. In order to have both precise and semantic answers, we implemented a **hybrid RAG pattern** that uses two different approaches to generate the results.

The first step if to use an LLM to understand if the asked question can be answered using the available database schema and just the SQL (or any other) language. If the answer is precise, then the SQL query is generated and executed against the database. If the question cannot be answered using just the database schema and SQL, then the RAG pattern is used to find the semantically closest results.

In either case the output is then sent back to the LLM to generate a final answer. Answer that is asked to be returned as a well-defined JSON object, so it can be easily parsed and turned into a table and then joined back with the data in the database, to provide the final, structured, complete answer to the user.

I'm calling this process **Hybrid RAG** as it uses both the database schema and the RAG pattern to generate the results.

[![Hybrid RAG](./_assets/hybrid-rag.png)](./_assets_/hybrid-rag.png)

The hybrid RAG pattern is implemented in the database, using a stored procedure that is called by Data API builder. The stored procedure generates the SQL query and the RAG query, executes them and then generates the final answer using the LLM.

Clean, easy and efficient.

## Azure Services

The whole architecture is built using the following Azure services:

- [Azure SQL Database](https://learn.microsoft.com/azure/azure-sql/database/sql-database-paas-overview?view=azuresql): to store and query the samples data and embeddings.
- [Azure Static Web Apps](https://learn.microsoft.com/azure/static-web-apps/overview): to host the front-end application, and to provide reverse proxy to Data API builder.
- [Azure Container Apps](https://learn.microsoft.com/azure/container-apps/overview): to host [Data API builder](https://learn.microsoft.com/en-us/azure/data-api-builder/overview), which is used to expose the database stored procedures as REST API.
- [Azure OpenAI](https://learn.microsoft.com/azure/ai-services/openai/): to generate the embeddings for the samples data and to generate the results using the RAG pattern.

## Solution

### Pre-requisites

Local development is possible, but you still have to have an Azure OpenAI subscription to generate the embeddings and the results.
All you need to have to run the sample locally is:

- [Node](https://nodejs.org/en) - to run the front-end application.
- [DotNet SDK 8](https://dotnet.microsoft.com/download/dotnet/8.0) or later to run Data API builder and deploy the database.
- [Static Web Apps CLI](https://learn.microsoft.com/azure/static-web-apps/static-web-apps-cli-overview) - to run the full-stack application locally.
- [Data API builder](https://learn.microsoft.com/azure/data-api-builder/how-to/install-cli) - to run the API locally.

### Azure Open AI

Make sure to have two models deployed, one for generating embeddings (*text-embedding-3-small* model recommended) and one for handling the chat. The model **must** support structured output: *gpt-4o* version *2024-08-06* is recommended. You can use the Azure OpenAI service to deploy the models. Make sure to have the endpoint and the API key ready. The two models are assumed to be deployed with the following names:

- Embedding model: `text-embedding-3-small`
- Chat model: `gpt-4o`, version `2024-08-06`

### Configure environment

Create a `.env` file starting from the `.env.sample` file:

- `OPENAI_URL`: specify the URL of your Azure OpenAI endpoint, eg: 'https://my-open-ai.openai.azure.com/'
- `OPENAI_KEY`: specify the API key of your Azure OpenAI endpoint
- `OPENAI_EMBEDDING_DEPLOYMENT_NAME`: specify the deployment name of your Azure OpenAI embedding endpoint, eg: 'text-embedding-3-small'
- `OPENAI_CHAT_DEPLOYMENT_NAME`: specify the deployment name of your Azure OpenAI chat endpoint, eg: 'gpt-4o'
- `MSSQL`: the connection string to the Azure SQL database where you want to deploy the database objects and sample data

### Database

Since the database is using the new `vector` data type, you need use Azure SQL (you can use the [Free offer](https://learn.microsoft.com/azure/azure-sql/database/free-offer?view=azuresql)) or just announced [SQL database in Microsoft Fabric (Public Preview)](https://aka.ms/announcingsqlfabric) or the [SQL Server 2025 (RC)](https://learn.microsoft.com/sql/sql-server/what-s-new-in-sql-server-2025?view=sql-server-ver17).

To deploy the database, make sure you have created the `.env` file as explained in the previoud section, and then run the following command:

```bash
dotnet run --project ./db
```

That will connect to Azure SQL and deploy the needed database objects. If you want to add some sample data, you can use the `sample-data.sql` file in the `db/sample` folder.

To test that database deployment was successful and that OpenAI connection is working, you can run the following query using your favourite SQL client:

```sql
declare @v vector(1536)
declare @e varchar(max)
exec [web].[get_embedding] 'just a sample text', @v output ,@e output
select @v,@e

```

if no error are returned, you'll get a vector as a result.

### Application

Once the database has been deployed, you can run the full-stack application locally. Install the Node dependencies, by running the following commands from the root folder:

```bash
cd client && npm install && cd ..
```

Run the back-end API using Data API builder. Make sure you have created the `.env` file as explained in the previous section, and then run the following Powershell command:

```powershell
dab start --config .\dab\dab-config.development.json
```

Then, you can run the application locally using the Static Web Apps CLI. Make sure you have the `.env` file created as explained in the previous section, and then run the following command in a new terminal window:

```bash
swa start
```

Once the application is running, you can access it at `http://localhost:4280`.

Data API builder may take a few seconds to run if you are using Azure SQL and EntraID authentication. You can check if Data API builde is running by accessing `http://localhost:4280/api/countSamples`.

### Azure deployment

The easiest way to deploy the full-stack application is to fork the repository and use the GitHub repo as the deployment option for both

- Static Web Apps
- Azure Container Apps

When deploying Data API builder to Azure Container Apps, make sure to point to the `dab` folder, which contains the needed `Dockerfile`. Also, create a `MSSQL` environment variable for the Container in Container Apps, with the connection string to the Azure SQL database.

Once the Static Web App is deployed, configure the [API](https://learn.microsoft.com/en-us/azure/static-web-apps/apis-overview) feature of Static Web Apps to connect to the Azure Container App that you have created.

Make sure you give the user that you are using to connect to the database the permission to run the needed stored procedures:

```sql
GRANT SELECT ON SCHEMA::[dbo] TO []
GRANT EXECUTE ON SCHEMA::[web] TO []
GRANT EXECUTE ANY EXTERNAL ENDPOINT TO [];
GRANT REFERENCES ON DATABASE SCOPED CREDENTIAL::[] TO [];
```

replace `` with the user you are using to connect to the database and `` with the name of the database scoped credential you are using to connect to Azure OpenAI, which is the same as the value you used for the `OPENAI_URL` variable in the `.env` file.

That's it! You can now access the full-stack application deployed in Azure Static Web Apps that will automatically route all API calls to Data API builder running in Azure Container Apps.

## Sample Data

You can add some data to play with using the `sample-data.sql` file in the `db-script/samples` folder.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/yorek/azure-sql-db-ai-samples-search

Awesome Lists containing this project

README