An open API service indexing awesome lists of open source software.

https://github.com/hariprasath-v/workshop_open_source_data_ingestion_rags_dlt

Data ingestion pipeline using dlt to load data from a REST API into LanceDB.
https://github.com/hariprasath-v/workshop_open_source_data_ingestion_rags_dlt

colab-notebook dlt lancedb llama2 ollama rest-api sentence-transformers

Last synced: 6 months ago
JSON representation

Data ingestion pipeline using dlt to load data from a REST API into LanceDB.

Awesome Lists containing this project

README

          

# Workshop_Open_Source_Data_Ingestion_RAGs_dlt

### Build a data ingestion pipeline using dlt to load data from a REST API into LanceDB.
### Covered topics

* Extract data from REST APIs
* Loading and vectorizing into LanceDB, which unlike other vector DBs stores the data and the embeddings
* Incremental loading