Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/narius2030/datalake-solution-imcp

This project involved the development and implementation of a Data Lake architecture to support an AI model capable of generating image captions. The architecture was designed to efficiently ingest, process, and centralized store large volumes of image and text data.
https://github.com/narius2030/datalake-solution-imcp

data-lake docker-container etl-pipeline fastapi medallion-architecture mlops nosql-database object-storage

Last synced: about 1 month ago
JSON representation

This project involved the development and implementation of a Data Lake architecture to support an AI model capable of generating image captions. The architecture was designed to efficiently ingest, process, and centralized store large volumes of image and text data.

Awesome Lists containing this project

README

        

## Overal Architecture
![image](https://github.com/user-attachments/assets/e7fc0152-fe7c-4c00-8d83-c268d4fee4a9)

## Detailed Architecture
![image](https://github.com/user-attachments/assets/13726b7e-6c91-4453-a291-1dda31684cd1)

## Overal Data Pipeline
![image](https://github.com/user-attachments/assets/0f0e0040-8681-4b8f-9ba0-ec1eea828972)

## Practical Data Pipeline
![image](https://github.com/user-attachments/assets/8ba59a0d-701b-4007-b248-0db1138f9263)