Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/georgia-tech-db/evadb
Database system for AI-powered apps
https://github.com/georgia-tech-db/evadb
agent ai auto-gpt chatgpt data-analysis database eva gpt-4 gpt4all hacktoberfest huggingface labeling langchain llm object-detection serving video-analytics
Last synced: 2 days ago
JSON representation
Database system for AI-powered apps
- Host: GitHub
- URL: https://github.com/georgia-tech-db/evadb
- Owner: georgia-tech-db
- License: apache-2.0
- Created: 2018-09-10T02:26:03.000Z (over 6 years ago)
- Default Branch: staging
- Last Pushed: 2024-05-17T16:33:06.000Z (8 months ago)
- Last Synced: 2024-10-29T15:06:40.955Z (2 months ago)
- Topics: agent, ai, auto-gpt, chatgpt, data-analysis, database, eva, gpt-4, gpt4all, hacktoberfest, huggingface, labeling, langchain, llm, object-detection, serving, video-analytics
- Language: Python
- Homepage: https://evadb.ai/docs
- Size: 155 MB
- Stars: 2,631
- Watchers: 27
- Forks: 262
- Open Issues: 142
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE.txt
Awesome Lists containing this project
- awesome-ChatGPT-repositories - evadb - Database system for AI-powered apps (Langchain)
- awesome-repositories - georgia-tech-db/evadb - Database system for AI-powered apps (Python)
- awesome-chatgpt - georgia-tech-db/evadb - EvaDB is a Python library that provides a database system for AI-powered apps (SDK, Libraries, Frameworks / Python)
- StarryDivineSky - georgia-tech-db/evadb
README
Database system for AI-powered apps
Follow EvaDB
EvaDB enables software developers to build AI apps in a few lines of code. Its powerful SQL API simplifies AI app development for both structured and unstructured data. EvaDB's benefits include:
🔮 Easy to connect the EvaDB query engine with your data sources, such as PostgreSQL or S3 buckets, and build AI-powered apps with SQL queries.
Structured Data Sources
Unstructured Data Sources
Application Data Sources- PostgreSQL
- SQLite
- MySQL
- MariaDB
- Clickhouse
- Snowflake- Local filesystem
- AWS S3 bucket- Github
More details on the supported data sources is [available here](https://evadb.readthedocs.io/en/latest/source/reference/databases/index.html).
🤝 Query your connected data with a pre-trained AI model from Hugging Face, OpenAI, YOLO, Stable Diffusion, etc.
Hugging Face
OpenAI
YOLO- Audio Classification
- Automatic Speech Recognition
- Text Classification
- Summarization
- Text2Text Generation
- Text Generation
- Image Classification
- Image Segmentation
- Image-to-Text
- Object Detection
- Depth Estimation- gpt-4
- gpt-4-0314
- gpt-4-32k
- gpt-4-32k-0314
- gpt-3.5-turbo
- gpt-3.5-turbo-0301- yolov8n.pt
- yolov8s.pt
- yolov8m.pt
- yolov8l.pt
- yolov8x.ptMore details on the supported AI models is [available here](https://evadb.readthedocs.io/en/latest/source/reference/ai/index.html)
🔧 Create or fine-tune AI models for regression, classification, and time series forecasting.
Regression
Classification
Time Series Forecasting- Ludwig
- Sklearn
- Xgboost- Ludwig
- Xboost- Statsforecast
- NeuralforecastMore details on the supported AutoML frameworks is [available here](https://evadb.readthedocs.io/en/latest/source/reference/ai/index.html).
💰 Faster AI queries thanks to AI-centric query optimizations such as caching, batching, and parallel processing.
- Function result caching helps reuse results of expensive AI function calls.
- LLM batching reduces token usage and dollars spent on LLM calls.
- Parallel query processing saves money and time spent on running AI models by better utilizing CPUs and/or GPUs.
- Query predicate re-ordering and predicate push-down accelerates queries over both structured and unstructured data.More details on the optimizations in EvaDB is [available here](https://evadb.readthedocs.io/en/latest/source/reference/optimizations.html).
👋 Hey! If you're excited about our vision of bringing AI inside database systems, show some ❤️ by:
- ⭐ starring our GitHub 🐙 Repo
- 📟 joining our Slack Community
- 🐦 following us on Twitter
- 📝 following us on Medium
We would love to learn about your AI app. Please complete this 1-minute form: https://v0fbgcue0cm.typeform.com/to/BZHZWeZm
## Quick Links
- [Quick Links](#quick-links)
- [Documentation](#documentation)
- [Why EvaDB](#why-evadb)
- [How does EvaDB work](#how-does-evadb-work)
- [Illustrative Queries](#illustrative-queries)
- [Illustrative Apps](#illustrative-apps)
- [More Illustrative Queries](#more-illustrative-queries)
- [Architecture of EvaDB](#architecture-of-evadb)
- [Community and Support](#community-and-support)
- [Contributing](#contributing)
- [Star History](#star-history)
- [License](#license)
## Documentation
You can find the complete documentation of EvaDB at [evadb.ai/docs](https://evadb.ai/docs/) 📚✨🚀
## Why EvaDB
In the world of AI, we've reached a stage where many AI tasks that were traditionally handled by AI or ML engineers can now be automated. EvaDB enables software developers with the ability to perform advanced AI tasks without needing to delve into the intricate details.
EvaDB covers many AI applications, including regression, classification, image recognition, question answering, and many other generative AI applications. EvaDB targets 99% of AI problems that are often repetitive and can be automated with a simple function call in an SQL query. Until now, there is no comprehensive open-source framework for bringing AI into an existing SQL database system with a principled AI optimization framework, and that's where EvaDB comes in.
Our target audience is software developers who may not necessarily have a background in AI but require AI capabilities to solve specific problems. We target programmers who write simple SQL queries inside their CRUD apps. With EvaDB, it is possible to easily add AI features to these apps by calling built-in AI functions in the queries.
## How does EvaDB work
- Connect EvaDB to your SQL and vector database systems with the `CREATE DATABASE` and `CREATE INDEX` statements.
- Write SQL queries with AI functions to get inference results:
- Pick a pre-trained AI model from Hugging Face, Open AI, Ultralytics, PyTorch, and built-in AI frameworks for generative AI, NLP, and vision applications;
- or pick from a variety of state-of-the-art ML engines for classic ML use-cases (classification, regression, etc.);
- or bring your custom model built with any AI/ML framework using `CREATE FUNCTION`.
Follow the [getting started](https://evadb.readthedocs.io/en/stable/source/overview/getting-started.html) guide to get on-boarded as fast as possible.
## Illustrative Queries
* Get insights about Github stargazers using GPT4.
```sql
SELECT name, country, email, programming_languages, social_media, GPT4(prompt,topics_of_interest)
FROM gpt4all_StargazerInsights;
--- Prompt to GPT-4
You are given 10 rows of input, each row is separated by two new line characters.
Categorize the topics listed in each row into one or more of the following 3 technical areas - Machine Learning, Databases, and Web development. If the topics listed are not related to any of these 3 areas, output a single N/A. Do not miss any input row. Do not add any additional text or numbers to your output.
The output rows must be separated by two new line characters. Each input row must generate exactly one output row. For example, the input row [Recommendation systems, Deep neural networks, Postgres] must generate only the output row [Machine Learning, Databases].
The input row [enterpreneurship, startups, venture capital] must generate the output row N/A.
```
* Build a vector index on the feature embeddings returned by the SIFT Feature Extractor on a collection of Reddit images. Return the top-5 similar images for a given image.
```sql
CREATE INDEX reddit_sift_image_index
ON reddit_dataset (SiftFeatureExtractor(data))
USING FAISS
SELECT name FROM reddit_dataset ORDER BY
Similarity(
SiftFeatureExtractor(Open('reddit-images/g1074_d4mxztt.jpg')),
SiftFeatureExtractor(data)
)
LIMIT 5
```
## Illustrative Apps
Here are some illustrative AI apps built using EvaDB (each notebook can be opened on Google Colab):
* 🔮 Sentiment Analysis using LLM within PostgreSQL
* 🔮 ChatGPT-based Video Question Answering
* 🔮 Text Summarization on PDF Documents
* 🔮 Analysing Traffic Flow with YOLO
* 🔮 Examining Emotions of Movie
* 🔮 Image Similarity Search
## More Illustrative Queries
* Get a transcript from a video stored in a table using a Speech Recognition model. Then, ask questions on the extracted transcript using ChatGPT.
```sql
CREATE TABLE text_summary AS
SELECT SpeechRecognizer(audio) FROM ukraine_video;
SELECT ChatGPT('Is this video summary related to Ukraine russia war', text)
FROM text_summary;
```
* Train a classic ML model for prediction using the Ludwig AI engine.
```sql
CREATE FUNCTION IF NOT EXISTS PredictHouseRent FROM
(SELECT * FROM HomeRentals)
TYPE Ludwig
PREDICT 'rental_price'
TIME_LIMIT 120;
```
## Architecture of EvaDB
EvaDB's AI-centric query optimizer takes a query as input and generates a query plan. The query engine takes the query plan and hits the relevant backends to efficiently process the query:
1. SQL Database Systems (Structured Data)
2. AI Frameworks (Transform Unstructured Data to Structured Data; Unstructured data includes PDFs, text, images, etc. stored locally or on the cloud)
3. Vector Database Systems (Feature Embeddings)
## Community and Support
We would love to learn about your AI app. Please complete this 1-minute form: https://v0fbgcue0cm.typeform.com/to/BZHZWeZm
If you run into any bugs or have any comments, you can reach us on our Slack Community 📟 or create a [Github Issue :bug:](https://github.com/georgia-tech-db/evadb/issues).
Here is EvaDB's public [roadmap 🛤️](https://github.com/orgs/georgia-tech-db/projects/3). We prioritize features based on user feedback, so we'd love to hear from you!
## Contributing
We are a lean team on a mission to bring AI inside database systems! All kinds of contributions to EvaDB are appreciated 🙌 If you'd like to get involved, here's information on where we could use your help: [contribution guide](https://evadb.readthedocs.io/en/latest/source/dev-guide/contribute.html) 🤗
CI Status:
[![CI Status](https://circleci.com/gh/georgia-tech-db/evadb.svg?style=svg)](https://circleci.com/gh/georgia-tech-db/evadb)
[![Documentation Status](https://readthedocs.org/projects/evadb/badge/?version=latest)](https://evadb.readthedocs.io/en/latest/index.html)
## Star History
## License
Copyright (c) [Georgia Tech Database Group](http://db.cc.gatech.edu/).
Licensed under an [Apache License](LICENSE.txt).