Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/metriccoders/ai-studio

AI Studio by Metric Coders: A No-Code Software to train, download and deploy Large Language Models.
https://github.com/metriccoders/ai-studio

ai-studio artificial-intelligence generative-ai keras langchain large-language-models llm machine-learning natural-language-processing nlp python python3 scikit-learn tensorflow

Last synced: 9 days ago
JSON representation

AI Studio by Metric Coders: A No-Code Software to train, download and deploy Large Language Models.

Awesome Lists containing this project

README

        


AI Studio


The no-code software to train and deploy ML models and Large Language Models, and eventually your AI SaaS

## Overview
**AI Studio** is a GitHub repository created by **Metric Coders**. It is a no-code software to create and train various Large Language Models by configuring different hyperparameters through the UI.

Due to lack of processing power, we are currently focussing only on providing a robust platform for creating ML models using scikit-learn.

The platform is designed to simplify the process of building and experimenting with LLMs, including ML models, making it accessible for users with varying levels of expertise in machine learning. This application is built using **Streamlit** to provide an interactive and user-friendly experience.

The platform offers a **one-click deployment** zip file that contains everything you need to directly deploy the model as an API into your development and production environments.
## Homepage

Here is how the webpage looks when pre-trained datasets are loaded
![Pre-trained Datasets](readme_images/home_page.png)

Here is how the homepage looks when a custom dataset is loaded
![Dataset](readme_images/file_upload.png)

Visualization your data effortlessly in different charts.
![Charts](readme_images/charts.png)

Visualization your data effortlessly in different tables.
![Charts](readme_images/data_table.png)

## Features
### Current Features
- **Streamlit**: Users can easily create ML models using a graphical interface.
- **Hyperparameter Tuning**: Adjust and experiment with different hyperparameters to optimize model performance.
- **Preloaded Datasets**: Access to a variety of datasets provided by scikit-learn, facilitating immediate experimentation and learning.
- **Model Export**: Functionality to export trained models for deployment and further use.
- **Custom Datasets**: Users can upload and use their own datasets for model training.

### Upcoming Features
- **Data Cleaning and Preprocessing**: Tools for data cleaning and preprocessing will be integrated to prepare datasets for model training.
- **Support for LLMs**: Expansion to include large language models (LLMs), broadening the scope of the platform to natural language processing tasks.

## Getting Started
### Prerequisites
- Python 3.6+
- Scikit-learn library
- Streamlit

### Installation
1. Clone the repository:
```bash
git clone https://github.com/MetricCoders/AI-Studio.git
```
2. Navigate to the project directory:
```bash
cd AI-Studio
```
3. Install the required dependencies:
```bash
pip install -r requirements.txt
```

### Running the Application
To start the Streamlit application, use the following command:
```bash
streamlit run Home.py
```
This will launch the AI Studio web UI. Open your web browser and navigate to the URL provided in the terminal to start using the platform.

## Usage
### Creating a Model
1. **Select a Dataset**: Choose from a variety of preloaded datasets or load your own datasets.
2. **Choose an Algorithm**: Select the machine learning algorithm you wish to use (e.g., linear regression, decision trees, etc.).
3. **Set Hyperparameters**: Configure the hyperparameters for your chosen algorithm.
4. **Train the Model**: Click the "Train" button to build and train your model.

### Evaluating a Model
After training, AI Studio provides performance metrics and visualizations to help you evaluate the effectiveness of your model. Use these insights to refine your model by adjusting hyperparameters or selecting different algorithms.

### Downloading the Trained Model
The "**Download Model**" button allows you to instantly download the model to your local directory.

### Downloading the Deployment Zip File
The "**Download Deployment Zip**" button allows you to instantly download a zip file that contains everything you need to run the model, including the model itself. It contains:
1. **backend_api.py**: This is full-fledged backend api written in fastapi that contains the complete code necessary to run the model in a development/production setup.
2. **model.joblib**: This is the trained model available to be used instantly in the production environment. This model is used by **backend_api.py** in order to make predictions.
3. **requirements.txt**: This is the requirements file required to run the backend_api.py correctly. The packages present in the requirements.txt file needs to installed in the following manner:

```bash
pip install -r requirements.txt
```

### Large Language Models
In the menu on the left, Large Language Models section allows you to train and fine-tune large models. You can add as many layers as you need.
Due to Streamlit cloud constraints for the demo purpose, the values are limited. In your local, you can change it to whatever you need depending on your system.

The number of epochs have been limited to 10 and the batch_size is 16.

### Loading Own Datasets
The datasets should have headers and the column by the name 'target' should contain the output/result. A sample is given below:

feature1,feature2,feature3,target

1,2,3,0

2,3,4,0

3,4,5,1

4,5,6,1

5,6,7,0

6,7,8,1

7,8,9,0

8,9,10,1

9,10,11,1

10,11,12,0

## License
This project is licensed under the MIT License. See the `LICENSE` file for details.

## Contact
For questions, feedback, or support, please contact me via email at [email protected].

---

**AI Studio by Metric Coders** aims to democratize machine learning by providing an intuitive platform for model creation and experimentation. With upcoming features and continuous improvements, it strives to be a comprehensive tool for both novice and experienced data scientists.