https://github.com/underneathall/pinferencia

Python + Inference - Model Deployment library in Python. Simplest model inference server ever.
https://github.com/underneathall/pinferencia

ai artificial-intelligence computer-vision data-science deep-learning huggingface inference inference-server machine-learning model-deployment model-serving modelserver nlp paddlepaddle predict python pytorch serving tensorflow transformers

Last synced: about 1 month ago
JSON representation

Python + Inference - Model Deployment library in Python. Simplest model inference server ever.

Host: GitHub
URL: https://github.com/underneathall/pinferencia
Owner: underneathall
License: apache-2.0
Created: 2022-04-04T15:50:47.000Z (about 3 years ago)
Default Branch: main
Last Pushed: 2023-02-14T22:50:48.000Z (over 2 years ago)
Last Synced: 2025-04-03T13:16:22.767Z (3 months ago)
Topics: ai, artificial-intelligence, computer-vision, data-science, deep-learning, huggingface, inference, inference-server, machine-learning, model-deployment, model-serving, modelserver, nlp, paddlepaddle, predict, python, pytorch, serving, tensorflow, transformers
Language: Python
Homepage: https://pinferencia.underneathall.app
Size: 9.57 MB
Stars: 557
Watchers: 39
Forks: 86
Open Issues: 17
Metadata Files:
- Readme: Readme.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md

Awesome Lists containing this project

README

        ![Pinferencia](/docs/assets/images/logo_header.png)



    Simple, but Powerful.





    

        

    

    

        

    

    

        

    

    

        

    

    



---





    English Doc

 |



    中文文档

|



    中文Readme







    Help wanted. Translation, rap lyrics, all wanted. Feel free to create an issue.



---

**Pinferencia** tries to be the simplest machine learning inference server ever!

**Three extra lines and your model goes online**.

Serving a model with GUI and REST API has never been so easy.

![Pinferencia-GUI](/docs/assets/images/examples/translation-gui.png)

![Pinferencia-REST API](/docs/assets/images/examples/translate-app.png)

If you want to

- give your model a **GUI** and **REST API**

- find a **simple but robust** way to serve your model

- write **minimal** codes while maintain controls over you service

- **avoid** any **heavy-weight** solutions

- **compatible** with other tools/platforms

You're at the right place.

## Features

**Pinferencia** features include:

- **Fast to code, fast to go alive**. Minimal codes needed, minimal transformation needed. Just based on what you have.

- **100% Test Coverage**: Both statement and branch coverages, no kidding. Have you ever known any model serving tool so seriously tested?

- **Easy to use, easy to understand**.

- **A pretty and clean GUI** out of box.

- **Automatic API documentation page**. All API explained in details with online try-out feature.

- **Serve any model**, even a single function can be served.

- **Support Kserve API**, compatible with Kubeflow, TF Serving, Triton and TorchServe. There is no pain switching to or from them, and **Pinferencia** is much faster for prototyping!

## Install

### Recommend

```bash

pip install "pinferencia[streamlit]"

```

### Backend Only

```bash

pip install "pinferencia"

```

## Quick Start

**Serve Any Model**

```python title="app.py"

from pinferencia import Server

class MyModel:

    def predict(self, data):

        return sum(data)

model = MyModel()

service = Server()

service.register(model_name="mymodel", model=model, entrypoint="predict")

```

Just run:

```

pinfer app:service

```

Hooray, your service is alive. Go to http://127.0.0.1:8501/ and have fun.

**Any Deep Learning Models?** Just as easy. Simple train or load your model, and register it with the service. Go alive immediately.

**Hugging Face**

Details: [HuggingFace Pipeline - Vision](https://pinferencia.underneathall.app/ml/huggingface/pipeline/vision/)

```python title="app.py" linenums="1"

from transformers import pipeline

from pinferencia import Server

vision_classifier = pipeline(task="image-classification")

def predict(data):

    return vision_classifier(images=data)

service = Server()

service.register(model_name="vision", model=predict)

```

**Pytorch**

```python title="app.py"

import torch

from pinferencia import Server

# train your models

model = "..."

# or load your models (1)

# from state_dict

model = TheModelClass(*args, **kwargs)

model.load_state_dict(torch.load(PATH))

# entire model

model = torch.load(PATH)

# torchscript

model = torch.jit.load('model_scripted.pt')

model.eval()

service = Server()

service.register(model_name="mymodel", model=model)

```

**Tensorflow**

```python title="app.py"

import tensorflow as tf

from pinferencia import Server

# train your models

model = "..."

# or load your models (1)

# saved_model

model = tf.keras.models.load_model('saved_model/model')

# HDF5

model = tf.keras.models.load_model('model.h5')

# from weights

model = create_model()

model.load_weights('./checkpoints/my_checkpoint')

loss, acc = model.evaluate(test_images, test_labels, verbose=2)

service = Server()

service.register(model_name="mymodel", model=model, entrypoint="predict")

```

Any model of any framework will just work the same way. Now run `uvicorn app:service --reload` and enjoy!

## Contributing

If you'd like to contribute, details are [here](./CONTRIBUTING.md)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/underneathall/pinferencia

Awesome Lists containing this project

README