https://github.com/PicoMLX/PicoMLXServer

The easiest way to run the fastest MLX-based LLMs locally
https://github.com/PicoMLX/PicoMLXServer

ai llama3 mlx ollama openai openai-api

Last synced: 3 months ago
JSON representation

The easiest way to run the fastest MLX-based LLMs locally

Host: GitHub
URL: https://github.com/PicoMLX/PicoMLXServer
Owner: PicoMLX
License: mit
Created: 2024-03-05T00:58:50.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-10-30T17:47:15.000Z (9 months ago)
Last Synced: 2024-11-08T20:47:57.863Z (8 months ago)
Topics: ai, llama3, mlx, ollama, openai, openai-api
Language: Swift
Homepage:
Size: 2.28 MB
Stars: 209
Watchers: 7
Forks: 15
Open Issues: 2
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # Pico MLX Server

Pico MLX Server is the easiest way to get started with Apple's [MLX AI framework](https://github.com/ml-explore/mlx)

Pico MLX Server provides a GUI for [MLX Server](https://github.com/mustafaaljadery/mlxserver). MLX server provides an API for local MLX models conform the [OpenAI API](https://platform.openai.com/docs/api-reference/completions/create). This allows you to use most existing OpenAI chat clients with Pico MLX Server.

![Menu extra screenshot](Images/menuExtra.png)

## Highlights

- Start and stop servers quickly via menu bar extra

- Download MLX models from the [MLX community on HuggingFace](https://huggingface.co/mlx-community)

- Install and setup Python environment, MLX, and MLX Server from within Pico MLX Server.

- Run multiple servers on different ports

- View logs of the servers in separte windows

- Custom link to open your favorite chat client (defaults to [Pico AI Assistant](https://apps.apple.com/us/app/pico-ai-copilot/id1668205047))

### Supported MLX models

See [MLX Community on HuggingFace](https://huggingface.co/mlx-community)

## Getting Started

To install Pico MLX Server, build the source using Xcode, or download the [notarized executable directly from GitHub](https://github.com/ronaldmannak/PicoMLXServer/tags).

To set up Pico MLX Server, open the app and 

- Install and set up Python, pip, [MLX](https://github.com/ml-explore/mlx) and [MLX Server](https://github.com/mustafaaljadery/mlxserver) and optionally Conda manually

- Use Pico MLX Server's automated Setup (`MLX -> Setup...`)

- Pico MLX Server uses [Conda](https://docs.conda.io/en/latest/) by default to create a virtual environment for the servers called pico. This (hopefully) will avoid any Python version issues. Conda can be disabled in `Settings`

![Setup window screenshot](Images/setup.png)

### Requirements

- MacOS 14.0 (Sonoma) or later

### Create a New Server

![Menu extra screenshot](Images/serverManager.png)

- Select `MLX -> Servers -> New Server...`

- Press `Create` to create the default server `mlx-community/Nous-Hermes-2-Mistral-7B-DPO-4bit-MLX` on port `8080`

- To use a different model, click on the `v` button or type in a model manually from the [MLX Community](https://huggingface.co/mlx-community) on HuggingFace (make sure to use the `mlx-community/` prefix)

- Press the `View Logs` buttons to open a window with the server's real-time logs

![Menu extra screenshot](Images/log.png)

### Use Pico MLX Server with an AI client

- Point any OpenAI API compatible AI assistant to `http://127.0.0.1:8080` (or any other port you used in Pico MLX Server). (Instructions for Pico AI Assistant coming soon)

- Curl:

  ```

  curl -X GET 'http://127.0.0.1:8080/generate?prompt=write%20me%20a%20poem%20about%the%20ocean&stream=true'

  ```

### API Endpoints

- Pico MLX Server uses OpenAI's `POST /v1/completions` API. See for more information https://platform.openai.com/docs/api-reference/completions/create.

## Known Issues

- Pico MLX Server doesn't detect if a port is already in use (use `lsof -i:8080` in the terminal to find the PID of the running server)

- There is a SwiftUI issue in `New Server` window and the `Servers` menu where the state of servers isn't updated

## Roadmap

- Switch from Python to [MLX Swift](https://github.com/ml-explore/mlx-swift)

- Swift-based HTTP server

## Related projects

- [MLX](https://github.com/ml-explore/mlx)

- [MLX Swift](https://github.com/ml-explore/mlx-swift)

- [MLX Server](https://github.com/mustafaaljadery/mlxserver)

- [MLX Community](https://huggingface.co/mlx-community)

Pico MLX Server is part of a bundle of open source Swift tools for AI engineers.

Looking for a server-side Swift OpenAI proxy to protect your OpenAI keys? Check out [Swift OpenAI Proxy](https://github.com/ronaldmannak/SwiftOpenAIProxy).

## Authors and Acknowledgements

Pico MLX Server, [Swift OpenAI Proxy](https://github.com/ronaldmannak/SwiftOpenAIProxy), and [Pico AI Assistant](https://apps.apple.com/us/app/pico-ai-copilot/id1668205047) were created by [Ronald Mannak](https://twitter.com/ronaldmannak) with help from [Ray Fernando](https://twitter.com/rayfernando1337/)

MLX Server was created by [Mustafa Aljadery](https://www.maxaljadery.com/) & [Siddharth Sharma](https://stanford.edu/~sidshr/)

Code used from [MLX Swift Chat](https://github.com/PreternaturalAI/mlx-swift-chat) and [Swift Chat](https://github.com/huggingface/swift-chat)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/PicoMLX/PicoMLXServer

Awesome Lists containing this project

README