Metadata-Version: 2.4
Name: kooka-server
Version: 0.0.5
Summary: local inference web server for MLX / mlx-lm.
Author: Anchen
Requires-Python: >=3.12
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: mlx-lm==0.30.5
Provides-Extra: dev
Requires-Dist: pytest>=8.0.0; extra == "dev"
Requires-Dist: pytest-timeout>=2.3.0; extra == "dev"
Requires-Dist: requests>=2.31.0; extra == "dev"
Dynamic: license-file

# kooka-server

Local inference web server for MLX / `mlx-lm`.

Note: `mlx-lm==0.30.2` depends on `transformers==5.0.0rc1`, so `uvx` needs pre-releases enabled:

```bash
export UV_PRERELEASE=allow
```

## Quickstart (single machine)
```bash
uvx kooka-server serve --model mlx-community/MiniMax-M2.1-4bit --host 127.0.0.1 --port 8080
```

## Quickstart (distributed)
See `docs/DISTRIBUTED_SERVER.md`.
