"Local model runtime and manager for downloading, switching between, and serving open language models through a simple CLI, desktop app, and REST API."
tags