VoxServe Documentation#

VoxServe is a streaming-centric serving system for Speech Language Models (SpeechLMs), supporting both text-to-speech (TTS) and speech-to-speech (STS) workloads.

Getting Started

Quickstart
Core Concepts

Usage Guides

Usage Guides
- Text-to-Speech (TTS) Models
- Speech-to-Speech (STS) Models

Reference

API Reference
Python API
- Package overview
- Key modules
CLI Reference
- Usage
- Arguments
Supported Models
Architecture

Contributing

Development