TTSLab is a browser-based platform that allows users to test and compare Text-to-Speech (TTS) and Speech-to-Text (STT) models without the need for a server, API keys, or data collection. Built using WebGPU and WASM, it enables on-device inference, ensuring that all text and audio remain within the user's device. This makes TTSLab ideal for developers, researchers, and product teams who want to evaluate model performance, compare voices, or build applications with on-device speech AI.
The platform supports a wide range of models such as Kokoro 82M, Whisper, Moonshine, and Supertonic 2. These models are downloaded once and cached locally, allowing for instant reuse. TTSLab also includes features like voice comparison, benchmarking, and a Voice Agent that runs entirely in the browser. It emphasizes privacy, speed, and open-source collaboration, making it a powerful tool for anyone working with speech AI.
TTSLab operates through a simple four-step process:
| Benefit | Description |
|---|---|
| Privacy | No data is sent to external servers, ensuring sensitive content remains secure. |
| Speed | WebGPU acceleration reduces latency, enabling real-time applications like voice agents and live captioning. |
| Flexibility | Supports multiple models and allows side-by-side comparisons for evaluation and benchmarking. |
| Open Source | MIT licensed, encouraging community contributions and customization. |
| Accessibility | No setup required — just run it in your browser. |