Coqui STT (πΈSTT) is a fast, open-source, multi-platform, deep-learning toolkit for training and deploying speech-to-text models. πΈSTT is battle tested in both production and research π
πΈSTT features
High-quality pre-trained STT model.
Efficient training pipeline with Multi-GPU support.
Streaming inference.
Multiple possible transcripts, each with an associated confidence score.