The speaker diarization is quite good as compared to other users and the transcription is fairly good.
As most speech recognition tools I’ve tried, it has trouble with names, puntuation, and similar words, which demands careful proofreading.
The best part is support for multiple languages. The transcripts are very accurate, and having it done as a cloud service helps us separate the concern of having accurate STT frmo the rest of our app.
1. Lack of documentation, or sometime the available document is incomplete or not up to date. 2. The default model does seem to pick the best model for the current audio and the user to pick a different model which performance better.
The speaker diarization is quite good as compared to other users and the transcription is fairly good.
The best part is support for multiple languages. The transcripts are very accurate, and having it done as a cloud service helps us separate the concern of having accurate STT frmo the rest of our app.
As most speech recognition tools I’ve tried, it has trouble with names, puntuation, and similar words, which demands careful proofreading.
1. Lack of documentation, or sometime the available document is incomplete or not up to date. 2. The default model does seem to pick the best model for the current audio and the user to pick a different model which performance better.