By default, plain text customization is supported for all available base models. Depending on the locale, you can upload audio + human-labeled transcripts, plain text, structured text, and pronunciation data. To improve Speech to text recognition accuracy, customization is available for some languages and base models. Locale (BCP-47)Īudio + human-labeled transcript Plain textĪudio + human-labeled transcript Plain text PronunciationĪudio + human-labeled transcript Plain text Structured text PronunciationĪudio + human-labeled transcript Plain text Structured text Pronunciation Phrase listĪudio + human-labeled transcript Audio Plain text Structured text Pronunciation Phrase listĪudio + human-labeled transcript Audio Plain text Structured text PronunciationĪudio + human-labeled transcript Audio Plain text PronunciationĪudio + human-labeled transcript Plain text Phrase listĪudio + human-labeled transcript Plain text Structured text Phrase listĪudio + human-labeled transcript Plain text Structured textĬhinese (Southwestern Mandarin, Simplified)Ĭhinese (Taiwanese Mandarin, Traditional) Try out the Real-time Speech to text tool without having to use any code.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |