ROBUST UZBEK ASR AND TTS FOR DIALECTAL AND NOISY SETTINGS

loading.default
thumbnail.default.alt

item.page.date

item.page.journal-title

item.page.journal-issn

item.page.volume-title

item.page.publisher

American Journals Publishing

item.page.abstract

In this article we present a unified recipe for robust Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) for Uzbek under dialectal variation, code-switching, and real-world noise. We combine self-supervised pretraining, dialect-aware lexicons, multi-script text normalization, targeted augmentation, and test-time adaptation. On simulated and field recordings, the ASR reduces WER by large margins; the TTS maintains naturalness and intelligibility across accents and SNRs.

item.page.description

item.page.citation

item.page.collections

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced