END-TO-END UZBEK-RUSSIAN SPEECH TRANSLATION WITH SELF-SUPERVISED PRETRAINING

Sukhrob Avezov Sobirovich

END-TO-END UZBEK-RUSSIAN SPEECH TRANSLATION WITH SELF-SUPERVISED PRETRAINING

item.page.files

sobirovich_2025_end-to-end_uzbek-russian_speech_translat.pdf (325.68 KB)

item.page.date

2025-09-27

item.page.authors

Sukhrob Avezov Sobirovich

item.page.publisher

Web of Journals Publishing

item.page.abstract

In this article we study end-to-end Uzbek→Russian speech translation under realistic low-resource and code-switching conditions. We couple a wav2vec-style encoder pre-trained on unlabeled audio with a Transformer decoder, add multi-task ASR/CTC objectives, and distill from a strong cascade teacher. Script-aware tokenization and data augmentation reduce sparsity. On conversational and broadcast tests the model improves BLEU/chrF at fixed latency and yields fewer morphology and NE errors.

item.page.subject

End-to-end speech translation, Uzbek-Russian, self-supervised pretraining, wav2vec 2.0, XLS-R, knowledge distillation, code-switching, low-resource.

item.page.uri

https://webofjournals.com/index.php/1/article/view/5127
https://asianeducationindex.com/handle/123456789/22185

item.page.collections

Published Articles

item.page.link.full

END-TO-END UZBEK-RUSSIAN SPEECH TRANSLATION WITH SELF-SUPERVISED PRETRAINING

item.page.files

item.page.date

item.page.authors

item.page.journal-title

item.page.journal-issn

item.page.volume-title

item.page.publisher

item.page.abstract

item.page.description

item.page.subject

item.page.citation

item.page.uri

item.page.collections

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced