| Title |
Investigation of VITS text-to-speech for the Lithuanian language |
| Authors |
Lėveris, Vytautas ; Korvel, Gražina |
| DOI |
10.15388/LMITT.2026.15 |
| Full Text |
|
| Is Part of |
Lietuvos magistrantų informatikos ir IT tyrimai: konferencijos darbai, 2026 m. gegužės 6 d. Vilnius.. Vilnius : Vilniaus universiteto leidykla. 2026, p. 140-148.. eISSN 2783-784X |
| Keywords [eng] |
text-to-speech ; Lithuanian language ; VITS ; speech synthesis ; phone- me-based modeling |
| Abstract [eng] |
This study investigates the performance of the VITS model for Lithuanian speech synthesis under different training configurations. Experiments were conducted using datasets with phoneme-based and grapheme-based text representations, accented text, and both single-speaker and multi-speaker setups. The goal was to evaluate how linguistic pre-processing and speaker diversity influence synthesis quality. Model outputs were compared using objective measures. The results provide insights into the impact of phoneme representation and accent information on the quality of Lithuanian neural TTS systems. |
| Published |
Vilnius : Vilniaus universiteto leidykla |
| Type |
Conference paper |
| Language |
English |
| Publication date |
2026 |
| CC license |
|