Title Investigation of VITS text-to-speech for the Lithuanian language
Authors Lėveris, Vytautas ; Korvel, Gražina
DOI 10.15388/LMITT.2026.15
Full Text Download
Is Part of Lietuvos magistrantų informatikos ir IT tyrimai: konferencijos darbai, 2026 m. gegužės 6 d. Vilnius.. Vilnius : Vilniaus universiteto leidykla. 2026, p. 140-148.. eISSN 2783-784X
Keywords [eng] text-to-speech ; Lithuanian language ; VITS ; speech synthesis ; phone- me-based modeling
Abstract [eng] This study investigates the performance of the VITS model for Lithuanian speech synthesis under different training configurations. Experiments were conducted using datasets with phoneme-based and grapheme-based text representations, accented text, and both single-speaker and multi-speaker setups. The goal was to evaluate how linguistic pre-processing and speaker diversity influence synthesis quality. Model outputs were compared using objective measures. The results provide insights into the impact of phoneme representation and accent information on the quality of Lithuanian neural TTS systems.
Published Vilnius : Vilniaus universiteto leidykla
Type Conference paper
Language English
Publication date 2026
CC license CC license description