Title Evaluation of Lithuanian speech-to-text transcribers
Authors Kasparaitis, Pijus
DOI 10.15388/25-INFOR591
Full Text Download
Is Part of Informatica.. Vilnius : Vilniaus universiteto leidykla. 2025, vol. 36, no. 2, p. 369-384.. ISSN 0868-4952. eISSN 1822-8844
Keywords [eng] speech-to-text transcription ; automatic speech recognition ; word error rate ; character error rate ; Lithuanian
Abstract [eng] For more than two decades, Lithuanian speech recognition has been researched solely in Lithuania due to the need for deep knowledge of Lithuanian. AI advancements now allow high-quality speech-to-text systems to be built without native knowledge, given sufficient annotated data is available. This study evaluated as many as 18 Lithuanian speech transcribers using a small piece of recording; 7 best ones were selected and evaluated using extensive data. The top system achieved a WER of 5.1% for Lithuanian words, with three others showing 8.7–9.2%. For other word-size tokens, such as numbers, speech disfluencies, abbreviations, foreign words, a classification adapted to the Lithuanian language was proposed. Different processing strategies for tokens of these classes were examined and it was assessed which transcribers tend to follow which strategies.
Published Vilnius : Vilniaus universiteto leidykla
Type Journal article
Language English
Publication date 2025
CC license CC license description