Title |
Evaluation of Lithuanian speech-to-text transcribers |
Authors |
Kasparaitis, Pijus |
DOI |
10.15388/25-INFOR591 |
Full Text |
|
Is Part of |
Informatica.. Vilnius : Vilniaus universiteto leidykla. 2025, vol. 36, no. 2, p. 369-384.. ISSN 0868-4952. eISSN 1822-8844 |
Keywords [eng] |
speech-to-text transcription ; automatic speech recognition ; word error rate ; character error rate ; Lithuanian |
Abstract [eng] |
For more than two decades, Lithuanian speech recognition has been researched solely in Lithuania due to the need for deep knowledge of Lithuanian. AI advancements now allow high-quality speech-to-text systems to be built without native knowledge, given sufficient annotated data is available. This study evaluated as many as 18 Lithuanian speech transcribers using a small piece of recording; 7 best ones were selected and evaluated using extensive data. The top system achieved a WER of 5.1% for Lithuanian words, with three others showing 8.7–9.2%. For other word-size tokens, such as numbers, speech disfluencies, abbreviations, foreign words, a classification adapted to the Lithuanian language was proposed. Different processing strategies for tokens of these classes were examined and it was assessed which transcribers tend to follow which strategies. |
Published |
Vilnius : Vilniaus universiteto leidykla |
Type |
Journal article |
Language |
English |
Publication date |
2025 |
CC license |
|