Title |
Large language models for lithuanian language / |
Translation of Title |
Didieji kalbos modeliai lietuvių kalbai. |
Authors |
Plesevičius, Dominykas |
Full Text |
|
Pages |
45 |
Keywords [eng] |
Large language model, LLM, natural language processing, NLP, multilingual models, low-resource languages, model training, didieji kalbos modeliai, natūralios kalbos apdorojimas, daugiakalbiai modeliai, modelių apmokymas |
Abstract [eng] |
Efforts to develop large language models for the Lithuanian language have been limited, primarily due to data and resource constraints. In this work, we aim to address this issue by training models specifically tailored for Lithuanian. We enhance existing multilingual large language models through additional training and develop a new Lithuanian-specific model with an optimized tokenizer. To evaluate their performance, we test these models across a diverse set of benchmarks. The results highlight both the strengths and weaknesses of Lithuanian LLMs while suggesting areas for improvement, including higher quality data collection, synthetic data generation, advanced training techniques, and more effective model design. |
Dissertation Institution |
Vilniaus universitetas. |
Type |
Master thesis |
Language |
English |
Publication date |
2025 |