Recognising the contents in digitised financial documents

Title	Recognising the contents in digitised financial documents
Authors	Rimašauskas, Simas ; Belovas, Igoris
DOI	10.15388/LMITT.2025.22
Full Text
Is Part of	Lietuvos magistrantų informatikos ir IT tyrimai: konferencijos darbai, 2025 m. gegužės 13 d.... Vilnius : Vilniaus universiteto leidykla. 2025, p. 187-196.. eISSN 2783-784X
Keywords [eng]	machine learning ; natural language processing ; optical character recognition ; text recognition ; table recognition
Abstract [eng]	he necessity of content recognition in digital documents is everincreasing in the financial sector. Extracted data is used for fundamental analysis, modelling and portfolio selection. In the most prominent markets, there is a wide array of available sources to obtain the data, such as SEC filings easily. However, it is not so in markets with less investor interest, such as the CEE region or Latin America. Often, the only sources containing the data are primary reports by the company itself. Scarce secondary sources may provide data of dubious reliability. This leads to an excessive workload for analysts, implying the necessity to adapt existing intelligent methods for processing financial data.
Published	Vilnius : Vilniaus universiteto leidykla
Type	Conference paper
Language	English
Publication date	2025
CC license

„Recognising the contents in digitised financial documents“