Title Tikimybinis dažnų posekių paieškos algoritmas /
Another Title Probabilistic algorithm for mining frequent sequences.
Authors Pragarauskaitė, Julija ; Dzemyda, Gintautas
DOI 10.15388/LMR.2010.57
Full Text Download
Is Part of Lietuvos matematikos rinkinys. Lietuvos matematikų draugijos darbai. 2010, T. 51, p. 313-318.. ISSN 0132-2818
Keywords [eng] Sequence mining, frequent ; Probabilistic algorithm ; Data mining
Abstract [eng] Frequent sequence mining in large volume databases is important in many areas, e.g., biological, climate, financial databases. Exact frequent sequence mining algorithms usually read the whole database many times, and if the database is large enough, then frequent sequence mining is very long or requires supercomputers. A new probabilistic algorithm for mining frequent sequences is proposed. It analyzes a random sample of the initial database. The algorithm makes decisions about the initial database according to the random sample analysis results and performs much faster than the exact mining algorithms. The probability of errors made by the probabilistic algorithm is estimated using statistical methods.
Type Conference paper
Language Lithuanian
Publication date 2010
CC license CC license description