Το έργο με τίτλο SPIRIT: Sequential pattern mining with regular expression constraints από τον/τους δημιουργό/ούς Garofalakis Minos, Rastogi Rajeev, Shim Kyuseok διατίθεται με την άδεια Creative Commons Αναφορά Δημιουργού 4.0 Διεθνές
Βιβλιογραφική Αναφορά
M.N. Garofalakis, R. Rastogi and K. Shim, "SPIRIT: Sequential pattern mining with
regular expression constraints", in 25th VLDB Conference, September 1999, pp. 223-234.
Discovering sequential patterns is an important problem indata mining with a host of application domains includingmedicine, telecommunications, and the World Wide Web.Conventional mining systems provide users with only avery restricted mechanism (based on minimum support)for specifying patterns of interest. In this paper, we proposethe use of Regular Expressions (REs) as a flexibleconstraint specification tool that enables user-controlledfocus to be incorporated into the pattern mining process.We develop a family of novel algorithms (termed SPIRIT– Sequential Pattern mIning with Regular expressIon consTraints)for mining frequent sequential patterns that alsosatisfy user-specified RE constraints. The main distinguishingfactor among the proposed schemes is the degreeto which the RE constraints are enforced to prune thesearch space of patterns during computation. Our solutionsprovide valuable insights into the tradeoffs that arisewhen constraints that do not subscribe to nice properties(like anti-monotonicity) are integrated into the mining process.A quantitative exploration of these tradeoffs is conductedthrough an extensive experimental study on syntheticand real-life data sets.