
Please use this identifier to cite or link to this item:
https://scholar.dlu.edu.vn/handle/123456789/555
Title: | Fast generation of sequential patterns with item constraints from concise representations | Authors: | Dương, Văn Hải Trương, Chí Tín Trần, Ngọc Anh Bac Le |
Keywords: | Sequential pattern;Generator and closed sequences;Equivalence relation;Partition;Constraint-based pattern mining;Item constraint | Issue Date: | 2020-11 | Publisher: | Springer Link | Journal: | Knowl Inf Syst | Volume: | 62 | Pages: | 2191–2223 | Abstract: | Constraint-based frequent sequence mining is an important and necessary task in data mining since it shows results very close to the requirements and interests of users. Most existing algorithms for performing this task are based on a traditional approach that mines patterns directly from a sequence database (SDB). However, in fact, SDBs are often very large. The algorithms thus often exhibit poor performance because the number of generated candidates and the search space are enormous, especially for low minimum support thresholds. In addition, these algorithms must read an SDB again when a constraint is changed by the user. In the context of frequently varied constraints, repeatedly scanning SDBs consume much time. To address this issue, we propose a novel approach for generating frequent sequences with various constraints from the two sets of frequent closed sequences (FCS) and frequent generator sequences (FGS), which are the concise representations of the set FS of all frequent sequences. The proposed approach is based on novel theoretical results that show an explicit relationship between FS and these two sets and have been strictly proved. The approach is then used to develop an efficient algorithm named MFS-IC for quickly generating frequent sequences with item constraints, a task that has many real-life applications. Extensive experiments on real-life and synthetic databases show that the proposed MFS-IC algorithm outperforms state-of-the-art algorithms, which directly mine frequent sequences with constraints from an SDB, in terms of runtime, memory usage and scalability. |
URI: | https://scholar.dlu.edu.vn/handle/123456789/555 | DOI: | 10.1007/s10115-019-01418-2 | Type: | Bài báo đăng trên tạp chí thuộc ISI, bao gồm book chapter |
Appears in Collections: | Đề tài khoa học (Khoa Toán - Tin học) |
Files in This Item:
File | Description | Size | Format | Existing users please Login |
---|---|---|---|---|
Fast Gen. of Seq Patterns_Hai Duong_KAIS.pdf | 3.32 MB | Adobe PDF |
CORE Recommender
SCOPUSTM
Citations
5
8
Last Week
1
1
Last month
1
1
checked on Mar 22, 2025
Page view(s)
80
Last Week
3
3
Last month
1
1
checked on Mar 28, 2025
Download(s)
77
checked on Mar 28, 2025
Google ScholarTM
Check
Altmetric
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.