Volume 9 Number 8 (Aug. 2014)
Home > Archive > 2014 > Volume 9 Number 8 (Aug. 2014) >
JSW 2014 Vol.9(8): 2080-2086 ISSN: 1796-217X
doi: 10.4304/jsw.9.8.2080-2086

Efficient Method for Mining Patterns from Highly Similar and Dense Database based on Prefix-Frequent-Items

Meng Han1, 2, Zhihai Wang1, Jidong Yuan1

1School of Computer and Information Technology Beijing Jiaotong University, Beijing, 100044, P.R. China
2School of Computer Science and Engineering Beifang University of Nationalities, Yinchuan, 750021, P.R. China


Abstract—In recent years, there are a great deal of efforts on sequential pattern mining, but some challenges have not been resolved, such as large search spaces and the ineffectiveness in handling highly similar, dense and long sequences. This paper mainly focuses on how to design some effective search space pruning methods to accelerate the mining process. We present a novel structure, Prefix- Frequent-Items Graph (PFI-Graph), which presents the prefix frequent items of other items in sequential patterns. An efficient algorithm PFI-PrefixSpan (Prefix-Frequent- Items PrefixSpan) based on PFI-Graph is proposed in this paper. It avoids redundant data scanning, and thus can effectively speed up the discovery process of new patterns. Extensive experimental results on some synthetic and real sequence datasets show that the proposed novel structure is substantially more efficient than PrefixSpan with physicalprojection and pseudo-projection, especially for dense and highly similar sequence databases.

Index Terms—sequential pattern mining; dense database; highly similar sequence; long sequence; prefix frequent items

[PDF]

Cite: Meng Han, Zhihai Wang, Jidong Yuan, "Efficient Method for Mining Patterns from Highly Similar and Dense Database based on Prefix-Frequent-Items," Journal of Software vol. 9, no. 8, pp. 2080-2086, 2014.

General Information

  • ISSN: 1796-217X (Online)

  • Abbreviated Title: J. Softw.

  • Frequency:  Quarterly

  • APC: 500USD

  • DOI: 10.17706/JSW

  • Editor-in-Chief: Prof. Antanas Verikas

  • Executive Editor: Ms. Cecilia Xie

  • Abstracting/ Indexing: DBLP, EBSCO,
           CNKIGoogle Scholar, ProQuest,
           INSPEC(IET), ULRICH's Periodicals
           Directory, WorldCat, etc

  • E-mail: jsweditorialoffice@gmail.com

  • Oct 22, 2024 News!

    Vol 19, No 3 has been published with online version   [Click]

  • Jan 04, 2024 News!

    JSW will adopt Article-by-Article Work Flow

  • Apr 01, 2024 News!

    Vol 14, No 4- Vol 14, No 12 has been indexed by IET-(Inspec)     [Click]

  • Apr 01, 2024 News!

    Papers published in JSW Vol 18, No 1- Vol 18, No 6 have been indexed by DBLP   [Click]

  • Jun 12, 2024 News!

    Vol 19, No 2 has been published with online version   [Click]