Publikace UTB
Repozitář publikační činnosti UTB

Incremental clickstream pattern mining with search boundaries

Repozitář DSpace/Manakin

Zobrazit minimální záznam


dc.title Incremental clickstream pattern mining with search boundaries en
dc.contributor.author Huynh, Minh Huy
dc.contributor.author Pham, Ngoc Nam
dc.contributor.author Komínková Oplatková, Zuzana
dc.contributor.author Nguyen, Loan T. T.
dc.contributor.author Thanh Nguyen, Ngoc
dc.contributor.author Yun, Unil
dc.contributor.author Vo, Bay
dc.relation.ispartof Information Sciences
dc.identifier.issn 0020-0255 Scopus Sources, Sherpa/RoMEO, JCR
dc.identifier.issn 1872-6291 Scopus Sources, Sherpa/RoMEO, JCR
dc.date.issued 2024
utb.relation.volume 662
dc.type article
dc.language.iso en
dc.publisher Elsevier Inc.
dc.identifier.doi 10.1016/j.ins.2024.120257
dc.relation.uri https://www.sciencedirect.com/science/article/pii/S0020025524001701
dc.relation.uri https://www.sciencedirect.com/science/article/pii/S0020025524001701/pdfft?md5=b369d752f5eeed61709b7f71da0f768f&pid=1-s2.0-S0020025524001701-main.pdf
dc.subject clickstream pattern mining en
dc.subject pre-large concept en
dc.subject progressive search border en
dc.subject incremental pattern mining en
dc.description.abstract Recently, there has been a growing interest in sequential pattern mining in data mining, with a particular focus on clickstream pattern mining. These areas hold the potential for discovering valuable patterns. However, traditional mining algorithms in these domains often assume that databases are static, simplifying the mining process. In reality, databases are updated incrementally over time, partially rendering a portion of the previous results invalid. This necessitates rerunning algorithms on updated databases to obtain accurate frequent patterns. As database size increases, this approach can become time-consuming and affect performance. To tackle this issue, we propose PSB-CUP to mine frequent clickstream patterns in an incremental update manner. PSB-CUP employs the concept of search borders to reduce the search space and the information retained in memory. Furthermore, an IDList generation method called “partial imbalance join” was proposed to reconstruct possibly missing information during the incremental process. This join method, however, requires more extra information to be cached in exchange for speed. We then improve this technique by introducing “recursive imbalance join”, removing the need for extra cached data in the PSB-CUP + algorithm. The experimental results show that our proposed algorithms are efficient for incremental clickstream pattern mining. en
utb.faculty Faculty of Applied Informatics
dc.identifier.uri http://hdl.handle.net/10563/1011907
utb.identifier.scopus 2-s2.0-85185494013
utb.identifier.wok 001182274800001
utb.identifier.coden ISIJB
utb.source j-scopus
dc.date.accessioned 2024-03-05T08:38:52Z
dc.date.available 2024-03-05T08:38:52Z
dc.description.sponsorship Faculty of Applied Informatics, Tomas Bata University in Zlin; Internal Grant Agency of Tomas Bata University, (IGA/CebiaTech/2023/004)
dc.description.sponsorship Internal Grant Agency of Tomas Bata University [IGA/CebiaTech/2023/004]
utb.contributor.internalauthor Huynh, Minh Huy
utb.contributor.internalauthor Pham, Ngoc Nam
utb.contributor.internalauthor Komínková Oplatková, Zuzana
utb.fulltext.affiliation Huy M. Huynh a, Nam N. Pham a, Zuzana K. Oplatkova a, Loan T.T. Nguyen b,c, Ngoc Thanh Nguyen d,e, Unil Yun f, Bay Vo g a Faculty of Applied Informatics, Tomas Bata University in Zlín, Nam. T.G. Masaryka 5555, Zlín 76001, Czech Republic b School of Computer Science and Engineering, International University, Ho Chi Minh City 700000, Viet Nam c Vietnam National University, Ho Chi Minh City 700000, Vietnam d Department of Applied Informatics, Wroclaw University of Science and Technology, Wroclaw, Poland e Faculty of Information Technology, Nguyen Tat Thanh University, Viet Nam f Department of Computer Engineering, Sejong University, Seoul 05006, Republic of Korea g Faculty of Information Technology, HUTECH University, Ho Chi Minh City 700000, Viet Nam
utb.fulltext.dates Received 8 September 2023 Received in revised form 26 January 2024 Accepted 26 January 2024 Available online 28 January 2024
utb.fulltext.sponsorship This work was supported by the Internal Grant Agency of Tomas Bata University under Project No. IGA/CebiaTech/2023/004. The work was further supported by resources of A. I. Lab at the Faculty of Applied Informatics, Tomas Bata University in Zlin, Czechia (ailab.fai.utb.cz).
utb.wos.affiliation [Huynh, Huy M.; Pham, Nam N.; Oplatkova, Zuzana K.] Tomas Bata Univ Zlin, Fac Appl Informat, Nam TG Masaryka 5555, Zlin 76001, Czech Republic; [Nguyen, Loan T. T.] Int Univ, Sch Comp Sci & Engn, Ho Chi Minh City 700000, Vietnam; [Nguyen, Loan T. T.] Vietnam Natl Univ, Ho Chi Minh City 700000, Vietnam; [Nguyen, Ngoc Thanh] Wroclaw Univ Sci & Technol, Dept Appl Informat, Wroclaw, Poland; [Nguyen, Ngoc Thanh] Nguyen Tat Thanh Univ, Fac Informat Technol, Ho Chi Minh, Vietnam; [Yun, Unil] Sejong Univ, Dept Comp Engn, Seoul 05006, South Korea; [Vo, Bay] HUTECH Univ, Fac Informat Technol, Ho Chi Minh City 700000, Vietnam
utb.scopus.affiliation Faculty of Applied Informatics, Tomas Bata University in Zlín, Nám. T.G. Masaryka 5555, Zlín, 76001, Czech Republic; School of Computer Science and Engineering, International University, Ho Chi Minh City, 700000, Viet Nam; Vietnam National University, Ho Chi Minh City, 700000, Viet Nam; Department of Applied Informatics, Wroclaw University of Science and Technology, Wroclaw, Poland; Faculty of Information Technology, Nguyen Tat Thanh University, Viet Nam; Department of Computer Engineering, Sejong University, Seoul, 05006, South Korea; Faculty of Information Technology, HUTECH University, Ho Chi Minh City, 700000, Viet Nam
utb.fulltext.projects IGA/CebiaTech/2023/004
utb.fulltext.projects CZ.10.03.01/00/22-003/0000048
utb.fulltext.projects SP2023/050
utb.fulltext.faculty Faculty of Applied Informatics
utb.fulltext.ou -
Find Full text

Soubory tohoto záznamu

Zobrazit minimální záznam