Volume 39, Issue 2 e12881
ORIGINAL ARTICLE

A horizontal partitioning-based method for frequent pattern mining in transport timetable

Claudio Teixeira

Claudio Teixeira

CEFET/RJ - Federal Center for Technological Education of Rio de Janeiro, Rio de Janeiro, Brazil

Search for more papers by this author
Luana Fragoso

Luana Fragoso

PSI Technologies, Saskatchewan, Canada

Search for more papers by this author
Marta Mattoso

Marta Mattoso

COPPE/UFRJ, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil

Search for more papers by this author
Diego Carvalho

Diego Carvalho

CEFET/RJ - Federal Center for Technological Education of Rio de Janeiro, Rio de Janeiro, Brazil

Search for more papers by this author
Eduardo Bezerra

Eduardo Bezerra

CEFET/RJ - Federal Center for Technological Education of Rio de Janeiro, Rio de Janeiro, Brazil

Search for more papers by this author
Jorge Soares

Jorge Soares

CEFET/RJ - Federal Center for Technological Education of Rio de Janeiro, Rio de Janeiro, Brazil

Search for more papers by this author
Glauco Amorim

Glauco Amorim

CEFET/RJ - Federal Center for Technological Education of Rio de Janeiro, Rio de Janeiro, Brazil

Search for more papers by this author
Eduardo Ogasawara

Corresponding Author

Eduardo Ogasawara

CEFET/RJ - Federal Center for Technological Education of Rio de Janeiro, Rio de Janeiro, Brazil

Correspondence

Eduardo Ogasawara, CEFET/RJ - Federal Center for Technological Education of Rio de Janeiro, Rio de Janeiro, Brazil.

Email: [email protected]

Search for more papers by this author
First published: 19 November 2021

Funding information: Conselho Nacional de Desenvolvimento Científico e Tecnológico; Coordenação de Aperfeiçoamento de Pessoal de Nível Superior; Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro

Abstract

Analysing transport timetables is an important task, as it brings the opportunity to discover which routes commonly lead to delays. Frequent pattern mining is a technique used to support such type of discovery. However, functional dependencies are intrinsic properties present in timetables, particularly related to attributes derived from the origin–destination matrix. Such functional dependencies compromise the search for patterns in timetables in both the number of association rules (ARs) generated and the computational cost. Several of these ARs refer to the same information. Redundancy removal techniques can reduce the number of ARs. However, these techniques are designed to be used after mining finishes, which increases the computational cost of finding useful ARs. This work presents timetable pattern mining (T-mine), a novel method for frequent pattern mining that improves knowledge discovery in timetables. We evaluated T-mine using Brazilian Flight Data and compared T-mine with the direct application of frequent pattern mining approaches with and without functional dependencies. Our experiments indicate that T-mine is about one order magnitude faster than other methods with functional dependencies.

DATA AVAILABILITY STATEMENT

The data that support the findings of this study are openly available in Brazilian Flights Dataset at https://doi.org/10.21227/k10b-qn21.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.