Partitioning strategies for distributed association rule mining

FRANS COENEN; PAUL LENG

doi:10.1017/S0269888906000786

Partitioning strategies for distributed association rule mining

Published online by Cambridge University Press: 07 July 2006

FRANS COENEN and

PAUL LENG

Show author details

FRANS COENEN: Affiliation:
Department of Computer Science, University of Liverpool, Liverpool L69 3BX, UK. E-mail: [email protected], [email protected]
PAUL LENG: Affiliation:
Department of Computer Science, University of Liverpool, Liverpool L69 3BX, UK. E-mail: [email protected], [email protected]

Article contents

Abstract

Get access

Rights & Permissions

Abstract

In this paper a number of alternative strategies for distributed/parallel association rule mining are investigated. The methods examined make use of a data structure, the T-tree, introduced previously by the authors as a structure for organizing sets of attributes for which support is being counted. We consider six different approaches, representing different ways of parallelizing the basic Apriori-T algorithm that we use. The methods focus on different mechanisms for partitioning the data between processes, and for reducing the message-passing overhead. Both ‘horizontal’ (data distribution) and ‘vertical’ (candidate distribution) partitioning strategies are considered, including a vertical partitioning algorithm (DATA-VP) which we have developed to exploit the structure of the T-tree. We present experimental results examining the performance of the methods in implementations using JavaSpaces. We conclude that in a JavaSpaces environment, candidate distribution strategies offer better performance than those that distribute the original dataset, because of the lower messaging overhead, and the DATA-VP algorithm produced results that are especially encouraging.

Type: Research Article
Information: The Knowledge Engineering Review , Volume 21 , Issue 1 , March 2006 , pp. 25 - 47

DOI: https://doi.org/10.1017/S0269888906000786 [Opens in a new window]

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article contents

Partitioning strategies for distributed association rule mining

Abstract

Access options

Article purchase

Temporarily unavailable

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

Partitioning strategies for distributed association rule mining

Abstract

Access options

Article purchase

Temporarily unavailable

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests