Data stream analysis is growing in popularity in the last years since several application domains require to continuously and quickly analyse data produced by sensors with the aim of, for instance, reacting immediately when problems arise, or detecting new trends. The specificity of these domains imposes strict temporal constraints on machine learning algorithms to be used for mining useful insights. The Hoeffding Decision Tree (HDT) is a well-known classification algorithm for efficient streaming data classification. In this paper, with the aim of improving HDT accuracy and capability of handling noisy data, we exploit the learning procedure proposed in HDT for adapting a recently proposed fuzzy decision tree to cope with streaming data classification problems. We tested the fuzzy approach on a benchmark dataset for the on-line learning of data stream classification models. Results show that, during the on-line learning process, the fuzzy approach outperforms HDT in terms of accuracy.

Incremental Learning of Fuzzy Decision Trees for Streaming Data Classification

Riccardo Pecori
;
Pietro Ducange;
2019-01-01

Abstract

Data stream analysis is growing in popularity in the last years since several application domains require to continuously and quickly analyse data produced by sensors with the aim of, for instance, reacting immediately when problems arise, or detecting new trends. The specificity of these domains imposes strict temporal constraints on machine learning algorithms to be used for mining useful insights. The Hoeffding Decision Tree (HDT) is a well-known classification algorithm for efficient streaming data classification. In this paper, with the aim of improving HDT accuracy and capability of handling noisy data, we exploit the learning procedure proposed in HDT for adapting a recently proposed fuzzy decision tree to cope with streaming data classification problems. We tested the fuzzy approach on a benchmark dataset for the on-line learning of data stream classification models. Results show that, during the on-line learning process, the fuzzy approach outperforms HDT in terms of accuracy.
2019
978-94-6252-770-6
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11389/27979
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 17
  • ???jsp.display-item.citation.isi??? ND
social impact