The paper presents an application of fuzzy logic to the problem of outliers detection. The overall purpose of the work is to point out anomalous data due different causes through a combination of several traditional methods for outliers detection in multivariate datasets and such combination is achieved through a fuzzy inference system. Moreover, the proposed solutions aims to be automatic and self-adaptive, as some parameters which are required for the combination of the different approaches are automatically evaluated by exploiting the available data, without the need of a-priori assumptions or information on a subset of the available data. The proposed method therefore belongs to the class of the unsupervised outliers detection methods. In order to demonstrate the effectiveness of the developed method, extensive tests have been performed on both a simple case study and a database coming from a real industrial context, where the data have to be filtered before their exploitation for process control purposes. The achieved numerical results are presented and discussed.
A Multivariate Fuzzy System Applied for Outliers Detection
CATENI, Silvia;COLLA, Valentina;NASTASI, Gianluca
2013-01-01
Abstract
The paper presents an application of fuzzy logic to the problem of outliers detection. The overall purpose of the work is to point out anomalous data due different causes through a combination of several traditional methods for outliers detection in multivariate datasets and such combination is achieved through a fuzzy inference system. Moreover, the proposed solutions aims to be automatic and self-adaptive, as some parameters which are required for the combination of the different approaches are automatically evaluated by exploiting the available data, without the need of a-priori assumptions or information on a subset of the available data. The proposed method therefore belongs to the class of the unsupervised outliers detection methods. In order to demonstrate the effectiveness of the developed method, extensive tests have been performed on both a simple case study and a database coming from a real industrial context, where the data have to be filtered before their exploitation for process control purposes. The achieved numerical results are presented and discussed.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.