Pakistan Science Abstracts
Article details & metrics
No Detail Found!!
The Effect of Different Similarity Distance Measures in Detecting Outliers Using Single-Linkage Clustering Algorithm for Univariate Circular Biological Data
Author(s):
1. Nur Syahirah Zulkipli: Centre for Mathematical Sciences,Universiti Malaysia Pahang,Malaysia
2. Siti Zanariah Satari: Centre for Mathematical Sciences,Universiti Malaysia Pahang,Malaysia
3. Wan Nur Syahidah Wan Yusoff: Centre for Mathematical Sciences,Universiti Malaysia Pahang,Malaysia
Abstract:
Clustering algorithms can be used to create an outlier detection procedure in univariate circular data. The circular distance between each point of angular observation in circular data is used to calculate the similarity measure to appropriately group observations. In this paper, we present a clustering-based procedure for detecting outliers in univariate circular biological data using various similarity distance measures. Three circular similarity distance measures; Satari distance, Di distance and Chang-chien distance were used to detect outliers using a single-linkage clustering algorithm. Satari distance and Di distance are two similarity measures that have similar formulas for univariate circular data. This study aims to develop and demonstrate the effectiveness of the proposed clusteringbased procedure with various similarity distance measures in detecting outliers. The circular similarity distance of SL-Satari/Di and other similarity measures, including SL-Chang, were compared at various dendrogram cutting points. It is found that a clustering-based procedure using a single-linkage algorithm with various similarity distances is a practical and promising approach to detect outliers in univariate circular data, particularly for biological data. According to the results, the SL-Satari/Di distance outperformed the SL-Chang distance for certain data conditions.
Page(s): 561-573
Published: Journal: Pakistan Journal of Statistics and Operation Research, Volume: 18, Issue: 3, Year: 2022
Keywords:
circular distance , Similarity measure , Clustering algorithm , Outliers , Circular data
References:
References are not available for this document.
Citations
Citations are not available for this document.
0

Citations

0

Downloads

5

Views