Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Analysis of a Similarity Measure for Non-Overlapped Data

View through CrossRef
A similarity measure is a measure evaluating the degree of similarity between two fuzzy data sets and has become an essential tool in many applications including data mining, pattern recognition, and clustering. In this paper, we propose a similarity measure capable of handling non-overlapped data as well as overlapped data and analyze its characteristics on data distributions. We first design the similarity measure based on a distance measure and apply it to overlapped data distributions. From the calculations for example data distributions, we find that, though the similarity calculation is effective, the designed similarity measure cannot distinguish two non-overlapped data distributions, thus resulting in the same value for both data sets. To obtain discriminative similarity values for non-overlapped data, we consider two approaches. The first one is to use a conventional similarity measure after preprocessing non-overlapped data. The second one is to take into account neighbor data information in designing the similarity measure, where we consider the relation to specific data and residual data information. Two artificial patterns of non-overlapped data are analyzed in an illustrative example. The calculation results demonstrate that the proposed similarity measures can discriminate non-overlapped data.
Title: Analysis of a Similarity Measure for Non-Overlapped Data
Description:
A similarity measure is a measure evaluating the degree of similarity between two fuzzy data sets and has become an essential tool in many applications including data mining, pattern recognition, and clustering.
In this paper, we propose a similarity measure capable of handling non-overlapped data as well as overlapped data and analyze its characteristics on data distributions.
We first design the similarity measure based on a distance measure and apply it to overlapped data distributions.
From the calculations for example data distributions, we find that, though the similarity calculation is effective, the designed similarity measure cannot distinguish two non-overlapped data distributions, thus resulting in the same value for both data sets.
To obtain discriminative similarity values for non-overlapped data, we consider two approaches.
The first one is to use a conventional similarity measure after preprocessing non-overlapped data.
The second one is to take into account neighbor data information in designing the similarity measure, where we consider the relation to specific data and residual data information.
Two artificial patterns of non-overlapped data are analyzed in an illustrative example.
The calculation results demonstrate that the proposed similarity measures can discriminate non-overlapped data.

Related Results

Similarity Search with Data Missing
Similarity Search with Data Missing
Similarity search is a fundamental research problem with broad applications in various research fields, including data mining, information retrieval, and machine learning. The core...
Using covariance weighted euclidean distance to assess the dissimilarity between integral experiments
Using covariance weighted euclidean distance to assess the dissimilarity between integral experiments
Integral experiments especially criticality experiments help a lot in designing either new nuclear reactor or criticality assembly. The calculation uncertainty of the integral para...
SNOMED CT Primitive Concept Similarity Measure by Concept Name Text Similarity Approach
SNOMED CT Primitive Concept Similarity Measure by Concept Name Text Similarity Approach
In the last few years, Concept Similarity Measures (CSMs) become important for the biomedical ontologies in order to find adaptable treatments from the conceptually similar disease...
Bend-Net: Bending Loss Regularized Multitask Learning Network for Nuclei Segmentation in Histopathology Images
Bend-Net: Bending Loss Regularized Multitask Learning Network for Nuclei Segmentation in Histopathology Images
Separating overlapped nuclei is a significant challenge in histopathology image analysis. Recently published approaches have achieved promising overall performance on nuclei segmen...
Improved Cosine Similarity Measures for q-Rung Orthopair Fuzzy Sets
Improved Cosine Similarity Measures for q-Rung Orthopair Fuzzy Sets
In this paper, we introduce some novel cosine similarity measures for \(q\)-rung orthopair fuzzy sets (\(q\)-ROFSs), which capture both direction and magnitude aspects of fuzzy set...
New Graph Based Trust Similarity Measure
New Graph Based Trust Similarity Measure
Trust network in social networks can be considered as graph which trustors and trustees are graph vertices and edges present trust between them with measured values. To evaluate tr...
Adapting Document Similarity Measures for Ligand-Based Virtual Screening
Adapting Document Similarity Measures for Ligand-Based Virtual Screening
Quantifying the similarity of molecules is considered one of the major tasks in virtual screening. There are many similarity measures that have been proposed for this purpose, some...

Back to Top