Efficient Similarity Measures for Texts Matching
Brak miniatury
Data
2015
Autorzy
Tytuł czasopisma
ISSN czasopisma
Tytuł tomu
Wydawca
Wydawnictwo Politechniki Łódzkiej
Lodz University of Technology. Press
Lodz University of Technology. Press
Abstrakt
Calculation of similarity measures of exact matching texts is a
critical task in the area of pattern matching that needs a great attention.
There are many existing similarity measures in literature but the best methods
do not exist for closeness measurement of two strings. The objective of
this paper is to explore the grammatical properties and features of generalized
n-gram matching technique of similarity measures to find exact text in
electronic computer applications. Three new similarity measures have been
proposed to improve the performance of generalized n-gram method. The
new methods assigned high values of similarity measures and performance
to price with low values of running time. The experiment with the new methods
demonstrated that they are universal and very useful in words that could
be derived from the word list as a group and retrieve relevant medical terms
from database . One of the methods achieved best correlation of values for
the evaluation of subjective examination.
Opis
Słowa kluczowe
similarity measures, fuzzy relations, n-gram, word list, set theory, subjective examination
Cytowanie
Journal of Applied Computer Science, 2015 Vol.23 nr 1 s.7-28