A fuzzy ranking approach for improving search results in Turkish as an agglutinative language
► The ranking of search results with stemming and without stemming is examined for Turkish. ► Stemming techniques have negative effects in some search queries. ► A fuzzy ranking approach is proposed for eliminating these negative effects. ► This approach takes user feedback into account in ranking p...
Saved in:
Published in: | Expert systems with applications Vol. 39; no. 5; pp. 5658 - 5664 |
---|---|
Main Author: | |
Format: | Journal Article |
Language: | English |
Published: |
Elsevier Ltd
01-04-2012
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | ► The ranking of search results with stemming and without stemming is examined for Turkish. ► Stemming techniques have negative effects in some search queries. ► A fuzzy ranking approach is proposed for eliminating these negative effects. ► This approach takes user feedback into account in ranking process.
This study proposes a fuzzy ranking approach, designed for Turkish as an agglutinative language, that focuses on improving stemming techniques via using distances of characters in its search algorithm. Various studies focused on search engines are based on using stemming techniques in indexing process because of the higher percentage of relevancy that these techniques provide. However, stemming techniques may have negative effects on search results in some queries. While analyzing the search results to find the query terms those give irrelevant results and why, we observe that user’s query suffixes are crucial in search performance. Therefore, the proposed fuzzy ranking approach supports traditional stemming approaches with the use of suffixes. The search results of this approach are significantly better than stemming techniques in where stemming technique is ineffective. In terms of overall results, the fuzzy ranking approach also gives satisfactory results when compared with stemming techniques such as a Turkish stemmer (19.43% of improvement) and word truncation technique (12.61% of improvement). Moreover, it is statistically better than no stemming with 28.61% of improvement. |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2011.11.105 |