Javascript must be enabled to continue!
Techniques for Improving Web Search by Understanding Queries
View through CrossRef
<p>This thesis investigates the refinement of web search results with a special focus on the use of clustering and the role of queries. It presents a collection of new methods for evaluating clustering methods, performing clustering effectively, and for performing query refinement. The thesis identifies different types of query, the situations where refinement is necessary, and the factors affecting search difficulty. It then analyses hard searches and argues that many of them fail because users and search engines have different query models. The thesis identifies best practice for evaluating web search results and search refinement methods. It finds that none of the commonly used evaluation measures for clustering meet all of the properties of good evaluation measures. It then presents new quality and coverage measures that satisfy all the desired properties and that rank clusterings correctly in all web page clustering situations. The thesis argues that current web page clustering methods work well when different interpretations of the query have distinct vocabulary, but still have several limitations and often produce incomprehensible clusters. It then presents a new clustering method that uses the query to guide the construction of semantically meaningful clusters. The new clustering method significantly improves performance. Finally, the thesis explores how searches and queries are composed of different aspects and shows how to use aspects to reduce the distance between the query models of search engines and users. It then presents fully automatic methods that identify query aspects, identify underrepresented aspects, and predict query difficulty. Used in combination, these methods have many applications — the thesis describes methods for two of them. The first method improves the search results for hard queries with underrepresented aspects by automatically expanding the query using semantically orthogonal keywords related to the underrepresented aspects. The second method helps users refine hard ambiguous queries by identifying the different query interpretations using a clustering of a diverse set of refinements. Both methods significantly outperform existing methods.</p>
Title: Techniques for Improving Web Search by Understanding Queries
Description:
<p>This thesis investigates the refinement of web search results with a special focus on the use of clustering and the role of queries.
It presents a collection of new methods for evaluating clustering methods, performing clustering effectively, and for performing query refinement.
The thesis identifies different types of query, the situations where refinement is necessary, and the factors affecting search difficulty.
It then analyses hard searches and argues that many of them fail because users and search engines have different query models.
The thesis identifies best practice for evaluating web search results and search refinement methods.
It finds that none of the commonly used evaluation measures for clustering meet all of the properties of good evaluation measures.
It then presents new quality and coverage measures that satisfy all the desired properties and that rank clusterings correctly in all web page clustering situations.
The thesis argues that current web page clustering methods work well when different interpretations of the query have distinct vocabulary, but still have several limitations and often produce incomprehensible clusters.
It then presents a new clustering method that uses the query to guide the construction of semantically meaningful clusters.
The new clustering method significantly improves performance.
Finally, the thesis explores how searches and queries are composed of different aspects and shows how to use aspects to reduce the distance between the query models of search engines and users.
It then presents fully automatic methods that identify query aspects, identify underrepresented aspects, and predict query difficulty.
Used in combination, these methods have many applications — the thesis describes methods for two of them.
The first method improves the search results for hard queries with underrepresented aspects by automatically expanding the query using semantically orthogonal keywords related to the underrepresented aspects.
The second method helps users refine hard ambiguous queries by identifying the different query interpretations using a clustering of a diverse set of refinements.
Both methods significantly outperform existing methods.
</p>.
Related Results
Graph-based Interactive Bibliographic Information Retrieval Systems
Graph-based Interactive Bibliographic Information Retrieval Systems
In the big data era, we have witnessed the explosion of scholarly literature. This explosion has imposed challenges to the retrieval of bibliographic information. Retrieval of inte...
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract
The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Query Recommendation for Improving Search Engine Results
Query Recommendation for Improving Search Engine Results
As web contents grow, the importance of search engines become more critical and at the same time user satisfaction decreases. Query recommendation is a new approach to improve sear...
ERROR ESTIMATION FOR A PIEZOELECTRIC CONTACT PROBLEM WITH WEAR AND LONG MEMORY
ERROR ESTIMATION FOR A PIEZOELECTRIC CONTACT PROBLEM WITH WEAR AND LONG MEMORY
We study a mathematical model for a quasistatic behavior of electro-viscoelastic materials. The problem is related to highly nonlinear and non-smooth phenomena like contact, fricti...
WEB PROGRAMMING
WEB PROGRAMMING
"Web Programming" is a comprehensive book that provides a detailed overview of various aspects of web programming. The book is co-authored by Dr. Chitra Ravi and Dr. Mohan Kumar S,...
Eliciting Single-Peaked Preferences Using Comparison Queries
Eliciting Single-Peaked Preferences Using Comparison Queries
Voting is a general method for aggregating the preferences of multiple agents. Each agent ranks all the possible alternatives, and based on this, an aggregate ranking of the alter...
Using Metadata to Understand Search Behavior in Digital Libraries
Using Metadata to Understand Search Behavior in Digital Libraries
This thesis explores how search log analysis can be used to gain a deeper understanding of online search behavior in curated collections by leveraging the metadata. For this, we us...
Search engines and their search strategies: the effective use by Indian academics
Search engines and their search strategies: the effective use by Indian academics
Purpose
– The purpose of this paper is to examine the use of various search engines and meta search engines by Indian academics for retrieving information on the we...

