In search engines, different users may search for different information by issuing the same query. To satisfy more users with limited search results, search result diversification re-ranks the results to cover as many...In search engines, different users may search for different information by issuing the same query. To satisfy more users with limited search results, search result diversification re-ranks the results to cover as many user intents as possible. Most existing intent-aware diversification algorithms recognize user intents as subtopics, each of which is usually a word, a phrase, or a piece of description. In this paper, we leverage query facets to understand user intents in diversification, where each facet contains a group of words or phrases that explain an underlying intent of a query. We generate subtopics based on query facets and propose faceted diversification approaches. Experimental results on the public TREC 2009 dataset show that our faceted approaches outperform state-of-the-art diversification models.展开更多
The result diversification problem is to select an optimal subset with high“quality”and“diversity”from a given ground set of items,which is popular in various applications such as web-based search,multi-document s...The result diversification problem is to select an optimal subset with high“quality”and“diversity”from a given ground set of items,which is popular in various applications such as web-based search,multi-document summarization and ensemble pruning.The diversity relies on the distance between items.Previous works mainly focused on metric distances,and applied a greedy or local search algorithm with theoretical guarantees.As a kind of global search algorithm inspired by Darwin’s theory of evolution,evolutionary algorithms(EAs)can have a better optimization ability than greedy and local search,but often lack theoretical support.Recently,EAs have been introduced to result diversification,achieving good theoretical guarantees besides superior empirical performances.In this paper,we study whether EAs can still achieve good theoretical guarantees for result diversification with negative type distances,which are also a class of important dissimilarity measures,especially in information retrieval and sketching techniques.We propose to reformulate the result diversification problem with negative type distances as a bi-objective maximization problem,and solve it by multi-objective evolutionary algorithms(MOEAs).We prove that a simple MOEA(i.e.,GSEMO)can achieve the best-known polynomial-time approximation ratio.Experiments are also performed to examine the performance of different MOEAs on the application of web-based search.展开更多
文摘In search engines, different users may search for different information by issuing the same query. To satisfy more users with limited search results, search result diversification re-ranks the results to cover as many user intents as possible. Most existing intent-aware diversification algorithms recognize user intents as subtopics, each of which is usually a word, a phrase, or a piece of description. In this paper, we leverage query facets to understand user intents in diversification, where each facet contains a group of words or phrases that explain an underlying intent of a query. We generate subtopics based on query facets and propose faceted diversification approaches. Experimental results on the public TREC 2009 dataset show that our faceted approaches outperform state-of-the-art diversification models.
基金supported by the National Natural Science Foundation of China(62022039,62276124).
文摘The result diversification problem is to select an optimal subset with high“quality”and“diversity”from a given ground set of items,which is popular in various applications such as web-based search,multi-document summarization and ensemble pruning.The diversity relies on the distance between items.Previous works mainly focused on metric distances,and applied a greedy or local search algorithm with theoretical guarantees.As a kind of global search algorithm inspired by Darwin’s theory of evolution,evolutionary algorithms(EAs)can have a better optimization ability than greedy and local search,but often lack theoretical support.Recently,EAs have been introduced to result diversification,achieving good theoretical guarantees besides superior empirical performances.In this paper,we study whether EAs can still achieve good theoretical guarantees for result diversification with negative type distances,which are also a class of important dissimilarity measures,especially in information retrieval and sketching techniques.We propose to reformulate the result diversification problem with negative type distances as a bi-objective maximization problem,and solve it by multi-objective evolutionary algorithms(MOEAs).We prove that a simple MOEA(i.e.,GSEMO)can achieve the best-known polynomial-time approximation ratio.Experiments are also performed to examine the performance of different MOEAs on the application of web-based search.