資訊管理學報

陳林志;林育任;
頁: 97-129
日期: 2013/01
摘要: 本論文發展了一套具有分群能力之個人化系統,Personalization Web-Snippet Clustering System(PWSC),此系統是基於元搜尋技術。此系統的第一階段根據使用者所輸入之查詢,針對不同搜尋引擎匯集相關網頁摘要文件。第二階段,透過Mean Reciprocal Rank(MRR)計算模型重新排列網頁摘要文件。第三階段,將收集到的網頁摘要文件,經由N字詞語言模型產生分群標籤。第四階段,依據分群標籤建構出階層式分群。最後階段為建立個人化系統,其能依據使用者所選擇的標籤及運算,產生不同的搜尋結果,這樣將能幫助使用者快速尋找想要的資訊。根據實驗結果,本系統的性能優於商業和學術系統。
關鍵字: 網頁摘要文件分群;個人化搜尋引擎;階層式分群;分群標籤;元搜尋技術;

A Personal Search System with the Clustering Ability


Abstract: In this paper, we develop a personal search system with the clustering ability, called Personalization Web-Snippet Clustering System (PWSC) that is based on a Metasearch technique. The first stage of the system is to collect the relevant snippets from different search engines based on the user's query. The second stage is to rearrange the weight of the collected snippets based on a Mean Reciprocal Rank (MRR) measure. The third stage is to use word N-gram for language model to generate the clustering labels from our collected snippets. The fourth stage is to build a hierarchical tree based on all clustering labels. The final stage is to build a personal search system by the user to select some of the most interesting labels and operations to help the user quickly locate information of interest. According to all experiment results, the performance of our system is superior to the commercial and academic systems.
Keywords: Web-Snippet Clustering;Personal Search Engine;Hierarchical Clustering;Clustering Label;Metasearch Technique;

瀏覽次數: 9813     下載次數: 3103

引用     導入Endnote

相關文章推薦

Top Downlaod Papers