Clustering may mean to have two or more computer systems working together or multiple servers linked together for the purpose of handling variable workloads as well as to provide continued operation in case one fails. It may also refer to data clustering which is a technique used for data analysis by dividing a data set into subsets whose elements share common traits. Search result clustering aims to change the way people search online by organizing search result into folders that group similar items together.
Why Clustering is Needed
The use of the vast information available online cannot be maximized unless an effective means of organizing it can be provided. Clustering engines put search results together based on textual and linguistic similarity. This basic similarity is supported by heuristics which are coded by programmers using as basis the users preference on what they want to see on clustered documents. Clusters are presented using the style of folders and sub-folders.
When a search engine provides millions of results for a particular query, the searcher can either sift through the endless pages of results or depend on the search engines judgment...