MetaSpider: Meta-Searching and Categorization on the Web
| dc.contributor.author | Chen, Hsinchun | |
| dc.contributor.author | Fan, Haiyan | |
| dc.contributor.author | Chau, Michael | |
| dc.contributor.author | Zeng, Daniel | |
| dc.date.accessioned | 2004-08-16T00:00:01Z | |
| dc.date.available | 2010-06-18T23:23:50Z | |
| dc.date.issued | 2001 | en_US |
| dc.date.submitted | 2004-08-16 | en_US |
| dc.identifier.citation | MetaSpider: Meta-Searching and Categorization on the Web 2001, 52(13):1134-1147 Journal of the American Society for Information Science & Technology | en_US |
| dc.identifier.uri | http://hdl.handle.net/10150/105331 | |
| dc.description | Artificial Intelligence Lab, Department of MIS, Univeristy of Arizona | en_US |
| dc.description.abstract | It has become increasingly difficult to locate relevant information on the Web, even with the help of Web search engines. Two approaches to addressing the low precision and poor presentation of search results of current search tools are studied: meta-search and document categorization. Meta-search engines improve precision by selecting and integrating search results fromgeneric or domain-specific Web search engines or other resources. Document categorization promises better organization and presentation of retrieved results. This article introduces MetaSpider, a meta-search engine that has real-time indexing and categorizing functions. We report in this paper the major components of MetaSpider and discuss related technical approaches. Initial results of a user evaluation study comparing Meta- Spider, NorthernLight, and MetaCrawler in terms of clustering performance and of time and effort expended show that MetaSpider performed best in precision rate, but disclose no statistically significant differences in recall rate and time requirements. Our experimental study also reveals that MetaSpider exhibited a higher level of automation than the other two systems and facilitated efficient searching by providing the user with an organized, comprehensive view of the retrieved documents. | |
| dc.format.mimetype | application/pdf | en_US |
| dc.language.iso | en | en_US |
| dc.publisher | Wiley Periodicals, Inc | en_US |
| dc.subject | Web Mining | en_US |
| dc.subject | Internet | en_US |
| dc.subject | Knowledge Management | en_US |
| dc.subject | World Wide Web | en_US |
| dc.subject.other | National Science Digital Library | en_US |
| dc.subject.other | NSDL | en_US |
| dc.subject.other | Artificial Intelligence lab | en_US |
| dc.subject.other | AI lab | en_US |
| dc.subject.other | MetaSpider | en_US |
| dc.title | MetaSpider: Meta-Searching and Categorization on the Web | en_US |
| dc.type | Journal Article (Paginated) | en_US |
| dc.identifier.journal | Journal of the American Society for Information Science & Technology | en_US |
| refterms.dateFOA | 2018-08-21T11:28:52Z | |
| html.description.abstract | It has become increasingly difficult to locate relevant information on the Web, even with the help of Web search engines. Two approaches to addressing the low precision and poor presentation of search results of current search tools are studied: meta-search and document categorization. Meta-search engines improve precision by selecting and integrating search results fromgeneric or domain-specific Web search engines or other resources. Document categorization promises better organization and presentation of retrieved results. This article introduces MetaSpider, a meta-search engine that has real-time indexing and categorizing functions. We report in this paper the major components of MetaSpider and discuss related technical approaches. Initial results of a user evaluation study comparing Meta- Spider, NorthernLight, and MetaCrawler in terms of clustering performance and of time and effort expended show that MetaSpider performed best in precision rate, but disclose no statistically significant differences in recall rate and time requirements. Our experimental study also reveals that MetaSpider exhibited a higher level of automation than the other two systems and facilitated efficient searching by providing the user with an organized, comprehensive view of the retrieved documents. |
