نتایج جستجو برای: search engine result page enhancement

تعداد نتایج: 1151683  

2008
Carrie Grimes Daniel Ford

Search engines strive to maintain a “current” repository of all pages on the web to index for user queries. However, crawling all pages all the time is costly and inefficient: many small websites don’t support that much load and while some pages change very rapidly others don’t change at all. Therefore, estimated frequency of change is often used to decide how often to crawl a page. Here we con...

2010
Djoerd Hiemstra Claudia Hauff

This draft report presents preliminary results for the TREC 2010 adhoc web search task. We ran our MIREX system on 0.5 billion web documents from the ClueWeb09 crawl. On average, the system retrieves at least 3 relevant documents on the first result page containing 10 results, using a simple index consisting of anchor texts, page titles, and spam removal.

2010
Dilip Kumar Sharma A. K. Sharma

Web is expending day by day and people generally rely on search engine to explore the web. In such a scenario it is the duty of service provider to provide proper, relevant and quality information to the internet user against their query submitted to the search engine. It is a challenge for service provider to provide proper, relevant and quality information to the internet user by using the we...

2013
Thomas Demeester Dong Nguyen Dolf Trieschnigg Chris Develder Djoerd Hiemstra

We summarize findings from [1]. What is the likelihood that a Web page is considered relevant to a query, given the relevance assessment of the corresponding snippet? Using a new Federated Web Search test collection that contains search results from over a hundred search engines on the internet, we are able to investigate such research questions from a global perspective. Our test collection co...

Journal: :CoRR 2016
Tri Nguyen Mir Rosenberg Xia Song Jianfeng Gao Saurabh Tiwary Rangan Majumder Li Deng

This paper presents our recent work on the design and development of a new, large scale dataset, which we name MS MARCO, for MAchine Reading COmprehension. This new dataset is aimed to overcome a number of well-known weaknesses of previous publicly available datasets for the same task of reading comprehension and question answering. In MS MARCO, all questions are sampled from real anonymized us...

2009
Fabio Clarizia Francesco Colace Massimo De Santo Paolo Napoletano

In this paper we address the problem of modeling large collections of data, namely web pages by exploiting jointly traditional information retrieval techniques with probabilistic ones in order to find semantic descriptions for the collections. This novel technique is embedded in a real Web Search Engine in order to provide semantics functionalities, as prediction of words related to a single te...

Journal: :JCP 2012
Xinyue Liu Hongfei Lin Cong Zhang

The HITS algorithm is a very popular and effective algorithm to rank web documents based on the link information among a set of web pages. However, it assigns every link with the same weight. This assumption results in topic drift. In this paper, we firstly define the generalized similarity between a query and a page, and the popularity of a web page. Then we propose a weighted HITS algorithm w...

2012
Thomas Demeester Dong Nguyen Dolf Trieschnigg Chris Develder Djoerd Hiemstra

What is the likelihood that a Web page is considered relevant to a query, given the relevance assessment of the corresponding snippet? Using a new federated IR test collection that contains search results from over a hundred search engines on the internet, we are able to investigate such research questions from a global perspective. Our test collection covers the main Web search engines like Go...

Journal: :Online Information Review 2006
Jin Zhang Iris Jastram

Purpose – This paper aims to investigate the internet web page metadata usage behavior in terms of their metadata element co-occurrences. Metadata are designed to facilitate both web publishers/authors to organize their web pages and search engines to index the web pages accurately. Design/methodology/approach – This study examines the types of metadata elements employed by different profession...

2010
Karl Gyllstrom Elin Rønby Pedersen

We describe LostRank, a project in its formative stage which aims to produce a way to rank results in re-finding search engines according to the likelihood of their being lost to the user. To this end, we have explored a number of ideas, including applying users’ temporal document access patterns to determine the documents that are both important and have not been recently accessed (indicating ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید