web documents

نتایج جستجو برای: web documents

تعداد نتایج: 276761 فیلتر نتایج به سال:

Towards Scalable Web Documents

1998

Anne-Marie Kermarrec Ihor Kuz Maarten van Steen Andrew S. Tanenbaum

Article summary. The current Web is running into serious scalability problems. The standard solution is to apply techniques like caching, replication, and distribution. Unfortunately, as the variety of Web applications continues to grow, it will be impossible to find a single solution that fits all needs. The authors advocate a different approach to tackling scaling problems. Instead of seeking...

متن کامل

Application of Agent-Based Personal Web of Trust to Local Document Ranking

2007

Marek Kopel Przemyslaw Kazienko

Web is the boundless source of information and no one is able to process the vast amount of new documents published on the web every day, even with filtering out the documents the user is not interested in. However, most of the recent web documents are blog posts, news and other documents with the author information established. Each author who is also the receiver of web documents possesses th...

متن کامل

Adaptive Replicated Web Documents

2000

Guillaume Pierre Ihor Kuz Maarten van Steen

Caching and replication techniques can improve latency of the Web, while reducing network traffic and balancing load among servers. However, no single strategy is optimal for replicating all documents. Depending on its access pattern, each document should use the policy that suits it best. This paper presents an architecture for adaptive replicated documents. Each adaptive document monitors its...

متن کامل

Web Document Modelling and Clustering*

1997

William Song

Great number of the web documents, created by people of a great variety of walks and used by all the people being able to access to the Internet, gives rise to a problem of how to search the Internet to easily obtain what users want and to filter out what they don't. The problem is strongly related to how to describe or characterize the web documents. On the other hand, labels are being introdu...

متن کامل

Structured Information Retrieval for Web Documents

2007

Cheng-Hai Tan Ee-Peng Lim Wee-Keong Ng Boon-Wan Lim

To overcome the limitations of conventional Web search engines in retrieving Web documents relevant to users' queries, one has to exploit semantic structures embedded in Web documents. We propose a Web Information Retrieval (WebIR) model for Web documents containing semantic elements which are text segments enclosed by special tags. These special tags, known as semantic tags, can either be inde...

متن کامل

Retrieval of Web Documents Using a Fuzzy Hierarchical Clustering

2010

Deepti Gupta Nidhi Tyagi Komal Kumar Bhatia A. K. Sharma Els Lefever Timur Fayruzov Veronique Hoste Martine De Cock Sadaaki Miyamoto

The World Wide Web has huge amount of information that is retrieved using information retrieval tool like Search Engine. Page repository of Search Engine contains the web documents downloaded by the crawler. This repository contains variety of web documents from different domains. In this paper, a technique called “Retrieval of Web documents using a fuzzy hierarchical clustering” is being propo...

متن کامل

An Interactive Classification of Web Documents by Self-Organizing Maps and Search Engines

1999

Kenji Hatano Ryouichi Sano Yiwei Duan Katsumi Tanaka

In this paper, we propose an effective classification view mechanism for hypertext data such as web documents based on Kohonen’s Self-Organizing Map (SOM) and search engines. Web documents collected by search engines are automatically classified by SOM and the obtained SOMs are incrementally modified according to the interaction between users and SOMs. At present, various search engines are des...

متن کامل

Kokono Search: A Location Based Search Engine

2001

Seiji Yokoji Katsumi Takahashi Nobuyuki Miura

We have developed a location-based search system for web documents on the Internet. This system can find web documents based on the distance between locations that are described in web documents and a location specified by a user. It consists of three modules. (1) A robot that gathers documents from the Internet, (2) a parser that extracts address strings from web documents and associates latit...

متن کامل

Hierarchical Clustering of documents-A brief study and implementation in MATLAB

2015

Tulika Narang

The paper discusses and implements hierarchical clustering of documents. The objective is to group similar documents together using hierarchical clustering methods. The paper aims at organizing a set of documents into clusters. The paper is focused on Web Content mining by clustering web documents. Clustering is done on document corpus in MATLAB environment. The result is groups or clusters of ...

متن کامل

The Number of Scholarly Documents on the Public Web

Journal: :PLoS ONE 2014

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید