نتایج جستجو برای: web documents
تعداد نتایج: 276761 فیلتر نتایج به سال:
Article summary. The current Web is running into serious scalability problems. The standard solution is to apply techniques like caching, replication, and distribution. Unfortunately, as the variety of Web applications continues to grow, it will be impossible to find a single solution that fits all needs. The authors advocate a different approach to tackling scaling problems. Instead of seeking...
Web is the boundless source of information and no one is able to process the vast amount of new documents published on the web every day, even with filtering out the documents the user is not interested in. However, most of the recent web documents are blog posts, news and other documents with the author information established. Each author who is also the receiver of web documents possesses th...
Caching and replication techniques can improve latency of the Web, while reducing network traffic and balancing load among servers. However, no single strategy is optimal for replicating all documents. Depending on its access pattern, each document should use the policy that suits it best. This paper presents an architecture for adaptive replicated documents. Each adaptive document monitors its...
Great number of the web documents, created by people of a great variety of walks and used by all the people being able to access to the Internet, gives rise to a problem of how to search the Internet to easily obtain what users want and to filter out what they don't. The problem is strongly related to how to describe or characterize the web documents. On the other hand, labels are being introdu...
To overcome the limitations of conventional Web search engines in retrieving Web documents relevant to users' queries, one has to exploit semantic structures embedded in Web documents. We propose a Web Information Retrieval (WebIR) model for Web documents containing semantic elements which are text segments enclosed by special tags. These special tags, known as semantic tags, can either be inde...
The World Wide Web has huge amount of information that is retrieved using information retrieval tool like Search Engine. Page repository of Search Engine contains the web documents downloaded by the crawler. This repository contains variety of web documents from different domains. In this paper, a technique called “Retrieval of Web documents using a fuzzy hierarchical clustering” is being propo...
In this paper, we propose an effective classification view mechanism for hypertext data such as web documents based on Kohonen’s Self-Organizing Map (SOM) and search engines. Web documents collected by search engines are automatically classified by SOM and the obtained SOMs are incrementally modified according to the interaction between users and SOMs. At present, various search engines are des...
We have developed a location-based search system for web documents on the Internet. This system can find web documents based on the distance between locations that are described in web documents and a location specified by a user. It consists of three modules. (1) A robot that gathers documents from the Internet, (2) a parser that extracts address strings from web documents and associates latit...
The paper discusses and implements hierarchical clustering of documents. The objective is to group similar documents together using hierarchical clustering methods. The paper aims at organizing a set of documents into clusters. The paper is focused on Web Content mining by clustering web documents. Clustering is done on document corpus in MATLAB environment. The result is groups or clusters of ...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید