Class-Based Delta-Encoding: A Scalable Scheme for Caching Dynamic Web Content
نویسنده
چکیده
Caching static HTTP traffic in proxy-caches has reduced bandwidth consumption and download latency. However, web-caching performance is hard to increase further due to the growing number of noncachable dynamic web-documents. Delta-encoding is a promising technique that exploits temporal correlation among different snapshots of a dynamic document, and renders dynamic traffic cachable. It achieves this by combining a cachable, previous snapshot of a document, called base-file, with a small difference-file, called delta, to generate the current snapshot of the document. However, it has not yet been deployed due to the significant scalability concerns related to the storage requirements for base-files on the server-side. In this paper we introduce class-based delta-encoding, a scalable scheme to perform delta-encoding on dynamic web-traffic. The idea is to group documents into classes, and store one document per class on the server-side. Thus, the proposed scheme exploits both temporal correlation in a dynamically evolving document, and spatial correlation among different documents. Finally, we present an architecture to deploy the scheme, that is transparent to clients, proxy-caches, and web-servers. Experimental results report that class-based delta-encoding combined with compression reduces the bandwidth consumption by a factor of 30, and the latency perceived by most users by a factor of 10 on average, without suffering from enormous storage requirements on the server-side. Keywords—web-caching, dynamic document, delta-encoding, delta, basefile, grouping, privacy.
منابع مشابه
Evaluation of ESI and Class-Based Delta Encoding
The portion of web traffic attributed to dynamic web content is substantial and continues to grow as users expect more personalization and tailored information. Unfortunately, dynamic content is costly to generate. Moreover, traditional web caching schemes are not very effective for dynamically-created pages. In this paper we study two acceleration techniques for dynamic content. The first tech...
متن کاملEvaluation of Delivery Techniques for Dynamic Web Content
The portion of web traffic attributed to dynamic web content is substantial and continues to grow as users expect more personalization and tailored information. Unfortunately, dynamic content is costly to generate. Moreover, traditional web caching schemes are not very effective for dynamically-created pages. In this paper we study two new acceleration techniques for dynamic content. The first ...
متن کاملHPP: HTML Macro-Preprocessing to Support Dynamic Document Caching
A number of techniques are available for reducing latency and bandwidth requirements for resources on the World Wide Web, including caching, compression, and delta-encoding [12]. These approaches are limited: much data on the Web is dynamic, for which traditional caching is of limited use, and delta-encoding requires both a common version base against which to apply a delta and the complete gen...
متن کاملClass-Oriented Page Invalidation for Caching Dynamic Web Content
Caching dynamic pages at a server is beneecial in reducing server resource demands and it also helps dynamic page caching at proxy sites. Previous work has used ne-grain dependence graphs among individual dynamic pages and underlying data sets to enforce result consistency. Such an approach can be cumbersome or ineecient for a Web site to manage a cache in dealing with an arbitrarily large numb...
متن کاملScaling Java-based Dynamic Web Services: A Second-Year Project Proposal
A popular new class of Web-based services generates dynamic content and does not bene t from traditional static content caching. To improve the performance of these dynamic services, we must develop techniques to make them more scalable, reliable, and available. My approach to achieving better performance is to design a service cache that replicates the code and underlying data used to create t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002