نتایج جستجو برای: web wrapper generation

تعداد نتایج: 567401  

2006
Lynn Wu Aykut Firat Tarik Alatovic Stuart E. Madnick

The Web is undoubtedly the largest and most diverse repository of data, but it was not designed to offer the capabilities of traditional data base management systems – which is unfortunate. In a true data federation, all types of data sources, such as relational databases and semi-structured Web sites, could be used together. IBM WebSphere uses the “request-reply-compensate” protocol to communi...

2000
Boris Chidlovskii Jon Ragetli Maarten de Rijke

Journal: :Data Knowl. Eng. 2001
Arnaud Sahuguet Fabien Azavant

The Web so far has been incredibly successful at delivering information to human users. So successful actually, that there is now an urgent need to go beyond a browsing human. Unfortunately, the Web is not yet a well organized repository of nicely structured documents but rather a conglomerate of volatile HTML pages. To address this problem, we present the World Wide Web Wrapper Factory (W4F), ...

Journal: :Future Generation Comp. Syst. 2004
Maozhen Li P. van Santen David W. Walker Omer F. Rana Mark A. Baker

This paper presents SGrid, a service-oriented model for the Semantic Grid. Each Grid service in SGrid is a Web service with certain domain knowledge. A Web services oriented wrapper generator has been implemented to automatically wrap legacy codes as Grid services exposed as Web services. Each wrapped Grid service is supplemented with domain ontology and registered with a Semantic Grid Service ...

2000
Aykut Firat Denis Peleshchuk Prakash Rao

In this paper, we describe an automatic Web wrapper generator that creates specification files, which contain the schema information and extraction rules for a class of Web pages. These specification files can then used by a wrapper engine (e.g. MIT COIN Grenouille) to extract information from the semi-structured Web sites. We create specification files through a WYSIWYG GUI with minimal user i...

2003
Chun-Nan Hsu Chia-Hui Chang Harianto Siek Jiann-Jyh Lu Jen-Jie Chiou

In this paper, we presented a tool to exploit online Web data sources using reconfigurable Web wrapper agents. We described how these agents can be rapidly generated and executed based on the script language WNDL and extraction rule generator IEPAD. WNDL is an XML-based language that provides a representation of a Web browsing session. A WNDL script describes how to locate the data, extract the...

2001
Robert Baumgartner Sergio Flesca Georg Gottlob

We illustrate basic features of the Lixto wrapper generator such as the user and system interaction, the capacious visual interface, the marking and selecting procedures, and the extraction tasks by describing the construction of a simple example program in the current Lixto prototype.

2005
Youngju Son Hasan Jamil Farshad Fotouhi

Biological data sources are useful to bioinformatics researches. Several computational tools have been developed so that these data sources can be used as easily as possible. Most of biological data has been provided over the web. Web data is almost represented in unstructured format and cannot be queried using traditional querying language. Furthermore, the problems, which integration of biolo...

2002
Yang Li Zhan Cui Hongji Yang Hewijin Christine Jiau

We introduce an innovative approach to wrapping semi-structured web pages in order to generate structured data. Unlike other work in this area based on physically specifying the location of information, our approach is based on human design psychology that captures more stable features across web pages, which we believe renders a more robust result in coping with changes in the web pages. In th...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید