DEQUE: querying the deep web
نویسندگان
چکیده
In this paper, we present a system called DEQUE (Deep WEb QUery SystEm) for modeling and querying the deep Web. We propose a data model for representing and storing HTML forms, and a web form query language called DEQUEL for retrieving data from the deep Web and storing them in the format convenient for additional processing. Our system is able to query forms (single and consecutive) with input values from relations as well as from result pages (results of querying web forms). We present a novel approach in modeling of consecutive forms and introduce the concept of the super form. A prototype system has been implemented on a SUN workstation working under Solaris 2.7 using Perl version 5.005_2 and employing MySQL (version 3.23.49) DBMS as the data storage. 2004 Elsevier B.V. All rights reserved.
منابع مشابه
Querying Capability Modeling and Construction of Deep Web Sources
Information in a deep Web source can be accessed through queries submitted on its query interface. Many Web applications need to interact with the query interfaces of deep Web sources such as deep Web crawling and comparison-shopping. Analyzing the querying capability of a query interface is critical in supporting such interactions automatically and effectively. In this paper, we propose a quer...
متن کاملSEEDEEP: A System for Exploring and Querying Scientific Deep Web Data Sources
A recent and emerging trend in scientific data dissemination involves online databases that are hidden behind query forms, thus forming what is referred to as the deep web. In this paper, we propose SEEDEEP, a System for Exploring and quErying scientific DEEP web data sources. SEEDEEP is able to automatically mine deep web data source schemas, integrate heterogeneous data sources, answer cross-...
متن کاملSemaForm: Semantic Wrapper Generation for Querying Deep Web Data Sources (Interim Report)
A wealth of data on the World Wide Web is hidden behind web form query interfaces and cannot be found through regular search engines. Querying across multiple such sources is a tedious and error-prone process; it involves manually filling in many related, but different, web forms. SemaForm automates this process by correlating web form labels to entries in a domain ontology through the use of a...
متن کاملSemaForm: Semantic Wrapper Generation for Querying Deep Web Data Sources
A wealth of data on the World Wide Web is hidden behind web form query interfaces and cannot be found through regular search engines. Querying across multiple such sources is a tedious and error-prone process; it involves manually filling in many related, but different, web forms. SemaForm automates this process by correlating web form labels to entries in a domain ontology through the use of a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Data Knowl. Eng.
دوره 52 شماره
صفحات -
تاریخ انتشار 2005