Structure based Data Extraction from Hidden Web Sources: A Review
نویسندگان
چکیده
منابع مشابه
Structure based Data Extraction from Hidden Web Sources: A Review
In order to extract data from the web pages of Hidden web sources, many semi-automatic and automatic techniques are proposed based on structure and tags of HTML documents. These
متن کاملOn the Automatic Extraction of Data from the Hidden Web
An increasing amount of Web data is accessible only by filling out HTML forms to query an underlying data source. While this is most welcome from a user perspective (queries are easy and precise) and from a data management perspective (static pages need not be maintained; databases can be accessed directly), automated agents have greater difficulty accessing data behind forms. In this paper we ...
متن کاملUnsupervised object extraction from data-intensive web sources
A long-term challenge for the Web extraction community is to devise technologies for automatically converting Web content from raw HTML (which has no explicit semantics and usually contains large quantities of spurious content), into some sort of structured machine-processable format (such as XML conforming to some given schema). We address this question in the context of interactive dataintens...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Applications
سال: 2011
ISSN: 0975-8887
DOI: 10.5120/3010-4060