نتایج جستجو برای: web information extraction
تعداد نتایج: 1428884 فیلتر نتایج به سال:
In this work we approach relationships on the Linked Open Data Web as key facilitators of information exploration. Linked Open Data (LOD) principles contribute to a shift in paradigm for information representation and access, enhancing the ability of users and computers to connect, browse and query data on the Web through standard languages and protocols. We present a brief discussion on the cu...
This paper presents DeepEC (Deep Web Extraction and Cataloguing Process), a new method for content extraction of Deep Web databases and its subsequent cataloguing. Our focus is on the extraction of hidden Web content presented in HTML pages generated from Web forms query submissions. While state-of-the-art information extraction and cataloguing methods address this issue separately, DeepEC is a...
AUTOMATING THE EXTRACTION OF DOMAIN SPECIFIC INFORMATION FROM THE WEB—A CASE STUDY FOR THE GENEALOGICAL DOMAIN Troy Walker Department of Computer Science Master of Science Current ways of finding genealogical information within the millions of pages on the Web are inadequate. In an effort to help genealogical researchers find desired information more quickly, we have developed GeneTIQS, a Genea...
During the past few decades, social and behavioral sciences experienced a proliferation of vibrant research communities, leading to a rapid accumulation of theories, articles, and constructs (Lee et al. 2004; Straker 2008). Many efforts have been dedicated to exploration and integration of this large network of behavioral research data. For example, the inter-nomological network, a construct se...
In this paper we describe past and present work dealing with the use of textual resources, out of which semantic information can be extracted in order to provide for semantic annotation and indexing of associated image or video material. Since the emergence of semantic web technologies and resources, entities, relations and events extracted from textual resources by means of Information Extract...
چکیده ندارد.
Information extraction is initially applied for identification of desired information from natural language documents and conversion of the extracted text into a self-defined presentation. With the rapidly increasing amount of available information sources and electronic documents on the World Wide Web, information extraction is extended for identification from structured and semi-structured we...
This paper describes a Semantic Annotation Tool for extraction of knowledge structures from web pages through the use of simple user-defined knowledge extraction patterns. The semantic annotation tool contains: an ontology-based mark-up component which allows the user to browse and to mark-up relevant pieces of information; a learning component (Crystal from the University of Massachusetts at A...
In this chapter we discuss the possible application of new concepts in web content extraction: utility assessment, utility annealing, and dynamic aggregated document generation. After analysis of the state of the art in web content extraction, results of a survey study among Polish managers are presented. The discussion covers a web content extraction system with possible extensions that may he...
Semantic browser technologies such as Magpie require the construction of lexicons to support the identification of terms in Web pages which are linked to a user’s chosen ontology. We frame the generation of such lexicons from ontologies as a problem of finding synonyms and hyponyms. Synonym finding using the hypothesis of semantic substitutability relies upon the discovery of patterns in which ...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید