Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling
نویسندگان
چکیده
The choice of which vocabulary to reuse when modeling and publishing Linked Open Data (LOD) is far from trivial. There is no study that investigates the different strategies of reusing vocabularies for LOD modeling and publishing. In this paper, we present the results of a survey with 79 participants that examines the most preferred vocabulary reuse strategies of LOD modeling. The participants, LOD publishers and practitioners, were asked to assess different vocabulary reuse strategies and explain their ranking decision. We found significant differences between the modeling strategies that range from reusing popular vocabularies, minimizing the number of vocabularies, and staying within one domain vocabulary. A very interesting insight is that the popularity in the meaning of how frequent a vocabulary is used in a data source is more important than how often individual classes and properties are used in the LOD cloud. Overall, the results of this survey help in better understanding the strategies how data engineers reuse vocabularies and may also be used to develop future vocabulary engineering tools.
منابع مشابه
Extended Description of the Survey on Common Strategies of Vocabulary Reuse in Linked Open Data Modeling
Modeling and publishing Linked Open Data (LOD) involves the choice of which vocabulary to use. This choice is far from trivial and poses a challenge to a Linked Data engineer. It covers the search for appropriate vocabulary terms, making decisions regarding the number of vocabularies to consider in the design process, as well as the way of selecting and combining vocabularies. Until today, ther...
متن کاملTermPicker: Recommending Vocabulary Terms for Reuse When Modeling Linked Open Data
Linked Open Data (LOD) refers to data published on the Web in a way that it is machine-readable, its meaning is explicitly defined, and it is linked to other data sets. So-called Resource Description Framework (RDF) vocabularies are employed for LOD modeling. An RDF vocabulary is a collection of unique vocabulary terms comprising classes, which describe the type of a data entity, and properties...
متن کاملA Quantitative Survey on the Use of the Cube
There is a striking increase in the availability of statistical data in the Linked Open Data (LOD) cloud, and the Cube vocabulary has become the de facto standard for the description of multi-dimensional data. However, the reuse of a standard vocabulary needs to pair with modeling strategies that make it easy to locate, consume and integrate information. In this paper, we developed a quantitati...
متن کاملDatavore: A Vocabulary Recommender Tool Assisting Linked Data Modeling
In this paper, we introduce the vocabulary recommendation system Datavore (Data vocabulary recommender). The tool is oriented towards metadata designers providing ranked lists of vocabulary terms to reuse in the web of data modeling process, together with additional metadata and cross-terms relations. Datavore relies on the Linked Open Vocabulary ecosystem for acquiring vocabularies and metadat...
متن کاملTermPicker: Enabling the Reuse of Vocabulary Terms by Exploiting Data from the Linked Open Data Cloud - An Extended Technical Report
Deciding which vocabulary terms to use when modeling data as Linked Open Data (LOD) is far from trivial. Choosing too general vocabulary terms, or terms from vocabularies that are not used by other LOD datasets, is likely to lead to a data representation, which will be harder to understand by humans and to be consumed by Linked data applications. In this technical report, we propose TermPicker:...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014