The Third Man: hierarchy formation in Wikipedia
نویسندگان
چکیده
*Correspondence: [email protected] 1Department of Computer and Information Science, University of Konstanz, Universitätsstr. 10, 78464 Konstanz, Germany Full list of author information is available at the end of the article Abstract Wikipedia articles are written by teams of independent volunteers in the absence of formal hierarchical organizational structures. How is coordination achieved under such conditions of extreme decentralization? Building on studies on the organization of dominance relations in animal and human societies, we theorize that coordination in Wikipedia is made possible by an emergent hierarchical order sustained by self-organizing sequences of text editing events. We propose a new method to turn the editing history of Wikipedia pages into an evolving multiplex network resulting from three types of interaction events: dyadic undo, dyadic redo, and third-party based edit events. We develop new relational event models for signed networks that specify how the probability of observing various types of edit events depends on their embeddedness in sequences of past edit events. Using a random sample of page histories comprising 12,719 revisions produced by 7,657 unique users, we examine the relation between theoretically defined sequences of text editing events, and the emergence of linear dominance hierarchies that regulate production relations within Wikipedia. We find evidence that dyadic interaction gives rise to systematic extra-dyadic dependence structures that are partially consistent with a hierarchical interpretation of the Wikipedia editing network. We support and complement the statistical analysis of multiplex event networks with data visualizations that provide qualitative validation of our main results.
منابع مشابه
Dominance, Deference, and Hierarchy Formation in Wikipedia Edit-Networks
Does co-editing of Wikipedia articles reveal users dominating others? Do these dyadic dominance orderings (if any) lead to a global linear hierarchy among contributing users? In this article we claim that dominance (respectively deference) is revealed by users undoing (respectively redoing) edits of others. We propose methods to turn the history of Wikipedia pages into a dynamic multiplex netwo...
متن کاملTransforming Wikipedia into an Ontology-based Information Retrieval Search Engine for Local Experts using a Third-Party Taxonomy
Wikipedia is widely used for finding general information about a wide variety of topicss. Its vocation is not to provide local information. For example, it provides plot, cast, and production information about a given movie, but not showing times in your local movie theatre. Here we describe how we can connect local information to Wikipedia, without altering its content. The case study we prese...
متن کاملExploring Wikipedia and DMoz as Knowledge Bases for Engineering a User Interests Hierarchy for Social Network Applications
The outgrowth of social networks in the recent years has resulted in opportunities for interesting data mining problems, such as interest or friendship recommendations. A global ontology over the interests specified by the users of a social network is essential for accurate recommendations. We propose, evaluate and compare three approaches to engineering a hierarchical ontology over user intere...
متن کاملTowards Conceptual Indexing of the Blogosphere through Wikipedia Topic Hierarchy
This paper studies the issue of conceptually indexing the blogosphere through the whole hierarchy of Wikipedia entries. About 300,000 Wikipedia entries are used for representing a hierarchy of topics. Based on the results of judging whether each blog feed is relevant to a given Wikipedia entry, this paper proposes how to judge whether there exist blog feeds to be linked from the given entry. In...
متن کاملCleansing Wikipedia Categories using Centrality
We propose a novel general technique aimed at pruning and cleansing the Wikipedia category hierarchy, with a tunable level of aggregation. Our approach is endogenous, since it does not use any information coming from Wikipedia articles, but it is based solely on the user-generated (noisy) Wikipedia category folksonomy itself. We show how the proposed techniques can help reduce the level of nois...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Applied Network Science
دوره 2 شماره
صفحات -
تاریخ انتشار 2017