Optimizing Open-Ended Crowdsourcing: The Next Frontier in Crowdsourced Data Management

نویسندگان

  • Aditya G. Parameswaran
  • Akash Das Sarma
  • Vipul Venkataraman
چکیده

Crowdsourcing is the primary means to generate training data at scale, and when combined with sophisticated machine learning algorithms, crowdsourcing is an enabler for a variety of emergent automated applications impacting all spheres of our lives. This paper surveys the emerging field of formally reasoning about and optimizing open-ended crowdsourcing, a popular and crucially important, but severely understudied class of crowdsourcing-the next frontier in crowdsourced data management. The underlying challenges include distilling the right answer when none of the workers agree with each other, teasing apart the various perspectives adopted by workers when answering tasks, and effectively selecting between the many open-ended operators appropriate for a problem. We describe the approaches that we've found to be effective for open-ended crowdsourcing, drawing from our experiences in this space.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

White Paper Series Crowdsourcing Pedestrian and Cyclist Activity Data

expressed in this publication are those of the Author(s) and do not necessarily reflect the view of the Federal Highway Administration. Executive Summary Collecting cyclist and pedestrian activity data can be challenging due to data gaps and unique characteristics of active transportation that set it apart from other modes. Technological advances continue to improve data collection for demograp...

متن کامل

Toward Hands-Off Crowdsourcing: Crowdsourced Entity Matching for the Masses

Recent approaches to crowdsourcing entity matching (EM) are limited in that they crowdsource only parts of the EM workflow, requiring a developer to execute the remaining parts. Consequently, these approaches do not scale to the growing EM need at enterprises and crowdsourcing startups, and cannot handle scenarios where ordinary users (i.e., the masses) want to leverage crowdsourcing to match e...

متن کامل

An evaluation methodology for crowdsourced design

In recent years, the “power of the crowd” has been repeatedly demonstrated and various Internet platforms have been used to support applications of collaborative intelligence in tasks ranging from open innovation to image analysis. However, crowdsourcing applications in the fields of design research and creative innovation have been much slower to emerge. So, although there have been reports of...

متن کامل

DIYgenomics Crowdsourced Health Research Studies: Personal wellness and Preventive Medicine through Collective Intelligence

The current era of internet-facilitated bigger data, better tools, and collective intelligence community computing is accelerating advances in many areas ranging from artificial intelligence to knowledge generation to public health. In the health sector, data volumes are growing with genomic, phenotypic, microbiomic, metabolomic, self-tracking, and other data streams. Simultaneously, tools are ...

متن کامل

LingoTurk: managing crowdsourced tasks for psycholinguistics

LingoTurk is an open-source, freely available crowdsourcing client/server system aimed primarily at psycholinguistic experimentation where custom and specialized user interfaces are required but not supported by popular crowdsourcing task management platforms. LingoTurk enables user-friendly local hosting of experiments as well as condition management and participant exclusion. It is compatible...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bulletin of the Technical Committee on Data Engineering

دوره 39 4  شماره 

صفحات  -

تاریخ انتشار 2016