Optimizing Open-Ended Crowdsourcing: The Next Frontier in Crowdsourced Data Management
نویسندگان
چکیده
Crowdsourcing is the primary means to generate training data at scale, and when combined with sophisticated machine learning algorithms, crowdsourcing is an enabler for a variety of emergent automated applications impacting all spheres of our lives. This paper surveys the emerging field of formally reasoning about and optimizing open-ended crowdsourcing, a popular and crucially important, but severely understudied class of crowdsourcing-the next frontier in crowdsourced data management. The underlying challenges include distilling the right answer when none of the workers agree with each other, teasing apart the various perspectives adopted by workers when answering tasks, and effectively selecting between the many open-ended operators appropriate for a problem. We describe the approaches that we've found to be effective for open-ended crowdsourcing, drawing from our experiences in this space.
منابع مشابه
White Paper Series Crowdsourcing Pedestrian and Cyclist Activity Data
expressed in this publication are those of the Author(s) and do not necessarily reflect the view of the Federal Highway Administration. Executive Summary Collecting cyclist and pedestrian activity data can be challenging due to data gaps and unique characteristics of active transportation that set it apart from other modes. Technological advances continue to improve data collection for demograp...
متن کاملToward Hands-Off Crowdsourcing: Crowdsourced Entity Matching for the Masses
Recent approaches to crowdsourcing entity matching (EM) are limited in that they crowdsource only parts of the EM workflow, requiring a developer to execute the remaining parts. Consequently, these approaches do not scale to the growing EM need at enterprises and crowdsourcing startups, and cannot handle scenarios where ordinary users (i.e., the masses) want to leverage crowdsourcing to match e...
متن کاملAn evaluation methodology for crowdsourced design
In recent years, the “power of the crowd” has been repeatedly demonstrated and various Internet platforms have been used to support applications of collaborative intelligence in tasks ranging from open innovation to image analysis. However, crowdsourcing applications in the fields of design research and creative innovation have been much slower to emerge. So, although there have been reports of...
متن کاملDIYgenomics Crowdsourced Health Research Studies: Personal wellness and Preventive Medicine through Collective Intelligence
The current era of internet-facilitated bigger data, better tools, and collective intelligence community computing is accelerating advances in many areas ranging from artificial intelligence to knowledge generation to public health. In the health sector, data volumes are growing with genomic, phenotypic, microbiomic, metabolomic, self-tracking, and other data streams. Simultaneously, tools are ...
متن کاملLingoTurk: managing crowdsourced tasks for psycholinguistics
LingoTurk is an open-source, freely available crowdsourcing client/server system aimed primarily at psycholinguistic experimentation where custom and specialized user interfaces are required but not supported by popular crowdsourcing task management platforms. LingoTurk enables user-friendly local hosting of experiments as well as condition management and participant exclusion. It is compatible...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bulletin of the Technical Committee on Data Engineering
دوره 39 4 شماره
صفحات -
تاریخ انتشار 2016