Crowdsourcing Based on Clustering

نویسنده

  • Sheetal Jadhav
چکیده

Crowdsourcing is an act of outsourcing tasks, traditionally performed by an employee or contractor, which are now performed by a large group of people. Recent survey deals with the problem of evaluating the submissions to crowdsourcing websites on which data is increasing rapidly in both volume and complexity. Thus, with an increasing number of submissions, the process of rate submissions, select winners and adjust monetary rewards is getting more complex, time consuming and hence more expensive. To overcome this problem text mining methodology can be used which consist of series of operations like Data extraction, pre-processing tf-idf calculation, calculating similarity and clustering and finally cluster submission. But results using these operations in existing methodology also show that this aspect does not do the entire trick of evaluating submissions. Hence propose methodology uses classification using Apriori algorithm which will find relations of clustered terms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi-Crowdsourced Clustering: Generalizing Crowd Labeling by Robust Distance Metric Learning

One of the main challenges in data clustering is to define an appropriate similarity measure between two objects. Crowdclustering addresses this challenge by defining the pairwise similarity based on the manual annotations obtained through crowdsourcing. Despite its encouraging results, a key limitation of crowdclustering is that it can only cluster objects when their manual annotations are ava...

متن کامل

Identifying and Rewarding Subcrowds in Crowdsourcing

Identifying and rewarding truthful workers are key to the sustainability of crowdsourcing platforms. In this paper, we present a clustering based reward mechanism that rewards workers based on their truthfulness while accommodating differences in workers’ preferences. Experimental results show that the proposed approach can effectively discover subcrowds under various conditions; and truthful w...

متن کامل

Weighted Ensemble Clustering for Increasing the Accuracy of the Final Clustering

Clustering algorithms are highly dependent on different factors such as the number of clusters, the specific clustering algorithm, and the used distance measure. Inspired from ensemble classification, one approach to reduce the effect of these factors on the final clustering is ensemble clustering. Since weighting the base classifiers has been a successful idea in ensemble classification, in th...

متن کامل

UPMC at MediaEval 2013: Relevance by Text and Diversity by Visual Clustering

In the diversity task, our strategy was to, first, try to improve relevance, and then to cluster similar images to improve diversity. We propose a four step framework, based on AHC clustering and different reranking strategies. A large number of tests on devset showed that most of the best strategies include text based reranking for pertinence, and visual clustering for diversity even compared ...

متن کامل

An Iterative and Re-weighting Framework for Rejection and Uncertainty Resolution in Crowdsourcing

In practical applications of crowdsourcing, labelers may be uncertain or refuse to label a particular instance (or reject) due to the inherent difficulty, and each labeler may be given a different set of instances for big dataset applications. These various issues lead to missing and uncertain labels. Existing crowdsourcing methods have limited capabilities when these two problems exist. In thi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016