Assessing the Quality of Web Content
نویسندگان
چکیده
This paper describes our approach towards the ECML/PKDD Discovery Challenge 2010. The challenge consists of three tasks: (1) a Web genre and facet classification task for English hosts, (2) an English quality task, and (3) a multilingual quality task (German and French). In our approach, we create an ensemble of three classifiers to predict unseen Web hosts whereas each classifier is trained on a different feature set. Our final NDCG on the whole test set is 0.537 for Task 1, 0.844 for Task 2, and 0.823 (French) and 0.793 (German) for Task 3, which ranks fourth place in the ECML/PKDD Discovery Challenge 2010.
منابع مشابه
ارزیابی کیفیت صفحات وب پژوهشگاههای وابسته به وزارت علوم، تحقیقات و فنآوری مستقر در شهر تهران از دیدگاه کاربران
Especially in research centers, evaluating the quality of web pages from clients' point of view has a constructive role in their design and development, since it makes the web developers familiar with client's perspective and assists them in designing client-oriented web sites in scientific and research environment. As a model for assessing the quality of web pages, "webQual" attempts to provid...
متن کاملAnalyzing new features of infected web content in detection of malicious web pages
Recent improvements in web standards and technologies enable the attackers to hide and obfuscate infectious codes with new methods and thus escaping the security filters. In this paper, we study the application of machine learning techniques in detecting malicious web pages. In order to detect malicious web pages, we propose and analyze a novel set of features including HTML, JavaScript (jQuery...
متن کاملA Multi-Dimensional Model for Assessing the Quality of Answers in Social Q&A Sites
The quality of user-generated content in Web 2.0 dramatically varies from professional to abusive. Quality assessment is therefore a critical problem in producing, managing and retrieving information in Web 2.0. In this paper, we develop a multi-dimensional model for assessing the quality of answers in social Q&A (Question & Answer) sites.
متن کاملAssessing the Internal Structure of the Ellis Information Retrieval Model in Order to Present the Persian Norm of Web Retrieval Tools
Introduction: Study evaluated the internal structure of Ellis information seeking model in the student community with the aim of presenting the Persian norm. Methods: This is a descriptive-analytical study conducted by cross-sectional survey method in the second semester of the academic year 1399-1400. Population comprise of 280 graduate students at Ahvaz Jundishapur University of Medical Scien...
متن کاملInvestigating Healthcare Personnel’s Satisfaction with Quality of Web-based Learning in Teaching Preventive Behaviors of Hepatitis B Virus Infection
Introduction: Acceptance and implementation of preventive behaviors through new methods by healthcare personnel are of great importance. The aim of this study was to investigate healthcare personnel’s satisfaction with quality of web-based learning in teaching preventive behaviors of hepatitis B virus infection.Methods: This descriptive study was conducted on 120 healthcare employees in Tehran ...
متن کاملارزیابی کیفیت صفحات وبسایت دانشگاه علوم پزشکی گناباد براساس مدل E-Qual
Introduction: Quality assessment of web pages from users’ perspective has a fundamental role in its designing and development especially in research centers. In this study, the degree of desirability and quality of web pages of Gonabad University of Medical Sciences from the users’ perspective has been evaluated. Methods: This survey study conducted using the new edition of web Q...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1406.3188 شماره
صفحات -
تاریخ انتشار 2014