TREC 2004 Web Track Experiments at CAS-ICT

نویسندگان

  • Zhaotao Zhou
  • Yan Guo
  • Bin Wang
  • Xueqi Cheng
  • Hongbo Xu
  • Gang Zhang
چکیده

This report presents CAS-ICT’s experiments on the Mixed query task of the TREC2004 Web track. Our work focused on combining different Web page evidences together to improve the overall retrieval performance. Four kinds of evidences, including body content(C), anchor texts (AT), basic structural information (S0) and extended structural information (S1) were considered for retrieval. Six combination functions were investigated in our experiments. The experimental results show that most functions can improve the retrieval performance. Some heuristic re-ranking techniques were also introduced and tested in the task. No query classification was made during the experiments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TREC-10 Experiments at CAS-ICT: Filtering, Web and QA

CAS-ICT took part in the TREC conference for the first time this year. We have participated in three tracks of TREC-10. For adaptive filtering track, we paid more attention to feature selection and profile adaptation. For web track, we tried to integrate different ranking methods to improve system performance. For QA track, we focused on question type identification, named entity tagging and an...

متن کامل

TREC 11 Experiments at CAS-ICT: Filtering and Web

CAS-ICT took part in the TREC conference for the second time this year and we undertook two tracks of TREC-11. For filtering track, we have submitted results of all three subtasks. In adaptive filtering, we paid more attention to undetermined documents processing, profile building and adaptation. In batch filtering and routing, a centroid-based classifier is used with preprocessed samples. For ...

متن کامل

ICTNET at TREC 2017 Common Core Track

Xu Chang1,2,3, Liying Jiao1,2,3, Jinlong Liu1,2,3, Weijian Zhu,1,2,3, Yuanhai Xue1,2 ,Li Zha1,2, Yue Liu1,2 , Xueqi Cheng4 1) Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190 2) Key Laboratory of Web Data Science and Technology, CAS 3) University of Chinese Academy of Sciences, Beijing, 100190 4) Institute of Network Technology,ICT(YANTAI),CAS {changxu, jiaoliying...

متن کامل

Research on Enterprise Track of TREC 2007

We (ICT-CAS team) participated in the Enterprise Track of TREC 2007. This paper reports our experimental results on this track.

متن کامل

Experiments in TREC 2007 Blog Opinion Task at CAS-ICT

This paper describes our participation in TREC 2007 Blog Track Tasks: Opinion retrieval and Polarity classification. As for Opinion retrieval task, a two-step approach is used to retrieve opinion relevant blog unit (that is blog post and its comments) given a query after filtering Spam blog and extracting blog unit. With Polarity Classification, Drag-push [1] based classifier is employed to get...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004