I2R-NUS-MSRA at TAC 2011: Entity Linking
نویسندگان
چکیده
In this paper, we report the joint participation of I2R-NUS team and MSRA team in entity linking task for Knowledge Base Population at Text Analysis Conference 2011. I2R-NUS team submitted two results with the full system and the partial system for diagnosis purpose. Both results incorporate the new technologies: acronym expansion, instance selection and topic modeling proposed in our recent papers. In clustering step, three clustering algorithms: spectral graph partitioning (SGP), hierarchical agglomerative clustering (HAC) and latent Dirichlet allocation (LDA) are combined for the full system. The full system achieves a competitive F-score 0.8311. The partial system uses only Wikipedia Source to generate candidates for KB linking and only LDA for clustering , which leads to 0.813 Fscore. Although due to the time constrain, the combined result of I2R-NUS full system with MSRA KB linking result was not submitted, it shows 0.828 F-score afterwards.
منابع مشابه
NUS-I2R: Learning a Combined System for Entity Linking
In this paper, we report the joint participation of NUS and I2R team in Knowledge Base Population at Text analysis conference 2010. For Entity Linking, we analyze IR approaches and SVM classification in the disambiguation stage and develop a supervised learner for combining these approaches. The combined system performs better than the individual components and achieves results much better than...
متن کاملMSRA at TAC 2011: Entity Linking
The Knowledge Base Population task aims at advancing the state of the art for systems that automatically discover information about named entities and then incorporate this information in a knowledge source. The overall task of populating a knowledge base is decomposed into two related tasks: Entity Linking, where names must be aligned to entities in the KB, and Slot Filling, which involves min...
متن کاملECNU: Brief System Description of Submission to Knowledge Base Population at TAC 2011
This paper briefly reports our submissions to the three tasks in TAC KBP 2011, i.e., Slot Filling (SF for short), Entity Linking (EL for short) and Cross-lingual Entity Linking (CEL for short).
متن کاملTHUNLP at TAC KBP 2011 in Entity Linking
Entity Linking is to link a name string from plain-text documents to the corresponding entry in given knowledge base. In this paper we demonstrate our entity linking system for TAC KBP 2011 Track. Our system implements pairwise and listwise learning to rank methods to create a ranking list of candidates with several kinds of features, including context similarity, term frequency, key entity ext...
متن کاملHITS' Cross-lingual Entity Linking System at TAC 2011: One Model for All Languages
This paper presents HITS’ system for crosslingual entity linking at TAC 2011. We approach the task in three stages: (1) context disambiguation to obtain a language-independent representation, (2) entity disambiguation, (3) clustering of the queries that have not been linked in the second step. For each of these steps one single model is trained and applied to both languages, i.e. English and Ch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011