ESearch: Incorporating Text Corpus and Structured Knowledge for Open Domain Entity Search
KAUST DepartmentComputer Science Program
Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
Online Publication Date2018-01-11
Print Publication Date2017
Permanent link to this recordhttp://hdl.handle.net/10754/631132
MetadataShow full item record
AbstractThe paper introduces an open domain entity search system called ESearch, which aims at finding a list of relevant entities to an open domain entity search query (a natural language question). The system is built on top of a Wikipedia text corpus, as well as the structured DBPedia knowledge base. Entities are initially ranked by a model which effectively associates context matching (based on the contexts of entities in the unstructured text corpus) and category matching (based on the types of entities in the structured knowledge base). They are ranked further by a re-ranking component supported by blind feedback or user feedback on entities. We show that category matching is critical for the search performance and the re-ranking component can boost the performance largely. Category matching therefore needs some query entity types (especially specific entity types) as input. However, it is often hard for systems to detect specific entity types because users may not be familiar with how the types of desired entities are defined in the structured knowledge base. In ESearch, we design an effective ranking model of entity types to facilitate blind feedback and user feedback on desired entity types for category matching, so that users can effectively perform entity search without the need of explicitly providing any query entity types as inputs.
CitationMa D, Chen Y, Chen J, Du X, Zhang X (2017) ESearch. Proceedings of the 26th International Conference on World Wide Web Companion - WWW ’17 Companion. Available: http://dx.doi.org/10.1145/3041021.3054720.
SponsorsThis work is supported by the National Science Foundation of China under grant (No. 61472426 and 61432006), 863 key project under grant No. 2015AA015307, the open research program of State Key Laboratory of Computer Architecture, Institute of Computing Technology, Chinese Academy of Science (No. CARCH201510), and the ECNU-RUC-InfoSys Joint Data Science Lab.
JournalProceedings of the 26th International Conference on World Wide Web Companion - WWW '17 Companion
Conference/Event name26th International World Wide Web Conference, WWW 2017 Companion
Except where otherwise noted, this item's license is described as 2017 International World Wide Web Conference Committee (IW3C2), published under Creative Commons CC BY 4.0 License.