TRIP: An interactive retrieving-inferring data imputation approach
KAUST DepartmentComputer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
Computer Science Program
Online Publication Date2016-06-25
Print Publication Date2016-05
Permanent link to this recordhttp://hdl.handle.net/10754/621293
MetadataShow full item record
AbstractData imputation aims at filling in missing attribute values in databases. Existing imputation approaches to nonquantitive string data can be roughly put into two categories: (1) inferring-based approaches , and (2) retrieving-based approaches . Specifically, the inferring-based approaches find substitutes or estimations for the missing ones from the complete part of the data set. However, they typically fall short in filling in unique missing attribute values which do not exist in the complete part of the data set . The retrieving-based approaches resort to external resources for help by formulating proper web search queries to retrieve web pages containing the missing values from the Web, and then extracting the missing values from the retrieved web pages . This webbased retrieving approach reaches a high imputation precision and recall, but on the other hand, issues a large number of web search queries, which brings a large overhead . © 2016 IEEE.
CitationLi Z, Qin L, Cheng H, Zhang X, Zhou X (2016) TRIP: An interactive retrieving-inferring data imputation approach. 2016 IEEE 32nd International Conference on Data Engineering (ICDE). Available: http://dx.doi.org/10.1109/ICDE.2016.7498375.
Conference/Event name32nd IEEE International Conference on Data Engineering, ICDE 2016