We use cookies to improve your experience with our site.
Hong-Zhi Wang, Zhi-Xin Qi, Ruo-Xi Shi, Jian-Zhong Li, Hong Gao. COSSET+:Crowdsourced Missing Value Imputation Optimized by Knowledge Base[J]. Journal of Computer Science and Technology, 2017, 32(5): 845-857. DOI: 10.1007/s11390-017-1768-1
Citation: Hong-Zhi Wang, Zhi-Xin Qi, Ruo-Xi Shi, Jian-Zhong Li, Hong Gao. COSSET+:Crowdsourced Missing Value Imputation Optimized by Knowledge Base[J]. Journal of Computer Science and Technology, 2017, 32(5): 845-857. DOI: 10.1007/s11390-017-1768-1

COSSET+:Crowdsourced Missing Value Imputation Optimized by Knowledge Base

  • Missing value imputation with crowdsourcing is a novel method in data cleaning to capture missing values that could hardly be filled with automatic approaches. However, time cost and overhead in crowdsourcing are high. Therefore, we have to reduce cost and guarantee accuracy of crowdsourced imputation. To achieve the optimization goal, we present COSSET+, a crowdsourced framework optimized by knowledge base. We combine the advantages of both knowledge-based filter and crowdsourcing platform to capture missing values. Since the amount of crowd values will affect the cost of COSSET+, we aim to select partial missing values to be crowdsourced. We prove that the crowd value selection problem is an NP-hard problem and develop an approximation algorithm for this problem. Extensive experimental results demonstrate the efficiency and effectiveness of the proposed approaches.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return