We use cookies to improve your experience with our site.
Zhao D, Zhao SY, Chen H et al. Hadamard encoding based frequent itemset mining under local differential privacy. JOURNAL OFCOMPUTER SCIENCE AND TECHNOLOGY 38(6): 1403−1422 Nov. 2023. DOI: 10.1007/s11390-023-1346-7.
Citation: Zhao D, Zhao SY, Chen H et al. Hadamard encoding based frequent itemset mining under local differential privacy. JOURNAL OFCOMPUTER SCIENCE AND TECHNOLOGY 38(6): 1403−1422 Nov. 2023. DOI: 10.1007/s11390-023-1346-7.

Hadamard Encoding Based Frequent Itemset Mining under Local Differential Privacy

  • Local differential privacy (LDP) approaches to collecting sensitive information for frequent itemset mining (FIM) can reliably guarantee privacy. Most current approaches to FIM under LDP add “padding and sampling” steps to obtain frequent itemsets and their frequencies because each user transaction represents a set of items. The current state-of-the-art approach, namely set-value itemset mining (SVSM), must balance variance and bias to achieve accurate results. Thus, an unbiased FIM approach with lower variance is highly promising. To narrow this gap, we propose an Item-Level LDP frequency oracle approach, named the Integrated-with-Hadamard-Transform-Based Frequency Oracle (IHFO). For the first time, Hadamard encoding is introduced to a set of values to encode all items into a fixed vector, and perturbation can be subsequently applied to the vector. An FIM approach, called optimized united itemset mining (O-UISM), is proposed to combine the padding-and-sampling-based frequency oracle (PSFO) and the IHFO into a framework for acquiring accurate frequent itemsets with their frequencies. Finally, we theoretically and experimentally demonstrate that O-UISM significantly outperforms the extant approaches in finding frequent itemsets and estimating their frequencies under the same privacy guarantee.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return