We use cookies to improve your experience with our site.

多布隆过滤器热数据识别:区块级别决策vs I/O请求级别决策

Hot Data Identification with Multiple Bloom Filters: Block-Level Decision vs I/O Request-Level Decision

  • 摘要: 热数据识别对于很多应来说至关重要,但目前这方面的研究并不多。几乎所有的现存研究都把重点放在了频率上,然而,有效地识别热数据需要同时考虑新近和频率。而且,以往的研究均在数据块层级上作热数据决策。因为闪存存储的随机访问的性能与其顺序访问的性能一样优秀,如此细粒度的决策特别适合闪存存储。但是,硬盘驱动器(HDD)在顺序访问和随机访问的性能有显著差异,因此,与闪存存储不同,HDD不对称的访问性能使得需要粗粒度的决策。本文提出了一个新的热数据识别方案,它利用多布隆过滤器有效地描述新近和频率。因此,它不仅少消耗了50%的内存和高达58%的计算花销,而且与现有代表性方案相比,它降低了高达65%的错误识别率。此外,我们将此方案应用到下一代HDD技术,叠瓦式磁破纪录(SMR),以验证它的效用,。为此,我们设计了一个全新的基于SMR驱动的粗粒度决策的热数据识别方法。实验揭示了准确的热数据识别的重要性和好处;它为本文提出的SMR驱动性能提升高达42%。

     

    Abstract: Hot data identification is crucial for many applications though few investigations have examined the subject. All existing studies focus almost exclusively on frequency. However, effectively identifying hot data requires equally considering recency and frequency. Moreover, previous studies make hot data decisions at the data block level. Such a fine-grained decision fits particularly well for flash-based storage because its random access achieves performance comparable with its sequential access. However, hard disk drives (HDDs) have a significant performance disparity between sequential and random access. Therefore, unlike flash-based storage, exploiting asymmetric HDD access performance requires making a coarse-grained decision. This paper proposes a novel hot data identification scheme adopting multiple bloom filters to efficiently characterize recency as well as frequency. Consequently, it not only consumes 50% less memory and up to 58% less computational overhead, but also lowers false identification rates up to 65% compared with a state-of-the-art scheme. Moreover, we apply the scheme to a next generation HDD technology, i.e., Shingled Magnetic Recording (SMR), to verify its effectiveness. For this, we design a new hot data identification based SMR drive with a coarse-grained decision. The experiments demonstrate the importance and benefits of accurate hot data identification, thereby improving the proposed SMR drive performance by up to 42%.

     

/

返回文章
返回