Skip to main content
Kent Academic Repository

Memory Deduplication: An Effective Approach to Improve the Memory System

Wang, Frank Z. (2017) Memory Deduplication: An Effective Approach to Improve the Memory System. Journal of Information Science and Engineering, 33 (5). pp. 1103-1120. ISSN 1016-2364. (doi:keyId=158_2059) (KAR id:69676)

Warning
There is a more recent version of this item available.

Abstract

Programs now have more aggressive demands of memory to hold their data than before. This paper analyzes the characteristics of memory data by using seven real memory traces. It observes that there are a large volume of memory pages with identical contents contained in the traces. Furthermore, the unique memory content accessed are much less than the unique memory address accessed. This is incurred by the traditional address-based cache replacement algorithms that replace memory pages by checking the addresses rather than the contents of those pages, thus resulting in many identical memory contents with different addresses stored in the memory. For example, in the same file system, opening two identical files stored in different directories, or opening two similar files that share a certain amount of contents in the same directory, will result in identical data blocks stored in the cache due to the traditional address-based cache replacement algorithms. Based on the observations, this paper evaluates memory compression and memory deduplication. As expected, memory deduplication greatly outperforms memory compression. For example, the best deduplication ratio is 4.6 times higher than the best compression ratio. The deduplication time and restore time are 121 times and 427 times faster than the compression time and decompression time, respectively. The experimental results in this paper should be able to offer useful insights for designing systems that require abundant memory to improve the system performance.

Item Type: Article
DOI/Identification number: keyId=158_2059
Additional information: the Best Paper Award of all papers published in the Journal of Information Science and Engineering in 2017
Uncontrolled keywords: Memory deduplication, address-based cache, content-based cache, memory compression, data characteristics
Subjects: Q Science > QA Mathematics (inc Computing science) > QA 75 Electronic computers. Computer science
Divisions: Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
Depositing User: Frank Wang
Date Deposited: 19 Oct 2018 04:15 UTC
Last Modified: 16 Feb 2021 13:58 UTC
Resource URI: https://kar.kent.ac.uk/id/eprint/69676 (The current URI for this page, for reference purposes)

Available versions of this item

University of Kent Author Information

  • Depositors only (login required):

Total unique views for this document in KAR since July 2020. For more details click on the image.