Ripple: A Decentralized Edge Based Data Deduplication Frame Work
DOI:
https://doi.org/10.63665/adq26c36Keywords:
Edge computing, data redundancy, data retrieval latency, data deduplication, data index, cloud deduplicationAbstract
With its advantages in ensuring low data retrieval latency and reducing backhaul network traffic, edge computing is becoming a backbone solution for many latency-sensitive appli cations. An increasingly large number of data is being generated at the edge, stretching the limited capacity of edge storage sys tems. Improving resource utilization for edge storage systems has become a significant challenge in recent years. Existing solutions attempt to achieve this goal through data placement optimization, data partitioning, data sharing, etc. These approaches overlook the data redundancy in edge storage systems, which produces substantial storage resource wastage. This motivates the need for an approach for data deduplication at the edge. However, existing data deduplication methods rely on centralized control, which is not always feasible in practical edge computing environments. This article presents Ripple, the first approach that enables edge servers to deduplicate their data in a decentralized manner. At its core, it builds a data index for each edge server, enabling them to deduplicate data without central control. With Ripple, edge servers can 1) identify data duplicates; 2) remove redundant data without violating data retrieval latency constraints; and 3) ensure data availability after deduplication. The results of trace driven experiments conducted in a testbed system demonstrate the use fulness of Ripple in practice. Compared with the state-of-the-art approach,Ripple improves the deduplication ratio by upto 16.79% and reduces data retrieval latency by an average of 60.42%. ,
Downloads
References
[1] W. Shi, G. Pallis, and Z. Xu, “Edge computing,” Proc. IEEE, vol. 107, no. 8, pp. 1474–1481, Aug. 2019.
[2] AWS, “AWS Wavelength for media & entertainment,” 2021. [Online]. Available: https://d1.awsstatic.com/Wavelength2020/AWS-Wavelengthfor-Media-Entertainment-SolutionBrief-Feb2021-Final.pdf
[3] G. Zhu et al., “Pushing AI to wireless network edge: An overview on integrated sensing, communication, and computation towards 6G,” Sci. China Inf. Sci., vol. 66, no. 3, 2023, Art. no. 130301.
[4] Q. He, Z. Dong, F. Chen, S. Deng, W. Liang, and Y. Yang, “Pyramid: Enabling hierarchical neural networks with edge computing,” in Proc. ACM Web Conf., 2022, pp. 1860–1870.
[5] H. Jin, R. Luo, Q. He, S. Wu, Z. Zeng, and X. Xia, “Cost-effective data placement in edge storage systems with erasure code,” IEEE Trans. Serv. Comput., vol. 16, no. 2, pp. 1039–1050, Mar./Apr. 2023.
[6] W. Xiao, Y. Hao, J. Liang, L. Hu, S. A. Alqahtani, and M. Chen, “Adaptive compression offloading and resource allocation for edge vision computing,” IEEE Trans. Cogn. Commun. Netw., to be published, doi: 10.1109/TCCN.2024.3400820.
[7] J. Peng et al., “MagNet: Cooperative edge caching by automatic content congregating,” in Proc. ACM Web Conf., 2022, pp. 3280–3288.
[8] L. Yuan et al., “CSEdge: Enabling collaborative edge storage for multiaccess edge computing based on blockchain,” IEEE Trans. Parallel Distrib. Syst., vol. 33, no. 8, pp. 1873–1887, Aug. 2022.
[9] R. Li, Z. Zhou, X. Zhang, and X. Chen, “Joint application placement and request routing optimization for dynamic edge computing service management,” IEEE Trans. Parallel Distrib. Syst., vol. 33, no. 12, pp. 4581–4596, Dec. 2022.
[10] X. Xia, F. Chen, Q. He, J. Grundy, M. Abdelrazek, and H. Jin, “Online collaborative data caching in edge computing,” IEEE Trans. Parallel Distrib. Syst., vol. 32, no. 2, pp. 281–294, Feb. 2021.
[11] J. Zhou, F. Chen, G. Cui, Y. Xiang, and Q. He, “FEUAGame: Fairnessaware edge user allocation for app vendors,” IEEE Trans. Parallel Distrib. Syst., vol. 35, no. 8, pp. 1429–1443, Aug. 2024.
[12] X. Ge, S. Tu, G. Mao, C.-X. Wang, and T. Han, “5G ultra-dense cellular networks,” IEEE Wireless Commun., vol. 23, no. 1, pp. 72–79, Feb. 2016.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Authors

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
