Ripple: A Decentralized Edge Based Data Deduplication Frame Work

Authors

  • Mohd Abraar B.E.Students; Dept of CSE, ISL engineering College (affiliated to Osmania University)Hyderabad India Author
  • Mohd Rafay B.E.Students; Dept of CSE, ISL engineering College (affiliated to Osmania University)Hyderabad India Author
  • Syed Mahmood Hussaini B.E.Students; Dept of CSE, ISL engineering College (affiliated to Osmania University)Hyderabad India Author
  • Dr. Md Zainlabuddin Associate professor; Dept of CSE, ISL engineering College (affiliated to Osmania University) Hyderabad India Author

DOI:

https://doi.org/10.63665/adq26c36

Keywords:

Edge computing, data redundancy, data retrieval latency, data deduplication, data index, cloud deduplication

Abstract

With its advantages in ensuring low data retrieval latency and reducing backhaul network traffic, edge computing is becoming a backbone solution for many latency-sensitive appli cations. An increasingly large number of data is being generated at the edge, stretching the limited capacity of edge storage sys tems. Improving resource utilization for edge storage systems has become a significant challenge in recent years. Existing solutions attempt to achieve this goal through data placement optimization, data partitioning, data sharing, etc. These approaches overlook the data redundancy in edge storage systems, which produces substantial storage resource wastage. This motivates the need for an approach for data deduplication at the edge. However, existing data deduplication methods rely on centralized control, which is not always feasible in practical edge computing environments. This article presents Ripple, the first approach that enables edge servers to deduplicate their data in a decentralized manner. At its core, it builds a data index for each edge server, enabling them to deduplicate data without central control. With Ripple, edge servers can 1) identify data duplicates; 2) remove redundant data without violating data retrieval latency constraints; and 3) ensure data availability after deduplication. The results of trace driven experiments conducted in a testbed system demonstrate the use fulness of Ripple in practice. Compared with the state-of-the-art approach,Ripple improves the deduplication ratio by upto 16.79% and reduces data retrieval latency by an average of 60.42%. ,

Downloads

Download data is not yet available.

References

[1] W. Shi, G. Pallis, and Z. Xu, “Edge computing,” Proc. IEEE, vol. 107, no. 8, pp. 1474–1481, Aug. 2019.

[2] AWS, “AWS Wavelength for media & entertainment,” 2021. [Online]. Available: https://d1.awsstatic.com/Wavelength2020/AWS-Wavelengthfor-Media-Entertainment-SolutionBrief-Feb2021-Final.pdf

[3] G. Zhu et al., “Pushing AI to wireless network edge: An overview on integrated sensing, communication, and computation towards 6G,” Sci. China Inf. Sci., vol. 66, no. 3, 2023, Art. no. 130301.

[4] Q. He, Z. Dong, F. Chen, S. Deng, W. Liang, and Y. Yang, “Pyramid: Enabling hierarchical neural networks with edge computing,” in Proc. ACM Web Conf., 2022, pp. 1860–1870.

[5] H. Jin, R. Luo, Q. He, S. Wu, Z. Zeng, and X. Xia, “Cost-effective data placement in edge storage systems with erasure code,” IEEE Trans. Serv. Comput., vol. 16, no. 2, pp. 1039–1050, Mar./Apr. 2023.

[6] W. Xiao, Y. Hao, J. Liang, L. Hu, S. A. Alqahtani, and M. Chen, “Adaptive compression offloading and resource allocation for edge vision computing,” IEEE Trans. Cogn. Commun. Netw., to be published, doi: 10.1109/TCCN.2024.3400820.

[7] J. Peng et al., “MagNet: Cooperative edge caching by automatic content congregating,” in Proc. ACM Web Conf., 2022, pp. 3280–3288.

[8] L. Yuan et al., “CSEdge: Enabling collaborative edge storage for multiaccess edge computing based on blockchain,” IEEE Trans. Parallel Distrib. Syst., vol. 33, no. 8, pp. 1873–1887, Aug. 2022.

[9] R. Li, Z. Zhou, X. Zhang, and X. Chen, “Joint application placement and request routing optimization for dynamic edge computing service management,” IEEE Trans. Parallel Distrib. Syst., vol. 33, no. 12, pp. 4581–4596, Dec. 2022.

[10] X. Xia, F. Chen, Q. He, J. Grundy, M. Abdelrazek, and H. Jin, “Online collaborative data caching in edge computing,” IEEE Trans. Parallel Distrib. Syst., vol. 32, no. 2, pp. 281–294, Feb. 2021.

[11] J. Zhou, F. Chen, G. Cui, Y. Xiang, and Q. He, “FEUAGame: Fairnessaware edge user allocation for app vendors,” IEEE Trans. Parallel Distrib. Syst., vol. 35, no. 8, pp. 1429–1443, Aug. 2024.

[12] X. Ge, S. Tu, G. Mao, C.-X. Wang, and T. Han, “5G ultra-dense cellular networks,” IEEE Wireless Commun., vol. 23, no. 1, pp. 72–79, Feb. 2016.

Downloads

Published

2026-04-26

How to Cite

Ripple: A Decentralized Edge Based Data Deduplication Frame Work. (2026). International Journal of Multidisciplinary Engineering In Current Research, 11(4s), 71-80. https://doi.org/10.63665/adq26c36