DATA DEDUPLICATION BASED ON HADOOP

Back to Accomplishments

Accomplishments

Details
Share

Category

Articles

Authors

Sanjay Vidhani

Publisher

Ijrar

Publishing Date

01-May-2019

volume

Issue

special issue conference 2019

Pages

13-15

Abstract

Data generated by user on social media and by various companies is increasing day by day. It is heavy challenge to copy these multiform of data in real time. To avoid data duplicates and increase the data reliability Hadoop distributed file system is designed to deal with duplicate data. MapReduce and HBase along with the new standard of Secure Hash Algorithm-3(Keccak) to speed up the deduplication procedure. The files with duplicate content will share a single copy which improves the utilization of the cloud storage space effectively.

Related Items

SANJAY VIDHANI. (2019).

DATA DEDUPLICATION BASED ON HADOOP. IJRAR- International Journal of Research and Analytical Reviews,6(special issue conference 2019): 13-15.

SANJAY VIDHANI. (2019).

Advanced Authentication System. SSRN Elsevier online conference proceedings,2019(1): 4.doi: https://dx.doi.org/10.2139/ssrn.3368849

View All

Apply Now Enquire Now