International Journal on Science and Technology

E-ISSN: 2229-7677     Impact Factor: 9.88

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 16 Issue 2 April-June 2025 Submit your research before last 3 days of June to publish your research paper in the issue of April-June.

Legacy to Cloud-Native: Scalable Strategies for Migrating Huge Hadoop Clusters To AWS

Author(s) Praveen Kodakandla
Country India
Abstract Since most organizations want to be more flexible and able to grow quickly, moving from traditional Hadoop systems to cloud-based solutions is a key strategy now. The paper describes a simple and adaptable way to move a 5-petabyte, 200-node Hadoop system from an on-premises setup to the cloud on Amazon Web Services (AWS). In the beginning, we review the problems with standard on-premises Hadoop and highlight the important features of Amazon EMR, S3, Glue and Athena. A well-supported case study demonstrates that with planned phases, using technology and open standards, businesses can smoothly transition with less downtime and better performance and cost savings. The document also examines main issues called data egress, reproducibility, schema validation and vendor lock-in, plus it shares suggestions for facing such problems. It was found that cloud migration, done the right way, helps update data systems and allows organizations to be more adaptable, fast and reliable. We will end with a discussion of upcoming trends such as using serverless analytics, managing metadata and working across multiple cloud platforms.
Field Engineering
Published In Volume 13, Issue 4, October-December 2022
Published On 2022-11-10
DOI https://doi.org/10.71097/IJSAT.v13.i4.6415
Short DOI https://doi.org/g9qtjs

Share this