International Journal on Science and Technology

E-ISSN: 2229-7677     Impact Factor: 9.88

A Widely Indexed Open Access Peer Reviewed Multidisciplinary Bi-monthly Scholarly International Journal

Call for Paper Volume 16 Issue 2 April-June 2025 Submit your research before last 3 days of June to publish your research paper in the issue of April-June.

Advanced Strategies For AI Model Deployment In Cloud Environment

Author(s) Janavi G prabhakar, G R Amrutha, Ambaldhage Anusha
Country India
Abstract Deploying AI models in cloud environments comes with several challenges, including security risks, cost inefficiencies, latency concerns, and performance issues. These obstacles can hinder the widespread adoption and scalability of AI-powered applications. This paper investigates strategies such as Zero Trust Security models, federated learning, autoscaling, serverless AI, multi-cloud deployment, and 5G-enabled edge computing to optimize AI scalability and efficiency. The study also delves into model optimization techniques, including quantization, pruning, and knowledge distillation, aimed at improving inference speed while reducing computational costs. Experimental results show that combining cloud-edge hybrid models with autoscaling and model optimization leads to cost savings, better security, and enhanced real-time AI processing
Keywords Real-time AI Processing, Cloud-Edge AI, Cost Optimization, Model Optimization, Zero Trust Security
Field Computer > Artificial Intelligence / Simulation / Virtual Reality
Published In Volume 16, Issue 2, April-June 2025
Published On 2025-05-11
DOI https://doi.org/10.71097/IJSAT.v16.i2.4936
Short DOI https://doi.org/g9kc6t

Share this