Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Efficient Workload Allocation and Scheduling Strategies for AI-Intensive Tasks in Cloud Infrastructures.

View through CrossRef
- The rapid proliferation of Artificial Intelligence (AI) applications has underscored the need for advanced cloud infrastructures capable of efficiently managing AI-intensive workloads. This paper delves into the intricacies of workload allocation and scheduling in the context of cloud environments, specifically focusing on the challenges posed by AI-intensive tasks. Our research endeavors to scrutinize existing strategies, discern their limitations, and proffer innovative approaches tailored to optimize the allocation and scheduling of AI workloads within cloud infrastructures. In elucidating the challenges, we pinpoint resource heterogeneity, dynamic workload characteristics, and scalability as the crux of the issues confronting AI-intensive workload management. The diverse computational demands of AI workloads make it challenging to allocate resources optimally, while the dynamic nature of these tasks necessitates adaptive strategies to accommodate varying computational requirements over time. [1] Additionally, as AI models and datasets burgeon in complexity and size, ensuring scalability becomes paramount for sustaining performance in cloud environments. Our literature review encompasses an examination of both traditional and state-of-the-art workload allocation strategies, shedding light on their respective strengths and shortcomings. We also delve into scheduling techniques employed for managing AI-intensive tasks, providing a comprehensive overview of the existing landscape. To address these challenges, we propose a novel framework centered around dynamic resource provisioning, machine learning-based scheduling, and efficient task migration strategies. The framework aims to adaptively allocate resources based on the evolving nature of AI workloads, leveraging machine learning algorithms to predict workload characteristics and employing efficient task migration to handle workload fluctuations. The paper concludes with an experimental evaluation of the proposed strategies, conducted in a simulated environment using diverse datasets. Key performance metrics, such as throughput, latency, and resource utilization, are employed to assess the effectiveness of our strategies compared to existing approaches. By offering insights into the efficient management of AI-intensive workloads in cloud infrastructures, this research contributes to the ongoing efforts to enhance the scalability and performance of cloud environments in the face of burgeoning AI applications. DOI: https://doi.org/10.52783/pst.160
Title: Efficient Workload Allocation and Scheduling Strategies for AI-Intensive Tasks in Cloud Infrastructures.
Description:
- The rapid proliferation of Artificial Intelligence (AI) applications has underscored the need for advanced cloud infrastructures capable of efficiently managing AI-intensive workloads.
This paper delves into the intricacies of workload allocation and scheduling in the context of cloud environments, specifically focusing on the challenges posed by AI-intensive tasks.
Our research endeavors to scrutinize existing strategies, discern their limitations, and proffer innovative approaches tailored to optimize the allocation and scheduling of AI workloads within cloud infrastructures.
In elucidating the challenges, we pinpoint resource heterogeneity, dynamic workload characteristics, and scalability as the crux of the issues confronting AI-intensive workload management.
The diverse computational demands of AI workloads make it challenging to allocate resources optimally, while the dynamic nature of these tasks necessitates adaptive strategies to accommodate varying computational requirements over time.
[1] Additionally, as AI models and datasets burgeon in complexity and size, ensuring scalability becomes paramount for sustaining performance in cloud environments.
Our literature review encompasses an examination of both traditional and state-of-the-art workload allocation strategies, shedding light on their respective strengths and shortcomings.
We also delve into scheduling techniques employed for managing AI-intensive tasks, providing a comprehensive overview of the existing landscape.
To address these challenges, we propose a novel framework centered around dynamic resource provisioning, machine learning-based scheduling, and efficient task migration strategies.
The framework aims to adaptively allocate resources based on the evolving nature of AI workloads, leveraging machine learning algorithms to predict workload characteristics and employing efficient task migration to handle workload fluctuations.
The paper concludes with an experimental evaluation of the proposed strategies, conducted in a simulated environment using diverse datasets.
Key performance metrics, such as throughput, latency, and resource utilization, are employed to assess the effectiveness of our strategies compared to existing approaches.
By offering insights into the efficient management of AI-intensive workloads in cloud infrastructures, this research contributes to the ongoing efforts to enhance the scalability and performance of cloud environments in the face of burgeoning AI applications.
DOI: https://doi.
org/10.
52783/pst.
160.

Related Results

CLOUD COMPUTING - NAVIGATING THE DIGITAL SKY
CLOUD COMPUTING - NAVIGATING THE DIGITAL SKY
“Cloud Computing – Navigating the Digital Sky” is an extensive guide designed to provide a thorough understanding of cloud computing, an essential technology in today’s digital age...
Hybrid Cloud Scheduling Method for Cloud Bursting
Hybrid Cloud Scheduling Method for Cloud Bursting
In the paper, we consider the hybrid cloud model used for cloud bursting, when the computational capacity of the private cloud provider is insufficient to deal with the peak number...
Leveraging Artificial Intelligence for smart cloud migration, reducing cost and enhancing efficiency
Leveraging Artificial Intelligence for smart cloud migration, reducing cost and enhancing efficiency
Cloud computing has become a critical component of modern IT infrastructure, offering businesses scalability, flexibility, and cost efficiency. Unoptimized cloud migration strategi...
Supporting cloud resource allocation in configurable business process models
Supporting cloud resource allocation in configurable business process models
Supporter l'allocation des ressources cloud dans les processus métiers configurables Les organisations adoptent de plus en plus les Systèmes (PAIS) pour gérer leurs...
Workflow Scheduling Based on Mobile Cloud Computing Machine Learning
Workflow Scheduling Based on Mobile Cloud Computing Machine Learning
In recent years, cloud workflow task scheduling has always been an important research topic in the business world. Cloud workflow task scheduling means that the workflow tasks subm...
Reinforcement Learning-Based Framework for Optimal Task Scheduling in Cloud Computing
Reinforcement Learning-Based Framework for Optimal Task Scheduling in Cloud Computing
Cloud computing enables the execution of large-scale computing tasks in a pay-per-use manner, allowing users worldwide to submit diverse workloads to cloud infrastructures. In this...
COMPREHENSIVE METHOD OF ENERGY-EFFICIENT WORKLOAD PROCESSING IN THE INFORMATION AND COMMUNICATION NETWORK
COMPREHENSIVE METHOD OF ENERGY-EFFICIENT WORKLOAD PROCESSING IN THE INFORMATION AND COMMUNICATION NETWORK
Background. Peculiarities of the workload in a modern information and communication network (ICN) determine specific requirements for energy efficiency, performance and availabilit...
AI-driven zero-touch orchestration of edge-cloud services
AI-driven zero-touch orchestration of edge-cloud services
(English) 6G networks demand orchestration systems capable of managing thousands of distributed microservices under sub-millisecond latency constraints. Traditional centralized app...

Back to Top