[Retracted] Design of Virtual Machine Scheduling Algorithm in Cloud Computing Environment
Abstract
With the application of cloud computing services in more and more fields, it will undertake more computing tasks and storage tasks. The problem of high energy consumption in data centers will become more serious. Virtualization technology is very important in cloud computing, which can improve the utilization rate of resources. At the same time, it has flexibility in resource scheduling and can integrate multiple virtual machines to achieve power efficiency. Using online virtual machine migration technology for energy-saving planning in cloud environment is a hot research topic in academic circles. The scheduling strategy proposed in this paper can reduce the server downtime and the number of server hosts, so as to achieve the maximum use of resources. The main research work of this paper includes the following aspects: Firstly, the energy consumption in the process of virtual machine energy-saving design is modeled, and the relationship between energy consumption and resource usage under different load conditions is analyzed, and the problems are abstracted, e.g., packaging box problem. Secondly, this paper uses genetic algorithm to solve the problem of high energy consumption. Finally, based on the target allocation scheme of virtual machines obtained by the above method, the migration problem of virtual machines is abstracted as the problem of finding the maximum weighted independent set of graphs. And a greedy algorithm is designed to solve this problem. In this paper, CloudSim simulation platform is used to verify the effectiveness of the proposed algorithm. Experiments show that the proposed algorithm can reduce data energy consumption and avoid frequent migration of virtual machines.
1. Introduction
If the virtual machine scheduling problem is not considered carefully, the system load will be unbalanced. To solve this problem, this paper proposes a scheduling strategy based on genetic algorithm to reduce dynamic migration and achieve the best load balance [1]. Because cloud computing has many difficult problems to solve, so how to effectively solve these problems and improve efficiency, this paper proposes a dual fitness genetic algorithm. The research shows that this method can not only shorten the total task time but also shorten the average task time [2]. Big data technology cannot solve the problems in data mining efficiently. Therefore, this paper puts forward the model of cloud computing, which can provide more benefits. It has the characteristics of flexible cost and more flexibility in computing resources [3]. In cloud computing, providing appropriate and reliable application services is the basic requirement. The lightweight virtual machine scheduler of metascheduler proposed in this paper can improve resource utilization and can be further expanded to achieve adaptive load balancing [4]. Genetic algorithm can solve the load balancing problem well. Experiments show that this strategy solves the problems of unbalanced load and high migration cost to a great extent [5]. At present, the offline method cannot be used to solve the problems in uncertain environment, because this method can only solve the certain problems. Efficient cloud computing scheduling method can reduce the energy consumption of cloud computing platform and improve resource utilization [6]. Cloud computing has high requirements on basic equipment and resources. Efficient load balancing can improve the utilization of resources. In order to solve different cloud environments, this paper proposes a variety of load balancing schemes [7]. Experimental results show that ant colony optimization resource allocation method be able to meet the needs of cloud computing environment [8]. In order to solve the problems in cloud security, this paper proposes keystroke dynamics method, which can provide a very effective solution [9]. Aiming at the problem of resource utilization in cloud computing, this paper proposes a virtual machine scheduling algorithm based on K-means clustering, which can make full use of resources and has strong stability [10]. In this paper, we propose a virtual machine scheduling algorithm to solve the mixed load scheduling problem, which uses processors to evaluate the recent energy consumption problem of virtual machines. Experiments show that this method is used in scheduling deviation and normalized energy efficiency [11]. Virtual machine has been widely used because it can improve the utilization rate of resources. With the change of cloud computing scale, the requirements for resource utilization are getting higher and higher, so a more efficient method is needed to improve the allocation of resources [12]. Sustainable green intelligent computer can solve the problem of high energy consumption in cloud computing. It only needs to pay attention to the scheduling of cloud virtual machines, consider the actual configuration of servers, and avoid overheating in the central local location [13]. Task scheduling is the focus of cloud data research. At present, the long execution time means that its cost is also very high, and reducing the execution time means reducing the cost, so this paper proposes a task scheduling algorithm with short average execution time to solve this problem [14]. There are many problems to be solved in resource scheduling of virtual machines. The algorithm proposed in this paper combines genetic algorithm to continuously adjust the host load, so as to achieve the optimal balance [15].
2. Features of Cloud Computing
2.1. Virtualization
The basic feature of cloud computing is virtualization technology. The resources retrieved by users in cloud services are virtualized, not real resources. Cloud services take advantage of virtualization technology to a certain extent, and eliminate the differences between different server architectures by protecting the details of specific hardware and virtualizing IT resources. In cloud services, virtualization of CPU resources is the purpose of virtualization. These resources are usually provided to users in the form of virtual machines, which is convenient for users to use. For cloud providers, resource virtualization helps build an easy-to-use management platform.
2.2. High Availability
Cloud computing platform provides reliable services according to users’ needs. In the cloud server center, the stable operation of the whole system mainly depends on the centralized management of large-scale resources under the unified platform. Data redundancy technology and load balancing technology provide reliable guarantee for the high availability of services. Hadoop’s heartbeat mechanism can monitor the survival of nodes at any time. Once a node goes down, it will be replaced immediately.
2.3. Low Cost
Cloud computing does not require high-performance computing components and professional storage devices. Users only need to pay a very low rent and a terminal that can connect to the Internet, and they can get high-performance computing power and large-scale storage capacity. Cloud application service providers are responsible for maintaining the software and hardware in the cloud, which greatly saves the purchase and maintenance costs of users’ software and hardware.
2.4. On-Demand Service
Cloud service providers charge rent according to the actual usage of users, which can allocate resources in time to increase the actual service capacity of users when the number of resource requests suddenly increases and can also release the resources occupied by users when the number of resources required by users decreases to avoid wasting resources.
2.5. Scalability
Cloud computing architecture and scale have certain scalability, allowing computing nodes to join and exit computer clusters dynamically, which can not only build a usable high-performance computing platform in a short time but also facilitate data center maintenance personnel to maintain equipment.
2.6. Safe and Reliable
Data center uses virtual machines to schedule tasks and uses the isolation of virtual machines to provide users with good security. At the same time, the redundancy technology of the data center ensures the security of the data stored in the cloud. Even if a certain node or a certain area is paralyzed, the standby node in another area can replace the failed node at the first time.
3. Resource Model and Scheduling Algorithm Model
3.1. Basic Model
3.2. Resource Request Model
The above Formulas (5) and (6) indicate that when creating a virtual machine in host hk, the corresponding sum of the various types of resources already allocated thereto plus the value of the corresponding resources to be allocated thereto cannot exceed 80% of the total value of the corresponding resources that this host can accommodate. This filtering can be called Filter filtering.
3.3. Host Resource Load Model
3.4. Scheduling Target Model
In the above formula, hall_average_load represents the average load of all hosts, Variancehost represents the load variance of all hosts, Stdhost represents the load standard deviation of all hosts, and Ratiovm_success represents the success rate of creating virtual machines. The load dispersion of each host can be clearly expressed by variance and standard deviation.
3.5. Host Estimates Remaining Resources
3.6. Predicted Average Weight of Host Computer
- (1)
Calculate the estimated maximum residual resource value and the estimated minimum residual resource value of each resource. The purpose of calculating this value is to obtain the available residual resources on the host, so as to judge which host is more suitable for creating a new virtual machine. According to Formula (15), the following Formula (16) is defined as
- (2)
Calculating the estimated residual resource difference of each resource (which is equal to the estimated maximum resource value minus the estimated minimum resource value), the purpose of calculating the value is to obtain the difference between the estimated residual resources of all hosts and calculate the estimated weight value of each host accordingly. According to Formula (16), Formula (17) is defined as
- (3)
This paper improves the calculation method of host weights in OpenStack’s original algorithm (the original algorithm only considers the weights of a single resource), calculates the estimated average weights of the whole host by using the weights of multiple resources, and defines Formula (18) according to Formulas (15)–(17).
4. Experiment of Virtual Machine Placement Algorithm
4.1. Experimental Simulation Environment
In order to test the performance of the algorithm, 10000 MIPs CPU, 50 GB memory, 1 TB storage, and 10 G bandwidth are set up. Specific settings of virtual machine and algorithm parameters are shown in Tables 1 and 2.
Virtual machine type | CPU (MIPs) | Memory (GB) | Storage (GB) | Bandwidth (G) |
---|---|---|---|---|
1 | 1000 | 4 | 20 | 1 |
2 | 2000 | 8 | 50 | 2 |
3 | 3000 | 16 | 100 | 2 |
4 | 5000 | 24 | 200 | 4 |
Algorithm parameters | Value |
---|---|
Population sizeS | 50 |
Number of iterationsG | 100 |
Crossing rateRc | 0.8 |
Variation rateRm | 0.02 |
In order to analyze the performance of the algorithm, the virtual machines with loads of 100, 200, 300, 400, and 500 are run 10 times, respectively, and the average value of 10 times is taken as the final data of experimental evaluation.
4.2. Experimental Analysis
In order to verify the performance and efficiency of IGGA proposed in this paper, the virtual machine placement algorithm based on FFD is compared with the traditional virtual machine placement algorithm based on GA. FFD algorithm is a commonly used heuristic algorithm, which is generally used to solve the problem of virtual machine placement, while GA algorithm is the basis of improved genetic algorithm, and is often used to evaluate the performance of improved algorithm.
As shown in Figures 1and 2, under the same virtual machine request scale, IGGA algorithm is obviously superior to FFD algorithm and GA algorithm in terms of the number of physical machines activated and the comprehensive utilization rate of resources. Compared with the other two algorithms, the number of physical machines activated is reduced by about 13% and 7%, respectively, and the resource utilization rate is increased by about 30% and 14%. Moreover, the number of activated physical machines is closely related to the request scale of virtual machines. At the same time, the request scale of virtual machine also affects the comprehensive utilization of resources, which fluctuates in a certain range.


As shown in Figure 3, IGGA algorithm is superior to FFD algorithm and GA algorithm in system energy consumption, and the energy consumption is reduced by about 12% and 6%, respectively. It has the least number of physical machines, the highest comprehensive utilization rate of resources, and the minimum energy consumption required by the system. Therefore, the less the number of physical machines started by the system, the higher the resource utilization rate, and then, the energy consumption of the system can be reduced.

As shown in Figure 4, it reflects the fluctuation of load imbalance of the three algorithms under different virtual machine sizes. Under the same virtual machine request size, IGGA algorithm is less unbalanced than FFD algorithm and GA algorithm, which shows that the improved algorithm is better than them in load balancing performance. The performance of FFD algorithm is the worst in load balancing and energy consumption. The algorithm reduces the number of active physical machines to a great extent, which makes the load of individual hosts too high, and the unbalanced load leads to waste of resources and energy consumption. Traditional GA algorithm has the disadvantages of high redundancy and repeated coding, resulting in poor performance of the algorithm. The algorithm proposed in this paper is optimized in energy consumption and load balancing and improves the energy efficiency of the data center as a whole.

How to improve the overall energy efficiency of data center is a problem worthy of people’s attention. By establishing the equilibrium index and combining with the traditional algorithm, the genetic algorithm proposed in this paper can be better optimized. Experiments show that the optimized algorithm can minimize data energy consumption.
4.3. Experimental Simulation Environment
The experiment extends the cloud simulation platform and compares it with set-top box.
In the experiment, 500 physical machines are selected to form a data center. Each physical machine contains four resources with the same performance to verify the algorithm. The experiment also sets virtual machine parameters, specific data center, and algorithm, as shown in Tables 3 and 4.
Resource type | Physical machinery | Virtual machine |
---|---|---|
CPU | 1000/2000/3000 MIPs | 250/500/850/1000 MIPs |
Memory | 100 GB | 256 MB |
Storage | 1 TB | 2.5 GB |
Bandwidth | 20 Gbps | 20 Mbps |
Algorithm parameters | Value |
---|---|
Threshold interval[Tlow,Thigh] | [0.4,0.8] |
Load saturation interval[Lmin,Lmax] | [0.85,0.9] |
Threshold backlashCoffset | 0.05 |
RdcUpper bound constantR0 | 0.1 |
The data processing method is the same as that in Tables 1 and 2.
4.4. Experimental Results and Analysis
In order to build a green data center, we should try to avoid the adverse effects of virtual machine migration, and this algorithm also considers minimizing unnecessary migration of virtual machines. In the algorithm, the statistics of migration times of virtual machines are mainly based on the situation that virtual machines are not in the source physical machines after integration.
As shown in Figure 5, the migration number of virtual machines in different virtual machine sizes is compared between DTMA algorithm and STMA algorithm, and both algorithms set resource load threshold intervals of 0.4 and 0.8. Compared with DTMA algorithm, the number of virtual machines migrated in STMA algorithm fluctuates greatly, which is due to the frequent migration of virtual machines caused by the changing load of physical machines reaching instantaneous peak and valley values. However, DTMA algorithm is lower than STMA algorithm in the number of virtual machine migration under all sizes of virtual machines, which is reduced by about 19%. This is because DTMA algorithm considers the overall load of the data center, dynamically changes the threshold interval of resource load, and deals with unnecessary migration caused by instantaneous peak and valley according to the load of the data center.

The system energy consumption is the sum of the energy consumption of all physical machines in the data center during the system operation, as shown in Figure 6, which reflects the energy consumption of DTMA algorithm and STMA algorithm under different virtual machine scales, and the system energy consumption increases with the increase of the number of virtual machines. Under the same virtual machine request size, the system energy consumption of DTMA algorithm is less than that of STMA algorithm, which is about 12% lower than that of STMA algorithm, indicating that DTMA algorithm is better than STMA algorithm in system energy consumption. Compared with STMA algorithm, DTMA algorithm reduces the energy consumption of cloud data center.

In this paper, a virtual threshold migration algorithm is proposed to solve the problem of unreasonable use of resources. This method can predict load, select migration opportunity, object, and destination physical machine, dynamically changes the threshold value, and migrate in combination with policies, thus reducing the number of servers, thus reducing the power consumption of the data center to the lowest. Through simulation experiments, this method can consider the load change of the whole data center and the dynamic adjustment of threshold interval, avoid unnecessary migration times of virtual machines, and play an important role in reducing energy consumption of data center.
This chapter mainly compares and analyzes the simulation results. Firstly, the simulation results of server number, energy consumption, and load imbalance between the proposed virtual machine placement algorithm and other algorithms are compared and analyzed, and then, the simulation results of virtual machine migration times and energy consumption between the proposed virtual machine migration algorithm and other algorithms are compared and analyzed, respectively.
5. Conclusion
In recent years, virtual machine scheduling based on virtual machines has become an important supporting technology for power management and dynamic expansion of cloud servers. Virtual machine resource scheduling problems can be divided into two categories: deployment and migration. Virtual machine deployment is the process of checking and selecting the appropriate physical target machine from the empty cloud service center according to the virtual machine type and resource request, which has been proved to be a NP-hard problem, similar to the compression problem. In the virtual machine placement stage, the virtual machine placement mainly adopts the corresponding greedy or heuristic virtual machine placement strategy, which is difficult to ensure the balanced utilization of resources and lacks consideration of the data center load type. The purpose of migration is to study the process of data change. The whole process must be handled according to the load of the computer room. In the phase of virtual machine migration, we need to consider the following aspects, choose the migration timing, object, and target physical machine, and adjust the source physical machine according to the load.
Conflicts of Interest
The authors declared that they have no conflicts of interest regarding this work.
Acknowledgments
This work was sponsored in part by the 2020 Teaching Research and Scientific Research Project of Jiuzhou Polytechnic (Yjx202003).
Open Research
Data Availability
The experimental data used to support the findings of this study are available from the corresponding author upon request.