WiMi Hologram Cloud Inc. has developed a deep reinforcement learning-based task scheduling algorithm in cloud computing to improve the performance and resource utilization of cloud computing systems. This algorithm can automatically adjust the policy according to the changes in the environment and can be adapted to complex task scheduling scenarios. It includes state representation, action selection, reward function and training and optimization of the algorithm.