The Harbin Institute of Technology (Shenzhen) Computing and Intelligence Research Institute team has established a multimodal large-scale model development enterprise, Shenzhen Ruoyu Technology Co., Ltd. The first model under the company, ‘JiuTian’, has topped the OpenCompass multimodal large-scale model ranking upon its debut evaluation. With over billions of parameters, JiuTian has achieved multimodal fusion of text, images, audio, and video, and its intelligent understanding and response capabilities cover fields such as natural language processing, computer vision, and speech recognition. CEO of Ruoyu Technology, Dr. Sun Teng, explained that the model transcends the boundaries of various modes such as text, images, audio, and video with its powerful understanding and responsive capabilities.