Microsoft is pushing its AI capabilities beyond the cloud by creating hardware-aware optimization tools for ONNX models, as well as collaborating with Qualcomm to produce Arm processors with AI accelerators. These accelerators, known as NPUs, allow users to run AI workloads locally, without relying on cloud compute. However, different tool chains are needed for different hardware implementations, making it difficult to write general purpose AI applications.
