Quadric today announced that support for the Llama 2 large language model (LLM) is immediately available on its Chimera general purpose neural processing unit (GPNPU) intellectual property (IP) core. Unlike other IP and semiconductor application processor suppliers, Quadric was able to add this support with a simple software port with no hardware changes, allowing existing designs to immediately run this model. Other suppliers have announced plans to change their hardware to offer support in 2024 or beyond. Meta, Qualcomm, Mediatek, Ceva, and Cadence have all announced plans to support LLMs, but all imply that silicon respins are required to gain this new capability.
