Qualcomm Technologies announced on Oct. 27 that it will launch its AI200 and AI250 inference chips and integrated rack-scale systems in the next two years, marking its entry into the AI data center hardware market currently dominated by Nvidia.
Both chips are available in direct-to-chip rack cooling systems, which use significantly less water than other cooling methods deployed in large hyperscale data centers. Qualcomm also noted that it plans to introduce additional AI products to meet the country’s burgeoning data center market.
“We’re redefining what’s possible for rack-scale AI inference,” said Durga Malladi, Qualcomm’s senior vice president and general manager of technology planning, edge solutions and data centers. “These innovative new AI infrastructure solutions empower customers to deploy generative AI at unprecedented [total cost of ownership], while maintaining the flexibility and security modern data centers demand.”
Qualcomm is entering an AI chip and full-rack systems market that’s currently dominated by Nvidia and, to a lesser extent, AMD.
Qualcomm said its AI chips support machine learning frameworks, inference engines, generative AI frameworks, and other demanding AI workloads at total costs that are lower than competitors’ due to lower power consumption.
“Our rich software stack and open ecosystem support make it easier than ever for developers and enterprises to integrate, manage, and scale already trained AI models on our optimized AI inference solutions,” Malladi said. “With seamless compatibility for leading AI frameworks and one-click model deployment, Qualcomm AI200 and AI250 are designed for frictionless adoption and rapid innovation.”
The collaboration also includes development of semiconductor infrastructure and manufacturing capabilities in the Arab kingdom by the year 2030.






