OpenAI and Broadcom debut Jalapeño chip, a custom LLM inference processor

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC designed from scratch for large language model inference. Engineering samples are running GPT-5.3 workloads. The chip aims to deliver substantially better performance per watt than current offerings.

Listen to this article 3 min

OpenAI and Broadcom on Wednesday unveiled Jalapeño, a custom inference processor co-designed from the ground up for large language model workloads. The chip is OpenAI's first Intelligence Processor and represents a move by the AI company to own more of its compute stack.

The chip was developed from initial design to manufacturing tape-out in just nine months, which the companies claim is likely the fastest ASIC development cycle ever achieved in high-performance semiconductors. Engineering samples are already running ML workloads in the lab at production target frequency and power, including OpenAI's GPT-5.3-Codex-Spark model.

A blank-slate architecture for LLM inference

Unlike general-purpose accelerators adapted from earlier AI workloads, Jalapeño was designed from scratch around OpenAI's deep understanding of LLM fundamentals. The architecture reduces data movement and balances compute, memory, and networking resources to achieve utilization much closer to theoretical peak performance, according to the company. Broadcom's Tomahawk networking silicon is used in large-scale production configurations.

Celestica, an electronics manufacturing firm, also contributed to bringing the silicon to production.
Richard Ho, head of OpenAI's hardware program, said the architecture was optimized around kernels, memory movement, networking, and serving patterns that matter most for frontier AI models.
OpenAI used its own AI models to accelerate parts of the design and optimization process, contributing to the rapid nine-month development cycle.
Initial testing indicates Jalapeño will deliver performance per watt substantially better than current state-of-the-art, though final metrics have not been published.

Full-stack infrastructure strategy takes shape

Greg Brockman, president and co-founder of OpenAI, said Jalapeño is part of a long-term full-stack strategy to make compute more abundant, resulting in faster, more reliable, and more affordable AI. By designing more of the stack itself, OpenAI aims to serve more intelligence with greater efficiency and push advanced AI toward broader access.

The chip announcement signals OpenAI's intent to reduce reliance on merchant silicon for inference. The company has not disclosed when Jalapeño will be deployed at scale, but a detailed technical report on performance is expected in the coming months. The chip is the first in a planned multi-generation compute platform.

Fact check

Jalapeño was developed from initial design to tape-out in nine months, possibly the fastest ASIC development cycle for high-performance semiconductors.

reported · source
Engineering samples are running GPT-5.3-Codex-Spark at production target frequency and power.

reported · source
Jalapeño's architecture is a blank-slate design for LLM inference, not a general-purpose accelerator.

reported · source
OpenAI used its own AI models to accelerate parts of the chip design and optimization.

reported · source

Source reporting (3)

0 Comments

No comments yet

Be the first to share your thoughts on this article.

Join the conversation

You need to be registered and logged in to comment on blog articles.

OpenAI and Broadcom Unveil Jalapeño, a Custom AI Inference Chip Challenging Nvidia

Jun 24, 2026

SK Hynix plans record $29 billion US listing as AI memory demand explodes

Jun 24, 2026

Slate Auto Launches $24,950 Electric Pickup, Targets DIY Repairs and Customization

Jun 24, 2026

Back to News Desk

OpenAI and Broadcom debut Jalapeño chip, a custom LLM inference processor

A blank-slate architecture for LLM inference

Full-stack infrastructure strategy takes shape

Fact check

Source reporting (3)

0 Comments

Related Articles

OpenAI and Broadcom Unveil Jalapeño, a Custom AI Inference Chip Challenging Nvidia

SK Hynix plans record $29 billion US listing as AI memory demand explodes

Slate Auto Launches $24,950 Electric Pickup, Targets DIY Repairs and Customization

Who Is Online