News Article · Jun 24, 2026 at 7:40 PM
2 min read 0
Member
OpenAI and Broadcom debut Jalapeño chip, a custom LLM inference processor
Industry #Broadcom #OpenAI #Jalapeño #AI chip #LLM inference #custom silicon

OpenAI and Broadcom debut Jalapeño chip, a custom LLM inference processor

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC designed from scratch for large language model inference. Engineering samples are running GPT-5.3 workloads. The chip aims to deliver substantially better performance per watt than current offerings.

Listen to this article 3 min

OpenAI and Broadcom on Wednesday unveiled Jalapeño, a custom inference processor co-designed from the ground up for large language model workloads. The chip is OpenAI's first Intelligence Processor and represents a move by the AI company to own more of its compute stack.

The chip was developed from initial design to manufacturing tape-out in just nine months, which the companies claim is likely the fastest ASIC development cycle ever achieved in high-performance semiconductors. Engineering samples are already running ML workloads in the lab at production target frequency and power, including OpenAI's GPT-5.3-Codex-Spark model.

A blank-slate architecture for LLM inference

Unlike general-purpose accelerators adapted from earlier AI workloads, Jalapeño was designed from scratch around OpenAI's deep understanding of LLM fundamentals. The architecture reduces data movement and balances compute, memory, and networking resources to achieve utilization much closer to theoretical peak performance, according to the company. Broadcom's Tomahawk networking silicon is used in large-scale production configurations.

  • Celestica, an electronics manufacturing firm, also contributed to bringing the silicon to production.
  • Richard Ho, head of OpenAI's hardware program, said the architecture was optimized around kernels, memory movement, networking, and serving patterns that matter most for frontier AI models.
  • OpenAI used its own AI models to accelerate parts of the design and optimization process, contributing to the rapid nine-month development cycle.
  • Initial testing indicates Jalapeño will deliver performance per watt substantially better than current state-of-the-art, though final metrics have not been published.

Full-stack infrastructure strategy takes shape

Greg Brockman, president and co-founder of OpenAI, said Jalapeño is part of a long-term full-stack strategy to make compute more abundant, resulting in faster, more reliable, and more affordable AI. By designing more of the stack itself, OpenAI aims to serve more intelligence with greater efficiency and push advanced AI toward broader access.

The chip announcement signals OpenAI's intent to reduce reliance on merchant silicon for inference. The company has not disclosed when Jalapeño will be deployed at scale, but a detailed technical report on performance is expected in the coming months. The chip is the first in a planned multi-generation compute platform.

Fact check

  • Jalapeño was developed from initial design to tape-out in nine months, possibly the fastest ASIC development cycle for high-performance semiconductors.

    reported · source

  • Engineering samples are running GPT-5.3-Codex-Spark at production target frequency and power.

    reported · source

  • Jalapeño's architecture is a blank-slate design for LLM inference, not a general-purpose accelerator.

    reported · source

  • OpenAI used its own AI models to accelerate parts of the chip design and optimization.

    reported · source

Source reporting (3)

0 Comments

No comments yet

Be the first to share your thoughts on this article.

Join the conversation

You need to be registered and logged in to comment on blog articles.

Who Is Online

In total there are 183 users online: 0 registered, 176 guests and 7 bots.

Most users ever online was 1,755 on 17 Jun 2026, 5:11 pm.

Bots: AhrefsBot Applebot Baiduspider Majestic Other Bot Other Crawler SemrushBot

Users active in the past 15 minutes. Total registered members: 360