OpenAI's first custom silicon, co-developed with Broadcom, is designed from scratch for large language model inference and is targeted for gigawatt-scale deployment by late 2026.

OpenAI and Broadcom have jointly announced “Jalapeño,” a custom accelerator chip built specifically for large language model (LLM) inference — the process of running a trained AI model to generate outputs. [1] The chip is described as OpenAI’s first “Intelligence Processor” and the first in a planned multi-generation hardware platform the two companies are developing together. [1]

Broadcom CEO Hock Tan and President Charlie Kawwas presented the first wafer to OpenAI CEO Sam Altman and President Greg Brockman at the announcement. [1] For OpenAI, which has until now focused on model and product development rather than hardware, the announcement marks its first move into custom silicon. [1]

According to OpenAI, Jalapeño was not derived from a general-purpose chip but was designed from the ground up for modern LLM inference workloads. [1] The division of labor in the project has OpenAI leading chip design, Broadcom contributing silicon manufacturing and networking technology — including its Tomahawk networking chips — and Celestica handling boards, racks, and system integration. [1]

OpenAI says the chip’s architecture reduces data movement and pushes hardware utilization closer to its theoretical maximum. [1] Engineering samples are already running machine learning workloads in the lab, including the GPT-5.3-Codex-Spark model, which currently also runs on Cerebras hardware. [1] Early internal tests showed performance per watt that is “substantially better” than current state-of-the-art hardware, though OpenAI itself notes these figures have not been finalized. [1] The company says a technical report will follow, but at this stage the specific chips Jalapeño was benchmarked against, the tasks used, and the testing conditions have not been disclosed. [1]

OpenAI claims the design-to-tape-out process — the full cycle from initial design to a completed chip layout ready for manufacturing — took nine months, which the company describes as the fastest application-specific integrated circuit (ASIC) development cycle for high-performance semiconductors it is aware of. [1] OpenAI also says its own AI models helped accelerate parts of the chip design process. [1] Reports of OpenAI’s custom chip ambitions had been circulating since 2023. [1]

Broadcom CEO Tan has said the first deployment is planned for late 2026 at gigawatt scale, in partnership with Microsoft and other unnamed partners. [1] Broadcom has reportedly required Microsoft to guarantee it will purchase 40 percent of the chips in order to secure the first phase of production. [1] OpenAI frames the broader effort as evidence that controlling the full stack — from chip to product — enables faster, more reliable, and lower-cost model serving. [1]

Sources

The Decoder — OpenAI and Broadcom unveil "Jalapeño," a custom chip built for LLM inference

This article was drafted with AI from the cited sources and checked against them before publication. Spot an error? Let us know.

OpenAI and Broadcom Unveil ‘Jalapeño’ Custom LLM Inference Chip

Sources

More from the Newsroom

OpenAI to Release GPT-5.6 in Limited Preview After U.S. Government Request

Meta Races to Replace Human Moderators with LLMs, Staff Warn of Risks

OpenAI Updates GPT-5.5 Instant With Better Intent and Context Handling

Ready to build something that ships?