OpenAI's First AI Chip Signals a Push for Greater Control of the AI Stack

OpenAI and Broadcom have unveiled "Jalapeño," OpenAI’s first custom AI processor, marking the latest step in the company’s effort to gain greater control over the infrastructure that powers its artificial intelligence services.

The chip is designed primarily for AI inference, the process of running trained models to generate responses, recommendations, code, and other outputs. While model training has historically attracted the most attention because of its enormous computational requirements, many industry observers believe inference will become an increasingly important part of the AI economy as usage expands across consumer, enterprise, and agentic applications.

The announcement places OpenAI alongside a growing group of technology companies that have invested in custom silicon. Google developed its Tensor Processing Units, or TPUs, to support AI workloads, while Amazon Web Services and Microsoft have also pursued custom AI chip strategies aimed at reducing dependence on third-party suppliers.

For OpenAI, the move comes as demand for AI services continues to grow. The company has relied heavily on graphics processing units supplied by NVIDIA and, increasingly, hardware from AMD. Custom silicon offers the potential to lower inference costs, improve energy efficiency, and provide more direct control over hardware development.

The focus on inference is notable because inference and training represent different economic challenges. Training frontier AI models requires massive bursts of computing power during development. Inference, by contrast, occurs every time a user submits a prompt, an enterprise application invokes an AI service, or an autonomous agent performs a task.

As AI systems become more deeply embedded in software, search, coding tools, and enterprise workflows, demand for inference is expected to increase substantially. That has prompted many AI companies and cloud providers to seek hardware optimized for those workloads.

Broadcom has emerged as a significant player in that market by helping major technology companies design application-specific integrated circuits (ASICs) tailored to their requirements. Unlike general-purpose processors, ASICs are designed for specific workloads and can often deliver better performance-per-watt and lower operating costs.

Jalapeño appears to be the first major product resulting from a broader strategic collaboration between OpenAI and Broadcom, announced in 2025. At the time, the companies said they planned to work together on AI accelerators and networking technologies intended to support OpenAI's growing infrastructure needs.

The announcement does not signal an end to OpenAI's use of third-party hardware. Industry reports indicate the company will continue to deploy NVIDIA and AMD products alongside custom silicon, reflecting a strategy focused on supplementing rather than replacing existing infrastructure.

One aspect of the project that has drawn attention is OpenAI's reported use of its own AI models to assist portions of the chip-design process. While AI-assisted chip design is still an emerging field, several semiconductor companies have begun exploring how to use machine learning to accelerate design, verification, and optimization.

The broader significance of Jalapeño may extend beyond the chip itself. Over the past several years, OpenAI has evolved from a research organization into a company operating across multiple layers of the AI ecosystem, including models, developer platforms, enterprise software, infrastructure partnerships, and data center investments.

The custom processor suggests OpenAI is pursuing a strategy increasingly similar to that of large cloud providers, which seek to optimize hardware, software, networking, and services as an integrated system.

That approach reflects a growing belief across the industry that competitive advantage in AI may depend not only on model capabilities, but also on control of the infrastructure that delivers those capabilities at scale.

As AI adoption expands, the economics of inference, energy consumption, and infrastructure efficiency are likely to become increasingly important. Jalapeño represents one example of how AI companies are responding by investing in specialized hardware designed for their own workloads.

Whether the strategy ultimately reduces costs or creates a meaningful competitive advantage remains to be seen. For now, the launch underscores how the AI race is increasingly being fought not just in models, but across the entire technology stack that supports them.

About the Author

John K. Waters is the editor in chief of a number of Converge360.com sites, with a focus on high-end development, AI and future tech. He's been writing about cutting-edge technologies and culture of Silicon Valley for more than two decades, and he's written more than a dozen books. He also co-scripted the documentary film Silicon Valley: A 100 Year Renaissance, which aired on PBS.  He can be reached at [email protected].

Featured