Baseten, an AI inference platform, is closing in on a $1.5 billion funding round that values the company at $13 billion, according to reports. The raise comes just months after the startup's previous mega-round, reflecting investor appetite for infrastructure focused on running trained AI models at scale.

Inference, the computational process of deploying and executing AI models in production, has become a major battleground for venture capital. Companies racing to serve enterprises need fast, cost-effective ways to run models after training them. Baseten positions itself as a platform that simplifies inference deployment across different hardware and model architectures.

The timing highlights the shift in AI investment focus. While foundation model development commanded most attention in 2023 and 2024, the infrastructure layer for actually operating those models is now attracting serious capital. Baseten competes directly with other inference-focused startups and cloud providers racing to own the production deployment space.

The $13 billion valuation marks significant growth for a company focused on a relatively narrow slice of the AI stack. Investors appear convinced that inference will consume substantial enterprise spend as AI adoption scales beyond chatbots and into mission-critical business processes.

The "inference gold rush" mentality reflects real economics. Running large language models and other AI systems in production is expensive. Every inference call incurs computational costs that multiply across thousands or millions of user requests. Platforms that reduce latency, optimize hardware utilization, or cut costs per inference attract enterprise buyers with direct ROI benefits.

Baseten's back-to-back mega-rounds demonstrate how quickly valuations climb in hot sectors. The company now ranks among the most well-funded infrastructure startups in the AI ecosystem, sitting alongside players like Hugging Face and Mistral AI in terms of total capital raised.

Whether these valuations prove sustainable depends on execution. The inference market remains fragmented, with cloud