Thinking Machines shows off preview of near-realtime AI voice and video conversation with new 'interaction models'

Thinking Machines demonstrated a working preview of AI systems capable of near-realtime voice and video conversation, moving beyond the turn-based interaction model that defines current AI assistants.

Today's AI operates in discrete exchanges. A user inputs a prompt and waits for the model to generate a complete response. This latency, which can stretch from milliseconds to hours for complex queries, creates an unnatural conversation rhythm unlike human dialogue.

Thinking Machines' new "interaction models" compress this cycle substantially. The system responds to voice and video inputs with minimal delay, enabling fluid back-and-forth exchanges that more closely mirror natural conversation. The technology processes human input continuously rather than waiting for discrete prompts to complete before starting output generation.

The distinction matters for practical applications. Current AI struggles with jobs requiring real-time interaction, negotiation, or immediate feedback loops. A customer service representative powered by existing AI models would feel stilted. Real-time capable systems could handle live customer conversations, conduct interviews, provide coaching, or facilitate collaborative work without the jarring pauses that break conversational flow.

The preview suggests that latency, not capability, has been the primary barrier to more human-like AI interaction. Reducing response times to near-instantaneous levels changes the user experience fundamentally. It shifts AI from a tool you query to a participant in genuine dialogue.

Technical challenges remain. Maintaining context across rapid exchanges, handling interruptions gracefully, and managing computational load for simultaneous processing all present obstacles. Early previews show promise but don't guarantee production readiness across diverse use cases.

The shift toward continuous interaction models represents a practical evolution rather than a conceptual breakthrough. It acknowledges that even sophisticated AI falls flat when users must wait awkwardly for responses. Companies racing to deploy AI in customer-facing roles understand this intimately.

If Thinking Machines delivers production systems with near-realtime responsiveness across voice and

Thinking Machines shows off preview of near-realtime AI voice and video conversation with new 'interaction models'

The EU wants to regulate AI but needs OpenAI and Anthropic to let regulators through the door

AI Weekly Issue #473: The godfather of AI bets against LLM's

Thinking Machines wants to build an AI that actually listens while it talks

Get Daily AIWireDaily