Compatibility at a Glance
| Service | Works with Flows |
|---|---|
| Cascade LLMs (OpenAI, Anthropic, Gemini, AWS Bedrock, and OpenAI-compatible) | Yes |
Gemini Live (GeminiLiveLLMService, GeminiLiveVertexLLMService) | No |
OpenAI Realtime (OpenAIRealtimeLLMService) | No |
AWS Nova Sonic (AWSNovaSonicLLMService) | No |
| Grok S2S, Inworld S2S, Ultravox | No |
Why
Flows currently requires a cascade LLM service (STT → LLM → TTS). Native S2S support is currently being developed.Recommended Path: Use a Cascade Pipeline
If you want structured conversation flows today, build a cascade pipeline with a separate STT, LLM, and TTS service. Any cascade LLM that supports function calling works. Install Pipecat Flows along with Pipecat and the services you want to use. This example uses Deepgram (STT), Google Gemini (LLM), and Cartesia (TTS):FlowManager:
If You Specifically Need Realtime S2S
If speech-to-speech is a hard requirement, build with plain Pipecat (without Flows) and manage conversation state in your own code. The S2S service pages have everything you need to get started:Gemini Live
Realtime speech-to-speech with Google Gemini Live
OpenAI Realtime
Realtime speech-to-speech with OpenAI’s Realtime API