Shayne Parmelee - LiveKit Blog

Prompting voice agents to sound more realistic

One of the most common questions we hear voice AI developers ask is: “Should I use a speech-to-speech (S2S) model or stick with the cascade (STT-LLM-TTS) approach?” What they’re really asking is: how do I make my agent sound more human? A cascaded pipeline can be just as fast