AldeaAldea
ConceptsStreaming controls

VAD events

Voice Activity Detection events with vad_events=true

Voice Activity Detection (VAD) events notify your application when the API detects that someone has started speaking. This is useful for building UI indicators, triggering recording, or implementing push-to-talk workflows.

Usage

Enable VAD events by adding vad_events=true to the WebSocket query string:

wss://api.aldea.ai/v1/listen?vad_events=true&encoding=mp3

SpeechStarted message

When voice activity is detected, the server sends a SpeechStarted message:

{
  "type": "SpeechStarted",
  "channel": [0],
  "timestamp": 0.0
}
FieldDescription
typeAlways "SpeechStarted"
channelArray indicating which channel detected speech
timestampTime offset in seconds from the start of the stream

Next steps