Input
Executionexec_in
Initiate Execution
AI/ML/ONNX/Audio
Detect speech segments in audio. Download Silero VAD model from: https://github.com/snakers4/silero-vad/raw/master/src/silero_vad/data/silero_vad.onnx
Scores range from 0 to 10. Higher values mean more impact, exposure, or operational weight.
Initiate Execution
ONNX VAD Model
Cache ID for Session
Input audio data
Sample rate in Hz
Number of channels (1 = mono, 2 = stereo)
Audio samples (normalized to -1.0 to 1.0)
Duration in seconds
Speech probability threshold
Minimum speech duration (ms)
Minimum silence duration (ms)
Done
VAD result
Speech segments detected
Start time in seconds
End time in seconds
Average confidence
Frame-level speech probabilities
Frame duration in seconds
Speech segments