Input
Executionexec_in
Initiate Execution
AI/ML/ONNX/Audio
Convert audio to mel spectrogram for speech models
Scores range from 0 to 10. Higher values mean more impact, exposure, or operational weight.
Initiate Execution
Input audio (16kHz mono)
Sample rate in Hz
Number of channels (1 = mono, 2 = stereo)
Audio samples (normalized to -1.0 to 1.0)
Duration in seconds
Number of mel bands
Hop length in samples
FFT window size
Done
Mel spectrogram [n_mels, time]
Number of time frames