Skip to content

Audio to Mel Spectrogram Node

AI/ML/ONNX/Audio

Audio to Mel Spectrogram

Convert audio to mel spectrogram for speech models

audio_to_mel_spectrogramonnx
Inputs5
Outputs3
Security exposure10/10
Packageonnx

Ratings

Scores range from 0 to 10. Higher values mean more impact, exposure, or operational weight.

No score metadata has been set for this node yet.

Input Pins

5

Input

Execution
exec_in

Initiate Execution

Audio

Struct
audio

Input audio (16kHz mono)

AudioDataAudioData4 fields
sample_rateinteger:uint32required

Sample rate in Hz

format uint32min 0
channelsinteger:uint16required

Number of channels (1 = mono, 2 = stereo)

format uint16min 0max 65535
samplesArray<number:float>required

Audio samples (normalized to -1.0 to 1.0)

itemsnumber:floatarray item
format float
duration_secsnumber:floatrequired

Duration in seconds

format float

N Mels

Integer
n_mels

Number of mel bands

Default 80

Hop Length

Integer
hop_length

Hop length in samples

Default 160

N FFT

Integer
n_fft

FFT window size

Default 400

Output Pins

3

Output

Execution
exec_out

Done

Spectrogram

Generic
spectrogram

Mel spectrogram [n_mels, time]

Frames

Integer
frames

Number of time frames

Node Info

Internal name
audio_to_mel_spectrogram
Category
AI/ML/ONNX/Audio