Skip to content

Speech to Text Node

AI/Generative/Audio

Speech to Text

Transcribes or translates audio with an existing provider Bit.

ai_audio_speech_to_textmediaLong running
Inputs4
Outputs5
Security exposure5/10
Packagemedia

Ratings

Scores range from 0 to 10. Higher values mean more impact, exposure, or operational weight.

SecurityAttack surface and exposure impact.
5/10Medium
PrivacyPotential sensitivity of processed data.
6/10Medium
PerformanceRuntime or resource pressure.
5/10Medium
GovernancePolicy, audit, or compliance impact.
4/10Medium
ReliabilityOperational stability considerations.
4/10Medium
CostExternal or compute cost impact.
7/10Low

Input Pins

4

Input

Execution
exec_in

Trigger transcription

Provider

Struct
provider

Existing provider Bit

BitBit19 fields
idstring
default ""
typeBitTypes
enum "Llm", "Vlm", "Tts", "Stt"...default "Other"
metaMap<string, Metadata>
default {}
*Metadatamap value
namestringrequired
descriptionstringrequired
long_descriptionstring | null
release_notesstring | null
tagsArray<string>required
itemsstringarray item
+11 more fields
authorsArray<string>
default []
itemsstringarray item
repositorystring | null
default null
download_linkstring | null
default null
file_namestring | null
default null
hashstring
default ""
sizeinteger | null
format uint64default nullmin 0
hubstring
default ""
parametersvalue
default null
versionstring | null
default null
licensestring | null
default null
dependenciesArray<string>
default []
itemsstringarray item
dependency_tree_hashstring
default ""
createdstring
default ""
updatedstring
default ""
model_slugstring | null
default null
+1 more fields
Schema enforced

Audio

Struct
audio

Audio FlowPath

FlowPathFlowPath3 fields
pathstringrequired
store_refstringrequired
cache_store_refstring | null
Schema enforced

Provider Options

Struct
provider_options

Typed provider-specific speech-to-text options

SpeechToTextProviderOptionsSpeechToTextProviderOptions4 fields
variant 1objectvariant
providerconstrequired
const "default"
variant 2objectvariant
providerconstrequired
const "open_ai_compatible"
optionsOpenAiCompatibleSpeechToTextOptionsrequired
promptstring | null
default null
languagestring | null
default null
response_formatSpeechResponseFormat
enum "json", "text", "verbose_json", "srt"...default "json"
translateboolean
default false
variant 3objectvariant
providerconstrequired
const "xai"
optionsXaiSpeechToTextOptionsrequired
promptstring | null
default null
languagestring | null
default null
variant 4objectvariant
providerconstrequired
const "google"
optionsGoogleSpeechToTextOptionsrequired
promptstring | null
default null
Default {"provider":"default"}
Schema enforced

Output Pins

5

Output

Execution
exec_out

Done

Text

String
text

Transcript text

Message

Struct
message

Transcript as a user HistoryMessage

HistoryMessageHistoryMessage6 fields
roleRolerequired
enum "system", "user", "assistant", "function"...
contentMessageContentrequired
variant 1stringvariant
variant 2Array<Content>variant
itemsContentarray item
variant 1objectvariant
variant 2objectvariant
variant 3objectvariant
variant 4objectvariant
variant 5objectvariant
namestring | null
tool_callsarray | null
itemsToolCallarray item
idstringrequired
typestringrequired
functionToolCallFunctionrequired
namestringrequired
argumentsstringrequired
tool_call_idstring | null
annotationsarray | null
itemsAnnotationarray item
typestringrequired
url_citationanyOf (2)
variant 1UrlCitationvariant
variant 2nullvariant

History

Struct
history

Transcript wrapped in History

HistoryHistory19 fields
modelstringrequired
messagesArray<HistoryMessage>required
itemsHistoryMessagearray item
roleRolerequired
enum "system", "user", "assistant", "function"...
contentMessageContentrequired
variant 1stringvariant
variant 2Array<Content>variant
namestring | null
tool_callsarray | null
itemsToolCallarray item
tool_call_idstring | null
+1 more fields
presetstring | null
streamboolean | null
stream_optionsanyOf (2)
variant 1StreamOptionsvariant
include_usagebooleanrequired
variant 2nullvariant
max_completion_tokensinteger | null
format uint32min 0
top_pnumber | null
format float
temperaturenumber | null
format float
thinkinganyOf (2)
variant 1HistoryThinkingvariant
enum "off", "low", "mid", "high"
variant 2nullvariant
seedinteger | null
format uint32min 0
presence_penaltynumber | null
format float
frequency_penaltynumber | null
format float
userstring | null
stoparray | null
itemsstringarray item
response_formatanyOf (2)
variant 1ResponseFormatvariant
variant 1stringvariant
variant 2nullvariant
ninteger | null
format uint32min 0
toolsarray | null
itemsToolarray item
typeToolTyperequired
enum "function"
functionHistoryFunctionrequired
namestringrequired
descriptionstring | null
parametersHistoryFunctionParametersrequired
tool_choiceanyOf (2)
variant 1ToolChoicevariant
variant 1nullvariant
variant 2nullvariant
variant 3nullvariant
variant 4objectvariant
typeToolTyperequired
enum "function"
functionHistoryFunctionrequired
variant 2nullvariant
+1 more fields

Metadata

Struct
metadata

Transcription metadata

Node Info

Internal name
ai_audio_speech_to_text
Category
AI/Generative/Audio
Version
3