Skip to content

Extract Text Node

Document/PDF

Extract Text

Extract all text content from a PDF document.

pdf_extract_textmedia
Inputs2
Outputs2
Security exposure3/10
Packagemedia

Ratings

Scores range from 0 to 10. Higher values mean more impact, exposure, or operational weight.

SecurityAttack surface and exposure impact.
3/10High
PrivacyPotential sensitivity of processed data.
4/10Medium
PerformanceRuntime or resource pressure.
5/10Medium
GovernancePolicy, audit, or compliance impact.
2/10High
ReliabilityOperational stability considerations.
4/10Medium
CostExternal or compute cost impact.
0/10High

Input Pins

2

Input

Execution
exec_in

Trigger

Template

Struct
template

PDF file

FlowPathFlowPath3 fields
pathstringrequired
store_refstringrequired
cache_store_refstring | null
Schema enforced

Output Pins

2

Done

Execution
exec_out

Continues

Text

String
text

Extracted text

Node Info

Internal name
pdf_extract_text
Category
Document/PDF