Skip to content

Split Dataset Node

AI/ML/Dataset

Split Dataset

Split a dataset into training and testing subsets

ai_ml_dataset_splitml
Inputs5
Outputs1
Security exposure4/10
Packageml

Ratings

Scores range from 0 to 10. Higher values mean more impact, exposure, or operational weight.

SecurityAttack surface and exposure impact.
4/10Medium
PrivacyPotential sensitivity of processed data.
5/10Medium
PerformanceRuntime or resource pressure.
4/10Medium
GovernancePolicy, audit, or compliance impact.
4/10Medium
ReliabilityOperational stability considerations.
3/10High
CostExternal or compute cost impact.
4/10Medium

Input Pins

5

Input

Execution
exec_in

Execution trigger that starts the split

Split

Float
split

Ratio used for assigning rows to the training set (rest goes to test)

Default 0.8
Range 0 to 1

Data Source

Struct
source

Data Source (DB or CSV)

NodeDBConnectionNodeDBConnection1 fields
cache_keystringrequired
Schema enforced

Training Database

Struct
train

Destination database connection that receives the training rows

NodeDBConnectionNodeDBConnection1 fields
cache_keystringrequired
Schema enforced

Test Database

Struct
test

Destination database connection that receives the testing rows

NodeDBConnectionNodeDBConnection1 fields
cache_keystringrequired
Schema enforced

Output Pins

1

Done

Execution
exec_out

Activated once the split has finished

Node Info

Internal name
ai_ml_dataset_split
Category
AI/ML/Dataset