Create a parade flow for declarative data processing
flow.RdA flow defines a computational pipeline with typed stages that operate on a parameter grid. Each stage can depend on previous stages and produce typed outputs with optional error handling policies.
Arguments
- grid
A data frame or tibble containing parameter combinations
- seed_col
Optional column name for reproducible random seeds
- error
Error handling policy: "propagate" (default), "keep", "omit", or "stop"
- retries
Flow-level retry attempts per stage (default
0).- retry_backoff
Backoff strategy for retries:
"none","fixed","linear","exponential".- retry_base
Base delay in seconds for retry backoff.
- retry_on
Optional retry classifier.
NULLretries all errors; a character vector retries matching condition classes; a function receives the condition and should returnTRUE/FALSE.- cancel
Cancellation propagation mode used when
error = "propagate":"deps"(default),"all", or"none".- resources
Optional flow-level resource defaults as a named list with keys
cpus,memory,time(aliases:cpus_per_task,ncpus,mem).- cpus
Optional flow-level default CPUs per task.
- memory
Optional flow-level default memory (e.g.
"8G").- time
Optional flow-level default walltime (e.g.
"2:00:00").
Examples
# Create a simple flow
grid <- data.frame(x = 1:3, y = letters[1:3])
fl <- flow(grid)
print(fl)
#> <parade_flow>
#> Grid : 3 rows × 2 cols [x, y]
#> Stages : 0 []
#> Error : propagate
# Flow with seed column for reproducibility
fl_seed <- flow(grid, seed_col = "x")