Name	Name	Last commit message	Last commit date
parent directory ..
src	src
.gitignore	.gitignore
Dockerfile	Dockerfile
README.md	README.md
docker-compose.yml	docker-compose.yml
pom.xml	pom.xml
run.sh	run.sh

Name

Last commit message

Last commit date

Batch Processing

An operations team needs to process a dataset of variable size, but the processing service chokes on payloads larger than a few hundred records. They need to split the input into manageable chunks, process each chunk independently, and produce a unified summary -- without losing track of which chunk failed if one does.

Pipeline

[bp_prepare_batches]
     |
     v
     +── loop ──────────────+
     |  [bp_process_batch]
     +───────────────────────+
     |
     v
[bp_summarize]

Workflow inputs: records, batchSize

Workers

PrepareBatchesWorker (task: bp_prepare_batches)

Prepares batches from input records by splitting them into chunks.

Applies math.ceil(), clamps with math.min()
Reads records, batchSize. Writes totalRecords, totalBatches, batchSize, batches

ProcessBatchWorker (task: bp_process_batch)

Processes a single batch of records: validates fields, normalizes strings, and computes per-record status. This does real per-item transformation work.

Trims whitespace, clamps with math.min(), filters with predicates
Reads iteration, batchSize, totalRecords, batches. Writes batchIndex, processedCount, rangeStart, rangeEnd, processedItems

SummarizeWorker (task: bp_summarize)

Summarizes batch processing results.

Reads totalRecords, iterations. Writes summary

30 tests | Workflow: batch_processing | Timeout: 60s

See RUNNING.md for setup and usage.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Batch Processing

Pipeline

Workers

FilesExpand file tree

batch-processing

Directory actions

More options

Directory actions

More options

Latest commit

History

batch-processing

Folders and files

parent directory

README.md

Batch Processing

Pipeline

Workers