feat: concurrent task execution with DAG scheduler by branchseer · Pull Request #288 · voidzero-dev/vite-task

branchseer · 2026-03-23T11:17:59Z

Summary

Replace sequential task execution with a concurrent DAG scheduler using FuturesUnordered + tokio::sync::Semaphore (limit 10 per graph level)
Each nested Expanded graph gets its own semaphore for independent concurrency limits
Failure propagation via CancellationToken — any task failure immediately kills all in-flight processes
Add barrier test tool (cross-platform, fs.watch-based) for concurrency testing
Add e2e tests proving concurrent execution and kill-on-failure behavior

Test plan

All existing e2e snapshot tests pass (10 consecutive runs)
New concurrent-execution fixture proves independent tasks run concurrently (barrier with 2 participants — would timeout if sequential)
Failure test proves cancellation kills concurrent tasks (task b hangs on stdin after barrier, killed when task a fails)
cargo clippy clean
cargo test -p vite_task unit tests pass

🤖 Generated with Claude Code

…n_with_tracking` Accept a `tokio_util::sync::CancellationToken` in the spawn pipeline so in-flight child processes can be killed when cancellation is signalled. For fspy-tracked processes, the token is passed into fspy's background task which selects between `child.wait()` and `token.cancelled()`. For plain tokio processes, `spawn_with_tracking` spawns its own background task with the same pattern. Both kill the child via `Child::start_kill()` then await normal exit — no PID-based killing. The read loop needs no cancellation branch: killing the child closes its pipes and reads return EOF naturally. `spawn_inherited` uses the select pattern directly since it has no read loop. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Replace the sequential execution loop with a concurrent DAG scheduler. Independent tasks now run in parallel, bounded by a per-graph semaphore (limit 10). Failure cancels all in-flight tasks via CancellationToken. - Use FuturesUnordered + tokio::sync::Semaphore for bounded concurrency - Each nested ExecutionGraph (Expanded items) gets its own semaphore - Wrap reporter in RefCell for shared access from concurrent futures - On failure, close semaphore so pending acquires fail immediately - Add `barrier` test tool (fs.watch-based cross-platform barrier) - Add e2e tests proving concurrency (barrier) and kill-on-failure - Stabilize existing e2e tests by adding deps between independent packages Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chatgpt-codex-connector · 2026-03-23T11:18:03Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

- Collapse nested if-let + if in spawn_node (clippy::collapsible_if) - Use setInterval instead of stdin.resume() in barrier --hang for cross-platform reliability (stdin.resume may not keep process alive on Windows in PTY environments) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

On Windows, TerminateProcess only kills the direct child, leaving grandchildren (e.g., node.exe spawned by a .cmd shim) alive. This caused the "failure kills concurrent tasks" e2e test to timeout. Add a Win32 Job Object with JOB_OBJECT_LIMIT_KILL_ON_JOB_CLOSE to spawn_inherited. The child process and all its descendants are assigned to the job; when the handle drops, the entire tree is killed. This makes the kill-on-failure test cross-platform (no platform split). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Remove unused DWORD import - Use pub(super) visibility for OwnedHandle and assign_child_to_kill_on_close_job Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Use the child's existing process handle directly via AsRawHandle instead of re-opening it by PID. Simpler, removes the OpenProcess call and PROCESS_SET_QUOTA/PROCESS_TERMINATE permissions. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Apply assign_to_kill_on_close_job to spawn_with_tracking (piped/fspy path) in addition to spawn_inherited, covering all spawn modes on Windows - Expose duplicated process_handle on fspy::TrackedChild (Windows) so callers can assign it to a Job Object without modifying fspy internals - Use DuplicateHandle (via try_clone_to_owned) so the handle stays valid after tokio closes its copy when the process exits - Add "failure kills concurrent cached tasks" e2e test exercising the --cache (piped stdio / fspy) path Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

On Windows with piped stdio, killing the direct child (cmd.exe) doesn't close pipes held by grandchild processes (node.exe). The pipe read loop in spawn_with_tracking blocks forever waiting for EOF. Fix: add a cancellation branch to the pipe read loop that calls TerminateJobObject to kill the entire process tree, closing all pipes. Also add TerminateJobObject import and terminate() method on OwnedJobHandle. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

The `else => break` in tokio::select! only fires when ALL other arms are disabled. The `cancelled()` arm stays pending (not disabled) even when both pipes have EOF'd, preventing `else` from ever triggering. This caused every piped-stdio task to hang on Windows. Fix: check the exit condition (both pipes done) at the top of the loop instead of relying on the `else` arm. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Remove the background tokio::spawn for the non-fspy path. Instead, handle cancellation directly in the pipe read loop alongside pipe reads. This eliminates: - The WaitState enum and background task indirection - The cancellation_for_pipes token clone - The need for Send on the Job Object handle The pipe read loop now has a unified cancellation arm (all platforms) that kills the direct child and terminates the Job Object on Windows. The exit condition is checked at the top of the loop to avoid the tokio::select! else-arm issue with the always-pending cancelled() future. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Add cancellation-aware wait after pipe reads in spawn_with_tracking. If a child closes stdout/stderr but stays alive (e.g., daemonizes), the pipe reads EOF but child.wait() would block forever without cancellation support. Add --daemonize flag to barrier test tool and e2e test verifying that daemonized concurrent tasks are properly killed on failure. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

branchseer and others added 2 commits March 23, 2026 11:40

branchseer and others added 3 commits March 23, 2026 19:39

fix: Windows compilation errors in win_job module

fd2406a

- Remove unused DWORD import - Use pub(super) visibility for OwnedHandle and assign_child_to_kill_on_close_job Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

branchseer marked this pull request as draft March 23, 2026 13:11

branchseer and others added 6 commits March 23, 2026 21:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: concurrent task execution with DAG scheduler#288

feat: concurrent task execution with DAG scheduler#288
branchseer wants to merge 11 commits intomainfrom
03-20-feat_concurreny

branchseer commented Mar 23, 2026

Uh oh!

chatgpt-codex-connector bot commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

branchseer commented Mar 23, 2026

Summary

Test plan

Uh oh!

chatgpt-codex-connector bot commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant