Claude/daily sdk update action by jeremyeder · Pull Request #661 · ambient-code/platform

jeremyeder · 2026-02-20T07:12:40Z

I created this from my phone.

Adds a scheduled workflow that runs daily at 8 AM UTC to check PyPI for the latest claude-agent-sdk version and opens a PR if a newer version is available. https://claude.ai/code/session_01AF1ro2VAMS7WSb7QayTetZ

- Replace pip index with PyPI JSON API (curl+jq) — removes setup-python step - Add version format validation to reject unexpected PyPI responses - Use sort -V for proper semver comparison — prevents downgrade PRs - Escape dots in sed regex to avoid wildcard matches - Merge update and validate steps into one - Build PR body in a variable instead of inline heredoc - Add failure case to step summary - Use POSIX = instead of bash == in test expressions https://claude.ai/code/session_01AF1ro2VAMS7WSb7QayTetZ

- Pin actions/checkout to commit SHA (de0fac2e, v6.0.2) for supply chain security - Move all ${{ steps.*.outputs.* }} into env: blocks to prevent expression injection - Add concurrency group to prevent parallel workflow races - Add timeout-minutes: 15 to the job - Add --max-time 30 to curl for PyPI requests - Replace silent 2>/dev/null || true with diagnostic echo on branch delete - Remove --label "dependencies" flag (label may not exist in repo) https://claude.ai/code/session_01AF1ro2VAMS7WSb7QayTetZ

github-actions · 2026-02-20T07:28:03Z

Claude Code Review

Summary

This PR adds a scheduled GitHub Actions workflow (daily-sdk-update.yml) that checks PyPI daily for new claude-agent-sdk versions and auto-opens a PR when one is found. The workflow is well-structured with good security practices from its three-commit iteration history. Two issues need attention before this is fully reliable in production.

Issues by Severity

🚫 Blocker Issues

None.

🔴 Critical Issues

1. uv.lock not updated in auto-generated PRs

The workflow updates pyproject.toml (e.g., >=0.1.23 → >=0.1.24) but does not regenerate uv.lock. The lockfile currently pins claude-agent-sdk at version 0.1.23 with hashes. After a bump, uv.lock would still contain version = "0.1.23" while pyproject.toml requires >=0.1.24 — making them inconsistent.

Impact:

Any developer running uv sync --frozen (reproducible local setup) gets a resolution conflict and a broken environment.
The lockfile would accumulate drift with each auto-update until someone regenerates it manually.

Fix — add a uv lock step before committing in Create branch, commit, and open PR:

- name: Regenerate uv.lock
  if: steps.check.outputs.needs_update == 'true' && steps.existing_pr.outputs.pr_exists == 'false'
  run: |
    pip install uv
    cd components/runners/claude-code-runner
    uv lock

Then include uv.lock in the commit:

git add components/runners/claude-code-runner/pyproject.toml \
        components/runners/claude-code-runner/uv.lock

Note: The Dockerfile uses pip install (not uv sync --frozen) so builds still pass today — but this is a latent breakage for local dev workflows.

2. Auto-generated PRs will not trigger CI workflows

PRs opened with GITHUB_TOKEN do not trigger pull_request-based workflow runs (GitHub limitation). This means runner-tests.yml and components-build-deploy.yml will not run on the auto-generated PR automatically — reviewers would have to push an empty commit or manually trigger them.

The test plan checkboxes in the PR body (Runner tests pass, Container image builds) imply CI should validate these, but it won't run automatically.

Options:

Use a PAT stored as a secret (e.g., secrets.BOT_TOKEN) in the token: field of the checkout step and the GITHUB_TOKEN env for gh pr create. This allows PR events to trigger workflows.
Document in the PR body that CI must be manually triggered.

🟡 Major Issues

3. actions/checkout SHA comment may be inaccurate

uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6.0.2

SHA pinning is the correct security practice, but the comment v6.0.2 should be verified against the upstream actions/checkout repository to confirm de0fac2e is the actual commit for that tag. A wrong comment is harmless but misleading during audits. The other workflows in this repo use actions/checkout@v6 (unpinned), so the tag version lineage is unclear.

🔵 Minor Issues

4. PR_EXISTS unset in Summary when existing_pr step is skipped

If needs_update == 'false', the existing_pr step is skipped and steps.existing_pr.outputs.pr_exists is empty. In the Summary step, PR_EXISTS will be an empty string. The condition chain still works correctly (the NEEDS_UPDATE=false branch fires first), but it's fragile. Adding a default to the env block would be defensive:

PR_EXISTS: ${{ steps.existing_pr.outputs.pr_exists || 'false' }}

5. Race condition window between PR check and PR creation

The concurrency: cancel-in-progress: false correctly serializes workflow runs, but if two runs are somehow queued and both check for an existing PR before either creates it (unlikely given the serial concurrency), a duplicate PR could be opened. This is extremely low probability for a daily cron, but gh pr create would fail gracefully with a non-zero exit if a PR already exists for the branch, so the practical impact is minimal.

6. No auto-merge for auto-generated dependency PRs

The repo already auto-merges Dependabot patch/minor PRs via dependabot-auto-merge.yml. For consistency, the auto-generated SDK PRs (which are semantically equivalent to minor dependency bumps) could also be set to auto-merge if CI passes, reducing manual overhead. This is a design choice, not a bug.

Positive Highlights

SHA pinning on actions/checkout is a good supply chain security practice; most other workflows in this repo use unpinned @v6.
Version format validation (grep -qE '^[0-9]+(\.[0-9]+)+$') prevents a malformed PyPI response from propagating into sed or the commit message.
sort -V for semver comparison correctly handles all cases including equal versions and prevents accidental downgrades.
All ${{ steps.*.outputs.* }} expressions moved to env: blocks — this is the correct defense against expression injection (CVE class: workflow injection via untrusted step outputs).
Idempotency — branch deletion before re-creation and open-PR check together prevent duplicate PRs and stale branches.
concurrency group with cancel-in-progress: false correctly queues runs rather than dropping them.
timeout-minutes: 15 on the job prevents hanging runs from blocking the runner pool.
--max-time 30 on curl prevents hanging on a slow PyPI response.
Dot escaping in sed regex (CURRENT_ESC) prevents version dots from acting as wildcards in the substitution pattern.
Step Summary is comprehensive and covers all terminal states (no-op, existing PR, failure, success).

Recommendations

Priority order:

Fix uv.lock regeneration (Critical) — add uv lock + include uv.lock in the commit. This is the most impactful correctness fix.
Decide on CI triggering strategy (Critical) — either document the manual step requirement or switch to a PAT for the gh pr create call.
Verify the actions/checkout SHA comment (Major) — confirm de0fac2e = v6.0.2 in the upstream repo.
Add || 'false' default to PR_EXISTS (Minor) — defensive coding for the Summary step.

🔍 View AI decision process (logs available for 90 days)

📋 View memory system files loaded (click to expand)

What Amber Loaded for Code Review

Amber automatically loaded these repository standards from the memory system:

CLAUDE.md - Master project instructions, development standards
backend-development.md - Go backend, K8s integration patterns
frontend-development.md - NextJS, Shadcn UI, React Query patterns
security-standards.md - Auth, RBAC, token handling
k8s-client-usage.md - User token vs service account patterns
error-handling.md - Consistent error patterns
react-query-usage.md - Data fetching patterns

Impact: This review used your repository's specific code quality standards, security patterns, and best practices from the memory system (PRs #359, #360) - not just generic code review guidelines.

- Add uv lock regeneration step after pyproject.toml update to keep uv.lock in sync and prevent broken local dev environments - Include uv.lock in the auto-generated commit - Document GITHUB_TOKEN CI triggering limitation in auto-generated PR body (PRs created by GITHUB_TOKEN don't trigger pull_request workflows) - Add defensive default (|| 'false') for PR_EXISTS env in Summary step when existing_pr step is skipped - Fix PR body formatting via heredoc to avoid indentation artifacts https://claude.ai/code/session_01AF1ro2VAMS7WSb7QayTetZ

github-actions · 2026-02-21T02:57:24Z

Claude Code Review

Summary

This PR adds .github/workflows/daily-sdk-update.yml — a scheduled workflow that polls PyPI daily for claude-agent-sdk updates and opens a PR when a new version is available. After three rounds of iteration (4 commits total), the workflow has addressed nearly all issues from prior reviews: SHA pinning, expression injection via env: blocks, concurrency control, timeout, --max-time on curl, diagnostic error handling, and uv.lock regeneration.

This review covers the current final state of the file.

Issues by Severity

Blocker Issues

None.

Critical Issues

1. actions/checkout SHA/tag comment needs verification

uses: actions/checkout@de0fac2e4500dabe0009e67214ff5f5447ce83dd # v6.0.2

SHA pinning is correct security practice. However, de0fac2e4500dabe0009e67214ff5f5447ce83dd should be confirmed against the upstream actions/checkout repository. If the comment v6.0.2 is wrong, a future auditor will be misled during supply chain audits. This matters especially because the workflow carries contents: write and pull-requests: write permissions.

Verify with: gh api repos/actions/checkout/git/commits/de0fac2e4500dabe0009e67214ff5f5447ce83dd --jq .message

Major Issues

2. Auto-generated PRs do not trigger CI automatically — process gap not fully mitigated

The PR body documents this limitation clearly, but the test plan checkboxes (runner-tests, components-build-deploy) still require a human to manually kick off CI on every auto-generated SDK PR. For a daily automation meant to reduce toil, this is a consistent manual step on every update.

Clean fix: store a PAT with repo scope as secrets.BOT_TOKEN and use it in the checkout step token and gh pr create call. No other changes required.

If a PAT is not acceptable, document the manual trigger step explicitly in the team runbook so it is not silently skipped.

Minor Issues

3. pip install uv — no caching, no version pin

- name: Regenerate uv.lock
  run: |
    pip install uv
    cd components/runners/claude-code-runner
    uv lock

pip install uv downloads uv on every run without caching. astral-sh/setup-uv@v4 provides automatic caching and explicit version pinning for reproducibility:

- uses: astral-sh/setup-uv@v4
  with:
    version: '>=0.5'

4. Commit message body has leading whitespace from shell indentation

The continuation lines in the git commit -m string carry the shell's indentation into the Git commit body. git log --format="%B" will show body lines with leading spaces. A heredoc avoids this:

git commit -m "$(cat <<'MSGEOF'
chore(runner): update claude-agent-sdk to latest

Automated daily update of the Claude Agent SDK minimum version.

Release notes: https://pypi.org/project/claude-agent-sdk/
MSGEOF
)"

5. uv lock Python version is not pinned

uv lock uses whatever Python is on ubuntu-latest. A future runner image upgrade could silently change the resolved lockfile if Python version affects wheel selection or marker evaluation. Adding --python 3.11 matches the runner-tests.yml pin and the requires-python = ">=3.11" constraint in pyproject.toml.

Positive Highlights

After three rounds of iteration, the workflow is well-hardened. Specifically worth calling out in the current state:

All steps.*.outputs.* expressions in env: blocks — correct expression injection defense from round-1 feedback.
uv.lock regenerated and committed — resolves the critical lockfile drift issue from round-2; uv sync --frozen in local dev will no longer conflict.
concurrency: cancel-in-progress: false serializes daily runs without silently dropping queued ones.
timeout-minutes: 15 prevents hung jobs from consuming runner hours.
--max-time 30 on curl protects against a slow PyPI response hanging the job.
sort -V semver comparison handles multi-digit version components correctly and prevents accidental downgrades.
Dot-escaping in sed (CURRENT_ESC) prevents version dots from acting as regex wildcards — a common shell scripting mistake avoided correctly.
Post-sed verification catches silent no-ops.
PR_EXISTS output with || 'false' fallback correctly handles the skipped-step empty-value case in the Summary step.
Branch cleanup before re-creation avoids stale branch conflicts from prior merged/closed PRs.
Step summary covers all four terminal states: no-op, existing PR, failure, success.

Recommendations

[Critical] Verify de0fac2e4500dabe0009e67214ff5f5447ce83dd is the correct commit SHA for the intended actions/checkout tag and update the comment if needed.
[Major] Decide on CI triggering: add a secrets.BOT_TOKEN PAT to enable automatic pull_request workflow runs, or document the manual trigger step in the team runbook.
[Minor] Fix commit message indentation using a heredoc.
[Minor] Pin uv version and add --python 3.11 to uv lock.

The workflow is ready to merge pending SHA verification (item 1) and a decision on CI auto-triggering (item 2).

Memory system files loaded for this review

CLAUDE.md - Master project instructions, development standards
backend-development.md - Go backend, K8s integration patterns
frontend-development.md - NextJS, Shadcn UI, React Query patterns
security-standards.md - Auth, RBAC, token handling
k8s-client-usage.md - User token vs service account patterns
error-handling.md - Consistent error patterns
react-query-usage.md - Data fetching patterns

🔍 View AI decision process (logs available for 90 days)

📋 View memory system files loaded (click to expand)

What Amber Loaded for Code Review

Amber automatically loaded these repository standards from the memory system:

CLAUDE.md - Master project instructions, development standards
backend-development.md - Go backend, K8s integration patterns
frontend-development.md - NextJS, Shadcn UI, React Query patterns
security-standards.md - Auth, RBAC, token handling
k8s-client-usage.md - User token vs service account patterns
error-handling.md - Consistent error patterns
react-query-usage.md - Data fetching patterns

Impact: This review used your repository's specific code quality standards, security patterns, and best practices from the memory system (PRs #359, #360) - not just generic code review guidelines.

Replace `pip install uv` with `astral-sh/setup-uv` (SHA-pinned v7.3.0) which provides built-in caching keyed on uv.lock. This avoids re-downloading uv and re-resolving the dependency graph on each run. https://claude.ai/code/session_01AF1ro2VAMS7WSb7QayTetZ

github-actions · 2026-02-21T03:48:25Z

Claude Code Review

Summary

This PR adds a scheduled GitHub Actions workflow (daily-sdk-update.yml) that runs daily at 8 AM UTC to detect new claude-agent-sdk releases on PyPI and automatically opens a PR bumping the minimum version constraint in pyproject.toml alongside a regenerated uv.lock. The workflow is well-engineered overall - supply-chain hardening, expression injection prevention, and idempotency controls are all present. A few correctness and scope concerns are worth addressing before merge.

Issues by Severity

Blocker Issues

None.

Critical Issues

1. uv lock regenerates the full lock file, not just the target package

.github/workflows/daily-sdk-update.yml lines 130-133

The uv lock command resolves the whole dependency graph and will update any dependency that has a newer version satisfying its constraints. An auto-generated PR titled "update claude-agent-sdk" could silently update fastapi, anthropic, langfuse, etc. alongside the intended change, making review harder and broadening the blast radius.

Fix: Use the targeted flag to pin the upgrade to the intended package only:

uv lock --upgrade-package claude-agent-sdk

Major Issues

2. Missing set -o pipefail on the PyPI fetch step

.github/workflows/daily-sdk-update.yml line 34

The curl | jq pipeline exit code is taken from jq when curl fails. If curl exits non-zero (network error, 5xx), jq may succeed on empty input and return null. The explicit null-check guard catches this case, but only by accident - the step itself will not fail due to the curl error.

Fix: Add set -o pipefail at the top of the run: block so the pipeline fails fast on curl errors.

3. SHA comment for actions/checkout should be verified

.github/workflows/daily-sdk-update.yml line 26

SHA pinning is the correct security practice. However, the comment # v6.0.2 should be confirmed to match the actual tag - mislabeled comments propagate when copy-pasted and make audits misleading. Verify with:

gh api repos/actions/checkout/git/refs/tags/v6.0.2 --jq ".object.sha"

Minor Issues

4. No reviewer or team assigned to auto-generated PRs

gh pr create has no --reviewer flag. Auto-generated dependency PRs can sit unreviewed for days. Consider adding --reviewer <team-slug> for the runner component owners.

5. cache-dependency-glob points to a file that is regenerated in the same run

The cache key is computed from uv.lock before the "Regenerate uv.lock" step runs. The cache will never be a hit on a run that produces an update (the lock file changes). Low impact but worth noting.

6. Branch-delete failure is silenced, masking unexpected errors

The || echo pattern suppresses all errors, including unexpected ones (permissions, auth failures). Distinguishing "branch not found" (expected) from other errors would make failures easier to diagnose.

7. sed pattern is fragile if constraints ever include upper bounds

The sed pattern matches exactly the current version constraint format. If the constraint is ever written with an upper bound (e.g. claude-agent-sdk>=0.1.23,<2.0), the sed will silently fail (caught by the subsequent grep verification). Current format is compatible; worth noting for future maintainers.

Positive Highlights

SHA-pinned actions: Both actions/checkout and astral-sh/setup-uv are pinned to commit SHAs - good supply-chain security practice that is absent from several other workflows in this repo (e.g., amber-dependency-sync.yml uses @v6 without pinning).
Expression injection prevention: All template expression values are placed in env: blocks before use in run: scripts, preventing script injection attacks.
Version format validation: The version regex on the PyPI response guards against unexpected or malicious version strings before they reach sed or git.
Proper semver comparison with sort -V: Correct portable approach for comparing version strings without requiring Python or Node.
Idempotent PR creation: The existing-PR check prevents duplicate PRs when the workflow runs while a previous update is still unmerged.
uv.lock included in commit: Keeping the lock file in sync with pyproject.toml prevents broken local dev environments and ensures reproducible builds in CI.
Documented CI triggering limitation: The PR body clearly explains that GITHUB_TOKEN-created PRs do not auto-trigger workflows, proactively saving future confusion.
concurrency: cancel-in-progress: false: Correct for an update workflow - in-progress runs should complete rather than be cancelled.
Comprehensive summary step: The if: always() summary with GITHUB_STEP_SUMMARY gives clear human-readable output for all outcomes.
15-minute job timeout: Prevents runaway jobs from consuming Actions minutes indefinitely.

Recommendations

Prioritized action items:

(Critical) Change uv lock to uv lock --upgrade-package claude-agent-sdk to scope lock file changes to only the intended dependency.
(Major) Add set -o pipefail to the PyPI fetch step for fail-fast behavior on pipe errors.
(Major) Verify the actions/checkout SHA matches the tagged v6.0.2 release and update the comment if not.
(Minor) Add --reviewer to gh pr create so auto-generated PRs get routed to the right team.
(Minor) Consider re-adding a dependencies label to the repo and applying it here.

Overall the workflow is production-ready modulo item 1, which affects its correctness guarantee. Items 2 and 3 are hardening improvements worth addressing before this pattern is used as a template for other packages.

Review by Claude Code (claude-sonnet-4-6)

🔍 View AI decision process (logs available for 90 days)

📋 View memory system files loaded (click to expand)

What Amber Loaded for Code Review

Amber automatically loaded these repository standards from the memory system:

CLAUDE.md - Master project instructions, development standards
backend-development.md - Go backend, K8s integration patterns
frontend-development.md - NextJS, Shadcn UI, React Query patterns
security-standards.md - Auth, RBAC, token handling
k8s-client-usage.md - User token vs service account patterns
error-handling.md - Consistent error patterns
react-query-usage.md - Data fetching patterns

Impact: This review used your repository's specific code quality standards, security patterns, and best practices from the memory system (PRs #359, #360) - not just generic code review guidelines.

claude added 2 commits February 20, 2026 06:55

feat(ci): add daily GitHub Action to update claude-agent-sdk

da7c0cf

Adds a scheduled workflow that runs daily at 8 AM UTC to check PyPI for the latest claude-agent-sdk version and opens a PR if a newer version is available. https://claude.ai/code/session_01AF1ro2VAMS7WSb7QayTetZ

This comment has been minimized.

Sign in to view

jeremyeder mentioned this pull request Feb 23, 2026

feat: automated Vertex AI model discovery with feature-flag-gated availability #667

Open

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Claude/daily sdk update action#661

Claude/daily sdk update action#661
jeremyeder wants to merge 5 commits intoambient-code:mainfrom
jeremyeder:claude/daily-sdk-update-action-tuNV9

jeremyeder commented Feb 20, 2026

Uh oh!

This comment has been minimized.

github-actions bot commented Feb 20, 2026 •

edited

Loading

What Amber Loaded for Code Review

Uh oh!

github-actions bot commented Feb 21, 2026 •

edited

Loading

What Amber Loaded for Code Review

Uh oh!

github-actions bot commented Feb 21, 2026 •

edited

Loading

What Amber Loaded for Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

jeremyeder commented Feb 20, 2026

Uh oh!

This comment has been minimized.

github-actions bot commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Claude Code Review

Summary

Issues by Severity

🚫 Blocker Issues

🔴 Critical Issues

🟡 Major Issues

🔵 Minor Issues

Positive Highlights

Recommendations

What Amber Loaded for Code Review

Uh oh!

github-actions bot commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Claude Code Review

Summary

Issues by Severity

Blocker Issues

Critical Issues

Major Issues

Minor Issues

Positive Highlights

Recommendations

What Amber Loaded for Code Review

Uh oh!

github-actions bot commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Claude Code Review

Summary

Issues by Severity

Blocker Issues

Critical Issues

Major Issues

Minor Issues

Positive Highlights

Recommendations

What Amber Loaded for Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

github-actions bot commented Feb 20, 2026 •

edited

Loading

github-actions bot commented Feb 21, 2026 •

edited

Loading

github-actions bot commented Feb 21, 2026 •

edited

Loading