ci(runpod): auto-update template image after worker push by quevon24 · Pull Request #65 · freelawproject/scanning

quevon24 · 2026-05-11T19:09:16Z

Summary

After the Build and Push RunPod Worker workflow uploads a new image to Docker Hub, it PATCHes the RunPod template's imageName to the SHA-pinned tag via rest.runpod.io/v1/templates/<id>. Next cold-start workers pull the new image automatically, replacing the manual "edit image tag in the RunPod UI" step.

Setup before merging

RunPod

Console → Settings → API Keys → Create API Key with Restricted permission. Set GraphQL: Read/Write and leave AI API: None. Copy the key.

Why GraphQL R/W: RunPod exposes only two scopes on restricted keys, GraphQL (the management plane covering templates, endpoints, pods, secrets) and AI API (per-endpoint scope, only for invoking serverless jobs). Updating a template is a management operation, so it requires GraphQL R/W; the REST /v1/templates/<id> route used by the workflow is a thin shim over the GraphQL saveTemplate mutation internally. There is no per-template scope, so GraphQL R/W is the tightest setting that lets us do this. It does, however, grant full management of every template, endpoint, and pod on the account, so treat the key as a production credential: store only in GitHub Actions secrets, never echo to logs, and rotate if it leaks.
Find the template ID. Endpoints created manually still have a backing template, hidden from the default listing, surface it with the includeEndpointBoundTemplates flag:
```
curl -sS -H "Authorization: Bearer <API_KEY>" \
  "https://rest.runpod.io/v1/templates?includeEndpointBoundTemplates=true"
```
Find the entry where isServerless: true and name matches your endpoint (e.g. Blackletter gpu worker). Its id is the template ID.

GitHub
3. Repo → Settings → Secrets and variables → Actions → New repository secret. Add both:

RUNPOD_API_KEY from step 1
RUNPOD_TEMPLATE_ID from step 2

Verifying after merge

Run the workflow manually (Actions → Build and Push RunPod Worker → Run workflow) and confirm the Update RunPod template image step logs HTTP 200. The template in the RunPod UI should then show the new freelawproject/blackletter-gpu-worker:<sha> tag, and the next job dispatched to the endpoint will cold-start on the new image.

Notes

This relies on Min Workers = 0: warm workers drain on the 5-min idle timeout and the next job picks up the new image. If we raise Min Workers > 0 later, we'll need to add an explicit roll step to force existing workers to recycle.
The workflow still triggers only on changes under scanning/runpod/** (plus workflow_dispatch), unchanged from before.

Closes #55

quevon24 · 2026-05-15T22:49:13Z

Tested the new step via workflow_dispatch from this branch:

Without secrets set: workflow ran end-to-end and the step failed cleanly with HTTP 400 and RunPod's "templateId is required" error, confirming the error path, status parsing, and ::error:: annotation all behave as expected.
With personal RUNPOD_API_KEY and RUNPOD_TEMPLATE_ID: step returned HTTP 200, the response showed imageName updated to the new SHA, and all other template fields (containerDiskInGb, volumeMountPath, startSsh, etc.) were preserved, confirming RunPod's PATCH is partial-update semantics. GitHub Actions also masked the templateId in the logged response body.

Then removed the personal secrets. Ready for the production RUNPOD_API_KEY / RUNPOD_TEMPLATE_ID to be added before merge.

grossir

LGTM!

grossir · 2026-05-18T21:23:52Z

@quevon24 Both RUNPOD_API_KEY and RUNPOD_TEMPLATE_ID are missing from the repo secrets. Should we open a ticket on the infra repo? Or can you add the values? I have permissions to add secrets

ci(runpod): update template image after worker push

a6be661

quevon24 linked an issue May 11, 2026 that may be closed by this pull request

feat(ci): automate RunPod template update on new image release #55

Closed

quevon24 added 2 commits May 11, 2026 13:13

docs(runpod): document automated release workflow

6d49083

docs(runpod): note partial-PATCH caveat for template env vars

59b7ccf

quevon24 marked this pull request as ready for review May 15, 2026 22:46

quevon24 requested a review from grossir May 15, 2026 22:50

quevon24 assigned grossir May 15, 2026

quevon24 added this to Sprint (Case Law) May 15, 2026

quevon24 moved this to PRs to Review in Sprint (Case Law) May 15, 2026

grossir approved these changes May 18, 2026

View reviewed changes

grossir merged commit 1192c16 into main May 18, 2026
8 checks passed

github-project-automation Bot moved this from PRs to Review to Done in Sprint (Case Law) May 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci(runpod): auto-update template image after worker push#65

ci(runpod): auto-update template image after worker push#65
grossir merged 3 commits into
mainfrom
55-featci-automate-runpod-template-update-on-new-image-release

quevon24 commented May 11, 2026

Uh oh!

quevon24 commented May 15, 2026

Uh oh!

grossir left a comment

Uh oh!

Uh oh!

grossir commented May 18, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

quevon24 commented May 11, 2026

Summary

Setup before merging

Verifying after merge

Notes

Uh oh!

quevon24 commented May 15, 2026

Uh oh!

grossir left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

grossir commented May 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

grossir commented May 18, 2026 •

edited

Loading