Add proposal for queues and services in Tyger by johnstairs · Pull Request #291 · microsoft/tyger

johnstairs · 2026-02-10T16:45:48Z

Adding a functional proposal for queues and services in Tyger. The proposal does not get into implementation details. Comments welcome!

hansenms

This looks pretty nice. Any thoughts on the underlying implementation? We handle this in the database or do we use Azure Storage Queue? Would this be supported in Docker mode (I think not)?

The main concern I have is with have the CLI authenticated in the container without being scoped down or something?

proposals/queues.md

hansenms · 2026-02-13T05:38:31Z

proposals/queues.md

+
+while true; do
+  # Receive returns {"status": "...", "items": [...]}
+  response=$(tyger queue receive "$queue" --create-output-buffers)


So this implies that the container running in the service has Tyger control plane access. We have not had that before but I see how it could be needed now. We should think about if/how this could be scoped down? Would it be possible that it only has queue access and only to the queues and associated buffers relevant for this service?

A few models for this:

tyger is logged in with some unspecified account that has, contributor access.

When creating the codespec, you explicitly like the tyger credentials to an existing workload identity.

We have a separate token that is independent of Entra that is mounted as a secret in the container and that is rotated (like we do for SAS tokens). That token would have claims that have been explicitly granted to the service or run. They could grant access to one or more queues, read and write to buffers related to those queues, create buffers, etc.

hansenms · 2026-02-13T05:39:33Z

proposals/queues.md

+  output_buffer=$(echo "$item_json" | jq -r '.outputs.result')
+
+  # Start heartbeat in background
+  tyger queue item heartbeat "$item_id" --lease "$lease" --while-alive &


I know this is just an example, but there is a good chance if something goes wrong below that this will zombie and the queue will be open forever.

This is a convenience feature. Ideally, you would take care of the heartbeat in your own code in a background thread with proper error handling etc. In this case, if the main script exits because of a failure, the heartbeat process would detect that its parent is no longer alive and exit.

hansenms · 2026-02-13T05:41:37Z

proposals/queues.md

+`<QUEUE_KEY>_QUEUE_NAME` containing the actual queue name (e.g.
+`REQUESTS_QUEUE_NAME=inference-requests`).
+
+#### Scaling Services


Should we (at some point) have autoscaling? It is somewhere between trigger and service if you let it scale to zero when idle for long enough?

Yeah, I think autoscaling would be a valuable feature, but I think how it is defined (function of queue length, based a target average response time?) would require a whole other spec. Scale to 0 would be a really interesting capability.

hansenms · 2026-02-13T05:44:14Z

proposals/queues.md

@@ -0,0 +1,697 @@
+# Proposal: Queues and Services in Tyger


Not sure where to park this comment, so it will be at the top....any thoughts on this in a multi-tenant environment. There multiple tenants could really benefit from this.

I had not thought about multi-tenancy. All of the scenarios in this document are scoped per organization. Are you thinking about queues that are shared across organizations?

I guess you're thinking about a shared inferencing service. Ok this gets complicated because now a service has to be able to access queues and buffers across multiple organizations.

Yes, I don't think we should consider it right now, but if we consider our own multi-tenancy use cases, this would be pretty handy. Anyways also a reminder that creating these services is probably a privileged operation?

hansenms · 2026-02-13T05:50:22Z

proposals/queues.md

+The container is completely unaware of queues—it just reads from input pipes and
+writes to output pipes.
+
+#### Parameter Forwarding


It is a bit unclear what should happen if you have output parameters since the processor is required to provide them?

I think it should be an error to create a trigger on a queue with output parameters.

proposals/queues.md

johnstairs · 2026-02-13T13:27:08Z

This looks pretty nice. Any thoughts on the underlying implementation? We handle this in the database or do we use Azure Storage Queue? Would this be supported in Docker mode (I think not)?

The main concern I have is with have the CLI authenticated in the container without being scoped down or something?

I think that the implementation would probably be handled in the database, because the functionality is a little different than normal message queues in that the queue message is durable and can be updated with a response. And if we implement it in the database, it would be relatively straightforward to get this working in Docker mode as well.

Add proposal for queues and services in Tyger

81688d6

johnstairs requested review from hansenms, lv-ms, naegelejd and yuliadub February 10, 2026 16:46

hansenms reviewed Feb 13, 2026

View reviewed changes

Conversation

johnstairs commented Feb 10, 2026

Uh oh!

hansenms left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

johnstairs Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

johnstairs commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

johnstairs Feb 13, 2026 •

edited

Loading