-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Batching #2051
Batching #2051
Conversation
…dal-client into cathy/batching-integration
For parameter names, |
Does this mean we can't do batching with webhook functions? |
I think it should be intuitive in the context of a |
Where is "linger_ms" coming from by the way? At one point I thought we had |
Yes, we can't batch the webhook functions but we can do this. I think most of the use cases would probably involve declaring a class for model + make batched inference as one of its methods
|
What's the technical reason that we can't decorate an endpoint function or method with the batching decorator? I think users will naively expect those to compose. |
We're looking into getting it to work with web endpoints, but the short summary is that each web endpoints invocation has ASGI streams of incoming/outgoing data. It's difficult to batch four streams into a single request/response pattern unless the user defines their request handler explicitly. |
Describe your changes
Enable batching in modal functions and class methods.
Interface Examples:
Backward/forward compatibility checks
Check these boxes or delete any item (or this section) if not relevant for this PR.
Note on protobuf: protobuf message changes in one place may have impact to
multiple entities (client, server, worker, database). See points above.
Changelog