Batching #2051

cathyzbn · 2024-07-25T16:17:29Z

Describe your changes

Enable batching in modal functions and class methods.
Interface Examples:

@app.function()
@modal.batch(batch_max_size=4, batch_linger_ms=1000)
async def batch_squared(x):
    return [x_i ** 2 for x_i in x]

@app.cls()
class Batch:
    @modal.batch(batch_max_size=4, batch_linger_ms=1000)
    async def batch_squared(self, x):
        return [x_i ** 2 for x_i in x]

Backward/forward compatibility checks

Check these boxes or delete any item (or this section) if not relevant for this PR.

Client+Server: this change is compatible with old servers
Client forward compatibility: this change ensures client can accept data intended for later versions of itself

Note on protobuf: protobuf message changes in one place may have impact to
multiple entities (client, server, worker, database). See points above.

Changelog

…dal-client into cathy/batching-integration

cathyzbn · 2024-07-31T18:14:13Z

For parameter names, @batch and batch_max_size / batch_linger_ms may be redundant, but max_size seems unintuitive especially for ML users who are accustomed to using batch_size?

…dal-client into cathy/batching-integration

mwaskom · 2024-07-31T20:06:09Z

Does this mean we can't do batching with webhook functions?

mwaskom · 2024-07-31T20:07:30Z

@Batch and batch_max_size / batch_linger_ms may be redundant, but max_size seems unintuitive especially for ML users who are accustomed to using batch_size?

I think it should be intuitive in the context of a batch decorator with a small number of parameters. If the decorator had a lot of parameters maybe the link would be less clear.

mwaskom · 2024-07-31T20:08:26Z

Where is "linger_ms" coming from by the way? At one point I thought we had max_size and max_wait which I thought nicely contrasted the two parameters that you trade off. "linger" isn't really evoking anything specific for me relating to batching, but maybe I'm missing a reference point.

cathyzbn · 2024-07-31T20:13:10Z

Does this mean we can't do batching with webhook functions?

Yes, we can't batch the webhook functions but we can do this. I think most of the use cases would probably involve declaring a class for model + make batched inference as one of its methods

@app.function()
@modal.batch(batch_max_size=4, batch_linger_ms=1000)
async def batched_function_async(x):
    return [x_i**2 for x_i in x]

@app.function()
@modal.web_endpoint()
async def f(x: int):
    output = await batched_function_async.remote.aio(x)
    return output

mwaskom · 2024-07-31T20:17:15Z

What's the technical reason that we can't decorate an endpoint function or method with the batching decorator? I think users will naively expect those to compose.

gongy · 2024-07-31T20:20:41Z

We're looking into getting it to work with web endpoints, but the short summary is that each web endpoints invocation has ASGI streams of incoming/outgoing data. It's difficult to batch four streams into a single request/response pattern unless the user defines their request handler explicitly.

cathyzbn · 2024-08-05T18:31:23Z

Split into: https://github.com/modal-labs/modal-client/pull/2064/files
https://github.com/modal-labs/modal-client/pull/2065/files

cathyzbn and others added 9 commits July 25, 2024 15:45

initial

dc66405

Merge branch 'main' into cathy/batching-integration

8232960

enable class method to batch

bd80e7d

fix current_id and deserialization latency

2b8906b

Merge branch 'main' into cathy/batching-integration

9a83ef5

cleanup and unit test

c6d8ff0

Merge branch 'cathy/batching-integration' of github.com:modal-labs/mo…

3df6fb4

…dal-client into cathy/batching-integration

fix type check

96d7f2a

fix type check

c71d982

cathyzbn marked this pull request as ready for review July 31, 2024 17:52

Merge branch 'main' into cathy/batching-integration

3ebf9cb

cathyzbn requested review from mwaskom and gongy July 31, 2024 18:11

cathyzbn and others added 4 commits July 31, 2024 14:58

Merge branch 'main' into cathy/batching-integration

fd50bd1

Merge branch 'main' into cathy/batching-integration

98b6146

fix test on linux

15be068

Merge branch 'cathy/batching-integration' of github.com:modal-labs/mo…

edd94ce

…dal-client into cathy/batching-integration

isolate input/output change

55af973

cathyzbn closed this Aug 5, 2024

cathyzbn removed request for mwaskom and gongy August 5, 2024 18:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batching #2051

Batching #2051

cathyzbn commented Jul 25, 2024 •

edited

Loading

cathyzbn commented Jul 31, 2024 •

edited

Loading

mwaskom commented Jul 31, 2024

mwaskom commented Jul 31, 2024

mwaskom commented Jul 31, 2024

cathyzbn commented Jul 31, 2024

mwaskom commented Jul 31, 2024

gongy commented Jul 31, 2024 •

edited

Loading

cathyzbn commented Aug 5, 2024

Batching #2051

Batching #2051

Conversation

cathyzbn commented Jul 25, 2024 • edited Loading

Describe your changes

Changelog

cathyzbn commented Jul 31, 2024 • edited Loading

mwaskom commented Jul 31, 2024

mwaskom commented Jul 31, 2024

mwaskom commented Jul 31, 2024

cathyzbn commented Jul 31, 2024

mwaskom commented Jul 31, 2024

gongy commented Jul 31, 2024 • edited Loading

cathyzbn commented Aug 5, 2024

cathyzbn commented Jul 25, 2024 •

edited

Loading

cathyzbn commented Jul 31, 2024 •

edited

Loading

gongy commented Jul 31, 2024 •

edited

Loading