Improve the way LLM eval runs in the background #6

kasnerz · 2024-06-10T11:57:18Z

We currently have no specialized solution whatsover for running LLM evals in the background.

After receiving the request, we simply start to iterate over the examples to annotate on the backend. We check for the running flag at each iteration of the loop and stop if the flag is set to False.

Surprisingly, this seems to work quite ok so far, probably since Flask takes care of threading.

However, it seems to be too YOLO. I also expect it not to work robustly, especially if users start launching multiple tasks at once.

At first, I also tried using Python threads manually in the code, something along the lines of:

thread = Thread(target=utils.run_llm_eval, args=(app, campaign_id))
thread.daemon = True
thread.start()

But that actually rendered the frontend unresponsive (I might have just messed it up, though). In any case, implementing a more principled solution would be much appreciated.

The text was updated successfully, but these errors were encountered:

oplatek · 2024-08-01T16:54:35Z

I would rather switch to async(io/HTTP), where we can wait for thousands of responses without affecting the server performance. We always delegate the heavy work to another server, and I think this works well for us.

If you worry about scaling the worker nodes, I would start solving it only once the waiting times for the users are too bad in some usecase. Somebody had to simulate it first. Personally, I think I will never need it in factgenie.

kasnerz added enhancement New feature or request help wanted Extra attention is needed labels Jun 10, 2024

kasnerz added low priority Tasks which can be postponed and removed help wanted Extra attention is needed labels Jul 25, 2024

kasnerz mentioned this issue Sep 11, 2024

Placeholder: Progress of annotation is not reflected in the overview page #67

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve the way LLM eval runs in the background #6

Improve the way LLM eval runs in the background #6

kasnerz commented Jun 10, 2024 •

edited

Loading

oplatek commented Aug 1, 2024

Improve the way LLM eval runs in the background #6

Improve the way LLM eval runs in the background #6

Comments

kasnerz commented Jun 10, 2024 • edited Loading

oplatek commented Aug 1, 2024

kasnerz commented Jun 10, 2024 •

edited

Loading