Add torch.no_grad, fix greedy_until bug #161

OyvindTafjord · 2024-04-19T16:46:33Z

This was an unnecessary oversight spotted by @jjyang77, that we're not wrapping model calls with torch.no_grad(). Added here for the language_model.py model type. In a couple of experiments it doesn't seem to speed evaluation up by much, but does improve memory usage (was able to run Llama-3-8B on single GPU rather than needing two).

Also fixed a bug in the greedy_until method which causes crashes for some models (like Mistral) when primary_until is set as None, as noted by @dmh43.

dmh43 · 2024-04-19T18:04:50Z

catwalk/models/language_model.py

-            primary_until = None
-            for tokenized_until in tokenizer(untils)["input_ids"]:
+            primary_until = tokenizer.eos_token_id
+            for tokenized_until in tokenizer(untils, add_special_tokens=False)["input_ids"]:


what's this about?

The add_special_tokeens=False prevents the tokenizer from adding a <bos> token of sorts at the start of the string, which some tokenizers do, and would mess up the test that there's only one token (unfortunately, other tokenizers, like Llama/Mistral, also adds a "space" token when you tokenize "\n", so this is still not effective for those - will be improved in next iteration of model handling, more centrally)

OyvindTafjord added 3 commits April 19, 2024 09:16

Add torch.no_grad wrappers around model calls

4aeca8d

Robustify greedy_until stop condition

1f8c34f

Update CHANGELOG.md

6f0389c

dmh43 approved these changes Apr 19, 2024

View reviewed changes

OyvindTafjord merged commit c3eb82e into main Apr 19, 2024
10 of 17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add torch.no_grad, fix greedy_until bug #161

Add torch.no_grad, fix greedy_until bug #161

OyvindTafjord commented Apr 19, 2024

dmh43 Apr 19, 2024

OyvindTafjord Apr 19, 2024

Add torch.no_grad, fix greedy_until bug #161

Add torch.no_grad, fix greedy_until bug #161

Conversation

OyvindTafjord commented Apr 19, 2024

dmh43 Apr 19, 2024

Choose a reason for hiding this comment

OyvindTafjord Apr 19, 2024

Choose a reason for hiding this comment