Any plans of supporting INT8 and INT4 precision with tensor core support? #220

Dampfinchen · 2023-01-27T11:48:09Z

Dampfinchen
Jan 27, 2023

This could reduce VRAM requirements by a lot and speed up the AI models significantly.

henk717 · 2023-01-27T15:20:23Z

henk717
Jan 27, 2023
Collaborator

INT8 is planned but currently delayed because the developer working on it has gone missing. I will give him at least one more month before we explore other options to get it supported.

The hard part of getting suppott is very deep integration of our part and the fact not every developer has a GPU that can support it. So in my personal case for example I can not finish his INT8 support since my card can not do this in general.

iNT4 is currently not planned since huggingface has no support for it. Once they add support we can explore that to.

0 replies

scavru · 2023-04-03T13:38:00Z

scavru
Apr 3, 2023

We are waiting for your decision and implementation, hakurei has already released the lit-6B-8bit model.

0 replies

henk717 · 2023-04-03T13:42:09Z

henk717
Apr 3, 2023
Collaborator

Since the last update we have been focussed on overhauling our backend to make these implementations easier to implement. So currently the holdup is that being finished. There are already unofficial versions from the community you can find in our discord.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any plans of supporting INT8 and INT4 precision with tensor core support? #220

{{title}}

Replies: 3 comments

{{title}}

{{title}}

{{title}}

Select a reply

Any plans of supporting INT8 and INT4 precision with tensor core support? #220

Dampfinchen Jan 27, 2023

Replies: 3 comments

henk717 Jan 27, 2023 Collaborator

scavru Apr 3, 2023

henk717 Apr 3, 2023 Collaborator

Dampfinchen
Jan 27, 2023

henk717
Jan 27, 2023
Collaborator

scavru
Apr 3, 2023

henk717
Apr 3, 2023
Collaborator