Is there a faster way to read large Vec<u8>? #462

MeguminSama · 2024-07-31T11:48:30Z

At the moment, say I have a struct like this:

#[derive(Debug, DekuRead, DekuWrite)]
#[deku(ctx = "size: usize", ctx_default = "0")]
pub struct Rfc {
	#[deku(bytes_read = "size")]
	pub data: Vec<u8>,
}

The problem, is that deku seems to loop through the reader for each u8 in bytes_read. This causes it to be very slow on large vectors. #[deku(read_all)] and #[deku(count = "size")] are also very slow.

At the moment, we're using our own read function to do something like this:

match reader.read_bytes(size, &mut buf) {
	Ok(ReaderRet::Bytes) => Ok(Rfc { data: buf }),
	_ => {...}
}

Which is significantly faster.

But I was wondering if there was a built-in way to do this with deku, instead of deku looping over each u8?

If this isn't a feature currently, I might consider implementing it if it's something you'd want in deku.

Thanks!

The text was updated successfully, but these errors were encountered:

wcampbell0x2a · 2024-07-31T11:54:15Z

For read_all performance, check out the following MR. #441

Since deku makes small repeated reads, using a https://doc.rust-lang.org/std/io/struct.BufReader.html should reduce the read overhead.

MeguminSama · 2024-07-31T12:01:17Z

Thanks for getting back so quickly!

At the moment, our reader is already using a BufReader. We tried doing this in an attempt to speed it up, but unfortunately it's still much too slow when reading the vectors compared to our own read function.

Would some kind of #[deku(read_buffer = "size")] attribute be something you'd consider? Or is this out of scope for deku?

wcampbell0x2a · 2024-07-31T12:09:54Z

Definitely try out the merge request, it's really slow without that.

Would some kind of #[deku(read_buffer = "size")] attribute be something you'd consider? Or is this out of scope for deku?

Sure, I don't have the code in front of me, but I think we only store leftover as a u8, so we would need to store the leftovers in a Vec<u8> if needed. I'd like to use that only if you use read_buffer, since for embedded platforms you don't want allocations all the time.

wcampbell0x2a · 2024-07-31T12:11:14Z

I also don't know, it could be an improvement in our impl of Vec, I think it reads and evaluates one at a time currently.

MeguminSama · 2024-07-31T12:12:58Z

I will take a look, thanks :)

* Add read_exact, which can only be used for Vec<u8> but allows faster reading that doesn't have ctx or limits. See #462

* For Vec<u8> when using count, specialize into reading the bytes all at once See #462

wcampbell0x2a added a commit that referenced this issue Sep 18, 2024

Add read_exact attribute

099f933

* Add read_exact, which can only be used for Vec<u8> but allows faster reading that doesn't have ctx or limits. See #462

wcampbell0x2a added a commit that referenced this issue Sep 18, 2024

Add read_exact attribute

0fa2195

* Add read_exact, which can only be used for Vec<u8> but allows faster reading that doesn't have ctx or limits. See #462

wcampbell0x2a added a commit that referenced this issue Sep 18, 2024

Add read_exact attribute

b055f43

* Add read_exact, which can only be used for Vec<u8> but allows faster reading that doesn't have ctx or limits. See #462

wcampbell0x2a mentioned this issue Sep 18, 2024

Add count Vec<u8> Specializations #481

Open

wcampbell0x2a added a commit that referenced this issue Sep 18, 2024

Add read_exact attribute

75f3a4b

* Add read_exact, which can only be used for Vec<u8> but allows faster reading that doesn't have ctx or limits. See #462

wcampbell0x2a added a commit that referenced this issue Sep 20, 2024

Add count Vec<u8> Specializations

ac5b772

* For Vec<u8> when using count, specialize into reading the bytes all at once See #462

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there a faster way to read large Vec<u8>? #462

Is there a faster way to read large Vec<u8>? #462

MeguminSama commented Jul 31, 2024 •

edited

Loading

wcampbell0x2a commented Jul 31, 2024

MeguminSama commented Jul 31, 2024

wcampbell0x2a commented Jul 31, 2024

wcampbell0x2a commented Jul 31, 2024

MeguminSama commented Jul 31, 2024

Is there a faster way to read large Vec<u8>? #462

Is there a faster way to read large Vec<u8>? #462

Comments

MeguminSama commented Jul 31, 2024 • edited Loading

wcampbell0x2a commented Jul 31, 2024

MeguminSama commented Jul 31, 2024

wcampbell0x2a commented Jul 31, 2024

wcampbell0x2a commented Jul 31, 2024

MeguminSama commented Jul 31, 2024

MeguminSama commented Jul 31, 2024 •

edited

Loading