toHexString util: try String.fromCharCode api #277

twoeths · 2022-07-29T10:39:43Z

Is your feature request related to a problem? Please describe.

Refer to achingbrain/uint8arrays#30 (comment)

Given how efficient protobuf creates a string from Uint8Array in one go, we could try to use String.fromCharCode() api for our toHexString() function without using string concatenation (which create temporary strings that cause gc run more frequently)

Describe the solution you'd like

Create a function to map a number from 0 to 15 to char code
For each byte extract to 1st 4-bits uint and 2nd 4-bits uint
Combine to an array of char codes
Create string from there in one go

The text was updated successfully, but these errors were encountered:

dapplion · 2022-07-29T12:16:04Z

For NodeJS where performance is important we should just use Buffer utils which will probably be the faster and more memory efficient.

Given how efficient protobuf creates a string from Uint8Array in one go

Efficient in terms of CPU time, memory or what specifically?

twoeths · 2022-07-30T06:41:36Z

Efficient in terms of CPU time, memory or what specifically?

in CPU time, but I guess could be better in terms of memory too since there is no created strings in the middle (need more benchmarks to see)

export function toHexString2(bytes: Uint8Array): string {
  const chunks = new Array<number>(bytes.length * 2 + 2);
  chunks[0] = 48;
  // "x".charCodeAt(0)
  chunks[1] = 120;
  for (let i = 0; i < bytes.length; i++) {
    const byte = bytes[i];
    const first = (byte & 0xf0) >> 4;
    const second = byte & 0x0f;

    // "0".charCodeAt(0) = 48
    // "a".charCodeAt(0) = 97 => delta = 87
    chunks[2 + 2 * i] = first < 10 ? first + 48 : first + 87;
    chunks[2 + 2 * i + 1] = second < 10 ? second + 48 : second + 87;
  }
  // return String.fromCharCode.apply(String, chunks);
  return String.fromCharCode(...chunks);
}

some quick benchmarks:

  toHexString vs String.fromCharCode
    ✓ fromCharCode                                                        320.1892 ops/s    3.123153 ms/op        -      19043 runs   60.0 s
    ✓ toHexString                                                         211.8084 ops/s    4.721247 ms/op   x0.985      12598 runs   60.0 s
    ✓ Buffer.toString hex                                                 342.6784 ops/s    2.918188 ms/op   x0.995      20385 runs   60.0 s

dapplion · 2022-07-30T21:26:51Z

@tuyennhv For memory efficiency there's this library that flattens strings. Check it out it's magic https://github.com/davidmarkclements/flatstr

wemeetagain · 2022-08-01T16:41:44Z

The fromCharCode approach looks good. Probably the best browser-compatible implementation

dapplion · 2022-08-02T14:55:13Z

@tuyennhv For memory efficiency there's this library that flattens strings. Check it out it's magic https://github.com/davidmarkclements/flatstr

Note this was recommended by Ben (the libuv mantainer)

twoeths mentioned this issue Aug 19, 2024

fix: avoid ssz toHexString ChainSafe/lodestar#7036

Closed

twoeths mentioned this issue Aug 29, 2024

feat: implement isomorphic utils for nodejs and browser ChainSafe/lodestar#7060

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

toHexString util: try String.fromCharCode api #277

toHexString util: try String.fromCharCode api #277

twoeths commented Jul 29, 2022

dapplion commented Jul 29, 2022

twoeths commented Jul 30, 2022

dapplion commented Jul 30, 2022

wemeetagain commented Aug 1, 2022

dapplion commented Aug 2, 2022

toHexString util: try String.fromCharCode api #277

toHexString util: try String.fromCharCode api #277

Comments

twoeths commented Jul 29, 2022

dapplion commented Jul 29, 2022

twoeths commented Jul 30, 2022

dapplion commented Jul 30, 2022

wemeetagain commented Aug 1, 2022

dapplion commented Aug 2, 2022