Add fast path for utf16le encoding in stringToBuffer()/bufferToString() by wh201906 · Pull Request #981 · margelo/react-native-quick-crypto

wh201906 · 2026-04-26T17:09:59Z

The native implementation is way much faster

name	utf16le encode 32B	utf16le encode 1MB	utf16le encode 32B (ASCII only)	utf16le encode 1MB (ASCII only)	utf16le decode 32B	utf16le decode 1MB	utf16le decode 32B (ASCII only)	utf16le decode 1MB (ASCII only)
ratio	2.18x	318.79x	2.09x	164.03x	3.39x	2005.62x	3.27x	886.29x

Screenshot

In the current mainstream React Native JavaScript engine, Hermes, strings are internally represented using UTF-16 or ASCII. Therefore, when the native side needs access to the UTF-16 representation of a string, Hermes can provide the underlying data with minimal overhead. However, in the current implementation of Nitro, JavaScript strings are always converted to UTF-8 by default. For UTF-16 data, this introduces unnecessary conversion overhead and may also lead to data loss (e.g., unpaired surrogates) during the conversion process.

To address this, I bypass the Nitrogen-generated conversion path from JS string to std::string by accessing jsi::String object directly. For other encodings, the existing Nitrogen-like code path is preserved (call jsi::String::utf8() like what nitro does). For UTF-16 encoding, a lower-level fast path is used whenever possible (call jsi::String::getStringData()).

Note: this optimized UTF-16 encoding/decoding path is only available in the Hermes environment and for React Native 0.78+. Therefore, I added conditional checks on both the JavaScript side and the C++ side to selectively enable this feature.

For testing, I added UTF-16LE-related test cases based on Node.js, as well as performance benchmarks for the UTF-16 encoding path.

(text polished by ChatGPT)

Add argument count check Wrap exceptions in the same style as Nitro HybridFunction

And use createFromUtf8() override with less overhead

No significant performance improvements for normal cases

wh201906 · 2026-04-27T01:48:21Z

Test cases from Node.js v24.15.0

Roundtrips ASCII text through utf16le encoding.

Current encoding_tests.ts:

test(SUITE, '[Node.js] Roundtrips ASCII text through utf16le encoding.', () => {
  const str = 'foo';
  const ab = stringToBuffer(str, 'utf16le');
  expect(bufferToString(ab, 'utf16le')).to.equal(str);
});

Original Node.js (test/parallel/test-buffer-tostring.js):

// utf8, ucs2, ascii, latin1, utf16le
for (const encoding of [
  'utf8',
  'utf-8',
  'ucs2',
  'ucs-2',
  'ascii',
  'latin1',
  'binary',
  'utf16le',
  'utf-16le',
].flatMap(e => [e, e.toUpperCase()])) {
  assert.strictEqual(Buffer.from('foo', encoding).toString(encoding), 'foo');
}

Roundtrips UTF-16LE text containing an unpaired high surrogate.

Current encoding_tests.ts:

test(
  SUITE,
  'Roundtrips UTF-16LE text containing an unpaired high surrogate.',
  () => {
    const str = 'A\uD83DB';
    const ab = stringToBuffer(str, 'utf16le');
    expect(toU8(ab)).to.deep.equal(
      new Uint8Array([0x41, 0x00, 0x3d, 0xd8, 0x42, 0x00]),
    );
    expect(bufferToString(ab, 'utf16le')).to.equal(str);
  },
);

Original Node.js:

No direct matching test case was found in Node.js v24.15.0.

Verified Node.js runtime behavior:

const str = 'A\uD83DB';
const buf = Buffer.from(str, 'utf16le');
assert.deepStrictEqual([...buf], [0x41, 0x00, 0x3d, 0xd8, 0x42, 0x00]);
assert.strictEqual(buf.toString('utf16le'), str);

Roundtrips UTF-16LE text containing an unpaired low surrogate.

Current encoding_tests.ts:

test(
  SUITE,
  'Roundtrips UTF-16LE text containing an unpaired low surrogate.',
  () => {
    const str = 'A\uDC00B';
    const ab = stringToBuffer(str, 'utf16le');
    expect(toU8(ab)).to.deep.equal(
      new Uint8Array([0x41, 0x00, 0x00, 0xdc, 0x42, 0x00]),
    );
    expect(bufferToString(ab, 'utf16le')).to.equal(str);
  },
);

Original Node.js:

No direct matching test case was found in Node.js v24.15.0.

Verified Node.js runtime behavior:

const str = 'A\uDC00B';
const buf = Buffer.from(str, 'utf16le');
assert.deepStrictEqual([...buf], [0x41, 0x00, 0x00, 0xdc, 0x42, 0x00]);
assert.strictEqual(buf.toString('utf16le'), str);

UTF-16LE encoding of "über"

Current encoding_tests.ts:

test(SUITE, '[Node.js] UTF-16LE encoding of "über"', () => {
  expect(toU8(stringToBuffer('über', 'utf16le'))).to.deep.equal(
    new Uint8Array([252, 0, 98, 0, 101, 0, 114, 0]),
  );
});