fix(win32): encode KEY_EVENT_RECORD.uChar as UTF-8 for IME input by KoalaHao · Pull Request #78 · Norbert515/nocterm

KoalaHao · 2026-06-05T06:28:37Z

KEY_EVENT_RECORD.uChar is a single UTF-16 code unit, but the downstream InputParser decodes as UTF-8 (1/2/3/4-byte sequences dispatched by leading byte). The old code returned String.fromCharCode(char).codeUnits, which for e.g. 你 (U+4F60) produced the int 0x4F60 - not a valid UTF-8 byte - so IME characters (Chinese, Japanese, Korean, anything CJK) never reached the text field.

Now encoded as proper UTF-8 via utf8.encode(), with a _pendingHighSurrogate field to reassemble supplementary-plane characters (e.g. emoji) across two consecutive key events. Orphans are discarded so a half-pair followed by a different key never produces a wrong character.

Verified: 你 (U+4F60) -> 'e4 bd a0', (surrogate pair 0xD83D/0xDE00) -> 'f0 9f 98 80', both round-trip through utf8.decode.

KEY_EVENT_RECORD.uChar is a single UTF-16 code unit, but the downstream InputParser decodes as UTF-8 (1/2/3/4-byte sequences dispatched by leading byte). The old code returned String.fromCharCode(char).codeUnits, which for e.g. 你 (U+4F60) produced the int 0x4F60 - not a valid UTF-8 byte - so IME characters (Chinese, Japanese, Korean, anything CJK) never reached the text field. Now encoded as proper UTF-8 via utf8.encode(), with a _pendingHighSurrogate field to reassemble supplementary-plane characters (e.g. emoji) across two consecutive key events. Orphans are discarded so a half-pair followed by a different key never produces a wrong character. Verified: 你 (U+4F60) -> 'e4 bd a0', (surrogate pair 0xD83D/0xDE00) -> 'f0 9f 98 80', both round-trip through utf8.decode.

…xtField The cursor position was displayed wrong when the text had multiple consecutive newlines (e.g. "hello\n\nworld"). The _paintCursor method tracked character offsets across layout lines incorrectly. The old code checked textSoFar.endsWith('\n') — whether the text *up to* the current offset ends with a newline. But textSoFar only includes characters before position charCount, so it would never detect the newline that sits AT position charCount. Fix: check _text[charCount] == '\n' instead, consistent with the correct pattern used in CursorMovement.getCursorPosition and selection_utils.lineStartOffsets. Bug introduced in a0c5bd4 (Feature/soft wrapping textfield Norbert515#6).

KoalaHao added 2 commits June 5, 2026 14:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(win32): encode KEY_EVENT_RECORD.uChar as UTF-8 for IME input#78

fix(win32): encode KEY_EVENT_RECORD.uChar as UTF-8 for IME input#78
KoalaHao wants to merge 2 commits into
Norbert515:mainfrom
marsup-space:fix/windows-utf8-ime

KoalaHao commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

KoalaHao commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant