perf: SIMD scan ASCII runs in input loop #288

dyxushuai · 2025-12-28T15:25:53Z

Problem

Input parsing reads one byte at a time and funnels everything through Parser, even when large ASCII runs are present. This adds avoidable per-byte overhead in common input streams.

Fix

Add a SIMD-assisted scan in the input loop to detect contiguous printable ASCII runs (0x20..0x7E). For these runs we emit key_press events directly. If the next byte begins a combining mark, we leave the last ASCII byte for the parser to avoid breaking combining/keycap sequences.

Bench (local, zig build bench, iterations=200, 80x24)

Mixed stream: ASCII + CSI + UTF-8

Baseline ns/frame	SIMD ns/frame	Δ ns/frame	Δ%	Speedup
24,020	5,481	-18,539	-77.2%	4.38x

Improvement: -18,539 ns/frame (-77.2%), 4.38x speedup.

Tests

zig build test
zig build bench

Copilot

Pull request overview

This PR adds a SIMD-accelerated fast path for scanning and processing contiguous runs of printable ASCII characters (0x20-0x7E) in the input loop, bypassing the parser for these common cases to reduce per-byte overhead. The optimization shows a ~4x improvement in the benchmark (22,725 ns/frame → 5,743 ns/frame for mixed input streams).

Key Changes:

Added asciiPrintableRunLen() function that uses SIMD vector operations to scan for printable ASCII runs
Integrated ASCII fast path in the non-Windows input loop to emit key_press events directly for ASCII characters
Added benchmark functions to compare baseline parser performance against the SIMD-optimized path

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
src/Loop.zig	Implements `asciiPrintableRunLen()` SIMD function and integrates ASCII fast path in the input loop before parser invocation
bench/bench.zig	Adds `asciiPrintableRunLen()` function copy and benchmark harnesses (`benchParseStreamBaseline`, `benchParseStreamSimd`) to measure performance improvement

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/Loop.zig

bench/bench.zig

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

bench/bench.zig

src/ascii.zig

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/Loop.zig

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/ascii.zig

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/ascii.zig

Copilot

Pull request overview

Copilot reviewed 4 out of 4 changed files in this pull request and generated no new comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

perf: SIMD scan ASCII runs in input loop

caa7637

Copilot AI review requested due to automatic review settings December 28, 2025 15:25

Copilot started reviewing on behalf of dyxushuai December 28, 2025 15:26 View session

Copilot AI reviewed Dec 28, 2025

View reviewed changes

src/Loop.zig Show resolved Hide resolved

src/Loop.zig Outdated Show resolved Hide resolved

bench/bench.zig Outdated Show resolved Hide resolved

refactor: share ASCII run scanner

72ae497

dyxushuai marked this pull request as draft December 28, 2025 15:36

fix: refine ASCII run boundary handling

6da8289

dyxushuai requested a review from Copilot December 28, 2025 15:41

Copilot started reviewing on behalf of dyxushuai December 28, 2025 15:41 View session

dyxushuai marked this pull request as ready for review December 28, 2025 15:44

Copilot AI reviewed Dec 28, 2025

View reviewed changes

bench/bench.zig Outdated Show resolved Hide resolved

src/ascii.zig Show resolved Hide resolved

bench: isolate parser instances

109f291

dyxushuai requested a review from Copilot December 28, 2025 16:07

Copilot started reviewing on behalf of dyxushuai December 28, 2025 16:07 View session

Copilot AI reviewed Dec 28, 2025

View reviewed changes

src/Loop.zig Outdated Show resolved Hide resolved

fix: include buffered bytes in read loop

4a06cd6

dyxushuai requested a review from Copilot December 28, 2025 16:19

Copilot started reviewing on behalf of dyxushuai December 28, 2025 16:20 View session

Copilot AI reviewed Dec 28, 2025

View reviewed changes

src/ascii.zig Show resolved Hide resolved

src/ascii.zig Show resolved Hide resolved

test: cover ASCII boundary and incomplete UTF-8 cases

c222cf5

dyxushuai requested a review from Copilot December 28, 2025 16:37

Copilot started reviewing on behalf of dyxushuai December 28, 2025 16:38 View session

Copilot AI reviewed Dec 28, 2025

View reviewed changes

src/ascii.zig Show resolved Hide resolved

src/ascii.zig Show resolved Hide resolved

fix: treat variation selectors as combining

d5d295b

dyxushuai requested a review from Copilot December 28, 2025 16:55

Copilot started reviewing on behalf of dyxushuai December 28, 2025 16:56 View session

Copilot AI reviewed Dec 28, 2025

View reviewed changes

dyxushuai mentioned this pull request Dec 28, 2025

Perf tracking: cumulative impact of optimization PRs #289

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: SIMD scan ASCII runs in input loop #288

perf: SIMD scan ASCII runs in input loop #288

Uh oh!

dyxushuai commented Dec 28, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

perf: SIMD scan ASCII runs in input loop #288

Are you sure you want to change the base?

perf: SIMD scan ASCII runs in input loop #288

Uh oh!

Conversation

dyxushuai commented Dec 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

Bench (local, zig build bench, iterations=200, 80x24)

Tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dyxushuai commented Dec 28, 2025 •

edited

Loading