fix: correct Kanji character detection #400

ren1244 · 2025-07-31T04:30:01Z

I encountered the same issue as described in #173.
After reviewing the source code, I found that the problem was caused by an incorrect Unicode range check for Kanji characters.
This pull request fixes that issue.

For example, given the input string: 兩個黃鸝鳴翠柳
It should be segmented as:

Kanji: 兩個
Byte: 黃鸝
Kanji: 鳴翠柳

In the previous version, 黃鸝 was mistakenly classified as Kanji, but these characters are not Kanji.

fix: correct Kanji character detection

3498e63

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: correct Kanji character detection #400

fix: correct Kanji character detection #400

Uh oh!

ren1244 commented Jul 31, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: correct Kanji character detection #400

Are you sure you want to change the base?

fix: correct Kanji character detection #400

Uh oh!

Conversation

ren1244 commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ren1244 commented Jul 31, 2025 •

edited

Loading