Move accent-insensitive filtering to common #276798

dmitrivMS · 2025-11-11T20:17:21Z

Added new tryNormalizeToBase method to normalization.ts which will perform NFD+accent removal+lower casing and cache the result.

Removed removeAccents since it was not used after this change and it is not safe to use (does not preserve the indices).

Moved accent-insensitive logic to filters.txt by using the new tryNormalizeToBase method.

Replaced filters that match on natural language and used matchesPrefix or matchesContiguousSubString with a single call to matchesBaseContiguousSubString filter which will do both. This does change the order of preferences (contiguous substring will now take precedence other words), but it will make one less filtering call per item.

Updated unit-tests.

Note: The intent of getAlternateCodes is similar to what tryNormalizeToBase is doing, but at the same time building tables for all Unicode accents and combined characters into our code seems impractical, so keep those two separate, although normalization will occur before Korean replacement, so the two will work together in matchesWords case.

vs-code-engineering · 2025-11-11T20:17:54Z

📬 CODENOTIFY

The following users are being notified based on files changed in this PR:

@rzhao271

Matched files:

src/vs/workbench/contrib/preferences/browser/preferencesSearch.ts

Copilot

Pull Request Overview

This PR moves accent-insensitive filtering logic from various files into a centralized tryNormalizeToBase method in normalization.ts. The new method performs NFD normalization, accent removal, and lower casing with caching support, while ensuring string length preservation to maintain index consistency.

Adds new tryNormalizeToBase method that combines NFD normalization, accent removal, and lower casing with LRU caching
Introduces matchesBaseContiguousSubString filter that uses the new normalization
Removes the removeAccents function and custom normalization logic from commandsQuickAccess.ts

Reviewed Changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
src/vs/base/common/normalization.ts	Removes `removeAccents`, adds `tryNormalizeToBase` with caching and length-preserving logic
src/vs/base/common/filters.ts	Adds `matchesBaseContiguousSubString` filter, updates `matchesWords` and `matchesContiguousSubString` to use normalization, fixes typo in comment
src/vs/platform/quickinput/browser/commandsQuickAccess.ts	Removes custom normalization logic and unused fields, simplifies filtering to use new `matchesBaseContiguousSubString`
src/vs/workbench/services/preferences/browser/keybindingsEditorModel.ts	Updates wordFilter to use `matchesBaseContiguousSubString` instead of `matchesPrefix` and `matchesContiguousSubString`
src/vs/workbench/contrib/preferences/browser/preferencesSearch.ts	Updates to use `matchesBaseContiguousSubString` for description matching
src/vs/workbench/contrib/chat/browser/chatManagement/chatModelsViewModel.ts	Updates wordFilter to use `matchesBaseContiguousSubString`
src/vs/base/test/common/normalization.test.ts	Updates tests from `removeAccents` to `tryNormalizeToBase`, adds new test cases for case handling
src/vs/base/test/common/filters.test.ts	Adds test coverage for new `matchesBaseContiguousSubString` filter

src/vs/base/common/normalization.ts

src/vs/base/common/filters.ts

Copilot · 2025-11-11T20:30:16Z

@dmitrivMS I've opened a new pull request, #276800, to work on those changes. Once the pull request is ready, I'll request review from you.

Copilot · 2025-11-11T20:31:12Z

@dmitrivMS I've opened a new pull request, #276801, to work on those changes. Once the pull request is ready, I'll request review from you.

TylerLeonhardt

Very nice

Move accent-insensitive filtering to common

864c563

Copilot AI review requested due to automatic review settings November 11, 2025 20:17

dmitrivMS added the debt Code quality issues label Nov 11, 2025

dmitrivMS requested a review from TylerLeonhardt November 11, 2025 20:17

dmitrivMS enabled auto-merge (squash) November 11, 2025 20:17

dmitrivMS self-assigned this Nov 11, 2025

Copilot started reviewing on behalf of dmitrivMS November 11, 2025 20:18 View session

Copilot finished reviewing on behalf of dmitrivMS November 11, 2025 20:18

vs-code-engineering bot added this to the November 2025 milestone Nov 11, 2025

Copilot AI reviewed Nov 11, 2025

View reviewed changes

src/vs/base/common/normalization.ts Outdated Show resolved Hide resolved

src/vs/base/common/normalization.ts Outdated Show resolved Hide resolved

src/vs/base/common/filters.ts Show resolved Hide resolved

Reuse normalizeNFD and its cache.

a2e4bef

This was referenced Nov 11, 2025

[WIP] WIP address feedback on accent-insensitive filtering #276800

Closed

[WIP] WIP address feedback on accent-insensitive filtering changes #276801

Closed

PR feedback

bb465a4

TylerLeonhardt approved these changes Nov 11, 2025

View reviewed changes

dmitrivMS merged commit bd64c17 into main Nov 11, 2025
28 checks passed

dmitrivMS deleted the dev/dmitriv/filtering branch November 11, 2025 23:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move accent-insensitive filtering to common #276798

Move accent-insensitive filtering to common #276798

Uh oh!

dmitrivMS commented Nov 11, 2025 •

edited

Loading

Uh oh!

vs-code-engineering bot commented Nov 11, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Nov 11, 2025

Uh oh!

Copilot AI commented Nov 11, 2025

Uh oh!

TylerLeonhardt left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Move accent-insensitive filtering to common #276798

Move accent-insensitive filtering to common #276798

Uh oh!

Conversation

dmitrivMS commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vs-code-engineering bot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📬 CODENOTIFY

@rzhao271

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI commented Nov 11, 2025

Uh oh!

Copilot AI commented Nov 11, 2025

Uh oh!

TylerLeonhardt left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dmitrivMS commented Nov 11, 2025 •

edited

Loading

vs-code-engineering bot commented Nov 11, 2025 •

edited

Loading