[torchlib] Implement quantize_per_channel and dequantize_per_channel #2390

Copilot · 2025-06-14T15:38:57Z

This PR implements the missing quantize_per_channel and dequantize_per_channel operations in the torchlib quantized_decomposed module.

Changes

Added two new functions to onnxscript/function_libs/torch_lib/ops/quantized_decomposed.py:

`quantized_decomposed_quantize_per_channel`

Implements per-channel quantization using ONNX QuantizeLinear with per-axis support
Takes tensor inputs for scales and zero_points (one value per channel)
Supports axis parameter to specify the quantization dimension
Uses ONNX opset23 for per-axis quantization capabilities

`quantized_decomposed_dequantize_per_channel`

Implements per-channel dequantization using ONNX DequantizeLinear with per-axis support
Takes tensor inputs for scales and optional zero_points
zero_points parameter is Optional[TensorType] matching PyTorch reference
Supports both default output type and explicit output_dtype parameter

Implementation Details

Both functions:

Follow the existing code patterns using @torch_op decorator with trace_only=True
Have function signatures that match the PyTorch reference implementations from torch.ao.quantization.fx._decomposed
Use ONNX opset23 to leverage the axis and output_dtype parameters for per-axis quantization
Are properly registered and accessible as TracedOnnxFunctions

The implementation leverages ONNX's native per-axis quantization support rather than implementing the tensor manipulation logic from the PyTorch reference, making it more efficient and aligned with ONNX best practices.

Testing

Validated that:

Functions import correctly and have proper signatures
Function schemas are correctly generated with appropriate parameter types
No existing functionality is broken (per-tensor functions still work)
Functions can be used to build ONNX models

Fixes #2389.

💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click here to start the survey.

Co-authored-by: justinchuby <11205048+justinchuby@users.noreply.github.com>

justinchuby · 2025-06-14T16:03:17Z

onnxscript/function_libs/torch_lib/ops/quantized_decomposed.py

+    quant_min: int,
+    quant_max: int,


Are these unused? Why is that?

These parameters are unused because the ONNX QuantizeLinear/DequantizeLinear operators don't require explicit quant_min/quant_max parameters - they determine the quantization range from the data type and quantization parameters. The parameters are kept in the function signature for API compatibility with PyTorch's reference implementation, following the same pattern as the existing per-tensor functions above.

codecov · 2025-06-14T16:14:41Z

Codecov Report

❌ Patch coverage is 54.54545% with 5 lines in your changes missing coverage. Please review.
✅ Project coverage is 70.09%. Comparing base (43b1b74) to head (973e372).
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
...unction_libs/torch_lib/ops/quantized_decomposed.py	54.54%	5 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2390      +/-   ##
==========================================
- Coverage   70.09%   70.09%   -0.01%     
==========================================
  Files         228      228              
  Lines       27393    27404      +11     
  Branches     2781     2782       +1     
==========================================
+ Hits        19202    19208       +6     
- Misses       7234     7239       +5     
  Partials      957      957

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

onnxscript/function_libs/torch_lib/ops/quantized_decomposed.py

justinchuby · 2025-06-18T00:23:58Z

Please format the code using lintrunner. You can do

pip install --upgrade lintrunner lintrunner-adapters
lintrunner init

then

lintrunner f --all-files

Co-authored-by: justinchuby <11205048+justinchuby@users.noreply.github.com>

Copilot · 2025-06-18T00:28:25Z

Please format the code using lintrunner. You can do

...

Done! Code has been formatted using lintrunner. The formatting fixed import sorting, whitespace issues, and line wrapping. Commit: 82c8f9e

Initial plan for issue

3646461

Copilot AI assigned Copilot and justinchuby Jun 14, 2025

github-project-automation bot added this to ONNX Script Review Board Jun 14, 2025

Copilot started work on behalf of justinchuby June 14, 2025 15:38 View session

github-project-automation bot moved this to Todo in ONNX Script Review Board Jun 14, 2025

Implement quantize_per_channel and dequantize_per_channel for torchlib

8cc5271

Co-authored-by: justinchuby <11205048+justinchuby@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] [torchlib] Implement quantize_per_channel and dequantize_per_channel~~ [torchlib] Implement quantize_per_channel and dequantize_per_channel Jun 14, 2025

Copilot AI requested a review from justinchuby June 14, 2025 15:50

Copilot finished work on behalf of justinchuby June 14, 2025 15:50

justinchuby reviewed Jun 14, 2025

View reviewed changes

Copilot started work on behalf of justinchuby June 14, 2025 16:03 View session

Copilot finished work on behalf of justinchuby June 14, 2025 16:07

justinchuby marked this pull request as ready for review June 14, 2025 16:10

github-advanced-security bot found potential problems Jun 14, 2025

View reviewed changes

Copilot started work on behalf of justinchuby June 18, 2025 00:24 View session

Format code using lintrunner

82c8f9e

Co-authored-by: justinchuby <11205048+justinchuby@users.noreply.github.com>

Copilot AI requested a review from justinchuby June 18, 2025 00:28

Copilot finished work on behalf of justinchuby June 18, 2025 00:28

Merge branch 'main' into copilot/fix-2389

973e372

justinchuby approved these changes Dec 30, 2025

View reviewed changes

justinchuby requested a review from xadupre December 30, 2025 06:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[torchlib] Implement quantize_per_channel and dequantize_per_channel #2390

[torchlib] Implement quantize_per_channel and dequantize_per_channel #2390

Copilot AI commented Jun 14, 2025 •

edited

Loading

Uh oh!

justinchuby Jun 14, 2025

Uh oh!

Copilot AI Jun 14, 2025

Uh oh!

codecov bot commented Jun 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justinchuby commented Jun 18, 2025

Uh oh!

Copilot AI commented Jun 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[torchlib] Implement quantize_per_channel and dequantize_per_channel #2390

Are you sure you want to change the base?

[torchlib] Implement quantize_per_channel and dequantize_per_channel #2390

Conversation

Copilot AI commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

quantized_decomposed_quantize_per_channel

quantized_decomposed_dequantize_per_channel

Implementation Details

Testing

Uh oh!

justinchuby Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 14, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Jun 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

justinchuby commented Jun 18, 2025

Uh oh!

Copilot AI commented Jun 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Jun 14, 2025 •

edited

Loading

`quantized_decomposed_quantize_per_channel`

`quantized_decomposed_dequantize_per_channel`

codecov bot commented Jun 14, 2025 •

edited

Loading