Feat/quant/per block #2849

laggui · 2025-02-27T14:29:36Z

Pull Request Template

Checklist

Confirmed that run-checks all script has been executed.
Made sure the book is up to date with changes in this PR.

Changes

More quantization granularity!

Refactored QuantizationScheme enum
Changed Calibration to an enum
Added per-block quantization
- Flat: linear segments (implemented for ndarray and cubecl backends)
- Grid: m x n blocks (ndarray only via QuantizationStrategy)
- Quantization parameters are stored as [offset_1, offset_2, ..., offset_num_blocks, scale_1, scale_2, ..., scale_num_blocks] (with offsets being optional)

Test utils:

Added #[might_panic] test attribute (for ops configuration that are not strictly required, e.g. different quantization schemes)

For the CI:

Disabled incremental compilation for the test profile (reduces total artifact sizes quite significantly, finally fixing the intermittent No space left on device issues).

Testing

Unit tests for new schemes

codecov · 2025-02-27T14:51:47Z

Codecov Report

Attention: Patch coverage is 83.27781% with 251 lines in your changes missing coverage. Please review.

Project coverage is 82.29%. Comparing base (17d9753) to head (24d5857).
Report is 6 commits behind head on main.

Files with missing lines	Patch %	Lines
...es/burn-cubecl/src/kernel/quantization/quantize.rs	67.16%	88 Missing ⚠️
.../burn-cubecl/src/kernel/quantization/dequantize.rs	60.95%	41 Missing ⚠️
crates/burn-tch/src/ops/qtensor.rs	0.00%	38 Missing ⚠️
...tes/burn-cubecl/src/kernel/quantization/qtensor.rs	34.54%	36 Missing ⚠️
crates/burn-tch/src/tensor.rs	0.00%	11 Missing ⚠️
...es/burn-tensor/src/tensor/quantization/strategy.rs	96.20%	11 Missing ⚠️
crates/burn-tensor-testgen/src/lib.rs	88.88%	9 Missing ⚠️
...ates/burn-tensor/src/tensor/quantization/scheme.rs	94.54%	6 Missing ⚠️
crates/burn-tensor/src/tensor/element/base.rs	0.00%	3 Missing ⚠️
crates/burn-tensor/src/tensor/data.rs	88.23%	2 Missing ⚠️
... and 4 more

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2849      +/-   ##
==========================================
+ Coverage   82.18%   82.29%   +0.11%     
==========================================
  Files         854      861       +7     
  Lines      114059   116887    +2828     
==========================================
+ Hits        93734    96194    +2460     
- Misses      20325    20693     +368

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

laggui added 15 commits February 27, 2025 09:19

Refactor scheme to have quantization mode + add per-block variant

c51e348

Add per-block calibration range

aa8bc1b

Add per-block qparams compute

cb1d36d

Update book

1744e15

Add per-block quantization strategy

2f3c80d

Working per-block tests

fb029b5

Remove dead code

a833a49

Quantize works

39c937c

Dequantize works

fc2c005

TODO

b1a3190

Add test with block_size > line_size

90bc2ee

Clean up

1a4bd87

Fix clippy + typos

2fe9015

Fix no-std

008a1d0

Add might_panic test attribute

ab9114f

laggui added 3 commits February 27, 2025 11:13

Feature gate might_panic for no-std

1bcd205

Add from/to data tests + make panic reason more specific

a6790c3

We forgot default std

55d62ef

laggui force-pushed the feat/quant/per-block branch 2 times, most recently from 4a083fa to 08a4b7f Compare February 28, 2025 17:10

laggui added 2 commits February 28, 2025 12:16

Merge branch 'main' into feat/quant/per-block

8097fe8

WIP macos debug

a7bf68b

laggui force-pushed the feat/quant/per-block branch from 08a4b7f to a7bf68b Compare February 28, 2025 17:18

laggui added 6 commits February 28, 2025 12:50

More debug

3cc94ff

Debug

d50bff5

Fix precision issues

8ab78f8

Debug ci

fe0002d

More ci

d44dc0a

Clean up ci

eda3e6a

laggui added 2 commits February 28, 2025 15:20

Remove todo comment

a40340a

Remove comment

24d5857

laggui marked this pull request as ready for review February 28, 2025 20:42

laggui requested a review from nathanielsimard February 28, 2025 20:42

nathanielsimard approved these changes Mar 3, 2025

View reviewed changes

laggui merged commit a6b5210 into main Mar 3, 2025
11 checks passed

laggui deleted the feat/quant/per-block branch March 3, 2025 17:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/quant/per block #2849

Feat/quant/per block #2849

laggui commented Feb 27, 2025 •

edited

Loading

codecov bot commented Feb 27, 2025 •

edited

Loading

Feat/quant/per block #2849

Feat/quant/per block #2849

Conversation

laggui commented Feb 27, 2025 • edited Loading

Pull Request Template

Checklist

Changes

Testing

codecov bot commented Feb 27, 2025 • edited Loading

Codecov Report

laggui commented Feb 27, 2025 •

edited

Loading

codecov bot commented Feb 27, 2025 •

edited

Loading