Add tensor quantization #1963

laggui · 2024-07-03T17:03:09Z

Checklist

Confirmed that run-checks all script has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

Progress towards #464

Changes

Support for static per-tensor quantization.

Added new DType::QFloat
Added backend trait associative type QuantizedTensorPrimitive
Added QTensorOps for quantize/dequantize ops
Added TensorPrimitive enum for float tensors to include float and qfloat (quantized type)
- All existing float ops now automatically retrieve the .tensor() (dequantized) before calling the backend implementation on the FloatTensorPrimitive
Added QuantizationStrategy enum with per-tensor affine and symmetric int8 quantization

Note: QAT support should be added in a future PR

Testing

Added unit tests for affine and symmetric per-tensor quantization

codecov · 2024-07-03T19:15:11Z

Codecov Report

Attention: Patch coverage is 69.59707% with 249 lines in your changes missing coverage. Please review.

Project coverage is 85.15%. Comparing base (1ad2a63) to head (1a224c2).

Files	Patch %	Lines
crates/burn-tensor/src/tensor/data.rs	9.75%	37 Missing ⚠️
...es/burn-tensor/src/tensor/quantization_strategy.rs	75.00%	33 Missing ⚠️
crates/burn-ndarray/src/ops/qtensor.rs	0.00%	29 Missing ⚠️
crates/burn-tch/src/ops/qtensor.rs	0.00%	29 Missing ⚠️
crates/burn-tensor/src/tensor/api/autodiff.rs	45.71%	19 Missing ⚠️
crates/burn-autodiff/src/ops/qtensor.rs	0.00%	16 Missing ⚠️
crates/burn-candle/src/ops/qtensor.rs	0.00%	16 Missing ⚠️
crates/burn-fusion/src/ops/qtensor.rs	0.00%	16 Missing ⚠️
crates/burn-jit/src/ops/qtensor.rs	0.00%	16 Missing ⚠️
crates/burn-autodiff/src/backend.rs	0.00%	10 Missing ⚠️
... and 8 more

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1963      +/-   ##
==========================================
- Coverage   85.29%   85.15%   -0.15%     
==========================================
  Files         798      804       +6     
  Lines       95512    96050     +538     
==========================================
+ Hits        81471    81788     +317     
- Misses      14041    14262     +221

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

nathanielsimard

Some comments, but LGTM great job!

nathanielsimard · 2024-07-04T18:02:49Z

crates/burn-tensor/src/tensor/api/autodiff.rs

+        TensorPrimitive::Float(B::inner(tensor.tensor()))
    }

    fn from_inner<const D: usize>(
        inner: <Self::InnerKind as TensorKind<<B as AutodiffBackend>::InnerBackend>>::Primitive<D>,
    ) -> <Self as TensorKind<B>>::Primitive<D> {
-        B::from_inner(inner)
+        TensorPrimitive::Float(B::from_inner(inner.tensor()))


We will need to implement a q_inner and q_from_inner at some point.

Oh, this should be fairly straightforward. I'll add it.

crates/burn-tensor/src/tensor/api/float.rs

laggui · 2024-07-04T18:33:34Z

Not sure what's up with the wgpu tests, they seem to be failing intermittently even on main

error: test failed, to rerun pass `-p burn-wgpu --lib`
  
  Caused by:
    process didn't exit successfully: `/home/runner/work/burn/burn/target/debug/deps/burn_wgpu-c73d9b81b38abf9c --color=always` (signal: 11, SIGSEGV: invalid memory reference)

nathanielsimard · 2024-07-04T19:27:54Z

Not sure what's up with the wgpu tests, they seem to be failing intermittently even on main

error: test failed, to rerun pass `-p burn-wgpu --lib`
  
  Caused by:
    process didn't exit successfully: `/home/runner/work/burn/burn/target/debug/deps/burn_wgpu-c73d9b81b38abf9c --color=always` (signal: 11, SIGSEGV: invalid memory reference)

Working on it

…et attributes

laggui marked this pull request as ready for review July 3, 2024 21:53

laggui requested a review from nathanielsimard July 3, 2024 21:53

nathanielsimard approved these changes Jul 4, 2024

View reviewed changes

laggui added 12 commits July 5, 2024 10:16

Add QuantizationBackend, QTensorOps and QTensor

8cbc4a4

Refactor QTensorOps as part of Backend trait

eead11b

Add tensor dequantize, QFloat dtype and default affine/symmetric quant

dbaf335

Add ndarray default quantization implementation

72b6bfd

Fix clippy

97fd335

Add rayon parallel iter

ce305b6

Add quantization operations to book

494e184

Add q_shape and q_device ops to avoid converting the tensor just to g…

52810e9

…et attributes

Implement autodiff grad ops

c56c319

Mark autodiff todo for QAT

32878d5

Remove note

faf8e51

Add q_inner and q_from_inner

1a224c2

laggui force-pushed the feat/quant/tensor branch from 84d8ee7 to 1a224c2 Compare July 5, 2024 14:18

Merge branch 'main' into feat/quant/tensor

3ecf185

laggui merged commit c0211e2 into main Jul 8, 2024
12 checks passed

laggui deleted the feat/quant/tensor branch July 8, 2024 14:17

thedevleon mentioned this pull request Jan 3, 2025

4 bit / 8 bit model training / inference capabilities #464

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add tensor quantization #1963

Add tensor quantization #1963

Uh oh!

laggui commented Jul 3, 2024 •

edited

Loading

Uh oh!

codecov bot commented Jul 3, 2024 •

edited

Loading

Uh oh!

nathanielsimard left a comment

Uh oh!

nathanielsimard Jul 4, 2024

Uh oh!

laggui Jul 4, 2024

Uh oh!

Uh oh!

laggui commented Jul 4, 2024

Uh oh!

nathanielsimard commented Jul 4, 2024

Uh oh!

Uh oh!

Uh oh!

Add tensor quantization #1963

Add tensor quantization #1963

Uh oh!

Conversation

laggui commented Jul 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Related Issues/PRs

Changes

Testing

Uh oh!

codecov bot commented Jul 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

nathanielsimard left a comment

Choose a reason for hiding this comment

Uh oh!

nathanielsimard Jul 4, 2024

Choose a reason for hiding this comment

Uh oh!

laggui Jul 4, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

laggui commented Jul 4, 2024

Uh oh!

nathanielsimard commented Jul 4, 2024

Uh oh!

Uh oh!

Uh oh!

laggui commented Jul 3, 2024 •

edited

Loading

codecov bot commented Jul 3, 2024 •

edited

Loading