[Perf] Convolution migration to NHWC #3090

wingertge · 2025-04-27T15:38:41Z

Pull Request Template

Checklist

Confirmed that run-checks all script has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

Requires tracel-ai/cubecl#646 to get merged first

Changes

Reworks im2col and direct convolution to use NHWC layout, making the 2D forward pass completely NHWC. This saves unnecessary transpositions and is a prerequisite for eventually moving weights to NHWC overall. It also improves performance.

Testing

All tests pass with both the specific algorithms and autotune enabled.

codecov · 2025-04-28T18:53:39Z

Codecov Report

Attention: Patch coverage is 43.67816% with 196 lines in your changes missing coverage. Please review.

Project coverage is 81.06%. Comparing base (4360f36) to head (45a4393).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
...rates/burn-cubecl/src/kernel/conv/conv2d/im2col.rs	38.55%	102 Missing ⚠️
...rates/burn-cubecl/src/kernel/conv/conv2d/direct.rs	46.08%	62 Missing ⚠️
crates/burn-cubecl/src/kernel/contiguous.rs	26.31%	14 Missing ⚠️
crates/burn-cubecl/src/kernel/utils.rs	60.00%	10 Missing ⚠️
crates/burn-cubecl-fusion/src/shared/kernel.rs	0.00%	4 Missing ⚠️
crates/burn-cubecl-fusion/src/shared/io.rs	0.00%	2 Missing ⚠️
crates/burn-cubecl/src/kernel/conv/conv2d/base.rs	80.00%	1 Missing ⚠️
...ecl/src/kernel/conv/conv2d/implicit_gemm/launch.rs	66.66%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #3090      +/-   ##
==========================================
- Coverage   81.08%   81.06%   -0.03%     
==========================================
  Files         817      817              
  Lines      117326   117408      +82     
==========================================
+ Hits        95131    95173      +42     
- Misses      22195    22235      +40

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

laggui

Ran the tests locally for cuda and wgpu/vulkan. Got a couple of failures on vulkan with f16 for convolutions, but it looks like they are also failing on main. So something broke since the 0.17 release.

With the failing tests I found a print statement for into data but that will be fixed in #3114.

Also linux-std runner keeps running out of disk space with all the updates we're doing recently. I think I'm just gonna disable the caching on this runner, more of an annoyance than anything else!

Sorry for the rant 😅 just happened to stumble upon multiple issues while testing the changes..

TL;DR: this PR looks good to me!

wingertge added 10 commits April 24, 2025 21:22

Refactor interpolate to use NHWC and fix OOB issue

5011d57

Fix slice negative range

f83ccc5

Merge branch 'main' into perf/interpolate

7ac435c

WIP

60d1f3d

Merge branch 'main' into perf/conv

83d5068

Merge branch 'main' into perf/conv

1c050ef

Migrate im2col to NHWC

c8d44a3

Remove groups support from im2col

d572d2c

Update cubecl and fixes

8d9c017

Migrate direct conv2d to NHWC

6c9bc70

wingertge marked this pull request as draft April 27, 2025 15:39

wingertge added 3 commits April 28, 2025 13:41

Fix strides for sliced batches

56c1582

Update cubecl rev

e8fb0c2

lockfile

ccc4ae2

wingertge marked this pull request as ready for review April 28, 2025 14:32

wingertge added 2 commits April 28, 2025 20:17

Merge remote-tracking branch 'upstream/main' into perf/conv

bd51ae4

Revert debug change

4edcbb0

wingertge added 2 commits April 28, 2025 20:57

Clippy

a6d0205

Fix contiguity check for weight

2383b7e

laggui self-requested a review April 29, 2025 11:40

Merge branch 'main' into perf/conv

45a4393

laggui approved these changes Apr 29, 2025

View reviewed changes

laggui merged commit ceab6d4 into tracel-ai:main Apr 29, 2025
26 of 29 checks passed

wingertge deleted the perf/conv branch April 29, 2025 17:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Perf] Convolution migration to NHWC #3090

[Perf] Convolution migration to NHWC #3090

Uh oh!

wingertge commented Apr 27, 2025

Uh oh!

codecov bot commented Apr 28, 2025 •

edited

Loading

Uh oh!

laggui left a comment

Uh oh!

Uh oh!

Uh oh!

[Perf] Convolution migration to NHWC #3090

[Perf] Convolution migration to NHWC #3090

Uh oh!

Conversation

wingertge commented Apr 27, 2025

Pull Request Template

Checklist

Related Issues/PRs

Changes

Testing

Uh oh!

codecov bot commented Apr 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

laggui left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Apr 28, 2025 •

edited

Loading