-
Notifications
You must be signed in to change notification settings - Fork 3.6k
[RELAX][PASS] Annotate Custom Scope layout pass for Adreno GPU #17599
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
fe15d5b
to
1549733
Compare
@tvm-bot rerun |
@Hzfengsy do you mind take a look given it touches FuseOps/TIR |
also cc @yongwww for memory scope related changes |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
some initial comments
@tvm-bot rerun |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Thanks @srkreddy1238 for updates. I take a closer look and now understands the motivation behind However, attaching op attributes into the I also now understand the demand is to enable the finally fused I think it is more cleaner to try a different approach. Instead of relying on legalize pass, let us introduce an adreno specific Now the remaining question is where the schedule can appear
To further enable fusion, one can try adopt the following customized legalize sequence |
Off late realized, I could have drafted an RFC to describe the approach. Have done now https://discuss.tvm.apache.org/t/rfc-annotate-custom-scope-layout-relax-pass-for-adreno-gpu/18052 @tqchen thanks for the thoughts. Few concerns I have in this approach
Pls advice. |
c1b22d5
to
57e46e0
Compare
@tvm-bot rerun |
294b7a6
to
3b402e9
Compare
@tqchen can you take a look at this ? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @srkreddy1238 , sorry for the delayed review. I think we are getting close. The main comments are related to testcases and group some of the adreno specific passes to relax/backend
3b402e9
to
68b615b
Compare
@tvm-bot rerun |
Adreno transforms under relax.backend.adreno.tansforms across cpp and python.
9479ee9
to
a43d9f7
Compare
a43d9f7
to
f1d6847
Compare
Refer
https://discuss.tvm.apache.org/t/rfc-annotate-custom-scope-layout-relax-pass-for-adreno-gpu/18052/6
for details about texture scope handling.