graph: sdpa: support dropout seed/offset/prob in fused sdpa by TaoLv · Pull Request #4961 · uxlfoundation/oneDNN

TaoLv · 2026-04-07T03:42:24Z

For SDPA forward with dropout seed/offset/prob.

~~SDPA backward will be fixed later~~

Update: SDPA backward is also fixed.

TaoLv · 2026-04-07T04:14:43Z

Noticed correctness issue via benchdnn. Debugging...

# with fused sdpa kernel
$ _ONEDNN_GRAPH_SDPA_FORCE_PRIMITIVE=0 ./tests/benchdnn/benchdnn --graph --engine=gpu  --case=complex_fusion/mha/gqa-plain-training-fwd-w-dropout-bf16-f32.json
[COMPARE_STATS][DST]: trh=0 err_max_diff: 2.01562 err_max_rdiff:8.37618e+37 all_max_diff: 2.01562 all_max_rdiff:8.37618e+37
[COMPARE_STATS] Norm check is prohibited; error_to_total_ratio: 233469/262144; allowed_ratio: 256/262144;
Error: Function 'doit' at (/nfs/pdx/disks/hal9000/lvtao/oneDNN/tests/benchdnn/graph/graph.cpp:787) returned '1'
0:FAILED (errors:233469 total:262144) (3079 ms) __REPRO: --graph --engine=gpu --case=complex_fusion/mha/gqa-plain-training-fwd-w-dropout-bf16-f32.json
===========================================================
= Failed cases summary (--summary=no-failures to disable) =
===========================================================
0:FAILED (errors:233469 total:262144) (3079 ms) __REPRO: --graph --engine=gpu --case=complex_fusion/mha/gqa-plain-training-fwd-w-dropout-bf16-f32.json
============================
tests:1 passed:0 skipped:0 mistrusted:0 unimplemented:0 invalid_arguments:0 failed:1 listed:0
total: 3.09s; create_pd: 0.09s (3%); create_prim: 0.96s (31%); fill: 0.00s (0%); execute: 0.01s (0%); compute_ref: 0.00s (0%); compare: 0.00s (0%);

# with primitive based kernel
$ _ONEDNN_GRAPH_SDPA_FORCE_PRIMITIVE=1 ./tests/benchdnn/benchdnn --graph --engine=gpu  --case=complex_fusion/mha/gqa-plain-training-fwd-w-dropout-
bf16-f32.json
0:PASSED (10122 ms) __REPRO: --graph --engine=gpu --case=complex_fusion/mha/gqa-plain-training-fwd-w-dropout-bf16-f32.json
tests:1 passed:1 skipped:0 mistrusted:0 unimplemented:0 invalid_arguments:0 failed:0 listed:0
total: 10.12s; create_pd: 0.07s (1%); create_prim: 0.67s (7%); fill: 0.00s (0%); execute: 0.00s (0%); compute_ref: 0.00s (0%); compare: 0.00s (0%);

TaoLv · 2026-04-07T06:05:01Z

make test
set test_scope=NIGHTLY
disable benchdnn_all
enable benchdnn_graph

src/graph/interface/op_def.hpp

TaoLv · 2026-04-07T08:30:41Z

make test
set test_scope=NIGHTLY
disable benchdnn_all
enable benchdnn_graph

h-sadia · 2026-04-07T23:39:40Z

We are not enabling mask here? Also, we will need a backport to v3.12 branch as well.

TaoLv · 2026-04-08T09:27:18Z

We are not enabling mask here? Also, we will need a backport to v3.12 branch as well.

Dropout mask output is not required for SDPA training in PyTorch.
Sure, I will backport these to rls-v3.12 once the correctness failures are addressed.

src/graph/backend/dnnl/executables/sdpa.cpp

TaoLv · 2026-04-09T01:17:25Z

make test
disable benchdnn_all
enable benchdnn_graph

TaoLv · 2026-04-09T01:19:17Z

Adding commits from #4969 for validation. Will rebase the PR once #4969 is landed.

dzarukin

(Minor) It looks to me if the output_mask from dropout will be requested, the pattern won't be picked up. If that's the case, probably, would be good to reflect that in documentation or/and code comment. If this is false impression, then OK.

TaoLv · 2026-04-10T02:46:17Z

make test
disable benchdnn_all
enable benchdnn_graph

TaoLv requested a review from a team as a code owner April 7, 2026 03:42

github-actions bot added the component:graph-api Codeowner: @oneapi-src/onednn-graph label Apr 7, 2026

ElaineBao reviewed Apr 7, 2026

View reviewed changes

src/graph/interface/op_def.hpp Outdated Show resolved Hide resolved

TaoLv force-pushed the lvtao/main/sdpa-dropout branch from c2806db to 3e7b8cd Compare April 7, 2026 06:38

ElaineBao approved these changes Apr 7, 2026

View reviewed changes

TaoLv force-pushed the lvtao/main/sdpa-dropout branch from 3e7b8cd to 2376fbd Compare April 7, 2026 15:28

h-sadia mentioned this pull request Apr 7, 2026

[Fix][GPU] Dropout base offset correction for fused SDPA inference & training #4969

Merged

h-sadia reviewed Apr 8, 2026

View reviewed changes

src/graph/backend/dnnl/executables/sdpa.cpp Show resolved Hide resolved

TaoLv force-pushed the lvtao/main/sdpa-dropout branch from 2376fbd to 851ad8e Compare April 9, 2026 01:14

TaoLv requested a review from a team as a code owner April 9, 2026 01:14

github-actions bot added the platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel label Apr 9, 2026

TaoLv requested a review from a team as a code owner April 9, 2026 09:01

github-actions bot added the component:tests Codeowner: @oneapi-src/onednn-arch label Apr 9, 2026

TaoLv force-pushed the lvtao/main/sdpa-dropout branch from 78e1fb8 to c28941c Compare April 9, 2026 09:07

rongzha1 approved these changes Apr 9, 2026

View reviewed changes

ElaineBao approved these changes Apr 9, 2026

View reviewed changes

dzarukin approved these changes Apr 9, 2026

View reviewed changes

TaoLv added 4 commits April 9, 2026 18:18

graph: sdpa: support dropout seed/offset/prob in fused sdpa

8a204c9

graph: sdpa: support dropout seed/offset/prob in fused sdpa bwd

5cdc381

graph: backend: dnnl: executables: fix api arguments order

56d0970

benchdnn: inputs: graph: update gqa training backward with dropout

aa04588

TaoLv force-pushed the lvtao/main/sdpa-dropout branch from c28941c to aa04588 Compare April 10, 2026 01:19

github-actions bot removed the platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel label Apr 10, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

graph: sdpa: support dropout seed/offset/prob in fused sdpa#4961

graph: sdpa: support dropout seed/offset/prob in fused sdpa#4961
TaoLv wants to merge 4 commits intomainfrom
lvtao/main/sdpa-dropout

TaoLv commented Apr 7, 2026 •

edited

Loading

Uh oh!

TaoLv commented Apr 7, 2026

Uh oh!

TaoLv commented Apr 7, 2026

Uh oh!

Uh oh!

TaoLv commented Apr 7, 2026

Uh oh!

h-sadia commented Apr 7, 2026

Uh oh!

TaoLv commented Apr 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

TaoLv commented Apr 9, 2026

Uh oh!

TaoLv commented Apr 9, 2026

Uh oh!

dzarukin left a comment

Uh oh!

TaoLv commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

TaoLv commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TaoLv commented Apr 7, 2026

Uh oh!

TaoLv commented Apr 7, 2026

Uh oh!

Uh oh!

TaoLv commented Apr 7, 2026

Uh oh!

h-sadia commented Apr 7, 2026

Uh oh!

TaoLv commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

TaoLv commented Apr 9, 2026

Uh oh!

TaoLv commented Apr 9, 2026

Uh oh!

dzarukin left a comment

Choose a reason for hiding this comment

Uh oh!

TaoLv commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

TaoLv commented Apr 7, 2026 •

edited

Loading

TaoLv commented Apr 8, 2026 •

edited

Loading