Skip to content

Commit 4506142

Browse files
authored
[GPU] New SDPA approach for 1st token (openvinotoolkit#25316)
### Details: - This change improves SDPA version for 1st token processing, primarily when the head_size values are smaller (i.e., head_size < 128). It also reduces memory consumption in all cases
1 parent 9833e84 commit 4506142

File tree

4 files changed

+522
-337
lines changed

4 files changed

+522
-337
lines changed

0 commit comments

Comments
 (0)