-
Notifications
You must be signed in to change notification settings - Fork 65
Pull requests: intel/xFasterTransformer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Eval] Add eval test with opencompass.
benchmark
performance or accuracy benchmark
enhancement
New feature or request
[Kernel] Add dynamic onednn matmul.
performance
performance related.
#425
opened May 28, 2024 by
changqi1
•
Review required
[Layers] Increased the threshold for enabling flashAttn
performance
performance related.
#428
opened Jun 3, 2024 by
abenmao
•
Review required
Add env param KV_CACHE_LOCATION to control kv cache memory numanode location
#462
opened Jun 28, 2024 by
a3213105
•
Review required
add bf16_int8 support for invokeLayerLLaMA API
#470
opened Jul 22, 2024 by
miaojinc
•
Review required
ProTip!
no:milestone will show everything without a milestone.