Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update fp8 implementation, design and implement save&load #1605

Merged
merged 22 commits into from
Feb 27, 2024
Merged

Conversation

xin3he
Copy link
Contributor

@xin3he xin3he commented Feb 5, 2024

Type of Change

feature

Description

update implementation and example for habana fp8.

  • refine example for less memory & better performance
  • remove workaround for old habana conda env
  • implement save & load interface

Expected Behavior & Potential Risk

fp8 inference with lower memory usage and better performance.

How has this PR been tested?

local tested

@xin3he xin3he marked this pull request as draft February 5, 2024 03:50
@xin3he xin3he added WIP PyTorch Related to PyTorch F/W labels Feb 5, 2024
@xin3he xin3he changed the title update fp8 implementation update fp8 implementation, design and implement save&load Feb 20, 2024
@xin3he xin3he marked this pull request as ready for review February 20, 2024 07:11
Signed-off-by: xinhe3 <xinhe3@habana.ai>
pre-commit-ci bot and others added 3 commits February 20, 2024 07:18
Signed-off-by: xinhe3 <xinhe3@habana.ai>
Signed-off-by: xinhe3 <xinhe3@habana.ai>
xinhe3 and others added 5 commits February 23, 2024 11:01
Signed-off-by: xinhe3 <xinhe3@habana.ai>
Signed-off-by: xinhe3 <xinhe3@habana.ai>
Signed-off-by: xinhe3 <xinhe3@habana.ai>
Signed-off-by: xinhe3 <xinhe3@habana.ai>
Signed-off-by: xinhe3 <xinhe3@habana.ai>
pre-commit-ci bot and others added 2 commits February 25, 2024 05:49
Signed-off-by: xin3he <xin3.he@intel.com>
@xin3he xin3he requested a review from yiliu30 February 26, 2024 02:10
xin3he and others added 7 commits February 26, 2024 10:33
Signed-off-by: xin3he <xin3.he@intel.com>
Co-authored-by: Yi30 <106061964+yiliu30@users.noreply.github.com>
Signed-off-by: xin3he <xin3.he@intel.com>
Signed-off-by: xin3he <xin3.he@intel.com>
…nto habana/fp8

Signed-off-by: xin3he <xin3.he@intel.com>
Signed-off-by: xin3he <xin3.he@intel.com>
Signed-off-by: xin3he <xin3.he@intel.com>
Signed-off-by: xin3he <xin3.he@intel.com>
Signed-off-by: xin3he <xin3.he@intel.com>
@chensuyue chensuyue added INC3.X and removed WIP labels Feb 27, 2024
@chensuyue chensuyue merged commit f812e67 into master Feb 27, 2024
60 of 68 checks passed
@chensuyue chensuyue deleted the habana/fp8 branch February 27, 2024 14:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
INC3.X PyTorch Related to PyTorch F/W
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants