Add blip-2 to bettertransformer #1125

baskrahmer · 2023-06-21T21:39:06Z

What does this PR do?

Add BLIP-2 to the BetterTransformer API.

Part of #1056

Before submitting

Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2023-06-21T21:59:37Z

The documentation is not available anymore as the PR was closed or merged.

fxmarty

LGTM thank you! Could you just solve the conflict?

We should definitely have a nice table in the doc with the speedups (if there is, because I suspect for some archs/settings it is not necessarily huge).

baskrahmer · 2023-06-27T07:24:47Z

@fxmarty yes I agree such a comparison would be nice! If there are already benchmarks then I would be interested to work on this.

I also thought about adding tests to assert speedups, but if the speeds fluctuate this could mess with the CI.

fxmarty · 2023-06-28T08:23:11Z

There are some scripts that we used for blog posts, but we did not put results in the documentation itself: https://github.com/huggingface/optimum/tree/main/tests/benchmark

Encoder implementation may need to be revamped soon though, as currently we error out when they are used for training, while there's not really any reason to now.

kirillsemenov1314 · 2023-12-14T17:31:19Z

@baskrahmer thank you very much for contributing BLIP2 support!
I have a question - does it currently only support FlanT5 model? If so, I'm a bit confused of how does it support is since T5 model can not be supported due to nature of it's attention mechanism - shouldn't it be the same for FlanT5?

Also does it show any improvement in inference speed on GPU T4? Could not get any during my experiments, maybe I've done something wrong.
Thanks in advance!

baskrahmer · 2023-12-15T07:34:43Z

Hey @kirillsemenov1314 :)

I have a question - does it currently only support FlanT5 model? If so, I'm a bit confused of how does it support is since T5 model can not be supported due to nature of it's attention mechanism - shouldn't it be the same for FlanT5?

I believe this statement no longer holds. The BetterTransformer implementation of the T5 layer is found in this file, so I suggest going through it if you're interested in the implementation.

Also does it show any improvement in inference speed on GPU T4? Could not get any during my experiments, maybe I've done something wrong. Thanks in advance!

This is an interesting topic. AFAIK active work is being done on this tool which can be used to also run benchmarks on BetterTransformer architectures. Inference speed is influenced by a variety of factors such as the model, dataset and hardware. It can thus very well be that there is no significant speedup using BetterTransformer in your case, and it does not necessarily apply you are doing something wrong.

Add blip-2 to bettertransformer

26860a4

bk-jc added 5 commits June 22, 2023 08:16

Formatting

5f881c5

Validate converted T5

8dcb476

Revert validation on T5

81e1939

Disable strict validation for blip-2

c40fe04

Typo

506547f

baskrahmer marked this pull request as ready for review June 24, 2023 07:41

fxmarty approved these changes Jun 27, 2023

View reviewed changes

Merge branch 'main' into blip2-better-transformer

480215d

fxmarty merged commit 9abc249 into huggingface:main Jun 28, 2023

baskrahmer mentioned this pull request Jul 15, 2023

Bt benchmarks #1189

Closed

3 tasks

fxmarty mentioned this pull request Jul 21, 2023

Add BLIP-2 to BetterTransformer documentation #1218

Merged

fxmarty mentioned this pull request Aug 11, 2023

Community contribution - BetterTransformer integration for more models! #488

Open

15 tasks

garyzhang99 mentioned this pull request Apr 25, 2024

The transformation of the model Blip2ForConditionalGeneration to BetterTransformer failed #1833

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add blip-2 to bettertransformer #1125

Add blip-2 to bettertransformer #1125

baskrahmer commented Jun 21, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 21, 2023 •

edited

Loading

fxmarty left a comment •

edited

Loading

baskrahmer commented Jun 27, 2023

fxmarty commented Jun 28, 2023

kirillsemenov1314 commented Dec 14, 2023 •

edited

Loading

baskrahmer commented Dec 15, 2023

Add blip-2 to bettertransformer #1125

Add blip-2 to bettertransformer #1125

Conversation

baskrahmer commented Jun 21, 2023 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Jun 21, 2023 • edited Loading

fxmarty left a comment • edited Loading

Choose a reason for hiding this comment

baskrahmer commented Jun 27, 2023

fxmarty commented Jun 28, 2023

kirillsemenov1314 commented Dec 14, 2023 • edited Loading

baskrahmer commented Dec 15, 2023

baskrahmer commented Jun 21, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 21, 2023 •

edited

Loading

fxmarty left a comment •

edited

Loading

kirillsemenov1314 commented Dec 14, 2023 •

edited

Loading