Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how about the speed of depth convolution on cpu VS original caffe? #2

Open
lyk125 opened this issue Aug 3, 2017 · 7 comments
Open

Comments

@lyk125
Copy link

lyk125 commented Aug 3, 2017

No description provided.

@farmingyard
Copy link
Owner

@lyk125
The original caffe's group conv is very slow, and the depthwise conv is more faster.

@libra7
Copy link

libra7 commented Oct 26, 2017

@farmingyard
i have try the depthwise(cpu i7 -4790hq), group conv 130ms depthwise conv 128ms
gtx1060 group conv 35ms depthwise conv 5ms
Why hasn't the CPU model accelerated
Hope to get your answer

@Laulian
Copy link

Laulian commented Oct 8, 2018

@libra7 just as confused as you are,do you have idea now?

@HolmesShuan
Copy link

Using OpenMP may help. E.g.,

#pragma omp parallel for num_threads(...)

@Laulian @libra7

@Rookielike
Copy link

Using OpenMP may help. E.g.,

#pragma omp parallel for num_threads(...)

@Laulian @libra7

@HolmesShuan can you tell me where did you add ? in conv_dw_layer.cpp?

@HolmesShuan
Copy link

HolmesShuan commented Mar 29, 2019

@Rookielike I have tried several settings:

#pragma omp parallel for num_threads(...)
#pragma omp parallel for

in several places. However, the run-time speed becomes unstable. You may regard it as a failed attempt.
But I find ncnn utilizes this technique to boost the runtime speed of depthwise convolution on ARM CPUs. Here are some useful inference libs you may have an interest.

@AlexofNTHU
Copy link

@libra7
which customized depthconvolution layers did you use to speed up?
Did you use this one?
https://github.com/yonghenglh6/DepthwiseConvolution
Achieving 5ms for mobilenet v1 with GTX1060 sounds really good!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants