CUDA performance #8
geomat0101
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
FYI, here's the examples rendered with a CUDA-enabled dlib. Most of these are going from a couple seconds down to < 0.02 seconds. The one with all the bald dudes (25 faces) went from about 20 seconds to < 0.6!
Note that there is noticeable startup cost factored into the first one rendered. We need to do our best to ensure that our face-analyzer stays active and primed so we don't eat that startup cost for every image (or batch of) that we want to process.
Beta Was this translation helpful? Give feedback.
All reactions