Question for ssl_cuda_kernel implementation #4

jameslahm · 2023-03-23T06:48:27Z

Thank you for your great work! In the paper, I see that using sparsity can reduce the memory movement in the shift operation, but in the code, the shift operation, \ie, ssl_cuda_kernel will always copy or move all the channels. The sparsity will thus not reduce the memory cost of the shift operation. So I wonder if the shift operation implementation in inference mode should be different from the training mode. If that's so, would you mind sharing the ssl_cuda_kernel implementation for the inference mode? Thanks a lot!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question for ssl_cuda_kernel implementation #4

Question for ssl_cuda_kernel implementation #4

jameslahm commented Mar 23, 2023

Question for ssl_cuda_kernel implementation #4

Question for ssl_cuda_kernel implementation #4

Comments

jameslahm commented Mar 23, 2023