-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add rawtodigi implementation using the Cupla backend #2
Conversation
Using CUDA as the cupla backend, I see the same performance as with native CUDA. Using serial operations as the cupla backend is slower than the naive and CUDA implementations; however it was not optimised, and I am not sure what |
Thanks, looks good. Before merging I'd like to ask for clarification This PR fully includes #1, right? For the the Cupla CPU backend, the input data "transfers" I found it interesting to see that exactly the same code works fine also in the |
Yes.
Yes, and I think this is one main limitation of Cupla: to keep the code unchanged, it has to introduce these extra operations.
That is the other problem I am thinking about: both libraries export the same symbols - |
One option could be to put all those symbols in different a namespace for each backend, and rely on the macro definition to pick the right one. An other option could be to link the cupla backend statically into each plugin. |
Anyway, I am going to try the other backends and check their performance, as well. |
Thanks, merging. |
No description provided.