Add proof of concept for NVRTC support #1170

mborland · 2024-08-06T18:14:42Z

Development and completed CUDA runs can be found here: cppalliance/cuda-math#8

This one was requested by @izaid for CuPy usage. To me it seems beneficial to just alias libcu++ anywhere we can for this level of support because NVRTC has extremely strict rules. We're going to have to create aliases to everything in std:: to either point to cuda::std::, thrust::, or roll our own implementation for things that don't exist such as the already completed implementation of <limits>. I'm also not quite sure how to pass relative paths to nvrtcCompileProgram so right now I have everything as hard-coded full paths. This is also for runtime compilation so we get less useful results out of just compilation than we do with NVCC here, but everything is still run first through the cuda-math repo.

Any thoughts: @jzmaddock, @ckormanyos, @NAThompson?

izaid · 2024-08-06T18:23:06Z

Thanks for setting this up @mborland! And ccing @steppi here.

Yes, NVRTC is quite strict, but unfortunately it's used in various places so we would need to support it. We went through this as well, and our solution indeed was just to alias the relevant parts of std. It definitely wasn't a complete implementation, but we did copy-and-paste a lot of <limits>, etc.

mborland · 2024-08-06T19:17:52Z

@izaid, @steppi, @dschmitz89 Do any of you want or need ROCm support as long as we are going back through this?

edit: To me it doesn't seem especially valuable since there's translations available from SYCL or CUDA code. Looks more like additional maintenance without a ton of benefit.

izaid · 2024-08-06T19:43:02Z

I don't think we need ROCm support, CUDA is the big win.

codecov · 2024-08-06T20:36:08Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 94.08%. Comparing base (ab09ece) to head (135208b).

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #1170      +/-   ##
===========================================
+ Coverage    94.06%   94.08%   +0.01%     
===========================================
  Files          780      780              
  Lines        65797    65797              
===========================================
+ Hits         61892    61904      +12     
+ Misses        3905     3893      -12

Files	Coverage Δ
include/boost/math/special_functions/gamma.hpp	`91.75% <ø> (ø)`

... and 1 file with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ab09ece...135208b. Read the comment docs.

include/boost/math/special_functions/gamma.hpp

jzmaddock · 2024-08-07T08:20:23Z

We're going to have to create aliases to everything in std:: to either point to cuda::std::, thrust::

Since our calls are all unqualified, can we not just change BOOST_MATH_STD_USING to point to the appropriate namespace?

steppi · 2024-08-07T09:05:08Z

Since our calls are all unqualified, can we not just change BOOST_MATH_STD_USING to point to the appropriate namespace?

@izaid and I wanted to do the equivalent of this in SciPy. The hitch was that to get things working in CuPy (which uses NVRTC) we needed a patchwork of different namespaces. thrust for complex arithmetic, mostly cuda::std, sometimes just :: from cuda_runtime.h, and I think even had to copy and paste some stuff from the standard library and add the CUDA markup.

mborland · 2024-08-07T11:40:52Z

We're going to have to create aliases to everything in std:: to either point to cuda::std::, thrust::

Since our calls are all unqualified, can we not just change BOOST_MATH_STD_USING to point to the appropriate namespace?

The equivalent of BOOST_MATH_STD_USING is easy since all the mathematical functions require no special headers and are all in the global namespace. The things we will need from thrust:: are containers, and we have aliases already like tuple: https://github.com/boostorg/math/blob/develop/include/boost/math/tools/tuple.hpp#L14. Utilities are in cuda::std:: such as a version of <type_traits>, etc.: https://nvidia.github.io/cccl/libcudacxx/standard_api/utility_library/type_traits.html. My thought is to create a bunch of alias headers in boost/math/tools like boost::math::tuple so if you call say boost::math::is_same_v it is either std::is_same_v or cuda::std::is_same_v depending on context. For posterity these could all be extracted and put into a central place. @jzmaddock what do you think about landing all these kind of context aware aliases in Config? I thought it might be out of scope, but wanted your thoughts. I'd rather not propose a Boost.GPU_Compat library,

jzmaddock · 2024-08-07T11:51:07Z

My thought is to create a bunch of alias headers in boost/math/tools like boost::math::tuple so if you call say boost::math::is_same_v it is either std::is_same_v or cuda::std::is_same_v depending on context. For posterity these could all be extracted and put into a central place. @jzmaddock what do you think about landing all these kind of context aware aliases in Config? I thought it might be out of scope, but wanted your thoughts. I'd rather not propose a Boost.GPU_Compat library,

It's not unreasonable to push this to Config: I'm sure other libraries could make use of this stuff too... but do we want to support this in standalone mode? It might be worth at least raising this on the mailing list?

mborland · 2024-08-07T12:05:37Z

It's not unreasonable to push this to Config: I'm sure other libraries could make use of this stuff too... but do we want to support this in standalone mode? It might be worth at least raising this on the mailing list?

I would keep the original copies here in math so we can continue supporting standalone mode, because that is what the Python packages depend on. I'll post on the ML.

mborland · 2024-08-07T15:04:45Z

It's not unreasonable to push this to Config: I'm sure other libraries could make use of this stuff too... but do we want to support this in standalone mode? It might be worth at least raising this on the mailing list?

I would keep the original copies here in math so we can continue supporting standalone mode, because that is what the Python packages depend on. I'll post on the ML.

Looks like the ML is in favor of everything going in it's own library. Pending any other comments I think this is a good limited proof that we can make it work. I'll keep building out the basic functions e.g. fpclassify not provided by CUDA and see if we can get a simple distribution working.

mborland added 5 commits August 6, 2024 13:58

Add tgamma nvrtc test

d30faa3

Add CI run

dae4190

Add Jamfile

88c36d3

Update CML

470e593

Add tgamma support to NVRTC

e7f09ca

Fix runs on value

477d1c1

jzmaddock reviewed Aug 7, 2024

View reviewed changes

include/boost/math/special_functions/gamma.hpp Outdated Show resolved Hide resolved

mborland added 2 commits August 7, 2024 08:00

Rearrange and add policy overload

37db017

Expand testing

135208b

mborland merged commit 2a4351e into develop Aug 8, 2024
79 checks passed

mborland deleted the nvrtc branch August 8, 2024 12:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add proof of concept for NVRTC support #1170

Add proof of concept for NVRTC support #1170

mborland commented Aug 6, 2024

izaid commented Aug 6, 2024 •

edited

Loading

mborland commented Aug 6, 2024 •

edited

Loading

izaid commented Aug 6, 2024

codecov bot commented Aug 6, 2024 •

edited

Loading

jzmaddock commented Aug 7, 2024

steppi commented Aug 7, 2024

mborland commented Aug 7, 2024

jzmaddock commented Aug 7, 2024

mborland commented Aug 7, 2024

mborland commented Aug 7, 2024

Add proof of concept for NVRTC support #1170

Add proof of concept for NVRTC support #1170

Conversation

mborland commented Aug 6, 2024

izaid commented Aug 6, 2024 • edited Loading

mborland commented Aug 6, 2024 • edited Loading

izaid commented Aug 6, 2024

codecov bot commented Aug 6, 2024 • edited Loading

Codecov Report

jzmaddock commented Aug 7, 2024

steppi commented Aug 7, 2024

mborland commented Aug 7, 2024

jzmaddock commented Aug 7, 2024

mborland commented Aug 7, 2024

mborland commented Aug 7, 2024

izaid commented Aug 6, 2024 •

edited

Loading

mborland commented Aug 6, 2024 •

edited

Loading

codecov bot commented Aug 6, 2024 •

edited

Loading