Commit 01ed593c authored by Jakub Klinkovský's avatar Jakub Klinkovský
Browse files

Merge branch 'JK/MPI' into 'develop'

MPI refactoring

See merge request !82
parents 9e8685af fb3807b5
Loading
Loading
Loading
Loading
+1 −1
Original line number Diff line number Diff line
@@ -210,7 +210,7 @@ if( ${WITH_CUDA} )
               set( CUDA_HOST_COMPILER ${CMAKE_CXX_COMPILER} )
            endif()
        endif()
        set(CUDA_NVCC_FLAGS ${CUDA_NVCC_FLAGS} ; -DHAVE_CUDA --expt-relaxed-constexpr --expt-extended-lambda)
        set(CUDA_NVCC_FLAGS ${CUDA_NVCC_FLAGS} ; -DHAVE_CUDA --expt-relaxed-constexpr --expt-extended-lambda --default-stream per-thread)
        # disable false compiler warnings
        #   reference for the -Xcudafe --diag_suppress and --display_error_number flags: https://stackoverflow.com/a/54142937
        #   incomplete list of tokens: http://www.ssl.berkeley.edu/~jimm/grizzly_docs/SSL/opt/intel/cc/9.0/lib/locale/en_US/mcpcom.msg
+3 −3
Original line number Diff line number Diff line
@@ -109,9 +109,9 @@ computing platform, and (optionally) some libraries.
    - [CUDA](https://docs.nvidia.com/cuda/index.html) 9.0 or later -- for
      computations on Nvidia GPUs.
    - [MPI](https://en.wikipedia.org/wiki/Message_Passing_Interface) -- TNL can
      use an MPI library such as [OpenMPI](https://www.open-mpi.org/) for
      distributed computing. For distributed CUDA computations, the library must
      be [CUDA-aware](
      a library implementing the MPI-3 standard for distributed computing (e.g.
      [OpenMPI](https://www.open-mpi.org/)). For distributed CUDA computations,
      the library must be [CUDA-aware](
      https://developer.nvidia.com/blog/introduction-cuda-aware-mpi/).

- __Libraries:__
+7 −8
Original line number Diff line number Diff line
@@ -2,11 +2,10 @@

## Tutorials

1. [Building applications with TNL](tutorial_building_applications_with_tnl.html)
2. [General concepts](tutorial_GeneralConcepts.html)
3. [Arrays](tutorial_Arrays.html)
4. [Vectors](tutorial_Vectors.html)
5. [Flexible parallel reduction and scan](tutorial_ReductionAndScan.html)
6. [For loops](tutorial_ForLoops.html)
7. [Cross-device pointers](tutorial_Pointers.html)
8. [Matrices](tutorial_Matrices.html)
1. [General concepts](tutorial_GeneralConcepts.html)
2. [Arrays](tutorial_Arrays.html)
3. [Vectors](tutorial_Vectors.html)
4. [Flexible parallel reduction and scan](tutorial_ReductionAndScan.html)
5. [For loops](tutorial_ForLoops.html)
6. [Cross-device pointers](tutorial_Pointers.html)
7. [Matrices](tutorial_Matrices.html)
+6 −0
Original line number Diff line number Diff line
install( DIRECTORY mpark Leksys TYPE INCLUDE
         MESSAGE_NEVER
         FILES_MATCHING PATTERN "*.h" PATTERN "*.hpp" )

if( ${WITH_PYTHON} )
   install( DIRECTORY cctbx TYPE INCLUDE
            MESSAGE_NEVER
            FILES_MATCHING PATTERN "*.h" PATTERN "*.hpp" )
endif()
+532 −0

File added.

Preview size limit exceeded, changes collapsed.

Loading