This project is archived. Its data is read-only.

Commit 01ed593c authored Jan 23, 2021 by Jakub Klinkovský

Merge branch 'JK/MPI' into 'develop'

MPI refactoring

See merge request !82

parents 9e8685af fb3807b5

CMakeLists.txt

+1 −1

Original line number	Diff line number	Diff line
		@@ -210,7 +210,7 @@ if( ${WITH_CUDA} )
		set( CUDA_HOST_COMPILER ${CMAKE_CXX_COMPILER} )
		endif()
		endif()
		set(CUDA_NVCC_FLAGS ${CUDA_NVCC_FLAGS} ; -DHAVE_CUDA --expt-relaxed-constexpr --expt-extended-lambda)
		set(CUDA_NVCC_FLAGS ${CUDA_NVCC_FLAGS} ; -DHAVE_CUDA --expt-relaxed-constexpr --expt-extended-lambda --default-stream per-thread)
		# disable false compiler warnings
		# reference for the -Xcudafe --diag_suppress and --display_error_number flags: https://stackoverflow.com/a/54142937
		# incomplete list of tokens: http://www.ssl.berkeley.edu/~jimm/grizzly_docs/SSL/opt/intel/cc/9.0/lib/locale/en_US/mcpcom.msg

Documentation/Pages/main-page.md

+3 −3

Original line number	Diff line number	Diff line
		@@ -109,9 +109,9 @@ computing platform, and (optionally) some libraries.
		- [CUDA](https://docs.nvidia.com/cuda/index.html) 9.0 or later -- for
		computations on Nvidia GPUs.
		- [MPI](https://en.wikipedia.org/wiki/Message_Passing_Interface) -- TNL can
		use an MPI library such as [OpenMPI](https://www.open-mpi.org/) for
		distributed computing. For distributed CUDA computations, the library must
		be [CUDA-aware](
		a library implementing the MPI-3 standard for distributed computing (e.g.
		[OpenMPI](https://www.open-mpi.org/)). For distributed CUDA computations,
		the library must be [CUDA-aware](
		https://developer.nvidia.com/blog/introduction-cuda-aware-mpi/).

		- __Libraries:__

Documentation/Tutorials/index.md

+7 −8

Original line number	Diff line number	Diff line
		@@ -2,11 +2,10 @@

		## Tutorials

		1. [Building applications with TNL](tutorial_building_applications_with_tnl.html)
		2. [General concepts](tutorial_GeneralConcepts.html)
		3. [Arrays](tutorial_Arrays.html)
		4. [Vectors](tutorial_Vectors.html)
		5. [Flexible parallel reduction and scan](tutorial_ReductionAndScan.html)
		6. [For loops](tutorial_ForLoops.html)
		7. [Cross-device pointers](tutorial_Pointers.html)
		8. [Matrices](tutorial_Matrices.html)
		1. [General concepts](tutorial_GeneralConcepts.html)
		2. [Arrays](tutorial_Arrays.html)
		3. [Vectors](tutorial_Vectors.html)
		4. [Flexible parallel reduction and scan](tutorial_ReductionAndScan.html)
		5. [For loops](tutorial_ForLoops.html)
		6. [Cross-device pointers](tutorial_Pointers.html)
		7. [Matrices](tutorial_Matrices.html)

src/3rdparty/CMakeLists.txt

+6 −0

Original line number	Diff line number	Diff line
		install( DIRECTORY mpark Leksys TYPE INCLUDE
		MESSAGE_NEVER
		FILES_MATCHING PATTERN ".h" PATTERN ".hpp" )

		if( ${WITH_PYTHON} )
		install( DIRECTORY cctbx TYPE INCLUDE
		MESSAGE_NEVER
		FILES_MATCHING PATTERN ".h" PATTERN ".hpp" )
		endif()

src/3rdparty/async/README.md

0 → 100644

+532 −0

File added.

Preview size limit exceeded, changes collapsed.