This project is archived. Its data is read-only. This project is read-only.

Commit b19f7a2e authored Jul 16, 2019 by Tomáš Oberhuber

Creating tutorial on parallel reduction.

parent 0b1c97a2

Documentation/Tutorials/CMakeLists.txt

+2 −1

Original line number	Diff line number	Diff line
		add_subdirectory( Arrays )
		add_subdirectory( Vectors )
		add_subdirectory( Reduction )
		No newline at end of file

Documentation/Tutorials/Reduction/CMakeLists.txt

0 → 100644

+9 −0

Original line number	Diff line number	Diff line
		IF( BUILD_CUDA )
		# CUDA_ADD_EXECUTABLE( ArrayAllocation ArrayAllocation.cu )
		# ADD_CUSTOM_COMMAND( COMMAND ArrayAllocation > ArrayAllocation.out OUTPUT ArrayAllocation.out )
		ENDIF()

		IF( BUILD_CUDA )
		ADD_CUSTOM_TARGET( TutorialsReduction-cuda ALL DEPENDS
		)
		ENDIF()

Documentation/Tutorials/Reduction/tutorial_03_Reduction.md

0 → 100644

+18 −0

Original line number	Diff line number	Diff line
		\page tutorial_01_arrays Flexible (parallel) reduction tutorial

		## Introduction

		This tutorial introduces flexible parallel reduction in TNL. It shows how to easily implement parallel reduction with user defined operations which may run on both CPU and GPU. Parallel reduction is a programming pattern appering very often in different kind of algorithms for example in scalar product, vector norms or mean value evaluation but also in sequences or strings comparison.

		## Table of Contents
		1. [Flexible Parallel Reduction](#flexible_parallel_reduction)
		1. [Sum](#flexible_parallel_reduction_sum)

		## Flexible parallel reduction<a name="flexible_parallel_reduction"></a>

		We will explain the flexible parallel reduction on several examples. We start with the simplest sum of sequence of numbers folowed by more advanced problems like scalar product or vector norms.

		### Sum

		We start with simple problem.