Public Git Hosting - gromacs.git/commit

commit	cb313c7e66b36b25e4c54db9934cad613418ecd1
author	Berk Hess <hess@kth.se>
	Wed, 24 Feb 2016 13:53:08 +0000 (24 14:53 +0100)
committer	Gerrit Code Review <gerrit@gerrit.gromacs.org>
	Sat, 27 Feb 2016 17:48:49 +0000 (27 18:48 +0100)
tree	5690033d17a42ca0757794bb65f235814e2e0b92	tree \| snapshot (tar.gz zip)
parent	34f6027048b863a2909e09b3dd4bfb1a0fe7557b	commit \| diff

Minor code reordering in GPU kernels

Updating bCalcFshift just before use instead at the top of the kernel
improves performance by 1-2% on CUDA. This also improves readability.
Making specialized (no)shift kernels will only add 1% gain.
Also updated the OpenCL kernels for consistency and readability
(the perfromance impact is negligible with current hardware/compiler).

Change-Id: I309f90ad61e5815726d55254e2cd38d5e4e7662d

The ultimate molecular dynamics simulation package

RSS Atom

src/gromacs/mdlib/nbnxn_cuda/nbnxn_cuda_kernel.cuh		diff \| blob \| blame \| history
src/gromacs/mdlib/nbnxn_ocl/nbnxn_ocl_kernel_amd.clh		diff \| blob \| blame \| history
src/gromacs/mdlib/nbnxn_ocl/nbnxn_ocl_kernel_nowarp.clh		diff \| blob \| blame \| history
src/gromacs/mdlib/nbnxn_ocl/nbnxn_ocl_kernel_nvidia.clh		diff \| blob \| blame \| history