Add CUDA bonded kernels
CUDA bonded kernels are added for the most common bonded and LJ-14
interactions.
The default auto settings of mdrun offloads these interactions
to the GPU when possible.
Currently these interactions are computed in the local or non-local
nbnxn non-bonded streams. We should consider using a separate stream.
This change uses synchronous transfers. A child change will change
these to asynchronous.
Updated release notes and performance guide.
Fixes #2678
Refs #2675
Change-Id: Ifc6d97854cc7afa8526602942ec3b1712ba45bac
24 files changed: