cuda: allow copy to/from shared memory outside of outermost loop
commit6b99e2a972d6ec2ea537ace626380517ee0b9b8e
authorSven Verdoolaege <skimo@kotnet.org>
Mon, 6 Feb 2012 10:15:08 +0000 (6 11:15 +0100)
committerSven Verdoolaege <skimo@kotnet.org>
Mon, 6 Feb 2012 10:15:08 +0000 (6 11:15 +0100)
treee7d9da5f5da37ce856de32cf164a35e83ce5bc84
parent88d1ce7aeda435febcd53eba3fd0296c8eaf98e7
cuda: allow copy to/from shared memory outside of outermost loop

We already look for the innermost loop affecting the position
of the shared/private memory tile of a particular array and then
copy that array to/from shared memory inside that loop, but if
this position is not affected by any loop, we copying would be
performed inside the innermost loop.
Fix this by also allowing the copying to be performed outside
of any loops.

Signed-off-by: Sven Verdoolaege <skimo@kotnet.org>
cuda.c