gpu: only add synchronization after writes from core computation that require it