foreach_{8,diag}neighbor(): Simplify+speed up, use persistent shift array