Initial all-vs-all kernels. Single precision SSE working.