Fix and restructure fastmemcpybench. It is now one binary that runs all