Add per-thread cache to malloc