Optimize pop_1st_bit() on 32 bits x86