I think I found my solution by rewriting my code to shift before movemask epi8 however it didn\'t look like I could shift a 128/256 value by 1 bit. Is that true? searching s