At a high-level I understand that when you create a per-CPU variable, each processor on the system gets its own copy of that variable. Some implementation details are here. I st