Why does the GK110 have 192 cores, and 4 warps?
I wanted to get a feel for Kepler's architecture, but it doesn't make sense to me. If a warp is 32 threads, and 4 of them get scheduled/executed, that would mean 128 cores are in use and 64 are left idle. In the whitepaper it said something about independent instructions, so are the 64 cores reserved for those instructions? If so, can someone give me an example of when an independent instruction would be needed? Each SM in Kepler has 192 (SP) cores, and 4 warp schedulers. Each warp scheduler is capable of dual-issue which means that it can actually issue 2 instructions from a given threadblock