There is absolutely a cost to creating additional threads. You should have a non-trivial amount of work needed before spinning up a new thread.
I assume this means you have a quad-core CPU. The optimal number of threads actually depends on the workload, if threads are waiting for whatever reason they may be able to context switch to a another thread and you may see a benefit with a number of threads greater than the number of physical cores.