问题
After reading the famous C10k article and searching on the web about how things have evolved since it was written, I would like to know if it would be possible for a today's standard server to handle >10000 concurrent connections using a thread per connection (possibly with the help of a pool of threads to avoid the creation/killing process).
Some details that may affect the approach to the problem:
- Input, intermediate processing and output.
- Length of each connection.
- Technical specifications of the server (cores, processors, RAM, etc...)
- Combining this system with alternative techniques like AIO, polling, green threads, etc...
Obviously I'm not an expert in the matter, so any remarks or advices will be highly appreciated :)
回答1:
Absolutely. A standard server can handle more than 10K concurrent connections using the model with one thread per connection. I have build such an application, and five years ago, it was running with more than 50K concurrent connections per process on a standard Linux server. Nowadays, it should be possible to run the same application with more than 250K concurrent connections on current hardware.
There are only a few things to keep in mind:
- Reuse threads by using a thread pool. There is no need to kill threads if they are not used, because the resource usage should be optimized for peak loads.
- Stack size: By default each Linux thread reserves 8 MB for its stack. That sums up to 80 GB for 10K threads. You should set the default stack size to some value between 64k and 512k, which isn't a problem, because most applications don't require deeper call stacks.
- If the connections are short-lived, optimize for new connections by creating several sockets on the same endpoint with the option
SO_REUSEPORT
. - Increase the user limits:
open files
(default 1.024),max user processes
- Increase system limits, e.g.
/proc/sys/kernel/pid_max
(default 32K),/proc/sys/kernel/threads-max
, and/proc/sys/vm/max_map_count
(default 65K).
The application mentioned above was initially designed to handle only 2K concurrent connections. However, with the growth in use, we didn't have to make significant changes to the code in order to scale up to 50K connections.
回答2:
You might like a recent follow-up on the subject: The Secret To 10 Million Concurrent Connections -The Kernel Is The Problem, Not The Solution.
回答3:
The usual approaches for servers are either: (a) thread per connection (often with a thread pool), or (b) single threaded with asynchronous IO (often with epoll or kqueue). My thinking is that some elements of these approaches can, and often should, be combined to use asynchronous IO (with epoll or kqueue) and then hand off the connection request to a thread pool to process. This approach would combine the efficient dispatch of asynchronous IO with the parallelism provided by the thread pool.
I have written such a server for fun (in C++) that uses epoll on Linux and kqueue on FreeBSD and OSX along with a thread pool. I just need to run it through its paces for heavy testing, do some code cleanup, and then toss it out on github (hopefully soon).
来源:https://stackoverflow.com/questions/17593699/tcp-ip-solving-the-c10k-with-the-thread-per-client-approach