TCP retranmission timer overrides/kills TCP keepalive timer, delaying disconnect discovery
问题 Machine - linux, 3.10.19 kernel This is in a large distributed system, there are several servers and clients (on same as well as different nodes/machines) having TCP connections with each other. Test case: The client program node/machine is switched off (on purpose, test case) and the only way for server to know about his disconnection is via keepalive timer (idle time=40 sec, 4 probes, probe time=10 sec). Good case: This works fine in most of the cases, the server gets to know that the