I am running a CFD simulation with a 200,000-vertex mesh. I\'ve decomposed the mesh into 2 load-balanced sub-domains to test my parallel implementation. In the specific function