Computational fluid dynamics is an increasingly important application domain for computational scientists. In this paper, we propose and analyze optimizations necessary to run CFD simulations consisting of multibillion-cell mesh models on large processor systems. Our investigation leverages the general industrial Navier–Stokes CFD application, Code_Saturne, developed by Electricité de France for incompressible and nearly compressible flows. In this paper, we outline the main bottlenecks and challenges for massively parallel systems and emerging processor features such as many-core, transactional memory, and thread level speculation. We also present an approach based on an octree search algorithm to facilitate the joining of mesh parts and to build complex larger unstructured meshes of several billion grid cells. We describe two parallel strategies of an algebraic multigrid solver and we detail how to introduce new levels of parallelism based on compiler directives with OpenMP, transactional memory and thread level speculation, for finite volume cell-centered formulation and face-based loops. A renumbering scheme for mesh faces is proposed to enhance thread-level parallelism. Copyright © 2012 John Wiley & Sons, Ltd.

REFERENCES

1 Archambeau F, Mechitoua N, Sakiz M. EDF, Code_Saturne version 1.3.2 practical user's guide. April 2008. 9.
Google Scholar
2 Archambeau F, Mechitoua N, Sakiz M.Code_Saturne: a finite volume code for the computation of turbulent incompressible flows. Industrial Applications, International Journal on Finite Volumes 2004; 1.
Google Scholar
3 D'Azevedo EF, Eijkhout VL, Romine CH. Lapack Working Note 56: Conjugate Gradient Algorithms with Reduced Synchronization Overhead on Distributed Memory Multiprocessors. 1999.
Google Scholar
4 Mechitoua N, Fournier Y, Hulsemann F. Improvements of a FV base multigrid method applied to elliptic problems. International Conference on Mathematics, Computational Methods and Reactor Physics (M&C’09), Saratoga Springs, New York, May 3–7, 2009.
Google Scholar
5 Axelsson O, Vassilevski PS.A black box generalized conjugate gradient solver with inner iterations and variable-step preconditioning. SIAM Journal of Matrix Analysis and Applications October 1991; 12(4): 625–644.
10.1137/0612048
Web of Science® Google Scholar
6 Aubry R, Mut F, Löhner R.Deflated preconditioned conjugate gradient solvers for the pressure-Poisson equation. Journal of Computational Physics 2008; 227(24): 10196–10208.
10.1016/j.jcp.2008.08.025
Web of Science® Google Scholar
7 Tang JM, Nabben R, Vuik C, Erlangga YA.Comparison of two-level preconditioners derived from deflation, domain decomposition and multigrid Methods. Journal of Scientific Computing 2009; 39: 340–370.
10.1007/s10915-009-9272-6
Web of Science® Google Scholar
8 MacLachlan SP, Tang JM, Vuik C.Fast and robust solvers for pressure-correction in bubbly flow problems. Journal of Computational Physics 2008; 227: 9742–9761.
10.1016/j.jcp.2008.07.022
Web of Science® Google Scholar
9 Tufo HM, Fischer PF.Fast parallel direct solvers for coarse grid problems. Journal of Parallel & Distributed Computing 2001; 61: 151–177.
10.1006/jpdc.2000.1676
Web of Science® Google Scholar
10 Fischer P, Lottes J, Pointer D, Siegel A.Petascale algorithms for reactor hydrodynamics. Journal of Physics Conference Series 2008; 125: 2–6.
10.1088/1742-6596/125/1/012076
Web of Science® Google Scholar
11 Lottes JW, Fischer PF.Hybrid multigrid/schwarz algorithms for the spectral element method. Journal of Scientific Computing 2005; 24.
10.1007/s10915-004-4787-3
Web of Science® Google Scholar
12 Fournier Y, Benhamadouche S, Monfort D, Laurence D. Non Conforming Meshes and RANS/LES Coupling: Two Challenging Aims for a CFD Code. Heat Transfer/Fluids Engineering Summer Conference, Charlotte, NC, 2004. Paper HT-FED2004-56340, ASME.
Google Scholar
13 Mechitoua N, Fournier Y, Hulsemann F. Improvement of a finite volume based multigrid method applied to elliptic problems. International Conference on Mathematics, Computational Methods & Reactor Physics (M&C 2009), American Nuclear Society, LaGrange Park, IL, 2009.
Google Scholar
14 Oliker L, Li X, Hever G, Biswas R. Parallel conjugate gradient: Effect of ordering strategies, programming paradigms, and architectural platforms. 13th International Conference on Parallel and Distributed Computing Systems, Cambridge, MA, 2000.
Google Scholar
15 Cuthill E, McKee J. Reducing the bandwidth of sparse symmetric matrices. Proceedings of the 1969 24th National Conference, Las Vegas, NV, 1969; 157–172. ACM.
Google Scholar
16 Hever G, Biswas R, Gao GR.Self-avoiding walks over adaptive unstructured grids. Concurrency: Practice and Experience 2000; 12: 85–109.
10.1002/(SICI)1096-9128(200002/03)12:2/3<85::AID-CPE471>3.0.CO;2-8
Web of Science® Google Scholar
17 Bihari B, Gyllenhaal J, Spelce T, Futral S. Experiments Using IBM's Software Transactional Memory Compiler, ScicomP 15, Barcelona, Spain, May 18-22, 2009). http://www.spscicomp.org/ScicomP15/slides/user/bihari.pdf).
Google Scholar
18 Eunn-Jin Im. Optimizing the Performance of Sparse Matrix-Vector Multiplication, 2000. library SPARSITY: Register Blocking Optimization and Cache Blocking Optimization.
Google Scholar

Citing Literature

Volume25, Issue6

Special Issue:Latest Trends in Computer Architectures and Parallel and Distributed Technologies

25 April 2013

Pages 843-861

Multiple threads and parallel challenges for large simulations to accelerate a general Navier–Stokes CFD code on massively parallel systems

SUMMARY

REFERENCES

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Multiple threads and parallel challenges for large simulations to accelerate a general Navier–Stokes CFD code on massively parallel systems

SUMMARY

REFERENCES

Citing Literature

References

Related

Information