Volume 21, Issue 3-4 682356 pp. 93-107
Article
Open Access

A Divide and Conquer Strategy for Scaling Weather Simulations with Multiple Regions of Interest

Preeti Malakar

Corresponding Author

Preeti Malakar

Department of Computer Science and Automation Indian Institute of Science Bangalore, India

Search for more papers by this author
Thomas George

Thomas George

IBM India Research Lab New Delhi, India , ibm.com

Search for more papers by this author
Sameer Kumar

Sameer Kumar

IBM T.J. Watson Research Center Yorktown Heights NY, USA , ibm.com

Search for more papers by this author
Rashmi Mittal

Rashmi Mittal

IBM India Research Lab New Delhi, India , ibm.com

Search for more papers by this author
Vijay Natarajan

Vijay Natarajan

Department of Computer Science and Automation Indian Institute of Science Bangalore, India

Supercomputer Education and Research Centre Indian Institute of Science Bangalore, India

Search for more papers by this author
Yogish Sabharwal

Yogish Sabharwal

IBM India Research Lab New Delhi, India , ibm.com

Search for more papers by this author
Vaibhav Saxena

Vaibhav Saxena

IBM India Research Lab New Delhi, India , ibm.com

Search for more papers by this author
Sathish S. Vadhiyar

Sathish S. Vadhiyar

Supercomputer Education and Research Centre Indian Institute of Science Bangalore, India

Search for more papers by this author
First published: 01 January 2013

Abstract

Accurate and timely prediction of weather phenomena, such as hurricanes and flash floods, require high-fidelity compute intensive simulations of multiple finer regions of interest within a coarse simulation domain. Current weather applications execute these nested simulations sequentially using all the available processors, which is sub-optimal due to their sub-linear scalability. In this work, we present a strategy for parallel execution of multiple nested domain simulations based on partitioning the 2-D processor grid into disjoint rectangular regions associated with each domain. We propose a novel combination of performance prediction, processor allocation methods and topology-aware mapping of the regions on torus interconnects. Experiments on IBM Blue Gene systems using WRF show that the proposed strategies result in performance improvement of up to 33% with topology-oblivious mapping and up to additional 7% with topology-aware mapping over the default sequential strategy.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.