Volume 12, Issue 8 pp. 727-753
Research Article

Javelin++: scalability issues in global computing

Michael O. Neary

Corresponding Author

Michael O. Neary

Department of Computer Science, University of California at Santa Barbara, Santa Barbara, CA 93106, U.S.A.

Department of Computer Science, University of California at Santa Barbara, Santa Barbara, CA 93106, U.S.A.Search for more papers by this author
Sean P. Brydon

Sean P. Brydon

Department of Computer Science, University of California at Santa Barbara, Santa Barbara, CA 93106, U.S.A.

Search for more papers by this author
Paul Kmiec

Paul Kmiec

Department of Computer Science, University of California at Santa Barbara, Santa Barbara, CA 93106, U.S.A.

Search for more papers by this author
Sami Rollins

Sami Rollins

Department of Computer Science, University of California at Santa Barbara, Santa Barbara, CA 93106, U.S.A.

Search for more papers by this author
Peter Cappello

Peter Cappello

Department of Computer Science, University of California at Santa Barbara, Santa Barbara, CA 93106, U.S.A.

Search for more papers by this author

Abstract

Javelin is a Java-based infrastructure for global computing. This paper presents Javelin++, an extension of Javelin, intended to support a much larger set of computational hosts. Contributions to scalability and fault tolerance are presented. This is the focus of the paper. Two scheduling schemes are presented: probabilistic work stealing and deterministic work stealing. The distributed deterministic work stealing is integrated with a distributed deterministic eager scheduler, which is one of the paper's primary original contributions. An additional fault tolerance mechanism is implemented for replacing hosts that have failed or retreated. A Javelin++ API is sketched, then illustrated on a raytracing application. Performance results for the two schedulers are reported, indicating that Javelin++, with its broker network, scales better than the original Javelin. Copyright © 2000 John Wiley & Sons, Ltd.

The full text of this article hosted at iucr.org is unavailable due to technical difficulties.