Volume 17, Issue 3 pp. 197-240
ARTICLE
Full Access

A social interaction model with an extreme order statistic

Ji Tao

Ji Tao

School of Economics, Shanghai University of Finance and Economics, 777 Guoding Road, Shanghai, 200433 China

Search for more papers by this author
Lung-fei Lee

Lung-fei Lee

Department of Economics, Ohio State University, 410 Arps Hall, 1945 N. High St., Columbus, OH, 43210 USA

Search for more papers by this author
First published: 24 April 2014
Citations: 8

Summary

In this paper, we introduce a social interaction econometric model with an extreme order statistic to model peer effects. We show that the model is a well-defined system of equations and that it is a static game with complete information. The social interaction model can include exogenous regressors and group effects. Instrumental variables estimators are proposed for the general model that includes exogenous regressors. We also consider distribution-free methods that use recurrence relations to generate moment conditions for estimation. For a model without exogenous regressors, the maximum likelihood approach is computationally feasible.

1. INTRODUCTION

There is a growing body of literature in which the influence of social peers in economics and other social sciences is addressed; see Durlauf (2004) for a recent survey. Conventional social interaction models, including the spatial autoregressive model and the linear-in-means model, assume that an individual's outcome depends linearly on the mean of the outcomes of the individual's peers; see, e.g. Anselin (1988), Manski (1993), Moffitt (2001), Lee (2007) and Bramoulle et al. (2009). For instance, a student's test score would be affected linearly by the mean test score of their classmates. In certain situations, we believe that a better specified model might involve the full distribution or other distributional characteristics, such as order statistics, rather than the mean (e.g. Ioannides and Soetevent, 2007).

In this paper, we introduce a social interaction econometric model with an extreme order statistic to allow non-linearity in modelling endogenous peer effects. Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0001 be a finite set of players in a social network (group) and let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0002 be the set of players, except the ith player, who are i's neighbours. For example, a simple social interaction model with the maximum statistic for the ith player is specified as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0003, where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0004 denotes urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0005 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0006 is a random element. The social interaction model with the minimum statistic can be defined in a similar way.

Empirical studies have found evidence regarding peer effects on educational outcomes; see, e.g. Hanushek (1992), Hanushek et al. (2003) and Hoxby (2000). It is plausible that peers affect educational outcomes by aiding learning through questions and answers or by hindering learning through disruptive behaviour. Lazear (2001) has suggested that disruptive students could interfere with the educational outcomes of their peers. For instance, a student who has bad grades in class might hinder learning by asking poor questions. This disruptive behaviour might have large negative effects on the academic achievements of this student's peers, and thus might generate strong endogenous peer effects. Figlio (2007) has found empirical evidence of the effects of disruptive students on the performance of their peers, which indicates that students suffer academically from the presence of classroom disruption. Thus, our social interaction model with an extreme order statistic might be useful to explore such ‘bad apple’ effects in some empirical studies.

In many real-world situations, rewards are based on relative performance rather than absolute performance, and competition in terms of extreme outcomes may occur. Holmstrom (1982) has suggested that a relative performance scheme provides a better evaluation than an absolute performance scheme in the presence of costly monitoring of the agent's effort. Lazear and Rosen (1981) have suggested that there is a convex relation between pay and performance level in tournament schemes. Frank and Cook (1995) have attributed much of this phenomenon to winner-take-all markets, which compete fiercely for the best. Examples of such schemes include companies offering bonuses to the ‘salesperson of the year’, universities rewarding researchers for writing the ‘best paper’ and athletes being rewarded for having the best Olympic performances. Main et al. (1993) have conducted a detailed survey of executive compensations and have found some support for a relative performance scheme. Ehrenberg and Bognanno (1990a,b) have found that the performances of professional golf players are influenced by the level and structure of prizes in major golf tournaments.

In contrast to the situation where individuals are competitors in the labor market, individuals can benefit from their peers in a social group. The benefit might take the form of the sharing of know-how between individuals. This is an efficient way for individuals to obtain valuable experiences from their peers and especially from those who are most capable. Therefore, individuals might react to the performance of their best peers in a strategic model of social network.

The paper is organized as follows. In Section 2., we provide some theoretical justifications for the social interaction model with the minimum or maximum statistic on economic grounds. In Section 3., we introduce a general model specification with an extreme order statistic. We show that the social interaction model with an extreme order statistic is a well-defined system of equations. The solution to the system exists and is unique. Our model has the essential features of a static game with complete information for a finite number of players. In particular, this game is asymmetric and has a unique pure-strategy Nash equilibrium. In Section 4., we provide a simple instrumental variables (IV) approach for a general model where exogenous regressors are relevant. In Section 5., we consider the generalized method of moments (GMM), which uses IV moments and recurrence relations for moments of order statistics. Our asymptotic analysis of estimation methods (IV and GMM) is based on a sample consisting of a large number of groups where group sizes remain fixed and bounded (as the number of groups tends to infinity). In Section 6., we give a brief introduction of an extended social interaction model with mixed mean and maximum. In Section 7., we provide some Monte Carlo results and in Section 8., we provide an empirical example. To illustrate the practical use of this model, we apply it to the National Collegiate Athletic Association (NCAA) men's basketball data and we obtain some interesting results. In Section 9., we draw our conclusions. In Appendix A, we list some useful notations and some moments of ordered normal random variables. In Appendix B, we give detailed proofs of the theorems provided in Sections 3., 4. and 6.. In Appendix C, we provide a brief description of the maximum likelihood estimation (MLE) of simple model specifications and some Monte Carlo results. In Appendix D, we give a useful lemma and some technical details about the GMM estimation with recurrence moments discussed in Section 5..

2. THEORETICAL ECONOMIC CONSIDERATIONS

Our model specification can be derived from a simultaneous move game with perfect information. The sample can be viewed as repetitions of a game played among individuals with possibly varying finite numbers of players. We give three theoretical examples to illustrate the model specification within the game-theoretic framework. These theoretical models are for illustrative purposes and are not exclusive for possible economic applications of the proposed econometric model.

2.1. A synergistic relationship

A game has m players involved in a synergistic relationship (Osborne, 2004). It is a static game with complete information, in which each player knows the pay-offs and strategies available to other players. Each player's set of actions is the set of performance levels. Denote by urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0007 a performance profile chosen by players and by urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0008, which is a subvector of p with dimension urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0009, the performance profile of all players except the ith player in the game. Player i's preference is represented by the pay-off function urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0010, which depends on p in the following way. First, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0011 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0012 capture the concave relationship between pay-offs and performance levels. Secondly, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0013, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0014, captures the synergistic relationship between player i and player i's peer j.

For analytical simplicity, assume that each player is endowed with one unit of social capital (i.e. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0015 for all i). The social capital can only be used to establish social connections between players. If player i spends an amount urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0016 on j, then player i's pay-off can increase proportionally by urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0017. Intuitively, the player will spend all their social capital in establishing connections with their peers such that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0018. Formally, we have the pay-off function
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0019(2.1)
with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0020. Note that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0021 represents the cost of performance, which is similar to the moral cost in the crime network study of Liu et al. (2011). It should be stressed that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0022, which is a function of characteristics of the individual i, varies across individuals. In other words, 2.1 depends on which player chooses the action. So, the game is asymmetric. To maximize the pay-off, the player will spend all their capital on their best peer who has the highest performance level in the synergistic relationship. So, we have that
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0023(2.2)
Suppose that each player chooses the optimal performance level urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0024 to maximize (2.2), which gives the reaction function of player i as
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0025(2.3)
where the coefficient ϕ is non-negative. For empirical estimation, we expect that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0026, the performance profile, is observable, and the index urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0027 will be specified as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0028, where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0029 is a vector of observed characteristics of individual i and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0030 represents the characteristics that are unobservable by econometricians. Here, ϕ and β are unknown parameters for estimation.

As an alternative specification, it is possible to justify the game in terms of choosing effort instead of performance level directly, as long as the response of an individual to their peers is on their maximum performance level. Suppose that an individual chooses their effort to respond to the best performance of their peers. Then, we have urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0031. Suppose that the performance of individual i is the sum of their effort and unobserved disturbance urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0032 (i.e. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0033). Then, the model with the measurement relation becomes urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0034, which is similar to 2.3, where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0035 is the overall disturbance.

2.2. A tournament scheme

In a tournament game, we assume that a player is rewarded based on relative performance. We assume that the gross pay-off urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0036 depends on p in the following way. First, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0037 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0038 capture the convex relationship between pay and performance. Secondly, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0039 reflects the competition among players (Calvo-Armengol and Zenou, 2004). Formally, we have the gross pay-off function in a tournament game
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0040(2.4)
where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0041 varies across individuals. With urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0042, player i will be rewarded for winning the tournament, and penalized for losing. Suppose that player i's cost is a convex function of performance with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0043 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0044. Formally,
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0045(2.5)
Combining all the above information, the net pay-off can be written as
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0046(2.6)
Each player will do their work urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0047 in order to maximize the net pay-off according to the first-order condition
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0048(2.7)
The second-order condition requires urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0049, which is sufficient for a maximum at urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0050. It follows that the reaction function is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0051(2.8)
where the coefficient of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0052 is negative. For estimation, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0053 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0054 would be functions of the observable and unobserved characteristics of individual i by econometricians. Here, ρ and ϱ would be unknown parameters. Because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0055 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0056 would also involve unknown coefficients of individual characteristics, only the composite parameter urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0057 can be identifiable in this model if only urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0058 are observed but not the costs urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0059.

2.3. A game that combines the tournament scheme and the synergistic relationship

In the above example, the cost function (2.5) can be written as
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0060(2.9)
which captures the benefits of social interactions between connected players (Calvo-Armengol and Zenou, 2004). To achieve the most benefit, a player will allocate all their social capital to the best peer in order to minimize the cost. So, we have that
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0061(2.10)
We obtain the net pay-off function
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0062(2.11)
which gives the best response function
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0063(2.12)

Note that the coefficients of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0064 in (2.3) and (2.8) are positive and negative, respectively. We can interpret the former as the social effect and the latter as the competition effect. The net pay-off function (2.11) combines the tournament scheme and the synergistic relation into a single game. In this situation, the coefficient of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0065 in (2.12 ) would represent the difference between the social effect and the competition effect, so that it can take either a negative or a positive value. Its sign indicates the domination of one effect over the other. They cannot be separately identified from (2.12) alone. These effects would be separately identified if the cost or pay-off of a player could be observed. In the next section, we shall see that the best response function (2.3), (2.8) or (2.12) has a unique Nash equilibrium under some general conditions.

3. THE SOCIAL INTERACTION MODEL WITH MAXIMUM AND REGRESSORS

The various reaction functions on the performance of a player with respect to the maximum performance of their peers in Section 2. are just examples. In order to provide a general model and its estimation, we consider a model specification with the outcome of an individual being urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0066 and the outcome profile of the individual's peers being urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0067. The individual's observed characteristics will be represented by a vector of regressors urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0068 and unobserved characteristics involved will be summarized as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0069. So, in general, we consider the social interaction model with maximum and regressors:
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0070(3.1)

Here, m is the total number of players in a game, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0071 is a vector of observed individual characteristics and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0072 is an individual disturbance, which is known to the agent but unobservable to econometricians. Thus, the reaction functions 2.3, 2.8 and 2.12 can all be included in this framework. For example, λ will be urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0073 for 2.12, and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0074 will become urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0075 in this general formulation. Because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0076 and possibly urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0077 vary across individuals, 3.1 can be interpreted as the reaction function for an asymmetric game with perfect information. With given individual characteristics urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0078 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0079, 3.1 can have a unique Nash equilibrium when urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0080. The following theorem summarizes the solution of 3.1 as the unique Nash equilibrium.

Theorem 3.1.Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0081 for simplicity. The system 3.1 with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0082 but urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0083 is equivalent to the following system with order statistics:

urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0084(3.2)

Here, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0085 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0086 are the corresponding order statistics of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0087 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0088 in ascending order. The system 3.2 has the unique solution:

urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0089(3.3)

Analogously, for the social interaction model with minimum, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0090, the solution exists and is unique as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0091 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0092 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0093 given urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0094 but urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0095. We note that the existence and uniqueness of the solution are valid without any distributional assumption on urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0096.

We see that, from the arguments of the proof for Theorem 3.1, as long as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0097 but urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0098, the order statistics urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0099 have the same ascending order as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0100. In this case, the solution (Nash equilibrium) of the system exists and is unique. The restriction urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0101 provides the stability of the system. However, the system is unstable or divergent when urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0102.

It is apparent that the system has many possible solutions when λ is 1 or −1. This is also the case with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0103. For any urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0104, consider the quantities defined by urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0105, and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0106 with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0107 but urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0108. The quantities of these elements have the order urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0109. They also imply urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0110 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0111. When urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0112, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0113; hence, these urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0114 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0115 satisfy the structure urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0116 and form a solution set. For urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0117, because i can be any individual of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0118, there are multiple solutions for the system.

For certain estimation methods, such as the method of maximum likelihood (ML), a unique solution of the system is essential, because it requires a well-defined mapping of the disturbance vector urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0119 to the sample vector urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0120 conditional on observed exogenous variables in order to set up the likelihood function. However, for certain estimation methods, such as IV or the two-stage least-squares (2SLS) estimation, even though the system might have multiple solutions, asymptotic analysis of such estimation methods could still be valid as long as proper instrumental variables are available for the endogenous explanatory variables of the structural equation and the rank condition is satisfied, under the scenario that the number of groups R tends to infinity but the group sizes remain finite and bounded.

The social interaction model with an extreme order statistic can be compared with the group interaction model based on the average outcome of peers,
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0121(3.4)
with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0122. This group interaction model allows individuals to interact with each other within a group, and peer effects are reciprocal in the model. The social interaction model in 3.2 implicitly assumes that every player except the best player depends on the first-order statistic (i.e. the group maximum). There are no reciprocal effects between players except for the first- and second-order statistics in 3.2. The conventional group interaction model is a special case of spatial autoregressive models, which assume exogenous interaction structures. In contrast, the interaction structure of the model with an extreme order statistic is endogenously determined. It resembles an extension of a star network but the star is endogenously determined with the adjacency matrix of the form
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0123

4. ESTIMATION

Suppose that the economy has a well-defined group structure. Each player belongs to a social group. The players can interact with each other within a group but not with members in another group. The sample consists of R groups with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0124 players in the rth group. The total sample size is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0125. For empirical relevance, we allow variable group sizes in the model but group sizes remain finite and bounded as the number of groups increases.

Assumption 4.1.Assume that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0126 for all r, where the lower bound urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0127 and the upper bound mU is finite.

For the estimation, we make the distinction whether exogenous regressors are present or not in the model. Because the model with exogenous regressors might be more relevant for practical use, we focus on the estimation of this general case in the main text, whereas the estimation of the model without regressors is given in Appendix C.

4.1 The model with exogenous regressors

The model 3.1 can include observed group factors urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0128 as well as individual characteristics urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0129:
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0130(4.1)

Here, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0131 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0132 denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0133 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0134 vectors of exogenous variables. If urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0135 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0136 are relevant regressors, the IV approach is possible and is distribution-free.

Assumption 4.2.urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0137 are i.i.d. random variables with zero mean and a finite variance urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0138, and are independent of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0139 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0140.

With urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0141, in terms of order statistics in Theorem 3.1, the system 4.1 is equivalent to
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0142(4.2)
where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0143 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0144 are the transformed variables of x and ε, respectively, under the permutation of y into an ascending order. The urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0145 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0146 are not necessarily in ascending order, even though urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0147.
With urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0148 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0149 being constants, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0150 are independent but not identically distributed, so the likelihood function for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0151 can be complicated. Under normality, the density of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0152 conditional on urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0153 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0154 is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0155, where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0156 is the standard normal density function. Conditional on urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0157 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0158, the joint density of the order statistics urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0159 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0160(4.3)
where the summation S extends over all permutations urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0161 of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0162 (David, 1981, p. 22). The density function for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0163 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0164(4.4)

Because the possible number of permutations is large in general, the ML approach will be computationally demanding unless the number of players in a game is very small. Furthermore, a parametric ML approach will not be distribution-free. For tractable estimation, we consider an IV approach.

Because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0165 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0166 are exogenous, important moment restrictions of 4.1 are urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0167, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0168 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0169 for all urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0170. The number of such IV moments is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0171. In order to distinguish possible parameter values from those of the true values that generate the sample observations, a subscript 0 on parameters is added to indicate their true values. Because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0172 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0173, the model via (4.2) implies that
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0174(4.5)
with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0175. Because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0176,
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0177(4.6)
Similarly,
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0178(4.7)
and
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0179(4.8)
Similar to urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0180, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0181 can also have
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0182(4.9)
Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0183 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0184 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0185 or 3. Then, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0186 urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0187, which has zero mean by mutual independence of x and ε. It follows that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0188 provides a set of valid instruments,
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0189(4.10)
as long as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0190 does not collapse into any of its lower-order power variables for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0191.
Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0192 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0193 Writing 4.1 in vector form, we have
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0194(4.11)
where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0195, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0196, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0197 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0198 are the corresponding vectors and matrices in the rth group, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0199 is the urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0200-dimensional vector consisting of ones in its entries, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0201 collects all these explanatory variables in a single matrix and δ1 is the complete vector of coefficients. Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0202 be a generic IV matrix for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0203, which satisfies the following assumption.

Assumption 4.3.The probability limits of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0204 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0205 exist and are non-singular. Furthermore, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0206 and

urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0207

Assumption 4.3 requires that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0208 has full rank asymptotically, which is a sufficient condition for the identification of δ1. The IV estimator with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0209 for 4.11 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0210(4.12)

Theorem 4.1.Under Assumptions 4.14.3, the IV estimator urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0211 with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0212 is consistent and

urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0213

Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0214 denote a matrix consisting of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0215, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0216, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0217, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0218 and some powers of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0219. The 2SLS estimator with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0220 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0221(4.13)
with the asymptotic variance urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0222. For the estimation of σ2, the moment urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0223 implies that a variance estimator can be
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0224(4.14)

4.2. The model with exogenous regressors and group effects

The model 4.1 can be extended to include an unobservable group factor urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0225
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0226(4.15)
which is equivalent to
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0227(4.16)
and
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0228
For generality, we allow urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0229 to correlate with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0230 or urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0231. Hence, it is desirable to treat urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0232 as fixed effects. However, there is a need to eliminate the incidental parameter's problem as R tends to infinity, for which purpose, we consider the deviation from a group mean operation. Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0233, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0234 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0235, where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0236, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0237 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0238 are the corresponding group means. Pre-multiplying 4.16 by the transformation matrix urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0239, individual observations are measured as deviations from group means
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0240(4.17)
and
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0241
with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0242.
Because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0243 are exogenous, important moments of 4.15 are urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0244 for all urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0245. By multiplying each equation in 4.17 by the corresponding urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0246, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0247, and taking their summation, the model via 4.17 implies that
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0248
because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0249. Hence,
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0250(4.18)
Note that any constant regressors within the group will not be useful for the within-group equation because the summation over all units of 4.17 in a group is identically zero. For IV estimation, additional instruments are needed for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0251 on the right-hand side of 4.17. Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0252 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0253 or 3. Because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0254, which has zero mean,
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0255(4.19)
Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0256. The within-group equation of 4.15 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0257(4.20)
where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0258, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0259, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0260, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0261 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0262 are the corresponding vectors and matrices in the rth group. Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0263 be a generic IV matrix for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0264, which satisfies the following assumption.

Assumption 4.4.The probability limits of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0265 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0266 urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0267 exist and are non-singular. Furthermore, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0268 and

urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0269

Assumption 4.4 requires that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0270 has full rank asymptotically, which is a sufficient condition for the identification of δ2. The IV estimator with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0271 for 4.20 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0272(4.21)

Theorem 4.2.Under Assumptions 4.1, 4.2 and 4.4, the IV estimator urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0273 with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0274 is consistent and

urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0275

Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0276 denote a matrix consisting of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0277 and some of its powers. The 2SLS estimator with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0278 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0279(4.22)
where its asymptotic variance is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0280. For the estimation of σ2, the moment urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0281 implies that a variance estimator can be
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0282(4.23)

5. THE GMM APPROACH

If all the exogenous regressors were irrelevant, the above IV methods would no longer be applicable. In other words, if β0 and γ0 were zero, IV moments alone would not identify the model parameters. Furthermore, it is impossible to test the joint significance of all the exogenous regressors based on those IV estimators. To remedy such a possible but unknown scenario, we suggest a combination of IV moments with some recurrence relations for moments of order statistics. The resulting estimation method is distribution-free. However, when regressors are really relevant, the validity of the recurrence relations would require that the regressors urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0283 are i.i.d. for all i and r, which is a strong assumption of this estimation strategy for the model.

We consider the GMM approach for the general model, namely 4.15. Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0284, which has the same ascending order as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0285 in terms of order statistics. We can eliminate the common group effect of 4.15 by the differencing method, and we obtain the following system
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0286(5.1)
and
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0287(5.2)

Assumption 5.1.urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0288 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0289 are i.i.d. for all members in a group as well as across groups.

Recurrence relations for moments of order statistics are based on the i.i.d. property of random variables. Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0290 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0291 be the ith-order statistics in random samples of size m and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0292, respectively. We have a useful recurrence relation
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0293(5.3)
which follows directly from a lemma of Balakrishnan and Sultan (1998) in Appendix D. Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0294 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0295. Equation 5.3 implies
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0296(5.4)
for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0297. By taking urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0298, 5.4 implies that
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0299(5.5)
for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0300. Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0301 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0302 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0303. For a generic group of size m, the system of equations 5.1 and 5.2 implies that
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0304(5.6)
for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0305, and
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0306(5.7)
for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0307. It follows that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0308 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0309 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0310. The corresponding moment in terms of dependent variables of 5.1 and 5.2 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0311(5.8)
for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0312. As in the general GMM estimation framework, let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0313 be some weighting matrix such that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0314 is non-negative definite. The GMM estimator with weights urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0315 is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0316, which minimizes urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0317, where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0318 is an empirical moment vector consisting of the recurrence moment 5.8 and the IV moments for the model 4.15. The GMM estimator satisfies the first-order condition
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0319

Assumption 5.2.urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0320 converges in probability to a constant weighting matrix a0 with a rank no less than urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0321, and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0322 converges in probability to D0 that exists and is finite.

Theorem 5.1.Under Assumptions 4.1, 4.2, 4.4, 5.1 and 5.2,

urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0323
where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0324.

By the generalized Schwartz inequality, the optimal choice of a0 is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0325. It follows that the optimal GMM estimator minimizes urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0326, where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0327 is a consistent estimate of V2 (see Appendix D for details on the Proof of Theorem 5.1).

6. THE MODEL WITH MIXED MEAN AND MAXIMUM

We consider a generalized social interaction model with mixed mean and maximum as follows,
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0328(6.1)
with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0329, or, equivalently,
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0330(6.2)
Following Theorem 3.1, 6.1 with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0331 but urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0332 is a well-defined system and has a unique solution. It includes the social interaction with the mean or maximum of peers as a special case. For empirical application, we provide a simple IV estimator under the fixed-effects specification.
Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0333. As urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0334, it implies that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0335 urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0336 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0337. The within-group equation of 6.1 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0338
Writing the within-group equation of 6.1 in vector form, we have
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0339(6.3)
where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0340 collects all explanatory variables in a single matrix, and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0341 is the corresponding parameter vector. Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0342 be a generic IV matrix for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0343, which satisfies the following assumptions.

Assumption 6.1.Assume that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0344 for all r where the lower bound urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0345 and the upper bound mU are finite. Furthermore, either urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0346 or there are sufficient number of groups with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0347.

Assumption 6.2.The probability limits of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0348 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0349 urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0350 exist and are non-singular. Furthermore, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0351 and

urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0352

Assumption 6.1 requires that group sizes are not all less than three for model identification, because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0353 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0354 are identical for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0355. Assumption 6.2 requires that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0356 has full rank asymptotically, which is a sufficient condition for the identification of δ3. The IV estimator with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0357 for 6.3 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0358(6.4)

Theorem 6.1.Under Assumptions 4.1, 4.2 and 6.2, the IV estimator urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0359 with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0360 is consistent and

urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0361

Following Lee (2007), we can use urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0362 as an IV for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0363. Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0364 denote a matrix consisting of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0365 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0366 in 4.22. The 2SLS estimator with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0367 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0368(6.5)
with the asymptotic variance urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0369. Similarly to 4.23, the variance estimator of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0370 is
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0371(6.6)

7. MONTE CARLO RESULTS

The models used to generate urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0372 are given by 4.1 and 4.15, and these are
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0373

We randomly draw urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0374 from the standardized χ2(5) distribution to investigate finite sample properties of distribution-free estimators under this non-normal distribution. The two regressors urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0375 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0376 are i.i.d. N(0, 1) for all r and i, and are independent of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0377. The group effect urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0378 is generated as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0379. The true parameters are urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0380, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0381, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0382 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0383. We allow group sizes to vary from three to five. We design the same number of groups for each group size. The average group size is four by design. In the simulation, we experiment with different numbers of groups R from 60 to 1920. The number of Monte Carlo repetitions is 300.

We try two different IV estimators. IV1 uses the group average urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0384 of 4.9 as instruments, and IV2 uses urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0385 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0386 of 4.10 in the first model. Because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0387 is invariant within the group, only IV2 is relevant to the second model. We also consider the GMM and the two-step procedure (TSP). The GMM uses IV1 and recurrence relation 5.3 for the first model, and uses IV2 and 5.8 for the second model. For computational simplicity, we use the identity matrix as a weighting matrix in a GMM objective function. The TSP uses the moments of 5.3 or 5.8 to estimate λ in the first step, and then regresses fitted residuals on exogenous variables to estimate their coefficients by the ordinary least-squares method in the second step.

Table 1 presents the bias (Bias) and standard deviation (SD) of IV1, IV2, GMM and TSP estimates of Model 1. The IV1 and IV2 estimates of λ are unbiased for all R. The GMM estimate of λ is biased upward when urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0388 or 120, and this bias decreases as R increases. The TSP estimate of λ has a large upward or downward bias when urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0389 or 120, but this bias decreases sharply when urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0390 or more. All the estimates of λ have smaller SDs as R increases. The IV1 estimate of λ has much smaller SDs than those of the other estimates. The IV2 estimate of λ has larger SDs than those of the GMM estimates for all R, and has smaller SDs than that of the TSP estimate when urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0391 or less. However, it has larger SDs than that of the TSP estimate when urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0392 or more. These estimates of β, γ, μ and σ have similar properties in bias and standard deviation. Table 2 presents finite sample results of IV2, GMM and TSP estimates of Model 2. The IV2 estimate of λ is unbiased for all R. The GMM estimate of λ is biased downward when urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0393 or less, and this bias decreases as R increases. The TSP estimate of λ has large upward bias when urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0394 or 120, but this bias decreases sharply when urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0395 or more. The GMM estimate of λ has smaller SDs than those of the IV2 and TSP estimates. These estimates of β and σ have similar properties in bias and standard deviation.

Table 1. Model 1 under χ2(5) distribution
R IV1 IV2 GMM TSP R IV1 IV2 GMM TSP
λ Bias 60 −0.0040 0.0737 0.1958 0.8023 480 −0.0004 0.0026 0.0484 −0.0421
SD 0.0743 0.9290 0.2477 11.1507 0.0227 0.2368 0.1325 0.2097
β Bias −0.0065 −0.0373 −0.0856 −0.2203 0.0005 −0.0017 −0.0217 0.0142
SD 0.0687 0.2474 0.1403 3.2108 0.0235 0.0807 0.0650 0.0750
γ Bias 0.0048 −0.1467 −0.3810 −1.7597 0.0012 −0.0039 −0.0981 0.0804
SD 0.1669 1.8454 0.4833 24.1389 0.0519 0.4807 0.2612 0.4082
μ Bias −0.0013 0.0097 −0.0185 0.0289 0.0014 0.0024 −0.0030 −0.0011
SD 0.0659 0.1450 0.0978 1.4228 0.0236 0.0274 0.0250 0.0318
σ Bias 0.0075 0.3261 0.0727 2.1491 0.0012 0.0588 0.0117 0.0516
SD 0.0701 1.0161 0.1516 15.5359 0.0247 0.1256 0.0567 0.1660
λ Bias 120 0.0013 0.0129 0.1451 −0.2968 960 −0.0013 −0.0122 0.0273 −0.0138
SD 0.0464 0.5421 0.2495 3.3297 0.0181 0.1603 0.0890 0.1065
β Bias −0.0020 −0.0051 −0.0609 0.1065 −0.0002 0.0033 −0.0135 0.0037
SD 0.0502 0.1934 0.1261 1.1990 0.0170 0.0548 0.0438 0.0402
γ Bias 0.0014 −0.0296 −0.2809 0.6019 0.0028 0.0239 −0.0549 0.0272
SD 0.1028 1.0825 0.4884 6.8572 0.0414 0.3201 0.1803 0.2147
μ Bias −0.0030 −0.0027 −0.0218 0.0182 −0.0001 0.0003 −0.0025 −0.0003
SD 0.0441 0.0812 0.0624 0.2489 0.0176 0.0188 0.0190 0.0187
σ Bias 0.0007 0.2169 0.0541 0.9507 0.0003 0.0313 0.0034 0.0155
SD 0.0515 0.4537 0.1307 5.2488 0.0187 0.0692 0.0300 0.0535
λ Bias 240 −0.0025 −0.0043 0.0863 −0.0016 1920 −0.0014 0.0008 0.0085 −0.0141
SD 0.0349 0.4986 0.1721 1.7710 0.0124 0.1097 0.0638 0.0722
β Bias 0.0016 −0.0021 −0.0360 0.0045 0.0013 0.0003 −0.0032 0.0054
SD 0.0354 0.1546 0.0835 0.5087 0.0122 0.0376 0.0310 0.0266
γ Bias 0.0051 0.0085 −0.1759 0.0102 0.0018 −0.0029 −0.0188 0.0262
SD 0.0792 1.0067 0.3481 3.3791 0.0287 0.2190 0.1253 0.1424
μ Bias −0.0037 −0.0048 −0.0091 −0.0056 −0.0004 −0.0001 −0.0010 −0.0006
SD 0.0345 0.0557 0.0363 0.1201 0.0121 0.0128 0.0125 0.0125
σ Bias −0.0009 0.1600 0.0200 0.3584 0.0003 0.0137 0.0026 0.0094
SD 0.0340 0.4625 0.0676 2.5571 0.0126 0.0369 0.0221 0.0270
Table 2. Model 2 under χ2(5) distribution
R IV2 GMM TSP R IV2 GMM TSP
λ Bias 60 0.0282 −0.2355 4.2695 480 0.0068 −0.0457 0.0188
SD 0.9543 0.5486 71.3090 0.2629 0.2407 0.2746
β Bias −0.0073 −0.0661 1.1213 0.0014 −0.0122 0.0041
SD 0.2071 0.1582 19.1270 0.0640 0.0646 0.0661
σ Bias 0.0403 0.0106 1.5919 0.0046 −0.0038 0.0061
SD 0.1949 0.0966 23.2660 0.0467 0.0461 0.0529
λ Bias 120 0.0279 −0.1006 0.2327 960 0.0119 −0.0073 0.0239
SD 0.6435 0.4291 1.3350 0.1861 0.1848 0.1962
β Bias −0.0006 −0.0308 0.0465 0.0021 −0.0006 0.0051
SD 0.1453 0.1235 0.2943 0.0443 0.0508 0.0484
σ Bias 0.0183 −0.0067 0.0591 0.0030 0.0001 0.0048
SD 0.1193 0.0793 0.3586 0.0355 0.0342 0.0364
λ Bias 240 0.0000 −0.1326 0.0057 1920 0.0006 −0.0141 0.0017
SD 0.3751 0.3166 0.4717 0.1394 0.1320 0.1426
β Bias 0.0002 −0.0324 0.0024 0.0014 −0.0021 0.0018
SD 0.0894 0.0867 0.1090 0.0330 0.0352 0.0356
σ Bias 0.0049 −0.0156 0.0074 0.0008 −0.0016 0.0008
SD 0.0671 0.0560 0.0948 0.0242 0.0231 0.0248

If exogenous regressors are irrelevant to the model, the IV method is inconsistent. However, GMM and TSP, which use recurrence relations for moments of order statistics, remain valid. For illustration, we present finite sample results of the IV, GMM and TSP estimates of the models when β0 and γ0 are zero. The IV1 estimate of λ has significant upward or downward bias for all R in Table 3, and the IV2 estimate of λ has significant downward bias for all R in Table 4. In contrast, the GMM and TSP estimates of λ have smaller biases and these biases decrease as the number of groups increases.

Table 3. Model 1 with irrelevant IV under χ2(5) distribution
R IV1 GMM TSP R IV1 GMM TSP
λ Bias 60 1.1676 0.0801 −0.1009 480 −1.67 × 102 0.0190 −0.0041
SD 11.0140 0.2450 0.7187 2.82 × 103 0.0819 0.0938
β Bias −0.0004 −0.0026 −0.0055 −2.31 × 10−2 0.0010 0.0008
SD 0.1408 0.0826 0.0733 3.03 × 10−1 0.0232 0.0225
γ Bias −0.0860 −0.0021 −0.0018 6.20 × 100 0.0004 0.0000
SD 1.0608 0.0797 0.0845 1.06 × 102 0.0224 0.0223
μ Bias −0.7194 −0.0583 0.0660 1.13 × 102 −0.0129 0.0037
SD 6.5073 0.1760 0.4654 1.92 × 103 0.0602 0.0664
σ Bias 1.5278 0.0073 0.0917 1.38 × 102 −0.0011 0.0040
SD 8.2338 0.0877 0.3772 2.33 × 103 0.0314 0.0330
λ Bias 120 17.5350 0.0367 −0.0860 960 −0.8384 0.0071 −0.0056
SD 225.1000 0.1828 0.3066 65.9200 0.0571 0.0665
β Bias −0.2452 0.0047 −0.0007 0.0037 −0.0007 −0.0005
SD 3.5708 0.0774 0.0492 0.0868 0.0157 0.0158
γ Bias 0.3809 0.0099 0.0035 0.2426 0.0001 0.0002
SD 9.5496 0.0782 0.0508 3.1064 0.0169 0.0167
μ Bias −12.7630 −0.0318 0.0523 0.5270 −0.0054 0.0038
SD 165.9500 0.1275 0.2043 43.4560 0.0424 0.0497
σ Bias 16.3410 0.0053 0.0374 6.3651 −0.0007 0.0022
SD 178.1700 0.0731 0.1335 55.7460 0.0218 0.0241
λ Bias 240 0.6479 0.0365 −0.0171 1920 0.7953 0.0068 0.0007
SD 11.7950 0.1213 0.1576 6.8995 0.0402 0.0472
β Bias 0.0113 0.0020 0.0013 0.0017 0.0009 0.0009
SD 0.0927 0.0330 0.0332 0.0192 0.0114 0.0113
γ Bias −0.0418 0.0004 0.0002 −0.0002 −0.0014 −0.0013
SD 0.4317 0.0348 0.0369 0.1063 0.0115 0.0113
μ Bias −0.4100 −0.0298 0.0084 −0.5260 −0.0052 −0.0008
SD 8.2165 0.0865 0.1098 4.5290 0.0295 0.0345
σ Bias 1.6218 −0.0052 0.0079 1.3726 −0.0011 0.0003
SD 9.5404 0.0398 0.0536 5.4796 0.0146 0.0158
Table 4. Model 2 with irrelevant IV under χ2(5) distribution
R IV2 GMM TSP R IV2 GMM TSP
λ Bias 60 −2.7994 −0.2025 0.2221 480 −2.3605 −0.0198 0.0133
SD 11.1320 0.5884 1.1072 5.2979 0.2540 0.2783
β Bias 0.0024 −0.0033 −0.0050 0.0014 0.0000 0.0007
SD 0.1454 0.0972 0.0791 0.0371 0.0317 0.0256
σ Bias 0.3681 −0.0106 0.0422 0.0732 −0.0010 0.0026
SD 1.6721 0.0926 0.1724 0.6112 0.0428 0.0444
λ Bias 120 −2.0023 −0.0949 0.1380 960 −2.6978 −0.0080 0.0126
SD 6.6691 0.4514 0.6714 4.7140 0.1734 0.1889
β Bias 0.0001 −0.0052 −0.0026 −0.0019 0.0017 0.0002
SD 0.0728 0.0664 0.0554 0.0235 0.0237 0.0189
σ Bias 0.1047 −0.0092 0.0189 0.0297 −0.0007 0.0016
SD 0.9480 0.0743 0.1002 0.5201 0.0289 0.0305
λ Bias 240 −2.5636 −0.0647 0.0074 1920 −1.4783 −0.0102 −0.0022
SD 6.2293 0.3319 0.3822 10.8100 0.1250 0.1332
β Bias 0.0012 0.0026 0.0023 −0.0011 0.0017 0.0016
SD 0.0597 0.0467 0.0391 0.0278 0.0165 0.0135
σ Bias 0.0767 −0.0070 0.0007 0.2684 −0.0011 −0.0002
SD 0.8438 0.0521 0.0573 1.7928 0.0199 0.0208

8. AN EMPIRICAL EXAMPLE

In college basketball, all major individual player awards are given based on regular season performance. For example, the John R. Wooden Award is given to the most outstanding college basketball players every year. For the winners of these awards, it is not only a great honour, but it also increases the probability of being drafted by a National Basketball Association (NBA) team, which has the highest average salary among major sports leagues. In our example, we assume that a college basketball player i in a team r will maximize his net pay-off function 2.11. After the reparametrization of 2.12, we obtain his best response function:
urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0398(8.1)
Note that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0399 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0400 capture observed and unobserved common group factors (e.g. coach effects), respectively, which are eliminated by the covariance transformation in our fixed-effects estimation framework. As a comparison, we estimate the group interaction model with the mean specification as well as the generalized model 6.1.

There are over 10,000 men's basketball players competing in three divisions at about 1,000 colleges and universities within the NCAA. Division I (D-I) has the most prestigious basketball programmes, which have the greatest financial resources and attract the most athletically talented students. We collect men's basketball player statistics of D-I teams during the period 2005–2010 from web sites including ncaa.org, espn.com and statsheet.com. The total number of teams (groups) is 1,673 in the five seasons. The total number of observations is 22,122. The average team size is 13, with a minimum of eight and a maximum of 20.

Table 5 reports the descriptive statistics of variables used in the regression analysis. We measure per-game performances of a player, based on his various accomplishments, such as points, assists, rebounds, steals and blocks. These accomplishments are the most important factors in determining his chance of winning the awards and/or becoming an NBA player. Our measures are different from the player efficiency rating (PER), which measures a player's per-minute performance. Some experts have criticized the PER on the grounds that it gives undue weight to a player's contribution in limited minutes. Hence, we use the per-game performance to measure a player's contribution to the team. We use several per-game performance measures (urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0401) as the dependent variables of our regression models. There are five urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0402 measures that are different combinations of points, assists, rebounds, blocks and steals. Estimated results of each of these are reported separately in Tables 610. Specifically, we have the following measures for the dependent variable urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0403 for a player:
  1. the dependent variable in Table 6 is the points per game of a player;
  2. the dependent variable in Table 7 is the sum of points and assists per game;
  3. the dependent variable in Table 8 is the sum of points, assists and rebounds per game;
  4. the dependent variable in Table 9 is the sum of points, assists, rebounds, and steals per game;
  5. the dependent variable in Table 10 is the sum of points, assists, rebounds, steals and blocks per game.
Table 5. Summary of regression variables
Variable Obs. Mean SD Min Max
Points per game 22,122 5.71 4.90 0.00 28.65
Assists per game 22,122 1.10 1.17 0.00 11.69
Rebounds per game 22,122 2.65 2.06 0.00 14.80
Steals per game 22,122 0.57 0.50 0.00 4.77
Blocks per game 22,122 0.28 0.43 0.00 6.53
Height (inches) 22,122 76.69 3.58 63.00 91.00
Weight (pounds) 22,122 203.76 26.27 80.00 380.00
Minutes per game 22,122 17.02 10.51 0.30 40.00
Freshman 22,122 0.32 0.47 0.00 1.00
Sophomore 22,122 0.26 0.44 0.00 1.00
Junior 22,122 0.23 0.42 0.00 1.00
Senior 22,122 0.19 0.39 0.00 1.00
Table 6. Basketball performance: points per game
IV IV IV IV GMM
Regressor (1) (2) (3) (4) (5)
Best performance of peers −1.359 −1.253 −1.355
(0.218) (0.220) (0.217)
Average performance of peers −2.173 −0.884
(0.477) (0.556)
Height 0.012 0.015 0.010 0.015 0.015
(0.007) (0.006) (0.006) (0.005) (0.006)
Weight 0.047 0.045 0.038 0.042 0.045
(0.001) (0.008) (0.008) (0.007) (0.008)
Minutes per game 4.379 3.729 3.594 3.460 3.731
(0.038) (0.109) (0.199) (0.186) (0.109)

Note

  • Number of observations is 22,122. Individual coefficients are statistically significant at the urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0396 or urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0397 levels. Standard errors are given in parentheses under coefficients.
Table 7. Basketball performance: sum of points and assists per game
IV IV IV IV GMM
Regressor (1) (2) (3) (4) (5)
Best performance of peers −1.057 −0.948 −1.056
(0.139) (0.1318) (0.139)
Average performance of peers −2.173 −1.241
(0.477) (0.413)
Height −0.094 −0.073 −0.078 −0.065 −0.073
(0.007) (0.006) (0.007) (0.006) (0.006)
Weight 0.027 0.032 0.022 0.029 0.032
(0.010) (0.008) (0.008) (0.007) (0.008)
Minutes per game 5.176 4.611 4.355 4.141 4.612
(0.038) (0.081) (0.199) (0.170) (0.080)

Note

  • Number of observations is 22,122. Individual coefficients are statistically significant at the urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0404 or urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0405 levels. Standard errors are given in parentheses under coefficients.
Table 8. Basketball performance: sum of points, assists and rebounds per game
IV IV IV IV GMM
Regressor (1) (2) (3) (4) (5)
Best performance of peers −1.174 −1.092 −1.169
(0.205) (0.212) (0.204)
Average performance of peers −1.983 −0.766
(0.477) (0.458)
Height 0.053 0.047 0.046 0.045 0.047
(0.008) (0.007) (0.007) (0.006) (0.007)
Weight 0.161 0.138 0.135 0.130 0.139
(0.011) (0.010) (0.011) (0.010) (0.010)
Minutes per game 6.714 6.017 5.619 5.642 6.020
(0.044) (0.127) (0.236) (0.218) (0.127)

Note

  • Number of observations is 22,122. Individual coefficients are statistically significant at the urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0406 or urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0407 levels. Standard errors are given in parentheses under coefficients.
Table 9. Basketball performance: sum of points, assists, rebounds and steals per game
IV IV IV IV GMM
Regressor (1) (2) (3) (4) (5)
Best performance of peers −1.131 −1.067 −1.126
(0.204) (0.214) (0.204)
Average performance of peers −1.777 −0.579
(0.425) (0.462)
Height 0.031 0.030 0.028 0.029 0.030
(0.008) (0.007) (0.007) (0.007) (0.007)
Weight 0.152 0.132 0.129 0.126 0.133
(0.011) (0.010) (0.011) (0.010) (0.010)
Minutes per game 7.086 6.391 6.045 6.093 6.393
(0.046) (0.131) (0.252) (0.232) (0.131)

Note

  • Number of observations is 22,122. Individual coefficients are statistically significant at the urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0408 or urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0409 levels. Standard errors are given in parentheses under coefficients.
Table 10. Basketball performance: sum of points, assists, rebounds, steals and blocks per game
IV IV IV IV GMM
Regressor (1) (2) (3) (4) (5)
Best performance of peers −1.121 −1.051 −1.118
(0.205) (0.211) (0.204)
Average performance of peers −1.814 −0.698
(0.426) (0.456)
Height 0.090 0.080 0.078 0.076 0.081
(0.009) (0.007) (0.008) (0.007) (0.007)
Weight 0.155 0.133 0.132 0.125 0.133
(0.012) (0.010) (0.011) (0.010) (0.010)
Minutes per game 7.233 6.534 6.154 6.673 6.536
(0.047) (0.134) (0.258) (0.119) (0.133)

Note

  • Number of observations is 22,122. Individual coefficients are statistically significant at the urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0410 or urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0411 levels. Standard errors are given in parentheses under coefficients.

The explanatory variables urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0412 for each of the Tables 610 are the same, and include height, weight and minutes per game. Height gives a major advantage in professional basketball, because taller players generally achieve more rebounds, block more shots and make more dunks than shorter players. Weight might also give a college player some advantages, because strength is an important factor in many aspects of the game. Both height and weight will have a positive effect on a player's performance. Minutes per game is defined as the total minutes played in one season divided by the number of games played, and this is expected to have a positive coefficient. For reasons of scale, we divide weight and minutes per game by ten in the actual regressions.

Tables 610 report the regression estimates under the fixed-effects specification. Column (1) presents the within-group IV estimates of 6.1, for which there are no endogenous interaction regressors. We find that the coefficients of these regressors (i.e. the elements of the vector urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0413 discussed above) are statistically significant. Physical characteristics (with an exception in Table 7) have positive estimates, which indicate that taller and stronger athletes play better basketball. Table 7 uses the sum of points and assists as the performance measure, and finds that height has a negative effect. The point guard is one of the most important positions on a basketball team, and players at this position are shorter and are awarded more assists, which might explain the unexpected sign of height. Minutes per game is potentially endogenous because coaches generally give better players more time on the court. Hence, we use the years of college education (freshman, sophomore and junior) as the instruments for this endogenous regressor, and the IV estimates are statistically significant and positive.

Because interaction regressors are endogenous, we use the instruments proposed in Section 4.2 for the best performance of peers and the instruments proposed by Lee (2007) for the average performance of peers. Columns (2) and (3) show that the interaction regressor has a negative and statistically significant effect in the respective model. Column (4) shows that both interaction regressors remain negative in the generalized model 6.1. The best performance of peers is statistically significant, whereas the average performance of peers is statistically insignificant, except for the case that uses the sum of points and assists as a performance measure. Column (5) presents the GMM estimates of the model 8.1, which are similar to the IV estimates. We conclude that a college basketball player reacts negatively and statistically significantly to the best performance of his teammates, which suggests that there exist important competition effects among basketball players in an NCAA D-I team.

9. CONCLUSION

In this paper, we introduce a social interaction econometric model with an extreme order statistic, which differs from conventional social interaction models in that an individual's outcome is affected by the extreme outcome of peers. Our model represents one of the possible extensions to allow non-linearity in modelling endogenous social peer effects.

We show that the social interaction model with an extreme order statistic is a well-defined system of equations. The solution to the system exists and can be unique. We provide a simple IV method for model estimation if exogenous regressors are relevant. A distribution-free method that uses recurrence relations is applicable, even when regressors might be irrelevant. It should be noted that ML estimation would be infeasible for the model including exogenous regressors (unless the number of players is very small), even if one were willing to make a distributional assumption of the disturbances, because the likelihood function of order statistics for independent but not identically distributed variates is rather complicated.

We apply the social interaction models with peer's maximum, peer's average and a mixed specification to the NCAA D-I men's basketball data. We find that a college player performs negatively and statistically significantly in response to the best performance of his peers, which suggests the existence of competition effects in the basketball game. For future research, we can apply this model to explore the possible ‘bad apple’ effect in an empirical study of education. We can also consider a model with reaction to certain quantiles of the outcome distribution.

ACKNOWLEDGEMENTS

We thank two referees and the co-editor, Jaap Abbring, for their valuable comments and suggestions, which have improved the presentation of this paper.

  1. 1 Alternative but related non-linear models might consider interactions in terms of certain quantiles of a distribution.
  2. 2 A tournament scheme is an extreme form of relative performance schemes.
  3. 3 This is also true in crime, education, academic research and sports; see, e.g. Calvo-Armengol and Zenou (2004) for details of the influence of friends on criminal activities.
  4. 4 We thank an anonymous referee for pointing this out.
  5. 5 The presence of social capital concerns the formation of the network links. It represents the time constraint or cost of an individual in socializing (or learning) with peers. Without such a constraint, an individual would not need to target effort to learn from the best peer. This is because learning from more peers would be beneficial if learning would enhance performance.
  6. 6 However, even for the case urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0820 with a unique solution, there would be some technical difficulty for the MLE method because the parameter space of λ is not a connected set due to the need to rule out urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0821. When the parameter space is not connected, the use of the mean value theorem for analysis of asymptotic distribution of the ML estimator might not be justifiable. Furthermore, the determinant of the Jacobian transformation urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0822 would have its sign changed. All of the above would have created technical difficulties for estimation using the ML method. So, for the ML estimation method, it would be desirable to limit attention to the stable model with the restriction urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0823. However, because there is no specific constraint imposed on λ, the IV or 2SLS methods do not have such technical difficulties. We are grateful to a referee who has directed our attention to the tractability of the IV and 2SLS methods for the estimation of the structural equation.
  7. 7 A star-shaped network is one where one central agent is in direct contact with all the other peripheral agents who, in turn, are only connected to this central agent in the star; see footnote 5 of Calvo-Armengol and Zenou (2004, p. 940). In our social interaction model with the maximum, the star is simply the one with the maximum value. Hence, our network resembles an extension of a star network.
  8. 8 Our model assumes that the coefficient λ is independent of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0824, which could be a restrictive assumption. If necessary, we might extend the model with its parameters to depend on its group size as additional interactive components.
  9. 9 If urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0825 is a dichotomous variable, then its power is the same as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0826 itself.
  10. 10 We suggest that the second and third powers of x are used in the estimation. Our Monte Carlo experiments find that higher powers of x as instrumental variables would only slightly improve efficiency.
  11. 11 In this case, the GMM estimates do not necessarily have smaller empirical variances than the IV estimates.
  12. 12 There are 346 D-I basketball teams. Each season has several teams with missing data. These teams are not included in our data.
  13. 13 The PER has been developed by John Hollinger, an NBA analyst and ESPN writer.
  14. 14 The coefficients of the best performance of peers are smaller than −1. This might be because the IV estimators impose no bounds on parameters and happen to have large empirical standard errors. However, the coefficients of the average performance of peers can be smaller than −1, as pointed out by Lee (2007). Assumption 4.4 says that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0827 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0828 (Lee, 2007). In our case, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0829, because the minimum size is 8.
  15. 15 We do not impose the lower bound on λ in the GMM estimation.
  16. 16 When urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0830 are continuously distributed, the probability that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0831 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0832 is zero. Hence, it is appropriate to assume that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0833 are distinct.
  17. APPENDIX A: NOTATION LIST AND MOMENTS OF ORDERED NORMAL VARIABLES

    Table A.1 summarizes some frequently used notations in the text.

    Table A.1. Notation list
    Notation Description
    R total number of groups
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0414 member size of the rth group
    n total number of observations
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0415 empirical mean of group size
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0416 urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0417-dimensional column vector of ones
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0418 orthogonal projector to the linear space spanned by the vector urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0419
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0420 covariance transformation matrix
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0421 column vector of disturbances in the rth group
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0422 column vector of dependent variables in the rth group
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0423 column vector of order statistics of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0424 in ascending order
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0425 column vector of order statistics of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0426 in ascending order
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0427 ith-order statistic of ε in a group of size m
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0428 ith-order statistic of y in a group of size m
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0429 spacing statistic between urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0430 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0431
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0432 spacing statistic between urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0433 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0434
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0435 difference between largest and ith-order statistics of ε
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0436 difference between largest and ith-order statistics of y

    The following properties for normal-order statistics are useful.

    Lemma A.1.Suppose that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0437 are i.i.d. N(0, 1). Let the corresponding urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0438 be order statistics of standard normal variables in the ascending order. Then, (a) urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0439, and (b) urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0440, where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0441 is the sample mean of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0442.

    The first result is a special case of the recurrence relation for the standard normal distribution (Balakrishnan and Sultan, 1998, p. 174).

    Lemma A.2.Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0443 denote order statistics of standard normal variables of a random sample with size m in the ascending order. Then, (a) urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0444; (b) urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0445 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0446; (c) urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0447 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0448; (d) urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0449; (e) urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0450; (f) urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0451 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0452; (g) urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0453; (h) urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0454.

    The property that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0455 is independent of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0456 under normality is useful for the preceding properties. In addition, the technique of the integration by parts can be used to derive some of the above results. Detailed proofs of these results are available upon request.

    APPENDIX B: PROOFS OF RESULTS IN THE MAIN TEXT

    Proof of Theorem 3.1.Given urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0457, without loss of generality, assume that values of ξ are distinct. They can be permuted into the order statistics with an ascending order urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0458, i.e. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0459. Because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0460, define

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0461(B.1)

    When urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0462 (and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0463), the solution vector urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0464 in B.1 satisfying the system 3.1 can be seen as follows. First, the values of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0465 also have an ascending order like those of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0466, i.e., urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0467 if urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0468. This is so, because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0469 by assumption,

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0470
    Furthermore, for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0471, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0472, and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0473. Second, the system B.1 implies that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0474. From urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0475, it implies that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0476. By substitution,
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0477
    This also implies that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0478. Therefore, the system of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0479 follows. By permuting the indices back to the original ones in terms of ξ, the system consisting of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0480 implies the existence of a solution to the original system 3.1.

    It remains to show that 3.1 has a unique solution for given urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0481. We can show that the inverse mapping theorem from y to ξ has an order preserving property. If urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0482 in 3.1 with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0483, then it is necessary that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0484. This is so, because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0485, and 3.1 becomes urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0486 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0487 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0488. Hence, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0489 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0490 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0491. This property is used to guarantee the unique solution of the system 3.1 for given ξ.

    Finally, we can show that the system 3.1 has a unique solution for given ξ. Without loss of generality, assume that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0492. From the preceding arguments, there exists a vector urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0493 such that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0494 satisfying 3.1. In particular, it has

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0495
    Suppose that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0496 is another solution of 3.1 with an ascending order urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0497. Then
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0498
    (i.e. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0499 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0500). Because both urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0501 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0502 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0503, it follows that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0504. Next, suppose that there exists y such that its components are not in ascending order. In that case, there exists a permutation such that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0505. Previous arguments will imply that the inverse vector of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0506 from 3.1 must have an ascending order. This is a contradiction because the original elements of the vector ξ are in ascending order.

    From the above results, given any urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0507, there exists a unique solution vector y to the system 3.1. By reordering ξ into ascending order, 3.1 can be conveniently rewritten as the equivalent explicit expression 3.2. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0508

    Proof of Theorems 4.1, 4.2 and 6.1: The IV estimator with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0509 implies that
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0510
    Under Assumption 4.3, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0511 is non-singular, and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0512urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0513. Hence, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0514 is consistent. For the limiting distribution,
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0515
    Under Assumptions 4.2 and 4.3, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0516 urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0517, it follows that
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0518
    Similarly, we can show that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0519 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0520 are consistent and are asymptotically normal. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0521

    APPENDIX C: LIKELIHOOD FUNCTIONS OF MODELS

    The ML approach is feasible for the simple model, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0522, if a specific parametric distribution is specified for the disturbances. Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0523, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0524 and
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0525
    Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0526 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0527 be column vectors of order statistics. The structural equation system is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0528, and the reduced-form system is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0529, with
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0530
    Suppose that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0531 are i.i.d. with a density function urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0532. The implied density function of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0533 is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0534 (David, 1981, p. 10). Because the determinant of Λ is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0535 by the partition matrix formulae (see Proposition 30 of Dhrymes, 1978), the density function of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0536 is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0537
    Because the normality assumption is of special interest, we consider the ML approach in some detail under the assumption that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0538 are normally distributed with zero mean and a finite variance σ2. The model equation in the rth group can be written as
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0539(C.1)
    Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0540. The log likelihood function is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0541(C.2)
    For any λ, the MLE of σ2 is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0542
    We can work with the concentrated likelihood on λ, which might have a computational advantage because it involves only a single unknown parameter. It has analytical advantage too, if our interest is focused on the asymptotic distribution of the estimate of λ alone; see Amemiya (1985). The concentrated log likelihood function at λ is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0543(C.3)
    Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0544 be the true parameter vector. To investigate the asymptotic properties of the MLE of (C.2), the following basic regularity conditions are assumed.

    Assumption C.1.urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0545 are i.i.d. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0546.

    Assumption C.2.The parameter space Θ is a compact convex subset of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0547, and the true λ0 is in the interior of Λ, a connected compact subset of the interval ( − 1, 1).

    The compact parameter space in Assumption C.2 is needed because the ML approach works with the concentrated likelihood function C.3, which is non-linear in λ. The condition on the lower and upper bounds of λ guarantees that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0548 is well defined and that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0549 is bounded away from zero for all urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0550. Assumption C.2 also requires σ2 to be bounded away from zero. The first-order derivatives of the log likelihood function C.2 are
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0551
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0552
    At λ0, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0553 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0554 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0555. Hence,
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0556
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0557, because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0558. From the moments for the normal-order statistics,
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0559
    The zero expected score at θ0 relies on the moment properties that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0560 of the normal-order statistics from Lemma A.1. Because these moment properties would not necessarily hold for other distributions, this illustrates that the consistency of the MLE depends on the distributional assumption. In this case, it is the normality in Assumption C.1. In other words, the MLE assuming normality would not be robust to other distributions.
    Define a non-stochastic function
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0561
    where
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0562
    with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0563. Following Lemma A.1, we have
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0564
    where
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0565
    which is positive because of the expectation of the average of squared normal ordered variates. Note that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0566 is bounded from above because group sizes are finite and bounded, and that it is bounded below by Lemma A.1, because
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0567
    where the last equality follows from urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0568 in a group of size two. Therefore, such positiveness would not vanish in the limit, which guarantees the non-singularity of the information matrix. It follows that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0569 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0570 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0571.

    Theorem C.1.Under Assumptions 4.1, C.1 and C.2, θ0 is globally identifiable and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0572 is a consistent estimator of θ0.

    Proof.The consistency of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0573 will follow from the uniform convergence of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0574 urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0575 to zero on Λ and the uniqueness identification condition that, for any urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0576,

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0577
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0578 is the complement of an open neighbourhood of λ0 in Λ of diameter ε; see Theorem 3.4 of White (1994). Note that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0579, where
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0580
    and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0581.

    By Kolmogorov's strong law of large numbers or Chebyshev's weak law of large numbers for independent random variables, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0582 uniformly on Λ, where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0583 is bounded away from zero on Λ. Therefore, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0584.

    A third-order Taylor series expansion of the function urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0585 at λ0 can be written as

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0586
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0587 lies between λ and λ0, and
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0588
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0589
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0590
    with
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0591
    and
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0592

    For urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0593,

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0594
    Under Assumptions 4.1, C.1 and C.2, we have that
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0595
    are of order O(1) uniformly in λ on the bounded set Λ. Thus, there exists a constant urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0596 such that, for large enough R,
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0597
    Note that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0598 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0599 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0600. Hence, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0601 urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0602 for some urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0603 for large R. Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0604. For urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0605, we have urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0606. Therefore, the consistency of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0607 follows from the uniform convergence of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0608 to zero on Λ and the identification uniqueness condition that holds on urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0609. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0610

    The asymptotic distribution of the MLE urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0611 can be derived from the Taylor series expansion of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0612 at θ0. The variance matrix of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0613 urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0614 is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0615
    which is positive definite because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0616 is strictly positive for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0617 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0618.

    Theorem C.2.Under Assumptions 4.1, C.1 and C.2, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0619.

    Proof.Applying the mean-value theorem to urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0620 at θ0, we obtain

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0621
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0622 lies between urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0623 and θ0. The asymptotic distribution of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0624 follows by the application of a uniform law of large numbers of independent observations to the Hessian matrix and Lyapunov's central limit theorem to the score vector.

    The second-order derivatives of the log likelihood function are

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0625
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0626
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0627

    Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0628. Then, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0629 can be expanded in terms of a polynomial of λ as follows:

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0630
    Because the stochastic terms on the right-hand side are urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0631 by Chebyshev's law of large numbers of independent observations and because Λ is a bounded set, it follows that
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0632

    Similarly, denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0633 urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0634. We have urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0635. It follows that

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0636
    The uniform convergence of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0637 in θ follows directly from Assumptions 4.1, C.1 and C.2.

    The first-order derivatives of the log likelihood function at θ0 are

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0638
    and
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0639

    Let k be an even integer and let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0640 denote the kth moment of a normal variate. Note that

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0641
    Under Assumptions 4.1 and C.1, we have that for all r,
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0642
    where the first and second inequalities follow from the urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0643 inequality, the third inequality is from the Cauchy–Schwarz inequality and C is a finite constant. Then
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0644
    which goes to zero in the limit. It follows that Lyapunov's central limit theorem can be applied and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0645. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0646

    The social interaction model with maximum can include a group effect and an intercept:
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0647(C.4)
    We assume that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0648 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0649 are mutually independent and can be characterized by i.i.d. random variables with zero mean and finite variances, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0650 and σ2, respectively.

    It is revealing to decompose C.4 into two parts, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0651, which can be called a between-group (BG) equation, and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0652, which is a within-group (WG) equation, because they bear some similarity to the parts of a panel data regression model (Hsiao, 1986).

    Ignoring potential group effects, the MLE would be inconsistent, because the above likelihood function is misspecified. In panel data models, it often makes a difference in estimation whether researchers treat urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0653 as fixed or random. In the fixed-effects model, we encounter an incidental parameters problem when m is finite but R is large. We can eliminate the group effects (by either the first-differencing method or WG transformation) or we can use the random-effects approach to obtain consistent estimates under the normality assumption. Note that the mutual independence between urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0654 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0655 is not necessary for the consistency of the first-differencing or WG MLE.

    1 Derivation of the score equation that ignores common group factors

    Ignoring the group effect urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0656, C.4 reduces to urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0657. The misspecified log likelihood function is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0658
    At λ0 and μ0, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0659 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0660 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0661. Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0662, the ratio of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0663 to σ2. Let ρ0 be the true parameter of ρ. From the moments relations of the normal-order statistics in Lemma A.1,
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0664
    Thus, the MLE ignoring group effects would be inconsistent if there exist significant group factors in the model.

    2 Derivation of the likelihood function of fixed-effects specification

    Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0665 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0666, and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0667. Under the condition that ε are i.i.d. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0668, the likelihood function of the fixed-effects specification of C.4 is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0669
    Substituting the fixed-effects MLEs urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0670, for all r at any given value of λ, into the above likelihood function, the concentrated fixed-effects log likelihood function, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0671, follows:
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0672
    From the moments for the normal-order statistics,
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0673
    which is negative as urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0674. Thus, the fixed-effects MLE is inconsistent.

    3 Derivation of the likelihood function for first-differenced samples

    Given ordered random variables urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0675, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0676, in ascending order, spacing random variables urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0677, for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0678, are defined as the order statistics via urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0679, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0680. The relation between the vector urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0681 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0682 is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0683, where
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0684
    Because the Jacobian of J is 1, and the density of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0685 is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0686, the density of spacings s is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0687. When ε are i.i.d. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0688, the density of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0689 is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0690
    Note that
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0691
    Hence,
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0692
    Using the first-differencing method to our model, we have
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0693
    and
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0694
    Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0695 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0696 represent the spacings between the order statistics of ε and y for the rth group. Because the joint density of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0697 is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0698, it follows that
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0699
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0700 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0701 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0702. The log likelihood function of a first-differenced sample with R groups is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0703
    For any λ, the MLE of σ2 from the above likelihood function is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0704

    4 Derivation of the partial likelihood function of the within-group equation

    Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0705 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0706 be column vectors of demeaned order statistics. The WG equation is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0707, where
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0708
    and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0709.
    The joint density function of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0710 is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0711
    which implies that the density function of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0712 is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0713. The partial likelihood function of the WG equation follows because the Jacobian is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0714.
    Under the condition that ε are i.i.d. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0715, the log likelihood function of the WG equation is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0716
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0717 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0718 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0719. The MLE of σ2 is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0720
    which is identical to the MLE of σ2 of the first-differencing equation. The numerical equivalence of first-differencing and WG transformation can be easily established because
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0721
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0722 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0723. Thus, the first-differencing and WG transformations lead to the numerically exact likelihood function.

    5 Derivation of the likelihood function of the random-effects approach

    The variance matrix of the disturbance vector urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0724 is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0725, which has the determinant urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0726 and the inverse urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0727 with urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0728, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0729 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0730. The likelihood function of the random effects (RE) specification follows. When urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0731 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0732 are mutually independent and normally distributed, the log likelihood function under the RE specification is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0733
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0734 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0735.

    6 Derivation of the conditional likelihood function of the between-group equation

    From the likelihood function of the RE model and that of the WG equation, their ratio gives the log likelihood function of the BG equation. The likelihood function is the conditional likelihood function of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0736 conditional on urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0737 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0738. It is a partial likelihood function:
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0739
    Explicitly, in terms of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0740, the BG equation becomes
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0741
    By a simple recursive argument, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0742. Thus, the BG equation can be rewritten as
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0743
    Note that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0744 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0745 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0746, and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0747 for all i under normality. Hence, we can regard all the variables urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0748 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0749 as exogenous variables and the dependent variable is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0750. The likelihood function of the BG equation is simply the conditional likelihood function of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0751 conditional on urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0752 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0753. The likelihood function of the WG equation is thus a partial likelihood function.
    This likelihood is also related to the conditional likelihood of spacing variables. For the spacing variables, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0754 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0755, the conditional density of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0756 given urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0757 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0758 is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0759
    Because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0760 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0761, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0762, and
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0763
    the conditional density of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0764 given urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0765 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0766 is
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0767
    Note that
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0768
    By taking into account the random effect urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0769 as the extra source of variation, we see that the density function of the BG equation is related to the density of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0770.

    The likelihood functions of all the proposed approaches (i.e. the first-differencing approach, RE specification, WG equation and BG equation) are well defined and depend on a fixed number of parameters. Similar to Theorem C.2, under the i.i.d. normality assumption, these estimators can be consistent and asymptotically normal as R goes to infinity. Among these consistent estimators, the RE MLE will be asymptotically efficient.

    7 Monte Carlo results

    We investigate the finite sample performance of MLEs under various specifications for Models C.1 and C.2.
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0771
    In contrast, we consider the method of moments estimator (MOME) that uses recurrence relations under the i.i.d. assumption. We randomly draw urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0772 from N(0, 1) to investigate the asymptotic property of the MLE and we also draw urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0773 from the standardized χ2(5) distribution in order to investigate its sensitivity against non-normality. The group effect urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0774 is randomly drawn from N(0, 1). The true parameters are urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0775 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0776.

    Table C.1 presents the bias (Bias) and standard deviation (SD) of the MLE and MOMEs of Model C.1 under the normal distribution. The MLEs of λ have very small biases for all R, and its SDs tend to decrease as R increases. The MOMEs of λ have large biases for small R and its biases become small when R is 240 or larger. When R increases, the MOMEs have smaller standard errors. In general, the MLE has smaller SDs than the MOME. The estimates of μ and σ have similar properties as those of λ under the normal distribution. Table C.2 presents finite sample results of Model C.1 under the χ2 distribution. The MLE of λ has significant downward biases. The MOME of λ has large biases for small R, but small biases when R is 240 or larger. These results reveal that the MLE is sensitive to distributional assumptions, and that the MOME is distribution-free and has satisfactory performance for reasonably large R. Table C.3 presents finite sample results of various MLEs and MOMEs of Model C.2 under the normal distribution. As expected, the inconsistent MLEs that ignore group effects (IGE) or treat them as fixed effects (FEs) are biased in different directions, and the consistent MLEs (WG, BG and RE) have small biases for all R. The MOMEs of Model C.2 perform similar to those of Model C.1. The RE-MLE approach performs best in terms of SDs among the consistent estimators, and all the consistent MLEs perform better than the MOMEs.

    Table C.1. Model C.1 under N(0, 1) distribution
    R MLE MOM R MLE MOM
    λ Bias 60 −0.0043 14.9160 480 −0.0006 −0.0112
    SD 0.0432 263.4300 0.0144 0.0802
    μ Bias 0.0048 −6.1766 0.0008 0.0070
    SD 0.0659 109.8200 0.0245 0.0502
    σ Bias 0.0029 4.6755 0.0000 −0.0023
    SD 0.0481 79.8610 0.0192 0.0300
    λ Bias 120 −0.0023 −0.0717 960 −0.0001 −0.0020
    SD 0.0273 0.2855 0.0101 0.0559
    μ Bias −0.0008 0.0394 −0.0001 0.0009
    SD 0.0464 0.1767 0.0157 0.0339
    σ Bias −0.0021 −0.0135 −0.0002 −0.0003
    SD 0.0340 0.0665 0.0135 0.0194
    λ Bias 240 −0.0008 −0.0375 1920 −0.0007 −0.0059
    SD 0.0213 0.1685 0.0070 0.0390
    μ Bias 0.0016 0.0264 0.0004 0.0032
    SD 0.0361 0.1178 0.0114 0.0238
    σ Bias −0.0027 −0.0120 0.0001 −0.0012
    SD 0.0553 0.0693 0.0091 0.0135
    Table C.2. Model C.1 under standardized χ2(5) distribution
    R MLE MOM R MLE MOM
    λ Bias 60 −0.0898 −0.1833 480 −0.0815 −0.0228
    SD 0.0589 0.9894 0.0201 0.1039
    μ Bias 0.0537 0.1203 0.0546 0.0157
    SD 0.0777 0.6868 0.0269 0.0729
    σ Bias 0.0177 −0.0018 0.0206 −0.0028
    SD 0.0703 0.1270 0.0256 0.0315
    λ Bias 120 −0.0856 −0.0398 960 −0.0809 −0.0125
    SD 0.0407 0.3414 0.0142 0.0768
    μ Bias 0.0537 0.0254 0.0533 0.0079
    SD 0.0517 0.2451 0.0194 0.0537
    σ Bias 0.0188 −0.0013 0.0191 −0.0024
    SD 0.0501 0.0635 0.0170 0.0212
    λ Bias 240 −0.0826 −0.0177 1920 −0.0810 −0.0026
    SD 0.0283 0.1764 0.0097 0.0497
    μ Bias 0.0546 0.0118 0.0548 0.0026
    SD 0.0386 0.1229 0.0135 0.0340
    σ Bias −0.0208 −0.0002 0.0204 −0.0002
    SD 0.0350 0.0449 0.0128 0.0159
    Table C.3. Model C.2 under N(0, 1) distribution
    R IGE FE WG BG RE MOM
    λ Bias 120 0.2385 −0.7780 0.0057 −0.0309 −0.0085 0.3536
    SD 0.0199 0.0247 0.1367 0.1951 0.0935 3.2983
    μ Bias −0.1249 0.4874 0.0049 0.0260 0.0131 −0.1810
    SD 0.0579 0.2626 0.1377 0.1694 0.1251 2.0278
    σ Bias 0.0942 −0.1673 −0.0010 −0.0641 −0.0030 0.1069
    SD 0.0380 0.0357 0.0471 1.0738 0.0424 1.0339
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0777 Bias −0.1088 0.0011 0.8097
    SD 0.4993 0.2508 7.3440
    λ Bias 240 0.2391 −0.7753 0.0143 −0.0101 0.0015 0.0698
    SD 0.0135 0.0167 0.0955 0.1136 0.0666 0.4111
    μ Bias −0.1270 0.4676 −0.0083 0.0052 −0.0013 −0.0336
    SD 0.0399 0.1958 0.0925 0.0974 0.0809 0.2270
    σ Bias 0.0956 −0.1641 0.0051 −0.1537 0.0027 0.0216
    SD 0.0300 0.0279 0.0357 0.8178 0.0314 0.1052
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0778 Bias −0.0630 −0.0162 0.0258
    SD 0.3402 0.1794 0.7527
    λ Bias 480 0.2418 −0.7765 0.0049 −0.0036 −0.0016 0.0211
    SD 0.0096 0.0116 0.0647 0.0808 0.0483 0.3042
    μ Bias −0.1314 0.4669 −0.0047 −0.0001 −0.0011 −0.0133
    SD 0.0308 0.1407 0.0687 0.0755 0.0633 0.1745
    σ Bias 0.0925 −0.1668 −0.0010 −0.1795 −0.0022 0.0055
    SD 0.0199 0.0174 0.0227 0.7383 0.0206 0.0716
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0779 Bias −0.0331 0.0034 0.0085
    SD 0.2455 0.1304 0.6488
    λ Bias 960 0.2408 −0.7776 −0.0029 0.0037 −0.0043 0.0062
    SD 0.0064 0.0088 0.0480 0.0498 0.0331 0.1844
    μ Bias −0.1287 0.4765 0.0033 −0.0009 0.0040 −0.0011
    SD 0.0209 0.0986 0.0488 0.0485 0.0433 0.1059
    σ Bias 0.0928 −0.1679 −0.0017 −0.1993 −0.0019 0.0013
    SD 0.0136 0.0131 0.0177 0.6127 0.0152 0.0444
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0780 Bias −0.0190 0.0083 −0.0206
    SD 0.1521 0.0924 0.4518
    λ Bias 1920 0.2412 −0.7769 0.0014 0.0045 −0.0005 −0.0021
    SD 0.0048 0.0059 0.0326 0.0362 0.0230 0.1223
    μ Bias −0.1296 0.4685 −0.0017 −0.0035 −0.0006 0.0004
    SD 0.0145 0.0677 0.0335 0.0337 0.0296 0.0706
    σ Bias 0.0936 −0.1662 0.0006 −0.2441 0.0003 0.0002
    SD 0.0099 0.0089 0.0113 0.5209 0.0104 0.0284
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0781 Bias 0.0052 0.0012 −0.0061
    SD 0.1167 0.0645 0.3297

    APPENDIX D: RECURRENCE MOMENTS

    Lemma D.1.Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0782 be a random sample of size m and let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0783 be the corresponding order statistics in the ascending order. Assume that the first moment of any one order statistics in a sample of size m exists. Then,

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0784

    This lemma follows directly from Balakrishnan and Sultan (1998, p. 155).

    Proof of Theorem 5.1.For analytical convenience, we assume that there are urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0785 different group sizes. Each group size has the same number of groups T, and R and T have the same order of magnitude, because urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0786 and mU is bounded. Denote urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0787 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0788, where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0789 is the ith-order statistics in the tth subgroup with group size m for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0790 and urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0791. The GMM estimation vector for the model 4.15 is

    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0792
    By the Taylor expansion of urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0793 at δ20,
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0794
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0795 is between urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0796 and δ20, and
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0797
    By the law of large numbers of i.i.d. random variables, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0798 for bounded m. It follows that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0799, where
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0800
    By Lyapunov's central limit theorem, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0801, and
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0802
    where Δ11 is the variance of recurrence moments and Δ12 (and Δ21) is the covariance of the linear IV and recurrence moments. Under Assumption 5.2 that urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0803 exists and has full column rank urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0804, it follows that
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0805
    It is obvious that the above analysis can be applied to the general case that each group size has a different number of groups. urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0806

    1 Consistent estimation of Δ11 and Δ12

    We can use an initial estimate such as 4.22 to obtain consistent estimates of Δ11 and Δ12. Note that Δ11 depends only on λ0. The variance of a generic recurrence moment, urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0807, can be estimated by
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0808
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0809 is an initial estimate of λ0. If two different recurrence moments have common group size, their covariance is non-zero and can be estimated with cross moments in a similar way.
    Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0810 be a generic disturbance vector of m rows and let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0811 be the corresponding IV matrix. Under Assumption 5.1, the covariance of the linear IV moment urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0812 and a generic recurrence moment urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0813 for urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0814 is urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0815. Let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0816 be the disturbance vector of the tth group of size m, and let urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0817 be the corresponding IV matrix. The above expectation can be estimated by
    urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0818
    where urn:x-wiley:13684221:media:ectj12031:ectj12031-math-0819 is the residual of an IV regression.

      The full text of this article hosted at iucr.org is unavailable due to technical difficulties.