Volume 41, Issue 6 pp. 1529-1548
ORIGINAL ARTICLE
Full Access

Heterogeneous immigrants, exports and foreign direct investment: The role of language skills

Matthias Lücke

Matthias Lücke

Institut fur Weltwirtschaft an der Universitat Kiel, Kiel, Germany

Search for more papers by this author
Tobias Stöhr

Tobias Stöhr

Institut fur Weltwirtschaft an der Universitat Kiel, Kiel, Germany

Search for more papers by this author
First published: 19 December 2017
Citations: 8

Abstract

We investigate the interplay of language skills and immigrant stocks in determining bilateral FDI outstocks of OECD reporting countries. Applying a Poisson panel estimator to 2004–11 data, we find robust evidence for a positive effect of bilateral immigrants on bilateral FDI-provided that residents of the two countries have few language skills in common. We find a similar effect for immigrants from third countries that speak the language(s) of the FDI host country, making them potential substitutes for bilateral migrants. Our findings suggest that immigrants facilitate outgoing FDI through their language skills, rather than through other characteristics like cultural familiarity.

1 MOTIVATION

One important feature of many international labour migrants is their transnationalism—that is, their familiarity with the cultures of both, their countries of origin and destination. Therefore, the presence of immigrants may facilitate international economic transactions, especially foreign direct investment and trade, by reducing international communication costs. Indeed, over the past two decades, many empirical studies have found a positive effect of immigrant stocks on foreign direct investment in the immigrants' countries of origin (Aubry, Kugler, & Rapoport, 2012; Javorcik, Özden, Spatareanu, & Neagu, 2011) as well as on bilateral trade (Gould, 1994; Head & Ries, 1998; Rauch & Trindade, 2002). However, migrants are not an homogeneous group. In addition to differences in education levels that are often studied, they are highly heterogeneous in terms of the languages they speak.

Most existing studies are based on a gravity model of bilateral trade or FDI where the bilateral migrant stock is added as an explanatory variable to reflect its possible dampening impact on the cost of international economic transactions. One concern with this approach is that migration, FDI and trade are driven in part by common, unobserved, country-pair-specific determinants. Many existing studies do not address the resulting omitted-variable bias; therefore, they probably overstate the positive effect of the migrant stock on bilateral outward FDI.

In this paper, we extend the empirical literature on the impact of immigrants on FDI in two directions. First, we estimate a gravity equation with panel data and country-pair-specific fixed effects because this approach has been shown to avoid omitted-variable bias (Parsons, 2012). This is a particularly demanding test of our underlying hypothesis. We address the impact of short-term, year-to-year variations in the number of immigrants on outward FDI, rather than the long-term effect that may be identified through cross-section analysis. While our short-term effect is economically significant in its own right, we also view it as a lower bound on the total migration effect, following the full social and economic integration of immigrants in the reporting country.

Second, we investigate how the impact of immigrants on bilateral outward FDI depends on the ease with which residents of the two countries can communicate, given their common language skills. “Transnational” immigrants are familiar with the cultures and languages of both countries. We hypothesise that their presence is associated with higher FDI particularly when only few residents of the two countries can otherwise communicate in a common language—be it the official language of either country or any other language. By contrast, if most residents of the two countries have a language in common, the presence of immigrants may not reduce communication costs much further. Empirically, we try to eliminate all variation except for changes within country pairs over time. We thus try to look well beyond the simple effect of migration or language and conduct a very demanding test of the correlation.

While our focus on the interaction between immigrants and common language skills at the country-pair level is novel (to the best of our knowledge), we draw on several strands of literature. First, irrespective of migrant stocks, many studies have found that a common language increases bilateral trade (Egger & Lassmann, 2012) or FDI (Selmier & Oh, 2013); recent work such as Egger and Lassmann (2014) presents robust evidence on the direction of causality from language to trade.

Melitz and Toubal (2014) develop a global language data set that includes not only the official language(s) of each country, but also any other language spoken by at least 4% of the population (such as English as a foreign language). The authors estimate a conventional gravity model for bilateral trade flows and find separate positive effects for migrant stocks and language variables (common official and spoken languages). They interpret this finding as a reflection of the role of migrants as translators and facilitators in international trade. This is similar to our hypothesis that migrants are more important when there is less knowledge of common languages.

Gould (1994) and others have argued informally that the language skills of immigrants explain the positive empirical association between the number of immigrants and bilateral exports. More generally, language skills are one example of a wide range of skills and characteristics of immigrants that may help to ease informational asymmetries and solve agency problems that are central to the proximity-concentration trade-off (Felbermayr & Jung, 2009; Portes & Rey, 2005; Rauch, 2001). Therefore, immigrants may facilitate various international economic transactions that involve an element of investment, including bilateral trade (because of the fixed cost of market entry), portfolio and direct investment. The trade and FDI facilitating roles of migrants may thus work through a language channel, but they may also be based on other skills or characteristics that are beyond the scope of this paper (e.g., culture).

Kleinert and Toubal (2010) derive a gravity equation for FDI from a heterogeneous firm model of international trade and FDI similar to Helpman, Melitz, and Yeaple (2004). The firm's decision to undertake FDI, rather than to export or to produce only for the domestic market, is driven by the fixed cost of exporting versus the fixed cost of setting up production abroad. It is natural to think that the presence of immigrants may reduce both kinds of fixed costs. On this basis, migrant stocks become part of a rigorously derived gravity equation for FDI.

We explain our approach to estimating the gravity equation for FDI in Section 2 below. In Section 3, we discuss data sources, particularly the use of the language matrices from Melitz and Toubal (2014). We present our econometric results in Section 4 and discuss robustness checks in Section 5 In Section 6, we summarise our findings and point out the implications for the role of migrants from developing countries in facilitating FDI from high-income countries in their countries of origin.

2 EMPIRICAL APPROACH

Several recent papers estimate a single-period cross-sectional gravity model of FDI with the stock of migrants included as an “exogenous” variable (most notably Leblang, 2010). However, this approach will yield biased estimates for at least two reasons: first, there is likely to be reverse causality between FDI and migrant stocks. Second, both FDI and migrant stocks are probably subject to unobserved heterogeneity. Unfortunately, there is no easy way to address these concerns: there are no known instruments for bilateral migration in an FDI or trade context with a valid exclusion restriction at the global level. Furthermore, the bias does not occur at the country level but at the country-pair level; for example, specific historical relations between two countries may be a reason for both high bilateral migration and high bilateral FDI flows. Therefore, cross-sectional country-level estimates are potentially severely biased.

Thus, our analysis of the impact of immigrant stocks on FDI involves a difficult trade-off between bias reduction and the time horizon of our estimates. We estimate a panel model with bilateral fixed effects to eliminate the likely omitted-variable bias from unobserved historical ties that may affect all economic interactions—bilateral migration, FDI and trade. Our bilateral fixed effects are time-invariant, which is a sensible assumption only over a relatively short time horizon. To reduce the scope for reverse causality even further, we use 1-year lagged immigrant stocks to explain FDI stocks. Thus, we estimate a correlation that can be seen as the short-term “effect” of the immigrant stock on outward FDI as precisely as possible; we consider this short-term effect a lower bound on the total, long-term effect that may be realised only as immigrants integrate in the reporting country economically and socially. We can see no way of estimating the long-term effect remotely as cleanly as the short-term effect and therefore do not attempt it.

When we choose our estimator, we take into account that bilateral trade and FDI data contain many zeros and are highly heteroscedastic (cf. Silva & Tenreyro, 2006, and follow-up papers). We use a fixed-effects Poisson model with cluster-robust standard errors (Stock & Watson, 2008; Wooldridge, 1999). This estimator requires only minimal assumptions, particularly that the conditional mean is correctly specified and that the independent variables are strictly exogenous.

The explanatory variables in our gravity model for bilateral FDI outstocks include log migrant stocks; time-variant gravity variables such as GDP, GDP per capita and free trade agreements; time-invariant bilateral fixed effects; and year fixed effects. Thus, the direct effect of all time-invariant factors is subsumed under the bilateral fixed effect. Instead of log-linearising the outcome variable, our estimator directly estimates the exponential regression function in its multiplicative form. Using the notation of Westerlund and Wilhelmsson (2011), we are interested in the simple gravity equation with bilateral fixed effects:
urn:x-wiley:03785920:media:twec12598:twec12598-math-0001(1)

Yit and Yjt contain time-variant, country-specific gravity variables for reporting and partner countries, respectively. Dijt contains variables such as the 1-year lagged bilateral log migrant stock which vary over time as well as across country pairs. θij denotes the time-invariant bilateral fixed effects.

We then directly estimate the exponential regression function using maximum likelihood with the fixed-effects conditional mean given by:
urn:x-wiley:03785920:media:twec12598:twec12598-math-0002(2)

With this approach, we neither need to truncate the dependent variable (which includes many zeros) nor subject it to an arbitrary non-linear transformation that could lead to severely biased and inconsistent results (cf. Silva & Tenreyro, 2006; Westerlund & Wilhelmsson, 2011).

3 DATA

3.1 Overview

Of the data required for our analysis, immigrant stocks represent the bottleneck. The widely used World Bank Bilateral Migration Database relies on census data and is decadal. This is insufficient for our analysis where we seek to eliminate unobserved heterogeneity at the country-pair level as much as possible and therefore rely on much of this unobserved heterogeneity being time-invariant. This assumption can be argued to hold with yearly data in our context but is unlikely to hold across decades. Therefore, we use yearly data from the OECD international migration database that provides good coverage of immigrants in OECD countries from all countries of the world (Arslan et al., 2014; Dumont & Lemaitre, 2004; Widmaier & Dumont, 2011).

Most reporting countries define immigrants either by country of birth or nationality; only a few report both definitions. Therefore, if we were to limit our analysis to data based on only one definition, we would exclude a substantial number of countries. In the interest of external validity, we include each country's immigrant stock according to the country's preferred definition. If both are available, such as in the case of Norway, we use the country of birth. Since we only rely on within-bilateral-tie variation and eliminate differences between countries, it seems unlikely that these differing definitions will affect our findings. However, we ensure that definitions and reporting of migration numbers do not change within bilateral ties since this would lead to purely artificial time-variation in migration numbers. Between 2002 and 2004, several OECD countries adjusted their definition of migrants in these statistics, for example, by cleaning up register data, regularising and thus suddenly counting irregular migrants or changing reporting standards. Therefore, we restrict our data to the years from 2004 that are unaffected by these changes. Otherwise our analysis would have been based partly on purely statistical or artificial fluctuations in the migration data, resulting in low comparability even within countries. Furthermore, we use variables from the Doing Business Report to account for the fact that the investment climate in some countries may have been so poor as to preclude most FDI except to exploit natural resources.

Our final data set includes all OECD countries except Luxembourg and Turkey. Data range from 2004 to 2011. In our Poisson panel estimations, where values for consecutive years are required, results are based on 26 host countries with an average of 82 partners each. Average bilateral data points per reporting country range from 3 to 5.6 years with an average of 3.9 years.

We match our immigrant stock data with FDI outstocks and outflows from the OECD International Direct Investment Statistics (OECD, 2014) and to UN Comtrade data on exports and imports. Our definition of FDI thus comprises vertical and horizontal FDI. Language data are from Melitz and Toubal (2014); we discuss the construction of our “common language scarcity” variable in the following subsection. We add standard gravity variables and take GDP and population data from the World Bank's World Development Indicators.

The final data set consists of 11,999 bilateral observations in 3,089 bilateral ties (Table 1). Naturally, FDI, exports and immigrant stocks are highly skewed with the mass of observations close to zero.

Table 1. Descriptive statistics for key variables as used in main specification
Variable Obs Mean Median SD Min Max
FDI outstock in mill. US$ 11,999 3,180 8.59 18,583 0 595,139
Exports in mill. US$ 11,999 2,207 74.5 9,981 0 280,710
Log bilateral immigrant stock 11,999 6.26 6.35 3.40 0.00 16.3
Scarcity of bilateral language ties (1 − αij) 11,999 0.78 0.91 0.27 0.00 1.00
(1 − αij) × Log bilateral immigrant stock 11,999 4.70 4.58 3.07 0.00 14.2
Log GDP of reporter 11,999 6.15 5.93 1.42 2.19 9.62
Log GDP of partner 11,999 10.4 10.5 0.48 8.77 11.5
Log GDP per capita of reporter 11,999 3.61 3.60 2.35 −3.84 8.90
Log GDP per capita of partner 11,999 8.33 8.40 1.56 4.68 11.4

Notes

  • For sources of variables see text. GDP in billion US$ and GDP per capita in US$1,000 before taking logs.

3.2 Language ties and the bridging role of migrants

We hypothesise that the scarcer are the language skills needed to do business with another country, the more valuable is the presence of immigrants from that country. As a first test, we allow the coefficient of the bilateral immigrant stock in our gravity model to vary with the share of individuals in the two countries who have no spoken language in common. Thus, the bilateral immigrant stock appears among our explanatory variables not only on its own, but also interacted with our “common language scarcity” variable.

We calculate “common language scarcity” from the “common spoken language” variable in Melitz and Toubal (2014). For every country pair, Melitz and Toubal estimate the chance that two randomly selected individuals, one from each country, have a language in common. They compile a global database of all 42 languages that are spoken by at least 4% of the population in at least two countries. For every country i, they report the share Lli of the population who speak language l. Then, for every country pair (ij) and language l = 1, …, L, they calculate the chance that two randomly selected residents both speak language l (LliLlj). They sum this term over all languages l for each country pair to yield an estimate of the chance that two randomly selected residents have a language in common: αij = ∑lLliLlj. Finally, they standardise their common spoken language variable to lie between 0 and 1.

For our analysis, we are interested in the share of residents of the two countries who cannot communicate in any language. Therefore, we use (1 − αij) as a proxy for the scarcity of language ties. With standardisation, our common language scarcity variable takes a value of 1 for two countries whose residents have no language skills in common whatsoever. By contrast, when all residents can communicate with one another (say, because two countries share the same official language and this is spoken by everyone), the language scarcity variable takes a value of 0. When we estimate our gravity equation for bilateral FDI outstocks, we expect that common language scarcity interacted with the (lagged) immigrant stock will have a positive effect on outgoing FDI. Within the short time span of the panel, there is hardly any change in common languages through other channels than migration. The time-invariant effect of a shared language between two countries is eliminated by the bilateral fixed effect.

In our data, “common language scarcity” is distributed between 0 and 1 and provides sufficient common support (Figure A1 in Appendix S1) to make inferences on a cross-country basis. The average is .78, and the median is .91. As one might expect, English language skills are the most important contributor to bilateral communication.

We also take a step further and investigate whether immigrants from third countries with similar language skills act as substitutes for immigrants from the FDI host country. Take the example of United Kingdom FDI in Mozambique. Residents from the two countries have almost no language skills in common (common language scarcity is close to 1), and there are few immigrants from Mozambique in the UK. However, maybe immigrants in the UK from other Portuguese-speaking countries such as Brazil or Portugal facilitate UK FDI in Mozambique in a similar fashion—provided that language skills, rather than other factors like familiarity with local institutions and culture, are at the heart of the link between immigrants and outward FDI.

To capture the possible effect of third-country immigrants in reporting country i who speak one of the languages of FDI host country j (“same language migrants”), we estimate their number based on the languages spoken in the immigrants' countries of origin:
urn:x-wiley:03785920:media:twec12598:twec12598-math-0003(3)
We include “same language migrants” in the explanatory variables of our gravity model in parallel with bilateral immigrants.

4 RESULTS

As expected, our decision to estimate the gravity model from panel data with bilateral fixed effects constitutes a demanding test of our hypotheses (Table 2). To facilitate comparison with other literature, we begin by estimating our basic gravity equation for both exports and FDI outstocks with bilateral migrant stocks, but without interaction terms, using three different specifications: the widely used PPML estimator with reporter, partner and year fixed effects (columns (1) and (2)); our preferred panel Poisson set-up with bilateral and year fixed effects and gravity variables (detailed in table notes) that will be used in the rest of the paper on different samples (columns (3)–(8)); a robustness check combining bilateral, reporter-year, and partner-year fixed effects (columns (9) and (10)). In addition, we provide evidence of our results' robustness using alternative language data (columns (11) and (12)).

Table 2. Migrants, exports and FDI—alternative estimators and fixed effects
Panel A: Estimates without conditioning on unrestricted sample
(1) (2) (3) (4) (5) (6)
Estimator PPML Panel Poisson with FE
Dependent variable Exports FDI Exports FDI Exports FDI
Lagged log (bilateral migrants) .207 .269 .051 −.059 .003 −.460
(.011) (.019) (.027) (.060) (.082) (.237)
Lagged log (bilateral migrants) × common language scarcity .066 .526
(.107) (.292)
Gravity variables
Reporter & Partner FE
Bilateral FE
Year FE
Used observations 11,999 11,973 11,616 7,442 11,616 7,442
Panel B: Preferred panel Poisson fixed-effects specifications on better quality migration data
(7) (8) (9) (10) (11) (12)
Dependent variable Exports FDI Exports FDI Exports FDI
Subsample All All All All All available
Lagged log (bilateral migrants) .068 −.522 −.050 −.739 .102 −.040
(.090) (.214) (.090) (.224) (.030) (.107)
Lagged log (bilateral migrants) × common language scarcity .038 .799 .090 1.022
(.125) (.312) (.104) (.292)
Lagged log (bilateral migrants) × LangF −.035 .238
(.041) (.020)
Gravity variables
Bilateral FE
Year FE
Reporter-year and partner-year FE
Non-zero observations 7,432 4,733 7,432 4,733 5,426 4,126
Number of bilateral non-zero observations 1,960 1,204 1,960 1,204 1,383 1,014
Panel C: Estimates using panel Poisson with fixed effects and language scarcity by ease of doing business
(13) (14) (15) (16) (17) (18)
Dependent variable Exports Exports FDI FDI FDI FDI
Subsample Time to import Property registration time
<p(50) p(50) <p(50) p(50) <p(50) p(50)
Lagged log (bilateral migrants) .007 .704 −.474 .576 −.514 −.465
(.089) (.387) (.215) (.643) (.227) (.481)
Lagged log (bilateral migrants) × common language scarcity .113 −.731 .700 −.180 .865 .922
(.119) (.450) (.296) (.779) (.343) (.650)
Gravity variables
Bilateral FE
Year FE
Non-zero observations 3,580 3,691 2,902 1,717 2,657 1,984
Number of bilateral non-zero observations 1,002 1,074 772 520 696 567

Notes

  • Standard errors in parentheses. Panel A uses any available data, Panel B and Panel C work with data that excludes large artificial jumps in the migration data that occurred in 2002–04 due to definitorial changes, amnesties, etc. These would otherwise create artificial within-bilateral-tie variation biasing the estimates. Number of non-zero observations and non-zero bilateral ties reported for the respective estimation. Gravity variables are reporter's and partner's respective log GDP and log GDP per capita, common legal system, common currency, regional trade agreement, log distance, contiguity, colony, common coloniser, common official language and common spoken language. Time-invariant ones dropped automatically. The median of scarcity is .91. A scarcity value of .2 corresponds to its 8th percentile. Time to import and time to register property in panel C are measured in days. Columns (11) and (12) report estimates not based on the common spoken language variable, but the alternative LangF variable of Dow and Karunaratna (2006), which indicates differences in languages between countries. The index incorporates linguistic similarity as well as how frequently languages are spoken in reporter and partner country. R2 could only be calculated for columns (1) and (2) (.945, .882) and columns (9) and (10) (.999, .996).
  • ***p < .01, **p < .05, *p < .1.

Under PPML without bilateral fixed effects, the bilateral immigrant stock has a statistically significant and positive coefficient for both exports and FDI as the literature suggests (columns (1) and (2)). With our preferred panel Poisson estimator, which relies only on variation over time and also eliminates bilateral tie-specific time-invariant differences, we find a positive significant coefficient for bilateral immigrant stocks only for exports (column (3)), but not FDI (column (4)). These results suggest that the positive association between immigrant stocks and FDI that is typically found in conventionally estimated gravity models (similar to columns (1) and (2)) may be severely upward-biased.

We analyse our main hypothesis by allowing the impact of the bilateral immigrant stock on FDI to vary with our “common language scarcity” variable in columns (5)–(8). The coefficient of the corresponding interaction term is significantly positive for FDI (columns (6), (8) and (10)), but not for exports (columns (5), (7) and (9)). While Panel A includes the full available migration data from 2000 to 11 including the changes in definition mentioned in the data section, in Panel B we restrict our working sample to the more reliable migration data from 2004 onwards. The stylised results are very similar, and results become statistically more significant when excluding low-quality data. The results in Panel B are therefore our preferred estimates. The stylised finding holds both for the specification with bilateral and year effects and the very rigid specification with bilateral, reporter-year and partner-year fixed effects in columns (9) and (10). As hypothesised, bilateral immigrants facilitate FDI the more, the scarcer are the common language skills between the two countries.

By contrast, although we have earlier found that exports are facilitated by bilateral immigrants, the size of this effect does not significantly depend on common language scarcity. Therefore, in columns (5), (7) and (9), this effect is split into two components that are both positive, yet statistically insignificant, or close to zero and add approximately to the single coefficient in column 3 when applied to countries with average or median language scarcity. This pattern indicates that the correlation between migration and exports does not vary across the range of the interacted variable. This finding is support by the specification with an alternative language scarcity proxy from Dow and Karunaratna (column (11)).

The theoretical argument for the role of migrants in facilitating exports and FDI relate to variable and fixed trade/FDI costs. However, the scope for reducing these costs by employing migrants is not unlimited because these costs are largely driven by a country's business climate. In Table 2, Panel C we therefore test the role of the cost of doing business by splitting the sample along the median value of export-/FDI-receiving countries for two indicators frequently used to study the conditions for international trade and FDI: time to import and property registration time. The results suggest that, while there is no clear pattern for exports, the statistically significant link between FDI, migration and language scarcity only exists if business conditions are not too difficult. By contrast, in countries with a poor business climate, the related administrative costs apparently limit FDI irrespective of how migrants might otherwise contribute to lowering communication and other international transaction costs.

The combination of a straightforward positive effect of bilateral immigrants on exports and a more nuanced effect on FDI can be motivated in the framework of a heterogeneous firm model (e.g. Helpman et al., 2004). Exporting means adapting domestically produced goods to foreign preferences and marketing these goods in an unfamiliar environment. It seems plausible that greater access to immigrants as potential employees with extensive cultural knowledge of both countries may reduce the cost of exporting even when communication is not impeded by the scarcity of common languages.

By contrast, both vertical and horizontal FDI implies reporting country staff managing substantial part of the activities that are carried out by the local staff in the FDI host country. This is very communication-intensive and may be infeasible if few in the FDI host country speak the sending country's language or a substitute such as English. Bilateral immigrants may thus provide highly specialised communication tasks in the FDI sending country without which there might be less scope for FDI to their country of origin. However, since most other tasks in production and marketing are carried in the FDI host country—after all, this is why firms decide to use FDI rather than exports—there may be little other use for immigrants' origin country-specific skills left.

Should this pattern hold in the data, we would expect bilateral migrants not only to matter more for FDI if there is little language overlap in general but also to be more positively associated with FDI in receiving country where there is little knowledge of English, the world's business language. In Table 3, we analyse whether bilateral migrants are indeed more strongly associated with FDI the lower the share of English speakers. Indeed, point estimates of the corresponding coefficient are substantially higher if we restrict the estimation to, for example, partner countries with fewer than 10% English speakers. For exports, there is no such pattern, suggesting that common language skills or English as a substitute is not the main driver of the correlation between bilateral immigrants and exports.

Table 3. Migrants, exports and FDI—By share of population speaking English in partner country
(1) (2) (3) (4) (5) (6) (7) (8) (9)
Share of English speakers in partner country <.1 <.2 <.3 <.4 <.5 <.6 <.7 <.8 <.9
Dependent variable: FDI
Lagged log(bilateral migrants) .438 .377 .140 .158 .159 .146 .158 .149 −.087
(.237) (.194) (.095) (.084) (.083) (.081) (.079) (.078) (.110)
Non-zero observations 1,898 2,250 2,928 3,302 3,559 3,818 3,882 3,967 4,358
Number of non-zero bilateral ties 526 616 770 854 914 975 989 1,015 1,114
Dependent variable: Exports
Lagged log(bilateral migrants) −.068 .002 .049 .062 .063 .071 .073 .074 .090
(.069) (.080) (.044) (.037) (.037) (.034) (.033) (.033) (.031)
Non-zero observations 3,738 4,224 4,976 5,436 5,803 6,142 6,210 6,315 6,857
Number of non-zero bilateral ties 1,031 1,161 1,336 1,446 1,534 1,619 1,635 1,667 1,814

Notes

  • Standard errors in parentheses. All estimations specified as in Table 2, columns (7) and (8) using bilateral fixed effects and year fixed effects. Number of non-zero observations and non-zero bilateral ties reported in each case.
  • ***p < .01, **p < .05, *p < .1.

Our estimates imply a negative effect of immigrants on FDI unless common languages are scarce (negative coefficient for the direct effect of the lagged immigrant stock in Table 2, column (8)). This result is probably an artefact due to our linear specification of the variable coefficient model, combined with the fact that about 75% of observations on “common language scarcity” lie between .75 and 1 (Figure 1). If the mass of observations lies in an area where the true overall effect increases and there is little statistical weight close to the origin, a linearity assumption on the effect can imply that the estimate undershoots the true effect of 0 to indicate negative values. Within the relevant range of values of common language scarcity, the composite effect of immigrants on FDI is predicted to be positive: given the baseline coefficient estimates (Table 2, column (8)), the composite effect of the immigrant stock on FDI exceeds 0 if common language scarcity exceeds .65.

Details are in the caption following the image
Cumulative density function of language scarcity (1 − αij) [Colour figure can be viewed at wileyonlinelibrary.com]

To formalise this argument, we test alternative specifications of our gravity model that only enter the bilateral immigrant stock weighted by common language scarcity if the latter exceeds a specific threshold. This lifts the linearity assumption on the relationship between language scarcity and the log migrant stock. Thus, the hypothesis that bilateral migrants only become relevant for outgoing FDI as common language skills become scarce can be tested using a single migration-related variable. The hypothesised relationship between immigrants and FDI is nil up to the respective threshold and linear thereafter. This way, the function has no jumps and the estimated coefficients is the predicted effect for a bilateral tie without any common language. Figure 2 shows the estimated coefficient and 95% confidence interval for each threshold p ∈ [0, 1), which is shown on the vertical axis. For thresholds above .65, there is strong evidence of a statistically significant, linear relationship. This includes the mass of our observations on common language scarcity and is consistent with our main estimate. We are confident therefore that the negative direct effect of the immigrant stock on FDI is merely a technical consequence of our linear variable coefficient model and does not indicate that immigrants actually reduce FDI for low values of common language scarcity (see also Footnote 2).

Details are in the caption following the image
Point estimates and 95% confidence intervals for regressions that use urn:x-wiley:03785920:media:twec12598:twec12598-math-0005 above a threshold p ∈ [0, 1) as the only migration-related covariate [Colour figure can be viewed at wileyonlinelibrary.com]

Our estimates suggest that the impact of the bilateral immigrant stock on outward FDI is sizable: for a country pair with median common language scarcity at .91 (cf. Table 1), the composite effect is .21 for the full sample: approximately, a 1% increase in the bilateral immigrant stock implies a 0.21% increase in the FDI outstock. When residents in the two countries share almost no language skills (common language scarcity = .99), the corresponding increase in FDI is 0.27%.

Now we turn to our second language-related hypothesis regarding the impact of immigrants on outward FDI: FDI may be affected by immigrants from third countries who speak the same language(s) as FDI host country residents (our “same language migrants”—Section 3). It would be natural to add “same language migrants” to our gravity model exactly in parallel with bilateral migrants, that is, both on its own and interacted with common language scarcity. However, there is too little meaningful variation left when all four migrant-related variables are included. With an average of only five data points for each country pair and many zeros, our panel data do not permit reliable within-estimates for so many regressors that vary over time and across country pairs.

Therefore, we include “same language migrants” in our gravity equation under three alternative specifications: (i) for the whole sample (Table 4, columns (1) and (2)); (ii) only for country pairs with common language scarcity above .9 (columns (3) and (4)); and (iii) only for country pairs with common language scarcity above .95 (columns (5) and (6)). For FDI, the corresponding coefficient increases along with the threshold on common language scarcity (columns (2), (4) and (6)) and becomes significant at the .95 threshold; the latter specification still covers 46% of all observations. The effect from “same language migrants” is large, with a coefficient of .78 for the .95 threshold (column (6)); it comes on top of the broadly unchanged effect for bilateral immigrants—a composite effect of .18 when common language scarcity is set to .95 which is close its median value. Thus, for country pairs with few common language skills, third-country immigrants with the “right” languages may act as substitutes for bilateral immigrants. This substitutive relationship also remains statistically significant when we use a kinked functional form similar to the one discussed earlier to model the scarcity threshold (cf. Figure 2).

Table 4. Migrants, exports and FDI—third-country migrants interacted with common language scarcity
(1) (2) (3) (4) (5) (6)
Dependent variable Exports FDI Exports FDI Exports FDI
Lagged log (bilateral migrants) .084 −.528 .033 −.478 .059 −.475
(.091) (.211) (.090) (.212) (.090) (.208)
Lagged log (bilateral migrants) × common language scarcity .044 .785 .102 .691 .055 .693
(.117) (.301) (.118) (.297) (.118) (.284)
Lagged log (same language migrants) −.055 .084
(.050) (.219)
Lagged log (same language migrants) with common language scarcity >.9 −.164 .388
(.076) (.297)
Lagged log (same language migrants) with common language scarcity >.95 −.069 .780
(.082) (.300)
Gravity variables
Bilateral FE
Year FE
Non-zero observations 7,432 4,733 7,432 4,733 7,432 4,733
Number of non-zero bilateral ties 1,960 1,204 1,960 1,204 1,960 1,204

Notes

  • Standard errors in parentheses. Number of non-zero observations and non-zero bilateral ties reported for the respective estimation. The scarcity of language ties is captured as 1 − αij (1 − common spoken language). Gravity variables are reporter's and partner's respective log GDP and log GDP per capita, common legal system, common currency, regional trade agreement, log distance, contiguity, colony, common coloniser, common official language and common spoken language.
  • ***p < .01, **p < .05, *p < .1.

It is remarkable that we find a robust effect on FDI exactly for those third-country immigrants that have the “right” language skills to communicate with FDI host country residents. It is nevertheless conceivable that factors other than language skills may enable migrants that do not come from the FDI host country to nevertheless facilitate FDI. One possible such factor is cultural familiarity, which is often proxied by geographical proximity. We test this hypothesis by including immigrants from the FDI host country's neighbours in our gravity equation. It turns out that there is no significant effect from “neighbourhood” immigrants on bilateral FDI in general (Table 5, columns (1) and (2)). However, if neighbouring countries have a large language overlap with the FDI host country (e.g., Chile and Peru) and if common language skills between FDI origin and host country are also scarce, there is a small but significant positive correlation between immigrants from neighbouring countries and bilateral outward FDI (Table 5, columns 4 and 6). Once again, the coefficients for bilateral immigrants are almost unchanged from our basic results (Table 2).

Table 5. Migrants, exports and FDI—migrants from the partner country's neighbours
(1) (2) (3) (4) (5) (6)
Dependent variable Exports FDI Exports FDI Exports FDI
Lagged log (bilateral migrants) .072 −.527 .045 −.401 .069 −.503
(.090) (.216) (.100) (.193) (.092) (.211)
Lagged log (bilateral migrants) × common language scarcity .036 .799 .069 .628 .041 .754
(.119) (.293) (.131) (.280) (.122) (.286)
Lagged log (migrants from partner country's neighbours) −.004 .017 −.005 .014 −.002 −.019
(.006) (.014) (.008) (.012) (.008) (.030)
Lagged log (migrants from partner country's neighbours (csl-weighted)) .040 −.299
(.077) (.179)
Lagged log (migrants from partner country's neighbours (csl-weighted)) common language scarcity −.044 .356 −.004 .061
(.083) (.188) (.013) (.033)
Gravity variables
Bilateral FE
Year FE
Non-zero observations 7,432 4,733 5,868 3,945 5,868 3,945
Number of non-zero bilateral ties 1,960 1,204 1,510 990 1,510 990

Notes

  • Standard errors in parentheses. Number of non-zero observations and non-zero bilateral ties reported for the respective estimation. The scarcity of language ties is captured as 1 − αij (1 − common spoken language). Gravity variables are reporter's and partner's respective log GDP and log GDP per capita, common legal system, common currency, regional trade agreement, log distance, contiguity, colony, common coloniser, common official language and common spoken language. Time-invariant ones dropped. The neighbourhood of a country of origin is defined as adjacent countries. The note “csl-weighted” indicates that migrants from neighbouring countries m = 1, …, N are multiplied by the common language αim they have with the respective adjacent country of origin.
  • ***p < .01, **p < .05, *p < .1.

So far, we have assumed a double-log-linear relationship between immigrants and outward FDI. We now explore whether FDI is subject to decreasing returns to immigrants (Table 6, columns (1)–(4)). First, we exclude the 25% largest bilateral immigrant stocks from our sample because we expect immigrants to have a larger effect in less developed migration corridors (columns (1) and (2)). Second, we exclude the favourite corridor of each country of origin from the sample (columns (3) and (4)). Under both specifications, the composite effect of the bilateral immigrant stock on outward FDI at median “common language scarcity” is substantially smaller (at .08 and .10, respectively) than in our basic regression (.21; Table 2). In the presence of decreasing returns, we would expect the composite effect to be larger, rather than smaller, than for the full sample. Hence, our estimates provide no evidence of decreasing returns in the relationship between bilateral immigrants and outward FDI.

Table 6. Diminishing returns and high-skilled migrants
(1) (2) (3) (4) (5) (6) (7) (8)
Dependent variable Exports FDI Exports FDI Exports FDI Exports FDI
Subsample <p(75) mig stock Without favourite destination With migrants skill data Only best data
Lagged log (bilateral migrants) .406 −1.267 .071 −.603 .275 −.238
(.171) (.579) (.097) (.232) (.099) (.271)
Lagged log (bilateral migrants) × common language scarcity −.383 1.463 .028 .767 −.221 .681
(.216) (.681) (.126) (.312) (.126) (.352)
Lagged log (low and medium-skilled bilateral migrants) .073 .026
(.060) (.210)
Lagged log (high-skilled bilateral migrants) −.167 −1.055
(.250) (.536)
Lagged log (high-skilled bilateral migrants) × common language scarcity .480 1.486
(.327) (.757)
Gravity variables
Bilateral FE
Year FE
Non-zero observations 4,771 2,468 6,933 4,292 5,430 3,731 5,829 3,817
Number of non-zero bilateral 1,311 658 1,835 1,095 1,376 903 1,571 984

Notes

  • Standard errors in parentheses. Number of non-zero observations and non-zero bilateral ties reported for the respective estimation. Gravity variables are reporter's and partner's respective log GDP and log GDP per capita, common legal system, common currency, regional trade agreement, log distance, contiguity, colony, common coloniser, common official language and common spoken language. In columns (1) and (2) the bilateral ties with the 25% largest migration stocks are dropped. In columns (3) and (4), the favourite destination per country of origin is dropped. In column (5) and (6) the sample size dropped because of missing data on skills of bilateral migrants. In columns (7) and (8), only countries which provide migration data from population registers or residence permits are included. Countries which use surveys to estimate immigrant numbers are excluded.
  • ***p < .01, **p < .05, *p < .1.

Finally, we use the limited data on bilateral migrant stocks broken down by skill level from DIOC 2000 to distinguish between high-skilled and other migrants (Table 6, columns (5) and (6)). In line with much of the immigration literature, we defined high-skilled migrants as those with ISCED levels 5 or 6, that is, those with at least some tertiary education. This is a considerable simplification, because the effect of migration on FDI is more likely to be tied to occupations or tasks than to education levels. With this caveat in mind, our findings suggest that low to medium-skilled immigrants have no significant effect on outward FDI, whereas high-skilled immigrants have a higher composite impact (.31) than total bilateral immigrant stocks according to our baseline results. However, without proper annual data on immigrant stocks by skill levels, it is impossible to corroborate this hypothesis further.

5 FURTHER ROBUSTNESS CHECKS

In most of our reporting countries, annual immigrant stock data come from administrative sources such as population registers or residence permits. A few countries, including the United States and the United Kingdom, rely on census data or even surveys such as the Current Population Survey or the Labour Force Survey. Survey-based data may be unreliable for small groups of immigrants (defined by country of origin); hence, regression estimates based on within variation in such data may be problematic. On the other hand, census data as such tend to be fairly precise, but the gaps in between censuses are sometimes filled by national statistical offices using procedures that are not fully reported. Therefore, to assess the robustness of our findings to different sources of migration data, we restrict our sample to the best available administrative data (Table 6, columns (7) and (8)). The composite effect of the bilateral immigrant stock on outward FDI is now larger (at .39) than in our baseline regression as could be expected if measurement error was present. The direct effect of the immigrant stock is still negative, but no longer significant. We consider these estimates as support for the robustness of our baseline results. We also stick with our baseline results because we do not wish to exclude several important migrant destination countries from our sample, as we would have to do in order to use administrative data only.

One important requirement for a causal interpretation of our results is the exogeneity of immigrant stocks, conditional on the covariates and fixed effects. This is potentially problematic because changes in FDI outstocks could be correlated with changes in immigrant stocks due to some underlying factor or due to reverse causality. This concern can be assessed through placebo regressions that rearrange the chronological order of events in our presumed causal chain. If correlations remain similar in a framework without lags or a specification with leads, this indicates simultaneity bias, reverse causality or severe autocorrelation. Reassuringly, when we replace the lags of the immigrant stock and interaction variables with leads, our gravity regression does not produce the same significant effects of immigrants on FDI. Likewise, when we leave out the lag structure altogether, we obtain only statistically insignificant estimates. These findings support our hypothesis that the direction of causality runs from immigrants to FDI because we find that the positive influence of immigrants on FDI takes some time to take effect. Our findings also imply that immigration is not caused by outward FDI. Furthermore, we confirm that immigration is not significantly correlated with FDI in-stocks; thus, immigrants do not come to OECD countries in tandem with, following, or leading inward investment from their origin countries. In spite of this tentative evidence, our identification strategy only permits conclusions about correlations, not causal relationships.

Our analysis might also be subject to a systematic bias if past bilateral immigrants cause current immigration due to network or similar effects (Munshi, 2003). If this led to autocorrelation in the inflow of migrants that is correlated with time-invariant factors, this would have the potential to spill over into the within variation on which we rely for our estimates with bilateral fixed effects. We therefore show through additional tests that the growth of the log immigrant stock is not higher when there is a larger immigrant presence—as a network effect would imply (Figure A2 in Appendix S1). Therefore, Card-style shift-share instrumental variables lack instrumental relevance in this study—in contrast to destination-specific studies where they can work well (cf. Peri & Requena-Silvente, 2010). Also we would find it unconvincing to use the exogenous part of variation in push factors because many push factors simultaneously decrease the attractiveness for incoming FDI (e.g., instability) and factors that shift the importance of given levels of push factors (e.g., migration policy in Mayda, 2009) are often determined specifically with the aim of improving economic ties between countries. There is also no evidence of significantly more bilateral immigration based on common spoken languages in the short run (Figure A3 in Appendix S1). Furthermore, the results are robust to the exclusion of longstanding bilateral migration ties, based on per capita bilateral immigrant stocks in 1960 that are taken from the “Worldbank Bilateral Migration Database 1960–2000” (Özden, Parsons, Schiff, & Walmsley, 2011).

We have tested extensively whether the immigrant stock—language scarcity interaction might reflect any other correlated factor. In Table A1 in Appendix S1, we test whether similar patterns to language scarcity can be found for several gravity and language-related variables, including distance between the two countries, past colonial ties, common official languages and language proximity (a proxy of how much overlap there is between two languages in linguistic terms, yielding for instance a high value for Dutch and English). If we had found a pattern similar to that of migration and language scarcity, it might have indicated a high level of collinearity between language scarcity and the respective other variable, which would substantially weaken our main results regarding migrants and scarce languages. Reassuringly, we do not find any similar pattern to that for common language scarcity among the four variables that we have tested. Furthermore, we show in Table A2 of Appendix S1 that including these additional interaction terms does not substantially affect our baseline estimates.

Our results continue to hold when we exclude country pairs with common borders from our sample. In this case, the estimated coefficient on the immigrant stock—language scarcity interaction term is (insignificantly) smaller than in the full sample. It appears (plausibly) that contiguous countries whose residents cannot communicate easily benefit substantially from the presence of immigrants with respect to their FDI.

Furthermore, our estimates are robust to excluding all zero bilateral migrant stocks or the smallest quarter of bilateral migrant stocks. We can also control for inflows of asylum seekers without affecting our results. Finally, our results are robust to using language shares that have been cleaned of bilateral immigrants.

6 CONCLUSION

We conduct a demanding test of the hypothesis that immigrants facilitate FDI from their country of destination to their country of origin. We use panel data for OECD reporting countries with bilateral and year fixed effects to estimate a gravity model for bilateral FDI outstocks in OECD and other host countries. We find a robust positive impact of bilateral immigrants on FDI only if residents of the two countries have few language skills in common. While earlier studies have concluded that immigrants may facilitate FDI, our finding suggests that immigrants' language skills play a crucial role in mediating this effect.

Quantitatively, we find that a 1% increase in bilateral immigrants in a country pair with median “common language scarcity” implies a 0.21% higher FDI outstock according to our baseline estimates. When residents in the two countries share almost no language skills (common language scarcity = .99), the FDI outstock increases by 0.27%. However, if FDI host countries' business climate is too adverse, we do not find evidence of a positive relationship. In this case, other constraints to FDI thus seem to outweigh the possible positive effect of migrants.

Our conclusion receives strong support from an additional set of regressions that focus on those third-country immigrants in the OECD reporting country that have the same language skills as FDI host country residents. Through several specifications, we find that this group has a positive effect on bilateral FDI similar to bilateral immigrants. To further corroborate the key role of language skills, we play devil's advocate and search for a similar effect through other third-country immigrants. We focus on those from the FDI host country's neighbours because geographic proximity is a good proxy for cultural familiarity, which may help to facilitate FDI. However, we fail to find a positive effect on bilateral FDI outstocks through this group. However, these robustness checks cannot definitely rule out that part of the association between FDI, migration and language is caused by cultural similarity or knowledge of the country of origin. Future research at the microlevel may be able to disentangle these effects more conclusively.

Our finding has a particular bearing for FDI from high-income to medium and low-income countries. In pairs of rich countries, residents can often communicate either in English or in the countries' official languages, obviating the need for immigrants to facilitate FDI. By contrast, for most country pairs with one rich and one developing country, common language skills are scarce. For these country pairs, immigrants in the rich country with the “right” language skills have an important role to play in facilitating FDI. Promoting foreign (as well as domestic) investment is often thought of as an important development strategy. Our findings suggest that developing countries, as well as aid donors, should seek to link their investment promotion and diaspora policies.

ACKNOWLEDGEMENTS

Financial support from Volkswagen Foundation Project “Europe's Global Linkages and the Impact of the Financial Crisis: Policies for Sustainable Trade, Capital Flows, and Migration” is acknowledged. We would like to thank participants in the Lunchtime Seminar in International Economics in Kiel as well and the 2015 SMYE, AEL and VfS conferences for helpful comments, and in particular Toman Barsbai, Holger Görg, Gordon Hanson, Nicolas Keller, Magnus Lodefalk, Thierry Mayer, Rainer Thiele, the editor and two anonymous referees. The usual disclaimer applies.

    Notes

  1. 1 Kugler, Rapoport, and Levintal (2013) report a similar positive effect on bilateral financial flows for their common official language dummy.
  2. 2 In related work, Oldenski (2012) shows that communication-intensive industries are more likely to market their products abroad through FDI than exports. This finding points to the importance of communication with customers, adjusting products to local preferences, etc. in successfully undertaking FDI. It is plausible to think that “transnational” immigrants help with these crucial tasks.
  3. 3 Aubry et al. (2012) follow a similar approach but use the ratio of FDI stocks (as a proxy for FDI sales) to exports as their dependent variable, as pioneered by Helpman, Melitz, and Rubinstein (2008). This approach has been criticised by Santos Silva and Tenreyro (2015) for requiring unrealistically strong normality and homoscedasticity assumptions.
  4. 4 These assumptions are discussed in the robustness section below. A Monte Carlo study is provided by Westerlund and Wilhelmsson (2011). Note that the estimator should not be confused with either the negative binomial MLE or the panel qMLE.
  5. 5 Doing Business, The World Bank (http://www.doingbusiness.org).
  6. 6 Luxembourg is missing because the language scarcity variable (see below) is only available for “Belgium and Luxembourg” combined. Inspecting the data and comparing independent estimates of the share of Dutch, French and German speakers in Belgium and Luxembourg, we decided to treat the “Belgium and Luxembourg” values as representative of Belgium, but not Luxembourg. In the case of Turkey, migration data exist only for a single year, making panel estimation impossible.
  7. 7 The reductions in years per bilateral tie is due to incomplete migration data for some countries.
  8. 8 https://comtrade.un.org/db/default.aspx.
  9. 9 http://data.worldbank.org/data-catalog/world-development-indicators.
  10. 10 Here we follow Ginsburgh and Prieto-Rodriguez (2011) in our reasoning.
  11. 11 This is an overestimate of the true language overlap to the extent that some random pairs of residents may be able to communicate in more than one language.
  12. 12 The sign of the correlation between migration and outward FDI in the restrictive panel Poisson fixed-effects estimation heavily depends on the inclusion of cases with very high common language values. While in the full sample, there is no statistically significant correlation between migration and FDI (column (4)), when excluding the <10% of the sample where more than 80% speak a common language, the coefficient on FDI is very similar to the PPML setting and statistically significant. For exports, conditioning on language scarcity does not affect the coefficient in a similar way. These findings suggest that the contribution of migrants to bilateral exports may work less through their language skills than in the case of FDI.
  13. 13 This proxy is only available for a subsample of countries. The stylised result also holds with this subsample in the other specifications of Panel B.
  14. 14 Since sample sizes are substantially lower, standard errors are larger.
  15. 15 For interpretation, we standardise the new covariate depending on threshold p ∈ [0, 1] as:
    urn:x-wiley:03785920:media:twec12598:twec12598-math-0004
  16. 16 The negative effect on exports, which is significant in column (3), may simply reflect more third-country migrants making exports to their home country more profitable.
  17. 17 We assume that the skill ratio observed in 2000 is representative of our whole period of observation (2004–11), knowing that in the long-term maturing migration networks can lead to slowly falling skill ratios due to falling migration costs that benefits low-skilled migrants in particular. Our regression results are therefore highly tentative (Table 6, columns (5) and (6)), but additional tests using DIOC 2010/11 data indicate their robustness.
  18. 18 For example, the likelihood that a French and a UK resident can communicate in a common language will increase if UK residents include many French immigrants, and French residents include many immigrants from the UK.
    • The full text of this article hosted at iucr.org is unavailable due to technical difficulties.