The present study uses electroencephalography (EEG) and a new stimulation paradigm, the ‘continuous stimulation paradigm’, to investigate the neural correlate of phonological processing in human auditory cortex. Evoked responses were recorded to stimuli consisting of a control sound (1000 ms) immediately followed by a test sound (150 ms). On half of the trials, the control sound was a noise and the test sound a vowel; to control for unavoidable effects of spectral change at the transition, the roles of the stimuli were reversed on the other half of the trials. The acoustical properties of the vowel and noise sounds were carefully matched to isolate the response specific to phonological processing. As the unspecific response to sound energy onset has subsided by the transition to the test sound, we hypothesized that the transition response from a noise to a vowel would reveal vowel-specific processing. Contrary to this expectation, however, the most striking difference between vowel and noise processing was a large, vertex-negative sustained response to the vowel control sound, which had a fast onset (30–50 ms) and remained constant throughout presentation of the vowel. The vowel-specific response was isolated using a subtraction technique analogous to that commonly applied in neuroimaging studies. This similarity in analysis methodology enabled close comparison of the EEG data collected in the present study with relevant functional magnetic resonance (fMRI) literature. Dipole source analysis revealed the vowel-specific component to be located anterior and inferior to primary auditory cortex, consistent with previous data investigating speech processing with fMRI.

Introduction

A major goal in cognitive neuroscience is understanding how and where language is processed in the brain. The present study investigates the suitability of a novel approach to the study of speech sound processing in human auditory cortex using electroencephalography (EEG). Both EEG and magnetoencephalography (MEG) have become widely used in the study of speech and language processing as their millisecond temporal resolution enables investigation of the temporal properties of the brain response. A popular tool for the study of auditory processing using electrophysiological techniques is the mismatch negativity (MMN), which is a brain response elicited by a rare (deviant) stimulus occasionally presented in a sequence of frequent (standard) stimuli (see, e.g. Näätänen et al., 1978; Alho, 1995; Näätänen & Alho, 1995; Picton, 1995; Picton et al., 2000). The amplitude and topography of the MMN have been shown to depend upon the nature and salience of the acoustic or perceptual difference introduced in the deviant stimulus and the paradigm has been applied to the study of many sound features, including speech sound processing (Näätänen et al., 1997; Schulte-Körne et al., 1998, 2001; Rinne et al., 1999; Koyama et al., 2000; Shtyrov et al., 2000; Jaramillo et al., 2001; Pulvermüller et al., 2001, 2004; Eulitz & Lahiri, 2004).

An alternative approach for the study of feature-specific processing utilizes the ‘continuous stimulation paradigm’ (CSP) which involves preceding a test sound, which possesses the feature of interest, with a control sound that does not possess the feature, but whose acoustical properties match those of the test sound as far as possible. It is assumed that the response elicited by the transition from the control to the test sound reflects processing specific to the test feature as the response to sound energy onset has subsided and those neurons responsible for processing the features common to both the test and the control sound have adapted (Arlinger et al., 1982; Jones et al., 1991; Jones, 2003; Krumbholz et al., 2003; Ungan & Özmen, 1996; May et al., 1999; Ungan et al., 2001). An advantage of the CSP over the MMN approach is that recording times may be reduced; in the MMN paradigm, the standard stimulus must be presented on ∼ 80% of the trials in order to generate a reliable mismatch response, despite the fact that it is primarily the response to the deviant that is of interest.

Independent of the specific stimulation paradigm, the selection of an appropriate baseline stimulus is imperative in the study of speech processing to ensure appropriate interpretation of data with regard to speech- or language-specific processing (Scott & Wise, 2004). The current study carefully matched speech and nonspeech sounds such that the perception of the two sounds differed whilst the acoustic properties of the stimuli remained as consistent as possible. The use of such carefully matched stimuli together with the CSP revealed that the processing of vowel sounds is associated with a sustained response with an exceptionally short latency generated by an independent source anterior and inferior to that activated by nonspeech sounds.

Materials and methods

Participants

Fifteen participants (eight female, seven male, age range 23–40 years) took part in this study after giving written informed consent. All subjects were strongly right-handed, as assessed by a revised version of the Edinburgh inventory (Oldfield, 1971), and had no history of audiological or neurological disease. The experimental procedures conformed with the Code of Ethics of the World Medical Association (Declaration of Helsinki) and were approved by the Ethics Committee of the University of Nottingham Medical School.

Data acquisition

Auditory evoked potentials were recorded in an acoustically shielded room with an equidistantly arranged 61-channel EEG cap (Easy Cap; Falk Minow Services, Munich, Germany). Data were recorded continuously at a sampling rate of 500 Hz and were high-pass filtered on-line at 0.1 Hz. The participants watched a self-chosen silent movie during the recording. The quality of the EEG recording was monitored throughout and the data were stored on the computer for off-line processing. Three additional skin electrodes were positioned around the left eye to enable eye movement correction. The ground electrode was placed on a midline position on the forehead and, for on-line monitoring, data were referenced to an additional midline electrode in the occipital region, a little superior to the Inion.

Stimuli

The stimuli consisted of a 1000-ms ‘control’ sound and a 150-ms ‘test’ sound; on half of the trials the control sound was a randomly filtered noise and the test sound was a vowel, and for the remaining half of the trials the reverse was true (Fig. 1). The acoustic properties of the noise and vowel sounds were matched as far as possible, whilst still creating the perception of a vowel in one case but not the other. The vowel sounds consisted of noise, bandpass-filtered around the first three canonical formant frequencies of one of the vowels /a/, /e/, /i/ or /o/. Each of the three pass-bands had a bandwidth of ±10% around the respective formant frequency and was weighted according to a sloping (−6 dB per octave) spectral profile. The noise sounds were similarly filtered around three frequencies. However, in this case, each of the three frequencies was selected randomly from the range set by the lowest and highest frequency values of the respective formant for the four vowels, and the bandwidth of the filter pass-band increased to ±30% around the selected filter frequency. The larger bandwidth was used as a precaution to minimize the chance of the noises sounding like a vowel. The larger bandwidth also made the noises sound less like nonspeech human utterances (e.g. burps). The spectral change response at the transition was very similar for the noise-to-vowel stimulus and the vowel-to-noise stimulus (see Results), indicating that this difference in bandwidth had no significant effect on the response. Both vowel and noise sounds were multiplied with a periodic envelope mimicking the glottal pulse signal. The shape of each glottal pulse was approximated by a gamma function with a fast attack and an exponential decay with a half-life of 2.5 ms; the pulses were repeated at a rate of 100 Hz. The control and test sounds were gated on and off with a 5-ms cosine-squared ramp. At the transition from the control to the test sound the ramps overlapped, so that the envelope of the composite stimulus remained flat. The overall intensity of the sound remained constant throughout, at 65 dB SPL. The vowel sounds created a somewhat degraded (pathological), but highly identifiable, perception of the respective vowel. The pathological character of the vowels was due to the fact that they were produced using a filtered-noise rather than a complex-tone carrier. The sound quality of the noise could change considerably from trial to trial, depending on the randomly selected filter frequencies; however, the noise sounds were judged by the experimenters never to appear vowel- or speech-like. Although the vowel and noise stimuli were as acoustically similar as possible, we expected the unavoidable change in spectral composition from control to test sound to elicit a transition response in itself, independent of the sounds' perceptual attributes. Thus, both the transition from noise to vowel and from vowel to noise were presented to dissociate any neural activity specific to the perception of a vowel from the unspecific spectral change response present in both transitions.

Details are in the caption following the image — **Figure 1**
Open in figure viewer PowerPoint

Temporal waveform of a noise that becomes a vowel (panel a; an /a/ in this example) and a vowel that becomes a noise (panel b). Panels (c) and (d) show the long-term spectra of a vowel (/a/) and a noise, respectively. The three formant frequencies around which the vowel sound was filtered are indicated by downwards arrows and are labelled F₁, F₂ and F₃ (panel c) and the three random frequencies selected for the noise are labelled Frq₁, Frq₂ and Frq₃ (panel d). Note the difference in bandwidth of the frequency bands between the vowel and noise.

The stimuli were generated digitally with a sampling rate of 12.2 kHz and a 24-bit resolution using Tucker Davis Technologies (TDT, Florida, USA) System 3 and Matlab. They were passed through a headphone amplifier (HB7, TDT) and presented diotically through headphones (K240 DF; AKG, Vienna, Austria). The stimuli were generated afresh throughout the experiment using new noise samples for every trial. The interstimulus interval (ISI; from end of one stimulus to onset of the next) was 1000 ms. Presentation of noise-to-vowel transitions were randomly interleaved with vowel-to-noise transitions, with each type of stimulus being presented a total of 600 times. Responses to individual vowel types were not analysed separately; rather, responses to all vowel sounds were pooled for each of the two types of transition. Comparing responses to different vowels would not have been meaningful in the current study, because any differences between the responses would have been confounded with the spectral differences between the corresponding vowels. Presentation of the stimuli occurred in three blocks of equal length, between which subjects had a short break.

Data processing and source analysis

The continuous raw data files were corrected for eye blink artefacts with the use of the Gratton et al. (1983) algorithm implemented by the BrainVision Analyser software (Brain Products, GmbH, Munich, Germany) and re-referenced to the average of all 61 channels. Data exceeding a max − min difference of 150 µV within 100 ms were considered artefactual and a 500-ms window surrounding the artefact was removed from subsequent analysis. After low-pass filtering at 35 Hz (with a 48 dB/octave slope), the data were divided into 2000-ms epochs, including a 500-ms prestimulus period, and baseline-corrected to the 200-ms period before the onset of the control stimulus. The epochs for each condition were then averaged for each subject, and the grand average across participants calculated for each stimulation condition. Equivalent dipole source analysis was employed to estimate the location of the neural generators of the responses (BESA 5.1; Gräfelfing, Germany) using a four-shell ellipsoidal volume conductor as head model. Further details of the dipole analysis are given in the Results section (Source modelling).

Results

Contrary to our initial expectation that vowel-specific processing would be reflected in the transition response, the most apparent difference between the responses elicited by the noise-to-vowel and vowel-to-noise stimuli was in the vertex-negative sustained response elicited by the respective sounds. Figure 2a and b shows that both the initial control portion and, to a lesser degree, the (much shorter) test portion of the stimuli elicited a sustained response. The sustained response was considerably larger when the eliciting sound was a vowel rather than a noise. This difference in sustained response between vowel and noise sounds can be seen particularly clearly in the time period from ∼ 400–1000 ms after stimulus onset as, in this time period, the sustained response is not superposed by any transient responses. A paired t-test of the average root-mean-square (rms) amplitude of the individual responses to the vowel and noise sounds within this time range confirmed that the difference was highly significant (t₁₄ = 5.06, P < 0.001; compare Fig. 2c).

Transient responses were elicited both by the onset of the control sound and by the transition from the control to the test sound (see Fig. 2). The general morphology of these onset and transition responses differed considerably from one another. The onset response to both the noise (Fig. 2a) and vowel (Fig. 2b) sounds was triphasic, comprising a vertex-positive deflection peaking at ∼ 55 ms after sound onset (P1), a negative deflection peaking at ∼ 95 ms (N1) and another positive deflection peaking ∼ 177 ms (P2; the vertex channel, Cz, is shown in bold in Fig. 2a and b). In contrast, the transition response appeared to consist of only two deflections, a negative deflection peaking at ∼ 117 ms after the transition (henceforth referred to as tN1), followed by a positive deflection peaking at ∼ 208 ms (tP2). The negative deflections in both the onset and the transition response (N1 and tN1) were significantly larger in amplitude when the response was elicited by a vowel rather than a noise (N1: t₁₄ = −2.45, P = 0.014; tN1: t₁₄ = −4.19, P < 0.001; see upward pointing arrows in Fig. 2a and b). Conversely, there was a trend for the positive deflections (P1, P2 and tP2) to be smaller in amplitude for the vowel than the noise sounds; this trend was significant for the P2 and tP2 (P2: t₁₄ = 1.92, P = 0.038; tP2: t₁₄ = 3.50, P = 0.002; see downward pointing arrows). This pattern of results suggests that the differences between the transient (onset and transition) responses to the noise and vowel sounds were a consequence of the difference in the sustained responses elicited by the two types of stimuli (superposed on the transient responses), rather than representing actual differences in the transient responses themselves. This conjecture was confirmed by the difference between the responses for the two stimulus conditions shown in Fig. 3, which was generated by subtracting the response to the noise-to-vowel stimulus (Fig. 2a) from that to the vowel-to-noise stimulus (Fig. 2b). An enlarged view of the difference response around sound onset (Fig. 3b) reveals that the difference in sustained response between vowel and noise sounds (red line) began within the time range of the P1 (between 30 and 50 ms after stimulus onset) and had almost reached its full amplitude within the time range of the N1 (∼ 100 ms). Like the sustained parts of the original responses (black lines), the difference response then remained remarkably constant up to the transition, after which it switched its sign with a dynamic that was similarly fast and early as that of its onset (see Fig. 3c).

In addition to the difference in size of the sustained responses generated by the noise and vowel sounds, the channel traces in Fig. 2a and b also indicate a difference in the topography of the two responses. The bold trace illustrates the signal recorded from the vertex channel; this represents the largest (most negative) sustained response to the noise (Fig. 2a), but not to the vowel sound (Fig. 2b). Figure 4a and b, which shows the scalp distribution of the noise and vowel sustained responses, averaged over the period from 400 to 1000 ms after stimulus onset, reveals that the sustained response to the vowel (Fig. 4b) exhibited a more anterior topography than the response to the noise (Fig. 4a). This suggests that the larger sustained response to the vowel was generated by an additional source, located more anterior to the source of the sustained response to the noise. The topography of this additional source, which would be assumed to represent processes specific to the perception of vowels, is reflected in the scalp distribution of the difference between the vowel and noise responses (again, averaged over the 400–1000-ms time window), shown in Fig. 4c.

Source modelling

Equivalent dipole modelling was used to estimate the source locations of the noise- and vowel-evoked sustained responses (BESA 5.1; see above). Using an approach similar to that of Gutschalk et al. (2002), we first determined the source of the noise-evoked sustained response by fitting the locations and orientations of two dipoles, one in each hemisphere, to the grand-average sustained response to the noise control sound within the time window from 400 to 1000 ms after stimulus onset (Fig. 2a). The locations of the dipoles were constrained to be mirror-symmetric about the mid-sagittal plane. Their orientations were unconstrained; this means that the dipoles would reflect not only tangential but also radial contributions to the activity. In auditory EEG data, the symmetry constraint is introduced to avoid obtaining biologically implausible solutions, which often come about because the scalp distribution of a bilateral response from auditory cortex looks similar to the distribution of a response from a single source in the centre of the head. In the current case, however, the symmetry constraint would not strictly have been necessary, as re-running the analysis without it yielded practically identical results. Based on previous findings (Gutschalk et al., 2002, 2004), it was assumed that the noise-evoked sustained response represented an unspecific response to the presence of sound energy and that it would thus also be active during the vowel sounds. Using the same procedure, two other dipoles were then fitted to the grand-average difference response (i.e. the difference between the vowel- and noise-evoked sustained responses; see Fig. 3) within the same time window to reflect the additional sustained activity generated by the vowel, over and above the unspecific noise response. The two sets of dipoles were then combined to a four-dipole model (with two dipoles in each hemisphere); within the fit window, the residual variance of the dipole model was 2.86% for the noise-sustained response and 1.34% for the vowel-sustained response. This model was used as a spatial filter to derive the activation time-course of each of the four sources (source waveforms) for the two stimulus conditions separately for each individual.

Based on the scalp distributions of the noise-evoked sustained response (Fig. 4a), we expected the source of the unspecific, energy-related response (reflected by the noise response) to be located within the region of the auditory cortex on the supra-temporal plane. In contrast, the scalp distribution of the difference response (Fig. 4c), representing the additional vowel-specific response, suggests a more anterior source, possibly outside the ‘classical’ (unimodal) auditory areas. This was indeed found to be the case, with the dipoles fitted to the noise response (shown in black in Fig. 5) located in the region of the auditory cortex on the supra-temporal plane, and the dipoles fitted to the difference response (shown in grey) located 15 mm anterior and 19 mm inferior to the noise dipoles. The location of the difference dipoles is consistent with activity arising from supra-temporal areas anterior to the primary auditory cortex, within the region of the planum polare.

Figure 6 shows the grand average source waveforms for the noise dipoles (Fig. 6, upper panels, a and b) and the difference dipoles (Fig. 6, lower panels, c and d) to the noise-to-vowel and vowel-to-noise stimuli; the left and right panels show the results for the dipoles in the left and right hemispheres, respectively. In order to estimate each of the dipoles' source strengths for the noise- and vowel-evoked sustained responses, the mean amplitude of the respective source waveform was calculated over the time window 400–1000 ms after stimulus onset (highlighted in grey in Fig. 6) for each individual; these average source amplitudes are presented in Fig. 7. The source strength of the noise dipoles during the noise- and vowel-evoked sustained responses was practically identical (compare black bars across Fig. 7a and b), confirming the assumption that the noise dipoles represent a unspecific source common to the two types of sound. In contrast, the difference dipoles exhibited a large sustained response to the vowel but not the noise sounds (compare grey bars). This indicates that the increased sustained response to the vowels was indeed due to an additional source, represented by the difference dipoles. The interaction between source type (noise vs. difference dipoles) and stimulus condition (noise vs. vowel) in Fig. 7 was significant (F_1,29 = 5.87, P = 0.022).

Hemispheric differences

It is often assumed that processes relating to speech perception are lateralized towards the left hemisphere. However, the literature on this question remains highly controversial (for a recent review, see Shtyrov et al., 2005). In the current data, there was a general trend for the unspecific, energy-related sustained response, reflected by the noise dipoles, to be larger in the right hemisphere than the left (see black bars in Fig. 7). This effect is also evident in the scalp distribution of the original channel data, where the response to the noise sounds is skewed towards the right hemisphere (Fig. 4a). This is particularly visible at positive polarity (highlighted in red). Interestingly, neither the scalp distribution nor the source strengths for the vowel-specific sustained response, represented by the difference between the vowel- and noise-evoked responses (Fig. 4c and grey bars in Fig. 7b), show this pattern. Here, the trend was for the response to be larger in the left hemisphere than the right. However, this trend did not reach statistical significance across the whole data set. A closer inspection of the data prompted us to consider the effects of gender on the lateralization of the vowel-specific response. We found that, whilst the interaction between gender and hemisphere did not reach statistical significance, t-tests showed that the vowel-specific response was significantly larger in the left hemisphere than the right for males (t₆ = 2.832, P = 0.03) but not for females (t₇ = −0.553, P = 0.598).

Discussion

This study applied the continuous stimulation paradigm to investigate the neural correlates of vowel perception in human auditory cortex. The speech (vowels) and nonspeech stimuli (noises) were carefully matched to ensure that any difference in the evoked responses to the two types of sounds could be attributed to ‘vowel-specific’ processing mechanisms, rather than acoustic differences between the sounds. We initially expected any vowel-specific response to be transient in nature and, similar to other feature-specific responses, such as the pitch onset response (Krumbholz et al., 2003) or the response to a change in interaural correlation (Chait et al., 2005), to be most obvious at the transition from the control to the test portion of the stimuli. However, we found that the main difference between the noise- and vowel-evoked responses was a large and rapid sustained response to the vowels, which was present during the control portion of the vowel-to-noise stimulus and, to a lesser degree, also the test portion of the noise-to-vowel stimulus. This vowel-specific sustained response began within the time window of the P1 deflection in the transient response at sound onset (30–50 ms after sound onset) and lasted throughout stimulus presentation. Due to this sustained response, the negative deflections in the transient responses to the onset of the vowel control sounds and the transition from a noise to a vowel appeared enhanced, and the positive deflections diminished relative to the corresponding deflections in the respective noise-evoked responses.

Eulitz et al. (1995) similarly found a large sustained negativity following prolonged (600 ms) presentation of vowel sounds. However, in their study subjects performed a task in which they were asked to detect a particular vowel, and so increased attention or vigilance during the vowel compared to the control stimuli could not be excluded as the cause of this sustained response. Evidence that the amplitude of the sustained response increases when subjects attend to auditory stimuli (Picton et al., 1978; Hari et al., 1989; Sieroka et al., 2003) would support the attention-related explanation of the response. In contrast, as all stimuli in the present study were presented passively whilst subjects watched a silent movie, we are confident that the vowel-evoked sustained response was a result of stimulus-related differences rather than attentional discrepancies between stimulus conditions.

The selection of an appropriate baseline against which to compare responses evoked by speech sounds is an issue of concern in the study of speech and language processing, as it can be difficult to separate the effects of phonological processing from effects due to more basic (nonphonological) perceptual and acoustic differences between the test and control stimuli (Scott & Wise, 2004). In the current experiment, the general properties of the noise sounds almost exactly matched those of the vowel sounds, and the actual spectral composition of both types of sounds was randomised from trial to trial (by randomly presenting one of four different vowels in the case of the vowels, and by randomising the ‘formant’ frequencies in the case of the noises). Thus, it seems justified to interpret the differences between the noise- and vowel-evoked responses in the current study on the basis of the phonological differences between the sounds, because the other differences were varied randomly from trial to trial.

One approach for the study of speech-specific processing is to conduct cross-lingual studies, which exploit the fact that different languages comprise different speech sounds (Dehaene-Lambertz, 1997; Näätänen et al., 1997; Winkler et al., 1999; Dehaene-Lambertz et al., 2000). Rather than presenting speech and nonspeech sounds to individual participants, responses to identical sounds are compared across participants with different linguistic backgrounds. As the stimuli are acoustically identical, issues regarding appropriate baselines are avoided. Typically, a control sound is chosen which is a prototypical speech sound in the native language of both groups of subjects, whilst the test sound is a prototypical speech sound in the language of one group of participants but not the other. For example, Näätänen et al. (1997) conducted an MMN experiment on Finnish and Estonian participants, in which the standard (/e/) and all deviants (/ö/, /õ/ and /o/) were prototypical speech sounds in Estonian, and all but the deviant /õ/ were prototypical speech sounds in Finnish. The amplitude of the MMN in Estonian participants depended only on the size of the frequency difference between standard and deviant sounds. In the Finnish participants, the same pattern was true for the prototypical speech sounds; however, the MMN was considerably smaller to the deviant /õ/, which was not perceived as a speech sound by these participants. That the MMN is larger to familiar speech sounds (Dehaene-Lambertz, 1997, 2000; Näätänen et al., 1997; Winkler et al., 1999), and to words compared to pseudo-words (Pulvermüller et al., 2001, 2004; Shtyrov & Pulvermüller, 2002; Endrass et al., 2004) has been interpreted in terms of the activation of a ‘language-specific’ memory trace in addition to the activation resulting from the acoustical properties of the sound. The current study indicates the existence of a phonology-specific vertex-negative sustained response. Such a phonology-related response may explain the enhanced size of the MMN to language-relevant stimuli, as it would enhance the amplitude of the negative deflections to speech deviants relative to nonspeech deviants.

The scalp distribution maps and the results of the source analysis indicate that the vowel-specific sustained response was generated by a different neuronal population to that responsible for the sustained response to the noise. The dipole location for the noise-evoked sustained response was consistent with activity within the region of auditory cortex on the supra-temporal plane. In contrast, the dipole location for the vowel-specific ‘difference’ response indicated activity further anterior and inferior, probably in the region of the planum polare or, less probably, the superior temporal sulcus. The difference dipole seems too anterior to be in the region of the ‘classical’ (unimodal) auditory cortex. The planum polare has previously been implicated in processes associated with language comprehension (Friederici et al., 2000a,b; Meyer et al., 2000) as well as music processing (Koelsch et al., 2002). Phonology-specific activity anterior to primary auditory regions is inconsistent with conventional neuroanatomical models of language processing, which stress the importance of the posterior extent of the superior temporal gyrus, including the planum temporale (PT; Wernicke, 1874; Geschwind & Levitsky, 1968; Braak, 1978; Foundas et al., 1994). However, it has since been suggested that the role of the PT may have been over-emphasized in traditional models of language (Binder et al., 1996, 1997), as much recent neuroimaging research has converged on the importance of relatively anterior and inferior regions for the processing of speech and language (Binder et al., 1996, 1997, 2004; Binder, 2000; Scott et al., 2000; Obleser et al., 2003, 2006; Scott & Johnsrude, 2003; Warren et al., 2006). These results are supported by findings that the processing of object-related sound features predominantly engage a pathway of areas anterior to primary auditory cortex (‘what’ pathway), whereas areas posterior to the primary auditory cortex, most importantly the PT, seem to be more responsive to changes in spatial sound attributes (‘where’ pathway; Rauschecker & Tian, 2000; Tian et al., 2001; Zatorre et al., 2004; Scott, 2005). Scott & Wise (2004) propose that regions posterior to primary auditory cortex, including the PT, are involved in the analysis of temporal patterns within sounds as opposed to the processing of speech sounds per se. This hypothesis is based on reports that the PT responds to sequences of sounds as simple as tones (Binder et al., 1996), and to signal-correlated noise (noise modulated with the temporal envelope of speech), which contains the temporal information mediated by speech but does not contain the corresponding spectral information necessary to recognize the words from which they were formed (Wise et al., 2001). Similarly, Jäncke et al. (2002) explain their finding that the PT responds more strongly to CV syllables than to tones, noises or vowels in the framework of Tallal and coworkers' hypothesis that the PT contains neurons that specialize in the processing of rapidly changing acoustic cues, irrespective of the speech-like qualities of the sounds (Tallal et al., 1993; but see also Schönwiesner et al., 2005). The data reported in the current study are consistent with these hypotheses, as the source of the response to the vowel sounds, which contained rich spectral information but little temporal variation, was found to be anterior to primary auditory cortex rather than on the PT. Scott (2005) suggests that the anterior stream becomes progressively more responsive to intelligible speech along its length, until areas specifically in left anterior superior temporal sulcus respond to intelligible speech (Scott et al., 2000; Narain et al., 2003). Our data indicate that there is a centre on the anterior supra-temporal plane, in the region of the planum polare, which is sensitive to the perception of simple phonemes. The sounds' vowel-like perceptual quality, in addition to their acoustic structure, was necessary for activation in this region, as the equally spectrally complex noises did not activate this area. However, this region appears to represent a relatively ‘low-level’ language area, as semantic content was not necessary to elicit activation. Recent functional magnetic resonance imaging (fMRI) data similarly indicate that activity is evoked in regions anterior to auditory cortex during the processing of simple vowel stimuli (Obleser et al., 2006). The subtraction technique employed in the current study to identify the ‘vowel-specific’ component of the response closely mirrors the technique commonly employed in fMRI studies, in which the response to a control stimulus is subtracted from that elicited by a test stimulus to reveal the response associated with processes invoked by the test but not the control stimulus. This similarity in analysis procedure allows close comparison between results obtained by the two complementary techniques, and may explain why the current EEG results show such high concordance with previous fMRI data.

The transition from a noise to a vowel and from a vowel to a noise produced a prominent transient response; however, the main difference between the responses to these two kinds of transition was due to the difference in the sustained response elicited by the vowel and the noise sounds, rather than a difference in the transient responses themselves. This suggests that the transient responses represent a spectral change response, similar to that elicited by a change in frequency or intensity of an otherwise continuous pure tone (see, e.g. Arlinger et al., 1982; for a review, see Näätänen & Picton, 1987) and show little or no sensitivity to phonological processing. In MMN studies, a sequence of discrete sounds is presented in relatively fast succession. Presenting a speech deviant in a sequence of different speech or nonspeech standards would be expected to be accompanied by a spectral change similar to the spectral change between the noise and vowel sounds in the current experiment. Unless a cross-lingual approach is used (Dehaene-Lambertz, 1997; Näätänen et al., 1997; Winkler et al., 1999; Dehaene-Lambertz et al., 2000) or more complex stimulus designs are applied (Pulvermüller et al., 2001; Shtyrov & Pulvermüller, 2002; Shtyrov et al., 2005), it would be difficult to dissociate the speech-specific components from the unspecific spectral change response. This may explain why the locus of the speech-related MMN has on occasion been reported to be close to Heschl's gyrus, within ‘classical’ (unimodal) auditory cortex (Alho et al., 1998; Shestakova et al., 2002). Indeed, fitting a dipole to the response elicited by the vowel sound in the current study gave rise to a dipole much closer to primary auditory cortex within the region of the anterior bank of Heschl's gyrus (data not shown); it was only by subtracting the response to the noise stimulus and fitting the dipole to the vowel-specific difference response that the anterior location of the vowel-specific response became apparent.

Evidence that language processing occurs predominantly in the left hemisphere was reported over a hundred years ago, when it was shown that left-hemisphere lesions lead to disturbances in speech production and perception (Broca, 1861; Wernicke, 1874). The findings of many, though not all, experimental studies conducted since this time have supported the hypothesis that the left hemisphere is specialized for the processing of language. What is more controversial, however, is the level at which the processing of speech sounds becomes lateralized. Some researchers have found a left hemisphere dominance for the passive processing of speech sounds (Rinne et al., 1999; Shtyrov et al., 2000), whilst others have found that attention to the speech sounds is required before lateralization is reliably detected (Poeppel et al., 1996). Other findings indicate that functional lateralization is not evident until phonemes are presented in the context of words, suggesting that it is the higher levels of speech perception that are left-lateralized rather than the processing of the acoustic properties of speech sounds (Pulvermüller et al., 2001; Shtyrov et al., 2005). Dipole source analysis was employed in the current study to determine the presence or otherwise of vowel-specific hemispheric lateralization in the absence of attention to the stimuli. The source analysis revealed that the nonspecific response to the noise stimulus tended to be larger in the right hemisphere than the left, and that the opposite was true for the vowel-sustained response represented by the difference dipole. Closer inspection of the data revealed that this trend was a consequence of a significant left hemisphere lateralization in the vowel-specific responses recorded from the male participants. In contrast, the responses recorded from female participants showed no trend towards leftward lateralization for the vowel response. While the interaction between gender and hemisphere did not reach statistical significance, these results suggest that there may be a link between gender and hemispheric lateralization in speech perception. Research into gender effects on the lateralization of language processing has shown mixed findings, seemingly dependent on the level of language processing investigated, but many studies have indicated that males do tend to show a greater degree of functional lateralization than females (see Kansaku & Kitazawa, 2001 for a review). One proposed explanation as to why this may be the case cites the interhemispheric connection delay (Ringo et al., 1994); the relative size of the isthmus of the corpus callosum, which contains commissural fibres connecting the language areas, has been shown to be larger in female than in male brains (Steinmetz et al., 1992; though see Bermudez & Zatorre, 2001), suggesting a greater efficiency and speed of interhemispheric connections between language areas, and thus a reduced need for functional lateralization, in females. Not all studies have found greater language-related lateralization in male brains, however. A study that would appear to be particularly relevant in the present context investigated the lateralization of vowel processing and, contrary to its initial hypothesis as well as the current results, found evidence that the evoked response to a vowel sound was left-lateralized in females but not males (Obleser et al., 2001). In that study, the analysis was restricted to the peak amplitude of the N1 response, and the large sustained fields elicited by the sounds were not considered. In addition, the subjects attended to the sounds and, as attention is known to interact with the processing of vowels (Hugdahl et al., 2003), it is possible that this interaction may have influenced the relationship between gender and lateralization of the evoked responses.

Conclusion

By presenting vowel sounds immediately after carefully matched noise sounds we hoped to isolate any vowel-specific neural processing from that elicited by the onset of sound energy, and from the features common to both noise and vowel sounds. We believe that we have isolated features of the neural response which are specific to the processing of speech sounds, though not necessarily in the way that we had anticipated. Rather than the transition response from a noise to a vowel revealing features of ‘vowel-specific’ processing, it was the sustained response, generated by the prolonged presentation of the vowel sound as the control stimulus, which provided the most striking vowel-specific response. Source analysis revealed that this vowel-specific response was located anterior to primary auditory cortex, and may form part of the anterior ‘what’ processing stream.

Acknowledgements

This research was supported by the Medical Research Council (UK) and Deafness Research UK.

Abbreviations

CSP: continuous stimulation paradigm
EEG: electroencephalography
fMRI: functional magnetic resonance imaging
MEG: magnetoencephalography
MMN: mismatch negativity
PT: planum temporale

References

Alho, K. (1995) Cerebral generators of mismatch negativity (MMN) and its magnetic counterpart (MMNm) elicited by sound changes. Ear Hear., 16, 38–51.
10.1097/00003446-199502000-00004
CAS PubMed Web of Science® Google Scholar
Alho, K., Connolly, J.F., Cheour, M., Lehtokoski, A., Huotilainen, M., Virtanen, J., Aulanko, R. & Ilmoniemi, R.J. (1998) Hemispheric lateralization in preattentive processing of speech sounds. Neurosci. Lett., 258, 9–12.
10.1016/S0304-3940(98)00836-2
CAS PubMed Web of Science® Google Scholar
Arlinger, S., Elberling, C., Bak, C., Kofoed, B., Lebech, J. & Saermark, K. (1982) Cortical magnetic fields evoked by frequency glides of a continuous tone. Electroencephalogr. Clin. Neurophysiol., 54, 642–653.
10.1016/0013-4694(82)90118-3
CAS PubMed Web of Science® Google Scholar
Bermudez, P. & Zatorre, R.J. (2001) Sexual dimorphism in the corpus callosum: methodological considerations in MRI morphometry. Neuroimage, 13, 1121–1130.
10.1006/nimg.2001.0772
CAS PubMed Web of Science® Google Scholar
Binder, J. (2000) The new neuroanatomy of speech perception. Brain, 123, 2371–2372.
10.1093/brain/123.12.2371
PubMed Web of Science® Google Scholar
Binder, J.R., Frost, J.A., Hammeke, T.A., Cox, R.W., Rao, S.M. & Prieto, T. (1997) Human brain language areas identified by functional magnetic resonance imaging. J. Neurosci., 17, 353–362.
10.1523/JNEUROSCI.17-01-00353.1997
CAS PubMed Web of Science® Google Scholar
Binder, J.R., Frost, J.A., Hammeke, T.A., Rao, S.M. & Cox, R.W. (1996) Function of the left planum temporale in auditory and linguistic processing. Brain, 119, 1239–1247.
10.1093/brain/119.4.1239
PubMed Web of Science® Google Scholar
Binder, J.R., Liebenthal, E., Possing, E.T., Medler, D.A. & Ward, B.D. (2004) Neural correlates of sensory and decision processes in auditory object identification. Nat. Neurosci., 7, 295–301.
10.1038/nn1198
CAS PubMed Web of Science® Google Scholar
Braak, H. (1978) On magnopyramidal temporal fields in the human brain – probable morphological counter parts of Wernicke's sensory speech region. Anat Embryol (Berl.), 152, 141–169.
10.1007/BF00315922
CAS PubMed Web of Science® Google Scholar
Broca, P. (1861) Remarques sur le siège de la faculté du langage articulé; suivies d'une observation d'aphemie. Bull. Soc. Anat. Paris, 6, 330–357.
Google Scholar
Chait, M., Poeppel, D., De Cheveigne, A. & Simon, J.Z. (2005) Human auditory cortical processing of changes in interaural correlation. J. Neurosci., 25, 8518–8527.
10.1523/JNEUROSCI.1266-05.2005
CAS PubMed Web of Science® Google Scholar
Dehaene-Lambertz, G. (1997) Electrophysiological correlates of categorical phoneme perception in adults. Neuroreport, 8, 919–924.
10.1097/00001756-199703030-00021
CAS PubMed Web of Science® Google Scholar
Dehaene-Lambertz, G., Dupoux, E. & Gout, A. (2000) Electrophysiological correlates of phonological processing: a cross-linguistic study. J. Cogn. Neurosci., 12, 635–647.
10.1162/089892900562390
CAS PubMed Web of Science® Google Scholar
Endrass, T., Mohr, B. & Pulvermüller, F. (2004) Enhanced mismatch negativity brain response after binaural word presentation. Eur. J. Neurosci., 19, 1653–1660.
10.1111/j.1460-9568.2004.03247.x
CAS PubMed Web of Science® Google Scholar
Eulitz, C., Diesch, E., Pantev, C., Hampson, S. & Elbert, T. (1995) Magnetic and electric brain activity evoked by the processing of tone and vowel stimuli. J. Neurosci., 15, 2748–2755.
CAS PubMed Web of Science® Google Scholar
Eulitz, C. & Lahiri, A. (2004) Neurobiological evidence for abstract phonological representations in the mental lexicon during speech recognition. J. Cogn. Neurosci., 16, 577–583.
10.1162/089892904323057308
PubMed Web of Science® Google Scholar
Foundas, A.L., Leonard, C.M., Gilmore, R., Fennell, E. & Heilman, K.M. (1994) Planum temporale asymmetry and language dominance. Neuropsychologia, 32, 1225–1231.
10.1016/0028-3932(94)90104-X
CAS PubMed Web of Science® Google Scholar
Friederici, A.D., Meyer, M. & Cramon, D.Y. (2000a) Auditory language comprehension: An event-related fMRI study on the processing of syntactic and lexical information. Brain Lang., 74, 289–300.
10.1006/brln.2000.2313
CAS PubMed Web of Science® Google Scholar
Friederici, A.D., Wang, Y., Herrmann, C., Maess, B. & Oertel, U. (2000b) Localisation of early syntactic processes in frontal and temporal cortical areas: a MEG study. Hum. Brain Mapp., 11, 1–11.
10.1002/1097-0193(200009)11:1<1::AID-HBM10>3.0.CO;2-B
CAS PubMed Web of Science® Google Scholar
Geschwind, N. & Levitsky, W. (1968) Human brain: left-right asymmetries in temporal speech region. Science, 161, 186–187.
10.1126/science.161.3837.186
CAS PubMed Web of Science® Google Scholar
Gratton, G., Coles, M.G. & Donchin, E. (1983) A new method for off-line removal of ocular artifact. Electroencephalogr. Clin. Neurophysiol., 55, 468–484.
10.1016/0013-4694(83)90135-9
CAS PubMed Web of Science® Google Scholar
Gutschalk, A., Patterson, R.D., Rupp, A., Uppenkamp, S. & Scherg, M. (2002) Sustained magnetic fields reveal separate sites for sound level and temporal regularity in human auditory cortex. Neuroimage, 15, 207–216.
10.1006/nimg.2001.0949
PubMed Web of Science® Google Scholar
Gutschalk, A., Patterson, R.D., Scherg, M., Uppenkamp, S. & Rupp, A. (2004) Temporal dynamics of pitch in human auditory cortex. Neuroimage, 22, 755–766.
10.1016/j.neuroimage.2004.01.025
PubMed Web of Science® Google Scholar
Hari, R., Hamalainen, M., Kaukoranta, E., Mäkelä, J., Joutsiniemi, S.L. & Tiihonen, J. (1989) Selective listening modifies activity of the human auditory cortex. Exp. Brain Res., 74, 463–470.
10.1007/BF00247348
CAS PubMed Web of Science® Google Scholar
Hugdahl, K., Thomsen, T., Ersland, L., Rimol, L.M. & Niemi, J. (2003) The effects of attention on speech perception: an fMRI study. Brain Lang, 85, 37–48.
10.1016/S0093-934X(02)00500-X
PubMed Web of Science® Google Scholar
Jäncke, L., Wüstenberg, T., Scheich, H. & Heinze, H.J. (2002) Phonetic perception and the temporal cortex. Neuroimage, 15, 733–746.
10.1006/nimg.2001.1027
CAS PubMed Web of Science® Google Scholar
Jaramillo, M., Ilvonen, T., Kujala, T., Alku, P., Tervaniemi, M. & Alho, K. (2001) Are different kinds of acoustic features processed differently for speech and non-speech sounds? Brain Res. Cogn. Brain Res., 12, 459–466.
10.1016/S0926-6410(01)00081-7
CAS PubMed Web of Science® Google Scholar
Jones, S.J. (2003) Evoked potentials of the human auditory cortex: sensitive to the harmonic series? Ann. NY Acad. Sci., 999, 177–179.
10.1196/annals.1284.024
PubMed Web of Science® Google Scholar
Jones, S.J., Pitman, J.R. & Halliday, A.M. (1991) Scalp potentials following sudden coherence and discoherence of binaural noise and change in the inter-aural time difference: a specific binaural evoked potential or a ‘mismatch’ response? Electroencephalogr. Clin. Neurophysiol., 80, 146–154.
10.1016/0168-5597(91)90152-N
CAS PubMed Web of Science® Google Scholar
Kansaku, K. & Kitazawa, S. (2001) Imaging studies on sex differences in the lateralization of language. Neurosci. Res., 41, 333–337.
10.1016/S0168-0102(01)00292-9
CAS PubMed Web of Science® Google Scholar
Koelsch, S., Gunter, T.C.V., Cramon, D.Y., Zysset, S., Lohmann, G. & Friederici, A.D. (2002) Bach speaks: a cortical ‘language-network’ serves the processing of music. Neuroimage, 17, 956–966.
10.1006/nimg.2002.1154
PubMed Web of Science® Google Scholar
Koyama, S., Gunji, A., Yabe, H., Oiwa, S., Akahane-Yamada, R., Kakigi, R. & Näätänen, R. (2000) Hemispheric lateralization in an analysis of speech sounds. Left hemisphere dominance replicated in Japanese subjects. Brain Res. Cogn. Brain Res., 10, 119–124.
10.1016/S0926-6410(00)00034-3
CAS PubMed Web of Science® Google Scholar
Krumbholz, K., Patterson, R.D., Seither-Preisler, A., Lammertmann, C. & Lütkenhöner, B. (2003) Neuromagnetic evidence for a pitch processing center in Heschl's gyrus. Cereb. Cortex, 13, 765–772.
10.1093/cercor/13.7.765
CAS PubMed Web of Science® Google Scholar
May, P., Tiitinen, H., Ilmoniemi, R.J., Nyman, G., Taylor, J.G. & Näätänen, R. (1999) Frequency change detection in human auditory cortex. J. Comput. Neurosci., 6, 99–120.
10.1023/A:1008896417606
CAS PubMed Web of Science® Google Scholar
Meyer, M., Friederici, A.D. & Cramon, D.Y. (2000) Neurocognition of auditory sentence comprehension: Event related fMRI reveals sensitivity to syntactic violations and task demands. Cogn. Brain Res., 9, 19–33.
10.1016/S0926-6410(99)00039-7
CAS PubMed Web of Science® Google Scholar
Näätänen, R. & Alho, K. (1995) Mismatch negativity – a unique measure of sensory processing in audition. Int. J. Neurosci., 80, 317–337.
10.3109/00207459508986107
CAS PubMed Google Scholar
Näätänen, R., Gaillard, A.W. & Mantysalo, S. (1978) Early selective-attention effect on evoked potential reinterpreted. Acta Psychol. (Amst.), 42, 313–329.
10.1016/0001-6918(78)90006-9
CAS PubMed Web of Science® Google Scholar
Näätänen, R., Lehtokoski, A., Lennes, M., Cheour, M., Huotilainen, M., Iivonen, A., Vainio, M., Alku, P., Ilmoniemi, R.J., Luuk, A., Allik, J., Sinkkonen, J. & Alho, K. (1997) Language-specific phoneme representations revealed by electric and magnetic brain responses. Nature, 385, 432–434.
10.1038/385432a0
CAS PubMed Web of Science® Google Scholar
Näätänen, R. & Picton, T. (1987) The N1 wave of the human electric and magnetic response to sound: a review and an analysis of the component structure. Psychophysiology, 24, 375–425.
10.1111/j.1469-8986.1987.tb00311.x
CAS PubMed Web of Science® Google Scholar
Narain, C., Scott, S.K., Wise, R.J., Rosen, S., Leff, A., Iversen, S.D. & Matthews, P.M. (2003) Defining a left-lateralized response specific to intelligible speech using fMRI. Cereb. Cortex, 13, 1362–1368.
10.1093/cercor/bhg083
CAS PubMed Web of Science® Google Scholar
Obleser, J., Boecker, H., Drzezga, A., Haslinger, B., Hennenlotter, A., Roettinger, M., Eulitz, C. & Rauschecker, J.P. (2006) Vowel sound extraction in anterior superior temporal cortex. Hum Brain Mapp., 27, 562–571.
10.1002/hbm.20201
PubMed Web of Science® Google Scholar
Obleser, J., Eulitz, C., Lahiri, A. & Elbert, T. (2001) Gender differences in functional hemispheric asymmetry during processing of vowels as reflected by the human brain magnetic response. Neurosci. Lett., 314, 131–134.
10.1016/S0304-3940(01)02298-4
CAS PubMed Web of Science® Google Scholar
Obleser, J., Lahiri, A. & Eulitz, C. (2003) Auditory-evoked magnetic field codes place of articulation in timing and topography around 100 milliseconds post syllable onset. Neuroimage, 20, 1839–1847.
10.1016/j.neuroimage.2003.07.019
PubMed Web of Science® Google Scholar
Oldfield, R.C. (1971) The assessment and analysis of handedness: the Edinburgh inventory. Neuropsychologia, 9, 97–113.
10.1016/0028-3932(71)90067-4
CAS PubMed Web of Science® Google Scholar
Picton, T.W. (1995) The neurophysiological evaluation of auditory discrimination. Ear Hear., 16, 1–5.
10.1097/00003446-199502000-00001
CAS PubMed Web of Science® Google Scholar
Picton, T.W., Alain, C., Otten, L., Ritter, W. & Achim, A. (2000) Mismatch negativity: different water in the same river. Audiol. Neurootol., 5, 111–139.
10.1159/000013875
CAS PubMed Web of Science® Google Scholar
Picton, T.W., Woods, D.L. & Proulx, G.B. (1978) Human auditory sustained potentials. I. The nature of the response. Electroencephalogr. Clin. Neurophysiol., 45, 186–197.
10.1016/0013-4694(78)90003-2
CAS PubMed Web of Science® Google Scholar
Poeppel, D., Yellin, E., Phillips, C., Roberts, T.P., Rowley, H.A., Wexler, K. & Marantz, A. (1996) Task-induced asymmetry of the auditory evoked M100 neuromagnetic field elicited by speech sounds. Brain Res. Cogn. Brain Res., 4, 231–242.
10.1016/S0926-6410(96)00643-X
CAS PubMed Web of Science® Google Scholar
Pulvermüller, F., Kujala, T., Shtyrov, Y., Simola, J., Tiitinen, H., Alku, P., Alho, K., Martinkauppi, S., Ilmoniemi, R.J. & Näätänen, R. (2001) Memory traces for words as revealed by the mismatch negativity. Neuroimage, 14, 607–616.
10.1006/nimg.2001.0864
CAS PubMed Web of Science® Google Scholar
Pulvermüller, F., Shtyrov, Y., Kujala, T. & Näätänen, R. (2004) Word-specific cortical activity as revealed by the mismatch negativity. Psychophysiology, 41, 106–112.
10.1111/j.1469-8986.2003.00135.x
CAS PubMed Web of Science® Google Scholar
Rauschecker, J.P. & Tian, B. (2000) Mechanisms and streams for processing of ‘what’ and ‘where’ in auditory cortex. Proc. Natl Acad. Sci. USA, 97, 11800–11806.
10.1073/pnas.97.22.11800
CAS PubMed Web of Science® Google Scholar
Ringo, J.L., Doty, R.W., Demeter, S. & Simard, P.Y. (1994) Time is of the essence: a conjecture that hemispheric specialization arises from interhemispheric conduction delay. Cereb. Cortex, 4, 331–343.
10.1093/cercor/4.4.331
CAS PubMed Web of Science® Google Scholar
Rinne, T., Alho, K., Alku, P., Holi, M., Sinkkonen, J., Virtanen, J., Bertrand, O. & Näätänen, R. (1999) Analysis of speech sounds is left-hemisphere predominant at 100–150ms after sound onset. Neuroreport, 10, 1113–1117.
10.1097/00001756-199904060-00038
CAS PubMed Web of Science® Google Scholar
Schönwiesner, M., Rübsamen, R. & Von Cramon, D.Y. (2005) Hemispheric asymmetry for spectral and temporal processing in the human antero-lateral auditory belt cortex. Eur. J. Neurosci., 22, 1521–1528.
10.1111/j.1460-9568.2005.04315.x
CAS PubMed Web of Science® Google Scholar
Schulte-Körne, G., Deimel, W., Bartling, J. & Remschmidt, H. (1998) Auditory processing and dyslexia: evidence for a specific speech processing deficit. Neuroreport, 9, 337–340.
10.1097/00001756-199801260-00029
CAS PubMed Web of Science® Google Scholar
Schulte-Körne, G., Deimel, W., Bartling, J. & Remschmidt, H. (2001) Speech perception deficit in dyslexic adults as measured by mismatch negativity (MMN). Int. J. Psychophysiol., 40, 77–87.
10.1016/S0167-8760(00)00152-5
CAS PubMed Web of Science® Google Scholar
Scott, S.K. (2005) Auditory processing – speech, space and auditory objects. Curr. Opin. Neurobiol., 15, 197–201.
10.1016/j.conb.2005.03.009
CAS PubMed Web of Science® Google Scholar
Scott, S.K., Blank, C.C., Rosen, S. & Wise, R.J. (2000) Identification of a pathway for intelligible speech in the left temporal lobe. Brain, 123, 2400–2406.
10.1093/brain/123.12.2400
PubMed Web of Science® Google Scholar
Scott, S.K. & Johnsrude, I.S. (2003) The neuroanatomical and functional organization of speech perception. Trends Neurosci., 26, 100–107.
10.1016/S0166-2236(02)00037-1
CAS PubMed Web of Science® Google Scholar
Scott, S.K. & Wise, R.J. (2004) The functional neuroanatomy of prelexical processing in speech perception. Cognition, 92, 13–45.
10.1016/j.cognition.2002.12.002
PubMed Web of Science® Google Scholar
Shestakova, A., Brattico, E., Huotilainen, M., Galunov, V., Soloviev, A., Sams, M., Ilmoniemi, R.J. & Näätänen, R. (2002) Abstract phoneme representations in the left temporal cortex: magnetic mismatch negativity study. Neuroreport, 13, 1813–1816.
10.1097/00001756-200210070-00025
PubMed Web of Science® Google Scholar
Shtyrov, Y., Kujala, T., Palva, S., Ilmoniemi, R.J. & Näätänen, R. (2000) Discrimination of speech and of complex nonspeech sounds of different temporal structure in the left and right cerebral hemispheres. Neuroimage, 12, 657–663.
10.1006/nimg.2000.0646
CAS PubMed Web of Science® Google Scholar
Shtyrov, Y., Pihko, E. & Pulvermüller, F. (2005) Determinants of dominance: is language laterality explained by physical or linguistic features of speech? Neuroimage, 27, 37–47.
10.1016/j.neuroimage.2005.02.003
PubMed Web of Science® Google Scholar
Shtyrov, Y. & Pulvermüller, F. (2002) Neurophysiological evidence of memory traces for words in the human brain. Neuroreport, 13, 521–525.
10.1097/00001756-200203250-00033
PubMed Web of Science® Google Scholar
Sieroka, N., Dosch, H.G., Specht, H.J. & Rupp, A. (2003) Additional neuromagnetic source activity outside the auditory cortex in duration discrimination correlates with behavioural ability. Neuroimage, 20, 1697–1703.
10.1016/S1053-8119(03)00445-2
CAS PubMed Web of Science® Google Scholar
Steinmetz, H., Jäncke, L., Kleinschmidt, A., Schlaug, G., Volkmann, J. & Huang, Y. (1992) Sex but no hand difference in the isthmus of the corpus callosum. Neurology, 42, 749–752.
10.1212/WNL.42.4.749
CAS PubMed Web of Science® Google Scholar
Tallal, P., Miller, S. & Fitch, R.H. (1993) Neurobiological basis of speech: a case for the preeminence of temporal processing. Ann. NY Acad. Sci., 682, 27–47.
10.1111/j.1749-6632.1993.tb22957.x
CAS PubMed Google Scholar
Tian, B., Reser, D., Durham, A., Kustov, A. & Rauschecker, J.P. (2001) Functional specialization in rhesus monkey auditory cortex. Science, 292, 290–293.
10.1126/science.1058911
CAS PubMed Web of Science® Google Scholar
Ungan, P. & Özmen, B. (1996) Electroencephalographic responses to briefly introduced interaural intensity differences. Audiology, 35, 307–321.
10.3109/00206099609071952
CAS PubMed Web of Science® Google Scholar
Ungan, P., Yagcioglu, S. & Goksoy, C. (2001) Differences between the N1 waves of the responses to interaural time and intensity disparities: scalp topography and dipole sources. Clin. Neurophysiol., 112, 485–498.
10.1016/S1388-2457(00)00550-2
CAS PubMed Web of Science® Google Scholar
Warren, J.D., Scott, S.K., Price, C.J. & Griffiths, T.D. (2006) Human brain mechanisms for the early analysis of voices. Neuroimage, 31, 1389–1397.
10.1016/j.neuroimage.2006.01.034
CAS PubMed Web of Science® Google Scholar
Wernicke, C. (1874) Der aphasische Symptomencomplex. Eine Psychologische Studie auf anatomischer Basis. Khon und Weigert, Breslau.
Google Scholar
Winkler, I., Lehtokoski, A., Alku, P., Vainio, M., Czigler, I., Csépe, V., Aaltonen, O., Raimo, I., Alho, K., Lang, H., Iivonen, A. & Näätänen, R. (1999) Pre-attentive detection of vowel contrasts utilizes both phonetic and auditory memory representations. Brain Res. Cogn. Brain Res., 7, 357–369.
10.1016/S0926-6410(98)00039-1
CAS PubMed Web of Science® Google Scholar
Wise, R.J., Scott, S.K., Blank, S.C., Mummery, C.J., Murphy, K. & Warburton, E.A. (2001) Separate neural subsystems within ‘Wernicke’s area'. Brain, 124, 83–95.
10.1093/brain/124.1.83
CAS PubMed Web of Science® Google Scholar
Zatorre, R.J., Bouffard, M. & Belin, P. (2004) Sensitivity to auditory object features in human temporal neocortex. J. Neurosci., 24, 3637–3642.
10.1523/JNEUROSCI.5458-03.2004
CAS PubMed Web of Science® Google Scholar

Citing Literature

Volume24, Issue9

November 2006

Pages 2661-2671

Vowel processing evokes a large sustained response anterior to primary auditory cortex

Abstract

Introduction