Skip Navigation


Cerebral Cortex Advance Access originally published online on February 21, 2007
Cerebral Cortex 2007 17(12):2777-2779; doi:10.1093/cercor/bhm005
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
17/12/2777    most recent
bhm005v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Ruusuvirta, T.
Right arrow Articles by Näätänen, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ruusuvirta, T.
Right arrow Articles by Näätänen, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Preperceptual Human Number Sense for Sequential Sounds, as Revealed by Mismatch Negativity Brain Response?

Timo Ruusuvirta1, Minna Huotilainen1,2,3 and Risto Näätänen1,3

1 Cognitive Brain Research Unit, Department of Psychology, 2 Collegium for Advanced Studies, University of Helsinki, Helsinki, Finland, 3 Helsinki Brain Research Centre, Helsinki, Finland

Address correspondence to Timo Ruusuvirta, Cognitive Brain Research Unit, Department of Psychology, PO Box 9, 00014 University of Helsinki, Helsinki, Finland. Email: timo.ruusuvirta{at}helsinki.fi.


    Abstract
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 References
 
Humans and some other species can nonlinguistically operate on the quantities of things or events, including sounds. Whether this ability is restricted to conscious percepts of sounds developing in ~200 ms is, however, unclear. To this end, we recorded the mismatch negativity (MMN) brain response, an index of preperceptual auditory change detection, of adult humans who passively listened to rare sequences of four 50-ms tones ("deviants") interspersed among frequently repeated tones ("standards"). Each tone was either 1000 or 1500 Hz in frequency. Deviants differed from standards in a ratio of the tones of the 2 frequencies. MMN was found for deviants by 160 ms from the onset of their largest ratio difference from standards (2:2 vs. 4:0), suggesting some ability of the human brain to operate on the number of sequential sounds of specific frequencies at a preperceptual time scale.

Key Words: auditory event–related potential • human • mismatch negativity • preattentive


    Introduction
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 References
 
Humans, just as some other species, can operate on approximate numbers (as quantities) of things or events without the support of language (Dehaene et al. 1998Go; Gallistel and Gelman 2000Go; Gordon 2004Go). The speed of related neural computations remains, however, largely unclear. One of the key issues in this respect is whether items must be fully perceived before their quantities can be operated on. Here, using the mismatch negativity (MMN, Näätänen 1990Go) of the event-related potential (ERP) of the human brain, we tackled this question regarding sequential sounds of specific frequencies.

The percept of a discrete, and thereby easily countable, auditory event takes ~200 ms to develop (Cowan 1984Go; Näätänen and Winkler 1999Go). MMN, in turn, can be recorded at ~100 to 200 ms following an auditory event ("deviant") that violates some invariant attribute of previous auditory events ("standards") (Näätänen 1990Go; Näätänen et al. 2001Go). It is held to reflect the preattentive detection, albeit not necessarily conscious perception, of deviants. Consistently, MMN-eliciting events can be composed of a number of tones of specific frequencies sequenced at intervals of just some tens of milliseconds (e.g., Schröger et al. 1992Go), tones that cannot contribute to MMN as discrete conscious percepts.

We recorded MMN from adult humans, who passively listened to 200-ms sequences of 4 consecutive tones, each of which could be of 1 of the 2 alternative frequencies. Each series of the sequences was divided into rare sequences (deviants) and frequently repeated sequences (standards). In the main series, deviants differed from standards in the ratio of the tones of the 2 frequencies. In the control series, the 2 frequencies themselves distinguished deviants from standards. The control series were simply aimed at demonstrating MMN per se (Näätänen 1990Go) in order to be able to attribute its possible absence in the main series to numerical rather than elementary aspects of auditory processing.


    Materials and Methods
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 References
 
Participants

Twelve participants (5 males and 7 females) between 19 and 37 years of age participated in the experiment. Participants provided their informed consent after the nature of the experiment was explained to them. During the experiment, they were comfortably seated in an electrically shielded and acoustically attenuated chamber. The participants were instructed to ignore sounds and to concentrate on a silent, subtitled playing movie on a video monitor.

Stimuli

Sequences of 4 consecutive tones of 50 ms in duration (including 10-ms onset/offset ramps) and 50 dB (sound pressure level) in intensity were emitted above the subject's hearing level as defined by the staircase method (Fig. 1a). Each tone of a sequence was either 1000 or 1500 Hz in frequency. There were 1000 sequences presented at a stimulus–onset–asynchrony (SOA) of 500 ms in each series. The order of the series was counterbalanced across the participants. The sequences were divided into those of low (deviants, P = 0.1) and those of high (standards, P = 0.9) sequential probability (Fig. 1b and c).


Figure 1
View larger version (29K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1. (a) The illustration of the waveform of 1 sequence used. (b) All sequences used. Three sets of sequences (4:0, 2:2, and 3:1) corresponded to 3 different ratios of the tones of the 2 frequencies as specified by the names of these sets. They were used in the main series (S2:2 D4:0, S3:1 D4:0, S3:1 D2:2, and S2:2 D3:1, where S refers to standard and D to deviant, and 2 numbers separated by a colon, to the ratio of the tones of the 2 frequencies). Each of the sequences in a set had the same probability to be assigned to a series, and this probability eventually depended on the stimulus type to which the sequences were assigned. The remaining 2 sequences (1000 and 1500 Hz) corresponded to 2 sound frequencies that were used to distinguish standards from deviants in the control series (Controls A and B). (c) A schematic illustration of the temporal structure of the series of sequences.

 
Figure 1b illustrates the sequences used in the experiment. There were 3 sets of sequences for 3 different ratios of the tones of the 2 frequencies (4:0, 2:2, and 3:1) from which individual sequences were pseudorandomly (equiprobably) assigned to standards and deviants in the 4 main series (S2:2 D4:0, S3:1 D4:0, S2:2 D3:1, and S3:1 D2:2). The resulting nonsystematic ordering of the tones over their different sequences in the series prevented deviants from being detected as mere alterations in a constant melodic pattern of standards (Schröger et al. 1992Go). Rather, a given number of tones of a particular sequence had to occur until this detection could be made. More explicitly, for the S2:2 D4:0 series, for example, the first and the second tone of the D4:0 sequence could still belong to the S2:2 sequence (as only 2 identical tones had occurred thus far), but not the third identical tone (which could only match the 3:1 or the 4:0 ratio). In the 2 control series (A and B), 1 sequence (1000 or 1500 Hz) was assigned to standards and the other to deviants.

Recordings

The electroencephalogram was recorded (NeuroScan software, NeuroScan Co., El Paso, Texas) with Ag-AgCl electrodes from the F3, Fz F4, C3, Cz, C4, P3, Pz, and P4 scalp sites according to the 10–20 system. Eye-related activity was monitored with electrodes positioned at the outer canthus of and below the right eye. The signals from the electrodes referenced to the electrode at the tip of the nose were amplified, band-pass filtered (0.1–40 Hz, 24 dB per octave roll off), and digitized at a 200-Hz sampling rate.

Analysis

ERPs were averaged for each subject, series, and stimulus type (deviants and those standards that immediately preceded the deviants) and were corrected relative to the baseline (50-ms prestimulus mean amplitude). ERPs exceeding ±150 µV at any recording channel were rejected from the averages. Subsequently, we measured the mean amplitudes during the 60-ms period starting at 100 ms from the onset of the earliest tone of a sequence distinguishing deviants from standards. Depending on the series, this onset was of the first (control series), third (S2:2 D4:0 and S2:2 D3:1), or fourth (S3:1 D4:0 and S3:1 D2:2) tone of a sequence.

The resultant values were analyzed with a repeated measures analysis of variance with stimulus types (standard and deviant), laterality (left, middle, and right, respectively), and anterior–posterior (frontal, central, and parietal, respectively) as factors. An alpha level of 0.05 was used in all analyses. Greenhouse–Geisser–adjusted degrees of freedom for the averaged tests of significance were used whenever the sphericity assumption was violated; P values were reported accordingly.


    Results
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 References
 
The elicitation of MMN was found in the control series (Controls A and B in Fig. 1b). The average ERP amplitude shifted toward negative polarity by deviants relative to standards between 100 and 160 ms from deviance onset (Fig. 2), as shown by the significant main effect of stimuli (standard and deviant: Control A, F1,11 = 33.70, P < 0.001; Control B, F1,11 = 11.22, P < 0.01). The frontocentral scalp distribution of this shift was further indicated by the significant interactions of the main effect of stimuli with the main effect of anterior–posterior (frontal, central, and parietal: Control A, F2,22 = 13.50, P < 0.01; Control B, F2,22 = 26.11, P < 0.001) and of laterality (left, middle, and right: Control A, F2,22 = 9.60, P < 0.01; Control B, F2,22 = 8.64, P < 0.01). Stimuli x anterior posterior x laterality interaction was not significant.


Figure 2
View larger version (22K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2. Grand-averaged electrophysiological responses to deviants and standards obtained at different scalp sites for the different series. For the explanations of the names of the main series, see the legend in Figure 1b. A sequence is illustrated below the x axis.

 
The elicitation of MMN was also found in one of the main series where it was the ratio of the tones of the 2 frequencies that distinguished deviants from standards. Namely, again, the average ERP amplitude shifted toward negative polarity by the deviant 4:0 ratio relative to the standard 2:2 ratio, especially in frontal leads between 100 and 160 ms from the onset of deviance (series S2:2 D4:0 in Figs. 1b and 2). This was indicated by a significant stimuli x anterior–posterior interaction (F2,22 = 6.43, P < 0.05), the main effect of stimuli remaining insignificant.

However, no MMN could be found with the deviant 4:0 ratio against the standard 3:1 ratio (series S3:1 D4:0 in Figs. 1b and 2). There was no significant stimuli x anterior–posterior interaction (F2,22 = 0.90, P = 0.4), and there was only a trend for the main effect of stimuli (F1,11 = 3.51, P = 0.088). The same held true with the deviant 2:2 ratio against the standard 3:1 ratio, and vice versa (series S3:1 D2:2 and S2:2 D3:1 in Figs. 1b and 2). In neither case was the main effect of stimuli or its interaction with the other main effects significant.


    Discussion
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 References
 
As expected, we found a robust frontocentral ERP deflection of negative polarity at 100–160 ms poststimulus, most likely MMN (Näätänen 1990Go), for passively heard deviants that differed from standards in their frequency (control series).

However, when deviants differed from standards in the ratio of the tones of the 2 frequencies, MMN was found only when this difference was large (S2:2 D4:0). With the small ratio differences (series S3:1 D4:0, S3:1 D2:2, and S2:2 D3:1), no MMN was observed, presumably because the fast pace of the tones made them difficult to quantify. After all, although one may easily discriminate between 4 seen or heard items from their respective sets of 3 (Dehaene et al. 1998Go; Gallistel and Gelman 2000Go; Gordon 2004Go), fast pace of sounds make them difficult to count even as general auditory activations (Garner 1951Go).

The standard 2:2 ratio was of 2 tones of each frequency, so already the third tone of a sequence made the deviant 4:0 ratio different from this ratio. This was because the ratio 2:2 could not include 3 identical tones, only 2. But the third tone alone was insufficient to actually elicit MMN. The next, fourth tone had to play a role also, given that no MMN was obtained for a difference of only one tone of each frequency between the deviant (4:0) and the standard (3:1) ratio.

MMN was observed by 160 ms from the onset of the earliest tone of a sequence distinguishing the deviant 4:0 from the standard 2:2 ratio, namely, the third one. Thus, in this short time period, a whole set of neural tasks were accomplished. The third tone was processed in the context of the 2 preceding tones to establish a neural correlate of the 3 tones of one frequency or of the absence of tones of another frequency, or both. One or both of these correlates, in turn, were compared with their transient memory (standard) counterparts (for 2 tones of one frequency or one tone of the other frequency, or both), and MMN was initially triggered as a result of a mismatch between the two.

But how were the numbers of the tones of the 2 frequencies per sequence actually reflected in the brain? The involvement of a visual (i.e., nonauditory) object-tracking mechanism (Kahneman et al. 1992Go) can obviously be excluded. Instead, a so-called accumulator mechanism could have automatically converted these numbers to continuous magnitudes (Meck and Church 1983Go), which is not the only possible explanation, however. The numbers could also have paralleled a frequency average (Cowan 1984Go) or a frequency variability (Wolff and Schröger 2001Go) across the tones of the 2 frequencies at preperceptual auditory storage (Näätänen and Winkler 1999Go). The frequency average, however, would also have reflected the cumulative durations of these tones that, in addition to their numbers, could have recruited the accumulator mechanism to process them (Meck and Church 1983Go).

Finally, there was only 1 frequency present in the deviant 4:0 ratio, whereas 2 frequencies were present in the standard 2:2 ratio. This implies that the MMN-eliciting changes might have been in the number of different frequencies per se rather than the number of tones of these frequencies. Further studies should therefore be conducted with higher total numbers of tones in a sequence to clarify this problem.

In conclusion, our findings suggest that the human brain possesses some ability to operate on the numbers of sequential sounds of specific frequencies, as yet incomplete in their conscious perceivability and thereby voluntary countability.


    Acknowledgments
 
We thank Jenni Väre and Vesa Putkinen for their help in data acquisition and analysis. The study was supported by the Academy of Finland. Conflict of Interest: None declared.


    References
 Top
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 References
 
Cowan N. On short and long auditory stores. Psychol Bull (1984) 96:341–370.[CrossRef][ISI][Medline]

Dehaene S, Dehaene-Lambertz G, Cohen L. Abstract representations of numbers in the animal and human brain. Trends Neurosci (1998) 21:355–361.[CrossRef][ISI][Medline]

Gallistel CR, Gelman R. Non-verbal numerical cognition: from reals to integers. Trends Cognit Sci (2000) 4:59–65.[CrossRef][ISI][Medline]

Garner WR. The accuracy of counting repeated short tones. J Exp Psychol (1951) 41:310–316.[CrossRef][ISI][Medline]

Gordon P. Numerical cognition without words: evidence from Amazonia. Science (2004) 306:469–499.[Abstract/Free Full Text]

Kahneman D, Treisman A, Gibbs BJ. Reviewing object files. Cognit Psychol (1992) 24:175–219.[CrossRef][ISI][Medline]

Meck WH, Church RM. A mode control model of counting and timing processes. J Exp Psychol Anim Behav Process (1983) 9:320–334.[CrossRef][ISI][Medline]

Näätänen R. The role of attention in auditory information processing as revealed by event-related potentials and other brain measures of cognitive function. Behav Brain Sci (1990) 13:201–288.[ISI]

Näätänen R, Tervaniemi M, Sussman E, Paavilainen P, Winkler I. ‘Primitive intelligence’ in the auditory cortex. Trends Neurosci (2001) 24:283–288.[CrossRef][ISI][Medline]

Näätänen R, Winkler I. The concept of auditory stimulus representation in cognitive neuroscience. Psychol Bull (1999) 125:826–859.[CrossRef][ISI][Medline]

Schröger E, Näätänen R, Paavilainen P. Event-related potentials reveal how non-attended complex sound patterns are represented by the human brain. Neurosci Lett (1992) 146:183–186.[CrossRef][ISI][Medline]

Wolff C, Schröger E. Activation of the auditory pre-attentive change detection system by tone repetitions with fast stimulation rate. Brain Res Cogn Brain Res (2001) 10:323–327.[CrossRef][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Cogn. Neurosci.Home page
R. S. Ehrlichman, C. R. Maxwell, S. Majumdar, and S. J. Siegel
Deviance-elicited Changes in Event-related Potentials are Attenuated by Ketamine in Mice
J. Cogn. Neurosci., August 1, 2008; 20(8): 1403 - 1414.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
17/12/2777    most recent
bhm005v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Ruusuvirta, T.
Right arrow Articles by Näätänen, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ruusuvirta, T.
Right arrow Articles by Näätänen, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?