• Nem Talált Eredményt

A new paradigm for measuring the ERP correlates of auditory multistable

85 Study IV. A new paradigm for measuring the ERP correlates of auditory multistable

86

87

88

89

90

91

92

93

94

95

96

97

General discussion

The main goal of the studies presented in this thesis was to investigate the percept-inducing and percept-stabilizing cues and the ERP correlates of the perceptual processes of auditory stream segregation. The studies reported here utilized behavioral and electrophysiological measurements for better understanding auditory stream segregation.

Continuous measurements during bi-/multistable perception permit one to investigate various aspects of stream segregation while eliminating the effects originating from physical stimulus changes. Therefore, multistable stimulation was used in all four studies while the participant’s perception of the stimuli was continuously recorded. Our results showed that similarly to other cues based on spectral similarity/separation, amplitude modulation can both induce and stabilize auditory stream segregation, and it interacts with carrier frequency difference (Study I). We demonstrated that some cues based on spectro-temporal regularities, such as a melody, may also induce stream segregation (Study II). Studies on the neural correlates of auditory stream segregation (Study III and IV) demonstrated that the current perceptual interpretation influences sound processing both at the level of feature extraction and sound evaluation.

Percept-inducing and stabilizing cues

Whereas most of the studies on auditory stream segregation used cues based on spectral differences between the sounds (Hartmann & Johnson, 1991) there is much evidence showing that the auditory system also utilizes other types of cues (for reviews, see Moore &

Gockel, 2002; Moore & Gockel, 2012). Grimault et al. (2002) found that differences between the modulation frequency of two sets of sounds can evoke auditory stream segregation. Study I investigated the effects of amplitude modulation in a bistable auditory streaming paradigm together with carrier-frequency differences and differences in the perceived location. Based on the results, amplitude modulation difference not only stabilized streams but also induced

98 stream segregation, and the effect of amplitude modulation difference was partly overlapping with that of the carrier frequency difference and location difference. Also, we found that carrier frequency difference and location difference interacted with each other. This result contradicts the finding of independence between these two cues by Denham et al. (2010). It is likely that these interactions resulted from a ceiling effect: there was not enough room for further facilitating stream segregation by adding another cue.

Several studies indicated that rhythm perception doesn’t require musical training and the ability is present even in infancy (Demany et al., 1977; Honing et al., 2009; Winkler et al., 2009b). However, only few studies exist which investigated the effect rhythmic structure on stream segregation (but see Andreou et al., 2011; French-St George & Bregman, 1989;

Rimmele et al., 2012; Rogers & Bregman, 1993). Also, the effect of melody is not well understood. It was shown by Dowling and colleagues (Dowling, 1973; Dowling et al., 1987) that when a piece of familiar melody was presented to listeners and afterward this melody was presented together with distracting sounds from the same pitch range, participants were able to separate the different sound streams and recognize the familiar melody. These studies suggest that sequential stream segregation can depend on previous knowledge. Study II demonstrated that the presence of a melody facilitated switching towards segregation reducing the overall phase duration of the integrated percept and therefore utilized as a percept-inducing cue. However the effect of rhythm was due to the overlap between the two sets of sounds rather than the temporal arrangement of sounds within one set. An important finding is that the melodic pattern used in Study II induced stream segregation regardless of the familiarity. This result is contradictory to the results of Bendixen et al. (Bendixen et al., 2013;

Bendixen et al., 2010) who suggested that predictability-based cues such as temporal regularities can stabilize the sound streams but they do not induce stream segregation. In our case unfamiliar sound patterns were unpredictable as participants could not have predicted

99 upcoming sounds from the preceding ones. Similarly to our result Devergie et al. (Devergie et al., 2010) found that familiar melodies helped perceiving two stream without spectral differences between the two interleaved sequences. However in our experiment the effect was present even without familiarity of the melodies. Possible explanations of this result are discussed in detail in Study II. However, here we only wish to emphasize that Study II brought evidence against the notion that higher-order cues can only have percept stabilizing effects: we have shown that these cues can also induce stream segregation.

Percept-dependent processing of regular sounds

In Study IV we found that the amplitude of the P1 ERP component correlated with whether the sound was a part of the fore- or the background. When a sound was part of the background, it elicited a higher-amplitude P1 than when the same sound belonged to the foreground. In Study III the P1 was also enhanced when the percept was segregated for the first A and the B tones of the ABA triplets, whereas N1 was enhanced for the integrated percept. These results suggest that the perceived sound organization influences sound processing as early as 70 ms from stimulus onset.

Two different hypotheses have been put forward about the function of the P1 component. According to the baseline hypothesis, originally suggested by Luck et al. (Luck &

Hillyard, 1995; Luck et al., 1994) and Hillyard et al. (1998) the amplitude of the P1 decreases in unattended conditions compared to a neutral baseline. In contrast, the inhibition hypothesis proposes that the amplitude of the P1 increases with inhibition (Klimesch, 2011). This hypothesis suggests that the P1 doesn’t reflect sensory processing and cannot be explained as a sensory evoked component as it is not affected by the physical stimulus properties per se: P1 is only modulated by the stimulus properties when these features are relevant for early categorization or when the sounds are targets in the listeners’ task. Rather, P1 reflects

100 inhibition of task-irrelevant stimuli and/or networks. (Note, however that this hypothesis was put forward for the visual P1.) According to the inhibition hypothesis, larger P1 would be expected for the background sounds and not for the foreground sounds as the baseline hypothesis suggests. The results of Study IV are compatible with the inhibition hypothesis.

When participants perceived timbre A in the foreground, sounds with the timbre B acted as distractors (and vice versa). Therefore, perceiving a coherent falling A stream, B sounds might have been suppressed and, as a consequence, they elicited higher P1 compared to the falling A sounds. In contrast, in the rising percept we don’t find this difference between timbre A and B sounds as they constructed together the integrated percept (and no inhibition of the sounds was needed). However this interpretation is put into doubt by the results of Study III where the first A tone and the B tone of the ABA triplet elicited larger P1 when the percept was segregated. One possible explanation can be based on the fact that during the segregated percept, one of the two tones is part of the foreground, whereas the other is part of the background, and the foreground-background can switch both within and between segregated phases. If both the A and the B stream was perceived as background for sufficient proportion of time during the segregated phases (either by switching foreground and background within listeners or by some listeners predominantly perceiving one while others the other as background), then P1’s for both of these tones could have been enhanced at the group level.

Unfortunately, Study III provided no information regarding the selection of foreground within the segregated phases. It is also possible that the P1 modulations observed in the two studies may reflect partly different or interacting underlying effects, as the paradigms are not directly comparable. P1 is generally regarded as a component involved in feature extraction (see e.g., Näätänen & Winkler, 1999). Thus it is possible that the P1 enhancement reflects a modulation of feature extraction stream during segregation, or even the inhibition of this process for background sounds within the segregated sound organization. Therefore, further studies are

101 required regarding the P1 effects found in these two studies. Regardless of the precise theoretical interpretation, these studies showed that the percept influenced the sound processing as early as 70 ms from stimulus onset.

Percept-dependent processing of irregular sounds

There are several studies indicating that deviant sound processing follows the dominant percept (Rahne et al., 2007; Sussman et al., 1998, 1999; Winkler et al., 2003a;

Winkler et al., 2003b; Winkler et al., 2003c; Winkler et al., 2005). Most of these studies investigated the MMN elicitation corresponding to conscious perception (but see Winkler et al., 2005). However in the study of Ross et al. (1996) MMN elicitation did not match the listener’s percept. Rather, it followed the acoustic make-up of the sounds. Similarly, our results in Study IV showed that MMN elicitation did not follow the perceptual organization of the sounds. Rather, it followed the physical arrangement supporting that MMN is not directly related to auditory stream segregation. This result is consistent with results of other studies demonstrating dissociation between perception and MMN elicitation (Sussman et al., 2002;

Takegata et al., 2005; van Zuijen et al., 2006).

Some other ERP components displayed percept-related differences in processing irregular sounds. In Winkler et al.’ (2005) study, the auditory streaming paradigm was used with occasional sound omissions. They found an enhanced frontocentrally negative difference for the omission deviants when integration was perceived compared to the segregated percept.

The observed ERP component peaked at 176 ms from the onset of the expected tone. In Study III, we found similar effects for our frequency-deviant tones: Deviant tones encountered during experiencing the integrated percept elicited larger N2 than those encountered during the segregated percept. In contrast, the P3a component was larger during segregation than integration. This result might reflect the perceptual salience of the deviant tones in the two

102 cases. In a heterogeneous set of sounds (integrated streams contained two types of tones), a deviant sound may catch less attention, because a wider frequency range is covered by the auditory system. Both N2 and P3a are regarded as electrophysiological correlates of processes evaluating the contextual relevance of incoming sounds. Whereas the N2 component is regarded as an index of stimulus classification (Ritter et al., 1979), the P3a has been suggested to reflect attentional switching (Escera et al., 2000; Friedman et al., 2001; Polich, 2007;

Schröger, 1996), awareness of the participants that an unexpected event has occurred (Leppert et al., 2003), or contextual evaluation of novelty (Horváth et al., 2008). Our N2 finding might be explained by considering the position of the deviant tones. In Study III, deviants were always placed in the third position of the ABA triplets. Within this paradigm, when the sequence is perceived as integrated, the third tone defines the end of the repeating pattern. In contrast, when segregation is perceived, a continuous flow of sounds is heard and the information about the position is lost. Therefore deviation at the end of the triplets might be more relevant than in an arbitrary position within a homogeneous sequence. This could have caused the more negative N2 response for the integrated than for the segregated deviants. In summary irregular sound processing is strongly influenced by the current percept as was reflected by the N2 and P3a components.

A new stimulus paradigm for investigating auditory multistability

Study IV presented listeners with a multistable auditory stimulus sequence based on Wessel’s (1979) stimulus configuration. In this paradigm sounds of two different timbres were alternated. Within this paradigm, integration is defined as the perception of a coherent pattern composed of alternating timbres, whereas segregation as the perception of patterns with homogeneous timbres. The behavioral results found in Study IV showed similarities to studies employing variants of the auditory streaming paradigm (Bendixen et al., 2013;

Bendixen et al., 2010; Bőhm et al., 2013; Denham et al., 2014; Denham et al., 2010, 2013;

103 Pressnitzer & Hupé, 2006; Roberts et al., 2002). Participants reported perceptual switching throughout the stimulus blocks. Using timbre difference between the sounds, participants were able to easily decide which sound organization they perceived in the foreground.

Participants’ subjective report was that when they heard two streams only one of the timbres was perceived in the foreground. Therefore we suggest that this paradigm is a possible tool for investigating foreground-background discrimination with the advantages of bi-/multistable auditory stimulus configurations, that is, different perceptual foreground-background configurations without corresponding changes in the stimuli.

Conclusions

In conclusion, we found that both spectral and temporal-structure-based cues can induce auditory stream segregation. The fact that unfamiliar melodies induced stream segregation brings up the possibility that tonality analysis may influence auditory stream segregation. The nature of these processes requires further experiments. Our results showed multiple interactions between the processing of incoming sounds and the currently dominant representation of the sound sequence. Early effects of perceptual modulation in the ERP components reflected in the P1 and N1 occurred for regular sounds. Our results for the P1 suggest that enhancement of this component reflects inhibition of background tones rather than attention towards foreground tones. The processing of irregularity was affected in a later, evaluation stage, as reflected by modulations of the N2 and P3a ERP components. We have also tested a new auditory multistability paradigm for investigating the foreground-background decomposition of auditory scenes. With this paradigm, percept-dependent modulation of the P1 was obtained, yet MMN elicitation was independent of the percept.

Future studies should be conducted to clarify the relationship between the representations underlying the MMN ERP component and those involved in conscious perception.

104 References

Akeroyd, M. A., Carlyon, R. P., & Deeks, J. M. (2005). Can dichotic pitches form two streams? Journal of the Acoustical Society of America, 118(2), 977-981.

Alain, C., Achim, A., & Richer, F. (1993). Perceptual context and the selective attention effect on auditory event-related brain potentials. Psychophysiology, 30(6), 572-580.

Alain, C., & Bernstein, L. J. (2008). From sounds to meaning: the role of attention during auditory scene analysis. Current Opinion in Otolaryngology & Head and Neck Surgery, 16, 485–489.

Alain, C., Schuler, B. M., & McDonald, K. L. (2002). Neural activity associated with distinguishing concurrent auditory objects. Journal of the Acoustical Society of America, 111(2), 990-995.

Alain, C., & Woods, D. L. (1994). Signal clustering modulates auditory cortical activity in humans. Perception & Psychophysics, 56(5), 501-516.

Andreou, L. V., Kashino, M., & Chait, M. (2011). The role of temporal regularity in auditory segregation. Hear. Res., 280(1-2), 228-235.

Anstis, S., & Saida, S. (1985). Adaptation to auditory streaming of frequency-modulated tones. Journal of Experimental Psychology: Human Perception and Performance, 11(3), 257-271.

Baldeweg, T. (2006). Repetition effects to sounds: evidence for predictive coding in the auditory system. Trends in Cognitive Sciences, 10(3), 93-94.

Beauvois, M. W., & Meddis, R. (1996). Computer simulation of auditory stream segregation in alternating-tone sequences. Journal of the Acoustical Society of America, 99, 2270-2280.

Bee, M. A., & Klump, G. M. (2004). Primitive auditory stream segregation: a neurophysiological study in the songbird forebrain. Journal of Neurophysiology, 92(2), 1088-1104.

Bee, M. A., & Klump, G. M. (2005). Auditory stream segregation in the songbird forebrain:

effects of time intervals on responses to interleaved tone sequences. Brain, Behavior and Evolution, 66(3), 197-214.

Bendixen, A. (2014). Predictability effects in auditory scene analysis: a review. Frontiers in Neuroscience, 8, 60.

Bendixen, A., Bőhm, T., Szalárdy, O., Mill, R., Denham, S. L., & Winkler, I. (2013).

Different roles of similarity and predictability in auditory stream segregation.

Learning and Perception, 5(2), 37-54.

Bendixen, A., Denham, S. L., Gyimesi, K., & Winkler, I. (2010). Regular patterns stabilize auditory streams. Journal of the Acoustical Society of America, 128(6), 3658-3666.

Bendixen, A., Denham, S. L., & Winkler, I. (2014). Feature predictability flexibly supports auditory stream segregation or integration. Acta Acustica united with Acustica, 100(5), 888-899.

Bendixen, A., Schröger, E., & Winkler, I. (2009). I heard that coming: event-related potential evidence for stimulus-driven prediction in the auditory system. Journal of Neuroscience, 29(26), 8447-8451.

Bey, C., & McAdams, S. (2002). Schema-based processing in auditory scene analysis.

Perception & Psychophysics, 64(5), 844-854.

Bey, C., & McAdams, S. (2003). Postrecognition of interleaved melodies as an indirect measure of auditory stream formation. Journal of Experimental Psychology-Human Perception and Performance, 29(2), 267-279.

105 Blake, R., & Logothetis, N. K. (2002). Visual competition. Nature Reviews Neuroscience,

3(1), 13-23.

Boehnke, S. E., & Phillips, D. P. (2005). The relation between auditory temporal interval processing and sequential stream segregation examined with stimulus laterality differences. Perception & Psychophysics, 67(6), 1088-1101.

Bőhm, T. M., Shestopalova, L., Bendixen, A., Andreou, A. G., Georgiou, J., Garreau, G., et al. (2013). The role of perceived source location in auditory stream segregation:

Separation affects sound organization, common fate does not. Learning and Perception, 5(Supplement 2), 55-72.

Bregman, A. S. (1990). Auditory Scene Analysis: The Perceptual Organization of Sound.

Cambridge, MA: MIT Press.

Bregman, A. S., Ahad, P. A., Crum, P. A., & O'Reilly, J. (2000). Effects of time intervals and tone durations on auditory stream segregation. Attention, Perception & Psychophysics, 62(3), 626-636.

Broadbent, D. E. (1952). Listening to one of two synchronous messages. Journal of Experimental Psychology, 44(1), 51-55.

Brochard, R., Drake, C., Botte, M. C., & McAdams, S. (1999). Perceptual organization of complex auditory sequences: effect of number of simultaneous subsequences and frequency separation. Journal of Experimental Psychology- Human Perception and Performance, 25(6), 1742-1759.

Carlyon, R. P. (2004). How the brain separates sounds. Trends in Cognitive Sciences, 8(10), 465-471.

Carlyon, R. P., Cusack, R., Foxton, J. M., & Robertson, I. H. (2001). Effects of attention and unilateral neglect on auditory stream segregation. Journal of Experimental Psychology- Human Perception and Performance, 27(1), 115-127.

Carter, O., & Cavanagh, P. (2007). Onset rivalry: brief presentation isolates an early independent phase of perceptual competition. Plos One, 2(4), e343.

Cherry, E. C. (1953). Some experiments on the recognition of speech, with one and with two ears Journal of the Acoustical Society of America, 25, 975-981.

Conway, A. R., Cowan, N., & Bunting, M. F. (2001). The cocktail party phenomenon revisited: the importance of working memory capacity. Psychonomical Bulletin &

Review, 8(2), 331-335.

Cowan, N. (1984). On short and long auditory stores. Psychological Bulletin, 96(2), 341-370.

Cowan, N. (1988). Evolving conceptions of memory storage, selective attention, and their mutual constraints within the human information-processing system. Psychological Bulletin, 104(2), 163-191.

Cusack, R., Deeks, J., Aikman, G., & Carlyon, R. P. (2004). Effects of location, frequency region, and time course of selective attention on auditory scene analysis. Journal of Experimental Psychology: Human Perception and Performance, 30(4), 643-656.

Cusack, R., & Roberts, B. (2000). Effects of differences in timbre on sequential grouping.

Perception & Psychophysics, 62(5), 1112-1120.

Dannenbring, G. L., & Bregman, A. S. (1976). Stream segregation and the illusion of overlap.

Journal of Experimental Psychology- Human Perception and Performance, 2(4), 544-555.

Deike, S., Heil, P., Böckmann-Barthel, M., & Brechmann, A. (2012). The Build-up of Auditory Stream Segregation: A Different Perspective. Frontiers in Psychology, 3, 461.

Demany, L., McKenzie, B., & Vurpillot, E. (1977). Rhythm perception in early infancy.

Nature, 266(5604), 718-719.

106 Demany, L., & Semal, C. (2008). The Role of Memory in Auditory Perception. In W. A.

Yost, A. N. Popper & R. A. Fay (Eds.), Auditory perception of sound sources.

Springer handbook of auditory research (Vol. 29, pp. 77-113). New York: Springer.

Denham, S., Bőhm, T. M., Bendixen, A., Szalárdy, O., Kocsis, Z., Mill, R., et al. (2014).

Stable individual characteristics in the perception of multiple embedded patterns in multistable auditory stimuli. Frontiers in Neuroscience, 8(25).

Denham, S. L., Gyimesi, K., Stefanics, G., & Winkler, I. (2010). Stability of perceptual organisation in auditory streaming. In E. A. Lopez-Poveda, A. R. Palmer & R. Meddis (Eds.), The Neurophysiological Bases of Auditory Perception. (pp. 477-487). New York: Springer.

Denham, S. L., Gyimesi, K., Stefanics, G., & Winkler, I. (2013). Perceptual bi-stability in auditory streaming: How much do stimulus features matter? Learning and Perception, 5(2), 73-100.

Denham, S. L., & Winkler, I. (2006). The role of predictive models in the formation of auditory streams. Journal of Physiology, Paris, 100(1-3), 154-170.

Deutsch, D. (1974). An auditory illusion. Nature, 251(5473), 307-309.

Devergie, A., Grimault, N., Tillmann, B., & Berthommier, F. (2010). Effect of rhythmic attention on the segregation of interleaved melodies. Journal of the Acoustical Society of America, 128(1), 1-7.

Dolležal, L.-V., Beutelmann, R., & Klump, G. M. (2012). Stream segregation in the perception of sinusoidally amplitude-modulated tones. PloS One, 7(9), e43615.

Dowling, W. J. (1973). The perception of interleaved melodies. Cognitive Psychology, 5, 322-337.

Dowling, W. J., Lung, K. M., & Herrbold, S. (1987). Aiming attention in pitch and time in the perception of interleaved melodies. Perception & Psychophysics, 41(6), 642-656.

Du, Y., He, Y., Ross, B., Bardouille, T., Wu, X. H., Li, L. A., et al. (2011). Human Auditory Cortex Activity Shows Additive Effects of Spectral and Spatial Cues during Speech Segregation. Cerebral Cortex, 21(3), 698-707.

Duke, R. A. (1989). Musicians' Perception of Beat in Monotonic Stimuli. Journal of Research in Music Education, 37, 61-71.

Durlach, N. I., Mason, C. R., Kidd, G., Jr., Arbogast, T. L., Colburn, H. S., & Shinn-Cunningham, B. G. (2003). Note on informational masking. Journal of the Acoustical Society of America, 113(6), 2984-2987.

Elhilali, M., Ma, L., Micheyl, C., Oxenham, A. J., & Shamma, S. A. (2009). Temporal coherence in the perceptual organization and cortical representation of auditory scenes. Neuron, 61(2), 317-329.

Escera, C., Alho, K., Schröger, E., & Winkler, I. (2000). Involuntary attention and distractibility as evaluated with event-related brain potentials. Audiology and Neuro-Otology, 5(3-4), 151-166.

Fishman, Y. I., Arezzo, J. C., & Steinschneider, M. (2004). Auditory stream segregation in monkey auditory cortex: effects of frequency separation, presentation rate, and tone duration. Journal of the Acoustical Society of America, 116(3), 1656-1670.

Fishman, Y. I., Reser, D. H., Arezzo, J. C., & Steinschneider, M. (2001). Neural correlates of auditory stream segregation in primary auditory cortex of the awake monkey. Hearing Research, 151(1-2), 167-187.

French-St George, M., & Bregman, A. S. (1989). Role of predictability of sequence in auditory stream segregation. Perception & Psychophysics, 46(4), 384-386.

Friedman, D., Cycowicz, Y. M., & Gaeta, H. (2001). The novelty P3: an event-related brain potential (ERP) sign of the brain's evaluation of novelty. Neuroscience and Biobehavioral Reviews, 25(4), 355-373.

107 Fritz, J. B., Elhilali, M., David, S. V., & Shamma, S. A. (2007). Does attention play a role in dynamic receptive field adaptation to changing acoustic salience in A1? Hearing Research, 229(1-2), 186-203.

Furst, M., Levine, R. A., & McGaffigan, P. M. (1985). Click lateralization is related to the beta component of the dichotic brainstem auditory evoked potentials of human subjects. Journal of the Acoustical Society of America, 78(5), 1644-1651.

Grey, J. M. (1977). Multidimensional perceptual scaling of musical timbres. Journal of the Acoustical Society of America, 61(5), 1270-1277.

Griffiths, T. D., & Warren, J. D. (2004). What is an auditory object? Nature Reviews Neuroscience, 5(11), 887-892.

Grimault, N., Bacon, S. P., & Micheyl, C. (2002). Auditory stream segregation on the basis of amplitude-modulation rate. Journal of the Acoustical Society of America, 111(3), 1340-1348.

Grimm, S., & Schröger, E. (2007). The processing of frequency deviations within sounds:

evidence for the predictive nature of the Mismatch Negativity (MMN) system.

Restorative Neurology and Neuroscience, 25(3-4), 241-249.

Gutschalk, A., & Dykstra, A. R. (2014). Functional imaging of auditory scene analysis.

Hearing Research, 307, 98-110.

Gutschalk, A., Micheyl, C., Melcher, J. R., Rupp, A., Scherg, M., & Oxenham, A. J. (2005).

Neuromagnetic correlates of streaming in human auditory cortex. Journal of Neuroscience, 25(22), 5382-5388.

Hartmann, W. M., & Johnson, D. (1991). Stream segregation and peripheral channeling.

Music Perception, 9(2), 155-184.

Hartmann, W. M., McAdams, S., & Smith, B. K. (1990). Hearing a mistuned harmonic in an otherwise periodic complex tone. Journal of the Acoustical Society of America, 88(4), 1712-1724.

Hautus, M. J., & Johnson, B. W. (2005). Object-related brain potentials associated with the perceptual segregation of a dichotically embedded pitch. Journal of the Acoustical Society of America, 117(1), 275-280.

Haykin, S., & Chen, Z. (2005). The cocktail party problem. Neural Comput, 17(9), 1875-1902.

Hillyard, S. A., & Anllo-Vento, L. (1998). Event-related brain potentials in the study of visual selective attention. Proceedings of the National Academy of Sciences of the United States of America, 95(3), 781-787.

Hillyard, S. A., Hink, R. F., Schwent, V. L., & Picton, T. W. (1973). Electrical signs of selective attention in the human brain. Science, 182(108), 177-180.

Hillyard, S. A., & Picton, T. W. (1987). Electrohysiology of cognition. Bethesda: Waverly Press.

Honing, H., Ladinig, O., Háden, G. P., & Winkler, I. (2009). Is beat induction innate or learned? Probing emergent meter perception in adults and newborns using event-related brain potentials. Annals of the New York Academy of Sciences, 1169, 93-96.

Horváth, J., Winkler, I., & Bendixen, A. (2008). Do N1/MMN, P3a, and RON form a strongly coupled chain reflecting the three stages of auditory distraction? Biological Psychology, 79(2), 139-147.

Hulse, S. H., MacDougall-Shackleton, S. A., & Wisniewski, A. B. (1997). Auditory scene analysis by songbirds: stream segregation of birdsong by European starlings (Sturnus vulgaris). Journal of Comparative Physiology, 111(1), 3-13.

Itatani, N., & Klump, G. M. (2009). Auditory streaming of amplitude-modulated sounds in the songbird forebrain. Journal of Neurophysiology, 101(6), 3212-3225.

108 Iverson, P. (1995). Auditory stream segregation by musical timbre: Effects of static and dynamic acoustic attributes. Journal of Experimental Psychology: Human Perception and Performance, 21(4), 751-763.

Iverson, P., & Krumhansl, C. L. (1993). Isolating the dynamic attributes of musical timbre.

Journal of the Acoustical Society of America, 94(5), 2595-2603.

Jones, D., Alford, D., Bridges, A., Tremblay, S., & Macken, B. (1999a). Organizational factors in attention: The interplay of acoustic distinctiveness and auditory streaming in the irrelevant sound effect. Journal of Experimental Psychology: Learning, Memory and Cognition, 25, 464–473.

Jones, D. M., Saint-Aubin, J., & Tremblay, S. (1999b). Modulation of the irrelevant sound effect by organizational factors: Further evidence from streaming by location.

Quarterly Journal of Experimental Psychology, 52A, 545-554.

Jones, M. R. (1976). Time, our lost dimension: toward a new theory of perception, attention, and memory. Psychological Review, 83, 323-355.

Jones, M. R., & Boltz, M. (1989). Dynamic attending and responses to time. Psychological Review, 96(3), 459-491.

Joris, P. X., Schreiner, C. E., & Rees, A. (2004). Neural processing of amplitude-modulated sounds. Physiological Reviews, 84(2), 541-577.

Kaernbach, C. (2004). The memory of noise. Experimental Psychology, 51(4), 240-248.

Kang, M. S., & Blake, R. (2010). What causes alternation in dominance during binocular rivalry? Proceedings of the National Academy of Sciences of the United States of America, 93, 179-186.

Klimesch, W. (2011). Evoked alpha and early access to the knowledge system: the P1 inhibition timing hypothesis. Brain Research, 1408, 52-71.

Korzyukov, O. A., Winkler, I., Gumenyuk, V. I., & Alho, K. (2003). Processing abstract auditory features in the human auditory cortex. Neuroimage, 20(4), 2245-2258.

Kowalski, N., Depireux, D. A., & Shamma, S. A. (1996). Analysis of dynamic spectra in ferret primary auditory cortex. II. Prediction of unit responses to arbitrary dynamic spectra. Journal of Neurophysiology, 76(5), 3524-3534.

Köhler, W. (1947). Gestalt psychology: An introduction to new concepts in modern psychology. New York: Liveright Publishing Corporation.

Krumhansl, C. L. (1989). Why is musical timbre so hard to understand? In S. Nielzén & O.

Olssen (Eds.), Structure and perception of electroacoustic sound and music (pp. 43-53). Amsterdam: Excerpta Medica.

Kujala, T., Tervaniemi, M., & Schröger, E. (2007). The mismatch negativity in cognitive and clinical neuroscience: theoretical and methodological considerations. Biological Psychology, 74(1), 1-19.

Ladinig, O., Honing, H., Háden, G. P., & Winkler, I. (2009). Probing attentive and pre-attentive emergent meter in adult listeners without extensive music training. Music Perception, 26(4), 377-386.

Leopold, D. A., & Logothetis, N. K. (1999). Multistable phenomena: Changing views in perception. Trends in Cognitive Sciences, 3(7), 254-264.

Leppert, D., Goodin, D. S., & Aminoff, M. J. (2003). Stimulus recognition and its relationship to the cerebral event-related potential. Neurology, 61(11), 1533-1537.

Liberman, A. M., Isenberg, D., & Rakerd, B. (1981). Duplex perception of cues for stop consonants: evidence for a phonetic mode. Perception & Psychophysics, 30(2), 133-143.

Liégeois-Chauvel, C., Lorenzi, C., Trébuchon, A., Régis, J., & Chauvel, P. (2004). Temporal envelope processing in the human left and right auditory cortices. Cerebral Cortex, 14(7), 731-740.

109 Lu, T., Liang, L., & Wang, X. (2001). Temporal and rate representations of time-varying signals in the auditory cortex of awake primates. Nature Neuroscience, 4(11), 1131-1138.

Luck, S. J., & Hillyard, S. A. (1995). The role of attention in feature detection and conjunction discrimination: an electrophysiological analysis. International Journal of Neuroscience, 80(1-4), 281-297.

Luck, S. J., Hillyard, S. A., Mouloua, M., Woldorff, M. G., Clark, V. P., & Hawkins, H. L.

(1994). Effects of spatial cuing on luminance detectability: psychophysical and electrophysiological evidence for early selection. Journal of Experimental Psychology: Human Perception and Performance, 20(4), 887-904.

MacDougall-Shackleton, S. A., Hulse, S. H., & Ball, G. F. (1998). Neural bases of song preferences in female zebra finches (Taeniopygia guttata). Neuroreport, 9(13), 3047-3052.

Mamassian, P., & Goutcher, R. (2005). Temporal dynamics in bistable perception. J Vis, 5(4), 361-375.

McAdams, S., & Bregman, A. S. (1979). Hearing musical streams. Computer Music Journal, 4, 26-43.

McCabe, S., & Denham, M. J. (1997). A model of auditory streaming. Journal of the Acoustical Society of America, 101, 1611-1621.

Meddis, R., & Hewitt, M. J. (1991). Virtual pitch and phase sensitivity of a computer model of the auditory periphery. I: Pitch identification. Journal of the Acoustical Society of America, 89(6), 2866-2882.

Micheyl, C., Tian, B., Carlyon, R. P., & Rauschecker, J. P. (2005). Perceptual organization of tone sequences in the auditory cortex of awake macaques. Neuron, 48(1), 139-148.

Mill, R., Sheik, S., Indiveri, G., & Denham, S. L. (2011). A Model of Stimulus-Specific Adaptation in Neuromorphic Analog VLSI. IEEE Trans Biomed Circuits Syst, 5(5), 413-419.

Mill, R. W., Bőhm, T. M., Bendixen, A., Winkler, I., & Denham, S. L. (2013). Modelling the emergence and dynamics of perceptual organisation in auditory streaming. PLoS Computational Bioligy, 9(3), e1002925.

Moore, B. C., Glasberg, B. R., & Peters, R. W. (1986). Thresholds for hearing mistuned partials as separate tones in harmonic complexes. Journal of the Acoustical Society of America, 80(2), 479-483.

Moore, B. C., & Gockel, H. E. (2011). Resolvability of components in complex tones and implications for theories of pitch perception. Hearing Research, 276(1-2), 88-97.

Moore, B. C. J., & Gockel, H. (2002). Factors influencing sequential stream segregation. Acta Acustica United with Acustica, 88(3), 320-333.

Moore, B. C. J., & Gockel, H. E. (2012). Properties of auditory stream formation.

Philosophical transactions of the Royal Society of London. Series B, Biological sciences, 367(1591), 919-931.

Moray, N. (1959). Attention in dichotic listening: Affective cues and the influence of instructions. Quarterly Journal of Experimental Psychology, 11(1), 56-60.

Moss, C. F., & Surlykke, A. (2001). Auditory scene analysis by echolocation in bats. Journal of the Acoustical Society of America, 110(4), 2207-2226.

Müller, D., Widmann, A., & Schröger, E. (2005). Auditory streaming affects the processing of successive deviant and standard sounds. Psychophysiology, 42(6), 668-676.

Näätänen, R., Gaillard, A. W. K., & Mantysalo, S. (1978). Early Selective-Attention Effect on Evoked-Potential Reinterpreted. Acta Psychologica, 42(4), 313-329.

110 Näätänen, R., Paavilainen, P., Rinne, T., & Alho, K. (2007). The mismatch negativity (MMN) in basic research of central auditory processing: A review. Clinical Neurophysiology, 118(12), 2544-2590.

Näätänen, R., & Winkler, I. (1999). The concept of auditory stimulus representation in cognitive neuroscience. Psychological Bulletin, 125(6), 826-859.

Nager, W., Teder-Salejärvi, W., Kunze, S., & Münte, T. F. (2003). Preattentive evaluation of multiple perceptual streams in human audition. Neuroreport, 14(6), 871-874.

Neff, D. L., & Green, D. M. (1987). Masking produced by spectral uncertainty with multicomponent maskers. Perception & Psychophysics, 41(5), 409-415.

Niebur, E., Hsiao, S. S., & Johnson, K. O. (2002). Synchrony: a neuronal mechanism for attentional selection? Current Opinion in Neurobiology, 12(2), 190-194.

Paavilainen, P., Arajärvi, P., & Takegata, R. (2007). Preattentive detection of nonsalient contingencies between auditory features. Neuroreport, 18(2), 159-163.

Plack, C. J., & Oxenham, A. J. (2005). Overview: The Present and Future of Pitch. In R. R.

Fay & A. N. Popper (Eds.), Pitch: Neural Coding and Perception (pp. 1-7). New York: Springer.

Polich, J. (2007). Updating p300: An integrative theory of P3a and P3b. Clinical Neurophysiology, 118(10), 2128-2148.

Pressnitzer, D., & Hupé, J.-M. (2006). Temporal dynamics of auditory and visual bistability reveal common principles of perceptual organization. Current Biology, 16(13), 1351-1357.

Pressnitzer, D., Sayles, M., Micheyl, C., & Winter, I. M. (2008). Perceptual organization of sound begins in the auditory periphery. Current Biology, 18(15), 1124-1128.

Rahne, T., Böckmann, M., von Specht, H., & Sussman, E. S. (2007). Visual cues can modulate integration and segregation of objects in auditory scene analysis. Brain Research, 1144, 127-135.

Rand, T. C. (1974). Letter: Dichotic release from masking for speech. Journal of the Acoustical Society of America, 55(3), 678-680.

Riecke, L., Esposito, F., Bonte, M., & Formisano, E. (2009). Hearing illusory sounds in noise:

the timing of sensory-perceptual transformations in auditory cortex. Neuron, 64(4), 550-561.

Riedel, H., & Kollmeier, B. (2002). Auditory brain stem responses evoked by lateralized clicks: is lateralization extracted in the human brain stem? Hearing Research, 163(1-2), 12-26.

Rimmele, J., Schröger, E., & Bendixen, A. (2012). Age-related changes in the use of regular patterns for auditory scene analysis. Hearing Research, 289(1-2), 98-107.

Ritter, W., Simson, R., Vaughan, H. G., & Friedman, D. (1979). Brain Event Related to the Making of a Sensory Discrimination. Science, 203(4387), 1358-1361.

Ritter, W., Sussman, E., & Molholm, S. (2000). Evidence that the mismatch negativity system works on the basis of objects. Neuroreport, 11(1), 61-63.

Roberts, B., Glasberg, B. R., & Moore, B. C. J. (2002). Primitive stream segregation of tone sequences without differences in fundamental frequency or passband. Journal of the Acoustical Society of America, 112(5), 2074-2085.

Rogers, W. L., & Bregman, A. S. (1993). An experimental evaluation of three theories of auditory stream segregation. Perception & Psychophysics, 53(2), 179-189.

Rose, M. M., & Moore, B. C. (2000). Effects of frequency and level on auditory stream segregation. Journal of the Acoustical Society of America, 108(3 Pt 1), 1209-1214.

Ross, J., Tervaniemi, M., & Näätänen, R. (1996). Neural mechanisms of the octave illusion:

electrophysiological evidence for central origin. Neuroreport, 8(1), 303-306.

111 Sach, A. J., & Bailey, P. J. (2004). Some characteristics of auditory spatial attention revealed

using rhythmic masking release. Perception & Psychophysics, 66(8), 1379-1387.

Schröger, E. (1996). A neural mechanism for involuntary attention shifts to changes in auditory stimulation. Journal of Cognitive Neuroscience, 8(6), 527-539.

Schröger, E. (2007). Mismatch negativity: A microphone into auditory memory. Journal of Psychophysiology, 21(3-4), 138-146.

Schröger, E., Bendixen, A., Denham, S. L., Mill, R. W., Bőhm, T. M., & Winkler, I. (2014).

Predictive regularity representations in violation detection and auditory stream segregation: from conceptual to computational models. Brain Topography, 27(4), 565-577.

Schul, J., & Sheridan, R. A. (2006). Auditory stream segregation in an insect. Neuroscience, 138(1), 1-4.

Schwartz, J. L., Grimault, N., Hupé, J. M., Moore, B. C., & Pressnitzer, D. (2012).

Multistability in perception: binding sensory modalities, an overview. Philosophical Transactions of the Royal Society London B: Biological Sciences, 367(1591), 896-905.

Shamma, S. A., Elhilali, M., & Micheyl, C. (2011). Temporal coherence and attention in auditory scene analysis. Trends in Neurosciences, 34(3), 114-123.

Sheft, S. (2008). Envelope processing and sound-source perception. In W. A. Yost, A. N.

Popper & R. R. Fay (Eds.), Auditory Perception of Sound Sources (Vol. 29, pp. 233-280): Springer.

Shinn-Cunningham, B. G. (2008). Object-based auditory and visual attention. Trends in Cognitive Sciences, 12(5), 182-186.

Singh, P. G., & Bregman, A. S. (1997). The influence of different timbre attributes on the perceptual segregation of complex-tone sequences. Journal of the Acoustical Society of America, 102(4), 1943-1952.

Snyder, J. S., & Alain, C. (2007). Toward a neurophysiological theory of auditory stream segregation. Psychological Bulletin, 133(5), 780-799.

Snyder, J. S., Gregg, M. K., Weintraub, D. M., & Alain, C. (2012). Attention, awareness, and the perception of auditory scenes. Frontiers in Psychology, 3, 15.

Snyder, J. S., & Weintraub, D. M. (2011). Pattern specificity in the effect of prior Δƒ on auditory stream segregation. Journal of Experimental Psychology- Human Perception and Performance, 37(5), 1649-1656.

Sussman, E., Ritter, W., & Vaughan, H. G. (1999). An investigation of the auditory streaming effect using event-related brain potentials. Psychophysiology, 36(1), 22-34.

Sussman, E., Ritter, W., & Vaughan, H. G., Jr. (1998). Attention affects the organization of auditory input associated with the mismatch negativity system. Brain Research, 789(1), 130-138.

Sussman, E., Winkler, I., Huotilainen, M., Ritter, W., & Näätänen, R. (2002). Top-down effects can modify the initially stimulus-driven auditory organization. Cognitive Brain Research, 13(3), 393-405.

Sussman, E. S. (2007). A new view on the MMN and attention debate - The role of context in processing auditory events. Journal of Psychophysiology, 21(3-4), 164-175.

Sussman, E. S., Bregman, A. S., Wang, W. J., & Khan, F. J. (2005). Attentional modulation of electrophysiological activity in auditory cortex for unattended sounds within multistream auditory environments. Cognitive, Affective, and Behavioral Neuroscience, 5(1), 93-110.

Sussman, E. S., Horváth, J., Winkler, I., & Orr, M. (2007). The role of attention in the formation of auditory streams. Perception & Psychophysics, 69(1), 136-152.