In the auditory system, acoustic information conveyed by hair cells and auditory nerves is broken up into different components in central neural circuits, which can then be processed and encoded separately. This process relies on the generation of a variety of response features different from auditory nerve activity. Inhibition in the central circuits proves crucial for the creation and refinement of these functional response properties. Previous understanding of inhibitory mechanisms for auditory information processing has been limited by the methodology of deriving inhibition indirectly from spike and membrane potential responses. Recent application of in vivo whole cell voltage-clamp recordings (iVCRs) to auditory cortical neurons directly reveals the spectral and temporal properties of synaptic inhibition evoked by auditory stimuli. These findings provide new insights into how cortical inhibition shapes spike responses of excitatory neurons through its specific interaction with their excitatory synaptic input. This review highlights our current understanding of cortical inhibitory mechanisms underlying several fundamental functional properties of auditory cortical neurons. In particular, we propose that the variation in spectrotemporal pattern of cortical inhibition in relation to excitation contributes to the functional diversity of auditory cortex.
In the central auditory system, a variety of response features that do not resemble auditory nerve activity are found. For example, although the auditory nerve fire spikes continuously during sound duration, some central auditory neurons only respond transiently to the onset or the offset of sound stimuli (Fig. 1). The firing rate of the auditory nerve increases monotonically as sound intensity increases, whereas that of some central auditory neurons reaches a peak and then declines with further intensity increments. These diverse functional properties may set a foundation for parallel processing of different components of acoustic information. It is believed that the generation of many of these novel response properties depends on inhibitory circuits. However, the detailed underlying mechanisms remain largely unclear, as it has been difficult to directly reveal synaptic inhibition in previous studies. Recently, in vivo whole cell recording techniques, especially, those of iVCR, have been successfully applied to auditory cortical neurons. These studies provided new insights into the inhibitory synaptic circuitry basis for the generation and refinement of these functional response properties, even though many of them are originated in subcortical nuclei.
Cortical responses are know to be strongly influenced by synaptic inhibition, which plays important roles in defining frequency–intensity receptive fields (RFs) and shaping sound-evoked responses of individual cortical neurons (e.g., Feng and Ratnam, 2000; Wang et al., 2000, 2002; Ojima and Murakami, 2002; Oswald et al., 2006). The inhibitory control is mediated by cortical GABAergic interneurons through their feedforward or feedback projections. To understand the inhibitory contribution to cortical information processing, two major questions need to be addressed: (1) How do GABAergic interneurons respond during cortical processing? (2) What kind of interplay between coactivated excitatory and inhibitory synaptic inputs to the cortical neuron determines the inhibitory shaping of its responses?
Response properties of inhibitory neurons
Because of the relative sparseness of GABAergic interneurons, which account for only 15–25% of total cortical neurons (Peters and Kara, 1985; Hendry et al., 1987; Prieto et al., 1994), our knowledge on in vivo functional properties of these cells has lagged far behind that of pyramidal cells. The inhibitory neuron population contains more than a dozen morphologically and neurochemically distinct subgroups (Kawaguchi and Kondo, 2002; Markram et al., 2004). The high heterogeneity of inhibitory neurons, as well as the technical challenge of identifying inhibitory neurons with either extracellular or intracellular recordings, greatly increases the difficulty in studying this population of cortical cells. Current understandings of response properties of cortical inhibitory neurons are mostly on fast-spike neurons, which exhibit distinctive narrow spike waveforms and can be identified with extracellular recordings (Mountcastle et al., 1969; Swadlow, 2003; Atencio and Schreiner, 2008; Wu et al., 2008). These cells have also been categorized as basket or chandelier cells based on morphology, and parvalbumin-positive neurons based on molecular markers (Kawaguchi and Kondo, 2002; Markram et al., 2004). Extracellular recording studies demonstrate that fast-spike inhibitory neurons exhibit different functional properties from excitatory neurons (Atencio and Schreiner, 2008; Wu et al., 2008). They display broader spectral tuning, shorter response latency, lower intensity threshold, and higher response reliability, whereas their preferred frequency is essentially the same as that of excitatory neurons in the same column. Intracellular recordings further indicate that the spectral range of excitatory inputs received by fast-spike neurons is not different from that of nearby excitatory neurons (Wu et al., 2008), suggesting that the two classes of cells receive common thalamocortical inputs. However, fast-spike neurons are more efficient in converting synaptic input to spike output, resulting in more broadly tuned spike responses than excitatory neurons. This observed broader tuning of fast-spike neurons is also consistent with results in other sensory cortices (Bruno and Simons, 2002; Swadlow, 2003; Niell and Stryker, 2008; Liu et al., 2009; Kerlin et al., 2010; Ma et al., 2010). These unique response properties of fast-spike neurons place them in a strong position to supply fast, reliable, and temporally precise feedforward inhibition to other cells (Gabernet et al., 2005). Functional properties of other inhibitory neuron subtypes and their involvement in auditory cortical processing remain to be addressed.
Approaches to unraveling cortical inhibitory mechanisms
To understand the role of inhibition in shaping auditory cortical function, two approaches have been applied. First, functional properties of neurons have been compared before and after removing cortical inhibition pharmacologically. Local iontophoretic application of an antagonist of GABAA receptors results in the broadening of frequency tuning of cortical neurons (Wang et al., 2000, 2002). These results provide evidence for an inhibitory sharpening of frequency tuning. However, they could not generate deeper insights into detailed underlying mechanisms. Second, RF properties of cortical inhibition have been indirectly derived by examining spike responses. In a two-tone forward-masking paradigm, inhibition is revealed by the suppression of responses to a characteristic frequency (CF) tone caused by a leading tone (Calford and Semple, 1995; Chen and Jen, 2000; Sutter and Loftus, 2003; Zhang et al., 2003). In these studies, it is found that inhibitory RFs flank the excitatory RFs, leading to the proposal of a lateral inhibition model. Inhibition has also been estimated based on the suppression of spontaneous spiking activity (e.g., Qin and Sato, 2004; Sadagopan and Wang, 2010; Zhou et al., 2010). Furthermore, intracellular sharp-electrode recording has revealed inhibition as a hyperpolarizing membrane potential response (Ojima and Murakami, 2002). The above studies, although providing persuasive evidence for inhibition, cannot give a quantitative measurement of inhibition. For example, in the two-tone suppression experiments, the leading tone may result in both excitation and inhibition, and the apparent suppression of the CF-tone response cannot be simply viewed as a pure inhibitory effect. In addition, short-term plasticity of excitatory and inhibitory inputs should also be considered because the temporally close two tones may not activate completely independent pathways.
In previous intracellular or whole cell recording experiments, many attempts to derive synaptic conductances may have been compromised by the high impedance of recording microelectrodes. Recently, the successful application of high-quality iVCR technique opens the door to probing into synaptic circuits underlying cortical processing. Only with low access–resistance whole cell recording and sufficient voltage clamp of neuronal membranes has it become possible to isolate excitatory and inhibitory synaptic conductances reliably (Wehr and Zador, 2003; Tan et al., 2004; Wu et al., 2006; Liu et al., 2007). It is worth noting that an evaluation of clamping quality cannot be based simply on the linearity of current–voltage relationship, but should also be based on the proximity of measured reversal potentials of synaptic currents to theoretical values. The iVCR technique is particularly important and useful for investigating inhibition because indirect derivations of inhibition could be problematic, as discussed above. Recent studies in the primary auditory cortex (A1) using the iVCR technique have revealed inhibitory patterns underlying several fundamental functional properties of cortical neurons, such as selectivity for auditory features and specific temporal response profiles (Wehr and Zador, 2003, 2005; Zhang et al., 2003; Tan et al., 2004, 2007; Wu et al., 2006, 2008; Liu et al., 2007; Sun et al., 2010; Zhou et al., 2010). The results provide us with a more thorough picture on the inhibitory mechanisms underlying auditory cortical processing.
Temporal shaping of auditory responses by cortical inhibition
Sound-evoked responses of individual cortical neurons are primarily determined by the temporal integration of coactivated excitatory and inhibitory synaptic inputs to the cell. Recent iVCR studies have allowed a detailed comparison of onset latency between excitation and inhibition evoked by the same stimulus. The results revealed that the temporal relationship between excitation and inhibition is not fixed, but varies in different cortical locations as to fulfill different processing functions. Three salient excitatory–inhibitory temporal relationships have been observed.
Canonical inhibitory delay.
A stereotyped excitatory–inhibitory temporal relationship is found for layer 4 neurons in the auditory cortex, with the onset of inhibition delayed by 2–3 ms compared with that of excitation (Wehr and Zador, 2003; Tan et al., 2004; Wu et al., 2008; Zhou et al., 2010). This relative onset of inhibition appears constant across different tone frequencies (Wehr and Zador, 2003). A similar excitation–inhibition response sequence is also widely observed in other sensory cortices (Douglas and Martin, 1991, 2004; Higley and Contreras, 2006; Liu et al., 2010). As shown in Fig. 2 A, the briefly delayed inhibition has three effects on the membrane potential response: (1) suppressing the depolarization response level; (2) narrowing the time window for depolarizing response; and (3) creating a relatively long period of hyperpolarization after the initial depolarization. Such excitatory–inhibitory interplay selectively allows spikes to occur within the narrow depolarization window while it prevents spiking during the delayed long period of hyperpolarization (Fig. 2 A). This leads to transient onset spike responses in cortical neurons with relatively precise spike timing, and also allows the neuron to behave as a coincidence detector for synchronous inputs (Pouille and Scanziani, 2001; Wehr and Zador, 2003; Tan et al., 2004; Higley and Contreras, 2006). This canonical excitatory–inhibitory temporal relationship can be attributed to a feedforward inhibitory circuit, with disynaptic inhibitory inputs provided most likely by fast-spike inhibitory neurons (Tan et al., 2004; Gabernet et al., 2005; Wu et al., 2008; Ma et al., 2010).
Intensity-dependent inhibitory delay.
Intensity-tuned auditory neurons are characterized by their nonmonotonic responses to sound intensities (Phillips et al., 1995; Heil and Irvine, 1998; Sutter and Loftus, 2003) and have been proposed to play important roles in encoding sound loudness and envelop transients (Heil and Irvine, 1998; Polley et al., 2004). Because auditory nerve responses all exhibit monotonic rate-level functions, intensity tuning must be created in the central auditory pathway, likely through specific spectral and temporal interactions between excitation and inhibition (Suga and Manabe, 1982; Faingold et al., 1991; Pollak and Park, 1993; Calford and Semple, 1995; Ojima and Murakami, 2002; Wang et al., 2002; Sutter and Loftus, 2003; Sivaramakrishnan et al., 2004). Recent iVCR recordings from intensity-tuned cortical neurons indicate that although excitatory input already exhibits intensity tuning, cortical intensity tuning is greatly strengthened by inhibitory input recruited in an imbalanced manner (Wu et al., 2006; Tan et al., 2007). As intensity increases, the amplitude of inhibition increases monotonically, and the temporal delay of inhibition relative to excitation is shortened. As a result, the suppression of excitation by the inhibitory input is enhanced at intensities above the preferred intensity, and intensity selectivity of spike responses is sharpened. More interestingly, neuron modeling work indicates that even if excitatory input is not intensity tuned, intensity tuning can still be generated by shortening the relative onset of inhibition with intensity increments (Wu et al., 2006) (Fig. 2 B). This result suggests that controlling the relative timing between excitation and inhibition can be a good strategy used by synaptic circuits to achieve a de novo construction of intensity selectivity.
Inhibitory advance in layer 6 (L6).
L6 of the A1 has been implicated in a major corticothalamic feedback loop. It receives direct thalamocortical input and conversely sends feedback projections predominantly to the first-order thalamic nucleus (Ojima, 1994; Prieto and Winer, 1999; Rouiller and Welker, 2000; Winer et al., 2001, 2005; Kaur et al., 2005; Winer, 2005; Takayanagi and Ojima, 2006; Lakatos et al., 2007; Llano and Sherman, 2008; Wallace and Palmer, 2008). This corticothalamic feedback has been thought to mediate thalamic responses (Villa et al., 1991; Zhang and Suga, 1997; Yan and Ehret, 2002). However, sensory stimuli do not drive spike responses in a large proportion of L6 excitatory neurons (Tsumoto and Suda, 1980; Sirota et al., 2005; Zhou et al., 2010), but suppress their spontaneous firing within the expected tonal RF (Zhou et al., 2010). The suppression of evoked spike responses results from a strong inhibitory input preceding the coactivated excitatory input (Zhou et al., 2010) (Fig. 2 C). Such a reversed excitatory–inhibitory temporal relationship can be attributed to a parallel feedforward circuit in L6, with excitatory and inhibitory inputs both disynaptically relayed from the thalamus. Because of earlier spiking of the first-order L6 inhibitory neurons than the first-order excitatory neurons (Zhou et al., 2010), the second-order excitatory neuron would receive inhibition before the arrival of the disynaptic excitatory inputs. It is proposed that the preceding inhibition may be relieved under specific conditions, such as during the arrival of attention-related inputs. Then, the corticothalamic feedback is allowed to be activated to mediate the induction of sound-specific plasticity in the auditory thalamus (Zhang and Suga, 2000; Suga and Ma, 2003; Zhang and Yan, 2008).
Spectral shaping of auditory feature selectivity by cortical inhibition
In the auditory system, frequency or spectral information is mainly coded spatially in a tonotopic map. Therefore, the spatial distribution of presynaptic neurons largely determines the spectral range of synaptic inputs to the postsynaptic cell. Through specific spectral interactions between excitation and inhibition, feature selectivity represented by auditory cortical neurons can be enhanced or even created.
Three models have been proposed to explain the inhibitory sharpening of frequency selectivity. First, in the balanced excitation and inhibition model (Fig. 3 A), excitation and inhibition exhibit the same tuning profiles (Fig. 3 A); that is, they are cotuned (Wehr and Zador, 2003; Zhang et al., 2003; Tan et al., 2004; Oswald et al., 2006). In this model, inhibition scales down the level of membrane depolarization responses and thus narrows the frequency range for spike responses through an “iceberg” or thresholding effect (Wehr and Zador, 2003; Tan et al., 2004). The cotuned inhibition can be explained by a feedforward circuit in which inhibitory neurons providing the inhibition receive the same set of thalamic inputs as the excitatory neuron under examination. Second, in the lateral inhibition model (Fig. 3 B), the spectral range of inhibitory inputs is much broader than that of excitatory inputs, resulting in suppressive sidebands flanking the excitatory RF and the narrowing of frequency tuning of spike responses (Suga and Manabe, 1982; Shamma, 1985; Shamma and Symmes, 1985; Calford and Semple, 1995; Sutter and Loftus, 2003; Oswald et al., 2006). This second model is primarily based on extracellular recording results of two-tone suppression experiments (Suga and Manabe, 1982; Calford and Semple, 1995; Sutter and Loftus, 2003), and can only be explained by a circuit in which inhibitory neurons receive a broader range of thalamic inputs than excitatory neurons. Third, detailed analysis of frequency tunings of synaptic inputs at high resolutions reveals that the spectral range of inhibition is in fact slightly narrower than that of excitation, and the shapes of excitatory and inhibitory tuning curves are different (Wu et al., 2008; Sun et al., 2010). The inhibitory tuning curve appears broader, especially within a putative spiking frequency range around the CF (Fig. 3 C). This current model unites the two previous models by demonstrating that on a global scale, excitation and inhibition are approximately balanced, but on a finer scale, excitation and inhibition can be significantly imbalanced. It also fits better with the properties of inhibitory neurons. Recordings from layer 4 fast-spike inhibitory neurons show that their spectral range of synaptic inputs (which is primarily determined by thalamocortical inputs) is not different from nearby excitatory neurons, but spike responses of fast-spike inhibitory neurons are more broadly tuned than those of excitatory neurons (Atencio and Schreiner, 2008; Wu et al., 2008). Because layer 4 excitatory neurons receive a significant amount of excitatory input from other cortical excitatory neurons (Liu et al., 2007; Happel et al., 2010; Zhou et al., 2010), this differential tuning between fast-spike inhibitory neurons and excitatory neurons introduces a break of excitatory–inhibitory balance; that is, inhibitory inputs are more broadly tuned than summed excitatory inputs (Fig. 3 C). Compared with the cotuned inhibition, the more broadly tuned inhibition around the preferred frequency has an advantage in that it can exert an equivalent lateral inhibition effect and further narrow the frequency range of spike responses (Wu et al., 2008).
Frequency-modulated (FM) direction selectivity.
Neurons selective for direction of FM sweeps are found in the A1 (Suga, 1965; Mendelson and Cynader, 1985; Zhang et al., 2003). Mapping studies suggest that direction selectivity is topographically ordered in parallel with frequency representation. Low CF neurons prefer upward sweeps, whereas high CF neurons prefer downward sweeps (Heil et al., 1992; Zhang et al., 2003; Godey et al., 2005). It is found that the spectral distribution of excitatory synaptic input is asymmetric or skewed in direction-selective neurons, and the skewness is strongly correlated with direction selectivity (Zhang et al., 2003). However, the skewed excitatory inputs by themselves do not account for the generation of direction selectivity, as the integration of single-tone evoked excitatory inputs sequentially to simulate FM sweep stimulation results in an optimal direction for excitation opposite to the cell’s preferred direction (Zhang et al., 2003). The correct directional preference on the other hand can be achieved by spectral and temporal interplays between excitatory and inhibitory inputs. There appears to be a spectral offset between excitation and inhibition in direction-selective neurons (Suga, 1965; Shamma et al., 1993; Nelken and Versnel, 2000; Zhang et al., 2003; Razak and Fuzessery, 2006; Ye et al., 2010), and relatively stronger inhibition appears on one side of the excitatory frequency tuning curve in a manner consistent with the cell’s preferred direction (Zhang et al., 2003; Ye et al., 2010). It has been thought that this spectral offset is important for the generation of direction selectivity. In fact, the apparent offset can be attributed to the broader tuning of inhibition than excitation and its less asymmetric tuning shape (Fig. 3 C). When FM sweeps are applied in the preferred direction, both strong excitation and inhibition are activated earlier, but the delayed nature of inhibition allows spiking response to be generated. In contrast, in the opposite direction, relatively strong inhibition is activated earlier, which effectively suppresses the later arriving strong excitation and prevents spiking response (Zhang et al., 2003). Thus, the detailed excitatory–inhibitory imbalance can contribute significantly to direction selectivity in response to FM sweeps.
Variety of excitatory–inhibitory interplay and functional diversity
Having been supported by several earlier iVCR studies (Wehr and Zador, 2003; Zhang et al., 2003; Tan et al., 2004; Oswald et al., 2006; Tan and Wehr, 2009), the concept of balanced excitation and inhibition has received broad attention. The balance is characterized by a relatively constant ratio between amplitudes of excitation and inhibition across different stimuli. In addition, a stereotyped sequence of excitation followed by inhibition is evoked by sensory input, with the time interval between them relatively constant across stimuli. Under balanced excitation and inhibition, functional selectivity is primarily determined by the property of excitatory synaptic inputs, and inhibition only helps to sharpen the selectivity. Considering that central auditory neurons exhibit a wide variety of functional properties very different from auditory nerves (Schreiner et al., 2000; de la Rocha et al., 2008), neural circuits with only balanced excitation and inhibition seem too limited for explaining the functional diversity. Indeed, in the cortex, even the balanced excitation and inhibition can only be viewed as being approximate. Neural circuits with different organization principles together with neuronal populations with different response properties can result in various patterns of excitatory–inhibitory interplay deviating from the perfect balance, which would be essential for creating diverse functional properties.
The inhibitory mechanisms discussed so far are largely consistent with the functional properties of fast-spike inhibitory neurons and their involved circuits. Functional properties of other types of inhibitory neurons in the auditory cortex are yet unknown. Although in a few cases, in vivo whole cell recordings combined with post hoc histology have helped to identify inhibitory neurons in the visual cortex (Azouz et al., 1997; Hirsch et al., 2003), in general it is extremely difficult to encounter inhibitory neurons in blind recordings, especially those minor types of inhibitory neurons. The fast growing transgenic mouse lines wherein genetic labeling of specific subtypes of inhibitory neurons has been achieved greatly facilitate a targeted examination of a desired inhibitory cell type. With technical innovations on in vivo Ca2+ imaging of neuronal responses (Sohya et al., 2007; Bandyopadhyay et al., 2010; Kerlin et al., 2010; Runyan et al., 2010; Zariwala et al., 2011) as well as two-photon imaging–guided patch-clamp recording (Margrie et al., 2003; Liu et al., 2009; Gentet et al., 2010; Ma et al., 2010), some major breakthroughs will likely be made in the next few years to greatly enhance our understanding of the differential functional roles of inhibitory neuron subtypes (Ma et al., 2010) and their associated synaptic circuitry.
An important functional aspect of inhibition, its dynamic property, has not been extensively studied previously. The dynamic property concerns the temporal profile (duration and decay time) and short-term plasticity of inhibitory synaptic responses, as well as the firing pattern of inhibitory neurons (transient or sustained). Such information is important for understanding the inhibitory shaping of responses under temporally complex sounds, which are predominant in a natural acoustic environment. The current data on the duration of inhibition remain somewhat controversial (with short duration: Wehr and Zador, 2003, 2005; with relatively long duration: Tan et al., 2004; Wu et al., 2008). For example, the short duration of inhibition observed in excitatory cells in a couple of studies (Wehr and Zador, 2003, 2005) seems inconsistent with the more sustained spike responses elicited in inhibitory neurons by similar tone stimulation (Wu et al., 2008). Further studies will be needed for our understanding on the contribution of the temporal profile of inhibition to the outstanding phenomenon of two-tone feedforward masking (Tan et al., 2004; Wehr and Zador, 2005).
In addition to comparing patterns of sound-evoked excitation and inhibition, a straightforward strategy for elucidating the inhibitory shaping of single cortical neuron’s functional properties is to compare its responses before and after eliminating inhibition in that cell. With iVCR, it remains extremely difficult to change the intrapipette solution to directly record the responses of the same cell with and without intracellular blockers for GABA receptors. However, essential understandings of inhibitory mechanisms can be gained first by modeling membrane potential responses with neuronal models. By including or removing the inhibitory conductance, these computational models can qualitatively reveal the effects of inhibition (Wu et al., 2008; Liu et al., 2010). Second, the dynamic-clamp technique (Sharp et al., 1993) can be combined with in vivo whole cell recording to inject experimentally determined synaptic conductances into the cell and monitor its membrane potential response. Finally, the newly developed optogenetic technique (Lima et al., 2009; Zhang et al., 2010) will allow light-controlled reversible inactivation of local inhibitory neurons, while leaving the network activity largely intact. Responses of single cells can be examined before and after activation or inactivation. Collectively, these experiments will be able to provide invaluable insights into the inhibitory synaptic mechanisms underlying auditory cortical functions.
This work was supported by National Institutes of Health (grants DC008983 and DC008588 to L.I. Zhang and grant EY019049 to H.W. Tao), the Searle Scholar Program, the Esther A. and Joseph Klingenstein Fund, and the David and Lucile Packard Foundation.
Robert S. Farley served as guest editor.
- Abbreviations used in this paper:
- characteristic frequency
- frequency modulated
- in vivo whole cell voltage-clamp recording
- receptive field
This article is distributed under the terms of an Attribution–Noncommercial–Share Alike–No Mirror Sites license for the first six months after the publication date (see http://www.rupress.org/terms). After six months it is available under a Creative Commons License (Attribution–Noncommercial–Share Alike 3.0 Unported license, as described at http://creativecommons.org/licenses/by-nc-sa/3.0/).