Alternative splicing in humans (Bioinformatics)

1. Introduction

In the human genome, the protein coding sequence of most genes is organized into discrete blocks of sequence, the exons, which are interrupted by noncoding sequences called introns. Exons and introns are transcribed together as long nuclear pre-mRNAs that must be spliced so as to excise the introns and ligate the exons, forming mature mRNAs that are exported to the cytoplasm for translation. For small genes, this process of RNA splicing is often hard-wired so that the same array of exons is always joined to create a single unique mRNA. Many large eukaryotic genes, in contrast, exhibit alternative competing pathways for splicing, generating multiple mRNA isoforms that differ with respect to inclusion or exclusion of selected exons or parts of exons. Moreover, because alternative exons can encode unique peptides or introduce new translation initiation or termination signals, any aspect of protein structure and function may be affected, having important physiological impacts on virtually every major cellular process. In recognition of the critical role of splicing as a key mediator of gene expression, its discovery was celebrated with the 1993 Nobel Prize award in Physiology or Medicine to Richard Roberts and Phillip Sharp.

Others have reviewed the structure and function of the eukaryotic spliceosomal machinery and the biochemical mechanisms involved in catalysis of the splicing reactions (Black, 2003). Here, we will focus briefly on the importance of alternative splicing in the human genome. It is now known that a majority of human genes are alternatively spliced with respect to one or more exons (Modrek and Lee, 2002), and that this phenomenon allows a relatively small genome to encode an enormously complex proteome. A number of efforts are under way to catalog and classify human alternative splicing events on a genome-wide basis, and to compare alternative splicing among genomes of other species (e.g., Lee et al., 2003; Nurtdinov et al., 2003; Pospisil et al., 2004; Thanaraj et al., 2004). Some splicing events are poorly conserved phylogenetically and may represent experiments of nature, while others are very highly conserved through evolution and likely regulate important cellular functions. Many conserved splicing events are tightly regulated in cell type- and differentiation stage-specific patterns, so that individual exons or sets of exons can be switched on or off in response to appropriate environmental signals. Moreover, regulation of alternative splicing is functionally coupled to other RNA processing events including transcription initiation and polyadenylation (Hirose and Manley, 2000; Maniatis and Reed, 2002). A major challenge in the splicing field remains identification and characterization of the regulatory networks that control alternative splicing switches and thereby play important roles in defining the specialized properties of differentiated cells.


2. Cis-acting regulatory sequences in the genome

The decision to activate splicing of an alternative exon requires mechanism(s) for recruiting spliceosomal machinery to the appropriate splice sites, a daunting task given the observed degeneracy of splice site consensus sequences and the abundance of cryptic splice sites in the genome. The cellular splicing machinery meets this challenge by employing not only the splice sites themselves but also accessory regulatory elements in the exon and/or its flanking introns, for the process of “exon definition” (Robberson et al., 1990). Analogous to regulatory elements that stimulate or repress transcription, positively acting enhancer elements and negatively acting silencer elements can exert dramatic effects on the control of pre-mRNA splicing by promoting or inhibiting spliceosome recruitment. While exons are generally enriched in enhancers and introns contain abundant silencers (Cartegni et al., 2002; Fairbrother and Chasin, 2000; Fairbrother et al., 2002), tissue-specific alternative exons often contain both types of elements as an integral part of the regulatory mechanism. As a result, exon sequences are evolutionarily constrained by the need to ensure their own splicing via retention of appropriate enhancers and silencers.

3. Trans-acting splicing factors

A major paradigm in pre-mRNA splicing regulation is the concept that splicing is controlled by mutually antagonistic splicing factor proteins that bind to enhancer and silencer elements in pre-mRNA, in order to control recruitment of the spliceosomes. Typically, these proteins fall into two classes, the heterogeneous nuclear ribonucleoprotein (hnRNP) proteins, and the SR proteins characterized by domains rich in arginine and serine. Splicing activator proteins typically act as spliceosome recruiting proteins by binding to regulatory site(s) in or near an alternative exon (via a consensus RNA binding motif), and to a component(s) of the spliceosome (via a protein interaction domain). Interestingly, recruitment itself can involve any one of several different adaptors interacting with distinct spliceosomal proteins, a flexibility that may enable a cell to switch on specific sets of alternative exons in response to specific environmental signals. Equally important to splicing regulation are splicing silencer proteins that can interfere with spliceosome recruitment by blocking activity of the positive regulators. This interplay between splicing activators and inhibitors, termed “dynamic antagonism” (Charlet et al., 2002), plays a critical role in regulating cell type-specific splicing.

4. Alternative splicing and normal development

Implementation of alternative splicing switches during differentiation and development requires signaling mechanisms to transduce extracellular stimuli into changes in expression or activity of splicing activators and inhibitors. Alternative splicing switches can be signaled by insulin, stress hormones, or antigen treatment (Stamm, 2002), and may ultimately be mediated by tissue-restricted expression of a splicing regulatory protein, or by physiological changes in expression of a widely expressed factor (e.g., Hou et al., 2002; Jensen et al., 2000; Ladd et al., 2001). However, at present these mechanisms are poorly understood.

5. Alternative splicing and human disease

This subject is clearly presented in a recent review.Mutations that alter normal splice sites or create aberrant splice sites have long been known to underlie many human diseases. More recently it has been appreciated that subtle exon mutations and distant intron mutations can also cause disease by disrupting enhancer or silencer elements. Alternatively, disturbances in the splicing machinery itself, whether engineered in mouse models or occurring naturally in human disease states, have been shown to affect splicing patterns of many genes and cause global disturbances of cellular metabolism. Quantitative shifts in the relative concentrations of enhancer and silencer proteins could alter splice site selection of key growth regulatory genes, DNA repair functions, or apoptosis genes, and contribute to cancer progression. Many individual reports of altered splicing in cancer, and two broad bioinformatic surveys of cancer-associated splicing defects, have been published recently (Wang et al., 2003; Xu and Lee, 2003).

6. Future perspectives

This is an exciting time in the alternative splicing field, as new approaches offer great promise for a better understanding of the extent of alternative splicing in the human genome and better knowledge of the mechanisms by which alternative splicing is regulated. Advances in genome-wide sequence analysis and microarray technologies offer opportunities to deduce global patterns of alternative splicing in normal tissues (Johnson et al., 2003). This will provide a more detailed and perhaps more accurate view of splicing than what is available today from studies that are often derived from transformed cell lines or tumors that may exhibit aberrant splicing compared to normal differentiated cells. Although computational analysis of splicing regulatory elements has already begun (Brudno etal., 2001; Cartegni et al., 2003b; Fairbrother et al., 2002), improved methods should increase our ability to mine this sequence data for splicing regulatory signals. Together with new biochemical and genetic methods for identifying the targets of splicing regulator proteins, these advances should ultimately lead to a map of the splicing regulatory networks that control alternative splicing. Finally, the next several years should see significant progress in understanding splicing aberrations in human disease and, more importantly, in the development of exciting new potential therapies for the correction of such splicing defects (Cartegni and Krainer, 2003a; Sazani and Kole, 2003).

Next post:

Previous post: