Sarcoglycanopathies are autosomal recessive muscle-wasting disorders that result from genetic defects of four transmembrane glycoproteins, α-, β-, γ- and δ-sarcoglycan. These four subunits form a distinct complex at the cell membrane of skeletal and cardiac muscle. The sarcoglycan complex, whose role is only now being revealed, is a component of a major complex formed by dystrophin at costameres in the cell membrane (Refs Reference Blake1, Reference Davies and Nowak2, Reference Hack3, Reference Ozawa4). Sarcoglycanopathies are included in a large group of limb-girdle muscular dystrophy (LGMD) because these progressive muscle disorders predominantly affect proximal muscles around the scapular and the pelvic girdles. Mutations in individual sarcoglycans are responsible for LGMD-2C (γ-sarcoglycan), LGMD-2D (α-sarcoglycan), LGMD-2E (β-sarcoglycan) and LGMD-2F (δ-sarcoglycan) (Refs Reference Bönnemann5, Reference Lim6, Reference Nigro7, Reference Noguchi8, Reference Roberds9). The clinical phenotype of sarcoglycanopathies is very heterogeneous, and age of onset, rate of progression and severity can vary between and within affected families (Ref. Reference Angelini10). In general, the disease is characterised by progressive weakness and degeneration of skeletal muscle, leading to loss of ambulation, difficulties in breathing and often premature death. The majority of sarcoglycanopathies are associated with missense mutations that generate substitution of single residues that could lead to a misfolded protein. Analysis of muscle samples from patients showed that these mutations result in either the complete absence or the presence of only trace amounts of the protein in the cell membrane. Sarcoglycans are transmembrane proteins that mature in the endoplasmic reticulum (ER) where nascent proteins reach their native conformation through the activity of an efficient quality control system. Misfolded proteins are identified by the quality control system and retrotranslocated to the cytosol for proteasomal degradation through the ER-associated protein degradation (ERAD) pathway (Refs Reference Anelli and Sitia11, Reference Hebert and Molinari12). Occasionally, defective proteins may pass the quality control and reach the cell membrane, where, since they are nonfunctional and thus unstable, they are dismantled and degraded. Defects in each sarcoglycan have destabilising consequences on the entire sarcoglycan complex. Therefore, the pathogenetic mechanisms may comprise: (1) the processing of defective sarcoglycan subunits; (2) the inability to assemble into a complete complex; or (3) the targeting of a dysfunctional sarcoglycan complex to the cell membrane. Sequence analysis of sarcoglycans indicated that although many missense mutations might not have functional consequences, they are intercepted by the quality control system, which significantly slows their processing and results in the disposal of the mutant protein. Recent evidence shows that these misfolded ‘functional’ sarcoglycans could be rescued to the cell membrane by assisting them in the maturation process along the ER secretory pathway (Refs Reference Bartoli13, Reference Gastaldello14). The purpose of this review is to provide a comprehensive overview of the molecular organisation of the sarcoglycan complex and of the pathogenetic events causing the disease. Novel therapeutic strategies for the treatment of sarcoglycanopathies based on interference of the ER-processing machinery are also evaluated.
The dystrophin–glycoprotein complex
The four sarcoglycans form a tetrameric subcomplex within the multimeric complex based on dystrophin in the cell membrane of skeletal and cardiac muscle. Dystrophin is a large actin-binding cytoskeletal protein that is essential for the organisation of the cell-membrane-associated dystrophin–glycoprotein complex (DGC). The DGC is enriched in costameres, which are specialised cytoskeletal structures connecting the plasma membrane and Z discs of peripheral myofibrils (Refs Reference Davies and Nowak2, Reference Ervasti15). Dystrophin is composed of four distinct functional domains: the N-terminal actin-binding domain, a long central domain containing 24 spectrin-like repeats, a cysteine-rich domain and the C-terminal domain. The protein forms a tight link with the actin cytoskeleton through the N-terminal domain, an association reinforced by additional binding sites within some spectrin repeats (Ref. Reference Blake1). At the distal region of the protein, dystrophin forms, via its cysteine-rich domain, a tight association with the transmembrane protein β-dystroglycan, which in turn is strongly linked to the extracellular α-dystroglycan (Fig. 1). The two dystroglycans are products of a single gene that is post-translationally cleaved into two subunits (Ref. Reference Durbeej, Henry and Campbell17). The highly glycosylated α-dystroglycan forms a strong interaction with laminin α2, a major constituent of muscle fibre basement membrane. Thus, the backbone formed by dystrophin, dystroglycans and laminin α2 represents a transmembrane structure directly connecting the actin cytoskeleton to the extracellular matrix. The localisation of DGC in line with costameres is assured by the association of β-dystroglycan and dystrophin to two ankyrins (B and G), which are components of the subsarcolemmal cytoskeleton (Ref. Reference Ayalon18). The DGC is thus thought to provide structural support to the plasma membrane and to protect it from the mechanical stress of contractile activity by transmitting the lateral tension generated by muscle contraction to the extracellular matrix (Refs Reference Ervasti15, Reference Petrof19). The critical protective role of dystrophin and of the entire membrane-associated DGC is demonstrated by the fact that genetic defects of dystrophin are responsible for the most common type of muscular dystrophy, the incurable Duchenne muscular dystrophy (DMD) and the milder Becker muscular dystrophy (BMD), whereas defects of laminin α2 and α-dystroglycan glycosylating enzymes cause diverse forms of muscular dystrophy (Refs Reference Davies and Nowak2, Reference Helbling-Leclerc20, Reference McNally and Pytel21). With a defective DGC, the backbone structure is dismantled so that the cell membrane becomes exposed to muscle contraction stresses. As a consequence, cell membrane focal ruptures might occur, leading to transient intracellular calcium influx, which triggers a series of pathogenic events that result in muscle degeneration and the dystrophic phenotype (Refs Reference Davies and Nowak2, Reference McNally and Pytel21).
Role of sarcoglycans in the dystrophin–glycoprotein complex
Loss of sarcoglycan is the specific cause of sarcoglycanopathy, whereas the absence of dystrophin in DMD causes instability of the complex and also leads to the severe reduction of sarcoglycans. Thus, an altered sarcoglycan complex is a common trait of the two muscular dystrophies and probably represents an aggravating condition in DMD.
The role of the four sarcoglycans in the molecular organisation of the DGC is not yet well defined. The sarcoglycan complex is known to form a tight side-association with dystroglycan (Refs Reference Chan22, Reference Brenman23), but is also involved in composite molecular relationships with other constitutive DGC elements such as α-dystrobrevin and syntrophin (Ref. Reference Sakamoto24), neuronal nitric oxide synthase (nNOS) (Ref. Reference Yoshida25) and sarcospan (Ref. Reference Crosbie26) (Fig. 1). Sarcospan is an integral component of DGC, which directly interacts with and stabilises sarcoglycans. The relevance of this transmembrane protein is, however, still uncertain, because ablation of the sarcospan gene in mice does not result in myopathy (Ref. Reference Lebakken27). The sarcoglycan complex, excluding γ-sarcoglycan, is thought to form an association with α-dystroglycan (Ref. Reference Brenman23) via an extracellular proteoglycan, biglycan (Ref. Reference Rafii28). The intracellular tail of β- and δ-sarcoglycan seems to associate directly with the C-terminus of dystrophin (Ref. Reference Chen29), whereas the N-terminal region of α-dystrobrevin secures sarcoglycans to dystrophin (Ref. Reference Sakamoto24). α-Dystrobrevins, syntrophin and nNOS form a signalling subunit associated with dystrophin through the C-terminal domain of dystrobrevin. Additional proteins seem to interact with sarcoglycans at the cell membrane. Two hybrid screens identified γ-filamin as a γ- and δ-sarcoglycan-interacting protein (Ref. Reference Thompson30) and, as a result, γ-filamin is reduced in LGMD-2C (γ-sarcoglycan) but not in LGMD-2D (α-sarcoglycan) (Ref. Reference Anastasi31). Since γ-filamin is an actin-binding protein, it provides, through the interaction with the sarcoglycan complex, additional structural linkages between the DGC and the actin cytoskeleton. The precise physiological role of biglycan and γ-filamin, however, is not well understood. Recently, aquaporin 4, a muscle-specific water channel, was shown to be associated with sarcoglycan through α1-syntrophin, an interaction validated by its severe reduction in muscles of sarcoglycanopathy patients (Ref. Reference Assereto32).
Altogether, these data demonstrate that the sarcoglycan complex forms multiple interactions with the other DGC components and enforce the view of its crucial role in stabilising the whole DGC structure. However, the sarcoglycans might also be involved in signal transduction (Refs Reference Davies and Nowak2, Reference Hack3, Reference McNally and Pytel21, Reference Lapidos33), a role that is primarily supported by the anchorage provided by the complex to nNOS through dystrobrevin and syntrophin (Ref. Reference Yoshida25). Moreover, the sarcoglycan complex has been shown to cooperate with integrins in mediating cell adhesion to the extracellular matrix (Ref. Reference Yoshida34). In support of this, integrins are upregulated when the complex is missing (Ref. Reference Allikian35). The sarcoglycan complex also interacts with the 16 kDa subunit of vacuolar H+-ATPase, and thence with β1-integrin (Ref. Reference Chen36) – a finding that further supports bidirectional signalling with integrins (Ref. Reference Yoshida34). Notably, it has been demonstrated that the cytoplasmic tail of γ-sarcoglycan is phosphorylated after mechanical stimulation, suggesting that it is a signalling sensor of contractility (Ref. Reference Barton37). In addition, α-sarcoglycan possesses an ATP-binding site, conferring ATP-hydrolysing activity to the protein (Refs Reference Betto38, Reference Sandonà39). This activity suggests a possible role in modulating the signalling initiated by extracellular nucleotides and a role in the extracellular ATP-dependent modulation of skeletal muscle contractility (Ref. Reference Sandonà40). The critical role of the sarcoglycan complex, whether mechanical or signalling, is further corroborated by a recent study showing that the complete absence of the sarcoglycan complex in mice lacking both δ-sarcoglycan and dystrophin exacerbates the pathological course (Ref. Reference Li41). This evidence also suggests that the residual expression of the sarcoglycan complex contributes to the milder phenotype of dystrophin-deficient mdx mice compared with that of patients with DMD (Ref. Reference Li41).
The sarcoglycan complex in sarcoglycanopathies
Studies of patient muscle biopsies provide additional important information on the sarcoglycan complex and its contribution to the molecular organisation of the DGC. Analyses of muscle biopsies from sarcoglycanopathy patients demonstrated that the absence or reduced expression of one sarcoglycan has important, but variable consequences for the stability of the other remaining components at the cell membrane. In general, the most clinically severe course was observed in patients in which the involved sarcoglycan was absent, with a variable milder phenotype when residual protein is present. Early studies on patients affected by sarcoglycanopathies reported that mutations in one sarcoglycan gene led to the reduction or absence of the other sarcoglycans, with negligible consequences for the DGC (Refs Reference Lim6, Reference Nigro7, Reference Noguchi8, Reference Barresi42, Reference Bönnemann43, Reference Carrié44, Reference Crosbie45, Reference Duggan46, Reference Jung47, Reference Moreira48, Reference Piccolo49). By contrast, more recent evidence shows that the absence of individual sarcoglycans has indirect consequences on DGC stability. In LGMD-2E and -2F patients, defects in genes encoding β- and δ-sarcoglycan results in the absence of the four sarcoglycans from the plasma membrane and the reduction of dystrophin and dystroglycan (Refs Reference Draviam50, Reference Duclos51, Reference Fanin and Angelini52, Reference Klinge53, Reference Vainzof54). In LGMD-2D patients, defects in α-sarcoglycan causes the secondary loss of β- and δ-sarcoglycan and a severe reduction in γ-sarcoglycan, which can be absent altogether in more severely affected patients; β-dystroglycan and dystrophin were reduced in most cases (Refs Reference Draviam50, Reference Fanin and Angelini52, Reference Klinge53, Reference Vainzof54). In γ-sarcoglycan-deficient patients (LGMD-2C) trace amounts of residual sarcoglycans are evident at the cell membrane, whereas no consequences for the DGC or a reduction of dystrophin are reported (Refs Reference Draviam50, Reference Fanin and Angelini52, Reference Klinge53, Reference Vainzof54, Reference Bönnemann55, Reference Higuchi56). The absence of the sarcoglycan complex in muscle of patients affected by these diseases is reported to expose β-dystroglycan to the activity of a matrix metalloproteinase, thus providing an explanation for its reduced level in sarcoglycanopathies (Ref. Reference Matsumura57).
Animal models exist for each of the four sarcoglycanopathies and show similar results to the human situation. The absence of α-, β- and δ-sarcoglycan causes the complete loss of the other three sarcoglycans and some destabilisation of dystroglycans (Refs Reference Brenman23, Reference Duclos58, Reference Araishi59, Reference Coral-Vasquez60, Reference Durbeej61, Reference Hack62, Reference Matsumura63, Reference Straub64). By contrast, the absence of γ-sarcoglycan in mice weakens the stability of β- and δ-sarcoglycan and causes severe reduction of α-sarcoglycan, but the expression and localisation of dystrophin, dystroglycan and laminin-α2 are unaffected (Refs Reference Hack62, Reference Hack65, Reference Mizuno66, Reference Sasaoka67). δ-Sarcoglycan-deficient animals develop cardiomyopathy, which is rarely observed in LGMD-2F patients, with loss of the sarcoglycan complex even in smooth muscle (Ref. Reference Coral-Vasquez60); β- and γ-sarcoglycan-deficient mice, but not α-sarcoglycan-null mice, also display some cardiac involvement.
The sarcoglycan family of proteins in flies is simpler than in vertebrates: the fly genome encodes a single orthologue of vertebrate α- and ε-sarcoglycans (αε-sarcoglycan), a β-sarcoglycan and a single orthologue of γ- and δ-sarcoglycans (γδ-sarcoglycan). In Drosophila, a tetrameric organisation of the complex is suggested, with the association of αε-sarcoglycan, β-sarcoglycan and two copies of γδ-sarcoglycan (Ref. Reference Greener and Roberts68). A γδ-sarcoglycan-deficient model has been generated, which caused skeletal and cardiac defects (Ref. Reference Allikian69); however, the fate of the other sarcoglycans was not determined. Orthologues of sarcoglycan genes have also been identified in zebrafish, which represent an established model of vertebrate development (Ref. Reference Chambers70). Knockdown of δ-sarcoglycan caused skeletal and cardiac development defects and reduced the levels of β- and γ-sarcoglycan (Ref. Reference Cheng71).
In summary, muscle analysis of various LGMD patients and of related animal models shows that the absence of or defects in β- and δ-sarcoglycan has dramatic consequences on the expression of all other subunits, whereas defects in γ-sarcoglycan have the least effect on expression of the other sarcoglycans. Interestingly, the absence of α-sarcoglycan in humans causes the absence or severe reduction of γ-sarcoglycan, but β- and δ-sarcoglycan are slightly spared. By contrast, in α-sarcoglycan-deficient mice, the other sarcoglycans are completely lost. Interestingly, expression of γ-sarcoglycan and, to a lesser extent, α-sarcoglycan, is reduced more than β- and δ-sarcoglycan across all sarcoglycanopathies.
The molecular organisation of the sarcoglycan complex
Data from LGMD patients and related animal models demonstrate that the four sarcoglycans have an important role at the plasma membrane, but only when they exist as a complex. A better understanding of sarcoglycan complex organisation, processing and function is thus central to improve our knowledge of the pathogenic mechanisms responsible for the dystrophic phenotype.
Sarcoglycans are single-pass transmembrane proteins, with a short intracellular tail and a large extracellular glycosylated portion that is rich in conserved cysteine residues. Six sarcoglycans have been cloned so far: α-, β-, γ-, δ-, ε- and ζ-sarcoglycan (Table 1). α- and γ-sarcoglycan are expressed exclusively in skeletal and cardiac muscle, whereas ε-, β-, δ- and ζ-sarcoglycan are more widely distributed (Refs Reference Bönnemann5, Reference Lim6, Reference Nigro7, Reference Noguchi8, Reference Roberds9, Reference Ettinger, Feng and Sanes72, Reference McNally, Ly and Kunkel73, Reference Wheeler74). The α- and ε-sarcoglycans, are type I membrane proteins with an extracellular N-terminal domain, whereas β-, γ-, δ- and ζ-sarcoglycans are type II membrane proteins, with an extracellular C-terminal domain. α- and ε-sarcoglycan are highly homologous, as are β-, γ-, δ- and ζ-sarcoglycan, with the latter being functionally more similar to γ-sarcoglycan than to δ-sarcoglycan (Ref. Reference Shiga75). A cysteine cluster is present in the extracellular domain of all sarcoglycans and is predicted to be important for the tertiary structure of the protein and for assembly of the whole complex (Ref. Reference Chan22). All four sarcoglycans possess putative phosphorylation sites in the intracellular domain, indicating a possible post-translational modulation of the protein or complex.
aObserved molecular mass (kDa) based on results from SDS-PAGE.
Abbreviations: LGMD, limb-girdle muscular dystrophy; MDS, Myoclonus–dystonia syndrome; AA, number of amino acid residues; N, number of putative N-linked glycosylation sites in the extracellular portion of the protein, according to http://www.uniprot.org; C, number of cysteine residues.
Protein sequence analysis shows that α-sarcoglycan contains an N-terminal signal sequence, one transmembrane domain, five extracellular cysteines and two putative N-linked glycosylation sites, which are conserved in all species (Table 1 and Fig. 5). Cleavage of the signal sequence and glycosylation generate a protein of about 50 kDa. In the N-terminal portion, α-sarcoglycan has a cadherin-like domain and Ca2+-binding pockets, which have also been described in α-dystroglycan, suggesting a possible Ca2+-dependent heterotypic adhesion between the two proteins (Ref. Reference Dickens, Beatons and Ponting76). In addition, α-sarcoglycan has a putative ATP-binding sequence in the extracellular domain, which is conserved in all species (Ref. Reference Betto38). The predicted molecular mass of glycosylated β-sarcoglycan is about 43 kDa and its large extracellular portion has five cysteine residues and three putative N-linked glycosylation sites. The mass of glycosylated γ- and δ-sarcoglycan is ~35 kDa. γ-sarcoglycan has four cysteine residues and one putative N-linked glycosylation site in its extracellular portion, whereas δ-sarcoglycan has four cysteine residues and three putative N-linked glycosylation sites. The C-terminus of β-, δ- and γ-sarcoglycan has three conserved cysteine residues in a fixed position that is similar to those in epidermal growth factor (EGF), suggesting a receptor-like function for an unknown agonist (Refs Reference Hack3, Reference Chen29). It is worth noting that the presence of a putative N-linked glycosylation site in a protein does not necessarily equate to a real glycosylation event. In fact, although the molecular masses of unglycosylated γ- and δ-sarcoglycan are identical (32 kDa), the presence of three N-glycans in δ-sarcoglycan is expected to generate a larger mass than that of γ-sarcoglycan, which has one N-linked glycosylation site. Therefore, either only one of the δ-sarcoglycan N-linked glycosylation sites is eventually glycosylated or the two proteins are modified differently upon post-translation processing.
It is widely assumed that sarcoglycans are organised at the cell membrane as a tetramer, in a complex that is exclusively expressed in striated muscles; this is consistent with the notion that mutations in α-, β-, γ- and δ-sarcoglycan are responsible for muscle sarcoglycanopathies. Mutations in the gene encoding ε-sarcoglycan are associated with myoclonus dystonia syndrome (Ref. Reference Zimprich77), whereas no disease has been so far associated with mutations in the ζ-sarcoglycan gene. Organisation of the sarcoglycan complex occurs in a strict equimolar stoichiometry (Ref. Reference Jung47), although atypical organisation and localisation of sarcoglycans have also been reported. For example, in human smooth muscle, a pentameric or exameric sarcoglycan complex has been proposed (Ref. Reference Anastasi78). Moreover, δ- and γ-sarcoglycan seem to have an additional intracellular membrane localisation, but any contribution to muscle physiology and/or pathology is unknown (Ref. Reference Ueda79).
Assembly and trafficking of the sarcoglycan complex
Studies on the assembly of the sarcoglycan complex during the early stage of myotube differentiation show that sarcoglycans are cotranslationally translocated in the ER, where they associate during transport through the Golgi to the plasma membrane (Refs Reference Chan22, Reference Hegde and Kang80) (Fig. 2).
In the ER, the quality control machinery facilitates polypeptide folding to ensure that nascent proteins assume the correct conformation, and, if necessary, it recycles the proteins until the native conformation is obtained (Fig. 3a–c). Glycosylation of proteins such as sarcoglycans is performed by an oligosaccharyltransferase, which attaches a composite oligosaccharide composed of three glucose, nine mannose and two n-acetyl glucosamine residues (N-glycan) to the nascent polypeptide. Subsequently, two glucoses are sequentially trimmed from the N-glycan by glucosidase I and II, while two lectin chaperons, calnexin and calreticulin, associate with the monoglycosylated proteins to attain their pre-native conformation (Refs Reference Anelli and Sitia11, Reference Hebert and Molinari12). Protein folding is then performed by oxidoreductases of the PDI family and peptidyl-propyl cis/trans isomerases, which facilitate the oxidative folding of the protein, formation and isomerisation of disulfide bonds, the correct cis or trans isomerisation and prevent the formation of aggregates (Ref. Reference Hebert and Molinari12). At the end of the calnexin–calreticulin cycle, the enzyme UDP-glucose glycoprotein glucosyltransferase (UGGT) reattaches a glucose to imperfectly folded proteins, causing their reintroduction into the calnexin and calreticulin cycle (Ref. Reference Caramelo and Parodi81) (Fig. 3b). Properly folded proteins are ignored by UGGT and proceed along the maturation pathway (Fig. 3c). Export of native proteins occurs at ER-specific sites where budding of vesicles coated with coat protein complex II takes place and by which they enter the Golgi complex (Fig. 3d).
Trafficking of transmembrane multimeric complexes requires more elaborate processing because the individual membrane proteins are usually assembled in the ER and then transported as a unit to the Golgi and the cell membrane (Fig. 2). Evidence indicates that multimeric complexes may also encounter post-ER quality control checkpoints that monitor and assist the oligomerisation process, probably at ER–Golgi intermediate and cis-Golgi compartments (Ref. Reference Anelli and Sitia11). Experimental evidence in CHO cells demonstrated that the simultaneous synthesis of the four sarcoglycans is mandatory for proper assembly and cell membrane localisation of the sarcoglycan complex (Ref. Reference Holt and Campbell82). Biochemical analyses in C2C12 myogenic cells showed that sarcoglycans associate to form a complex in the ER as soon as they are synthesised. Thereafter, the sarcoglycan complex associates with dystroglycans and sarcospan only during transportation from the Golgi to the cell membrane (Ref. Reference Noguchi83). The latter finding implies that sarcoglycans, sarcospan and dystroglycan may exit the ER independently and then associate during transport to the cell membrane (Ref. Reference Allikian and McNally84) (Fig. 2). The association of the sarcoglycan complex with dystroglycans en route to the cell membrane probably favours the subsequent recruitment of cytosolic dystrophin once it reaches its destination. In primary mouse myotubes, it was demonstrated that β-, δ- and γ-sarcoglycans form a strong interaction, whereas α-sarcoglycan seems to be less tightly associated (Ref. Reference Chan22). The overall evidence indicates the following sequential events for sarcoglycan association: (1) β-sarcoglycan initiates a strong interaction with δ-sarcoglycan; (2) γ-sarcoglycan is added to the βδ-sarcoglycan core; (3) α-sarcoglycan is recruited by interacting with γ-sarcoglycan (Refs Reference Chan22, Reference Shi85). An alternative association sequence has also been proposed, where α-sarcoglycan directly interacts with the βδ-sarcoglycan core (Refs Reference Draviam, Shand and Watkins86, Reference Hack87). An intriguing finding in COS-1 (Ref. Reference Shi85) and in HEK293 (Ref. Reference Draviam, Shand and Watkins86) cells shows that β- and δ-sarcoglycans might be transported to the cell membrane even when expressed without α- and γ-sarcoglycan. In any case, these data reinforce the view of the fundamental role of the βδ-sarcoglycan core in the assembly process of the complex. However, the precise molecular interactions controlling assembly of the four sarcoglycans remain undefined.
A recent study in COS-1 cells identified novel functional domains in the extracellular portion of sarcoglycans that seem to regulate association and favour transport to the cell membrane. In particular, regions of the extracellular portion close to the transmembrane domain appear to be important for regulation of the interaction between β- and δ-sarcoglycan and α- and γ-sarcoglycan, whereas a more distal region seems to control the interaction between β- and γ-sarcoglycan (Ref. Reference Chen29). Moreover, these authors also demonstrate that N-glycosylation and the cysteine-rich domain of δ-sarcoglycan are crucial for the localisation of sarcoglycans to the cell membrane, and suggest that the intracellular tails of β- and δ-sarcoglycan interact with dystrophin (Ref. Reference Chen29).
The individual heterologous expression of α-sarcoglycan, but not of the other subunits, results in the localisation of the protein at the cell membrane even in the absence of the remaining sarcoglycans. Surprisingly, this occurs in cells of human origin, such as HEK293 and HeLa cells (Refs Reference Sandonà39, Reference Draviam88) but not in COS-1 cells from monkey or CHO cells from hamster (Refs Reference Holt and Campbell82, Reference Shi85). Moreover, α-sarcoglycan cell membrane localisation seems to be dynamic, because the protein recycles from the membrane to endosomes (Ref. Reference Draviam88). This evidence suggests that undefined sequence motifs of human α-sarcoglycan might permit its preferential solitary trafficking to the cell membrane. Conversely, it is possible that the difference might reside in the maturation, sorting and trafficking rules of the human protein-export apparatus. This possibility is validated by recent evidence showing that the R77C α-sarcoglycan mutant, the most common LGMD-2D mutation, remains trapped in the ER when expressed in human cells, either alone or together with the other sarcoglycans (Refs Reference Bartoli13, Reference Gastaldello14, Reference Draviam, Shand and Watkins86). By contrast, in knock-in mice homozygous for the H77C mutation (in mice, R at position 77 is conservatively substituted by H), α-sarcoglycan and the entire complex is correctly localised at the cell membrane and mice did not develop muscular dystrophy (Refs Reference Bartoli13, Reference Kobuke89). Since no information on the tertiary structure of α-sarcoglycan is available, it is possible that the protein region around H77 in mice might differ from that in humans such that this substitution could be considered a polymorphism in mice. Intriguingly, when human R77C α-sarcoglycan is transfected into muscles of α-sarcoglycan-knockout mice some aggregates do form, but the majority reaches the cell membrane (Refs Reference Bartoli13, Reference Kobuke89).
Pathogenetic mechanisms
In sarcoglycanopathy, mutations in any of the four sarcoglycan genes cause a reduction or the absence of the defective protein, and secondarily affect expression of the other subunits. A reduction in protein levels can have several causes, such as defects that hamper gene expression or lead to aberrant maturation of the mRNA transcript, or defects in protein processing (Ref. Reference Loo and Clarke90), where its fate then depends on the ER quality control system. Complete absence of a protein can result from nonsense mutations, which instead of producing truncated sarcoglycans, trigger nonsense-mediated mRNA decay. Absence of a protein might also result from missense mutations that lead to production of misfolded proteins, which are then recognised and retained in the ER, and after unsuccessful cycling in the calnexin and calreticulin system, are ultimately degraded through the ubiquitin–proteasome system (Fig. 3e, f). A mutant protein might evade the quality control mechanism, but since it will probably not be able to form a stable tetramer it will still be degraded (Fig. 3c, f). Finally, a mutant sarcoglycan protein might fold and form a tetramer, exit the ER and reach the cell membrane, where it could cause loss of function or instability of the entire sarcoglycan complex. In this case, the whole structure will be dismantled and undergo lysosomal–proteasomal degradation. The residual expression level of the remaining wild-type sarcoglycans probably depends on the association rules and trafficking possibilities described above. However, the presence of residual expression of a mutant protein in the cell membrane suggests that the mutant might escape the ER quality control system, at least in part, and forms a tetramer that is transported to the cell membrane. The resulting dystrophic phenotype might depend on insufficient levels of either the protein or the complete complex, or to loss-of-function consequences.
According to the Leiden Open Variation Database (Ref. Reference Fokkema, den Dunnen and Taschner91) (http://www.lovd.org), and to recent evidence (Refs Reference Klinge53, Reference Trabelsi92), at least 176 sequence variants in the coding region of sarcoglycans have been so far reported to cause LGMD-2C to LGMD-2F. Seventy-one are mutations that generate a truncated or aberrant protein or lead to nonsense-mediated decay of the transcript, whereas 105 are missense mutations that cause protein misfolding. Usually, mutated misfolded proteins synthesised in the ER are intercepted, retrotranslocated in the cytosol and delivered for dismantling by the ubiquitin–proteasome through the multifaceted ERAD system (Fig. 4). Recognition of misfolded proteins is quite rapid, and proteins are delivered for degradation in a ubiquitin-dependent manner, without any detectable lag following their synthesis. The process is intended to prevent the accumulation of defective proteins and formation of toxic aggregates that might block trafficking in the secretory pathway. However, several misfolded proteins are reported to be prone to aggregation, as is the case for many myofibrillar myopathies (Ref. Reference Ferrer and Olivè93) and neurodegenerative disorders, such as Parkinson, Alzheimer and prion-associated diseases (Ref. Reference Olzmann, Li and Chin94).
Recently, two different groups have demonstrated the involvement of the cell quality control system in the pathogenesis of sarcoglycanopathies in heterologous cell models (Refs Reference Bartoli13, Reference Gastaldello14). Both papers show that different α-sarcoglycan mutants with single amino acid substitutions are ubiquitinated and rapidly degraded by the proteasome. Intriguingly, proteasomal inhibition by MG132 or bortezomib (Velcade) was able to rescue expression of D97G, R98H, P228Q and V247M, but not R77C, α-sarcoglycan mutants, and more importantly, the mutants formed a stable sarcoglycan complex that localised at the cell membrane (Ref. Reference Gastaldello14). This finding demonstrates that inhibition of the proteasome, the last step in the ERAD pathway (Fig. 4, Step 8), has a retrograde effect on mutant processing, permitting the assembly of partly misfolded proteins with the other sarcoglycan components, which favours their transport to the cell membrane. Moreover, these data suggest that D97G, R98H, P228Q and V247M α-sarcoglycan mutants maintain the ability to form a potentially functional complex at the cell membrane. Regarding the inability to recover the R77C mutation, it is worth mentioning that the defective protein is known to form aggregates and is retained in the ER when expressed in heterologous cells (Refs Reference Bartoli13, Reference Gastaldello14, Reference Draviam88); however, R77C α-sarcoglycan can occasionally be detected at the cell membrane, either with or without proteasome inhibitor treatment (Ref. Reference Gastaldello14), suggesting that the R77C mutant might escape the ERAD system. Importantly, to prevent degradation of sarcoglycan mutants, it has been shown that it is also possible to intervene upstream of the proteasome. Mannosidase I is a key ER enzyme involved in retrotranslocation of terminally misfolded proteins (Fig. 4, Step 1). Inhibition of α-mannosidase I, with the selective inhibitors kifunensine and deoxymannojirimycine, favoured the localisation of the R77C α-sarcoglycan mutant at the cell membrane in HER911 cells cotransfected with β-, γ- and δ-sarcoglycan (Ref. Reference Bartoli13). However, in mouse, the substitution of H77 with a cysteine does not lead to a pathological phenotype (Refs Reference Bartoli13, Reference Kobuke89), suggesting that if there are no differences in the protein quality control system among species, the C77 variant has no effect on α-sarcoglycan function.
Recent evidence indicates that pharmacological therapies could assist ER protein processing of disease-causing mutants (Ref. Reference Loo and Clarke90). Molecules that interfere with the ER quality control system, ERAD processing and ubiquitin proteasome degradation may result in a greater level of mutated sarcoglycan proteins, which increases their chance of reaching the cell membrane. An essential prerequisite of this salvage approach is that the rescued protein mutant maintains residual activity and/or function at the cell membrane. It is thus essential to first define the criteria for the identification of treatable mutants. Diverse computational tools, based on both protein structural and evolutionary information, are currently available to facilitate the prediction of the functional consequences of missense mutations. In general, these methods give clues about whether replaced residues are associated with a disease or are simply neutral polymorphisms. Adverse mutations can either destabilise the structure or disrupt a functional site, such as ligand binding, catalytic or protein–protein interaction sites.
Unfortunately, structural data on the sarcoglycan complex are incomplete, so predictive approaches have to compensate for the lack of both experimental data and identification of homologous structures. In almost all cases of sarcoglycanopathy analysed so far, the mutants may be considered to be processing mutants. However, several mutants might retain their function if they could escape the quality control system. Mutations of critical residues within hypothetical functional domains and in highly conserved sequences may be expected to have loss-of-function consequences. By contrast, substitutions in noncritical regions might generate functional proteins. Below, we provide a concise summary of mutations in the four sarcoglycans.
Mutations in α-sarcoglycan
Seventy-three mutations have been reported in the α-sarcoglycan gene (SGCA) that cause changes in the protein. Fifty-two are missense mutations that generate a complete protein with a single residue substitution (Fig. 5), whereas 21 mutations (nucleotide substitution, duplication, deletion or insertions) produce truncated, incomplete or anomalous proteins. All but one of the α-sarcoglycan missense mutations are mapped in the extracellular domain – a critical region for the organisation of a stable sarcoglycan tetramer and association with dystroglycan (Refs Reference Chan22, Reference Brenman23).
One third of the α-sarcoglycan missense mutations map to the cadherin-like domain, which is believed to be important for Ca2+-dependent association with the similar domain present in α-dystroglycan (Ref. Reference Dickens, Beatons and Ponting76). R34C, R34H, D97G, R98S, R98C and R98H missense mutations map to putative Ca2+-binding sites in the cadherin-like domain (Ref. Reference Dickens, Beatons and Ponting76). Substitution in each of these three critical residues (R34, D97 and R98) causes LGMD in homozygous patients (Refs Reference Carrié44, Reference Duggan46). R74 causes a severe phenotype in a homozygous patient (Ref. Reference Vainzof54). Intriguingly, substitution of D97 and R98 residues seems to produce processing mutants, because they can be pharmacologically rescued at the cell membrane (Ref. Reference Gastaldello14). The R77C missense substitution in the cadherin-like domain is the most frequently reported α-sarcoglycan mutation. Analysis of muscle samples from homozygous R77C LGMD patients shows that α-sarcoglycan is absent in the majority of cases, with variable residual expression of the sole γ-sarcoglycan (Refs Reference Roberds9, Reference Carrié44, Reference Duggan46, Reference Jung47). Protein expression data, however, do not tell us whether mutations within the cadherin-like domain also compromise α-sarcoglycan function.
It is worth noting that seven α-sarcoglycan missense mutations cause the substitution of diverse residues with a cysteine, which are expected to cause abnormal disulphide bonds and abnormal conformation of the nascent protein. However, it has been demonstrated in a heterologous cell system that it is the replacement of a positive residue at position 77, rather than the introduction of a cysteine, that causes retention of the mutant protein (Ref. Reference Hebert and Molinari12). This finding highlights just how complicated the prediction of pathological consequences associated with single mutations is in proteins of undetermined function.
Two mutations, V175A and V247M, map within the N-glycosylation sequence of α-sarcoglycan (174NVT and 246NVT become, respectively, 174NAT and 246NMT), which are conservative substitutions that usually do not affect glycosylation. Nevertheless, homozygous V247M causes LGMD (Refs Reference Duggan46, Reference Moreira48), suggesting that they are processing mutants. Interestingly, five additional α-sarcoglycan missense mutations reside in the highly conserved putative α-sarcoglycan ATP-binding domain (Ref. Reference Betto38, blue box in Fig. 5). One of these, R221H, targets a critical residue in the ATP-binding pocket of the protein and causes a severe reduction of α-sarcoglycan, with a slight reduction of the other subunits (Ref. Reference Boito96). Thus, R221H probably causes a loss of function, whereas substitutions in the remaining four residues did not affect the ATP-binding capacity. Nonetheless, the C232S mutation causes a severe LGMD in homozygous patients (Ref. Reference Klinge53).
A single α-sarcoglycan mutation, M312R, maps in the intracellular domain, apparently without having adverse consequences on the extracellular domain; nevertheless, this mutation causes LGMD in a homozygous patient (Ref. Reference Guglieri97). Similarly, two mutations caused by deletions generate two α-sarcoglycan proteins with normal extracellular domains but with truncated, scrambled intracellular tails. At present, no data regarding the function of the α-sarcoglycan intracellular tail are available. It is therefore not clear whether these mutations generate a misfolded protein intercepted by the ERAD-C machinery or whether they also abrogate unknown functions located therein.
Mutations in β-sarcoglycan
Forty-nine mutations have been so far described in the β-sarcoglycan gene (SGCB), 29 of which are missense mutations that generate a full protein with a single residue substitution (Fig. 5). Removal of glycosylation sites in β-sarcoglycan is known to affect the assembly of the complex (Ref. Reference Shi85). The βδ-sarcoglycan core is suggested to link dystrophin through undefined portions of their intracellular tail (Ref. Reference Chen29); five missense mutations reside in the β-sarcoglycan tail and might have either loss-of-function consequences or simply generate processing mutants; two of these (Q11G and G56R) cause LGMD in homozygous patients. In the large extracellular domain, the portion of β-sarcoglycan proximal to the transmembrane domain is thought to be required for the binding to δ-sarcoglycan (Ref. Reference Chen29); nine missense mutations reside in this portion of β-sarcoglycan and may thus perturb the assembly of the sarcoglycan complex. Of these nine mutations, seven cause LGMD in homozygous patients (http://www.lovd.org). Two additional missense mutations reside in apparently noncritical residues of the β-sarcoglycan cysteine-rich domain (N304S and D311N), which is a putative EGF-like sequence, and might not be functionally relevant, even though homozygous N304S causes a mild phenotype.
Mutations in γ-sarcoglycan
Forty mutations are described in the γ-sarcoglycan gene (SGCG), 16 missense mutations generating a full protein with a single residue substitution (Fig. 5) and 24 that generate a truncated protein or no protein at all. Three portions of the γ-sarcoglycan extracellular domain display possible critical functions, two for the assembly with either β-sarcoglycan or α-sarcoglycan, and the putative EGF-like domain. Residues 60–155 in the γ-sarcoglycan portion proximal to the transmembrane domain are thought to be essential for the interaction with α- and β-sarcoglycan (Ref. Reference Chen29). Although the relevant residues for binding to the sarcoglycan partners are not known, it is expected that some of the missense mutations residing in this region compromise the correct assembly of the complex. The homozygous Q82R mutation actually causes severe LGMD and the almost complete absence of the four sarcoglycans from the cell membrane (Ref. Reference Klinge53). However, again, some of these might simply be processing mutants. In CHO cells, mutations producing a truncated γ-sarcoglycan without the cysteines in the EGF-like domain do not permit sarcoglycan complex assembly (Ref. Reference Holt and Campbell82), as demonstrated in muscle samples from patients (Ref. Reference McNally98). By contrast, in two LGMD-2C patients, a homozygous Δ525T mutation generates a truncated γ-sarcoglycan protein without the EGF-like domain, which is apparently able to assemble with the other sarcoglycans (Ref. Reference Crosbie45). The C283Y missense mutation in the γ-sarcoglycan cysteine-rich domain could be functionally relevant, because this cysteine is crucial in the EGF-like domain (Ref. Reference McNally98). In fact, C283Y can cause severe LGMD and it severely perturbs the assembly of α-sarcoglycan to the complex in COS-1 cells (Ref. Reference Shi85). Intriguingly, the L53P mutation in the transmembrane domain causes mild LGMD with the complete absence of the protein but traces of the other sarcoglycans (Ref. Reference Klinge53).
Mutations in δ-sarcoglycan
Fourteen δ-sarcoglycan gene (SGCD) mutations have been so far found to cause LGMD-2F. Eight are missense substitution and all reside in the extracellular portion (Fig. 5). The portion in the extracellular domain between residues 57 and 92 of δ-sarcoglycan (proximal to the transmembrane domain) was shown to be critical for the association with β-sarcoglycan (Ref. Reference Chen29). Three disease-causing mutations are within this portion. Considering the critical role of the βδ-sarcoglycan core in the assembly of sarcoglycans, these substitutions are expected to have functional consequences. Similarly, the portion between residues 92 and 200 is believed to be important for δ- to γ-sarcoglycan interaction (Ref. Reference Chen29). One substitution, R198P, resides in a recently identified highly conserved sequence (L190RLESPTRSL) (Ref. Reference Chen29). As highly conserved sequences are predicted to be functionally important, it is not surprising that homozygous expression causes a DMD-like phenotype (Refs Reference Nigro7, Reference Moreira48). Five δ-sarcoglycan mutations generate a protein with a truncated extracellular domain and cause clinical phenotypes from LGMD to DMD-like in homozygous patients.
To date, there is not sufficient information to precisely predict the number of mutations in the four sarcoglycans that cause loss-of-function consequences or which result in processing mutants. We estimate that about half of LGMD-causing missense mutations in the four sarcoglycans are potentially recoverable to the cell membrane, by preventing their premature degradation through pharmacological interference with the ERAD machinery. This percentage could be even higher if the residues substituted within supposedly functional sarcoglycan regions do not affect that function. To this end, studies in heterologous cell systems could facilitate the identification of critical residues of sarcoglycan functional domains. Heterologous cells could be used in preliminary tests for the identification of appropriate molecules and/or interventions that would favour the recovery of mutant sarcoglycans.
Therapeutic approaches for sarcoglycanopathies
Sarcoglycanopathies are autosomal recessive disorders. The majority of LGMD cases are due to compound heterozygous conditions and only a small percentage result from homozygous mutations. This situation could be advantageous to some extent, because even partial restoration of a functional protein might correct the dystrophic phenotype. Two therapeutic approaches are currently being investigated for delivery of healthy genes into dystrophic muscles: gene and cell therapy (Refs Reference Cossu and Sampaolesi99, Reference Danièle, Richard and Bartoli100, Reference Lim and Rando101). Gene therapy has been carried out in all animal models of sarcoglycanopathies, and is aided by the small size of the cDNA. The most effective gene delivery was obtained with nonpathogenic adeno-associated viruses (reviewed in Ref. Reference Danièle, Richard and Bartoli100). Importantly, in animal models of sarcoglycanopathies, the best results were seen when the vector was injected before the onset of the pathology (Refs Reference Allamand102, Reference Fougerousse103, Reference Xiao104). However, to be effective in patients, gene therapy must overcome critical issues, such as targeting the largest tissue of the body and low distribution of the vector, as well as the immune response (Ref. Reference Lim and Rando101).
Grafting of genetically modified or healthy donor cells is another approach to treat dystrophic muscle. Several stem cell populations exhibit myogenic potential and have been analysed for their ability to correct the dystrophic phenotype. These include muscle-derived satellite cells, bone-marrow-derived mesenchymal stem cells, muscle side-population cells, and cells derived from blood vessel walls, such as mesangioblasts or pericytes (reviewed by Refs Reference Cossu and Sampaolesi99, Reference Price105). Although a pluripotent bone-marrow-derived side population of stem cells failed to deliver δ-sarcoglycan (Ref. Reference Lapidos106), muscle satellite cells have been more successful (Ref. Reference Wallace107). Mesangioblasts injected into muscles of α-sarcoglycan-null mice and dystrophin-deficient dogs show potential for gene delivery (Refs Reference Sampaolesi108, Reference Sampaolesi109). In addition to the difficulty of treating the most abundant tissue of the body, stem cell therapy presents several potential disadvantages, such as the requirement for immunosuppression, difficulty in identification of suitable adult stem cells, and possible diffusion and colonisation efficiencies. Last, but not least, caution must be used in transferring the encouraging results obtained in animal models to humans.
Pharmacological strategies are becoming an important alternative to gene delivery, because drugs can more easily target each tissue of the body. In general, pharmacological therapies can be directed either to circumvent the primary defect or to treat its pathological consequences. Recent evidence indicates that this approach can also be used to facilitate the processing of misfolded but functional protein mutants. The first treatment to produce beneficial effects was the use of corticosteroids in DMD; however, besides the anti-inflammatory effect, the mechanisms of action remain undefined. Another pharmaceutical approach successfully overcame premature stop codons in mutated mRNAs, thus favouring translational read-through (Refs Reference Lim and Rando101, Reference Hermann110). Initially, this approach used aminoglycoside antibiotics, which successfully rescued dystrophin in mdx mice (Ref. Reference Barton-Davis111) and was the object of clinical trials in DMD patients. However, additional work in mdx mice demonstrated divergent results (Ref. Reference Dunant112). Recently, PTC124, a very promising nonaminoglycoside agent, was demonstrated to selectively induce ribosomal read-through of premature stop codons but not the normal stop codon (Refs Reference Du113, Reference Welch114). The use of specific antisense oligonucleotides to induce exon skipping and generate in-frame transcripts translated into functional, albeit smaller, protein could be an additional approach for the therapy of diseases caused by mutations generating premature stop codons. This approach is useful for large proteins possessing many repeated functional units (e.g. dystrophin spectrin-like repeats), some of which can be deleted without dramatic functional consequences (Refs Reference Lim and Rando101, Reference van Ommen, van Deutekom and Aartsma-Rus115). This option is not applicable to sarcoglycanopathies because there are no dispensable domains in the extracellular portion of sarcoglycans and the genomic structure of the sarcoglycan genes prevents any meaningful exon skipping.
Alternative strategies aim to compensate for the muscle mass wasting or to directly cope with the pathological mechanisms in muscular dystrophy (reviewed by Ref. Reference Danièle, Richard and Bartoli100). However, contrasting results were obtained by targeting the activity of regulators of muscle growth. For example, treatment with insulin-like growth factor-1 (IGF-1) improved life expectancy of the δ-sarcoglycan-deficient hamster (Ref. Reference Serose116) and beneficial effects were obtained by treating α-sarcoglycan-deficient mice with a deacetylase inhibitor that induced the expression of the myostatin inhibitor follistatin in satellite cells (Ref. Reference Minetti117). By contrast, administration of inactive myostatin ameliorated the dystrophic phenotype of calpain 3 but not of α-sarcoglycan-null mice (Ref. Reference Bartoli118). Similarly, the antibody-mediated inhibition of myostatin in γ-sarcoglycan-deficient mice was ineffective (Ref. Reference Bogdanovich, McNally and Khurama119). Additional work is thus necessary to explore this important therapeutic option for sarcoglycanopathies.
Possible pharmacological treatment of sarcoglycan-processing mutants
As indicated above, all the sarcoglycan mutants produced by single-residue substitutions can be considered to be processing mutants. Therefore the possibility to intervene in the different steps of the cell quality control system offers great opportunities for therapeutic intervention. Considering that the majority of LGMD patients are compound heterozygotes, it is reasonable to expect that rescue of just one of the two allele products would ameliorate the dystrophic phenotype.
Chemical and pharmacological chaperones can assist processing mutants by stabilising the defective protein. Chemical chaperones create a more favourable environment for protein mutant processing and trafficking. Typical chemical chaperones are small molecules (e.g. glycerol, dimethyl sulfoxide, 4-phenylbutyrate, trimethylamine) that stabilise the folding process of mutant proteins and/or the assembly of multimeric complexes by increasing hydration of the folding mutant (Ref. Reference Loo and Clarke90). However, to be effective, these molecules must be used at very high concentrations, a condition that rarely permits their utilisation in clinical trials.
Numerous molecules have been proposed as pharmacological chaperones, which might be more useful than chemical chaperones because they act in a more specific manner, are less toxic and are utilised at lower levels. An ideal pharmacological chaperone is a molecule that interacts with the functional sites of the mutant, stabilising the protein until it reaches the cell membrane. For example, ligands of cell membrane receptors have been shown to stabilise and facilitate their processing (reviewed by Ref. Reference Loo and Clarke90). β-, γ- and δ-sarcoglycan possess an EGF-like domain in their extracellular portion (Ref. Reference Hack3), although the putative ligand modulating this receptor-like function is unknown. Identification of this factor would reveal a possible crucial function of the sarcoglycan complex and provide a candidate drug to promote the processing of sarcoglycan complex.
As illustrated above, there are additional sites for intervention to promote the processing of potentially functional sarcoglycan mutants. In addition to a general intervention aimed at modification of the ER environment towards more favourable conditions (produced by molecular chaperones that affect oxidative state or calcium levels), other more specific actions can be envisaged. The eight steps indicated in Figure 4 are all suitable targets for pharmaceutical intervention. The retrograde beneficial effects produced by inhibiting the final steps of the proteasome degradation process (Step 8) suggest that intervention in the intermediate steps of the ERAD pathway might also favour a reduction of sarcoglycan mutant disposal, permitting their assembly into a complete tetramer, and trafficking to the cell membrane. Below, we provide a concise description of the ERAD–ubiquitin–proteasome pathway.
The ERAD process can be split into three basic parts: (1) initial recognition of the ERAD substrate; (2) targeting and retrotranslocation of the ERAD substrate from the ER into the cytosol; and (3) ubiquitination and degradation of the ERAD substrate into small peptides within the 20S proteasome core (Fig. 4) (Refs Reference Anelli and Sitia11, Reference Hebert and Molinari12, Reference Vembar and Brodsky120). Sarcoglycans mature in the ER membrane, and are processed by distinct pathways of the quality control machinery, according to whether the defects reside in cytosolic, transmembrane or extracellular (lumenal) domains (Refs Reference Anelli and Sitia11, Reference Vembar and Brodsky120, Reference Vashist and Ng121). Three distinct ERAD pathways are in fact used according to the topology of the misfolded protein. ER-soluble lumenal proteins and integral membrane proteins with misfolded extracellular (lumenal) domains are degraded via the ERAD-L pathway, membrane proteins with cytosolic defects are degraded by the ERAD-C pathway, whereas those with misfolded transmembrane domains are degraded by the ERAD-M pathway. As most of sarcoglycans mutations reside in the extracellular domain, ERAD-L is almost certainly used for their degradation. Since some patients have defects in the intracellular portion of α- and β-sarcoglycan, cytosolic ER-associated components of the ERAD-C pathway could instead be involved in processing of these mutants (Ref. Reference Vembar and Brodsky120).
Exit of misfolded proteins from the calnexin–calreticulin cycle is determined by the ‘inactivity’ of UDP-glucose:glycoprotein glucosyltransferase-1 (UGGT) (Fig. 3). Mannosidase I is a key enzyme in directing misfolded proteins to ERAD; by trimming terminal mannoses, mannosidase I inhibits glucose readdition and thus re-entry into the calnexin–calreticulin cycle. In this initial recognition function, mannosidase I is assisted by a family of mannosidase-like proteins, EDEM1–EDEM3, which operate both as mannosidases and chaperones (Fig. 4, Step 1) (Ref. Reference Hebert and Molinari12). Before directing terminally misfolded protein to retrotranslocation to the cytosol, ERAD substrates are reduced by protein disulfide isomerases (PDIs), such as the oxidoreductase ERp57. This action probably requires the assistance of BiP (GRP78). The ER surveillance apparatus can also detect redundant monomers of unassembled multiprotein complexes, which may be the case for sarcoglycans, causing the degradation of correctly folded proteins, whose recognition as ERAD substrates remains an intriguing process.
Subsequently, a composite lumenal and/or membrane-associated surveillance system targets ERAD substrates for retrotranslocation and degradation. Sarcoglycans defective in the extracellular domain, which represent the vast majority of cases, are probably recognised by the ER lectins OS-9 and XTP3-B and the chaperones BiP and GRP94 (Fig. 4, Step 2) (Ref. Reference Christianson122), and then recruited to a transmembrane complex that operates the retrotranslocation process. This complex is formed by SEL1 and HRD1, with SEL1 facing the lumenal side and HRD1, an E3 ubiquitin ligase, the cytosolic ER side (Ref. Reference Vembar and Brodsky120) (Fig. 4, Step 3). The presence of a ubiquitin ligase (HRD1) in this complex indicates that ubiquitination is strictly associated with translocation of ERAD substrates (Ref. Reference Vembar and Brodsky120). Ubiquitination of proteins is in fact a prerequisite for their degradation. This process is operated by the concerted and sequential action of three families of enzymes: E1 and E2 enzymes that respectively activate and conjugate ubiquitin to E3 enzymes, which eventually ligate ubiquitin to lysine-competent residues in the ERAD substrate (Fig. 4, Step 4). Sec61, which controls cotranslational entry of nascent proteins, is probably the channel protein through which luminal misfolded proteins pass from the ER to the cytosol. In addition, Derlin proteins have been found in the retrotranslocation machinery, and possibly form a pore themselves (Ref. Reference Nakatsukasa and Brodsky123).
A cytosolic complex comprising the ATPase p97, UFD1 and NPL4 is at the crossroads of all three ERAD pathways, with p97 providing the energy for retrotranslocation of monoubiquitinated ERAD substrates (Refs Reference Vembar and Brodsky120, Reference Romish124). The p97–UFD1–NPL4 complex, associated with the ER membrane via Derlin proteins (Fig. 4, Step 5), escorts the polyubiquitinated mutant protein destined to degradation toward the proteasome (Ref. Reference Vembar and Brodsky120). Next, the polyubiquitinated protein is deglycosylated by the cytosolic enzyme peptide:N-glycanase (PNGase) (Fig. 4, Step 6), deubiquitinated (Fig. 4, Step 7) and finally degraded in the 20S proteasome core (Ref. Reference Suzuki125) (Fig. 4, Step 8).
It is important to mention that accumulation of misfolded proteins in the ER lumen may elicit the unfolded protein response (UPR) signal transduction pathway (Ref. Reference Ron and Walter126). The function of the UPR is to quickly remove accumulated aberrant substrates, which are frequently in the form of aggregates, and restore the ER to an efficiently operating maturation compartment. Activation of the UPR causes (1) upregulation of ER-resident protein expression to facilitate the proper folding of misfolded proteins; (2) reduced activity of the secretory pathway by decreasing the expression of secretory cargo; (3) increased degradation of terminally misfolded proteins; and (4) restrained translational initiation until the abnormality has been minimised. However, long-lasting activation of the UPR might have severe consequences for cell survival (Ref. Reference Ron and Walter126). Therefore, prolonged interference with the ERAD pathway should be considered cautiously.
In summary, we propose that misfolded or truncated sarcoglycans are intercepted by the ER quality control system, which sequentially activates ERAD, ubiquitin–proteasome degradation and possibly UPR, to rapidly remove defective proteins and to maintain an efficient ER. Table 2 lists potential molecular targets for therapeutical intervention within the diverse components of the ERAD machinery that could promote the rescue of misfolded sarcoglycan mutants. To this end, it would also be important to determine whether the sarcoglycan complex is specifically targeted in ERAD degradation; the identification of muscle-specific quality control or ERAD factors that participate in the degradation process of sarcoglycan mutants would be of a great help in the development of novel therapeutical approaches. For example, although glycoproteins are bound to calnexin and/or calreticulin, disulphide bond formation and folding are promoted by the recruitment of ERp57, a member of the protein disulphide isomerase family. ERp57 is essential for efficient folding of glycoproteins sharing common structural domains, for instance Type 1 transmembrane proteins, such as α-sarcoglycan, and proteins that possess EGF-like domains, such as β-, γ- and δ-sarcoglycan (Ref. Reference Jessop127). ERp57 or an analogous ER component could thus be involved in the processing of sarcoglycans and would be a possible target for therapy. Interestingly, some antibiotics have been recently described to inhibit oxidoreductase activity of ERp57 (Ref. Reference Gaucci128).
The components indicated in this table are derived from recent ERAD reviews (Refs Reference Anelli and Sitia11, Reference Hebert and Molinari12, Reference Vembar and Brodsky120, Reference Nakatsukasa and Brodsky123); for many components, the specific role has not yet been conclusively established. A partial list of molecules known to interfere with the activity of some ERAD components is also included: Eeyarestatin I (Ref. Reference Wang, Li and Ye136), geldanamycin (Ref. Reference Turnbull130), kifunensine and deoxymannojirimycine (deoxyMR) (Ref. Reference Bartoli13), ribostamycin, inhibitor of PDI chaperone activity (Ref. Reference Horibe139), vancomycin (Ref. Reference Gaucci128), chloroacetamidyl chitobiose (Ref. Reference Hagihara137). Abbreviations: BiP, immunoglobulin heavy chain binding protein (GRP78); DUB, deubiquitinating enzyme; ERAD, ER-associated protein degradation; PDI, protein disulfide isomerase; UGGT, UDP-glucose:glycoprotein glucosyltransferase-1.
Biochemical and gene-silencing approaches have identified many of the ERAD components indicated in Fig. 4, but have also demonstrated that modification of their activity might have substantial consequences on the fate of processing mutants. The first and most widely studied ERAD substrate is the cystic fibrosis transmembrane regulator (CFTR), a cell membrane chloride channel whose mutations are responsible for cystic fibrosis (Refs Reference Loo and Clarke90, Reference Gelman and Kopito129, Reference Turnbull130). ΔF508 is the most frequent mutation of CFTR that causes misfolding of a partially functional protein. Similarly to α-sarcoglycan mutants, proteasome inhibition (Fig. 4, Step 8) permitted the partial rescue of ΔF508 CFTR (Refs Reference Norez131, Reference Vij, Fang and Zeritlin132). Miglustat (N-butyldeoxynojirimycin), an inhibitor of α-1,2 glucosidase (Fig. 3b), effectively corrected the ΔF508 and G622D CFTR mutants in nasal epithelial cells through an action that interferes with the progress of the misfolded protein in the calnexin–calreticulin cycle (Ref. Reference Norez131). Geldanamycin treatment in vitro causes the release of CFTR protein from competent chaperones and the concomitant termination of ubiquitination and formation of a stable protein (Ref. Reference Fuller and Cuthbert133). Derlin-1 (Fig. 4, Step 5) is thought to recognise misfolded nonubiquitinated CFTR, and reduction of its expression led to significantly increased levels of ΔF508 CFTR (Ref. Reference Sun134). The p97 ATPase (Fig. 4, Step 5) is also a potential target for pharmacological therapy of sarcoglycanopathies, because downregulation of p97 partly rescued ΔF508 CFTR (Ref. Reference Vij, Fang and Zeritlin132). However, a similar approach must be considered very cautiously in skeletal muscle, because mutations affecting p97 activity are responsible for inclusion body myopathy, which is characterised by excessive accumulation of ubiquitinated proteins (Ref. Reference Weihl135). Conversely, a novel inhibitor of p97, Eeyarestatin I, was shown to efficiently block degradation of ERAD substrates (Ref. Reference Wang, Li and Ye136). N-glycanase is the sole deglycosylating enzyme directing misfolded proteins to 26S proteasome (Fig. 4, Step 6). Inhibition of N-glycanase, by very low doses of the pan-caspase inhibitor z-VAD-fmk, or by the novel inhibitor chloroacetamidyl chitobiose (Ref. Reference Hagihara137), was shown to prevent deglycosylation and slow down retrotranslocation of misfolded proteins (Ref. Reference Misaghi138). This section and Table 2 briefly summarise some of the diverse approaches used for the treatment of CFTR processing mutants that could also be applied to sarcoglycans.
Finally, it is important to note that proteasome inhibition has been successfully used to rescue the expression and localisation of components of the DGC in skeletal muscle of mdx mice, and DMD and BDM explants, an effect obtained by reducing the elevated dismantling rate of disassembled proteins, which is due to the absence of dystrophin (Refs Reference Bonuccelli140, Reference Bonucelli141, Reference Assereto142). Recently, it has been shown that ε-sarcoglycan mutants that cause myoclonus–dystonia syndrome, are also intercepted by ER quality control and degraded by the ubiquitin–proteasome system (Ref. Reference Esapa143), suggesting that interfering with the ERAD process could be beneficial, even with this syndrome. Last, but not least, recent evidence shows that inhibition of the proteasome with the FDA-approved inhibitor bortezomib (Velcade) successfully rescued the localisation of D97G α-sarcoglycan mutant in muscle explants from an LGMD-2D patient (Ref. Reference Gastaldello14). Additional studies are certainly necessary, but current knowledge offers an exciting perspective for the pharmacological treatment of sarcoglycan processing mutants.
Clinical implications
It is worth reporting that no clinical trials are underway for the pharmaceutical treatment of sarcoglycanopathies. However, recent research shows some promise (Refs Reference Bartoli13, Reference Gastaldello14), particularly with sarcoglycan mutations tightly linked to the ER quality control system. Evidence indicates that interfering with the ERAD machinery may be a relatively safe approach. Apparently, inhibition of mannosidase I with kifunensine does not produce pathological consequences in mice (Ref. Reference Bartoli13). Moreover, inhibition of the proteasome with bortezomib is already in Phase III clinical trials for the treatment of multiple myeloma (http://www.velcade.com). Thus, both drugs have the potential for treating sarcoglycanopathy patients. In this review, we highlight additional components within the ERAD pathway whose activity could be regulated to favour processing of sarcoglycan mutants. Both currently used and novel drugs are emerging as suitable candidates for the pharmaceutical treatment of sarcoglycanopathies, and research should concentrate in ERAD-modulating compounds with the aim of identifying promising therapeutical approaches for future clinical trials.
Outstanding research questions
Research on ERAD pathways and the proteasome system is dynamic, and novel components are continuously being discovered. Research on muscle-specific ERAD components should be encouraged and the discovery of sarcoglycan-specific chaperones and/or enzymes specifically assisting their processing would offer new avenues for future therapy. It is hoped that many molecules will be identified as potential candidates for the treatment of sarcoglycanopathies. To this end, however, preclinical studies are needed to validate their effectiveness and possible unwanted consequences. Preliminary studies in heterologous cell systems will be mandatory to test the pharmacology and toxicity of potential candidates in preparation for clinical trials.
Acknowledgements and funding
R.B. is funded by the Italian National Research Council (C.N.R.) and the Association Francaise contre les Miopathies. D.S. is funded by an Athenaeum Project of the University of Padova. The contribution of the Italian Space Agency is gratefully acknowledged. Comments from the peer reviewers are greatly appreciated.