- Original research
- Open Access
In silico, biologically-inspired modelling of genomic variation generation in surface proteins of Trypanosoma cruzi
© Azuaje et al; licensee BioMed Central Ltd. 2007
- Received: 27 February 2007
- Accepted: 10 July 2007
- Published: 10 July 2007
Protozoan parasites improve the likelihood of invading or adapting to the host through their capacity to present a large repertoire of surface molecules. The understanding of the mechanisms underlying the generation of antigenic diversity is crucial to aid in the development of therapies and the study of evolution. Despite advances driven by molecular biology and genomics, there is a need to gain a deeper understanding of key properties that may facilitate variation generation, models for explaining the role of genomic re-arrangements and the characterisation of surface protein families on the basis of their capacity to generate variation. Computer models may be implemented to explore, visualise and estimate the variation generation capacity of gene families in a dynamic fashion. In this paper we report the dynamic simulation of genomic variation using real T. cruzi coding sequences as inputs to a computational simulation system. The effects of random, multiple-point mutations and gene conversions on genomic variation generation were quantitatively estimated and visualised. Simulations were also implemented to investigate the potential role of pseudogenes as a source of antigenic variation in T. cruzi.
Computational models of variation generation were applied to real coding sequences from surface proteins in T. cruzi: trans-sialidase-like proteins and putative surface protein dispersed gene family-1. In the simulations the sequences self-replicated, mutated and re-arranged during thousands of generations. Simulations were implemented for different mutation rates to estimate the relative robustness of the protein families in the face of DNA multiple-point mutations and sequence re-arrangements. The gene super-families and families showed distinguishing evolutionary responses, which may be used to characterise them on the basis of their capacity to generate variability. The simulations showed that sequences from T. cruzi nuclear genes tend to be relatively more robust against random, multiple-point mutations than those obtained from surface protein genes. Simulations also showed that a gene conversion model may act as an effective variation generation mechanism. Differential variation responses can be used to characterise the sequence groups under study. For example, unlike other families, sequences from the DGF1 family have the capacity to maximise variation at the amino acid level under relatively low mutation rates and through gene conversion. However, in relation to the other protein families, they exhibit more robust behaviour in response to more severe modifications through intra-family genomic sequence exchange. Independent simulations indicate that DGF1 pseudogenes might play a role in the generation of greater genomic variation in the DFG1 gene family through gene conversion under different experimental conditions.
Digital, dynamic simulations may be implemented to characterise gene families on the basis of their capacity to generate variation in the face of genomic perturbations. Such simulations may be useful to explore antigenic variation mechanisms and hypotheses about robustness at the genomic level. This investigation illustrated how sequences derived from surface protein genes and computer simulations can be used to investigate variation generation mechanisms. Such in silico experiments of self-replicating sequences undergoing random mutations and genomic re-arrangements can offer insights into the diversity generation potential of the genes under study. Biologically-inspired simulations may support the study of genomic variation mechanisms in pathogens whose genomes have been recently sequenced.
- Gene Conversion
- Simulation Step
- Variable Amino Acid Sequence
- Surface Protein Gene
- Result Amino Acid Sequence
Trypanosoma cruzi is the etiological agent of Chagas disease, which is an incurable and debilitating illness affecting millions of people in Latin America [1, 2]. Chagas disease also represents a serious health concern for industrially-developed countries. There is a potential for infection in the USA and Europe due to the risk of contamination of the blood supplies. Additionally, HIV/AIDS patients may experience reactivation of Chagas disease . T. cruzi infects and adapts to the vertebrate host by exploiting evolutionary strategies to invade target cells and to evade (or to confuse) the immune system [4, 5]. The invasion, evasion and infection process involve different families of surface proteins . The generation and presentation of variable surface antigens is a key strategy [5–7]. The parasite may take advantage of this strategy to adhere to different molecules on the host cell membrane and the extracellular matrix .
In general, an understanding of the roles of genetic damage (mutations) and recombination in the generation of antigenic diversity in trypanosomes is important for addressing key questions about the evolution, adaptation and robustness of parasites and their interactions with the hosts. However, the experimental analysis of antigenic diversity generation represents a massive challenge, even in the case of parasites that have traditionally received relatively more international attention than T. cruzi, such as the malaria and sleeping sickness parasites. The availability of gene sequences derived from the T. cruzi Genome Project  further motivates the proposal of data- and discovery-driven approaches to characterising functional properties of proteins at different levels of organisation.
The computer-based, dynamic modelling of biological processes and mechanisms may provide tools to aid researchers in addressing fundamental questions about the genomic basis of evolution, adaptation and complexity. The area of digital genetics, also known as artificial life, offers tools in which self-replicating strands of computer code are capable to mutate, compete, evolve and adapt to a computing environment with space and resource constraints. Darwinian evolution has also inspired the development of the area of evolutionary computation, which provides algorithms that mimic mutation, recombination and selection mechanism of data structures based on predetermined "fitness" functions. Evolutionary computation methods, such as genetic algorithms, have been successfully applied to solve optimisation problems in many disciplines. The reader is referred to  and  for reviews of the areas of digital genetics and evolutionary computation.
In a previous work  we investigated the potential capacity of T. cruzi surface protein genes to maximise phenotypic variation, which may be seen as a key attribute to expand the repertoire of surface antigens. The robustness of a parasite gene against mutations was addressed in terms of several gene volatility and diversity indicators. The potential impact of point-mutation errors on surface antigen genes based on the analysis of codon usage and its potential for generating different amino acid mutants were explored.
In this paper we propose in silico, yet biologically-inspired, models that dynamically simulate genomic variation generation through random mutations and a relatively simple approximation of recombination. The models and simulations implemented in this exploratory study processed coding sequences from two surface protein super-families: the TS (trans-sialidase)-like proteins and putative surface protein DGF-1 (dispersed gene family-1) super-families [5, 8, 12]. These families are known to be important components in the infection process, but relatively little is known about their functional and adaptive properties. Moreover, the mechanisms underlying their variation generation process are not well-understood.
We propose the implementation of dynamic simulations of mutation and genomic arrangements to estimate the antigenic generation capacity of these families. In this approach sets of gene sequences of biological interest for the study of T. cruzi are used as inputs to a system that recreates mutation and replication mechanisms. The system can also be used to simulate a gene conversion mechanism under different in silico conditions, such as mutation rates and the number of simulation generations. Such simulations allowed us to visualise the generation of variation at the amino acid level using the available sequences (inputs) as starting points, as well as comparative references, in the simulations. Thus, this methodology has the potential to be applied for assessing the effect of mutation and genomic exchange (e.g. gene conversions) in the generation of diversity of surface proteins. Moreover, it may be exploited to characterise gene families in terms of putative models of antigenic variation through mutation and conversion. This investigation explores some of these applications and discusses their potential implications for addressing fundamental questions relevant to the understanding of mechanisms driving genomic diversity.
A variation generation simulation require DNA coding sequences, a user-defined mutation rate value and the number of simulation steps (self-replication generations) as inputs. The input sequences are used as variation references, i.e. the degree of variation of subsequent mutated sequences is measured in relation to these input (wild) sequences. The variation of a given sequence at a particular time is estimated here by measuring the divergence (distance) between the resulting amino acid sequence (encoded by a mutated DNA coding sequence) and the reference amino acid sequence. Mutations, i.e. random multiple-point mutation and sequence re-arrangements, occur at the DNA level. At each generation step a multiple-point mutation or a gene conversion event occurs, depending of the model being simulated. The degree of variation for each gene sequence is calculated based on the resulting amino acid sequence. An average variation value for each family of genes is calculated at each simulation step. At the end of a simulation the system outputs the average variation values at each simulation step for the sequence family under consideration. The section of Methods provides more details about these information processing steps.
Different simulations using genes from the TS and DGF1 superfamilies were implemented. The TS family consists of a large number of genes that encode major surface antigens of the infective forms of T. cruzi . This super-family can be categorised into four groups according to sequence similarity, molecular mass and function [5, 8]. There are 1430 TS sequences in the T. cruzi genome: 737 genes and 693 pseudogenes . In this investigation we concentrated on 261 complete TS proteins whose biological and structural properties had been previously described. The TS super-family is divided into 14 families [5, 11]: ASP-1 (25 sequences), ASP-2 (37 sequences), CEA (17 sequences), CRP-10 (24 sequences), FL-160 (16 sequences), GP82 (19 sequences), GP85 (92 sequences), MVar1-GP90 (101 sequences), SA-85 (98 sequences), SAPA (30 sequences), Tc85-11 (93 sequences), TESA-1 (57 sequences), TS EPI (30 sequences) and TSA (37 sequences). The DFG1 genes [12, 13] investigated here were represented by 85 coding sequences.
It has been suggested that genes encoding proteins involved in maintaining core biological functions, such as house-keeping and nuclear proteins, tend to be genetically robust to mutations (at the protein level). Therefore, simulations were also performed on a set of non-surface sequences to illustrate possible differences between genes from these cellular components. We selected a group of 40 sequences encoding nuclear proteins in T. cruzi to further illustrate observed differences between these families and as a baseline reference for discussions. The potential role of pseudogenes in the generation of genetic variability was also explored by performing independent experiments in which 15 DGF1 pseudogenes and 23 GP85 pseudogenes from the group II of TS superfamily drove random genomic modifications in their respective gene families. Thus, pseudogene sub-sequences provided the potential sources of variation in random gene conversion events. The resulting variation patterns were compared to those obtained from simulations that did not involve the participation of pseudogenes.
A multiple-point mutation model
A variation generation model based on gene conversion
The second variation generation model explored mimics genomic exchange through recombination. This model may be related to a gene conversion model based on synthesis dependent strand annealing (SDSA) homologous recombination . Since unequal recombination mechanisms involving chromatids exchange at chromosomal interstitial areas may lead to alteration in syntheny, which is not the case in trypanosomatids so far studied, we envisioned that the recombination mechanisms operating in this exchange can be defined as gene conversion vía SDSA homologous recombination  without crossover.
An exploration of the potential role of pseudogenes in variation generation
Potential biological relevance of models and observations
The study of mechanisms for the generation of genetic diversity is important because such mechanisms are needed for the survival and adaptation of the parasite in the hosts. T. cruzi exploits such a capacity to generate a massive genetic heterogeneity to increase its chances to adapt to different hosts. The study of genomic variation generation may have applications and implications for the development of novel therapies against these parasites. For example, depending on the level of antigenic variation of a potential vaccine target, different strategies (e.g. focus on function domains, focus on the vector-specific surface antigens) may be considered in the absence of multivalent solutions . Thus, highly volatile (or anti-robust) genes may not be suitable drug targets, in comparison to relatively more robust genes. Relatively more robust genes (i.e. less diversity production ability) might represent an even more feasible target if the gene in question is predicted to be essential for the survival of the pathogen.
Previous research has shown that homologous recombination and gene conversions may be a significant mechanism for achieving antigenic variation . Recent investigations have shown the importance of DNA recombination for the generation of genetic variability in protozoan parasites, including African trypanosomes . However, there is a relative lack of research on genes and mechanisms relevant to DNA mutation, recombination and repair in this and other kinetoplastid organisms.
An important feature illustrated by the proposed methodology is the differentiating response capacity of specific gene families in the face of random mutations. Previous research has suggested that mutation rates (and its effects) strongly depend on an evolutionary compromise between a need to create diversity (a basis for adaptive evolution as in the case of surface protein genes) and a need to preserve core or essential cellular functions (as in the case of genes encoding nuclear proteins) . The results obtained here underlined such a principle and indicated distinguishing genetic variability responses between the gene families analysed. Moreover, the simulations showed differential patterns and responses between surface gene families, which may be used to implement other sequence-based characterisations.
Although genetic exchange among distantly related strains of T. cruzi could occur, asexual reproduction is by far the most prevalent way of propagation in this parasite [18, 19]. T. cruzi displays high level of genetic diversity in the different isolates and even in the cloned cultures [8, 20]. Genomic rearrangement including single nucleotide replacement could play a key role in maintaining genetic heterogeneity of its population. Our results are in agreement with previous experimental results that show the importance of genomic rearrangement in the evolution of T. cruzi multigenic families . For instance, the diversity found in T. cruzi mucin genes of group I (TcMUC I) are due to some point mutations, but mainly to insertions/deletions of complete codons, whereas in the group II (TcMUC) most of the differences are due to non-synonymous point mutations and some small insertions/deletions . These authors suggested that the accumulation of non-synonymous point mutations could be the mechanism involved in the generation of diversity within T. cruzi mucin genes .
Previous experimental investigations have suggested that genetic re-arrangements based on recombination might be relevant to DNA damage repair mechanisms [17, 22]. Thus, DNA recombination or gene conversions may play a role in the reduction of variability. Although our models did not directly address specific aspects of DNA damage and repair through recombination, our results point out interesting quantitative patterns and indicators. For example, our simulations showed that genomic rearrangements by a gene conversion mechanism tend to produce less variable modifications (at the protein sequence level) in comparison to the diversity generated by pure random, multiple-point mutations. This behaviour was observed, for instance, in the DGF1 family in terms of overall average values of genetic variation. Figures 3 and 6 (as well as other simulation result figures) depict that for the same amount of mutated nucleotides (i.e. the same mutation rates), the DGF1 genes generated less variable amino acid sequences through (intra-family) gene conversion than through pure random, multiple-point mutations.
Our simulations indicated that gene conversion might act as an effective variation generation mechanism in TS family. Our results are in agreement with experimental results  on TSA genes (group II of TS superfamily). Comparison of TSA genes revealed that they are analogous, supporting the hypothesis that short segments between the family members are exchanged by gene conversion events . Another critical factor that should be taken into account to interpret the gene conversion simulation results is the genomic diversity that is already present in the gene sequences under study, particularly in TS family. The potential roles of such a rich diversity should be considered in future modelling studies.
The possibility that sequence length may be an influential factor in the simulations is excluded by the following evidence. First, we ensured that the amount of mutating bases in each simulation step is defined in relation to the length of the sequence under consideration. That is, the number of mutations and the variation effects are always estimated relative to each sequence, and not as absolute measures. But more importantly, in a previous research we did not detect quantitative correlations between sequence length and variation potential in each of the gene families analysed here .
The simulations implemented here also illustrated how the participation of pseudogenes might contribute to an increase in diversity generation through gene conversion. However, our simulations suggested this potential role in the case of the DFG1 family only. More clear or significant relationships between the participation of pseudogenes and genomic variation generation through gene conversion were not observed in the case of the GP85 family. Therefore, there is a need to implement additional, more powerful statistical tests to identify possible systematic differences. Previous experimental research has provided evidence that combinations of silent genes may be involved in the generation of diverse surface proteins in trypanosomes. This may be achieved based on partial duplications of pseudogenes or the insertion of sub-sequences from the silent donor into the variable gene. Recombinational processes have been proposed as a key mechanism for diversity generation, in which the rearrangement of sequences may allow the partial expression of pseudogenes . For example, in T. brucei arrangements based on duplicative transposition of pseudogene subsequences have been suggested as an important source of genetic variation . Moreover, the existence of expressed sequences derived from combinations of several donor pseudogenes has been demonstrated. At least in the case of African trypanosomes such pseudogene-based recombinational processes have been suggested as a key mechanism to enhance antigenic diversity of surface protein gene sequences . Further studies (experimental and bioinformatics using these and other protein families) are required to characterise and explain potential roles of T. cruzi pseudogenes in the generation of genetic diversity. For example, the results obtained in the GP85 family analysis might provide a basis for motivating the study of the potential role of pseudogenes in protein family preservation or stability. Future studies should also include additional, more powerful statistical analyses to test for systematic effects.
Summary of key findings and contributions
The simulations allowed us to assess the relative robustness or variation capacity of the protein families in the face of multiple-point mutations or sequence re-arrangements. Distinguishing evolutionary responses were observed between families, which may be used to characterise them on the basis of their capacity to generate genetic diversity. A group of genes encoding nuclear proteins showed an ability to minimise the phenotypic variation generated by random, multiple-point mutations in comparison to the responses observed in the surface protein genes. Simulations also showed that a gene conversion model can act as an effective variation generation mechanism in the DGF1 family.
This and research published elsewhere  suggest that the variability patterns and responses observed may be useful features to characterise gene groups. For example, in relation to the other families, sequences from the DGF1 family have a greater capacity to maximise variation at the amino acid level under relatively low mutation rates and through gene conversion. However, in relation to the TS families, the DGF1 family exhibited more robust behaviour in response to more severe mutations through gene conversion. This investigation also illustrated and evaluated a mechanism of genetic variation generation at the amino acid level that was driven by pseudogenes. Simulations using DGF1 sequences indicated that pseudogenes might contribute to the generation of variation of genes through gene conversion-based re-arrangements under different experimental conditions.
Thus, although this research does not report conclusive and experimentally-validated results, it proposes digital dynamic simulations as a tool to support the characterisation of gene families on the basis of their capacity to generate variability in the face of genomic perturbations. These simulations require minimum user-defined inputs and relatively low mathematical complexity to describe models and outcomes. These and future in silico experiments of self-replicating sequences undergoing random mutations and genomic re-arrangements can also offer insights into the mechanisms underlying variation generation of the genes under study.
To the best of our knowledge this is the first computational simulation study involving these variation models, sequence data and the proposed simulation approach. A recent study by Lythgoe et al.  presents a mathematical model of the ordered appearance of variants in Trypanosoma brucei during infection. Such a model is based on sets of differential equations describing the dynamics of the host-parasite responses. In our study the only critical, user-dependent parameters are the mutations rates, which allow the user to perform different simulations. The patterns and responses shown here do not depend on other analytical factors or computing variables. As part of future work we will make a computing platform-independent, user-friendly tool publicly available.
The proposed method may be applied as an alternative approach to characterising surface protein families on the basis of the visualisation and quantitative estimation of genomic variation. Therefore, we aim to extend the proposed methodology in order to approach such modelling challenges in T. cruzi and other kinetoplastid organisms. Nevertheless, in the long term the scientific value of these and future in silico modelling proposals will greatly depend on the availability of experimentally-obtained data, which should be used to corroborate, modify and refine models and simulations. At the time of completing this revision we have not identified a specific strategy or experimental technique to verify all the results reported here. Regarding a possible approach to confirm the nature of the variability of the TS superfamily, the approach implemented by Khan et al.  based on monoclonal antibodies is a feasible solution. However, this type of approach can now be assessed through new highthrouput techniques, such as antibody-microarrays, which could tell us how many different TS or DGF-1 members are being expressed at a given time in different T. cruzi populations. Moreover, we expect that these and future results will help us to direct our attention to more specific experimental approaches that may allow us to test these or related hypotheses.
In this investigation we concentrated on 261 complete TS proteins  whose functional and structural properties had been previously described. The DFG1 genes investigated here were represented by 85 coding sequences [8, 12]. The TS super-family consisted of 14 families: ASP-1 (25 sequences), ASP-2 (37 sequences), CEA (17 sequences), CRP-10 (24 sequences), FL-160 (16 sequences), GP82 (19 sequences), GP85 (92 sequences), MVar1-GP90 (101 sequences), SA-85 (98 sequences), SAPA (30 sequences), Tc85-11 (93 sequences), TESA-1 (57 sequences), TS EPI (30 sequences) and TSA (37 sequences) [8, 23, 27–39]. Also our analyses incorporated 40 sequences encoding nuclear proteins in T. cruzi to further illustrate observed differences between these families. This group represented a set of sequences described as "nuclear" proteins in the T. cruzi Genome Resource . The variation generation simulation analyses also included 15 DGF1 and 23 GP85 pseudogene groups [8, 12, 13, 39], which represented the donor sources in the intra-family gene conversion model. The sequences used in these analyses were obtained after performing Blastp searches in the GenBank database  using sequence probes identified by the authors. The sequences were chosen based on their relationship with genes sequences isolated from different laboratories, and whose functions have been confirmed/demonstrated experimentally. For instance, trans-sialidase genes code for proteins with enzymatic trans-sialidase enzyme activity; GP82 and Tc85 have adhesin/binding properties; CRP has complement activity, etc. In the case of the DGF-1 genes used here, their sequences have been validated by comparisons with our own sequences (2 of them)  and the original report by Wincker et al. . Since the selection of genes was done considering the entire ORF of 10,000 nucleotides long, and comparing these three sequences and the ones in the database, it is unlikely that we have mosaics or misassembled DGF1 genes sequences. Although we cannot rule out the existence of misassembled sequences, such sequences would represent exceptions in the T. cruzi Genome Resource.
Estimation of gene variation
The implemented models quantitatively estimated genetic variation or diversity by calculating the distance between the amino acid sequence resulting from a mutated gene sequence and the corresponding native amino acid sequence encoded by the input gene sequence. Average and overall average variation values were used to summarise variability responses in each family. Distances were calculated by counting the number of different pair-wise amino acids at the same position in the respective sequences. This distance metric is also known as the Hamming distance between two symbol sequences of equal length. It calculates the number of amino acid positions for which the corresponding amino acids differ. In this study each distance was scaled to the length of the sequence under consideration.
Figures 1 and 4 depict the two variation generation models implemented. A variation generation simulation require DNA coding sequences, a user-defined mutation rate, mr, value and the number of simulation steps (self-replication generations) as inputs. In both models mr is proportional to the percentage of nucleotide bases to be mutated. A mr = 0.0005, for example, means that at each simulation step (generation) 0.05% (0.0005 × 100%) of the nucleotides will be randomly selected and mutated. Each nucleotide in the (mutating) sequence has the same probability of being selected. Each nucleotide base has the same probability of being chosen to substitute a base in the mutating sequence. Gene sequences are replicated and mutations are accumulated from generation to generation. In the gene conversion-based model each of the sequences used as inputs to the simulation system undergoes a multiple-point, continuous mutations, whose length is defined by a mutation rate, mr. The starting point of mutation for a given sequence is randomly selected. This is followed by a random selection of a donor sequence from the same family, which represents the source of mutated nucleotides for the sequence to be re-arranged. The starting point of the sub-sequence to be donated is randomly selected and its length is defined by the mr value. The re-arranged sequence encodes a mutated amino acid sequence, which is compared to the reference sequence (original input sequence) like in the case of the first model. Results reported here were observed in simulations implemented with different mr values and a 1000 generations.
The input sequences (see data description) are used as references to estimate variation, i.e. the degree of variation of subsequent mutated sequences is measured in relation to these input (native) sequences. Mutations, i.e. random multiple-point mutation and sequence re-arrangements, occur at the DNA level. At each generation step a multiple-point mutation or conversion event occurs. Both models were implemented independently. The variation value for each gene sequence is calculated based on the resulting amino acid sequence as explained above. An average variation value for each family of genes was calculated at each simulation step. At the end of a simulation the system outputs the average variation values at each simulation step for the sequence family under consideration.
This work was supported in part by an international travel grant from the U.K Royal Society to F.A. and JLR, by a grant from the Ministry of Science and Technology of Venezuela to JLR, and from FAPESP and CNPq (Brazil) to JFS. We thank the reviewers for helpful comments and corrections.
- WHO Expert Committee: Control of Chagas disease, Second report of WHO. World Health Organization Technical Report Series. 2002, 905: 59-90.Google Scholar
- Teixeira AR, Nascimento RJ, Sturm NR: Evolution and pathology in chagas disease – a review. Mem Inst Oswaldo Cruz. 2006, 101: 463-91. 10.1590/S0074-02762006000500001.View ArticlePubMedGoogle Scholar
- Vaidian AK, Weiss LM, Tanowitz HB: Chagas' disease and AIDS. Kinetoplastid Biol Dis. 2004, 3: 2-10.1186/1475-9292-3-2.PubMed CentralView ArticlePubMedGoogle Scholar
- Andrade LO, Andrews NW: The Trypanosoma cruzi-host-cell interplay: location, invasion, retention. Nat Rev Microbiol. 2005, 3: 819-23. 10.1038/nrmicro1249.View ArticlePubMedGoogle Scholar
- Frasch AAC: Functional diversity in the trans-sialidase and mucin families in Trypanosoma cruzi. Parasitol Today. 2002, 16: 282-286. 10.1016/S0169-4758(00)01698-7.View ArticleGoogle Scholar
- Kahn JJ, Nguyen D, Norsen J, Wleklinski M, Granston T, Kahn M: Trypanosoma cruzi: monoclonal antibodies to the surface glycoprotein superfamily differentiate subsets of the 85-kDa surface glycoproteins and confirm simultaneous expression of variant 85-kDa surface glycoproteins. Exp Parasitol. 1999, 92: 48-56. 10.1006/expr.1998.4394.View ArticlePubMedGoogle Scholar
- Kahn SJ, Wleklinski M: The surface glycoproteins superfamily of Trypanosoma cruzi encode a superfamily of variant T cell epitopes. J Immunol. 1999, 159: 4444-4451.Google Scholar
- El-Sayed NMA, Myler PJ, Bartholomeu DC, Nilsson D, Aggarwal G, Tran AN, Ghedin E, Worthey EA, Delcher AL, Blandin G, Westenberger SJ, Caler E, Cerqueira GC, Branche C, Haas B, Anupama A, Arner E, Aslund L, Attipoe P, Bontempi E, Bringaud F, Burton P, Cadag E, Campbell DA, Carrington M, Crabtree J, Darban H, da Silveira JF, de Jong P, Edwards K, et al: The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas' disease. Science. 2005, 309: 409-415. 10.1126/science.1112631.View ArticlePubMedGoogle Scholar
- Adami C: Digital genetics: unravelling the genetic basis of evolution. Nat Rev Genet. 2006, 7: 109-18. 10.1038/nrg1771.View ArticlePubMedGoogle Scholar
- De Jong KA: Evolutionary Computation – A Unified Approach. 2006, Cambridge, MA: The MIT PressGoogle Scholar
- Azuaje F, Ramirez JL, Da Silveira JF: An exploration of the genetic robustness landscape of surface protein families in the human protozoan parasite Trypanosoma cruzi. IEEE Transactions on Nanobioscience.Google Scholar
- Wincker P, Murto-Dovales AC, Goldenberg S: Nucleotide sequence of a representative member of a Trypanosoma cruzi dispersed gene family. Mol Biochem Parasitol. 1992, 55: 217-20. 10.1016/0166-6851(92)90142-7.View ArticlePubMedGoogle Scholar
- Kim D, Chiurillo MA, El-Sayed N, Jones K, Santos MR, Porcile PE, Andersson B, Myler P, da Silveira JF, Ramírez JL: Telomere and subtelomere of Trypanosoma cruzi chromosomes are enriched in (pseudo)genes of retrotransposon hot spot and trans-sialidase-like gene families: the origins of T. cruzi telomeres. Gene. 2005, 346: 153-61. 10.1016/j.gene.2004.10.014.View ArticlePubMedGoogle Scholar
- Paques F, Haber JE: Multiple pathways of recombination induced by double-strand breaks in Saccharomyces cerevisiae. Microbiol Mol Biol Rev. 1999, 63: 349-404.PubMed CentralPubMedGoogle Scholar
- Ghedin E, Bringaud F, Peterson J, Myler P, Berriman M, Ivens A, Andersson B, Bontempi E, Eisen J, Angiuoli S, Wanless D, Von Arx A, Murphy L, Lennard N, Salzberg S, Adams MD, White O, Hall N, Stuart K, Fraser CM, El-Sayed NM: Gene syntheny and evolution of genome architecture in trypanosomatids. Mol Biochem Parasitol. 2004, 134: 183-191. 10.1016/j.molbiopara.2003.11.012.View ArticlePubMedGoogle Scholar
- Barbour AG, Restrepo BI: Antigenic variation in vector-borne pathogens. Emerg Infect Dis. 2000, 6: 449-57.PubMed CentralView ArticlePubMedGoogle Scholar
- Bhattacharyya MK, Norris DE, Kumar N: Molecular players of homologous recombination in protozoan parasites: implications for generating antigenic variation. Infect Genet Evol. 2004, 4: 91-8. 10.1016/j.meegid.2004.01.008.View ArticlePubMedGoogle Scholar
- Machado CA, Ayala FJ: Nucleotide sequences provide evidence of genetic exchange among distantly related lineages of Trypanosoma cruzi. Proc Natl Acad Sci. 2001, 98: 7396-7401. 10.1073/pnas.121187198.PubMed CentralView ArticlePubMedGoogle Scholar
- Gaunt MW, Yeo M, Frame IA, Stothard JR, Carrasco HJ, Taylor MC, Mena SS, Veazey P, Miles GA, Acosta N, de Arias AR, Miles MA: Mechanism of genetic exchange in American trypanosomes. Nature. 2003, 421: 936-939. 10.1038/nature01438.View ArticlePubMedGoogle Scholar
- McDaniel JP, Dvorak JA: Identification, isolation, and characterization of naturally-occurring Trypanosoma cruzi variants. Mol Biochem Parasitol. 1993, 57: 213-222. 10.1016/0166-6851(93)90197-6.View ArticlePubMedGoogle Scholar
- Campo V, Di Noia JM, Buscaglia CA, Aguero F, Sanchez DO, Frasch AC: Differential accumulation of mutations localized in particular domains of the mucin genes expressed in the vertebrate host stage of Trypanosoma cruzi. Mol Biochem Parasitol. 2004, 133: 81-91. 10.1016/j.molbiopara.2003.09.006.View ArticlePubMedGoogle Scholar
- Machado CR, Augusto-Pinto L, McCulloch R, Teixeira SM: DNA metabolism and genetic diversity in trypanosomes. Mutat Res. 2006, 612: 40-57. 10.1016/j.mrrev.2005.05.001.View ArticlePubMedGoogle Scholar
- Ruef BJ, Dawson BD, Devanusu T, Fouts DL, Manning JE: Expression and evolution of the Trypanosoma cruzi trypomastigote surface antigen multigene family. Mol Biochem Parasitol. 1994, 63: 109-120. 10.1016/0166-6851(94)90013-2.View ArticlePubMedGoogle Scholar
- Thon G, Baltz T, Eisen H: Antigenic diversity by the recombination of pseudogenes. Genes & Development. 1989, 3: 1247-1254. 10.1101/gad.3.8.1247.View ArticleGoogle Scholar
- Taylor JE, Rudenko G: Switching trypanosome coats: what's in the wardrobe?. Trends Genet. 2006, 22: 614-20. 10.1016/j.tig.2006.08.003.View ArticlePubMedGoogle Scholar
- Lythgoe KA, Morrison LJ, Read AF, Barry JD: Parasite-intrinsic factors can explain ordered progression of trypanosome antigenic variation. Proc Natl Acad Sci USA. 2007, 104: 8095-100. 10.1073/pnas.0606206104.PubMed CentralView ArticlePubMedGoogle Scholar
- Santos AM, Garg N, Tarleton RL: The identification and molecular characterization of Trypanosoma cruzi amastigote surface protein-1, a member of the trans-sialidase gene super-family. Mol Biochem Parasitol. 1997, 85: 1-11. 10.1016/S0166-6851(96)02820-4.View ArticleGoogle Scholar
- Low HP, Tarleton RL: Molecular cloning of the gene encoding the 83 kDa amastigote surface protein and its identification as a member of Trypanosoma cruzi sialidase superfamily. Mol Biochem Parasitol. 1997, 88: 137-149. 10.1016/S0166-6851(97)00088-1.View ArticlePubMedGoogle Scholar
- Jazin EE, Bontempi EJ, Sanchez DO, Aslund L, Henriksson J, Frasch AC, Pettersson : Trypanosoma cruzi exoantigens is a member of 160-kDa gene family. Parasitology. 1995, 110: 61-69.View ArticlePubMedGoogle Scholar
- Norris KA, Schrimpf JE, Szabo MJ: Identification of the gene family encoding the 160 kilodalton Trypanosoma cruzi complement regulatory protein. Infect Immun. 1997, 65: 349-357.PubMed CentralPubMedGoogle Scholar
- Van Voorhis WC, Barrte L, Koelling R, Farr A: FL-160 proteins of Trypanosoma cruzi are expressed from a multigene family and contains two distinct epitopes that mimics nervous tissue. J Exp Med. 1993, 178: 681-694. 10.1084/jem.178.2.681.View ArticlePubMedGoogle Scholar
- Araya JE, Cano MI, Yoshida N, da Silveira JF: Cloning and characterization of a gene for the stage-specific 82 kDa surface antigen of metacyclic trypomastigotes of Trypanosoma cruzi. Mol Biochem Parasitol. 1994, 65: 161-169. 10.1016/0166-6851(94)90124-4.View ArticlePubMedGoogle Scholar
- Takle GB, Cross GAM: An 85-kilodalton surface antigen gene family of Trypanosoma cruzi encodes polypeptides homologous to bacterial neuraminidases. Mol Biochem Parasitol. 1991, 48: 185-198. 10.1016/0166-6851(91)90114-L.View ArticlePubMedGoogle Scholar
- Carmo MS, Santos MRM, Cano MI, Araya JE, Yoshida N, da Silveira JF: Expression and genome wide distribution of the gene family encoding a 90 kDa surface glycoprotein of metacyclic trypomastigotes of Trypanosoma cruzi. Mol Biochem Parasitol. 2002, 125: 201-206. 10.1016/S0166-6851(02)00212-8.View ArticlePubMedGoogle Scholar
- Kahn S, Van Voorhis WC, Eisen H: The major 85-kDa surface antigen of the mammalian form of Trypanosoma cruzi is encoded by a large heterogeneous familiy of simultaneously expressed genes. J Exp Med. 1990, 172: 589-597. 10.1084/jem.172.2.589.View ArticlePubMedGoogle Scholar
- Giordano R, Fouts DL, Tewari D, Colli W, Manning J, Alves MJM: Cloning of a surface membrane glycoprotein specific for the infective forms of Trypanosoma cruzi having adhesive properties of laminin. J Biol Chem. 1999, 274: 3461-3468. 10.1074/jbc.274.6.3461.View ArticlePubMedGoogle Scholar
- Matsumoto TK, Cotrim PC, Franco da Silveira J, Stolf AMS, Umezawa ES: Trypanosoma cruzi: isolation of an immunodominant peptide of TESA (Trypomastigote Excreted-Secreted Antigens) by gene cloning. Diagnostic Microbiology and Infectious Disease. 2001, 42: 187-92. 10.1016/S0732-8893(01)00348-0.View ArticleGoogle Scholar
- Fouts DL, Ruef BJ, Ridley PT, Wrightsman RA, Peterson DS, Manning JE: Nucleotide sequence and transcription of a trypomastigote surface antigen of Trypanosoma cruzi. Mol Biochem Parasitol. 1991, 46: 189-200. 10.1016/0166-6851(91)90043-6.View ArticlePubMedGoogle Scholar
- Takle GB, O'Connor J, Young AJ, Cross GAM: Sequence homology and absence of mRNA defines a possible pseudogene member of the Trypanosoma cruzi gp85/sialidase multigene family. Mol Biochem Parasitol. 1992, 56: 117-128. 10.1016/0166-6851(92)90159-H.View ArticlePubMedGoogle Scholar
- T. cruzi Genome Resource. [http://www.tcruzidb.org]
- Genbank. [http://www.ncbi.nlm.nih.gov/GenBank]
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.