Journal of Animal and Veterinary Advances

Year: 2010
Volume: 9
Issue: 13
Page No. 1811 - 1815

Detecting Adaptive Evolution of Galliform and Anseriform Avians Mx Genes

Authors : Y.F. Zhu, H.F. Li, W. Han, J.T. Shu, W.T. Song, X.Y. Zhang and K.W. Chen

Abstract: The objective of this study was to trace the route of avian Mx (myxo-virus resistance) gene evolution and knowledge of its structure and function variations. Mx gene cDNA sequences of six galliform and anseriform avians were used in this study. Datamonkey and DAMBE were used to detect recombination events and test nucleotide substitution saturation. PAML4b were used to test the selective pressure on amino acid sites. Saturation test did not indicate any sign of substitution saturation. Single breakpoint scanning and genetic algorithm scanning by Datamonkey found two breakpoints, located, respectively in 162 and 999 bp and divided the sequences into three nonrecombination split partitions. The selection test of site-specific mode showed that avian Mx gene sequences had suffered positive selection pressure. Likelihood Ratio Test (LTR) suggested M2a and M8 be more advantageous models,and in a total of eight positive sites with >95% posterior probability were identified in three nonrecombination split partitions. These detected positive sites distributed in N-ternimal and GTP-binding domain of Mx protein and might be important candidate marks for improving avian antiviral activity.

How to cite this article:

Y.F. Zhu, H.F. Li, W. Han, J.T. Shu, W.T. Song, X.Y. Zhang and K.W. Chen, 2010. Detecting Adaptive Evolution of Galliform and Anseriform Avians Mx Genes. Journal of Animal and Veterinary Advances, 9: 1811-1815.

INTRODUCTION

Comparison of Synonymous (dS) and Nonsynonymous (dN) substitution rates (ω = dN dS-1 ) in protein encoding genes provided an important means for understanding molecular evolution. The value of ω ratio <1, = 1 and >1, respectively indicated purifying selection, neutral selection and positive selection (Yang et al., 2000a). It had been mainly used to identify adaptive molecular evolution of genes correlated with reproduction and antivirus (Yang et al., 2000b; Sainudiin et al., 2005). Mx proteins, part of the dynamin family of large GTPases induced by IFN-I (Accola et al., 2002) expressed antiviral activity in many species, e.g., human (Aebi et al., 1989), murine (Jin et al., 1999), sheep (Charleston and Stewart, 1993), bovine (Ellinwood et al., 1998) as well as fish (Ooi et al. 2006) viral activity had some difference between species. MxA protein of humans had antiviral activity to Vesicular Stomatitis (VSV) and avian influenza virus but MxB protein had no antiviral activity. Similarly, Mx1 protein of rats conferred antiviral activity to avian influenza virus in vivo or in vitro, Mx2 and Mx3 conferred no activity. In poultry, Ko et al. (2002, 2004) found that the amino acid at position 631 of chicken Mx could determine the antiviral activity to Vesicular Stomatitis (VSV) and avian influenza virus and that cell lines expressing chicken Mx proteins with asparagine at position 631 had higher antiviral activity than those with serine. It was not clear if the antiviral activity of avian Mx has evolved to inhibit a set of species-specific pathogens.

In the present study, Maximum-likelihood models of codon substitution were used to determine the evolutionary pattern of six galliform and anseriform avians Mx genes and identify amino acid sites under diversifying selection. This could help to trace the route of chicken Mx evolution and knowledge of its structure and functions. The identified positive sites could contribute to chicken resistant breedings.

MATERIALS AND METHODS

Sequence data: The full Mx cDNA sequences of six species of anseriform and galliform birds were collected from GenBank. Of which, G. gallus Mx gene sequence sets were composed of RIR (Accession No: EF575635), TB (Accession No: EF575608), GSP line (Accession No: AB088536), BM line (Accession No: AB088535), WLF (Accession No: AB088534), Comtaier (Accession No: AY695797), SK (Accession No: EF575611), BY (Accession No: EF575614), WL (Accession No: EF575622) and WLN (Accession No: AB088533). The other five species were C. coturnix (Accession No: EF575605), M. gallopavo (Accession No: EF575607), P. colchicus(Accession No: EF575606), L. lagopus (Accession No: EF575604)and A. platyrhynchos (Accession No: Z215549).

Statistical analysis: The cDNA sequences were aligned using CLUSTAL W program implemented in MEGA 3.1 (Kumar et al., 2004). The whole level of saturation was determined using the index of substitution saturation, implemented in the DAMBE (http://dambe.bio.uottawa.ca/dambe.asp). Recombination analysis and identification of non-recombinant regions in multiple sequence alignments was done using single breakpoint and Genetic Algorithms (GARD) implemented in Datamonkey (http://www.datamonkey.org/GARD/). Several site-specific models that allow for various dN/dS ratios among sites implemented in Codeml program from PAML4b (http://abacus.gene.ucl.ac.uk/software/paml.html) were used to detect positive selection. The fitness of models was evaluated through Likelihood Ratio Test (LRT). When the likelihood ratio test was significant, the empirical Bayes method (Yang et al., 2005) was used to calculate posterior probability for sites classes.

RESULTS AND DISCUSSION

Substitution saturation test: Before carrying out tests of positive selection, the substitution saturation of Mx gene data sets was examined, as saturation could lead to the underestimation of dS and an inflation of the dN/dS ratio. The results showed that the Index Score (ISS) was significantly lower than the critical score (ISS.C) for the Mx gene data sets (ISS = 0.4659, ISS.C = 0.7850; p< 0.001). Therefore, Mx gene data sets of six anseriform and galliform birds did not indicated any sign of substitution saturation.

Recombination analysis and Positive site identification: Datamonkey implemented a pragmatic approach, Genetic Algorithm Recombination Detection (GARD for short) to find all the recombination breakpoints. Given the maximum number of break points (B this number can also be inferred), the method will search the space of all possible locations for B or fewer break points in the alignment, inferring phylogenies for each putative nonrecombinant fragment and assessed goodness of fit by small sample Akaike Information Criterion (AIC). For Mx gene data sets of six anseriform and galliform birds, two breakpoints,located, respectively in 162 bp and 999 bp were found (Table 1) and the sequences were divided into three nonrecombination split partitions.


Table 1: Screen of recombination breakpoints using GARD method
The locus of breakpoint was accorded to sequences of chicken in alignments (EF575614)

Fig. 1: Breakpoint placement support of Mx gene sequences using c-AIC

Figure 1 showed the breakpoint placement support using c-AIC. The topology of inferred trees based on different split partitions was treated as user-tree in Codeml program to test positive sites.

Positive site identification: A number of models were implemented in codeml program. Model 0 (one ratio) assumeed one ω for all sites. The neutral Model (M1) assumeed a proportion p0 of conserved sites with ω0 = 0 and a proportion p1 of neutral sites with ω1 = 1. The selection Model (M2) added a third class of sites with ω2 estimated from data. The discrete Model (M3) used a discrete distribution with three site classes, estimating the proportions (p0, p1 and p2) and the ω ratios (ω0, ω1 and ω2) from the data. Model M7 (beta) used a beta distribution B (p, q) which depending on parameters p and q can take various shapes in the interval (0, 1). Model M8 (β and ω) added an extra class of sites with ω>1 to the beta (M7) model. Comparison of two different models constituted an Likelihood Ratio Rest (LRT) which was done by comparing the log-likelihood values. If the null hypothesis was correct, twice the log-likelihood difference between the two models (2ΔL) asymptotically had a χ2 distribution. Parameter estimation of the above models for three nonrecombination fragment sequences of Mx gene and results of three LRTs were showed in Table 2-4. About 4, 1 and 3 positive sites were respectively identified in three nonrecombination fragment sequences. Early adaptive evolution studies calculated Synonymous (dS) and Nonsynonymous (dN) substitution rates by averaging over all sites in the protein. However,due to stronger structural and functional constraints in a functional protein, adaptive selection pressure often only acted on a few sites at a few time points (Yang, 1998).


Table 2: Parameters estimation of first nonrecombination sequences and test of positive sites

Table 3: Parameters estimation of second nonrecombination sequences and test of positive sites

Table 4: Parameters estimation of third nonrecombination sequences and test of positive sites
General font sites with posterior probability >95% level, bold font sites with posterior probability >99% level

Thus, such analysis rarely find ω ratio >1 sites or detect positive selection.

In the past twenty years, many stochastic models for studying evolution due to point mutations were reported (Muse and Gaut, 1994; Felsenstein and Churchill, 1996; Savill et al., 2001). These advances led to rapid development of popular phylogeny-based inference methods for detecting adaptive evolution at the level of sequence codons (Nielsen and Yang, 1998; Suzuki and Gojobori, 1999; Kosakovsky and Frost, 2005). Of which, Maximum Likelihood (ML) methods developed by Nielsen and Yang (1998) and Yang et al. (2000a), based on explicit models of codon substitution assuming variable ω ratios among sites had been found to be powerful in detecting adaptive evolution and identifing positive sites (Suzuki and Gojobori, 1999).

Some researchers pointed that the Maximum likelihood methods may suffer high rates of false positives when the analyzed sequences had undergone recombination (Anisimova et al., 2003; Shriner et al., 2003). So, an essential step before analysis was to screen for and quantify evidence of recombination. Many algorithms and software tools analyzing sequence recombination (Archibald and Roger, 2002) rely on a sliding window approach, however, the length of the sliding window could strongly influence recombination inference. Although, several methods based on Markov Chain Monte Carlo were not limited by sliding window, they were too time expensive and could only be used to test small or medium data sets. Sergei et al. (2006) proposed a pragmatic approach-Genetic Algorithm Recombination Detection (GARD) implemented in datamonkey to rapidly screen multiple-sequence alignments for recombination, quantified the level of support for their locations and identified sequences or clades involved in putative recombination events. Based on reanalysis of previously published biological sequence alignments (Posada, 2002; Chare et al., 2003) and several simulation scenarios, it showed that GARD had good power and accuracy to detect recombination and low rates of false positives.

In this study, GARD approach identified two breakpoints in aligments of Mx gene sequences from six species of anseriform and galliform bird with a total of 300.9047 improvement in c-AIC. The alignments of Mx gene were divided into three nonrecombination split partitions. Then site-special models, allowed the ω ratio to vary among sites (Nielsen and Yang, 1998; Yang et al., 2000b) were used to test diversifying selection for each nonrecombination fragments. The results suggested extreme variation in selective pressure among amino acid sites (M0 VS M3, p<0.05) and showed significant positive selection signatures (M1a VS M2a, p<0.05; M7 VS M8, p<0.05, p<0.01) in Mx gene sequences of six anseriform and galliform birds. Hou et al. (2007) used sequence comparisons among several mammals, chicken and duck to detect positively selected sites by conducting phylogenetic analysis, also suggested that Mx gene had strong positive selection in chicken and duck lineages. This was consistent with our results.

In this study, the number of positive sites identified by M2a was less than that of M8 and all were contained in the results of M8. In case of false positive, the positive sites identified by M2a were considered. Especially in the first nonrecombination partition sequences, >90% sites suffered positive selection with ω = 1.97985. In the 2nd and 3rd partitions, the rates were also >30 and 20% with higher ω values. The detected positive sites located in the avian specific N-terminal end of the Mx gene and GTPase domain that was essential for activiral activity of Mx protein.

CONCLUSION

In this study, association had analyzed between those detected positive sites with immune traits in Baier chicken (a Chinese indigenous chicken breeds) and found several to be important candidate marks for improving antiviral activity.

ACKNOWLEDGEMENTS

This research was supported by the Program of National Technological Basis from Ministry of Science and Technology of China (No.2008BADB2B08), the high Technological programme of Jiangsu province, China (No. BG2007323) and National High Technology Research and Development Program of China (No. 2008 AA101009).

Design and power by Medwell Web Development Team. © Medwell Publishing 2024 All Rights Reserved