搜档网
当前位置:搜档网 › Genome-Wide Analysis of Nucleotide-Binding Site (NBS) Disease Resistance (R) Genes in Sacred Lotus

Genome-Wide Analysis of Nucleotide-Binding Site (NBS) Disease Resistance (R) Genes in Sacred Lotus

Genome-Wide Analysis of Nucleotide-Binding Site (NBS) Disease Resistance (R) Genes in Sacred Lotus
Genome-Wide Analysis of Nucleotide-Binding Site (NBS) Disease Resistance (R) Genes in Sacred Lotus

Genome-Wide Analysis of Nucleotide-Binding Site (NBS)Disease Resistance (R)Genes in Sacred Lotus (Nelumbo

nucifera Gaertn.)Reveals Their Transition Role During Early Evolution of Land Plants

Rui Zong Jia &Ray Ming &Yun J.Zhu

Received:29March 2013/Accepted:6June 2013/Published online:6July 2013#Springer Science+Business Media New York 2013

Abstract Nucleotide-binding site (NBS)containing genes comprise the largest class in identified plant resistance genes.A total of 137NBS class resistance genes were identified from the newly sequenced sacred lotus genome (Nelumbo nucifera Gaertn.)through a reiterative computational sequence analysis.Three distinct groups of NBS-encoding genes were classified:5with Toll/interleukin-1receptor homology (TIR)domain at N-terminal (TIR-NBS [-LRR (leucine-rich repeat)]),37with CC (coiled coil)domain (CC-NBS [-LRR]),and 95with neither TIR nor CC at N-terminal (NBS [-LRR]).Sequence analysis revealed high divergence of NBS-LRR genes in sacred lotus.The result of cluster and syntenic analysis of NBS genes suggested a duplication and recombination event,which is consistent with the correspondent result of whole genome analysis.In addition,we also identified 52NBS genes which have a putative NACHT domain embedded in the NBS domains.This characteristic has only been reported in animals,fungi and bacteria,suggesting that NACHT and NBS domains shared a similar ancient origin;and

sacred lotus NBS (NACHT)genes may represent a transition role during the early evolution of disease resistance in land plants.

Keywords Disease resistance gene .Divergence .NBS-encoding gene .Sacred lotus .Synteny Abbreviations NBS Nucleotide-binding site NB-ARC NB,ARC1,and ARC2

NACHT NAIP,CIITA,HET-E,and TP1TIR Toll/interleukin-1receptor (TIR)

homology domain

CC Coiled coil domain LRR Leucine-rich repeat R Resistance

Introduction

Plant disease resistance (R)genes confer resistance to patho-gens,and play critical roles in plant immune responses.Triggered by pathogen invasion,R gene cognates pathogen avirulence genes (Avr)in a GENE-for-GENE interaction lead-ing to hypersensitive response,termed as programmed cell death (Flor 1971;Chisholm et al.2006).There are two major classes of R genes:the NBS containing genes (McHale et al.2006)and the cell surface pattern recognition receptors (PRR)(Song et al.1995).NBS containing genes comprise the largest class in identified plant R genes.R proteins recognize a con-served pathogen-associated or microbe-associated molecular pattern (PAMP or MAMP),to trigger a series of nonspecific innate immune system responses (Chisholm et al.2006).

Communicated by Paul Arruda

Electronic supplementary material The online version of this article (doi:10.1007/s12042-013-9122-4)contains supplementary material,which is available to authorized users.

R.Z.Jia :Y .J.Zhu (*)

Hawaii Agriculture Research Center,P.O.Box 100,Kunia,HI 96759,USA

e-mail:JZhu@https://www.sodocs.net/doc/198932842.html,

R.Ming

Department of Plant Biology,University of Illinois,Urbana,IL 61801,USA

Y .J.Zhu

Institute of Tropical Bioscience and Biotechnology,China Academy of Tropical Agricultural Sciences,Haikou,Hainan 571101,People ’s Republic of China

Tropical Plant Biol.(2013)6:98–116DOI 10.1007/s12042-013-9122-4

Most of the characterized NBS genes encode an N-terminal variable domain,a central NBS domain,and a C-terminal leucine-rich repeat(LRR)domain.The NBS do-mains contain the three-layered alpha-beta fold and subse-quent short alpha-helical region characteristic of the AAA+ ATPase domain superfamily(van der Biezen and Jones 1998).It is proposed that NBS domains are involved in signaling pathways by binding and hydrolysis of ATP,in-ducing conformational changes of the overall protein,lead-ing to the formation of the apoptosome,and subsequently cell defense(Yan et al.2005).The LRR motif is typically involved in pathogen recognition specificity in protein–pro-tein interactions and ligand binding with pathogen-derived molecules.The well characterized modeling for NBS do-main for plants is a NB-ARC domain(Pfam:PF00931, current version17)(Punta et al.2012).

The NBS family of proteins can be subdivided further based on the presence or absence of an N-terminal Toll/Interleukin-1 Receptor(TIR)motif(Cannon et al.2002;Meyers et al.2003) and most of those proteins lacking a TIR may have a coiled-coil(CC)motif(Pan et al.2000).TIR and its associated signaling pathways may represent the most ancient host de-fense system found in mammals,insects and plants(Kopp and Medzhitov1999).The alpha-helical CC motif is the principal subunit of oligomerization motifs in proteins;a particular coiled-coil domain determines its oligomerization state,rigid-ity and ability to function as a molecular recognition system (Burkhard et al.2001).NACHT domain(named after NAIP, CIITA,HET-E and TP1)as a central domain,like NBS in plant,was reported to be mainly distributed in animals,fungi and bacteria(Koonin and Aravind2000).They work as recep-tors to perceive non-self and modified-self molecules inside host cells and mediate innate immune responses(Ausubel 2005;Ronald and Beutler2010).

Proteins that have homology with the plant NB-ARC-LRR protein function in mammalian defense response which is dom-inated by NACHT-LRR proteins,together called NLR (Maekawa et al.2011).Sharing similar biological function and protein architecture,the NLR family of receptors in plants and animals perceive signals and mediate innate immune responses to microbial pathogens.While in plants NLR typically detects strain-specific pathogen effectors,in animals NLR normally recognizes conserved microbe/damage associated molecular patterns(Maekawa et al.2011).The NB-ARC domain consists of three subdomains:NB,ARC1and ARC2,and highly con-served methionine–histidine–aspartate(MHD)motif at C-terminus of ARC2(van Ooijen et al.2008).The NB-ARC domain is a functional A TPase domain that hydrolyzes A TP in vitro(Tameling et al.2006).An example is the immunity in tomato(Solanum lycopersicum)to Pseudomonas syringae bac-teria by expression of Pto kinase activity and N-myristoylation for signaling via Pto interacting with a unique Prf(NB-ARC-LRR domain protein for recognition)(Mucyn et al.2006).The NACHT domain was used to reconstruct the evolutionary his-tory of vertebrate NACHT-containing proteins(Hughes2006).

There are two different theories for how the NACHT-related and NBS-related resistance proteins are thought to be involved in the immune system,namely convergent evolution (Ausubel2005;Ronald and Beutler2010;Maekawa et al. 2011)and independent-origin evolution(Yue et al.2011). The convergent evolution theory explains the evolution of the innate immune systems of plants and animals as a conse-quence of the biochemical constrains of building sensors for non-self and signaling cascades from a limited number of eukaryotic protein modules in both phyla(Maekawa et al. 2011).Also,it is thought as a consequence of convergent evolution reflecting inherent constraints on how an innate immune system can be constructed(Ausubel2005). Meanwhile,the theory of independent origin and evolution considers that,by counting the usage frequency patterns among all major kingdoms of organisms on the planet,the NACHT domain remains quite conserved from prokaryotes to eukaryotes,while the NBS domains shows considerable flex-ibility in different evolutionary lineages(Y ue et al.2011).

Sacred lotus(Nelumbo nucifera Gaertn.)is known for its agricultural,medicinal and cultural(religious)importance. Sacred lotus is cultivated for its edible rhizomes,seeds and leaves,which have been used as food for over4,000years in Asia.Its seeds exhibit exceptional longevity,as long as1,300-years,and its rhizomes remain healthy for more than50years. The genome-wide survey of defense/disease resistance asso-ciated NBS-encoding genes in sacred lotus was briefly de-scribed in the recently published draft genome sequence (238Mb containing26,685genes)(Ming et al.2013).In this study,a detailed analysis of the predicted NBS-containing resistance genes was carried out to identify a resource for comparative genomics and to characterize the disease resis-tance involved in innate immune system evolution. Results

Identification and Classification of NBS Genes

One hundred and thirty-seven(137)NBS-encoding resistance genes were identified from the draft genome of sacred lotus. Comparing to other sequenced plants,sacred lotus has less NBS genes except papaya(54),spike mosses(38),and moss (61)(Table1).Arabidopsis thaliana has a similar number of predicted total genes as that in sacred lotus while it has almost twice the number of NBS genes(208)than sacred lotus. Grapevine and rice both have535NBS genes,nearly four-times the number in sacred lotus.Poplar has416NBS genes, three-times more than sacred lotus.The variation of the total number of NBS genes in different plants is high,perhaps due to the difference in genome sizes(Richter and Ronald2000)and

the selection pressure of diseases and stresses during their evolution(Michelmore and Meyers1998).

NBS genes have been subdivided into the functionally distinct TIR-domain-containing(TNL)and CC-domain-containing(CNL)subfamilies(McHale et al.2006).Out of 137genes,5genes(4%)were classified as belonging to the TNL subfamily,and37genes(27%)were classified as belong-ing to the CNL subfamily.However,a large number(95also 69%)of NBS genes were without either CC domain or TIR domains and were assigned to“other”non-regular subfamily (Table1).In A.thaliana,all the NBS genes are classified as either TNL or CNL subfamilies,while NBS genes in other plants,including sacred lotus,are classified into TNL or CNL families,and also another subfamily with neither TIR nor CC domains.All monocots have less than0.1%of NBS genes belonging to the TNL subfamily.There were no TNL genes identified from sorghum,maize,or purple false brome,and only3TNL genes(0.06%of total NBS genes)were found in rice.On the other hand,all dicots have more than10%of NBS genes belonging to the TNL subfamily:The percentages of NBS genes belonging to the TNL subfamily are71%,20%, 21%,and13%in Arabidopsis,grapevine,poplar and papaya, respectively.The vascular spikemoss has3%of total NBS genes as TNL,and a non-vascular moss,has16%of total NBS genes as TNL.TNL were reported to be common in dicots and gymnosperms but rare in monocots and magnoliid orders(Tarr and Alexander2009).It is not clear whether the TNL were lost independently or prior to plant divergence.

The remarkable difference in percentages of TNL genes in total NBS genes among dicot and monocot plants led us to further analyze the distribution of genes in the CNL subfamily. The percentage of CNL genes in total NBS genes among dicots ranged from11%to43%and those in monocots ranged from 31%to86%,suggesting that the number of CNL varies among plants.The absence of the CNL subfamily in spike mosses and moss suggested that the CNL subfamily may have diversified during the evolution of vascular plants.The num-bers of non-regular NBS subfamily were also compared.The highest numbers of non-regular NBS were found in spike mosses(92%)and moss(82%),followed by papaya(76%), sacred lotus(69%),rice(68%),and maize(64%).In A. thaliana,there is no non-regular NBS found.The non-regular NBS encoding genes are considered to be highly divergent or simply truncated structures or pseudogenes with an incomplete coding sequence(Zhou et al.2004;Cheng et al.2010). Motif Architectures of Predicted NBS Encoding Genes

A highly conserved NBS motif with the consensus sequence VIxIVGMGGLGKTTLAxxVYN(Fig.1)was identified from NBS encoding genes using MEME analysis(Bailey et al. 2009).This conserved motif was found in119out of the137 NBS genes in sacred lotus,with cut off P-value10-9,and in some genes it appeared multiple times.Genome comparison of all published NBS genes(PFAM,https://www.sodocs.net/doc/198932842.html,, PF00391,v17,12,258sequences)showed the amino acid

Table1The total number of predicted NBS-encoding(R)genes identified in the whole genome of sequenced Angiosperms

Species Groups b Sacred

lotus(Nn)Mouse-ear

cress(At)

Grape

(Vv)

Poplar

(Pt)

Papaya

(Cp)

Rice

(Os)

Sorghum

(Sb)

Maize

(Zm)

Purple false

brome(Bd)

Spike

mosses(Sm)

Moss

(Pp)

Genome(Mb)2381254874853723897302,500272212.5480 No.of genes26,68525,49830,43445,55524,74637,54434,49632,00025,53222,27338,354 NBS-encoding(R)genes

TIR TN12314101300014 TNL49597786000006

Sub-total51471118873000110 CC CN106261427104404821 CNL27552031204160391915700

Sub-total376122913461671435920521 Other N300366218451047016293 NL65015913223320273618647

Sub-total95019519441365131106343550 Total R137208535416545352741652393861

The total number of predicted protein-encoding genes and the genome size of each species are also provided.See references for details

Bold number is highlighted for the number from Sacred lotus(Nn)

a References for each plant genome and NBS-encoding gene annotation was listed as footnote.Nn:Nelumbo nucifera(Ming et al.2013);At: Arabidopsis thaliana(Meyers et al.2003);Vv:Vitis vinifera(Yang et al.2008);Pt:Populus trichocarpa(Yang et al.2008);Cp:Carica papaya (Porter et al.2009);Os:Oryza sativa(Zhou et al.2004);Sb:Sorghum bicolor(Cheng et al.2010);Zm:Zea mays;Bd:Brachypodium distachyon(Tan and Wu2012);Sm:Selaginella moellendorffii;Pp:Physcomitrella patens

b TN:TIR_NB-ARC,TNL:TIR_NB-ARC_LRR,CN:CC_NB_ARC,CNL:CC_NB-ARC_LRR,N:NB-ARC,NL:NB-ARC_LRR

residues to be highly conserved.Glycine (G)residue in relative entropy position 6,9,and 11,and lysine (L)residue at position 12,were highly conserved,consistent with PFAM database.Furthermore,most of the residues in sacred lotus showed a similar preference with other NBS genes in the database.Exceptions to this were relative entropy 2where isoleucine residue (I)had a higher distribution than valine (V),relative entropy 10where lysine (L)was higher than isoleucine (I)and at relative entropy 21,asparagine (A)was preferred in sacred lotus instead of lysine (L).

To verify and classify other motif architectures in NBS-encoding genes,each candidate was evaluated using PFAM database including the CC motif,TIR motif as well as LRR,and other possible disease resistance associated motifs such as RPW8domain (Arabidopsis powdery mildew resistance gene),Pkinase domain,and WKRY domain (Table 2).We identified 10genes with CC-NBS (CN)and 28genes with CC-NBS-LRR (CNL).CNL is the most common architecture among sacred lotus NBS genes.In addition,we were able to find one gene with TIR-NBS (TN),4genes with TIR-NBS-LRR architectures (TNL)retaining 2~5LRR motifs.The TNL with diverged putative LRR motifs were considered to play different roles in plant defense (Wan et al.2012).

In addition,5NBS genes have the RPW8domain;NNU_16329has one copy of RPW8(CC)-NBS-LRR,NNU_006591has two copies of RPW8-NBS-LRR domains,NNU_003698has RPW-NBS domain,and NNU6589and NNU_011785have RPW8-NBS-LRR domain.RPW8is known as Arabidopsis broad-spectrum mildew resistance protein which confers resistance to powdery mildew by inducing localized,salicylic acid-dependent defenses (Xiao et al.2001).Two genes,NNU_016985and NNU_014596,were found to contain a Pkinase domain at the C-terminal as Pkinase-NBS-LRR-LRR.Such a Pkinase-NBS-LRR-LRR architecture has been found in only 12sequences in the PFAM database,all from the wheat family (2from common wheat,8from durum wheat,and 2from goatgrass)(Punta et al.2012).The Pkinase domain was reported to be involved in the NBS gene catalytic function of protein kinases.Tsn1(D7PDA0_9POAL),a wheat disease resistance-like

gene,

C o n t r i b u t i o n

C o n t r i b u t i o

n

1

123451718192021

678910111213

141516A

B

Relative Entropy

Relative Entropy

Partial NBS motif in sacred lotus

4

NB-ARC motif from Pfam database

Relative Entropy

C o n t r i b u t i o n

C

NACHT motif from Pfam database

4

Fig.1Conserved NBS motif from predicted sacred lotus NBS genes.a NBS motif logo revealed via MEME analysis.Contribution was calcu-lated by surveying the percentage of amino acid appearance.Relative entropy is reflected for each position of corresponding sequence.b Partially HMM logo of NB-ARC (PF00931)domain (truncated from original generated by https://www.sodocs.net/doc/198932842.html, ).Up to date,logo of PF00931(version 17)was generated from 12,258sequences.c Partially HMM logo of NACHT (PF05729)domain (truncated from original logo in https://www.sodocs.net/doc/198932842.html, ).Up to date,logo of PF05729(version 7)was generated from 6,525sequences

which has the typical architecture of Pkinase-NBS-LRRs,governs the effector-triggered susceptibility to necrotrophic pathogens (Faris et al.2010).Another domain called FMN-red domain (NADPH-dependent FMN reductase domain)was also found in the C-terminal of one NBS gene (NNU_024486).This novel domain combination was never reported before in the PFAM database.A Blast-P search (NCBI)identified one protein (XP_002281054,disease re-sistance protein RPP13-like from Vitis vinifera )that was highly matched to NNU_024486gene.Further analysis of this NBS gene with NBS-FMN-red architecture is needed to predict its function.One WRKY domain was also discovered

Table 2Predicted motif architecture of sacred lotus NBS-encoding genes and their most similar Genbank entries by

BLAST-P

to be in the T-terminal of NBS gene(NNU_007278).There are a total of9sequences containing the WRKY domain in PFAM database and all of them are from Arabidopsis(6 genes from A.thaliana:WRK52,WRK16,Q0WWA0, C4B7M5,C4B7M6,and E1B328;and3from A.lyrata: D7MKX1,D7MKY8,and D7MUR9).A typical model of NBS-LRR-WRKY architecture was found in the WRK52, also known as a novel structure combining domains in the Arabidopsis disease resistance protein RRS1.In A thaliana, RRS1-R-mediated resistance is partially salicylic acid-and partially NDR1-dependent conferring specific resistance to Ralstonia solanacearum(Deslandes et al.2002).

a Motif architectures of each predicted R gene were distinguished with multi-domains

b E:for E-value of the most similar Genbank entries via BLAST-P

c S:indicate species,Vv:Vitis vinifera,Pt:Populus trichocarpa,Rc:Ricinus communis,Os:Oryza sativa,Mt:Medicago truncatula,Gm:Glycine max,Md:Malus x domestica,Pa:Platanus x acerifolia,Sb:Solanum bulbocastanum,At:Arabidopsis thaliana,Pd:Pinus taeda

Comparison of NBS and NACTH Domains

Fifty-two(52)out of137sacred lotus NBS-encoding genes were found to have the NACTH domain embedded in the NBS domain(Supplementary Table1).Comparison of HMM logo of NACHT and NBS domains(Fig.1b and c) showed a similarity of amino acid patterns in both domains.

A phylogenetic analysis of NBS and NACHT domains among animals and plants revealed independent recruitment in evolution(Yue et al.2011).Since the NBS domain and NACHT sequences from eubacteria and archaebacteria show fundamental differences,Yue et al.(2011)proposed that those two domains either diverged a long time ago or actu-ally originated independently.

In addition to differences in domain structures,NBS-LRR genes from the subclasses also differ considerably in their phy-letic distribution and downstream signaling pathways, suggesting possible functional divergence.Analogous to the case for plants,animal innate immune systems also rely on two groups of receptors,the transmembrane Toll-like receptors (TLRs)and cytosolic Nod-like receptors(NLRs)(Ausubel 2005).TLRs are characterized by an extracellular LRR domain and an intracellular TIR domain,whereas NLRs often dem-onstrate a tripartite domain architecture consisting of a vari-able N-terminal domain,a central NACHT domain and also a C-terminal LRR domain,like NBS-LRR proteins in plants. However,the NACHT’s have been placed in a separate phy-logeny,suggesting different evolutionary histories of these two domains.

In this study,we found52out of the137NBS genes from the sacred lotus genome to have some similarity to the NACHT domain which is completely or partially embedded in the NBS domain(Supplementary Table1).Evidence has suggested that fundamental differences between the NBS and NACHT do-mains existed before the split of eubacteria,archaebacteria and eukaryotes(Y ue et al.2011).While NBS genes are found in most land plants,NACHT are mostly found in animals,fungi and bacteria.Therefore,these two apparently similar domains must have either diverged a long time ago or actually originated independently.It is interesting to explore the distinct role of sacred lotus in the largely unknown evolutionary history for the NBS and NACHT domains.Phylogenetic analysis of NBS domains from plant,and NACHT domains from animals re-vealed that the NBS domains of the spikemoss and sacred lotus were hyper-distributed among the animal and plant kingdoms (Fig.2and Supplementary Figure1).Based on our current finding,we developed a putative hypothesis that the evolution of the NBS genes(also retaining NACHT domain)in spikemoss and sacred lotus may play a transitionary role during the evolu-tion of land plants.By surveying the number of NBS and NACHT among plants,fungi,bacteria and animals,we were able to present the distinct distribution of the NBS(retaining NACHT)genes of sacred lotus(Supplementary Figure1).It was found that they were clustered with domains found not only in plant but also in the animal kingdom.However,current knowl-edge cannot explain the evolutionary role of NBS embedded with NACHT.Our data may suggest that there is a common origin of NBS and NACHT domains in early evolution and those NBS domains retaining a NACHT domain in sacred lotus may indicate a transitionary role or incomplete divergence before separation of the plant and animal kingdoms.

Divergence of the NBS Encoding Genes

The amino acid sequence of NBS-encoding genes in three sub-families;TNL,CNL and NL,were aligned separately to inves-tigate their sequence diversity and phylogenetic relations (Fig.3).We adopted Arabidopsis disease resistance cluster/groups(Meyers et al.2003).Four TNL genes of sacred lotus were clustered with Arabidopsis RPS4disease resistance TNL_B/C group,and one of them NNU_005300was clustered with At4G23440,a TN group(Fig.3a).RPS4is a resistance gene against P.syringae in Arabidopsis corresponding to the avirulence gene avrRps4(Hinsch and Staskawicz1996),and also confers resistance to oomycetes by EDS1-dependent path-way(Aarts et al.1998).All37CNL genes of sacred lotus were grouped into five subgroups according to the Arabidopsis sys-tem;CNL_A,CNL_B/B3(RPS2),CNL_C(RPP13),CNL_D (RPP8),and CNL others(Fig.3b).The role of RPS2in Arabidopsis is specifically to the recognize Pseudomonas syringae strains expressing the avirulence gene avrRpt2 (Kunkel et al.1993).Mutation studies on rps2,ndr1,and Atrar1revealed that RPS2initiates the signaling based upon perception of RIN4disappearance rather than the direct recog-nition of AvrRpt2(Axtell and Staskawicz2003).Three RPS2-like NBS genes in sacred lotus;NNU_007430,NNU003655 and NNU_007362,were identified.In addition,three RPP13-like NBS genes(CNL_C)were found in the sacred lotus ge-nome.RPP13in A.thaliana specifies resistance to downy mildew(Peronospora parasitica)(Bittner-Eddy et al.2000). The RPP13locus contains either a single gene capable of recognizing multiple isolates or a group of tightly linked genes such as pbs2,pad4-1,npr1-1,and rps5-1genes(Bittner-Eddy and Beynon2001).We also identified five CNL_D(RPP8-like) resistance genes in sacred lotus.RPP8in Arabidopsis confers resistance to turnip crinkle virus and oomycete pathogens (Cooley et al.2000).RPP8is induced by pathogens and salicylic acid treatment and WRKY transcription factors regulate the expression of surveillance genes upstream of the defense-signaling cascade(Cooley et al.2000).The remaining13CNL NBS genes were not clustered into any Arabidopsis subgroups. An additional95of N-or NL-type NBS genes were highly diversified,and were grouped into20subgroups(Fig.3c). Sequences in this group were hyper-variable suggesting high divergence of their functions.

Distribution of NBS Genes Across the Sacred Lotus Chromosome

The majority of sacred lotus NBS-encoding genes(80out of 137)were distributed among the nine linkage groups except LG7and LG8.An additional55NBS-encoding genes were scattered in28scaffolds which could not be assigned to linkage groups(Fig.4).The majority(109out of137, 80%)of NBS-encoding genes are members of complex loci with more than2NBS genes located in a single scaffold.Nine (9)NBS genes were clustered in S82on LG9;8NBS genes were clustered in S16on LG1and S70on LG2;5NBS genes were clustered in S196on LG1;4NBS genes were clustered in S39 on LG5;3NBS genes were clustered respectively in S85,S8on LG2,S32on LG9and S13on LG4;finally,2NBS genes were clustered respectively in S121,S119,S15on LG1,in S20,S23, S41on LG3,in S235on LG4,in S4,S21,S60on LG6,in S126 on LG9.A total of80NBS genes were clustered in scaffolds on these linkage groups.In scaffolds which were not mapped to any linkage groups,7NBS genes were clustered in S59and S67,5 NBS genes were cluster in S104,4NBS genes were clustered in S6,S100,3NBS genes were clustered in S138,S322and finally 2NBS genes were found to be clustered in S51,S76,S64,S131. The total NBS genes clustered in this group was49. Superclusters have been reported in poplar(Kohler et al. 2008),with31NBS genes super-clustered on Linkage group XIX,18NBS genes super-clustered on Linkage group II and13 NBS genes super-clustered on Linkage group XI.Clustered NBS genes are also present in B.distachyon(Tan and Wu 2012),rice(Monosi et al.2004;Zhou et al.2004),and papaya (Porter et al.2009).Thus highly tandem repeats with clusters of NBS genes found in sacred lotus(Fig.4)and other crops suggest that these NBS genes arose from whole genome duplication (WGD)events or have been maintained without substantial divergence(Porter et al.2009).

Further analysis revealed18syntenic pairs among the NBS-encoding genes in sacred lotus,supporting the WGD event(Ming et al.2013).Interestingly,among them,two pairs, NNU_022164/NNU_007845and NNU_000216/NNU_01 6337,were not all NBS-coding genes(Supplementary Table2).Sequence analysis showed that NNU_007845and NNU_016337were missing the entire NBS domains and only LRR domains remained,suggesting potential divergence of NBS genes by losing domains after WGD event.

The sacred lotus that holds the world record for seed longevity(200~1,300years),is a primitive angiosperm and one of the oldest plants surviving from the last ice age(Shen-Miller et al.1995,2002).The unique environmental stresses during its growth might engage and curtail the specialization of disease resistance genes in sacred https://www.sodocs.net/doc/198932842.html,parative ge-nome analyses with other genome-sequenced plants indicated that plant genomes usually encode a large number of NBS

(208, 0)

(38, 24)

(61, 32)

Fig.2Overview of the distribution of NBS encoding genes and NACHT encoding genes.a The number of genes containing the NBS and NACHT domains in bacteria,archaea,fungi,animals,and plants, shown as branches of a phylogenetic tree.Red area with number indicates the number of species or genus containing the NB-ARC domain,Green area with number indicates the number of species containing NACHT domain.b The distribution of NBS and NACHT domains in green plants.The phylogenetic tree was re-regenerated from the Taxonomy Browser(https://www.sodocs.net/doc/198932842.html,/taxonomy).The first number in brackets is the total number of NBS encoding genes and the second number is the number of NBS encoding genes that possessed NACHT domains.The numbers are referred to in Table1,except that data from castor bean indicated with“asterisk”,was directly adopted from PFAM database

genes,with a great diversity in the number of NBS-LRR genes as well as in the distribution of subfamilies of NBS-LRR genes.An interesting example is provided by TNL genes, which are absent from grass genomes surveyed to date(Bai et al.2002).Typically,dicot plants contain more TNL(more than13%of NBS genes,Table1),for example Arabidopsis (147or70.7%),Vitis vinifera(111or20.7%),populus(88or 21.2%)and Carica papaya(7or13%)(Meyers et al.2003; Yang et al.2008;Porter et al.2009).With the availability of the sacred lotus genome,analysis of NBS-encoding genes revealed the number of TNL genes(3.5%of total NBS genes) in sacred lotus,falling between dicot plants and grass species. Therefore,it is of particular interest to try to elucidate the origin and evolutionary history of this important gene family and also the evolutionary relationship between TIR-and non-TIR-NBS-LRR genes.Meanwhile,the CC domain was iden-tified among the NBS genes in both dicot and monocot plants with ranges from11.1%in C.papaya to55.8%in Sorghum bicolor.This comparison supports the evolution theory in-volving ancestral polyploidy events with substantial gene expansion and diversification of this gene family in plants (Yue et al.2011).Such substantial gene expansion of this gene family in the land plants was possibly driven by their func-tional specialization in terms of plant immunity(Tan et al. 2007).

Discussion

The draft genome of sacred lotus appears to contain distinct NBS encoding genes(Ming et al.2013).A total of137genes were identified as NBS encoding https://www.sodocs.net/doc/198932842.html,paring the sacred lotus with other plants which have had their genome se-quenced,the number of subfamilies of TIR-NBS-LRR genes was higher than in monocots,but lower than in dicots and most similar to spikemoss,suggesting that the sacred lotus is an ancient plant in terms of https://www.sodocs.net/doc/198932842.html,parison of the sub-families of CC-NBS-LRR genes in sacred lotus and Arabidopsis showed that sacred lotus has the most types of CC-NBS-LRR genes and shows great divergence in the NBS-LRR subgroup.Eighteen(18)syntenic pairs of were found among137NBS encoding genes which points to a WGD event.Additionally,52out of137NBS-encoding genes contained a NACTH domain anchored/embedded in the NBS domain.A unique character of NACHT is that it is only found in disease resistance genes in animals and fungi.Thus our analysis indicates that the NACHT or NBS domains were largely curtailed by environmental stresses and may play a role in the evolution of land plants.Our data may suggest that there is a common origin of NBS and NACHT domains in early evolution and those NBS domains retaining a NACHT domain in sacred lotus may indicate a transitionary role or incomplete divergence before the separation of plant and animal kingdoms.Sacred Lotus Genome Contains NBS Encoding Genes

in Distinct Subgroups

We have characterized the complete set of137NBS encoding genes in the current version of the sacred lotus genome.This number represented~0.5%of all predicted ORFs,the same percentage as that in Arabidopsis(Meyers et al.2003).Based on the gene sequence,protein motif and domain architecture,we classified them into2TNL sub-groups and5CNL subgroups,compared to8and4corre-sponding subgroups in Arabidopsis.Meanwhile,we also classified20subgroups in NL groups showing great diver-gence.Genome-wide analysis previously resulted in416 poplar NBS genes being classified into10subgroups based on the motif architectures,and535grapevine NBS genes being classified into14subgroups(Yang et al.2008).Due to

NNU_015526_(CNL)

NNU_020694_(CNL)

NNU_021746_(CNL)

NNU_025710_(CNL)

NNU_019857_(CNL)

NNU_019863_(CNL)

NNU_022959_(CNL)

NNU_026019_(CNL)

NNU_012692_(CNL)

NNU_012696_(CNL)

NNU_019400_(CNL)

NNU_000216_(CNL)

NNU_026014_(CNL)

AT5G43470_(CNL_D)

NNU_023678_(CNL)

NNU_008770_(CNL)

NNU_011282_(CNL)

NNU_016986_(CNL)

NNU_016988_(CNL)

AT3G46530_(CNL_C)

NNU_018022_(CNL)

NNU_017037_(CNLNLNL)

NNU_018017_(CNL)

NNU_018068_(CNL)

AT4G26090_(CNL_B)

NNU_007430_(CNL)

AT1G61190_(CNL_B3)

NNU_003655_(CNL)

NNU_007362_(CNL)

NNU_016329_(CNL)

AT5G66900_(CNL_A)

NNU_006591_(CNLNL)

99

95

93

65

87

60

54

87

66

60

56

57

61

60

95

0.1

AT5G45260_(TNL_A)

AT5G45050_(TNL_WRKY)

AT4G36140_(TNTNL)

AT5G17890_(TNLX)

AT2G17050_(TNLTX_1)

AT4G19520_(TNLTX)

AT4G12020_(WRKY_TNL)

NNU_006926_(TNL)

NNU_006927_(TNL)

AT5G45250_(TNL_B)

AT1G27170_(TNL_C)

NNU_004665_(TNL)

NNU_011281_(TNL)

AT3G44630_(TNL_G)

AT1G56510_(TNL_H)

AT5G46520_(TNL_F)

AT4G19500_(TNTNL_1)

AT4G16950_(TNL_E)

AT1G72840_(TNL_D)

AT1G72950_(TN)

AT4G23440_(XTNX)

NNU_005300_(TN)

100

98

98

54

95

96

95

76

76

53

67

51

86

99

0.1

TNL_B/C

(RPS4)

TN

CNL

(other)

CNL_D

(RPP8)

CNL_C

(RPP13)

CNL_B/B3

(RPS2)

CNL_A A

B

Fig.3Phylogeny of subfamilies of TNL(a),CNL(b),and N or NL(c). Reference genes of each subgroup of TNL and CNL proteins were adopted from A.thaliana(Meyers et al.2003).Bootstrapped(1,000 replicates)neighbor-joining trees from distance matrices were constructed using the Clustal-W aligned NBS sequences.Numbers less than50%(clustering50out of100times)were omitted due to possible collapse of the branch.Branch lengths proportional to genetic distance are indicated in the scale bar

NNU 024486

XP 002274414 (RPP8-like)NNU 001119NNU 003698NNU 023855

XP 002528440 (RPP8)NNU 007239

XP 003634282 (RGA2-like)NNU 015616NNU 002459NNU 025043NNU 025044NNU 021135NNU 021136NNU 019222NNU 026022NNU 019855NNU 019860NNU 025365

XP 002271203 (RGA4-like)NNU 021753NNU 008143NNU 015503NNU 015544NNU 015580NNU 015581NNU 015582

XP 002297751 (R gene)NNU 004711NNU 025149NNU 007361

XP 003633547 (RPP13-like)NNU 021757NNU 016519NNU 019401NNU 018021NNU 018024NNU 018042NNU 018043NNU 002767NNU 020088NNU 015617NNU 024027NNU 019403NNU 016275NNU 016495NNU 016496NNU 016500NNU 016497NNU 019909NNU 021817NNU 016506NNU 022698NNU 007431NNU 024034NNU 012695NNU 026649NNU 012693NNU 016498NNU 018018NNU 009351NNU 018364NNU 021232NNU 013084NNU 015525NNU 018067NNU 004589

XP 002264704 (hypothetical R protein)NNU 011393NNU 006589NNU 011785NNU 015009NNU 008651NNU 020790NNU 007278NNU 008652NNU 020792NNU 024928NNU 014763NNU 004084NNU 025600NNU 014596NNU 014605NNU 017556NNU 026508NNU 020623NNU 007355NNU 018366

XP 002326566 (R protein)AFP82245(R protein)NNU 016976NNU 016978NNU 016979NNU 016984NNU 016985NNU 008736NNU 019089

XP 002265617 (hypothetical R protein)NNU 024029NNU 024033NNU 024035

XP 002525518 (RPM1-like)NNU 018955NNU 003525NNU 003502NNU 022330

100

100100

100100100

10010099

100

100

10010064100

100

10010097

100

61

100

100

9499

98100

97

99

99

90

99

99

98

979695

92

72

65

96

80

77

63

95

92

95

95

94

92

9191

73645364

71

69

68

5551

51

66

63

63

59

55

71

5889

5184

76

53

83

79

85

100

10099

99

76

65

99

XP 002281054 (hypothetical R protein)

0.1

C

Fig.3(continued)

2CM

S92

S10

S16

S213S145S193S118S55S253S119/S95

S196

S1

S230S158S121LG1

NNU_018955

NNU_005300

NNU_007431NNU_007430NNU_007412NNU_007362NNU_007361NNU_007355NNU_007278NNU_007239

NNU_024486NNU_022330NNU_023855NNU_020623NNU_015009NNU_025365

NNU_019089NNU_020694NNU_024029NNU_024034NNU_024033NNU_024027NNU_024035

NNU_001119NNU_024928

NNU_020790NNU_020792NNU_022698S70S8

S85

NNU_016985NNU_016986NNU_016988NNU_016979NNU_016976NNU_017037NNU_016978NNU_016984

NNU_004589NNU_004711NNU_004665

NNU_018366NNU_018364NNU_018334

LG2

S23

S41

S20

LG3

NNU_009330NNU_009351

NNU_013088NNU_013084NNU_008651NNU_008652

S13/ S0

S235

LG4

NNU_006591NNU_006589NNU_000216

NNU_025043NNU_025044

LG7

S21

S60

S4

LG6

NNU_008770NNU_008736

NNU_015617NNU_015616

NNU_002459NNU_002767S82

S32

S126

LG9

NNU_018017NNU_018024NNU_018042NNU_018068NNU_018067NNU_018022NNU_018021NNU_018018NNU_018043

NNU_011282NNU_011281NNU_011393NNU_021135NNU_021136LG8

S97

S39

LG5

NNU_019222

NNU_012695NNU_012696NNU_012692NNU_012693

A

S15NNU_006926NNU_006927

S6NNU_003698

NNU_003655

NNU_003525

NNU_003502

S7NNU_004084

S51

NNU_014596NNU_014605

S67

NNU_016519NNU_016506

NNU_016500

NNU_016498

NNU_016497NNU_016495NNU_016496

S34

NNU_011785

S52

NNU_014763

S63

NNU_016055

S76NNU_017583

NNU_017556

S109

NNU_020088

S131

NNU_021232

NNU_021230

S136

NNU_021817

S147NNU_022164S165

NNU_022959S180

NNU_023678

S240

NNU_025149

S244

NNU_025239

S277

NNU_025600

S297

NNU_025710S612

NNU_026508

S2697NNU_026649

S322

NNU_026022

NNU_026019NNU_026014S104

NNU_019909NNU_019863NNU_019860NNU_019857NNU_019855

S100

NNU_019404

NNU_019401

NNU_019400NNU_019403

S64

NNU_016329

NNU_016275S59

NNU_015582NNU_015581NNU_015580NNU_015544

NNU_015526NNU_015525

NNU_015503

S18NNU_008143

S138

NNU_021757

NNU_021753NNU_0217461Mb

B

Fig.4Distribution of identified NBS encoding genes across linkage groups (LG).a Each linkage group is shown in Centimorgans (cM);b Individual scaffolds were scaled in Mbp.The corresponding position of each predicted NBS gene was indicated with ascaffold number (Ming et al.2013)

the missing TNL group,519rice NBS genes were classified into7subgroups(Zhou et al.2004),and274sorghum NBS genes,also missing a TNL group,were classified into13 subgroups(Cheng et al.2010).The observed diversity of these NBS-LRR genes indicates the variety of recognition molecules available in detecting disease pathogens.

All dicot plants which have been sequenced so far have more than10%NBS with a TIR domain,and all genome sequenced monocots have less than1%.Arabidopsis is an exception among dicots with a ratio of approximately2:1of TNL and CNL(Meyers et al.2003).Sacred lotus has~4% which is in between the dicot and monocot plants and is similar to moss or spikemoss.Apparently,during the evolu-tion of land plants,TNL subgroups may have less preferen-tially amplified and diversified in cereal plant lineages(Bai et al.2002).Before work on the sacred lotus,researchers attempted to deduce the evolution of TNL and CNL genes and suggested that additional data as to the R gene compo-sition of other plant families was required to infer evolution events(Bai et al.2002;Cannon et al.2002;Meyers et al. 2003).Sacred lotus,an ancient plant species,with its specific feature of R gene composition could help to reveal how TNL and CNL were involved in land plant evolution.A recent study tracing both origin and evolutionary history concluded that TNL probably had an earlier origin than CNL(Yue et al. 2011).However,the molecular roles and evolution status of TNL and CNL remain to be confirmed in future studies. NB-ARC and NACHT Shared a Similar Ancient Origin, and Diverged During Evolution

Sharing a similar biological function and protein architec-ture,the NLR family of receptors in plants and animals,NBS and NACHT,perceive signals and mediate innate immune responses to microbial pathogens.While plant NLR typical-ly detect strain-specific pathogen effectors,animal NLR normally recognize conserved microbe/damage associated molecular patterns(Maekawa et al.2011).The similarity between NACHT and NB-ARC,as well as the R gene architectures suggest that resistance genes in plant and ani-mal systems may have evolved by the convergent evolution theory(Ausubel2005;Maekawa et al.2011).A trace of NACHT motif in sacred lotus NBS domain could lead us to conclude that NACHT and NB-ARC domains shared a similar ancient origin,and this transition status in sacred lotus confirms it as an early evolving land plant.

Genetic Events Shaped the Composition of the NBS Encoding Genes

Genome duplication plays a critical role in complex genetic systems like disease resistance,as a means of creating new loci,altering genes by recombination,or generating repeat sequences(Ronald and Beutler2010).Many NBS encoding genes were reported as clustered gene groups in all plants sequenced to date(Meyers et al.2003;Monosi et al.2004; Zhou et al.2004;Kohler et al.2008;Porter et al.2009;Tan and Wu2012)Including sacred lotus,NBS-LRR loci in plants are found both as isolated genes(singletons)and tightly linked arrays of related genes(Leister2004).Those gene clusters containing NBS-LRR genes from different phylogenetic clades may be driven by selection pressures on allelic variation created by mutations and re-assorted by recombination between alleles and sometimes between dif-ferent gene family members(Hulbert et al.2001).A sug-gested model for gene clusters in the genome is by chance or by certain mechanism to select NBS-LRR genes into clusters (Leister2004).

The duplication and recombination of NBS-LRR genes in sacred lotus also resulted in divergence.It has been sug-gested that the diversification of NBS-LRR genes in Arabidopsis resulted from intra-locus crossovers,sequence exchange,and selection mechanisms(Leister2004).To un-derstand the evolution of NBS-LRR in sacred lotus,the synteny was screened among the NBS-encoding genes,as well as non-NBS-encoding genes.Arabidopsis has the less synteny may resulting from heterogeneous originated dis-tance by chance association rather than by ectopic recombi-nation(Baumgarten et al.2003).Sacred lotus was also found less synteny among the NBS-encoding gene,may indicates those NBS gene different origin.Less selection mechanisms for clusters available in current knowledge,one of them was that different NBS-LRR genes mediated resistance against the same or different pathogen,and such a positive selection to promote the co-segregation to increase the total fitness to the plant(Hulbert et al.2001;Leister2004).

Function of NBS Encoding Genes

A fair amount of biochemical data has been reported to describe the function of the NBS genes in plants(Gassmann et al.1999;Zhang and Gassmann2003,2007;Wirthmueller et al.2007).The observed number of NBS proteins in sacred lotus represents a major part of the spectrum of recognition molecules to detect diverse pathogens.Thirty-one(31)CNL and TNL genes in sacred lotus have uninterrupted full-length ORFs.All full-length TNL genes were subgrouped with Arabidopsis RPS4gene which confers specific resistance to strains of Pseudomonas syringae pv.tomato expressing avrRps4(Gassmann et al.1999).The full length of RPS4is required to mediate disease resistance,indicating regulation at the protein level rather than as regulatory RNAs(Zhang and Gassmann2003).The PRS4accumulation could trigger the EDS1-dependent defense(Wirthmueller et al.2007),as well

as other signaling components,e.g.SGT1and HSP90,in activation of cell death with HR(hypersensitive response) (Zhang et al.2004).Negative regulators such as srfr(suppres-sor of rps4)mediate RPS4-dependent gene-for-gene disease resistance(Kwon et al.2004).Expression of truncated RPS4 proteins in Nicotiana benthamiana induced HR-like cell death in the absence of AvrRps4,suggesting that RPS4is regulated at multiple levels,including gene expression,alternative splic-ing,and protein stability(Zhang and Gassmann2007).RRS1 and RPS4are required for a dual resistance gene system that prevents infection of Colletotrichum higginsianum,R. solanacearum and Pst-avrRps4.The CNL genes in sacred lotus were mostly grouped together with Arabidopsis RPS2,RPP8, and RPP13.RPS2is also an Arabidopsis disease resistance locus specifying the recognition of specific Pseudomonas syringae strains expressing the avirulence gene avrRpt2(Kunkel et al. 1993).RPP8and RPP13confer resistance to downy-mildew via different avirulence determinants in Peronospora parasitica (Bittner-Eddy et al.2000;Mohr et al.2010).

Compared to Arabidopsis,sacred lotus apparently has fewer full-length NBS encoding genes.However,few sacred lotus pathogens/insect pests are known so far. Cercospora nymplaeacea(lotus leaf brown spot disease), Alternaria nelumbii(lotus leaf brown spot disease), Fusarium bulbigenum(lotus rot disease),Colletotrichum gloeosporioides(wilting),Cylindrocladium hawksworthii (leaf spot and rhizome rot),Prodenia litura,and waterlily aphid are common pathogens/insect pests of sacred lotus found in China(Xu1990;Chen et al.2005).Further molec-ular and biological analysis of the genes involved in the interaction of sacred lotus with its pathogens and insect pests would provide some insight as to the functions of the NBS-encoding genes.

Materials and Methods

Sacred Lotus Genome Sequence

The drafted genome of sacred lotus‘China Antique’became available in GenBank under genome ID14095:http:// https://www.sodocs.net/doc/198932842.html,/genome/14095(Ming et al.2013). MAKER(v2.22),SNAP and Augustus were used for gene prediction and annotation of NBS-encoding genes.A total of 26,685protein coding genes were predicted from the sacred lotus genome sequence.

Identification of NBS-Encoding Genes

Methods used in this study to identify the sacred lotus NBS type R genes and analyze motif architecture have been de-scribed by Porter et al.(2009).Hidden Markov Models method(HMMER,v3.0)was used to search for NBS(NB-ARC,Pfam family PF00931domain,up-to-date version17, https://www.sodocs.net/doc/198932842.html,)(Punta et al.2012).Preliminary search with lower stringency(expectation-value cut off of 1.0)to make sure covering all putative NBS proteins.The retrieved candidate proteins were further validated by MEME(Multiple Expectation Maximization for Motif Elicitation,version4.8.1,https://www.sodocs.net/doc/198932842.html,)(Bailey et al.2009),and NBCI Conserved Domain search tool to search against CDD(conserved domain database,v3.05, https://www.sodocs.net/doc/198932842.html,)(Marchler-Bauer et al.2011). Thus the critical validation yields a more stringent threshold (E-value10-6)to filter out false hits.Furthermore,BLASTP (https://www.sodocs.net/doc/198932842.html,)was conducted to confirm the candidate proteins(Altschul et al.1997).

Identification of TIR,CC,LRR and Other Motif Architectures

All candidate NBS genes were further searched manually and automatically against the HMM database(Pfam-A.hmm,v26.0) (Punta et al.2012)to find any potential architecture,such as TIR (Pfam domain module:PF01582,PF13676),LRR(Pfam do-main module:PF00560,PF12799,PF13306,PF13504, PF13516,PF13855,PF07723and PF07725),NACHT(Pfam domain module,PF05729).Additional CC domain analysis was carried out using Paircoil2program,with0.025as cut off value for P-score(McDonnell et al.2006).

Phylogenetic Analysis of NBS Resistance Gene

Full NBS-encoding amino acid sequences,truncated NBS-domains and truncated NACHT domain sequences were aligned separately using CLUSTALW program(Thompson et al.1994).Pairwise distance measurement was taken using PAUP(v4.0)to construct Neighbor-Joining method(NJ) (Saitou and Nei1987)with multiple alignment resampling set bootstrap at1,000(Felsenstein1996).The MEGA(v5.0) was used to view the phylogenetic tree(Tamura et al.2011).

Linkage Group Assignment

Based on the restriction associated DNA sequencing (RADseq)markers and simple sequence repeat(SSR) markers,the sacred lotus genome was mapped into9 linkage groups to anchor the assembled scaffolds(Ming et al.2013).Scaffold and Megascaffold(multi-scaffold) position were identified for each supercontig containing predicted NBS-encoding genes.In accordance with previ-ous genome-wide syntenic analysis(Ming et al.2013),the synteny among the predicted NBS-coding genes was also investigated.

Acknowledgments We thank Dr.Ming-Li Wang at the Hawai‘i Ag-riculture Research Center(HARC)for providing assistance with bioin-formatics and Dr.Heather McCafferty at HARC for critical reviewing of the manuscript.This work was supported partially by a cooperative agreement(No.CA58-5320-3-460)between the U.S.Department of Agriculture-Agricultural Research Service and HARC. References

Aarts N,Metz M,Holub E et al(1998)Different requirements for EDS1 and NDR1by disease resistance genes define at least two R gene-mediated signaling pathways in Arabidopsis.PNAS95:10306–10311

Altschul S,Madden T,Schaffer A et al(1997)Gapped BLAST and PSI-BLAST:a new generation of protein database search programs.

Nucleic Acids Res25:3389–3402

Ausubel FM(2005)Are innate immune signaling pathways in plants and animals conserved?Nat Immunol6:973–979

Axtell MJ,Staskawicz BJ(2003)Initiation of RPS2-specified disease resistance in Arabidopsis is coupled to the AvrRpt2-directed elim-ination of RIN4.Cell112:369–377

Bai J,Pennill LA,Ning J et al(2002)Diversity in nucleotide binding site-leucine-rich repeat genes in cereals.Genome Res12:1871–1884

Bailey TL,Boden M,Buske FA et al(2009)MEME SUITE:tools for motif discovery and searching.Nucleic Acids Res37:W202–W208

Baumgarten A,Cannon S,Spangler R et al(2003)Genome-level evolution of resistance genes in Arabidopsis thaliana.Genetics165:309–319 Bittner-Eddy PD,Beynon JL(2001)The Arabidopsis downy mildew resistance gene,RPP13-Nd,functions independently of NDR1and EDS1and does not require the accumulation of salicylic acid.Mol Plant Microbe Interact14:416–421

Bittner-Eddy PD,Crute IR,Holub EB et al(2000)RPP13is a simple locus in Arabidopsis thaliana for alleles that specify downy mildew resis-tance to different avirulence determinants in Peronospora parasitica.

Plant J21:177–188

Burkhard P,Stetefeld J,Strelkov SV(2001)Coiled coils:a highly versatile protein folding motif.Trends Cell Biol11:82–88 Cannon SB,Zhu H,Baumgarten AM et al(2002)Diversity,distribution, and ancient taxonomic relationships within the TIR and non-TIR NBS-LRR resistance gene subfamilies.J Mol Evol54:548–562 Chen LS,Lin CW,Liu CD et al(2005)Identification of pathogens causing rhizome rot of the East Indian lotus.Plant Prot Bull(Taipei)47 Cheng X,Jiang H,Zhao Y et al(2010)A genomic analysis of disease-resistance genes encoding nucleotide binding sites in Sorghum bicolor.Genet Mol Biol33:292–297

Chisholm ST,Coaker G,Day B et al(2006)Host-microbe interactions: shaping the evolution of the plant immune response.Cell 124:803–814

Cooley MB,Pathirana S,Wu H-J et al(2000)Members of the Arabidopsis HRT/RPP8family of resistance genes confer resistance to both viral and oomycete pathogens.Plant Cell12:663–676

Deslandes L,Olivier J,Theulieres F et al(2002)Resistance to Ralstonia solanacearum in Arabidopsis thaliana is conferred by the reces-sive RRS1-R gene,a member of a novel family of resistance genes.PNAS99:2404–2409

Faris JD,Zhang Z,Lu H et al(2010)A unique wheat disease resistance-like gene governs effector-triggered susceptibility to necrotrophic pathogens.PNAS107:13544–13549

Felsenstein J(1996)Inferring phylogenies from protein sequences by parsimony,distance,and likelihood methods.Methods Enzymol 266:418–427Flor HH(1971)Current status of the gene-for-gene concept.Annu Rev Phytopathol9:275–296

Gassmann W,Hinsch ME,Staskawicz BJ(1999)The Arabidopsis RPS4bacterial-resistance gene is a member of the TIR-NBS-LRR family of disease-resistance genes.Plant J20:265–277 Hinsch M,Staskawicz B(1996)Identification of a new Arabidopsis disease resistance locus,RPs4,and cloning of the corresponding avirulence gene,avrRps4,from Pseudomonas syringae pv.pisi.

Mol Plant Microbe Interact9:55–61

Hughes A(2006)Evolutionary relationships of vertebrate NACHT domain-containing proteins.Immunogenetics58:785–791 Hulbert SH,Webb CA,Smith SM et al(2001)Resistance gene complexes: evolution and utilization.Annu Rev Phytopathol39:285–312 Kohler A,Rinaldi C,Duplessis S et al(2008)Genome-wide identifica-tion of NBS resistance genes in Populus trichocarpa.Plant Mol Biol66:619–636

Koonin EV,Aravind L(2000)The NACHT family—a new group of predicted NTPases implicated in apoptosis and MHC transcription activation.Trends Biochem Sci25:223–224

Kopp EB,Medzhitov R(1999)The Toll-receptor family and control of innate immunity.Curr Opin Immunol11:13–18

Kunkel BN,Bent AF,Dahlbeck D et al(1993)RPS2,an Arabidopsis disease resistance locus specifying recognition of Pseudomonas syringae strains expressing the avirulence gene avrRpt2.Plant Cell 5:865–875

Kwon SI,Koczan JM,Gassmann W(2004)Two Arabidopsis srfr (suppressor of rps4-RLD)mutants exhibit avrRps4-specific dis-ease resistance independent of RPS4.Plant J40:366–375 Leister D(2004)Tandem and segmental gene duplication and recom-bination in the evolution of plant disease resistance genes.Trends Genet20:116–122

Maekawa T,Kufer TA,Schulze-Lefert P(2011)NLR functions in plant and animal immune systems:so far and yet so close.Nat Immunol 12:817–826

Marchler-Bauer A,Lu S,Anderson JB et al(2011)CDD:a Conserved Domain Database for the functional annotation of proteins.

Nucleic Acids Res39:D225–D229

McDonnell A V,Jiang T,Keating AE et al(2006)Paircoil2:improved prediction of coiled coils from sequence.Bioinformatics22:356–358 McHale L,Tan X,Koehl P et al(2006)Plant NBS-LRR proteins: adaptable guards.Genome Biol7:212

Meyers BC,Kozik A,Griego A et al(2003)Genome-wide analysis of NBS-LRR-encoding genes in Arabidopsis.Plant Cell15:809–834 Michelmore RW,Meyers BC(1998)Clusters of resistance genes in plants evolve by divergent selection and a birth-and-death process.

Genome Res8:1113–1130

Ming R,Van Buren R,Liu Y et al(2013)Genome of the long-living sacred lotus(Nelumbo nucifera Gaertn.).Genome Biol14:R41.

doi:10.1186/gb-2013-14-5-r41

Mohr TJ,Mammarella ND,Hoff T et al(2010)The Arabidopsis downy mildew resistance gene RPP8is induced by pathogens and salicylic acid and is regulated by W box cis elements.Mol Plant Microbe Interact23:1303–1315

Monosi B,Wisser R,Pennill L et al(2004)Full-genome analysis of resistance gene homologues in rice.Theor Appl Genet109:1434–1447 Mucyn TS,Clemente A,Andriotis VME et al(2006)The tomato NBARC-LRR protein Prf interacts with Pto kinase in vivo to regulate specific plant immunity.Plant Cell18:2792–2806

Pan Q,Wendel J,Fluhr R(2000)Divergent evolution of plant NBS-LRR resistance gene homologues in dicot and cereal genomes.J Mol Evol50:203–213

Porter B,Paidi M,Ming R et al(2009)Genome-wide analysis of Carica papaya reveals a small NBS resistance gene family.Mol Genet Genomics281:609–626

Punta M,Coggill PC,Eberhardt RY et al(2012)The Pfam protein families database.Nucleic Acids Res40:D290–D301

Richter TE,Ronald PC(2000)The evolution of disease resistance genes.Plant Mol Biol42:195–204

Ronald PC,Beutler B(2010)Plant and animal sensors of conserved microbial signatures.Science330:1061–1064

Saitou N,Nei M(1987)The neighbor-joining method:a new method for reconstructing phylogenetic trees.Mol Biol Evol4:406–425 Shen-Miller J,Mudgett MB,Schopf JW et al(1995)Exceptional seed longevity and robust growth:ancient sacred lotus from China.Am J Bot82:1367–1380

Shen-Miller J,Schopf JW,Harbottle G et al(2002)Long-living lotus: germination and soil Y-irradiation of centuries-old fruits,and cultivation,growth,and phenotypic abnormalities of offspring.

Am J Bot89:236–247

Song WY,Wang GL,Chen LL et al(1995)A receptor kinase-like protein encoded by the rice disease resistance gene,Xa21.

Science270:1804–1806

Tameling WI,V ossen JH,Albrecht M et al(2006)Mutations in the NB-ARC domain of I-2that impair ATP hydrolysis cause autoactivation.

Plant Physiol140:1233–1245

Tamura K,Peterson D,Peterson N et al(2011)MEGA5:molecular evolu-tionary genetics analysis using maximum likelihood,evolutionary dis-tance,and maximum parsimony methods.Mol Biol Evol8:2731–2739 Tan S,Wu S(2012)Genome wide analysis of nucleotide-binding site disease resistance genes in Brachypodium https://www.sodocs.net/doc/198932842.html,p Funct Genomics2012:418208

Tan X,Meyers BC,Kozik A et al(2007)Global expression analysis of nucleotide binding site-leucine rich repeat-encoding and related genes in Arabidopsis.BMC Plant Biol7:56

Tarr DE,Alexander H(2009)TIR-NBS-LRR genes are rare in monocots: evidence from diverse monocot orders.BMC Res Notes2:197 Thompson JD,Higgins DG,Gibson TJ(1994)CLUSTAL W:improv-ing the sensitivity of progressive multiple sequence alignment through sequence weighting,position-specific gap penalties and weight matrix choice.Nucleic Acids Res22:4673–4680

van der Biezen EA,Jones JD(1998)The NB-ARC domain:a novel signalling motif shared by plant resistance gene products and regulators of cell death in animals.Curr Biol8:R226–R228van Ooijen G,Mayr G,Kasiem MMA et al(2008)Structure-function analysis of the NB-ARC domain of plant disease resistance pro-teins.J Exp Bot59:1383–1397

Wan H,Yuan W,Ye Q et al(2012)Analysis of TIR-and non-TIR-NBS-LRR disease resistance gene analogous in pepper:characteriza-tion,genetic variation,functional divergence and expression pat-terns.BMC Genomics13:502

Wirthmueller L,Zhang Y,Jones JD et al(2007)Nuclear accumulation of the Arabidopsis immune receptor RPS4is necessary for trig-gering EDS1-dependent defense.Curr Biol17:2023–2029

Xiao S,Ellwood S,Calis O et al(2001)Broad-spectrum mildew resistance in Arabidopsis thaliana mediated by RPW8.Science 291:118–120

Xu MH(1990)Sacred louts disease and control[in Chinese].China Flower&Penjing6:6

Yan N,Chai J,Lee ES et al(2005)Structure of the CED-4-CED-9 complex provides insights into programmed cell death in Caenorhabditis elegans.Nature437:831–837

Yang S,Zhang X,Yue J-X et al(2008)Recent duplications dominate NBS-encoding gene expansion in two woody species.Mol Gen Genomics280:187–198

Yue JX,Meyers BC,Chen JQ et al(2011)Tracing the origin and evolutionary history of plant nucleotide-binding site-leucine-rich repeat(NBS-LRR)genes.New Phytol193:1049–1063

Zhang X-C,Gassmann W(2003)RPS4-mediated disease resistance re-quires the combined presence of RPS4transcripts with full-length and truncated open reading frames.Plant Cell15:2333–2342 Zhang X-C,Gassmann W(2007)Alternative splicing and mRNA levels of the disease resistance gene RPS4are induced during defense responses.Plant Physiol145:1577–1587

Zhang Y,Dorey S,Swiderski M et al(2004)Expression of RPS4in tobacco induces an AvrRps4-independent HR that requires EDS1, SGT1and HSP90.Plant J40:213–224

Zhou T,Wang Y,Chen J-Q et al(2004)Genome-wide identifica-tion of NBS genes in japonica rice reveals significant expan-sion of divergent non-TIR NBS-LRR genes.Mol Gen Genomics271:402–415

相关主题