- Extended genome report
- Open Access
Genome of Russian wheat aphid an economically important cereal aphid
Standards in Genomic Sciencesvolume 12, Article number: 90 (2017)
Although the hemipterans (Aphididae) are comprised of roughly 50,000 extant insect species, only four have sequenced genomes that are publically available, namely Acyrthosiphon pisum (pea aphid), Rhodnius prolixus (Kissing bug), Myzus persicae (Green peach aphid) and Diuraphis noxia (Russian wheat aphid). As a significant proportion of agricultural pests are phloem feeding aphids, it is crucial for sustained global food security that a greater understanding of the genomic and molecular functioning of this family be elucidated. Recently, the genome of US D. noxia biotype US2 was sequenced but its assembly only incorporated ~ 32% of produced reads and contained a surprisingly low gene count when compared to that of the model/first sequenced aphid, A. pisum . To this end, we present here the genomes of two South African Diuraphis noxia (Kurdjumov, Hemiptera: Aphididae) biotypes (SA1 and SAM), obtained after sequencing the genomes of the only two D. noxia biotypes with documented linked genealogy. To better understand overall targets and patterns of heterozygosity, we also sequenced a pooled sample of 9 geographically separated D. noxia populations (MixIX). We assembled a 399 Mb reference genome (PRJNA297165, representing 64% of the projected genome size 623 Mb) using ± 28 Gb of 101 bp paired-end HiSeq2000 reads from the D. noxia biotype SAM, whilst ± 13 Gb 101 bp paired-end HiSeq2000 reads from the D. noxia biotype SA1 were generated to facilitate genomic comparisons between the two biotypes. Sequencing the MixIX sample yielded ±26 Gb 50 bp paired-end SOLiD reads which facilitated SNP detection when compared to the D. noxia biotype SAM assembly. Ab initio gene calling produced a total of 31,885 protein coding genes from the assembled contigs spanning ~ 399 Mb (GCA_001465515.1).
Diuraphis noxia (Kurdjumov), commonly known as the Russian wheat aphid, is an economically important hemipteran pest species (Hemiptera: Aphididae) afflicting wheat and barley yield in dry-land production regions . Diuraphis noxia was first reported as a pest of small grains in South Africa during 1978 . In 1986, D. noxia was detected in the US Texas Panhandle , where after it spread to 16 other states and two Canadian provinces within a few years. In 1988, D. noxia was recorded in Chile, by 1992 in Argentina  and finally spread to Australia in 2016 .The feeding of D. noxia results in foliar damage which include distinct white, yellow, purple or reddish-purple longitudinal streaks (chlorotic streaking), with severe leaf rolling in fully expanded leaves and the inhibition of leaf unfolding of developing leaves. This inability of the leaves to unfold traps the developing spike of the plant (termed “head-trapping”) which results in no seeds being produced [6, 7]. The rolling of leaves has the added unwanted effect of protecting the aphid from harsh environmental conditions (such as insecticide spraying or extreme temperatures) and from natural predators . Overall, D. noxia infested wheat also suffers from stunted growth leading to a lowered biomass and a decrease in the number of tillers produced  thereby greatly affecting yield potential. Seed obtained from D. noxia infested wheat also tend to have lowered protein content and other negative attributes for the flour industry  which only adds to the economic injury of this pest. In D. noxia , it is common for mothers to carry both their daughters and granddaughters, as parthenogenetically produced granddaughter embryos develop directly within daughters, even before their own birth. This process allows for short D. noxia generation times and rapid population growth in favorable environments , but is thought to limit the available diversity possible within D. noxia populations . Since its appearance in South Africa in the late 1970’s, D. noxia has undergone several biotypification events as there are currently five different biotypes recognized in South Africa [11, 12] and eight in the USA . Biotypification, as referenced here, is when an aphid population is able to overcome previously established resistance within wheat . Recently, the genome of the United States Diuraphis noxia biotype US2 was released  with an assembly size of ~395 Mb (296 Mb represented by contigs) and containing 19,097 genes. While the study was able to produce a total of 1.3 Gb of sequence data, it could only incorporate ~32% of this into an assembly comprising ~ 70% of their predicted genome size. A partial assembly due to an under estimation of genome size may explain why their values differ so greatly to that of the closest relative of D. noxia , A. pisum (37, 865 genes and 541 Mb assembly).
Here we present the genomes of the most virulent  South African D. noxia biotype SAM and its progenitor, the least virulent South African D. noxia biotype, SA1 , as well as information on the heterozygosity within geographically separated D. noxia populations. This study forms part of a larger survey encompassing global D. noxia genomic variation.
Classification and features
Diuraphis noxia Kurdjumov (Hemiptera: Aphididae) (Table 1) is a phloem feeding Hemipteran that predominantly feeds on winter wheat and spring barley , with the ability to utilize other grasses as alternate hosts [3, 16]. It is pale green and up to 2 mm long with short and rounded cornicles (Fig. 1). Cornicles are structures limited to aphids on the posterior abdomen and its presence is used to assist in the identification of D. noxia . The cornicles above the cauda give the aphid the appearance of having two tails and it is believed that these structures help aphids with predator defense . Alignments using whole mitochondrial genomes  indicate that the closest relative of D. noxia is Acyrthosiphon pisum (Fig. 2). Reproduction of D. noxia can either be holocyclic (sexually reproducing males and females), as in areas where D. noxia is deemed endemic such as Hungary and Russia [20, 21], or anholocyclic (parthenogenic females), where D. noxia is deemed invasive . Reproduction through asexual means can lead to a fecundity rate of between 3 and 5 aphids per day with an average lifespan of roughly 50 days, of which 9 are spent as nymphs . Both forms of reproduction can lead to two morphological morphs, namely alatae (wingless forms) and apterae (winged form), with the latter form responsible for the wider geographical dispersal of the aphid .
Genome sequencing information
Genome project history
The genome of the most virulent South African D. noxia biotype, SAM, was sequenced, along with that of its less virulent progenitor, biotype SA1, in an attempt to determine the genomic factors responsible for biotypification. With this, a pooled sample comprising of geographically separated D. noxia populations (MixIX) was also sequenced to ascertain the scope of heterogeneity experienced by the species as a whole. The draft genome sequence, as well as that all produced sequences, has been deposited at the NCBI in the GenBank database under ID GCA_001465515.1 and BioProject PRJNA297165. The project information and its association with MIGS version 2.0 compliance are summarized in Table 2 .
Growth conditions and genomic DNA preparation
Insectary reared strains of D. noxia , kept at ± 22 °C and natural lighting, were utilized for all confirmed D. noxia populations’ genomic DNA extractions. Genomic DNA from adult aphids of the South African D. noxia biotypes SAM  and SA1 , and that of the pooled MixIX sample, was used for next generation sequencing (NGS) during the study. The genomic DNA extraction of aphid DNA was conducted as follows; whole aphids were flash frozen in liquid nitrogen, ground and DNA extracted using the Qiagen DNeasy Blood and Tissue kit according to the manufacturer’s protocol . The MixIX sample consisted of 2 μg of genomic DNA for each D. noxia representative, which consisted of three field collected South African D. noxia populations (SA1 < SA2 < SA3 in order of increasing virulence); one field collected Czech D. noxia population; two insectary reared US D. noxia populations (US1 < US2 in order of increasing virulence); one field collected Syrian D. noxia population; and two field collected Argentinian D. noxia populations. The integrity of the extracted DNA was then verified through electrophoresis, making use of a 1.5% agarose gel, and quantified using a Qubit v2.0 fluorometer.
Genome sequencing and assembly
269,657,598 (biotype SAM) and 119,235,662 (biotype SA1) 101 bp reads were obtained from single paired-end libraries constructed with the Illumina TruSeq Nano DNA Library Preparation Kit, with an average 500 bp insert size, that were sequenced on the Illumina HiSeq2000 sequencing platform by Macrogen, Inc. (Seoul, Korea). Whole genomic DNA obtained from the MixIX sample produced 334,866,714 50 bp reads using the SOLiD sequencing platform from a 3–4 Kbp long mate-paired library by SEQOMICS Biotechnológia Kft. (Budapest, Hungary).
Raw sequences obtained from the Illumina HiSeq2000 sequencing of the D. noxia SAM biotype, and from the SOLiD system for the MixIX sample, were trimmed and filtered so that all bases had a minimum Phred score of 20. Reads mapping to Buchnera aphidicola of D. noxia [CP013259.1] and that of the mitochondrion of D. noxia  were removed from further analysis. Optimal k-mer length for the D. noxia biotype SAM assembly was determined using KMERGENIE , while using DSK  to estimate the optimal k-mer frequency cut-off. GCE  was utilized to estimate the genome size of D. noxia through using the optimal k-mer size generated by KMERGENIE and the frequency of the optimal k-mer size as determined by DSK. The D. noxia genome of biotype SAM was assembled using the SOAP de novo software package . After contig assembly, scaffolds were constructed by realignment of useable paired-end reads onto the contig sequences.
Trimmed D. noxia biotype SAM reads were also iteratively mapped (3 iterations), using the Geneious (v7.1.5) software package , against the assembled scaffolds of Acyrthosiphon pisum (Acyr2.0) obtained from ENSEMBL . The consensus sequences obtained from the reference mapping were then compared through the use of the BLASTn  application to the de novo contigs. Any sequences that produced no match through use of BLASTn were added to the contigs obtained from the SAM de novo assembly to build the final draft genome. A total of 190,686 contigs greater than 300 bp in length was produced with an average coverage of 44.8×, representing ~83% of the total reads generated. Using the assembled contigs, BUSCO v1.1  was utilized to assess the completeness of the assembly and found that of the 2675 single-copy orthologues 85% were present, 7% were fragmented and 8% were missing (Fig. 3a). To allow for comparison, analysis using BUSCO was also performed with the scaffolds of the D. noxia biotype RWA2 genome (GCA_001186385.1)  and that of Acyrthosiphon pisum (GCA_000142985.2) (Fig. 3b and c respectively).
Gene prediction was performed using the ab initio gene caller Augustus  using the 36,195 protein coding genes of A. pisum (build v2.1) obtained from ENSEMBL  as a training set. Predicted protein coding genes were then assigned putative identity through the use of the BLASTp and BLASTx applications of the NCBI . Protein coding genes were considered shared if they presented with at least 70% sequence identity over at least 70% of the total protein length. Blast2GO  was used to obtain the putative Gene Ontology (GO)  of the D. noxia protein coding genes predicted by Augustus. KOG  functional categories were assigned to predicted protein coding genes through use of the NCBI’s RPS-BLAST  and Conserved Domain KOG database , with an E-value smaller than 10e-3 accepted as significant. Protein coding genes were analyzed for their amino acid content through use of the Geneious (v7.1.5) platform  and CRISPR sites were predicted using the CRISPR Recognition Tool v1.1 .
Reads obtained from the SOLiD system were then mapped to the predicted protein coding genes of the SAM assembly to facilitate nucleotide variant calling using the Geneious (v7.1.5) software package . The minimal criteria for assigning a single nucleotide polymorphism (SNP) required that the area in question had a mapping coverage of more than ×10, the variant was present in at least 2 sequences, and that the p-value predicted for the SNP should be smaller than 1 × 10−6 (calculated by first averaging the base quality of each base equal to the proposed SNP and averaging the qualities of each base not equal to the proposed SNP).
An EDTA-mediated exudation protocol  was used to collect phloem from uninfested, susceptible Triticum aestivum subsp. aestivum L cultivar Gamtoos-S leaves in triplicate. The exudates were blown to dryness under nitrogen at 55 °C and the residues were reconstituted in 200 ul 1 M (pH 8.0) borate buffer containing internal amino acid standards by the Central Analytical Facilities (CAF), Stellenbosch University. Ten microlitres of each reconstituted sample was derivatized using the Waters AQC derivatization kit. Derivatized amino acids were then separated and detected using a Waters Acquity UPLC fitted with an UltraTag C18 column and a photodiode array detector. Peaks were detected and integrated by the MassLynx software (Waters Corporation).
The genome of female D. noxia consists of 5 holocentric chromosome pairs (4 autosomes and 1 sex or X chromosome) giving it an XX/XO sex determination system . The final assembly totaled 399,704,836 bp which represents ~64% of the predicted genome size of between 593 and 623 Mb obtained through using GCE , the optimal k-mer size (KMERGENIE ) and distribution graphs (DSK ) (Table 3). The assembly GC content was 29.5% and ab initio gene calling, through Augustus , identified 31,885 protein coding genes greater than 32 amino acids in length. The total gene complement represented 66,633,929 bp of the assembly (16.67%) of which 20,316,122 (5.08%) consisted of coding domain sequence. Amino acid usage in the protein coding genes complement of D. noxia (Table 4) indicated that leucine followed by serine are the most frequently used amino acids, while tryptophan was the least frequently occurring amino acid. Of the 31,885 protein coding genes, 27,386 (~ 86%) sequences were putatively identified through BLASTx and BLASTp and only 12,791 (~ 47%) of these had a GO term assigned to them through Blast2GO .
Insights from the genome sequence
With an AT content of 70.5%, D. noxia is the most AT-rich insect genome sequenced to date. This is very similar to its closest aphid relative, A. pisum , which has an AT content of 70.4%. A cursory comparison of the genic complement between D. noxia biotypes SAM and SA1 shows no differences, with SA1 reads mapping to all predicted protein coding genes, and no indication of genomic rearrangements. Genome size estimations, utilizing GCE and k-mer counting, were also inconclusive with both biotypes predicted to have roughly equal genome sizes. The predicted genome size of roughly 623 Mb containing 31,885 protein coding genes is also comparable to that of A. pisum which currently has an assembly size of 542 Mb with 36,195 protein coding genes assigned to it.
In order to assess whether there is a bias for selected amino acids during transcription in the D. noxia genome, we analyzed the frequency of specific amino acids and codon usage (Table 4). From the data it was evident that leucine followed by serine are the most frequently used amino acids in the predicted protein coding genes, while tryptophan was the least frequently occurring amino acid.
With regards to codon usage, leucine codons were used in the following order from most used TTA > TTG > CTT > CTG > CTA > CTC, while in the case of serine they were as follows TCA > TCT > AGT > TCG > AGC > TCC. Codons with low usage include tryptophan (TGG), cysteine (TGT > TGC) and histidine (CAT > CAC). The start codon (methionine, ATG) and stop codons (TAA, TGA and TAG) also occurred as expected at lower frequencies.
When comparing the amino acid usage of D. noxia protein coding genes to that of the free amino acid composition of wheat phloem (Fig. 4), it was interesting to note that of the ten most abundant amino acids present in D. noxia protein coding genes (in order: Leu > Ser > Lys > Ile > Glu > Val > Asn > Thr > Asp > Ala), seven were also most abundant in wheat phloem (i.e., Asp, Glu, Ser, Ile, Thr, Leu and Ala). Previous studies that either utilized an EDTA mediated phloem exudation method and/or stylectomy to investigate wheat phloem reported similar levels in unchallenged wheat plants (Additional file 1: Figure S1) [37, 39]. The apparent organization of D. noxia protein coding genes around the availability of free amino acids within its diet could illustrate the adaptation of the aphid towards its limited host range.
Assigning predicted D. noxia protein coding genes into KOG categories revealed that, out of the 31,885 predicted genes, 13,523 (42.42%) were successfully assigned of which the largest comprised of the general function category R (12.98%) > the signal transduction mechanisms category T (12.26%) > the transcription category K (7.61%) > posttranslational modification, protein turnover and chaperones category O (7.29%) > the intracellular trafficking, secretion, and vesicular transport category U (6.15%) (Table 5; Fig. 5). The large grouping of genes associated with protein modification and turnover is interesting in that it has been shown previously that phloem feeding aphids, despite low levels of heterogeneity, display various levels of virulence towards single host cultivars [40,41,42,43], as is the case for Diuraphis noxia biotypes SA1 and SAM [11, 44]. The basis for this observed variance may include the adaptability of the aphid’s salivary cohort in response to its feeding environment [45, 46] as this is central to the molecular interaction between aphids and their hosts. In a study by Lapitan et al. , where fractionated aphid extracts from different D. noxia biotypes were injected into resistant and susceptible wheat cultivars, it was found that the D. noxia effector(s) modulating aphid-host interactions was proteinaceous in nature and differed between biotypes. Thus D. noxia , as well as other Hemipterans, would require an adaptive and responsive salivary enzyme cohort that is able to adjust for their continually changing feeding environment .
The pooling, and subsequent sequencing, of different D. noxia geographically separated populations was performed to give a clearer indication of the level of variation present overall within the species. The total number of polymorphic sites identified between the predicted protein coding genes of the South African D. noxia biotype SAM assembly and the MixIX sample was 92,125 (Table 6). The majority of these polymorphic sites were either synonymous (19.9%) or resulted in an amino acid substitution (68.4%). Other SNPs resulted in major underlying protein effects such as the introduction of aberrant stop codons leading to truncated transcripts (7.4%), frame shift alterations (2.6%), in-frame insertions (0.6%) and deletions (0.5%) and the extension of transcripts through disrupting stop codons (0.5%). In total, out of the predicted 31,885 protein coding genes 10,934 (34.29%) contained SNPs. The KOG general function category R (1657 genes) was assigned the most genes, followed by the translation, ribosomal structure and biogenesis category T (1434 genes); the replication, recombination and repair category L (1092); the posttranslational modification, protein turnover, chaperones category O (1061); the transcription category K (1025); and the cell cycle control, cell division, chromosome partitioning category D (938) (Fig. 6). Again, genes allocated to the general category of protein modification and turnover features prominently within the top genes containing the most SNPs, especially so when comparing SNPs with an underlying protein effect.
The overall low number of SNPs leading to protein content variation (i.e., insertion and deletion of in-frame codons) was the least represented. This may indicate a conservation of local amino acid identity within proteins. Although SNPs resulting in an amino acid substitution were the highest recorded type of all the SNP effect types, these generally don’t incur significant functional changes. Substitutions involving amino acids possessing similar properties would constrain protein folding and target specificity. Any prediction on the underlying protein effects of these types of SNPs would also require site specific information and corroborating molecular evidence. Truncation SNPs, polymorphisms introducing aberrant stop codons within the coding domain of genes, was the third most prevalent SNP type observed (after synonymous and substitution type SNPs). Arguably, the effect of these types of SNPs can be considered more significant as they have the potential of producing transcripts of varying lengths, possibly altering the molecular action and target affinities of proteins and their underlying complexes. This could potentially afford the aphid with a wider array of “molecular machinery” to adapt to defensive responses from its host.
The genome of the South African Diuraphis noxia biotype SAM was successfully assembled into contigs spanning roughly 400 Mb and predicted to contain 31,885 protein coding genes. A large proportion of predicted genes were assigned to KOG functional categories relating to protein modification and turnover that may help explain the differential adaptability of different D. noxia biotypes towards their host. The overall low variation across the genome of D. noxia is consistent with previous studies that have found limited variation between biotypes [48, 49]. It is though interesting that most of the functional nucleotide variation observed was predominantly present in genes governing protein modification and turnover which in turn is supportive of the adaptability of D. noxia when facing resistance mechanisms from its host.
Basic local alignment search tool
Eukaryotic version of Clusters of Orthologous Groups (or COG)
Reversed Position Specific BLAST
Morrison WP, Peairs FB. Response model concept and economic impact. Response model for an introduced pest—the Russian wheat aphid. Lanham: Ann Entomol Soc Am; 1998.
Walters MC, Penn F, du Toit F, Botha TC, Aalbersberg K, Mewitt PH, Broodryk SW. The Russian wheat aphid. Farming in South Africa. Leaflet Series. Wheat G3: 1-6. South Africa: Department of Agriculture; 1980.
Stoetzel MB. Information on and identification of Diuraphis noxia (Homoptera: Aphididae) and other aphid species colonizing leaves of wheat and barley in the United States. J Econ Entomol. 1987;80(3):696–704.
Clua AR, Castro AM, Ramos SI, Gimenez DO, Vasicek AR, Chidichimo HO, Dixon AF. The biological characteristics and distribution of the greenbug, Schizaphis graminum, and Russian wheat aphid, Diuraphis noxia (Hemiptera: Aphididae), in Argentina and Chile. Eur. J. Entomol. 2004;101(1):193–8.
International Plant Protection Convention: Detection of Russian wheat aphid (Diuraphis noxia) in South Australia and Victoria. https://www.ippc.int/en/countries/australia/pestreports/2016/06/detection-of-russian-wheat-aphid-diuraphis-noxia-in-south-australia-and-victoria/. Accessed 30 Oct 2016.
Burd JD, Burton RL. Characterization of plant damage caused by Russian wheat aphid (Homoptera: Aphididae). J Econ Entomol. 1992;85(5):2017–22.
Burd JD, Burton RL, Webster JA. Evaluation of Russian wheat aphid (Homoptera: Aphididae) damage on resistant and susceptible hosts with comparisons of damage ratings to quantitative plant measurements. J Econ Entomol. 1993;86(3):974–80.
Botha AM. A coevolutionary conundrum: the arms race between Diuraphis noxia (Kurdjumov) a specialist pest and its host Triticum aestivum (L.). Arthropod Plant Interact. 2013;7(4):359–72.
Girma M, Wilde GE, Harvey TL. Russian wheat aphid (Homoptera: Aphididae) affects yield and quality of wheat. J Econ Entomol. 1993;86(2):594–601.
Shufran KA, Kirkman LR, Puterka GJ. Absence of mitochondrial DNA sequence variation in Russian wheat aphid (Hemiptera: Aphididae) populations consistent with a single introduction into the United States. J Kans Entomol Soc. 2007;80(4):319–26.
Botha AM, Burger NF, Van Eck L. Hypervirulent Diuraphis noxia (Hemiptera: Aphididae) biotype SAM avoids triggering defenses in its host (Triticum aestivum)(Poales: Poaceae) during feeding. Environ Entomol. 2014;43(3):672–81.
Jankielsohn A. Changes in the Russian Wheat Aphid (Hemiptera: Aphididae) biotype complex in South Africa. J Econ Entomol. 2016;109(2):907-12.
Burd JD, Porter DR, Puterka GJ, Haley SD, Peairs FB. Biotypic variation among north American Russian wheat aphid (Homoptera: Aphididae) populations. J Econ Entomol. 2006;99(5):1862–6.
Smith CM, Liu X, Wang LJ, Liu X, Chen MS, Starkey S, Bai J. Aphid feeding activates expression of a transcriptome of oxylipin-based defense signals in wheat involved in resistance to herbivory. J Chem Ecol. 2010;36(3):260–76.
Nicholson SJ, Nickerson ML, Dean M, Song Y, Hoyt PR, Rhee H, Kim C, Puterka GJ. The genome of Diuraphis noxia, a global aphid pest of small grains. BMC Genomics. 2015;16(1):1.
Jankielsohn A. Distribution and diversity of Russian wheat aphid (Hemiptera: Aphididae) biotypes in South Africa and Lesotho. J Econ Entomol. 2011;104(5):1736–41.
Brewer MJ, Elliott NC. Biological control of cereal aphids in North America and mediating effects of host plant and habitat manipulations. Annu Rev Entomol. 2004;49(1):219–42.
Kovalev OV, Poprawski TJ, Stekolshchikov AV, Vereshchagina AB, Gandrabur SA. Diuraphis Aizenberg (Hom., Aphididae): key to apterous viviparous females, and review of Russian language literature on the natural history of Diuraphis noxia (Kurdjumov, 1913). J Appl Entomol. 1991;112(1–5):425–36.
De Jager L, Burger NF, Botha AM. Complete mitochondrial genome of Diuraphis noxia (Hemiptera: Aphididae) from nine populations, SNP variation between populations, and comparison with other Aphididae species. Afr Entomol. 2014;22(4):847–62.
Basky Z, Jordaan J. Comparison of the development and fecundity of Russian wheat aphid (Homoptera: Aphididae) in South Africa and Hungary. J Econ Entomol. 1997;90(2):623–7.
Starý P, Basky Z, Tanigoshi LK, Tomanovicć Z. Distribution and history of Russian wheat aphid, Diuraphis noxia (Kurdj.) in the Carpathian Basin (Hom., Aphididae). Anzeiger für Schädlingskunde. 2003;76(1):17–21.
Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, Thomson N, Allen MJ, Angiuoli SV, Ashburner M. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26(5):541–7.
Chikhi R, Medvedev P. Informed and automated k-mer size selection for genome assembly. Bioinformatics. 2013:btt310. http://kmergenie.bx.psu.edu. Accessed 18 Jul 2015.
Rizk G, Lavenier D, Chikhi R. DSK: k-mer counting with very low memory usage. Bioinformatics. 2013:btt020. http://minia.genouest.org/dsk/. Accessed 18 Jul 2015.
Binghang L, Yujian S, Jianying Y, Xuesong H, Hao Z, Nan L, Zhenyu L, Yanxiang C, Desheng M, Wei F. Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects. arXiv preprint arXiv:1308.2012. 2012.
Li R, Fan W, Tian G, Zhu H, He L, Cai J, Huang Q, Cai Q, Li B, Bai Y, Zhang Z. The sequence and de novo assembly of the giant panda genome. Nature. 2010;463(7279):311–7. http://soap.genomics.org.cn. Accessed 7 Nov 2013
Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28(12):1647–9.
Kersey PJ, Lawson D, Birney E, Derwent PS, Haimel M, Herrero J, Keenan S, Kerhornou A, Koscielny G, Kähäri A, Kinsella RJ. Ensembl genomes: extending Ensembl across the taxonomic space. Nucleic Acids Res. 2010;38(suppl 1):D563–9. http://metazoa.ensembl.org. Accessed 16 Jun 2014
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215(3):403–10.
Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31(19):3210-2.
Stanke M, Morgenstern B. AUGUSTUS: a web server for gene prediction in eukaryotes that allows user-defined constraints. Nucleic Acids Res. 2005;33(suppl 2):W465–7.
Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21(18):3674–6.
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA. Gene ontology: tool for the unification of biology. Nat Genet. 2000;25(1):25–9.
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, Rao BS. The COG database: an updated version includes eukaryotes. BMC Bioinforma. 2003;4(1):1.
Marchler-Bauer A, Lu S, Anderson JB, Chitsaz F, Derbyshire MK, DeWeese-Scott C, Fong JH, Geer LY, Geer RC, Gonzales NR, Gwadz M. CDD: a conserved domain database for the functional annotation of proteins. Nucleic Acids Res. 2011;39(suppl 1):D225–9.
Bland C, Ramsey TL, Sabree F, Lowe M, Brown K, Kyrpides NC, Hugenholtz P. CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinforma. 2007;8(1):209.
Wilkinson TL, Douglas AE. Phloem amino acids and the host plant range of the polyphagous aphid, Aphis fabae. Entomol Exp Appl. 2003;106(2):103–13.
Novotná J, Havelka J, Starý P, Koutecký P, Vítková M. Karyotype analysis of the Russian wheat aphid, Diuraphis noxia (Kurdjumov)(Hemiptera: Aphididae) reveals a large X chromosome with rRNA and histone gene families. Genetica. 2011;139(3):281–9.
Telang A, Sandström J, Dyreson E, Moran NA. Feeding damage by Diuraphis noxia results in a nutritionally enhanced phloem diet. Entomol Exp Appl. 1999;91(3):403–12.
Basky Z. Biotypic and pest status differences between Hungarian and South African populations of Russian wheat aphid, Diuraphis noxia (Kurdjumov)(Homoptera: Aphididae). Pest Manag Sci. 2003;59(10):1152–8.
Zaayman D, Lapitan NL, Botha AM. Dissimilar molecular defense responses are elicited in Triticum aestivum after infestation by different Diuraphis noxia biotypes. Physiol Plant. 2009;136(2):209–22.
Lombaert E, Carletto J, Piotte C, Fauvergue X, Lecoq H, Vanlerberghe-Masutti F, Lapchin L. Response of the melon aphid, Aphis gossypii, to host-plant resistance: evidence for high adaptive potential despite low genetic variability. Entomol Exp Appl. 2009;133(1):46–56.
Lu H, Yang P, Xu Y, Luo L, Zhu J, Cui N, Kang L, Cui F. Performances of survival, feeding behavior, and gene expression in aphids reveal their different fitness to host alteration. Sci Rep. 2016;6:19344.
Botha AM, Swanevelder ZH, Lapitan NL. Transcript profiling of wheat genes expressed during feeding by two different biotypes of Diuraphis noxia. Environ Entomol. 2010;39(4):1206–31.
Miles PW. Aphid saliva. Biol Rev Camb Philos Soc. 1999;74(01):41–85.
Habibi J, Backus EA, Coudron TA, Brandt SL. Effect of different host substrates on hemipteran salivary protein profiles. Entomol Exp Appl. 2001;98(3):369–75.
Lapitan NL, Li YC, Peng J, Botha AM. Fractionated extracts of Russian wheat aphid eliciting defense responses in wheat. J Econ Entomol. 2007;100(3):990–9.
Puterka GJ, Black WC, Steiner WM, Burton RL. Genetic variation and phylogenetic relationships among worldwide collections of the Russian wheat aphid, Diuraphis noxia (Mordvilko), inferred from allozyme and RAPD-PCR markers. Heredity. 1993;70:604.
Swanevelder ZH, Surridge AK, Venter E, Botha AM. Limited endosymbiont variation in Diuraphis noxia (Hemiptera: Aphididae) biotypes from the United States and South Africa. J Econ Entomol. 2010;103(3):887–97.
Nielsen C, Scharff N, Eibye-Jacobsen D. Cladistic analyses of the animal kingdom. Biol J Linnean Soc. 1996;57:385–410.
Stys P, Zrzavy J. Phylogeny and classification of extant Arthropoda: review of hypotheses and nomenclature. Eur J Entomol. 1994;91:257–75.
Labandeira CC, Sepkoski JJ. Insect diversity in the fossil record. Science. 1993;261:310–5.
Dolling WR. The Hemiptera. London: Oxford University Press; 1991.
Heie O. Aphid ecology in the past and a new view on the evolution of Macrosiphini. In: Leather SR, Watt AD, Mills NJ, Walters KFA, editors. Individuals, populations and patterns in ecology. Andover: Intercept; 1994.
Aizenberg, 1935. Zap. Bolshev biol. Stan. Nos. 7–8: 157. Obtained from Nomenclator Zoologicus 7:94; http://ubio.org/NZ/search.php?search=aizenberg+&quickSearch=QuickSearch&selectall=Check+All&colname=on&colcategory=on&colauthority=on&colcomments=on&page=&vol. Accessed 25 June 2017.
Swofford DL. PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. Sunderland: Sinauer Associates; 2002.
Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30(4):772–80.
Sandström J, Pettersson J, Amino acid composition of phloem sap and the relation to intraspecific variation in pea aphid (Acyrthosiphon pisum) performance. J. Insect Physiol. 1994;40 (11):947–55.
The authors express their sincere gratitude to the Winter Cereal Trust, the National Research Foundation (NRF) and Technology and Human Resources for Industry Programme (THRIP) of South Africa for financial assistance. The research was funded through THRIP Grant: TP2010072900011; NRF Competitive Programme for Rated Researchers (CPRR) Grant: CPR20110615000019459 and NRF Incentive Funding for Rated Researchers Programme (IFR) Grant: IFR201004200013.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.