GENOME ANNOUNCEMENT
“Candidatus Hamiltonella,” a gammaproteobacterium, is a maternally transmitted endosymbiont found sporadically in some sap-feeding insects, such as aphids and whiteflies. In some aphid individuals, the facultative symbiont “
Candidatus Hamiltonella defensa” provides protection against parasitoid wasps by causing high mortality of wasp larvae (
5) or induces the expression of host heat shock genes to increase host heat tolerance (
6). In the whitefly
Bemisia tabaci complex, “
Candidatus Hamiltonella defensa” is found in only two important invasive cryptic species, Mediterranean and Middle East-Asia Minor 1 (formally referred to as the Q biotype and B biotype, respectively) (
1,
2,
4,
8). Unlike “
Candidatus Hamiltonella defensa” in aphids, the symbiont of
B. tabaci shares bacteriocytes with the obligate symbiont “
Candidatus Portiera aleyrodidarum” (
4,
7). A recent study reported that the GroEL protein produced by “
Candidatus Hamiltonella defensa” in Middle East-Asia Minor 1 cryptic species of
B. tabaci can facilitate tomato yellow leaf curl virus transmission (
3). However, as a whole, the functions of “
Candidatus Hamiltonella defensa” in
B. tabaci remain largely unknown. The genome sequence of “
Candidatus Hamiltonella defensa” could provide useful information to understand the roles of this endosymbiont in the invasive
B. tabaci.
Uncultured symbionts were isolated from the Mediterranean cryptic species of B. tabaci, and the whole genome of purified bacteria was amplified by multiple displacement amplification (MDA) using a Repli-g ultrafast minikit (Qiagen). Two paired-end libraries with average insert sizes of 200 bp and 2 kb were generated. Amplified DNA was sequenced using an Illumina HiSeq 2000 instrument at BGI-Shenzhen (Shenzhen, China). We obtained 240 Mb and 84 Mb of high-quality useful data from each library. The paired-end reads were de novo assembled using SOAPdenovo v1.05; and contigs were connected according to the 2-kb paired-end relationships. The coding sequences (CDSs) were predicted by Glimmer v3.0, and homologous comparison to the NCBI nonredundant database was performed by BLAST for function annotation. The rRNA and tRNA were identified using rRNAmmer and tRNAscan.
The results show that the draft genome of the B. tabaci symbiont “Candidatus Hamiltonella defensa” has approximately 1.84 Mbp with a G+C content of 40.3% and comprises 404 contigs (N50, 11.8 kb), which can be assembled into 372 scaffolds (N50, 14.2 kb).
A total of 1,806 genes and 38 tRNA genes are predicted within the genome of “Candidatus Hamiltonella defensa.” A total of 1,112 genes were mapped to pathways in the Kyoto Encyclopedia of Genes and Genomes, among which “Membrane Transport” (n = 301) has the highest percentage of genes, followed by “Replication and Repair” (n = 227), “Translation” (n = 136), “Carbohydrate Metabolism” (n = 128), “Metabolism of Cofactors and Vitamins” (n = 88), and “Amino Acid Metabolism” (n = 87). These annotations provide a valuable resource for investigation of the functions of “Candidatus Hamiltonella defensa” in the whitefly. Further genomic analyses and comparative studies will provide additional insight into this symbiont-host system.
Nucleotide sequence accession numbers.
This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number
AJLH00000000. The version described in this paper is the first version,
AJLH01000000.