Species within the genus
Propionibacterium were recently divided into four genera, i.e.,
Propionibacterium, Acidipropionibacterium,
Cutibacterium, and
Pseudopropionibacterium (
1). The name
Pseudopropionibacterium was taxonomically corrected to
Arachnia because it was a homotypic synonym (
2).
Arachnia propionica (
3) and
Arachnia rubra (
4) are the only two recognized species in the genus
Arachnia, and both are members of the human oral microbiome (
5). A 16S rRNA neighbor-joining tree for oral species within the family
Propionibacteriaceae with current taxonomy is shown in
Fig. 1. Both
Arachnia species are hosts for species of the phylum
Saccharibacteria (TM7), ultrasmall parasitic epibionts (
6–8). Several strains of
Saccharibacteria species HMT-488 and HMT-955 have been grown in coculture with both
A. propionica and
A. rubra (
8), and their genomes are listed under BioProject accession number
282954 (
9,
10).
To fully examine the interactions of
Saccharibacteria species with
Arachnia hosts, it would be useful to have a genetically tractable strain of
A. rubra and use it as a model host. Restriction modification (RM) systems are a major barrier to genetic transformation, and RM systems can be identified from the methylome obtained during single-molecule real-time (SMRT) genome sequencing (
11). Based on the methylome data, plasmid vectors can be modified to eliminate RM incompatibilities with the target species, using techniques such as construction of syngenic DNA (
12). The methylome reported here should facilitate efforts to make
Arachnia rubra strain DSM 100122
T genetically tractable.
Strain DSM 100122
T was acquired from the German Collection of Microorganisms and Cell Cultures (DSM). For DNA isolation, the strain was grown in a 50:50 mixture of Trypticase soy broth and brain heart infusion broth with 1% yeast extract. Genomic DNA was extracted using a MasterPure DNA isolation kit (Lucigen) with a modified protocol that included bead beating for cell lysis. SMRT sequencing was carried out on a Sequel instrument (Pacific Biosciences, Menlo Park, CA, USA) with v3 chemistry, following standard SMRTbell template preparation protocols for base modification detection. Genomic DNA samples (5 to 10 μg) were sheared to an average size of 15 kbp via g-TUBE (Covaris, Woburn, MA, USA), end repaired, and ligated to hairpin-barcoded adapters prior to sequencing. Finally, prior to sequencing, the SMRTbell library was purified and size selected using AMPure PB beads to remove <3-kbp templates. Sequencing reads were processed using the Pacific Biosciences SMRT Link pipeline v8 (
https://www.pacb.com/support/software-downloads) with Microbial Assembly under default parameters. A total of 145,877 subreads were obtained, covering 632,860,315 subread bases, with a mean read length of 4,329 bp and a read
N50 value of 4,635 bp. The mean depth of coverage across the genome was 185×. A single circular contig of 3,316,958-bp length was assembled. The genomic GC content was 64.2%. The genome was annotated with the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). A total of 3,005 genes were identified, including 2,923 predicted protein-coding genes, 56 predicted RNAs, and 26 predicted pseudogenes. Three motifs were identified as methylated throughout the genome, i.e.,
CTGCAm6G (2,690 modified motifs),
ACGAm6BCT (2,130 modified motifs), and
GAAAm6TG (712 modified motifs). REBASE analysis assigned the type II methyltransferase M.Aru100122I as being responsible for the
CTGCAm6G motif modification, while the remaining modifications could not be assigned unambiguously to the remaining methyltransferase identified within the genome. Additionally, the genome harbors open reading frames (Aru100122McrBC) consistent with an active type IV restriction system, which should be taken into consideration during genetic engineering.