Of the 3,657 genes predicted, 3,601 were protein-coding genes, and 56 RNAs. The majority of the protein-coding genes (81.8%) were assigned a putative function while the remaining ones were annotated as hypothetical proteins. The inhibitor Lenalidomide distribution of genes into COGs functional categories is presented in Table 4. Table 3 Genome Statistics Figure 3 Graphical map of the largest scaffold. From bottom to top: genes on forward strand (color by COG categories), genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew. Table 4 Number of genes associated with the general COG functional categories Insights into the genome The genome comprises a single extrachromosomal element (with not yet validated circularity), ‘thalar_Contig204.17′, 139.
9 kbp in size containing 130 protein-coding genes including a large RTX-toxin gene and a F0F1-type ATPase operon. It contains the typical replication modules. Its replication system is of the ABC-9 type with the compatibility group RepC-9 [48]. This type of repABC operon was found in two representatives of the genera Octadecabacter and Roseobacter, respectively, as well as in Dinoroseobacter shibae [48]. The presence of the replication-initiation gene DnaA (Thalar_03034) reveals the chromosomal origin of largest, 1.091 Mbp long scaffold ‘thalar_Contig148.14′, but it harbors neither a plasmid stability module nor a type IV secretion system.
Genome analysis of strain DSM 19593T revealed the presence of genes encoding proteins associated to carbon monoxide utilization (thalar_00241, thalar_00242, thalar_02265, thalar_03324, thalar_03325, thalar_03395, thalar_03397) as well as genes forming a putative operon, which are involved in the oxidation of sulfur (thalar_01786 to_01792) indicating the oxidation of sulfur to produce energy. Additional gene sequences of interest encode a homogentisate 1,2-dioxygenase (thalar_03573), several haloacid dehalogenase superfamily proteins (thalar_00489, thalar_00580, thalar_01120, thalar_01943, thalar_02401) and a 2-haloalkanoic acid dehalogenase type II (thalar_00287). The presence of such genes could indicate a respiratory degradation of recalcitrant compounds by strain DSM 19593T in its ecological niche. Further genes encoding a N-acyl-L-homoserine lactone synthetase (thalar_00160) and a response regulator (thalar_00161) associated to quorum sensing were observed [49-52].
Genome analysis of strain DSM 19593T also revealed the presence of genes encoding a bacteriophage associated genes (e.g., thalar_00003 to 00007). A gene AV-951 encoding a sensor of blue light using FAD (BLUF, thalar_02670) was also detected, indicating possible blue-light dependent signal transduction. Acknowledgements The authors gratefully acknowledge the assistance of Iljana Schr?der for technical assistance and Evelyne-Marie Brambilla for DNA extraction and quality control (both at the DSMZ).