There are 27 complete genomes available within Rickettsiales.
These include, 4 Wolbachia, including wBm, 3 genomes from the genus Anaplasma, 5 Ehrlichia, 11 Rickettsia, 1 Neorickettsia, 2 Orientia, and 1 Pelagibacter (Table 3). Of these genomes, all but Pelagibacter are obligate endosymbionts residing either in vacuoles or within the host cell cytoplasm. Of the endosymbionts, all but Wolbachia replicate within vertebrate hosts with most transmitted via an invertebrate vector. Wolbachia, on the other hand infects a diverse XMU-MP-1 order spectrum of arthropod hosts as well as filarial nematodes, many of which are themselves vertebrate parasites . Table 3 Genomes available within the order Rickettsiales Genus species Strain Taxon ID Anaplasma marginale St Maries 234826 Anaplasma phagocytophilum HZ 212042 Anaplasma marginale Florida 320483 Candidatus Pelagibacter ubique HTCC1062 335992 Ehrlichia canis Jake 269484 Ehrlichia chaffeensis Arkansas 205920 Ehrlichia buy C59 wnt ruminantium Gardel 302409 Ehrlichia ruminantium Welgevonden UPSA 254945 Ehrlichia ruminantium Welgevonden CIRAD 254945 Orientia tsutsugamushi Boryong 357244 Orientia tsutsugamushi Ikeda 334380 Neorickettsia sennetsu Miyayama 222891 Rickettsia akari Hartford 293614 Rickettsia bellii OSU 85-389 391896 Rickettsia bellii RML369-C 336407 Rickettsia canadensis McKiel 293613 Rickettsia conorii Malish 7 272944 Rickettsia felis
URRWXCal2 315456 selleck chemicals llc Rickettsia massiliae MTU5 416276 Rickettsia Pyruvate dehydrogenase prowazekii Madrid E 272947 Rickettsia rickettsii Iowa 452659 Rickettsia rickettsii Sheila Smith 392021 Rickettsia typhi wilmington 257363 Wolbachia Drosophila
melanogaster 163164 Wolbachia Drosophila simulans 66084 Wolbachia Culex quinquefasciatus 570417 Wolbachia Brugia malayi TRS 292805 Refseq protein sequences from the 27 available genomes (as of April 1, 2009) were retrieved from NCBI. The OrthoMCL package was used to predict clusters of orthologs among the genomes . To gauge the extent of taxonomic diversity within each orthologous gene cluster, we initially tallied the number of taxa represented in the cluster. However, this measure inflated the phylogenetic diversity for groups containing multiple highly related taxa. To compensate, a minimum spanning tree (MST) was constructed using distances derived from aligned 16S rRNA gene sequences as edge weights between taxonomic nodes. A score for the MST was calculated by summing the distances between the connected taxonomic nodes. The MST was used to minimize the contributions from closely related taxa, while reflecting the overall taxonomic diversity. The MST distances for each cluster were incorporated into a metric we termed the gene conservation score (GCS), which represents both the extent of gene conservation across species, as well as the quality of that conservation.