3  Explore composition

3.1 Pre-processing

The trans() function contains many normalization methods, suitable for pre-processing of different omics, some refer to vegan::decostand().

Table 3.1: Normalization methods used in omics
Method Description
cpm Counts per million
minmax linear transfer to (min, max)
acpm Counts per million, then asinh transfer
log1 log(n+1) transformat
total divide by total
max divide by maximum
frequency divide by total and multiply by the number of non-zero items, so that the average of non-zero entries is one.
normalize make margin sum of squares equal to one
range standardize values into range (0,1) (same to minmax(0,1)). If all values are constant, they will be transformed to 0.
rank rank replaces abundance values by their increasing ranks leaving zeros unchanged.
rrank rrank is similar but uses relative ranks with maximum 1.
pa scale x to presence/absence scale (0/1).
standardize scale x to zero mean and unit variance.
hellinger square root of method = "total"
log logarithmic transformation as suggested by Anderson et al. (2006): logb(x)+1 for x>0, where b is the base of the logarithm; zeros are left as zeros.
alr Additive log ratio ('alr') transformation (Aitchison 1986) reduces data skewness and compositionality bias.
clr centered log ratio ('clr') transformation proposed by Aitchison (1986) reduces data skewness and compositionality bias.
rclr robust clr ('rclr') is similar to regular clr (see above) but allows data that contains zeroes.
trans(otutab, method = "log1") %>% head()
NS1 NS2 NS3 NS4 NS5 NS6 WS1 WS2 WS3 WS4 WS5 WS6 CS1 CS2 CS3 CS4 CS5 CS6
s__un_f__Thermomonosporaceae 6.996682 7.560601 6.698268 7.211557 6.970730 6.976348 7.133296 7.376508 7.193686 6.848005 7.118016 6.919684 7.746733 7.831617 7.444249 7.588830 7.266827 7.331715
s__Pelomonas_puraquae 7.582229 7.118826 7.767687 7.712891 7.973844 7.512071 6.469250 6.206576 7.115582 7.158514 6.860664 6.455199 7.174724 7.324490 6.739337 7.029088 7.302496 7.069023
s__Rhizobacter_bergeniae 6.378426 6.129050 6.791221 6.804614 7.112327 6.749931 6.405228 6.154858 6.976348 6.936343 6.741701 6.508769 6.937314 7.497207 6.910751 7.090910 7.085902 6.637258
s__Flavobacterium_terrae 5.501258 5.459586 7.501634 6.513230 7.276556 6.198479 5.765191 7.563720 7.309212 6.903747 6.359574 5.886104 6.985642 7.105786 6.626718 6.049734 6.940222 7.253470
s__un_g__Rhizobacter 7.267525 6.023448 6.280396 6.633318 7.162397 6.228511 6.222576 6.381816 6.100319 6.431331 6.489205 6.063785 7.032624 7.277939 6.311735 6.369901 7.008505 6.806829
s__un_o__Burkholderiales 6.787845 6.527958 6.715383 6.816736 7.315218 6.937314 5.463832 5.533390 5.886104 5.945421 5.961005 5.863631 6.313548 6.293419 6.169611 6.327937 6.242223 6.208590

rm_low(), guolv() and hebing() functions can help filter or aggregate the omics data.

3.2 Stackplot

# sum the abundance to Phylum level
hebing(otutab, taxonomy$Phylum, margin = 1, act = "sum") -> phylum
stackplot(phylum, metadata, group = "Group", topN = 10) +
  scale_fill_manual(values = get_cols(10))
stackplot(phylum, metadata,
  group = "Group", style = "sample",
  group_order = TRUE, flow = TRUE, relative = FALSE
) +
  scale_fill_manual(values = get_cols(10))
Figure 3.1: Stack plot
Figure 3.2: Stack flow plot with sample style

The stackplot function offers a wide range of parameters to facilitate the adjustment of the final graphical output. For further details and examples on drawing bar plots and their transformations in R, please refer to R Draw the bar plot and its transformation.

3.3 Rarefaction

Rarefaction is a method used to assess the relationship between sample community data sampling effort and species richness within the sample. Commonly used in ecology, it evaluates whether the observed species diversity saturates given a certain number of samples or sequences, and whether additional sampling is needed to fully capture species richness within the sample.

For sample data, Rarefaction curves display the number of observed species at different levels of sampling effort (or sequence numbers). By plotting Rarefaction curves, one can assess how species diversity changes with varying sampling depths and determine if saturation has been reached.

For species data, Rarefaction curves display the number of observed samples at different levels of sampling effort (or sequence numbers). Through this method, one can evaluate how sample numbers change with different sampling depths and determine if additional samples are necessary to fully capture species richness.

a <- rare_curve_sample(otutab)
plot(a)

a <- rare_curve_species(otutab, mode = 1)
plot(a)
Figure 3.3: Rarefaction curves of sample data
Figure 3.4: Rarefaction curves of species data

3.4 Phylogenetic tree

ann_tree(taxonomy, otutab) -> tree
easy_tree(tree, add_abundance = FALSE)
Figure 3.5: Phylogenetic tree
easy_tree(tree, add_tiplab = FALSE) -> p
some_tax <- table(taxonomy$Phylum) %>%
  sort(decreasing = TRUE) %>%
  head(5) %>%
  names()
add_strip(p, some_tax)
Figure 3.6: Phylogenetic tree with some strips

You can refer to Basic phylogenetic tree and Advanced phylogenetic tree to plot the phylogenetic tree using ggtree.

In addition, you can also choose to use online platforms like iPhylo and iTOL for interactive plotting.

iPhylo is a free online platform for generating, annotating, and visualizing phylogenetic trees. It allows you to plot various tree diagrams for species, compounds, and other hierarchical structures and conveniently add complex annotation information.

iPhylo Website: https://iphylo.net

For more information, you can check out our Online Guide Book.

sangji_plot(tree, width = 900, height = 700)
k__Bacteria → p__Proteobacteria 11.1k k__Bacteria → p__Actinobacteria 6.89k p__Actinobacteria → c__Actinobacteria 6.88k c__Actinobacteria → o__Actinomycetales 6.63k p__Proteobacteria → c__Betaproteobacteria 4.42k c__Betaproteobacteria → o__Burkholderiales 4.32k p__Proteobacteria → c__Alphaproteobacteria 3.26k p__Proteobacteria → c__Gammaproteobacteria 2.86k o__Burkholderiales → f__Comamonadaceae 2.20k c__Alphaproteobacteria → o__Rhizobiales 2.19k c__Gammaproteobacteria → o__Pseudomonadales 1.96k o__Pseudomonadales → f__Pseudomonadaceae 1.96k o__Actinomycetales → f__Thermomonosporaceae 1.73k o__Actinomycetales → f__Pseudonocardiaceae 1.69k f__Pseudomonadaceae → g__Rhizobacter 1.69k g__un_f__Thermomonosporaceae → s__un_f__Thermomonosporaceae 1.45k f__Thermomonosporaceae → g__un_f__Thermomonosporaceae 1.45k g__Pelomonas → s__Pelomonas_puraquae 1.40k f__Comamonadaceae → g__Pelomonas 1.40k k__Bacteria → p__Bacteroidetes 1.38k p__Bacteroidetes → c__Flavobacteriia 1.13k c__Flavobacteriia → o__Flavobacteriales 1.13k o__Flavobacteriales → f__Flavobacteriaceae 1.09k f__Flavobacteriaceae → g__Flavobacterium 1.06k o__Actinomycetales → g__Streptomyces 984 g__Rhizobacter → s__Rhizobacter_bergeniae 922 g__Flavobacterium → s__Flavobacterium_terrae 916 g__Rhizobacter → s__un_g__Rhizobacter 772 k__Bacteria → p__Firmicutes 578 k__Bacteria → p__Chloroflexi 301 s__un_f__Thermomonosporaceae 1.45k s__un_f__Thermomonosporaceae1.45ks__Pelomonas_puraquae 1.40k s__Pelomonas_puraquae1.40ks__Rhizobacter_bergeniae 922 s__Rhizobacter_bergeniae922s__Flavobacterium_terrae 916 s__Flavobacterium_terrae916s__un_g__Rhizobacter 772 s__un_g__Rhizobacter772k__Bacteria 20.5k k__Bacteria20.5kp__Actinobacteria 6.89k p__Actinobacteria6.89kp__Proteobacteria 11.1k p__Proteobacteria11.1kp__Bacteroidetes 1.38k p__Bacteroidetes1.38kp__Firmicutes 578 p__Firmicutes578p__Chloroflexi 301 p__Chloroflexi301c__Actinobacteria 6.88k c__Actinobacteria6.88kc__Betaproteobacteria 4.42k c__Betaproteobacteria4.42kc__Gammaproteobacteria 2.86k c__Gammaproteobacteria2.86kc__Flavobacteriia 1.13k c__Flavobacteriia1.13kc__Alphaproteobacteria 3.26k c__Alphaproteobacteria3.26ko__Actinomycetales 6.63k o__Actinomycetales6.63ko__Burkholderiales 4.32k o__Burkholderiales4.32ko__Pseudomonadales 1.96k o__Pseudomonadales1.96ko__Flavobacteriales 1.13k o__Flavobacteriales1.13ko__Rhizobiales 2.19k o__Rhizobiales2.19kf__Thermomonosporaceae 1.73k f__Thermomonosporaceae1.73kf__Comamonadaceae 2.20k f__Comamonadaceae2.20kf__Pseudomonadaceae 1.96k f__Pseudomonadaceae1.96kf__Flavobacteriaceae 1.09k f__Flavobacteriaceae1.09kf__Pseudonocardiaceae 1.69k f__Pseudonocardiaceae1.69kg__un_f__Thermomonosporaceae 1.45k g__un_f__Thermomonosporaceae1.45kg__Pelomonas 1.40k g__Pelomonas1.40kg__Rhizobacter 1.69k g__Rhizobacter1.69kg__Flavobacterium 1.06k g__Flavobacterium1.06kg__Streptomyces 984 g__Streptomyces984KingdomPhylumClassOrderFamilyGenusSpecies
Figure 3.7: Sankey plot of taxanomy
sunburst(tree)
r__rootk__Bacteriap__Proteobacteriap__Actinobacteriap__Bacteroidetesp__Firmicutesp__Chloroflexip__Acidobacteriap__Verrucomicrobiap__Planctomycetesp__Spirochaetesp__Chlamydiaec__Betaproteobacteriac__Alphaproteobacteriac__Gammaproteobacteriac__Deltaproteobacteriac__un_p__Proteobacteriac__Oligoflexiac__Actinobacteriac__un_p__Actinobacteriac__Flavobacteriiac__Cytophagiac__Sphingobacteriiac__un_p__Bacteroidetesc__Bacteroidiac__Bacillic__Clostridiac__Erysipelotrichiac__Chloroflexiac__un_p__Chloroflexic__Caldilineaec__Acidobacteria_Gp4c__Acidobacteria_Gp10c__Acidobacteria_Gp3c__Acidobacteria_Gp6c__Acidobacteria_Gp17c__un_p__Acidobacteriac__Acidobacteria_Gp11c__Acidobacteria_Gp25c__Acidobacteria_Gp5c__Acidobacteria_Gp7c__Verrucomicrobiaec__un_p__Verrucomicrobiac__Spartobacteriac__Subdivision3c__Opitutaec__Planctomycetiac__un_p__Planctomycetesc__Spirochaetiac__Chlamydiiao__Burkholderialeso__Methylophilaleso__un_c__Betaproteobacteriao__Rhodocyclaleso__Nitrosomonadaleso__Rhizobialeso__Sphingomonadaleso__Caulobacteraleso__Rhodospirillaleso__un_c__Alphaproteobacteriao__Alphaproteobacteria_incertae_sediso__Sneathiellaleso__Rhodobacteraleso__Pseudomonadaleso__Xanthomonadaleso__un_c__Gammaproteobacteriao__Enterobacterialeso__Chromatialeso__Myxococcaleso__un_c__Deltaproteobacteriao__Bdellovibrionaleso__Desulfuromonadaleso__un_p__Proteobacteriao__Oligoflexaleso__Actinomycetaleso__Acidimicrobialeso__Solirubrobacteraleso__un_c__Actinobacteriao__Gaiellaleso__Rubrobacteraleso__un_p__Actinobacteriao__Flavobacterialeso__Cytophagaleso__Sphingobacterialeso__un_p__Bacteroideteso__Bacteroidaleso__Bacillaleso__Clostridialeso__un_c__Clostridiao__Erysipelotrichaleso__Herpetosiphonaleso__Chloroflexaleso__un_c__Chloroflexiao__un_p__Chloroflexio__Caldilinealeso__un_c__Acidobacteria_Gp4o__un_c__Acidobacteria_Gp10o__un_c__Acidobacteria_Gp3o__un_c__Acidobacteria_Gp6o__un_c__Acidobacteria_Gp17o__un_p__Acidobacteriao__un_c__Acidobacteria_Gp11o__un_c__Acidobacteria_Gp25o__un_c__Acidobacteria_Gp5o__un_c__Acidobacteria_Gp7o__Verrucomicrobialeso__un_p__Verrucomicrobiao__un_c__Spartobacteriao__un_c__Subdivision3o__Opitutaleso__Planctomycetaleso__un_p__Planctomyceteso__Spirochaetaleso__Chlamydialesf__Comamonadaceaef__Oxalobacteraceaef__un_o__Burkholderialesf__Burkholderiales_incertae_sedisf__Burkholderiaceaef__Alcaligenaceaef__Methylophilaceaef__un_c__Betaproteobacteriaf__Rhodocyclaceaef__Nitrosomonadaceaef__Rhizobiaceaef__Bradyrhizobiaceaef__Hyphomicrobiaceaef__un_o__Rhizobialesf__Phyllobacteriaceaef__Rhodobiaceaef__Methylobacteriaceaef__Rhizobiales_incertae_sedisf__Xanthobacteraceaef__Beijerinckiaceaef__Brucellaceaef__Sphingomonadaceaef__Erythrobacteraceaef__un_o__Sphingomonadalesf__Caulobacteraceaef__Hyphomonadaceaef__Rhodospirillaceaef__un_o__Rhodospirillalesf__un_c__Alphaproteobacteriaf__un_o__Alphaproteobacteria_incertae_sedisf__Sneathiellaceaef__Rhodobacteraceaef__Pseudomonadaceaef__Xanthomonadaceaef__Sinobacteraceaef__un_o__Xanthomonadalesf__un_c__Gammaproteobacteriaf__Enterobacteriaceaef__Ectothiorhodospiraceaef__Sandaracinaceaef__Polyangiaceaef__Cystobacteraceaef__un_o__Myxococcalesf__Haliangiaceaef__Myxococcaceaef__Nannocystaceaef__Labilitrichaceaef__un_c__Deltaproteobacteriaf__Bacteriovoracaceaef__Bdellovibrionaceaef__Geobacteraceaef__un_o__Desulfuromonadalesf__un_p__Proteobacteriaf__Oligoflexaceaef__Thermomonosporaceaef__Pseudonocardiaceaef__Streptomycetaceaef__Micromonosporaceaef__Nocardioidaceaef__Kineosporiaceaef__Microbacteriaceaef__un_o__Actinomycetalesf__Streptosporangiaceaef__Promicromonosporaceaef__Nocardiaceaef__Mycobacteriaceaef__Geodermatophilaceaef__Micrococcaceaef__Cellulomonadaceaef__Intrasporangiaceaef__Glycomycetaceaef__Cryptosporangiaceaef__Propionibacteriaceaef__Sporichthyaceaef__Acidimicrobiaceaef__Iamiaceaef__un_o__Acidimicrobialesf__Acidimicrobineae_incertae_sedisf__Solirubrobacteraceaef__Conexibacteraceaef__un_o__Solirubrobacteralesf__un_c__Actinobacteriaf__Gaiellaceaef__Rubrobacteraceaef__un_p__Actinobacteriaf__Flavobacteriaceaef__Cryomorphaceaef__un_o__Flavobacterialesf__un_o__Cytophagalesf__Flammeovirgaceaef__Chitinophagaceaef__Sphingobacteriaceaef__un_o__Sphingobacterialesf__un_p__Bacteroidetesf__Prolixibacteraceaef__Bacillaceae_1f__Paenibacillaceae_1f__Planococcaceaef__Bacillaceae_2f__Thermoactinomycetaceae_2f__Paenibacillaceae_2f__Alicyclobacillaceaef__un_o__Bacillalesf__Clostridiaceae_1f__Peptostreptococcaceaef__Lachnospiraceaef__Ruminococcaceaef__Gracilibacteraceaef__un_c__Clostridiaf__Erysipelotrichaceaef__Herpetosiphonaceaef__Chloroflexaceaef__Oscillochloridaceaef__un_o__Chloroflexalesf__un_c__Chloroflexiaf__un_p__Chloroflexif__Caldilineaceaef__un_c__Acidobacteria_Gp4f__un_c__Acidobacteria_Gp10f__un_c__Acidobacteria_Gp3f__un_c__Acidobacteria_Gp6f__un_c__Acidobacteria_Gp17f__un_p__Acidobacteriaf__un_c__Acidobacteria_Gp11f__un_c__Acidobacteria_Gp25f__un_c__Acidobacteria_Gp5f__un_c__Acidobacteria_Gp7f__Verrucomicrobiaceaef__un_p__Verrucomicrobiaf__un_c__Spartobacteriaf__un_c__Subdivision3f__Opitutaceaef__Planctomycetaceaef__un_p__Planctomycetesf__Leptospiraceaef__Spirochaetaceaef__Chlamydiaceaef__Parachlamydiaceaef__un_o__Chlamydialesg__Pelomonasg__un_f__Comamonadaceaeg__Kinneretiag__Variovoraxg__Hydrogenophagag__Acidovoraxg__Ramlibacterg__Roseatelesg__Xenophilusg__Caenimonasg__Pseudorhodoferaxg__Schlegelellag__Noviherbaspirillumg__un_f__Oxalobacteraceaeg__Massiliag__Pseudoduganellag__Duganellag__Paraherbaspirillumg__Herbaspirillumg__un_o__Burkholderialesg__un_f__Burkholderiales_incertae_sedisg__Piscinibacterg__Aquabacteriumg__Inhellag__Rubrivivaxg__Burkholderiag__Cupriavidusg__Ralstoniag__un_f__Burkholderiaceaeg__Achromobacterg__Azohydromonasg__un_f__Methylophilaceaeg__Methylotenerag__Methylovorusg__Methylophilusg__un_c__Betaproteobacteriag__un_f__Rhodocyclaceaeg__Uliginosibacteriumg__Methyloversatilisg__Nitrosospirag__Rhizobiumg__Ensiferg__Shinellag__un_f__Rhizobiaceaeg__Bradyrhizobiumg__Boseag__un_f__Bradyrhizobiaceaeg__Rhodopseudomonasg__Devosiag__Hyphomicrobiumg__Pedomicrobiumg__un_f__Hyphomicrobiaceaeg__un_o__Rhizobialesg__Mesorhizobiumg__Phyllobacteriumg__un_f__Phyllobacteriaceaeg__un_f__Rhodobiaceaeg__Methyloceanibacterg__Microvirgag__Bauldiag__Vasilyevaeag__Phreatobacterg__un_f__Xanthobacteraceaeg__Starkeyag__un_f__Beijerinckiaceaeg__Ochrobactrumg__Sphingomonasg__Sphingopyxisg__un_f__Sphingomonadaceaeg__Sphingobiumg__Sphingosinicellag__Novosphingobiumg__Sphingorhabdusg__Altererythrobacterg__un_f__Erythrobacteraceaeg__un_o__Sphingomonadalesg__Phenylobacteriumg__Caulobacterg__Brevundimonasg__Asticcacaulisg__un_f__Caulobacteraceaeg__Hyphomonasg__un_f__Hyphomonadaceaeg__Dongiag__un_f__Rhodospirillaceaeg__Inquilinusg__Pelagibiusg__Reyranellag__un_o__Rhodospirillalesg__un_c__Alphaproteobacteriag__Rhizomicrobiumg__Ferrovibriog__un_f__Sneathiellaceaeg__Amaricoccusg__Rhizobacterg__Pseudomonasg__Cellvibriog__Permianibacterg__Lysobacterg__Pseudoxanthomonasg__un_f__Xanthomonadaceaeg__Arenimonasg__Rhodanobacterg__Dokdonellag__Rudaeag__Stenotrophomonasg__Steroidobacterg__un_f__Sinobacteraceaeg__Povalibacterg__un_o__Xanthomonadalesg__un_c__Gammaproteobacteriag__un_f__Enterobacteriaceaeg__Pantoeag__Cedeceag__Enterobacterg__Thioalkalivibriog__Sandaracinusg__Sorangiumg__un_f__Polyangiaceaeg__Polyangiumg__Byssovoraxg__Cystobacterg__un_f__Cystobacteraceaeg__Archangiumg__un_o__Myxococcalesg__Haliangiumg__un_f__Myxococcaceaeg__Aggregicoccusg__Nannocystisg__un_f__Nannocystaceaeg__Labilithrixg__un_c__Deltaproteobacteriag__Peredibacterg__un_f__Bacteriovoracaceaeg__Bacteriovoraxg__Bdellovibriog__Geobacterg__un_o__Desulfuromonadalesg__un_p__Proteobacteriag__Oligoflexusg__un_f__Thermomonosporaceaeg__Actinocoralliag__Actinomadurag__Lentzeag__Amycolatopsisg__Actinophytocolag__Pseudonocardiag__Kibdelosporangiumg__un_f__Pseudonocardiaceaeg__Saccharothrixg__Lechevalieriag__Umezawaeag__Streptomycesg__Actinoplanesg__un_f__Micromonosporaceaeg__Verrucosisporag__Dactylosporangiumg__Polymorphosporag__Micromonosporag__Rhizocolag__Couchioplanesg__Phytohabitansg__Catellatosporag__Plantactinosporag__Catelliglobosisporag__Planosporangiumg__Nocardioidesg__Kribbellag__Aeromicrobiumg__Marmoricolag__un_f__Nocardioidaceaeg__Kineosporiag__un_f__Kineosporiaceaeg__Angustibacterg__Agromycesg__Microbacteriumg__Cryobacteriumg__Yonghaparkiag__un_f__Microbacteriaceaeg__un_o__Actinomycetalesg__un_f__Streptosporangiaceaeg__Nonomuraeag__Promicromonosporag__Cellulosimicrobiumg__Nocardiag__Gordoniag__Mycobacteriumg__Blastococcusg__Geodermatophilusg__Arthrobacterg__Cellulomonasg__Intrasporangiumg__un_f__Intrasporangiaceaeg__Phycicoccusg__Glycomycesg__Cryptosporangiumg__un_f__Propionibacteriaceaeg__Sporichthyag__Ilumatobacterg__Iamiag__un_f__Iamiaceaeg__Aquihabitansg__un_o__Acidimicrobialesg__Aciditerrimonasg__Solirubrobacterg__Conexibacterg__un_o__Solirubrobacteralesg__un_c__Actinobacteriag__Gaiellag__Rubrobacterg__un_p__Actinobacteriag__Flavobacteriumg__Chryseobacteriumg__un_f__Flavobacteriaceaeg__un_f__Cryomorphaceaeg__un_o__Flavobacterialesg__Ohtaekwangiag__un_o__Cytophagalesg__Chryseolineag__un_f__Flammeovirgaceaeg__Niastellag__Terrimonasg__Chitinophagag__un_f__Chitinophagaceaeg__Lacibacterg__un_f__Sphingobacteriaceaeg__un_o__Sphingobacterialesg__un_p__Bacteroidetesg__Mangrovibacteriumg__Bacillusg__un_f__Bacillaceae_1g__Fictibacillusg__Paenibacillusg__Brevibacillusg__Cohnellag__un_f__Paenibacillaceae_1g__Ammoniibacillusg__Paenisporosarcinag__un_f__Planococcaceaeg__Lysinibacillusg__Terribacillusg__un_f__Bacillaceae_2g__Gracilibacillusg__Planifilumg__un_f__Paenibacillaceae_2g__Tumebacillusg__un_o__Bacillalesg__Clostridium_sensu_strictog__Sporacetigeniumg__Romboutsiag__Clostridium_XIg__un_f__Lachnospiraceaeg__Clostridium_IIIg__un_f__Gracilibacteraceaeg__un_c__Clostridiag__Turicibacterg__Herpetosiphong__Roseiflexusg__un_f__Chloroflexaceaeg__Oscillochlorisg__un_o__Chloroflexalesg__un_c__Chloroflexiag__un_p__Chloroflexig__un_f__Caldilineaceaeg__un_c__Acidobacteria_Gp4g__Blastocatellag__Aridibacterg__Gp10g__Gp3g__Gp6g__Gp17g__un_p__Acidobacteriag__Gp11g__Gp25g__Gp5g__Gp7g__Luteolibacterg__Roseimicrobiumg__un_f__Verrucomicrobiaceaeg__un_p__Verrucomicrobiag__Spartobacteria_genera_incertae_sedisg__Subdivision3_genera_incertae_sedisg__un_c__Subdivision3g__Opitutusg__Blastopirellulag__un_f__Planctomycetaceaeg__Pirellulag__un_p__Planctomycetesg__Turneriellag__un_f__Leptospiraceaeg__Leptospirag__un_f__Spirochaetaceaeg__un_f__Chlamydiaceaeg__un_f__Parachlamydiaceaeg__Parachlamydiag__un_o__Chlamydialess__Pelomonas_puraquaes__un_f__Comamonadaceaes__Kinneretia_asaccharophilas__un_g__Variovoraxs__Variovorax_boronicumulanss__Variovorax_solis__un_g__Hydrogenophagas__Acidovorax_solis__un_g__Acidovoraxs__un_g__Ramlibacters__Ramlibacter_henchirensiss__un_g__Roseateless__Xenophilus_aerolatuss__Caenimonas_terraes__un_g__Pseudorhodoferaxs__Schlegelella_aquaticas__un_g__Noviherbaspirillums__Noviherbaspirillum_suwonenses__un_f__Oxalobacteraceaes__un_g__Massilias__Massilia_aerilatas__Massilia_namucuonensiss__Pseudoduganella_violaceinigras__Duganella_radiciss__Duganella_phyllosphaeraes__Paraherbaspirillum_solis__Herbaspirillum_aquaticums__un_o__Burkholderialess__un_f__Burkholderiales_incertae_sediss__Piscinibacter_aquaticuss__un_g__Aquabacteriums__Aquabacterium_communes__Aquabacterium_parvums__Inhella_inkyongensiss__Rubrivivax_gelatinosuss__un_g__Burkholderias__Burkholderia_heleias__Cupriavidus_necators__un_g__Cupriaviduss__un_g__Ralstonias__un_f__Burkholderiaceaes__un_g__Achromobacters__Azohydromonas_australicas__un_f__Methylophilaceaes__un_g__Methyloteneras__un_g__Methylovoruss__un_g__Methylophiluss__un_c__Betaproteobacterias__un_f__Rhodocyclaceaes__Uliginosibacterium_gangwonenses__Methyloversatilis_universaliss__Nitrosospira_multiformiss__un_g__Rhizobiums__Rhizobium_alveis__Rhizobium_subbaraoniss__Rhizobium_laguerreaes__Rhizobium_rosettiformanss__Rhizobium_vignaes__Rhizobium_nepotums__Rhizobium_rhizoryzaes__un_g__Ensifers__Ensifer_sahelis__Shinella_kummerowiaes__Shinella_zoogloeoidess__un_g__Shinellas__un_f__Rhizobiaceaes__Bradyrhizobium_neotropicales__un_g__Bradyrhizobiums__Bradyrhizobium_daqingenses__un_g__Boseas__Bosea_eneaes__Bosea_lathyris__Bosea_massiliensiss__un_f__Bradyrhizobiaceaes__Rhodopseudomonas_rhenobacensiss__un_g__Devosias__Devosia_insulaes__Devosia_chinhatensiss__Devosia_yakushimensiss__Devosia_pacificas__Devosia_riboflavinas__Hyphomicrobium_vulgares__un_g__Hyphomicrobiums__Hyphomicrobium_sulfonivoranss__Hyphomicrobium_zavarziniis__Pedomicrobium_manganicums__un_g__Pedomicrobiums__un_f__Hyphomicrobiaceaes__un_o__Rhizobialess__un_g__Mesorhizobiums__Mesorhizobium_gobienses__Mesorhizobium_silamurunenses__Phyllobacterium_bourgognenses__Phyllobacterium_myrsinacearums__un_f__Phyllobacteriaceaes__un_f__Rhodobiaceaes__Methyloceanibacter_caenitepidis__Microvirga_guangxiensiss__un_g__Microvirgas__Microvirga_lupinis__Microvirga_aerilatas__Bauldia_consociatas__un_g__Bauldias__un_g__Vasilyevaeas__Phreatobacter_oligotrophuss__un_f__Xanthobacteraceaes__Starkeya_koreensiss__un_f__Beijerinckiaceaes__un_g__Ochrobactrums__un_g__Sphingomonass__Sphingomonas_solis__Sphingomonas_daechungensiss__Sphingopyxis_solis__un_g__Sphingopyxiss__un_f__Sphingomonadaceaes__un_g__Sphingobiums__Sphingosinicella_vermicompostis__un_g__Novosphingobiums__Sphingorhabdus_planktonicas__un_g__Altererythrobacters__Altererythrobacter_troitsensiss__Altererythrobacter_xinjiangensiss__un_f__Erythrobacteraceaes__un_o__Sphingomonadaless__Phenylobacterium_compostis__un_g__Phenylobacteriums__Phenylobacterium_koreenses__Caulobacter_henriciis__Caulobacter_fusiformiss__un_g__Caulobacters__Brevundimonas_basaltiss__Brevundimonas_kwangchunensiss__Brevundimonas_aveniformiss__Brevundimonas_faecaliss__un_g__Brevundimonass__Asticcacaulis_solisilvaes__un_f__Caulobacteraceaes__un_g__Hyphomonass__un_f__Hyphomonadaceaes__Dongia_mobiliss__un_f__Rhodospirillaceaes__Inquilinus_ginsengisolis__Pelagibius_litoraliss__Reyranella_solis__Reyranella_massiliensiss__Reyranella_graminifoliis__un_o__Rhodospirillaless__un_c__Alphaproteobacterias__un_g__Rhizomicrobiums__Rhizomicrobium_electricums__Ferrovibrio_denitrificanss__un_f__Sneathiellaceaes__Amaricoccus_tamworthensiss__un_g__Amaricoccuss__Rhizobacter_bergeniaes__un_g__Rhizobacters__Pseudomonas_chlororaphis_subsp._pisciums__un_g__Pseudomonass__Pseudomonas_solis__Cellvibrio_mixtus_subsp._mixtuss__Cellvibrio_ostraviensiss__un_g__Cellvibrios__Permianibacter_aggreganss__Lysobacter_brunescenss__Lysobacter_panaciterraes__Lysobacter_terraes__un_g__Lysobacters__Lysobacter_enzymogeness__Lysobacter_panacisolis__Lysobacter_dokdonensiss__Lysobacter_oryzaes__Pseudoxanthomonas_geis__Pseudoxanthomonas_wuyuanensiss__un_g__Pseudoxanthomonass__Pseudoxanthomonas_indicas__un_f__Xanthomonadaceaes__un_g__Arenimonass__Arenimonas_daejeonensiss__Arenimonas_oryziterraes__Arenimonas_metallis__Arenimonas_donghaensiss__Rhodanobacter_denitrificanss__un_g__Dokdonellas__Rudaea_cellulosilyticas__un_g__Stenotrophomonass__Steroidobacter_denitrificanss__Steroidobacter_agariperforanss__un_g__Steroidobacters__un_f__Sinobacteraceaes__Povalibacter_uvarums__un_o__Xanthomonadaless__un_c__Gammaproteobacterias__un_f__Enterobacteriaceaes__un_g__Pantoeas__un_g__Cedeceas__un_g__Enterobacters__Thioalkalivibrio_sulfidiphiluss__Sandaracinus_amylolyticuss__Sorangium_cellulosums__un_f__Polyangiaceaes__Polyangium_fumosums__Byssovorax_cruentas__un_g__Cystobacters__Cystobacter_violaceuss__Cystobacter_graciliss__un_f__Cystobacteraceaes__Archangium_gephyras__un_o__Myxococcaless__un_g__Haliangiums__Haliangium_tepidums__un_f__Myxococcaceaes__Aggregicoccus_edonensiss__Nannocystis_exedenss__un_f__Nannocystaceaes__Labilithrix_luteolas__un_c__Deltaproteobacterias__Peredibacter_starriis__un_f__Bacteriovoracaceaes__Bacteriovorax_stolpiis__Bdellovibrio_bacteriovoruss__un_g__Bdellovibrios__Bdellovibrio_exovoruss__un_g__Geobacters__un_o__Desulfuromonadaless__un_p__Proteobacterias__Oligoflexus_tunisiensiss__un_f__Thermomonosporaceaes__Actinocorallia_herbidas__un_g__Actinomaduras__Lentzea_flaviverrucosas__un_g__Lentzeas__Lentzea_jiangxiensiss__Lentzea_kentuckyensiss__un_g__Amycolatopsiss__Amycolatopsis_keratiniphila_subsp._keratiniphilas__Actinophytocola_burenkhanensiss__un_g__Actinophytocolas__Actinophytocola_timorensiss__un_g__Pseudonocardias__Pseudonocardia_adelaidensiss__Pseudonocardia_xinjiangensiss__Pseudonocardia_seranimatas__Pseudonocardia_kunmingensiss__Pseudonocardia_parietiss__Kibdelosporangium_phytohabitanss__un_f__Pseudonocardiaceaes__Saccharothrix_longisporas__un_g__Saccharothrixs__un_g__Lechevalierias__Lechevalieria_xinjiangensiss__Umezawaea_tangerinas__un_g__Streptomycess__Streptomyces_ederensiss__Streptomyces_acidiscabiess__Streptomyces_bangladeshensiss__Streptomyces_bulliis__Streptomyces_chrysomallus_subsp._chrysomalluss__Streptomyces_seranimatuss__Streptomyces_fimbriatuss__un_g__Actinoplaness__Actinoplanes_xinjiangensiss__un_f__Micromonosporaceaes__Verrucosispora_andamanensiss__un_g__Verrucosisporas__un_g__Dactylosporangiums__Polymorphospora_rubras__un_g__Micromonosporas__Micromonospora_siamensiss__Rhizocola_helleboris__Couchioplanes_caeruleus_subsp._azureuss__Phytohabitans_houttuyneaes__Phytohabitans_flavuss__un_g__Catellatosporas__un_g__Plantactinosporas__Catelliglobosispora_koreensiss__un_g__Planosporangiums__un_g__Nocardioidess__Nocardioides_albuss__Nocardioides_agariphiluss__Nocardioides_jenseniis__Nocardioides_dilutuss__Nocardioides_panacisolis__Nocardioides_daeguensiss__Nocardioides_mesophiluss__Nocardioides_caricicolas__Nocardioides_iriomotensiss__Nocardioides_terrigenas__Nocardioides_marinquilinuss__Kribbella_catacumbaes__Aeromicrobium_ginsengisolis__Aeromicrobium_panaciterraes__un_g__Aeromicrobiums__un_g__Marmoricolas__Marmoricola_scoriaes__Marmoricola_aequoreuss__Marmoricola_bigeumensiss__un_f__Nocardioidaceaes__Kineosporia_rhamnosas__un_f__Kineosporiaceaes__un_g__Angustibacters__un_g__Agromycess__Agromyces_indicuss__Agromyces_subbeticuss__Microbacterium_aerolatums__Microbacterium_fluviis__Microbacterium_arthrosphaeraes__Cryobacterium_arcticums__Yonghaparkia_alkaliphilas__un_f__Microbacteriaceaes__un_o__Actinomycetaless__un_f__Streptosporangiaceaes__Nonomuraea_salmoneas__Nonomuraea_maritimas__Nonomuraea_muscovyensiss__Nonomuraea_jabiensiss__Nonomuraea_kuesteris__Promicromonospora_xylanilyticas__un_g__Promicromonosporas__Cellulosimicrobium_funkeis__un_g__Nocardias__Nocardia_salmonicidas__un_g__Gordonias__un_g__Mycobacteriums__Mycobacterium_canariasenses__Mycobacterium_vaccaes__Blastococcus_saxobsidenss__un_g__Blastococcuss__Blastococcus_jejuensiss__un_g__Geodermatophiluss__un_g__Arthrobacters__Arthrobacter_globiformiss__Cellulomonas_humilatas__Cellulomonas_hominiss__un_g__Cellulomonass__un_g__Intrasporangiums__un_f__Intrasporangiaceaes__un_g__Phycicoccuss__un_g__Glycomycess__Cryptosporangium_minutisporangiums__un_f__Propionibacteriaceaes__un_g__Sporichthyas__Ilumatobacter_fluminiss__un_g__Ilumatobacters__Ilumatobacter_nonamienses__Iamia_majanohamensiss__un_f__Iamiaceaes__Aquihabitans_daechungensiss__un_o__Acidimicrobialess__Aciditerrimonas_ferrireducenss__un_g__Solirubrobacters__Solirubrobacter_taibaiensiss__Solirubrobacter_paulis__Conexibacter_arvaliss__un_g__Conexibacters__un_o__Solirubrobacteraless__un_c__Actinobacterias__Gaiella_occultas__Rubrobacter_bracarensiss__un_g__Rubrobacters__un_p__Actinobacterias__Flavobacterium_terraes__un_g__Flavobacteriums__Flavobacterium_saliperosums__Flavobacterium_anhuienses__Flavobacterium_gyeonganenses__Flavobacterium_subsaxonicums__Flavobacterium_hauenses__Flavobacterium_urocaniciphilums__Chryseobacterium_defluviis__Chryseobacterium_wanjuenses__un_g__Chryseobacteriums__un_f__Flavobacteriaceaes__un_f__Cryomorphaceaes__un_o__Flavobacterialess__Ohtaekwangia_koreensiss__un_g__Ohtaekwangias__Ohtaekwangia_kribbensiss__un_o__Cytophagaless__Chryseolinea_serpenss__un_f__Flammeovirgaceaes__un_g__Niastellas__un_g__Terrimonass__Terrimonas_aquaticas__un_g__Chitinophagas__un_f__Chitinophagaceaes__Lacibacter_daechungensiss__un_f__Sphingobacteriaceaes__un_o__Sphingobacterialess__un_p__Bacteroidetess__Mangrovibacterium_diazotrophicums__un_g__Bacilluss__Bacillus_thuringiensiss__Bacillus_abyssaliss__Bacillus_infantiss__Bacillus_infernuss__Bacillus_nealsoniis__Bacillus_plakortidiss__Bacillus_hortis__Bacillus_litoraliss__Bacillus_idriensiss__Bacillus_invictaes__Bacillus_kokeshiiformiss__un_f__Bacillaceae_1s__un_g__Fictibacilluss__Paenibacillus_panacisolis__un_g__Paenibacilluss__Paenibacillus_terraes__Paenibacillus_glycanilyticuss__Paenibacillus_urinaliss__Paenibacillus_pectinilyticuss__Paenibacillus_prosopidiss__Paenibacillus_brasilensiss__Paenibacillus_barcinonensiss__Brevibacillus_ginsengisolis__un_g__Brevibacilluss__Cohnella_panacarvis__un_g__Cohnellas__un_f__Paenibacillaceae_1s__Ammoniibacillus_agariperforanss__Paenisporosarcina_macmurdoensiss__un_g__Paenisporosarcinas__un_f__Planococcaceaes__un_g__Lysinibacilluss__Terribacillus_goriensiss__un_g__Terribacilluss__un_f__Bacillaceae_2s__un_g__Gracilibacilluss__Planifilum_compostis__un_g__Planifilums__un_f__Paenibacillaceae_2s__Tumebacillus_ginsengisolis__un_g__Tumebacilluss__un_o__Bacillaless__un_g__Clostridium_sensu_strictos__Clostridium_roseums__Sporacetigenium_mesophilums__un_g__Romboutsias__un_g__Clostridium_XIs__un_f__Lachnospiraceaes__Clostridium_straminisolvenss__un_f__Gracilibacteraceaes__un_c__Clostridias__Turicibacter_sanguiniss__un_g__Herpetosiphons__un_g__Roseiflexuss__Roseiflexus_castenholziis__un_f__Chloroflexaceaes__un_g__Oscillochloriss__un_o__Chloroflexaless__un_c__Chloroflexias__un_p__Chloroflexis__un_f__Caldilineaceaes__un_c__Acidobacteria_Gp4s__Blastocatella_fastidiosas__Aridibacter_famiduranss__un_g__Gp10s__un_g__Gp3s__un_g__Gp6s__un_g__Gp17s__un_p__Acidobacterias__un_g__Gp11s__un_g__Gp25s__un_g__Gp5s__un_g__Gp7s__un_g__Luteolibacters__Roseimicrobium_gellanilyticums__un_f__Verrucomicrobiaceaes__un_p__Verrucomicrobias__un_g__Spartobacteria_genera_incertae_sediss__un_g__Subdivision3_genera_incertae_sediss__un_c__Subdivision3s__Opitutus_terraes__Blastopirellula_cremeas__un_g__Blastopirellulas__un_f__Planctomycetaceaes__Pirellula_staleyis__un_p__Planctomycetess__Turneriella_parvas__un_f__Leptospiraceaes__un_g__Leptospiras__un_f__Spirochaetaceaes__un_f__Chlamydiaceaes__un_f__Parachlamydiaceaes__Parachlamydia_acanthamoebaes__un_o__Chlamydiales
Figure 3.8: Sunburst plot of taxanomy

3.5 Rtaxonkit

Taxonkit is a Practical and Efficient NCBI Taxonomy Toolkit ().

We recommend you download this excellent software to help next analysis. Or you can use Taxonkit in R by pctax interface as followed:

# 1. This function help you install suitable version taxonkit
install_taxonkit()
# taxonkit has been successfully installed!

# 2. Then download the NCBI taxonomy database.
download_taxonkit_dataset()
# Taxonkit files downloaded and copied successfully.

# 3. Check whether taxonkit is ready
check_taxonkit()
# ==============Taxonkit is available if there is help message above==============
# =========================Taxonkit dataset is available!=========================

Then you can use taxonkit in R just like in terminal.

?taxonkit_lineage

# taxonkit_list
# taxonkit_reformat
# taxonkit_name2taxid
# taxonkit_filter
# taxonkit_lca
lineage <- taxonkit_lineage("9606\n63221", show_name = TRUE, show_rank = TRUE, text = T)
lineage
[1] "9606\tcellular organisms;Eukaryota;Opisthokonta;Metazoa;Eumetazoa;Bilateria;Deuterostomia;Chordata;Craniata;Vertebrata;Gnathostomata;Teleostomi;Euteleostomi;Sarcopterygii;Dipnotetrapodomorpha;Tetrapoda;Amniota;Mammalia;Theria;Eutheria;Boreoeutheria;Euarchontoglires;Primates;Haplorrhini;Simiiformes;Catarrhini;Hominoidea;Hominidae;Homininae;Homo;Homo sapiens\tHomo sapiens\tspecies"                                                   
[2] "63221\tcellular organisms;Eukaryota;Opisthokonta;Metazoa;Eumetazoa;Bilateria;Deuterostomia;Chordata;Craniata;Vertebrata;Gnathostomata;Teleostomi;Euteleostomi;Sarcopterygii;Dipnotetrapodomorpha;Tetrapoda;Amniota;Mammalia;Theria;Eutheria;Boreoeutheria;Euarchontoglires;Primates;Haplorrhini;Simiiformes;Catarrhini;Hominoidea;Hominidae;Homininae;Homo;Homo sapiens;Homo sapiens neanderthalensis\tHomo sapiens neanderthalensis\tsubspecies"