In human metagenomics studies, for example, the sampling of bacterial, viral, and eukaryotic organisms taken from a specific area of the human body by gross sampling methods and DNA isolation has been greatly accelerated by next-generation sequencing (NGS) technology.
Generating NGS data from these metagenomic isolates addressed the two major issues—cost and ease of data generation—that kept early metagenomic studies from achieving their full potential.
With the decreased cost of NGS reads, significantly deeper sampling of each population isolate can be achieved and hence minor species detected. The digital nature of these reads further enables a measure of the relative proportions of each population member.
The simplicity of NGS library preparation, the ability to make these libraries from tiny input amounts of DNA, and the elimination of a bacterial cloning intermediate have been invaluable to improving the representation of all species present in the populations sampled and to expanding the types of samples we can address.
The development of data-analysis approaches that encompass the increasing size of the sequence datasets that result from deep NGS sampling and that can accurately mine information from them has re-engaged computational biologists and has resulted in a staggering amount of innovation.
Resulting “big science” projects such as the NIH’s Human Microbiome Project, Europe’s MetaHit, and other international projects, as well as the research of independent investigators, have begun to define bacterial diversity in human health and disease.2-3
Interestingly, we now have a bacterial census of the intestines of cats and dogs4, pigs5, and the octopus.6 Novel pathogenic viruses have been discovered by mining metagenomic datasets, and etiologic agents have been identified in disease outbreaks.7-8
By studying RNA isolates converted to cDNA from various sources, a new experimental approach called “metatranscriptomics” has resulted, and can be applied to characterize the metabolic potential of each population.
Metagenomics also has been applied to characterize environments unrelated to human health, such as soils9-10, lakes11, and thermal springs in Russia12, among myriad others. In fact, a quick search of PubMed with “metagenomic” reveals 1,382 references. Most have been published since 2005, across an incredible breadth of topics that reflect the explosion of this scientific endeavor in basic biological discovery, data analysis, data mining, and methods development.
As the transformation of metagenomics by NGS and advanced analytical approaches continues, it will be interesting to see its impact on diverse areas such as food safety monitoring (evidence the need for continuing vigilance as yet another E. coli strain impacts human health in Europe), and pharmaceutical product development and quality control (such as in vaccines or other live-cell products).
Another possible use will be diagnosis for optimal antibiotic treatment in patients affected with pathogens known to harbor a spectrum of antibiotic resistance such as methicillin-resistant Staphylococcus aureus (MRSA). There are numerous other applications.