Researchers at the University of Maryland report the development of a new, web-based tool that enables researchers to quickly and easily visualize and compare large amounts of genomic information resulting from high-throughput sequencing experiments. The free tool, called Epiviz, was described in a paper (“Epiviz: interactive visual analytics for functional genomics data”) published online in Nature Methods.
Next-generation sequencing has revolutionized functional genomics. These techniques are key to understanding the molecular mechanisms underlying cell function in healthy and diseased individuals and the development of diseases like cancer. Data from multiple experiments need to be integrated, but the growing number of datasets makes a thorough comparison and analysis of results challenging.
To visualize and browse entire genomes, graphical interfaces that display information from a database of genomic data (genome browsers) were created. Epiviz offers a major advantage over browsers currently available, according to the Maryland team: “Epiviz seamlessly integrates with the open-source Bioconductor analysis software widely used by genomic scientists, through its Epivizr Bioconductor package,” says Héctor Corrada-Bravo, Ph.D., assistant professor in computer science at UMD.
“Visualization is an integral aspect of genomics data analysis,” wrote the UMD investigators. “Algorithmic-statistical analysis and interactive visualization are most effective when used iteratively. Epiviz, a web-based genome browser, and the Epivizr Bioconductor package allow interactive, extensible, and reproducible visualization within a state-of-the-art data-analysis platform.”
“Prior tools limited visualization to presentation and dissemination, rather than a hybrid tool integrating interactive visualization with algorithmic analysis,” explained Dr. Corrada-Bravo, who also has an appointment in the Center for Bioinformatics and Computational Biology of the university's Institute for Advanced Computer Studies.
Because Epiviz is based on the Bioconductor infrastructure, the tool supports many popular next-generation sequencing techniques, such as ChIP-seq, which is used to analyze protein interactions with DNA; RNA-seq, which reveals a comprehensive snapshot of the abundance of RNAs in cells; and DNA methylation analyses,” according to Dr. Corrada-Bravo.
Epiviz implements multiple visualization methods for location-based data (such as genomic regions of interest) and feature-based data (such as gene expression), using interactive data visualization techniques not available in web-based genome browsers. For example, because display objects are mapped directly to data elements, Epiviz links data across different visualizations giving users visual insights of the spatial relationships of multiple datasets. The tool is designed to allow biomedical scientists to easily incorporate their own visualizations.
In the Nature Methods paper, the UMD group and colleagues from Williams College in Massachusetts and Washington University in St. Louis used Epiviz to visualize and analyze DNA methylation and gene expression data in colon cancer. Changes in DNA methylation patterns compared with normal tissue have been associated with a large number of human malignancies.
Using Epiviz and Bioconductor, the researchers found consistent regions of DNA methylation changes in colon cancer samples generated by the public Cancer Genome Atlas project, and similar gene expression in these regions of DNA methylation changes in other cancer types. The results were in agreement with previous experiments, which were conducted by researchers at Johns Hopkins University in collaboration with Dr. Corrada-Bravo, showing DNA methylation changes across large regions in the colon cancer genome.
“Epiviz helps biomedical scientists meet the challenge of visualizing large genomic datasets while supporting creative data analysis in a collaborative environment,” pointed out Dr. Corrada-Bravo.