Large-scale multiple sequence alignment visualization through gradient vector flow analysis

You are here

TitleLarge-scale multiple sequence alignment visualization through gradient vector flow analysis
Publication TypeConference Paper
Year of Publication2013
AuthorsNguyen, KTan, Ropinski, T
Conference Name2013 IEEE Symposium on Biological Data Visualization (BioVis)2013 IEEE Symposium on Biological Data Visualization (BioVis)
PublisherIEEE
Conference LocationAtlanta, GA, USA
Accession Number13898659
KeywordsAlgorithm design and analysis, Data visualization, Feature extraction, Force, Image color analysis, Vectors, Visualization
Abstract

Multiple sequence alignment (MSA) is essential as an initial step in studying molecular phylogeny as well as during the identification of genomic rearrangements. Recent advances in sequencing techniques have led to a tremendous increase in the number of sequences to be analyzed. As a result, a greater demand is being placed on visualization techniques, as they have the potential to reveal the underlying information in large-scale MSAs. In this work, we present a novel visualization technique for conveying the patterns in large-scale MSAs. By applying gradient vector flow analysis to the MSA data, we can extract and visually emphasize conservations and other patterns that are relevant during the MSA exploration process. In contrast to the traditional visual representation of MSAs, which exploits color-coded tables, the proposed visual metaphor allows us to provide an overview of large MSAs as well as to highlight global patterns, outliers, and data distributions. We will motivate and describe the proposed algorithm, and further demonstrate its application to large-scale MSAs.

URLhttp://ieeexplore.ieee.org/lpdocs/epic03/wrapper.htm?arnumber=6664341
DOI10.1109/BioVis.2013.6664341