TU Darmstadt / ULB / TUbiblio

Substitution matrix based color schemes for sequence alignment visualization.

Kunzmann, Patrick and Mayer, Benjamin E. and Hamacher, Kay (2020):
Substitution matrix based color schemes for sequence alignment visualization.
In: BMC bioinformatics, 21 (1), p. 209. ISSN 1471-2105,
DOI: 10.1186/s12859-020-3526-6,
[Article]

Abstract

BACKGROUND

Visualization of multiple sequence alignments often includes colored symbols, usually characters encoding amino acids, according to some (physical) properties, such as hydrophobicity or charge. Typically, color schemes are created manually, so that equal or similar colors are assigned to amino acids that share similar properties. However, this assessment is subjective and may not represent the similarity of symbols very well.

RESULTS

In this article we propose a different approach for color scheme creation: We leverage the similarity information of a substitution matrix to derive an appropriate color scheme. Similar colors are assigned to high scoring pairs of symbols, distant colors are assigned to low scoring pairs. In order to find these optimal points in color space a simulated annealing algorithm is employed.

CONCLUSIONS

Using the substitution matrix as basis for a color scheme is consistent with the alignment, which itself is based on the very substitution matrix. This approach allows fully automatic generation of new color schemes, even for special purposes which have not been covered, yet, including schemes for structural alphabets or schemes that are adapted for people with color vision deficiency.

Item Type: Article
Erschienen: 2020
Creators: Kunzmann, Patrick and Mayer, Benjamin E. and Hamacher, Kay
Title: Substitution matrix based color schemes for sequence alignment visualization.
Language: English
Abstract:

BACKGROUND

Visualization of multiple sequence alignments often includes colored symbols, usually characters encoding amino acids, according to some (physical) properties, such as hydrophobicity or charge. Typically, color schemes are created manually, so that equal or similar colors are assigned to amino acids that share similar properties. However, this assessment is subjective and may not represent the similarity of symbols very well.

RESULTS

In this article we propose a different approach for color scheme creation: We leverage the similarity information of a substitution matrix to derive an appropriate color scheme. Similar colors are assigned to high scoring pairs of symbols, distant colors are assigned to low scoring pairs. In order to find these optimal points in color space a simulated annealing algorithm is employed.

CONCLUSIONS

Using the substitution matrix as basis for a color scheme is consistent with the alignment, which itself is based on the very substitution matrix. This approach allows fully automatic generation of new color schemes, even for special purposes which have not been covered, yet, including schemes for structural alphabets or schemes that are adapted for people with color vision deficiency.

Journal or Publication Title: BMC bioinformatics
Journal volume: 21
Number: 1
Divisions: 10 Department of Biology
10 Department of Biology > Computational Biology and Simulation
Date Deposited: 02 Jun 2020 11:53
DOI: 10.1186/s12859-020-3526-6
Identification Number: pmid:32448181
Export:
Suche nach Titel in: TUfind oder in Google
Send an inquiry Send an inquiry

Options (only for editors)
Show editorial Details Show editorial Details