Teichert, Florian (2009)
Protein Sequence and Structure Comparison based on vectorial Representations.
Technische Universität Darmstadt
Dissertation, Erstveröffentlichung
Kurzbeschreibung (Abstract)
Proteins are very complex physical objects consisting of thousands of atoms and hundreds of amino acids with complicated local and global interactions on length scales ranging from the microscopic neighbourhood of atoms to the macroscopic size of organisms. The spatial configuration, in spite of that, is encoded into one single character per amino acid using a twenty character alphabet, an apparent contradiction that is not fully understood to date. This thesis is concerned with problems of protein structure and the relationship of protein sequence and structure. It is tried to integrate the different approaches typically carried out by physicists in the field that investigate very simplified model systems, e.g. single helices, with the bioinformatics approach to build powerful analysis tools. The first approach often leads to oversimplified systems that do not describe native proteins as a whole, while the second can be too heuristic and too involved to answer fundamental questions. We start from defining vectorial descriptions of protein structure, similar in form to sequence descriptions, to firstly compare protein structures, i.e. to perform structure alignments, and discuss several measures for structural similarity. From these we derive a statistical structural similarity score for pairs of protein structure based on their spatial superimposition. Then we utilize a previously known ansatz to exploit the sequence to structure correlation in order to predict vectorial structure descriptions from protein sequence. These predicted profiles are then used within the same alignment framework to align protein sequences. For these alignments a basic evolutionary similarity measure between protein sequences is derived. Large part of this thesis is dedicated to the objective assessment of alignment methods including the new method presented and a number of establish programs. A commonly used measure of structural similarity, the Percentage of Structural Identity (PSI), is discussed and generalized to cover an internal degree of freedom in structure that was ignored formerly. The improvement is achieved by very simple but powerful reasoning. The resulting scheme is also applicable to detect hinges in protein structures. Concluding, we state that protein structure, despite its complexity, is indeed to a large extent one-dimensional. The unification of structure and sequence alignments under a single formalism gives some insight into the relation of sequence and structure in proteins.
Typ des Eintrags: | Dissertation | ||||
---|---|---|---|---|---|
Erschienen: | 2009 | ||||
Autor(en): | Teichert, Florian | ||||
Art des Eintrags: | Erstveröffentlichung | ||||
Titel: | Protein Sequence and Structure Comparison based on vectorial Representations | ||||
Sprache: | Englisch | ||||
Referenten: | Porto, Prof. Dr. Markus ; Drossel, Prof. Dr. Barbara | ||||
Publikationsjahr: | 17 März 2009 | ||||
Ort: | Darmstadt | ||||
Verlag: | Technische Universität | ||||
Datum der mündlichen Prüfung: | 16 Februar 2009 | ||||
URL / URN: | urn:nbn:de:tuda-tuprints-13474 | ||||
Kurzbeschreibung (Abstract): | Proteins are very complex physical objects consisting of thousands of atoms and hundreds of amino acids with complicated local and global interactions on length scales ranging from the microscopic neighbourhood of atoms to the macroscopic size of organisms. The spatial configuration, in spite of that, is encoded into one single character per amino acid using a twenty character alphabet, an apparent contradiction that is not fully understood to date. This thesis is concerned with problems of protein structure and the relationship of protein sequence and structure. It is tried to integrate the different approaches typically carried out by physicists in the field that investigate very simplified model systems, e.g. single helices, with the bioinformatics approach to build powerful analysis tools. The first approach often leads to oversimplified systems that do not describe native proteins as a whole, while the second can be too heuristic and too involved to answer fundamental questions. We start from defining vectorial descriptions of protein structure, similar in form to sequence descriptions, to firstly compare protein structures, i.e. to perform structure alignments, and discuss several measures for structural similarity. From these we derive a statistical structural similarity score for pairs of protein structure based on their spatial superimposition. Then we utilize a previously known ansatz to exploit the sequence to structure correlation in order to predict vectorial structure descriptions from protein sequence. These predicted profiles are then used within the same alignment framework to align protein sequences. For these alignments a basic evolutionary similarity measure between protein sequences is derived. Large part of this thesis is dedicated to the objective assessment of alignment methods including the new method presented and a number of establish programs. A commonly used measure of structural similarity, the Percentage of Structural Identity (PSI), is discussed and generalized to cover an internal degree of freedom in structure that was ignored formerly. The improvement is achieved by very simple but powerful reasoning. The resulting scheme is also applicable to detect hinges in protein structures. Concluding, we state that protein structure, despite its complexity, is indeed to a large extent one-dimensional. The unification of structure and sequence alignments under a single formalism gives some insight into the relation of sequence and structure in proteins. |
||||
Alternatives oder übersetztes Abstract: |
|
||||
Sachgruppe der Dewey Dezimalklassifikatin (DDC): | 500 Naturwissenschaften und Mathematik > 500 Naturwissenschaften 500 Naturwissenschaften und Mathematik > 530 Physik 500 Naturwissenschaften und Mathematik > 570 Biowissenschaften, Biologie |
||||
Fachbereich(e)/-gebiet(e): | 05 Fachbereich Physik 05 Fachbereich Physik > Institut für Festkörperphysik (2021 umbenannt in Institut für Physik Kondensierter Materie (IPKM)) |
||||
Hinterlegungsdatum: | 20 Mär 2009 10:42 | ||||
Letzte Änderung: | 26 Aug 2018 21:25 | ||||
PPN: | |||||
Referenten: | Porto, Prof. Dr. Markus ; Drossel, Prof. Dr. Barbara | ||||
Datum der mündlichen Prüfung / Verteidigung / mdl. Prüfung: | 16 Februar 2009 | ||||
Export: | |||||
Suche nach Titel in: | TUfind oder in Google |
Frage zum Eintrag |
Optionen (nur für Redakteure)
Redaktionelle Details anzeigen |