AnCoRe - Analysis of Complementary Residues
|
AnCoRe is aimed to identify complementary residues in multiple-sequence alignments (MSA). This allows the prediction of potentially intramolecular interacting residues for homologous proteins, which share a common fold like in the GPHRs. The assumption is, if one of two interacting residues has changed the side chain property by evolutionary mutation, the other residue changed simultaneously the side chain to maintain the interaction by complementary properties. To find these residue positions in a MSA, the complementarity of the amino acids is evaluated by comparing the properties of the participating amino acids. Beside the prediction of potentially intramolecular interacting residues AnCoRe can also be used to predict the residues which are responsible for the selective interaction of two sets of homologous proteins sharing a conserved intramolecular binding mechanism. For this application AnCoRe is applied to two corresponding MSAs. To find these residue positions, the complementarity of the amino acids is evaluated by comparing the properties of the participating amino acids. Introduction
Most cellular processes depend largely on specific intra-protein interactions for proper protein folding and on inter-protein interactions for specific protein-protein interactions. However, experimental analyses of these interactions (e.g. by Mutation) are unfortunately expensive and time consuming. To reduce the number of wet lab experiments necessary, good working hypotheses are needed. Such working hypotheses can often be obtained by deciphering the hidden information within the vast amount of sequence information available. Especially alignments of homologous proteins sharing a common fold contain information of great value, which is utilized in the approach presented here. Rationale The specificity of intra- and protein-protein interactions is mediated for the most part by the properties of the amino acid side chains. Only complementary properties, suitable for the interaction between amino acid side chains, will allow for binding of the respective poly-peptide chains. For the interaction to remain throughout the course of evolution, a change in the property of one of the interacting partners has to be compensated by a corresponding change of the other partner. These complementary mutations can be spotted in multiple sequence alignments (MSA) and permit the identification of potentially interacting residues. Prerequisites
Intra-molecular (fold stabilizing) interactions:
Inter-molecular (protein-protein) interactions:
|
|||||||||||||||||||||||||||||||||||
|
Summary
AnCoRe is a fast and easy to use tool which aids the user in analyzing multiple sequence alignments for potentially interacting residues if no other prior knowledge is available. It can be applied to intra-protein interactions for evaluation of the protein fold, e.g. in homology modeling applications especially for segments exhibiting less sequence conservation. It can also be applied to protein-protein interactions, by comparing two sequence alignments of the participating proteins, to identify residues being potentially responsible for the interaction. |
||||||||||||||||||||||||||||||||||
|
Implementation Tcl/Tk was used to implement AnCoRe in a cross-platform manner. Therefore AnCoRe should in principle run on every platform supported by Tcl/Tk. AnCoRe has been successfully tested on the following platforms using Tcl/Tk 8.3:
The MSA consists of m sequences S, each containing n residues aa taken from the amino acid alphabet AA (1). ![]() Amino acids are grouped into classes corresponding to similar amino acid properties. Using this reduced class alphabet, with each member c representing a subset CDc from the amino acid alphabet, the MSA is converted into m sequences C (2). ![]() The class break pattern is obtained by evaluating the changes of amino acid classes from sequence to sequence at a given homologous residue position. CB is then the set of sequence numbers for which the residue class cs at a given residue position r is different from the residue class cs-1 of the preceding sequence (3). ![]() The complementarity of two residue positions in one multiple-sequence alignment (intra-protein) or two corresponding multiple-sequence alignments (protein-protein) is evaluated by AnCoRe using three scores: 1. CIS - the amino acid class complementarity score: CIS is the mean score of all class pairs at the two residue positions (4). ![]() In addition to predefined class definitions [Murphy et al. 2000], the user can supply customized ones. The class complementarity matrix MCIS [Betancourt et al. 1999], defining the corresponding complementarity scores, is also user customizable. This enables the user to search for specific interaction types only (e.g. charged interactions, aromatic interactions, etc.) while suppressing unwanted less specific potential interactions. 2. AIS - the amino acid complementarity score: AIS is the mean score of all amino acid pairs at two residue positions taken from an amino acid complementarity matrix MAIS [Betancourt et al. 1999] (5). ![]() Different matrices can be used and even customized. By additionally calculating the AIS the coarse grained results from evaluating the CIS can be refined while retaining its capability to filter out less likely interactions. 3. CBS - the class break similarity score: The comparison of two class break patterns at two residue positions for similarity yields the CBS (6). ![]() The CBS emphasizes the simultaneity of property changes at the two residue positions. RIR - the three scores are combined into a composite RIR score, which is used to rank the potentially interacting residue pairs (7). ![]() Additionally it is possible to analyze each score individually to focus only on a specific aspect. Tutorial In order to follow the tutorial please download the exemplary synthetic alignments tutmsa1.msf, tutmsa2.msf and install AnCoRe as described below. The you can follow the tutorial outlined in the next section: |
|||||||||||||||||||||||||||||||||||
|
tutmsa1.msf tutmsa2.msf
|
Prior to starting a new AnCoRe project:
|
||||||||||||||||||||||||||||||||||
|
Start Ancore and define a new Project:
|
||||||||||||||||||||||||||||||||||
|
Select the matrices to use for this analysis. The AnCoRe installation provides a basic set of matrices (see below in the installation section) which are installed in the AnCoRe installation path under "matrices".
|
||||||||||||||||||||||||||||||||||
|
An important feature of AnCoRe is the possibility to define custom matrices focusing on a specific type of interaction while suppressing many other potential interactions. The user can also customize the provided matrices. In order to edit a given matrix click on [Edit...]:
|
||||||||||||||||||||||||||||||||||
|
Set the thresholds and weights for the AnCoRe analysis and start the calculation.
|
||||||||||||||||||||||||||||||||||
|
The AnCoRe analysis produces a number of files in the project directory.
|
|||||||||||||||||||||||||||||||||||
|
AnCoRe itself provides a tool to visualize and filter the results.
|
||||||||||||||||||||||||||||||||||
|
You have successfully finished the tutorial! Download The current release of AnCoRe V.1.0.1 is available as a .tar.gz or .zip archive. The archives contain the AnCoRe script together with some exemplary matrices contained in a predefined directory structure.
Installation To install the AnCoRe package please download one of the above archives. Unpack the downloaded archive to a directory of your choice. under Linux and Unix you can use the following command: % gunzip -c ancore_101.tar.gz | tar xvf - Under Windows use WinZip or a similar tool and make sure you extract also the directory structure.
The unpacking of the archive produces the following directory structure:
A working Tcl/Tk (version 8.3 or later) scripting environment is needed in order to run the AnCoRe script. Most Linux distributions are delivered with a current version of Tcl/Tk, but it may be necessary to install the package separately. For Windows and some other platforms the sources and for some platforms also pre-compiled binaries are available at:
Especially for SGI Irix, the TclPro package provides a current pre-compiled binary.
For Unix/Linux edit the /ancore/bin/ancore Bourne shell script as follows:
For Windows edit the /ancore/bin/ancore.bat shell script as follows:
Now you should now be able to start AnCoRe using the start scripts: % ancore
Under Windows: References
|
|||||||||||||||||||||||||||||||||||















