Untitled Document
 
     
Untitled Document

UNIX
        SGI IRIX

HOMOLOGY MODELING
        USES & BACKGROUND
        BASIC GUIDE
        FOLD ASSIGNMENT
        TEMPLATE & ALIGNMENT
        BUILDING THE MODEL
        REFINEMENT & EVALUATION

DOCKING
        USES
        BACKGROUND
        SETTING UP THE SYSTEM
        ACCESSING THE RESULTS

RATIONAL DRUG DESIGN
        USES
        BACKGROUND

MOLECULAR DYNAMICS
        USES
        BACKGROUND
        SETTING UP THE SYSTEM
        ACCESSING THE RESULTS

 

 

 

Homology Modeling

The following document gives some indepth information about homology modeling. A modeling tutorial using DS Modeling (Accelrys) can be found here.

STEP 3: Model building
 

Methods of model building

    Two main methods are used to built the 3D structure in homology modelling that differ on the definition of function F transforming sequence space in structure space. The first method is based on rigid body superimposition and the second in geometric restraints, with analogy to the molecular replacement and distance geometry methodologies decribed for Xray and NMR structure determination, respectively.

    Several algorithms have been developed in order to obtain a rigid body superimposition between sequences no directly related (JIG-SAW , COMPOSER , among others). SCR construction follows the original approach of Greer using sequentially similar SCR from homologous proteins to define the new core from a multiple alignment: 1) superimposing the known structures of homologous proteins (parents) using the SCRs to construct a framework; 2) superimposing the closest template sequence to the target sequence in the averaged main chain of framework; 3) building the SVRs main chain conformations by fitting compatible structures in the anchored stumps of the framework (see section on SVRs modelling for identification of the stretches to use); and 4) completing the target structure by modelling the side-chains of the target sequence.

    The methods based on the satisfaction of spatial restraints (like MODELLER ) are based on generating as many constraints (or restraints) as possible from the structural alignments of the parents and building the target structure like in the NMR methods (using additional energetic restraints according to the correct stereochemistry of the protein polymer). It is clear that regions where the structure of the homologous templates can not be structurally aligned, or where an alignment between the target and the multiple alignment of the templates is not given, will have to be built with an additional function. Most of the structural changes are produced in the loop regions, but occasional secondary structures may also be involved in variable regions . In the case of multiple superimposed parents the coordinates are separated into conserved secondary structure elements and conserved loops.
     
     

Model building of SVRs

    SVRs modelling can be seen as a mini protein folding problem, consequently the number of methods for predicting loop conformation are twofold: ab initio methods and adopting database searching techniques or knowledge-based approaches

    1. The ab initio prediction is based on a conformational search guided by a scoring or energy function: (f,y) space sampling ; minimum perturbation random tweak method ; systematic conformational search ; global energy minimization , local energy minimization ; molecular dynamics simulations ; genetic algorithms ; Monte Carlo and molecular dynamics ; Monte Carlo sampling ; multiple copy sampling ; searching discrete conformations by dynamic programming ; self-consistent field optimization ; among others (for a review see )

    2. The database approach to loop prediction consists of finding a segment of main chain that fits the two stem regions of a loop. The procedure has improved since the early works on modeling and in the last few years instead of a single conformation a number of loop conformations are selected for each gap that is as uniformely spread as possible . Hence, the remaining loops from the multiple parent modelling and all loops in the single parent modelling are modelled from database searches in three different databases: 1) homologous structures ; 2) cluster database of loops ; and 3) nonredundant database of proteins with less than 25% homology and accuracy higher than 2.5 A.

    The requirements of the chosen loop cluster of conformations are twofold: 1) the fitting between the two bracing secondary structures, and 2) a sequence pattern presented in the target loop to model. This procedure is based on the successful work on canonical loop structures of immunoglobulin complementary determining regions (CDR) by Chothia et al.. Nevertheless, the database search is valid only for short and medium sized loops or for special cases where homologous proteins share some structural commonalities on the loops although still being considered variable regions (as is the case for immunoglobulins ). Up to date classifications of long loops have failed, and it has been demonstrated that a correlation between the geometric variables describing the loop stems is needed in order to obtain such classification. This was only asserted for short and medium sized loops .
     

Side-chain construction. The side chains of the components need to be changed to those of the target structure. The side-chain packing problem is concerned with obtaining the arrangement of side-chain conformations on a given fixed backbone. Vasquez reviewed on various approaches to side-chain modelling , the major problem for predicting side-chain conformations being again of combinatorial nature. The strategy to model side-chains is also to reduce the dimension of the problem by incorporating as much empirical information as possible. Heuristic procedures either forego any attempt to solve the combinatorial problem, or conduct some degree of combinatorial optimization in a solution space that has been reduced as a result of local analysis. For example, significant correlations are found between side-chain dihedral angles and backbone that go beyond the dependence on the secondary structure . Therefore, the conformation of the side-chains are copied from a homologous template in homology building: a single rotamer for each side chain is built that traces as far as possible the path of the original side-chain. Nevertheless, there is a rapid decrease in the side-chain packing conservation when the sequence identity falls under 30% which implies the need of other strategies for dimensional reduction. An important piece of information is that side-chains can be grouped in representative sets of rotamers with specific distributions. Consequently, the library of rotamers taken from the database of protein structures can be used as an alternative to model the side-chains. First, additional internal coordinates to complete the side chain are taken from a secondary structure dependent rotamer library . Second, the side-chain is chosen by optimization procedures derived from the mean field theory approximation from additional rotamers representing high population densities in PDB. Energy-based procedures rely on the assumption that lower values necessarily correlate with more accurate positioning . This puts the burden on the quality of the particular energy function used. There are several limitations on the potential energy function for structure prediction in vacuum. When modelling side-chains on the surface of the protein it is not possible to calculate its interaction with solvent, because water molecules can not be included with the rotamers from the library. Karplus and cow. have obtained an accuracy of around 70% on de modeling of side-chains by testing the accuracy of new force fields . They demonstrate that the absence of solvent introduces an error in the hydrogen-bonding pattern of polar residues, being necessary the inclusion of electrostatic and solvation effects. The success in the solution of the rotamer-packing problem has enabled incorporation of strategies that solve this problem in docking procedures that evaluate protein-protein interactions.
 
 



     

 

Sign Guest Book
View Guest Book

This site is maintained by Arzhang Fallahi
Last Updated: August 2, 2004
Comments/Suggestions


Visits to site: