Protein Folding

The problem of how a 1-D poly peptide chain acquires the form of a native 3-D protein structure is referred to as the protein folding problem.
This is one of the most challenging and important research areas in Biochemistry. All the information needed to specify the protein's 3-D structure is contained within its amino acid sequence, and given suitable conditions, most proteins will spontaneously fold into their native states.
Although studies have provided much information on the folding process, our understanding of a way proteins fold is not full yet.

The main reasons the protein folding problem is considered such a difficult one are:if a protein had to search all the possible conformations to find the energetically most favorable one, even a small protein would have to search for an unreasonable amount of time. Since it is known that proteins usually fold into their native states within much shorter time, it is clear that the protein retains partially correct intermediate rather than randomally scan all its possible conformations. Therefore, the energy calculations involve millions of atomic interactions, and going over all of them to find the most favorable conformation is beyond the ability of even the most powerful computers available today.
Structure is much more conserved than sequence. Similar sequences usually give similar structures, but different sequences may lead to a similar structures. As a result, there is a much smaller number of known structures than sequences. This means that the relationship between sequence and structure is not one-to-one, and the tules that predict the structure out of the sequence are not fully understood.

There are several methods to devise the 3-D structure of a protein:

Experimental methods:

NMR (nuclear magnetic resonance)
X-ray Crystallography

Theoretical and computational methods:

Homology modeling assumes that similar sequence leads to similar structure. It involves fitting a sequence to a known 3-D structure of a similar sequence. Two proteins are considered homologous if they have identical amino acid residues in a significant number of sequential positions along the polypeptide chains (usually at least 30-40% of identity). The results of homology modeling are more satisfying than those of ab-initio modeling methods. About 2% of the known structures were determined this way.
Ab initio methods do not rely on known structures, rather they use the physics of interactions among atoms, including the solvent, to define a force field and then use methods like molecular dynamics or monte carlo to determine the most stable structure.
threading (inverse folding): Since it is assumed that there is only a limited number of stable folds and many different sequences having the same fold, the inverse folding method tries to figure which specific patterns are compatible with a specific fold. The used methodology is called threading, since it involves "threading" a specific sequence through all known folds and estimate the probability that the sequence can have such a fold.