[Home][Help][About RsiteDB][Download files]

Abstract: The database details the interactions of extruded, unpaired RNA nucleotide bases. It presents and classifies the protein binding pockets that accommodate them. It allows the recognition of similar protein binding patters involved in interactions with different RNA molecules. Given an unbound structure of a target protein it allows the prediction of its RNA nucleotide binding sites.
Input: The user should specify a PDB code of the protein-RNA complex of interest.

General analysis: Given the input PDB code, RsiteDB will present its description and will detail its interacting protein-RNA chains. For each pair of interacting chains, it will list the number of protein binding sites that accommodate extruded nucleotides and dinucleotides.

A nucleotide binding site is defined by the protein Connolly solvent-accessible surface area within 2A from the surface of the RNA nucleotide ring. Only nucleotides with a protein binding site area larger than 3A are considered as protein interacting. Each pair of extruded consecutive nucleotides that interact with the protein, will be considered as dinucleotides and their binding site will be analyzed as a single unit. Binding sites of single nucleotides that are not a part of a pair of consecutive interacting nucleotides are considered as single-nucleotide binding sites.

Dinucleotide binding sites:
Given a pair of extruded consecutive nucleotides that interact with the protein, a dinucleotide binding site is defined by the pair of the corresponding nucleotide binding sites. RsiteDB presents all the details of these nucleotides. Specifically, it details the nucleotide identities and sequence numbering, where we present the numbers listed in the PDB file (PDB numbering) as well as enumerate the nucleotides according their location on the specified RNA chain (Sequence numbering). To analyze the geometries of the dinucleotides, we measure the angle between the planes of the nucleotide base rings (Ring Angle, measured by the angle cosine) as well as the distance between the ring centroids. In addition, we detail the protein binding site surface area (in squared A) that is in contact with the dinucleotide bases (less than 2A from the rings). We specify whether the dinucleotides are pi-stacked with each other. A pair of consecutive extruded nucleotides as pi-stacking, if the corresponding nucleotides' planes are either parallel or perpendicular and have an angle of 180+-2 or 90+-20 respectively. In addition, we require that the distance between nucleotide ring centroids is less 7A. Finally, we provide a Jmol visualization of the dinucleotides of interest and give a link to the binding sites with similar binding patterns (Cluster members). If the considered binding site was not recognized to be similar to any other dinucleotide binding site it is described as "Unique".

Single nucleotide binding site:
Given a nucleotide binding site, that does not have a protein interacting RNA strand neighbor, we analyze it as a single nucleotide binding site. Similarly to dinucleotide binding sites, we detail the nucleotide identity and sequence numbering, where present the numbers listed in the PDB file (PDB numbering) as well as enumerate the nucleotides according their location on the specified RNA chain (Sequence numbering). We detail the binding site surface area and provide a link to a Jmol visualization and a list of binding sites with similar patterns (Cluster members).
Cluster members:
RsiteDB is based on a classification of the nucleotide and dinuclaotide binding sites. For each binding site we detail its cluster members, which are binding sites with similar spatial physico-chemical binding patterns. Moreover, we present the details of this pattern shared by all cluster members. Specifically, each physico-chemical property is defined by the following description: Chain.ID - the protein chain identifier and the amino acid PDB sequence number;
Amino Acid - The one letter amino acid code. However it must be noted that the method is based on the physico-chemical properties and does not consider the identity of the amino acids. These are only displayed for the convenience of analysis.
Type - The type of physico-chemical property that is matched by the algorithm. The method is based on a representation of each amino acid of a protein as a set of features that are important for its interaction with other molecules. The abbreviations of these features are: DON - Hydrogen bond donor; ACC - Hydrogen bond acceptor; DAC - Hydrogen bond donor and acceptor (e.g in histidine); ALI - Aliphatic Hydrophobic property; PII - Aromatic property (pi contacts).
Source This field specifies whether the matched property is contributed by the backbone or the side-chain of the amino acid. The abbreviations are: b - feature contributed by the protein backbone atoms; s - feature contributed by the protein side-chain atoms.
Same AA: Marks the features shared by the two molecules that are contributed by residues with the same identity of the amino acid.

Each solution can be visualized with Jmol. The default view presents the superimposition of all the complexes, with their corresponding nucleotides/dinucleotides and the matched physico-chemical properties. The molecules are colored by model (according to the order of input molecules, the first is cyan, the second is magenta, the third is yellow). The physico-chemical properties (pseudocenters) are represented as balls. Hydrogen bond donors are blue, acceptors - red, donors/acceptors - green, hydrophobic aliphatic - orange and aromatic - gray. The surface points that have a similar location in all the molecules are represented as smaller dots with the same coloring.


Reference: Shulman-Peleg A, Shatsky M, Nussinov R. and Wolfson H. J. (2008) Prediction of interacting single-stranded RNA bases by protein binding patterns, Journal of Molecular Biology, Vol 379, pp 299-316.
Email:shulmana@tau.ac.il, ppdock@tau.ac.il