- Single sequence, one loop
- Multiple sequences, one loop
- Multiple sequences, one loop (fasta)
- Single sequence, many loops, with secondary structure
- Single sequence, many loops, with secondary structure (fasta)
- Multiple sequences, many loops, with secondary structure
- Multiple sequences, many loops, with secondary structure (fasta)
JAR3D scores RNA hairpin and internal loop sequences against motif groups from the RNA 3D Motif Atlas, by exact sequence match for sequences already observed in 3D and by probabilistic scoring and edit distance for novel sequences. RNA hairpin and internal loops are often represented on secondary structure diagrams as if they are unstructured, but in fact most are structured by non-Watson-Crick basepairs, base stacking, and base-backbone interactions. Analysis of 3D structures shows that different RNA sequences can form the same RNA 3D motif, as is apparent in many motif groups in the RNA 3D Motif Atlas. JAR3D scores sequences to motif groups based on the ability of the sequences to form the same pattern of interactions observed in 3D structures of the motif. Because the RNA 3D Motif Atlas incorporates new RNA 3D structures every four weeks, the performance of JAR3D will improve over time.
Functions of recurrent 3D motifs include:
- Architectural roles introducing bends in helices (e.g. kink-turns) or changing helical twist (e.g. C-loops)
- Anchoring RNA tertiary interactions (e.g., GNRA loops and loop-receptors)
- Providing sites for proteins or small molecules to bind.
Inferring the 3D structures of hairpin and internal loops is a step on the way toward correctly predicting full RNA 3D structures starting from sequence.
Input and Output
JAR3D accepts single or multiple sequences having one or many loops. See the Examples above. One loop: To specify the break between strands in internal loops, use an asterisk *. Sequence(s) without an asterisk are interpreted as hairpins. Internal and hairpin loops should include closing Watson-Crick basepairs, with nucleotides running in 5' to 3' order within each strand. Individual loops do not need the nucleotides to be aligned. Many loops: JAR3D will extract internal and hairpin loops from longer sequences if a dot-bracket secondary structure is provided as the first line of the input. Multiple sequences need to be aligned to one another.
The output shows the best-scoring motif groups from the RNA 3D Motif Atlas using a variety of metrics. The user can view a representative instance from each motif group and explore the group further at the RNA 3D Motif Atlas page for the motif group. The user can also see how their input input sequences align to known 3D instances of a motif.
The tutorial can be found at this link.
- We extract all hairpin and internal loops from a non-redundant set of RNA 3D structures from the PDB/NDB and cluster them in geometrically similar families.
- For each recurrent motif, we construct a probabilistic model for sequence variability based on a hybrid Stochastic Context-Free Grammar/Markov Random Field (SCFG/MRF) method we developed.
- To parameterize each model, we use all instances of the motif found in the non-redundant dataset and knowledge of RNA nucleotide interactions, especially isosteric basepairs and their substitution patterns.
- For each motif group, we form an acceptance region that is consistent with the geometry and basepairing of that group. If the score is in the cutoff region, we infer that the new sequence can form the same 3D structure.
- For more infromation you can read:
Identifying novel sequence variants of RNA 3D motifs, by Craig L. Zirbel, James Roll, Blake A. Sweeney, Anton I. Petrov, Meg Pirrung, and Neocles Leontis. Nucl. Acids Res. (2015) doi: 10.1093/nar/gkv651 link