Help

Introduction
Input
Prediction and Analysis
Output
Search

1. Introduction top

Regulatory proteins (RPs) are involved in the control of diverse cellular systems, allowing adaptive responses to changes in environmental conditions. They are found in prokaryotes where they control diverse aspects of prokaryotic metabolism, such as cell differentiation, morphogenesis, central metabolism, motility, biofilm formation and virulence. To understand RP functions, it is critical to characterise and analyse their modular complexity. P2RP, a freely accessible web server, has been developed for computational analysis of the modular RPs of prokaryotic genomes and metagenomes. It predicts transcription factors (TFs) and two-component systems (TCSs).

2. Input top

The program can perform several types of analyses depending on the type of request. P2RP requires inputs in Fasta format or gbk file (GenBank) in the case of genome re-annotation. Examples of nucleotide and amino acid sequences are provided to help for testing.

Protein sequence(s)
The user uploads one or more protein sequences and gets a detailed analysis of the submitted sequences.

DNA sequence(s)
Here the user has the opportunity to upload one nucleotide sequence such as a new genome DNA sequence, or more nucleotide sequences such as a set of metagenome contigs. In these cases, P2RP uses first MED-Start, a prokaryotic gene finding algorithm (Zhu et al., 2007), to define a set of potential genes. Thereafter the sequences are translated to constitute the proteome used for the RPs prediction.

Genome re-annotation
A GenBank available genome can be re-analysed to predict RPs. The user uploades a GBK format file and gets a complete analysis of the proteome.

3. Prediction and Analysis top

The identification of RP candidates is accomplished by domain analysis of each predicted protein. PHP scripts were developed to search the numerous combinations of RP modules and to categorise RP proteins into families based on similarity and/or domain architecture. Finally, the cellular localisation of each RP protein is determined by the presence or absence of transmembrane (TM) segments, using the HMMTOP predictor (Krogh et al., 2001).
The P2RP homepage contains a navigation bar that allows users to execute a new RP analysis, to search for a previous job, to access the help page or to contact the authors.

The - RP Analysis - menu, allows users to submit one or more sequences for RPs prediction and analysis. The web server takes as input:

Nucleotide sequence(s) in FASTA format

Amino acid sequence(s) in FASTA format

Genome in GBK format (GenBank), re-annotation process

The graphical user interface is provided with a progress bar, used to convey the progress of each task (source data analysis, rpsBlast, RP analysis, RP data refinement). The execution of the different processes is defined by a colour progressive changing using three colours.

4. Output top

Once processing is complete, the results are displayed as a web interface page, and data can be outputted for download in a variety of user-specified formats.

An example of an Excel file output.

The results page shows global counts of the different categories of RPs and detailed class counts of each category. Each class result provides a detailed gene list, via a popup window, when the mouse is passed over active text.

Selecting an object from the identifiers list in the popup window displays a detailed protein description page including 2D structure, domain architecture and aminoacid composition and usage. Blast searches can be performed for the sequence, using external links to numerous public databases.

Click on the plus sign (+) to display the corresponding information.

5. Search top

To keep user session private, every user query is given an ID which allows later retrieval of results, using the - Search - menu. So, please remember to save your analysis ID for later use. The data are stored on the server for one month.