Version 4 (latest)
(of 4)
|
Version created on:
26/10/08 @ 21:10:09
by:
Hamish McWilliam
|
Revision comments
Title: Nucleotide_InterProScan
Type: Taverna 1
Preview
(Click on the image to get the full size)
Description
Run InterProScan using a nucleotide sequence as input.
The InterProScan tool (http://www.ebi.ac.uk/Tools/InterProScan/) searches a protein sequence against a selection of protein domain, feature and family signature databases, and integrates the results giving potential assignments to InterPro entries and Gene Ontology terms. Since InterProScan is a protein search tool to use it with a nucleotide sequence, the sequence must be translated into a protein sequence. There are a number of ways of doing this, depending on the properties of the nucleotide sequence, in this case a simple open reading frame (ORF) model is used to obtain the candidate translations. These translations are filtered for length (>80aa) and a search against UniProtKB (http://www.uniprot.org/) is performed to ensure that only sequences which have some relationship with known protein space, on which the signatures used are based, are passed to InterProScan. Once the set of translations has been filtered the remaining sequences as passed on to InterProScan for analysis.
Note: the coordinates in the InterProScan output are in protein coordinates relative to the input translated sequence, to map these on to the input nucleotide sequence see the fasta header of the corresponding translated ORF where the nucleotide coordinates are shown.
This implementation uses:
1. EBI’s WSDbfetch web service (http://www.ebi.ac.uk/Tools/webservices/services/dbfetch) to retreive enties specified by database identifer.
2. EMBOSS seqret tool (http://emboss.sourceforge.net/apps/release/5.0/emboss/apps/getorf.html) via Soaplab (http://www.ebi.ac.uk/Tools/webservices/soaplab/overview) to ensure input sequences are in an appropriate format (i.e. fasta format).
3. EMBOSS getorf tool (http://emboss.sourceforge.net/apps/release/5.0/emboss/apps/getorf.html) via Soaplab (http://www.ebi.ac.uk/Tools/webservices/soaplab/overview) to find the ORFs, perform the translation and filter the translations for length.
4. EBI’s WSNCBIBlast web service (http://www.ebi.ac.uk/Tools/webservices/services/ncbiblast) to perform the filtering BLAST search against UniProtKB.
5. EBI’s WSInterProScan web service (http://www.ebi.ac.uk/Tools/webservices/services/interproscan) to access InterProScan for the final search.
and is based on the proceedure described for nucleotide InterProScan searches described on the WSInterProScan web pages (see http://www.ebi.ac.uk/Tools/webservices/services/interproscan).
Download
Run
Option 1:
Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.
Option 2:
Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/229/download?version=4
[ More Info
]
Workflow Components
All versions of this Workflow are licensed under the Creative Commons Attribution 3.0 License.
Log in to add Tags
Shared with Groups (0)
None
Current:
0.0 / 5
(0 ratings)
Log in to rate and see breakdown of ratings
Statistics
532 viewings
474 downloads
None
Earliest Version:
[1] - Nucleotide_InterProScan
Previous Versions:
[2] - Nucleotide_InterProScan
Latest Version:
[4] - Nucleotide_InterProScan
Reviews
(0)
Copyright (c) 2007 - 2008 The University of Manchester and University of Southampton
No comments yet
Log in to make a comment