Workflow Entry: Nucleotide_InterProScan

Created at: 06/06/08 @ 21:45:10      Last updated: 26/10/08 @ 21:10:10
Information Version 4 (latest) (of 4)
View version:

Version created on: 26/10/08 @ 21:10:09 by: Hamish McWilliam   |   Revision comments Expand

Title: Nucleotide_InterProScan

Type: Taverna 1


Information Preview

(Click on the image to get the full size)

Nucleotide_interproscan


Information Description

Run InterProScan using a nucleotide sequence as input.

The InterProScan tool (http://www.ebi.ac.uk/Tools/InterProScan/) searches a protein sequence against a selection of protein domain, feature and family signature databases, and integrates the results giving potential assignments to InterPro entries and Gene Ontology terms. Since InterProScan is a protein search tool to use it with a nucleotide sequence, the sequence must be translated into a protein sequence. There are a number of ways of doing this, depending on the properties of the nucleotide sequence, in this case a simple open reading frame (ORF) model is used to obtain the candidate translations. These translations are filtered for length (>80aa) and a search against UniProtKB (http://www.uniprot.org/) is performed to ensure that only sequences which have some relationship with known protein space, on which the signatures used are based, are passed to InterProScan. Once the set of translations has been filtered the remaining sequences as passed on to InterProScan for analysis.

Note: the coordinates in the InterProScan output are in protein coordinates relative to the input translated sequence, to map these on to the input nucleotide sequence see the fasta header of the corresponding translated ORF where the nucleotide coordinates are shown.

This implementation uses:

1. EBI’s WSDbfetch web service (http://www.ebi.ac.uk/Tools/webservices/services/dbfetch) to retreive enties specified by database identifer.

2. EMBOSS seqret tool (http://emboss.sourceforge.net/apps/release/5.0/emboss/apps/getorf.html) via Soaplab (http://www.ebi.ac.uk/Tools/webservices/soaplab/overview) to ensure input sequences are in an appropriate format (i.e. fasta format).

3. EMBOSS getorf tool (http://emboss.sourceforge.net/apps/release/5.0/emboss/apps/getorf.html) via Soaplab (http://www.ebi.ac.uk/Tools/webservices/soaplab/overview) to find the ORFs, perform the translation and filter the translations for length.

4. EBI’s WSNCBIBlast web service (http://www.ebi.ac.uk/Tools/webservices/services/ncbiblast) to perform the filtering BLAST search against UniProtKB.

5. EBI’s WSInterProScan web service (http://www.ebi.ac.uk/Tools/webservices/services/interproscan) to access InterProScan for the final search.

and is based on the proceedure described for nucleotide InterProScan searches described on the WSInterProScan web pages (see http://www.ebi.ac.uk/Tools/webservices/services/interproscan).


Information Download


Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.

Option 2:

Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/229/download?version=4
[ More InfoExpand ]


Information Workflow Components

Inputs (2)
Processors (3)
Outputs (7)
Links (12)
Coordinations (0)
Taverna 1 workflow

Information Original Uploader

Information License

All versions of this Workflow are licensed under the Creative Commons Attribution 3.0 License.

Information Tags (11)

Log in to add Tags

Information Shared with Groups (0)

None

Information Featured In Packs (1)

  • Private pack

Log in to add to one of your Packs

Information Ratings (0)

Current:

0.0 / 5

(0 ratings)

Log in to rate and see breakdown of ratings

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

 

Citations (0)

None


Version History

Earliest Version:
[1] - Nucleotide_InterProScan

Created on: Friday 06 June 2008 @ 21:45:09 (BST)

Created by: Hamish McWilliam

Last edited on: Friday 06 June 2008 @ 21:45:10 (BST)

Last edited by: Hamish McWilliam

Revision comments:

None

Previous Versions:
[2] - Nucleotide_InterProScan

Created on: Friday 06 June 2008 @ 21:45:09 (BST)

Created by: Hamish McWilliam

Last edited on: Friday 06 June 2008 @ 21:47:13 (BST)

Last edited by: Hamish McWilliam

Revision comments:

With the nested workflows...

[3] - Nucleotide_InterProScan

Created on: Friday 06 June 2008 @ 21:45:09 (BST)

Created by: Hamish McWilliam

Last edited on: Saturday 07 June 2008 @ 09:06:50 (BST)

Last edited by: Hamish McWilliam

Revision comments:

Tweak BLAST filtering rules. Add additional metadata descriptions.

Latest Version:
[4] - Nucleotide_InterProScan

Created on: Sunday 26 October 2008 @ 21:10:09 (GMT)

Created by: Hamish McWilliam

Revision comments:

Update for changes to poll output names in the WSInterProScan service. Also add GFF output from InterProScan.

 



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (0)

No comments yet

Log in to make a comment


New/Upload

Log in / Register

Username or Email:

Password:

Remember me:

OR

Use OpenID:


(eg: name.myopenid.com)

Need an account?
Click here to register

Forgot Password?

Front Page

Home

About Us

Mailing List

Contact Us

API

Publications

Taverna Workflow Workbench

Google Coop Search


Invite people to myExperiment

myGrid

BioCatalogue

JISC

Microsoft

Powered by:

Rails

Icons:
Silk icon set 1.3