Workflow Entry: SifterWorkflow_Prod_Neu

Created at: 23/11/07 @ 16:42:24      Last updated: 29/04/08 @ 10:37:13
Information Version 2 (latest) (of 2)
View version:

Version created on: 23/11/07 @ 16:42:24 by: Anika Joecker   |   Revision comments Expand

Last edited on: 29/04/08 @ 10:37:12 by: Anika Joecker

Title: SifterWorkflow_Prod_Neu

Type: Taverna 1


Information Preview

(Click on the image to get the full size)

Sifterworkflow_prod_neu_15943_2


Information Description

Phylogenomic workflow with SIFTER.

The workflow first runs an iterative Blast search against the RefSeq database (Pruitt et al. 2007) (only fully sequenced organisms) and filters the results to get putative orthologous and in-paralogous proteins. These sequences are used to build a multiple alignment with MAFFT (Katoh et al. 2005). After filtering out alignment columns with more than 60% gaps, a phylogenetic tree is build. If there are less than 20 proteins in the alignment Phyml (Guindon and Gascuel 2003), a maximum likelihood approach, is used. For more than 20 proteins BioNJ (Gascuel et al. 1997), a neighbour joining method, is applied to speed up the pipeline. FORESTER (Zmasek and Eddy 2001) is called to reconcile the tree with the species tree, thereby annotating duplication and speciation nodes. Finally SIFTER is run to transfer Gene Ontology (The Gene Ontology Consortium 2000) terms inside the phylogenetic tree.

Please use as inputs for organism the latin name with upper case letter in front:

e.g. Medicago truncatula

 

References:

Engelhardt, B.E., Jordan, M.I., Murator, K.E., Brenner, S.E. (2005) Protein Molecular Function Prediction by Bayesian Phylogenomics. PLoS Computational Biology, 1(5), e45.

Gascuel, O. (1997) BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data. Molecular Biology and Evolution, 14(7), 685-695.

Guindon, S., Gascuel, O. (2003). A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Systematic Biology, 52(5), 696-704.

Katoh, K., Kuma, K., Toh, H. and Miyata, T. (2005). MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res., 33, 511-518.

Pruitt, K.D., Tatusova, T., Maglott, D.R. (2007) NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res., 35, D61-65.

The Gene Ontology Consortium (2000) Gene Ontology: tool for the unification of biology. Nature Genet., 25, 25-29.

Zmasek, C.M., Eddy, S.R. (2001) A simple algorithm to infer gene duplication and speciation events on a gene tree. Bioinformatics, 17, 821–828.


Information Download


Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.

Option 2:

Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/95/download?version=2
[ More InfoExpand ]


Information Workflow Components

Inputs (4)
Processors (10)
Outputs (9)
Links (25)
Coordinations (0)
Taverna 1 workflow

Information Original Uploader

Information License

All versions of this Workflow are licensed under the Creative Commons Attribution-Share Alike 3.0 License.

Information Credits (1)

(People/Groups)

Information Attributions (0)

(Workflows/Files)

None

Information Tags (6)

Log in to add Tags

Information Shared with Groups (2)

Information Featured In Packs (0)

None

Log in to add to one of your Packs

Information Ratings (0)

Current:

0.0 / 5

(0 ratings)

Log in to rate and see breakdown of ratings

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

Information Statistics

 

Citations (1)

1. B.E. Engelhardt, M.I. Jordan, K.E. Muratore, and S.E. Brenner, Protein molecular function prediction by Bayesian ph\ylogenomics, 07 October 2005


Version History

Earliest Version:
[1] - SifterWorkflow1

Created on: Friday 23 November 2007 @ 16:42:24 (GMT)

Created by: Anika Joecker

Last edited on: Friday 14 December 2007 @ 13:08:46 (GMT)

Last edited by: Anika Joecker

Revision comments:

None

Latest Version:
[2] - SifterWorkflow_Prod_Neu

Created on: Friday 23 November 2007 @ 16:42:24 (GMT)

Created by: Anika Joecker

Last edited on: Tuesday 29 April 2008 @ 10:37:12 (BST)

Last edited by: Anika Joecker

Revision comments:

Change to the new BioMOBY repository at Calgary



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (0)

No comments yet

Log in to make a comment


New/Upload

Log in / Register

Username or Email:

Password:

Remember me:

OR

Use OpenID:


(eg: name.myopenid.com)

Need an account?
Click here to register

Forgot Password?

Front Page

Home

About Us

Mailing List

Contact Us

API

Publications

Taverna Workflow Workbench

Google Coop Search


Invite people to myExperiment

myGrid

BioCatalogue

JISC

Microsoft

Powered by:

Rails

Icons:
Silk icon set 1.3