Workflow Entry: tblastx non-redundant alignment

Created at: 30/06/11 @ 15:26:13      Last updated: 04/07/11 @ 16:51:41
Information Version 1 (of 1)

Version created on: 30/06/11 @ 15:26:13 by: Carol Lushbough   |   Revision comments Expand

Last edited on: 04/07/11 @ 16:51:41 by: Carol Lushbough

Title: tblastx non-redundant alignment

Type: BioExtract Server


Information Preview

(Click on the image to get the full size)

Medium


Information Description

This workflow carries out alignments using TCoffee and ClustalW2 for a set of non-redundant proteins where the starting point is a particular genomic coding sequence representing only one member of the gene family in a given species.

 

For the BioExtract Server implementation, the necessary steps for accomplishing this task involve:

1.   Selecting the NCBI tblastx tool and providing the accession number of the known nucleotide sequence record as input.

2.   The output from this tool, a BLAST report along with a set of records representing similar sequences, is parsed using a formatting template to produce an initial extract (a set of matching nucleotide sequences).

3.   The resulting data extract is saved

4.       The resulting data extract is used as input into Vmatch (see http://www.vmatch.de/) to remove duplicate sequences.

5.   The “fetchTranslation” tool is invoked. This tool is defined to use the current nucleotide sequence extract as input (in GenBank format) and returns the protein translations from the GenBank-annotated coding sequence (CDS) regions (in FASTA format).

6.   The ClustalW tool is selected to create the multiple sequence alignment with the input specified as coming from the previously executed tool (i.e., the extracted protein sequences) and to define and draw a dendrogram that represents how the sequences are related.

7.   The TCoffee tool is selected to create the multiple sequence alignment with the input specified as coming from the previously executed tool (i.e., the extracted protein sequences) and to define and draw a dendrogram that represents how the sequences are related.


Information Download



Information Run

Run this Workflow in the BioExtract Server...



Information Workflow Components

Not available

Information Workflow Type

BioExtract Server

Information Original Uploader

Information License

All versions of this Workflow are licensed under:

Information Credits (1)

(People/Groups)

Information Attributions (0)

(Workflows/Files)

None

Information Tags (7)

Log in to add Tags

Information Shared with Groups (0)

None

Information Featured In Packs (0)

None

Log in to add to one of your Packs

Information Ratings (0)

Current:

0.0 / 5

(0 ratings)

Log in to rate and see breakdown of ratings

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

 

Citations (0)

None


Version History

Earliest Version:
[1] - tblastx non-redundant alignment

Created on: Thursday 30 June 2011 @ 15:26:13 (GMT)

Created by: Carol Lushbough

Last edited on: Monday 04 July 2011 @ 16:51:41 (GMT)

Last edited by: Carol Lushbough

Revision comments:

None

This Workflow only has one version.



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (0)

No comments yet

Log in to make a comment




Workflow Other workflows that use similar services (0)

There are no workflows in myExperiment that use similar services to this Workflow.

What is this?

Linked Data

Non-Information Resource URI: http://www.myexperiment.org/workflows/2200


Alternative Formats

HTML
RDF
XML

New/Upload

Log in / Register

Username or Email:

Password:

Remember me:

OR

Use OpenID:


(eg: name.myopenid.com)

Need an account?
Click here to register

Forgot Password?

Front Page

Home

Invite people to myExperiment

Help pages

About Us

News and Events

Mailing List

Contact Us

Developers

Publications


Taverna Workflow Workbench

myGrid

BioCatalogue

Trident

Google Coop Search

EPSRC

JISC

Microsoft

Powered by:

Rails

Icons:
Silk icon set 1.3