Workflow Entry: BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created at: 28/05/09 @ 12:21:05
Information Version 11 (latest) (of 11)
View version:

Version created on: 28/05/09 @ 12:21:05 by: Marco Roos   |   Revision comments Expand

Title: BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Type: Taverna 1


Information Preview

(Click on the image to get the full size)

Medium


Information Description

This workflow finds proteins relevant to the query string via the following steps:

  1. A user query: a single gene/protein name. E.g.: (EZH2 OR "Enhancer of Zeste").
  2. Retrieve documents: finds 'maximumNumberOfHits' relevant documents (abstract+title) based on query (the AIDA service inside is based on Apache's Lucene)
  3. Discover proteins: extract proteins discovered in the set of relevant abstracts with a 'named entity recognizer' trained on genomic terms using a Bayesian approach; the AIDA service inside is based on LingPipe. This subworkflow also 'filters' false positives from the discovered protein by requiring a discovery has a valid UniProt ID. Martijn Schuemie's service to do that contains only human UniProt IDs, which is why this workflow only works for human proteins.

Workflow by Marco Roos (AID = Adaptive Information Disclosure, University of Amsterdam; http://adaptivedisclosure.org)

Text mining services by Sophia Katrenko and Edgar Meij (AID), and Martijn Schuemie (BioSemantics, Erasmus University Rotterdam).

Changes to our original BioAID_DiseaseDiscovery workflow:

* Stops at protein discovery * Use of Martijn Schuemie's synsets service to * add synonyms to the query. * provide uniprot ids to discovered proteins * filter false positive discoveries, only proteins with a uniprot id go through; this introduces some false negatives (e.g. discovered proteins with a name shorter than 3 characters) * Counting of results in various ways, but no outputs defined in this simplified workflow. * Output into simple html table.


Information Download



Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.

Option 2:

Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/154/download?version=11
[ More InfoExpand ]


Information Workflow Components

Inputs (1)
Processors (22)
Beanshells (11)
Outputs (3)
Links (29)
Coordinations (4)

Information Workflow Type

Taverna 1

Information Original Uploader

Information License

All versions of this Workflow are licensed under:

Information Credits (4)

(People/Groups)

Information Attributions (1)

(Workflows/Files)

Information Tags (9)

Log in to add Tags

Information Shared with Groups (2)

Information Featured In Packs (1)

Log in to add to one of your Packs

Information Ratings (0)

Current:

0.0 / 5

(0 ratings)

Log in to rate and see breakdown of ratings

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

 

Citations (0)

None


Version History

Earliest Version:
[1] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Friday 29 February 2008 @ 01:34:46 (GMT)

Created by: Marco Roos

Last edited on: Friday 29 February 2008 @ 01:34:47 (GMT)

Last edited by: Marco Roos

Revision comments:

None

Previous Versions:
[2] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Friday 29 February 2008 @ 01:34:46 (GMT)

Created by: Marco Roos

Last edited on: Wednesday 05 March 2008 @ 08:12:04 (GMT)

Last edited by: Marco Roos

Revision comments:

Demo

[3] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Friday 29 February 2008 @ 01:34:46 (GMT)

Created by: Marco Roos

Last edited on: Thursday 15 May 2008 @ 11:41:50 (GMT)

Last edited by: Marco Roos

Revision comments:

Balanced list levels for I/O of all beanshells.
Temporarily switched to development service for document search service due to problems with index files.

[4] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Friday 29 February 2008 @ 01:34:46 (GMT)

Created by: Marco Roos

Last edited on: Thursday 15 May 2008 @ 17:37:31 (GMT)

Last edited by: Marco Roos

Revision comments:

Added new simple web service that provides the html document on a publicly accessible URL.

[5] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Friday 29 February 2008 @ 01:34:46 (GMT)

Created by: Marco Roos

Last edited on: Thursday 15 May 2008 @ 22:26:46 (GMT)

Last edited by: Marco Roos

Revision comments:

Added initial 'results pending' html doc.

[6] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Friday 29 February 2008 @ 01:34:46 (GMT)

Created by: Marco Roos

Last edited on: Thursday 15 May 2008 @ 23:22:52 (GMT)

Last edited by: Marco Roos

Revision comments:

updated mime type of url output

[7] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Friday 29 February 2008 @ 01:34:46 (GMT)

Created by: Marco Roos

Last edited on: Monday 28 July 2008 @ 20:48:45 (GMT)

Last edited by: Marco Roos

Revision comments:

Repaired this workflow. Creating the html is done by a beanshell again.

[8] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Friday 29 February 2008 @ 01:34:46 (GMT)

Created by: Marco Roos

Last edited on: Wednesday 29 October 2008 @ 09:29:36 (GMT)

Last edited by: Marco Roos

Revision comments:

Repaired this workflow. Creating the html is done by a beanshell again.

[9] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Sunday 14 December 2008 @ 21:42:40 (GMT)

Created by: Marco Roos

Last edited on: Sunday 14 December 2008 @ 21:44:19 (GMT)

Last edited by: Marco Roos

Revision comments:

 Workflow running from production servers

[10] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Thursday 26 March 2009 @ 20:18:55 (GMT)

Created by: Marco Roos

Last edited on: Thursday 26 March 2009 @ 20:22:03 (GMT)

Last edited by: Marco Roos

Revision comments:

Minor changes to compensate for the changes caused by a migration to a new server. In some cases the changes are temporary until everything is migrated. The functionality of the workflow did not change.

Latest Version:
[11] - BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html

Created on: Thursday 28 May 2009 @ 12:21:05 (GMT)

Created by: Marco Roos

Revision comments:

synsets service moved



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (1)

Log in to make a comment

  • Saturday 01 November 2008 @ 13:52:29 (GMT)

    I am not sure I understand what this workflow does.

    Can you please add some use case/example of how to use it?
    What do you mean exactly with 'proteins relevant to the query string'? Proteins that interact with the query gene? Or that are involved in the same metabolism?

    With which data have you tested this workflow? Which queries have you tried?




Workflow Other workflows that use similar services (9)

Only the first 2 workflows that use similar services are shown. View all workflows that use these services.


Original Uploader

Workflow BioAID_DiseaseDiscovery_RatHumanMouseUniprotFilter (v4)

Created: 15/12/08 @ 20:46:09 | Last updated: 11/08/11 @ 09:22:23

Credits: User Marco Roos Network-member AID

License: Creative Commons Attribution-Share Alike 3.0 Unported License

Thumb

This workflow finds disease relevant to the query string via the following steps: 1. A user query: a list of terms or boolean query - look at the Apache Lucene project for all details. E.g.: (EZH2 OR "Enhancer of Zeste" +(mutation chromatin) -clinical); consider adding 'ProteinSynonymsToQuery' in front of the input if your query is a protein. 2. Retrieve documents: finds 'maximumNumberOfHits' relevant documents (abstract+title) based on query (the AIDA service inside is based on Apa...

Rating: 4.0 / 5 (2 ratings) | Versions: 4 | Reviews: 0 | Comments: 3 | Citations: 0

Viewed: 3861 times | Downloaded: 576 times

Tags (9):

Show View Download Download (v4)

Original Uploader

Workflow BioAID_ProteinToDiseases (v1)

Created: 14/11/07 @ 12:47:57 | Last updated: 15/11/07 @ 09:00:44

Credits: User Marco Roos User Martijn Schuemie Network-member AID

Attributions: Workflow BioAID_DiseaseDiscovery_RatHumanMouseUniprotFilter

License: Creative Commons Attribution-Share Alike 3.0 Unported License

Thumb

This workflow was based on BioAID_DiseaseDiscovery, changes: expects only one protein name, adds protein synonyms). This workflow finds diseases relevant to the query string via the following steps: A user query: a single protein name Add synonyms (service courtesy of Martijn Scheumie, Erasmus University Rotterdam) Retrieve documents: finds relevant documents (abstract+title) based on query Discover proteins: extract proteins discovered in the set of relevant abstracts 5. Link proteins ...

Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0

Viewed: 175 times | Downloaded: 83 times

Tags (8):

Show View Download Download (v1)

What is this?

Linked Data

Non-Information Resource URI: http://www.myexperiment.org/workflows/154


Alternative Formats

HTML
RDF
XML

New/Upload

Log in / Register

Username or Email:

Password:

Remember me:

OR

Use OpenID:


(eg: name.myopenid.com)

Need an account?
Click here to register

Forgot Password?

Front Page

Home

Invite people to myExperiment

Help pages

About Us

News and Events

Mailing List

Contact Us

Developers

Publications


Taverna Workflow Workbench

myGrid

BioCatalogue

Trident

Google Coop Search

EPSRC

JISC

Microsoft

Powered by:

Rails

Icons:
Silk icon set 1.3