Workflow Entry: BioAID_DiseaseDiscovery_byHumanUniprot

Created at: 05/12/07 @ 23:11:44      Last updated: 15/05/08 @ 21:10:32
Information Version 3 (latest) (of 3)
View version:

Version created on: 05/12/07 @ 23:11:42 by: Marco Roos   |   Revision comments Expand

Last edited on: 15/05/08 @ 21:10:32 by: Marco Roos

Title: BioAID_DiseaseDiscovery_byHumanUniprot

Type: Taverna 1


Information Preview

(Click on the image to get the full size)

Bioaid_diseasediscovery_byhumanuniprot_22938_3


Information Description

This workflow finds disease relevant to the query string via the following steps: 1. A user query: a list of terms or boolean query – look at the Apache Lucene project for all details. E.g.: (EZH2 OR “Enhancer of Zeste” +(mutation chromatin) -clinical); consider adding ‘ProteinSynonymsToQuery’ in front of the input if your query is a protein. 2. Retrieve documents: finds ‘maximumNumberOfHits’ relevant documents (abstract+title) based on query (the AIDA service inside is based on Apache’s Lucene) 3. Discover proteins: extract proteins discovered in the set of relevant abstracts with a ‘named entity recognizer’ trained on genomic terms using a Bayesian approach; the AIDA service inside is based on LingPipe. This subworkflow also ‘filters’ false positives from the discovered protein by requiring a discovery has a valid UniProt ID. Martijn Schuemie’s service to do that contains only human UniProt IDs, which is why this workflow only works for human proteins. 4. Link proteins to disease contained in the OMIM disease database (with a service from Japan that interrogates OMIM)

Workflow by Marco Roos (AID = Adaptive Information Disclosure, University of Amsterdam; http://adaptivedisclosure.org)

Text mining services by Sophia Katrenko and Edgar Meij (AID), and Martijn Schuemie (BioSemantics, Erasmus University Rotterdam). OMIM service from the Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, director Hideaki Sugawara (see http://xml.nig.ac.jp)

Changes to our original BioAID_DiseaseDiscovery workflow: * Use of Martijn Schuemie’s synsets service to * provide uniprot ids to discovered proteins * filter false positive discoveries, only proteins with a uniprot id go through; this introduces some false negatives (e.g. discovered proteins with a name shorter than 3 characters) * solve a major issue with the original workflow where some false positives could contribute disproportionately to the number of discovered diseases * Counting of results in various ways.


Information Download


Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.

Option 2:

Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/110/download?version=3
[ More InfoExpand ]


Information Workflow Components

Inputs (2)
Processors (9)
Outputs (8)
Links (18)
Coordinations (0)
Taverna 1 workflow

Information Original Uploader

Information License

All versions of this Workflow are licensed under the Creative Commons Attribution-Share Alike 3.0 License.

Information Credits (3)

(People/Groups)

Information Attributions (0)

(Workflows/Files)

None

Information Tags (9)

Log in to add Tags

Information Shared with Groups (1)

Information Featured In Packs (0)

None

Log in to add to one of your Packs

Information Ratings (0)

Current:

0.0 / 5

(0 ratings)

Log in to rate and see breakdown of ratings

Information Attributed By (3)

(Workflows/Files)

Information Favourited By (0)

No one

 

Citations (0)

None


Version History

Earliest Version:
[1] - BioAID_DiseaseDiscovery_byHumanUniprot

Created on: Wednesday 05 December 2007 @ 23:11:42 (GMT)

Created by: Marco Roos

Last edited on: Monday 10 December 2007 @ 23:13:06 (GMT)

Last edited by: Marco Roos

Revision comments:

None

Previous Versions:
[2] - BioAID_DiseaseDiscovery_byHumanUniprot

Created on: Wednesday 05 December 2007 @ 23:11:42 (GMT)

Created by: Marco Roos

Last edited on: Monday 04 February 2008 @ 10:26:41 (GMT)

Last edited by: Marco Roos

Revision comments:

Update with more uniprots

Latest Version:
[3] - BioAID_DiseaseDiscovery_byHumanUniprot

Created on: Wednesday 05 December 2007 @ 23:11:42 (GMT)

Created by: Marco Roos

Last edited on: Thursday 15 May 2008 @ 21:10:32 (BST)

Last edited by: Marco Roos

Revision comments:

Temporarily switched SearcherWS service to development server due to problems with the medline index on the production server.



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (0)

No comments yet

Log in to make a comment


New/Upload

Log in / Register

Username or Email:

Password:

Remember me:

OR

Use OpenID:


(eg: name.myopenid.com)

Need an account?
Click here to register

Forgot Password?

Front Page

Home

About Us

Mailing List

Contact Us

API

Publications

Taverna Workflow Workbench

Google Coop Search


Invite people to myExperiment

myGrid

BioCatalogue

JISC

Microsoft

Powered by:

Rails

Icons:
Silk icon set 1.3