Version 2 (latest)
(of 2)
|
Version created on:
03/10/07 @ 18:36:12
by:
Marco Roos
|
Revision comments
Last edited on: 15/11/07 @ 09:02:44 by: Marco Roos
Title: DiscoverProteinLink
Type: Taverna 1
Preview
(Click on the image to get the full size)
Description
This workflow implements Swanson’s prinicple with services from the AIDA toolbox. It tries to find proteins that link two topics, while they never mentioned together with both topics in any one of the top ranking papers related to either topic 1 or topic 2.
It uses the following logic: Discovered Protein Link = (Protein[Topic1 AND NOT Topic2] AND Protein[Topic2 AND NOT Topic1]) AND NOT Protein[Topic1 AND Topic2] where ‘Protein[Topic1 OPERATOR Topic2]’ represents a protein discovered in abstracts returned from Medline using ‘Topic1 OPERATOR Topic2’ as query.
Comments: - It may be useful to optimize the queries for the topics by experimenting with a DiscoverProteins subworkflow first. For example ‘cancer’ surprisingly does not return any proteins, possibly because clinical papers dominate the retrieval results. The query ‘+cancer -(therapy clinic) +(protein10.0 proteins10.0 gene9 genes9)’ performs much better. It contains the Lucene priority operator ‘^[priority], where priority=1 is the default. - The nature of the Swansson algorithm makes it much more likely that this workflow returns no results or false positives, than that it returns true positives. - True positives returned by this workflow are true with respect to the results of the information retrieval step and information extraction step. Limits: 1. Information retrieval: limited number of documents returned, uses indexes for searching, searches and returns abstracts only; 2. entity recognition: not guaranteed to recognize all instances of proteins.
Download
Run
Option 1:
Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.
Option 2:
Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/31/download?version=2
[ More Info
]
Workflow Components
All versions of this Workflow are licensed under the Creative Commons Attribution-Share Alike 3.0 License.
Log in to add Tags
Shared with Groups (1)
Current:
0.0 / 5
(0 ratings)
Log in to rate and see breakdown of ratings
Statistics
925 viewings
810 downloads
None
Earliest Version:
[1] - DiscoverProteinLink
Latest Version:
[2] - DiscoverProteinLink
Reviews
(0)
Copyright (c) 2007 - 2008 The University of Manchester and University of Southampton
No comments yet
Log in to make a comment