Version 2 (latest)
(of 2)
|
Version created on:
03/10/07 @ 18:36:12
by:
Marco Roos
|
Revision comments
Last edited on: 15/11/07 @ 09:02:44 by: Marco Roos
Title: DiscoverProteinLink
Type: Taverna 1
Preview
(Click on the image to get the full size)
Description
COMPETITION: For friends only: If you find any two topics that return true positives with this workflow I will buy you a bottle of wine (or equivalent). Terms: if we confirm that the protein was indeed never mentioned together with both input topics in one article, we will publish this together. ----
This workflow implements Swanson's prinicple with services from the AIDA toolbox. It tries to find proteins that link two topics, while they never mentioned together with both topics in any one of the top ranking papers related to either topic 1 or topic 2.
It uses the following logic: Discovered Protein Link = (Protein[Topic1 AND NOT Topic2] AND Protein[Topic2 AND NOT Topic1]) AND NOT Protein[Topic1 AND Topic2] where 'Protein[Topic1 OPERATOR Topic2]' represents a protein discovered in abstracts returned from Medline using 'Topic1 OPERATOR Topic2' as query.
Comments: - It may be useful to optimize the queries for the topics by experimenting with a DiscoverProteins subworkflow first. For example 'cancer' surprisingly does not return any proteins, possibly because clinical papers dominate the retrieval results. The query '+cancer -(therapy clinic) +(protein^10.0 proteins^10.0 gene^9 genes^9)' performs much better. It contains the Lucene priority operator '^[priority], where priority=1 is the default. - The nature of the Swansson algorithm makes it much more likely that this workflow returns no results or false positives, than that it returns true positives. - True positives returned by this workflow are true with respect to the results of the information retrieval step and information extraction step. Limits: 1. Information retrieval: limited number of documents returned, uses indexes for searching, searches and returns abstracts only; 2. entity recognition: not guaranteed to recognize all instances of proteins.
Workflow URL: http://rdf.adaptivedisclosure.org/~marco/BioAID/Public/Workflows/BioAID/SwansonProteins.xml
Download
Run
Option 1:
Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.
Option 2:
Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/31/download?version=2
[ More Info
]
Workflow Components
Workflow Type
Log in to add Tags
Shared with Groups (1)
Current:
0.0 / 5
(0 ratings)
Log in to rate and see breakdown of ratings
Statistics
None
Earliest Version:
[1] - DiscoverProteinLink
Latest Version:
[2] - DiscoverProteinLink
Reviews
(0)
Other workflows that use similar services
(0)
There are no workflows in myExperiment that use similar services to this Workflow.
Linked Data
Non-Information Resource URI: http://www.myexperiment.org/workflows/31
Alternative Formats
Copyright © 2007 - 2011 The University of Manchester and University of Southampton
No comments yet
Log in to make a comment