Created: 2007-10-03 18:36:12      Last updated: 2007-11-15 09:02:44

COMPETITION: For friends only: If you find any two topics that return true positives with this workflow I will buy you a bottle of wine (or equivalent). Terms: if we confirm that the protein was indeed never mentioned together with both input topics in one article, we will publish this together. ----

This workflow implements Swanson's prinicple with services from the AIDA toolbox. It tries to find proteins that link two topics, while they never mentioned together with both topics in any one of the top ranking papers related to either topic 1 or topic 2.

It uses the following logic: Discovered Protein Link = (Protein[Topic1 AND NOT Topic2] AND Protein[Topic2 AND NOT Topic1]) AND NOT Protein[Topic1 AND Topic2] where 'Protein[Topic1 OPERATOR Topic2]' represents a protein discovered in abstracts returned from Medline using 'Topic1 OPERATOR Topic2' as query.

Comments: - It may be useful to optimize the queries for the topics by experimenting with a DiscoverProteins subworkflow first. For example 'cancer' surprisingly does not return any proteins, possibly because clinical papers dominate the retrieval results. The query '+cancer -(therapy clinic) +(protein^10.0 proteins^10.0 gene^9 genes^9)' performs much better. It contains the Lucene priority operator '^[priority], where priority=1 is the default. - The nature of the Swansson algorithm makes it much more likely that this workflow returns no results or false positives, than that it returns true positives. - True positives returned by this workflow are true with respect to the results of the information retrieval step and information extraction step. Limits: 1. Information retrieval: limited number of documents returned, uses indexes for searching, searches and returns abstracts only; 2. entity recognition: not guaranteed to recognize all instances of proteins.

Workflow URL:

Information Preview

Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Copy and paste this link into File > 'Open workflow location...'
[ More InfoExpand ]

Information Workflow Components

Inputs (2)
Processors (7)
Beanshells (4)
Outputs (5)
Links (15)
Coordinations (0)

Information Workflow Type

Taverna 1

Information Uploader

Information License

All versions of this Workflow are licensed under:

Information Version 2 (latest) (of 2)

View version:

Information Credits (2)


Information Attributions (0)



Information Tags (8)

Log in to add Tags

Information Shared with Groups (1)

Information Featured In Packs (0)


Log in to add to one of your Packs

Information Attributed By (0)



Information Favourited By (0)

No one

Information Statistics


Citations (0)


Version History

In chronological order:

Reviews Reviews (0)

No reviews yet

Be the first to review!

Comments Comments (1)

Log in to make a comment

  • Monday 11 March 2013 14:05:05 (UTC)

    Is that bottle of wine still available? :)

Workflow Other workflows that use similar services (0)

There are no workflows in myExperiment that use similar services to this Workflow.