Lucene_bioquery_optimizer_MR1

Created: 2007-12-10 22:14:26

This workflow does four things:

  1. it retrieves documents relevant for the query string
  2. it discovers entities in those documents, these are considered relevant entities
  3. it filters proteins from those entities (on the tag protein_molecule)
  4. it removes all terms from the list produced by 3 (query terms temporarily considered proteins)

ToDo

  • Replace step 4 by the following procedure:

1. remove the query terms from the output of NER (probably by a regexp matching on what is inside the tag, possibly case-insensitive) 2. remove tag_as_protein_molecule (obsolete)

  • Add synonym service/workflow

Note that Remove_inputquery has an alternative iteration strategy (dot product instead of cross product). Idem for 'Join' in 'SplitQuery'.

Information Preview

Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/116/download?version=1
[ More InfoExpand ]


Information Workflow Components

Inputs (1)
Processors (2)
Beanshells (1)
Outputs (1)
Links (3)
Coordinations (0)

Information Workflow Type

Taverna 1

Information Uploader

Information License

All versions of this Workflow are licensed under:

Information Version 1 (of 1)

Information Credits (2)

(People/Groups)

Information Attributions (0)

(Workflows/Files)

None

Information Tags (4)

Log in to add Tags

Information Shared with Groups (0)

None

Information Featured In Packs (0)

None

Log in to add to one of your Packs

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

Information Statistics

 

Citations (0)

None


Version History

In chronological order:



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (0)

No comments yet

Log in to make a comment




Workflow Other workflows that use similar services (0)

There are no workflows in myExperiment that use similar services to this Workflow.