Workflow Entry: Discover_entities

Created at: 10/12/07 @ 21:48:33      Last updated: 10/12/07 @ 22:54:42
Information Version 2 (latest) (of 2)
View version:

Version created on: 10/12/07 @ 21:48:33 by: Marco Roos   |   Revision comments Expand

Last edited on: 10/12/07 @ 22:54:42 by: Marco Roos

Title: Discover_entities

Type: Taverna 1


Information Preview

(Click on the image to get the full size)

Medium


Information Description

This workflow contains the 'Named Entity Recognize' web service from the AIDA toolbox, created by Sophia Katrenko. It can be used to discover entities of a certain type (determined by 'learned_model') in documents provided in a lucene output format.

Known issues:

The output of NErecognize contains concepts with / characters, breaking the xml. For post-processing its results it is better to use string manipulation than xml manipulations. The output is per document, which means entities will be redundant if they occur in more than one document.


Information Download



Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.

Option 2:

Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/111/download?version=2
[ More InfoExpand ]


Information Workflow Components

Inputs (2)
Processors (3)
Beanshells (0)
Outputs (1)
Links (5)
Coordinations (0)

Information Workflow Type

Taverna 1

Information Original Uploader

Information License

All versions of this Workflow are licensed under:

Information Credits (3)

(People/Groups)

Information Attributions (0)

(Workflows/Files)

None

Information Tags (4)

Log in to add Tags

Information Shared with Groups (0)

None

Information Featured In Packs (0)

None

Log in to add to one of your Packs

Information Ratings (1)

Current:

5.0 / 5

(1 rating)

Log in to rate and see breakdown of ratings

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

 

Citations (0)

None


Version History

Earliest Version:
[1] - Discover_entities

Created on: Monday 10 December 2007 @ 21:48:33 (GMT)

Created by: Marco Roos

Last edited on: Monday 10 December 2007 @ 22:07:30 (GMT)

Last edited by: Marco Roos

Revision comments:

None

Latest Version:
[2] - Discover_entities

Created on: Monday 10 December 2007 @ 21:48:33 (GMT)

Created by: Marco Roos

Last edited on: Monday 10 December 2007 @ 22:54:42 (GMT)

Last edited by: Marco Roos

Revision comments:

Added example input



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (0)

No comments yet

Log in to make a comment




Workflow Other workflows that use similar services (5)

Only the first 2 workflows that use similar services are shown. View all workflows that use these services.


Original Uploader

Workflow BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html (v11)

Created: 28/05/09 @ 12:21:05

Credits: User Marco Roos User Martijn Schuemie Network-member AID Network-member AID_myGrid_collaboration

Attributions: Workflow BioAID_DiseaseDiscovery_RatHumanMouseUniprotFilter

License: Creative Commons Attribution-Share Alike 3.0 Unported License

Thumb

This workflow finds proteins relevant to the query string via the following steps: A user query: a single gene/protein name. E.g.: (EZH2 OR "Enhancer of Zeste"). Retrieve documents: finds 'maximumNumberOfHits' relevant documents (abstract+title) based on query (the AIDA service inside is based on Apache's Lucene) Discover proteins: extract proteins discovered in the set of relevant abstracts with a 'named entity recognizer' trained on genomic terms using a Bayesian approach; the AIDA serv...

Rating: 0.0 / 5 (0 ratings) | Versions: 11 | Reviews: 0 | Comments: 1 | Citations: 0

Viewed: 454 times | Downloaded: 167 times

Tags (9):

Show View Download Download (v11)

Original Uploader

Workflow BioAID_DiseaseDiscovery_RatHumanMouseUniprotFilter (v4)

Created: 15/12/08 @ 20:46:09 | Last updated: 11/08/11 @ 09:22:23

Credits: User Marco Roos Network-member AID

License: Creative Commons Attribution-Share Alike 3.0 Unported License

Thumb

This workflow finds disease relevant to the query string via the following steps: 1. A user query: a list of terms or boolean query - look at the Apache Lucene project for all details. E.g.: (EZH2 OR "Enhancer of Zeste" +(mutation chromatin) -clinical); consider adding 'ProteinSynonymsToQuery' in front of the input if your query is a protein. 2. Retrieve documents: finds 'maximumNumberOfHits' relevant documents (abstract+title) based on query (the AIDA service inside is based on Apa...

Rating: 4.0 / 5 (2 ratings) | Versions: 4 | Reviews: 0 | Comments: 3 | Citations: 0

Viewed: 4032 times | Downloaded: 616 times

Tags (9):

Show View Download Download (v4)

What is this?

Linked Data

Non-Information Resource URI: http://www.myexperiment.org/workflows/111


Alternative Formats

HTML
RDF
XML

New/Upload

Log in / Register

Username or Email:

Password:

Remember me:

OR

Use OpenID:


(eg: name.myopenid.com)

Need an account?
Click here to register

Forgot Password?

Front Page

Home

Invite people to myExperiment

Help pages

About Us

News and Events

Mailing List

Contact Us

Developers

Publications


Taverna Workflow Workbench

myGrid

BioCatalogue

Trident

Google Coop Search

EPSRC

JISC

Microsoft

Powered by:

Rails

Icons:
Silk icon set 1.3