Discover_entities

Created: 2007-12-10 21:48:33      Last updated: 2007-12-10 22:54:42

This workflow contains the 'Named Entity Recognize' web service from the AIDA toolbox, created by Sophia Katrenko. It can be used to discover entities of a certain type (determined by 'learned_model') in documents provided in a lucene output format.

Known issues:

The output of NErecognize contains concepts with / characters, breaking the xml. For post-processing its results it is better to use string manipulation than xml manipulations. The output is per document, which means entities will be redundant if they occur in more than one document.

Information Preview

Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Copy and paste this link into File > 'Open workflow location...'
https://www.myexperiment.org/workflows/111/download?version=1
[ More InfoExpand ]


Information Workflow Components

Inputs (2)
Processors (3)
Beanshells (0)
Outputs (1)
Links (5)
Coordinations (0)

Information Workflow Type

Taverna 1

Information Uploader

Information License

All versions of this Workflow are licensed under:

Information Version 1 (earliest) (of 2)

View version:

Information Credits (3)

(People/Groups)

Information Attributions (0)

(Workflows/Files)

None

Information Tags (4)

Log in to add Tags

Information Shared with Groups (0)

None

Information Featured In Packs (0)

None

Log in to add to one of your Packs

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

Information Statistics

 

Citations (0)

None


Version History

In chronological order:



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (0)

No comments yet

Log in to make a comment




Workflow Other workflows that use similar services (5)

Only the first 2 workflows that use similar services are shown. View all workflows that use these services.


Workflow BioAID_DiseaseDiscovery_RatHumanMouseUnipr... (4)

Thumb
This workflow finds disease relevant to the query string via the following steps: 1. A user query: a list of terms or boolean query - look at the Apache Lucene project for all details. E.g.: (EZH2 OR "Enhancer of Zeste" +(mutation chromatin) -clinical); consider adding 'ProteinSynonymsToQuery' in front of the input if your query is a protein. 2. Retrieve documents: finds 'maximumNumberOfHits' relevant documents (abstract+title) based on query (the AIDA service inside is based on Apa...

Created: 2008-12-15 | Last updated: 2011-08-11

Credits: User Marco Roos Network-member AID

Workflow BioAID_ProteinToDiseases (1)

Thumb
This workflow was based on BioAID_DiseaseDiscovery, changes: expects only one protein name, adds protein synonyms). This workflow finds diseases relevant to the query string via the following steps: A user query: a single protein name Add synonyms (service courtesy of Martijn Scheumie, Erasmus University Rotterdam) Retrieve documents: finds relevant documents (abstract+title) based on query Discover proteins: extract proteins discovered in the set of relevant abstracts 5. Link proteins ...

Created: 2007-11-14 | Last updated: 2007-11-15

Credits: User Marco Roos User Martijn Schuemie Network-member AID

Attributions: Workflow BioAID_DiseaseDiscovery_RatHumanMouseUniprotFilter