Workflow Entry: BioAID_EnirchBioModelWithProteinsFromText

Created at: 16/05/09 @ 01:06:26      Last updated: 16/05/09 @ 01:13:18
Information Version 7 (latest) (of 7)
View version:

Version created on: 16/05/09 @ 01:06:26 by: Marco Roos   |   Revision comments Expand

Last edited on: 16/05/09 @ 01:13:19 by: Marco Roos

Title: BioAID_EnirchBioModelWithProteinsFromText

Type: Taverna 1


Information Preview

(Click on the image to get the full size)

Medium


Information Description

This workflow is for demonstration purposes only. Please contact the authors if you wish to try it. We will gladly collaborate with you.

Summary

This workflow extracts proteins and protein relations from Medline. Extracted protein names (symbols of at least 3 characters) are validated against mouse, rat, and human UniProt symbols, so the results are limited to these species. This workflow follows the following basic steps:

  1. it retrieves documents relevant for the query string
  2. it discovers proteins in those documents, that are considered relevant to the query string and related to the proteins mentioned in the query (colocation in text mining jargon)
  3. it stores the results in a semantic repository

To support hypothesis formation, the results are added to a repository containing proto-ontologies with biological classes and procedural classes to log evidence. The models are based on RDF and OWL.

Acknowledgements:

Synonyms and Uniprot services: Martijn Scheumie, BioSemantics Group, University of Rotterdam, The Netherlands (BioRange project)

Known issues

Occasionally the workflow will fail on intermediate results that return no results (e.g. on a time out or a bug in the workflow). This problem will be addressed in Taverna 2 using its more strict list iteration mechanism and the AIDA plugin for Taverna 2. The workflow contains some elements that are not yet functional. This will show as failed when run. This can be ignored.

Please contact us if you have any questions about the workflow, our approach, or if you experience technical difficulties.


Information Download



Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.

Option 2:

Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/379/download?version=7
[ More InfoExpand ]


Information Workflow Components

Inputs (4)
Processors (32)
Beanshells (87)
Outputs (7)
Links (112)
Coordinations (8)

Information Workflow Type

Taverna 1

Information Original Uploader

Information License

All versions of this Workflow are licensed under:

Information Credits (8)

(People/Groups)

Information Attributions (0)

(Workflows/Files)

None

Information Tags (10)

Log in to add Tags

Information Featured In Packs (2)

Log in to add to one of your Packs

Information Ratings (0)

Current:

0.0 / 5

(0 ratings)

Log in to rate and see breakdown of ratings

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

 

Citations (0)

None


Version History

Earliest Version:
[1] - BioAID_ProteinDiscovery_HomoSapiens

Created on: Friday 22 August 2008 @ 11:00:29 (GMT)

Created by: Marco Roos

Last edited on: Tuesday 26 August 2008 @ 15:56:58 (GMT)

Last edited by: Marco Roos

Revision comments:

None

Previous Versions:
[2] - BioAID_EnirchBioModelWithProteinsFromText

Created on: Friday 22 August 2008 @ 11:00:29 (GMT)

Created by: Marco Roos

Last edited on: Tuesday 26 August 2008 @ 16:05:11 (GMT)

Last edited by: Marco Roos

Revision comments:

Protein discovery workflow that stores instances in a semantic model that separates biology (intensional) knowledge from procedural (extensional) knowledge.
Semantic types in the semantic sub workflows are obtained provisionally using strings produced by a workflow (GetFromSesame.xml) that gets types from a Sesame repository containing the template ontologies.

[3] - BioAID_EnirchBioModelWithProteinsFromText

Created on: Friday 22 August 2008 @ 11:00:29 (GMT)

Created by: Marco Roos

Last edited on: Wednesday 27 August 2008 @ 23:15:01 (GMT)

Last edited by: Marco Roos

Revision comments:

Minor update. Changed the description.

[4] - BioAID_EnirchBioModelWithProteinsFromText

Created on: Wednesday 29 October 2008 @ 09:14:51 (GMT)

Created by: Marco Roos

Revision comments:

Adjustments for semantic model updates.

[5] - BioAID_EnirchBioModelWithProteinsFromText

Created on: Wednesday 29 October 2008 @ 09:27:18 (GMT)

Created by: Marco Roos

Last edited on: Friday 15 May 2009 @ 16:38:18 (GMT)

Last edited by: Marco Roos

Revision comments:

 Minor updates to get the syncing of document instances and proteins right.

[6] - BioAID_EnirchBioModelWithProteinsFromText

Created on: Saturday 16 May 2009 @ 00:57:14 (GMT)

Created by: Marco Roos

Revision comments:

  • Temporary switch to development server because of minor unresolved issues with production server
  • Changed location of SynSets service.

Latest Version:
[7] - BioAID_EnirchBioModelWithProteinsFromText

Created on: Saturday 16 May 2009 @ 01:06:26 (GMT)

Created by: Marco Roos

Last edited on: Saturday 16 May 2009 @ 01:13:19 (GMT)

Last edited by: Marco Roos

Revision comments:

  1. Temporary return to aida development server due to minor issues with production server
  2. Changed URL of SynSets service (it moved)

 



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (1)

Log in to make a comment

  • Friday 27 November 2009 @ 11:43:02 (GMT)

    This workflow may need some work because of a recent server migration... Our apologies.




Workflow Other workflows that use similar services (2)

Original Uploader

Workflow BioAID_ProteinDiscovery_filterOnHumanUniprot_perDoc_html (v11)

Created: 28/05/09 @ 12:21:05

Credits: User Marco Roos User Martijn Schuemie Network-member AID Network-member AID_myGrid_collaboration

Attributions: Workflow BioAID_DiseaseDiscovery_RatHumanMouseUniprotFilter

License: Creative Commons Attribution-Share Alike 3.0 Unported License

Thumb

This workflow finds proteins relevant to the query string via the following steps: A user query: a single gene/protein name. E.g.: (EZH2 OR "Enhancer of Zeste"). Retrieve documents: finds 'maximumNumberOfHits' relevant documents (abstract+title) based on query (the AIDA service inside is based on Apache's Lucene) Discover proteins: extract proteins discovered in the set of relevant abstracts with a 'named entity recognizer' trained on genomic terms using a Bayesian approach; the AIDA serv...

Rating: 0.0 / 5 (0 ratings) | Versions: 11 | Reviews: 0 | Comments: 1 | Citations: 0

Viewed: 454 times | Downloaded: 167 times

Tags (9):

Show View Download Download (v11)

Original Uploader

Workflow BioAID_ProteinDiscovery (v7)

Created: 10/05/10 @ 16:21:09 | Last updated: 20/03/12 @ 17:16:11

Credits: User Marco Roos Network-member AID

License: Creative Commons Attribution-Share Alike 3.0 Unported License

Thumb

This protein discovery workflow extracts protein names from documents retrieved from MedLine based on a user Query (cf Apache Lucene syntax). The protein names are filtered by checking if there exists a valid UniProt ID for the given protein name.

Rating: 0.0 / 5 (0 ratings) | Versions: 7 | Reviews: 0 | Comments: 1 | Citations: 0

Viewed: 292 times | Downloaded: 131 times

Tags (12):

Show View Download Download (v7)

What is this?

Linked Data

Non-Information Resource URI: http://www.myexperiment.org/workflows/379


Alternative Formats

HTML
RDF
XML

New/Upload

Log in / Register

Username or Email:

Password:

Remember me:

OR

Use OpenID:


(eg: name.myopenid.com)

Need an account?
Click here to register

Forgot Password?

Front Page

Home

Invite people to myExperiment

Help pages

About Us

News and Events

Mailing List

Contact Us

Developers

Publications


Taverna Workflow Workbench

myGrid

BioCatalogue

Trident

Google Coop Search

EPSRC

JISC

Microsoft

Powered by:

Rails

Icons:
Silk icon set 1.3