Use Case C, Worflow 4

Created: 2014-05-27 08:29:43      Last updated: 2014-11-25 16:12:17

Workflow used to obtain data for the research paper ‘The application of the Open Pharmacological Concepts Triple Store (Open PHACTS) to support Drug Discovery Research’, currently submitted to PLOS ONE.  

Requirements:
- Knime v2.9
- Open PHACTS Knime nodes 1.0.0 (DON'T use any later version!): https://github.com/openphacts/OPS-Knime

Installation:
- Download "org.openphacts.utils.json_1.0.0.zip" and unzip it in the plugins folder of your KNIME installation
- Download the workflow (Use Case C_Workflow 4.zip)
- Start your Knime environment (you should see already a couple of new nodes at the ‘Node Repository’, like OPS_KNIME)
- Import the workflow (file>>import-workflow...>>select archive file)

Running:
- Double click on the desired workflow
- Reset the OPS nodes (in orange)
- Execute the nodes. Because of a bug found, the OPS nodes have to be run one by one to avoid the retrieved columns of the metanodes get mixed up.

Description of the Workflow:
1. We start with the text ‘regulation of Vitamin D’ and obtain GO terms associated by using ‘Free text to Concept’ API call. The output of the API was restricted to ‘alternative’ exact match type, to include only GO terms.
2. After removing duplicates and GO:0046137 (not specific of ‘Vitamind D’) we use GO terms urls as input for ‘Hierarchies: Child’ API call to retrieve Children of the GO terms.
3. The output from above is splitted into positive and negative regulators by using the ‘Row Filter’ node. After this we remove duplicates using the ‘GroupBy’ node and prepare the input for ‘Target Class Member: List’ API by renaming ‘Childnode’ to ‘uri’.
4. ‘Target Class Member: List’ API call is used to retrieve gene products associated with GO terms. Then we remove duplicates and keep only human genes using the ‘Nominal Value Row filter’ node.
5. In order to compare with proteins from the initial pathway (a table including all ‘Uniprot IDs’ from initial pathway need to be created manually), the string http://purl.uniprot.org/uniprot/  is removed by using the ‘String Replacer’ node.
6. Using the ‘Reference Row Filter’ node, we are able to show genes that are common to initial pathway and genes that are not from positive and negative regulators.    

Information Preview

Medium

Information Run

Not available


Information Workflow Components

Not available

Information Workflow Type

KNIME

Information License

All versions of this Workflow are licensed under:

Information Version 1 (of 1)

Information Credits (4)

(People/Groups)

Information Attributions (0)

(Workflows/Files)

None

Information Tags (18)

Log in to add Tags

Information Shared with Groups (1)

Information Featured In Packs (0)

None

Log in to add to one of your Packs

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

Information Statistics

 

Citations (0)

None


Version History

In chronological order:



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (0)

No comments yet

Log in to make a comment




Workflow Other workflows that use similar services (0)

There are no workflows in myExperiment that use similar services to this Workflow.