Version 2 (latest)
(of 2)
|
Version created on:
18/02/11 @ 13:45:51
by:
Paul Fisher
|
Revision comments
Last edited on: 18/02/11 @ 13:47:08 by: Paul Fisher
Title: Pathway and Gene to Pubmed
Type: Taverna 2
Preview
(Click on the image to get the full size)
Description
This workflow takes in a list of gene names and KEGG pathway descriptions, and searches the PubMed database for corresponding articles. Any matches to the genes are then retrieved (abstracts only). These abstracts are then used to calculate a cosine vector space between two sets of corpora (gene and phenotype, or pathway and phenotype). The workflow counts the number of articles in the pubmed database in which each term occurs, and identifies the total number of articles in the entire PubMed database. It also identified the total number of articles within pubmed so that a term enrichment score may be calculated. The workflow also takes in a document containing abstracts that are related to a particular phenotype. Scientiifc terms are then extracted from this text and given a weighting according to the number of terms that appear in the document. The higher the value the better the score. This is given as: X = log((a / b) / (c / d)) where: a = number of occurnaces of individual terms in phenotype corpus b = number of abstracts in entire phenotype corpus c = number of occurnaces of individual terms in entire pubmed d = number of articles in entire pubmed Once this has been created, the pathways obtained from the QTL and microarray pathway analysis workflows are analysed. The documents from a search of each pathway in pubmed are merged into a single document of pathway abstracts. The (unweighted) phenotype terms are then searched in the pathways corpus. This will determine if the phenotype term is listed with the given pathway. The higher the value the better the score. Each term is then assigned a weight as: Y = log((e / f) / (c /d)) where: a = number of occurnaces of individual terms in pathway corpus b = number of abstracts in pathway corpus (per pathway) c = number of occurnaces of individual terms in entire pubmed d = number of articles in entire pubmed The weighted terms are then given a link score. This is the total of: X + Y. This gives the link between the pathway and the phenotype a score / significance value. The higher the score the more "appropriate/interesting" the link between the pathway and the phenotype. The terms are also ranked according to the number of pathways which have been given a weight. This is calculated as: W = ( X + Y). The higher the value the better the score.
Download
Run
Option 1:
Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/1846/download?version=2
[ More Info
]
Workflow Components
Authors (1)
Titles (2)
Descriptions (3)
Workflow Type
Log in to add Tags
Shared with Groups (0)
None
Log in to add to one of your Packs
Current:
0.0 / 5
(0 ratings)
Log in to rate and see breakdown of ratings
Statistics
None
Earliest Version:
[1] - Pathway and Gene to Pubmed
Latest Version:
[2] - Pathway and Gene to Pubmed
Reviews
(0)
Other workflows that use similar services
(30)
Only the first 2 workflows that use similar services are shown. View all workflows that use these services.
|
Original Uploader |
Created: 08/02/11 @ 13:04:06 | Last updated: 10/02/11 @ 16:01:41
Credits:
Attributions:
License: Creative Commons Attribution-Share Alike 3.0 Unported License
This workflow takes in a list of gene names and searches the PubMed database for corresponding articles. Any matches to the genes are then retrieved (abstracts only). These abstracts are then returned to the user.
Rating: 0.0 / 5 (0 ratings) | Versions: 4 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 43 times | Downloaded: 23 times Tags (30): |
View
Download (v4)
|
|
Original Uploader |
Created: 05/07/10 @ 14:07:33 | Last updated: 11/01/11 @ 12:07:22
Credits:
License: Creative Commons Attribution-Share Alike 3.0 Unported License
This workflow takes in a phenotype search term, and searches for abstracts in the PubMed database. These are passed to the eSearch function and searched for in PubMed. Those abstracts found are returned to the user
Rating: 0.0 / 5 (0 ratings) | Versions: 3 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 109 times | Downloaded: 28 times Tags (18): |
View
Download (v3)
|
Linked Data
Non-Information Resource URI: http://www.myexperiment.org/workflows/1846
Alternative Formats
Copyright © 2007 - 2011 The University of Manchester and University of Southampton
No comments yet
Log in to make a comment