All content

Search filter terms
Filter by category
Filter by type
Filter by tag
Filter by user
Filter by licence
Filter by group
Filter by wsdl
Results per page:
Sort by:
Showing 19 results. Use the filters on the left and the search box below to refine the results.
Tag: term extraction

Workflow Termine with c-value threshold (1)

Thumb
This workflow accepts a list of sentences from a single document and returns the terms found by the TerMine web service. It also allows you to set a threshold c-value score so that only terms with a user-controlled probability (of being a real term) are returned as an output.   To get sentences to supply to this workflow you can use the sentence splitting workflow.  The TerMine service (used in this workflow) only accepts text in ASCII encoding, so you should also use the Clean p...

Created: 2010-02-19 | Last updated: 2011-12-13

Credits: User James Eales

Workflow Terms from collection of PDF files (2)

Thumb
This workflow will give you a set of candidate terms for each PDF document in a user-specified directory. You can also specify a c-value threshold that will restrict the terms to those with higher scores. This workflow was created using only nested workflows.  These workflow components work on their own and can be linked together to form more complex workflows such as this. You can view the text mining workflow components in this pack. If you receive errors when running this workflow t...

Created: 2010-02-19 | Last updated: 2011-12-13

Credits: User James Eales

Workflow Extract Scientific Terms (1)

Thumb
This workflow takes in a document containg text and removes any non-ascii characters. The cleaned text is then sent to a service in Dresden, to extract all scientific terms. These terms represent a concept profile for the input concpet. Any null values are also removed.

Created: 2009-08-10 | Last updated: 2009-08-10

Credits: User Paul Fisher

Uploader

Workflow Termine Webservice (1)

Thumb
Termine is a service provided by the National Centre for Text Mining (NaCTeM) to assist in the discovery of terms in text. More information on the Termine service can be found here. This workflow represents the simplest method of using Termine. The input represents a text string with the output being an string containing a representation of the list of terms, with their C-Value scores (representing significance in the text), in a simple xml format. Other variations of this tools will be adde...

Created: 2008-05-19 | Last updated: 2008-05-19

Credits: User Brian Rea Network-member National Centre for Text Mining (NaCTeM)

Workflow Terms from collection of text files (1)

Thumb
This workflow will give you a set of candidate terms for each text file in a user-specified directory. You can also specify a c-value threshold that will restrict the terms to those with higher scores. This workflow was created using only nested workflows.  These workflow components work on their own and can be linked together to form more complex workflows such as this. You can view the text mining workflow components in this pack. If you receive errors when running this workflow then...

Created: 2010-02-22 | Last updated: 2011-12-13

Credits: User James Eales

Workflow Rank Phenotype Terms (1)

Thumb
This workflow counts the number of articles in the pubmed database in which each term occurs, and identifies the total number of articles in the entire PubMed database. It also identified the total number of articles within pubmed so that a term enrichment score may be calculated. The workflow also takes in a document containing abstracts that are related to a particular phenotype. Scientiifc terms are then extracted from this text and given a weighting according to the number of terms that ...

Created: 2009-08-10

Credits: User Paul Fisher

Uploader

Blob Pathway Term Enrichment Scores

Created: 2009-08-11 14:23:49

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This file contains a list of each pathway identified from day 7 post infection and linked to the Tir1 QTL. With each pathway is a list of terms that are common to both pathway and phenotype corpora. These terms were ranked accoring to their enrichement scores. The higher the score, the more significant the term is in relation to correlating the pathway with the African trypanosomiasis resistance phenotype.

File type: Plain text

Comments: 0 | Viewed: 79 times | Downloaded: 0 times

Tags:

Uploader

Blob Pathway Abstracts for Day7 Microarray Tir1 QTL

Created: 2009-08-11 14:08:41 | Last updated: 2009-08-11 14:15:58

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This file contains all the abstracts for pathways found to be differentially expressed at day 7 post infection and intersect the Tir1 QTL region, from the African Trypanosomiasis project. Each pathway is listed as ">> [Pathway Name]", together with a PubMed identifier, date, and abstract for each article. Each pathway has been restricted to 500 abstracts, and is given in the date range 31/12/2007 to 01/01/2009. Note, some pathways do not have any abstracts available due to th...

File type: Plain text

Comments: 0 | Viewed: 47 times | Downloaded: 0 times

Tags:

Uploader

Blob PubMed Term Counts

Created: 2009-08-11 13:59:35

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This file contains a count of each phenotype term extracted from corpus of phenotype abstracts. Each value represents the number of articles in MEDLINE the term appears. The use of this file is to calculate a cosine vector score for correlating a given concept (e.g. pathway or gene) with a phenotype.

File type: Plain text

Comments: 0 | Viewed: 50 times | Downloaded: 0 times

Tags:

Uploader

Blob Phenotype Term Counts (in Phenotype Corpus)

Created: 2009-08-11 13:34:42 | Last updated: 2009-08-11 13:58:28

Credits: User Paul Fisher

License: Creative Commons Attribution-Share Alike 3.0 Unported License

This file contains a count of each phenotype term extracted from corpus of phenotype abstracts. Each value represents the number of articles in the phenotype corpus the term appears. The use of this file is to calculate a cosine vector score for correlating a given concept (e.g. pathway or gene) with a phenotype.

File type: Plain text

Comments: 0 | Viewed: 43 times | Downloaded: 0 times

Tags:

Results per page:
Sort by: