Version 1
(of 1)
|
Version created on:
18/02/10 @ 18:59:35
by:
James Eales
|
Revision comments
Last edited on: 18/02/10 @ 19:03:45 by: James Eales
Title: Clean plain text
Type: Taverna 2
Preview
(Click on the image to get the full size)
Description
This workflow will remove any XML-invalid characters (these characters often appear in the output of PDF to text software) from any text supplied to the input port.
This is a workflow component, designed to be used as a nested workflow inside a larger text mining or text processing workflow.
Download
Run
Option 1:
Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/1055/download?version=1
[ More Info
]
Workflow Components
Authors (0)
Titles (0)
Descriptions (0)
Workflow Type
Log in to add Tags
Shared with Groups (1)
Current:
0.0 / 5
(0 ratings)
Log in to rate and see breakdown of ratings
Statistics
None
Earliest Version:
[1] - Clean plain text
This Workflow only has one version.
Reviews
(0)
Other workflows that use similar services
(5)
Only the first 2 workflows that use similar services are shown. View all workflows that use these services.
|
Original Uploader |
Created: 06/05/11 @ 16:52:35 | Last updated: 13/12/11 @ 15:58:54
Credits:
License: Creative Commons Attribution-Share Alike 3.0 Unported License
This workflow accepts a plain text input and provides a single text document per input containing one sentence per line. Newline characters are removed from the original input.
The OpenNLP sentence splitter is used to split the text, this is provided by University of Manchester Web Services.
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 19 times | Downloaded: 14 times Tags (7): |
View
Download (v1)
|
|
Original Uploader |
Created: 19/02/10 @ 10:52:29 | Last updated: 13/12/11 @ 15:56:08
Credits:
License: Creative Commons Attribution-Share Alike 3.0 Unported License
This workflow will give you a set of candidate terms for each PDF document in a user-specified directory. You can also specify a c-value threshold that will restrict the terms to those with higher scores.
This workflow was created using only nested workflows. These workflow components work on their own and can be linked together to form more complex workflows such as this. You can view the text mining workflow components in this pack.
If you receive errors when running this workflow t...
Rating: 0.0 / 5 (0 ratings) | Versions: 2 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 71 times | Downloaded: 34 times Tags (4): |
View
Download (v2)
|
Linked Data
Non-Information Resource URI: http://www.myexperiment.org/workflows/1055
Alternative Formats
Copyright © 2007 - 2011 The University of Manchester and University of Southampton
No comments yet
Log in to make a comment