Terms_from_collection_of_PDF_filesWorkflow6_pdfDirectoryPathIn00 single input value, absolute path to a directory containing one or more PDF files 2010-02-19 09:48:04.478 GMT /Users/jameseales/temp/pdf 2010-02-19 09:48:12.534 GMT Workflow10_cValueThreshold00 c-value threshold, terms with score below this value are excluded from the "above threshold" output 2010-02-19 09:49:13.222 GMT 5.0 2010-02-19 09:49:16.902 GMT Workflow10_allTermCandidates All terms found by termine 2010-02-19 09:49:41.614 GMT Workflow10_termCandidatesAboveThreshold Terms with c-value scores above threshold 2010-02-19 09:50:04.870 GMT Workflow6pdfDirectoryPathIn0pdfFileContentsOut11net.sf.taverna.t2.activitiesdataflow-activity1.0.1net.sf.taverna.t2.activities.dataflow.DataflowActivitynet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeWorkflow7pdfFileContentsIn0textFileContentsOut00net.sf.taverna.t2.activitiesdataflow-activity1.0.1net.sf.taverna.t2.activities.dataflow.DataflowActivitynet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeWorkflow18plainTextForCleaning0cleanedTextASCII00net.sf.taverna.t2.activitiesdataflow-activity1.0.1net.sf.taverna.t2.activities.dataflow.DataflowActivitynet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeWorkflow9plainText0sentencesList11net.sf.taverna.t2.activitiesdataflow-activity1.0.1net.sf.taverna.t2.activities.dataflow.DataflowActivitynet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeWorkflow10sentencesList1cValueThreshold0allTermCandidates11termCandidatesAboveThreshold11net.sf.taverna.t2.activitiesdataflow-activity1.0.1net.sf.taverna.t2.activities.dataflow.DataflowActivitynet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeWorkflow6pdfDirectoryPathInWorkflow6_pdfDirectoryPathInWorkflow7pdfFileContentsInWorkflow6pdfFileContentsOutWorkflow18plainTextForCleaningWorkflow7textFileContentsOutWorkflow9plainTextWorkflow18cleanedTextASCIIWorkflow10sentencesListWorkflow9sentencesListWorkflow10cValueThresholdWorkflow10_cValueThresholdWorkflow10_allTermCandidatesWorkflow10allTermCandidatesWorkflow10_termCandidatesAboveThresholdWorkflow10termCandidatesAboveThreshold PDF to terms with c-value 2010-02-19 10:35:26.900 GMT This workflow will give you a set of candidate terms for each PDF document in a user-specified directory. You can also specify a c-value threshold that will restrict the terms to those with scores 2010-02-19 10:45:33.350 GMT This workflow will give you a set of candidate terms for each PDF document in a user-specified directory. You can also specify a c-value threshold that will restrict the terms to those with higher scores. 2010-02-19 10:51:42.213 GMT Terms from collection of PDF files 2010-02-19 10:35:55.32 GMT James Eales 2010-02-19 10:35:02.187 GMT Workflow18plainTextForCleaning00cleanedTextASCIIcleanTextASCIIparameters0parameters00net.sf.taverna.t2.activitieswsdl-activity1.0.1net.sf.taverna.t2.activities.wsdl.WSDLActivity http://gnode1.mib.man.ac.uk:8080/FullTextWebServices/TextCleanerService?wsdl cleanTextASCII net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokecleanTextASCII_inputtextStringBytesUTF80output00net.sf.taverna.t2.activitieswsdl-activity1.0.1net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLInputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="cleanTextASCII" name="parameters" qname="{http://webservices.fulltextwebservices/}cleanTextASCII"><s:elements><s:basetype optional="true" unbounded="false" typename="base64Binary" name="textStringBytesUTF8" qname="cleanTextASCII&gt;textStringBytesUTF8" /></s:elements></s:complextype></s:extensions> false textStringBytesUTF8 0 'application/octet-stream' 0 output 0 'text/xml' net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokecleanTextASCII_outputinput0cleanedTextASCII00net.sf.taverna.t2.activitieswsdl-activity1.0.1net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLOutputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="cleanTextASCIIResponse" name="parameters" qname="{http://webservices.fulltextwebservices/}cleanTextASCIIResponse"><s:elements><s:basetype optional="true" unbounded="false" typename="string" name="cleanedTextASCII" qname="cleanTextASCIIResponse&gt;cleanedTextASCII" /></s:elements></s:complextype></s:extensions> java.lang.String false input 0 'text/xml' 0 cleanedTextASCII 0 'text/plain' net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeencodeByteArrayToBase64bytes0base6400net.sf.taverna.t2.activitieslocalworker-activity1.0net.sf.taverna.t2.activities.localworker.LocalworkerActivity org.embl.ebi.escience.scuflworkers.java.EncodeBase64 commons-codec:commons-codec:1.3 workflow commons-codec commons-codec 1.3 1635363350 [B true bytes 0 'application/octet-stream' 0 base64 0 'text/plain' net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokecleanTextASCIIparameterscleanTextASCII_inputoutputcleanTextASCII_inputtextStringBytesUTF8encodeByteArrayToBase64base64cleanTextASCII_outputinputcleanTextASCIIparametersencodeByteArrayToBase64bytesplainTextForCleaningcleanedTextASCIIcleanTextASCII_outputcleanedTextASCIIWorkflow6pdfDirectoryPathIn00pdfFileContentsOutList_Files_by_Extensiondirectory0extension0filelist11net.sf.taverna.t2.activitieslocalworker-activity1.0net.sf.taverna.t2.activities.localworker.LocalworkerActivity net.sourceforge.taverna.scuflworkers.io.FileListByExtTask workflow java.lang.String true directory 0 'text/plain' java.lang.String true extension 0 'text/plain' 1 filelist 1 l('text/plain') net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokeextension_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.0net.sf.taverna.t2.activities.stringconstant.StringConstantActivity pdf net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokebinaryFileReaderabsoluteFilePath0fileContents00net.sf.taverna.t2.activitiesbeanshell-activity1.0net.sf.taverna.t2.activities.beanshell.BeanshellActivity workflow java.lang.String true absoluteFilePath 0 text/plain 0 fileContents 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeList_Files_by_ExtensiondirectorypdfDirectoryPathInList_Files_by_Extensionextensionextension_valuevaluebinaryFileReaderabsoluteFilePathList_Files_by_ExtensionfilelistpdfFileContentsOutbinaryFileReaderfileContentsWorkflow10sentencesList11cValueThreshold00 5.0 2010-02-14 16:49:44.949 GMT c-Value threshold value (a higher score indicates a higher probability of being a real term). If you want good quality terms raise this value, if you want all terms then lower it. 2010-02-14 16:49:37.741 GMT termCandidatesAboveThresholdxpathOutputallTermCandidatessentenceListNormalisersentences1sentenceListString00net.sf.taverna.t2.activitiesbeanshell-activity1.0net.sf.taverna.t2.activities.beanshell.BeanshellActivity workflow java.lang.String true sentences 1 text/plain 0 sentenceListString 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokeanalyzefilter0input_format0output_format0stoplist0src0result00net.sf.taverna.t2.activitieswsdl-activity1.0.1net.sf.taverna.t2.activities.wsdl.WSDLActivity http://www.nactem.ac.uk/software/termine/webservice/termine.wsdl analyze net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokefilter_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.0net.sf.taverna.t2.activities.stringconstant.StringConstantActivity net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokeinput_format_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.0net.sf.taverna.t2.activities.stringconstant.StringConstantActivity net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokeoutput_format_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.0net.sf.taverna.t2.activities.stringconstant.StringConstantActivity xml net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokestoplist_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.0net.sf.taverna.t2.activities.stringconstant.StringConstantActivity net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokejamesXPathxml0xPathString0resultValues11net.sf.taverna.t2.activitiesbeanshell-activity1.0net.sf.taverna.t2.activities.beanshell.BeanshellActivity workflow java.lang.String true xml 0 text/plain java.lang.String true xPathString 0 text/plain 1 resultValues 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeConcatenate_two_stringsstring20string10output00net.sf.taverna.t2.activitieslocalworker-activity1.0net.sf.taverna.t2.activities.localworker.LocalworkerActivity org.embl.ebi.escience.scuflworkers.java.StringConcat workflow java.lang.String true string1 0 'text/plain' java.lang.String true string2 0 'text/plain' 0 output 0 'text/plain' net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokexpath_first_bitvalue00net.sf.taverna.t2.activitiesstringconstant-activity1.0net.sf.taverna.t2.activities.stringconstant.StringConstantActivity //termCandidate[@cValueScore>= net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeConcatenate_two_strings_2string10string20output00net.sf.taverna.t2.activitieslocalworker-activity1.0net.sf.taverna.t2.activities.localworker.LocalworkerActivity org.embl.ebi.escience.scuflworkers.java.StringConcat workflow java.lang.String true string1 0 'text/plain' java.lang.String true string2 0 'text/plain' 0 output 0 'text/plain' net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokexpath_last_bitvalue00net.sf.taverna.t2.activitiesstringconstant-activity1.0net.sf.taverna.t2.activities.stringconstant.StringConstantActivity ]/text() net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokejamesXPath_2xml0xPathString0resultValues11net.sf.taverna.t2.activitiesbeanshell-activity1.0net.sf.taverna.t2.activities.beanshell.BeanshellActivity workflow java.lang.String true xml 0 text/plain java.lang.String true xPathString 0 text/plain 1 resultValues 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokexPathString_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.0net.sf.taverna.t2.activities.stringconstant.StringConstantActivity //termCandidate/text() net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokesentenceListNormalisersentencessentencesListanalyzefilterfilter_valuevalueanalyzeinput_formatinput_format_valuevalueanalyzeoutput_formatoutput_format_valuevalueanalyzestopliststoplist_valuevalueanalyzesrcsentenceListNormalisersentenceListStringjamesXPathxmlanalyzeresultjamesXPathxPathStringConcatenate_two_strings_2outputConcatenate_two_stringsstring2cValueThresholdConcatenate_two_stringsstring1xpath_first_bitvalueConcatenate_two_strings_2string1Concatenate_two_stringsoutputConcatenate_two_strings_2string2xpath_last_bitvaluejamesXPath_2xmlanalyzeresultjamesXPath_2xPathStringxPathString_valuevaluetermCandidatesAboveThresholdjamesXPathresultValuesxpathOutputConcatenate_two_strings_2outputallTermCandidatesjamesXPath_2resultValuesWorkflow9plainText00sentencesListsplitIntoSentencesparameters0parameters00net.sf.taverna.t2.activitieswsdl-activity1.0.1net.sf.taverna.t2.activities.wsdl.WSDLActivity http://gnode1.mib.man.ac.uk:8080/FullTextWebServices/SentenceSplitterService?wsdl splitIntoSentences net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokesplitIntoSentences_inputtextToSplit0output00net.sf.taverna.t2.activitieswsdl-activity1.0.1net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLInputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="splitIntoSentences" name="parameters" qname="{http://webservices.fulltextwebservices/}splitIntoSentences"><s:elements><s:basetype optional="true" unbounded="false" typename="string" name="textToSplit" qname="splitIntoSentences&gt;textToSplit" /></s:elements></s:complextype></s:extensions> false textToSplit 0 'text/plain' 0 output 0 'text/xml' net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokesplitIntoSentences_outputinput0sentencesArray11net.sf.taverna.t2.activitieswsdl-activity1.0.1net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLOutputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="splitIntoSentencesResponse" name="parameters" qname="{http://webservices.fulltextwebservices/}splitIntoSentencesResponse"><s:elements><s:arraytype optional="true" unbounded="true" wrapped="false" typename="string" name="sentencesArray" qname="splitIntoSentencesResponse&gt;sentencesArray"><s:elementtype><s:basetype optional="false" unbounded="false" typename="string" name="" qname="{http://www.w3.org/2001/XMLSchema}string" /></s:elementtype></s:arraytype></s:elements></s:complextype></s:extensions> java.lang.String false input 0 'text/xml' 1 sentencesArray 1 l('text/plain') net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokesplitIntoSentencesparameterssplitIntoSentences_inputoutputsplitIntoSentences_inputtextToSplitplainTextsplitIntoSentences_outputinputsplitIntoSentencesparameterssentencesListsplitIntoSentences_outputsentencesArrayWorkflow7pdfFileContentsIn00textFileContentsOutpdfToTextparameters0parameters00net.sf.taverna.t2.activitieswsdl-activity1.0.1net.sf.taverna.t2.activities.wsdl.WSDLActivity http://gnode1.mib.man.ac.uk:8080/FullTextWebServices/PdfToTextService?wsdl pdfToText net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokepdfToText_inputpdfFile0output00net.sf.taverna.t2.activitieswsdl-activity1.0.1net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLInputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="pdfToText" name="parameters" qname="{http://webservices.fulltextwebservices/}pdfToText"><s:elements><s:basetype optional="true" unbounded="false" typename="base64Binary" name="pdfFile" qname="pdfToText&gt;pdfFile" /></s:elements></s:complextype></s:extensions> false pdfFile 0 'application/octet-stream' 0 output 0 'text/xml' net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokepdfToText_outputinput0extractedText00net.sf.taverna.t2.activitieswsdl-activity1.0.1net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLOutputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="pdfToTextResponse" name="parameters" qname="{http://webservices.fulltextwebservices/}pdfToTextResponse"><s:elements><s:basetype optional="true" unbounded="false" typename="base64Binary" name="extractedText" qname="pdfToTextResponse&gt;extractedText" /></s:elements></s:complextype></s:extensions> java.lang.String false input 0 'text/xml' 0 extractedText 0 'application/octet-stream' net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeEncode_Byte_Array_to_Base_64bytes0base6400net.sf.taverna.t2.activitieslocalworker-activity1.0net.sf.taverna.t2.activities.localworker.LocalworkerActivity org.embl.ebi.escience.scuflworkers.java.EncodeBase64 commons-codec:commons-codec:1.3 workflow commons-codec commons-codec 1.3 1635363350 [B true bytes 0 'application/octet-stream' 0 base64 0 'text/plain' net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.0net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokepdfToTextparameterspdfToText_inputoutputpdfToText_inputpdfFileEncode_Byte_Array_to_Base_64base64pdfToText_outputinputpdfToTextparametersEncode_Byte_Array_to_Base_64bytespdfFileContentsIntextFileContentsOutpdfToText_outputextractedText