Text_preprocessingText00 An example of the input text is this one. The workflow will first remove the charachters from this set: `~!@#$%^&*()_+=-{}|\][":;'?><,./. The it will transform the text to lower case. The user will be prompted to select a dictionary for stop words from a list. The workflow will, based on the selected list, remove the stop words. Stop words are words that do not carry meaning, like, the, an,... The web service for stop words removal integrates six English stop words dictionaries and one for the Slovenian language. 2011-01-07 17:07:42.147 CET Plain text. 2011-01-07 16:36:58.151 CET An example of the input text is this one. The workflow will remove the stop words based on the selected dictionary. Stop words are words that do not carry meaning, like, the, an,... The web service for stop words removal integrates six English stop words dictionaries and one for the Slovenian language. The user is prompted to choose a stop words dictionary. 2011-01-07 16:40:54.71 CET An example of the input text is this one. The workflow will remove the stop words based on the selected dictionary. Stop words are words that do not carry meaning, like, the, an,... The 2011-01-07 16:38:33.236 CET TextWithoutStopWordsremoveStopwordsparameters0parameters00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.WSDLActivity http://zulu.ijs.si:8086/SW_service?wsdl removeStopwords net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeremoveStopwords_inputstopwordsCollectioID0inputText0output00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLInputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="removeStopwords" name="parameters" qname="{http://www.example.org/Stopwords/}removeStopwords"><s:elements><s:basetype optional="false" unbounded="false" typename="string" name="inputText" qname="&gt;removeStopwords&gt;inputText" /><s:basetype optional="true" unbounded="false" typename="string" name="stopwordsCollectioID" qname="&gt;removeStopwords&gt;stopwordsCollectioID" /></s:elements></s:complextype></s:extensions> false inputText 0 'text/plain' false stopwordsCollectioID 0 'text/plain' 0 output 0 'text/xml' net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeWorkflow1wsdl0VariableName0Selection00net.sf.taverna.t2.activitiesdataflow-activity1.2net.sf.taverna.t2.activities.dataflow.DataflowActivitynet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokewsdl_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.2net.sf.taverna.t2.activities.stringconstant.StringConstantActivity http://zulu.ijs.si:8086/SW_service?wsdl net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeVariableName_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.2net.sf.taverna.t2.activities.stringconstant.StringConstantActivity stopwordsCollectioID net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeremoveStopwords_outputinput0filteredText00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLOutputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="removeStopwordsResponse" name="parameters" qname="{http://www.example.org/Stopwords/}removeStopwordsResponse"><s:elements><s:basetype optional="false" unbounded="false" typename="string" name="filteredText" qname="&gt;removeStopwordsResponse&gt;filteredText" /></s:elements></s:complextype></s:extensions> java.lang.String false input 0 'text/xml' 0 filteredText 0 'text/plain' net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeBeanshelltext0LowerText00net.sf.taverna.t2.activitiesbeanshell-activity1.2net.sf.taverna.t2.activities.beanshell.BeanshellActivity workflow java.lang.String true text 0 text/plain 0 LowerText 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokefilterTextparameters0parameters00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.WSDLActivity http://zulu.ijs.si:8086/TF_service?wsdl filterText net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokefilterText_inputinputText0unwantedCharacters0output00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLInputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="filterText" name="parameters" qname="{http://www.example.org/TextFiltering/}filterText"><s:elements><s:basetype optional="false" unbounded="false" typename="string" name="inputText" qname="&gt;filterText&gt;inputText" /><s:basetype optional="false" unbounded="false" typename="string" name="unwantedCharacters" qname="&gt;filterText&gt;unwantedCharacters" /></s:elements></s:complextype></s:extensions> false inputText 0 'text/plain' false unwantedCharacters 0 'text/plain' 0 output 0 'text/xml' net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeunwantedCharacters_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.2net.sf.taverna.t2.activities.stringconstant.StringConstantActivity `~!@#$%^&*()_+=-{}|\][":;'?><,./ net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeremoveStopwordsparametersremoveStopwords_inputoutputremoveStopwords_inputstopwordsCollectioIDWorkflow1SelectionremoveStopwords_inputinputTextBeanshellLowerTextWorkflow1wsdlwsdl_valuevalueWorkflow1VariableNameVariableName_valuevalueremoveStopwords_outputinputremoveStopwordsparametersBeanshelltextfilterTextparametersfilterTextparametersfilterText_inputoutputfilterText_inputinputTextTextfilterText_inputunwantedCharactersunwantedCharacters_valuevalueTextWithoutStopWordsremoveStopwords_outputfilteredText fde29c0f-d18c-4611-9f30-ea363fe29db4 2011-01-07 16:58:03.586 CET Text preprocessing 2011-01-07 17:11:07.368 CET 127a86af-28de-413e-9594-d5da707f421b 2011-01-07 16:41:10.938 CET 1049c9c8-384e-48e8-a221-1cbf4507c655 2011-01-07 17:07:52.901 CET e53229f5-0f9d-439b-a9df-2b3a91752404 2011-01-07 16:38:48.670 CET e83f04fe-a7de-4e4e-ae6f-585ae42ae00e 2011-01-07 17:11:37.634 CET 6d3e3fdd-5136-46e9-8afd-d95e355c52fa 2011-01-07 17:03:44.662 CET dr. Petra Kralj Novak 2011-01-07 17:10:08.327 CET f5f48975-efe4-42ed-8003-e396e3eaaf5f 2011-01-07 15:59:26.931 CET Remove non- alfanumeric symbols, transforming to lower case, and removing stop words from the input text. 2011-01-07 17:11:37.498 CET Workflow1VariableName00wsdl00SelectiongetValuesgetValues0return00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.WSDLActivity http://ropot.ijs.si/webservices/janez/getvalues.php?wsdl getValues net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokegetValues_2varname0wsdl0output00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLInputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="getValues" name="getValues" qname="{urn:getvalues}getValues"><s:elements><s:basetype optional="false" unbounded="false" typename="string" name="wsdl" qname="getValues&gt;wsdl" /><s:basetype optional="false" unbounded="false" typename="string" name="varname" qname="getValues&gt;varname" /></s:elements></s:complextype></s:extensions> false wsdl 0 'text/plain' false varname 0 'text/plain' 0 output 0 'text/xml' net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokegetValues_returninput0values00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLOutputSplitterActivity <s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="getValuesResponse" name="return" qname="{urn:getvalues}getValuesResponse"><s:elements><s:basetype optional="false" unbounded="false" typename="string" name="values" qname="getValuesResponse&gt;values" /></s:elements></s:complextype></s:extensions> java.lang.String false input 0 'text/xml' 0 values 0 'text/plain' net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeSelecttitle0message0valueList1answer00net.sf.taverna.t2.activitieslocalworker-activity1.2net.sf.taverna.t2.activities.localworker.LocalworkerActivity net.sourceforge.taverna.scuflworkers.ui.SelectWorker workflow java.lang.String true valueList 1 l('text/plain') java.lang.String true message 0 'text/plain' java.lang.String true title 0 'text/plain' 0 answer 0 'text/plain' net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invoketitle_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.2net.sf.taverna.t2.activities.stringconstant.StringConstantActivity Select net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeSplit_string_into_string_list_by_regular_expressionstring0regex0split11net.sf.taverna.t2.activitieslocalworker-activity1.2net.sf.taverna.t2.activities.localworker.LocalworkerActivity org.embl.ebi.escience.scuflworkers.java.SplitByRegex workflow java.lang.String true string 0 'text/plain' java.lang.String true regex 0 'text/plain' 1 split 1 l('text/plain') net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokeregex_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.2net.sf.taverna.t2.activities.stringconstant.StringConstantActivity , net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize 1 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry 1.0 1000 5000 0 net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokegetValuesgetValuesgetValues_2outputgetValues_2varnameVariableNamegetValues_2wsdlwsdlgetValues_returninputgetValuesreturnSelecttitletitle_valuevalueSelectmessageVariableNameSelectvalueListSplit_string_into_string_list_by_regular_expressionsplitSplit_string_into_string_list_by_regular_expressionstringgetValues_returnvaluesSplit_string_into_string_list_by_regular_expressionregexregex_valuevalueSelectionSelectanswer 09c5b15f-4661-4315-9d8b-7b853d65d43d 2010-12-23 13:39:32.118 CET 2564e998-3269-470a-a98e-6d5dfb2eceff 2010-12-23 13:40:38.453 CET