Text_preprocessingText00An example of the input text is this one. The workflow will first remove the charachters from this set: `~!@#$%^&*()_+=-{}|\][":;'?><,./. The it will transform the text to lower case. The user will be prompted to select a dictionary for stop words from a list. The workflow will, based on the selected list, remove the stop words.
Stop words are words that do not carry meaning, like, the, an,... The web service for stop words removal integrates six English stop words dictionaries and one for the Slovenian language. 2011-01-07 17:07:42.147 CETPlain text.2011-01-07 16:36:58.151 CETAn example of the input text is this one. The workflow will remove the stop words based on the selected dictionary. Stop words are words that do not carry meaning, like, the, an,... The web service for stop words removal integrates six English stop words dictionaries and one for the Slovenian language. The user is prompted to choose a stop words dictionary.2011-01-07 16:40:54.71 CETAn example of the input text is this one. The workflow will remove the stop words based on the selected dictionary. Stop words are words that do not carry meaning, like, the, an,... The2011-01-07 16:38:33.236 CETTextWithoutStopWordsremoveStopwordsparameters0parameters00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.WSDLActivityhttp://zulu.ijs.si:8086/SW_service?wsdlremoveStopwordsnet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeremoveStopwords_inputstopwordsCollectioID0inputText0output00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLInputSplitterActivity<s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="removeStopwords" name="parameters" qname="{http://www.example.org/Stopwords/}removeStopwords"><s:elements><s:basetype optional="false" unbounded="false" typename="string" name="inputText" qname=">removeStopwords>inputText" /><s:basetype optional="true" unbounded="false" typename="string" name="stopwordsCollectioID" qname=">removeStopwords>stopwordsCollectioID" /></s:elements></s:complextype></s:extensions>falseinputText0'text/plain'falsestopwordsCollectioID0'text/plain'0output0'text/xml'net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeWorkflow1wsdl0VariableName0Selection00net.sf.taverna.t2.activitiesdataflow-activity1.2net.sf.taverna.t2.activities.dataflow.DataflowActivitynet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokewsdl_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.2net.sf.taverna.t2.activities.stringconstant.StringConstantActivityhttp://zulu.ijs.si:8086/SW_service?wsdlnet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeVariableName_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.2net.sf.taverna.t2.activities.stringconstant.StringConstantActivitystopwordsCollectioIDnet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeremoveStopwords_outputinput0filteredText00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLOutputSplitterActivity<s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="removeStopwordsResponse" name="parameters" qname="{http://www.example.org/Stopwords/}removeStopwordsResponse"><s:elements><s:basetype optional="false" unbounded="false" typename="string" name="filteredText" qname=">removeStopwordsResponse>filteredText" /></s:elements></s:complextype></s:extensions>java.lang.Stringfalseinput0'text/xml'0filteredText0'text/plain'net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeBeanshelltext0LowerText00net.sf.taverna.t2.activitiesbeanshell-activity1.2net.sf.taverna.t2.activities.beanshell.BeanshellActivityworkflowjava.lang.Stringtruetext0text/plain0LowerText0net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokefilterTextparameters0parameters00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.WSDLActivityhttp://zulu.ijs.si:8086/TF_service?wsdlfilterTextnet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokefilterText_inputinputText0unwantedCharacters0output00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLInputSplitterActivity<s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="filterText" name="parameters" qname="{http://www.example.org/TextFiltering/}filterText"><s:elements><s:basetype optional="false" unbounded="false" typename="string" name="inputText" qname=">filterText>inputText" /><s:basetype optional="false" unbounded="false" typename="string" name="unwantedCharacters" qname=">filterText>unwantedCharacters" /></s:elements></s:complextype></s:extensions>falseinputText0'text/plain'falseunwantedCharacters0'text/plain'0output0'text/xml'net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeunwantedCharacters_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.2net.sf.taverna.t2.activities.stringconstant.StringConstantActivity`~!@#$%^&*()_+=-{}|\][":;'?><,./net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeremoveStopwordsparametersremoveStopwords_inputstopwordsCollectioIDremoveStopwords_inputinputTextWorkflow1wsdlWorkflow1VariableNameremoveStopwords_outputinputBeanshelltextfilterTextparametersfilterText_inputinputTextfilterText_inputunwantedCharactersTextWithoutStopWordsfde29c0f-d18c-4611-9f30-ea363fe29db42011-01-07 16:58:03.586 CETText preprocessing2011-01-07 17:11:07.368 CET127a86af-28de-413e-9594-d5da707f421b2011-01-07 16:41:10.938 CET1049c9c8-384e-48e8-a221-1cbf4507c6552011-01-07 17:07:52.901 CETe53229f5-0f9d-439b-a9df-2b3a917524042011-01-07 16:38:48.670 CETe83f04fe-a7de-4e4e-ae6f-585ae42ae00e2011-01-07 17:11:37.634 CET6d3e3fdd-5136-46e9-8afd-d95e355c52fa2011-01-07 17:03:44.662 CETdr. Petra Kralj Novak2011-01-07 17:10:08.327 CETf5f48975-efe4-42ed-8003-e396e3eaaf5f2011-01-07 15:59:26.931 CETRemove non- alfanumeric symbols, transforming to lower case, and removing stop words from the input text.2011-01-07 17:11:37.498 CETWorkflow1VariableName00wsdl00SelectiongetValuesgetValues0return00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.WSDLActivityhttp://ropot.ijs.si/webservices/janez/getvalues.php?wsdlgetValuesnet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokegetValues_2varname0wsdl0output00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLInputSplitterActivity<s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="getValues" name="getValues" qname="{urn:getvalues}getValues"><s:elements><s:basetype optional="false" unbounded="false" typename="string" name="wsdl" qname="getValues>wsdl" /><s:basetype optional="false" unbounded="false" typename="string" name="varname" qname="getValues>varname" /></s:elements></s:complextype></s:extensions>falsewsdl0'text/plain'falsevarname0'text/plain'0output0'text/xml'net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokegetValues_returninput0values00net.sf.taverna.t2.activitieswsdl-activity1.2net.sf.taverna.t2.activities.wsdl.xmlsplitter.XMLOutputSplitterActivity<s:extensions xmlns:s="http://org.embl.ebi.escience/xscufl/0.1alpha"><s:complextype optional="false" unbounded="false" typename="getValuesResponse" name="return" qname="{urn:getvalues}getValuesResponse"><s:elements><s:basetype optional="false" unbounded="false" typename="string" name="values" qname="getValuesResponse>values" /></s:elements></s:complextype></s:extensions>java.lang.Stringfalseinput0'text/xml'0values0'text/plain'net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeSelecttitle0message0valueList1answer00net.sf.taverna.t2.activitieslocalworker-activity1.2net.sf.taverna.t2.activities.localworker.LocalworkerActivitynet.sourceforge.taverna.scuflworkers.ui.SelectWorkerworkflowjava.lang.StringtruevalueList1l('text/plain')java.lang.Stringtruemessage0'text/plain'java.lang.Stringtruetitle0'text/plain'0answer0'text/plain'net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invoketitle_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.2net.sf.taverna.t2.activities.stringconstant.StringConstantActivitySelectnet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokeSplit_string_into_string_list_by_regular_expressionstring0regex0split11net.sf.taverna.t2.activitieslocalworker-activity1.2net.sf.taverna.t2.activities.localworker.LocalworkerActivityorg.embl.ebi.escience.scuflworkers.java.SplitByRegexworkflowjava.lang.Stringtruestring0'text/plain'java.lang.Stringtrueregex0'text/plain'1split1l('text/plain')net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Invokeregex_valuevalue00net.sf.taverna.t2.activitiesstringconstant-activity1.2net.sf.taverna.t2.activities.stringconstant.StringConstantActivity,net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Parallelize1net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.ErrorBouncenet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Failovernet.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.Retry1.0100050000net.sf.taverna.t2.coreworkflowmodel-impl1.2net.sf.taverna.t2.workflowmodel.processor.dispatch.layers.InvokegetValuesgetValuesgetValues_2varnamegetValues_2wsdlgetValues_returninputSelecttitleSelectmessageSelectvalueListSplit_string_into_string_list_by_regular_expressionstringSplit_string_into_string_list_by_regular_expressionregexSelection09c5b15f-4661-4315-9d8b-7b853d65d43d2010-12-23 13:39:32.118 CET2564e998-3269-470a-a98e-6d5dfb2eceff2010-12-23 13:40:38.453 CET