AIT Matchbox Scenario Professional

Created: 2012-11-24 22:47:17      Last updated: 2012-11-24 22:56:24

In this scenario matchbox will find duplicates in passed digital collection. Each matchbox workflow step can be executed separately. User will get a list of duplicates in result. Matchbox in this scenario is installed on remote Linux VM. Digital collection is stored on Windows machine. This workflow starts duplicate finding process using the FindDuplicates python script of the matchbox tool. Matchbox tool support python in version 2.7. Execution starts from the directory where python scripts are located. If you use source code from Github, then it is a scape/pc-qa-matchbox/Python/ directory. The python script supports different parameter. Experienced user can apply extract, train, bowhist and compare parameters in order to execute associated step in the matchbox workflow for duplicate seach. The order of execution steps should not be changed, because each next step requires an output from a previous step. E.g. if you are going to repeat the comparison step you should have calculated required BOWHistogram files from bowhist step.

Information Preview

Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Copy and paste this link into File > 'Open workflow location...'
[ More InfoExpand ]

Information Workflow Components

Information Authors (1)
Information Titles (1)
Information Descriptions (1)
Information Dependencies (0)
Inputs (2)
Processors (2)
Beanshells (1)
Outputs (4)
Datalinks (8)
Coordinations (0)

Information Workflow Type

Taverna 2

Information Uploader

Information License

All versions of this Workflow are licensed under:

Information Version 3 (latest) (of 3)

View version:

Information Credits (2)


Information Attributions (0)



Information Tags (4)

Log in to add Tags

Information Shared with Groups (0)


Information Featured In Packs (0)


Log in to add to one of your Packs

Information Attributed By (0)



Information Favourited By (0)

No one

Information Statistics


Citations (0)


Version History

In chronological order:

Reviews Reviews (0)

No reviews yet

Be the first to review!

Comments Comments (0)

No comments yet

Log in to make a comment

Workflow Other workflows that use similar services (0)

There are no workflows in myExperiment that use similar services to this Workflow.