All content

Search filter terms
Filter by category
Filter by type
Filter by tag
Filter by user
Filter by licence
Filter by group
Filter by wsdl
Results per page:
Sort by:
Showing 20 results. Use the filters on the left and the search box below to refine the results.
Type: Taverna 2 Tag: quality assurance Licence: by-sa
Uploader

Workflow ARC to WARC Migration with CDX Index and w... (1)

Thumb
Workflow for migrating ARC to WARC and comparing the CDX index files (Linux). The workflow has an input port “input_directory” which is a local path to the directory containing the ARC files, and an input port “output_directory” which is the directory where the workflow outputs are created. The files in the input directory are migrated using the “arc2warc_migration_cli” tool service component to perform the migration. The “cdx_creator_arc” and “cdx_creator_warc” tool service components creat...

Created: 2014-07-09

Credits: User Sven

Uploader

Workflow TIF to JP2 file format migration with qual... (1)

Thumb
This workflow reads a textfile containing absolute paths to TIF image files and converts them to JP2 image files using OpenJPEG (https://code.google.com/p/openjpeg). Based on the input text file, the workflow creates a Taverna list to be processed file by file. A temporary directory is created (createtmpdir) where the migrated image files and some temporary tool outputs are stored. Before starting the actual migration, it is checked if the TIF input images are valid file format instances u...

Created: 2014-05-07 | Last updated: 2014-05-07

Credits: User Sven

Workflow Slim Migrate And QA mp3 to Wav Using Hadoo... (4)

Thumb
This workflow migrates an input list (available on HDFS) of mp3 files (available on NFS) to wav files (in output directory on NFS) using an ffmpeg Hadoop job. The workflow then compares content of the original mp3 and the migrated wav by first converting the two files to wav using an mpg123 Hadoop job and the identity function respectively, and then using an xcorrSound waveform-compare Hadoop job. The needed Hadoop jobs are available from https://github.com/statsbiblioteket/scape-audio-qa-ex...

Created: 2014-02-21 | Last updated: 2014-06-30

Uploader

Workflow MatchboxHadoopAPI (1)

Thumb
The workflow MatchboxHadoopApi.t2flow enables using of matchbox tool on Hadoop with Taverna. This workflow is based on Python scripts and Hadoop Streaming API included in"pythonwf" folder of pc-qa-matchbox project on github (https://github.com/openplanets/scape/tree/master/pc-qa-matchbox/hadoop/pythonwf).For this workflow we assume that digital collection is located on HDFS and we have a list of input files in format "hdfs:///user/training/collection/00000032.jp2" - one ro...

Created: 2013-11-05

Credits: User Roman Network-member SCAPE

Workflow Validate_Compare_Compare_List (1)

Thumb
QA for file migrated to wav. The QA steps include File Format Validation, Significant Property Comparison and xcorrSound waveform-compare file content comparison (all CLI). Input the migrated wav and a "compare-to-wav" file.

Created: 2013-04-15

Credits: User Bolette Jurik

Uploader

Workflow JP2 to TIFF file format migration with qua... (1)

Thumb
This workflow reads a textfile containing absolute paths to JP2 image files and converts them to TIFF image files using Kakadu's j2k_to_image command line application (http://www.kakadusoftware.com). Based on the input text file, the workflow creates a Taverna list to be processed file by file. A temporary directory is created (createtmpdir) where the migrated image files and some temporary tool outputs are stored. Before converting the files, the JP2 input files are validated using the SC...

Created: 2013-02-07

Credits: User Sven

Uploader

Workflow AIT Matchbox Scenario Compare Image Pair b... (1)

Thumb
In this scenario matchbox will compare given image pair based on extracted profile information. User will get a histogram intersection distance value in result. Small value means high similarity, high value means different images. Matchbox in this scenario is installed on remote Linux VM. Digital collection is stored on Windows machine.

Created: 2012-11-24

Credits: User Roman

Uploader

Workflow AIT Matchbox Scenario Check Duplicate Pair... (1)

Thumb
In this scenario matchbox will check duplicate pair of previously found duplicates in passed digital collection if output information was lost. The pair check does not require the time consumpting whole analysis and is very fast. Matchbox in this scenario is installed on remote Linux VM. Digital collection is stored on Windows machine.

Created: 2012-11-24

Credits: User Roman Network-member SCAPE

Uploader

Workflow AIT Matchbox Scenario Find Duplicates usin... (1)

Thumb
In this scenario matchbox will find duplicates in passed digital collection. All matchbox workflow steps are defined separately using input parameter sequence: clean, extract, train, bowhist and compare. User will get a list of duplicates in result. Matchbox in this scenario is installed on remote Linux VM. Digital collection is stored on Windows machine.

Created: 2012-11-24

Credits: User Roman Network-member SCAPE

Uploader

Workflow AIT Matchbox Scenario Professional (3)

Thumb
In this scenario matchbox will find duplicates in passed digital collection. Each matchbox workflow step can be executed separately. User will get a list of duplicates in result. Matchbox in this scenario is installed on remote Linux VM. Digital collection is stored on Windows machine. This workflow starts duplicate finding process using the FindDuplicates python script of the matchbox tool. Matchbox tool support python in version 2.7. Execution starts from the directory where python scripts ...

Created: 2012-11-24 | Last updated: 2012-11-24

Credits: User Roman Network-member SCAPE

Results per page:
Sort by: