Workflow Entry: Data Set Metadata Generator

Created at: 19/08/08 @ 21:17:14      Last updated: 19/08/08 @ 21:25:54
Information Version 1 (of 1)

Version created on: 19/08/08 @ 21:17:14 by: Andrea Wiggins   |   Revision comments Expand

Last edited on: 19/08/08 @ 21:25:54 by: Andrea Wiggins

Title: Data Set Metadata Generator

Type: Taverna 1


Information Preview

(Click on the image to get the full size)

Data_set_metadata_generator_24594_1


Information Description

This workflow generates ePrints XML import files with data set metadata for the FLOSSmole project. It reads in an input file generated from a Notre Dame SourceForge dump SQL query and uses regular expressions to parse the filename for the data set’s source repository, download URL, and basic description. It also translates the epoch date into a sql format suitable for import, and the file size from bytes into larger units, e.g. GB, MB, etc. These data are inserted into an XML eprint record template (specific to the FLOSSmole ePrints repository configuration at wp.floss.syr.edu) and the individual eprints are aggregated into an XML import file.

Unfortunately, I’m not sure that I can provide the input file due to license restrictions. I can provide the SQL query, however, so that anyone who has signed a license agreement for access to the ND SourceForge data can retrieve the same input:

SELECT f.filename, f.file_id, f.file_size, f.post_date FROM sf0508.frs_file as f, sf0508.groups as g WHERE g.unix_group_name = ‘ossmole’ AND f.group_id=g.group_id ORDER BY f.post_date


Information Download


Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.

Option 2:

Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/376/download?version=1
[ More InfoExpand ]


Information Workflow Components

Inputs (0)
Processors (13)
Outputs (1)
Links (19)
Coordinations (0)
Taverna 1 workflow

Information Original Uploader

Information License

All versions of this Workflow are licensed under the Creative Commons Attribution-Share Alike 3.0 License.

Information Credits (1)

(People/Groups)

Information Attributions (0)

(Workflows/Files)

None

Information Tags (6)

Log in to add Tags

Information Shared with Groups (1)

Information Featured In Packs (0)

None

Log in to add to one of your Packs

Information Ratings (0)

Current:

0.0 / 5

(0 ratings)

Log in to rate and see breakdown of ratings

Information Attributed By (0)

(Workflows/Files)

None

Information Favourited By (0)

No one

 

Citations (0)

None


Version History

Earliest Version:
[1] - Data Set Metadata Generator

Created on: Tuesday 19 August 2008 @ 21:17:14 (BST)

Created by: Andrea Wiggins

Last edited on: Tuesday 19 August 2008 @ 21:25:54 (BST)

Last edited by: Andrea Wiggins

Revision comments:

None

This Workflow only has one version.



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (0)

No comments yet

Log in to make a comment


New/Upload

Log in / Register

Username or Email:

Password:

Remember me:

OR

Use OpenID:


(eg: name.myopenid.com)

Need an account?
Click here to register

Forgot Password?

Front Page

Home

About Us

Mailing List

Contact Us

API

Publications

Taverna Workflow Workbench

Google Coop Search


Invite people to myExperiment

myGrid

BioCatalogue

JISC

Microsoft

Powered by:

Rails

Icons:
Silk icon set 1.3