Workflow Entry: FLOSS Communication Centralization Plot, Exponentially Weighted

Created at: 07/02/09 @ 18:50:44
Information Version 1 (earliest) (of 4)
View version:

Version created on: 10/02/08 @ 01:15:28 by: Andrea Wiggins   |   Revision comments Expand

Last edited on: 10/02/08 @ 01:41:21 by: Andrea Wiggins

Title: FLOSS Communication Centralization Plot

Type: Taverna 1


Information Preview

(Click on the image to get the full size)

Medium


Information Description

The analysis in this workflow represents the basis of the analysis in our paper, Social dynamics of FLOSS team communication across channels. This workflow uses WSDL components to select periodized data from the FLOSSmole database and generate sociomatrices. The workflow parses the threaded list structure into a communication network based on reply-to relationships.  In the analysis process, an edge weighting is applied so that older messages receive less weight using an exponential decay function; this is intended to moderate the effects of using a sliding window of observations for dynamic analysis. The weighted sociomatrices are then dichotomized according to a threshold, and their centralities are calculated using R's sna package, and plotted in a time series. The final output demonstrates communication centralization trends over time in a FLOSS project.

The analysis workflow allows users to set parameter values for the size of the sliding window, the date ranges covered, and the communication venues analyzed.  Data for the analysis in the paper examined dynamics in different communication channels; the data sets included up to 90 periods (with a 90-day sliding window) and 10K's of email messages or forum posts. Date periods are currently only in months, and exception handling for empty periods is implemented both in the WSDL components and the RShell scripts. There are a couple of inelegant solutions involving temporary output files, which are configured based on a standard Mac file structure. Future development effort could focus on solving these issues and on optimizing performance.


Information Download



Information Run

Run this Workflow in the Taverna Workbench...

Option 1:

Note: you need to have both the WHIP Launcher and the Taverna myExperiment/WHIP plugin installed on your machine for this to work. See here for information.

Option 2:

Copy and paste this link into File > 'Open workflow location...'
http://www.myexperiment.org/workflows/144/download?version=1
[ More InfoExpand ]


Information Workflow Components

Inputs (4)
Processors (22)
Beanshells (0)
Outputs (1)
Links (32)
Coordinations (0)

Information Workflow Type

Taverna 1

Information Original Uploader

Information License

All versions of this Workflow are licensed under:

Information Credits (3)

(People/Groups)

Information Attributions (0)

(Workflows/Files)

None

Information Tags (9)

Log in to add Tags

Information Shared with Groups (1)

Information Featured In Packs (2)

Log in to add to one of your Packs

Information Ratings (1)

Current:

5.0 / 5

(1 rating)

Log in to rate and see breakdown of ratings

Information Attributed By (1)

(Workflows/Files)

Information Favourited By (0)

No one

 

Citations (0)

None


Version History

Earliest Version:
[1] - FLOSS Communication Centralization Plot

Created on: Sunday 10 February 2008 @ 01:15:28 (GMT)

Created by: Andrea Wiggins

Last edited on: Sunday 10 February 2008 @ 01:41:21 (GMT)

Last edited by: Andrea Wiggins

Revision comments:

None

Previous Versions:
[2] - FLOSS Communication Centralization Plot, Exponentially Weighted

Created on: Sunday 10 February 2008 @ 01:15:28 (GMT)

Created by: Andrea Wiggins

Last edited on: Friday 15 February 2008 @ 20:02:47 (GMT)

Last edited by: Andrea Wiggins

Revision comments:

This revision includes a bit more workflow metadata, the most important of which is that the description for the Project_list input now contains a list of the project data available to use with this workflow as of 2/15/2008

[3] - FLOSS Communication Centralization Plot, Exponentially Weighted

Created on: Sunday 10 February 2008 @ 01:15:28 (GMT)

Created by: Andrea Wiggins

Last edited on: Friday 07 March 2008 @ 01:55:29 (GMT)

Last edited by: Andrea Wiggins

Revision comments:

This revision eliminates two ungainly text output files. The ~/temp.data file, previously used to write out and read back in a sociomatrix, has been replaced with an internalized read.csv(textConnection()) call. The date-centralization pairlist that was previously written to ~/Desktop/cent-values.txt is now a proper workflow output.

Latest Version:
[4] - FLOSS Communication Centralization Plot, Exponentially Weighted

Created on: Saturday 07 February 2009 @ 18:50:44 (GMT)

Created by: Andrea Wiggins

Revision comments:

Updated the workflow to include full metadata.



Reviews Reviews (0)

No reviews yet

Be the first to review!



Comments Comments (2)

Log in to make a comment

  • Friday 15 February 2008 @ 20:02:00 (GMT)

    We have made this workflow available to use with some FLOSSmole data. There is a list of the project names in the Project_list input metadata description that contains the data that are available for use as of 2/15/2008. 

    The project names are:

    fire-tracker-feature-requests
    fire-tracker-translations
    fire-talk
    fire-core
    fire-development
    gaim-cabal
    gaim-devel
    gaim-forum-users-helping-users
    gaim-tracker-plugins
    gaim-tracker-patches
    gaim-tracker-smileys
    gaim-tracker-rej-patches

  • Friday 15 February 2008 @ 20:22:44 (GMT)

    Start and end dates are also required for using this workflow.  The "active" project date ranges that we have chosen for the projects are:

    Fire, 2001-08-16 (registry) through 2006-03-15 (one month following final release 1.5.6)

    Gaim, 1999-11-13 (registry) through 2007-04-24 (when the project changed name to Pidgin and its lists moved)

    Note that not all date ranges contain data for all venues; some were not active as early as others and some were ad-hoc communication venues (i.e. gaim-cabal).




Workflow Other workflows that use similar services (1)

Original Uploader

Workflow FLOSS Communication Centralization Plot, Unit Weighted (v2)

Created: 07/02/09 @ 18:49:47

Credits: User Andrea Wiggins User Crowston User James Howison

License: Creative Commons Attribution-Share Alike 3.0 Unported License

Thumb

The analysis in this workflow represents the basis of the analysis in our paper, Social dynamics of FLOSS team communication across channels. This workflow uses WSDL components to select periodized data from the FLOSSmole database and generate sociomatrices. The workflow parses the threaded list structure into a communication network based on reply-to relationships. In the analysis process, an unit weighting is applied to the edges. The weighted sociomatrices are then dichotomized according ...

Rating: 0.0 / 5 (0 ratings) | Versions: 2 | Reviews: 0 | Comments: 0 | Citations: 0

Viewed: 145 times | Downloaded: 53 times

Tags (8):

Show View Download Download (v2)

What is this?

Linked Data

Non-Information Resource URI: http://www.myexperiment.org/workflows/144/versions/1


Alternative Formats

HTML
RDF
XML

New/Upload

Log in / Register

Username or Email:

Password:

Remember me:

OR

Use OpenID:


(eg: name.myopenid.com)

Need an account?
Click here to register

Forgot Password?

Front Page

Home

Invite people to myExperiment

Help pages

About Us

News and Events

Mailing List

Contact Us

Developers

Publications


Taverna Workflow Workbench

myGrid

BioCatalogue

Trident

Google Coop Search

EPSRC

JISC

Microsoft

Powered by:

Rails

Icons:
Silk icon set 1.3