Workflow RDKit-pains (4)

If you like this workflow, please reference our paper doi:10.1002/minf.201100076, and check the related workflows RDKit-pains-parallel, and Indigo-pains.*** Update 20151119 - using KNIME 3 and RDKit version of PAINS queries ***Implementation of the PAINS filters[1] using the RDKit ( nodes in KNIME (3.0.1). Original PAINS filters were published in SLN format. This workflow contains the SMARTS form of the filters published by Greg Landrum as part of the RDKit library[2], whic...

Created: 2011-02-07 | Last updated: 2015-11-19

Credits: User sauberns

Attributions: Workflow Indigo-pains

Workflow Liliopsida Protein Alignment (6)

This workflow retrieves Liliopsida chloroplast petb gene sequences from NCBI Nucleotide, removes duplicate sequences and saves the results at BioExtract Server. These results are then converted into GenBank format and fed into Fetch Translation, which removes the translation from the CDS coding region. Translations are then used to build a multiple alignment using ClustalW.

Created: 2010-01-13 | Last updated: 2010-11-17

Credits: User Carol Lushbough


Workflow fetchEnsemblSeqsAndBlast (1)

This workflow allows you to configure a BioMart query to fetch sequences you want from Ensembl. These sequences are retrieved and a blast database of them is created (by default, in the directory you ran taverna from). Warning: This workflow assumes that you have blastall and formatdb installed on the machine, and that by default, these are both found or linked in /usr/local/bin. It also assumes that you have write permission to the directory you have run taverna from. The beanshells "creat...

Created: 2008-04-18 | Last updated: 2008-04-18

Credits: User Bela

Workflow Download from ChemSpider using Accurate Mass (2)

No description

Created: 2007-11-26 | Last updated: 2008-02-05

Credits: User Egon Willighagen

Workflow Simplify a BLAST text file (2)

This workflow simplifies a BLAST text file into identifiers, descriptions and values (P, E-values). In order to extract the relevant ids etc. you need to pass the relevant string into the corresponding port, e.g. the default port being used is gi. This has been passed "gi". For any other ports simply pass in the string the SAME as the port name, e.g. seq_id, p, per etc.

Created: 2007-10-03 | Last updated: 2009-07-28

Workflow BioAID_ProteinDiscovery (8)

The workflow extracts protein names from documents retrieved from MedLine based on a user Query (cf Apache Lucene syntax). The protein names are filtered by checking if there exists a valid UniProt ID for the given protein name.

Created: 2010-05-10 | Last updated: 2013-08-16

Credits: User Marco Roos Network-member AID

Workflow KEGG pathways common to both QTL and micro... (3)

This workflow takes in two lists of KEGG pathway ids. These are designed to come from pathways found from genes in a QTL (Quantitative Trait Loci) region, and from pathways found from genes differentially expressed in a microarray study. By identifying the intersecting pathways from both studies, a more informative picture is obtained of the candidate processes involved in the expression of a phenotype.   Example input for this workflow is given below (as newline separated values). qt...

Created: 2009-11-24 | Last updated: 2009-12-03

Workflow FLOSS Communication Centralization Plot, E... (4)

The analysis in this workflow represents the basis of the analysis in our paper, Social dynamics of FLOSS team communication across channels. This workflow uses WSDL components to select periodized data from the FLOSSmole database and generate sociomatrices. The workflow parses the threaded list structure into a communication network based on reply-to relationships. In the analysis process, an edge weighting is applied so that older messages receive less weight using an exponential decay fun...

Created: 2009-02-07

Credits: User Andrea Wiggins User Crowston User James Howison

Workflow EBI_PICR_Sequence_to_UniParc_and_InterPro (2)

Given a protein sequence get some information about it: Does this protein sequence occur in any of the protein databases (e.g. UniProtKB, PDB, etc.). Using the PICR web service (see map the sequence to a UniParc identifer. Which entries in the protein databases have this sequence. Using the UniParc database (see a summary of the databases and the entries in those databases which have this s...

Created: 2008-06-08 | Last updated: 2008-06-08

Credits: User Hamish McWilliam

Attributions: Workflow EBI_dbfetch_UniParc Workflow EBI_Fetch_InterPro_Matches_UniParc Workflow EBI_PICR_Sequence_to_ID

Workflow Fetch today's xkcd comic (1)

Use the local java plugins and some filtering operations to fetch the comic strip image from Based on the FetchDailyDilbert workflow.

Created: 2008-03-05 | Last updated: 2008-04-07

Credits: User Tomoinn User Stian Soiland-Reyes


