User: Sebastian land
|
|
Name: Sebastian land Joined: Thursday 05 November 2009 @ 09:21:14 (GMT) Last seen: Monday 16 January 2012 @ 16:29:43 (GMT) Email (public): Not specified Website: Not specified Location: Dortmund, Germany |
Sebastian land has been credited 0 times Sebastian land has an average rating of: 0.0 / 5 (0 ratings in total) for their items |
Description/summary not set
Other contact details:
Not specifiedInterests:
Not specifiedField/Industry: Data-Mining
Occupation/Role(s): Software Developer
Organisation(s):
Not specifiedNote: some items may not be visible to you, due to viewing permissions.
1. RBioconductor Tutorial Process
2. Constructing user defined linear regression model
3. Finding all Examples that have duplicate values in certain attributes
4. Preprocessing nominal data for frequent item set mining
5. Prepending common prefix to attributes
6. Combining nominal attributes with missing
7. Plot round results of a backward elimination
8. Setting an attribute value in a specific Example
9. Correct Attribute Type to Binominal
10. Defining positive class with Remap Binominal
11. Generating Example Weights depending on label
12. Looping over Examples for doing de-aggregation
13. Using Remember / Recall for "tunneling" results
|
Original Uploader |
Created: 03/01/11 @ 16:08:26 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This Process illustrates how to create complex Data Analysis processes using R Scripts. In this case the Bioconductor Basic Tutorial is splitted into reusable components.
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 32 times | Downloaded: 14 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 02/09/10 @ 09:31:10 | Last updated: 02/09/10 @ 09:31:11 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process shows how one can construct a user defined Linear Regression model using the execute script operator
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 66 times | Downloaded: 53 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 18/06/10 @ 08:59:39 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process will retrieve all examples, who have identical values in a specific attribute. For testing, the following data can be writen into the file, that will be read by the Read CSV operator:
CID,Value
3596,X
4054,X
4054,X
3000,S
3000,T
3000,U
32135,S
The target of this process is to return the two examples having the same value in the CID column.
To achieve this, first a real id is generated by the generate id. After this, we have to find all duplicates: For this we first remove dupl...
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 46 times | Downloaded: 26 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 16/06/10 @ 08:32:58 | Last updated: 16/06/10 @ 08:32:59 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process will first create artificial data that can be compared to usual data loaded for frequent item set mining: Nominal Data with a true and false value, but differently mapped to internal indices.
For ItemSet Mining these must be preprocessed to avoid problems: First they have to be transformed to Binominal Attributes, then it has to be defined, which is the positive and the negative value.
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 47 times | Downloaded: 23 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 04/06/10 @ 07:22:58 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process shows how one can add a common prefix to a subset of attributes.
First a subset might be defined by the attribute set selection parameters of the rename by replacing operator. Then one can make use of the capturing group functionality of regular expressions to insert the original name into the replacement.
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 21 times | Downloaded: 17 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 04/06/10 @ 07:20:31 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process will show how one can combine nominal values of attributes that contain missing values.
A generate attribute operator is used and hence forbidden characters must first be replaced. After this, a condition in the generation ensures that no question mark (standing for missing value) will be shown in the result, if any of the two combined attributes is known.
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 26 times | Downloaded: 24 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 26/05/10 @ 16:52:24 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process will perform a backward elimination and logs all performance and deviation results of each round. This way, you can use the visualizations of rapidminer to asses the performance gain.
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 40 times | Downloaded: 25 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 14/05/10 @ 08:25:01 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process will first generate an artificial dataset and tag it with ids.
Then a Filter Examples Operator is used to get a dataset with exactly one example identified by it's id. Then a value is set in this example.
Since the change in the data will be reflected in all views of the example set, a simply copy is passed by to the process' output. If you take a look at the attributes of the example with id 5, you will find the 12 there.
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 49 times | Downloaded: 6 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 14/05/10 @ 07:27:59 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process will first generate some artificial data to show a commonly known problem:
Some attributes have only two values, but are not correctly stored as Binominal, instead RapidMiner recognizes them as nominal. In order to use them for some special operators, we have to change this to binominal. We can achieve this by using the Nominal to Binominal operator with the the parameter transform_binominals switched off.
Please take a look at the data before and after the Nominal to Binominal o...
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 36 times | Downloaded: 22 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 11/05/10 @ 09:04:30 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process shows how one can use the Remap Binominal operator to define which label value is treated as the positive class.
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 42 times | Downloaded: 36 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 03/05/10 @ 09:47:42 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process uses a Generate Attribute operator for assigning new weights to examples. It uses the if condition of this operator to distinguish between labels.
This can be especially useful if you have a strong bias in your class frequency, which can harm learning. Please note that you must use a learning algorithm that supports weighted examples.
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 105 times | Downloaded: 46 times This Workflow has no tags! |
View
Download (v1)
|
|
Original Uploader |
Created: 29/04/10 @ 16:21:56 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process is based on (artificially generated) data that looks like it has been aggregated before. The integer attribute Qty specifies the quantity of the given item that is represented by the rest of the example.
The process now loops over every example and performs on each example another loop, that will append the current example to a new example set. This example set has been created as empty copy of the original example set, so that the attributes are equally. To get access to and rem...
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 207 times | Downloaded: 267 times Tags (5):
|
View
Download (v1)
|
|
Original Uploader |
Created: 29/04/10 @ 16:07:55 | Last updated: 16/01/12 @ 16:35:12 License: Creative Commons Attribution-No Derivative Works 3.0 Unported License
This process shows how Remeber and Recall operators can be used for passing results from one position to another position in the process, when it's impossible to make a direct connection.
This process introduces another advanced RapidMiner technique: The macro handling. We have used the predefined macro a, accessed by %{a}, that gives the apply count of the operator. So we are remembering each application of the models that are generated in the learning subprocess of the Split validation. Af...
Rating: 0.0 / 5 (0 ratings) | Versions: 1 | Reviews: 0 | Comments: 0 | Citations: 0 Viewed: 154 times | Downloaded: 55 times Tags (5):
|
View
Download (v1)
|
New/Upload
Log in / Register
Need an account?
Click here to register
Popular Tags
25 tags
[All Tags]
Copyright © 2007 - 2011 The University of Manchester and University of Southampton

Log in
Register
Give us Feedback
Invite