Merge (RapidMiner Studio Core)

Synopsis

This operator merges two nominal values of the specified regular attribute.

Description

The Merge operator is used for merging two nominal values of the specified attribute of the input ExampleSet. Please note that this operator can merge only the values of regular attributes. The required regular attribute is specified using the attribute name parameter. The first value parameter is used for specifying the first value to be merged. The second value parameter is used for specifying the second value to be merged. The two values are merged in 'first_second' format where first is the value of the first value parameter and second is the value of the second value parameter. It is not compulsory for the first value and second value parameters to have values from the range of possible values of the selected attribute. However, at least one of the first value and second value parameters should have a value from the range of possible values of the selected attribute. Otherwise this operator will have no affect on the input ExampleSet.

Input

  • example set input (IOObject)

    This input port expects an ExampleSet. It is output of the Retrieve operator in the attached Example Process.

Output

  • example set output (IOObject)

    The ExampleSet with the merged attribute values is output of this port.

  • original (IOObject)

    The ExampleSet that was given as input is passed without changing to the output through this port. This is usually used to reuse the same ExampleSet in further operators or to view the ExampleSet in the Results Workspace.

Parameters

  • attribute_nameThe required nominal attribute whose values are to be merged is selected through this parameter. This operator can be applied only on regular attributes. Range: string
  • first_value This parameter is used for specifying the first value to be merged. It is not compulsory for the first value parameter to have a value from the range of possible values of the selected attribute. Range: string
  • second_valueThis parameter is used for specifying the second value to be merged. It is not compulsory for the second value parameter to have a value from the range of possible values of the selected attribute. Range: string

Tutorial Processes

Introduction to the Merge operator

The Golf data set is loaded using the Retrieve operator. The Merge operator is applied on it. The attribute name parameter is set to 'Outlook'. The first value parameter is set to 'sunny' and the second value parameter is set to 'hot'. All the occurrences of value 'sunny' are replaced by 'sunny_hot' in the Outlook attribute of the resultant ExampleSet. Now set the value of the second value parameter to 'rain' and run the process again. As 'rain' is also a possible value of the Outlook attribute, all occurrences of 'sunny' and 'rain' in the Outlook attribute are replaced by 'sunny_rain' in the resultant ExampleSet. This Example Process is just to explain basic working of the Merge operator.