You are viewing the RapidMiner Radoop documentation for version 8.0 - Check here for latest version

What’s New in RapidMiner Radoop 8.0?

This page describes the new features of RapidMiner Radoop 8.0 as well as its enhancements and bug fixes. Note that this is the first version that is compatible with RapidMiner Server 8.0 release. Radoop upgrade is required if Server is upgraded.

Clustering based on Spark MLlib

We have updated the K-Means clustering operator in Radoop to use Spark MLlib's algorithm instead of Mahout. Mahout is no longer an active project and we have decided to deprecate our Mahout-based clustering operators (including the K-Means algorithms and Canopy) and replace them with new and more efficient Spark equivalents.

Cloudera Spark library upgrade

We always incorporate the changes in the major Hadoop distributions. We have added a new option to support the Cloudera Spark library. We adapt, so you don’t need to worry about technology upgrades or changes.

Support for Hive access in Spark Script

HiveContext is a framework to use HiveQL’s powerful language and functionality within Spark script. This allows for a more flexible and easier to use scripting tool. Note that some cluster-side configuration may be needed to use HiveContext.

Enhancements and bug fixes

The following pages describe the enhancements and bug fixes in RapidMiner Radoop 8.0 releases: