You are viewing the RapidMiner Radoop documentation for version 2024.0 - Check here for latest version
Radoop: Big Data Predictive Analytics
Radoop provides an easy-to-use graphical interface for analyzing data on a Hadoop cluster with a running Hive server. This introduction provides a quick description of the software and the capabilities of the solution for processing and analyzing big data.
Understanding the basic architecture
Radoop is client software with an intuitive graphical user interface. Radoop requires your Hadoop cluster to be accessible from the client running Altair AI Studio (and Altair AI Hub, if applicable). The diagram below shows the basic architecture of the Radoop solution on Altair AI Studio:
You can also use Radoop on Altair AI Hub for scheduling and managing client-created processes, as well as for collaboration and as a web reporting interface. The diagram below incorporates Altair AI Hub to show the basic architecture of the complete solution:
Documentation overview
This document, Radoop Overview, provides some background and resource material for using Radoop. It assumes that you are already familiar with using Altair AI Studio.
The document provides:
a quick overview of Radoop, including a description of Radoop operators and the Hadoop Data view which allows you to easily manage your data on the cluster.
a guide to importing your data if it is not already in a Hive structure on the Hadoop cluster.
explanation of more advanced features, including designing data mining processes, scoring, evaluation, and advanced data flow design.
discussion of how to modify the settings that influence Radoop.