You are viewing the RapidMiner Deployment documentation for version 9.6 - Check here for latest version
Deployment Templates for Production, with easy Hadoop connectivity
This template is very similar to the basic production template. This template becomes relevant when the goal is to deploy RapidMiner processes that leverage big data from a Hadoop cluster by using RapidMiner Radoop. We offer the Radoop Proxy component to make network configuration easier in cases where the Hadoop cluster is behind a firewall.
This deployment consists of the following components:
- 1 RapidMiner Server instance
- 3 RapidMiner Job Agents
- Postgres database
- Python Environment Manager
- Radoop Proxy
If you plan to deploy the RapidMiner platform to a single, large (physical or virtual) machine, we recommend the template using a docker-compose
based approach for the sake of simplicity.
If you plan to deploy the RapidMiner platform to multiple (physical or virtual) machines, we recommend a template using a Kubernetes based approach. We support the popular public cloud vendors' Kubernetes services, as well as your own Kubernetes cluster.
System requirements
Minimum recommended hardware configuration
Each virtual or physical machine should at least have:
- Quad core
- 16GB RAM
- >10GB free disk space
The amount of memory needed depends heavily on the amount of data that will be processed in the Server. If most or all of the data is going to be processed in the Hadoop environment using Radoop, then 16GB is enough for the Server. If non-Radoop processes are going to be run in Server, then the recommendation is to increase the memory size to 32GB or more depending on the size of user data.