Stop Accumulo
I am able to stop the entire cluster by running the stop-all.sh script inside the Accumulo home folder's "bin" directory.
Increase JVM Heap Space to accommodate larger Index Cache
The Tablet server heap space is defined in the file "accumulo-env.sh" located in the Accumulo home folder's "conf" directory. Inside this folder you can see the settings for tablet server Xmx and Xms at the bottom defined as an environment variable, "$ACCUMULO_TSERVER_OPTS". Depending on how much memory is available you will want to increase this value to support the increase we will make to the index cache next. Here is my setting:
ACCUMULO_TSERVER_OPTS="${POLICY} -Xmx1024m -Xms512m "
Increase Index Cache
In the Accumulo home folder's "conf" directory, you should also see a file called "accumulo-site.xml". Here you can define properties for the Accumulo cluster. I have set the cache.index.size to 512M:
I have not had any issues with tablet server memory yet, so I believe this is a good fix. Please provide feedback and comments below.
There are various other performance tweaks as well. Such as NOT using LVM with CentOS/RHEL, and ensuring any virtual machines in the cluster are running with "Independent Disk Mode" so writes are flushed straight to disk.
Nice description. Thanks. Love to see others in VA using Accumulo.
ReplyDeleteMe too! Accumulo seems to really have gained some traction in the big data game. Last time I checked, Cloudera was planning to implement an Accumulo role into their CDH5 stack. I believe it is currently in beta.
Delete