Note: This is not intended as an endorsement of either the performance nor the cost-effectiveness of using EMR to back your OpenTSDB deployment. Rather, there are definite use cases for being able to quickly bring up a cluster and install OpenTSDB on it, so if you have one of those, this should get it done.
- ssh to the master server in your cluster. If you took the defaults when creating your cluster, you can identify the master in the console by the name of the security group, which will be either ElasticMapReduce-Master or ElasticMapReduce-Slave:
ssh -i nameOfYourKeyPair.pem hadoop@emrMasterPublicDnsName - telnet to the local zookeeper port to make sure zookeeper is installed and running, e.g.:
telnet localhost 2181and you should see:
Trying 127.0.0.1...Connected to localhost.Escape character is '^]'.which means your telnet was successful. Now enter:
stats
and you should see:
Zookeeper version: 3.4.5-1392090, built 09/30/12 17:52 GMTClients:/10.XXX.XXX.XXX:50333[1](queued=0,recved=4048,sent=4048)/10.XXX.XXX.XXX:50360[1](queued=0,recved=4052,sent=4052)/127.0.0.1:37169[0](queued=0,recved=1,sent=0)/10.XXX.XXX.XXX:50343[1](queued=0,recved=12209,sent=12212)/10.XXX.XXX.XXX:50331[1](queued=0,recved=4053,sent=4054)/10.XXX.XXX.XXX:45165[1](queued=0,recved=12174,sent=12175)/10.XXX.XXX.XXX:45160[1](queued=0,recved=4079,sent=4087)/10.XXX.XXX.XXX:50350[1](queued=0,recved=4259,sent=4260)/10.XXX.XXX.XXX:45168[1](queued=0,recved=12174,sent=12175)/10.XXX.XXX.XXX:36065[1](queued=0,recved=4057,sent=4057)/10.XXX.XXX.XXX:50336[1](queued=0,recved=8114,sent=8114)/10.XXX.XXX.XXX:50332[1](queued=0,recved=4051,sent=4051)/10.XXX.XXX.XXX:50335[1](queued=0,recved=4140,sent=4153)/10.XXX.XXX.XXX:50337[1](queued=0,recved=20258,sent=24309)/10.XXX.XXX.XXX:36062[1](queued=0,recved=4077,sent=4085)Latency min/avg/max: 0/1/206Received: 101746Sent: 105832Connections: 15Outstanding: 0Zxid: 0x1ff1Mode: standaloneNode count: 34Connection closed by foreign host.
So far, so good. - Install git:
sudo yum install git - In the directory above the location you'd like to have the OpenTSDB repo live, run:
git clone https://github.com/OpenTSDB/opentsdb.git - To build OpenTSDB, we need to add some more basic dev tools to the base system. The following command is the nuclear option for adding dev tools:
sudo yum groupinstall 'Development Tools' - And we also need gnuplot:
sudo yum install gnuplot - Change directory into the directory containing OpenTSDB:
cd opentsdb - Pull all of the branches from github:
git fetch - Checkout "next":
git checkout next - Now you've got the right code and the right tools. Let 'er rip!:
./build.sh - When complete, change directory into the build directory:
cd ./build - To verify success, you're looking for the .jar file created by the build process. In this case, we've built a file named tsdb-2.0.0.jar and a script named tsdb.
ls tsdb* - And with that, we now have OpenTSDB built and ready to install. To install it, run:
sudo make install
- Now we'll create the required tables in HBase (with compression). The good folks at OpenTSDB made this easy by providing a script that does the heavy lifting for us. By default, this script will enable compression for your HBase tables. First we'll change directory to the location of the script, then execute it:
cd ../src
./create_table.sh
- With the tables created, we can configure the OpenTSDB process by editing the opentsdb.conf file and moving it into a place that the process can find it:
vi ./opentsdb.conf
and give appropriate values to the following variables (safe recommendations noted in italics, but you should provide answers appropriate for your system/configuration):
tsd.http.cachedir = /dev/shm/tsdtsd.http.staticroot = /home/hadoop/opentsdb/tsd.storage.hbase.zk_quorum = localhost
and move the file to a recognized configuration directory:
sudo mv ./opentsdb.conf /etc
- Now that you're fully configured, you can start OpenTSDB:
cd ..
./build/tsdb tsd
- You now have OpenTSDB running and receiving requests (both telnet and HTTP!) on port 4242 (assuming you took the defaults)
Excellent post, thank you. To help others, I'll add that you will probably need to create the directory in /dev/shm and open up the permissions. Also, even though the default port is 4242, you might need to set it in the openstdb.conf file anyway.
ReplyDeleteJeff G