Finally I managed to install Hadoop and start it correctly. Although at first I plan to setup one master node and three slaves as the data nodes. At last there’s only one master node and one data node running on the same machine.
I’m using Ubuntu 14.04.1 and Hadoop 2.5.2. Actually all went smoothly at first, as there’re lots of tutorials and articles about this, for example this one here. The main problem which really bothered me is that I’m using the 64-bit Ubuntu system, while the Hadoop installation only contain 32-bit libraries.
There’s only a warning saying:
1 | ...WARN util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable |
But actually, this caused the data nodes not really started for me. After running start-all.sh the status report shows there’s no data node running and the capacity is all 0.
1 | $ jps |
The problem is solved after building the 64-bit library following the instructions here.
Next, I’d like to still setup the three slave machines as planned. And then get something to run on it.