In this tutorial, we will install Zookeeper and Kafka in Raspberry Pi 3. It is also applicable for Linux system as well. After the installation, we will verify the setup by sending a message. Shall we start? Read more
In this tutorial, we will install Scala and SBT in Raspberry Pi 3. And then we will tackle memory issue and finally, we will verify the setup with an example Scala program. Let’s drive in. Read more
The pseudo-distributed mode is a special case of fully distributed mode in which the (single) host is localhost. Note that Hadoop daemon runs in a separate Java process. Therefore, we need to start daemons and we need to configure a couple settings until it can get it working. Let’s see how it can be achieved Read more
In this tutorial, we will setup Hadoop for standalone mode. Standalone mode is suitable for running MapReduce programs during development, since it is easy to test and debug them. Note that the default properties are already set for standalone mode and there are no daemons to run. Let’s look at it, shall we? Read more
RDD actions are RDD operations which don’t generate another RDD. Instead, RDD actions return a value of any types (such as List()) but not RDD[T]. And RDD actions are just like any other RDD operations. They are lazy; meaning they don’t compute right away, only when an action requires to return values. Let’s see how it’s done. Read more