How to install Zookeeper and Kafka in Raspberry Pi 3

In this tutorial, we will install Zookeeper and Kafka in Raspberry Pi 3. It is also applicable for Linux system as well. After the installation, we will verify the setup by sending a message. Shall we start? Read more


How to install Scala and SBT in Raspberry Pi 3

In this tutorial, we will install Scala and SBT in Raspberry Pi 3. And then we will tackle memory issue and finally, we will verify the setup with an example Scala program. Let’s drive in. Read more


How to setup Hadoop for Pseudo-distributed Mode

The pseudo-distributed mode is a special case of fully distributed mode in which the (single) host is localhost. Note that Hadoop daemon runs in a separate Java process. Therefore, we need to start daemons and we need to configure a couple settings until it can get it working. Let’s see how it can be achieved Read more


How to setup Hadoop for Standalone Mode

In this tutorial, we will setup Hadoop for standalone mode. Standalone mode is suitable for running MapReduce programs during development, since it is easy to test and debug them. Note that the default properties are already set for standalone mode and there are no daemons to run. Let’s look at it, shall we? Read more


Apache Spark- RDD Actions

RDD actions are RDD operations which don’t generate another RDD. Instead, RDD actions return a value of any types (such as List()) but not RDD[T]. And RDD actions are just like any other RDD operations. They are lazy; meaning they don’t compute right away, only when an action requires to return values. Let’s see how it’s done. Read more