Tasks and conclusion
Post-training tasks:
- Try setting up your own three-node Hadoop cluster.
- A VM-based solution can be found here
- Write a simple Spark/MR job of your choice and understand how to generate analytics from data.
- Sample dataset can be found here