The world of Hadoop and "Big Data" have hundreds of different technologies with cryptic names form the Hadoop ecosystem. With this course, you'll not only understand what those systems are and how they fit together but, you'll go hands-on and learn how to use them to solve real business problems. This course is comprehensive, covering over 25 different technologies in over 14 hours of video lectures. It's filled with hands-on activities and exercises, so you get some real experience in using Hadoop - it's not just theory. At the end of this course, you may expect to learn the course with a real, deep understanding of Hadoop and its associated distributed systems and you can apply Hadoop to real-world problems.
Our final project for this course is described in videos: imagine you work for some big website, and your manager wants a graph of the total number of sessions on your website per day. For this reason, they don't want use Google Analytics or some other existing service - you need to build your own!
The requirements are:
How would you design a system using the tools you've learned about in this course to meet this demand? The hard part is maintaining session data as website hits come in.
There is no correct answer, but you will see our approach at the end of the course.