As data has grown, so has the rate at which it is processed, along with the complex demands made of it. Traditional tools are no longer able to handle this magnitude of storing and processing data - a single computer does not suffice due to IO, CPU & RAM limitations. This is when the new generation tools that run on multiple computers are required.
This is a very hands on course and will take you from the very basics to an advanced level in Big Data Analysis and Streaming processing using Apache Spark. Apache spark is probably the fastest and most efficient amongst all distributed computing tools. We will start with the basics of Big Data, understand the architecture of Apache Spark and solve problems.