CS 686 Big Data

Project Retrospective - P3

Provide answers to the following questions and submit a PDF via Canvas. Be sure to answer the questions completely and explain your logic.

  1. You’ve now had the chance to work with both MapReduce and Spark. In your opinion, what are the pros and cons of both?

  2. Was there something that you thought would be easy to implement in Spark but it turned out that it wasn’t?

  3. Were there any confusing or surprising aspects of working with Spark? Did you come across some functionality that made your life easier or the computations run faster?

  4. Give a rough estimate of how long you spent completing this assignment. What part of the assignment took the most time?

  5. What did you learn from completing this project?

  6. How did your group split up the workload and coordinate work on the project? Do you prefer group projects or solo projects?

  7. What feedback do you have for the instructor to help make future projects like this better/more fun/interesting/useful? This is the first iteration of the course, so your feedback is greatly appreciated!