CS 686 Big Data

Project Retrospective - P2

Provide answers to the following questions and submit a PDF via Canvas. Be sure to answer the questions completely and explain your logic.

  1. During analysis, were there any surprises you came across in the dataset or in how MapReduce works? Were there any features that you thought may be useful but ended up not helping your analysis?

  2. This project was a bit biased towards spatial analysis rather than temporal. Design a question that focuses on the timing of events.

  3. Did you use custom writables or implement any Hadoop-specific interfaces?

  4. If you had more time to complete the project, what would you improve in your design?

  5. Give a rough estimate of how long you spent completing this assignment. What part of the assignment took the most time?

  6. What did you learn from completing this project?

  7. What feedback do you have for the instructor to help make future projects like this better/more fun/interesting/useful? This is the first iteration of the course, so your feedback is greatly appreciated!