CS 677 is focused on building and leveraging distributed systems to analyze large datasets. The course will consist of large programming assignments and you will also be required to submit written reports on assigned readings from the literature.

Each assignment will include a detailed specification document with a description of the problem, breakdown of points, permitted libraries, etc. You are free to discuss the projects with your classmates, but sharing code or pseudocode is not acceptable. Please see the grading policy for more information.

Submitting Assignments: use the project links below to create a git repository for your work. To submit, check your code into your git repository before the deadline.

Late Policy:

Research Papers

Presentation Order, Fall 2022:

  1. HDFS – Matthew Malensek
  2. The rest: to be announced!