CS 686 Big Data

Course Schedule

The following is a tentative schedule for the course (subject to change).

Date Topic Readings/Materials
1 Aug 23 - 25 Introduction to Big Data
2 Aug 28 - Sep 1 Communication and Network Design Protocol Buffers
3 Sep 6 - 8 HDFS Messaging and Serialization
4 Sep 11 - 15 Distributed Hash Tables DHT Fault Tolerance, Data Models
5 Sep 18 - 22 DFS Test Cases Consistency
6 Sep 25 - 29 Byzantine Fault Tolerance Spanner
7 Oct 2 - 6 Proof-of-work and Bitcoin Big Data Programming Models
8 Oct 9 - 11 MapReduce
9 Oct 18 - 20 MapReduce Discussion
10 Oct 23 - 27 Spatiotemporal Data Storm
11 Oct 30 - Nov 3 Hadoop Tips
12 Nov 6 - 10 Counting Streams Summarizing and Sampling Spark Discussion
  • Research Paper 6: Spark
    (Due 11/9)
13 Nov 13 - 17 Spark Setup
  • Research Paper 7: RDDs
    (Due 11/16)
14 Nov 20 - 22 Project 3
15 Nov 27 - Dec 1 Machine Learning with Spark
16 Dec 4 - 6 Final Project Wrap-up