CS 677 Big Data

Course Schedule

The following is a tentative schedule for the course (subject to change).

Week Topic Materials
1 Aug 20 - 24

Introduction to Big Data

Paper Evaluations and Data Sources

2 Aug 27 - 31

Scaling Out

HDFS (and Hadoop) Setup

3 Sep 3 - 7

Network Design, Serialization

HDFS Discussion

4 Sep 10 - 14

Distributed Hash Tables

Data Models

5 Sep 17 - 21

Project 1 Checkpoint

PolarFS Discussion

6 Sep 24 - 28

Tues: Quiz 2

Thurs: Project / HDFS

7 Oct 1 - 5

Failing Gracefully

Proof of Work

8 Oct 8 - 12

Guest Speaker: David Guy Brizan
Demographic Identification (10/9)

Project / HDFS (10/11)

9 Oct 15 - 19

MapReduce

10 Oct 22 - 26

Sampling

Flink

11 Oct 29 - Nov 2

MR Tips

Thurs: Project Work Day

12 Nov 5 - 9

Counting Streams

PageRank

13 Nov 12 - 16

Sketching and Summarization

Spatiotemporal Analysis

14 Nov 19 - 23

Classes Cancelled 11/20 and no class 11/22, Happy Thanksgiving!

15 Nov 26 - 30

Distributed Machine Learning

16 Dec 3 - 7

Machine Learning Frameworks

17 Dec 10 - 14

Finals Week (No Class)