CSEE 4121 Final Exam Study Outline



Treat this guide as a summary and outline of the materials covered in the final exam. Deeper understanding of recurring systems themes will benefit performance. Case studies mentioned in class or otherwise generally help internalize the concepts.

Students are expected to know basic SQL and Python to understand interpret code used in scenario-based questions. Completing Homework 3 and the in-class workshops should help.


1. Batch Processing at Scale

1.1 MapReduce

1.2 Spark

2. Stream Processing

3. Security Foundations

3.1 Goals and Adversaries

3.2 Cryptographic Primitives

3.3 Compliance

4. Attacks on Data Systems

4.1 Injection

4.2 Credentials, Phishing, and Denial of Service

5. Authentication and Authorization

6. The Equifax Breach

7. Access Control in Databases

8. Anonymization and Privacy Models

9. Systems for Machine Learning

9.1 Single Node

9.1.1 Landscape

9.1.2 Hardware

9.1.3 Training vs. Inference

9.2 Distributed ML

9.2.1 Training

9.2.2 Inference

9.3 Open Challenges

9.3.1 Testing and Debugging

9.3.2 Versioning and Rollback

9.3.3 Heterogeneous Deployment

9.3.4 Privacy

10. RAG and Vector Databases


Good luck with the exam!