Feb 01, 2025  
Graduate Record 2024-2025 
    
Graduate Record 2024-2025

DS 7200 - Computation III - Distributed Computing


Effective Date 08/15/2023
Learning tools and concepts for computing on big data. Learn how to use Spark for large-scale analytics and machine learning. Spark is an open-source, general-purpose computing framework that is scalable and blazingly fast. Fundamental data types and concepts will be covered (e.g., resilient distributed datasets, DataFrames) along with Tools for data processing, storage, and retrieval, including Amazon Web Services (AWS).

Credits: 3
Grading Basis Student Option
Requisites Must be a Data Science PhD student