|
Feb 01, 2025
|
|
|
|
Graduate Record 2024-2025
|
DS 7200 - Computation III - Distributed Computing Effective Date 08/15/2023 Learning tools and concepts for computing on big data. Learn how to use Spark for large-scale analytics and machine learning. Spark is an open-source, general-purpose computing framework that is scalable and blazingly fast. Fundamental data types and concepts will be covered (e.g., resilient distributed datasets, DataFrames) along with Tools for data processing, storage, and retrieval, including Amazon Web Services (AWS).
Credits: 3 Grading Basis Student Option Requisites Must be a Data Science PhD student
|
|