Big Data Analytics
Prerequisites: At least a C in the following courses (COMP 405 or COMP 453) AND (COMP 406 or COMP 479 or STAT 408 or equivalent).
In this course, large data sets will be leveraged to solve challenging analytics problems. With more samples, analytics can use more complex learning models to automate more feature combinations for more robust model tuning, selection, and validation. Parallel, distributed processing will be performed with Apache Spark and Hadoop.
Outcomes: Python or R will be used with parallel frameworks to perform proper model selection when testing large combinations of features, models, hyperparameters, and ensembles, with additional emphasis on deep learning.
In this course, large data sets will be leveraged to solve challenging analytics problems. With more samples, analytics can use more complex learning models to automate more feature combinations for more robust model tuning, selection, and validation. Parallel, distributed processing will be performed with Apache Spark and Hadoop.
Outcomes: Python or R will be used with parallel frameworks to perform proper model selection when testing large combinations of features, models, hyperparameters, and ensembles, with additional emphasis on deep learning.