2023-2024 Academic Catalog 
    
    Dec 21, 2024  
2023-2024 Academic Catalog [ARCHIVED CATALOG]

BAN 5600 - Advanced Big Data Computing and Programming


The astounding growth of data in all aspects of life in the form of emails, weblogs, tweets, sensors, videos, and text has necessitated the use of Big Data and advanced analytics techniques to support large-scale data analytics. The goal of this course is to enable students to design and build Big Data applications through highly scalable systems capable of collecting, processing, storing, and analyzing large volumes of structured and unstructured data.

 

By extending the Cross-Industry Standard Process for Data Mining (CRISP-DM) to build Big Data applications using distributed and parallel computing architecture, this course brings together key Big Data tools on Hadoop Ecosystem (such as Pig, Hive, Flume, Sqoop, and Spark). Students will learn how to efficiently manage and analyze data with three main characteristics: high volume, high velocity, and high variety.

Topics include the Hadoop Ecosystem platforms such as Hortonworks Sandbox, Amazon AWS, and Databricks; and advanced analytics techniques such as Visualization, Natural Language Processing, and streaming analytics.

 

MSF students can take this as an elective with professor permission.

Prerequisites: BAN 4550   OR MSIT 3090 & BAN 5501  

Anticipated Terms Offered: Annually