2020-2021 Academic Catalog 
    
    Apr 18, 2024  
2020-2021 Academic Catalog [ARCHIVED CATALOG]

BAN 5600 - Advanced Big Data Computing and Programming


The astounding growth of data in all aspects of life in the form of emails, weblogs, tweets, sensors, videos and text has necessitated the use of Big Data and advanced analytics techniques to support large scale data analytics. The goal of this course is to enable students to design and build Big Data applications through highly scalable systems capable of collecting, processing, storing and analyzing large volumes of structured and unstructured data. By extending the Cross Industry Standard Process for Data Mining (CRISP-DM) to build Big Data applications using distributed and parallel computing architecture, this course brings together key Big Data tools on Hadoop Ecosystem (such as pig, hive, Flume, Sqoop, and Hue) and Spark. Students will learn how to efficiently manage data with three main characteristics: volume, velocity and variety. Topics include the Hadoop platforms such as Cloudera, Teradata Aster and IBM Infosphere Streams, and analytics techniques such as social media analytics, link analysis, and stream analytics.

 

Previously Titled “MIS 5600 Data Structures and Big Data Computing”

Prerequisites: BAN 4550    and BAN 5501  

Anticipated Terms Offered: Offered annually