2019-2020 Academic Catalog 
    
    Apr 26, 2024  
2019-2020 Academic Catalog [ARCHIVED CATALOG]

MIS 5600 - Data Structures and Big Data Computing


The astounding growth of data in all aspects of life in the form of emails, weblogs, tweets, sensors, videos and text has necessitated the use of Big Data and advanced analytics techniques to support large scale data analytics. The goal of this course is to enable students to design and build Big Data applications through highly scalable systems capable of collecting, processing, storing and analyzing large volumes of structured and unstructured data. By extending the Cross Industry Standard Process for Data Mining (CRISP-DM) to build Big Data applications using distributed and parallel computing architecture, this course brings together key Big Data tools on Hadoop platforms such as pig, hive, R-Hadoop, flume, and SQL-MR. Students will learn how to efficiently manage data with three main characteristics: volume, velocity and variety. Topics include the Hadoop platforms such as Cloudera Hadoop, Teradata Aster and IBM Infosphere Streams, and analytics techniques such as social media analytics, link analysis, and stream analytics. 

Prerequisites: MIS 4550    and MIS 5501  

Anticipated Terms Offered: Offered annually