2017-2018 Academic Catalog 
    
    Apr 23, 2024  
2017-2018 Academic Catalog [ARCHIVED CATALOG]

MIS 5600 - Data Structures and Big Data Computing


The astounding growth of data in all aspects of life in the form of emails, weblogs, tweets, sensors, videos and text has necessitated the use of Big Data and advanced analytics techniques to support large scale data analytics. The goal of this course is to enable students to design and build Big Data applications through highly scalable systems capable of collecting, processing, storing and analyzing large volumes of structured and unstructured data. By extending the Cross Industry Standard Process for Data Mining (CRISP-DM) to build Big Data applications using distributed and parallel computing architecture, this course brings together key Big Data tools on Hadoop platforms such as Pig, Hive, R-Hadoop, Flume, Spark and SQL-MapReduce. Students will learn how to efficiently manage data with three main characteristics: volume, velocity and variety. Topics include the Hadoop platforms such as Cloudera Hadoop, Teradata Aster and IBM Infosphere Streams, and advanced analytics techniques such as social media analytics, link analysis, and stream analytics.

Prerequisites: MIS4550  and MIS 5501  

Anticipated Terms Offered: Offered annually