|
|
Apr 26, 2024
|
|
2019-2020 Academic Catalog [ARCHIVED CATALOG]
|
MIS 5600 - Data Structures and Big Data Computing The astounding growth of data in all aspects of life in the form of emails, weblogs, tweets, sensors, videos and text has necessitated the use of Big Data and advanced analytics techniques to support large scale data analytics. The goal of this course is to enable students to design and build Big Data applications through highly scalable systems capable of collecting, processing, storing and analyzing large volumes of structured and unstructured data. By extending the Cross Industry Standard Process for Data Mining (CRISP-DM) to build Big Data applications using distributed and parallel computing architecture, this course brings together key Big Data tools on Hadoop platforms such as pig, hive, R-Hadoop, flume, and SQL-MR. Students will learn how to efficiently manage data with three main characteristics: volume, velocity and variety. Topics include the Hadoop platforms such as Cloudera Hadoop, Teradata Aster and IBM Infosphere Streams, and analytics techniques such as social media analytics, link analysis, and stream analytics.
Prerequisites: MIS 4550 and MIS 5501
Anticipated Terms Offered: Offered annually
|
|
|