Big Data Seminar

Objectives

The Big Data field is emerging as one of the transformative business processes of recent times. By using classic techniques from Business Intelligence & Analytics, along with a new set of tools, it is able to deal with the volume, velocity, and variety associate with big data.

This seminar will build on the understanding of the basic concepts of big data and analytics to provide students the background to succeed in a data centric world, not only from the point of view of the technologies required, but also in terms of management, governance, and organization.

General characterization

Code

2469

Credits

3.5

Responsible teacher

Roberto Henriques

Hours

Weekly - Available soon

Total - Available soon

Teaching language

English

Prerequisites

Available soon

Bibliography

•    Franks, Bill. Taming The Big Data Tidal Wave. 1st ed. New Jersey: Wiley, 2012
•    Morabito, Vincenzo. Big Data and Analytics. 1st ed. Cham: Springer International Publishing, 2015.
•    Pries, Kim H and Robert Dunnigan. Big Data Analytics. 1st ed. CRC Press, 2015.
•    Galit Shmueli, Peter C. Bruce, Inbal Yahav, Nitin R. Patel, Kenneth C. Lichtendahl Jr., Data Mining for Business Analytics: Concepts, Techniques, and Applications in R, Wiley, 2018

Teaching method

Lectures and class discussions (students are expected to actively participate in the discussions).

Case studies.

Team seminar works (group paper projects).

Evaluation method

The Final Exam is mandatory and must cover the entire span of the course. The weight of the final exam should not be less than 30% nor exceed 70%. The remainder of the evaluation can consist of class participation, midterm exams, in class tests, etc. Overall, written in class assessment (final exam, midterm) must have a weight of at least 50%.


Handout 10%, project 30%, final examination 60%.


Subject matter

  • CUC1. Introduction to Big Data, Definitions, Applications, Tools, and Governance;

  • CUC2. Introduction to Hadoop. Understanding the Hadoop Architecture. The Distributed File System (HDFS);

  • CUC3. Understanding MapReduce;

  • CUC4. Introduction to Data analytics;

  • CUC5. Data analytics in a big data environment;

  • CUC6. Introduction to Data Governance: security, privacy, integrity and quality.

Programs

Programs where the course is taught: