Big Data Analytics
Objectives
Introductory knowledge in Python or any other programming language.
Familiarity with structured databases and SQL.
General characterization
Code
200167
Credits
7.5
Responsible teacher
Flávio Luís Portas Pinheiro
Hours
Weekly - Available soon
Total - Available soon
Teaching language
Portuguese. If there are Erasmus students, classes will be taught in English
Prerequisites
Week |
Class |
Topics |
|
1 |
Lecture |
|
|
Lab |
|
|
|
2 |
Lecture |
|
|
Lab |
|
|
|
3 |
Lecture |
|
|
Lab |
|
|
|
4 |
Lecture |
|
|
Lab |
|
|
|
5 |
Lecture |
|
|
Lab |
|
|
|
6 |
Lecture |
|
|
Lab |
|
|
|
7 |
Lecture |
|
|
Lab |
|
|
|
8 |
Lecture |
|
|
Lab |
|
|
|
9 |
Lecture |
|
|
Lab |
|
|
|
10 |
Lecture |
|
|
Lab |
|
|
|
11 |
Lecture |
|
|
Lab |
|
||
12 |
Lecture |
|
|
Lab |
|
||
13 |
Lecture |
|
|
Lab |
|||
14 |
Lecture |
|
|
Lab |
Bibliography
- White, Tom. Hadoop: The definitive guide. " O'Reilly Media, Inc.", 2012;
- Karau, Holden, et al. Learning spark: lightning-fast big data analysis. " O'Reilly Media, Inc.", 2015
- White, Tom. Hadoop: The definitive guide. " O'Reilly Media, Inc.", 2012;
- Capriolo, Edward, Dean Wampler, and Jason Rutherglen. Programming Hive: Data warehouse and query language for Hadoop. " O'Reilly Media, Inc.", 2012;
- Additionally, students will find selected book chapters and articles in the Moodle Page of the course.
Teaching method
1)Midterm (30%): During the last Lecture of the semester students will have 90 minutes answer a sert of multiple-choice questions and open questions that cover all the material discussed during the Lectures;
2)Practical Examination (30%): During the last lab of the semester students will have two hours to write a pySpark program that solves an exercise provided by the Instructors. Rules
- You cannot access the internet during the practical examination;
- You can bring any physical support material that you deem relevant (cheat sheet, prints of book chapters, etc¿)
- You are not allowed to bring supporting material through any other means (e.g., pen drives, Kindle, Tablet, etc ¿ )
- Smartphones need to be turned off during the practical examination;
- Students that break the rules will get zero points in this evaluation element.
3)Final Exam (40%): Consists in a mix of Multiple-Choice and Open questions covering all the material of the course.
Evaluation method
Inglês
Subject matter
The Big Data curricular has a duration of 14 weeks and it will be based in a system of weekly Lectures or Labs. Lectures will focus in the main theoretical concepts, while labs will provide an environment for students to become familiar with the different techniques and methodologies associated with the Big Data ecosystem.
Programs
Programs where the course is taught:
- Specialization in Information Analysis and Management
- Specialization in Risk Analysis and Management
- Major in Business Analytics
- Major in Data Science
- Specialization in Knowledge Management and Business Intelligence
- Specialization in Information Systems and Technologies Management
- Specialization in Marketing Intelligence
- Specialization in Marketing Research and CRM
- Specialization in Knowledge Management and Business Intelligence – Working Hours Format
- Specialization in Information Systems and Technologies Management - Working Hours Format
- Specialization in Marketing Intelligence - Working Hours Format
- Post-Graduation in Information Analysis and Management
- Post-Graduation Risk Analysis and Management
- PostGraduate in Smart Cities
- PostGraduate in Data Science for Marketing
- PostGraduate in Digital Enterprise Management
- PostGraduate Digital Marketing and Analytics
- PostGraduate in Information Management and Business Intelligence in Healthcare
- Post-Graduation in Knowledge Management and Business Intelligence
- Post-Graduation Information Systems and Technologies Management
- Post-Graduation in Marketing Intelligence
- Post-Graduation Marketing Research e CRM
- PostGraduate in Enterprise Information Systems