Machine Learning
Objectives
The curricular unit of machine learning has as primary goal to allow students to gather skills in extracting information and knowledge from large databases. The success of information systems over the last decades promoted huge databases. These databases have the potential to foster the development and significantly increase the wealth of our society, allowing problems resolution. Generally, we can say that these data repositories can provide precious resources on improving the quality of decision making. Thus, this course addresses the topics and issues typically associated with designations of "Machine learning " or "Data mining ". This field has been developing rapidly and has become central in various activities. Its contribution extends the scientific marketing research, constituting a very significant body of knowledge and is still expanding. In this course, the main methodological aspects of "machine learning" and the most important tools are used, including artificial neural networks, classification trees, among others. The aim is also to provide students the opportunity to develop practical skills of problem solving. In this context the students will use Python to carry out a set of tasks modelling and extraction of knowledge.
General characterization
Code
100225
Credits
6.0
Responsible teacher
Roberto André Pereira Henriques
Hours
Weekly - Available soon
Total - Available soon
Teaching language
Portuguese. If there are Erasmus students, classes will be taught in English
Prerequisites
No requirements are needed.
Bibliography
- Dean Abbott (2014) "Applied Predictive Analytics: Principles and Techniques for the Professional Data Analyst", Wiley
- Daniel T. Larose, Chantal D. Larose (2015) "Data Mining and Predictive Analytics", 2nd Edition, Wiley
- Mitchell, T., (1997) "Machine Learning", McGraw Hill.
- Hand, D. J., Mannila, H., Smyth, P. (2001) "Principles of Data Mining (Adaptive Computation and Machine Learning)", MIT Press.
- Géron, A. (2019). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O'Reilly Media.
Teaching method
The curricular unit is based on theoretical and practical lessons. A variety of instructional strategies will be applied, including lectures, slide show demonstrations, step-by-step applications (with and without software), questions and answers. The sessions include presentation of concepts and methodologies, solving examples, discussion and interpretation of results. The practical component includes exercises to consolidate the theoretical concepts covered in the theoretical classes.
Evaluation method
Regular period (1st examination period)
• Exam (40%)
• Project 1 (30%)
• Project 2 (30%)
Regular period (end examination period)
• Exam (40%)
• Project 1 (30%)
• Project 2 (30%)
Special exam period
• Exam (100%)
Minimum grade of 8.0 (in 20) for the exam
Minimum grade of 8.0 (in 20) for the projects
Subject matter
LU1. Introduction to machine learning. Predictive and descriptive models;
LU2. Main methodologies
LU3. Data exploration and Visualization tools;
LU4. Data pre-processing;
LU5. Predictive Models
a. Simple classifiers
b. Instance based classifiers
c. Classification Trees - DDT, Cart and C 4.5
d. Neural Networks - MLP
e. Additional Topics on Predictive Modelling
LU6. Descriptive Models;
a. Market basket analysis
b. RFM Analysis;
c. Clustering algorithms
I. K-Means;
II. Hierarchical Clustering;
III. Additional topics on the segmentation;
Programs
Programs where the course is taught: