Data Mining II
Objectives
The curricular unit of data mining has as primary goal to allow students to gain a fundamental understanding of the art and science of Predictive Analytics as it relates to improving business performance.
This course will cover the basics of predictive analytics and modeling data to determine which algorithms to use. It will also allow students to understand the similarities and differences and which options affect the models most.
Topics covered include predictive analytics algorithms for supervised learning, including decision trees, neural networks, k-nearest neighbor, support vector machines, and model ensembles.
At the end of the course, participants will be able to use these skills to produce a fully processed data set compatible with building powerful predictive models that can be deployed to increase profitability.
General characterization
Code
200028
Credits
7.5
Responsible teacher
Hours
Weekly - Available soon
Total - Available soon
Teaching language
Portuguese. If there are Erasmus students, classes will be taught in English
Prerequisites
Data Mining I is not a prerequisite.
Bibliography
Mitchell, T., (1997) ¿Machine Learning¿, McGraw Hill.; Berry, M.J.A. and G.S. Linoff, ¿Data Mining Techniques; for marketing, sales and customer support¿. 1997, John Wiley & Sons.; Hand, D. J., Mannila, H., Smyth, P. (2001) ¿Principles of Data Mining (Adaptive Computation and Machine Learning)¿, MIT Press; 0; 0
Teaching method
The course is mainly based on lecture and practical classes. The practical sessions include exposure of concepts and methodologies, sample resolution, discussion and interpretation of results. Practical work, which is very significant in this course is done by students outside the classroom, but is evaluated.
Evaluation method
The evaluation is done through homework (20% of final grade), a practical group (20%), and one final exam (60%)
Subject matter
The course is organized in seven Learning Units (UA):
UA1. Introduction to forecasting methods in Data Mining
UA2. Data pre-processing and error estimates
UA3. Decision theory and Bayesian
UA4 Learning and classification systems
UA5 Decision trees
UA6. Neural Networks
UA7. Ensambles
Programs
Programs where the course is taught: