Data Collection, Administrative Sources and Big Data

Objectives

This curricular unit aims to present a set of methodologies that support data gathering in the process of producing official statistics, including data collection processes (mainly in the context in which data collection takes place through surveys based on a questionnaire), the use of administrative data and the use of sources associated with what is commonly referred to as Big Data.

At the end of the curricular unit the student should achieve the following learning objectives:
1. Identify and differentiate methods of data collection.
2. Know and select the most appropriate data collection methods and modes for each situation.
3. Be able to design a data collection methodology based on a survey.
4. Identify potential administrative sources and methodological aspects in their use
5. Recognize the potentialities of the use of big data in official statistics and the methodological aspects in their use

 

General characterization

Code

400010

Credits

6.0

Responsible teacher

Pedro Miguel Pereira Simões Coelho

Hours

Weekly - Available soon

Total - Available soon

Teaching language

Portuguese. If there are Erasmus students, classes will be taught in English

Prerequisites

   Not aplicable

Bibliography

European Association of Methodology. International Handbook of Survey Methodology, Eds. Edith D. de Leeuw, Josep J. Hox, Don A. Dillman, 2008.;

European Commission. Handbook of Recommended Practices for Questionnaire Development and Testing in the European Statistical System, 2006.;

Statistics Canada. Survey Methods and Practices, 2010;

Malhotra, Naresh K., Birks, David F. (2007). Marketing research: an applied approach. Third European edition. Harlow: Prentice Hall/Financial Times.;

Vilares, M. J.; Coelho, P. A Satisfação e a Lealdade do Cliente. Metodologias de Avaliação, Gestão e Análise., 2ª Edição, Escolar Editora.,2011.

Wallgren, B. Wallgren (2014). Register-based Statistics Statistical Methods for Administrative Data. John Wiley&Sons, Ltd.

United Nations (2007). Register-based statistics in the Nordic countries – Review of best practices with focus on population and social statistics. Available online: http://unstats.un.org/unsd/dnss/docViewer.aspx?docID=2764

United Nations (2011). Using Administrative and Secondary Sources for Official Statistics: A Handbook of Principles and Practices. Available online:

http://www.unece.org/fileadmin/DAM/stats/publications/Using_Administrative_Sources_Final_for_web.pdf

P. Christen (2012). Data Matching – Concepts and Techniques for Record Linkage, Entity Resolution, and duplicate Detection. Springer.

Teaching method

 This course is based on theoretical-practical classes including the presentation of concepts and methodologies, practical aplications, methodological discussions, and project/exercise resolution.

Evaluation method

The evaluation includes a final exam (60%) and an optional  project (40%).
The project includes the development of a report that should be discussed with the professor. The project comprehends the design of a methodology to support a data collection for a real problem.
To ensure approval it is necessary to achieve a minimum score of 8.5 in each component of assessment.
 

Subject matter

  1. Introduction to data collection methodologies
  2. The collection of data with surveys
    1. Planning the steps of the data collection process
    2. Sampling designs
    3. Target populations and sampling frames
    4. Sources of error in statistical operations
    5. Methods and modes of data collection
    6. Design of questionnaires
    7. Specificities of electronic surveys
  3. The collection of data with administrative sources
    1. Statistics based on administrative information
    2. The nature of administrative data
    3. The transfer of administrative information into statistical information
    4. Joint use of data from administrative sources and statistical surveys
    5. Data linkage and integration
    6. Evaluation of the quality of administrative data
    7. Protection of privacy and confidentiality
  4. Big data in the statistical production process
    1. Big date and digital trails
    2. Overview of big data sources
    3. The use of big data in official statistics
    4. Privacy and data protection
    5. Examples of the use of big data in the production of official statistics
    6. Methodological Challenges
    7. Big Data Tools Overview