Course Overview
The course surveys methods for systematically analyzing text using statistical methods and procedures for social scientific purposes, starting with classical content analysis, dictionary-based methods and the introduction scaling methods. The course lays a theoretical foundation for text analysis but also takes a practical and applied approach, so that students learn how to apply these methods in actual research. The common focus across all methods is that they can be reduced to a three-step process: first, identifying text and units for analysis; second, organizing texts into a structured corpus in preparation for analysis; and third, extracting from the texts quantitatively measured features, such as coded content categories, word counts, word types, dictionary counts, or their authors. The course surveys these methods in a logical progression.
This course contains a lot of guided activities, and there is an expectation that you immerse yourself in the tasks and investigate additional documentation and features of the tools presented.
Learning Outcomes
This course will help you to:
- Explore the theoretical basis for Quantitative Text Analysis
- Survey methods for systematically extracting quantitative information from text for social scientific purposes
- Identify texts and units of texts for analysis
- Convert texts into matrices for quantitative analysis
- Analyze these matrices in order to generate inferences using statistical methods