ECPR

Install the app

Install this application on your home screen for quick and easy access when you’re on the go.

Just tap Share then “Add to Home Screen”

ECPR

Install the app

Install this application on your home screen for quick and easy access when you’re on the go.

Just tap Share then “Add to Home Screen”

Your subscription could not be saved. Please try again.
Your subscription to the ECPR Methods School offers and updates newsletter has been successful.

Discover ECPR's Latest Methods Course Offerings

We use Brevo as our email marketing platform. By clicking below to submit this form, you acknowledge that the information you provided will be transferred to Brevo for processing in accordance with their terms of use.

Text as Data

Course Dates and Times

Monday 1 – Friday 5 August 2022
2 hours of live teaching per day
15:00 – 17:00 CEST

Chendi Wang

chendi.wang@vu.nl

Vrije Universiteit Amsterdam

This course provides a highly interactive online teaching and learning environment, using state-of-the-art online pedagogical tools. It is designed for a demanding audience (researchers, professional analysts, advanced students) and capped at a maximum of 16 participants so that the teaching team can cater to the specific needs of each individual.

Purpose of the course

By the end of this course, you will understand, and be able to apply, various text-as-data methods for answering research questions in social science.

In addition to the theoretical material, you will gain proficiency in data analytic skills using the open-source statistical programming language R.

This course is suitable for students in various social science disciplines. It is set at an advanced level, for people who already possess basic quantitative skills.

By the end of this course, you will understand the current quantitative text analysis research landscape. You will have the skills to convert texts into quantitative matrices of features, and to analyse text data using statistical methods. You will also be able to apply these methods to a text corpus in support of a substantive research question, and to critically evaluate research that uses text analysis methods.

The course is suitable for researchers, professional analysts, and advanced students.

ECTS Credits

3 credits Engage fully with class activities 
4 credits Complete a post-class assignment


Instructor Bio

Chendi is an assistant professor in political science at the Department of Political Science and Public Administration, VU Amsterdam. He holds a PhD in political science from the EUI.

Chendi's research interests lie in political behaviour, political economy, comparative politics, and quantitative and computational methods.

He has published in the British Journal of Political Science, Western European Politics, Comparative European Politics, and in volumes published by Cambridge University Press.

To get the most out of this course, complete the required in-depth readings for each day, and skim at least one of the recommended readings, if listed.

Five pre-recorded lectures, introducing the course's major topics and concepts, supplement the readings.

Key topics covered

Monday
Introduction and overview

Tuesday
Data collection, webscraping and APIs

Wednesday
Descriptive analyses and dictionary methods

Thursday
Supervised methods and unsupervised methods

Friday
Word2Vec and embeddings

How the course will work online

The course combines pre-class assignments readings and pre-recorded videos with daily two-hour live Zoom sessions. These sessions focus on two tasks:

  • in-depth discussion of topics covered in the lecture
  • a lab session with guided hands-on exercises relevant to the lecture.

We want you to master the technical side of text analysis. To this end, we will distribute the R script prior to the live session so you can explore the code at your own pace, but we will go through the code and models together during the sessions.

We will also get to know each other, and each other's projects, and explore how we can use text data to answer your research questions. There will be problem sets after each live session. We will discuss these assignments, and any problems you may have, together the following day.

You can share thoughts and ask questions on our Slack channel, and the Instructor will host live Q&A sessions and social breaks. You will be able to sign up for a quick one-to-one consultation during designated office hours.

You should have basic knowledge of statistical analysis. If you have completed an introductory course in applied regression, this will suffice.

The course mainly uses R. Familiarity with it is preferred, but not necessary because a lab session early on in this course will refresh your knowledge.

If you are completely new to R, take Akos Mate's course Introduction to R before signing up to this one.