Help Centre Proposal Guidelines Guiding Principles and Ethical Framework Guide for Section Chairs Guidance for the Panel Chairs, Discussants and Paper Presenters Terms and Conditions Code of Conduct

The use of LLMs in analysing large-scale corpora on far-right discourse

Extremism

Gender

Parliaments

Representation

Narratives

Big Data

Presenter(s)

Vivian Stamou

National Centre for Social Research - EKKE

Author(s)

Vivian Stamou

National Centre for Social Research - EKKE

Theoni Stathopoulou

National Centre for Social Research - EKKE

Marios Christopoulos

Haris Papageorgiou

Alexia Katsanidou

GESIS Leibniz-Institute for the Social Sciences

Panel Parliamentary Speeches and Social Media Discourses Across Party Systems

Date, Time and Location

Wednesday 14:15 - 15:45 BST (17/06/2026) Building: Frederick Douglass Centre, Floor: 2nd Floor, Room: Room 2.16

To access full paper downloads, participants are encouraged to install the official Event App, available on the App Store.

Abstract

The analysis builds on recent advances in Large Language Models (LLMs) and computational text analysis to process and interpret large-scale parliamentary and social media corpora. These corpora are designed to be comparable across countries, which makes it possible to explore far-right topics cross-linguistically, examining how they are referred to and framed according to political orientation. Two complementary approaches are integrated: (a) a keyword-based retrieval pipeline, designed to capture targeted instances of relevant discourse phenomena, and (b) topic modeling, employed to automatically identify recurring themes, semantic clusters, and linguistic patterns across texts. LLMs are used both for the semantic expansion of keyword queries and for generating contextual embeddings that enhance the interpretability and coherence of topics, as well as for evaluating the coherence and validity of the topics identified. This combined strategy bridges hypothesis-driven and exploratory analysis, enabling systematic cross-linguistic and cross-platform comparisons, as well as the examination of temporal dynamics in online communication. The resulting framework offers a reproducible, multilingual, and scalable methodology for analysing large-scale textual data from parliamentary corpora and social media sources.

Install the app

The use of LLMs in analysing large-scale corpora on far-right discourse

Abstract