Multilingual Coarse Political Stance Classification of Media: Training Details | HackerNoon

Media Bias [Deeply Researched Academic Papers]
May 19, 2024
9:00 am

This paper is available on arxiv under CC BY-NC-SA 4.0 DEED license.

Authors:

(1) Cristina España-Bonet, DFKI GmbH, Saarland Informatics Campus.

Table of Links

F. Training Details

F.1 L/R Classifier

We finetune XLM-RoBERTa large (Conneau et al., 2020) for L vs. R classification as schematised in Figure 1. Our classifier is a small network on top of RoBERTa that first performs dropout with probability 0.1 on RoBERTa’s [CLS] token, followed by a linear layer and a tanh. We pass trough another dropout layer with probability 0.1 and a final linear layer projects into the two classes. The whole architecture is finetuned.

We use a cross-entropy loss, AdamW optimiser and a learning rate that decreases linearly. We tune the batch size, the learning rate, warmup period and the number of epochs. The best values per language and model are summarised in Table 12.

All trainings are performed using a single NVIDIA Tesla V100 Volta GPU with 32GB.

F.2 Topic Modelling

We use Mallet (McCallum, 2002) to perform LDA on the corpus after removing the stopwords, with the hyperparameter optimization option activated and done every 10 iterations. Other parameters are the defaults. We do a run per language with 10 topics and another run with 15 topics. We tag the corpus with both labels.

From Dollar Dominance to Digital Power: Is 2025 the Year of State-Backed Cryptos (CBDC)? | HackerNoon

It is increasingly evident that 2025 could usher in a new financial era. Major global powers are already vying for dominance in this emerging currency

November 3, 2024

Biometric NFTs: A Fresh Approach to Securing Digital Identity

As we spend more time online—collecting digital goodies, and exploring virtual worlds—we might need better ways to prove who we are and protect what we

December 19, 2024

FORTNA discusses how machine vision advances enable faster parcel sortation – The Robot Report

FORTNA’s dual, six-axis robotic singulator uses proprietary software to process up to 2,800 parcels per hour. Source: FORTNA While some automation may be commoditizing, successful

July 24, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.