Search

Russian State-Controlled Propaganda and its Proxies: Pro-Russian Political Actors in Japan
Olena Kalashnikova, Fabian Schäfer
Journal:

Asia-Pacific Journal / Volume 22 / Issue 3 / March 2024

Published online by Cambridge University Press:

14 March 2025, e6
- Article
- - You have access
- PDF
- Export citation
There are two main ways Russian propaganda reaches Japan: (a) the social media accounts of official institutions, such as the Russian Embassy, or Russian state-linked media outlets, such as Sputnik, and (b) pro-Russian Japanese political actors who willingly (or unwillingly) spread disinformation and display a clear pro-Kremlin bias. These actors justify the Russian invasion of Ukraine and repeat the Russian view of the war with various objectives in mind, primarily serving their own interests. By utilizing corpus analysis and qualitative examination of social media data, this article explores how Russian propaganda and a pro-Russian stance are effectively connected with and incorporated into the discursive strategies of political actors of the Japanese Far-Right.

Lexical Multidimensional Analysis

Identifying Discourses and Ideologies
Tony Berber Sardinha, Shannon Fitzsimmons-Doolan
Published online:

07 February 2025

Print publication:

27 February 2025
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Lexical Multidimensional Analysis (LMDA), an extension of Biber's (1988) Multidimensional Analysis, seeks to identify dimensions (correlated lexical features across texts in a corpus) unveiling underlying patterns of lexical co-occurrence and variation within texts that are operationalized as a variety of latent, macro-level discursive constructs. Initially developed in the 2010s, LMDA has been applied to diverse domains, including education policy, national representations, applied linguistics, music, the infodemic, religion, sustainability, and literary style. This Element introduces LMDA for the identification and analysis of discourses and ideologies, offering insights into how lexis marks discourse formations and ideological alignments. Two case studies demonstrate the application of LMDA: uncovering discourses on climate change within conservative social media and analyzing ideological discourses in migrant education.

Social Group Representation in a Diachronic News Corpus

Irene Elmerot
Published online:

06 February 2025

Print publication:

06 February 2025
- Element
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Equality is a global factor of prosperity in democratic societies. In this Element, thirty years of newspapers and magazines form the basis of an intersectional study on how different social actors are described in Czechia. A bird's eye perspective points to the news being very white male-oriented, but when scrutinising further, some results differ from previous studies, giving insights on linguistic othering and stratification that may be a threat to equality. The methodology can be used for most languages with a sufficient amount of digitised, annotated and available texts. Since more and more text is being gathered to form datasets large enough to answer any question we might have, this Element helps uncover why we should be careful about which conclusions to draw if the words put into the data are not adapted to the relevant register and context. This title is also available as Open Access on Cambridge Core.

Lexical be
Philip Miller, Peter W. Culicover
Journal:

Journal of Linguistics , First View

Published online by Cambridge University Press:

15 January 2025, pp. 1-24
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
We explore the surprising lexical be construction in English (e.g. Why don’t you be quiet?). After an overview of previous discussions, an investigation of the use of lexical be in the COCA and SOAP corpora is provided. It is shown that its distribution is highly skewed and that it is completely felicitous only under a very limited set of conditions. An account of lexical be is then provided showing that the conditions that license it are inherited from more general constructions, most importantly the negative imperative construction and the ‘Why don’t you’ construction. In this light, it is suggested that the lexical be construction, with its special properties, provides strong evidence for a constructional approach to linguistic competence along the lines of Goldberg (1995), Culicover and Jackendoff (2005), Sag (2012).

The heart attack of the Polish health service: metaphors, arguments, and emotional appeals in political debates
Konrad Juszczyk, Barbara Konat, Małgorzata Fabiszak
Journal:

Language and Cognition / Volume 17 / 2025

Published online by Cambridge University Press:

09 January 2025, e12
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Metaphors, arguments and emotional appeals have considerable persuasive power in political discourse, yet they are rarely studied together. To explore the interactions between these interrelated phenomena, we employ three methods of analysis: Metaphor Identification Procedure, Inference Anchoring Theory, and lexicon-based sentiment analysis. Our data come from Polish political debates broadcasted during the 2019 pre-election campaign. We test hypotheses about the frequency of the associations between metaphors, arguments and emotional appeals. Hypothesis 1 predicts that arguments containing metaphors are more frequent than arguments without metaphors, hypothesis 2 predicts that arguments containing emotional appeals are more frequent than arguments without them, and hypothesis 3 predicts that arguments with metaphors and emotional appeals are more frequent than any other combination. The results show that metaphorical arguments do not outnumber non-metaphorical ones (H1 is falsified), and arguments that are both metaphorical and emotional do not outnumber the sum of all other types (H3 is falsified). Emotional arguments are more common than non-emotional ones (H2 is verified). We suggest that when political actors articulate their arguments, they often choose a particular metaphor to evoke positive or negative emotions in their audience.

Gender-specific and gender-neutral language trends in the AP Stylebook and online written news: A comparative corpus analysis of prescribed vs. actual usage
Brooke James, Jacob D. Rawlins
Journal:

English Today , First View

Published online by Cambridge University Press:

16 December 2024, pp. 1-16
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Contrary to traditional thought in linguistics and editing, recent studies using corpus-based evidence suggest that historical English usage patterns influenced prescriptive usage manuals’ guidelines more than the other way around. To explore the modern relationship between English language prescriptions and usage, this study focuses on the wide-reaching genre of written online news and the topic of gender-fair language. It compares changes regarding gender-specific titles in the Associated Press's stylebooks to actual usage trends as documented by the News on the Web (NOW) corpus. Results from NOW show -man title variants as the dominant form in the early 2010s, consistent with AP style at that time. However, many gender-neutral (including -person) variants saw rapid uptake in usage in the mid-2010s to become the most frequent forms by 2021, contrasting AP guidelines that only started listing -person and other neutral forms as ‘acceptable' around 2017 and as the prescribed forms more recently. These results indicate both an increased cultural consciousness for changing gender equity standards as well as a willingness of many news writers, editors, and publishers to defer to culturally significant language trends even if authoritative guides do not yet endorse them.

Topics in Public Administration

Perspectives from Computational Social Sciences and Corpus Linguistics
Richard M. Walker, Jiasheng Zhang, Yanto Chandra
Published online:

29 November 2024

Print publication:

09 January 2025
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This inductive examination of the topics in the public administration literature using computational social science and corpus linguistics (17 journals, N=12,760 articles, 1991–2019) reveals a new landscape of public administration topics, changes in topics over time and their distribution: Topic modelling of the stock of the whole corpus identifies 50 topics: the top ten topics included health care, federal government, performance management, environmental regulation, HRM and networks and accounted for just over a third of scholarship between 1991–2019. Focal topics identified in individual journals identified similarities with popular topics in the whole corpus – networks, health care, HRM – and less frequently examined topics including gender and diversity and partnerships. Analysis of topics over time shows a substantial flow in topics moving from a country and practice focus in the early stages of our study period to concepts such as governance, networks and citizens in the late stages (2015–2019).

Characterizing English Preposing in PP constructions
Christopher Potts
Journal:

Journal of Linguistics , First View

Published online by Cambridge University Press:

08 October 2024, pp. 1-39
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The English Preposing in PP construction (PiPP; e.g., Happy though/as we were) is extremely rare but displays an intricate set of stable syntactic properties. How do people become proficient with this construction despite such limited evidence? It is tempting to posit innate learning mechanisms, but present-day large language models seem to learn to represent PiPPs as well, even though such models employ only very general learning mechanisms and experience very few instances of the construction during training. This suggests an alternative hypothesis on which knowledge of more frequent constructions helps shape knowledge of PiPPs. I seek to make this idea precise using model-theoretic syntax (MTS). In MTS, a grammar is essentially a set of constraints on forms. In this context, PiPPs can be seen as arising from a mix of construction-specific and general-purpose constraints, all of which seem inferable from general linguistic experience.

The ‘adverb-ly adjective’ construction in English: meanings, distribution and discourse functions
MAITE TABOADA, CLIFF GODDARD, RADOSLAVA TRNAVAC
Journal:

English Language & Linguistics , First View

Published online by Cambridge University Press:

27 September 2024, pp. 1-30
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
We investigate a class of adjective phrases composed of a deadjectival adverb ending in -ly and an adjective head (e.g. staggeringly incompetent, absolutely terrific, fiscally responsible), a compact construction whereby two adjectives may jointly contribute to evaluative meaning. Using corpus methodologies on more than 1 million examples and relying on semantic analyses of about 1,000 instances, we propose that the construction can be divided into different semantic subtypes, including Degree (deeply disturbing), Focus (utterly ridiculous), Manner (delightfully performed), Reaction (strangely compelling), Topical (historically inaccurate) and Epistemic (intuitively obvious), among others. Using this typology, we investigate the relative distribution of each subtype across several registers of written English. We found a high frequency of the Reaction subtype in book, film and art reviews, and we suggest a discourse-functional explanation for this, linked to the perceived value of originality in expressive writing. This investigation reveals the power of semantically informed, corpus methodologies to shed light on the distribution of specific constructions.

Programming for Corpus Linguistics with Python and Dataframes

Daniel Keller
Published online:

24 May 2024

Print publication:

20 June 2024
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

26 - Sociolinguistic Variation in Slavic Languages
from Part 5 - Sociolinguistic and Geographical Approaches
- By Serge Sharoff, Nenad Ivanović
Edited by Danko Šipka, Arizona State University, Wayles Browne, Cornell University, New York
Book:

The Cambridge Handbook of Slavic Linguistics

Published online:

16 May 2024

Print publication:

23 May 2024, pp 559-583
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter discusses linguistic variation in Slavic languages by presenting an overview of the relationship between human communication in the society and the corresponding linguistic features. In this chapter we focus on the parameters of variation according to the language user, such as age or dialects, and according to the language use, such as communicative functions or communication styles, e.g. politeness. We cite both qualitative and quantitative methods for studying aspects of sociolinguistic variation. Examples are drawn from large corpora of two Slavic languages, Russian and Serbo-Croatian, with a particular focus on academic writing, news reporting, and reporting personal experience in social media, as well as from dictionaries and field studies.

35 - Natural Language Processing
from Part 6 - Experimental and Quantitative Approaches
- By Tomaž Erjavec
Edited by Danko Šipka, Arizona State University, Wayles Browne, Cornell University, New York
Book:

The Cambridge Handbook of Slavic Linguistics

Published online:

16 May 2024

Print publication:

23 May 2024, pp 732-750
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter surveys the history and main directions of natural language processing research in general, and for Slavic languages in particular. The field has grown enormously since its beginning. Especially since 2010, the amount of digital texts has been rapidly growing; furthermore, research has yielded an ever-greater number of highly usable applications. This is reflected in the increasing number and attendance of NLP conferences and workshops. Slavic countries are no exception; several have been organising international conferences for decades, and their proceedings are the best place to find publications on Slavic NLP research. The general trend of the evolution of NLP is difficult to predict. It is certain that deep learning, including various new types (e.g. contextual, multilingual) of word embeddings and similar ‘deep’ models will play an increasing role, while predictions also mention the increasing importance of the Universal Dependencies framework and treebanks and research into the theory, not only the practice, of deep learning, coupled with attempts at achieving better explainability of the resulting models.

Introduction
- By Wayles Browne, Danko Šipka
Edited by Danko Šipka, Arizona State University, Wayles Browne, Cornell University, New York
Book:

The Cambridge Handbook of Slavic Linguistics

Published online:

16 May 2024

Print publication:

23 May 2024, pp 1-6
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The introduction to this volume describes its content. It also provides the rationale for including selected topics and provides comments on the manner of presentation adopted in this volume.

A quantitative exploration of the functions of auxiliary do in Middle English
LORENZO MORETTI
Journal:

English Language & Linguistics , First View

Published online by Cambridge University Press:

17 May 2024, pp. 1-21
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
One of the questions that still surrounds the history of auxiliary do is what function it had during the Middle English period (c.1100–1500). Scholars have put forward different hypotheses, suggesting that it could serve, among others, as a perfective marker (Denison 1985), agentive marker (Ecay 2015) and habitual marker (Garrett 1998). The present article reports on a quantitative study that aims to shed further light on this issue. By means of a collexeme analysis, this article investigates the semantic features of the infinitives that occur with auxiliary do in several Middle English corpora. The results show that auxiliary do was not connected to verbs with specific semantic profiles, but it was employed in different contexts and had various functions. Specifically, the data suggest that auxiliary do was used (i) as an accommodation tool to facilitate the use of low-frequency verbs, particularly of French origin, and (ii) as an aspectual particle to mark both perfectivity and habituality. It is argued that the multifunctionality of auxiliary do in Middle English played a crucial role in the preservation of the construction before it spread to the NICE (i.e. negation, inversion, code and emphasis) environments.

The Cambridge Handbook of Slavic Linguistics

Edited by Danko Šipka, Wayles Browne
Published online:

16 May 2024

Print publication:

23 May 2024
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The linguistic study of the Slavic language family, with its rich syntactic and phonological structures, complex writing systems, and diverse socio-historical context, is a rapidly growing research area. Bringing together contributions from an international team of authors, this Handbook provides a systematic review of cutting-edge research in Slavic linguistics. It covers phonetics and phonology, morphology and syntax, lexicology, and sociolinguistics, and presents multiple theoretical perspectives, including synchronic and diachronic. Each chapter addresses a particular linguistic feature pertinent to Slavic languages, and covers the development of the feature from Proto-Slavic to present-day Slavic languages, the main findings in historical and ongoing research devoted to the feature, and a summary of the current state of the art in the field and what the directions of future research will be. Comprehensive yet accessible, it is essential reading for academic researchers and students in theoretical linguistics, linguistic typology, sociolinguistics and Slavic/East European Studies.

Chapter 1 - Formularity
Chiara Bozzone, Ludwig-Maximilians-Universität Munchen
Book:

Homer's Living Language

Published online:

11 April 2024

Print publication:

18 April 2024, pp 5-63
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Formularity, or the poet’s reliance on prefabricated linguistic features in the composition of his verses, has been the most debated feature of Oral-Formulaic Theory. This chapter reviews the history of Homeric formularity (Part 1), while introducing new key insights from the fields of linguistics (esp. usage-based linguistics, corpus linguistics, and language acquisition studies) and the cognitive sciences (Parts 2-5). Parts 2-3 argue that formularity is a general feature of human language and cognition. Homer’s formularity is quantitatively notable, however, in that it involves sequences that are particularly long when compared to repeated sequences in corpora of both contemporary written or spoken English and ancient prose and hexameter authors. This is interpreted as a sign of Homer’s extreme mastery of his medium, which was arguably necessitated by the oral-improvisational nature of the task. Part 4 develops a new theory of Homeric formularity, borrowing insights from connectionism, lexical priming, and construction grammar, and introduces fine-grained distinctions between conceptual associations, collocations, constructions, metrical constructions and structural formulas.

10 - Creation and Analysis of the Multimedia Russian Corpus for Gesture Research
from Part II - Ways of Approaching Gesture Analysis
- By Ekaterina Rakhilina, Alan Cienki
Edited by Alan Cienki, Vrije Universiteit, Amsterdam
Book:

The Cambridge Handbook of Gesture Studies

Published online:

01 May 2024

Print publication:

18 April 2024, pp 249-272
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The chapter considers gesture studies in relation to corpus linguistic work. The focus is on the Multimedia Russian Corpus (MURCO), part of the Russian National Corpus. The chapter includes a brief biography of the creator of this corpus, Elena Grishina. The compilation of the corpus out of a set of Russian classic feature films and recorded lectures is described as well as the methods of annotating it in detail. The gesture coding is not limited to manual/hand gestures, but also includes head gestures and use of eye gaze. The chapter considers the findings from the corpus, and reported in Grishina’s posthumously published volume on Russian gestures from a linguistic point of view. The categories include pointing gestures, representational gestures, auxiliary (discourse-structuring) gestures, and several cross-cutting categories, including gestures in relation to pragmatics and to grammatical categories, like verbal aspect. Additional consideration is given to other video corpora in English (and other languages) which are being used for gesture research, namely the UCLA NewsScape library being managed by the Red Hen Lab and the Television Archive.

Chapter 2 - Theoretical and Methodological Considerations
Claudia Claridge, University of Augsburg, Ewa Jonsson, Mid Sweden University, Merja Kytö, Uppsala University
Book:

Intensifiers in Late Modern English

Published online:

15 March 2024

Print publication:

28 March 2024, pp 9-34
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The chapter introduces the material used for the study, that is, the Old Bailey Corpus (OBC) as well as the Old Bailey Online resource and the Proceedings that the OBC has been based on. The analytical frameworks adopted are also discussed, comprising the corpuslinguistic approach, and the historical sociopragmatics, the language variation and change, and the grammaticalization and pragmatic-semantic change frameworks. Attention is also paid to the late modern courtroom and to the issues of relevance to the study of past spoken interaction based on written records.

Chapter 4 - Corpus Methodology and Overview of Data
Claudia Claridge, University of Augsburg, Ewa Jonsson, Mid Sweden University, Merja Kytö, Uppsala University
Book:

Intensifiers in Late Modern English

Published online:

15 March 2024

Print publication:

28 March 2024, pp 64-89
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This is the main methodology and first-results chapter. It opens with an introduction to the lexeme-based approach used for the investigation, contrasting this to previous, variationist approaches. The chapter proceeds to explain the data retrieval and screening processes and presents an overview of the data, the nearly 65,000 intensifier tokens found in the corpus, across the three main categories (maximizers, boosters, downtoners), and the descriptive results across time for the most frequent items. The word counts of the different sociopragmatic groups of speakers (divided by speakers’ role in the courtroom, gender and social class) are introduced, as well as the diachronic distribution of intensifiers across the genders and social classes. Results are presented within the descriptive statistics framework, but the chapter also briefly introduces the regression model, or the inferential, multivariate statistical method to be used in Chapters 8–11 to disentangle the complex interplay of the sociopragmatic variables of speakers on the use of intensifiers.

Contrasting the semantic space of ‘shame’ and ‘guilt’ in English and Japanese
Eugenia Diegoli, Emily Öhman
Journal:

Language and Cognition / Volume 16 / Issue 4 / December 2024

Published online by Cambridge University Press:

01 March 2024, pp. 1296-1318
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
This article sheds light on the significant yet nuanced roles of shame and guilt in influencing moral behaviour, a phenomenon that became particularly prominent during the COVID-19 pandemic with the community’s heightened desire to be seen as moral. These emotions are central to human interactions, and the question of how they are conveyed linguistically is a vast and important one. Our study contributes to this area by analysing the discourses around shame and guilt in English and Japanese online forums, focusing on the terms shame, guilt, haji (‘shame’) and zaiakukan (‘guilt’). We utilise a mix of corpus-based methods and natural language processing tools, including word embeddings, to examine the contexts of these emotion terms and identify semantically similar expressions. Our findings indicate both overlaps and distinct differences in the semantic landscapes of shame and guilt within and across the two languages, highlighting nuanced ways in which these emotions are expressed and distinguished. This investigation provides insights into the complex dynamics between emotion words and the internal states they denote, suggesting avenues for further research in this linguistically rich area.

Search Results

Refine search

Refine search

Actions for selected content:

162 results

Russian State-Controlled Propaganda and its Proxies: Pro-Russian Political Actors in Japan

Lexical Multidimensional Analysis

Social Group Representation in a Diachronic News Corpus

Lexical be

The heart attack of the Polish health service: metaphors, arguments, and emotional appeals in political debates

Gender-specific and gender-neutral language trends in the AP Stylebook and online written news: A comparative corpus analysis of prescribed vs. actual usage

Topics in Public Administration

Characterizing English Preposing in PP constructions

The ‘adverb-ly adjective’ construction in English: meanings, distribution and discourse functions

Programming for Corpus Linguistics with Python and Dataframes

26 - Sociolinguistic Variation in Slavic Languages

Summary

35 - Natural Language Processing

Summary

Introduction

Summary

A quantitative exploration of the functions of auxiliary do in Middle English

The Cambridge Handbook of Slavic Linguistics

Chapter 1 - Formularity

Summary

10 - Creation and Analysis of the Multimedia Russian Corpus for Gesture Research

Summary

Chapter 2 - Theoretical and Methodological Considerations

Summary

Chapter 4 - Corpus Methodology and Overview of Data

Summary

Contrasting the semantic space of ‘shame’ and ‘guilt’ in English and Japanese

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

162 results

Lexical Multidimensional Analysis

Social Group Representation in a Diachronic News Corpus

Topics in Public Administration

Programming for Corpus Linguistics with Python and Dataframes

Summary

Summary

Summary

The Cambridge Handbook of Slavic Linguistics

Summary

Summary

Summary

Summary