Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
11,512
result(s) for
"Corpus analysis"
Sort by:
Why do news values matter? Towards a new methodological framework for analysing news discourse in Critical Discourse Analysis and beyond
2014
This article introduces a new framework for the analysis of news discourse to scholars in Critical Discourse Analysis (CDA) and beyond. It emphasises the importance of news values for linguistic analysis and encourages a constructivist approach to their analysis. The new methodological framework is situated within what the authors call a 'discursive' approach to news values. From this perspective, news values are seen as values that exist in and are constructed through discourse, and the primary research interest is in how texts construct newsworthiness through multimodal resources. This article first introduces resources that are used to construe news values in English-language news discourse, before illustrating the framework through two case studies of a 70,000-word corpus of British news discourse. The framework itself is intended for both multimodal discourse analysis and corpus linguistic analysis, although this article focuses more on the integration of corpus linguistic techniques. Thus, the discursive approach ties in well with two recent trends in CDA – towards multimodal and towards corpus-assisted discourse analysis. More specifically, the case studies show that corpus linguistic techniques can identify conventionalised discursive devices that are repeatedly used in news discourse to construct and perpetuate an ideology of newsworthiness. They further show that such techniques can provide a useful indication of the discursive construction of newsworthiness around a specific topic, event or news actor. The article concludes with an outline of further applications of the framework for (critical) linguistic analyses of news discourse.
Journal Article
Contrasting the semantic space of ‘shame’ and ‘guilt’ in English and Japanese
2024
This article sheds light on the significant yet nuanced roles of shame and guilt in influencing moral behaviour, a phenomenon that became particularly prominent during the COVID-19 pandemic with the community’s heightened desire to be seen as moral. These emotions are central to human interactions, and the question of how they are conveyed linguistically is a vast and important one. Our study contributes to this area by analysing the discourses around shame and guilt in English and Japanese online forums, focusing on the terms shame, guilt, haji (‘shame’) and zaiakukan (‘guilt’). We utilise a mix of corpus-based methods and natural language processing tools, including word embeddings, to examine the contexts of these emotion terms and identify semantically similar expressions. Our findings indicate both overlaps and distinct differences in the semantic landscapes of shame and guilt within and across the two languages, highlighting nuanced ways in which these emotions are expressed and distinguished. This investigation provides insights into the complex dynamics between emotion words and the internal states they denote, suggesting avenues for further research in this linguistically rich area.
Journal Article
Kada se portali „zaraze” koronom: Razvoj i usporedna analiza članaka portala Index.hr 2019. i 2020. godine
2022
The goal of this paper is to present the methodology, tools and results of comparative computational analysis of newspaper online articles: from the collection of documents and the cleaning of language data for the development of specialized corpora of newspaper articles, to the presentation of the tools used and the comparative statistical analysis of the corpora. The research was conducted on two specialized corpora developed precisely for the purpose of this research, based on 500 newspaper articles in the category “News” of the Index.hr news portal. One corpus is based on articles published in the pre-pandemic year 2019, and the other is based on articles published in the pandemic year 2020. By analyzing the data, we found that the vocabulary of the pandemic corpus is significantly poorer than the pre-pandemic corpus, that in 2020 less was written about the neighboring states of the Republic of Croatia than in 2019, and that the pre-pandemic corpus mentioned domestic cities more than the foreign ones, while the opposite can be argued for the pandemic corpus. Finally, we also investigated the adequacy of automatic term extraction to identify specific topics covered in the observed corpora.
Journal Article
On the Creation of a Corpus-Derived Medical Multi-Word Term List
by
Florescu, Cosmin Mihail
,
Ohniwa, Ryosuke L.
in
Analysis
,
Artificial intelligence
,
Case reports
2025
Although several studies have succeeded in creating medical word lists using corpus analysis methods, there is currently a shortage of comprehensive lists containing medical multi-word terms (MWTs). This study attempts to fill this gap by identifying medical MWTs using a large corpus of English language medical textbooks (28,384,681 running words). The term extraction function in Sketch Engine was used to extract high-frequency MWTs and to calculate keyness and dispersion data for each MWT. The validity of the resulting list and of specific subsets was tested using a different medical corpus and a general English corpus. The resulting list comprises 3307 MWTs with 63.83% (2111 MWTs) occurring at comparable frequencies in the different medical corpus and only 0.97% (32 MWTs) occurring at comparable frequencies in the general English corpus. The study also revealed clear differences in replicability between semantic subsets, with MWTs from the Anatomy and the Disorders semantic groups displaying high replicability, while MWTs from the Concepts and Ideas semantic group showed low to moderate replicability. The list may be used to develop evidence-based materials in English for Medical Purposes courses and to further explore how information is packaged in healthcare communication settings.
Journal Article
Battles and breakthroughs: representations of dementia in the British press
by
Dening, Tom
,
Bailey, Annika
,
Harvey, Kevin
in
Biomedicine
,
Corpus analysis
,
Corpus linguistics
2021
Media coverage of dementia can influence public and professional attitudes towards the syndrome, shaping societal knowledge of dementia and impacting how people with dementia are cared for. This paper reports on a study of news articles about dementia published in the British press in the years 2012–2017. The analysis combines the tools of corpus linguistics, a methodology for quantitatively surveying a vast amount of electronic linguistic data, with the qualitative perspectives of Critical Discourse Analysis, which seeks to uncover dominant discourses and ideologies. The most salient discourse that emerged from this analysis was the portrayal of dementia in biomedical terms, with a particular focus on the pathological processes of dementia, and pharmaceutical treatments and research. Keywords relating to this discourse are interrogated in detail, illuminating the linguistic strategies through which the pathology of dementia and people with dementia are depicted. This study highlights the challenges that this type of reporting presents to people living with dementia and their families, and points to the relevance of a discursive approach to understanding societal perceptions of dementia.
Journal Article
A Corpus Analysis of Differential Object Marking in Beserman Udmurt
2020
This paper is focused on differential object marking in Beserman Udmurt. Elicitation sessions show the relevance of referential properties of the DO for the choice between accusative and zero marking; however, for Standard Udmurt animacy has been claimed to play a more significant role. To establish the exact ranking of each parameter, we conducted an analysis of corpus data involving 10 539 sentences with 2187 DOs. With human animates, the unmarked DO is only possible with some lexical classes; with non-human animates, referential properties play the central role, as predicted by elicitation; with inanimates, the frequency of the accusative is higher among DOs with certain types of modifiers.
Journal Article
Linguistic and Communicative Diversity in Inclusive Settings—Effects, Challenges, and Opportunities
2025
Diversity is understood as a significant advantage for inclusive learning. However, the questions of whether linguistic and communicative diversity in the classroom is truly beneficial and whether mutually linguistic support is effective between students with and without language impairments have not yet been answered comprehensively. Therefore, within the research project ‘LINUS—Learners in Inclusive Settings’, interactions between children (n = 48; aged 6 y 7 m to 7 y 7 m) in dyads in primary school were videotaped and micro-sequentially analyzed. The pairs of children were engaged in a 15 min playful framework within a quasi-experimental design. Based on quantitative language corpus analyses, the occurrence of linguistic elements was compared between pairs of children with (a) typical age-appropriate language acquisition, (b) developmental language disorders, and (c) heterogeneous settings with respect to backgrounds of language acquisition. Additionally, quantitative coding was used to analyze how often linguistic elements were productively imitated from one child to the other in different peer settings. The results show that there is a higher potential for mutually motivated communication and more resources for linguistic support in mixed dyads. From this study, specific principles for teachers and students can be drawn to foster linguistic support between children in the context of diversity.
Journal Article
Translating Contract Law Terminology in the Civil Code of the P.R.C.: A Corpus-Based Analysis
2025
Terminology has long been a central concern in both theoretical and practical studies of legal translation. This study conducts a comparative and mixed-method analysis of the translation of contract law terminology, based on two pairs of self-compiled corpora: 1) the English monolingual translation of the Book of Contract of the Civil Code of the People's Republic of China and the official text of UNIDROIT Principles of International Commercial Contracts 2016, and 2) the parallel bilingual corpora of these texts. Focusing on three representative pairs of legal terms (formation/conclusion of contracts, terms/clauses of contracts, and obligor/obligee and creditor/debtor), the study identifies key inconsistencies in the translated version of the Book of Contract. Notably, the terms formation and conclusion are used inconsistently, and no clear distinction is made between terms and clauses, in contrast to the usage found in the reference corpus. The study also compares the use of obligor/obligee and creditor/debtor, offering suggestions for more accurate and contextually appropriate translation choices. Ultimately, the study seeks to contribute to the standardization of legal terminology in Chinese legislative translation, enhance legal translation quality, promote mutual understanding among different legal systems, and foster international legal and commercial cooperation, while providing the global community with clearer access to Chinese legal culture as well.
Journal Article
The portrayal of the future as legitimacy construction: discursive strategies in highly ranked business schools’ external communication
2023
In this paper, we investigate how highly ranked business schools construct their legitimacy claims by analysing their online organisational communication. We argue that in the case of higher education institutions in general, and business schools in particular, the discursive formation of these legitimacy claims is strongly connected to the future. Consequently, we utilise corpus-based discourse analysis of highly ranked business schools’ website communication by focusing on sentences containing the expression ‘future’. At first, we analysed the future-related language use to reveal the general future picture in the corpus. Furthermore, by combining qualitative and quantitative textual data, we identified six typical agency frames (i.e. preparing, shaping, adjusting, exploring, personal future, responsibility) about the future. By examining the co-occurrence of these frames, we were able to identify different discursive strategies. As we connected our findings to general societal phenomena we could interpret why and how business schools utilise these discursive strategies to (re)create and maintain their legitimacy.
Journal Article
Uses of the Light Verb GET in Chinese EFL Learners’ and English L1 Undergraduates’ Writing: A Corpus-Pattern-Analysis-Based Investigation
2024
In order to facilitate the instruction and acquisition of EFL writing, this paper compares the uses of the light verb get in English writing by Chinese and L1 undergraduates by drawing on insights from the theory of norms and exploitations (TNE). Corpus pattern analysis (CPA) is conducted to analyze the data taken from two comparable corpora of Chinese EFL learners’ and L1 undergraduates’ writing, unveiling both similarities and disparities between the two groups in terms of the patterning of get. Regarding similarities, most of the prototypical patterns identified in the two target corpora are identical, and both corpora show innovative exploitations of norms. In terms of differences, Chinese undergraduates misuse get in their writing in a varied but recognizable way. In addition, Chinese undergraduates are more likely than L1 undergraduates to overuse high-frequency patterns and semantic roles but underuse low-frequency ones, which may result in a lack of diversity in the range of patterns and complements. Chinese undergraduates’ uncertainty of registers is another notable feature in their writing. Based on the findings, the paper further discusses the implications for the teaching and learning of writing in EFL contexts.
Journal Article