Search

5 - Data and Methods for Modeling Climate-Related Migration
Kelsea Best, The Ohio State University, Kayly Ober, United States Institute of Peace, Robert A. McLeman, Wilfrid Laurier University
Book:

Migration and Displacement in a Changing Climate

Published online:

10 April 2025

Print publication:

17 April 2025, pp 162-188
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter, we review approaches to model climate-related migration including the multiple goals of modeling efforts and why modeling climate-related migration is of interest to researchers, commonly used sources of climate and migration data and data-related challenges, and various modeling methods used. The chapter is not meant to be an exhaustive inventory of approaches to modeling climate-related migration, but rather is intended to present the reader with an overview of the most common approaches and possible pitfalls associated with those approaches. We end the chapter with a discussion of some of the future directions and opportunities for data and modeling of climate-related migration.

Leveraging initial conditions memory for modelling Rayleigh–Taylor turbulence
Sébastien Thévenin, Benoît-Joseph Gréa, Gilles Kluth, Balasubramanya T. Nadiga
Journal:

Journal of Fluid Mechanics / Volume 1009 / 25 April 2025

Published online by Cambridge University Press:

14 April 2025, A17
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
In this study, we tackle the challenge of inferring the initial conditions of a Rayleigh–Taylor mixing zone for modelling purposes by analysing zero-dimensional (0-D) turbulent quantities measured at an unspecified time. This approach assesses the extent to which 0-D observations retain the memory of the flow, evaluating their effectiveness in determining initial conditions and, consequently, in predicting the flow’s evolution. To this end, we generated a comprehensive dataset of direct numerical simulations, focusing on miscible fluids with low density contrasts. The initial interface deformations in these simulations are characterised by an annular spectrum parametrised by four non-dimensional numbers. To study the sensitivity of 0-D turbulent quantities to initial perturbation distributions, we developed a surrogate model using a physics-informed neural network (PINN). This model enables computation of the Sobol indices for the turbulent quantities, disentangling the effects of the initial parameters on the growth of the mixing layer. Within a Bayesian framework, we employ a Markov chain Monte Carlo (MCMC) method to determine the posterior distributions of initial conditions and time, given various state variables. This analysis sheds light on inertial and diffusive trajectories, as well as the progressive loss of initial conditions memory during the transition to turbulence. Furthermore, it identifies which turbulent quantities serve as better predictors of Rayleigh–Taylor mixing zone dynamics by more effectively retaining the memory of the flow. By inferring initial conditions and forward propagating the maximum a posteriori (MAP) estimate, we propose a strategy for modelling the Rayleigh–Taylor transition to turbulence.

Predicting experiences of paranoia and auditory verbal hallucinations in daily life with ambulatory sensor data – A feasibility study
Felix Strakeljahn, Tania M. Lincoln, Björn Schlier
Journal:

Psychological Medicine / Volume 55 / 2025

Published online by Cambridge University Press:

11 April 2025, e114
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
Prediction models that can detect the onset of psychotic experiences are a key component of developing Just-In-Time Adaptive Interventions (JITAI). Building these models on passively collectable data could substantially reduce user burden. In this study, we developed prediction models to detect experiences of auditory verbal hallucinations (AVH) and paranoia using ambulatory sensor data and assessed their stability over 12 weeks.
Methods
Fourteen individuals diagnosed with a schizophrenia-spectrum disorder participated in a 12-day Ecological Momentary Assessment (EMA) study. They wore ambulatory sensors measuring autonomic arousal (i.e., electrodermal activity, heart rate variability) and completed questionnaires assessing the intensity/distress of AVHs and paranoia once every hour. After 12 weeks, participants repeated the EMA for four days for a follow-up assessment. We calculated prediction models to detect AVHs, paranoia, and AVH-/paranoia-related distress using random forests within nested cross-validation. Calculated prediction models were applied to the follow-up data to assess the stability of prediction models.
Results
Prediction models calculated with physiological data achieved high accuracy both for AVH (81%) and paranoia (69%–75%). Accuracy increased by providing models with baseline information about psychotic symptom levels (AVH: 86%; paranoia: 80%–85%). During the follow-up EMA accuracy dropped slightly throughout all models but remained high (73%–84%).
Conclusions
Relying solely on physiological data to detect psychotic symptoms achieved substantial accuracy that remained sufficiently stable over 12 weeks. Experiences of AVHs can be predicted with higher accuracy and long-term stability than paranoia. The findings tentatively suggest that psychophysiology-based prediction models could be used to develop and enhance JITAIs for psychosis.

Validity and accuracy of artificial intelligence-based dietary intake assessment methods: a systematic review
Sebastián Cofre, Camila Sanchez, Gladys Quezada-Figueroa, Xaviera A. López-Cortés
Journal:

British Journal of Nutrition , First View

Published online by Cambridge University Press:

10 April 2025, pp. 1-13
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
One of the most significant challenges in research related to nutritional epidemiology is the achievement of high accuracy and validity of dietary data to establish an adequate link between dietary exposure and health outcomes. Recently, the emergence of artificial intelligence (AI) in various fields has filled this gap with advanced statistical models and techniques for nutrient and food analysis. We aimed to systematically review available evidence regarding the validity and accuracy of AI-based dietary intake assessment methods (AI-DIA). In accordance with PRISMA guidelines, an exhaustive search of the EMBASE, PubMed, Scopus and Web of Science databases was conducted to identify relevant publications from their inception to 1 December 2024. Thirteen studies that met the inclusion criteria were included in this analysis. Of the studies identified, 61·5 % were conducted in preclinical settings. Likewise, 46·2 % used AI techniques based on deep learning and 15·3 % on machine learning. Correlation coefficients of over 0·7 were reported in six articles concerning the estimation of calories between the AI and traditional assessment methods. Similarly, six studies obtained a correlation above 0·7 for macronutrients. In the case of micronutrients, four studies achieved the correlation mentioned above. A moderate risk of bias was observed in 61·5 % (n 8) of the articles analysed, with confounding bias being the most frequently observed. AI-DIA methods are promising, reliable and valid alternatives for nutrient and food estimations. However, more research comparing different populations is needed, as well as larger sample sizes, to ensure the validity of the experimental designs.

Data publishing in mechanics and dynamics: challenges, guidelines, and examples from engineering design
Henrik Ebel, Jan van Delden, Timo Lüddecke, Aditya Borse, Rutwik Gulakala, Marcus Stoffel, Manish Yadav, Merten Stender, Leon Schindler, Kristin Miriam de Payrebrune, Maximilian Raff, C. David Remy, Benedict Röder, Rohit Raj, Tobias Rentschler, Alexander Tismer, Stefan Riedelbauch, Peter Eberhard
Journal:

Data-Centric Engineering / Volume 6 / 2025

Published online by Cambridge University Press:

08 April 2025, e23
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Data-based methods have gained increasing importance in engineering. Success stories are prevalent in areas such as data-driven modeling, control, and automation, as well as surrogate modeling for accelerated simulation. Beyond engineering, generative and large-language models are increasingly helping with tasks that, previously, were solely associated with creative human processes. Thus, it seems timely to seek artificial-intelligence-support for engineering design tasks to automate, help with, or accelerate purpose-built designs of engineering systems for instance in mechanics and dynamics, where design so far requires a lot of specialized knowledge. Compared with established, predominantly first-principles-based methods, the datasets used for training, validation, and test become an almost inherent part of the overall methodology. Thus, data publishing becomes just as important in (data-driven) engineering science as appropriate descriptions of conventional methodology in publications in the past. However, in mechanics and dynamics, quite widely, still traditional publishing practices are prevalent that largely do not yet take into account the rising role of data as much as that may already be the case in pure data-scientific research. This article analyzes the value and challenges of data publishing in mechanics and dynamics, in particular regarding engineering design tasks, showing that the latter raise also challenges and considerations not typical in fields where data-driven methods have been booming originally. Researchers currently find barely any guidance to overcome these challenges. Thus, ways to deal with these challenges are discussed and a set of examples from across different design problems shows how data publishing can be put into practice.

5 - Anti-discrimination
from Part II - Legal Controls
Yee-Fui Ng, Monash University, Victoria
Book:

Combatting the Code

Published online:

26 March 2025

Print publication:

03 April 2025, pp 74-95
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter analyzes challenges to AI decision-making based on anti-discrimination in the US, the UK, and Australia. Machine learning algorithms can be trained on datasets that contain human bias, thus resulting in predictions that are tainted with unfair discrimination. Anti-discrimination claims involve challenging the inputs of decision-making, such as the data or source code, and arguing that the flawed algorithm or data inputted into the AI system leads to discriminatory outcomes.

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)
Nur Hani Zainal, Regina Eckhardt, Gavin N. Rackoff, Ellen E. Fitzsimmons-Craft, Elsa Rojas-Ashe, Craig Barr Taylor, Burkhardt Funk, Daniel Eisenberg, Denise E. Wilfley, Michelle G. Newman
Journal:

Psychological Medicine / Volume 55 / 2025

Published online by Cambridge University Press:

02 April 2025, e106
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
As the use of guided digitally-delivered cognitive-behavioral therapy (GdCBT) grows, pragmatic analytic tools are needed to evaluate coaches’ implementation fidelity.
Aims
We evaluated how natural language processing (NLP) and machine learning (ML) methods might automate the monitoring of coaches’ implementation fidelity to GdCBT delivered as part of a randomized controlled trial.
Method
Coaches served as guides to 6-month GdCBT with 3,381 assigned users with or at risk for anxiety, depression, or eating disorders. CBT-trained and supervised human coders used a rubric to rate the implementation fidelity of 13,529 coach-to-user messages. NLP methods abstracted data from text-based coach-to-user messages, and 11 ML models predicting coach implementation fidelity were evaluated.
Results
Inter-rater agreement by human coders was excellent (intra-class correlation coefficient = .980–.992). Coaches achieved behavioral targets at the start of the GdCBT and maintained strong fidelity throughout most subsequent messages. Coaches also avoided prohibited actions (e.g. reinforcing users’ avoidance). Sentiment analyses generally indicated a higher frequency of coach-delivered positive than negative sentiment words and predicted coach implementation fidelity with acceptable performance metrics (e.g. area under the receiver operating characteristic curve [AUC] = 74.48%). The final best-performing ML algorithms that included a more comprehensive set of NLP features performed well (e.g. AUC = 76.06%).
Conclusions
NLP and ML tools could help clinical supervisors automate monitoring of coaches’ implementation fidelity to GdCBT. These tools could maximize allocation of scarce resources by reducing the personnel time needed to measure fidelity, potentially freeing up more time for high-quality clinical care.

NLP verification: towards a general methodology for certifying robustness
Marco Casadio, Tanvi Dinkar, Ekaterina Komendantskaya, Luca Arnaboldi, Matthew L. Daggitt, Omri Isac, Guy Katz, Verena Rieser, Oliver Lemon
Journal:

European Journal of Applied Mathematics , First View

Published online by Cambridge University Press:

02 April 2025, pp. 1-58
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Machine learning has exhibited substantial success in the field of natural language processing (NLP). For example, large language models have empirically proven to be capable of producing text of high complexity and cohesion. However, at the same time, they are prone to inaccuracies and hallucinations. As these systems are increasingly integrated into real-world applications, ensuring their safety and reliability becomes a primary concern. There are safety critical contexts where such models must be robust to variability or attack and give guarantees over their output. Computer vision had pioneered the use of formal verification of neural networks for such scenarios and developed common verification standards and pipelines, leveraging precise formal reasoning about geometric properties of data manifolds. In contrast, NLP verification methods have only recently appeared in the literature. While presenting sophisticated algorithms in their own right, these papers have not yet crystallised into a common methodology. They are often light on the pragmatical issues of NLP verification, and the area remains fragmented. In this paper, we attempt to distil and evaluate general components of an NLP verification pipeline that emerges from the progress in the field to date. Our contributions are twofold. First, we propose a general methodology to analyse the effect of the embedding gap – a problem that refers to the discrepancy between verification of geometric subspaces, and the semantic meaning of sentences which the geometric subspaces are supposed to represent. We propose a number of practical NLP methods that can help to quantify the effects of the embedding gap. Second, we give a general method for training and verification of neural networks that leverages a more precise geometric estimation of semantic similarity of sentences in the embedding space and helps to overcome the effects of the embedding gap in practice.

Machine-learning-based pressure reconstruction with moving boundaries
Hongping Wang, Fan Wu, Yi Liu, Xinyi He, Shuyi Feng, Shizhao Wang
Journal:

Journal of Fluid Mechanics / Volume 1008 / 10 April 2025

Published online by Cambridge University Press:

02 April 2025, A21
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The greatest challenge in pressure reconstruction from the measured velocity fields is that the error of material acceleration is significantly contaminated due to error propagation. Particularly for flows with moving boundaries, accurate boundary velocities are difficult to obtain due to error propagation, and a complex boundary processing technique is needed to treat the moving boundaries. The present work proposes a machine-learning-based method to determine the pressure for incompressible flows with moving boundaries. The proposed network consists of two neural networks: one network, named the boundary network, is used to track the Lagrangian boundary points; the other physics-informed neural network, named the flow network, is adopted to approximate the flow fields. These two networks are coupled by imposing boundary conditions. We further propose a new dynamic weight strategy for the loss terms to guarantee convergence and stability. The performance of the proposed method is validated by two examples: the flow over an oscillating cylinder and the flow around a swimming fish. The proposed method can accurately determine the pressure fields and boundary motion from synthetic particle image velocimetry (PIV) flow fields. Moreover, this method can also predict the boundary and pressure at a given instant without supervised data. Finally, this method was applied to reconstruct the pressure from the two-dimensional and three-dimensional PIV velocities of the left ventricle. All of the results indicate that the proposed method can accurately reconstruct the pressure fields for flows with moving boundaries and is a novel method for surface pressure estimation.

Using machine learning to identify features associated with different types of self-injurious behaviors in autistic youth
Ligia Antezana, Caitlin M. Conner, Safaa Eldeeb, Samuel Turecki, Matthew Siegel, Helmet T. Karim, Carla A. Mazefsky
Journal:

Psychological Medicine / Volume 55 / 2025

Published online by Cambridge University Press:

31 March 2025, e98
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Background
Self-injurious behaviors (SIB) are common in autistic people. SIB is mainly studied as a broad category, rather than by specific SIB types. We aimed to determine associations of distinct SIB types with common psychiatric, emotional, medical, and socio-demographic factors.
Methods
Participants included 323 autistic youth (~50% non−/minimally-speaking) with high-confidence autism diagnoses ages 4–21 years. Data were collected by the Autism Inpatient Collection during admission to a specialized psychiatric inpatient unit (www.sfari.org/resource/autism-inpatient-collection/). Caregivers completed questionnaires about their child, including SIB type and severity. The youth completed assessments with clinicians. Elastic net regressions identified associations between SIB types and factors.
Results
No single factor relates to all SIB types. SIB types have unique sets of associations. Consistent with previous work, more repetitive motor movements and lower adaptive skills are associated with most types of SIB; female sex is associated with hair/skin pulling and self-rubbing/scratching. More attention-deficit/hyperactivity disorder symptoms are associated with self-rubbing/scratching, skin picking, hair/skin pulling, and inserts finger/object. Inserts finger/object has the most medical condition associations. Self-hitting against surface/object has the most emotion dysregulation associations.
Conclusions
Specific SIB types have unique sets of associations. Future work can develop clinical likelihood scores for specific SIB types in inpatient settings, which can be tested with large community samples. Current approaches for SIB focus on the behavior functions, but there is an opportunity to further develop interventions by considering the specific SIB type in assessment and treatment. Identifying factors associated with specific SIB types may aid with screening, prevention, and treatment of these often-impairing behaviors.

How to beat a Bayesian adversary
Part of
- Probabilistic methods, simulation and stochastic differential equations
Zihan Ding, Kexin Jin, Jonas Latz, Chenguang Liu
Journal:

European Journal of Applied Mathematics , First View

Published online by Cambridge University Press:

28 March 2025, pp. 1-23
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Deep neural networks and other modern machine learning models are often susceptible to adversarial attacks. Indeed, an adversary may often be able to change a model’s prediction through a small, directed perturbation of the model’s input – an issue in safety-critical applications. Adversarially robust machine learning is usually based on a minmax optimisation problem that minimises the machine learning loss under maximisation-based adversarial attacks. In this work, we study adversaries that determine their attack using a Bayesian statistical approach rather than maximisation. The resulting Bayesian adversarial robustness problem is a relaxation of the usual minmax problem. To solve this problem, we propose Abram – a continuous-time particle system that shall approximate the gradient flow corresponding to the underlying learning problem. We show that Abram approximates a McKean–Vlasov process and justify the use of Abram by giving assumptions under which the McKean–Vlasov process finds the minimiser of the Bayesian adversarial robustness problem. We discuss two ways to discretise Abram and show its suitability in benchmark adversarial deep learning experiments.

News Cycles and Satisfaction With Democracy: How the Pandemic Short-Circuited Media Polarization
Omar Hammoud-Gallego, Roberto S. Foa, Xavier Romero-Vidal
Journal:

British Journal of Political Science / Volume 55 / 2025

Published online by Cambridge University Press:

26 March 2025, e49
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
During the coronavirus pandemic in the United Kingdom, media outlets shifted their focus from divisive political issues to more neutral topics like lifestyle, sports, and entertainment. This study explores how this change in media content relates to partisan divides in satisfaction with democracy. Using data from a representative survey of 201,144 individuals, we linked respondents’ perceptions of democratic performance to their daily media exposure. We did so by analysing 1.5 million tweets from British newspapers using a topic modelling algorithm to identify shifts in topic salience and sentiment using sentiment analysis. Our findings reveal a decline in partisan media exposure during the pandemic, associated with increased satisfaction with democracy at both individual and collective levels, and a narrowing of cross-party divides. These results contribute to discussions on affective polarization, the winner-loser gap in democratic evaluation, and media framing effects, highlighting the potential influence of depoliticized news coverage on democratic attitudes.

Validation of large language models (Llama 3 and ChatGPT-4o mini) for title and abstract screening in biomedical systematic reviews
Adriana López-Pineda, Rauf Nouni-García, Álvaro Carbonell-Soliva, Vicente F Gil-Guillén, Concepción Carratalá-Munuera, Fernando Borrás
Journal:

Research Synthesis Methods ,

Published online by Cambridge University Press:

24 March 2025, pp. 1-11
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
With the increasing volume of scientific literature, there is a need to streamline the screening process for titles and abstracts in systematic reviews, reduce the workload for reviewers, and minimize errors. This study validated artificial intelligence (AI) tools, specifically Llama 3 70B via Groq’s application programming interface (API) and ChatGPT-4o mini via OpenAI’s API, for automating this process in biomedical research. It compared these AI tools with human reviewers using 1,081 articles after duplicate removal. Each AI model was tested in three configurations to assess sensitivity, specificity, predictive values, and likelihood ratios. The Llama 3 model’s LLA_2 configuration achieved 77.5% sensitivity and 91.4% specificity, with 90.2% accuracy, a positive predictive value (PPV) of 44.3%, and a negative predictive value (NPV) of 97.9%. The ChatGPT-4o mini model’s CHAT_2 configuration showed 56.2% sensitivity, 95.1% specificity, 92.0% accuracy, a PPV of 50.6%, and an NPV of 96.1%. Both models demonstrated strong specificity, with CHAT_2 having higher overall accuracy. Despite these promising results, manual validation remains necessary to address false positives and negatives, ensuring that no important studies are overlooked. This study suggests that AI can significantly enhance efficiency and accuracy in systematic reviews, potentially revolutionizing not only biomedical research but also other fields requiring extensive literature reviews.

Exploring the Impact of China’s Retaliatory Tariffs on US Soybean Exports with Machine Learning Techniques
Anastasia W. Thayer, Pengyan Sun, Hernan A. Tejeda, Man-Keun Kim
Journal:

Journal of Agricultural and Applied Economics , FirstView

Published online by Cambridge University Press:

24 March 2025, pp. 1-19
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The 2018/2019 trade conflict between the United States and China impacted a broad array of agricultural products, including soybeans. Previous trade studies using gravity models fail to account for trends and complex seasonal patterns observed in the data. This study uses a machine learning (ML) approach to estimate losses in soybean export value and volume from the trade war. We find that models using ML techniques outperform traditional models and estimate losses in the value of soybean exports of $10.16 billion/year. The ML models fit the complex export trade data series well, highlighting the importance of utilizing improved modeling approaches.

Factors associated with carditis adverse events following SARS-COV-2-19 vaccination
Kyung Hyun Min, Jun Hyeob Kim, Jin Yeon Gil, Jun Hyuk Park, Ji Min Han, Kyung Eun Lee
Journal:

Epidemiology & Infection / Volume 153 / 2025

Published online by Cambridge University Press:

19 March 2025, e51
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The study aimed to delve into the incidence and risk factors associated with myocarditis and pericarditis following SARS-COV-2-19 vaccination, addressing a notable gap in understanding the safety profile of vaccinations. Through meticulous data selection from the National Health Insurance System (NHIS) database of Korea, the researchers employed both a case-crossover study and a nested case-control design to analyze temporal patterns and risk factors related to carditis occurrences post-immunization. Key findings revealed a significant association between SARS-COV-2-19 vaccination and the occurrence of carditis, with a strong temporal correlation observed within 10 days post-vaccination. Noteworthy factors contributing to carditis risk included the duration between vaccination and carditis, specific comorbidities and medication use. The study concluded by recommending an extended post-vaccination surveillance duration of at least 10 days and underscored the importance of considering individual medical histories and concurrent medication use in assessing vaccine-induced carditis risk. This study might contribute to understanding vaccine safety profiles and emphasizes the significance of comprehensive post-vaccination monitoring protocols.

Using machine learning for communication classification
Stefan P. Penczynski
Journal:

Experimental Economics / Volume 22 / Issue 4 / December 2019

Published online by Cambridge University Press:

14 March 2025, pp. 1002-1029
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The present study explores the value of machine learning techniques in the classification of communication content in experiments. Previously human-coded datasets are used to both train and test algorithm-generated models that relate word counts to categories. For various games, the computer models of the classification are able to match out-of-sample the human classification to a considerable extent. The analysis raises hope that the substantial effort going into such studies can be reduced by using computer algorithms for classification. This would enable a quick and replicable analysis of large-scale datasets at reasonable costs and widen the applicability of such approaches. The paper gives an easily accessible technical introduction into the computational method.

Elastoinertial turbulence: data-driven reduced-order model based on manifold dynamics
Manish Kumar, C. Ricardo Constante-Amores, Michael D. Graham
Journal:

Journal of Fluid Mechanics / Volume 1007 / 25 March 2025

Published online by Cambridge University Press:

13 March 2025, R1
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Elastoinertial turbulence (EIT) is a chaotic state that emerges in the flows of dilute polymer solutions. Direct numerical simulation (DNS) of EIT is highly computationally expensive due to the need to resolve the multiscale nature of the system. While DNS of two-dimensional (2-D) EIT typically requires $O(10^6)$ degrees of freedom, we demonstrate here that a data-driven modelling framework allows for the construction of an accurate model with 50 degrees of freedom. We achieve a low-dimensional representation of the full state by first applying a viscoelastic variant of proper orthogonal decomposition to DNS results, and then using an autoencoder. The dynamics of this low-dimensional representation is learned using the neural ordinary differential equation (NODE) method, which approximates the vector field for the reduced dynamics as a neural network. The resulting low-dimensional data-driven model effectively captures short-time dynamics over the span of one correlation time, as well as long-time dynamics, particularly the self-similar, nested travelling wave structure of 2-D EIT in the parameter range considered.

Decoding development: the AI frontier in policy crafting: A systematic review
Sofiarti Dyah Anggunia, Jesse Sowell, María Pérez-Ortiz
Journal:

Data & Policy / Volume 7 / 2025

Published online by Cambridge University Press:

11 March 2025, e31
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
In today’s world, smart algorithms—artificial intelligence (AI) and other intelligent systems—are pivotal for promoting the development agenda. They offer novel support for decision-making across policy planning domains, such as analysing poverty alleviation funds and predicting mortality rates. To comprehensively assess their efficacy and implications in policy formulation, this paper conducts a systematic review of 207 publications. The analysis underscores their integration within and across stages of the policy planning cycle: problem diagnosis and goal articulation; resource and constraint identification; design of alternative solutions; outcome projection; and evaluation. However, disparities exist in smart algorithm applications across stages, economic development levels, and Sustainable Development Goals (SDGs). While these algorithms predominantly focus on resource identification (29%) and contribute significantly to designing alternatives—such as long-term national energy policies—and projecting outcomes, including predicting multi-scenario land-use ecological security strategies, their application in evaluation remains limited (10%). Additionally, low-income nations have yet to fully harness AI’s potential, while upper-middle-income countries effectively leverage it. Notably, smart algorithm applications for SDGs also exhibit unevenness, with more emphasis on SDG 11 than on SDG 5 and SDG 17. Our study identifies literature gaps. Firstly, despite theoretical shifts, a disparity persists between physical and socioeconomic/environmental planning applications. Secondly, there is limited attention to policy-making in development initiatives, which is critical for improving lives. Future research should prioritise developing adaptive planning systems using emerging powerful algorithms to address uncertainty and complex environments. Ensuring algorithmic transparency, human-centered approaches, and responsible AI are crucial for AI accountability, trust, and credibility.

Advances in methods for characterising dietary patterns: a scoping review
Joy M. Hutchinson, Amanda Raffoul, Alexandra Pepetone, Lesley Andrade, Tabitha E. Williams, Sarah A. McNaughton, Rebecca M. Leech, Jill Reedy, Marissa M. Shams-White, Jennifer E. Vena, Kevin W. Dodd, Lisa M. Bodnar, Benoît Lamarche, Michael P. Wallace, Megan Deitchler, Sanaa Hussain, Sharon I. Kirkpatrick
Journal:

British Journal of Nutrition , First View

Published online by Cambridge University Press:

10 March 2025, pp. 1-15
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
There is a growing focus on understanding the complexity of dietary patterns and how they relate to health and other factors. Approaches that have not traditionally been applied to characterise dietary patterns, such as latent class analysis and machine learning algorithms, may offer opportunities to characterise dietary patterns in greater depth than previously considered. However, there has not been a formal examination of how this wide range of approaches has been applied to characterise dietary patterns. This scoping review synthesised literature from 2005 to 2022 applying methods not traditionally used to characterise dietary patterns, referred to as novel methods. MEDLINE, CINAHL and Scopus were searched using keywords including latent class analysis, machine learning and least absolute shrinkage and selection operator. Of 5274 records identified, 24 met the inclusion criteria. Twelve of twenty-four articles were published since 2020. Studies were conducted across seventeen countries. Nine studies used approaches with applications in machine learning, such as classification models, neural networks and probabilistic graphical models, to identify dietary patterns. The remaining studies applied methods such as latent class analysis, mutual information and treelet transform. Fourteen studies assessed associations between dietary patterns characterised using novel methods and health outcomes, including cancer, cardiovascular disease and asthma. There was wide variation in the methods applied to characterise dietary patterns and in how these methods were described. The extension of reporting guidelines and quality appraisal tools relevant to nutrition research to consider specific features of novel methods may facilitate consistent reporting and enable synthesis to inform policies and programs.

Can machine learning help accelerate article screening for systematic reviews? Yes, when article separability in embedding space is high
Farhan Ali, Amanda Swee-Ching Tan, Serena Jun-Wei Wang
Journal:

Research Synthesis Methods / Volume 16 / Issue 1 / January 2025

Published online by Cambridge University Press:

10 March 2025, pp. 194-210
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Systematic reviews play important roles but manual efforts can be time-consuming given a growing literature. There is a need to use and evaluate automated strategies to accelerate systematic reviews. Here, we comprehensively tested machine learning (ML) models from classical and deep learning model families. We also assessed the performance of prompt engineering via few-shot learning of GPT-3.5 and GPT-4 large language models (LLMs). We further attempted to understand when ML models can help automate screening. These ML models were applied to actual datasets of systematic reviews in education. Results showed that the performance of classical and deep ML models varied widely across datasets, ranging from 1.2 to 75.6% of work saved at 95% recall. LLM prompt engineering produced similarly wide performance variation. We searched for various indicators of whether and how ML screening can help. We discovered that the separability of clusters of relevant versus irrelevant articles in high-dimensional embedding space can strongly predict whether ML screening can help (overall R = 0.81). This simple and generalizable heuristic applied well across datasets and different ML model families. In conclusion, ML screening performance varies tremendously, but researchers and software developers can consider using our cluster separability heuristic in various ways in an ML-assisted screening pipeline.

Search Results

Refine search

Refine search

Actions for selected content:

861 results

5 - Data and Methods for Modeling Climate-Related Migration

Summary

Leveraging initial conditions memory for modelling Rayleigh–Taylor turbulence

Predicting experiences of paranoia and auditory verbal hallucinations in daily life with ambulatory sensor data – A feasibility study

Validity and accuracy of artificial intelligence-based dietary intake assessment methods: a systematic review

Data publishing in mechanics and dynamics: challenges, guidelines, and examples from engineering design

5 - Anti-discrimination

Summary

Capitalizing on natural language processing (NLP) to automate the evaluation of coach implementation fidelity in guided digital cognitive-behavioral therapy (GdCBT)

NLP verification: towards a general methodology for certifying robustness

Machine-learning-based pressure reconstruction with moving boundaries

Using machine learning to identify features associated with different types of self-injurious behaviors in autistic youth

How to beat a Bayesian adversary

News Cycles and Satisfaction With Democracy: How the Pandemic Short-Circuited Media Polarization

Validation of large language models (Llama 3 and ChatGPT-4o mini) for title and abstract screening in biomedical systematic reviews

Exploring the Impact of China’s Retaliatory Tariffs on US Soybean Exports with Machine Learning Techniques

Factors associated with carditis adverse events following SARS-COV-2-19 vaccination

Using machine learning for communication classification

Elastoinertial turbulence: data-driven reduced-order model based on manifold dynamics

Decoding development: the AI frontier in policy crafting: A systematic review

Advances in methods for characterising dietary patterns: a scoping review

Can machine learning help accelerate article screening for systematic reviews? Yes, when article separability in embedding space is high

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

861 results

Summary

Summary