Hostname: page-component-7b9c58cd5d-dlb68 Total loading time: 0 Render date: 2025-03-14T06:19:04.064Z Has data issue: false hasContentIssue false

OpenForest: a data catalog for machine learning in forest monitoring

Published online by Cambridge University Press:  27 February 2025

Arthur Ouaknine*
Affiliation:
School of Computer Science, McGill University, Montréal, QC, Canada Mila, Quebec AI Institute, Montréal, QC, Canada
Teja Kattenborn
Affiliation:
Remote Sensing Centre for Earth System Research, Leipzig University, Leipzig, Germany German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Germany
Etienne Laliberté
Affiliation:
Institut de recherche en biologie végétale, Département de sciences biologiques, Université de Montréal, Montréal, QC, Canada
David Rolnick
Affiliation:
School of Computer Science, McGill University, Montréal, QC, Canada Mila, Quebec AI Institute, Montréal, QC, Canada
*
Corresponding author: Arthur Ouaknine; Email: [email protected]

Abstract

Forests play a crucial role in the Earth’s system processes and provide a suite of social and economic ecosystem services, but are significantly impacted by human activities, leading to a pronounced disruption of the equilibrium within ecosystems. Advancing forest monitoring worldwide offers advantages in mitigating human impacts and enhancing our comprehension of forest composition, alongside the effects of climate change. While statistical modeling has traditionally found applications in forest biology, recent strides in machine learning and computer vision have reached important milestones using remote sensing data, such as tree species identification, tree crown segmentation, and forest biomass assessments. For this, the significance of open-access data remains essential in enhancing such data-driven algorithms and methodologies. Here, we provide a comprehensive and extensive overview of 86 open-access forest datasets across spatial scales, encompassing inventories, ground-based, aerial-based, satellite-based recordings, and country or world maps. These datasets are grouped in OpenForest, a dynamic catalog open to contributions that strives to reference all available open-access forest datasets. Moreover, in the context of these datasets, we aim to inspire research in machine learning applied to forest biology by establishing connections between contemporary topics, perspectives, and challenges inherent in both domains. We hope to encourage collaborations among scientists, fostering the sharing and exploration of diverse datasets through the application of machine learning methods for large-scale forest monitoring. OpenForest is available at the following url: https://github.com/RolnickLab/OpenForest.

Type
Data Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press

Impact Statement

OpenForest establishes a constantly evolving catalog of open-access forest datasets. This catalog is open for contributions and aims to provide a single central hub for such datasets within the open-source community. In addition to introducing the OpenForest catalog, we provide in this article a detailed overview of complementary research topics and challenges in forest monitoring and machine learning, so as to better enable the impactful use of these datasets in interdisciplinary research. We hope this work will ultimately contribute substantially to enhancing our comprehension of global forest composition as well the development of innovative machine learning algorithms.

1. Introduction

Forests cover one third of the Earth’s land surface (The Food and Agriculture Organization of the United Nations, 2020). They provide a range of valuable ecosystem goods and services to humanity, including timber provision, water and climate regulation, and atmospheric carbon sequestration (Millennium Ecosystem Assessment, 2001; Bonan, Reference Bonan2008). They also serve as habitat for a myriad of plant, animal, and microbial species. However, human activities have had, and continue to have, a major impact on forests worldwide.

More than 3,000 ha of forests disappear every hour from deforestation (Hansen et al., Reference Hansen, Potapov, Moore, Hancher, Turubanova, Tyukavina, Thau, Stehman, Goetz, Loveland, Kommareddy, Egorov, Chini, Justice and Townshend2013; The Food and Agriculture Organization of the United Nations, 2020). Yet forests are also increasingly recognized as natural solutions to the joint climate and biodiversity crises (Griscom et al., Reference Griscom, Adams, Ellis, Houghton, Lomax, Miteva, Schlesinger, Shoch, Siikamäki, Smith, Woodbury, Zganjar, Blackman, Campari, Conant, Delgado, Elias, Gopalakrishna, Hamsik, Herrero, Kiesecker, Landis, Laestadius, Leavitt, Minnemeyer, Polasky, Potapov, Putz, Sanderman, Silvius, Wollenberg and Fargione2017, Reference Griscom, Busch, Cook-Patton, Ellis, Funk, Leavitt, Lomax, Turner, Chapman, Engelmann, Gurwick, Landis, Lawrence, Malhi, Schindler Murray, Navarrete, Roe, Scull, Smith, Streck, Walker and Worthington2020; Drever et al., Reference Drever, Cook-Patton, Akhter, Badiou, Chmura, Davidson, Desjardins, Dyk, Fargione, Fellows, Filewod, Hessing-Lewis, Jayasundara, Keeton, Kroeger, Lark, Le, Leavitt, LeClerc, Lemprière, Metsaranta, McConkey, Neilson, St-Laurent, Puric-Mladenovic, Rodrigue, Soolanayakanahally, Spawn, Strack, Smyth, Thevathasan, Voicu, Williams, Woodbury, Worth, Xu, Yeo and Kurz2021). Forest-based adaptation through avoided forest conversion, improved forest management, and forest restoration could mitigate over 2 Gt CO2-eq emissions per year by 2030 (Intergovernmental Panel On Climate Change (IPCC), 2023), with variations observed in different regions worldwide (Griscom et al., Reference Griscom, Adams, Ellis, Houghton, Lomax, Miteva, Schlesinger, Shoch, Siikamäki, Smith, Woodbury, Zganjar, Blackman, Campari, Conant, Delgado, Elias, Gopalakrishna, Hamsik, Herrero, Kiesecker, Landis, Laestadius, Leavitt, Minnemeyer, Polasky, Potapov, Putz, Sanderman, Silvius, Wollenberg and Fargione2017, Reference Griscom, Busch, Cook-Patton, Ellis, Funk, Leavitt, Lomax, Turner, Chapman, Engelmann, Gurwick, Landis, Lawrence, Malhi, Schindler Murray, Navarrete, Roe, Scull, Smith, Streck, Walker and Worthington2020; Bastin et al., Reference Bastin, Finegold, Garcia, Mollicone, Rezende, Routh, Zohner and Crowther2019; Busch et al., Reference Busch, Engelmann, Cook-Patton, Griscom, Kroeger, Possingham and Shyamsundar2019; Drever et al., Reference Drever, Cook-Patton, Akhter, Badiou, Chmura, Davidson, Desjardins, Dyk, Fargione, Fellows, Filewod, Hessing-Lewis, Jayasundara, Keeton, Kroeger, Lark, Le, Leavitt, LeClerc, Lemprière, Metsaranta, McConkey, Neilson, St-Laurent, Puric-Mladenovic, Rodrigue, Soolanayakanahally, Spawn, Strack, Smyth, Thevathasan, Voicu, Williams, Woodbury, Worth, Xu, Yeo and Kurz2021), while being limited by the climatic effects we are witnessing on forests (Zhu et al., Reference Zhu, Zhang, Niu, Chu and Luo2018).

Due to their significant economic and ecological importance, monitoring forests has attracted considerable attention. It includes the assessment of ecosystem functional properties, as well as the evaluation of forest health, vitality, stress, and diseases (see Section 2.1). However, monitoring forests presents significant challenges, especially using field-based approaches (see Section 2.2). Forests cover huge areas and can be difficult to access. Consequently, remote sensing has and continues to play an important role in forest monitoring worldwide. Nowadays, a wide array of remote sensing platforms and sensors to monitor forests are available and being used. This includes platforms such as drones (also referred to as unoccupied aerial vehicles [UAVs]), airplanes, or satellites, and sensors ranging from passive optical imagery, to active methods such as light detection and ranging (LiDAR) or synthetic aperture RADAR (SAR) (Verrelst et al., Reference Verrelst, Camps-Valls, Muñoz Marí, Rivera, Veroustraete, Clevers and Moreno2015; White et al., Reference White, Coops, Wulder, Vastaranta, Hilker and Tompalski2016).

In recent years, the applications of remote sensing data for the Earth-related purposes (Ma et al., Reference Ma, Liu, Zhang, Ye, Yin and Johnson2019; Camps-Valls et al., Reference Camps-Valls, Tuia, Zhu and Reichstein2021), such as forest monitoring (Fassnacht et al., Reference Fassnacht, Latifi, Stereńczak, Modzelewska, Lefsky, Waser, Straub and Ghosh2016; Diez et al., Reference Diez, Kentsch, Fukuda, Caceres, Moritake and Cabezas2021; Kattenborn et al., Reference Kattenborn, Leitloff, Schiefer and Hinz2021; Michalowska and Rapinski, Reference Michalowska and Rapinski2021), have gained momentum due to the adoption of machine learning methods and algorithms. It has been inspired by continuous improvements in the performance of deep learning models used in computer vision challenges in the past decade (Deng et al., Reference Deng, Dong, Socher, Li, Li and Fei-Fei2009; Lin et al., Reference Lin, Maire, Belongie, Hays, Perona, Ramanan, Dollár and Zitnick2014; Everingham et al., Reference Everingham, Eslami, Van Gool, Williams, Winn and Zisserman2015). Recent advances in deep learning model architectures have enabled the integration of remote sensing data from various sensors and resolutions—spatial, temporal, or spectral—which presents promising opportunities to enhance forest monitoring practices (Cong et al., Reference Cong, Khanna, Meng, Liu, Rozi, He, Burke, Lobell and Ermon2022; Rahaman et al., Reference Rahaman, Weiss, Träuble, Locatello, Lacoste, Bengio, Pal, Li and Schölkopf2022; Reed et al., Reference Reed, Gupta, Li, Brockman, Funk, Clipp, Funk, Candido, Uyttendaele and Darrell2022; Tseng et al., Reference Tseng, Zvonkov, Purohit, Rolnick and Kerner2023).

Numerous machine learning challenges related to forest monitoring have yet to be explored (see Figure 1), and addressing them will require diverse and large forest datasets (Liang and Gamarra, Reference Liang and Gamarra2020). While there is a wealth of remote sensing data that is freely available, these data can be difficult to access because they involve a wide variety of sensory modalities, geographies, and tasks and are spread out across many repositories. To the best of our knowledge, no comprehensive, central repository of open-access forest datasets currently exists, a gap which we fill here with OpenForest (https://github.com/RolnickLab/OpenForest). The OpenForest catalog is designed to simplify the process of accessing and highlight forest monitoring datasets for researchers in the field of machine learning and forest biology, thereby accelerating progress in these domains.

Figure 1. Overview of forest monitoring topics and challenges associated with machine learning perspectives and challenges. Note: Each forest monitoring topic and its challenges are detailed with their corresponding section number (in red). They are associated with the three main machine learning perspectives and challenge categories, namely generalization, limited data, and domain-specific objectives, along with their corresponding section number (in red).

In this article, we present the existing biological topics and challenges related to forest monitoring that scientists are currently investigating (Section 2) and which could be of interest of machine learning practitioners. Additionally, we briefly introduce several machine learning research topics, exploring their potential applications in addressing biology-related challenges (Section 3). These applications hold promise in assisting ecologists and biologists in their work. Moreover, we conduct a thorough review of open-access forest datasets across different spatial scales (Figure 2) to support both machine learning and biology research communities in their work (Section 4). Finally, we provide perspectives on the space of machine learning applications with forest datasets (Section 5).

Figure 2. Illustration of forest monitoring datasets at different scales. Note: Inventories are in situ measurements realized at the tree level. Ground-based datasets are recorded within or below the canopy of the trees. Aerial datasets are composed of recordings from sensors mounted on unoccupied (drones) or occupied aircrafts. Satellite datasets are collected from sensors mounted on satellites orbiting the Earth. Map datasets are generated at the country or world level using datasets at the aerial or satellite scales.

2. Forest monitoring: current topics and challenges

Forest monitoring is an empirical science that is increasingly based on data-driven machine learning methods and, as such, benefits by improved data access through open data (Wulder et al., Reference Wulder, Masek, Cohen, Loveland and Woodcock2012; De Lima et al., Reference De Lima, Phillips, Duque, Tello, Davies, De Oliveira, Muller, Honorio Coronado, Vilanova, Cuni-Sanchez, Baker, Ryan, Malizia, Lewis, Ter Steege, Ferreira, Marimon, Luu, Imani, Arroyo, Blundo, Kenfack, Sainge, Sonké and Vásquez2022). In particular, deep learning algorithms are widely recognized for their strong performance in diverse tasks, but their successful application often relies on large datasets to unleash their performance and enhance their generalization potential. This section seeks to emphasize the importance of open-access forest datasets for two primary purposes: first, to facilitate a more comprehensive exploration of current topics in the context of forest monitoring (Section 2.1); and second, to better assess forest monitoring-related challenges (Section 2.2), particularly for machine learning practitioners.

2.1. Forest monitoring topics

Considering the significance of forests both economically and ecologically, forest monitoring encompasses a range of trackable forest attributes. Each of them can be sensed with different sensors, platforms, and across different scales. The forest attribute itself (such as the regression of a biochemical property or the detection of tree individuals), together with the structure of the remotely sensed signals, collectively determines the appropriate machine learning algorithms to be employed. These aspects are succinctly discussed in the following sections.

2.1.1. Forest extent and forest type mapping

Tracking the extent of forests is crucial to understand the spatial distribution of forest resources, ecosystem services, and assess the role of forest in land surface dynamics (Keenan et al., Reference Keenan, Reams, Achard, de Freitas, Grainger and Lindquist2015). Thereby, forest can be classified to different management, functional, or ecosystems types (e.g., coniferous, deciduous forests) (Buchhorn et al., Reference Buchhorn, Lesiv, Tsendbazar, Herold, Bertels and Smets2020; Zhang et al., Reference Zhang, Liu, Chen, Gao, Xie and Mi2021). In this regard, Earth observation data from long-term satellite missions (e.g., Landsat or Sentinel described in Section 4.4) enable to track forest extent dynamics across past decades (Hansen et al., Reference Hansen, Potapov, Moore, Hancher, Turubanova, Tyukavina, Thau, Stehman, Goetz, Loveland, Kommareddy, Egorov, Chini, Justice and Townshend2013), enabling to assess conservation efforts and anthropogenic land cover change, such as deforestation for agricultural expansion (Curtis et al., Reference Curtis, Slay, Harris, Tyukavina and Hansen2018).

2.1.2. Tree species mapping

A fine-scaled representation of forest stands in terms of their species composition is relevant for forestry (e.g., species-specific timber supply), biogeographical assessments (e.g., climate change-induced shifts in species distributions), or biodiversity monitoring (Fassnacht et al., Reference Fassnacht, Latifi, Stereńczak, Modzelewska, Lefsky, Waser, Straub and Ghosh2016; Wang and Gamon, Reference Wang and Gamon2019; Cavender-Bares et al., Reference Cavender-Bares, Gamon and Townsend2020). Recent developments in machine learning greatly advanced the identification of tree species in high-resolution data (e.g., imagery or LiDAR point clouds from drones and airplanes) using semantic and instance segmentation methods (Schiefer et al., Reference Schiefer, Kattenborn, Frick, Frey, Schall, Koch and Schmidtlein2020; Li et al., Reference Li, Chai, Wang, Lei and Zhang2022a; Cloutier et al., Reference Cloutier, Germain and Laliberté2023). At large spatial scales, Earth observation satellite data, providing coarser spatial but high temporal and spectral resolutions, enable accurate assessments of tree species distributions using spatiotemporal machine learning methods (Ienco et al., Reference Ienco, Interdonato, Gaetano and Minh2019; Bolyn et al., Reference Bolyn, Lejeune, Michez and Latte2022).

2.1.3. Biomass quantification

Forests provide cardinal ecosystem services through their provision of timber and their role as sinks in the terrestrial carbon cycle (Regnier et al., Reference Regnier, Resplandy, Najjar and Ciais2022). Tree biomass is primarily a product of the wood volume and density, while both properties are challenging to obtain from remote sensing data. Accurate biomass estimates of individuals trees can be obtained from close-range 3D representations acquired from terrestrial or drone-based LiDAR systems (Brede et al., Reference Brede, Terryn, Barbier, Bartholomeus, Bartolo, Calders, Derroire, Moorthy, Lau, Levick, Raumonen, Verbeeck, Wang, Whiteside, van der Zee and Herold2022). More indirectly related information on crown and canopy structure derived from airborne or spaceborne LiDAR and SAR data can be used to estimate biomass at the stand scale (Le Toan et al., Reference Le Toan, Quegan, Davidson, Balzter, Paillou, Papathanassiou, Plummer, Rocca, Saatchi, Shugart and Ulander2011; Lu et al., Reference Lu, Chen, Wang, Liu, Li and Moran2016). Some studies have indicated the value of passive optical data from satellites, since forest biomass is partially correlated with foliage density (Besnard et al., Reference Besnard, Koirala, Santoro, Weber, Nelson, Gütter, Herault, Kassi, N’Guessan, Neigh, Poulter, Zhang and Carvalhais2021; Potapov et al., Reference Potapov, Li, Hernandez-Serna, Tyukavina, Hansen, Kommareddy, Pickens, Turubanova, Tang, Silva, Armston, Dubayah, Bryan Blair and Hofton2021). Given that precise large-scale biomass distributions cannot be directly revealed through a single remote sensing modality alone, deep learning may play a crucial role to simultaneously exploit the suite and high dimensionality of available data modalities (Yang et al., Reference Yang, Liang and Zhang2020).

2.1.4. Forest health, disturbance, and mortality

In many regions, forest ecosystems are under pressure as globalization facilitates the introduction of exotic pests and pathogens, climate change exceeds the resilience and resistance of trees (Hartmann et al., Reference Hartmann, Bastos, Das, Esquivel-Muelbert, Hammond, Martínez-Vilalta, McDowell, Powers, Pugh, Ruthrof and Allen2022), while nutrient and water cycles are affected by anthropogenic activities (Steffen et al., Reference Steffen, Sanderson, Tyson, Jäger, Matson, Moore, Oldfield, Richardson, Schellnhuber, Turner and Wasson2005; Trumbore et al., Reference Trumbore, Brando and Hartmann2015). A decline in tree health, for example, due to pathogen infestations or shortages of water and nutrients, can lead to a variety of symptoms, such as changing concentrations of multiple biochemical tissue properties (e.g., pigments, carbohydrates, and water content), which in turn can be sensed through multispectral or hyperspectral reflectance (Zarco-Tejada et al., Reference Zarco-Tejada, Camino, Beck, Calderon, Hornero, Hernández-Clemente, Kattenborn, Montes-Borrego, Susca, Morelli, Gonzalez-Dugo, PRJ, Landa, Boscia, Saponari and Navas-Cortes2018; Zarco-Tejada et al., Reference Zarco-Tejada, Hornero, Beck, Kattenborn, Kempeneers and Hernández-Clemente2019). In this context, deep learning algorithms are very promising, due to their capability to exploit high-dimensional data (e.g., hyperspectral) and to translate it into a suite of foliage properties relevant to vegetation health (Cherif et al., Reference Cherif, Feilhauer, Berger, Dao, Ewald, Hank, He, Kovach, Lu, Townsend and Kattenborn2023). An interconnected topic are globally increased rates of forest mortality (Allen et al., Reference Allen, Macalady, Chenchouni, Bachelet, McDowell, Vennetier, Kitzberger, Rigling, Breshears, Hogg, Gonzalez, Fensham, Zhang, Castro, Demidova, Lim, Allard, Running, Semerci and Cobb2010; Hartmann et al., Reference Hartmann, Bastos, Das, Esquivel-Muelbert, Hammond, Martínez-Vilalta, McDowell, Powers, Pugh, Ruthrof and Allen2022). In this context, a wealth of approaches was successfully employed at local scales, such as the detection of dead trees via semantic or instance segmentation techniques (Sani-Mohammed et al., Reference Sani-Mohammed, Yao and Heurich2022; Cloutier et al., Reference Cloutier, Germain and Laliberté2023), or at large scales, such as the regression of annual cover of dead tree crowns in satellite image pixels with deep learning-based time series analysis (Schiefer et al., Reference Schiefer, Schmidtlein, Frick, Frey, Klinke, Zielewska-Büttner, Junttila, Uhl and Kattenborn2023).

2.1.5. Biophysical traits and functional ecosystem properties

With accelerated biodiversity decline and environmental change, understanding functional properties, their diversity across stands and landscapes, as well as their phenology (temporal dynamics), is essential to assess the resilience and resistance of ecosystems (Thompson et al., Reference Thompson, Mackey, McNulty and Mosseler2009; Sakschewski et al., Reference Sakschewski, Von Bloh, Boit, Poorter, Peña Claros, Heinke, Joshi and Thonicke2016). Given that trees through evolution developed different strategies to interact with light, their appearance studied with optical remote sensing signals can inform on a variety of functional traits, such as the foliage density, date of green up, or contents of different pigments, and carbohydrates (Schneider et al., Reference Schneider, Morsdorf, Schmid, Petchey, Hueni, Schimel and Schaepman2017; Cherif et al., Reference Cherif, Feilhauer, Berger, Dao, Ewald, Hank, He, Kovach, Lu, Townsend and Kattenborn2023). Such functional traits determine the configuration of an ecosystem and thereby modulate functional ecosystem processes (Migliavacca et al., Reference Migliavacca, Musavi, Mahecha, Nelson, Knauer, Baldocchi, Perez-Priego, Christiansen, Peters, Anderson, Bahn, Andrew Black, Blanken, Bonal, Buchmann, Caldararu, Carrara, Carvalhais, Cescatti, Chen, Cleverly, Cremonese, Desai, El-Madany, Farella, Fernández-Martínez, Filippa, Forkel, Galvagno, Gomarasca, Gough, Göckede, Ibrom, Ikawa, Janssens, Jung, Kattge, Keenan, Knohl, Kobayashi, Kraemer, Law, Liddell, Ma, Mammarella, Martini, Macfarlane, Matteucci, Montagnani, Pabon-Moreno, Panigada, Papale, Pendall, Penuelas, Phillips, Reich, Rossini, Rotenberg, Scott, Stahl, Weber, Wohlfahrt, Wolf, Wright, Yakir, Zaehle and Reichstein2021; Gomarasca et al., Reference Gomarasca, Migliavacca, Kattge, Nelson, Niinemets, Wirth, Cescatti, Bahn, Nair, Acosta, Altaf Arain, Beloiu, Andrew Black, Bruun, Bucher, Buchmann, Byun, Carrara, Conte, da Silva, Duveiller, Fares, Ibrom, Knohl, Komac, Limousin, Lusk, Mahecha, Martini, Minden, Montagnani, Mori, Onoda, Peñuelas, Perez-Priego, Poschlod, Powell, Reich, Šigut, van Bodegom, Walther, Wohlfahrt, Wright and Reichstein2023), that is, fluxes of energy and matter between the terrestrial biosphere, pedosphere, hydrosphere, and atmosphere, including carbon, evapotranspiration, latent, and sensible heat. Due to the cardinal importance of these fluxes in the Earth system, considerable efforts have been made to monitor them using a ground-based sensor network of flux towers (e.g., FLUXNET) (Baldocchi et al., Reference Baldocchi, Falge, Gu, Olson, Hollinger, Running, Anthoni, Bernhofer, Davis, Evans, Fuentes, Goldstein, Katul, Law, Lee, Malhi, Meyers, Munger, Oechel, Paw U, Pilegaard, Schmid, Valentini, Verma, Vesala, Wilson and Wofsy2001). Given the complexity of these ecosystem processes, deep learning is assumed to greatly enlarge our capabilities to exploit local flux towers and globally available remote sensing data to spatially and temporally extrapolate and understand forest ecosystem process (Jung et al., Reference Jung, Koirala, Weber, Ichii, Gans, Camps-Valls, Papale, Schwalm, Tramontana and Reichstein2019; Reichstein et al., Reference Reichstein, Camps-Valls, Stevens, Jung, Denzler, Carvalhais and Prabhat2019; Camps-Valls et al., Reference Camps-Valls, Tuia, Zhu and Reichstein2021; ElGhawi et al., Reference ElGhawi, Kraft, Reimers, Reichstein, Körner, Gentine and WinklerWinkler2023).

2.2. Forest monitoring challenges

Forests are complex ecosystems dominated by trees. As living organisms, trees are affected by various abiotic and biotic factors, which influence their remotely sensed signal via their foliage properties and crown architecture (Kulawardhana, Reference Kulawardhana2011). Machine learning researchers wishing to develop algorithms to monitor forests using remote sensing data must be aware of these sources of biological variation and their origin. Because some of this biological variation is largely unpredictable but potentially clustered in space and time (e.g., insect outbreaks affecting tree health and random genetic variation within tree species populations), it can be seen as a challenge as it might lead to systematic errors for prediction tasks. On the other hand, part of this variation is deterministic (e.g., changes in leaf color and other phenological changes driven by seasonal fluctuations that occur every year) and could be leveraged to improve model performances (Cloutier et al., Reference Cloutier, Germain and Laliberté2023). Another major, pervasive challenge in forest monitoring are the difficulties associated with the acquisition of ground data to train or validate machine learning models using remote sensing data. Below we summarize these primary challenges.

2.2.1. Tree species

There are an estimated 73,000 tree species on Earth (Cazzolla Gatti et al., Reference Cazzolla Gatti, Reich, Gamarra, Crowther, Hui, Morera, Bastin, de Miguel, Nabuurs, Svenning, Serra-Diaz, Merow, Enquist, Kamenetsky, Lee, Zhu, Fang, Jacobs, Pijanowski, Banerjee, Giaquinto, Alberti, Almeyda Zambrano, Alvarez-Davila, Araujo-Murakami, Avitabile, Aymard, Balazy, Baraloto, Barroso, Bastian, Birnbaum, Bitariho, Bogaert, Bongers, Bouriaud, Brancalion, Brearley, Broadbent, Bussotti, Castro Da Silva, César, Češljar, Chama Moscoso, Chen, Cienciala, Clark, Coomes, Dayanandan, Decuyper, Dee, Del Aguila Pasquel, Derroire, Djuikouo, Van Do, Dolezal, Đorđević, Engel, Fayle, Feldpausch, Fridman, Harris, Hemp, Hengeveld, Herault, Herold, Ibanez, Jagodzinski, Jaroszewicz, Jeffery, Johannsen, Jucker, Kangur, Karminov, Kartawinata, Kennard, Kepfer-Rojas, Keppel, Khan, Khare, Kileen, Kim, Korjus, Kumar, Kumar, Laarmann, Labrière, Lang, Lewis, Lukina, Maitner, Malhi, Marshall, Martynenko, Monteagudo Mendoza, Ontikov, Ortiz-Malavasi, Pallqui Camacho, Paquette, Park, Parthasarathy, Peri, Petronelli, Pfautsch, Phillips, Picard, Piotto, Poorter, Poulsen, Pretzsch, Ramírez-Angulo, Restrepo Correa, Rodeghiero, Rojas Gonzáles, Rolim, Rovero, Rutishauser, Saikia, Salas-Eljatib, Schepaschenko, Scherer-Lorenzen, Šebeň, Silveira, Slik, Sonké, Souza, Stereńczak, Svoboda, Taedoumg, Tchebakova, Terborgh, Tikhonova, Torres-Lezama, Van Der Plas, Vásquez, Viana, Vibrans, Vilanova, Vos, Wang, Westerlund, White, Wiser, Zawiła-Niedźwiecki, Zemagho, Zhu, Zo-Bi and Liang2022), the majority of which are found in the tropics. While tree species show many similarities (e.g., the presence of woody stems and branches), every tree species differs from one another in their chemical and structural make-up and how they will reflect solar radiation (Asner et al., Reference Asner, Martin, Carranza-Jiménez, Sinca, Tupayachi, Anderson and Martinez2014). For example, tree foliage of different species comes in various shades of green that reflect the concentrations of pigments (e.g., chlorophylls and carotenoids) (Gates et al., Reference Gates, Keegan, Schleter and Weidner1965). Likewise, tree species differ from each other in their leaf form crown structure (Verbeeck et al., Reference Verbeeck, Bauters, Jackson, Shenkin, Disney and Calders2019), which will affect the remotely sensed signal. Such foliar biophysical and crown structural variation among tree species is the result of millions of years of evolution and of adaptations to various environmental conditions (Meireles et al., Reference Meireles, Cavender-Bares, Townsend, Ustin, Gamon, Schweiger, Schaepman, Asner, Martin, Singh, Schrodt, Chlus and O’Meara2020).

From a machine learning perspective, the biggest challenge associated with tree species diversity is that models trained on data from a given set of tree species might transfer poorly to other regions that host different species. However, the phylogeny and evolutionary distances of tree species are fairly well known (Zanne et al., Reference Zanne, Tank, Cornwell, Eastman, Smith, FitzJohn, McGlinn, O’Meara, Moles, Reich, Royer, Soltis, Stevens, Westoby, Wright, Aarssen, Bertin, Calaminus, Govaerts, Hemmings, Leishman, Oleksyn, Soltis, Swenson, Warman and Beaulieu2014), and tree species that are closer phylogenetically tend to be more similar in their traits (Ackerly, Reference Ackerly2009). As such, phylogenetic correlations and distances among tree species can potentially be leveraged to improve model transferability. Another interconnected challenge, elaborated upon in the following sections, pertains to the dynamic nature of tree species in relation to their leaf biophysical and crown structural characteristics. Instead, each individual differs according to their abiotic (e.g., microclimate and soil) and biotic environment (competition and herbivory), and as such, the expression of foliage and crown properties can overlap between species (Fassnacht et al., Reference Fassnacht, Latifi, Stereńczak, Modzelewska, Lefsky, Waser, Straub and Ghosh2016).

2.2.2. Seasons and phenology

Trees are sessile organisms, but they still respond dynamically to fluctuating seasons. In some cases, phenological properties, such as leaf onset, flowers, or seeds, might be of direct interest to monitor ecological phenomena or species (Wagner, Reference Wagner2021), in which case high-frequency multitemporal imagery might be required. Indeed, phenological changes among species, for example, changes in leaf color during autumn senescence, can help to distinguish tree species based on color, which can be used to improve species classification models (Cloutier et al., Reference Cloutier, Germain and Laliberté2023). However, phenological properties may also hinder the transferability of models across time (Kattenborn et al., Reference Kattenborn, Schiefer, Frey, Feilhauer, Mahecha and Dormann2022b). For instance, the information learnt by a machine learning model using data acquired in summer may not transfer to the same location in fall as trees may have changed in their leaf biochemical properties or the fraction of flowers and seeds in the canopy (Schiefer et al., Reference Schiefer, Schmidtlein and Kattenborn2021). In such cases, the temporal representativeness of data on individual tree species can be key (Kattenborn et al., Reference Kattenborn, Schiefer, Frey, Feilhauer, Mahecha and Dormann2022b).

2.2.3. Forest dynamics

The structure and composition of forests is strongly influenced by abiotic factors such as climate, geology, and soils, as well as water availability. For example, declining temperatures and/or growing season lengths with increasing latitude and/or elevation can filter out tree species that cannot tolerate low temperatures (e.g., low frost resistance) or that do not have enough time to produce mature tissue once the growing season becomes too short (Körner et al., Reference Körner, Basler, Hoch, Kollas, Lenz, Randin, Vitasse and Zimmermann2016). In addition, changes in soil nutrient availability driven by geomorphological processes can influence forest canopy biochemistry (Chadwick and Asner, Reference Chadwick and Asner2018). Water supply is also important: too much water favors trees that can tolerate waterlogging, whereas too little water favors trees that can resist or recover from xylem cavitation (Choat et al., Reference Choat, Jansen, Brodribb, Cochard, Delzon, Bhaskar, Bucci, Feild, Gleason, Hacke, Jacobsen, Lens, Maherali, Martínez-Vilalta, Mayr, Mencuccini, Mitchell, Nardini, Pittermann, Pratt, Sperry, Westoby, Wright and Zanne2012). Much of these environmental influences on forest composition express themselves via tree species turnover, that is, changes in tree species composition across these spatial environmental gradients or discontinuities. However, changes in forest composition and structure can also arise through intraspecific variation. Applications of machine learning methods to forest monitoring should integrate these sources of variation. In particular, incorporating environmental drivers of forest composition and structure as model inputs may help to transfer forest monitoring models from one region to the other.

Tree monitoring can also be affected by biotic factors—that is, by other organisms. First, pests and pathogens can impact tree health, foliage chemistry, and/or water content, which in turn can affect the remotely sensed signal of forest canopies (Sapes et al., Reference Sapes, Lapadat, Schweiger, Juzwik, Montgomery, Gholizadeh, Townsend, Gamon and Cavender-Bares2022). The health status of trees is often directly expressed via their foliage properties and crown architecture and therefore can cause a large variability in remote sensing signals (Zarco-Tejada et al., Reference Zarco-Tejada, Camino, Beck, Calderon, Hornero, Hernández-Clemente, Kattenborn, Montes-Borrego, Susca, Morelli, Gonzalez-Dugo, PRJ, Landa, Boscia, Saponari and Navas-Cortes2018; Kattenborn et al., Reference Kattenborn, Richter, Guimarães-Steinicke, Feilhauer and Wirth2022a). In addition, the remotely sensed signals of trees can also be “overshadowed” by other organisms that live in their crowns (epiphytes), particularly in tropical environments (Baldeck et al., Reference Baldeck, Asner, Martin, Anderson, Knapp, Kellner and Wright2015). Prominent examples are lianas or mistletoes.

Forest management activities as part of forest dynamics, such as harvesting, thinning, and pruning, can challenge the accurate mapping of forest attributes with remote sensing, as this crucial information is often unavailable but significantly impacts forest structure and composition. This lack of data therefore introduces uncertainty into remote sensing analyses and models.

2.2.4. Data collection

As previously mentioned, forests can exhibit significant diversity in terms of their composition and structure across different locations and time periods due to a variety of factors. This heterogeneity poses a particular difficulty in creating machine learning models for forest monitoring. Models developed for one region may not easily generalize to other regions that lie beyond the scope of the training data distribution. One solution for this issue would involve training these models using extensive datasets that encompass the complete spectrum of conditions present in diverse forest environments. Forest remote sensing data worldwide, especially acquired from sensors on satellites, are abundant and generally easily obtainable. In sharp contrast, there is a scarcity of ground-based data, including labels or annotations.

In contrast to other disciplines, annotating remote sensing data in the context of vegetation is often time-consuming, costly, and complex. This phenomenon arises due to the fact that vegetation of various species or conditions frequently exhibits striking similarities, often referred to as “greenery.” Moreover, vegetation communities often show smooth transitions across species or growth forms along environmental gradients. This aspect adds another layer of complexity to the task of distinguishing between individual plants, species, or growth forms (Kattenborn et al., Reference Kattenborn, Leitloff, Schiefer and Hinz2021). Often, field inventories become essential to validate annotations, such as the identification of tree species (Kattenborn et al., Reference Kattenborn, Eichel, Wiser, Burrows, Fassnacht and Schmidtlein2020; Cloutier et al., Reference Cloutier, Germain and Laliberté2023), or when the properties of interest, such as stem diameters, cannot be directly extracted from remote sensing data and require on-site measurements conducted by human observers. Gathering such field data is typically a time-intensive, expensive, and gradual process, leading to significant constraints on its accessibility. Field data such as stem diameters are very important to estimate aboveground tree biomass because most published allometric equations use stem diameter as its primary predictor (Gonzalez-Akre et al., Reference Gonzalez-Akre, Piponiot, Lepore, Herrmann, Lutz, Baltzer, Dick, Gilbert, He, Heym, Huerta, Jansen, Johnson, Knapp, Kral, Lin, Malhi, McMahon, Myers, Orwig, Rodriguez-Hernandez, Russo, Shue, Wang, Wolf, Yang, Davies and Anderson-Teixeira2022). Moreover, spatial coordinates frequently serve as the sole means of connecting field data to remote sensing data. However, GPS or GNSS geolocation in forest settings often introduces substantial uncertainties (ranging from meters to tens of meters), thereby posing challenges in accurately establishing a posteriori links between field observations and remote sensing data (Kattenborn et al., Reference Kattenborn, Leitloff, Schiefer and Hinz2021).

Therefore, integrating various datasets is a strategy aimed at addressing the scarcity of annotated data, cutting down annotation expenses, and lessening the dependency on field-based ground truthing. Nonetheless, this may result heterogenous datasets: In numerous cases, annotations vary (such as boxes, polygons, and points), as well as their quality, across different applications. Annotations are frequently customized to match particular remote sensing data characteristics, such as spatial resolution. Hence, directly merging labels from different datasets is often not feasible, or at the very least, alternative approaches are necessary. One such approach is weakly supervised learning, where the potential lack of label quality is counteracted by leveraging a substantial quantity of data (see Section 3.2.2).

The key takeaway from this section is that the development of machine learning models for forest monitoring will consistently involve a substantial surplus of unlabeled remote sensing data in comparison to labeled ground-truth data. This disparity arises due to the inherent challenges in obtaining labeled data. This scenario is not exclusive to forest monitoring; rather, it is a prevalent aspect shared with other geospatial applications using remote sensing data (Rahaman et al., Reference Rahaman, Weiss, Träuble, Locatello, Lacoste, Bengio, Pal, Li and Schölkopf2022; Mai et al., Reference Mai, Huang, Sun, Song, Mishra, Liu, Gao, Liu, Cong, Hu, Cundy, Li, Zhu and Lao2023a) (see Section 3.1.2). This has two main implications for machine learning research aimed at improving forest monitoring. First, there is a need to develop machine learning methods to forest monitoring that that can effectively utilize limited labeled data. One approach involves leveraging self-supervised learning techniques to extract valuable representations from the available data (see Section 3.2.1). Second, there exists a necessity for novel machine learning strategies, including active learning or alternative forms of model-assisted labeling. These approaches aim to expedite the process of label collection by human observers and reduce associated costs (see Section 3.2.3).

3. Machine learning perspectives and challenges

Machine learning algorithms in computer vision have gained significant capabilities over the past decade, for example, in image classification (Krizhevsky et al., Reference Krizhevsky, Sutskever and Hinton2012; Simonyan and Zisserman, Reference Simonyan and Zisserman2015; Szegedy et al., Reference Szegedy, Liu, Jia, Sermanet, Reed, Anguelov, Erhan, Vanhoucke and Rabinovich2015, Reference Szegedy, Vanhoucke, Ioffe, Shlens and Wojna2016, Reference Szegedy, Ioffe, Vanhoucke and Alemi2017; He et al., Reference He, Zhang, Ren and Sun2016; Hu et al., Reference Hu, Shen and Sun2018; Dosovitskiy et al., Reference Dosovitskiy, Beyer, Kolesnikov, Weissenborn, Zhai, Unterthiner, Dehghani, Minderer, Heigold, Gelly, Uszkoreit and Houlsby2021; Liu et al., Reference Liu, Lin, Cao, Hu, Wei, Zhang, Lin and Guo2021; Touvron et al., Reference Touvron, Cord, Douze, Massa, Sablayrolles and Jégou2021), object detection (Ren et al., Reference Ren, He, Girshick and Sun2015; Liu et al., Reference Liu, Anguelov, Erhan, Szegedy, Reed, Fu and Berg2016; Redmon et al., Reference Redmon, Divvala, Girshick and Farhadi2016; He et al., Reference He, Gkioxari, Dollar and Girshick2017; Redmon and Farhadi, Reference Redmon and Farhadi2018; Li et al., Reference Li, Zhang, Xu, Liu, Zhang, Ni and Shum2023a), and segmentation (Long et al., Reference Long, Shelhamer and Darrell2015; Ronneberger et al., Reference Ronneberger, Fischer and Brox2015; He et al., Reference He, Gkioxari, Dollar and Girshick2017; Lin et al., Reference Lin, Goyal, Girshick, He and Dollár2017; Chen et al., Reference Chen, Zhu, Papandreou, Schroff and Adam2018; Cheng et al., Reference Cheng, Misra, Schwing, Kirillov and Girdhar2022; Kirillov et al., Reference Kirillov, Mintun, Ravi, Mao, Rolland, Gustafson, Xiao, Whitehead, Berg, Lo, Dollár and Girshick2023). While many successful algorithmic paradigms have been established, different applications differ widely across sensory modalities and domain-specific constraints, necessitating the adaptation of algorithms to fit specific needs. For instance, detecting, localizing, and segmenting objects in a scene have been explored for LiDAR point cloud (Yang et al., Reference Yang, Luo and Urtasun2018), automotive RADAR (Ouaknine et al., Reference Ouaknine, Newson, Pérez, Tupin and Rebut2021a), and medical echocardiography (Leclerc et al., Reference Leclerc, Smistad, Pedrosa, Ostvik, Cervenansky, Espinosa, Espeland, Berg, Jodoin, Grenier, Lartizien, Dhooge, Lovstakken and and Bernard2019).

Machine learning algorithms for remote sensing have been the subject of extensive innovation and application (Ma et al., Reference Ma, Liu, Zhang, Ye, Yin and Johnson2019; Camps-Valls et al., Reference Camps-Valls, Tuia, Zhu and Reichstein2021) for problems involving classification (Maxwell et al., Reference Maxwell, Warner and Fang2018; Cheng et al., Reference Cheng, Xie, Han, Guo and Xia2020), object detection (Cheng and Han, Reference Cheng and Han2016; Li et al., Reference Li, Wan, Cheng, Meng and Han2020), and segmentation (Hoeser and Kuenzer, Reference Hoeser and Kuenzer2020; Yuan et al., Reference Yuan, Shi and Gu2021). In recent times, there has been a growing exploration of such techniques for forest monitoring purposes, aiming to enhance our understanding of forest composition, with a specific focus on tree species mapping (see Sections 2.2.1 and 2.1.2), that is, tree classification, tree detection, and tree segmentation (Fassnacht et al., Reference Fassnacht, Latifi, Stereńczak, Modzelewska, Lefsky, Waser, Straub and Ghosh2016; Diez et al., Reference Diez, Kentsch, Fukuda, Caceres, Moritake and Cabezas2021; Kattenborn et al., Reference Kattenborn, Leitloff, Schiefer and Hinz2021; Michalowska and Rapinski, Reference Michalowska and Rapinski2021). These tasks are accomplished using modalities from diverse sensors to gather complementary information.

Nevertheless, the study of machine learning for forest monitoring has not received as much attention as autonomous driving or medical imagery, despite the importance of forest conservation, restoration, and management as natural solutions to the joint climate and biodiversity crises (IPCC, 2023). Consequently, there are numerous unexplored machine learning challenges that need to be addressed in order to tackle climate change (Rolnick et al., Reference Rolnick, Donti, Kaack, Kochanski, Lacoste, Sankaran, Ross, Milojevic-Dupont, Jaques, Waldman-Brown, Luccioni, Maharaj, Sherwin, Mukkavilli, Kording, Gomes, Ng, Hassabis, Platt, Creutzig, Chayes and Bengio2023), including improving forest monitoring practices. Can the challenges encountered in adapting machine learning for forest monitoring be beneficial in exploring the challenges in the field of biology and ecology? This section will outline the current challenges in machine learning linked to forest monitoring, as described in Figure 1, and discuss the diverse strategies employed to tackle them.

3.1. Generalization

Generalization in machine learning refers to the ability of an algorithm to continue to perform well when evaluated on data different from that it was trained on (Zhang et al., Reference Zhang, Bengio, Hardt, Recht and Vinyals2017). One may speak of both in-distribution generalization (performance on data relatively similar to training data) and out-of-distribution generalization (performance on highly different data). Out-of-distribution generalization can be especially relevant to forest monitoring, since, as mentioned in Section 4, forest datasets have a wide range of variations in terms of geographical locations, species composition, sensors and scale (see Figure 2). Such variations introduce distinct data distribution shifts that need to be considered and addressed in forest monitoring tasks. For example, the effects of geographic variability of data have been examined in the context of tree species distributions (Dormann et al., Reference Dormann, McPherson, Araujo, Bivand, Bolliger, Carl, Davies, Hirzel, Jetz, Daniel Kissling, Kuhn, Ohlemuller, Peres-Neto, Reineking, Schroder, Schurr and Wilson2007) and biomass estimation (Ploton et al., Reference Ploton, Mortier, Réjou-Méchain, Barbier, Picard, Rossi, Dormann, Cornu, Viennois, Bayol, Lyapustin, Gourlet-Fleury and Pélissier2020). Simple algorithmic approaches to improve generalization include various forms of regularization (Zou and Hastie, Reference Zou and Hastie2005), data augmentation (Shorten and Khoshgoftaar, Reference Shorten and Khoshgoftaar2019), dropout (Srivastava et al., Reference Srivastava, Hinton, Krizhevsky, Sutskever and Salakhutdinov2014), and batch normalization (Ioffe and Szegedy, Reference Ioffe and Szegedy2015), while improving the breadth of training data, where possible, is also almost always beneficial in practice. However, generalization remains a very active field of research in machine learning. We consider two areas of work that may be of especial interest in forest monitoring.

3.1.1. Domain adaptation

Transfer learning refers to transferring information learnt by a model on one set of problems to different set of problems (Weiss et al., Reference Weiss, Khoshgoftaar and Wang2016). For example, one may pretrain a model on a large, commonly used dataset and then fine-tune it on a smaller dataset representing the specific problem in question. Transfer learning can boost generalization when there is a significant distribution shift between training and inference (Csurka, Reference Csurka2017). One particularly notable approach to transfer learning is domain adaptation, where a model must be applied to target domains that are unknown or lacking labeled data (Soltani et al., Reference Soltani, Feilhauer, Duker and Kattenborn2022). Some domain adaptation approaches have already been applied in plant identification (Ganin and Lempitsky, Reference Ganin and Lempitsky2015). Autonomous driving has witnessed significant exploration in the realm of unsupervised domain adaptation (UDA), where a model is trained on labeled data from the source domain and unlabeled data from the target domain, with the objective of improving its performance specifically on the target domain. It has been explored in the context of unlabeled or unseen source or target domains (Wilson and Cook, Reference Wilson and Cook2020) using generative (Hoffman et al., Reference Hoffman, Tzeng, Park, Zhu, Isola, Saenko, Efros and Darrell2018) or adversarial (Vu et al., Reference Vu, Jain, Bucher, Cord and Pérez2019) methods. The UDA framework has also been explored for cross-modal learning considering domains from different sensor modalities (Jaritz et al., Reference Jaritz, Vu, de Charette, Wirbel and Pérez2020). Within the context of forest monitoring, this framework could prove particularly valuable for adapting a model from one forest to another, regardless of whether they belong to the same biome or not, to identify similar species across both the source and target domains (see Sections 2.1.2 and 2.2.1). Additionally, this framework would be beneficial for adapting the model to address distribution shifts that occur between tree signature distributions (see Sections 2.1.4, 2.2.2, and 2.2.3) as well as between different sensors (see Section 2.2.4). Domain adaptation has been investigated in the field of remote sensing mostly in the context of extrapolation across time or geographical region, including approaches for both aerial and satellite data (Shi et al., Reference Shi, Du, Guo and Du2022; Wang et al., Reference Wang, Feng, Sun, Zhang, Zhang, Yang and Meng2022; Arnaudo et al., Reference Arnaudo, Tavera, Masone, Dominici and Caputo2023; Ma et al., Reference Ma, Zhang, Wang and Pun2023; Xu et al., Reference Xu, Shi, Yuan and Zhu2023). Such work holds potential for training generalizable algorithms for forest monitoring, such as adapting models from PhenoCams to satellite images (Kosmala et al., Reference Kosmala, Hufkens and Richardson2018).

3.1.2. Foundation models

Foundation models are models that can operate on diverse sets of input modalities, scales, data regimes, and downstream tasks. They refer to large-scale multimodal and multitask models, which have opened up research in generalization capacities such as increasing performances in applications unseen during training (Bommasani et al., Reference Bommasani, Hudson, Adeli, Altman, Arora, von Arx, Bernstein, Bohg, Bosselut, Brunskill, Brynjolfsson, Buch, Card, Castellon, Chatterji, Chen, Creel, Davis, Demszky, Donahue, Doumbouya, Durmus, Ermon, Etchemendy, Ethayarajh, Fei-Fei, Finn, Gale, Gillespie, Goel, Goodman, Grossman, Guha, Hashimoto, Henderson, Hewitt, Ho, Hong, Hsu, Huang, Icard, Jain, Jurafsky, Kalluri, Karamcheti, Keeling, Khani, Khattab, Koh, Krass, Krishna, Kuditipudi, Kumar, Ladhak, Lee, Lee, Leskovec, Levent, Li, Li, Ma, Malik, Manning, Mirchandani, Mitchell, Munyikwa, Nair, Narayan, Narayanan, Newman, Nie, Niebles, Nilforoshan, Nyarko, Ogut, Orr, Papadimitriou, Park, Piech, Portelance, Potts, Raghunathan, Reich, Ren, Rong, Roohani, Ruiz, Ryan, R’e, Sadigh, Sagawa, Santhanam, Shih, Srinivasan, Tamkin, Taori, Thomas, Tramèr, Wang, Wang, Wu, Wu, Wu, Xie, Yasunaga, You, Zaharia, Zhang, Zhang, Zhang, Zhang, Zheng, Zhou and Liang2021). Most of the discussed machine learning strategies can be further explored by training foundation models with diverse datasets, thereby enhancing their generalization capabilities. Forest datasets encompass a wide range of scales, ranging from field measurements to estimated world maps (refer to Figure 2), as well as varying resolutions and modalities for different tasks (as outlined in Section 4). The data diversity necessitates the utilization of generalized deep learning architectures. Influenced by the success of large language models (Devlin et al., Reference Devlin, Chang, Lee and Toutanova2019; Radford et al., Reference Radford, Wu, Child, Luan, Amodei and Sutskever2019, Reference Radford, Kim, Hallacy, Ramesh, Goh, Agarwal, Sastry, Askell, Mishkin, Clark, Krueger and Sutskever2021; Brown et al., Reference Brown, Mann, Ryder, Subbiah, Kaplan, Dhariwal, Neelakantan, Shyam, Sastry, Askell, Agarwal, Herbert-Voss, Krueger, Henighan, Child, Ramesh, Ziegler, Wu, Winter, Hesse, Chen, Sigler, Litwin, Gray, Chess, Clark, Berner, McCandlish, Radford, Sutskever and Amodei2020; Chowdhery et al., Reference Chowdhery, Narang, Devlin, Bosma, Mishra, Roberts, Barham, Chung, Sutton, Gehrmann, Schuh, Shi, Tsvyashchenko, Maynez, Rao, Barnes, Tay, Shazeer, Prabhakaran, Reif, Du, Hutchinson, Pope, Bradbury, Austin, Isard, Gur-Ari, Yin, Duke, Levskaya, Ghemawat, Dev, Michalewski, Garcia, Misra, Robinson, Fedus, Zhou, Ippolito, Luan, Lim, Zoph, Spiridonov, Sepassi, Dohan, Agrawal, Omernick, Dai, Pillai, Pellat, Lewkowycz, Moreira, Child, Polozov, Lee, Zhou, Wang, Saeta, Diaz, Firat, Catasta, Wei, Meier-Hellstern, Eck, Dean, Petrov and Fiedel2022; Hoffmann et al., Reference Hoffmann, Borgeaud, Mensch, Buchatskaya, Cai, Rutherford, Casas, Hendricks, Welbl, Clark, Hennigan, Noland, Millican, Driessche, Damoc, Guy, Osindero, Simonyan, Elsen, Rae, Vinyals and Sifre2022; Driess et al., Reference Driess, Xia, Sajjadi, Lynch, Chowdhery, Ichter, Wahid, Tompson, Vuong, Yu, Huang, Chebotar, Sermanet, Duckworth, Levine, Vanhoucke, Hausman, Toussaint, Greff, Zeng, Mordatch and Florence2023; Touvron et al., Reference Touvron, Lavril, Izacard, Martinet, Lachaux, Lacroix, Rozière, Goyal, Hambro, Azhar, Rodriguez, Joulin, Grave and Lample2023), recent advancements in computer vision have led to the development of models that incorporate multiple modalities and can perform multiple tasks simultaneously. In recent studies, researchers have explored the concept of multitask vision by utilizing natural images (Cheng et al., Reference Cheng, Schwing and Kirillov2021, Reference Cheng, Misra, Schwing, Kirillov and Girdhar2022; Kirillov et al., Reference Kirillov, Mintun, Ravi, Mao, Rolland, Gustafson, Xiao, Whitehead, Berg, Lo, Dollár and Girshick2023; Li et al., Reference Li, Zhang, Xu, Liu, Zhang, Ni and Shum2023a) or by employing text to enhance performance in vision-based tasks (Dancette and Cord, Reference Dancette and Cord2022; Xu et al., Reference Xu, De Mello, Liu, Byeon, Breuel, Kautz and Wang2022; Jain et al., Reference Jain, Li, Chiu, Hassani, Orlov and Shi2023a). In the realm of integrating image and text for performing tasks on both modalities, alternative approaches have been developed to improve performances by benefiting from their combination (Zhu et al., Reference Zhu, Zhu, Li, Wu, Wang, Li, Wang and Dai2022; Li et al., Reference Li, Zhu, Jiang, Zhu, Li, Yuan, Wang, Qiao, Wang, Wang and Dai2023b). Additionally, generalist models have been constructed to be agnostic to specific modalities and tasks (Jaegle et al., Reference Jaegle, Gimeno, Brock, Zisserman, Vinyals and Carreira2021, Reference Jaegle, Borgeaud, Alayrac, Doersch, Ionescu, Ding, Koppula, Zoran, Brock, Shelhamer, Hénaff, Botvinick, Zisserman, Vinyals and Carreira2022), enabling them to handle diverse modalities and tasks with a unified approach. In the field of computer vision, for example, the segment anything model (Kirillov et al., Reference Kirillov, Mintun, Ravi, Mao, Rolland, Gustafson, Xiao, Whitehead, Berg, Lo, Dollár and Girshick2023) has demonstrated the capability to perform instance segmentation by leveraging prompts in conjunction with input images. These architecture frameworks hold a significant value for forest monitoring tasks, enabling the detection, segmentation, and estimation of tree properties over large geographical areas, such as their canopy surface or their aboveground biomass (Tolan et al., Reference Tolan, Yang, Nosarzewski, Couairon, Vo, Brandt, Spore, Majumdar, Haziza, Vamaraju, Moutakani, Bojanowski, Johns, White, Tiecke and Couprie2023; Tucker et al., Reference Tucker, Brandt, Hiernaux, Kariryaa, Rasmussen, Small, Igel, Reiner, Melocik, Meyer, Sinno, Romero, Glennie, Fitts, Morin, Pinzon, McClain, Morin, Porter, Loeffler, Kergoat, Issoufou, Savadogo, Wigneron, Poulter, Ciais, Kaufmann, Myneni, Saatchi and Fensholt2023).

The utilization of foundation models with remote sensing data is still in its infancy. However, promising advances have been made in developing multimodal architectures (Zhang et al., Reference Zhang, Ming, Feng, Liu, He and Zhao2023) and temporal-based approaches (Garnot et al., Reference Garnot, Landrieu and Chehata2021; Garnot and Landrieu, Reference Garnot and Landrieu2021; Tarasiou et al., Reference Tarasiou, Chavez and Zafeiriou2023) specifically tailored for precise tasks in remote sensing applications. Based on the masked autoencoder pretraining method (He et al., Reference He, Chen, Xie, Li, Dollár and Girshick2022), multimodal and multitask architectures have been developed for Earth observation applications, particularly for land use and land cover (LULC) estimation (Cong et al., Reference Cong, Khanna, Meng, Liu, Rozi, He, Burke, Lobell and Ermon2022; Reed et al., Reference Reed, Gupta, Li, Brockman, Funk, Clipp, Funk, Candido, Uyttendaele and Darrell2022; Sun et al., Reference Sun, Wang, Lu, Zhu, Lu, He, Li, Rong, Yang, Chang, He, Yang, Wang, Lu and Fu2022; Tseng et al., Reference Tseng, Zvonkov, Purohit, Rolnick and Kerner2023). These architectures address the challenges posed by data collected from sensors that record diverse physical measurements, such as multispectral or SAR data in remote sensing (Reed et al., Reference Reed, Gupta, Li, Brockman, Funk, Clipp, Funk, Candido, Uyttendaele and Darrell2022; Pan et al., Reference Pan, Gao, Dong and Du2023; Yamazaki et al., Reference Yamazaki, Hanyu, Tran, Garcia, Tran, McCann, Liao, Rainwater, Adkins, Molthan, Cothren and Le2023), as well as in natural images (Themyr et al., Reference Themyr, Rambour, Thome, Collins and Hostettler2023). While different spectral, spatial, and temporal resolutions have been considered in previous works, there remains a lack of exploration regarding the resolution gap between datasets captured by aerial and satellite sensors. The integration of multimodal, multitask, and multiscale architectures is expected to significantly enhance the generalization capabilities of models for forest monitoring tasks at global scale (see Section 2.2.4). By training these algorithms with various types of datasets and specializing them for forest monitoring tasks, they could effectively deliver improved performance across different geographical regions such as for species cover estimation or aboveground biomass estimation (see Section 2.1.3).

3.2. Learning from limited data

There are a growing number of massive datasets and algorithms leveraging them, including across remote sensing (Sumbul et al., Reference Sumbul, de Wall, Kreuziger, Marcelino, Costa, Benevides, Caetano, Demir and Markl2021; Bastani et al., Reference Bastani, Wolters, Gupta, Ferdinando and Kembhavi2023; Rahaman et al., Reference Rahaman, Weiss, Träuble, Locatello, Lacoste, Bengio, Pal, Li and Schölkopf2022; Mai et al., Reference Mai, Huang, Sun, Song, Mishra, Liu, Gao, Liu, Cong, Hu, Cundy, Li, Zhu and Lao2023a). However, many of the most powerful machine learning approaches are supervised and, therefore, require labels, which can be challenging, time-consuming, and costly to obtain. There has been considerable attention given to the problem of learning from limited labeled data; we here present several families of approaches and their relevance to forest monitoring.

3.2.1. Self-supervised learning

Situated in-between supervised and unsupervised learning, the self-supervised learning paradigm involves training a model to reconstruct certain known relationships between or within the datapoints themselves. The resulting model can then be fine-tuned with actual labeled data or directly applied to solve the downstream task. Self-supervised approaches in computer vision include discriminative approaches that distinguish between positive and negative samples while separating their representation (e.g., contrastive learning) (Gidaris et al., Reference Gidaris, Singh and Komodakis2018; Chen et al., Reference Chen, Kornblith, Norouzi and Hinton2020; He et al., Reference He, Fan, Wu, Xie and Girshick2020; Caron et al., Reference Caron, Touvron, Misra, Jégou, Mairal, Bojanowski and Joulin2021; Oquab et al., Reference Oquab, Darcet, Moutakanni, Vo, Szafraniec, Khalidov, Fernandez, Haziza, Massa, El-Nouby, Assran, Ballas, Galuba, Howes, Huang, Li, Misra, Rabbat, Sharma, Synnaeve, Xu, Jegou, Mairal, Labatut, Joulin and Bojanowski2023) and generative approaches that learn representations through reconstruction (Lehtinen et al., Reference Lehtinen, Munkberg, Hasselgren, Laine, Karras, Aittala and Aila2018; He et al., Reference He, Chen, Xie, Li, Dollár and Girshick2022). The utilization of self-supervised learning in remote sensing has experienced significant growth due to the abundance of unlabeled open-access data (Tao et al., Reference Tao, Qi, Guo, Zhu and Li2023). For instance, geolocation of satellite images has been exploited with a contrastive approach (Ayush et al., Reference Ayush, Uzkent, Meng, Tanmay, Burke, Lobell and Ermon2021; Mai et al., Reference Mai, Lao, He, Song and Ermon2023b). Multispectral and SAR data have been reconstructed based on the temporal information (Cong et al., Reference Cong, Khanna, Meng, Liu, Rozi, He, Burke, Lobell and Ermon2022; Yadav et al., Reference Yadav, Nascetti, Azizpour and Ban2022), for multiscale reconstruction (Reed et al., Reference Reed, Gupta, Li, Brockman, Funk, Clipp, Funk, Candido, Uyttendaele and Darrell2022) and for denoising (Dalsasso et al., Reference Dalsasso, Denis and Tupin2021, Reference Dalsasso, Denis and Tupin2022; Meraoumia et al., Reference Meraoumia, Dalsasso, Denis, Abergel and Tupin2023). Emerging cross-modal approaches, encompassing both discriminative (Jain et al., Reference Jain, Wilson and Gulshan2022) and generative (Jain et al., Reference Jain, Schoen-Phelan and Ross2023b) techniques, are being developed to harness the complementary nature of aligned samples. Self-supervised learning will greatly unleash the potential of remote sensing data in the area of forests (Ge et al., Reference Ge, Gu, Su, Lönnqvist and Antropov2023) by learning textural and geometrical structures of forests and trees without labels (see Sections 2.2.1 and 2.2.4).

3.2.2. Weakly supervised learning

Obtaining precise and detailed annotations, for example, for tree crown instance segmentation, can be both costly and time-consuming. Although self-supervised learning aims to learn representations from pretext tasks, it still necessitates precise annotations for fine-tuning the model in a downstream task. In cases where precise annotations are not available, coarse-grained and potentially inaccurate annotations, or even single point locations, can be utilized as weak labels in weakly supervised learning approaches (Zhou, Reference Zhou2018). Given their cost-effectiveness and efficiency, computer vision methods have been developed to leverage weak annotations while addressing their inherent inaccuracies (Zhou, Reference Zhou2018). Weakly supervised learning has been explored in the realm of object location (Oquab et al., Reference Oquab, Bottou, Laptev and Sivic2015), object relationship estimation (Peyre et al., Reference Peyre, Sivic, Laptev and Schmid2017), instance segmentation (Ahn et al., Reference Ahn, Cho and Kwak2019), and contrastive learning (Zheng et al., Reference Zheng, Wang, You, Qian, Zhang, Wang and Xu2021). Obtaining high-quality annotations for remote sensing data is particularly difficult due to their poor spatial resolution or the physics of the sensors used. Weakly supervised learning has therefore been investigated for Earth observation tasks including object detection (Han et al., Reference Han, Zhang, Cheng, Guo and Ren2015; Zhang et al., Reference Zhang, Han, Cheng, Liu, Bu and Guo2015; Yao et al., Reference Yao, Feng, Han, Cheng and Guo2021), LULC semantic segmentation (Yao et al., Reference Yao, Han, Cheng, Qian and Guo2016; Wang et al., Reference Wang, Chen, Xie, Azzari and Lobell2020b), and plant traits regression (Schiller et al., Reference Schiller, Schmidtlein, Boonman, Moreno-Martínez and Kattenborn2021; Cherif et al., Reference Cherif, Feilhauer, Berger, Dao, Ewald, Hank, He, Kovach, Lu, Townsend and Kattenborn2023). Recently, weakly supervised methods have been investigated in the areas of tree classification (Illarionova et al., Reference Illarionova, Trekin, Ignatiev and Oseledets2021), tree counting (Amirkolaee et al., Reference Amirkolaee, Shi and Mulligan2023), tree detection (Aygunes et al., Reference Aygunes, Cinbis and Aksoy2021), and tree segmentation (Gazzea et al., Reference Gazzea, Kristensen, Pirotti, Ozguven and Arghandeh2022) using multispectral data (see Section 2.2.4).

3.2.3. Active learning

Even highly precise and fine-grained annotations are generally less useful if present in only small quantities. To address this limitation, active learning strategies have been developed to identify and select the optimal way to select a small set of training datapoints to label (Cohn et al., Reference Cohn, Ghahramani and Jordan1996). These strategies for sample selection often rely on estimating the uncertainty of a model (Gal et al., Reference Gal, Islam and Ghahramani2017), for instance, using variational approaches (Sinha et al., Reference Sinha, Ebrahimi and Darrell2019) or estimated with a loss function (Yoo and Kweon, Reference Yoo and Kweon2019). They have demonstrated their effectiveness in scenarios where the amount of labeled data is limited, particularly in the context of image classification (Gal et al., Reference Gal, Islam and Ghahramani2017; Sinha et al., Reference Sinha, Ebrahimi and Darrell2019; Yoo and Kweon, Reference Yoo and Kweon2019), object detection (Roy et al., Reference Roy, Unmesh and Namboodiri2019), and semantic segmentation (Siddiqui et al., Reference Siddiqui, Valentin and Niessner2020). Active learning has also been investigated for remote sensing applications, including classification (Tuia et al., Reference Tuia, Volpi, Copa, Kanevski and Munoz-Mari2011), object detection (Qu et al., Reference Qu, Du, Cao, Guan and Zhao2020), and LULC semantic segmentation with hyperspectral data (Li et al., Reference Li, Bioucas-Dias and Plaza2010; Li et al., Reference Li, Bioucas-Dias and Plaza2011; Zhang et al., Reference Zhang, Pasolli, Crawford and Tilton2016). Its application would be helpful for forest monitoring to optimize and create relevant human annotations (see Section 2.2.4).

3.2.4. Few-shot learning

Yet another approach to limited data availability is few-shot learning, which refers to efficient fine-tuning of a pretrained model using only a few annotated datapoints. Few-shot learning has been approached from different perspectives, considering the comparison between the small annotated dataset and the data used for pretraining the model—for example, by quantifying the similarities between these datasets (Vinyals et al., Reference Vinyals, Blundell, Lillicrap, Kavukcuoglu and Wierstra2016), constructing mixtures of feature embeddings (Snell et al., Reference Snell, Swersky and Zemel2017), or adapting the optimization scheme through meta-learning (Finn et al., Reference Finn, Abbeel and Levine2017). Motivated by the limited availability of annotations, applications of few-shot learning in remote sensing tasks have been investigated. For instance, methods based on feature similarity (Alajaji et al., Reference Alajaji, Alhichri, Ammour and Alajlan2020; Zhang et al., Reference Zhang, Bai, Wang, Bai and Li2020; Alosaimi et al., Reference Alosaimi, Alhichri, Bazi, Ben Youssef and Alajlan2023) and metric learning (Liu et al., Reference Liu, Yu, Yu, Zhang, Wan and Wang2019), aiming at separating representations in an embedding space, have been explored for LULC classification with either multispectral or hyperspectral data. Objects have also been detected by learning meta features (Deng et al., Reference Deng, Li and Fang2022). Metric learning techniques have also been utilized in the context of few-shot learning for semantic segmentation tasks (Jiang et al., Reference Jiang, Zhou and Li2022) or meta learning with multispectral and SAR data (Rußwurm et al., Reference Rußwurm, Wang, Körner and Lobell2020). Few-shot learning has been explored for tree species classification using feature similarity (Chen et al., Reference Chen, Tian, Chai, Zhang and Chen2021) and would be beneficial to recognize a species or estimate the characteristics of a tree with minimal manual annotations (see Sections 2.2.2 and 2.2.4).

3.2.5. Zero-shot learning

The machine learning community has also shown interest in developing methods for training algorithms to differentiate unseen classes without any explicitly annotated samples at all, which is known as zero-shot learning (Xian et al., Reference Xian, Lampert, Schiele and Akata2018). In order to categorize unseen classes, the task of zero-shot learning has been accomplished by projecting image and word embeddings (Socher et al., Reference Socher, Ganjoo, Sridhar, Bastani, Manning and Ng2013) or known semantic attributes (Lampert et al., Reference Lampert, Nickisch and Harmeling2014) into a shared space. Zero-shot learning has also been investigated by incorporating a mixture of embeddings from the source domain before computing similarities with the target domain, which includes the unseen classes (Zhang and Saligrama, Reference Zhang and Saligrama2015). Generative approaches have been developed to create visual feature embeddings of unseen classes from word embeddings for zero-shot semantic segmentation (Bucher et al., Reference Bucher, Vu, Cord and Pérez2019). Zero-shot learning has also garnered attention in remote sensing applications, including combining multispectral data and word embeddings for classification tasks (Li et al., Reference Li, Lu, Wang, Xiang and Wen2017, Reference Li, Kong, Zhang, Tan and Chen2021) and initial exploration of applying zero-shot learning to classify hyperspectral data (Freitas et al., Reference Freitas, Silva and Silva2022). Generative approaches have also been used with remote sensing data to create visual embeddings from word embeddings (Li et al., Reference Li, Zhang, Wang, Lin and Zhang2022b). Zero-shot learning presents a promising approach for forest monitoring, enabling the adaptation of models in regions where previously unseen species are encountered (see Sections 2.2.2 and 2.2.4). By leveraging tree taxonomy hierarchy and meta characteristics to align with visual embeddings (Sumbul et al., Reference Sumbul, Cinbis and Aksoy2018), a vast research potential emerges. Notably, utilizing foundation models that have demonstrated strong zero-shot learning capabilities (Brown et al., Reference Brown, Mann, Ryder, Subbiah, Kaplan, Dhariwal, Neelakantan, Shyam, Sastry, Askell, Agarwal, Herbert-Voss, Krueger, Henighan, Child, Ramesh, Ziegler, Wu, Winter, Hesse, Chen, Sigler, Litwin, Gray, Chess, Clark, Berner, McCandlish, Radford, Sutskever and Amodei2020; Radford et al., Reference Radford, Kim, Hallacy, Ramesh, Goh, Agarwal, Sastry, Askell, Mishkin, Clark, Krueger and Sutskever2021) further enhances this potential. In the following paragraph, we will explore methods concerning domain-specific objectives, focusing on the consideration of physical and biological constraints and their applications.

3.3. Domain-specific objectives

Machine learning methods commonly use a fairly limited set of metrics to evaluate success, such as (macro or micro) accuracy of labels and cross-entropy loss for classification tasks, mean squared error or mean average error for regression tasks, and so forth. However, these uniform metrics do not necessarily reflect the realities of real-world use cases (Birhane et al., Reference Birhane, Kalluri, Card, Agnew, Dotan and Bao2022), where criteria for success may be much more nuanced or domain-specific. In this section, we consider two other families of objectives that may frequently be of relevance in forest monitoring.

3.3.1. Constraints on data

Depending on the domain of application, the outputs of a machine learning pipeline may have specific constraints that must be satisfied if the answer is to be useful or even possible. For example, climate variables may need to obey physical laws such as conservation of energy, engineered systems may need to obey the laws of mechanics, and so forth. Machine learning models to work with such variables have increasingly been designed with soft constraints (Ouaknine et al., Reference Ouaknine, Newson, Pérez, Tupin and Rebut2021a; Harder et al., Reference Harder, Watson-Parris, Stier, Strassel, Gauger and Keuper2022), which impose penalties for constraint violation, or hard constraints (Donti et al., Reference Donti, Rolnick and Kolter2021; Geiss and Hardin, Reference Geiss and Hardin2021; Harder et al., Reference Harder, Ramesh, Hernandez-Garcia, Yang, Sattigeri, Szwarcman, Watson and Rolnick2023), where the constraints are strictly enforced by the design of the algorithm. Compared to physics- and engineered-based constraints, fewer authors have to date integrated biological constraints into ML algorithms. Dynamics of biological systems have been included in a deep learning optimization scheme as hard constraints from ordinary differential equations (Yazdani et al., Reference Yazdani, Lu, Raissi and Karniadakis2020). There are potential opportunities for incorporating biological constraints in forest monitoring by considering phenological (Richardson et al., Reference Richardson, Hufkens, Milliman and Frolking2018) or biophysical traits, or ecosystem properties (see Section 2.1.5), for example, by considering the ratio of tree height and canopy size. These constraints could be particularly valuable in tasks such as semantic segmentation or biomass estimation.

Domain-specific constraints on data may also pose opportunities for improving the design of machine learning models. The design of deep learning model architectures can incorporate considerations for, or reconstruction of, physical properties. For instance, a physics-informed architecture has been developed for super-resolution in turbulent flows, incorporating partial differential equations as a form of regularization (Jiang et al., Reference Jiang, Esmaeilzadeh, Azizzadenesheli, Kashinath, Mustafa, Tchelepi, Marcus, Prabhat and Anandkumar2020). Similarly, RADAR-based architectures have been created to reconstruct physical properties for scene understanding in the context of autonomous driving (Ouaknine et al., Reference Ouaknine, Newson, Pérez, Tupin and Rebut2021a; Rebut et al., Reference Rebut, Ouaknine, Malik and Pérez2022). Leveraging the properties of multiple sensors has also been employed to fuse their representations (Ouaknine, Reference Ouaknine2022) or to generate annotations from one modality to another (Ouaknine et al., Reference Ouaknine, Newson, Rebut, Tupin and Perez2021b; Schiefer et al., Reference Schiefer, Schmidtlein, Frick, Frey, Klinke, Zielewska-Büttner, Junttila, Uhl and Kattenborn2023). In remote sensing, self-supervised learning has benefited from SAR physical properties by considering a pretext denoising task (Dalsasso et al., Reference Dalsasso, Denis and Tupin2021; Meraoumia et al., Reference Meraoumia, Dalsasso, Denis, Abergel and Tupin2023), or by separating and reconstructing the real from the imaginary part of the signal (Dalsasso et al., Reference Dalsasso, Denis and Tupin2022). Such methods could also be explored by exploiting various sensors to learn representations of forests and trees (see Section 2.2.4).

3.3.2. Uncertainty quantification

Biological phenomena adhere to intricate rules that are challenging to estimate and often exhibit inherent uncertainties. The estimation of prediction uncertainty aids in obtaining a better understanding of the strengths and limitations of a machine learning model. The overall uncertainty of these models comprises both aleatoric and epistemic uncertainties (Gal, Reference Gal2016). They both can be distinguished based on their origins. Aleatoric uncertainty arises from the inherent noise present in the data and label distributions, while epistemic uncertainty is associated with the model itself, encompassing its estimated parameters and structural characteristics. Approaches have been devised to estimate the uncertainties of deep neural networks, for example, by using a Bayesian approach such as Monte Carlo dropout (Gal and Ghahramani, Reference Gal and Ghahramani2016), by using adversarial training combined with model ensembles (Lakshminarayanan et al., Reference Lakshminarayanan, Pritzel and Blundell2017), by predicting the uncertainty distribution (Malinin and Gales, Reference Malinin and Gales2018), or by learning an auxiliary confidence score from the data (Corbière et al., Reference Corbière, Thome, Bar-Hen, Cord and Pérez2019; Corbière, Reference Corbière2022). Similar methods have been applied to estimate uncertainty in remote sensing data for crop yield estimation (Ma et al., Reference Ma, Zhang, Kang and özdoğan2021b) or road segmentation (Haas and Rabus, Reference Haas and Rabus2021). The quantification of uncertainties in forest monitoring methods has been carried out to assess both aleatoric and epistemic uncertainties (see Section 2.2). This is commonly performed to evaluate the uncertainty of predictions on large-scale maps, utilizing low-resolution satellite data. The uncertainty of plant functional type has been studied for classification in Siberia (Ottlé et al., Reference Ottlé, Lescure, Maignan, Poulter, Wang and Delbart2013). Estimating the uncertainty of aboveground biomass has also been conducted to establish a range of estimated values in carbon stock maps (Patterson et al., Reference Patterson, Healey, Ståhl, Saarela, Holm, Andersen, Dubayah, Duncanson, Hancock, Armston, Kellner, Cohen and Yang2019; Santoro et al., Reference Santoro, Cartus, Carvalhais, Rozendaal, Avitabile, Araza, de Bruin, Herold, Quegan, Rodriguez-Veiga, Balzter, Carreiras, Schepaschenko, Korets, Shimada, Itoh, Martínez, Cavlovic, Cazzolla Gatti, da Conceiçao Bispo, Dewnath, Labrière, Liang, Lindsell, Mitchard, Morel, Pacheco Pascagaza, Ryan, Slik, Vaglio Laurin, Verbeeck, Wijaya and Willcock2021) (see Section 2.1.3). To quantify uncertainty, these methods utilize standard deviations or output probabilities of the model. Recent studies have taken a step further in estimating tree carbon stocks in semi-arid sub-Saharan Africa north of the Equator by combining uncertainty from both allometric equations and predicted crown segmentation, utilizing field measurements (Tucker et al., Reference Tucker, Brandt, Hiernaux, Kariryaa, Rasmussen, Small, Igel, Reiner, Melocik, Meyer, Sinno, Romero, Glennie, Fitts, Morin, Pinzon, McClain, Morin, Porter, Loeffler, Kergoat, Issoufou, Savadogo, Wigneron, Poulter, Ciais, Kaufmann, Myneni, Saatchi and Fensholt2023). There has been limited application of advanced uncertainty quantification methods, whether associated with the data or the predictive model, in the context of forest monitoring.

Despite the extensive application of the presented machine learning techniques in remote sensing, their utilization for forest monitoring has been relatively limited. This presents numerous opportunities to gain deeper insights into the composition of forests while achieving generalization at a large scale. However, it is crucial to have access to high-quality, diverse, and sufficient datasets in order to effectively explore machine learning strategies. In the following section, we will review open-access forest datasets, providing information on their size, tasks, scale, and modalities.

4. Review of open-access forest datasets

Open-access datasets are essential to drive the scientific community in general to exploring forest biology challenges, particularly by using machine learning strategies (see Section 3). Deep learning algorithms have demonstrated strong performance in various forest monitoring tasks, such as tree classification or segmentation (Kattenborn et al., Reference Kattenborn, Leitloff, Schiefer and Hinz2021). The availability of open-access datasets has played a significant role in enhancing the algorithm performances and expanding their applications on a larger scale. In this particular field, the use of data, from the tree to the country level (see Figure 2), distributed in the entire globe, must be taken into consideration. Algorithms have been trained for forest monitoring by leveraging datasets that encompass different scales, modalities, and tasks (Guimaraes et al., Reference Guimaraes, Padua, Marques, Silva, Peres and Sousa2020; Kattenborn et al., Reference Kattenborn, Leitloff, Schiefer and Hinz2021; Michalowska and Rapinski, Reference Michalowska and Rapinski2021). However, the limited availability of data sources often restricts public access, thereby impeding the progress of extended research projects. While the scientific community emphasizes the importance of reproducible experiments, it is worth noting that some datasets do not fully adhere to the fair principles (https://www.go-fair.org/fair-principles/), which encompass aspects like documentation and findability.

While there is still a considerable quantity of publicly available datasets, it is important to acknowledge that they may have certain limitations that restrict their impact in machine learning applications for forest composition analysis. These limitations can include factors such as the size of the dataset or the specific type of data that is released. This section aims to review forest monitoring datasets considering the following criteria:

  1. 1. The dataset should be open-access, that is, without any request requirement.

  2. 2. The dataset should be related to at least one published article; exceptions have been made for datasets that are available as preprints, but are considered to be must-see datasets.

  3. 3. The dataset should be focused on the composition of the forest, excluding event-based specific ones (i.e., wildfire detection).

  4. 4. An LULC dataset should contain more than a single plant functional type (i.e., conifers or deciduous) since a focus is made on better understanding the composition of the forest.

  5. 5. The dataset should be at the tree level at least, excluding datasets at the organ or cellular level considered as out of the scope of this review (e.g., leaf spectra or root scans).

  6. 6. The dataset should contained at least $ O\left(1,000\right) $ trees.

Based on these criteria, 86 datasets have been identified representing a wide range of geographical locations and spanning from 1974 to 2022. The datasets are associated with publications from 2005 to 2023, as depicted in Figure 3.

Figure 3. Distribution of the reviewed open-access forest datasets. Note: (Left) World map of the location of the reviewed datasets at the country level. Most of the datasets are regional and do not reflect the entire associated country. The datasets categorized with a “Worldwide” location or at the continent level have been excluded for visualization purposes. (Right) Distributions of the publication years and recording years used and/or released in the associated datasets.

The scope of the presented review is broad; it is likely that other datasets meeting these requirements have been missed. Based on this motivation, the study is supported by OpenForest, a dynamic catalog integrating the reviewed datasets and open to updates from the community. (The catalog contains all URLs to access the datasets that are not included in this article to ensure a temporal consistency. OpenForest is available at https://github.com/RolnickLab/OpenForest.) Updates on OpenForest will be restricted with the criteria detailed above. We hope to motivate researchers by grouping our efforts to create the largest database of open-access forest datasets and thus create synergies on forest monitoring applications.

This section will review open-access forest datasets grouped at different scales as presented in Figure 2: inventories (Section 4.1), ground-based recordings (Section 4.2), aerial recordings (Section 4.3), satellite recordings (Section 4.4), and country or world maps (Section 4.5). Datasets composed of mixed scales are finally presented (Section 4.6).

Each section will detail the overall scope of the presented datasets with the specificity of the sensors used to record the data, the information related to each dataset, and their applications. In each section, the reviewed datasets will be categorized in tables, respectively, to the scale of the released data. In these tables, the publication and recording years are differentiated to better understand the temporal scope of the datasets. The recordings years are distinguished with a new line, while time series are represented by an upper dash. Each table will relate the available modalities in the “Data” column. This one is separated with the “Spatial resolution” or “Spatial precision” column (except for inventories) with a dashed line to associate a resolution to the corresponding modality. Each section will also discuss the limits of current open-access datasets to motivate our perspectives presented in Section 5. The following section will review inventory datasets as the smallest scale of recordings that have been taken into account.

4.1. Inventories

Historically, forests have been mostly locally or regionally inventoried based on stratified plot samples acquired in the field (Jucker et al., Reference Jucker, Fischer, Chave, Coomes, Caspersen, Ali, Loubota Panzou, Feldpausch, Falster, Usoltsev, Adu-Bredu, Alves, Aminpour, Angoboy, Anten, Antin, Askari, Muñoz, Ayyappan, Balvanera, Banin, Barbier, Battles, Beeckman, Bocko, Bond-Lamberty, Bongers, Bowers, Brade, van Breugel, Chantrain, Chaudhary, Dai, Dalponte, Dimobe, Domec, Doucet, Duursma, Enríquez, van Ewijk, Farfán-Rios, Fayolle, Forni, Forrester, Gilani, Godlee, Gourlet-Fleury, Haeni, Hall, He, Hemp, Hernández-Stefanoni, Higgins, Holdaway, Hussain, Hutley, Ichie, Iida, Jiang, Joshi, Kaboli, Larsary, Kenzo, Kloeppel, Kohyama, Kunwar, Kuyah, Kvasnica, Lin, Lines, Liu, Lorimer, Loumeto, Malhi, Marshall, Mattsson, Matula, Meave, Mensah, Mi, Momo, Moncrieff, Mora, Nissanka, O’Hara, Pearce, Pelissier, Peri, Ploton, Poorter, Pour, Pourbabaei, Dupuy-Rada, Ribeiro, Ryan, Sanaei, Sanger, Schlund, Sellan, Shenkin, Sonké, Sterck, Svátek, Takagi, Trugman, Ullah, Vadeboncoeur, Valipour, Vanderwel, Vovides, Wang, Wang, Wirth, Woods, Xiang, Ximenes, Xu, Yamada and Zavala2022). Digitized and open-access inventories generally cover small areas, consisting of dozens or a few hundred trees, which limits their impact on the machine learning community (Section 3). As defined in Section 4, this section is focused on medium- to large-scale inventories with at least $ O\left(1,000\right) $ trees. A significant part of reviewed inventory datasets are mixed with modalities at different scales, which will be detailed in Section 4.6.

Inventory datasets are summarized in Table 1; the size of the datasets is quantified by the number of trees. Inventory datasets are composed of various measurements. They commonly contain tree height, canopy diameter, diameter at breast height (DBH), or diameter at soil height (DSH) (Gastauer et al., Reference Gastauer, Leyh and Meira-Neto2015; Jucker et al., Reference Jucker, Fischer, Chave, Coomes, Caspersen, Ali, Loubota Panzou, Feldpausch, Falster, Usoltsev, Adu-Bredu, Alves, Aminpour, Angoboy, Anten, Antin, Askari, Muñoz, Ayyappan, Balvanera, Banin, Barbier, Battles, Beeckman, Bocko, Bond-Lamberty, Bongers, Bowers, Brade, van Breugel, Chantrain, Chaudhary, Dai, Dalponte, Dimobe, Domec, Doucet, Duursma, Enríquez, van Ewijk, Farfán-Rios, Fayolle, Forni, Forrester, Gilani, Godlee, Gourlet-Fleury, Haeni, Hall, He, Hemp, Hernández-Stefanoni, Higgins, Holdaway, Hussain, Hutley, Ichie, Iida, Jiang, Joshi, Kaboli, Larsary, Kenzo, Kloeppel, Kohyama, Kunwar, Kuyah, Kvasnica, Lin, Lines, Liu, Lorimer, Loumeto, Malhi, Marshall, Mattsson, Matula, Meave, Mensah, Mi, Momo, Moncrieff, Mora, Nissanka, O’Hara, Pearce, Pelissier, Peri, Ploton, Poorter, Pour, Pourbabaei, Dupuy-Rada, Ribeiro, Ryan, Sanaei, Sanger, Schlund, Sellan, Shenkin, Sonké, Sterck, Svátek, Takagi, Trugman, Ullah, Vadeboncoeur, Valipour, Vanderwel, Vovides, Wang, Wang, Wirth, Woods, Xiang, Ximenes, Xu, Yamada and Zavala2022; National Ecological Observatory Network (NEON), 2023; Oliveira et al., Reference Oliveira, Farias, Perdiz, Scudeller and Imbrozio Barbosa2017;Pérez-Luque et al., Reference Pérez-Luque, Zamora, Bonet and Pérez-Pérez2015; Pérez-Luque et al., Reference Pérez-Luque, Gea-Izquierdo and Zamora2021). In specific cases, wood density, bark density, and bark thickness are also measured (Schepaschenko et al., Reference Schepaschenko, Chave, Phillips, Lewis, Davies, Réjou-Méchain, Sist, Scipal, Perger, Herault, Labrière, Hofhansl, Affum-Baffoe, Aleinikov, Alonso, Amani, Araujo-Murakami, Armston, Arroyo, Ascarrunz, Azevedo, Baker, Bałazy, Bedeau, Berry, Bilous, Bilous, Bissiengou, Blanc, Bobkova, Braslavskaya, Brienen, Burslem, Condit, Cuni-Sanchez, Danilina, del Castillo Torres, Derroire, Descroix, Sotta, d’Oliveira, Dresel, Erwin, Evdokimenko, Falck, Feldpausch, Foli, Foster, Fritz, Garcia-Abril, Gornov, Gornova, Gothard-Bassébé, Gourlet-Fleury, Guedes, Hamer, Susanty, Higuchi, Coronado, Hubau, Hubbell, Ilstedt, Ivanov, Kanashiro, Karlsson, Karminov, Killeen, Koffi, Konovalova, Kraxner, Krejza, Krisnawati, Krivobokov, Kuznetsov, Lakyda, Lakyda, Licona, Lucas, Lukina, Lussetti, Malhi, Manzanera, Marimon, Junior, Martinez, Martynenko, Matsala, Matyashuk, Mazzei, Memiaghe, Mendoza, Mendoza, Moroziuk, Mukhortova, Musa, Nazimova, Okuda, Oliveira, Ontikov, Osipov, Pietsch, Playfair, Poulsen, Radchenko, Rodney, Rozak, Ruschel, Rutishauser, See, Shchepashchenko, Shevchenko, Shvidenko, Silveira, Singh, Sonké, Souza, Stereńczak, Stonozhenko, Sullivan, Szatniewska, Taedoumg, ter Steege, Tikhonova, Toledo, Trefilova, Valbuena, Gamarra, Vasiliev, Vedrova, Verhovets, Vidal, Vladimirova, Vleminckx, Vos, Vozmitel, Wanek, West, Woell, Woods, Wortel, Yamada, Nur Hajar and Zo-Bi2019; Farias et al., Reference Farias, Silva, de Oliveira Perdiz, Citó, da Silva Carvalho and Barbosa2020; Kindermann et al., Reference Kindermann, Dobler, Niedeggen, Fabiano and Linstädter2022). These information are particularly useful to estimate the tree density, the aboveground biomass (AGB), or the tree carbon stock at large scale (Tucker et al., Reference Tucker, Brandt, Hiernaux, Kariryaa, Rasmussen, Small, Igel, Reiner, Melocik, Meyer, Sinno, Romero, Glennie, Fitts, Morin, Pinzon, McClain, Morin, Porter, Loeffler, Kergoat, Issoufou, Savadogo, Wigneron, Poulter, Ciais, Kaufmann, Myneni, Saatchi and Fensholt2023) even if the inventories have not been released with the estimated maps (Patterson et al., Reference Patterson, Healey, Ståhl, Saarela, Holm, Andersen, Dubayah, Duncanson, Hancock, Armston, Kellner, Cohen and Yang2019; Dionizio et al., Reference Dionizio, Pimenta, Lima and Costa2020).

Table 1. Review of open-access forest inventories datasets

Note: The dataset size measured in K is $ O\left({10}^3\right) $ . AGB = aboveground biomass; Classif. = classification; DBH = diameter at breast height; DSH = diameter at soil height; N/A = non-applicable; OL = object localization; Reg. = regression; Unknown = non-provided by the authors.

Species, genus, and family of the trees are generally provided. This hierarchy of labels coming alongside with the tree geo-location make inventories a very accurate datasets for understanding forest composition. However, they are geographically sparse and centered in a specific location to reduce measurement efforts (Laar and Akça, Reference Laar and Akça2007; Motz et al., Reference Motz, Sterba and Pommerening2010). As an exception, Tallo (Jucker et al., Reference Jucker, Fischer, Chave, Coomes, Caspersen, Ali, Loubota Panzou, Feldpausch, Falster, Usoltsev, Adu-Bredu, Alves, Aminpour, Angoboy, Anten, Antin, Askari, Muñoz, Ayyappan, Balvanera, Banin, Barbier, Battles, Beeckman, Bocko, Bond-Lamberty, Bongers, Bowers, Brade, van Breugel, Chantrain, Chaudhary, Dai, Dalponte, Dimobe, Domec, Doucet, Duursma, Enríquez, van Ewijk, Farfán-Rios, Fayolle, Forni, Forrester, Gilani, Godlee, Gourlet-Fleury, Haeni, Hall, He, Hemp, Hernández-Stefanoni, Higgins, Holdaway, Hussain, Hutley, Ichie, Iida, Jiang, Joshi, Kaboli, Larsary, Kenzo, Kloeppel, Kohyama, Kunwar, Kuyah, Kvasnica, Lin, Lines, Liu, Lorimer, Loumeto, Malhi, Marshall, Mattsson, Matula, Meave, Mensah, Mi, Momo, Moncrieff, Mora, Nissanka, O’Hara, Pearce, Pelissier, Peri, Ploton, Poorter, Pour, Pourbabaei, Dupuy-Rada, Ribeiro, Ryan, Sanaei, Sanger, Schlund, Sellan, Shenkin, Sonké, Sterck, Svátek, Takagi, Trugman, Ullah, Vadeboncoeur, Valipour, Vanderwel, Vovides, Wang, Wang, Wirth, Woods, Xiang, Ximenes, Xu, Yamada and Zavala2022) groups inventories from all around the world, with an unprecedented number of species reported. The latter could have an impact on estimating tree species distribution at large scale.

Considering that inventories contain annotations of trees or tree clusters, they open possibilities to segment tree canopies according to their taxonomic levels, regress continuous metrics (i.e., height and biomass), or even locate tree individuals by predicting their coordinates or crown perimeter (Tucker et al., Reference Tucker, Brandt, Hiernaux, Kariryaa, Rasmussen, Small, Igel, Reiner, Melocik, Meyer, Sinno, Romero, Glennie, Fitts, Morin, Pinzon, McClain, Morin, Porter, Loeffler, Kergoat, Issoufou, Savadogo, Wigneron, Poulter, Ciais, Kaufmann, Myneni, Saatchi and Fensholt2023). Another example could be to estimate the wood density of a tree or its carbon stock using allometric equations with information on taxonomy and height measured on the field (Zianis et al., Reference Zianis, Muukkonen, Mäkipää and Mencuccini2005). Inventories could also be combined with other modalities and used as annotations for larger-scale tasks. As an example, remote sensing datasets presented in the following sections in the same geographic locations could be associated with inventories to enhance the precision of their annotations. While establishing this connection between ground measurements and remote sensing data presents its own set of challenges. In the following section, we will review datasets of ground-based recordings.

4.2. Ground-based recordings

The fine-scaled composition of forests can be understood by visualizing the trees within or under their canopy. Ground-based datasets are composed of recordings inside the forests, under the tree canopy. Trunks and small trees, invisible from a bird’s eye view, can be captured with cameras recording red–green–blue (RGB) images per example. These data are sometimes recorded in time series, for example, PhenoCams (Klosterman et al., Reference Klosterman, Hufkens, Gray, Melaas, Sonnentag, Lavine, Mitchell, Norman, Friedl and Richardson2014; Brown et al., Reference Brown, Hultine, Steltzer, Denny, Denslow, Granados, Henderson, Moore, Nagai, SanClements, Sánchez-Azofeifa, Sonnentag, Tazik and Richardson2016). The use of data recorded by sensors by machine learning algorithms help to have a broader context and more tree information in the samples compared to inventories.

Ground-based datasets are reviewed in Table 2. The dataset size has been measured in hectares (ha) corresponding to the studied surface, in the number of trees in the area, or in the number of samples, which may differ between synthetic and real samples (Grondin et al., Reference Grondin, Fortin, Pomerleau and Giguère2022).

Table 2. Review of open-access ground-based forest datasets

Note: The dataset size measured in K is $ O\left({10}^3\right) $ . DBH = diameter at breast height; ha = hectares; IMU = inertial measurement unit; IS = instance segmentation; KD = key-point detection; N/A = non-applicable; OD = object detection; PC = point cloud; Reg. = regression; RGB = red–green–blue images; Unknown = non-provided by the authors.

Stereo cameras are parameterized to estimate the depth of a scene differentiating trees and objects from the background in the forest (Grondin et al., Reference Grondin, Fortin, Pomerleau and Giguère2022). Thermal cameras have also been used to record trees’ signature (Still et al., Reference Still, Powell, Aubrecht, Kim, Helliker, Roberts, Richardson and Goulden2019) and distinguish them from other objects (Reis et al., Reference Reis, dos Santos and Santos2020; da Silva et al., Reference da Silva, dos Santos, Sousa and Filipe2021a,Reference da Silva, dos Santos, Sousa, Filipe and Boaventura-Cunhab, Reference da Silva, Santos, Filipe, Sousa and Oliveira2022). In specific cases, camera images have been annotated with bounding boxes around trees to detect them (Tremblay et al., Reference Tremblay, Béland, Gagnon, Pomerleau and Giguère2020; Grondin et al., Reference Grondin, Fortin, Pomerleau and Giguère2022). Only two reviewed datasets located in Canada have been annotated with several species classes to combine detection and classification of trees (Tremblay et al., Reference Tremblay, Béland, Gagnon, Pomerleau and Giguère2020; Grondin et al., Reference Grondin, Fortin, Pomerleau and Giguère2022). Since these datasets also provide inertial measurement unit (IMU), a potential task could be to predict the next move of an automated agent in a forest.

Forest geometry is also being intensively studied from the ground by using LiDAR—typically referred to as terrestrial laser scanning (TLS). This active sensor records three-dimensional scenes with photon reflections and can be applied from tripods or be combined with IMUs to enable mobile laser scanning. It is not impacted by sun lighting conditions and well suited to understand the structure of forests and trees such as measuring, gap fraction, stand density, tree height, DBH, volume, or biomass (Hackenberg et al., Reference Hackenberg, Spiecker, Calders, Disney and Raumonen2015; Liang et al., Reference Liang, Kankare, Hyyppä, Wang, Kukko, Haggrén, Yu, Kaartinen, Jaakkola, Guan, Holopainen and Vastaranta2016; Tremblay et al., Reference Tremblay, Béland, Gagnon, Pomerleau and Giguère2020). The spatial resolution of ground-based LiDAR recordings is either expressed in the average number of points per meter squared, or in the precision of localization of each point, based on information provided by the authors. The generated LiDAR point clouds have been used for instance segmentation (Burt et al., Reference Burt, Disney and Calders2018; Tremblay et al., Reference Tremblay, Béland, Gagnon, Pomerleau and Giguère2020; Grondin et al., Reference Grondin, Fortin, Pomerleau and Giguère2022), that is, segment each tree independently and associate them an identification number, or key-point detection, that is, localizing points of interest for each tree.

Ground-based datasets are useful to understand the composition of forests under the tree canopy, and recordings were difficult to automatize until recently (Calders et al., Reference Calders, Brede, Newnham, Culvenor, Armston, Bartholomeus, Griebel, Hayward, Junttila, Lau, Levick, Morrone, Origo, Pfeifer, Verbesselt and Herold2023). Literature lacks large-scale annotated datasets, although they can provide information at high spatial and temporal resolution and from perspectives that aerial and satellite recordings cannot. Providing both ground-based and aerial-based recordings (Soltani et al., Reference Soltani, Feilhauer, Duker and Kattenborn2022), informing both above and below tree canopy would facilitate transfer and bridging machine learning applications between different modality scales (for details, see Section 5). The next section will review aerial recordings datasets.

4.3. Aerial recordings

Aerial datasets consist of recordings of sensors mounted on unoccupied (drones) or occupied aircrafts flying above the tree canopy, offering a broader perspective of the forest without the hindrance of obstacles impeding the automatic recording process. The diversity in aerial datasets has increased in the past few years since they are used for diverse applications such as vegetation segmentation, disease detection, fire detection, and numerous others (Guimaraes et al., Reference Guimaraes, Padua, Marques, Silva, Peres and Sousa2020). This is in part also boosted as governmental organizations are increasingly making the imagery of repeated official aerial campaigns openly available (e.g., for entire countries). Furthermore, the decreasing costs of UAVs and the miniaturization of high-quality sensors have served as strong incentives for their adoption within the community.

Aerial-based recordings are reviewed in Table 3. The dataset size is expressed in kilometer squared ( $ {\mathrm{km}}^2 $ ), or in hectares (ha) if the studied area is small. It is also quantified by the number of samples or the number of trees if applicable.

Table 3. Review of open-access aerial forest datasets

Note: The dataset size measured in K is $ O\left({10}^3\right) $ . CHM = canopy height model; Classif. = classification; DBH = diameter at breast height; DSM = digital surface model; DTM = digital terrain model (spatial or vertical); ha = hectares; MC = multi-classification; N/A = non-applicable; OD = object detection; PC = point cloud; Reg. = regression; RGB = red–green–blue images; Seg. = semantic segmentation; Unknown = non-provided by the authors.

Multiple sensors can be carried by UAVs, including RGB and thermal cameras, multispectral sensors, hyperspectral sensors, and LiDAR, which collectively contribute to a captivating array of recorded data, offering diverse perspectives and insights. Cameras mounted on UAVs facilitate the acquisition of overlapping images with a spatial resolution of a few millimeters to centimeters. Such high-resolution image datasets can be applied in concert with photogrammetric workflows, which enable a triangulation of common features found in overlapping images, enabling to precisely reconstruct camera parameters and orientations in hundreds of images automatically. Such workflows enable to reconstruct digital surface models and reprojections of the imagery to generate geocoded image mosaics with orthographic projection (Guimaraes et al., Reference Guimaraes, Padua, Marques, Silva, Peres and Sousa2020; Diez et al., Reference Diez, Kentsch, Fukuda, Caceres, Moritake and Cabezas2021). Most of the recently publicly released aerial datasets contain RGB images generated by photogrammetry since they are relatively simple and cheap to collect (Morales et al., Reference Morales, Kemper, Sevillano, Arteaga, Ortega and Telles2018; Kattenborn et al., Reference Kattenborn, Eichel and Fassnacht2019a, Reference Kattenborn, Eichel, Wiser, Burrows, Fassnacht and Schmidtlein2020; Kentsch et al., Reference Kentsch, Lopez Caceres, Serrano, Roure and Diez2020; Schiefer et al., Reference Schiefer, Kattenborn, Frick, Frey, Schall, Koch and Schmidtlein2020; Nguyen et al., Reference Nguyen, Lopez Caceres, Moritake, Kentsch, Shu and Diez2021; Galuszynski et al., Reference Galuszynski, Duker, Potts and Kattenborn2022; Reiersen et al., Reference Reiersen, Dao, Lütjens, Klemmer, Amara, Steinegger, Zhang and Zhu2022). But the original RGB point cloud carrying the height information used to generate the DSM is generally not provided with some exceptions (Brieger et al., Reference Brieger, Herzschuh, Pestryakova, Bookhagen, Zakharov and Kruse2019; van Geffen et al., Reference van Geffen, Heim, Brieger, Geng, Shevtsova, Schulte, Stuenzi, Bernhardt, Troeva, Pestryakova, Zakharov, Pflug, Herzschuh and Kruse2022). This is unfortunate because there would be opportunities for new multimodal models to leverage both the RGB and point cloud modalities to improve model performance.

An alternative method for studying the topography of both the ground and canopies, depending on the structure of the forest, involves the utilization of airborne LiDAR acquisitions (Ferraz et al., Reference Ferraz, Saatchi, Xu, Hagen, Chave, Yu, Meyer, Garcia, Silva, Roswintiart, Samboko, Sist, Walker, Pearson, Wijaya, Sullivan, Rutishauser, Hoekman and Ganguly2018; Kalinicheva et al., Reference Kalinicheva, Landrieu, Mallet and Chehata2022). In contrast to terrestrial LiDAR, these measurements commonly have lower point densities, but cover large areas. Airborne LiDAR sensors are operated with IMU sensors, which enables geo-referenced flights transect across large spatial extents. These sensors typically can record multiple returns per LiDAR pulse so that acquisitions can resemble the vertical structure of forest stands, including multiple overlapping tree layers, the understory, and even the ground topography (Kalinicheva et al., Reference Kalinicheva, Landrieu, Mallet and Chehata2022). The spatial resolution of airborne LiDAR products is estimated by the average number of points per square meter.

Multispectral and hyperspectral sensors are passive, capturing reflected or emitted photons (Mavrovic et al., Reference Mavrovic, Sonnentag, Lemmetyinen, Baltzer, Kinnard and Roy2023) from the sun across wavelength bands that extend beyond the visible spectrum, allowing for comprehensive recording of electromagnetic radiation throughout the near up to the shortwave infrared region. They are especially valuable in assessing the composition of forest canopies, enabling the differentiation of species or retrieving biochemical and structural properties based on the spectral characteristics across spectral bands (Fassnacht et al., Reference Fassnacht, Latifi, Stereńczak, Modzelewska, Lefsky, Waser, Straub and Ghosh2016; Cherif et al., Reference Cherif, Feilhauer, Berger, Dao, Ewald, Hank, He, Kovach, Lu, Townsend and Kattenborn2023). A trade-off is usually required between acquiring information with a high spectral and a low spatial resolution (Paz-Kagan et al., Reference Paz-Kagan, Caras, Herrmann, Shachak and Karnieli2017), or with a low spectral and a high spatial resolution (Garioud et al., Reference Garioud, Peillet, Bookjans, Giordano and Wattrelos2022), given that the radiation reflected by plant canopies does not suffice the acquisition at high spectral and high spatial resolution simultaneously.

Forest monitoring can be explored in different ways using aerial datasets relying on the sensors employed and the annotations provided alongside the data. For instance, semantic segmentation is a prevalent method employed to classify forest canopies into tree species (Morales et al., Reference Morales, Kemper, Sevillano, Arteaga, Ortega and Telles2018; Kattenborn et al., Reference Kattenborn, Eichel and Fassnacht2019a, Reference Kattenborn, Eichel, Wiser, Burrows, Fassnacht and Schmidtlein2020; Kentsch et al., Reference Kentsch, Lopez Caceres, Serrano, Roure and Diez2020; Schiefer et al., Reference Schiefer, Kattenborn, Frick, Frey, Schall, Koch and Schmidtlein2020; Galuszynski et al., Reference Galuszynski, Duker, Potts and Kattenborn2022). Depending on the canopy structural complexity and data quality, the classification might combined with a delineation of individual tree crowns using instance segmentation approaches. Thereby, instance segmentation captures the intricate shapes of tree crowns, unlike object detection, which typically predicts rectangular bounding boxes or centroids for individual objects (Reiersen et al., Reference Reiersen, Dao, Lütjens, Klemmer, Amara, Steinegger, Zhang and Zhu2022). Some of the reviewed datasets include a DSM, which can be utilized with tree localization to estimate canopy height. This application using deep learning algorithms and aerial data is an actual active field of research (Yue et al., Reference Yue, Yang, Li, Hu, Zhang and Li2019; Moradi et al., Reference Moradi, Javan and Samadzadegan2022; Reiersen et al., Reference Reiersen, Dao, Lütjens, Klemmer, Amara, Steinegger, Zhang and Zhu2022; Wagner et al., Reference Wagner, Roberts, Ritz, Carter, Dalagnol, Favrichon, Hirye, Brandt, Ciais and Saatchi2023).

Due to the high spatial resolution, datasets of aerial recordings enable a granular understanding of forests at the individual tree level. There are still many open challenges which could be explored at the tree level such as segmenting individual tree crowns in dense forests, classifying them between a wide range of species, or adapting algorithms from a forest to another (see Section 2). Nevertheless, the scale of aerial datasets, especially for drones, is constrained by limited battery life and recording capacities, making it challenging to regularly assess and thus monitor large forest areas. Consequently, the next section will explore satellite datasets, which are better suited for capturing a broader scope of forest landscapes at high frequencies.

4.4. Satellite recordings

Satellite imagery has been consistently recorded across the globe for many years, enabling extensive research in the field of temporal remote sensing. This abundance of data has opened up research in machine learning applied to Earth observation, in particular deep learning approaches (Camps-Valls et al., Reference Camps-Valls, Tuia, Zhu and Reichstein2021), in the past few years. The datasets generated by diverse satellite missions encompass a wide range of resolutions and employ various sensors, enabling studies of diverse phenomena over both space and time (Swain et al., Reference Swain, Paul and Behera2023).

The Landsat missions (https://www.usgs.gov/landsat-missions/landsat-satellite-missions), a collaborative endeavor started in the seventies involving the U.S. Geological Survey, U.S. Department of the Interior, National Aeronautics and Space Administration (NASA), and the U.S. Department of Agriculture, represent the earliest and pioneering attempt to utilize multispectral cameras for Earth observation (Wulder et al., Reference Wulder, Roy, Radeloff, Loveland, Anderson, Johnson, Zhu, Scambos, Pahlevan, Hansen, Gorelick, Crawford, Masek, Hermosilla, White, Belward, Schaaf, Woodcock, Huntington, Lymburner, Hostert, Gao, Lyapustin, Pekel, Strobl and Cook2022). Landsat missions 4 and 5 capture images with between four and seven spectral bands, offering spatial resolutions ranging from 30 to 120 meters. The more recent Landsat missions, namely Landsat 7 and Landsat 8, record images with eight and nine spectral bands, respectively. These missions provide spatial resolutions ranging from 15 to 60 meters for Landsat 7 and 15 to 30 meters for Landsat 8. All of the Landsat missions have a 16-day repeat cycle. Most of the reviewed datasets used 30-meter-resolution spectral bands to ensure a consistency between the bands used for their final application (Potapov et al., Reference Potapov, Tyukavina, Turubanova, Talero, Hernandez-Serna, Hansen, Saah, Tenneson, Poortinga, Aekakkararungroj, Chishtie, Towashiraporn, Bhandari, Aung and Nguyen2019, Reference Potapov, Hansen, Pickens, Hernandez-Serna, Tyukavina, Turubanova, Zalles, Li, Khan, Stolle, Harris, Song, Baggett, Kommareddy and Kommareddy2022; Robinson et al., Reference Robinson, Hou, Malkin, Soobitsky, Czawlytko, Dilkina and Jojic2019; Irvin et al., Reference Irvin, Sheng, Ramachandran, Johnson-Yu, Zhou, Story, Rustowicz, Elsworth, Austin and Ng2020; De Almeida Pereira et al., Reference De Almeida Pereira, Fusioka, Nassu and Minetto2021; Feng et al., Reference Feng, Sexton, Wang, Channan, Montesano, Wagner, Wooten and Neigh2022; Lee and Choi, Reference Lee and Choi2022).

The Sentinel missions (https://sentinel.esa.int/web/sentinel/missions), managed by the European Space Agency, have been designed to comprehensively monitor the Earth’s various domains, encompassing air, land, ocean, and atmospheric measurements. These missions employ multiple sensors, enabling a wide range of Earth observation capabilities. Sentinel-1 includes a SAR generating electromagnetic waves with wavelengths not impacted by clouds. The reviewed datasets provide or use Level-1 Ground Range Detected (GRD) products at a $ 10\times 10 $ meters resolution (Schmitt et al., Reference Schmitt, Hughes, Qiu and Zhu2019; Sumbul et al., Reference Sumbul, de Wall, Kreuziger, Marcelino, Costa, Benevides, Caetano, Demir and Markl2021; Lee and Choi, Reference Lee and Choi2022). Since two satellites (Sentinel-1A and Sentinel-1B) are recording data on the same orbit, the mission has a 6-day exact repeat cycle with less than a day of revisit frequency at high latitudes.

Sentinel-2 utilizes multispectral sensors to scan photon reflectance across multiple spectral bands. The spatial resolution of the recorded data depends on the spectral bands: four bands at 10 meters, six bands at 20 meters, and three bands at 60 meters. Released datasets kept either 10-m-resolution bands (Schmitt et al., Reference Schmitt, Hughes, Qiu and Zhu2019; Bastani et al., Reference Bastani, Wolters, Gupta, Ferdinando and Kembhavi2023; Lee and Choi, Reference Lee and Choi2022) or all the bands (Sumbul et al., Reference Sumbul, de Wall, Kreuziger, Marcelino, Costa, Benevides, Caetano, Demir and Markl2021). The revisit frequency of the combined constellation of Sentinel-2A and B is 5 days on most of the globe.

Data from the Landsat and Sentinel missions are the most commonly provided in the reviewed datasets, but other interesting satellite sources are also explored. For instance, the Moderate Resolution Imaging Spectroradiometer (MODIS) (https://modis.gsfc.nasa.gov) instrument, introduced by NASA and integrated into the Terra and Aqua missions, generates data that are also used for large-scale forest monitoring purposes. The MODIS instrument offers recordings from 36 spectral bands, each defined for diverse observations, including atmospheric gases, ocean components, land boundaries, and properties (Schmitt et al., Reference Schmitt, Hughes, Qiu and Zhu2019; Levin et al., Reference Levin, Yebra and Phinn2021). Another example is the Visible Infrared Imaging Radiometer Suite (VIIRS) instrument (https://www.earthdata.nasa.gov/learn/find-data/near-real-time/viirs), part of the NOAA-20 missions, which also have generated data contained in a forest dataset for land and atmospheric observations (Levin et al., Reference Levin, Yebra and Phinn2021). It should be noted that researchers have used recordings from PlanetLabs (https://www.planet.com/), PlanetScope (https://earth.esa.int/eogateway/missions/planetscope), or Maxar (https://www.maxar.com/) missions (e.g., GeoEye, WorldView, or QuickBird), which provide multispectral data with submeter spatial resolution (Brandt et al., Reference Brandt, Tucker, Kariryaa, Rasmussen, Abel, Small, Chave, Rasmussen, Hiernaux, Diouf, Kergoat, Mertz, Igel, Gieseke, Schöning, Li, Melocik, Meyer, Sinno, Romero, Glennie, Montagu, Dendoncker and Fensholt2020). However, these datasets are not publicly accessible due to the associated licensing restrictions.

The datasets that have been reviewed encompass satellite data obtained from various missions and products, originating from different locations, and exhibiting diverse spatial and temporal resolutions. The details of datasets published before 2020 and included, or after 2020, are provided, respectively, in Tables 4 and 5. The dataset size is expressed in kilometer squared ( $ {\mathrm{km}}^2 $ ), or in hectares (ha) if the studied area is small. It is also quantified by the number of samples, trees, or events if applicable.

Table 4. Review of open-access satellite forest datasets before 2020 (included)

Note: The dataset size measured in K is $ O\left({10}^3\right) $ and in M is $ O\left({10}^6\right) $ . CHM = canopy height model; Classif. = classification; ha = hectares; LMFC = live fuel moisture content; LULC = land use and/or land cover; N/A = non-applicable; Reg. = regression; SAR = synthetic-aperture RADAR; Seg. = semantic segmentation.

Table 5. Review of satellite recording datasets after 2021 (included)

Note: The dataset size measured in K is $ O\left({10}^3\right) $ and in M is $ O\left({10}^6\right) $ . CD = change detection; Classif. = classification; ha = hectares; LULC = land use and/or land cover; MC = multi-classification; N/A = non-applicable; NDVI = normalized difference vegetation index; Reg. = regression; SAR = synthetic-aperture RADAR; Seg. = semantic segmentation; Unknown = non-provided by the authors.

a The list of countries is detailed in the OpenForest catalog.

Satellite datasets are frequently used for classification, multiclassification, or segmentation of satellite tiles, including LULC and tree species distribution. Other tasks include regression applications for forest cover estimation (Bastani et al., Reference Bastani, Wolters, Gupta, Ferdinando and Kembhavi2023; Feng et al., Reference Feng, Sexton, Wang, Channan, Montesano, Wagner, Wooten and Neigh2022), canopy height (Forkuor et al., Reference Forkuor, Benewinde Zoungrana, Dimobe, Ouattara, Vadrevu and Tondoh2020; Lang et al., Reference Lang, Jetz, Schindler and Wegner2022a), or live fuel moisture content estimation (Rao et al., Reference Rao, Williams, Flefil and Konings2020). An intriguing application involves utilizing satellite time series data to evaluate change detection of forest covers at a large scale (Wang et al., Reference Wang, Sulla-Menashe, Woodcock, Sonnentag, Keeling and Friedl2020a). This approach enables the estimation of deforestation, afforestation, and reforestation activities (Potapov et al., Reference Potapov, Hansen, Pickens, Hernandez-Serna, Tyukavina, Turubanova, Zalles, Li, Khan, Stolle, Harris, Song, Baggett, Kommareddy and Kommareddy2022).

Satellite recordings play a crucial role in Earth observation on a large scale, as they are manually or automatically processed to estimate global maps of forest cover, among other applications. Additionally, world maps depicting aboveground biomass, land use, and land cover have been estimated and made publicly available. In the following section, datasets containing maps at the country or global level will be reviewed.

4.5. Country or world maps

Earth observation applications have been resumed into maps at the country, continent, or global level. They are estimated using machine learning algorithms that incorporate manual expert features derived from satellite data such as statistics of the data distribution or vegetation indexes. These features encompass various aspects, ranging from multispectral information (Friedl et al., Reference Friedl, Sulla-Menashe, Tan, Schneider, Ramankutty, Sibley and Huang2010; Pflugmacher et al., Reference Pflugmacher, Rabe, Peters and Hostert2019) to climatic and elevation data (Chaves et al., Reference Chaves, Zuquim, Ruokolainen, Van doninck, Kalliola, Gómez Rivero and Tuomisto2020). The majority of the released maps have been estimated using machine learning algorithms trained on satellite data, as these algorithms demonstrate excellent scalability for predicting at a large scale and low resolution. The results obtained from these algorithms have been validated using field inventories. However, it is important to note that the field inventories themselves have not been included in the reviewed datasets discussed in this section. (Open-access datasets releasing both maps and inventories are reviewed in Table 8 and in Section 4.6.) Nonetheless, the reviewed map datasets are notable for their global coverage, which adds to their significance. Despite containing inherent uncertainties in their estimations, these maps have the potential to offer valuable meta-knowledge to downstream applications in the realm of forest monitoring.

Large-scale map datasets before 2019 (included) are reviewed in Table 6, while maps datasets released after 2020 (included) are reviewed in Table 7. The dataset size is expressed in kilometer squared ( $ {\mathrm{km}}^2 $ ), or in hectares (ha) if the studied area is small. It is also quantified by the number of samples or points if applicable.

Table 6. Review of open-access map forest datasets before 2019 (included)

Note: The dataset size measured in K is $ O\left({10}^3\right) $ and in M is $ O\left({10}^6\right) $ . AGB = aboveground biomass; Classif. = classification; IFL = intact forest landscape; LULC = land use and/or land cover; N/A = non-applicable; PFT = plant functional type; Reg. = regression; Seg. = semantic segmentation; Unknown = non-provided by the authors.

a The list of recording years is detailed in the OpenForest catalog.

b The list of countries is detailed in the OpenForest catalog.

Table 7. Review of open-access map forest datasets after 2020 (included)

Note: The dataset size measured in K is $ O\left({10}^3\right) $ and in M is $ O\left({10}^6\right) $ . AGB = aboveground biomass; BGB = belowground biomass; CD = change detection; CH = canopy height; Classif. = classification; GSV = growing stock volume; LULC = land use and/or land cover; MC = multi-classification; N/A = non-applicable; Reg. = regression; SCS = soil carbon stock; Seg. = semantic segmentation; Unknown = non-provided by the authors.

a The list of recording years is detailed in the OpenForest catalog.

A significant portion of map datasets focuses on providing information about LULC (see Section 4 for the proposed definition), which plays a crucial role in distinguishing different types of forests at a large scale (Bartholomé and Belward, Reference Bartholomé and Belward2005; Friedl et al., Reference Friedl, Sulla-Menashe, Tan, Schneider, Ramankutty, Sibley and Huang2010; Griffiths et al., Reference Griffiths, Kuemmerle, Baumann, Radeloff, Abrudan, Lieskovsky, Munteanu, Ostapowicz and Hostert2014; Pflugmacher et al., Reference Pflugmacher, Rabe, Peters and Hostert2019; Thonfeld et al., Reference Thonfeld, Steinbach, Muro and Kirimi2020; Bonannella et al., Reference Bonannella, Hengl, Heisig, Parente, Wright, Herold and de Bruin2022). Within in LULC maps, the reviewed works have also estimated the extend of forest cover, including time series data. These time series are particularly valuable for quantifying forest loss, that is, deforestation detection, as well as forest gain, that is, afforestation, reforestation monitoring, or deadwood maps (Hansen et al., Reference Hansen, Potapov, Moore, Hancher, Turubanova, Tyukavina, Thau, Stehman, Goetz, Loveland, Kommareddy, Egorov, Chini, Justice and Townshend2013; Curtis et al., Reference Curtis, Slay, Harris, Tyukavina and Hansen2018; Bunting et al., Reference Bunting, Rosenqvist, Hilarides, Lucas, Thomas, Tadono, Worthington, Spalding, Murray and Rebelo2022; Verhegghen et al., Reference Verhegghen, Kuzelova, Syrris, Eva and Achard2022; Schiefer et al., Reference Schiefer, Schmidtlein, Frick, Frey, Klinke, Zielewska-Büttner, Junttila, Uhl and Kattenborn2023). Another category of maps is specifically designed to differentiate between different plant functional types, particularly distinguishing between broad-leaved and needle-leaf forests, as well as identifying summer-green and evergreen forests across tropical, boreal, and temperate regions (Ottlé et al., Reference Ottlé, Lescure, Maignan, Poulter, Wang and Delbart2013).

As mentioned in Section 2.2, accurate estimation of aboveground biomass is crucial for a comprehensive quantification of the carbon stocks that forests worldwide hold. World maps of aboveground biomass have been estimated at different resolutions (Patterson et al., Reference Patterson, Healey, Ståhl, Saarela, Holm, Andersen, Dubayah, Duncanson, Hancock, Armston, Kellner, Cohen and Yang2019; Santoro et al., Reference Santoro, Cartus, Carvalhais, Rozendaal, Avitabile, Araza, de Bruin, Herold, Quegan, Rodriguez-Veiga, Balzter, Carreiras, Schepaschenko, Korets, Shimada, Itoh, Martínez, Cavlovic, Cazzolla Gatti, da Conceiçao Bispo, Dewnath, Labrière, Liang, Lindsell, Mitchard, Morel, Pacheco Pascagaza, Ryan, Slik, Vaglio Laurin, Verbeeck, Wijaya and Willcock2021; Tang et al., Reference Tang, Ma, Lister, O’Neill-Dunne, Lu, Lamb, Dubayah and Hurtt2021; Ma et al., Reference Ma, Hurtt, Tang, Lamb, Campbell, Dubayah, Guy, Huang, Lister, Lu, O’Neil-Dunne, Rudee, Shen and Silva2021a). Quantifying the uncertainty of aboveground biomass maps is also important as it depends on multiple factors, including tree species, tree height, canopy size, and reference data distribution (Patterson et al., Reference Patterson, Healey, Ståhl, Saarela, Holm, Andersen, Dubayah, Duncanson, Hancock, Armston, Kellner, Cohen and Yang2019; Ploton et al., Reference Ploton, Mortier, Réjou-Méchain, Barbier, Picard, Rossi, Dormann, Cornu, Viennois, Bayol, Lyapustin, Gourlet-Fleury and Pélissier2020; Santoro et al., Reference Santoro, Cartus, Carvalhais, Rozendaal, Avitabile, Araza, de Bruin, Herold, Quegan, Rodriguez-Veiga, Balzter, Carreiras, Schepaschenko, Korets, Shimada, Itoh, Martínez, Cavlovic, Cazzolla Gatti, da Conceiçao Bispo, Dewnath, Labrière, Liang, Lindsell, Mitchard, Morel, Pacheco Pascagaza, Ryan, Slik, Vaglio Laurin, Verbeeck, Wijaya and Willcock2021). Canopy height maps have also been quantified in sparse boreal forests (Bartsch et al., Reference Bartsch, Höfler, Kroisleitner and Trofaier2016, Reference Bartsch, Widhalm, Leibman, Ermokhina, Kumpula, Skarin, Wilcox, Jones, Frost, Höfler and Pointner2020). These canopy height maps, both at the country (Tolan et al., Reference Tolan, Yang, Nosarzewski, Couairon, Vo, Brandt, Spore, Majumdar, Haziza, Vamaraju, Moutakani, Bojanowski, Johns, White, Tiecke and Couprie2023) and world levels (Lang et al., Reference Lang, Jetz, Schindler and Wegner2022a,Reference Lang, Kalischek, Armston, Schindler, Dubayah and Wegnerb), have been estimated using LiDAR sensors as a ground truth. Accurate estimation of canopy height is crucial for evaluating aboveground biomass, which is why a few studies have utilized data from the Global Ecosystem Dynamics Investigation (GEDI) mission (https://gedi.umd.edu/) (Patterson et al., Reference Patterson, Healey, Ståhl, Saarela, Holm, Andersen, Dubayah, Duncanson, Hancock, Armston, Kellner, Cohen and Yang2019; Tang et al., Reference Tang, Ma, Lister, O’Neill-Dunne, Lu, Lamb, Dubayah and Hurtt2021; Ma et al., Reference Ma, Hurtt, Tang, Lamb, Campbell, Dubayah, Guy, Huang, Lister, Lu, O’Neil-Dunne, Rudee, Shen and Silva2021a; Lang et al., Reference Lang, Kalischek, Armston, Schindler, Dubayah and Wegner2022b; Tolan et al., Reference Tolan, Yang, Nosarzewski, Couairon, Vo, Brandt, Spore, Majumdar, Haziza, Vamaraju, Moutakani, Bojanowski, Johns, White, Tiecke and Couprie2023). GEDI records LiDAR data from the International Space Station, allowing for the estimation of a DSM that serves as a valuable reference for canopy height estimation. The overall biomass estimation of forests also includes belowground biomass (Chen et al., Reference Chen, Feng, Fu, Ma, Zohner, Crowther, Huang, Wu and Wei2023) and soil carbon stock (Dionizio et al., Reference Dionizio, Pimenta, Lima and Costa2020), which have been quantified using SAR satellite data penetrating dense canopies.

Although open-access map datasets are subject to the limitations and uncertainties inherent in the estimation methods employed by the authors, they remain a valuable source of data for obtaining a broad-scale understanding of forests or integrating meta-knowledge into future analyses. These datasets could be helpful for conducting further research and expanding our knowledge of forest ecosystems.

In the preceding sections, the reviewed datasets were presented with a focus on different scales. However, in the forthcoming section, datasets that offer a combination of data at various scales will be discussed in detail.

4.6. Datasets mixed at different scales

Datasets that offer data at various scales play an important role in establishing a bridge between different modalities recorded by diverse sensors. By integrating information from multiple sources, these datasets facilitate a comprehensive understanding of forests and enable cross-modal analysis. Inventories, ground-based, and aerial-based datasets are available at small scale but usually come alongside precise annotations at all tree levels. Conversely, satellite and map datasets are available at a larger scale but often lack precise annotations due to their lower resolution. Integrating data from different scales can be advantageous in generalizing and extrapolating local knowledge to a larger scale, bridging the gap between detailed annotations and broader coverage (Kattenborn et al., Reference Kattenborn, Lopatin, Förster, Braun and Fassnacht2019b; Schiefer et al., Reference Schiefer, Schmidtlein, Frick, Frey, Klinke, Zielewska-Büttner, Junttila, Uhl and Kattenborn2023).

The size of each dataset is expressed in kilometer squared ( $ {\mathrm{km}}^2 $ ), or in hectares (ha) if the studied area is small. It is also quantified by the number of samples, points, or trees if applicable.

Mixed datasets composed of inventories and aerial-based recordings (IA); inventories, aerial-based, and satellite-based recordings (IAS); and inventories and maps (IM) are reviewed in Table 8. Inventories provide an additional value to imagery recordings by providing geo-located annotations, depending on the level of precision they offer. These inventories enhance the spatial context and accuracy of the annotations. Combining inventories with aerial-based recordings would be highly beneficial for accurately aligning tree measurements with aerial data, especially with LiDAR (Weiser et al., Reference Weiser, Schäfer, Winiwarter, Krašovec, Fassnacht and Höfle2022) or RGB (Brieger et al., Reference Brieger, Herzschuh, Pestryakova, Bookhagen, Zakharov and Kruse2019; van Geffen et al., Reference van Geffen, Heim, Brieger, Geng, Shevtsova, Schulte, Stuenzi, Bernhardt, Troeva, Pestryakova, Zakharov, Pflug, Herzschuh and Kruse2022) recordings. This integration enables improved estimation of carbon stocks at the aerial scale, among other applications. Field measurements have also been used to validate country or world maps; they are often released together to enable reproducibility of the results. This integration of field measurements and map data enhances the accuracy and reliability of the generated maps. Similarly to datasets presented in Section 4.5, maps of forest age (Besnard et al., Reference Besnard, Koirala, Santoro, Weber, Nelson, Gütter, Herault, Kassi, N’Guessan, Neigh, Poulter, Zhang and Carvalhais2021), carbon stocks (Tucker et al., Reference Tucker, Brandt, Hiernaux, Kariryaa, Rasmussen, Small, Igel, Reiner, Melocik, Meyer, Sinno, Romero, Glennie, Fitts, Morin, Pinzon, McClain, Morin, Porter, Loeffler, Kergoat, Issoufou, Savadogo, Wigneron, Poulter, Ciais, Kaufmann, Myneni, Saatchi and Fensholt2023), and LULC (Koskinen et al., Reference Koskinen, Leinonen, Vollrath, Ortmann, Lindquist, d’Annunzio, Pekkarinen and Käyhkö2019; Bendini et al., Reference Bendini, Fonseca, Schwieder, Rufin, Korting, Koumrouyan and Hostert2020; Shevtsova et al., Reference Shevtsova, Heim, Kruse, Schröder, Troeva, Pestryakova, Zakharov and Herzschuh2020; European Commission. Statistical Office of the European Union, 2021) have been estimated by machine learning algorithms while being calibrated and validated with inventories.

Table 8. Review of open-access mixed forest datasets, including inventories and aerial-based (IA); inventories, aerial-based, and satellite-based (IAS); and inventories and maps (IM)

Note: The dataset size measured in K is $ O\left({10}^3\right) $ , in M is $ O\left({10}^6\right) $ , and in B is $ O\left({10}^9\right) $ . AGB = aboveground biomass; CBH = crown base height; Classif. = classification; DBH = diameter at breast height; EVI = enhanced vegetation index; IA = inventories and aerial; IAS = inventories, aerial and satellite; IM = inventories and maps; IS = instance segmentation; LULC = land use and/or land cover; MC = multi-classification; N/A = non-applicable; OL = object localization; PC = point cloud; Reg. = regression; RGB = red–green–blue; Seg. = semantic segmentation; Unknown = non-provided by the authors.

a The dataset includes three LiDAR with different resolutions, which are ALS: 72.5 pts-m2; ULS: 1,029.2 pts-m2; and TLS: Unknown.

b The list of countries is detailed in the OpenForest catalog.

c Aerial recordings (3-cm resolution) are aerial RGB, SfM PC, RGB PC, RGN images, DEM, CHM, DSM, and DTM.

d The list of recording years is detailed in the OpenForest catalog.

e Field measurements (50-cm resolution) are location, crown area, wood mass, mass, root dry mass, count density, coverage density, and area density.

Mixed datasets composed of ground-based and aerial-based recordings (GA); aerial-based and satellite-based recordings (AS); aerial-based recordings and maps (AM); and satellite-based recordings and maps (SM) are reviewed in Table 9. Aligning ground-based and aerial-based imagery recordings is valuable in integrating information from both above and below the canopies of a forest. For instance, models can be trained using ground recordings sourced from citizen science-based photographs, and then effectively transferred to aerial data (Soltani et al., Reference Soltani, Feilhauer, Duker and Kattenborn2022). This alignment could enable a comprehensive understanding of the forest ecosystem by bridging the gap between ground-level and aerial-level observations.

Table 9. Review of open-access mixed forest datasets, including ground-based and aerial-based (GA); aerial-based and satellite-based (AS); aerial-based and maps (AM); and satellite-based and maps (SM)

Note: The dataset size measured in K is $ O\left({10}^3\right) $ and in M is $ O\left({10}^6\right) $ . AGB = aboveground biomass; Align. = alignment; AM = aerial and maps; AS = aerial and satellite; CHM = canopy height model; Classif. = classification; GA = ground and aerial; LULC = land use and/or land cover; MC = multi-classification; N/A = non-applicable; NDVI = normalized difference vegetation index; OD = object detection; PC = point cloud; Reg. = regression; RGB = red–green–blue; SAR = synthetic-aperture RADAR; Seg. = semantic segmentation; SM = satellite and maps; Unknown = non-provided by the authors.

Mapping aerial-based and satellite-based recordings is helpful for extrapolating high-resolution information at a small scale to a lower resolution at a larger scale. This process allows for the transfer of detailed information captured through aerial LiDAR, for example, to validate canopy height models derived from satellite imagery (Marconi et al., Reference Marconi, Graves, Gong, Nia, Le Bras, Dorr, Fontana, Gearhart, Greenberg, Harris, Kumar, Nishant, Prarabdh, Rege, Bohlman, White and Wang2019; Weinstein et al., Reference Weinstein, Marconi, Bohlman, Zare, Singh, Graves and White2021b; Lang et al., Reference Lang, Jetz, Schindler and Wegner2022a). The integration of these datasets facilitates a more comprehensive and accurate representation of forest characteristics across different spatial scales. Integrating SAR and multispectral satellite data with aerial imagery can potentially enhance model performances (Schmitt and Zhu, Reference Schmitt and Zhu2016), particularly by leveraging the varying reflection and absorption characteristics of different tree species (Ahlswede et al., Reference Ahlswede, Schulz, Gava, Helber, Bischke, Förster, Arias, Hees, Demir and Kleinschmit2022). Aerial LiDAR metrics have also been used as validation points to estimate the aboveground biomass at large scale (Hudak et al., Reference Hudak, Fekety, Kane, Kennedy, Filippelli, Falkowski, Tinkham, Smith, Crookston, Domke, Corrao, Bright, Churchill, Gould, McGaughey, Kane and Dong2020). At a larger scale, satellite data analyzed with multiclassification algorithms have also been useful to monitor and detect forest loss (Turubanova et al., Reference Turubanova, Potapov, Tyukavina and Hansen2018).

Open-access datasets featuring modalities at different scales have been made available to enable result reproducibility and promote heterogeneity in the way of observing forests. These datasets incorporate various modalities aligned at different scales, which could aim to enhance the generalization capabilities of machine learning algorithms at a larger scale. This not only facilitates research in solving tasks at different scales depending on the modality but also fosters a comprehensive understanding of forests through multimodal analysis. To date, the publications related to the reviewed datasets have not extensively explored multimodal (e.g., point clouds with raster data and point observations with spatially continuous data), multiscale, and multitask approaches. However, it is our hope that the machine learning and computer vision communities will venture into forest monitoring along this path, as it holds great potential for advancing our understanding of the composition of forests worldwide. By embracing these comprehensive approaches, we can enhance our comprehension of forests and contribute to more effective and efficient forest management strategies. The upcoming section will explore perspectives on forest datasets, shedding light on the potential challenges that researchers could prioritize and address in their work.

5. Perspectives

The enthusiasm for forest monitoring is on the rise, serving as a safeguard to protect forests and their ecological and societal significance. Proper monitoring is essential for avoided forest conversion, supporting forest management initiatives, and ensuring successful reforestation and afforestation projects by enhancing survival rates and preventing diseases (van Lierop et al., Reference van Lierop, Lindquist, Sathyapala and Franceschini2015; Martin et al., Reference Martin, Woodbury, Doroski, Nagele, Storace, Cook-Patton, Pasternack and Ashton2021). Additionally, the effects of climate change on forest dynamics (Fassnacht et al., Reference Fassnacht, White and Wulder2023) imply a growing need for heightened surveillance of these ecosystems.

As a data-driven and empirical science, respectively, forest monitoring benefits from open-access, diverse, and large datasets, coupled with advancements in machine learning research (De Lima et al., Reference De Lima, Phillips, Duque, Tello, Davies, De Oliveira, Muller, Honorio Coronado, Vilanova, Cuni-Sanchez, Baker, Ryan, Malizia, Lewis, Ter Steege, Ferreira, Marimon, Luu, Imani, Arroyo, Blundo, Kenfack, Sainge, Sonké and Vásquez2022). This endeavor seeks to address existing challenges and research strategies while extensively reviewing open-access forest datasets, with the ultimate goal of encouraging the research community to further investigate this field.

As evident from Section 2.2, forest monitoring remains an active area of research. Numerous ongoing inquiries delve into various aspects, such as tree species identification, phenology, abiotic factors, exogenous influences, and many more. Machine learning already greatly advanced our capabilities to monitor forests through novel analytical tools and capacities. This involves sensing past, current, and dynamic forest states through predictive modeling. Such models and information, build to be explainable by design, will greatly advance our understanding of forests, including insights into how diverse environmental and anthropogenic drivers impact forest dynamics, as well as the operational mechanisms of forest ecosystems. A related and cardinal interest lies in the projection of future forest dynamics to guide decision-makers, to improve management and anticipate consequences (Requena-Mesa et al., Reference Requena-Mesa, Reichstein, Mahecha, Kraft and Denzler2018). In this context, it is important to consider that ongoing and accelerated changes induced by global warming and climate change reshape the dynamics of the Earth system and its respective data (making data through time nonstationary). Therefore, it becomes essential to explore solutions that streamline the adaptability and transferability of data-driven machine learning methods, ensuring their efficiency in extrapolating from existing and historical data to future circumstances.

Machine learning and computer vision are exerting a progressively increasing influence across various domains, including forest monitoring, as elaborated in Section 3. Strategies related to model generalization, learning schemes, and forestry-based metrics are valuable for delving further into the challenges presented by forest biology.

Enhancing the generalization capabilities of models involves better adaptation to diverse spatial and temporal domains, encompassing different forests, sensors, and resolutions. To achieve this, machine learning strategies will be explored, focusing on leveraging existing datasets through weakly supervised (see Section 3.2.2) or few-shot (see Sections 3.2.4 and 3.2.5) learning approaches. Moreover, hybrid models, which integrate physical knowledge (see Section 3.3), or space-for-time substitutions, which enable to learn temporal dynamics from spatial dynamics, may greatly advance our capabilities to design robust data-driven machine learning applications for monitoring and forecasting in a nonstationary world. In line with this perspective, the OpenForest dynamic catalog could serve as a suitable reference to consistently enhance and refine models using the latest data.

Implementing active learning methods (see Section 3.2.3) can significantly optimize the process of generating annotations for future datasets. As datasets continue to grow in size, self-supervised learning methods (see Section 3.2.1) offer a valuable perspective to learn meaningful representations in deep learning algorithms for forest monitoring without relying heavily on manual annotations.

Incorporating multimodal and multitask computer vision architectures into forest monitoring presents an intriguing opportunity to capitalize on task complementarity. For instance, by predicting multiple foliage traits from hyperspectral data, a model can learn the covariance among different traits and, hence, provide more robust estimates for challenging traits based on their relation with more accurately predicted ones (Schiller et al., Reference Schiller, Schmidtlein, Boonman, Moreno-Martínez and Kattenborn2021; Cherif et al., Reference Cherif, Feilhauer, Berger, Dao, Ewald, Hank, He, Kovach, Lu, Townsend and Kattenborn2023). An additional area of potential research could involve enhancing carbon stock estimation by simultaneously predicting both tree species and height.

Foundation models (see Section 3.1.2) have demonstrated remarkable capabilities in managing various modalities, such as LiDAR, RADAR, and hyperspectral data, with varying spatial and temporal resolutions. These models remain task-agnostic and can achieve high zero-shot performances, making their pretraining a challenging yet promising endeavor for forest monitoring. Once pretrained, they can be adapted to multiple other tasks in this domain. While relying on the complementarity of large-scale multimodal and multitask datasets, research on foundation models for forest monitoring worldwide would benefit from the OpenForest catalog dynamically enriched by the community.

Section 4 provides a comprehensive review of open-source forest datasets, categorized according to specific criteria and identified scales. These datasets are grouped in OpenForest, a dynamic catalog open for updates from the community. The aim is to foster communication, inspire new applications of machine learning in forest monitoring, and motivate advancements in this field.

Datasets, as prerequisite of machine learning applications, commonly exhibit a lack of geographical representativeness, particularly noticeable in African and Asian regions, as depicted in Figure 3. Whenever feasible, ideal datasets would preferably align multimodal recordings based on their temporal and spatial resolutions while offering annotations in the highest available resolution. As demonstrated in remote sensing (Lacoste et al., Reference Lacoste, Lehmann, Rodriguez, Sherwin, Kerner, Lütjens, Irvin, Dao, Alemohammad, Drouin, Gunturkun, Huang, Vazquez, Newman, Bengio, Ermon and Zhu2023), aggregating numerous diverse forest datasets could foster research in developing specialized foundation models for effective forest monitoring.

The OpenForest catalog, in addition to providing a list of open-access datasets, will also curate information about data providers (see related information at https://github.com/RolnickLab/OpenForest), a crucial resource for generating well-structured datasets that cater to specific needs. Citizen-generated data, such as curated on OpenAerialMap (https://openaerialmap.org/) or GBIF (https://www.gbif.org/), hold significant values since they integrate information from across the globe, making them ideal for self-supervised learning. The data provider list will also be frequently updated to integrate most recent initiatives. For instance, incorporating data from the Biomass mission (https://www.esa.int/Applications/Observing_the_Earth/FutureEO/Biomass) as soon as possible into future datasets is essential. The mission’s provision of P-band SAR data will greatly benefit worldwide forest tomography (Berenger et al., Reference Berenger, Denis, Tupin, Ferro-Famil and Huang2023), advancing our comprehension of forest carbon stock and its dynamics. Exploring its potential can lead to the creation of valuable and structured datasets.

Recordings from aerial data, especially UAVs, gain momentum by offering promising prospects, with more affordable and easier-to-pilot vehicles equipped with higher-resolution sensors. Leveraging UAV technology allows for high-resolution forest analysis, even in remote or inaccessible areas, and can moreover advance large-scale assessments by its integration with Earth observation satellite missions (Schiefer et al., Reference Schiefer, Schmidtlein, Frick, Frey, Klinke, Zielewska-Büttner, Junttila, Uhl and Kattenborn2023).

As more and more datasets are released at various scales, the OpenForest catalog offers the opportunity to centralize this information with details. It will help to motivate research in bridging the gap between scales, sensors, and resolutions while hopefully motivating collaborations between researchers.

Open peer review

To view the open peer review materials for this article, please visit http://doi.org/10.1017/eds.2024.53.

Acknowledgements

The authors are grateful for the valuable feedback of O. Sonnentag.

Author contribution

Conceptualization: A.O., D.R.; Data curation: A.O.; Data visualization: A.O.; Methodology: all authors; Writing—original draft: A.O., T.K., E.L.; Writing—review and editing: all authors. All authors approved the final submitted draft.

Competing interest

The authors declare no competing interests.

Data availability statement

The OpenForest catalog is available and open to contributions in the following repository: https://github.com/RolnickLab/OpenForest. It is also archived in Zenodo at https://doi.org/10.5281/zenodo.14025443.

Funding statement

This work was funded through the IVADO program on “AI, Biodiversity and Climate Change” and the Canada CIFAR AI Chairs program. It was also funded through the German Research Foundation (DFG) under the project PANOPS (Project No. 504978936) and BigPlantSens (Project No. 444524904).

Ethical standard

The research meets all ethical guidelines, including adherence to the legal requirements of the study country.

Footnotes

This research article was awarded Open Data and Open Materials badges for transparent practices. See the Data Availability Statement for details.

References

Ackerly, D (2009) Conservatism and diversification of plant functional traits: Evolutionary rates versus phylogenetic signal. Proceedings of the National Academy of Sciences 106(supplement_2), 1969919706.CrossRefGoogle ScholarPubMed
Ahlswede, S, Schulz, C, Gava, C, Helber, P, Bischke, B, Förster, M, Arias, F, Hees, J, Demir, B and Kleinschmit, B (2022) TreeSatAI benchmark archive: A multi-sensor, multi-label dataset for tree species classification in remote sensing. Earth System Science Data.Google Scholar
Ahn, J, Cho, S and Kwak, S (2019) Weakly supervised learning of instance segmentation with inter-pixel relations. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Alajaji, D., Alhichri, HS, Ammour, N and Alajlan, N (2020) Few-shot learning for remote sensing scene classification. In Mediterranean and Middle-East Geoscience and Remote Sensing Symposium.CrossRefGoogle Scholar
Allen, CD, Macalady, AK, Chenchouni, H, Bachelet, D, McDowell, N, Vennetier, M, Kitzberger, T, Rigling, A, Breshears, DD, Hogg, ET, Gonzalez, P, Fensham, R, Zhang, Z, Castro, J, Demidova, N, Lim, J-H, Allard, G, Running, SW, Semerci, A and Cobb, N (2010) A global overview of drought and heat-induced tree mortality reveals emerging climate change risks for forests. Forest Ecology and Management.CrossRefGoogle Scholar
Alosaimi, N, Alhichri, H, Bazi, Y, Ben Youssef, B and Alajlan, N (2023) Self-supervised learning for remote sensing scene classification under the few shot scenario. Scientific Reports 13(1), 433.CrossRefGoogle ScholarPubMed
Amirkolaee, HA, Shi, M and Mulligan, M (2023) TreeFormer: A semi-supervised transformer-based framework for tree counting from a single high resolution image. IEEE Transactions on Geoscience and Remote Sensing.CrossRefGoogle Scholar
Arino, O, Ramos Perez, JJ, Kalogirou, V, Bontemps, S, Defourny, P and Van Bogaert, E (2010) Global land cover map for 2009 (GlobCover 2009). In ESA-iLEAPS-EGU Earth Observation for Land–Atmosphere Interaction Science.Google Scholar
Arnaudo, E, Tavera, A, Masone, C, Dominici, F and Caputo, B (2023) Hierarchical instance mixing across domains in aerial segmentation. IEEE Access 11, 1332413333.CrossRefGoogle Scholar
Asner, GP, Martin, RE, Carranza-Jiménez, L, Sinca, F, Tupayachi, R, Anderson, CB and Martinez, P (2014) Functional and biological diversity of foliar spectra in tree canopies throughout the Andes to Amazon region. New Phytologist 204(1), 127139.CrossRefGoogle ScholarPubMed
Aygunes, B, Cinbis, RG and Aksoy, S (2021) Weakly supervised instance attention for multisource fine-grained object recognition with an application to tree species classification. ISPRS Journal of Photogrammetry and Remote Sensing 176, 262274.CrossRefGoogle Scholar
Ayush, K, Uzkent, B, Meng, C, Tanmay, K, Burke, M, Lobell, D and Ermon, S (2021) Geography-aware self-supervised learning. In International Conference on Computer Vision.CrossRefGoogle Scholar
Baldeck, CA, Asner, GP, Martin, RE, Anderson, CB, Knapp, DE, Kellner, JR and Wright, SJ (2015) Operational tree species mapping in a diverse tropical forest with airborne imaging spectroscopy. PLoS One 10(7), e0118403.CrossRefGoogle Scholar
Baldocchi, D, Falge, E, Gu, L, Olson, R, Hollinger, D, Running, S, Anthoni, P, Bernhofer, C, Davis, K, Evans, R, Fuentes, J, Goldstein, A, Katul, G, Law, B, Lee, X, Malhi, Y, Meyers, T, Munger, W, Oechel, W, Paw U, KT, Pilegaard, K, Schmid, HP, Valentini, R, Verma, S, Vesala, T, Wilson, K and Wofsy, S (2001) FLUXNET: A new tool to study the temporal and spatial variability of ecosystem-scale carbon dioxide, water vapor, and energy flux densities. Bulletin of the American Meteorological Society.2.3.CO;2>CrossRefGoogle Scholar
Bartholomé, E and Belward, AS (2005) GLC2000: A new approach to global land cover mapping from earth observation data. International Journal of Remote Sensing 26(9), 19591977.CrossRefGoogle Scholar
Bartsch, A, Höfler, A, Kroisleitner, C and Trofaier, A (2016) Land cover mapping in northern high latitude permafrost regions with satellite data: Achievements and remaining challenges. Remote Sensing.CrossRefGoogle Scholar
Bartsch, A, Widhalm, B, Leibman, M, Ermokhina, K, Kumpula, T, Skarin, A, Wilcox, EJ, Jones, BM, Frost, GV, Höfler, A and Pointner, G (2020) Feasibility of tundra vegetation height retrieval from Sentinel-1 and Sentinel-2 data. Remote Sensing of Environment.CrossRefGoogle Scholar
Bastani, F, Wolters, P, Gupta, R, Ferdinando, J and Kembhavi, A (2023). Satlas: A large-scale, multi-task dataset for remote sensing image understanding. International Conference on Computer Vision. Preprint, arXiv:2211.15660 [cs].Google Scholar
Bastin, J-F, Finegold, Y, Garcia, C, Mollicone, D, Rezende, M, Routh, D, Zohner, CM and Crowther, TW (2019) The global tree restoration potential. Science.CrossRefGoogle ScholarPubMed
Bendini, HN, Fonseca, LMG, Schwieder, M, Rufin, P, Korting, TS, Koumrouyan, A and Hostert, P (2020) Combining environmental and Landsat analysis ready data for vegetation mapping: A case study in the Brazilian savanna biome. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.CrossRefGoogle Scholar
Berenger, Z, Denis, L, Tupin, F, Ferro-Famil, L and Huang, Y (2023) A deep learning approach for SAR tomographic imaging of forested areas. Preprint, arXiv.CrossRefGoogle Scholar
Besnard, S, Koirala, S, Santoro, M, Weber, U, Nelson, J, Gütter, J, Herault, B, Kassi, J, N’Guessan, A, Neigh, C, Poulter, B, Zhang, T and Carvalhais, N (2021) Mapping global forest age from forest inventories, biomass and climate data. Earth System Science Data.CrossRefGoogle Scholar
Birhane, A, Kalluri, P, Card, D, Agnew, W, Dotan, R and Bao, M (2022) The values encoded in machine learning research. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, pp. 173184.CrossRefGoogle Scholar
Bolyn, C, Lejeune, P, Michez, A and Latte, N (2022) Mapping tree species proportions from satellite imagery using spectral-spatial deep learning. Remote Sensing of Environment.CrossRefGoogle Scholar
Bommasani, R, Hudson, DA, Adeli, E, Altman, R, Arora, S, von Arx, S, Bernstein, MS, Bohg, J, Bosselut, A, Brunskill, E, Brynjolfsson, E, Buch, S, Card, D, Castellon, R, Chatterji, NS, Chen, AS, Creel, KA, Davis, J, Demszky, D, Donahue, C, Doumbouya, M, Durmus, E, Ermon, S, Etchemendy, J, Ethayarajh, K, Fei-Fei, L, Finn, C, Gale, T, Gillespie, LE, Goel, K, Goodman, ND, Grossman, S, Guha, N, Hashimoto, T, Henderson, P, Hewitt, J, Ho, DE, Hong, J, Hsu, K, Huang, J, Icard, TF, Jain, S, Jurafsky, D, Kalluri, P, Karamcheti, S, Keeling, G, Khani, F, Khattab, O, Koh, PW, Krass, MS, Krishna, R, Kuditipudi, R, Kumar, A, Ladhak, F, Lee, M, Lee, T, Leskovec, J, Levent, I, Li, XL, Li, X, Ma, T, Malik, A, Manning, CD, Mirchandani, SP, Mitchell, E, Munyikwa, Z, Nair, S, Narayan, A, Narayanan, D, Newman, B, Nie, A, Niebles, JC, Nilforoshan, H, Nyarko, JF, Ogut, G, Orr, L, Papadimitriou, I, Park, JS, Piech, C, Portelance, E, Potts, C, Raghunathan, A, Reich, R, Ren, H, Rong, F, Roohani, YH, Ruiz, C, Ryan, J, R’e, C, Sadigh, D, Sagawa, S, Santhanam, K, Shih, A, Srinivasan, KP, Tamkin, A, Taori, R, Thomas, AW, Tramèr, F, Wang, RE, Wang, W, Wu, B, Wu, J, Wu, Y, Xie, SM, Yasunaga, M, You, J, Zaharia, MA, Zhang, M, Zhang, T, Zhang, X, Zhang, Y, Zheng, L, Zhou, K and Liang, (2021) On the opportunities and risks of foundation models. Preprint, arXiv.Google Scholar
Bonan, GB (2008) Forests and climate change: Forcings, feedbacks, and the climate benefits of forests. Science 320(5882), 14441449.CrossRefGoogle ScholarPubMed
Bonannella, C, Hengl, T, Heisig, J, Parente, L, Wright, MN, Herold, M and de Bruin, S (2022) Forest tree species distribution for Europe 2000–2020: Mapping potential and realized distributions using spatiotemporal machine learning. PeerJ.CrossRefGoogle ScholarPubMed
Brandt, M, Tucker, CJ, Kariryaa, A, Rasmussen, K, Abel, C, Small, J, Chave, J, Rasmussen, LV, Hiernaux, P, Diouf, AA, Kergoat, L, Mertz, O, Igel, C, Gieseke, F, Schöning, J, Li, S, Melocik, K, Meyer, J, Sinno, S, Romero, E, Glennie, E, Montagu, A, Dendoncker, M and Fensholt, R (2020) An unexpectedly large count of trees in the West African Sahara and Sahel. Nature.CrossRefGoogle ScholarPubMed
Brede, B, Terryn, L, Barbier, N, Bartholomeus, HM, Bartolo, R, Calders, K, Derroire, G, Moorthy, SMK, Lau, A, Levick, SR, Raumonen, P, Verbeeck, H, Wang, D, Whiteside, T, van der Zee, J and Herold, M (2022) Non-destructive estimation of individual tree biomass: Allometric models, terrestrial and UAV laser scanning. Remote Sensing of Environment.CrossRefGoogle Scholar
Brieger, F, Herzschuh, U, Pestryakova, LA, Bookhagen, B, Zakharov, ES and Kruse, S (2019) Advances in the derivation of northeast Siberian forest metrics using high-resolution UAV-based photogrammetric point clouds. Remote Sensing.CrossRefGoogle Scholar
Brown, T, Mann, B, Ryder, N, Subbiah, M, Kaplan, JD, Dhariwal, P, Neelakantan, A, Shyam, P, Sastry, G, Askell, A, Agarwal, S, Herbert-Voss, A, Krueger, G, Henighan, T, Child, R, Ramesh, A, Ziegler, D, Wu, J, Winter, C, Hesse, C, Chen, M, Sigler, E, Litwin, M, Gray, S, Chess, B, Clark, J, Berner, C, McCandlish, S, Radford, A, Sutskever, I and Amodei, D (2020) Language models are few-shot learners. In Conference on Neural Information Processing Systems.Google Scholar
Brown, TB, Hultine, KR, Steltzer, H, Denny, EG, Denslow, MW, Granados, J, Henderson, S, Moore, D, Nagai, S, SanClements, M, Sánchez-Azofeifa, A, Sonnentag, O, Tazik, D and Richardson, AD (2016) Using phenocams to monitor our changing earth: Toward a global phenocam network. Frontiers in Ecology and the Environment 14(2), 8493.CrossRefGoogle Scholar
Bucher, M, Vu, T-H, Cord, M and Pérez, P (2019) Zero-shot semantic segmentation. Conference on Neural Information Processing Systems.Google Scholar
Buchhorn, M, Lesiv, M, Tsendbazar, N-E, Herold, M, Bertels, L and Smets, B (2020) Copernicus global land cover layers-collection 2. Remote Sensing.CrossRefGoogle Scholar
Bunting, P, Rosenqvist, A, Hilarides, L, Lucas, RM, Thomas, N, Tadono, T, Worthington, TA, Spalding, M, Murray, NJ and Rebelo, L-M (2022) Global mangrove extent change 1996–2020: Global mangrove watch version 3.0. Remote Sensing.CrossRefGoogle Scholar
Burt, A, Disney, M and Calders, K (2018) Extracting individual trees from lidar point clouds using treeseg. Methods in Ecology and Evolution.Google Scholar
Busch, J, Engelmann, J, Cook-Patton, SC, Griscom, BW, Kroeger, T, Possingham, H and Shyamsundar, P (2019) Potential for low-cost carbon dioxide removal through tropical reforestation. Nature Climate Change.CrossRefGoogle Scholar
Calders, K, Brede, B, Newnham, G, Culvenor, D, Armston, J, Bartholomeus, H, Griebel, A, Hayward, J, Junttila, S, Lau, A, Levick, S, Morrone, R, Origo, N, Pfeifer, M, Verbesselt, J and Herold, M (2023) StrucNet: A global network for automated vegetation structure monitoring. Remote Sensing in Ecology and Conservation 9(5), 587598.CrossRefGoogle Scholar
Camps-Valls, G, Tuia, D, Zhu, XX and Reichstein, M (2021) Deep Learning for the Earth Sciences: A Comprehensive Approach to Remote Sensing, Climate Science, and Geosciences. Wiley.CrossRefGoogle Scholar
Caron, M, Touvron, H, Misra, I, Jégou, H, Mairal, J, Bojanowski, P and Joulin, A (2021) Emerging properties in self-supervised vision transformers. In International Conference on Computer Vision.CrossRefGoogle Scholar
Cavender-Bares, J, Gamon, JA and Townsend, PA (2020) Remote Sensing of Plant Biodiversity. Springer Nature.CrossRefGoogle Scholar
Cazzolla Gatti, R, Reich, PB, Gamarra, JGP, Crowther, T, Hui, C, Morera, A, Bastin, JF, de Miguel, S, Nabuurs, G-J, Svenning, J-C, Serra-Diaz, JM, Merow, C, Enquist, B, Kamenetsky, M, Lee, J, Zhu, J, Fang, J, Jacobs, DF, Pijanowski, B, Banerjee, A, Giaquinto, RA, Alberti, G, Almeyda Zambrano, AM, Alvarez-Davila, E, Araujo-Murakami, A, Avitabile, V, Aymard, GA, Balazy, R, Baraloto, C, Barroso, JG, Bastian, ML, Birnbaum, P, Bitariho, R, Bogaert, J, Bongers, F, Bouriaud, O, Brancalion, PHS, Brearley, FQ, Broadbent, EN, Bussotti, F, Castro Da Silva, W, César, RG, Češljar, G, Chama Moscoso, V, Chen, HYH, Cienciala, E, Clark, CJ, Coomes, DA, Dayanandan, S, Decuyper, M, Dee, LE, Del Aguila Pasquel, J, Derroire, G, Djuikouo, MNK, Van Do, T, Dolezal, J, Đorđević, I, Engel, J, Fayle, TM, Feldpausch, TR, Fridman, JK, Harris, DJ, Hemp, A, Hengeveld, G, Herault, B, Herold, M, Ibanez, T, Jagodzinski, AM, Jaroszewicz, B, Jeffery, KJ, Johannsen, VK, Jucker, T, Kangur, A, Karminov, VN, Kartawinata, K, Kennard, DK, Kepfer-Rojas, S, Keppel, G, Khan, ML, Khare, PK, Kileen, TJ, Kim, HS, Korjus, H, Kumar, A, Kumar, A, Laarmann, D, Labrière, N, Lang, M, Lewis, SL, Lukina, N, Maitner, BS, Malhi, Y, Marshall, AR, Martynenko, OV, Monteagudo Mendoza, AL, Ontikov, PV, Ortiz-Malavasi, E, Pallqui Camacho, NC, Paquette, A, Park, M, Parthasarathy, N, Peri, PL, Petronelli, P, Pfautsch, S, Phillips, OL, Picard, N, Piotto, D, Poorter, L, Poulsen, JR, Pretzsch, H, Ramírez-Angulo, H, Restrepo Correa, Z, Rodeghiero, M, Rojas Gonzáles, RDP, Rolim, SG, Rovero, F, Rutishauser, E, Saikia, P, Salas-Eljatib, C, Schepaschenko, D, Scherer-Lorenzen, M, Šebeň, V, Silveira, M, Slik, F, Sonké, B, Souza, AF, Stereńczak, KJ, Svoboda, M, Taedoumg, H, Tchebakova, N, Terborgh, J, Tikhonova, E, Torres-Lezama, A, Van Der Plas, F, Vásquez, R, Viana, H, Vibrans, AC, Vilanova, E, Vos, VA, Wang, H-F, Westerlund, B, White, LJT, Wiser, SK, Zawiła-Niedźwiecki, T, Zemagho, L, Zhu, Z-X, Zo-Bi, IC and Liang, J (2022) The number of tree species on earth. In Proceedings of the National Academy of Sciences.Google Scholar
Chadwick, KD and Asner, GP (2018) Landscape evolution and nutrient rejuvenation reflected in Amazon forest canopy chemistry. Ecology Letters 21(7), 978988.CrossRefGoogle ScholarPubMed
Chaves, P, Zuquim, G, Ruokolainen, K, Van doninck, J, Kalliola, R, Gómez Rivero, E and Tuomisto, H (2020) Mapping floristic patterns of trees in Peruvian Amazonia using remote sensing and machine learning. Remote Sensing.CrossRefGoogle Scholar
Chen, L, Tian, X, Chai, G, Zhang, X and Chen, E (2021) A new CBAM-P-net model for few-shot forest species classification using airborne hyperspectral images. Remote Sensing 13(7), 1269.CrossRefGoogle Scholar
Chen, L-C, Zhu, Y, Papandreou, G, Schroff, F and Adam, H (2018) Encoder–decoder with Atrous separable convolution for semantic image segmentation. In European Conference on Computer Vision.CrossRefGoogle Scholar
Chen, T, Kornblith, S, Norouzi, M and Hinton, G (2020) A simple framework for contrastive learning of visual representations. In International Conference on Machine Learning.Google Scholar
Chen, Y, Feng, X, Fu, B, Ma, H, Zohner, CM, Crowther, TW, Huang, Y, Wu, X and Wei, F (2023) Maps with 1 km resolution reveal increases in above- and belowground forest biomass carbon pools in China over the past 20 years. Earth System Science Data.CrossRefGoogle Scholar
Cheng, B, Misra, I, Schwing, AG, Kirillov, A and Girdhar, R (2022) Masked-attention mask transformer for universal image segmentation. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Cheng, B, Schwing, AG and Kirillov, A (2021) Per-pixel classification is not all you need for semantic segmentation. In Conference on Neural Information Processing Systems.Google Scholar
Cheng, G and Han, J (2016) A survey on object detection in optical remote sensing images. ISPRS Journal of Photogrammetry and Remote Sensing 117, 1128.Google Scholar
Cheng, G, Xie, X, Han, J, Guo, L and Xia, G-S (2020) Remote sensing image scene classification meets deep learning: Challenges, methods, benchmarks, and opportunities. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 13, 37353756.CrossRefGoogle Scholar
Cherif, E, Feilhauer, H, Berger, K, Dao, PD, Ewald, M, Hank, TB, He, Y, Kovach, KR, Lu, B, Townsend, PA and Kattenborn, T (2023) From spectra to plant functional traits: Transferable multi-trait models from heterogeneous and sparse data. Remote Sensing of Environment.CrossRefGoogle Scholar
Choat, B, Jansen, S, Brodribb, TJ, Cochard, H, Delzon, S, Bhaskar, R, Bucci, SJ, Feild, TS, Gleason, SM, Hacke, UG, Jacobsen, AL, Lens, F, Maherali, H, Martínez-Vilalta, J, Mayr, S, Mencuccini, M, Mitchell, PJ, Nardini, A, Pittermann, J, Pratt, RB, Sperry, JS, Westoby, M, Wright, IJ and Zanne, AE (2012) Global convergence in the vulnerability of forests to drought. Nature 491(7426), 752755.CrossRefGoogle ScholarPubMed
Chowdhery, A, Narang, S, Devlin, J, Bosma, M, Mishra, G, Roberts, A, Barham, P, Chung, HW, Sutton, C, Gehrmann, S, Schuh, P, Shi, K, Tsvyashchenko, S, Maynez, J, Rao, A, Barnes, P, Tay, Y, Shazeer, N, Prabhakaran, V, Reif, E, Du, N, Hutchinson, B, Pope, R, Bradbury, J, Austin, J, Isard, M, Gur-Ari, G, Yin, P, Duke, T, Levskaya, A, Ghemawat, S, Dev, S, Michalewski, H, Garcia, X, Misra, V, Robinson, K, Fedus, L, Zhou, D, Ippolito, D, Luan, D, Lim, H, Zoph, B, Spiridonov, A, Sepassi, R, Dohan, D, Agrawal, S, Omernick, M, Dai, AM, Pillai, TS, Pellat, M, Lewkowycz, A, Moreira, E, Child, R, Polozov, O, Lee, K, Zhou, Z, Wang, X, Saeta, B, Diaz, M, Firat, O, Catasta, M, Wei, J, Meier-Hellstern, K, Eck, D, Dean, J, Petrov, S and Fiedel, N (2022). PaLM: Scaling language modeling with pathways. Preprint, arXiv.Google Scholar
Cloutier, M, Germain, M and Laliberté, E (2023) Influence of temperate forest autumn leaf phenology on segmentation of tree species from UAV imagery using deep learning. Ecology.Google Scholar
Cohn, DA, Ghahramani, Z and Jordan, MI (1996) Active learning with statistical models. Journal of Artificial Intelligence Research 4, 129145.CrossRefGoogle Scholar
Cong, Y, Khanna, S, Meng, C, Liu, P, Rozi, E, He, Y, Burke, M, Lobell, DB and Ermon, S (2022) SatMAE: Pre-training transformers for temporal and multi-spectral satellite imagery. In Conference on Neural Information Processing Systems.Google Scholar
Corbière, C (2022) Robust Deep Learning for Autonomous Driving. PhD dissertation, Conservatoire National des Arts et Métiers.Google Scholar
Corbière, C, Thome, N, Bar-Hen, A, Cord, M and Pérez, P (2019) Addressing failure prediction by learning model confidence. In Conference on Neural Information Processing Systems.Google Scholar
Csurka, G (2017) Domain Adaptation for Visual Applications: A Comprehensive Survey. Springer Series: Advances in Computer Vision and Pattern Recognition.CrossRefGoogle Scholar
Curtis, PG, Slay, CM, Harris, NL, Tyukavina, A and Hansen, MC (2018) Classifying drivers of global forest loss. Science.Google ScholarPubMed
da Silva, DQ, dos Santos, FN, Sousa, AJ and Filipe, V (2021a) Visible and thermal image-based trunk detection with deep learning for forestry Mobile robotics. Journal of Imaging.CrossRefGoogle ScholarPubMed
da Silva, DQ, dos Santos, FN, Sousa, AJ, Filipe, V and Boaventura-Cunha, J (2021b) Unimodal and multimodal perception for forest management: Review and dataset. Computation 9, 127.CrossRefGoogle Scholar
da Silva, DQ, Santos, d, Filipe, V, Sousa, AJ and Oliveira, PM (2022) Edge AI-based tree trunk detection for forestry monitoring robotics. Robotics.CrossRefGoogle Scholar
Dalsasso, E, Denis, L and Tupin, F (2021) SAR2SAR: A semi-supervised despeckling algorithm for SAR images. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 14, 43214329.CrossRefGoogle Scholar
Dalsasso, E, Denis, L and Tupin, F (2022) As if by magic: Self-supervised training of deep despeckling networks with MERLIN. IEEE Transactions on Geoscience and Remote Sensing 60, 113.CrossRefGoogle Scholar
Dancette, C and Cord, M (2022). Dynamic query selection for fast visual perceiver. In Computer Vision and Pattern Recognition Conference Workshop.Google Scholar
De Almeida Pereira, GH, Fusioka, AM, Nassu, BT and Minetto, R (2021) Active fire detection in Landsat-8 imagery: A large-scale dataset and a deep-learning study. ISPRS Journal of Photogrammetry and Remote Sensing.CrossRefGoogle Scholar
De Lima, RAF, Phillips, OL, Duque, A, Tello, JS, Davies, SJ, De Oliveira, AA, Muller, S, Honorio Coronado, EN, Vilanova, E, Cuni-Sanchez, A, Baker, TR, Ryan, CM, Malizia, A, Lewis, SL, Ter Steege, H, Ferreira, J, Marimon, BS, Luu, HT, Imani, G, Arroyo, L, Blundo, C, Kenfack, D, Sainge, MN, Sonké, B and Vásquez, R (2022) Making forest data fair and open. Nature Ecology & Evolution 6(6), 656658.CrossRefGoogle Scholar
Deng, J, Dong, W, Socher, R, Li, L-J, Li, K and Fei-Fei, L (2009) Imagenet: A large-scale hierarchical image database. In Conference on Computer Vision and Pattern Recognition.Google Scholar
Deng, J, Li, X and Fang, Y (2022) Few-shot object detection on remote sensing images. IEEE Transactions on Geoscience and Remote Sensing 60, 114.CrossRefGoogle Scholar
Devlin, J, Chang, M-W, Lee, K and Toutanova, K (2019) BERT: Pre-training of deep bidirectional transformers for language understanding. Preprint, arXiv.Google Scholar
Diez, Y, Kentsch, S, Fukuda, M, Caceres, MLL, Moritake, K and Cabezas, M (2021) Deep learning in forestry using UAV-acquired RGB data: A practical review. Remote Sensing 13(14), 2837.CrossRefGoogle Scholar
Dionizio, EA, Pimenta, FM, Lima, LB and Costa, MH (2020) Carbon stocks and dynamics of different land uses on the Cerrado agricultural frontier. PLoS One.CrossRefGoogle ScholarPubMed
Donti, PL, Rolnick, D and Kolter, JZ (2021) DC3: A learning method for optimization with hard constraints. In International Conference on Learning Representations.Google Scholar
Dormann, CF, McPherson, JM, Araujo, MB, Bivand, R, Bolliger, J, Carl, G, Davies, RG, Hirzel, A, Jetz, W, Daniel Kissling, W, Kuhn, I, Ohlemuller, R, Peres-Neto, PR, Reineking, B, Schroder, B, Schurr, FM and Wilson, R (2007) Methods to account for spatial autocorrelation in the analysis of species distributional data: A review. Ecography.Google Scholar
Dosovitskiy, A, Beyer, L, Kolesnikov, A, Weissenborn, D, Zhai, X, Unterthiner, T, Dehghani, M, Minderer, M, Heigold, G, Gelly, S, Uszkoreit, J and Houlsby, N (2021) An image is worth 16 × 16 words: Transformers for image recognition at scale. In International Conference on Learning Representations.Google Scholar
Drever, CR, Cook-Patton, SC, Akhter, F, Badiou, PH, Chmura, GL, Davidson, SJ, Desjardins, RL, Dyk, A, Fargione, JE, Fellows, M, Filewod, B, Hessing-Lewis, M, Jayasundara, S, Keeton, WS, Kroeger, T, Lark, TJ, Le, E, Leavitt, SM, LeClerc, M-E, Lemprière, TC, Metsaranta, J, McConkey, B, Neilson, E, St-Laurent, GP, Puric-Mladenovic, D, Rodrigue, S, Soolanayakanahally, RY, Spawn, SA, Strack, M, Smyth, C, Thevathasan, N, Voicu, M, Williams, CA, Woodbury, PB, Worth, DE, Xu, Z, Yeo, S and Kurz, WA (2021) Natural climate solutions for Canada. Science Advances.CrossRefGoogle ScholarPubMed
Driess, D, Xia, F, Sajjadi, MSM, Lynch, C, Chowdhery, A, Ichter, B, Wahid, A, Tompson, J, Vuong, Q, Yu, T, Huang, W, Chebotar, Y, Sermanet, P, Duckworth, D, Levine, S, Vanhoucke, V, Hausman, K, Toussaint, M, Greff, K, Zeng, A, Mordatch, I and Florence, P (2023) PaLM-E: An embodied multimodal language model. Preprint, arXiv.Google Scholar
ElGhawi, R, Kraft, B, Reimers, C, Reichstein, M, Körner, M, Gentine, P and WinklerWinkler, AJ (2023) Hybrid modeling of evapotranspiration: Inferring stomatal and aerodynamic resistances using combined physics-based and machine learning. Environmental Research Letters.CrossRefGoogle Scholar
European Commission. Statistical Office of the European Union (2021) LUCAS: The EU’s Land Use and Land Cover Survey: 2021 Edition. Publications Office.Google Scholar
Everingham, M, Eslami, SMA, Van Gool, L, Williams, CKI, Winn, J and Zisserman, A (2015) The Pascal visual object classes challenge: A retrospective. International Journal of Computer Vision.CrossRefGoogle Scholar
Farias, HLS, Silva, WR, de Oliveira Perdiz, R, Citó, AC, da Silva Carvalho, LC and Barbosa, RI (2020) Dataset on wood density of trees in ecotone forests in Northern Brazilian Amazonia. Data in Brief.CrossRefGoogle ScholarPubMed
Fassnacht, FE, Latifi, H, Stereńczak, K, Modzelewska, A, Lefsky, M, Waser, LT, Straub, C and Ghosh, A (2016) Review of studies on tree species classification from remotely sensed data. Remote Sensing of Environment 186, 6487.CrossRefGoogle Scholar
Fassnacht, FE, White, JC and Wulder, MA (2023) Remote sensing in forestry: Current challenges, considerations and directions. Forestry.Google Scholar
Feng, M, Sexton, J, Wang, P, Channan, S, Montesano, P, Wagner, W, Wooten, M and Neigh, C (2022) Arctic-Boreal Vulnerability Experiment (ABoVE)ABoVE: Tree Canopy Cover and Stand Age from Landsat, Boreal Forest Biome, 1984–2020. ORNL DAAC.Google Scholar
Ferraz, A, Saatchi, S, Xu, L, Hagen, S, Chave, J, Yu, Y, Meyer, V, Garcia, M, Silva, C, Roswintiart, O, Samboko, A, Sist, P, Walker, S, Pearson, TRH, Wijaya, A, Sullivan, FB, Rutishauser, E, Hoekman, D and Ganguly, S (2018) Carbon storage potential in degraded forests of Kalimantan, Indonesia. Environmental Research Letters.CrossRefGoogle Scholar
Finn, C, Abbeel, P and Levine, S (2017) Model-agnostic meta-learning for fast adaptation of deep networks. In International Conference on Machine Learning.Google Scholar
Forkuor, G, Benewinde Zoungrana, J-B, Dimobe, K, Ouattara, B, Vadrevu, KP and Tondoh, JE (2020) Above-ground biomass mapping in West African dryland forest using Sentinel-1 and 2 datasets—A case study. Remote Sensing of Environment 236, 111496.CrossRefGoogle Scholar
Freitas, S, Silva, H and Silva, E (2022) Hyperspectral imaging zero-shot learning for remote marine litter detection and classification. Remote Sensing 14(21), 5516.CrossRefGoogle Scholar
Friedl, MA, Sulla-Menashe, D, Tan, B, Schneider, A, Ramankutty, N, Sibley, A and Huang, X (2010) MODIS collection 5 global land cover: Algorithm refinements and characterization of new datasets. Remote Sensing of Environment.CrossRefGoogle Scholar
Gal, Y (2016) Uncertainty in Deep Learning. PhD Thesis, University of Cambridge.Google Scholar
Gal, Y and Ghahramani, Z (2016) Dropout as a Bayesian approximation: Representing model uncertainty in deep learning. In Proceedings of the 33rd International Conference on Machine Learning.Google Scholar
Gal, Y, Islam, R and Ghahramani, Z (2017) Deep Bayesian active learning with image data. In Proceedings of the 34th International Conference on Machine Learning.Google Scholar
Galuszynski, NC, Duker, R, Potts, AJ and Kattenborn, T (2022) Automated mapping of Portulacaria afra canopies for restoration monitoring with convolutional neural networks and heterogeneous unmanned aerial vehicle imagery. PeerJ.CrossRefGoogle ScholarPubMed
Ganin, Y and Lempitsky, V (2015) Unsupervised domain adaptation by backpropagation. In International Conference on Machine Learning.Google Scholar
Garioud, A, Peillet, S, Bookjans, E, Giordano, S and Wattrelos, B (2022) FLAIR #1: Semantic segmentation and domain adaptation dataset. Preprint, arXiv.Google Scholar
Garnot, VSF and Landrieu, L (2021) Panoptic segmentation of satellite image time series with convolutional temporal attention networks. In International Conference on Computer Vision.Google Scholar
Garnot, VSF, Landrieu, L and Chehata, N (2021) Multi-modal temporal attention models for crop mapping from satellite time series. Preprint, arXiv.Google Scholar
Gastauer, M, Leyh, W and Meira-Neto, J (2015) Tree diversity and dynamics of the forest of Seu Nico, Viçosa, Minas Gerais, Brazil. Biodiversity Data Journal.CrossRefGoogle ScholarPubMed
Gates, DM, Keegan, HJ, Schleter, JC and Weidner, VR (1965) Spectral properties of plants. Applied Optics 4(1), 11.CrossRefGoogle Scholar
Gazzea, M, Kristensen, LM, Pirotti, F, Ozguven, EE and Arghandeh, R (2022) Tree species classification using high-resolution satellite imagery and weakly supervised learning. IEEE Transactions on Geoscience and Remote Sensing 60, 111.CrossRefGoogle Scholar
Ge, S, Gu, H, Su, W, Lönnqvist, A and Antropov, O (2023) A novel semisupervised contrastive regression framework for forest inventory mapping with multisensor satellite data. IEEE Geoscience and Remote Sensing Letters 20, 15.Google Scholar
Geiss, A and Hardin, JC (2021) Strict enforcement of conservation laws and invertibility in CNN-based super resolution for scientific datasets. Preprint, arXiv.Google Scholar
Gidaris, S, Singh, P and Komodakis, N (2018) Unsupervised representation learning by predicting image rotations. In International Conference on Learning Representations.Google Scholar
Gomarasca, U, Migliavacca, M, Kattge, J, Nelson, JA, Niinemets, ü, Wirth, C, Cescatti, A, Bahn, M, Nair, R, Acosta, AT, Altaf Arain, M, Beloiu, M, Andrew Black, T, Bruun, HH, Bucher, SF, Buchmann, N, Byun, C, Carrara, A, Conte, A, da Silva, AC, Duveiller, G, Fares, S, Ibrom, A, Knohl, A, Komac, B, Limousin, J-M, Lusk, CH, Mahecha, MD, Martini, D, Minden, V, Montagnani, L, Mori, AS, Onoda, Y, Peñuelas, J, Perez-Priego, O, Poschlod, P, Powell, TL, Reich, PB, Šigut, L, van Bodegom, PM, Walther, S, Wohlfahrt, G, Wright, IJ and Reichstein, M (2023) Leaf-level coordination principles propagate to the ecosystem scale. Nature Communications.CrossRefGoogle Scholar
Gonzalez-Akre, E, Piponiot, C, Lepore, M, Herrmann, V, Lutz, JA, Baltzer, JL, Dick, CW, Gilbert, GS, He, F, Heym, M, Huerta, AI, Jansen, PA, Johnson, DJ, Knapp, N, Kral, K, Lin, D, Malhi, Y, McMahon, SM, Myers, JA, Orwig, D, Rodriguez-Hernandez, DI, Russo, SE, Shue, J, Wang, X, Wolf, A, Yang, T, Davies, SJ and Anderson-Teixeira, KJ (2022) Allodb: An R package for biomass estimation at globally distributed extratropical forest plots. Methods in Ecology and Evolution 13(2), 330338.CrossRefGoogle Scholar
Griffiths, P, Kuemmerle, T, Baumann, M, Radeloff, VC, Abrudan, IV, Lieskovsky, J, Munteanu, C, Ostapowicz, K and Hostert, P (2014) Forest disturbances, forest recovery, and changes in forest types across the Carpathian ecoregion from 1985 to 2010 based on Landsat image composites. Remote Sensing of Environment.CrossRefGoogle Scholar
Griscom, BW, Adams, J, Ellis, PW, Houghton, RA, Lomax, G, Miteva, DA, Schlesinger, WH, Shoch, D, Siikamäki, JV, Smith, P, Woodbury, P, Zganjar, C, Blackman, A, Campari, J, Conant, RT, Delgado, C, Elias, P, Gopalakrishna, T, Hamsik, MR, Herrero, M, Kiesecker, J, Landis, E, Laestadius, L, Leavitt, SM, Minnemeyer, S, Polasky, S, Potapov, P, Putz, FE, Sanderman, J, Silvius, M, Wollenberg, E and Fargione, J (2017) Natural climate solutions. Proceedings of the National Academy of Sciences.CrossRefGoogle ScholarPubMed
Griscom, BW, Busch, J, Cook-Patton, SC, Ellis, PW, Funk, J, Leavitt, SM, Lomax, G, Turner, WR, Chapman, M, Engelmann, J, Gurwick, NP, Landis, E, Lawrence, D, Malhi, Y, Schindler Murray, L, Navarrete, D, Roe, S, Scull, S, Smith, P, Streck, C, Walker, WS and Worthington, T (2020) National mitigation potential from natural climate solutions in the tropics. Philosophical Transactions of the Royal Society B: Biological Sciences.CrossRefGoogle ScholarPubMed
Grondin, V, Fortin, J-M, Pomerleau, F and Giguère, P (2022) Tree detection and diameter estimation based on deep learning. Forestry: An International Journal of Forest Research.Google Scholar
Guimaraes, N, Padua, L, Marques, P, Silva, N, Peres, E and Sousa, JJ (2020) Forestry remote sensing from unmanned aerial vehicles: A review focusing on the data, processing and potentialities. Remote Sensing 12(6), 1046.CrossRefGoogle Scholar
Haas, J and Rabus, B (2021) Uncertainty estimation for deep learning-based segmentation of roads in synthetic aperture radar imagery. Remote Sensing 13(8), 1472.CrossRefGoogle Scholar
Hackenberg, J, Spiecker, H, Calders, K, Disney, M and Raumonen, P (2015) SimpleTree—An efficient open source tool to build tree models from TLS clouds. Forests 6(11), 42454294.CrossRefGoogle Scholar
Han, J, Zhang, D, Cheng, G, Guo, L and Ren, J (2015) Object detection in optical remote sensing images based on weakly supervised learning and high-level feature learning. IEEE Transactions on Geoscience and Remote Sensing 53(6), 33253337.CrossRefGoogle Scholar
Hansen, MC, Potapov, PV, Moore, R, Hancher, M, Turubanova, SA, Tyukavina, A, Thau, D, Stehman, SV, Goetz, SJ, Loveland, TR, Kommareddy, A, Egorov, A, Chini, L, Justice, CO and Townshend, JRG (2013) High-resolution global maps of 21st-century forest cover change. Science.CrossRefGoogle ScholarPubMed
Harder, P, Ramesh, V, Hernandez-Garcia, A, Yang, Q, Sattigeri, P, Szwarcman, D, Watson, C and Rolnick, D (2023) Physics-constrained deep learning for climate downscaling. Preprint, arXiv.CrossRefGoogle Scholar
Harder, P, Watson-Parris, D, Stier, P, Strassel, D, Gauger, NR and Keuper, J (2022) Physics-informed learning of aerosol microphysics. Preprint, arXiv.CrossRefGoogle Scholar
Harris, N, Goldman, ED and Gibbes, (2019) Spatial Database of Planted Trees Version 1.0. World Resources Institute.Google Scholar
Hartmann, H, Bastos, A, Das, AJ, Esquivel-Muelbert, A, Hammond, WM, Martínez-Vilalta, J, McDowell, NG, Powers, JS, Pugh, TA, Ruthrof, KX and Allen, CD (2022) Climate change risks to global forest health: Emergence of unexpected events of elevated tree mortality worldwide. Annual Review of Plant Biology.CrossRefGoogle ScholarPubMed
He, K, Chen, X, Xie, S, Li, Y, Dollár, P and Girshick, R (2022) Masked autoencoders are scalable vision learners. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
He, K, Fan, H, Wu, Y, Xie, S and Girshick, R (2020) Momentum contrast for unsupervised visual representation learning. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
He, K, Gkioxari, G, Dollar, P and Girshick, R (2017) Mask R-CNN. In International Conference on Computer Vision.CrossRefGoogle Scholar
He, K, Zhang, X, Ren, S and Sun, J (2016) Deep residual learning for image recognition. In Conference on Computer Vision and Pattern Recognition.CrossRefGoogle Scholar
Hoeser, T and Kuenzer, C (2020) Object detection and image segmentation with deep learning on earth observation data: A review-part I: Evolution and recent trends. Remote Sensing 12(10), 1667.CrossRefGoogle Scholar
Hoffman, J, Tzeng, E, Park, T, Zhu, J-Y, Isola, P, Saenko, K, Efros, AA and Darrell, T (2018) CyCADA: Cycle-consistent adversarial domain adaptation. In International Conference on Machine Learning.Google Scholar
Hoffmann, J, Borgeaud, S, Mensch, A, Buchatskaya, E, Cai, T, Rutherford, E, Casas, L, Hendricks, LA, Welbl, J, Clark, A, Hennigan, T, Noland, E, Millican, K, Driessche, G v, Damoc, B, Guy, A, Osindero, S, Simonyan, K, Elsen, E, Rae, JW, Vinyals, O and Sifre, (2022) Training compute-optimal large language models.Google Scholar
Hu, J, Shen, L and Sun, G (2018) Squeeze-and-excitation networks. In Conference on Computer Vision and Pattern Recognition.CrossRefGoogle Scholar
Hudak, AT, Fekety, PA, Kane, VR, Kennedy, RE, Filippelli, SK, Falkowski, MJ, Tinkham, WT, Smith, AMS, Crookston, NL, Domke, GM, Corrao, MV, Bright, BC, Churchill, DJ, Gould, PJ, McGaughey, RJ, Kane, JT and Dong, J (2020) A carbon monitoring system for mapping regional, annual aboveground biomass across the northwestern USA. Environmental Research Letters.CrossRefGoogle Scholar
Ienco, D, Interdonato, R, Gaetano, R and Minh, DHT (2019) Combining Sentinel-1 and Sentinel-2 satellite image time series for land cover mapping via a multi-source deep learning architecture. ISPRS Journal of Photogrammetry and Remote Sensing.CrossRefGoogle Scholar
Illarionova, S, Trekin, A, Ignatiev, V and Oseledets, I (2021) Tree species mapping on Sentinel-2 satellite imagery with weakly supervised classification and object-wise sampling. Forests 12(10), 1413.Google Scholar
Intergovernmental Panel On Climate Change (IPCC) (2023) Climate Change 2022—Impacts, Adaptation and Vulnerability: Working Group II Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge: Cambridge University Press.Google Scholar
Ioffe, S and Szegedy, C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning.Google Scholar
Irvin, J, Sheng, H, Ramachandran, N, Johnson-Yu, S, Zhou, S, Story, K, Rustowicz, R, Elsworth, C, Austin, K and Ng, AY (2020) ForestNet: Classifying drivers of deforestation in Indonesia using deep learning on satellite imagery. In Conference on Neural Information Processing Systems Workshop.Google Scholar
Jaegle, A, Borgeaud, S, Alayrac, J-B, Doersch, C, Ionescu, C, Ding, D, Koppula, S, Zoran, D, Brock, A, Shelhamer, E, Hénaff, O, Botvinick, MM, Zisserman, A, Vinyals, O and Carreira, J (2022) Perceiver IO: A general architecture for structured inputs & outputs. In International Conference on Learning Representations.Google Scholar
Jaegle, A, Gimeno, F, Brock, A, Zisserman, A, Vinyals, O and Carreira, J (2021) Perceiver: General perception with iterative attention. In International Conference on Machine Learning.Google Scholar
Jain, J, Li, J, Chiu, M, Hassani, A, Orlov, N and Shi, H (2023a) OneFormer: One transformer to rule universal image segmentation. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Jain, P, Schoen-Phelan, B and Ross, R (2023b) Multi-modal self-supervised representation learning for earth observation. In IEEE International Geoscience and Remote Sensing Symposium.Google Scholar
Jain, U, Wilson, A and Gulshan, V (2022) Multimodal contrastive learning for remote sensing tasks. Preprint, arXiv.Google Scholar
Jaritz, M, Vu, T-H, de Charette, R, Wirbel, É and Pérez, P (2020). xMUDA: Cross-modal unsupervised domain adaptation for 3D semantic segmentation. In Computer Vision and Pattern Recognition Conference.Google Scholar
Jiang, CM, Esmaeilzadeh, S, Azizzadenesheli, K, Kashinath, K, Mustafa, M, Tchelepi, HA, Marcus, P, Prabhat, and Anandkumar, A (2020) MeshfreeFlowNet: A physics-constrained deep continuous space-time super-resolution framework. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis.CrossRefGoogle Scholar
Jiang, X, Zhou, N and Li, X (2022) Few-shot segmentation of remote sensing images using deep metric learning. IEEE Geoscience and Remote Sensing Letters 19, 15.Google Scholar
Jucker, T, Fischer, FJ, Chave, J, Coomes, DA, Caspersen, J, Ali, A, Loubota Panzou, GJ, Feldpausch, TR, Falster, D, Usoltsev, VA, Adu-Bredu, S, Alves, LF, Aminpour, M, Angoboy, IB, Anten, NPR, Antin, C, Askari, Y, Muñoz, R, Ayyappan, N, Balvanera, P, Banin, L, Barbier, N, Battles, JJ, Beeckman, H, Bocko, YE, Bond-Lamberty, B, Bongers, F, Bowers, S, Brade, T, van Breugel, M, Chantrain, A, Chaudhary, R, Dai, J, Dalponte, M, Dimobe, K, Domec, J-C, Doucet, J-L, Duursma, RA, Enríquez, M, van Ewijk, KY, Farfán-Rios, W, Fayolle, A, Forni, E, Forrester, DI, Gilani, H, Godlee, JL, Gourlet-Fleury, S, Haeni, M, Hall, JS, He, J-K, Hemp, A, Hernández-Stefanoni, JL, Higgins, SI, Holdaway, RJ, Hussain, K, Hutley, LB, Ichie, T, Iida, Y, Jiang, H-s, Joshi, PR, Kaboli, H, Larsary, MK, Kenzo, T, Kloeppel, BD, Kohyama, T, Kunwar, S, Kuyah, S, Kvasnica, J, Lin, S, Lines, ER, Liu, H, Lorimer, C, Loumeto, J-J, Malhi, Y, Marshall, PL, Mattsson, E, Matula, R, Meave, JA, Mensah, S, Mi, X, Momo, S, Moncrieff, GR, Mora, F, Nissanka, SP, O’Hara, KL, Pearce, S, Pelissier, R, Peri, PL, Ploton, P, Poorter, L, Pour, MJ, Pourbabaei, H, Dupuy-Rada, JM, Ribeiro, SC, Ryan, C, Sanaei, A, Sanger, J, Schlund, M, Sellan, G, Shenkin, A, Sonké, B, Sterck, FJ, Svátek, M, Takagi, K, Trugman, AT, Ullah, F, Vadeboncoeur, MA, Valipour, A, Vanderwel, MC, Vovides, AG, Wang, W, Wang, L-Q, Wirth, C, Woods, M, Xiang, W, Ximenes, F, Xu, Y, Yamada, T and Zavala, MA (2022) Tallo: A global tree allometry and crown architecture database. Global Change Biology.CrossRefGoogle Scholar
Jung, M, Koirala, S, Weber, U, Ichii, K, Gans, F, Camps-Valls, G, Papale, D, Schwalm, C, Tramontana, G and Reichstein, M (2019) The FLUXCOM ensemble of global land-atmosphere energy fluxes. Scientific Data.CrossRefGoogle ScholarPubMed
Kalinicheva, E, Landrieu, L, Mallet, C and Chehata, N (2022) Multi-layer modeling of dense vegetation from aerial LiDAR scans. In Computer Vision and Pattern Recognition Conference Workshop.Google Scholar
Kampe, TU (2010) NEON: The first continental-scale ecological observatory with airborne remote sensing of vegetation canopy biochemistry and structure. Journal of Applied Remote Sensing 4(1), 043510.CrossRefGoogle Scholar
Kattenborn, T, Eichel, J and Fassnacht, FE (2019a) Convolutional neural networks enable efficient, accurate and fine-grained segmentation of plant species and communities from high-resolution UAV imagery. Scientific Reports.CrossRefGoogle ScholarPubMed
Kattenborn, T, Eichel, J, Wiser, S, Burrows, L, Fassnacht, FE and Schmidtlein, S (2020) Convolutional neural networks accurately predict cover fractions of plant species and communities in unmanned aerial vehicle imagery. Remote Sensing in Ecology and Conservation.CrossRefGoogle Scholar
Kattenborn, T, Leitloff, J, Schiefer, F and Hinz, S (2021) Review on convolutional neural networks (CNN) in vegetation remote sensing. ISPRS Journal of Photogrammetry and Remote Sensing.CrossRefGoogle Scholar
Kattenborn, T, Lopatin, J, Förster, M, Braun, AC and Fassnacht, FE (2019b) UAV data as alternative to field sampling to map woody invasive species based on combined Sentinel-1 and Sentinel-2 data. Remote Sensing of Environment.CrossRefGoogle Scholar
Kattenborn, T, Richter, R, Guimarães-Steinicke, C, Feilhauer, H and Wirth, C (2022a) AngleCam: Predicting the temporal variation of leaf angle distributions from image series with deep learning. Methods in Ecology and Evolution.CrossRefGoogle Scholar
Kattenborn, T, Schiefer, F, Frey, J, Feilhauer, H, Mahecha, MD and Dormann, CF (2022b) Spatially autocorrelated training and validation samples inflate performance assessment of convolutional neural networks. ISPRS Open Journal of Photogrammetry and Remote Sensing.CrossRefGoogle Scholar
Keenan, RJ, Reams, GA, Achard, F, de Freitas, JV, Grainger, A and Lindquist, E (2015) Dynamics of global forest area: Results from the FAO Global Forest Resources Assessment 2015. Forest Ecology and Management.CrossRefGoogle Scholar
Kentsch, S, Lopez Caceres, ML, Serrano, D, Roure, F and Diez, Y (2020) Computer vision and deep learning techniques for the analysis of drone-acquired forest images, a transfer learning study. Remote Sensing.CrossRefGoogle Scholar
Kindermann, L, Dobler, M, Niedeggen, D, Fabiano, EC and Linstädter, A (2022) Dataset on woody aboveground biomass, disturbance losses, and wood density from an African savanna ecosystem. Data in Brief.CrossRefGoogle ScholarPubMed
Kirillov, A, Mintun, E, Ravi, N, Mao, H, Rolland, C, Gustafson, L, Xiao, T, Whitehead, S, Berg, AC, Lo, W-Y, Dollár, P and Girshick, R (2023) Segment anything. Preprint, arXiv.CrossRefGoogle Scholar
Klosterman, ST, Hufkens, K, Gray, JM, Melaas, E, Sonnentag, O, Lavine, I, Mitchell, L, Norman, R, Friedl, MA and Richardson, AD (2014) Evaluating remote sensing of deciduous forest phenology at multiple spatial scales using PhenoCam imagery. Biogeosciences 11(16), 43054320.CrossRefGoogle Scholar
Körner, C, Basler, D, Hoch, G, Kollas, C, Lenz, A, Randin, CF, Vitasse, Y and Zimmermann, NE (2016) Where, why and how? Explaining the low-temperature range limits of temperate tree species. Journal of Ecology. 104(4), 10761088CrossRefGoogle Scholar
Koskinen, J, Leinonen, U, Vollrath, A, Ortmann, A, Lindquist, E, d’Annunzio, R, Pekkarinen, A and Käyhkö, N (2019) Participatory mapping of forest plantations with open Foris and Google earth engine. ISPRS Journal of Photogrammetry and Remote Sensing.CrossRefGoogle Scholar
Kosmala, M, Hufkens, K and Richardson, AD (2018) Integrating camera imagery, crowdsourcing, and deep learning to improve high-frequency automated monitoring of snow at continental-to-global scales. PLoS One. 13(12), e0209649.CrossRefGoogle ScholarPubMed
Krizhevsky, A, Sutskever, I and Hinton, GE (2012) ImageNet classification with deep convolutional neural networks. In Conference on Neural Information Processing Systems.Google Scholar
Kulawardhana, RW (2011) Remote sensing of vegetation: Principles, techniques and applications. By Hamlyn G. Jones and Robin A Vaughan: Book review. Journal of Vegetation Science 22(6), 11511153.CrossRefGoogle Scholar
Laar, AV and Akça, A (2007) Forest Mensuration. Springer Dordrecht: Managing Forest Ecosystems.CrossRefGoogle Scholar
Lacoste, A, Lehmann, N, Rodriguez, P, Sherwin, ED, Kerner, H, Lütjens, B, Irvin, JA, Dao, D, Alemohammad, H, Drouin, A, Gunturkun, M, Huang, G, Vazquez, D, Newman, D, Bengio, Y, Ermon, S and Zhu, XX (2023) GEO-bench: Toward foundation models for earth monitoring. Preprint, arXiv.Google Scholar
Lagomasino, D, Fatoyinbo, L, Castaneda, E, Cook, B, Montesano, P, Neigh, C, Corp, L, Ott, L, Chavez, S and Morton, D (2020) Storm surge, not wind, caused mangrove dieback in Southwest Florida following hurricane Irma. EarthArXiv.Google Scholar
Lakshminarayanan, B, Pritzel, A and Blundell, C (2017) Simple and scalable predictive uncertainty estimation using deep ensembles. In Conference on Neural Information Processing Systems.Google Scholar
Lampert, CH, Nickisch, H and Harmeling, S (2014) Attribute-based classification for zero-shot visual object categorization. IEEE Transactions on Pattern Analysis and Machine Intelligence 36(3), 453465.CrossRefGoogle ScholarPubMed
Lang, N, Jetz, W, Schindler, K and Wegner, JD (2022a) A high-resolution canopy height model of the earth. Preprint, arXiv.CrossRefGoogle Scholar
Lang, N, Kalischek, N, Armston, J, Schindler, K, Dubayah, R and Wegner, JD (2022b) Global canopy height regression and uncertainty estimation from GEDI LIDAR waveforms with deep ensembles. Remote Sensing of Environment 268, 112760.CrossRefGoogle Scholar
Le Toan, T, Quegan, S, Davidson, M, Balzter, H, Paillou, P, Papathanassiou, K, Plummer, S, Rocca, F, Saatchi, S, Shugart, H and Ulander, L (2011) The BIOMASS mission: Mapping global forest biomass to better understand the terrestrial carbon cycle. Remote Sensing of Environment 115(11), 28502860.CrossRefGoogle Scholar
Leclerc, S, Smistad, E, Pedrosa, J, Ostvik, A, Cervenansky, F, Espinosa, F, Espeland, T, Berg, EAR, Jodoin, P-M, Grenier, T, Lartizien, C, Dhooge, , Lovstakken, L and and Bernard, O (2019) Deep learning for segmentation using an open large-scale dataset in 2D echocardiography. IEEE Transactions on Medical Imaging 38(9), 21982210.CrossRefGoogle ScholarPubMed
Lee, D and Choi, Y (2022) MultiEarth 2022 deforestation challenge proceedings—ForestGump. In Computer Vision and Pattern Recognition Conference Workshop.Google Scholar
Lehtinen, J, Munkberg, J, Hasselgren, J, Laine, S, Karras, T, Aittala, M and Aila, T (2018) Noise2Noise: Learning image restoration without clean data. In International Conference on Machine Learning.Google Scholar
Levin, N, Yebra, M and Phinn, S (2021) Unveiling the factors responsible for Australia’s black summer fires of 2019/2020. Fire.CrossRefGoogle Scholar
Li, A, Lu, Z, Wang, L, Xiang, T and Wen, J-R (2017) Zero-shot scene classification for high spatial resolution remote sensing images. IEEE Transactions on Geoscience and Remote Sensing 55(7), 41574167.CrossRefGoogle Scholar
Li, F, Zhang, H, Xu, H, Liu, S, Zhang, L, Ni, LM and Shum, H-Y (2023a) Mask DINO: Towards a unified transformer-based framework for object detection and segmentation. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Li, H, Zhu, J, Jiang, X, Zhu, X, Li, H, Yuan, C, Wang, X, Qiao, Y, Wang, X, Wang, W and Dai, J (2023b) Uni-perceiver v2: A generalist model for large-scale vision and vision-language tasks. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Li, J, Bioucas-Dias, JM and Plaza, A (2010) Semisupervised hyperspectral image segmentation using multinomial logistic regression with active learning. IEEE Transactions on Geoscience and Remote Sensing.CrossRefGoogle Scholar
Li, J, Bioucas-Dias, JM and Plaza, A (2011) Hyperspectral image segmentation using a new Bayesian approach with active learning. IEEE Transactions on Geoscience and Remote Sensing 49(10), 39473960.CrossRefGoogle Scholar
Li, K, Wan, G, Cheng, G, Meng, L and Han, J (2020) Object detection in optical remote sensing images: A survey and a new benchmark. ISPRS Journal of Photogrammetry and Remote Sensing 159, 296307.CrossRefGoogle Scholar
Li, Y, Chai, G, Wang, Y, Lei, L and Zhang, X (2022a) Ace R-CNN: An attention complementary and edge detection-based instance segmentation algorithm for individual tree species identification using UAV RGB images and lidar data. Remote Sensing.Google Scholar
Li, Y, Kong, D, Zhang, Y, Tan, Y and Chen, L (2021) Robust deep alignment network with remote sensing knowledge graph for zero-shot and generalized zero-shot remote sensing image scene classification. ISPRS Journal of Photogrammetry and Remote Sensing 179, 145158.CrossRefGoogle Scholar
Li, Z, Zhang, D, Wang, Y, Lin, D and Zhang, J (2022b) Generative adversarial networks for zero-shot remote sensing scene classification. Applied Sciences 12(8), 3760.CrossRefGoogle Scholar
Liang, J and Gamarra, JGP (2020) The importance of sharing global forest data in a world of crises. Scientific Data 7(1).CrossRefGoogle Scholar
Liang, X, Kankare, V, Hyyppä, J, Wang, Y, Kukko, A, Haggrén, H, Yu, X, Kaartinen, H, Jaakkola, A, Guan, F, Holopainen, M and Vastaranta, M (2016) Terrestrial laser scanning in forest inventories. ISPRS Journal of Photogrammetry and Remote Sensing 115, 6377.CrossRefGoogle Scholar
Lin, T-Y, Goyal, P, Girshick, R, He, K and Dollár, P (2017) Focal loss for dense object detection. In International Conference on Computer Vision.CrossRefGoogle Scholar
Lin, T-Y, Maire, M, Belongie, S, Hays, J, Perona, P, Ramanan, D, Dollár, P and Zitnick, CL (2014) Microsoft COCO: Common objects in context. In European Conference on Computer Vision.CrossRefGoogle Scholar
Liu, B, Yu, X, Yu, A, Zhang, P, Wan, G and Wang, R (2019) Deep few-shot learning for hyperspectral image classification. IEEE Transactions on Geoscience and Remote Sensing.Google Scholar
Liu, W, Anguelov, D, Erhan, D, Szegedy, C, Reed, SE, Fu, C-Y and Berg, AC (2016) SSD: Single shot MultiBox detector. In European Conference on Computer Vision.CrossRefGoogle Scholar
Liu, Z, Lin, Y, Cao, Y, Hu, H, Wei, Y, Zhang, Z, Lin, S and Guo, B (2021) Swin transformer: Hierarchical vision transformer using shifted windows. In International Conference on Computer Vision.CrossRefGoogle Scholar
Long, J, Shelhamer, E and Darrell, T (2015) Fully convolutional networks for semantic segmentation. In Conference on Computer Vision and Pattern Recognition.CrossRefGoogle Scholar
López-Jiménez, E, Vasquez-Gomez, JI, Sanchez-Acevedo, MA, Herrera-Lozada, JC and Uriarte-Arcia, AV (2019) Columnar cactus recognition in aerial images using a deep learning approach. Ecological Informatics.CrossRefGoogle Scholar
Lu, D, Chen, Q, Wang, G, Liu, L, Li, G and Moran, E (2016) A survey of remote sensing-based aboveground biomass estimation methods in forest ecosystems. International Journal of Digital Earth.CrossRefGoogle Scholar
Ma, L, Hurtt, G, Tang, H, Lamb, R, Campbell, E, Dubayah, R, Guy, M, Huang, W, Lister, A, Lu, J, O’Neil-Dunne, J, Rudee, A, Shen, Q and Silva, C (2021a) High-resolution forest carbon modelling for climate mitigation planning over the RGGI region, USA. Environmental Research Letters.Google Scholar
Ma, L, Liu, Y, Zhang, X, Ye, Y, Yin, G and Johnson, BA (2019) Deep learning in remote sensing applications: A meta-analysis and review. ISPRS Journal of Photogrammetry and Remote Sensing 152, 166177.CrossRefGoogle Scholar
Ma, X, Zhang, X, Wang, Z and Pun, M-O (2023) Unsupervised domain adaptation augmented by mutually boosted attention for semantic segmentation of VHR remote sensing images. IEEE Transactions on Geoscience and Remote Sensing 61, 115.Google Scholar
Ma, Y, Zhang, Z, Kang, Y and özdoğan, M (2021b) Corn yield prediction and uncertainty analysis based on remotely sensed variables using a Bayesian neural network approach. Remote Sensing of Environment 259, 112408.CrossRefGoogle Scholar
Mai, G, Huang, W, Sun, J, Song, S, Mishra, D, Liu, N, Gao, S, Liu, T, Cong, G, Hu, Y, Cundy, C, Li, Z, Zhu, R and Lao, N (2023a) On the opportunities and challenges of foundation models for geospatial artificial intelligence. Preprint, arXiv.Google Scholar
Mai, G, Lao, N, He, Y, Song, J, and Ermon, S (2023b) CSP: Self-supervised contrastive spatial pre-training for geospatial-visual representations. In International Conference on Machine Learning.Google Scholar
Malinin, A and Gales, M (2018) Predictive uncertainty estimation via prior networks. Advances in Neural Information Processing Systems.Google Scholar
Marconi, S, Graves, SJ, Gong, D, Nia, MS, Le Bras, M, Dorr, BJ, Fontana, P, Gearhart, J, Greenberg, C, Harris, DJ, Kumar, SA, Nishant, A, Prarabdh, J, Rege, SU, Bohlman, SA, White, EP and Wang, DZ (2019) A data science challenge for converting airborne remote sensing data into ecological information. PeerJ.CrossRefGoogle ScholarPubMed
Martin, MP, Woodbury, DJ, Doroski, DA, Nagele, E, Storace, , Cook-Patton, SC, Pasternack, R and Ashton, MS (2021) People plant trees for utility more often than for biodiversity or carbon. Biological Conservation.CrossRefGoogle ScholarPubMed
Mavrovic, A, Sonnentag, O, Lemmetyinen, J, Baltzer, JL, Kinnard, C and Roy, A (2023) Reviews and syntheses: Recent advances in microwave remote sensing in support of terrestrial carbon cycle science in Arctic-boreal regions. Biogeosciences 20(14), 29412970.CrossRefGoogle Scholar
Maxwell, AE, Warner, TA and Fang, F (2018) Implementation of machine-learning classification in remote sensing: An applied review. International Journal of Remote Sensing 39(9), 27842817.CrossRefGoogle Scholar
Meireles, JE, Cavender-Bares, J, Townsend, PA, Ustin, S, Gamon, JA, Schweiger, AK, Schaepman, ME, Asner, GP, Martin, RE, Singh, A, Schrodt, F, Chlus, A and O’Meara, BC (2020) Leaf reflectance spectra capture the evolutionary history of seed plants. New Phytologist 228(2), 485493.CrossRefGoogle ScholarPubMed
Meraoumia, I, Dalsasso, E, Denis, L, Abergel, R and Tupin, F (2023) Multitemporal speckle reduction with self-supervised deep neural networks. IEEE Transactions on Geoscience and Remote Sensing 61, 114.CrossRefGoogle Scholar
Michalowska, M and Rapinski, J (2021) A review of tree species classification based on airborne LiDAR data and applied classifiers. Remote Sensing 13(3), 353.CrossRefGoogle Scholar
Migliavacca, M, Musavi, T, Mahecha, MD, Nelson, JA, Knauer, J, Baldocchi, DD, Perez-Priego, O, Christiansen, R, Peters, J, Anderson, K, Bahn, M, Andrew Black, T, Blanken, PD, Bonal, D, Buchmann, N, Caldararu, S, Carrara, A, Carvalhais, N, Cescatti, A, Chen, J, Cleverly, J, Cremonese, E, Desai, AR, El-Madany, TS, Farella, MM, Fernández-Martínez, M, Filippa, G, Forkel, M, Galvagno, M, Gomarasca, U, Gough, CM, Göckede, M, Ibrom, A, Ikawa, H, Janssens, IA, Jung, M, Kattge, J, Keenan, TF, Knohl, A, Kobayashi, H, Kraemer, G, Law, BE, Liddell, MJ, Ma, X, Mammarella, I, Martini, D, Macfarlane, C, Matteucci, G, Montagnani, L, Pabon-Moreno, DE, Panigada, C, Papale, D, Pendall, E, Penuelas, J, Phillips, RP, Reich, PB, Rossini, M, Rotenberg, E, Scott, RL, Stahl, C, Weber, U, Wohlfahrt, G, Wolf, S, Wright, IJ, Yakir, D, Zaehle, S and Reichstein, M (2021) The three major axes of terrestrial ecosystem function. Nature.CrossRefGoogle ScholarPubMed
Millennium Ecosystem Assessment (2001) Millennium Ecosystem Assessment. Millennium Ecosystem Assessment.Google Scholar
Moradi, F, Javan, FD and Samadzadegan, F (2022) Potential evaluation of visible-thermal UAV image fusion for individual tree detection based on convolutional neural network. International Journal of Applied Earth Observation and Geoinformation 113, 103011.CrossRefGoogle Scholar
Morales, G, Kemper, G, Sevillano, G, Arteaga, D, Ortega, I and Telles, J (2018) Automatic segmentation of Mauritia flexuosa in unmanned aerial vehicle (UAV) imagery using deep learning. Forests.CrossRefGoogle Scholar
Motz, K, Sterba, H and Pommerening, A (2010) Sampling measures of tree diversity. Forest Ecology and Management 260(11), 19851996.CrossRefGoogle Scholar
National Ecological Observatory Network (NEON) (2023). Vegetation structure (DP1.10098.001): RELEASE-2023.Google Scholar
Nguyen, HT, Lopez Caceres, ML, Moritake, K, Kentsch, S, Shu, H and Diez, Y (2021) Individual sick fir tree (Abies mariesii) identification in insect infested forests by means of UAV images and deep learning. Remote Sensing.CrossRefGoogle Scholar
Oliveira, R, Farias, H, Perdiz, R, Scudeller, V and Imbrozio Barbosa, R (2017) Structure and tree species composition in different habitats of savanna used by indigenous people in the northern Brazilian Amazon. Biodiversity Data Journal.CrossRefGoogle ScholarPubMed
Oquab, M, Bottou, L, Laptev, I and Sivic, J (2015) Is object localization for free?—Weakly-supervised learning with convolutional neural networks. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Oquab, M, Darcet, T, Moutakanni, T, Vo, H, Szafraniec, M, Khalidov, V, Fernandez, P, Haziza, D, Massa, F, El-Nouby, A, Assran, M, Ballas, N, Galuba, W, Howes, R, Huang, P-Y, Li, S-W, Misra, I, Rabbat, M, Sharma, V, Synnaeve, G, Xu, H, Jegou, H, Mairal, J, Labatut, P, Joulin, A and Bojanowski, P (2023) DINOv2: Learning robust visual features without supervision. Preprint, arXiv.Google Scholar
Ottlé, C, Lescure, J, Maignan, F, Poulter, B, Wang, T and Delbart, N (2013) Use of various remote sensing land cover products for plant functional type mapping over Siberia. Earth System Science Data.CrossRefGoogle Scholar
Ouaknine, A (2022) Deep Learning for Radar Data Exploitation of Autonomous Vehicle. PhD thesis, Institut polytechnique de Paris.Google Scholar
Ouaknine, A, Newson, A, Pérez, P, Tupin, F and Rebut, J (2021a) Multi-view radar semantic segmentation. In International Conference on Computer Vision.CrossRefGoogle Scholar
Ouaknine, A, Newson, A, Rebut, J, Tupin, F and Perez, P (2021b) CARRADA dataset: Camera and automotive radar with range-angle-Doppler annotations. In Conference in Pattern Recognition Pattern Recognition.CrossRefGoogle Scholar
Pan, H, Gao, F, Dong, J and Du, Q (2023) Multi-scale adaptive fusion network for hyperspectral image denoising. Preprint, arXiv.CrossRefGoogle Scholar
Patterson, PL, Healey, SP, Ståhl, G, Saarela, S, Holm, S, Andersen, H-E, Dubayah, RO, Duncanson, L, Hancock, S, Armston, J, Kellner, JR, Cohen, WB and Yang, Z (2019) Statistical properties of hybrid estimators proposed for GEDI-NASA’s global ecosystem dynamics investigation. Environmental Research Letters.CrossRefGoogle Scholar
Paz-Kagan, T, Caras, T, Herrmann, I, Shachak, M and Karnieli, A (2017) Multiscale mapping of species diversity under changed land use using imaging spectroscopy. Ecological Applications.CrossRefGoogle ScholarPubMed
Pérez-Luque, AJ, Gea-Izquierdo, G and Zamora, R (2021) Land-use legacies and climate change as a double challenge to oak forest resilience: Mismatches of geographical and ecological rear edges. Ecosystems.CrossRefGoogle Scholar
Pérez-Luque, AJ, Zamora, R, Bonet, FJ and Pérez-Pérez, R (2015) Dataset of MIGRAME project (global change, altitudinal range shift and colonization of degraded habitats in Mediterranean Mountains). PhytoKeys.CrossRefGoogle ScholarPubMed
Peyre, J, Sivic, J, Laptev, I and Schmid, C (2017) Weakly-supervised learning of visual relations. In International Conference on Computer Vision.CrossRefGoogle Scholar
Pflugmacher, D, Rabe, A, Peters, M and Hostert, P (2019) Mapping pan-European land cover using Landsat spectral-temporal metrics and the European LUCAS survey. Remote Sensing of Environment.CrossRefGoogle Scholar
Ploton, P, Mortier, F, Réjou-Méchain, M, Barbier, N, Picard, N, Rossi, V, Dormann, C, Cornu, G, Viennois, G, Bayol, N, Lyapustin, A, Gourlet-Fleury, S and Pélissier, R (2020) Spatial validation reveals poor predictive performance of large-scale ecological mapping models. Nature Communications.CrossRefGoogle ScholarPubMed
Potapov, P, Hansen, MC, Laestadius, L, Turubanova, S, Yaroshenko, A, Thies, C, Smith, W, Zhuravleva, I, Komarova, A, Minnemeyer, S and Esipova, E (2017) The last frontiers of wilderness: Tracking loss of intact forest landscapes from 2000 to 2013. Science Advances.CrossRefGoogle ScholarPubMed
Potapov, P, Hansen, MC, Pickens, A, Hernandez-Serna, A, Tyukavina, A, Turubanova, S, Zalles, V, Li, X, Khan, A, Stolle, F, Harris, N, Song, X-P, Baggett, A, Kommareddy, I and Kommareddy, A (2022) The global 2000–2020 land cover and land use change dataset derived from the Landsat archive: First results. Frontiers in Remote Sensing.CrossRefGoogle Scholar
Potapov, P, Li, X, Hernandez-Serna, A, Tyukavina, A, Hansen, MC, Kommareddy, A, Pickens, A, Turubanova, S, Tang, H, Silva, CE, Armston, J, Dubayah, R, Bryan Blair, J and Hofton, M (2021) Mapping global forest canopy height through integration of GEDI and Landsat data. Remote Sensing of Environment.CrossRefGoogle Scholar
Potapov, P, Tyukavina, A, Turubanova, S, Talero, Y, Hernandez-Serna, A, Hansen, M, Saah, D, Tenneson, K, Poortinga, A, Aekakkararungroj, A, Chishtie, F, Towashiraporn, P, Bhandari, B, Aung, K and Nguyen, Q (2019) Annual continuous fields of woody vegetation structure in the lower Mekong region from 2000–2017 Landsat time-series. Remote Sensing of Environment.CrossRefGoogle Scholar
Qu, Z, Du, J, Cao, Y, Guan, Q and Zhao, P (2020) Deep active learning for remote sensing object detection. Preprint, arXiv.Google Scholar
Radford, A, Kim, JW, Hallacy, C, Ramesh, A, Goh, G, Agarwal, S, Sastry, G, Askell, A, Mishkin, P, Clark, J, Krueger, G and Sutskever, I (2021). Learning transferable visual models from natural language supervision. In International Conference on Machine Learning.Google Scholar
Radford, A, Wu, J, Child, R, Luan, D, Amodei, D and Sutskever, I (2019) Language models are unsupervised multitask learners. Preprint, arXiv.Google Scholar
Rahaman, N, Weiss, M, Träuble, F, Locatello, F, Lacoste, A, Bengio, Y, Pal, C, Li, LE and Schölkopf, B (2022). A general purpose neural architecture for geospatial systems. In Conference on Neural Information Processing Systems Workshop.Google Scholar
Ramaswami, G, Sidhu, S and Quader, S (2020) Using citizen science to build baseline data on tropical tree phenology. BioRxiv.CrossRefGoogle Scholar
Rao, K, Williams, AP, Flefil, JF and Konings, AG (2020) SAR-enhanced mapping of live fuel moisture content. Remote Sensing of Environment.CrossRefGoogle Scholar
Rebut, J, Ouaknine, A, Malik, W and Pérez, P (2022) Raw high-definition radar for multi-task learning. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Redmon, J, Divvala, S, Girshick, R and Farhadi, A (2016) You only look once: Unified, real-time object detection. In Conference on Computer Vision and Pattern Recognition.CrossRefGoogle Scholar
Redmon, J and Farhadi, A (2018) YOLOv3: An incremental improvement. Preprint, arXiv.Google Scholar
Reed, CJ, Gupta, R, Li, S, Brockman, S, Funk, C, Clipp, B, Funk, C, Candido, S, Uyttendaele, M and Darrell, T (2022) Scale-MAE: A scale-aware masked autoencoder for multiscale geospatial representation learning. Preprint, arXiv.CrossRefGoogle Scholar
Regnier, P, Resplandy, L, Najjar, RG and Ciais, P (2022) The land-to-ocean loops of the global carbon cycle. Nature.CrossRefGoogle ScholarPubMed
Reichstein, M, Camps-Valls, G, Stevens, B, Jung, M, Denzler, J, Carvalhais, N and Prabhat, f (2019) Deep learning and process understanding for data-driven earth system science. Nature.CrossRefGoogle ScholarPubMed
Reiersen, G, Dao, D, Lütjens, B, Klemmer, K, Amara, K, Steinegger, A, Zhang, C and Zhu, X (2022) ReforesTree: A Dataset for Estimating Tropical Forest Carbon Stock with Deep Learning and Aerial Imagery. Association for the Advancement of Artificial Intelligence.CrossRefGoogle Scholar
Reis, R, dos Santos, FN and Santos, L (2020) Forest robot and datasets for biomass collection. In Robot 2019: Fourth Iberian Robotics Conference.CrossRefGoogle Scholar
Ren, S, He, K, Girshick, R and Sun, J (2015) Faster R-CNN: Towards real-time object detection with region proposal networks. In Conference on Neural Information Processing Systems.Google Scholar
Requena-Mesa, C, Reichstein, M, Mahecha, M, Kraft, B and Denzler, J (2018) Predicting landscapes as seen from space from environmental conditions. In International Geoscience and Remote Sensing Symposium, Valencia.CrossRefGoogle Scholar
Richardson, AD, Hufkens, K, Milliman, T and Frolking, S (2018) Intercomparison of phenological transition dates derived from the PhenoCam dataset V1.0 and MODIS satellite remote sensing. Scientific Reports 8(1).CrossRefGoogle ScholarPubMed
Robin, C, Requena-Mesa, C, Benson, V, Alonso, L, Poehls, J, Carvalhais, N and Reichstein, M (2022) Learning to forecast vegetation greenness at fine resolution over Africa with ConvLSTMs. In Conference on Neural Information Processing Systems Workshop.Google Scholar
Robinson, C, Hou, L, Malkin, K, Soobitsky, R, Czawlytko, J, Dilkina, B and Jojic, N (2019) Large scale high-resolution land cover mapping with multi-resolution data. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Rolnick, D, Donti, PL, Kaack, LH, Kochanski, K, Lacoste, A, Sankaran, K, Ross, AS, Milojevic-Dupont, N, Jaques, N, Waldman-Brown, A, Luccioni, AS, Maharaj, T, Sherwin, ED, Mukkavilli, SK, Kording, KP, Gomes, CP, Ng, AY, Hassabis, D, Platt, JC, Creutzig, F, Chayes, J and Bengio, Y (2023) Tackling climate change with machine learning. ACM Computing Surveys 55(2), 196.CrossRefGoogle Scholar
Ronneberger, O, Fischer, P and Brox, T (2015) U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer Assisted Intervention.CrossRefGoogle Scholar
Roy, S, Unmesh, A and Namboodiri, VP (2019) Deep active learning for object detection. In British Machine Vision Conference.Google Scholar
Rußwurm, M, Wang, S, Körner, M and Lobell, D (2020) Meta-learning for few-shot land cover classification. In Computer Vision and Pattern Recognition Conference Workshop.CrossRefGoogle Scholar
Sakschewski, B, Von Bloh, W, Boit, A, Poorter, L, Peña Claros, M, Heinke, J, Joshi, J and Thonicke, K (2016) Resilience of Amazon forests emerges from plant trait diversity. Nature Climate Change.CrossRefGoogle Scholar
Sani-Mohammed, A, Yao, W and Heurich, M (2022) Instance segmentation of standing dead trees in dense forest from aerial imagery using deep learning. ISPRS Open Journal of Photogrammetry and Remote Sensing.CrossRefGoogle Scholar
Santoro, M, Cartus, O, Carvalhais, N, Rozendaal, DMA, Avitabile, V, Araza, A, de Bruin, S, Herold, M, Quegan, S, Rodriguez-Veiga, P, Balzter, H, Carreiras, J, Schepaschenko, D, Korets, M, Shimada, M, Itoh, T, Martínez, M, Cavlovic, J, Cazzolla Gatti, R, da Conceiçao Bispo, P, Dewnath, N, Labrière, N, Liang, J, Lindsell, J, Mitchard, ETA, Morel, A, Pacheco Pascagaza, AM, Ryan, CM, Slik, F, Vaglio Laurin, G, Verbeeck, H, Wijaya, A and Willcock, S (2021) The global forest above-ground biomass pool for 2010 estimated from high-resolution satellite observations. Earth System Science Data.CrossRefGoogle Scholar
Sapes, G, Lapadat, C, Schweiger, AK, Juzwik, J, Montgomery, R, Gholizadeh, H, Townsend, PA, Gamon, JA and Cavender-Bares, J (2022) Canopy spectral reflectance detects oak wilt at the landscape scale using phylogenetic discrimination. Remote Sensing of Environment 273, 112961.CrossRefGoogle Scholar
Schepaschenko, D, Chave, J, Phillips, OL, Lewis, SL, Davies, SJ, Réjou-Méchain, M, Sist, P, Scipal, K, Perger, C, Herault, B, Labrière, N, Hofhansl, F, Affum-Baffoe, K, Aleinikov, A, Alonso, A, Amani, C, Araujo-Murakami, A, Armston, J, Arroyo, L, Ascarrunz, N, Azevedo, C, Baker, T, Bałazy, R, Bedeau, C, Berry, N, Bilous, AM, Bilous, SY, Bissiengou, P, Blanc, L, Bobkova, KS, Braslavskaya, T, Brienen, R, Burslem, DFRP, Condit, R, Cuni-Sanchez, A, Danilina, D, del Castillo Torres, D, Derroire, G, Descroix, L, Sotta, ED, d’Oliveira, MVN, Dresel, C, Erwin, T, Evdokimenko, MD, Falck, J, Feldpausch, TR, Foli, EG, Foster, R, Fritz, S, Garcia-Abril, AD, Gornov, A, Gornova, M, Gothard-Bassébé, E, Gourlet-Fleury, S, Guedes, M, Hamer, KC, Susanty, FH, Higuchi, N, Coronado, ENH, Hubau, W, Hubbell, S, Ilstedt, U, Ivanov, VV, Kanashiro, M, Karlsson, A, Karminov, VN, Killeen, T, Koffi, J-CK, Konovalova, M, Kraxner, F, Krejza, J, Krisnawati, H, Krivobokov, LV, Kuznetsov, MA, Lakyda, I, Lakyda, PI, Licona, JC, Lucas, RM, Lukina, N, Lussetti, D, Malhi, Y, Manzanera, JA, Marimon, B, Junior, BHM, Martinez, RV, Martynenko, OV, Matsala, M, Matyashuk, RK, Mazzei, L, Memiaghe, H, Mendoza, C, Mendoza, AM, Moroziuk, OV, Mukhortova, L, Musa, S, Nazimova, DI, Okuda, T, Oliveira, LC, Ontikov, PV, Osipov, AF, Pietsch, S, Playfair, M, Poulsen, J, Radchenko, VG, Rodney, K, Rozak, AH, Ruschel, A, Rutishauser, E, See, L, Shchepashchenko, M, Shevchenko, N, Shvidenko, A, Silveira, M, Singh, J, Sonké, B, Souza, C, Stereńczak, K, Stonozhenko, L, Sullivan, MJP, Szatniewska, J, Taedoumg, H, ter Steege, H, Tikhonova, E, Toledo, M, Trefilova, OV, Valbuena, R, Gamarra, LV, Vasiliev, S, Vedrova, EF, Verhovets, SV, Vidal, E, Vladimirova, NA, Vleminckx, J, Vos, VA, Vozmitel, FK, Wanek, W, West, TAP, Woell, H, Woods, JT, Wortel, V, Yamada, T, Nur Hajar, ZS and Zo-Bi, IC (2019) The forest observation system, building a global reference dataset for remote sensing of forest biomass. Scientific Data.CrossRefGoogle ScholarPubMed
Schiefer, F, Kattenborn, T, Frick, A, Frey, J, Schall, P, Koch, B and Schmidtlein, S (2020) Mapping forest tree species in high resolution UAV-based RGB-imagery by means of convolutional neural networks. ISPRS Journal of Photogrammetry and Remote Sensing.CrossRefGoogle Scholar
Schiefer, F, Schmidtlein, S, Frick, A, Frey, J, Klinke, R, Zielewska-Büttner, K, Junttila, S, Uhl, A and Kattenborn, T (2023) UAV-based reference data for the prediction of fractional cover of standing deadwood from sentinel time series. ISPRS Open Journal of Photogrammetry and Remote Sensing 8, 100034.CrossRefGoogle Scholar
Schiefer, F, Schmidtlein, S and Kattenborn, T (2021) The retrieval of plant functional traits from canopy spectra through RTM-inversions and statistical models are both critically affected by plant phenology. Ecological Indicators.CrossRefGoogle Scholar
Schiller, C, Schmidtlein, S, Boonman, C, Moreno-Martínez, A and Kattenborn, T (2021) Deep learning and citizen science enable automated plant trait predictions from photographs. Scientific Reports 11(1).CrossRefGoogle ScholarPubMed
Schmitt, M, Hughes, LH, Qiu, C and Zhu, XX (2019) SEN12MS—A curated dataset of georeferenced multi-spectral Sentinel-1/2 imagery for deep learning and data fusion. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences.CrossRefGoogle Scholar
Schmitt, M and Zhu, XX (2016) Data fusion and remote sensing: An ever-growing relationship. IEEE Geoscience and Remote Sensing Magazine 4(4):623.CrossRefGoogle Scholar
Schneider, FD, Morsdorf, F, Schmid, B, Petchey, OL, Hueni, A, Schimel, DS and Schaepman, ME (2017) Mapping functional diversity from remotely sensed morphological and physiological forest traits. Nature Communications.CrossRefGoogle ScholarPubMed
Seo, B, Bogner, C, Poppenborg, P, Martin, E, Hoffmeister, M, Jun, M, Koellner, T, Reineking, B, Shope, CL and Tenhunen, J (2014) Deriving a per-field land use and land cover map in an agricultural mosaic catchment. Earth System Science Data.CrossRefGoogle Scholar
Shevtsova, I, Heim, B, Kruse, S, Schröder, J, Troeva, EI, Pestryakova, LA, Zakharov, ES and Herzschuh, U (2020) Strong shrub expansion in tundra-taiga, tree infilling in taiga and stable tundra in central Chukotka (north-eastern Siberia) between 2000 and 2017. Environmental Research Letters.CrossRefGoogle Scholar
Shi, Y, Du, L, Guo, Y and Du, Y (2022) Unsupervised domain adaptation based on progressive transfer for ship detection: From optical to SAR images. IEEE Transactions on Geoscience and Remote Sensing 60, 117.Google Scholar
Shorten, C and Khoshgoftaar, TM (2019) A survey on image data augmentation for deep learning. Journal of Big Data 6(1).CrossRefGoogle Scholar
Siddiqui, Y, Valentin, J and Niessner, M (2020) ViewAL: Active learning with viewpoint entropy for semantic segmentation. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Simonyan, K and Zisserman, A (2015) Very deep convolutional networks for large-scale image recognition. In International Conference on Learning Representations.Google Scholar
Sinha, S, Ebrahimi, S and Darrell, T (2019) Variational adversarial active learning. In International Conference on Computer Vision.CrossRefGoogle Scholar
Snell, J, Swersky, K and Zemel, RS (2017) Prototypical networks for few-shot learning. In Conference on Neural Information Processing Systems.Google Scholar
Socher, R, Ganjoo, M, Sridhar, H, Bastani, O, Manning, CD and Ng, AY (2013) Zero-shot learning through cross-modal transfer. In Conference on Neural Information Processing Systems.Google Scholar
Soltani, S, Feilhauer, H, Duker, R and Kattenborn, T (2022) Transfer learning from citizen science photographs enables plant species identification in UAV imagery. Open Journal of Photogrammetry and Remote Sensing.CrossRefGoogle Scholar
Srivastava, N, Hinton, G, Krizhevsky, A, Sutskever, I and Salakhutdinov, R (2014) Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research.Google Scholar
Steffen, W, Sanderson, RA, Tyson, PD, Jäger, J, Matson, PA, Moore, B, Oldfield, F, Richardson, K, Schellnhuber, H-J, Turner, BL and Wasson, RJ (2005) Global Change and the Earth System: A Planet under Pressure. Springer Science & Business Media.CrossRefGoogle Scholar
Still, C, Powell, R, Aubrecht, D, Kim, Y, Helliker, B, Roberts, D, Richardson, AD and Goulden, M (2019) Thermal imaging in plant and ecosystem ecology: Applications and challenges. Ecosphere 10(6).CrossRefGoogle Scholar
Sumbul, G, Cinbis, RG and Aksoy, S (2018) Fine-grained object recognition and zero-shot learning in remote sensing imagery. IEEE Transactions on Geoscience and Remote Sensing 56(2), 770779.CrossRefGoogle Scholar
Sumbul, G, de Wall, A, Kreuziger, T, Marcelino, F, Costa, H, Benevides, P, Caetano, M, Demir, B and Markl, V (2021) BigEarthNet-MM: A large-scale, multimodal, multilabel benchmark archive for remote sensing image classification and retrieval [software and data sets]. IEEE Geoscience and Remote Sensing Magazine.CrossRefGoogle Scholar
Sun, X, Wang, P, Lu, W, Zhu, Z, Lu, X, He, Q, Li, J, Rong, X, Yang, Z, Chang, H, He, Q, Yang, G, Wang, R, Lu, J and Fu, K (2022) RingMo: A remote sensing foundation model with masked image Modeling. IEEE Transactions on Geoscience and Remote Sensing 61, 122.Google Scholar
Swain, R, Paul, A and Behera, MD (2023) Spatio-temporal fusion methods for spectral remote sensing: A comprehensive technical review and comparative analysis. Tropical Ecology 65(3), 356375.CrossRefGoogle Scholar
Szegedy, C, Ioffe, S, Vanhoucke, V and Alemi, AA (2017) Inception-v4, inception-ResNet and the impact of residual connections on learning. In Association for the Advancement of Artificial Intelligence Conference on Artificial Intelligence.CrossRefGoogle Scholar
Szegedy, C, Liu, W, Jia, Y, Sermanet, P, Reed, S, Anguelov, D, Erhan, D, Vanhoucke, V and Rabinovich, A (2015) Going deeper with convolutions. In Conference on Computer Vision and Pattern Recognition.CrossRefGoogle Scholar
Szegedy, C, Vanhoucke, V, Ioffe, S, Shlens, J and Wojna, Z (2016) Rethinking the inception architecture for computer vision. In Conference on Computer Vision and Pattern Recognition.CrossRefGoogle Scholar
Tang, H, Ma, L, Lister, A, O’Neill-Dunne, J, Lu, J, Lamb, RL, Dubayah, R and Hurtt, G (2021) High-resolution forest carbon mapping for climate mitigation baselines over the RGGI region, USA. Environmental Research Letters.Google Scholar
Tao, C, Qi, J, Guo, M, Zhu, Q and Li, H (2023) Self-supervised remote sensing feature learning: Learning paradigms, challenges, and future works. IEEE Transactions on Geoscience and Remote Sensing.CrossRefGoogle Scholar
Tarasiou, M, Chavez, E and Zafeiriou, S (2023) ViTs for SITS: Vision transformers for satellite image time series. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
The Food and Agriculture Organization of the United Nations (2020) Global Forest Resources Assessment 2020. Food and Agriculture Organization of the United Nations.Google Scholar
Themyr, L, Rambour, C, Thome, N, Collins, T and Hostettler, A (2023) Full Contextual Attention for Multi-Resolution Transformers in Semantic Segmentation. In Winter Conference on Applications of Computer Vision.CrossRefGoogle Scholar
Thompson, I, Mackey, B, McNulty, S and Mosseler, A (2009). Forest Resilience, Biodiversity, and Climate Change. A Synthesis of the Biodiversity/Resilience/Stability Relationship in Forest Ecosystems. Technical Series. Montreal, QC: Secretariat of the Convention on Biological Diversity.Google Scholar
Thonfeld, F, Steinbach, S, Muro, J and Kirimi, F (2020) Long-term land use/land cover change Assessment of the Kilombero catchment in Tanzania using random forest classification and robust change vector analysis. Remote Sensing.CrossRefGoogle Scholar
Tolan, J, Yang, H-I, Nosarzewski, B, Couairon, G, Vo, H, Brandt, J, Spore, J, Majumdar, S, Haziza, D, Vamaraju, J, Moutakani, T, Bojanowski, P, Johns, T, White, B, Tiecke, T and Couprie, C (2023) Sub-meter resolution canopy height maps using self-supervised learning and a vision transformer trained on aerial and GEDI lidar. Preprint, arXiv.Google Scholar
Touvron, H, Cord, M, Douze, M, Massa, F, Sablayrolles, A and Jégou, H (2021) Training data-efficient image transformers & distillation through attention. In International Conference on Machine Learning.Google Scholar
Touvron, H, Lavril, T, Izacard, G, Martinet, X, Lachaux, M-A, Lacroix, T, Rozière, B, Goyal, N, Hambro, E, Azhar, F, Rodriguez, A, Joulin, A, Grave, E and Lample, G (2023) LLaMA: Open and efficient foundation language models. Preprint, arXiv.Google Scholar
Tremblay, J-F, Béland, M, Gagnon, R, Pomerleau, F and Giguère, P (2020) Automatic three-dimensional mapping for tree diameter measurements in inventory operations. Journal of Field Robotics.CrossRefGoogle Scholar
Trumbore, S, Brando, P and Hartmann, H (2015) Forest health and global change. Science.CrossRefGoogle ScholarPubMed
Tseng, G, Zvonkov, I, Purohit, M, Rolnick, D and Kerner, H (2023) Lightweight, pre-trained transformers for remote sensing timeseries. Preprint, arXiv.Google Scholar
Tucker, C, Brandt, M, Hiernaux, P, Kariryaa, A, Rasmussen, K, Small, J, Igel, C, Reiner, F, Melocik, K, Meyer, J, Sinno, S, Romero, E, Glennie, E, Fitts, Y, Morin, A, Pinzon, J, McClain, D, Morin, P, Porter, C, Loeffler, S, Kergoat, L, Issoufou, B-A, Savadogo, P, Wigneron, J-P, Poulter, B, Ciais, P, Kaufmann, R, Myneni, R, Saatchi, S and Fensholt, R (2023) Sub-continental-scale carbon stocks of individual trees in African drylands. Nature.CrossRefGoogle ScholarPubMed
Tuia, D, Volpi, M, Copa, L, Kanevski, M and Munoz-Mari, J (2011) A survey of active learning algorithms for supervised remote sensing image classification. IEEE Journal of Selected Topics in Signal Processing. 5(3), 606617CrossRefGoogle Scholar
Turubanova, S, Potapov, PV, Tyukavina, A and Hansen, MC (2018) Ongoing primary forest loss in Brazil, Democratic Republic of the Congo, and Indonesia. Environmental Research Letters.CrossRefGoogle Scholar
van Geffen, F, Heim, B, Brieger, F, Geng, R, Shevtsova, IA, Schulte, L, Stuenzi, SM, Bernhardt, N, Troeva, EI, Pestryakova, LA, Zakharov, ES, Pflug, B, Herzschuh, U and Kruse, S (2022) SiDroForest: A comprehensive forest inventory of Siberian boreal forest investigations including drone-based point clouds, individually labeled trees, synthetically generated tree crowns, and Sentinel-2 labeled image patches. Earth System Science Data.CrossRefGoogle Scholar
van Lierop, P, Lindquist, E, Sathyapala, S and Franceschini, G (2015) Global forest area disturbance from fire, insect pests, diseases and severe weather events. Forest Ecology and Management.CrossRefGoogle Scholar
Verbeeck, H, Bauters, M, Jackson, T, Shenkin, A, Disney, M and Calders, K (2019) Time for a plant structural economics Spectrum. Frontiers in Forests and Global Change 2.CrossRefGoogle Scholar
Verhegghen, A, Kuzelova, K, Syrris, V, Eva, H and Achard, F (2022) Mapping canopy cover in African dry forests from the combined use of Sentinel-1 and Sentinel-2 data: Application to Tanzania for the year 2018. Remote Sensing.CrossRefGoogle Scholar
Verrelst, J, Camps-Valls, G, Muñoz Marí, J, Rivera, JP, Veroustraete, F, Clevers, JG and Moreno, J (2015) Optical remote sensing and the retrieval of terrestrial vegetation bio-geophysical properties—A review. ISPRS Journal of Photogrammetry and Remote Sensing.CrossRefGoogle Scholar
Vinyals, O, Blundell, C, Lillicrap, T, Kavukcuoglu, K and Wierstra, D (2016) Matching networks for one shot learning. In Conference on Neural Information Processing Systems.Google Scholar
Vu, T-H, Jain, H, Bucher, M, Cord, M and Pérez, P (2019) ADVENT: Adversarial entropy minimization for domain adaptation in semantic segmentation. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Wagner, FH (2021) The flowering of Atlantic Forest Pleroma trees. Scientific Reports.CrossRefGoogle ScholarPubMed
Wagner, FH, Roberts, S, Ritz, AL, Carter, G, Dalagnol, R, Favrichon, S, Hirye, MC, Brandt, M, Ciais, P and Saatchi, S (2023) Sub-meter tree height mapping of California using aerial images and LiDAR-informed U-net model.CrossRefGoogle Scholar
Wang, JA, Sulla-Menashe, D, Woodcock, CE, Sonnentag, O, Keeling, RF and Friedl, MA (2020a) Extensive land cover change across Arctic-boreal northwestern North America from disturbance and climate forcing. Global Change Biology 26(2), 807822.CrossRefGoogle ScholarPubMed
Wang, R and Gamon, JA (2019) Remote sensing of terrestrial plant biodiversity. Remote Sensing of Environment.Google Scholar
Wang, S, Chen, W, Xie, SM, Azzari, G and Lobell, DB (2020b) Weakly supervised deep learning for segmentation of remote sensing imagery. Remote Sensing 12(2), 207.CrossRefGoogle Scholar
Wang, Y, Feng, L, Sun, W, Zhang, Z, Zhang, H, Yang, G and Meng, X (2022) Exploring the potential of multi-source unsupervised domain adaptation in crop mapping using Sentinel-2 images. GIScience & Remote Sensing 59(1), 22472265.CrossRefGoogle Scholar
Weinstein, BG, Graves, SJ, Marconi, S, Singh, A, Zare, A, Stewart, D, Bohlman, SA and White, EP (2021a) A benchmark dataset for canopy crown detection and delineation in co-registered airborne RGB, LiDAR and hyperspectral imagery from the national ecological observation network. PLoS Computational Biology.CrossRefGoogle ScholarPubMed
Weinstein, BG, Marconi, S, Bohlman, SA, Zare, A, Singh, A, Graves, SJ and White, EP (2021b) A remote sensing derived data set of 100 million individual tree crowns for the national ecological observatory network. eLife.CrossRefGoogle ScholarPubMed
Weiser, H, Schäfer, J, Winiwarter, L, Krašovec, N, Fassnacht, FE and Höfle, B (2022) Individual tree point clouds and tree measurements from multi-platform laser scanning in German forests. Earth System Science Data.CrossRefGoogle Scholar
Weiss, K, Khoshgoftaar, TM and Wang, D (2016) A survey of transfer learning. Journal of Big Data 3(1).CrossRefGoogle Scholar
White, JC, Coops, NC, Wulder, MA, Vastaranta, M, Hilker, T and Tompalski, P (2016) Remote sensing technologies for enhancing forest inventories: A review. Canadian Journal of Remote Sensing.CrossRefGoogle Scholar
Wilson, G and Cook, DJ (2020) A survey of unsupervised deep domain adaptation. ACM Transactions on Intelligent Systems and Technology.CrossRefGoogle ScholarPubMed
Wulder, MA, Masek, JG, Cohen, WB, Loveland, TR and Woodcock, CE (2012) Opening the archive: How free data has enabled the science and monitoring promise of Landsat. Remote Sensing of Environment 122, 210.CrossRefGoogle Scholar
Wulder, MA, Roy, DP, Radeloff, VC, Loveland, TR, Anderson, MC, Johnson, DMS, Zhu, Z, Scambos, TA, Pahlevan, N, Hansen, M, Gorelick, N, Crawford, CJ, Masek, JG, Hermosilla, T, White, JC, Belward, AS, Schaaf, C, Woodcock, CE, Huntington, JL, Lymburner, L, Hostert, P, Gao, F, Lyapustin, A, Pekel, J-F, Strobl, P and Cook, BD (2022) Fifty years of Landsat science and impacts. Remote Sensing of Environment 280, 113195.CrossRefGoogle Scholar
Xian, Y, Lampert, CH, Schiele, B and Akata, Z (2018) Zero-shot learning—A comprehensive evaluation of the good, the bad and the ugly. IEEE Transactions on Pattern Analysis and Machine Intelligence.Google ScholarPubMed
Xu, J, De Mello, S, Liu, S, Byeon, W, Breuel, T, Kautz, J and Wang, X (2022) GroupViT: Semantic segmentation emerges from text supervision. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Xu, Q, Shi, Y, Yuan, X and Zhu, XX (2023) Universal domain adaptation for remote sensing image scene classification. In IEEE International Geoscience and Remote Sensing Symposium.CrossRefGoogle Scholar
Yadav, R, Nascetti, A, Azizpour, H and Ban, Y (2022) Unsupervised flood detection on SAR time series. Preprint, arXiv.Google Scholar
Yamazaki, K, Hanyu, T, Tran, M, Garcia, A, Tran, A, McCann, R, Liao, H, Rainwater, C, Adkins, M, Molthan, A, Cothren, J and Le, N (2023) AerialFormer: Multi-resolution transformer for aerial image segmentation. Preprint, arXiv.Google Scholar
Yang, B, Luo, W and Urtasun, R (2018) PIXOR: Real-time 3D object detection from point clouds. In Conference on Computer Vision and Pattern Recognition.CrossRefGoogle Scholar
Yang, L, Liang, S and Zhang, Y (2020) A new method for generating a global forest aboveground biomass map from multiple high-level satellite products and ancillary information. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.CrossRefGoogle Scholar
Yao, X, Feng, X, Han, J, Cheng, G and Guo, L (2021) Automatic weakly supervised object detection from high spatial resolution remote sensing images via dynamic curriculum learning. IEEE Transactions on Geoscience and Remote Sensing 59(1), 675685.CrossRefGoogle Scholar
Yao, X, Han, J, Cheng, G, Qian, X and Guo, L (2016) Semantic annotation of high-resolution satellite images via weakly supervised learning. IEEE Transactions on Geoscience and Remote Sensing 54(6), 36603671.CrossRefGoogle Scholar
Yazdani, A, Lu, L, Raissi, M and Karniadakis, GE (2020) Systems biology informed deep learning for inferring parameters and hidden dynamics. PLoS Computational Biology 16(11), e1007575.CrossRefGoogle ScholarPubMed
Yoo, D and Kweon, IS (2019) Learning loss for active learning. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Yuan, X, Shi, J and Gu, L (2021) A review of deep learning methods for semantic segmentation of remote sensing imagery. Expert Systems with Applications 169, 114417.CrossRefGoogle Scholar
Yue, K, Yang, L, Li, R, Hu, W, Zhang, F and Li, W (2019) TreeUNet: Adaptive tree convolutional neural networks for subdecimeter aerial image segmentation. ISPRS Journal of Photogrammetry and Remote Sensing 156, 113.CrossRefGoogle Scholar
Zanne, AE, Tank, DC, Cornwell, WK, Eastman, JM, Smith, SA, FitzJohn, RG, McGlinn, DJ, O’Meara, BC, Moles, AT, Reich, PB, Royer, DL, Soltis, DE, Stevens, PF, Westoby, M, Wright, IJ, Aarssen, L, Bertin, RI, Calaminus, A, Govaerts, R, Hemmings, F, Leishman, MR, Oleksyn, J, Soltis, PS, Swenson, NG, Warman, L and Beaulieu, JM (2014) Three keys to the radiation of angiosperms into freezing environments. Nature 506(7486), 8992.CrossRefGoogle Scholar
Zarco-Tejada, P, Camino, C, Beck, P, Calderon, R, Hornero, A, Hernández-Clemente, R, Kattenborn, T, Montes-Borrego, M, Susca, L, Morelli, M, Gonzalez-Dugo, V, PRJ, North, Landa, BB, Boscia, D, Saponari, M and Navas-Cortes, JA (2018) Previsual symptoms of Xylella fastidiosa infection revealed in spectral plant-trait alterations. Nature Plants.CrossRefGoogle ScholarPubMed
Zarco-Tejada, P, Hornero, A, Beck, P, Kattenborn, T, Kempeneers, P and Hernández-Clemente, R (2019) Chlorophyll content estimation in an open-canopy conifer forest with sentinel-2A and hyperspectral imagery in the context of forest decline. Remote Sensing of Environment 223, 320335.CrossRefGoogle Scholar
Zhang, B, Ming, Z, Feng, W, Liu, Y, He, L and Zhao, K (2023) MMFormer: Multimodal transformer using multiscale self-attention for remote sensing image classification. Preprint, arXiv.CrossRefGoogle Scholar
Zhang, C, Bengio, S, Hardt, M, Recht, B and Vinyals, O (2017) Understanding deep learning requires rethinking generalization. In International Conference on Learning Representations.Google Scholar
Zhang, D, Han, J, Cheng, G, Liu, Z, Bu, S and Guo, L (2015) Weakly supervised learning for target detection in remote sensing images. IEEE Geoscience and Remote Sensing Letters 12(4), 701705.CrossRefGoogle Scholar
Zhang, P, Bai, Y, Wang, D, Bai, B and Li, Y (2020) Few-shot classification of aerial scene images via meta-learning. Remote Sensing 13(1), 108.CrossRefGoogle Scholar
Zhang, X, Liu, L, Chen, X, Gao, Y, Xie, S and Mi, J (2021) GLC_fcs30: Global land-cover product with fine classification system at 30 m using time-series Landsat imagery. Earth System Science Data.Google Scholar
Zhang, Z, Pasolli, E, Crawford, MM and Tilton, JC (2016) An active learning framework for hyperspectral image classification using hierarchical segmentation. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 9(2), 640654.CrossRefGoogle Scholar
Zhang, Z and Saligrama, V (2015) Zero-shot learning via semantic similarity embedding. In International Conference on Computer Vision.CrossRefGoogle Scholar
Zheng, M, Wang, F, You, S, Qian, C, Zhang, C, Wang, X and Xu, C (2021) Weakly supervised contrastive learning. In International Conference on Computer Vision.CrossRefGoogle Scholar
Zhou, Z-H (2018) A brief introduction to weakly supervised learning. National Science Review 5(1), 4453.CrossRefGoogle Scholar
Zhu, K, Zhang, J, Niu, S, Chu, C and Luo, Y (2018) Limits to growth of forest biomass carbon sink under climate change. Nature Communications 9(1).CrossRefGoogle ScholarPubMed
Zhu, X, Zhu, J, Li, H, Wu, X, Wang, X, Li, H, Wang, X and Dai, J (2022) Uni-perceiver: Pre-training unified architecture for generic perception for zero-shot and few-shot tasks. In Computer Vision and Pattern Recognition Conference.CrossRefGoogle Scholar
Zianis, D, Muukkonen, P, Mäkipää, R and Mencuccini, M (2005) Biomass and Stem Volume Equations for Tree Species in Europe. Silva Fennica Monographs.Google Scholar
Zou, H and Hastie, T (2005) Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society. Series B (Statistical Methodology).Google Scholar
Figure 0

Figure 1. Overview of forest monitoring topics and challenges associated with machine learning perspectives and challenges. Note: Each forest monitoring topic and its challenges are detailed with their corresponding section number (in red). They are associated with the three main machine learning perspectives and challenge categories, namely generalization, limited data, and domain-specific objectives, along with their corresponding section number (in red).

Figure 1

Figure 2. Illustration of forest monitoring datasets at different scales. Note: Inventories are in situ measurements realized at the tree level. Ground-based datasets are recorded within or below the canopy of the trees. Aerial datasets are composed of recordings from sensors mounted on unoccupied (drones) or occupied aircrafts. Satellite datasets are collected from sensors mounted on satellites orbiting the Earth. Map datasets are generated at the country or world level using datasets at the aerial or satellite scales.

Figure 2

Figure 3. Distribution of the reviewed open-access forest datasets. Note: (Left) World map of the location of the reviewed datasets at the country level. Most of the datasets are regional and do not reflect the entire associated country. The datasets categorized with a “Worldwide” location or at the continent level have been excluded for visualization purposes. (Right) Distributions of the publication years and recording years used and/or released in the associated datasets.

Figure 3

Table 1. Review of open-access forest inventories datasets

Figure 4

Table 2. Review of open-access ground-based forest datasets

Figure 5

Table 3. Review of open-access aerial forest datasets

Figure 6

Table 4. Review of open-access satellite forest datasets before 2020 (included)

Figure 7

Table 5. Review of satellite recording datasets after 2021 (included)

Figure 8

Table 6. Review of open-access map forest datasets before 2019 (included)

Figure 9

Table 7. Review of open-access map forest datasets after 2020 (included)

Figure 10

Table 8. Review of open-access mixed forest datasets, including inventories and aerial-based (IA); inventories, aerial-based, and satellite-based (IAS); and inventories and maps (IM)

Figure 11

Table 9. Review of open-access mixed forest datasets, including ground-based and aerial-based (GA); aerial-based and satellite-based (AS); aerial-based and maps (AM); and satellite-based and maps (SM)

Author comment: OpenForest: a data catalog for machine learning in forest monitoring — R0/PR1

Comments

Dear Editors.

We are writing to submit our manuscript “OpenForest: A data catalogue for machine learning in forest monitoring” to the Environmental Data Science journal.

In the context of a climate emergency, there is an urgent need to monitor forests worldwide.

This is essential for maintaining ecological equilibrium, as it helps mitigate human impacts and enhances our comprehension of forest composition.

This work aims to foster interest among both the machine learning and the forest biology communities regarding ongoing research topics and challenges for forest monitoring.

Forest biology research topics and their current challenges are discussed to target potential areas for future research within the community.

Machine learning methods are also introduced bringing the potential to explore and tackle forest biology challenges.

As both biology challenges and machine learning methods require a large source of available data, a clear review of open source datasets is also proposed.

To highlight and increase the research trend in these fields, the OpenForest dynamic catalog is publicly released to centralize all open source available datasets for forest monitoring while being open to updates by the community.

The overall objective of this work is to foster communication, inspire new applications of machine learning in forest monitoring, and motivate advancements in this field.

Review: OpenForest: a data catalog for machine learning in forest monitoring — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

The authors present a new dataset of open data on forests that can be useful for machine and deep learning applications. This effort is excellent and highly needed as forest are becoming a key element of discussion in climate mitigation and adaptation.

The article is very well written and will be quite well received by the remote sensing and forestry community.

I have only few minor comments on the introductory part and second section:

Line 47 – it should be mentioned that the projected carbon sequestration related to the use of forest as nature based solution can be limited by the climate effects on forest that we are witnessing, so those numbers require constant updates.

https://www.nature.com/articles/s41467-018-05132-5

Figure 1 – I wonder if within the challenges one should merge biotic and abiotic factors into 1 category “disturbance type detection”

Figure 1 – The forest biomass is not listed as challenge. Although at a certain spatial resolution this is true, biomass estimation at high resolution, and most important biomass stock changes are key challenges for the remote sensing community, and very important both scientifically but also for monitoring reporting and verification activities in policy and carbon credit context

Figure 1 – monitoring forest management is also a key challenge in my opinion missing

I would also suggest to highlight these challenges in the section 2.1

Page 7 line 4

Clarify what do you mean by “these” are traits or ecosystem functional properties?

I would merge section 2.2.3 and 2.2.4 into one section and call it monitoring forest stress and attribution to biotic and abiotic factors. In my opinion, monitoring the effects of these factors on forest functioning is relatively straightforward, but it is the attribution of the observed stress to the cause the main area we need to develop, also to be able to map the type of disturbance at large scale

I found the machine learning section very well structured.

For the Inventories the authors can consider to access some inventories such as the one from Sweden and Catalonia, which can be freely requested

Recommendation: OpenForest: a data catalog for machine learning in forest monitoring — R0/PR3

Comments

Dear authors,

you probably saw my earlier message about having problems finding reviewers. We finally got one back and for data papers we only need one review. The reviewer is very enthusiastic about the paper and I personally agree with this assessment. I would encourage you to address the minor issues raised so that we can proceed with the publication of this paper.

Best regards,

Miguel Mahecha

Decision: OpenForest: a data catalog for machine learning in forest monitoring — R0/PR4

Comments

No accompanying comment.

Author comment: OpenForest: a data catalog for machine learning in forest monitoring — R1/PR5

Comments

Dear Editors.

We are writing to submit our manuscript “OpenForest: A data catalogue for machine learning in forest monitoring” to the Environmental Data Science journal.

In the context of a climate emergency, there is an urgent need to monitor forests worldwide.

This is essential for maintaining ecological equilibrium, as it helps mitigate human impacts and enhances our comprehension of forest composition.

This work aims to foster interest among both the machine learning and the forest biology communities regarding ongoing research topics and challenges for forest monitoring.

Forest biology research topics and their current challenges are discussed to target potential areas for future research within the community.

Machine learning methods are also introduced bringing the potential to explore and tackle forest biology challenges.

As both biology challenges and machine learning methods require a large source of available data, a clear review of open source datasets is also proposed.

To highlight and increase the research trend in these fields, the OpenForest dynamic catalogue is publicly released to centralize all open source available datasets for forest monitoring while being open to updates by the community.

The overall objective of this work is to foster communication, inspire new applications of machine learning in forest monitoring, and motivate advancements in this field.

Review: OpenForest: a data catalog for machine learning in forest monitoring — R1/PR6

Conflict of interest statement

Reviewer declares none.

Comments

The authors addressed the comments and I am fine with the manuscript

Recommendation: OpenForest: a data catalog for machine learning in forest monitoring — R1/PR7

Comments

Dear Dr. Ouaknine: The reviewer and myself are happy with the revisions; from my side the paper is ready to go to production. Many thanks for submitting the paper to us.

Best wishes, Miguel Mahecha

Decision: OpenForest: a data catalog for machine learning in forest monitoring — R1/PR8

Comments

No accompanying comment.