RG-CAT: Detection pipeline and catalogue of radio galaxies in the EMU pilot survey

Nikhel Gupta; Ray P. Norris; Zeeshan Hayder; Minh Huynh; Lars Petersson; X. Rosalind Wang; Andrew M. Hopkins; Heinz Andernach; Yjan Gordon; Simone Riggi; Miranda Yew; Evan J. Crawford; Bärbel Koribalski; Miroslav D. Filipović; Anna D. Kapińska; Stanislav Shabala; Tessa Vernstrom; Joshua R. Marvil

doi:10.1017/pasa.2024.25

RG-CAT: Detection pipeline and catalogue of radio galaxies in the EMU pilot survey

Published online by Cambridge University Press: 01 April 2024

Minh Huynh ,

Yjan Gordon and

Nikhel Gupta*: Affiliation:
CSIRO Space & Astronomy, Bentley, WA, Australia
Ray P. Norris: Affiliation:
Western Sydney University, Penrith, NSW, Australia CSIRO Space & Astronomy, Epping, NSW, Australia
Zeeshan Hayder: Affiliation:
CSIRO Data61, Black Mountain, ACT, Australia
Minh Huynh: Affiliation:
CSIRO Space & Astronomy, Bentley, WA, Australia International Centre for Radio Astronomy Research, The University of Western Australia, Crawley, WA, Australia
Lars Petersson: Affiliation:
CSIRO Data61, Black Mountain, ACT, Australia
X. Rosalind Wang: Affiliation:
Western Sydney University, Penrith, NSW, Australia
Andrew M. Hopkins: Affiliation:
Western Sydney University, Penrith, NSW, Australia School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW, Australia
Heinz Andernach: Affiliation:
Thüringer Landessternwarte, Tautenburg, Germany Departamento de Astronomía, Universidad de Guanajuato, Guanajuato, GTO, Mexico
Yjan Gordon: Affiliation:
Department of Physics, University of Wisconsin-Madison, Madison, WI, USA
Simone Riggi: Affiliation:
INAF-Osservatorio Astrofisico di Catania, Catania, Italy
Miranda Yew: Affiliation:
Western Sydney University, Penrith, NSW, Australia
Evan J. Crawford: Affiliation:
Western Sydney University, Penrith, NSW, Australia
Bärbel Koribalski: Affiliation:
Western Sydney University, Penrith, NSW, Australia CSIRO Space & Astronomy, Epping, NSW, Australia
Miroslav D. Filipović: Affiliation:
Western Sydney University, Penrith, NSW, Australia
Anna D. Kapińska: Affiliation:
National Radio Astronomy Observatory, Socorro, NM, USA
Stanislav Shabala: Affiliation:
School of Natural Sciences, University of Tasmania, Hobart, Australia
Tessa Vernstrom: Affiliation:
CSIRO Space & Astronomy, Bentley, WA, Australia International Centre for Radio Astronomy Research, The University of Western Australia, Crawley, WA, Australia
Joshua R. Marvil: Affiliation:
INAF-Osservatorio Astrofisico di Catania, Catania, Italy
*: Corresponding author: Nikhel Gupta; Email: [email protected]

Article contents

Abstract
Introduction
Data
Detection pipeline for radio galaxies
Catalogue construction pipeline
Catalogue descprition
Summary
Data availability
Footnotes
References

Rights & Permissions

Abstract

We present source detection and catalogue construction pipelines to build the first catalogue of radio galaxies from the 270 $\rm deg^2$ pilot survey of the Evolutionary Map of the Universe (EMU-PS) conducted with the Australian Square Kilometre Array Pathfinder (ASKAP) telescope. The detection pipeline uses Gal-DINO computer vision networks (Gupta et al. 2024, PASA, 41, e001) to predict the categories of radio morphology and bounding boxes for radio sources, as well as their potential infrared host positions. The Gal-DINO network is trained and evaluated on approximately 5 000 visually inspected radio galaxies and their infrared hosts, encompassing both compact and extended radio morphologies. We find that the Intersection over Union (IoU) for the predicted and ground-truth bounding boxes is larger than 0.5 for 99% of the radio sources, and 98% of predicted host positions are within $3^{\prime \prime}$ of the ground-truth infrared host in the evaluation set. The catalogue construction pipeline uses the predictions of the trained network on the radio and infrared image cutouts based on the catalogue of radio components identified using the Selavy source finder algorithm. Confidence scores of the predictions are then used to prioritise Selavy components with higher scores and incorporate them first into the catalogue. This results in identifications for a total of 211 625 radio sources, with 201 211 classified as compact and unresolved. The remaining 10 414 are categorised as extended radio morphologies, including 582 FR-I, 5 602 FR-II, 1 494 FR-x (uncertain whether FR-I or FR-II), 2 375 R (single-peak resolved) radio galaxies, and 361 with peculiar and other rare morphologies. Each source in the catalogue includes a confidence score. We cross-match the radio sources in the catalogue with the infrared and optical catalogues, finding infrared cross-matches for 73% and photometric redshifts for 36% of the radio galaxies. The EMU-PS catalogue and the detection pipelines presented here will be used towards constructing catalogues for the main EMU survey covering the full southern sky.

Keywords

Galaxies: active galaxies: peculiar radio continuum: galaxies Galaxy: evolution methods: data analysis

Type: Research Article
Information: Publications of the Astronomical Society of Australia , Volume 41 , 2024 , e027

DOI: https://doi.org/10.1017/pasa.2024.25 [Opens in a new window]

NASA ADS Abstract Service [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2024. Published by Cambridge University Press on behalf of Astronomical Society of Australia

1. Introduction

Radio galaxies remain enigmatic subjects in the realm of astronomy, with much still to be uncovered. The majority of radio galaxies host Active Galactic Nuclei (AGN), which commonly emit more energy in the radio part of the electromagnetic spectrum than in other wavelengths, such as optical or infrared. While progress has been made in understanding some aspects, there are key questions that elude us. For instance, the precise triggers that activate their powerful radio emission or jets, their interplay with the intergalactic medium, their magnetic field structure, as well as how they influence the broader cosmic environment, remain unresolved. While in the majority of previous radio surveys, the sources appear as unresolved, with better angular resolution and sensitivity of radio telescopes leads to the detection of a greater number of radio galaxies characterised by intricate, extended structures (see e.g. Norris Reference Norris2017). These structures often consist of multiple components, each displaying distinctive peaks in radio emission. The morphologies of equatorially symmetrical extended radio emission from galaxies are broadly classified into two categories: Fanaroff–Riley Class I (FR-I)- and Class II (FR-II)-type radio galaxies (Fanaroff & Riley Reference Fanaroff and Riley1974). These radio galaxies produce highly collimated jets emerging in opposite directions from AGN at the centre of the host galaxy. As the distance from the host galaxy increases, the surface brightness of FR-I radio galaxies decreases. In contrast, FRII radio galaxies typically feature linear jets terminating in high-brightness hotspots of large radio lobes. In some instances, a bipolar jet from an FR-II-type radio galaxy may transition into two distinct radio lobes unconnected by radio emission to the host galaxy. Consequently, FR-I and FR-II radio galaxies are commonly referred to as edge-darkened and edge-brightened AGN, respectively.

The emergence of new technologies such as phased array feed receivers (PAF; Hay et al. Reference Hay, O’Sullivan, Kot, Granet, Lacoste and Ouwehand2006) has enabled swift scanning of extensive portions of the sky, facilitating rapid surveys of large sky areas at radio wavelengths. Such advancements have opened up new avenues for detecting millions of radio galaxies. For example, the ongoing Evolutionary Map of the Universe (EMU; Norris et al. Reference Norris2021a) survey, conducted with the Australian Square Kilometre Array Pathfinder (ASKAP; Johnston et al. Reference Johnston2007; DeBoer et al. Reference DeBoer2009; Hotan et al. Reference Hotan2021) telescope, aims to discover over 40 million compact and extended radio sources within five years (Norris et al. Reference Norris2021a). Similarly, the omnidirectional dipole antennas employed in the Low-Frequency Array (LOFAR; van Haarlem et al. Reference van Haarlem2013) survey, which spans the entire northern sky, are expected to detect over 15 million radio sources. Additional cutting-edge radio survey telescopes include Murchison Widefield Array (MWA; Wayth et al. Reference Wayth2018), MeerKAT (Jonas & MeerKAT Team 2016), and the Karl G. Jansky Very Large Array (JVLA Perley et al. Reference Perley, Chandler, Butler and Wrobel2011). The Very Large Array Sky Survey (VLASS; Lacy et al. Reference Lacy2020) conducted by JVLA is expected to detect around 5 million radio galaxies. The forthcoming Square Kilometre Array (SKAFootnote a) radio telescope is expected to further escalate the number of galaxy detections, potentially reaching hundreds of millions. Such a vast dataset will significantly impact our understanding of the physics underlying galaxy evolution. In essence, the detection of radio galaxies at various stages of their existence holds immense potential for uncovering concealed dimensions of their behaviour, leading to new insights into their evolutionary processes. To fully harness the potential of these radio surveys, it is necessary to redesign the techniques employed in constructing catalogues of radio galaxies.

Grouping the associated components of extended radio galaxies is a necessary step in creating catalogues of radio galaxies. This is essential for estimating the actual number density and total integrated flux density of radio galaxies. Incorrectly grouping associated components or erroneously grouping unassociated components can lead to the misestimation of number density and total flux density, resulting in inaccurate models. While some analytical approaches are being developed (e.g. Gordon et al. Reference Gordon2023), currently, the cross-identification of associated radio galaxy components primarily relies on visual inspections (e.g. Banfield et al. Reference Banfield2015). The widely used source extraction algorithms, such as Selavy (Whiting & Humphreys Reference Whiting and Humphreys2012) and AEGEAN (Hancock et al. Reference Hancock, Trott and Hurley-Walker2018), are prone to potential confusion when dealing with detached lobes of extended radio galaxies, as well as neighbouring unassociated compact radio galaxies. However, visual inspections are time-consuming, require scientific expertise, and are not scalable to the millions of radio galaxies expected to be discovered in the next few years.

This underscores the vital need to develop computer vision methods for grouping associated radio galaxy components. The nature of available data typically determines the trajectory of computer vision tasks, categorising them into four primary methods: self-supervised, semi-supervised, weakly supervised and supervised. Self-supervised learning involves the utilisation of unsupervised techniques to train models on the underlying data structure, thereby eliminating the necessity for explicit annotations. This has proven effective in discovering new radio morphologies in radio surveys (e.g. Galvin et al. Reference Galvin2020; Mostert et al. Reference Mostert2021; Gupta et al. Reference Gupta2022). Semi-supervised learning combines labelled and unlabelled data during the training process, as demonstrated in the classification of radio galaxies (Slijepcevic et al. Reference Slijepcevic2022). In weakly supervised learning, indirect labels are leveraged for the entire training dataset, serving as a supervisory signal. This specific approach has found utility in the classification and detection of extended radio galaxies (Gupta et al. Reference Gupta2023). In supervised learning, the model undergoes training using image-label pairs, where these labels provide complete information required for the model to make specific predictions. Recently, machine learning (ML) techniques, as exemplified by studies, such as Lukic et al. (Reference Lukic2018), Alger et al. (Reference Alger2018), Wu et al. (Reference Wu2019), Bowles et al. (Reference Bowles, Scaife, Porter, Tang and Bastien2020), Maslej-Krešňáková et al. (Reference Maslej-Krešňáková, El Bouchefry and Butka2021), Becker et al. (Reference Becker, Vaccari, Prescott and Grobler2021), Brand et al. (Reference Brand2023), Riggi et al. (Reference Riggi2023), Sortino et al. (Reference Sortino2023), Lao et al. (Reference Lao2023), and Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024), have found application in the morphological classification and detection of radio galaxies.

This paper builds upon the RadioGalaxyNET dataset (Gupta et al. Reference Gupta, Norris, Huynh, Hayder and Petersson2023b) and computer vision algorithms (Gupta et al. Reference Gupta, Hayder, Norris, Huynh and Petersson2023a) designed to address the challenge of associating radio galaxy components (Gupta et al. Reference Gupta, Hayder, Norris, Huynh and Petersson2024). The RadioGalaxyNET dataset was curated by professional astronomers through multiple visual inspections and includes multimodal images of the radio and infrared sky, along with pixel-level labels on associated components of extended radio galaxies and their infrared hosts. In addition to the 2 800 extended radio galaxies in RadioGalaxyNET, we visually inspected and labelled approximately 2 100 compact radio galaxies and 99 sources with peculiar and other rare morphologies in the present work. The annotations comprise class information on the radio-morphological class for these radio galaxies, bounding box information to capture associated components of each radio galaxy, segmentation masks for radio emission, and positions of their host galaxies in infrared images. Using this comprehensive dataset of extended and compact radio galaxies and their infrared hosts, we train the Gal-DINO multimodal model (Gupta et al. Reference Gupta, Hayder, Norris, Huynh and Petersson2024) to simultaneously predict bounding boxes for radio galaxies and potential keypoint positions of their infrared hosts, where a keypoint in ML refers to a specific point or landmark in images. These detections are subsequently employed to generate the first catalogue of compact and extended radio galaxies observed in the first EMU pilot survey (EMU-PS) conducted with the ASKAP telescope.

The structure of the paper is outlined as follows. In Section 2, we explain the radio and infrared observations, image characteristics, and the labels utilised for training and assessing the computer vision networks. Section 3 presents the Gal-DINO network, encompassing training and evaluation specifics, along with the outcomes of network evaluation. Details about the catalogue construction pipeline are presented in Section 4. Section 5 provides detailed information regarding the catalogue. We summarise our findings in Section 6 and provide directions for future work.

2. Data

In this section, we describe the radio and infrared observations, as well as the annotations developed and used for training the computer vision model to construct a consolidated catalogue.

2.1. ASKAP observations

ASKAP, situated at Inyarrimahnha Ilgari Bundara, the Murchison Radio-astronomy Observatory (MRO), is a radio telescope equipped with PAF technology, enabling high survey speed through its wide instantaneous field of view. Comprising 36 antennas with various baselines, the majority are concentrated within a 2.3 km diameter region, while the outer six extend the baselines up to 6.4 km (Hotan et al. Reference Hotan2021). Recently, ASKAP concluded the first all-sky Rapid ASKAP Continuum Survey (RACS) (McConnell et al. Reference McConnell2020), covering the entire sky south of Declination $+41^{\circ}$ with a median RMS of about 250 $\unicode{x03BC}$ Jy/beam. This has paved the way for subsequent deeper surveys using ASKAP.

The EMU, designed to observe the entire Southern Sky and potentially catalogue up to 40 million radio sources, is underway (EMU; Norris Reference Norris2011). A significant step in this direction was the completion of the first EMU-PS (Norris et al. Reference Norris2021a) in late 2019. Covering 270 square degrees of the sky within $301^{\circ}< \mathrm{RA} < 336^{\circ}$ and $-63^{\circ}< \mathrm{Dec} < -48^{\circ}$ , EMU-PS employed 10 tiles, each observed for approximately 10 hours. Achieving an RMS sensitivity between $25-35~\unicode{x03BC}$ Jy/beam and a beamwidth of $13^{\prime\prime} \times 11^{\prime\prime}$ FWHM, the survey operated in the frequency range of 800–1 088 MHz, centred at 944 MHz.

The raw data from EMU-PS underwent processing using the ASKAPsoft pipeline (Whiting et al. Reference Whiting, Voronkov, Mitchell, Team, Lorente, Shortridge and Wayth2017; Norris et al. Reference Norris2021a). Since the survey comprised ten overlapping tiles, additional steps were taken for value-added processing to create a unified image and source catalogue. This involved merging the tiles by performing a weighted average of overlapping data regions and convolving the unified image to a standardised restoring beam size of $18^{\prime\prime}$ FWHM to address variations in the point spread function (PSF) from beam to beam (Norris et al. Reference Norris2021a). The creation of a catalogue of islands and components was accomplished using the Selavy source finder (Whiting & Humphreys Reference Whiting and Humphreys2012) applied to the convolved image, resulting in a compilation of 198 216 islands with 220 102 components, of which 90.3% are single-component islands and the rest are multiple component islands. Note that the Selavy catalogues are not designed to provide radio galaxy catalogues. Selavy initially detects pixels above a certain threshold, then groups those pixels into islands. Subsequently, components are fitted to each island, resulting in the creation of the component catalogue. Note that throughout this work, we use unconvolved images with native resolution for tiles. The convolved image was only employed to generate the Selavy catalogue (see Norris et al. Reference Norris2021a, for details).

2.2. Infrared observations

For the EMU-PS, we obtain infrared images from the AllWISE observations of the Wide-field Infrared Survey Explorer (WISE) (Wright et al. Reference Wright2010; Cutri et al. Reference Cutri2013). WISE conducted an all-sky infrared survey in the W1, W2, W3, and W4 bands, corresponding to wavelengths of 3.4, 4.6, 12, and 22 $\unicode{x03BC}$ m. Our study focuses on utilising the W1 band from AllWISE, which has a 5 $\sigma$ point source detection limit of 28 $\unicode{x03BC}$ Jy and angular resolution of $8.5^{\prime \prime}$ . In addition to images, we utilise the CatWISE catalogue (Marocco et al. Reference Marocco2021) of infrared sources to cross-match with the predicted infrared positions for constructing the consolidated catalogue. The CatWISE catalogue is constructed from unWISE coadds (Lang Reference Lang2014), which have an angular resolution of $6.1^{\prime \prime}$ . The CatWISE Catalogue surpasses AllWISE by using six times as many exposures, resulting in approximately twice as many sources. Furthermore, CatWISE exhibits enhanced precision, especially for faint sources, resulting in a 12-fold improvement over AllWISE.

2.3. Radio and infrared images

The RadioGalaxyNET dataset (Gupta et al. Reference Gupta, Hayder, Norris, Huynh and Petersson2024) comprises 2 800 extended radio galaxies and their infrared host galaxies identified in the EMU-PS and CatWISE through independent visual inspections, as detailed in Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024) and Yew et al. (in preparation). In addition to these extended radio galaxies, we incorporate compact ones into the dataset. Compact radio galaxies refer to unresolved radio sources that do not show extended emission. We begin by initially selecting sources randomly from the Selavy catalogue, wherein the algorithm identifies only one component corresponding to an island. Subsequently, we utilise the CARTA visualisation tool (Comrie et al. Reference Comrie2021) to validate their compact nature and identify their respective infrared hosts in the EMU-PS and AllWISE images, respectively. These infrared hosts are determined as the nearest galaxies to the radio peaks identified during visual inspections.

A visual inspection of 2 700 single-component radio sources in Selavy reveals that 146 of them are associated with extended radio sources, suggesting that approximately 95% of these single-component radio sources identified by Selavy are genuinely compact. Furthermore, we refine this dataset of compact radio sources by crossmatching it with the CatWISE catalogue to confirm the visual identifications of infrared hosts. In line with the methodology proposed by Norris et al. (Reference Norris2021a), we employ a maximum cross-matching separation of $3^{\prime \prime}$ between visually identified infrared positions and CatWISE positions to ensure that the false identification rate remains below 8%. Our ultimate dataset of compact radio galaxies includes 2 090 verified infrared host galaxies. In the present study, we exclusively utilise these compact radio galaxies to train and test the deep learning computer vision model, excluding compact sources with unconfirmed infrared hosts. It is important to note that the Selavy catalogue of EMU-PS includes approximately 180 000 sources with single components. However, a small randomly selected subset is included here to maintain the balance between other categories.

In addition to the compact radio galaxies, we also introduce peculiar and other rare radio morphologies into the dataset. Instances of these morphologies, as illustrated in Gupta et al. (Reference Gupta2022), encompass peculiar-shaped extended emission, diffuse emission from galaxy clusters, very nearby, thus large and resolved star-forming galaxies, and peculiars like Odd Radio Circles (ORCs; Norris et al. Reference Norris2021b). Initially, we seek out these sources in the EMU-PS image to identify any similar morphologies. Subsequently, our exploration extends to identify other morphologies that significantly deviate from the common types. Examples of these include oddly bent extended radio galaxies and neighbouring extended radio galaxies with merging emissions, presenting challenges in classification within conventional radio morphologies. Our investigation yields a total of 99 such peculiar and other rare morphologies, all integrated into the training and evaluation dataset. Note that not all of these morphologies are radio galaxies; some of them form a group with multiple hosts. Therefore, for simplicity, we utilise the terms ‘galaxy’ or ‘morphology’ interchangeably for all these sources. Additionally, while some of these morphologies, such as ORCs, fall under the category of peculiar sources, others like resolved star-forming, oddly bent galaxies, etc., are rare but not considered peculiar. However, for brevity, we use the abbreviation ‘Pec’ for all of these 100 morphologies. With a composition of 2 800 extended radio galaxies in RadioGalaxyNET, along with 2 090 compact radio galaxies and 100 sources with peculiar and other rare morphologies, our dataset comprises approximately 5 000 radio galaxies, forming a comprehensive set for training and testing our object detection deep learning model.

We create image cutouts with dimensions of $8^{\prime} \times 8^{\prime}$ in the sky, resulting in images of $240 \times 240$ pixels, where each pixel corresponds to 2 $^{\prime \prime} \times$ 2 $^{\prime \prime}$ . Examples of these radio images are illustrated in the left panels of Fig. 1. It is important to note that the cutouts of radio images in RadioGalaxyNET are initially $15^{\prime} \times 15^{\prime}$ , designed to detect several distinct instances of these multi-component radio galaxies in the same image. However, for our current objective of grouping multiple components of central galaxies to construct the catalogue, we reduce the cutout size, as detailed in Section 4. This is motivated by the fact that only 10 out of the 5 000 extended radio galaxies in our dataset have a total extension larger than $7^{\prime}$ . Future work should incorporate these very large extended radio galaxies into the dataset as we identify more such galaxies in the ongoing main EMU survey. However, for our current study, we have not included them in the training. In contrast to RadioGalaxyNET images, we refrain from preprocessing our radio maps in this study. Instead, the network is adapted to handle FITS files for radio images, as explained in Section 3. At the identical sky locations of radio images, we acquire infrared images from AllWISE, sourced from the WISE W1 band. We create cutouts of the same size as the radio images and then reproject the infrared images onto the radio images using the world coordinate system. In contrast to radio images, infrared images undergo noise reduction processing using the method detailed in Gupta et al. (Reference Gupta2023, Reference Gupta, Hayder, Norris, Huynh and Petersson2024). Examples of these processed infrared images are showcased in the right columns of Fig. 1.

Figure 1. Examples of the radio (left panels) and corresponding infrared (right panels) images, as described by the column titles. Each of these images has a frame size of $8^{\prime} \times 8^{\prime}$ in the sky ( $240 \times 240$ pixels). In the radio images, we display classes and bounding boxes for radio galaxies encapsulating all their components. Here, the ‘FR-X’ type is positioned between the FR-I and FR-II categories, ‘R’ denotes resolved radio sources with one visible peak, ‘C’ represents compact unresolved radio sources, and ‘Pec’ refers to peculiar or other rare radio morphologies (see Section 2.4 for details). On the infrared images, circles indicate the positions of host galaxies.

Figure 2. Shown are the dataset split distributions, depicting the distributions of extended radio galaxies in a single cutout (left), their respective categories (middle), and the occupied area per radio galaxy (A; right). The tables below the figures provide detailed counts of radio morphologies in the training, validation, and test sets. See Section 2.5 for more details. Note that each radio morphology has a corresponding infrared host, so the counts here also represent the number of corresponding infrared hosts.

2.4. Radio and infrared annotations

Annotations for 2 800 extended radio galaxies and their corresponding infrared hosts are already present in the RadioGalaxyNET dataset (Gupta et al. Reference Gupta, Hayder, Norris, Huynh and Petersson2024). These annotations encompass radio galaxy categories, bounding boxes encapsulating all radio components of each radio galaxy, pixel-level segmentation masks, and the keypoint locations of corresponding infrared hosts that are cross-matched with the CatWISE catalogue. The identification of these extended radio galaxies and their infrared hosts is independently achieved through visual inspections, where the infrared host for each radio galaxy is identified in the infrared image Yew et al. (in prep.). The dataset is categorised into FR-I, FR-II, FR-x, and R galaxies based on measurements of their total extent and the distance between peak positions. This classification follows the criteria set by Fanaroff & Riley (Reference Fanaroff and Riley1974), where the ratio between the peak distance and total extent is employed to differentiate between FR-I and FR-II galaxies. Specifically, the ratio is below 0.45 for FR-I and above 0.55 for FR-II. Owing to image resolution limitations, some galaxies cannot be conclusively classified as either FR-I or FR-II, leading to their categorisation as FR-x galaxies, with the ratio between peak distance and total extent falling between 0.45 and 0.55. Note that, in this work, certain subtypes of morphologies, which are commonly associated with FR-I-type radio galaxies, such as bent-tailed, narrow, and wide-angled-tailed sources (Miley Reference Miley1980), are not differentiated. The R category (radio galaxies with resolved radio emission) pertains to those galaxies where only one central peak is visible, resulting in a ratio set to zero.

For compact radio galaxies, denoted by ‘C’, we initially apply the island segmentation method (e.g. Gupta et al. Reference Gupta2022) to mask pixels exceeding $3\sigma$ for compact radio galaxies. Subsequently, we generate bounding boxes for these galaxies. The positions of infrared hosts for these compact radio galaxies are identified through visual inspections and included in the dataset as keypoints. Note that R and C radio galaxies used for model training and evaluation are visually distinguished in this work and do not rely on the peak-to-total flux criterion (e.g. Condon et al. Reference Condon1998). In the case of peculiar and other rare radio morphologies, denoted as ‘Pec’, we employ CARTA to obtain bounding boxes and subsequently generate segmentation masks for all pixels larger than $3\sigma$ within these bounding boxes. Due to the diffuse nature of radio emission in these Pec morphologies, the hosts for these sources cannot be identified unambiguously with a unique infrared object, sometimes involving multiple potential host galaxies. Consequently, we select one of these infrared galaxies, closest to the bounding box centroid, as the host for the Pec radio morphologies. Future work, incorporating a larger sample of Pec radio morphologies, should include all such infrared galaxies to enable multiple host galaxy detection for these sources. Fig. 1 provides examples of FR-I, FR-II, FR-x, R, and C radio galaxies and Pec morphologies. It is worth noting that the FR-I radio galaxies depicted in the middle left and bottom right panels may indeed be FR-II-type radio galaxies at higher resolution. Nevertheless, we adhere to the mechanism described above and classify them based on the present image resolution. Following the RadioGalaxyNET dataset structure, we furnish annotations for the radio images, encompassing ‘categories’, ‘bbox’, and ‘segmentation’, along with ‘keypoints’ for the infrared. All annotations adhere to the COCO dataset format (Lin et al. Reference Lin, Fleet, Pajdla, Schiele and Tuytelaars2014a), simplifying the streamlined evaluation of object detection methods.

2.5. Radio source statistics

Our dataset encompasses approximately 5 000 radio galaxies and their corresponding infrared hosts. We generate 5 000 cutouts centred at each radio galaxy, each with a size of $8^{\prime} \times 8^{\prime}$ , and store them in FITS image format with sky information in the headers. Similarly, for infrared host galaxies, we generate cutouts of the same size and process them following the methodology outlined in Section 2.3. As other radio galaxies are present close to the central radio galaxy within an $8^{\prime} \times 8^{\prime}$ cutout (e.g. top right panel of Fig. 1), there are a total of 6 080 instances of radio galaxies in these 5 000 cutouts. The dataset comprises 371 FR-I, 1 331 FR-II, 401 FR-x, 698 R, and 2 090 compact radio galaxies as well as 99 other rare and peculiar morphologies. Adhering to the commonly used strategy in ML, we randomly split our dataset into train, validation, and test sets in the ratio of $0.9:0.05:0.05$ for the object detection modelling.

The specific counts of radio galaxies within each radio-morphological category, along with the split ratios, are illustrated in Fig. 2 and detailed in the table below the figure. The left panel of the figure showcases the number of $8^{\prime} \times 8^{\prime}$ cutouts with radio galaxy instances ranging from single to three radio galaxies, indicating that the majority of our cutouts feature either single or double instances of radio galaxies. Note that while these cutouts with multiple instances are utilised for training and testing the network, catalogue construction is based solely on central radio galaxies and their infrared host galaxies, as discussed in Section 4. The middle panel of the figure presents the number of radio galaxies within the six categories, with the corresponding counts for the split sets displayed in the table below. Lastly, the third panel reveals the number of radio galaxies categorised by bounding box area in pixels, highlighting that the majority of our radio galaxies are small-scale structures, with bounding box area below $48^2$ pixels.

3. Detection pipeline for radio galaxies

The radio images exhibit distinct features for extended radio galaxies compared to their counterparts in infrared images, where the latter predominantly resemble point sources (with a few exceptions, such as resolved star-forming galaxies). Examples of these disparities are illustrated in Fig. 1. A multimodal approach, introduced by Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024), facilitates the detection of both radio sources and their potential infrared host positions. This approach incorporates models like Gal-DETR, Gal-Deformable DETR, and Gal-DINO, all capable of concurrently identifying radio galaxies and their potential infrared hosts. These models employ two fundamental detection schemes: the base networks handle class and bounding box predictions for radio galaxies, while the keypoint detection module is utilised for predicting potential infrared host positions. Table 2 in Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024) demonstrates that the Gal-DINOFootnote b model, outperforms other networks in our context of small object detection in radio and infrared images. For a comprehensive understanding of the modelling strategy, we direct readers to Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024); here, we provide a brief overview of the Gal-DINO model.

The Gal-DINO model is based on the DEtection TRansformers (DETR) (Carion et al. Reference Carion2020), which adopts the Transformer architecture (Vaswani et al. Reference Vaswani2017). Initially designed for natural language processing, this architecture is utilised to address the intricate task of object detection in images. In contrast to conventional methods relying on region proposal networks (e.g. Faster RCNN; Ren et al. Reference Ren, He, Girshick and Sun2015), DETR introduces an end-to-end approach to object detection using Transformers. The DETR with Improved deNoising anchOr boxes (DINO; Zhang et al. Reference Zhang2023) incorporates enhanced anchor boxes, predefined boxes crucial for object detection. DINO introduces refined strategies for selecting and placing these anchor boxes, improving the model’s capability to detect objects of varying sizes and aspect ratios. During training, DINO employs an improved mechanism for matching anchor boxes to ground-truth objects, enhancing accuracy in localisation and classification. Additionally, DINO utilises adaptive convolutional features, enabling the model to concentrate on informative regions of the image, thereby enhancing both efficiency and accuracy. Gal-DINO (Gupta et al. Reference Gupta, Hayder, Norris, Huynh and Petersson2024) integrates keypoint detection into the DINO algorithm, which already features improved de-noising anchor boxes. By mitigating the impact of noise and outliers, Gal-DINO yields more resilient and precise bounding box predictions. This results in improved localisation of extended radio galaxies and their corresponding infrared hosts within these bounding boxes.

3.1. Radio galaxy class and bounding box predictions

The Gal-DINO model utilises the DINO architecture, integrating a ResNet-50 (He et al. Reference He, Zhang, Ren and Sun2015) Convolutional Neural Network backbone to process input images and extract feature maps. Positional encodings are introduced to incorporate spatial information into the Transformer architecture, enhancing the model’s ability to understand relative object positions. DINO uses learned object queries, departing from fixed anchor boxes (e.g. Tian et al. Reference Tian, Shen, Chen and He2019), to represent classes of objects and are refined during training. The model’s decoder produces two output heads for class and bounding box predictions, facilitating simultaneous processing of the entire image and capturing contextual relationships between regions. Each head refers to a specific sub-network for a particular aspect of the overall task. This approach results in robust and precise bounding box predictions, improving the localisation of extended radio galaxies. DINO adopts the Hungarian loss function to establish associations between predicted and ground-truth bounding boxes, ensuring a one-to-one mapping. The comprehensive loss function for DINO integrates the cross-entropy loss for class predictions $\mathcal{L}_{\mathrm{c}}$ and the smooth L1 loss for bounding box predictions $\mathcal{L}_{\mathrm{b}}$ . This loss function is expressed as the sum of the classification and bounding box losses, contributing to the overall training objective of the model.

(1)

\begin{equation}\mathcal{L}_{\mathrm{DINO}} = \mathcal{L}_{\mathrm{c}} + \mathcal{L}_{\mathrm{b}}.\end{equation}

The L1 loss, used for bounding box regression, measures the absolute difference between predicted and true bounding box coordinates, penalising the model for deviations and promoting accurate localisation.

3.2. Infrared host galaxy keypoint prediction

Gal-DINO introduces keypoint detection techniques as a complement to the bounding box-based object detection method for identifying extended radio galaxies and their potential infrared hosts. Keypoints, representing distinctive features within images, offer precise spatial information for accurately locating the host galaxy. Unlike bounding boxes, keypoints allow for fine-grained localisation, particularly beneficial for complex radio emission morphologies. The keypoint detection in Gal-DINO, leveraging the transformer-based architecture to capture global and local details. It utilises self-attention mechanisms to localise and associate keypoints for infrared host galaxies. The loss function for Gal-DINO combines DINO loss for class and bounding box predictions with keypoint detection loss, expressed as

(2)

\begin{equation}\mathcal{L}_{\mathrm{Gal-DINO}} = \mathcal{L}_{\mathrm{DINO}} + \mathcal{L}_{\mathrm{k}}\end{equation}

where $\mathcal{L}_{\mathrm{k}}$ is the L1 loss for keypoint detection, calculating the absolute difference between predicted and ground-truth keypoint coordinates, such as the x and y position of the host.

3.3. Network training

The dataset is split into training, validation, and test sets as illustrated in Fig. 2 and explained in Section 2.5. The training set is utilised to train the networks, while the validation and test sets function as inference datasets during and after training, respectively. As mentioned in Section 2.3, the Gal-DINO network employed in the present work, processes radio images in FITS format and infrared images in PNG format, diverging from the RadioGalaxyNET dataset where 3-channel radio-radio-infrared PNG format files are used for both training and inference. In the present approach, 8-channel images are generated, comprising 7 channels from the radio FITS file and 1 channel representing the processed infrared sky. Each radio channel encompasses clipped data from FITS files, where clipping is done between the 50th percentile level and 7 distinct maximum levels. These maxima correspond to the 95th, 99th, 99.2nd, 99.5th, 99.7th, 99.9th, and 99.99th percentile levels. This methodology utilises radio images in FITS format directly, eliminating the need for preprocessing and conversion to PNG files. Furthermore, it imparts information to the computer vision model in a manner akin to the visual inspection performed by expert astronomers for the classification and grouping of multi-component radio galaxies. Future work should explore alternative scaling approaches and, if necessary, fine-tune the number of channels while taking into account potential GPU memory constraints. An illustrative example of an FR-II radio galaxy, featuring 7 channels and its corresponding infrared channel, is presented in Fig. 3.

Figure 3. Shown is an example of an 8-channel image used for the training and evaluation of the Gal-DINO network. The first 7 channels contain data from the radio FITS file, representing the extended radio galaxy with clipping between the 50th percentile level and 7 specific maxima, corresponding to the 95th, 99th, 99.2th, 99.5th, 99.7th, 99.9th, and 99.99th percentile levels. The 8th channel, in the bottom right, displays the corresponding pre-processed infrared image. The bounding box and keypoint annotations are not depicted here for brevity; examples of these annotations are shown in Fig. 1 on the radio (with maxima at the 95th percentile level) and infrared images, respectively.

In line with the training methodology of Gal-DINO in Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024), the training dataset undergoes diverse random augmentations during each epoch, where an epoch corresponds to a single pass through the entire dataset in the model’s training process. These augmentations encompass horizontal flipping, random rotations (–180 to 180 degrees), random resizing within $400\times400$ to $1\,300\times1\,300$ pixels, and random cropping of a randomly selected set of 8-channel training images. Horizontal flipping and rotations ensure varied orientations, crucial for handling radio galaxies with diverse sky orientations while resizing and cropping introduce scale and spatial variety. These augmentations, consistently applied, bolster the model’s ability to generalise effectively and excel on unseen data. Gal-DINO, with a parameter count of 47 million, underwent a training duration of approximately 35 hours using a single Nvidia Tesla P100 GPU for 100 epochs. We utilise the original hyperparameters from Gal-DINO with specific adjustments. These hyperparameters, set before training, encompass critical aspects such as learning rate, batch size, architecture details, dropout rate, activation functions, and optimiser. Although a comprehensive list of hyperparameters for each network is not provided here for brevity, detailed network architecture, including hyperparameters, is available in the associated repositories for reference.

3.4. Evaluation metrics

The evaluation metrics, based on Lin et al. (Reference Lin, Fleet, Pajdla, Schiele and Tuytelaars2014b), employ the Intersection over Union (IoU) to assess algorithm performance on the test dataset. IoU is calculated as the ratio of the overlap area between predicted ( $B_{\mathrm{P}}$ ) and ground-truth ( $B_{\mathrm{GT}}$ ) bounding boxes to their union area.

(3)

\begin{equation} \mathrm{IoU}(B_{\mathrm{P}}|B_{\mathrm{GT}}) = \frac{\text{Overlap between } B_{\mathrm{P}} \text{ and } B_{\mathrm{GT}}}{\text{Union between } B_{\mathrm{P}} \text{ and } B_{\mathrm{GT}}},\end{equation}

In bounding box prediction, each predicted box is categorised as true positive (TP), false positive (FP), or false negative (FN) based on its area compared to the ground-truth box. A TP bounding box accurately identifies an object with high IoU overlap, while an FP bounding box fails to correspond to any ground-truth object. An FN bounding box occurs when an object in the ground-truth data is not successfully detected by the algorithm, representing a missed opportunity to identify a genuine object. To detect keypoints, we use the Object Keypoint Similarity (OKS) metric, which gauges the similarity between predicted and ground-truth keypoints. OKS computes the Euclidean distance for each predicted keypoint relative to its corresponding ground-truth keypoint, normalising it based on the size of the object instance. The Euclidean distance (Ed) between the ground-truth and predicted keypoints is subjected to a Gaussian function, defined as follows:

(4)

\begin{equation}\mathrm{OKS} = \exp \left(-\frac{Ed^2}{2l^2c^2}\right),\end{equation}

where l represents the ratio of the bounding box’s area to the image cutout area, and c is a keypoint constant set to 0.107. This adjustment differs from the 10 values used in Lin et al. (Reference Lin, Fleet, Pajdla, Schiele and Tuytelaars2014b), as each bounding box in our case corresponds to a single infrared host. The resulting OKS score falls within the range of 0 to 1, with 1 signifying perfect keypoint localisation.

We evaluate the network’s performance using the average precision metric, a widely used standard in object detection model assessment (Lin et al. Reference Lin, Fleet, Pajdla, Schiele and Tuytelaars2014b). Precision, the ratio of TPs to the total number of objects identified as positive, measures detection and classification precision. Recall, the ratio of TPs to the total number of objects with ground-truth labels, assesses the model’s ability to identify all relevant objects. The precision-recall curve illustrates the trade-off between precision and recall across varying detection thresholds. The area under the curve (AUC) is calculated, representing the average AUC value across all classes or objects. Higher values of average precision, which ranges from 0 to 1, signify superior model performance. We compute average precision using standard IoU and OKS thresholds for bounding boxes and keypoints. IoU and OKS thresholds determine the correctness of predictions based on overlap and similarity scores, respectively. We calculate average precision at IoU (or OKS) thresholds from 0.50 to 0.95 (AP), as well as specific thresholds of 0.50 (AP $_{50}$ ) and 0.75 (AP $_{75}$ ) for radio galaxies of all sizes. Additionally, we assess performance across different structure scales by computing Average Precision for small (AP $_{\mathrm{S}}$ ), medium (AP $_{\mathrm{M}}$ ), and large (AP $_{\mathrm{L}}$ ) area ranges defined by pixel areas $\rm A < 24^2$ , $\rm 24^2 < A < 48^2$ , and $\rm A > 48^2$ , as shown in the right panel of Fig. 2.

3.5. Model evaluation results

We assess the performance of the Gal-DINO model using 8-channel radio and infrared images that were not part of the training dataset. The model is designed to predict the categories (FR-I, FR-II, FR-x, R, Pec, and C) of these galaxies, generate bounding boxes to capture their extended emission structures, and identify their corresponding infrared host galaxies. Details about the radio and infrared images, annotations, and data statistics can be found in Sections 2.3, 2.4, and 2.5, respectively. The model’s training and evaluation strategy is explained in Sections 3.3 and 3.4, respectively. Evaluation is performed on a combined dataset of validation and test sets. This combined evaluation is justified by the need for larger sample sizes; for example, the test set only contains 2 Pec morphologies, but combining the validation set increases this count to 10. It is important to note that the validation set plays no role in training the network; it is solely utilised for evaluating the model at each epoch without model back-propagation applied using the epoch’s validation results. Thus, it is appropriate to merge both the validation and test sets for the evaluation of the trained model.

Table 1 presents the outcomes obtained from the combined validation and test dataset for both bounding box and keypoint detections. The Gal-DINO model, after training, attains an AP of 65.1% for bounding boxes associated with radio galaxies. Its effectiveness extends across various IoU thresholds, achieving an AP $_{50}$ of 73.2% and an AP $_{75}$ of 70.0%, demonstrating robustness in detecting radio galaxies at different IoU thresholds. Furthermore, the model consistently performs well for radio galaxies of small, medium, and large sizes (AP $_{\mathrm{S}}$ , AP $_{\mathrm{M}}$ , and AP L). In keypoint detection as well, our trained model demonstrates strong performance, attaining an AP of 69.3% and an AP $_{50}$ of 71.7%. These results substantiate the model’s efficacy in predicting bounding boxes and keypoints for both radio galaxies and their corresponding infrared hosts, showcasing strong performance across diverse IoU thresholds and bounding box areas.

Table 1. Results for bounding box and keypoint detection using the trained Gal-DINO network are presented on a combination of the test and validation datasets with 8-channel images (see Fig. 3). The columns, from left to right, showcase various metric types: average precision for IoU (or OKS) thresholds ranging from 0.50 to 0.95 (AP), a specific IoU (or OKS) threshold of 0.5 (AP $_{50}$ ), IoU (or OKS) threshold of 0.75 (AP $_{75}$ ), and average precision for small-sized (AP $_{\mathrm{S}}$ ), medium-sized (AP $_{\mathrm{M}}$ ), and large-sized (AP $_{\mathrm{L}}$ ) radio galaxies. Further details on the training and evaluation can be found in Sections 3.3 and 3.4, respectively.

Fig. 4 presents the confusion matrices for the combined validation and test set, computed based on IoU for bounding boxes (and OKS for keypoints) at confidence thresholds of 0.5 and 0.25, respectively. It is crucial to highlight the distinction between confusion matrices used in object detection and classification. Object detection involves the possibility of multiple instances of the same or different classes within an image, resulting in several TP, FP, and FN values for each class. In contrast, classification usually assumes only one label per image, yielding a confusion matrix with a single TP, FP, and FN value for each image.

Figure 4. Presented is the normalised confusion matrix for the Gal-DINO detection model. Each matrix is normalised based on the total number of galaxies within its corresponding class. The diagonal entries denote true positive (TP) instances, representing objects correctly detected with an IoU and OKS threshold surpassing 0.5 compared to the ground-truth instances, and a confidence threshold of 0.25. False positive (FP) instances correspond to model detections lacking corresponding ground-truth instances, while false negative (FN) instances signify objects that the model failed to detect at the same IoU and OKS thresholds, along with a confidence threshold of 0.25.

Precision in object detection depends on precise localisation and accurate detection of object boundaries, evaluated using an IoU (and OKS) threshold of 0.5. The confusion matrix contains TP, FP, and FN values, offering insights into the model’s performance. For the FR-II class, the model achieved a TP value of 0.79, indicating correct detection of 79% of FR-II galaxy instances. However, there were significant false positives (FP = 0.40), where the model predicted 40% of instances as FR-II when they did not correspond to FR-II instances in the ground truth. Additionally, a moderate number of false negatives (FN = 0.07) suggests that the model missed or failed to detect 7% of actual FR-II instances. Similar patterns are observed for the FR-I, FR-x, and Pec classes. For the FR-I class, the detected source extent depends on sensitivity to diffuse extended structure (e.g. see Figure 14 in Turner et al. Reference Turner, Rogers, Shabala and Krause2018), and hence, some FR-I sources may be classified as FR-IIs due to limited surface brightness sensitivity.

Table 2. The bounding box and keypoint detection results achieved through the Gal-DINO network on a merged dataset comprising both test and validation sets. The columns correspond to those outlined in Table 1. The PNG results reflect outcomes obtained from 3-channel images, while the All-10% results signify a scenario where 10% of the entire training dataset is intentionally corrupted, aiming to assess the model’s robustness to potential noisy labels. The Ext-10% results specifically involve introducing 10% noise in annotations exclusively for extended radio galaxies within the training dataset (see Section 3 for details).

Peculiar and other rare morphologies exhibit around 40% misclassification as FR-II galaxies, primarily due to similarities in some rare morphologies to FR galaxies but with unique diffuse emission or wide-angled-tailed characteristics. It is important to highlight that although this percentage may seem high, it corresponds to only four Pec morphologies. Compact radio galaxies achieve a high TP rate of 85%, with a few misclassified as R galaxies due to only slight differences. Although the FN rates in the detections are not optimal, it is important to emphasise that these result from the application of a confidence threshold of 0.25. Lowering this threshold, eliminates FN rates for all categories, indicating the detection of all radio galaxies in the ground truth. Nonetheless, this leads to higher FP rates, a topic that will be revisited in Section 4 for catalogue construction.

As outlined in Section 2.3, the images utilised in Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024) for both training and model evaluation consist of 3-channel PNG images with dimensions of $450\times450$ pixels ( $15^{\prime}\times 15^{\prime}$ ). These images involve two pre-processed radio channels, containing 0–8 and 8–16 bit information, and one corresponding pre-processed infrared channel, with 8-16 bit information. In this work, we diverge from this approach by training the network with 8-channel images sized at $240\times240$ pixels ( $8^{\prime}\times 8^{\prime}$ ), as illustrated in Fig. 3 and detailed in Section 3.3. In Table 2 of Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024), it is evident that the AP $_{50}$ for bounding boxes is 60.2%, whereas in our current work, the AP $_{50}$ attains 73.2%, as demonstrated in Table 1. To understand this difference, we conduct an experiment where the Gal-DINO model is trained using our dataset (Section 2.5) and the same set of hyperparameters for 100 epochs. However, in this case, the network is trained with 3-channel PNG images containing radio-radio-infrared channels, mirroring the approach in Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024). The evaluation results using these PNG images are presented in Table 2. When considering the combined validation and test sets, the AP $_{50}$ is measured as 69.4% for bounding boxes. This comparison shows that employing 8-channel images in our current work marginally improves model performance in comparison to the utilisation of 3-channel PNG images, as indicated by the evaluation results. Additionally, it reaffirms the anticipated outcome that reducing image dimensions from $450\times450$ pixels to $240\times240$ pixels significantly contributes to improved model performance when compared with the findings in Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024).

Figure 5. An overview of the catalogue construction pipeline. The process initiates with obtaining predictions from the Gal-DINO model for all radio and infrared cutouts centred at the components in the Selavy catalogue. Subsequently, a dictionary of predictions is generated for the central sources within these cutouts. The consolidated catalogue is then formed by calibrating the confidence scores in the dictionary, organising them in descending order, and systematically consolidating and removing entries from the Selavy catalogue based on decreasing score values. We refer the readers to Section 4 for further details.

Given that visual inspections are employed for annotating the dataset used in training the network, we assess the model’s robustness to possible errors introduced during the manual labelling process. To achieve this, we undertake an experiment wherein the model is trained on noisy labels that are intentionally corrupted with noise. In this experiment, we introduce noise by randomly altering the bounding boxes for 10% of the training set. Specifically, we modify the bounding box positions by displacing them away from the full extent of the radio galaxy. This ensures that these noisy bounding boxes and keypoints neither coincide with the galaxy nor maintain a consistent placement in the image; instead, they are randomly positioned elsewhere within the image. This process is repeated ten times, each time selecting different galaxies at random. The model is trained in the same manner and for the same number of epochs as before, with five distinct training datasets, each with 10% randomly selected galaxies carrying noisy labels. Note that this noise is only applied to the training dataset, keeping the validation and test datasets unchanged for a direct comparison with our main results outlined in Table 1. Table 2 presents the outcomes when the entire training set is employed to randomise the bounding boxes, denoted as All-10%. The median AP $_{50}$ for the five models trained with random noisy data at a 10% level is $71.9\pm 3.1$ for bounding box detections. Here, the error is computed as the standard deviation across the results of five model evaluations on the validation and test datasets. This observation suggests that the model remains unaffected even when 10% of the annotations in the training dataset are erroneous. Comparable results are observed for AP, AP $_{75}$ , AP $_{S}$ , AP $_{M}$ , and AP $_{L}$ metrics. It’s worth noting that the larger error bars in APL may be attributed to the smaller validation and test sample size, as illustrated in the right panel of Fig. 2.

In another analogous experiment, we again altered the positions of the bounding boxes and keypoints by randomly selecting 10% of the extended radio galaxies. This selection encompasses all five categories of radio galaxies, excluding compact radio galaxies this time. This is conducted to assess the model’s robustness to possible noise in annotations related to extended radio galaxies. In this procedure, we randomly select 10% of these extended radio galaxies and proceed to randomly displace their bounding boxes and keypoint positions, following the same methodology as before. Over five such iterations, the model is trained in a manner consistent with the previous experiment. The outcomes for these models, denoted as Ext-10%, utilising noisy annotations for extended radio galaxies during model training, are presented in Table 2. Notably, the results for both bounding boxes and keypoints consistently align with the primary evaluation findings reported in Table 1. This demonstrates the model’s robustness against potential manual errors in the annotation process, such as incorrect radio morphology, radio bounding boxes, and infrared host galaxies.

4. Catalogue construction pipeline

The Gal-DINO model, once trained, is employed to generate a catalogue of radio galaxies within the EMU-PS. Fig. 5 provides an overview of this process. This section will delve into the specifics of the catalogue construction pipeline. Note that, unless the existence of confirmed infrared or optical counterparts is established, these entities are generally referred to as radio sources rather than AGN or radio galaxies. However, as our pipeline identifies potential hosts for all radio sources, we use the term ‘radio galaxies’ for brevity. The cross-matching process of counterpart identification is elaborated in Section 5.3.

4.1. Images for catalogue construction

We employ the Selavy catalogue of the EMU-PS to extract cutouts, which, in conjunction with the trained Gal-DINO model, provide radio galaxy and its potential infrared host predictions. As outlined in Section 2.1, the primary purpose of Selavy catalogues is not to generate radio galaxy catalogues; rather, their design only focuses on grouping pixels into islands and fitting components to each island. Utilising the Selavy source finder on the EMU-PS image resulted in identifying 220 102 components. We create cutouts, sized at $8^{\prime}\times8^{\prime}$ , at the positions of these components, where radio cutouts are saved as FITS files, and corresponding pre-processed infrared cutouts are saved as PNG files. These cutouts are saved in a directory and then processed by the trained Gal-DINO model to obtain predictions for each cutout. It is important to note that components in the Selavy catalogue can appear in multiple images due to their proximity. Consequently, the Gal-DINO model may generate multiple predictions for each component across different cutouts. As detailed in Section 4.3, only central radio source instances in each image are utilised for catalogue construction. Since each image has a unique Selavy catalogue component at its centre, this allows us to make predictions for each source individually.

4.2. Model predictions for images

The directory containing 220K radio and infrared image cutouts is input into the trained Gal-DINO model through its data loader. The model then provides predictions for each detected radio source, including a category assignment, a bounding box encompassing any extended emission or multiple components, and a prediction score or confidence level. Additionally, the model predicts potential infrared hosts for all identified radio sources within the provided cutouts. For 220K cutouts, the model requires approximately 10 hours for these predictions, when running on a single Nvidia Tesla P100 GPU. While the network produces several FP predictions, many of these predictions possess very low detection scores. To filter out less reliable predictions, we establish a detection score threshold, only retaining predictions with a score greater than 0.05. While this choice reduces the FP detection rate, it also maintains the FN rate at a minimal level. Employing this score threshold yields a maximum of 30 predictions per image. Out of 220K image cutouts, only 39 have no radio morphology detections above this score.

4.3. Predictions for central sources in images

As we acquire predictions for all 220K cutouts centred on the positions of the Selavy catalogue components, we selectively utilise predictions from central sources in cutouts to compile the consolidated catalogue. For each image, we initially generate a dictionary for every cutout, containing prediction details such as radio categories, bounding boxes, keypoints, and scores for all detected instances. Subsequently, we identify predictions where bounding boxes with the highest scores overlap with the centre of each cutout. Remarkably, only 1.1% of cutouts lack predicted bounding boxes at the centre, totalling approximately 2 336 cutouts without central predictions. A visual examination of 200 of these cutouts reveals that they all feature a faint compact radio galaxy at the centre, resulting in confidence scores below our 0.05 threshold. Utilising the prediction dictionary from the remaining 98.9% of central predictions, we proceed to update the consolidated catalogue, a process detailed in the subsequent sections. As outlined in Table 1, the AP $_{50}$ for the combined validation and test sets stands at 73.2%, taking into account multiple predictions within an image. Nonetheless, upon examining these images, we observe that 99% of central radio galaxies exhibit an IoU greater than 0.5, and 97.2% of central radio galaxies boast an IoU exceeding 0.7. Additionally, in terms of keypoint detections, we observe that 98% of central radio galaxies have a keypoint position within $<3^{\prime \prime}$ of the CatWISE host in the evaluation set, and 77% of keypoints are $<1^{\prime \prime}$ away from the CatWISE host location. This indicates that, for the majority of central radio galaxies, the predicted bounding boxes and keypoint positions align well with the ground truth.

4.4. Cataloguing galaxies based on detection scores

The prediction dictionary encompasses categories, bounding boxes, keypoints, and scores for central sources. These galaxies are incorporated into the consolidated catalogue based on their detection scores, with higher-scoring galaxies being added first. To achieve this, we initiate the process by calibrating the scores for all central sources in the dictionary. Calibration ensures that the predicted probabilities represent reliable estimates of the true probabilities. Following the methodology outlined in Gupta et al. (Reference Gupta2021), we employ a spline-based calibration approach. This approach introduces a binning-free calibration measure inspired by the comparison of cumulative probability distributions in the classical Kolmogorov–Smirnov (KS) statistical test. Unlike traditional binning, it approximates the empirical cumulative distribution using a differentiable function achieved through splines. The calibration process involves finding a mapping $\unicode{x03B3}\,:\, [0,1] \rightarrow [0,1]$ such that $\unicode{x03B3} (f_k(x))$ is calibrated. This mapping is established through a direct correlation from the score $f_k(x)$ to $P(k|f_k(x))$ for all classes k. We refer the readers to Gupta et al. (Reference Gupta2021) for more details. The spline fitting is executed using a held-out calibration set, and the resulting calibration function is evaluated on an unseen test set.

We utilise scores and class predictions extracted from the training dataset for establishing a calibration function. The effectiveness of this calibration function is then assessed using scores and classes from the combined validation and test sets. The graphs depicting uncalibrated and calibrated scores are illustrated in Fig. 6. Here, (a) presents a plot depicting the cumulative score and probability against the fractile of the test set. In (b), the same information is displayed, but the horizontal axis is distorted, resulting in a cumulative score graph that forms a straight line. This is visualised through scatter plots of cumulative (score, score) in blue and (score, probability) in orange. In the case of a perfectly calibrated network, the probability line will align as a straight line coinciding with the (score, score) line. In (c) and (d), plots of non-cumulative scores and probabilities are presented against the fractile or score. The upper panel illustrates that the network is substantially overestimating the probability of the detection based on the score. The bottom panel displays the outcomes of the calibration, revealing a notable improvement compared to the upper panel. This is evident from the lower KS error of 4.2%, in contrast to 7.3%. It is also important to highlight that the enhanced calibration is achieved without compromising accuracy (or probability).

Figure 6. Uncalibrated and calibrated scores for combined validation and test sets: (a) depicts the cumulative score and probability with the fractile. In (b), the same data is shown, but with a distorted horizontal axis, resulting in a cumulative score graph forming a straight line. This is visualised through scatter plots of cumulative (score, score) in blue and (score, probability) in orange. In the case of a perfectly calibrated network, the probability line will coincide as a straight line with the (score, score) line. (c) and (d) showcase plots of non-cumulative scores and probabilities against the fractile or score. The upper panel demonstrates that the network significantly overestimates the probability of detection based on the score with a KS error of 7.3%. The calibration reduces the KS error to 4.2%. See Section 4.4 for more details.

We employ the calibration function on the scores of central sources within the prediction dictionary. Subsequently, we arrange the scores and their associated category, bounding box, and keypoint predictions in descending order. The bounding box predictions are then utilised in a descending order of scores to group all Selavy catalogue components within each box and collectively add them to the consolidated catalogue. Simultaneously, we remove the corresponding rows of these components from the Selavy catalogue. Since boxes with higher scores are prioritised, the components already removed from the Selavy catalogue do not duplicate in the consolidated catalogue. This process is iterated for all central source predictions in the dictionary. Ultimately, when no components remain in the Selavy catalogue, the cataloguing process concludes.

4.5. Predictions for large extended radio galaxies

In a few cases, there are compact radio galaxies in the proximity of a large extended radio galaxy. The Gal-DINO model would generally predict a large bounding box encompassing all the components of the extended radio galaxy, but would also predict bounding boxes for the neighbouring radio galaxies. In such a scenario, the cataloguing process explained in the previous sections will tend to automatically add components with higher confidence scores to the consolidated catalogue but as it also removes those from the Selavy catalogue, this leads to separating the neighbouring radio galaxies from the large extended radio galaxy. As an example, Fig. 7 shows such a possible scenario, where an extended radio galaxy has two compact radio galaxies in its proximity. All Selavy catalogue components in the figure are represented by blue circles. The confidence score for the extended radio galaxy is 0.5 (with the biggest bounding box), however, as the two compact radio galaxies have higher scores, the galaxy with a score of 0.8 is consolidated first and removed from the Selavy catalogue. This is followed by the second compact radio galaxy with a score of 0.6. As both compact radio galaxies are consolidated and removed from Selavy catalogue, only the three remaining components are consolidated for the extended radio galaxy with the biggest box with a score of 0.5 encompassing them. Thus the consolidated catalogue records three radio galaxies in this scenario.

Figure 7. Shown is a potential scenario in which an extended radio galaxy is accompanied by two adjacent compact radio galaxies. Blue circles represent all Selavy catalogue components. The extended radio galaxy has a confidence score of 0.5, but the two compact radio galaxies with higher scores (0.8 and 0.6) are consolidated first, removing them from the Selavy catalogue. Subsequently, only the three remaining components of the extended radio galaxy, encompassed by the biggest bounding box with a score of 0.5, are consolidated. As a result, the consolidated catalogue registers three radio galaxies in this particular case.

Note that the predictions with scores 0.2 and 0.15 do not matter here as these components are already consolidated and removed from the Selavy catalogue due to their presence in the bigger box with a higher score of 0.5. A visual inspection of several such extended radio galaxies shows that this process works for most of the extended radio galaxies with neighbouring compact radio galaxies. However, for a handful of extended radio galaxies, the neighbouring compact radio galaxy gets consolidated due to its lower score as compared to the bigger box’s score that encloses both extended and compact radio galaxies. Future work with the EMU main survey should revisit this issue using a larger sample size, which can be addressed separately with a different computer vision strategy. Additionally, addressing the issue posed by very large extended radio galaxies with extensions surpassing $8^\prime$ should be a focus of future work with the EMU main survey. These galaxies, characterised by their limited representation in the present work due to their scarcity in the EMU-PS, were not incorporated into the training of the Gal-DINO model.

5. Catalogue descprition

The final catalogue comprises 211 625 radio galaxies, and 73% of these have a counterpart in the CatWISE infrared catalogue within $3^{\prime \prime}$ . Additionally, cross-matched counterparts from the optical surveys have been incorporated into the catalogue. Table 3 provides detailed descriptions of the catalogue columns. This section elaborates on the specifics of the catalogue.

Table 3. Description of columns in the catalogue (best viewed in a PDF reader).

5.1. Consolidated catalogue

The upper panel in Fig. 9 displays a histogram depicting the prediction confidence scores for all radio galaxies. Out of the 211 625 radio galaxies in the catalogue, 91.1% of them exhibit a confidence score of 1, and 99.1% are identified with a score greater than 0.5. As detailed in Section 4.3, there are 2 336 cutouts without central source predictions surpassing the confidence score of 0.05. Among these, 262 are incorporated through consolidation. As all the remaining 2 074 correspond to single-component islands in the Selavy catalogue as well, we include them as compact radio galaxies at the end of the consolidated catalogue. The catalogue comprises 201 211 compact radio galaxies and 10 414 extended radio galaxies. Among the extended radio galaxies, there are 582 FR-I, 5 602 FR-II, 1 494 FR-X, and 2 375 R radio galaxies as well as 361 Pec morphologies. Among the 0.9% (1 877) of galaxies with confidence scores below 0.5, the majority are associated with C (898) and FR-II (838) galaxies, as depicted in the lower panel of Fig. 9. This corresponds to only 0.05% of the total compact radio galaxies and 15% of the total FR-II galaxy predictions below a score of 0.5. Future investigations should focus on a more detailed examination of these FR-II galaxies with scores below 0.5. In our visual examinations of 100 systems classified as FR-II with scores below 0.5, we noted that, in several cases, their extensive bounding boxes encapsulate multiple compact radio galaxies, each with a higher score than the score assigned to the larger bounding box. As described in Section 4.5 and illustrated in Fig. 7, galaxies with higher scores are prioritised for consolidation in the catalogue. We observe that several bounding boxes classified as FR-II encompass adjacent compact radio galaxies rather than genuine FR-II galaxies; however, 30 out of 100 are indeed real FR-II galaxies. In Fig. 8, an instance of this situation is demonstrated, where the larger box prediction, with a score of 0.48, proves to be redundant. The presence of smaller boxes within suggests that these encompass three compact radio galaxies, each with higher scores, rather than indicating a single FR-II galaxy. Also after examining 100 randomly chosen predictions corresponding to 85% of FR-II classifications with scores above 0.5, we found that 90 of them are indeed authentic FR-II galaxies. However, 9 are FR-I or FR-X and one bounding box encompasses multiple compact radio sources. This indicates that the classification of FR-II galaxies can be relied upon for specific tasks, with more reliable classifications having scores above 0.5. It is crucial to emphasise that this does not impact the overall catalogue, as bounding boxes for low-scoring FR-II galaxies often have high-scoring compact radio galaxies, and these are already accounted for in the consolidated catalogue (as explained in Section 4.5). The lower panel of Fig. 9 illustrates the distribution of galaxies across the six prediction class types. Following this, we identify 200 313 compact radio galaxies, 576 FR-I galaxies, 4,764 FR-II galaxies, 1 425 FR-x galaxies, 2 335 R galaxies, and 335 Pec morphologies with model confidence scores above 0.5. Note that the accuracy of predictions relies on the appropriate selection of the confidence score, which should align with the specific requirements of the scientific case.

Figure 8. An example where the larger box prediction, scoring 0.48, is deemed redundant. The existence of smaller boxes within implies that these contain three compact radio galaxies, each with higher scores, rather than a single, bent FR-II radio galaxy within the larger bounding box.

Figure 9. The top panel depicts the distributions of all radio galaxies (solid bars) and subsets categorised by classification types (coloured contours) based on prediction confidence scores. The bottom panel displays the distribution across different types as predicted by the Gal-DINO model above and below scores of 0.5. Approximately 99.1% of the radio galaxies in our consolidated catalogue have a score larger than 0.5. Notably, among the galaxy types, FR-II exhibits the largest fraction of galaxies with scores below 0.5; for further insights, refer to Section 5.1.

The consolidated catalogue includes the following columns: EMU_PS_source_name (e.g. EMU PS J203428.9-583922Footnote c where ‘P’ stands for pilot survey and ‘S’ stands for source), derived from the RA and Dec of the consolidated radio galaxies. The RA (EMU_PS_ra_deg) and Dec (EMU_PS_dec_deg) represent the detected keypoint positions. It is important to note that we use keypoint positions for RA and Dec, rather than the position of the peak radio emission inside the predicted bounding box encompassing extended radio galaxies. This choice is made to avoid using the positions of radio lobes for RA and Dec, as they can have higher flux compared to the central radio source. As keypoint detection is performed on cutouts with 7 radio and 1 infrared channels, as done for locating bounding boxes. Consequently, the keypoints denote positions where distinctive features in both radio and infrared images contribute to their predictive capabilities. While keypoints in the training data correspond to known CatWISE sources, typically situated near the central core or the ridge of collimated radio emission connecting the lobes, the model predictions rely on features extracted from both radio and infrared images. In scenarios where no infrared emission is present near the radio components, the model tends to predict a keypoint near the central radio peak or the ridge, rather than artificially placing it randomly in the infrared image. Therefore, we utilise keypoint positions for RA and Dec instead of radio peak positions for extended radio galaxies. Despite there being only one radio peak for compact radio galaxies, for consistency, we use keypoint positions for both extended and compact radio galaxies.

The flux density columns, EMU_PS_flux_peak and EMU_PS_ flux_int, are sourced from the Selavy catalogue, where a signal-to-noise ratio of $\geq5$ is applied to select radio sources. The peak flux density is obtained from the maximum flux density of consolidated components, while the integrated flux density is the sum of these components. EMU_PS_img_id and EMU_PS_filename serve as identifiers and names for the cutouts from which the galaxy was consolidated into our catalogue. These identifiers correspond to component IDs in the Selavy catalogue, facilitating the viewing of consolidated galaxy cutouts. EMU_PS_radio_bbox provides the dimensions of the detected bounding box for the radio galaxy in pixels, including information about the minimum x and y coordinates, height, and width. The height and width, measured in pixels and parallel to RA and Dec, can be used to determine the total extent of the radio galaxies, with each pixel equivalent to 2 $^{\prime \prime}$ . Above an integrated flux of 10 mJy for extended radio galaxies, we measure the largest angular size (LAS) using the height and width, yielding a median LAS of $93.6^{\prime \prime}$ with a standard deviation of $50.4^{\prime \prime}$ . This is consistent with Williams et al. (Reference Williams2019), where employing the same flux threshold at 944 MHz (assuming a spectral index of -0.7) and selecting radio sources featuring multiple components, we find a median LAS of $99.6^{\prime \prime}$ with a standard deviation of $74.9^{\prime \prime}$ . EMU_PS_ra_cen_bbox and EMU_PS_dec_cen_bbox denote the box centroids in RA and Dec, serving as box positions for specific tests. EMU_PS_type and EMU_PS_score indicate the predicted classification types and confidence scores, respectively. Fig. 10 illustrates examples from the consolidated catalogue overlaid on EMU-PS radio images. It showcases RA and Dec positions with blue circles, extended radio galaxy bounding boxes with blue rectangles, and classification types with confidence scores for different radio galaxies exhibiting various extents. The last column, EMU_PS_n_selavy_components, provides information about the number of Selavy components for each consolidated radio galaxy. Table A1 displays the initial 60 rows of the catalogue, showcasing the EMU-PS fields.

Figure 10. Examples of the catalogue galaxies overlaid on radio images, showcasing their host positions, bounding boxes, and classifications derived from both the Gal-DINO and catalogue construction pipelines. Blue rectangles and accompanying text denote the bounding boxes and classification types with confidence scores for the extended radio galaxies. The positions of these extended radio galaxies are marked by blue circles. For brevity, we omit the presentation of bounding boxes for compact radio galaxies, which are solely indicated by blue circles.

5.2. Comparison to $\mathrm{Selavy}$

The compilation of the consolidated catalogue relies on the sources identified by Selavy. As mentioned in Section 2.1, the purpose of the Selavy algorithm is not to generate radio galaxy catalogues. In the Selavy process, pixels above a specific brightness threshold are initially detected and grouped into islands. Following this, components are fitted to each island, ultimately forming the component catalogue. Out of 198 216 Selavy islands, 178 921 are single-component ones, and the remaining 19 295 are multi-component with a total of 41 181 components. We record the selavy_island_id and selavy_component_id for each galaxy in the consolidated catalogue. As detailed in Section 5.1, the consolidated catalogue comprises 211 625 radio galaxies, representing an increase of 13 409 radio galaxies compared to the 198 216 islands in Selavy. Among these, the selavy_island_id in our consolidated catalogue consists of 204 533 single-component and 7 092 multi-component Selavy island IDs. This indicates the presence of 204 533 distinct islands, compared to the 178 921 islands with a single component in the consolidated catalogue. In essence, Selavy seems to underestimate the number of single-component islands by 12.5%. Regarding the 7 092 multi-component island IDs, 6 901 share the same set of 15 161 components as identified by Selavy, indicating correct associations for these islands. The remaining 191 multi-component island IDs have 407 components all from different Selavy islands. Overall, Selavy correctly associates multiple components for 6 901 out of 19 295 islands, with a correct association rate of 35.7% for the multi-component islands.

Furthermore, we incorporate additional attributes about the Selavy catalogue. Each Selavy component is uniquely identified by its assigned name, denoted as selavy_component_name. The celestial coordinates, namely Right Ascension (RA) and Declination (Dec), are encapsulated in the fields selavy_ra_deg_cont and selavy_dec_deg_cont, respectively. Information regarding the peak and integrated flux density of the radio signal emitted by these Selavy components can be found in selavy_flux_peak and selavy_flux_int. Additionally, the spectral index, characterising the frequency dependence of the radio emission, is detailed in selavy_spectral_index (see Figure 11 in Norris et al. Reference Norris2021a also).

Figure 11. Displayed are the numbers of DES, DESI, and SCos counterparts within 2 $^{\prime \prime}$ of the CatWISE position (top panel). The photometric redshift distributions are based on the DESI legacy surveys for compact and extended radio galaxy counterparts (bottom panel). Both these plots include galaxies with a Gal-DINO confidence score greater than 0.5.

5.3. Multiwavelength counterparts

We cross-identify radio galaxies in the consolidated catalogue with the infrared and optical catalogues. To do this, we first cross-match the EMU_PS_ra_deg and EMU_PS_dec_deg coordinates with the CatWISE catalogue. As explained in Section 5.1, these coordinates originate from the keypoint positions, and we do not utilise radio peak flux or bounding box centroids for this cross-identification. We use a 3 $^{\prime \prime}$ search radius around radio galaxies to identify counterparts. At this 3 $^{\prime \prime}$ search radius, the false identification rate is expected to be around 8% (see Figure 12 in Norris et al. Reference Norris2021a). Table 4 displays the number of radio galaxies within this search radius. We find that 73% of radio galaxies (including the 8% with possible false identifications) have counterparts in the CatWISE catalogue. Consequently, the consolidated catalogue lacks multiwavelength information for the remaining 27% of radio sources. Among the radio galaxies classified by the Gal-DINO model as compact and all extended, we find cross-matches for 74% and 44%, respectively. Future work should investigate whether a larger search radius could be more optimal for identifying counterparts for extended radio galaxies, aiming to increase completeness while keeping the false identification rate low. In another scenario, when looking only for counterparts of galaxies with a GalDINO confidence score greater than 0.5, we find similar percentages of cross-matches when applying a minimum score threshold of 0.33. Note that we have chosen this straightforward approach to identify radio galaxy counterparts, which yields acceptable false identification rates. Subsequent research should explore the cross-matching of radio galaxies by leveraging photometric properties, such as utilising stellar mass estimates for the matching process (see, e.g. Gupta et al. Reference Gupta2020), employing other likelihood ratio-based methods (e.g. McAlpine et al. Reference McAlpine, Smith, Jarvis, Bonfield and Fleuren2012; Weston et al. Reference Weston2018; Williams et al. Reference Williams2019; Gordon et al. Reference Gordon2023), filtering out contaminating stars using WISE colours or proper motion data and understanding the contamination in identification caused by gravitational lensing.

Figure 12. The distributions of multiwavelength counterparts for compact and extended radio galaxies, as well as FR-I- and FR-II-type radio galaxies, are shown in three panels. The top panel displays contours for radio luminosity and CatWISE magnitudes, the middle panel illustrates the infrared colour-colour plot for AllWISE counterparts, emphasising the dominance of AGN above an integrated radio flux of 5 mJy. The bottom panel exhibits the colour-colour plot for DES radio galaxy counterparts. All plots include galaxies with a Gal-DINO confidence score greater than 0.5, and contours describe 5, 25, 50, 75, and 95 percentile levels. Further details and discussion can be found in Section 5.3.

Next, we cross-match CatWISE sources that are within a 3 $^{\prime \prime}$ radius of the radio galaxies with the optical catalogues. We use photometric catalogues from the Dark Energy Survey Data Release 2 (DES DR2 Abbott et al. Reference Abbott2021), the Dark Energy Spectroscopic Instrument Legacy Survey catalogue from Data Release 8 (DESI DR8 Zou et al. Reference Zou, Gao, Zhou and Kong2019), and the WISE x SuperCOSMOS photometric catalogue (SCos Bilicki et al. Reference Bilicki2016) to cross-match CatWISE sources. Following (Figure 13 in Norris et al. Reference Norris2021a), we search for counterparts in these optical catalogues within a 2 $^{\prime \prime}$ radius of CatWISE, where the false identification rate is approximately 7%. Table 4 and the top panel of Fig. 11 show the number of CatWISE counterparts in DES, DESI, and SCos. The median separation distance for these galaxies with the CatWISE sources is found to be 0.34 $^{\prime \prime}$ , 0.27 $^{\prime \prime}$ , and 0.3 $^{\prime \prime}$ for DES, DESI, and SCos catalogues. We find that 65%, 36%, and 11% of CatWISE sources within a 3 $^{\prime \prime}$ radius of radio galaxies have counterparts in DES, DESI, and SCos catalogues, respectively. The same percentage of counterparts is found when a confidence score limit of 0.5 is applied. The bottom panel of Fig. 11 shows the number of compact (grey) and extended radio galaxy (green) counterparts with a score larger than 0.5 as a function of photometric redshift from the DESI legacy survey. The median redshift (z) for compact and extended radio galaxy counterparts is 0.46 and 0.53, respectively, with a maximum redshift of $1.79\pm0.28$ for compact and $1.26\pm0.22$ for extended. We find 4 525 compacts and 82 extended radio galaxy counterparts above a redshift of 1.

The three panels in Fig. 12 depict the relationships between compact (brown contours) and extended radio galaxies (blue contours) and their counterparts in infrared and optical, with Gal-DINO confidence scores larger than 0.5. In the top panel, radio luminosity contours are displayed at 5, 25, 50, 75, and 95 percentile levels relative to the CatWISE W1 magnitude. Extended radio galaxies exhibit higher radio luminosities, with a median of $10^{24.6} ~\rm W Hz^{-1}$ , compared to compact radio galaxies, which have a median of $10^{23.4} ~\rm W Hz^{-1}$ . This observation aligns with the findings in Figure 21 of Gordon et al. (Reference Gordon2023), indicating higher 3 GHz luminosities for extended sources compared to single-component ones. The CatWISE W1 magnitudes are similar, with median values of 15.2 and 15.4 for extended and compact radio galaxies, respectively. For a random sample of CatWISE sources, the median W1 magnitude is 16.8, underscoring that the infrared counterparts of radio galaxies are brighter than other infrared sources. Additionally, data points for FR-I- and FR-II-type radio galaxies are plotted, revealing overlap between the two populations in luminosity, consistent with the findings in Mingo et al. (Reference Mingo2019). The median luminosities for FR-I- and FR-II-type radio galaxies are $10^{24.6}$ and $10^{24.3} ~\rm W Hz^{-1}$ , respectively, while the median W1 magnitudes are both 15.2.

Table 4. Number of CatWISE, DES, DESI and SCos counterparts for radio galaxies. The cross-matching process involves CatWISE sources matched with radio galaxies within a 3 $^{\prime \prime}$ search radius while cross-matching with DES, DESI, and SCos catalogues utilises these CatWISE sources within a 2 $^{\prime \prime}$ radius. The upper and lower section of the table presents the number of galaxies and the percentage of cross-matches with confidence scores exceeding 0.33 and 0.5, respectively. The numbers and percentages refer to the sources remaining after applying all of the previous criteria. The columns display these statistics for all, compact, and extended radio galaxies.

The middle panel of Fig. 12 shows contours on the AllWISE W1-W2 and W2-W3 colour-colour diagram for compact and extended radio galaxies. The AllWISE counterparts are selected as the closest CatWISE counterparts of radio galaxies with a maximum separation of $1^{\prime \prime}$ . The plot displays contours at 5, 25, 50, 75, and 95 percentile levels for compact and extended radio galaxy counterparts. Following the criteria outlined in Mingo et al. (Reference Mingo2016) for identifying infrared host galaxies, we find a 63% and 58% AGN population (W1-W2>0.5) when a radio galaxy flux limit of 5 mJy is applied and AllWISE magnitudes above a signal-to-ratio larger than 3 are chosen. Applying the same flux and magnitude cuts, passive elliptical galaxies (W1-W2<0.5 and W2-W3<1.6) serve as hosts for 5% and 16% of compact and extended radio galaxies, respectively. Star-forming galaxies (W1-W2<0.5 and 1.6<W2-W3<3.4) host 10% and 12% of compact and extended radio galaxies, respectively. Ultra-luminous infrared galaxies are hosts to 14% and 22% of compact and extended radio galaxies. These proportions align with Gordon et al. (Reference Gordon2023), who find a majority of AGN hosts for compact and extended radio galaxies in the 3 GHz VLASS survey. Additionally, data points for FR-I- and FR-II-type radio galaxies are shown, indicating substantial overlap between the two categories, consistent with the findings of Mingo et al. (Reference Mingo2019).

The bottom panel of Fig. 12 illustrates the colour-colour diagram (g-r) versus (r-i) for DES counterparts. The plot depicts distributions for compact and extended radio galaxy counterparts with contours at 5, 25, 50, 75, and 95 percentile levels. The hosts exhibit consistent colours, with median values for (g-r, r-i) of (0.94, 0.74) and (1.2, 0.8) for compact and extended radio galaxies, respectively. For a random sample of DES sources, the median (g-r, r-i) colours are 3.7 and 1, indicating that optical hosts for radio galaxies have distinct colours compared to other optical sources. As before, we also plot data points for FR-I- and FR-II-type radio galaxies, revealing significant overlap in their host galaxy colours.

6. Summary

With ongoing and upcoming experiments poised to identify tens of millions of radio galaxies, it becomes crucial to develop more efficient pipelines for constructing radio galaxy catalogues. This is essential to fully harness the potential insights offered by these observations and reduce the considerable human effort currently required. In this study, we introduce a comprehensive detection pipeline designed to build radio galaxy catalogues through the application of cutting-edge computer vision algorithms. The pipeline employs a two-step process: initially detecting compact and extended radio galaxies along with their potential host galaxies in both radio and infrared images using computer vision networks, and subsequently utilising the network predictions in the second stage to construct a catalogue. Leveraging this detection and catalogue construction pipeline, we create the first catalogue of radio galaxies for the pilot survey of the Evolutionary Map of the Universe (EMU-PS), conducted with the Australian Square Kilometer Array Pathfinder (ASKAP) telescope.

In developing the detection pipeline, we build upon the GalDINO network initially introduced in RadioGalaxyNET by Gupta et al. (Reference Gupta, Hayder, Norris, Huynh and Petersson2024) for radio galaxy detection. Expanding on the 2 800 FR-I, FR-II, FR-x, and R-type radio galaxies present in RadioGalaxyNET, we conduct visual inspections and label 2 090 compact radio galaxies and 99 sources with peculiar and other rare radio galaxies. This results in a dataset comprising approximately 5 000 radio galaxies, along with their corresponding infrared counterparts, utilised for training and evaluating the network. The network is specifically trained to predict both the categories and bounding boxes for the radio galaxies, as well as the keypoint positions of their potential infrared host galaxies. Evaluation metrics are based on the Average Precision (AP) at a specified IoU concerning the ground truth. We achieve an AP $_{50}$ of 73.2% for the radio galaxy bounding box predictions and 71.7% for the infrared host keypoint positions when considering the combined validation and test datasets. These results encompass predictions for all radio galaxy instances in each image. However, focusing solely on the radio galaxy located in the central part of the image reveals that 99% of the central radio galaxy predictions exhibit an IoU above 0.5 with respect to the ground-truth bounding box in the evaluation set. For keypoint detection in the evaluation set, we observe that 98% of keypoint locations have $<3^{\prime \prime}$ separation from the ground-truth CatWISE infrared galaxy. This observation suggests a strong alignment between the predicted bounding boxes and keypoints with the ground truth for the majority of the radio galaxies.

The pipeline for constructing the catalogue utilises predictions from the Gal-DINO model for radio and infrared cutouts generated using the Selvay-based components catalogue. This process results in 220 102 cutouts, for which bounding box, category, and keypoint predictions for the central source in the image are employed to assemble the consolidated catalogue. Catalogue construction relies on the prediction confidence scores from the GalDINO network. For the 220 102 cutouts, the central source with the highest score is added to the consolidated catalogue first and is subsequently removed from the Selavy catalogue. In the case of an extended radio galaxy, multiple components are grouped and added to the consolidated catalogue, while for a compact radio galaxy, one component is added. The final catalogue comprises a total of 211 625 radio galaxies, including 201 211 compact radio galaxies and 10 414 extended radio galaxies. Among the extended radio galaxies, 582 are classified as FR-I, 5 602 as FR-II, 1 494 as FR-x, and 2 375 as R radio galaxies and 361 as Pec morphologies. Along with the GalDINO predictions, encompassing RA, DEC, peak flux density, integrated flux density, galaxy category, confidence score, etc., we also include corresponding Selavy component identifiers in the catalogue.

Subsequently, a cross-matching process is employed to match radio galaxies with infrared (CatWISE) and optical catalogues (DES DR2, DESI Legacy Surveys DR8 and SuperCosmos) to identify their multi-wavelength counterparts. We observe that 75% of the radio galaxies have counterparts in the CatWISE catalogue within a 3 $^{\prime \prime}$ search radius. Further cross-matching of these CatWISE counterparts with optical catalogues, using a 2 $^{\prime \prime}$ search radius, reveals that 64%, 37%, and 11% of radio galaxies are matched with DES, DESI, and SCos catalogues, respectively. The photometric redshifts obtained from the DESI legacy survey indicate a median redshift of 0.46 and 0.53 for compact and extended radio galaxies with a confidence score above 0.5. The farthest compact and extended radio galaxies are found at redshifts of 1.79 and 1.26, respectively. Future research should concentrate on devising improved cross-matching methodologies to identify infrared and optical counterparts for the potential host positions detected by the computer vision pipeline. Subsequent research efforts should leverage the EMU-PS catalogue and the computer vision methodologies developed for cataloguing radio galaxies and their potential infrared hosts in ongoing and future radio surveys. The existence of this catalogue will enable the advancement of more sophisticated ML techniques tailored for the detection of radio galaxies in the next generation of radio surveys, including the EMU main survey. To augment the variety of training data across the entirety of the southern sky, future studies should investigate the potential integration of active learning with a human-in-the-loop approach for the EMU main survey. This exploration should aim to enhance the diversity and quality of training data, contributing to the improvement of applied ML methodologies.

Acknowledgements

We thank Ian Smail for his kind remarks on the draft and Matt Whitting for clarifying the purpose of the Selavy source finder. The Australian SKA Pathfinder is part of the Australia Telescope National Facility, which is managed by CSIRO. The operation of ASKAP is funded by the Australian Government with support from the National Collaborative Research Infrastructure Strategy. ASKAP uses the resources of the Pawsey Supercomputing Centre. The establishment of ASKAP, the Murchison Radio-astronomy Observatory and the Pawsey Supercomputing Centre are initiatives of the Australian Government, with support from the Government of Western Australia and the Science and Industry Endowment Fund. We acknowledge the Wajarri Yamatji people as the traditional owners of the Observatory site. This publication makes use of data products from the Wide-field Infrared Survey Explorer, which is a joint project of the University of California, Los Angeles, and the Jet Propulsion Laboratory/California Institute of Technology, funded by the National Aeronautics and Space Administration. The DESI Legacy Survey catalogues used in this paper were produced thanks to funding from the U.S. Department of Energy Office of Science and Office of High Energy Physics via grant DE-SC0007914. The DES data management system is supported by the National Science Foundation under Grant Numbers AST-1138766 and AST-1536171. NG acknowledges support from CSIRO’s Machine Learning and Artificial Intelligence Future Science Impossible Without You (MLAI FSP IWY) Platform.

Data availability

The catalogue is accessible at https://data.csiro.au/collection/62055,Footnote d and the Gal-DINO network can be found at https://github.com/Nikhel1/Gal-DINO.

Appendix

Table A1. First few rows of the consolidated catalogue, excluding the remaining columns described in Table 3 for brevity.

Footnotes

^a https://www.skatelescope.org/the-ska-project/.

^b https://github.com/Nikhel1/Gal-DINO.

^c Nomenclature criterion details: https://cdsarc.cds.unistra.fr/viz-bin/Dic?/4848739.

^d If access is restricted due to an embargo period, please contact the lead author for a copy.

References

Abbott, T. M. C., et al. 2021, ApJS, 255, 20CrossRef Google Scholar

Alger, M. J., et al. 2018, MNRAS, 478, 5547CrossRef Google Scholar

Banfield, J. K., et al. 2015, MNRAS, 453, 2326Google Scholar

Becker, B., Vaccari, M., Prescott, M., & Grobler, T. 2021, MNRAS, 503, 1828CrossRef Google Scholar

Bilicki, M., et al. 2016, ApJS, 225, 5CrossRef Google Scholar

Bowles, M., Scaife, A. M. M., Porter, F., Tang, H., & Bastien, D. J. 2020, MNRAS, 501, 4579 CrossRef Google Scholar

Brand, K., et al. 2023, MNRAS, 522, 292CrossRef Google Scholar

Carion, N., et al. 2020, in Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I 16 (Springer), 213CrossRef Google Scholar

Comrie, A., et al. 2021, CARTA: The Cube Analysis and Rendering Tool for Astronomy (Zenodo), doi: 10.5281/zenodo.3377984CrossRef Google Scholar

Condon, J. J., et al. 1998, AJ, 115, 1693CrossRef Google Scholar

Cutri, R. M., et al. 2013, VizieR Online Data Catalog, II/328Google Scholar

DeBoer, D. R., et al. 2009, IEEE Proc., 97, 1507CrossRef Google Scholar

Fanaroff, B. L., & Riley, J. M. 1974, MNRAS, 167, 31PCrossRef Google Scholar

Galvin, T. J., et al. 2020, MNRAS, 497, 2730CrossRef Google Scholar

Gordon, Y. A., et al. 2023, ApJS, 267, 37CrossRef Google Scholar

Gupta, K., et al. 2021, in International Conference on Learning RepresentationsGoogle Scholar

Gupta, N., Hayder, Z., Norris, R., Huynh, M., & Petersson, L. 2023a, Radio Galaxy Detection - Computer Vision Algorithms, Software Collection, doi: http://hdl.handle.net/102.100.100/602494?index=1Google Scholar

Gupta, N., Hayder, Z., Norris, R. P., Huynh, M., & Petersson, L. 2024, PASA, 41, e001 CrossRef Google Scholar

Gupta, N., et al. 2022, PASA, 39, e051Google Scholar

Gupta, N., Norris, R., Huynh, M., Hayder, Z., & Petersson, L. 2023b, RadioGalaxyNET Dataset - Extended Radio Galaxies, Data Collection, doi: 10.25919/btk3-vx79CrossRef Google Scholar

Gupta, N., et al. 2020, MNRAS, 494, 1705Google Scholar

Gupta, N., et al. 2023, PASA, 40, e044Google Scholar

Hancock, P. J., Trott, C. M., & Hurley-Walker, N. 2018, PASA, 35, e011CrossRef Google Scholar

Hay, S., O’Sullivan, J., Kot, J., & Granet, C. 2006, in ESA Special Publication, Vol. 626, The European Conference on Antennas and Propagation: EuCAP 2006, ed. Lacoste, H., & Ouwehand, L., 663Google Scholar

He, K., Zhang, X., Ren, S., & Sun, J. 2015, arXiv e-prints, arXiv:1512.03385Google Scholar

Hotan, A. W., et al. 2021, PASA, 38, e009Google Scholar

Johnston, S., et al. 2007, PASA, 24, 174CrossRef Google Scholar

Jonas, J., & MeerKAT Team. 2016, in MeerKAT Science: On the Pathway to the SKA, 1Google Scholar

Lacy, M., et al. 2020, PASP, 132, 035001Google Scholar

Lang, D. 2014, AJ, 147, 108CrossRef Google Scholar

Lao, B., et al. 2023, A&C, 44, 100728CrossRef Google Scholar

Lin, T.-Y., et al. 2014a, in Computer Vision – ECCV 2014, ed. Fleet, D., Pajdla, T., Schiele, B., & Tuytelaars, T. (Cham: Springer International Publishing), 740Google Scholar

Lin, T.-Y., et al. 2014b, in Computer Vision – ECCV 2014, ed. Fleet, D., Pajdla, T., Schiele, B., & Tuytelaars, T. (Cham: Springer International Publishing), 740Google Scholar

Lukic, V., et al. 2018, MNRAS, 476, 246CrossRef Google Scholar

Marocco, F., et al. 2021, ApJS, 253, 8CrossRef Google Scholar

Maslej-Krešňáková, V., El Bouchefry, K., & Butka, P. 2021, MNRAS, 505, 1464CrossRef Google Scholar

McAlpine, K., Smith, D. J. B., Jarvis, M. J., Bonfield, D. G., & Fleuren, S. 2012, MNRAS, 423, 132CrossRef Google Scholar

McConnell, D., et al. 2020, PASA, 37, e048CrossRef Google Scholar

Miley, G. 1980, ARA&A, 18, 165CrossRef Google Scholar

Mingo, B., et al. 2016, MNRAS, 462, 2631CrossRef Google Scholar

Mingo, B., et al. 2019, MNRAS, 488, 2701Google Scholar

Mostert, R. I. J., et al. 2021, A&A, 645, A89CrossRef Google Scholar

Norris, R. P. 2011, JApA, 32, 599CrossRef Google Scholar

Norris, R. P. 2017, NatAs, 1, 671CrossRef Google Scholar

Norris, R. P., et al. 2021a, PASA, 38, e046Google Scholar

Norris, R. P., et al. 2021b, PASA, 38, e003Google Scholar

Perley, R. A., Chandler, C. J., Butler, B. J., & Wrobel, J. M. 2011, 739, L1CrossRef Google Scholar

Ren, S., He, K., Girshick, R., & Sun, J. 2015, Advances in Neural Information Processing Systems, 28Google Scholar

Riggi, S., et al. 2023, A&C, 42, 100682CrossRef Google Scholar

Slijepcevic, I. V., et al. 2022, MNRAS, 514, 2599 CrossRef Google Scholar

Sortino, R., et al. 2023, ExA, 56, 293CrossRef Google Scholar

Tian, Z., Shen, C., Chen, H., & He, T. 2019, in Proceedings of the IEEE/CVF International Conference on Computer Vision, 9627Google Scholar

Turner, R. J., Rogers, J. G., Shabala, S. S., & Krause, M. G. H. 2018, MNRAS, 473, 4179CrossRef Google Scholar

van Haarlem, M. P., et al. 2013, A&A, 556, A2Google Scholar

Vaswani, A., et al. 2017, Advances in Neural Information Processing Systems, 30Google Scholar

Wayth, R. B., et al. 2018, PASA, 35, e033Google Scholar

Weston, S. D., et al. 2018, MNRAS, 473, 4523CrossRef Google Scholar

Whiting, M., & Humphreys, B. 2012, PASA, 29, 371CrossRef Google Scholar

Whiting, M., Voronkov, M., Mitchell, D., & Team, Askap. 2017, in Astronomical Society of the Pacific Conference Series, Vol. 512, Astronomical Data Analysis Software and Systems XXV, ed. Lorente, N. P. F., Shortridge, K., & Wayth, R., 431Google Scholar

Williams, W. L., et al. 2019, A&A, 622, A2Google Scholar

Wright, E. L., et al. 2010, AJ, 140, 1868Google Scholar

Wu, C., et al. 2019, MNRAS, 482, 1211Google Scholar

Zhang, H., et al. 2023, in The Eleventh International Conference on Learning RepresentationsGoogle Scholar

Zou, H., Gao, J., Zhou, X., & Kong, X. 2019, ApJS, 242, 8CrossRef Google Scholar

Figure 1. Examples of the radio (left panels) and corresponding infrared (right panels) images, as described by the column titles. Each of these images has a frame size of $8^{\prime} \times 8^{\prime}$ in the sky ($240 \times 240$ pixels). In the radio images, we display classes and bounding boxes for radio galaxies encapsulating all their components. Here, the ‘FR-X’ type is positioned between the FR-I and FR-II categories, ‘R’ denotes resolved radio sources with one visible peak, ‘C’ represents compact unresolved radio sources, and ‘Pec’ refers to peculiar or other rare radio morphologies (see Section 2.4 for details). On the infrared images, circles indicate the positions of host galaxies.

Table 1. Results for bounding box and keypoint detection using the trained Gal-DINO network are presented on a combination of the test and validation datasets with 8-channel images (see Fig. 3). The columns, from left to right, showcase various metric types: average precision for IoU (or OKS) thresholds ranging from 0.50 to 0.95 (AP), a specific IoU (or OKS) threshold of 0.5 (AP$_{50}$), IoU (or OKS) threshold of 0.75 (AP$_{75}$), and average precision for small-sized (AP$_{\mathrm{S}}$), medium-sized (AP$_{\mathrm{M}}$), and large-sized (AP$_{\mathrm{L}}$) radio galaxies. Further details on the training and evaluation can be found in Sections 3.3 and 3.4, respectively.

Figure 5. An overview of the catalogue construction pipeline. The process initiates with obtaining predictions from the Gal-DINO model for all radio and infrared cutouts centred at the components in the Selavy catalogue. Subsequently, a dictionary of predictions is generated for the central sources within these cutouts. The consolidated catalogue is then formed by calibrating the confidence scores in the dictionary, organising them in descending order, and systematically consolidating and removing entries from the Selavy catalogue based on decreasing score values. We refer the readers to Section 4 for further details.

Figure 7. Shown is a potential scenario in which an extended radio galaxy is accompanied by two adjacent compact radio galaxies. Blue circles represent all Selavy catalogue components. The extended radio galaxy has a confidence score of 0.5, but the two compact radio galaxies with higher scores (0.8 and 0.6) are consolidated first, removing them from the Selavy catalogue. Subsequently, only the three remaining components of the extended radio galaxy, encompassed by the biggest bounding box with a score of 0.5, are consolidated. As a result, the consolidated catalogue registers three radio galaxies in this particular case.

Table 3. Description of columns in the catalogue (best viewed in a PDF reader).

Figure 11. Displayed are the numbers of DES, DESI, and SCos counterparts within 2$^{\prime \prime}$ of the CatWISE position (top panel). The photometric redshift distributions are based on the DESI legacy surveys for compact and extended radio galaxy counterparts (bottom panel). Both these plots include galaxies with a Gal-DINO confidence score greater than 0.5.

Table 4. Number of CatWISE, DES, DESI and SCos counterparts for radio galaxies. The cross-matching process involves CatWISE sources matched with radio galaxies within a 3$^{\prime \prime}$ search radius while cross-matching with DES, DESI, and SCos catalogues utilises these CatWISE sources within a 2$^{\prime \prime}$ radius. The upper and lower section of the table presents the number of galaxies and the percentage of cross-matches with confidence scores exceeding 0.33 and 0.5, respectively. The numbers and percentages refer to the sources remaining after applying all of the previous criteria. The columns display these statistics for all, compact, and extended radio galaxies.

Table A1. First few rows of the consolidated catalogue, excluding the remaining columns described in Table 3 for brevity.

Article contents

RG-CAT: Detection pipeline and catalogue of radio galaxies in the EMU pilot survey

Abstract

Keywords

1. Introduction

2. Data

2.1. ASKAP observations

2.2. Infrared observations

2.3. Radio and infrared images

2.4. Radio and infrared annotations

2.5. Radio source statistics

3. Detection pipeline for radio galaxies

3.1. Radio galaxy class and bounding box predictions

3.2. Infrared host galaxy keypoint prediction

3.3. Network training

3.4. Evaluation metrics

3.5. Model evaluation results

4. Catalogue construction pipeline

4.1. Images for catalogue construction

4.2. Model predictions for images

4.3. Predictions for central sources in images

4.4. Cataloguing galaxies based on detection scores

4.5. Predictions for large extended radio galaxies

5. Catalogue descprition

5.1. Consolidated catalogue

5.2. Comparison to $\mathrm{Selavy}$

5.3. Multiwavelength counterparts

6. Summary

Acknowledgements

Data availability

Appendix

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests