Genomic selection (GS) is becoming the state-of-the-art selection technology in many breeding programmes around the world. GS relies on direct genomic values (DGV), also often termed genomic estimated breeding values, that are estimated from a reference or training population with known genotypes and phenotypes. The process of estimating DGV is known as genomic prediction. Two main factors that determine the accuracy of GS are the size of the reference population (Meuwissen et al., Reference Meuwissen, Hayes and Goddard2001) and the relationship between the reference population and the predicted animals (Pszczola et al., Reference Pszczola, Strabel, Mulder and Calus2012). It has been shown that there is an interaction between both factors; when the reference population is small it is especially important to achieve a high relationship between the reference population and the predicted animals (Wientjes et al., Reference Wientjes, Veerkamp and Calus2013).
Obtaining a sufficiently large reference population, typically comprising at least a few thousand animals, may be a challenge for numerically small populations and for breeding programmes with limited resources. Centred around the topic of improving the accuracy of GS for small populations, seven papers are presented here that build on presentations during the 10th World Conference of Genetics Applied to Livestock Production in August 2014 in Vancouver, Canada.
The first paper provides a review of theory and practical results on strategies and methods to increase the accuracies of DGV in numerically small dairy populations, by increasing the reference population by adding animals from other reference populations (Lund et al., Reference Lund, van den Berg, Ma, Brøndum and Su2016). When added animals originated from the same breed, considerable gains can be expected and the most promising strategy was reported to be increasing the relationship between genotypes of predicted animals and phenotypes in the reference populations. When added animals originated from a different breed, expected gains diminish when the added breed becomes more distantly related and the most promising strategy seems to be to gear models and strategies to focus genomic predictions on markers closer to the causative variants. The second paper extends on this by reviewing strategies to develop sufficiently accurate genomic predictions for hard to measure traits in dairy cattle that typically also are faced with limited number of phenotypes (Schöpke and Swalve, Reference Schöpke and Swalve2016). In particular, the potential benefit of contract herds that employ extensive phenotyping is emphasized.
In dairy cattle, one commonly envisaged way to increase the size of the reference population is to extend common bull reference populations with genotyped cows. Koivula et al. (Reference Koivula, Strandén, Aamand and Mäntysaari2016) showed a 2% to 4% unit increase in reliability for production traits, when adding up to 7143 genotyped cows to a reference population comprising of 4413 bulls. As indicated by Koivula et al. (Reference Koivula, Strandén, Aamand and Mäntysaari2016), it should be avoided to only add genotyped bull dams and their genotyped daughters, because this represents a selected group of animals and created some bias in their single-step evaluation.
International exchange of information has been common practice through Interbull for several decades. The relative importance of international information was often limited when evaluations relied on pedigree information, owing to limited connectedness between different (groups of) countries. With GS, genetic connectedness is much less of an issue, because in principle all animals within a breed can have some genomic relationship with each other. To capitalize on this, Su et al. (Reference Su, Ma, Nielsen, Aamand, Wiggans, Guldbrandtsen and Lund2016) investigated the potential to improve genomic predictions in Danish Jerseys relying on 1050 to 1250 Danish bulls, by adding 1150 US Jersey bulls, 4800 Danish Jersey cows or both. On average, adding the Danish Jersey cows was slightly more advantageous than adding the US Jersey bulls. As expected, adding both additional sources of information was the best strategy.
Updating the reference population in time is very important to keep the accuracy at a constant level across generations. In the context of performing GS for a hard to measure trait, relying on a cow reference population, Pszczola and Calus (Reference Pszczola and Calus2016) investigate the required number of cows that should be added every generation to keep the accuracy constant. When starting with 2000 cows in the initial reference population, on average 600 cows needed to be added per generation to achieve this. Of course GS is also promising for other livestock species than dairy cattle, although the economic return on GS may be more limited, because the relative benefit compared with pedigree-based selection is smaller, and therefore does not justify the costs to compile a large reference population. Shumbusho et al. (Reference Shumbusho, Raoul, Astruc, Palhiere, Lemarié, Fugeray-Scarbel and Elsen2016) investigated the potential of GS in the meat sheep industry, considering different combinations of source of information used to select males. Assuming that an initial reference population was already available, carefully chosen GS strategies were able to yield up to 15% higher response than traditional selection, despite the higher costs of the selection process. Lillehammer et al. (Reference Lillehammer, Sonesson and Meuwissen2016) investigated the impact of GS in a pig breeding scheme for a trait not measured on the nucleus animals. Across several considered strategies, the genetic gain was not increased with GS if the trait had an economic value of 10% of the breeding goal, but was increased if this was 30%. In conclusion, the studies published in this issue collectively show that for scenarios where the size of reference populations is limited, there are still several solutions to achieve acceptable levels of accuracy, and thereby provide opportunities to improve accuracy of GS in small populations in practice.