OneMap: software for genetic mapping in outcrossing species
Abstract
OneMap is an environment for constructing linkage maps of outcrossing plant species, using full‐sib families derived from two outbred parents. The analyses are performed using a novel methodology based on the maximum likelihood approach for simultaneous estimation of linkage and linkage phases (Wu et al. 2002), which has been successfully applied to sugarcane (Garcia et al. 2006). It is implemented as a set of functions for the freely distributed software R, and handles pairwise marker analysis, marker ordering and map refinement. The software is freely available at http://www.ciagri.usp.br/∼aafgarci/OneMap/.
Genetic maps are of great use to breeding programs, because they allow the localization of QTL's, i.e. genome regions containing loci affecting a quantitative trait (Doerge 2002). For many species, due to their allogamous nature, it is difficult to obtain homozygous inbred lines, and therefore the traditional approaches used to construct linkage maps for F2 and backcross populations cannot be used (Grattapaglia and Sederoff 1994).
Traditionally, the construction of genetic maps in F1 populations has been achieved using the double “pseudo‐testcross” strategy (Grattapaglia and Sederoff 1994), in which markers segregating in a 1:1 ratio (testcross configuration) are analyzed with traditional statistical approaches and softwares, such as MapMaker/EXP (Lander et al. 1987). However, these are not suitable for markers with different segregation patterns and have the disadvantage of resulting in two separate linkage maps, one for each parent. Presently, the most widely available software for genetic mapping in F1 populations is JoinMap (Stam 1993). It handles the analysis of markers showing different segregation patterns in both parents. However, the methods used for marker ordering in linkage groups and recombination fraction estimation are based on minimum squares approximations for multipoint estimates of distances, and not on the maximum likelihood approach, which has several advantages (Wu et al. 2002). Recently, Garcia et al. (2006) compared the results of analysis from JoinMap and using Wu's methodology for sugarcane mapping, and found better results for the latter. Actually, it was the sugarcane mapping that led us to the development of OneMap.
OneMap software was developed to facilitate the linkage analyses in outcrossing species, using the methodology proposed by Wu et al. (2002). It is implemented as a set of functions for the freely distributed software R (Ihaka and Gentleman 1996). Algorithms were coded in C language, while the data manipulation functions were coded directly in the R language. It is currently available as source code for Windows and Unix.
Linkage and linkage phase estimation
The core part of the software is the simultaneous maximum likelihood estimation of linkage and linkage phases between markers, according to Wu et al. (2002). This methodology allows the analysis of a mixed set of different marker types containing various segregation patterns, such as 1:1:1:1, 1:2:1, 3:1 and 1:1. OneMap is capable of handling all possible cross types and, therefore, any data set can be analyzed, besides the more complex F1 populations derived from two outbred parent plants.
To our knowledge, no other software has been implemented to handle these analyses. Thus, although being very promising, this method has currently been underused by geneticists.
Current Features
The first version of OneMap covers various steps from genotype data reading to genetic map refinement. There are functions to perform the basic two‐point analysis between markers via the EM algorithm (Dempster et al. 1977) and, based on its results, assign markers to different linkage groups.
Next, marker ordering in linkage groups is done using the Rapid Chain Delineation algorithm (Doerge 1996), which is not computationally intensive and only requires a matrix with recombination fractions between markers. Finally, the more informative multipoint (three‐point) analysis may be used to check the local order of markers in a mapped linkage group and also to refine the map distance between adjacent markers.
There are also some data manipulation functions that make OneMap usage more flexible. The criteria used in all steps by the software are completely user‐defined and it is assumed that the user has some experience with genetic mapping and the R language. Nonetheless, there is also a tutorial available, intended to get users started with OneMap. It comes with a hypothetical simulated data set, used as an example to illustrate the usage of OneMap functions.
Perspectives
OneMap is free software and can be changed and redistributed by any user under the regulations of the GNU General Public License (Free Software Foundation 1991). Furthermore, it will be continuously updated and improved. Currently, work is being conducted in order to make available another ordering algorithm, called Seriation (Buetow and Chakravarti 1987). Besides that, the refinement algorithm Ripple (Lander et al. 1987) will also be implemented and released.
When the mapped markers are subjected to three‐point analysis, two estimates of the recombination fraction for the same interval are obtained. These estimates can be combined using a weighted mean, with the weights being the reciprocals of the variances of the two separate estimates (Ridout et al. 1998, Wu et al. 2002). This feature is already under development and is expected to be available soon.
In the future, we expect to release OneMap as an add‐on package to R, including help files, new functions for graphic representation of the results and more example data sets. In this sense, critics and suggestions are welcome.
References
Citing Literature
Number of times cited according to CrossRef: 19
- Jérôme Bartholomé, André Mabiala, Régis Burlett, Didier Bert, Jean‐Charles Leplé, Christophe Plomion, Jean‐Marc Gion, The pulse of the tree is under genetic control: eucalyptus as a case study, The Plant Journal, 10.1111/tpj.14734, 103, 1, (338-356), (2020).
- Matthew M. Marshall, David L. Remington, Elizabeth P. Lacey, Two reproductive traits show contrasting genetic architectures in Plantago lanceolata, Molecular Ecology, 10.1111/mec.15320, 29, 2, (272-291), (2019).
- Forest M. Dussault, Keng Pee Ang, James. A. K. Elliott, Brian D. Glebe, Steven Leadbeater, Anthony J. Manning, Frank Powell, Elizabeth Grace Boulding, Mapping quantitative trait loci for infectious salmon anaemia resistance in a North American strain of Atlantic salmon, Aquaculture Research, 10.1111/are.14349, 51, 1, (80-90), (2019).
- Jun Zhang, Fengqi Zhang, Baojun Tang, Yong Ding, Laikun Xia, Jianshuang Qi, Xinyuan Mu, Limin Gu, Daowen Lu, Yanhui Chen, Molecular mapping of quantitative trait loci for grain moisture at harvest and field grain drying rate in maize ( L.), Physiologia Plantarum, 10.1111/ppl.13048, 169, 1, (64-72), (2019).
- Wenbo Chen, Xiaowei Yang, Guillaume Tetreau, Xiaozhao Song, Cathy Coutu, Dwayne Hegedus, Gary Blissard, Zhangjun Fei, Ping Wang, A high‐quality chromosome‐level genome assembly of a generalist herbivore, Trichoplusia ni, Molecular Ecology Resources, 10.1111/1755-0998.12966, 19, 2, (485-496), (2019).
- Songtao Gui, Jing Peng, Xiaolei Wang, Zhihua Wu, Rui Cao, Jérôme Salse, Hongyuan Zhang, Zhixuan Zhu, Qiuju Xia, Zhiwu Quan, Liping Shu, Wedong Ke, Yi Ding, Improving Nelumbo nucifera genome assemblies using high‐resolution genetic maps and BioNano genome mapping reveals ancient chromosome rearrangements, The Plant Journal, 10.1111/tpj.13894, 94, 4, (721-734), (2018).
- Marcial Escudero, Marlene Hahn, Andrew L. Hipp, RAD‐seq linkage mapping and patterns of segregation distortion in sedges: meiosis as a driver of karyotypic evolution in organisms with holocentric chromosomes, Journal of Evolutionary Biology, 10.1111/jeb.13267, 31, 6, (833-843), (2018).
- Miyako Kodama, Kerry A. Naish, Robert H. Devlin, Influence of a growth hormone transgene on the genetic architecture of growth‐related traits: A comparative analysis between transgenic and wild‐type coho salmon, Evolutionary Applications, 10.1111/eva.12692, 11, 10, (1886-1900), (2018).
- Miaomiao Zhang, Wenhao Bo, Fang Xu, Huan Li, Meixia Ye, Libo Jiang, Chaozhong Shi, Yaru Fu, Guomiao Zhao, Yuejiao Huang, Kirk Gosik, Dan Liang, Rongling Wu, The genetic architecture of shoot–root covariation during seedling emergence of a desert tree, Populus euphratica, The Plant Journal, 10.1111/tpj.13518, 90, 5, (918-928), (2017).
- Yun Li, Shikai Liu, Ruijia Wang, Zhenkui Qin, Zhanjiang Liu, Bioinformatics Considerations and Approaches for High‐Density Linkage Mapping in Aquaculture, Bioinformatics in Aquaculture, 10.1002/9781118782392, (356-379), (2017).
- Briana E. Mittleman, Brenda Manzano‐Winkler, Julianne B. Hall, Katharine L. Korunes, Mohamed A. F. Noor, The large X‐effect on secondary sexual characters and the genetics of variation in sex comb tooth number in Drosophila subobscura, Ecology and Evolution, 10.1002/ece3.2634, 7, 2, (533-540), (2016).
- M. L. Fritz, S. Paa, J. Baltzegar, F. Gould, Application of a dense genetic map for assessment of genomic responses to selection and inbreeding in Heliothis virescens, Insect Molecular Biology, 10.1111/imb.12234, 25, 4, (385-400), (2016).
- S. Liu, Y. Li, Z. Qin, X. Geng, L. Bao, L. Kaltenboeck, H. Kucuktas, R. Dunham, Z. Liu, High‐density interspecific genetic linkage mapping provides insights into genomic incompatibility between channel catfish and blue catfish, Animal Genetics, 10.1111/age.12372, 47, 1, (81-90), (2015).
- C. Plomion, J. Bartholomé, I. Lesur, C. Boury, I. Rodríguez‐Quilón, H. Lagraulet, F. Ehrenmann, L. Bouffier, J. M. Gion, D. Grivet, M. Miguel, N. María, M. T. Cervera, F. Bagnoli, F. Isik, G. G. Vendramin, S. C. González‐Martínez, High‐density SNP assay development for genetic analysis in maritime pine (inus pinaster), Molecular Ecology Resources, 10.1111/1755-0998.12464, 16, 2, (574-587), (2015).
- Jason D. Fiedler, Christina Lanzatella, Miki Okada, Jerry Jenkins, Jeremy Schmutz, Christian M. Tobias, High‐Density Single Nucleotide Polymorphism Linkage Maps of Lowland Switchgrass using Genotyping‐by‐Sequencing, The Plant Genome, 10.3835/plantgenome2014.10.0065, 8, 2, (1-14), (2015).
- Isabel A. Blanco, Joshua Hegarty, L. W. Gallagher, B. W. Falk, G. Brown‐Guedira, E. Pellerin, J. Dubcovsky, Mapping of QTL for Tolerance to Cereal Yellow Dwarf Virus in Two‐rowed Spring Barley, Crop Science, 10.2135/cropsci2013.11.0781, 54, 4, (1468-1475), (2014).
- G.S. Pereira, E.S. Nunes, L.D.C. Laperuta, M.F. Braga, H.A. Penha, A.L. Diniz, C.F. Munhoz, R. Gazaffi, A.A.F. Garcia, M.L.C. Vieira, Molecular polymorphism and linkage analysis in sweet passion fruit, an outcrossing species, Annals of Applied Biology, 10.1111/aab.12028, 162, 3, (347-361), (2013).
- S. R. Mcdermott, M. A. F. Noor, Mapping of within‐species segregation distortion in rosophila persimilis and hybrid sterility between . persimilis and . pseudoobscura, Journal of Evolutionary Biology, 10.1111/j.1420-9101.2012.02581.x, 25, 10, (2023-2032), (2012).
- P. R. Laborda, R. Gazaffi, A. A. F. Garcia, A. P. de Souza, A molecular linkage map for Drosophila mediopunctata confirms synteny with Drosophila melanogaster and suggests a region that controls the variation in the number of abdominal spots, Insect Molecular Biology, 10.1111/j.1365-2583.2011.01117.x, 21, 1, (89-95), (2011).




