Yam (Dioscorea spp.) is a monocotyledonous, an annual or perennial stem tuber belonging to the family Dioscoreaceae of flowering plants. Dioscorea has been described as the largest genus with an estimated 600 species, 10 of which are cultivated and of economic importance    . It is the second most important crop after cassava in West Africa    . Important and cultivatable species of this vital crop include D. cayenensis Lam., D. alata L., D. rotundata Poir., D. trifida L. f., D. bulbifera L., D. pentaphylla L., D. opposita Thunb., D. transversa R. Br., D. nummularia Lam. and D. esculenta (Lour.) Burkill.  . Within Africa, the common species cultivated include D. rotundata (white yam), D. alata (water yam) and D. cayenensis (yellow yam), some of which have been reported to possess medicinal and ornamental values  .
The crop ranks fourth after potato, sweet potato and cassava as the most important food tuber crop in the world  . Yam is important in the economic and social life of people in West Africa   . As a starchy food, it provides a major source of cheap caloric energy food for millions of people in the tropical and sub-tropical regions of the world particularly in West Africa, the Caribbean, parts of Asia, South and Central America and the Pacific   . Yam tubers are rich sources of energy, vitamin C, musin (glycoprotein), minerals (K, P, Ca, Mg, Fe, Cu, Co), phytosterols and steroidal saponins  . They are converted into different types of food products such as pounded yam, boiled yam, roasted yam, fried yam slices, yam balls, mashed yams, yam chips, and yam flakes  . Fresh yam tubers are also peeled, chipped, dried, and milled into flour that is used to prepare dough called “amala” or “telibowo”  .
Yams are widespread in the tropics and subtropics. Nigeria is the leading producer of yam with 71% of the world production    . West Africa accounts for over 92% of the world’s production (54.2 million tonnes)  . In Ghana and Nigeria, 26.2% and 31.8% of people, respectively rely on yam species for income generation and food security  . Despite the increasing demand for local consumption and export of yams, there has been a marginal decline in its production due to lack of proper identification of unique species for biodiversity diversification for resistance to drastic changes in climate, introgression, cross hybridization and conservation processes to reduce genetic erosion  -  . To have adequate knowledge of these yam accessions, characterization to the species level, genetic richness and assessment of phylogenetic diversity (PD) are of utmost importance following the genetic resource preservation roles of PD in crop extinction  , functionality in ecosystems   and abiotic variability  and these can be achievable with accurate, sensitive and reliable methods.
Morphotaxonomy, the use of morphological characters to identify and classify plants, is currently the most widely used in yams in Nigeria. It entails using traits such as size, form and number of tubers per plant, bulbil formation, presence of spines on the stem, twining direction, fruit shape, and aerial bulbils, which could lead to misidentification of yam species    . Further, morphotaxonomy-based method requires cumbersome assessment of whole plants and the importance of this approach declines when specimens/tissue materials are utilized  . Use of molecular markers has become significant for accurate identification of these yams to the species level and to harness the genetic diversity inherent in them. Different markers including Restriction fragment length polymorphism (RFLP)  , Random amplified polymorphic DNA (RAPD)  , Simple sequence repeat (SSR)  , Inter-simple repeat (ISSR)  and Amplified fragment length polymorphism (AFLP)  and gene sequencing   have been applied in the characterization of yam species. The use of molecular tools to support morphotaxonomy-based identification is important to clear ambiguous species classification.
A DNA barcode facilitates taxonomic identification through the use of a standardized short genomic segment that is generally found in target lineages with adequate variations capable of discriminating living animals to the species level  . DNA barcoding techniques are useful tools in characterization as they allow more objective and rapid specimen identification, which can be cost-effective in providing a central catalog of species diversity. In general, DNA barcoding can improve biodiversity and genetic resource databases   . Also, a phylogenetic diversity (PD) method possesses the merits of ease of reconstruction of phylogenetic relationships of species and as such it has resultant potential to enlighten effective taxonomic challenges  . MatK and rbcL which are the two plant barcode loci have been chosen for phylogenetic studies of Dioscorea   . In this study, a barcoding marker of rbcL was used for identification and genetic characterization of Dioscorea accessions cultivated in southern Nigeria.
2. Materials and Methods
2.1. Sample Collection
Different yams were sampled from different locations across Eastern and Western Nigerian, including the ones in the germplasm collection at the International Institute of Tropical Agriculture (IITA), Ibadan, Nigeria (Table 1). A total of
Table 1. List of yam samples collected from different locations and used for DNA barcoding.
IITA = International Institute of Tropical Agriculture; LGA = Local Government Area.
eleven Local Government Areas (LGAs), cutting across three States including Oyo (where IITA, Ibadan is located), Enugu and Ebonyi States were used for the yam collection (Figure 1). The IITA, Ibadan, has Genetic Resources Unit that contains many yam species from other parts of Nigeria.
2.2. DNA Extraction
Fresh young leaves of yam species weighing from 0.1 - 0.2 g were collected for DNA extraction using Silica resin method standardized by the DNA Learning Center (http://www.dnabarcoding101.org/lab/protocol-2.html)  In brief, fresh young yam leaf samples were weighed and homogenized in 300 µL of lysis solution using sterile mortar and pestle followed by incubation in a heat block at 65˚C for 10 minutes. Next, samples were centrifuged in a balanced configuration at maximum speed (13,000 rev/min) for 1 min to pellet debris. A 150 μL sample of the supernatant was transferred to fresh micro centrifuge tubes, being careful not to disturb the debris pellet. A 3 μL silica resin, was subsequently added to the respective supernatants, mixed well by pipetting up and down, and placed for 5 minutes in a heat block at 57˚C. The silica resin is a DNA binding matrix which in the presence of lysis solution binds readily to nucleic acids. After incubation, tubes were subject to centrifugation, with cap hinges pointing outward, for 30 seconds at maximum speed to pellet the silica resin, which was now bound to nucleic acid. Using a micropipette with fresh tip the supernatant was removed
Figure 1. Map of Nigeria showing geographical areas for collection of yam accessions.
and 500 μL of ice cold wash buffer added to the pellet. The silica resin bound to nucleic acid was re-suspended by vortexing and centrifuged to repeat the wash procedure. The wash buffer removes contaminants from the samples while nucleic acids remain bound to the resin. A dry spin step after wash was performed to remove any remnant drops of supernatant with a micropipette. Finally, 100 μL of distilled water was added to the silica resin, mixed well by vortexing and incubated at 57˚C for 5 minutes. Samples were then centrifuged for 30 seconds at maximum speed to pellet the resin. This time 90 μL of the supernatant was transferred to fresh tubes as the nucleic acids eluate from the resin. The eluted DNA was stored to proceed to PCR step.
2.3. Polymerase Chain Reaction, Agarose Gel Electrophoresis and DNA Sequencing
PCR amplification was performed using Ready-To-Go PCR beads in a total volume of 25 µL: 2 µL of ~100 ng DNA and 23 µL of primer/loading dye mix for plant cocktail with rbcL primers (rbcLaf: 5'-TGTAAAACGACGGCCAGTATGTCACCACAAACAGAGACTAAAGC-3' and rbcLa-revM13: 5'-CAGGAAACAGCTATGACGTAAAATCAAGTCCACCRCG-3'). The PCR tubes were placed in a thermal cycler that had been programmed with the appropriate PCR protocol with initial step at 94˚C for 1 min., 35 cycles of 94˚C for 15 sec, 54˚C for 15 sec, and 72˚C for 30 sec., and 8 min final extension at 72˚C was maintained. The PCR products or amplicons were electrophoresed in a 1.5% agarose gel containing 0.5 mg/ml ethidium bromide and photographed on Transilluminator UV light (Omega G). The generated PCR amplicons sent to Genewiz LLC, New Jersey, USA, for DNA sequencing.
2.4. Data Analysis
The sequencing results generated from the Applied Biosystems Genetic automated sequencer (ABI Prism 3130X1, Froster City, CA 94404, USA) at Genewiz LLC were uploaded in the blue line of DNA Subway (https://dnasubway.cyverse.org/), which is an intuitive interface for analysing DNA barcodes. Using the Blue Line, the assembled sequences were end-trimmed, paired in their respective forward and reverse sequences to build consensus sequences. The consensus sequences from DNA subway were further edited, filtered and assembled for polymorphism detection using BioEdit software (BioEdit sequence aligner editor, version 22.214.171.124). Sequence alignment and percentage similarity searches were compared with GeneBank databases using NCBI web-based site, BLAST. Multiple alignments were done using the ClustalW   . Phylogenetic tree reconstruction was performed using MEGA 6 software  . Phylogenies were constructed using the Maximum Parsimony and Maximum Likelihood options   and the effectiveness of the trees was determined by bootstrapping up to 1000 replicates  .
3.1. Sequence Alignment of Sequences Generated from Dioscorea Spp. Using rbcL Barcoding Marker
A total length of sequence alignment, conserved sites, and variable sites of 525, 534 and 7 were respectively identified among the sequenced yam species. Different regions of polymorphisms and conserved regions at nucleotide level across the sequences exhibited variations among them. At a position of 335, 62_3LeavedYam_Ono and 76_Ona_TDd possessed a transversional mutation by having G nucleotide, while other samples had a T nucleotide (Additional file 1: Figure S1). At a consensus position of 362, yam species such as 3_TDa3050, 4_TDb3050, 5_TDb3044, 6_TDb2857, 7_TDb3058, 8_TDb3690, 61_6-EDO, 83_WaterYam-_Mbuna and 85_AerialYam_Edugbe showed a transitional mutation of A nucleotide, while the rest of the accessions had a G nucleotide. At a position of 368, accessions such as 1_TDa85.00250, 3_TDa3050, 4_TDb3050, 5_TDb3044, 6_TDb2857, 7_TDb3058, 8_TDb3690, 61_B-6-EDO, 83_WaterYam-_Mbuna, 85_AerialYam_Edugbe had a transitional mutation of A in place of G nucleotide possessed by other accessions at the same consensus position.. Also at 371 position, accessions including 35_Pepa, 36_Ke-emi, 37_Ame, 38_TDr.89.002665, 39_AlataTda98.01176, 40_TDa00.00.94, 42_OgojaVariety.1, 43_Gbangu_Variety.1, 44_ObioturuguVariety, 45_AmolaVariety.1, 46_OginiVariety, 47_Damieha, 48_Aloshivariety.1, 57_2-WhiteYam_Iyo, 61_6-EDO, 68_9ENEGBE, 78_Obella, 80_UtekpeVariety_2, 81_WhiteYam-Nwoopoko, 82_Yellowyam_Akpukpu, 83_WaterYam-Mbuna, 85_AerialYam_Edugbe, 89_WhiteYam_Nwoopoko and 93_YellowYam_TDes exhibited a transitional mutation by possessing a C nucleotide, while the remaining species had a T nucleotide. Also at a position of 391, 76_Ona_TDd possessed C, while other remaining yam species had T nucleotide.
3.2. Phylogenetic Tree Reconstruction (PTR) and Phylogenetic Diversity (PD)
Out of the 75 nucleotide sequences used for the analyses, a total of 270 codon positions including 1st, 2nd, 3rd, and non-coding regions as well as 4.3582% invariable (monomorphic) sites were found in the final dataset. From the phylogenetic tree analysis, the yam accessions were resolved into ten groups with variable phylogenetic diversities (PDs) (Figure 2). Group I with PD in the range of 0-27 consisted of twenty five accessions including 43_Gbangu_variety, 82_Yellowyam-Akpukpu, 81_Whiteyam-Nwopoko, 89_WhiteYam-Nwopoko, 24_TDm3052, 23_TDm3053, 20_TDc03-5, 19_TDc2792, 80_Utekpevariety, 17_TDc2813, 21_TDc04-71-2, 93_Yellowyam-TDes, 18_TDc2796, 68_9ENEGBE, 25_TDm3055, 15_TDc0471-2, 46_Oginivariety, 57_2-Whiteyam-Iyo, 45_Amolavariety, 40_TDa00.00.94, 38_TDr89.002665, 16_TDc0497-4, 78_Obella, 37_Ame and 35_Pepa grouping with D. rotundata obtained from NCBI data with a reference sequence accession of KR072483. The yam accessions were
Figure 2. Phylogenetic tree of different yam species as revealed by rbCL barcoding marker.
collected from different locations including Enugu, Ebonyi and International Institute of Tropical Agriculture (IITA), Ibadan, Nigeria. Group II (with PD of 2-49) contained four accessions such as 47_Damieha, 48_Aloshivariety, 39_Alata TDa98-01176, 44_Obioturuguvariety grouped together KJ629251-D. abyssinica, KJ629254-D. cayenensis and KJ629260-D. praehensillis. Group III (with PD of 1) contained only 42_Ogojavariety.
Groups IV (PD = 6) and V (PD = 31) had 36_Ke-emi and 22_TDm2938, respectively. Group VI (PD = 20 - 79) consisted fourteen including 59_10-Whiteyam-Nwopoko-Adaka, 90_YellowYam-Oku, 33_TDaNwopoko, 41_TDa00.00600, 71_D1WaterYam-Nbana2, 1_TDa85.00250, 73_WaterYam-Mbala, 72_1WaterYam-_Nbana, 87_WaterYam-Mbana, 60_D1WaterYam-Nbana 1, 92_ChineseYam-TDes, 51_Alata2, 34_AdakavarietyIITA, and 65_WaterYam-Nbana that grouped together with D. alata retrieved from NCBI database with an accession number of HQ637868. Group VII with PD value in the range of 18-86, had nine accessions including 6_TDb2857, 4_TDb3050, 5_TDb3044, 83_WaterYam-_Mbana, 85_AerialYam- Edugbe, 8_TDb3690, 61_6-Edo, 3_TDa3050 and TDb3058 grouped together with a known D. bulbifera species (with an accession No: KR072458) that was retrieved from NCBI database. Yam accessions 28_TDes3033, 30_TDes3030, 31_TDesculenta, 27_TDes3035 and 29_TDes3027 were in the same group VIII (PD = 17 - 51) identified as D. esculenta using a reference of D. esculenta (KR072467) obtained from the NCBI database. In group IX (PD = 2 - 60), 86_3leavedYam-Ona, 91_TrifoliateYam-TDd, 53_Ighu, 52_Ighu-Dumenturum, 9_TDd3101, 12_TDd08-38-53, 14_TDd3100, 49_IghuUna, 84_BitterYam-Iwu-obe, 11_TDd3935, 13_TDd-yellow, 10_TDd3829 and 54_Ighu-Una-2 were found grouping with D. hispicia (HQ637815), D. dregeana (JQ025042) and D. dumenturum (JF705531). Group X with PD of 88 had only 62_3leavedYam-Ono and 76_Ona-TDd, while outgroups (PD = 89 - 100) contained two Ipomoea triloba (trilobed (white potatoes), Colocasia esculenta (taro) (cocoyam) and Coccinia quinqueloba (96_unknown_sample) grouped together with Solanum vermiculata and S. lycopersicum with NCBI accession numbers KR057204 and KM008705, respectively.
3.3. Genetic Diversity Analysis
The analysis involved 75 nucleotide sequences between different groups. The highest inter-group genetic distance calculated based on K2P was 5.0560 ± 2.5760, while the lowest was 0.5000 ± 0.4770 (Table 2). The increment in genetic diversity started from the group combinations in ascending order: 0.5000 ± 0.4770 (groups, gps: I and II, I and III, I and IV, I and V) < 0.6700 ± 0.5500 (gps: II and VI, III and VI, V and VI) < 0.7510 ± 0.4240 (gps: II and IX) < 0.8100 ± 0.5500 (gps: III and IX, V and IX) < 0.8820 ± 0.5550 (gps: I and VI) < 0.9210 ± 0.4970 (gps: I and IX) < 1.0090 ± 0.9870 (gps: IV and VI) < 1.1390 ± 0.9170 (gps: IV and IX) < 1.2540 ± 0.6540 (gps: II and VIII) < 1.2770 ± 0.6800 (gps: III and VIII) < 1.3810 ± 0.8090 (gps: IV and VIII) < 1.5090 ± 1.4360 (gps: VII and VIII) < 1.5350 ± 0.8200 (gps: I and VIII) < 1.5820 ± 0.8660 (gps: VI and VIII) < 1.9060
Table 2. Genetic distances based on Kimura 2-parameter (K2P) between different groups of yam species.
Group I = 43_Gbangu_variety, 82_Yellowyam-Akpukpu, 81_Whiteyam-Nwopoko, 89_Whiteyam-Nwopoko, 24_TDm3052, 23_TDm3053, 20_TDc03-5, 19_TDc2792, 80_Utekpevariety, 17_TDc2813, 21_TDc04-71-2, 93_Yellowyam-TDes, 18_TDc2796, 68_9ENEGBE, 25_TDm3055, 15_TDc0471-2, 46_Oginivariety, 57_2-Whiteyam-Iyo, 45_Amolavariety, 40_TDa00.00.94, 38_TDr89.002665, 16_TDc0497-4, 78_Obella, 37_Ame and 35_Pepa; Group II = 47_Damieha, 48_Aloshivariety, 39_Alata TDa98-01176, 44_Obioturuguvariety; Group III = 42_Ogojavariety; Group IV = 36_Ke-emi; Group V = 22_TDm2938; Group VI = 59_10-Whiteyam-Nwopoko-Adaka, 90_YellowYam-Oku, 33_TDaNwopoko, 41_TDa00.00600, 71_D1WaterYam-Nbana2, 1_TDa85.00250, 73_WaterYam-Mbala, 72_1WaterYam-_Nbana, 87_WaterYam-Mbana, 60_D1WaterYam-Nbana 1, 92_ChineseYam-TDes, 51_Alata2, 34_AdakavarietyIITA, and 65_YaterYam-Nbana; Group VII = 6_TDb2857, 4_TDb3050, 5_TDb3044, 83_WaterYam-_Mbana, 85_AerialYam-Edugbe, 8_TDb3690, 61_6-Edo, 3_TDa3050 and TDb3058; Group VIII = 28_TDes3033, 30_TDes3030, 31_TDesculenta, 27_TDes3035 and 29_TDes3027; Group IX = 86_3leavedYam-Ona, 91_TrifoliateYam-TDd, 53_Ighu, 52_Ighu-Dumenturum, 9_TDd3101, 12_TDd08-38-53, 14_TDd3100, 49_IghuUna, 84_BitterYam-Iwu-obe, 11_TDd3935, 13_TDd-yellow, 10_TDd3829 and 54_Ighu-Una-2; and Group X = 62_3leavedYam-Ono and 76_Ona-TDd, N/C = Not computable.
± 0.8140 (gps: VI and IX) < 1.6640 ± 1.4910 (gps: VIII and X) < 1.7460 ± 1.4790 (gps: 2.2760 ± 1.7920 (gps: VI and X) < 2.4560 ± 1.4580 (gps: VIII and IX) < 2.7180 ± 2.0760 (gps: I and X) < 2.7900 ± 2.1300 (gps: II and X, III and X, IV and X, V and X) < 3.0200 ± 1.7000 (gps: I and VII, II and VII, III and VII, IV and VII, V and VII, VII and IX) < 3.1070 ± 2.390 (gps: VII and X) < 5.0560 ± 2.5760 (gps: VI and VII). The intra-group genetic distance ranged from 0.5250 ± 0.5000 - 2.0103 ± 1.2579 and some intra-group genetic distances were not computable which were denoted by n/c (Table 3). Groups I, VI and X were found computable with their respective values of 0.5250 ± 0.5000, 0.5616 ± 0.4788, and 2.0103 ± 1.2579. The mean genetic diversity within entire population was 0.7970 ± 0.06910, while the transitional to transversional distances per site from mean interpopulational diversity calculations was 2.1478 × 108 ± 4.5300. Also, the coefficient of differentiation of transitional to transversional distances per site was 1.1947 × 108 ± 6.9419 × 107.
3.4. BLAST Analysis of the Sequences Generated from the Yam Accessions Using rbcL Barcoding Gene
The output of the BLAST computations of the grouped sequences produced significant hits and some of the previously unknown sequences were fully identified (Table 4). The analysis identified ten putative species of yams including Dioscorea alata, D. bulbifera, D. cayenensis, D. rotundata, D. wallichii, D. aspersa,
Table 3. Genetic distances based on Kimura 2-parameter (K2P) within different groups of yam species.
Group I = 43_Gbangu_variety, 82_Yellowyam-Akpukpu, 81_Whiteyam-Nwopoko, 89_Whiteyam-Nwopoko, 24_TDm3052, 23_TDm3053, 20_TDc03-5, 19_TDc2792, 80_Utekpevariety, 17_TDc2813, 21_TDc04-71-2, 93_Yellowyam-TDes, 18_TDc2796, 68_9ENEGBE, 25_TDm3055, 15_TDc0471-2, 46_Oginivariety, 57_2-Whiteyam-Iyo, 45_Amolavariety, 40_TDa00.00.94, 38_TDr89.002665, 16_TDc0497-4, 78_Obella, 37_Ame and 35_Pepa; Group II = 47_Damieha, 48_Aloshivariety, 39_Alata TDa98-01176, 44_Obioturuguvariety; Group III = 42_Ogojavariety; Group IV = 36_Ke-emi; Group V = 22_TDm2938; Group VI = 59_10-Whiteyam-Nwopoko-Adaka, 90_YellowYam-Oku, 33_TDaNwopoko, 41_TDa00.00600, 71_D1WaterYam-Nbana2, 1_TDa85.00250, 73_WaterYam-Mbala, 72_1WaterYam-_Nbana, 87_WaterYam-Mbana, 60_D1WaterYam-Nbana 1, 92_ChineseYam-TDes, 51_Alata2, 34_AdakavarietyIITA, and 65_YaterYam-Nbana; Group VII = 6_TDb2857, 4_TDb3050, 5_TDb3044, 83_WaterYam-_Mbana, 85_AerialYam-Edugbe, 8_TDb3690, 61_6-Edo, 3_TDa3050 and TDb3058; Group VIII = 28_TDes3033, 30_TDes3030, 31_TDesculenta, 27_TDes3035 and 29_TDes3027; Group IX = 86_3leavedYam-Ona, 91_TrifoliateYam-TDd, 53_Ighu, 52_Ighu-Dumenturum, 9_TDd3101, 12_TDd08-38-53, 14_TDd3100, 49_IghuUna, 84_BitterYam-Iwu-obe, 11_TDd3935, 13_TDd-yellow, 10_TDd3829 and 54_Ighu-Una-2; and Group X = 62_3leavedYam-Ono and 76_Ona-TDd.
Table 4. BLAST outputs of total score, query coverage, e-value, percentage identity and accession number obtained from different yam accessions.
D. trifida, D. dregeana, and D. mangenotiana. The total bit score obtained in all ranged from 411 - 1011. The query coverage spanned between 99 and 100%, while the expected values (e-values) were 9e-111 or less. The percentage sequence identity ranged from 97% - 100%. Some accessions with acronyms including TDa, TDc and TDm denoting D. alata, D. cayenensis and D. manganotiana were found to be D. bulbifera, D. rotundata or cayenensis, respectively. Some of the sequences had NCBI hits ranging from two to four sequences with synonymous values of total bit score, query coverage, e-value, percentage identity but different accession numbers. For instance, 16_TDc04-97-4, 22_TDm2938, 35_Pepa, 36_Ke-emi, 37_Ame, 38_TDr.89.002665 and many others in this category had hits of D. cayenensis and D. rotundata. For accessions of 19_TDc2792, 25_TDm3055, 46_OginiVariety and 80_UtekpeVariety_2 had D. wallichii and D. rotundata as their hits with similar values in all the BLAST indices. Also, three species of yam including D. praehensilis, D. cayenensis and D. rotundata were obtained with a yam accession of 43_Gbangu_Variety.1 in the process of BLAST analysis, while 62_3LeavedYam-Ono produced D. aspersa, D. petelotii and D. daunea that had same values of total bit score, query coverage, e-value, percentage identity but different accession numbers. The yam accession, 59_D10 White-Nwopoko-Adaka, had five different NCBI hits of D. spicata, D. intermedia, D. wallichii, D. rotundata and D. oppositifolia with three having similar accession number, while the remaining two had a separate accession number as revealed by BLAST analysis.
DNA barcoding has become an effective method for species discrimination of flowering plants in the Polygonaceae   and Fabaceae families  , and other land plant species     . While mitochondrial cytochrome oxidase 1 (CO1) has proven a standardized animal DNA barcoding for necessary discrimination, no single barcode sequence works across all plants  . In the present work, the candidate barcoding marker, rbcL satisfied the DNA barcoding process, regarding the ease of amplification and sequencing Hollingsworth et al.  . However, this barcoding marker, rbcL, was not able to achieve the basic quality of discriminating different yam species in this study. Sequence alignment showed low degree of polymorphisms among the sequences. This study of genetic diversity in yam accessions is also dependent on the nucleotide variations occurring within the genome that are informative for the identification of different species. The discriminatory level of the rbcL marker has been linked to other researches, which contradict its potential for use as a universal DNA barcode for plants    . This low resolution of different accessions of yams into their respective species level could be attributed to the poor efficiency of rbcL marker when not jointly applied with other plastid markers. It has been reported that the joint application of rbcL+matK as a marker of choice in species resolution was based on clear recovery of the region of rbcL and discriminatory efficiency of fast evolving coding region of matK  .
In this study, 525 bp distinct total lengths of sequence alignment, 534 conserved sites, and variable sites of 7 were identified in the sequenced yam species. The alignment of 525 bp out of the total lengths of 568 bp, followed by the existence of similar regions (conserved sites) and low points of variations (variable areas) among the sequences demonstrate the low level of informativeness of rbcL in DNA barcoding of yam species. These findings are not in complete agreement with a previous report on yam species  , where 568 bp, 538, and 30 as total lengths of sequence alignment, conserved sites and variable sites were identified among accessions. Also, the sequence alignment length, conserved sites excluding the variable sites detected in this work correlate with the findings of Sun et al.  in which 553 bp, 522 bp and 31 of alignment length, conserved sites and variable sites were found among the accessions of Dioscorea species. The difference in the variable sites could have emanated from the number of samples studied.
Phylogenetic reconstruction of the generated Dioscorea species using rbcL marker resolved them into ten groups and this indicates different existing isolated groups inherent in the accessions. The existence of these different accessions among the collections could be attributable to lack of exchange of yam tubers by farmers among villages thereby resulting in a stronger heterozygosity among species compared to wild ones as reported by Ngo Ngwe et al.  . A contribution of evolutionary biology regarding conservation is the knowledge of diverse phylogenetic diversities, defined by the sums of branch lengths of the evolutionary trees connecting a set of taxa or individuals  . In this present work, group X had the highest PD value of 88, followed by groups VII, VI, IX, and VIII with their respective PDs of 86, 79, 60 and 51. The highest PD was identified in a group containing wild species of D. aspersa and this is in agreement with a previous report though in a different wild species wild called D. praehensilis  . When compared with other unrelated crops, the highest was observed in Cocoyam and other crops which were deliberately included to access the accuracy of this marker. The group with the lowest PD value D. rotundata clustered with other species and they were collected from a given single region. In this way, a given set of taxa will have a greater PD if they are widely spread out on a phylogenetic tree. Lack of or total loss of PD is generally assumed as a declining signal in the degree of biodiversity  . Furthermore, PD is associated with functional diversity since it is a measure of features also due to the fact that evolutionarily distant species are more likely to possess variable molecular functions in an ecosystem    . Also in group I, some accessions including 45_Amolavariety, 40_TDa00.00.94, 38_TDr89.002665, 16_TDc0497.4, 78_Obella, 37_Ame, and 35_Pepa had a PD value of 0 and this could be attributed to lack of sequence divergence. It could also be attributable to occurrence of common ancestral sequence homology  or poor resolving power of the rbcL DNA barcoding marker in yams  . Most of the accessions were accurately grouped according their species. For instance in group VIII, all the D. esculenta species was grouped with a known reference sequence from NCBI database. Also, group VII had all the accessions classified as D. bulbifera thereby identifying correctly an accession, 3_TDa3050, which was regarded as D. alata. A particular yam species was given different names as Ighu or Una (Ona) at a village in Enugu State but it was found to be just one species called D. dumetorum through the use of rbcL thereby resolving the issue of multiple names for the plant. Group IX had three reference sequences as D. dregeana, D. hispida and D. dumetorum but most of the accessions in the group are D. dumetorum and this could be as a result of their genetic relatedness  . However, accessions in group I of the trees were not correctly resolved following the existence of different yam species in various distinct subclades and non-grouping of any of the retrieved yam sequences from NCBI database. This may possibly be linked to sample contamination or a deficiency on the part of the rbcL resolution.
The identified genetic distances (0.5000 ± 0.4770 - 5.0560 ± 2.5760) based on K2P model regarding the inter-groups were in agreement with the previous works of other researchers in yams   and in authentication of native plants  . High genetic diversity indices were obtained from between group calculations, producing 5.0560 ± 2.5760 with the highest in two combined groups (groups VI and VII) and this demonstrates higher interspecific diversity than intraspecific one within the yam accessions as obtained in an earlier report involving ornamental plants with interspecific value of 3.080  . Assessment of genetic diversity within the groups (intra-group genetic diversity) could not be computed in most of the groups except three groups (groups I, VI and X), where group I had the lowest value of 0.5250 ± 0.5000, while X had the highest value of 2.0103 ± 1.2579. These values are higher than the ones obtained by Sun et al.  . The mean genetic diversity within entire population was 0.7970 ± 0.06910 and this is higher than the one (0.00266 ± 0.0044) obtained by Sun et al.  .
BLAST hits obtained in this study showed some degrees of similarity matches to the ones already annotated and deposited in NCBI database and some were not purely specific. The percentage sequence identity ranged from 97% - 100%, demonstrating low efficiency of this tool in identification of unknowns in yam species. However, some of the yams sampled from different regions were differently identified from what they were previously known to be using this method, indicating the potential of rbcL barcoding marker to resolve misclassification encountered via morphotaxonomy based approach despite the low discriminatory power. For instance, yam accessions with acronyms including TDa, TDc and TDm denoting D. alata, D. cayenensis and D. manganotiana were found to be D. bulbifera, D. rotundata or cayenensis, respectively. Furthermore, 62_3LeavedYam-Ono and 76_Ona_TDd sequences were correctly identified as D. aspersa. In the community where the two species (D. aspersa and D. dumetorum) were collected, they were misclassified by the villagers who generally called them D. dumetorum due to their similar morphological features. According to the villagers, the ones in group X which were later identified as D. aspersa are normally boiled and eaten directly, while the other ones (D. dumetorum, which had similar values of NCBI hits with D. hispida) are usually boiled, processed to remove bitterness in them before they are consumed. The discriminatory level of the rbcL marker in plants as a potential universal DNA barcode is demonstrated in this study as reported in other researches   . However, some of the yam sequences had two, three or five NCBI hits of different species of yams with synonymous values of total bit score, query coverage, -value and percentage identity with different accession numbers except in one that had five BLAST outputs with three having similar accession numbers and two with different accession numbers. For instance, the yam accession, 59_D-10-White-Nwopoko-Adaka, had five different NCBI hits of D. spicata, D. intermedia, D. wallichii, D. rotundata and D. oppositifolia with three (D. wallichii, D. rotundata, D. oppositifolia) having similar accession number (KY679569), while the remaining two (D. spicata and D. intermedia) had separate accession numbers of KY457460 and KY457459, respectively, after the BLAST analysis. Also, sequences generated from accessions 45_Amolavariety, 40_TDa00.00.94, 38_TDr89.002665, 16_TDc0497.4, 78_Obella, 37_Ame, and 35_Pepa hit two (D. cayenensis and D. rotundata) sequences with similar values of query coverage, e-value and percentage identity, while total bit score ranged from 1000-1005. This is possible due to existence of common ancestral homology as opined by Pearson  or due to redundancy, which in bioinformatics is observed when one or more homologous or synonymous sequences are found in the same set of data  . It could also be attributable to the low discriminatory potency of rbcL marker to correctly resolve species as previously reported in yams  and ornamental plants  .
The candidate barcoding marker, rbcL, was found to be ambiguously discriminatory in DNA barcoding process of yam accessions. Some of the accessions were not correctly identified to the species level and low polymorphisms were detected and this further demonstrates the low distinguishing potency of rbcL barcoding marker. The use of phylogenetic diversity (PD), which is associated with functionality in biodiversity and which was applied in the computational processes for the estimation of phylogenetic groups with lowest and largest collections in terms of diversity was of great potential. The highest phylogenetic diversity was in D. aspersa, while some were not computable due to the low efficacy of the marker. The group with the lowest PD value, D. rotundata clustered with other indistinguishable species and they were collected from a given single region. The accessions with high PD within the yam accessions should be considered for use in breeding programme to enhance biodiversity of Dioscorea species within the studied region. However, the rbcL could not resolve the yam accessions well following some noted discrepancies in the detected number of species from phylogenetic groupings and NCBI BLAST hits possibly due to inefficiency of the marker. Therefore, the rbcL may not be a marker of choice for species identification, discrimination and estimation of genetic diversity of yam accessions. The marker should be used in combination with other chloroplast markers for accurate DNA barcoding of yams for their improvement and germplasm conservation.
The authors are grateful to International Institute of Tropical Agriculture (IITA), Ibadan for providing part of the accessions used in the study. We thank National Science Foundation (NSF) for the Targeted Infusion HBCU_UP funding that supported this undergraduate student’s research project. We are also grateful to Dr. Dave Micklos of the Cold Spring Harbor Laboratory, DNA Learning Centre, for the technical and research assistance offered to us.
National Science Foundation (NSF) for the Targeted Infusion HBCU_UP funding was received to conduct this study
Ethics Approval and Consent to Participate
Consent was obtained from farmers before using their individual farms for sample collection.
Consent for Publication
Availability of Data and Materials
All data generated during this study are included in this published article. Sequence data were deposited in NCBI GenBank with accession numbers ranging from MH078114 to MH078188 to match the individual yam accessions in the list of supplementary Table S1.
All authors were involved in project design. GNU, DOI, JM, OO, JH, DB, CA and OC did the literature search process, extracted data elements, and carried out study compilation. Data analyses were performed by DOI, MO, CE, VC, MU and CO and reviewed by GNU, GA, JO and AD. DOI developed the first draft of the manuscript. All authors read the manuscript and approved the final copy of it.
Supplementary File 1
Figure S1. Consensus sequence of rbcL gene for Dioscorea species and its associated consensus points of polymorphisms (variations). Note that the dotted line (…) in the sequence alignment indicates similarity of nucleotide to the nucleotide of TDa-85.00250 that serves as a reference sequence of TDa-85.00250.
Supplementary File 2
Table S1. List of sequenced yam species collected from different locations and their GenBank accession numbers.
IITA = International Institute of Tropical Agriculture; LGA = Local Government Authority.
 Wilkin, P., Schols, P., Chase, M.W., Chayamarit, K., Furness, C.A., Huysmans, S., Rakotonasolo, F., Smets, E., Thapyai, C. and Meerow, A.W. (2005) A Plastid Gene Phylogeny of the Yam Genus, Dioscorea: Roots, Fruits and Madagascar. Systematic Botany, 30, 736-749.
 Scarcelli, N., Tostain, S., Mariac, C., Agbangla, C., Ogoubi, D., Julien, B. and Pharm, J.L. (2006) Genetic Nature of Yams (Dioscorea sp.) Domesticated by Farmers in Benin (West Africa). Genetic Resource and Crop Evolution, 53, 121-130.
 Arnau, G., Abraham, K., Sheela, M. N., Chair, H., Sartie, A. and Asiedu, R. (2010) Yams. In: Bradshaw, J.E., Ed., Root and Tuber Crops, Springer, New York, 127-148.
 Nweke, F.I., Ugwu, B.O., Asadu, C.L.A. and Ay, P. (1991) Production Costs in the Yam-Based Cropping Systems of South-Western Nigeria. Resource and Crop Management Division. Research Monograph No 6, IITA Ibadan, 29 p.
 Zhou, Y., Zhou, C., Yao, H., Liu, Y. and Tu, R. (2008) Application of ISSR Markers in Detection of Genetic Variation among Chinese Yam (Dioscorea opposita Thunb) Cultivars. Life Science Journal, 5, 4.
 Djeri, B., Tchobo, P.F., Adjrah, Y., Karou, D.S., Ameyapoh, Y., Soumanou, M.M. and Souza, C. (2015) Nutritional Potential of Yam Chips (Dioscorea cayenensis and Dioscorea rotundata Poir) Obtained Using Two Methods of Production in Togo. African Journal of Food Science, 9, 278-284.
 Mahalakshmi, V., Ng, Q. and Atalobhor, K. (2007) Development of West African Yam Dioscorea spp. Core Collection. Genetic Resource and Crop Evolution, 54, 1817-1825.
 Kenyon, L., Lebas, B.S.M. and Seal, S.E. (2008) Yams (Dioscorea spp.) from the South Pacific Islands Contain Many Novel Badnaviruses: Implications for International Movement of Yam Germplasm. Archives of Virology, 153, 877-889.
 Mignouna, H.D., Dansi, A. and Zok, S. (2002) Morphological and Isozymic Diversity of the Cultivated Yams (Dioscorea cayenensis/Rotundata Complex) of Cameroun. Genetic Resource and Crop Evolution, 49, 21-29.
 Schwenk, K., Brede, N. and Streit, B. (2008) Introduction. Extent, Processes and Evolutionary Impact of Interspecific Hybridization in Animals. Philosophical Transactions of the Royal Society B, 363, 2805-2811.
 Paun, O., Forest, F., Fay, M.F. and Chase, M.W. (2009) Hybrid Speciation in Angiosperms: Parental Divergence Drives Ploidy. New Phytologist, 182, 507-518.
 Dansi, A., Orobiyi, A., Dansi, M., Assogba, P., Sanni, A. and Akpagana, K. (2013) Sélection de sites pour la conservation in situ des ignames sauvages apparentées aux ignames Cultivées: Cas de Dioscorea praehensilis Au Bénin. International Journal of Biological and Chemical Sciences, 7, 60-74.
 Ngo Ngwe, M.F.S., Omokolo, D.N. and Joly, S. (2015) Evolution and Phylogenetic Diversity of Yam Species (Dioscorea spp.): Implication for Conservation and Agricultural Practices. PLoS ONE, 10, e0145364.
 Winter, M., Schweiger, O., Klotz, S., Nentwig, W., Andriopoulos, P., Arianoutsou, M., Basnou, C., Delipetrou, P., Didziulis, V., Hejda, M., Hulm, P.E., Lambdon, P.W., Perglh, J., Pysek, P., Royl, D.B. and Kuhn, I. (2009) Plant Extinctions and Introductions Lead to Phylogenetic and Taxonomic Homogenization of the European Flora. Proceeding of National Academy of Science, 106, 21721-21725.
 Faith, D.P., Magallón, S., Hendry, A.P., Conti, E., Yahara, T. and Donoghue, M.J. (2010) Ecosystem Services: An Evolutionary Perspective on the Links between Biodiversity and Human Well-Being. Current Opinion in Environmental Sustainability, 2, 66-74.
 Srivastava, D.S. and Vellend, M. (2005) Biodiversity-Ecosystem Function Research: Is It Relevant to Conservation? Annual Review of Ecology and Evolution System, 36, 267-294.
 Tamiru, M., Becker, H.C. and Maass, B.L. (2011) Comparative Analysis of Morphological and Farmers Cognitive Diversity in Yam Landraces (Dioscorea spp.) from Sothern Ethiopia. Tropical Agriculture and Development, 55, 28-43.
 Terauchi, R., Chikaleke, V.A. and Thottappilly, G. (1992) Origin and Phylogeny of Guinea Yams as Revealed by RFLP Analysis of Chloroplast DNA and Nuclear Ribosomal DNA. Theoretical and Applied Genetics, 83, 743-751.
 Mignouna, H.D., Abang, M.M. and Fagbemi, S.A. (2003) A Comparative Assessment of Molecular Marker Assays (AFLP, RAPD and SSR) for White Yam (Dioscorea rotundata) Germplasm Characterization. Annals of Applied Biology, 142, 269-276.
 Saski, C.A., Bhattacharjee, R., Scheffler, B.E. and Asiedu, R. (2015) Genomic Resources for Water Yam (Dioscorea alata L.): Analyses of EST Sequences, de Novo Sequencing and GBS Libraries. PLoS ONE, 10, e0134031.
 Akakpo, R., Scarcelli, N., Chaïr, H., Dansi, A., Djedatin, G., Thuillet, A.-C., Rhoné, B., François, O., Alix, K. and Vigouroux, Y. (2017) Molecular Basis of African Yam Domestication: Analyses of Selection Point to Root Development, Starch Biosynthesis, and Photosynthesis Related Genes. BMC Genome, 18, 782.
 Kress, W.J. and Erickson, D.L. (2007) A Two-Locus Global DNA Barcode for Land Plants: The Coding rbcL Gene Complements the Non-Coding trnH-psbA Spacer Region. PLoS ONE, 2, e508.
 Asahina, H., Shinozaki, J., Masuda, K., Morimitsu, Y. and Satake, M. (2010) Identification of Medicinal Dendrobium Species by Phylogenetic Analyses Using Matk and rbcL Sequences. Journal of the National Medical Association, 64, 133-138.
 Gao, X., Zhu, Y.-P., Wu, B.-C., Zhao, Y.-M., Chen, J.-Q. and Hang, Y.Y. (2008) Phylogeny of Dioscorea Sect. Stenophora Based on Chloroplast matK, rbcL and trnL-F Sequences. Journal of Systematic Evolution, 46, 315-321.
 Bousalem, M., Durand, O., Scarcelli, N., Lebas, B.S.M., Kenyon, L., Marchandm, J.L., Lefort, F. and Seal, S.E. (2009) Dilemmas Caused by Endogenous Pararetroviruses Regarding the Taxonomy and Diagnosis of Yam (Dioscorea spp.) Badnaviruses: Analyses to Support Safe Germplasm Movement. Archives of Virology, 154, 297-314.
 Chen, S., Yao, H., Han, J., Liu, C., Song, J., Shi, L., Zhu, Y., Ma, X., Gao, T., Pang, X., Luo, K., Li, Y., Li, X., Jia, X., Lin, Y. and Leon, C. (2010) Validation of the ITS2 Region as a Novel DNA Barcode for Identifying Medicinal Plant Species. PLoS ONE, 5, e8613.
 Tamura, K., Peterson, D., Peterson, N., Stecher, G., Nei, M. and Kumar, S. (2013) MEGA6: Molecular Evolutionary Genetics Analysis Using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Molecular Biology Evolution, 28, 2731-2739.
 Steel, M. and Penny, D. (2000) Parsimony, Likelihood, and the Role of Models in Molecular Phylogenetics. Molecular Biology Evolution, 17, 839-850.
 Kuck, P., Mayer, C., Wagele, J.W. and Misof, B. (2012) Long Branch Effects Distort Maximum Likelihood Phylogenies in Simulations Despite Selection of the Correct Model. PLoS ONE, 7, e36593.
 Soininen, E.M., Valentini, A., Coissac, E., Miquel, C., Gielly, L., Brochmann, C., Brysting, A.K., Sønstebø, J.H., Ims, R.A., Yoccoz, N.G. and Taberlet, P. (2009) Analysing Diet of Small Herbivores: The Efficiency of DNA Barcoding Coupled with High-Throughput Pyrosequencing for Deciphering the Composition of Complex Plant Mixtures. Frontier Zoology, 6, 16.
 Fazekas, A.J., Burgess, K.S., Kesanakurti, P.R., Graham, S.W., Newmaster, S.G., Husband, B.C., Percy, D.M., Hajibabaei, M. and Barrett, S.C. (2008) Multiple Multilocus DNA Barcodes from the Plastid Genome Discriminate Plant Species Equally Well. PLoS ONE, 3, e2802.
 Clement, W.L. and Donoghue, M.J. (2012) Barcoding Success as a Function of Phylogenetic Relatedness in Viburnum, a Clade of Woody Angiosperms. BMC Evolutionary Biology, 12, 73.
 Dong, W., Xu, C., Li, C., Sun, J., Zuo, Y., Shi, S., Cheng, T., Guo, J. and Zhou, S. (2015) ycf1, the Most Promising Plastid DNA Barcode of Land Plants. Scientific Reports, 5, Article No. 8348.
 Sun, X.Q., Zhu, Y.J., Guo, J.L., Peng, B., Bai, M.M. and Hang, Y.Y. (2012) DNA Barcoding the Dioscorea in China, a Vital Group in the Evolution of Monocotyledon: Use of matK Gene for Species Discrimination. PLoS ONE, 7, e32057.
 Faith, D.P. and Baker, A.M. (2006) Phylogenetic Diversity (PD) and Biology Conservation: Some Bioinformatics Challenges. Evolutionary Bioinformatics, 2, 70-77.
 Faith, D.P. (2016) The Phylogenetic Diversity Framework: Linking Evolutionary History to Feature Diversity for Biodiversity Conservation. Biodiversity Conservation and Phylogenetic Systematics, Topics in Biodiversity and Conservation, 14.
 Maloukh, L., Kumarappan, A., Jarrar, M., Salehi, J., El-waki, H. and Lakshmi, T.V.R. (2017) Discriminatory Power of rbcL Barcode Locus for Authentication of Some of United Arab Emirates (UAE) Native Plants. 3 Biotech, 7, 144.
 Elansary, O.H., Ashfaq, M., Ali, H.M. and Yessoufou, K. (2017) The First Initiative of DNA Barcoding of Ornamental Plants from Egypt and Potential Applications in Horticulture Industry. PLoS ONE, 12, e0172170.