PBG/MCB 621 – Class Exercise 3 Friday Nov 7th 2014 Today’s lab will build on the work from last week’s lab, using the Joinmap file with the groupings that you created from the data in the spreadsheet ‘DxB BOPA1 Genotypes.xlsx’. You will need to access this spreadsheet again during the lab but we have created a version that contains the edits made during Lab 2 and have also replaced the 4 heterozygotes with missing values in the sheet to speed up one of the operations. Briefly, the Joinmap file that you created last week should have 7 groupings that were created from the genotypic data for 156 barley DH lines produced from the F1 of a cross between Derkado and B-8312/21/5. All the genotypic data in the Joinmap project file should be scored as ‘a’, ‘b’ or ‘-‘. The object this week is to take one of the 7 linkage groups that we created last week and order the loci within it to obtain a map of one barley chromosome. We will see how errors in data coding can affect mapping and how they can be corrected and detected and will also explore some of the capabilities of Joinmap to detect irregularities in maps as well as comparing maps. Task 1. Move markers from one group to another. 1. Click on the Grouping Node in the LHS window. This should produce a list of all the loci ordered by the groups that you created last week. 2. Double click on the ‘Node’ header to produce a list in descending order of Node names. Note that the group number of all the loci on at least the first page is ‘0’, indicating that they do not belong to a group. 3. The top 3 loci should belong to a group called ‘ungrouped’ and these are three of the 5 loci that did not group with the rest at LOD 2.0. The other two are identical with two of the three so are in the ‘identical’ node. 4. Look at the ‘SCL-Group’ column and note which group the majority of the markers belong to (6 for 2/3), suggesting that these chromosome 2 markers might belong to Group 6. 5. Highlight the 3 loci and select ‘Move Selected Loci’ from the ‘Grouping’ menu. This will open a dialog box asking for the name of the group to move the loci to. Enter 6 and press return or ‘OK’ 6. This will reorder the list. Repeat step 2 and verify that the group number for the ungrouped loci is now ‘6’ QUESTION 1. Is the ‘SCL-Group’ for the 3 ungrouped markers the same after re-assigning them? If not, briefly suggest why it might have changed. No, two are now cross-linked to SCL Group 3 and one to Group 1. As we assigned all three to Group 6, those that were cross-linked to Group 6 now have to be assigned to a new Group as they can no longer be cross-linked to Group 6. Task 2. Producing a map using Maximum Likelihood ordering (the default option for Joinmap). 1. Click on Group 1 to select that group and press the ‘Calculate map’ icon (Seventh from the right and next to ‘Calculate’ icon). 2. This will begin map construction that should take maybe 2 minutes with this group. You should see a green bar at the bottom of the screen with some text at bottom LHS indicating progress. 3. This will create a ‘Mapping’ Node in the LHS window with a ‘Map’ node (highlighted in magenta) underneath. The main window should have a session log that reports mapping at various stages and whether the order from the first round could be improved or not. 4. Click on the ‘Map’ node to bring up a MapChart view of the order of the loci within the group. Note that the map is long with large distances between markers at one end. QUESTION 2. How many distances between adjacent marker pairs exceed 40cM and what markers are involved in each? Three, 11_10221 & 11_10793, 72.7cM; 11_10793 & 11_11292, 67.4 cM; 11_11066 & 11_21506, 96.2 cM Task 3. Attempt to produce a map using the Regression Mapping algorithm. 1. With the grouping node highlighted, click on the ‘Calculation options’ icon (10th icon from right and next to the hammer and spanner icon. 2. Click on the ‘Group’ tab in the dialog box that opens up. At the bottom left, there are two options in the ‘Mapping algorithm’ box. Select ‘Regression mapping’ by clicking the radio button to its left and then press ‘OK’. 3. Use the ‘Calculate map’ icon (see Task 2.1) to order the loci by the regression approach. 4. This should generate a ‘Mapping node’ but no ‘Map’ node. Check the session log and you should see that this algorithm detects a lack of linkage between two groups of markers Task 4. Trouble-shooting the genotypic data. 1. Click on the ‘Grouping node’ again. This should bring up a series of tabs that allow you to inspect the data. 2. Click on the ‘Suspect Linkages’ tab and press the ‘Calculate’ icon to populate the table. 3. Click on the ‘Nr1’ column header to sort the table by individual loci involved in the suspect linkages. 4. Note that these are all linkages with a recombination frequency of greater than 0.6 and some are very close to 1. This should not happen and suggests a change in phase of one of the markers in the pair. 5. Inspect the table and note that all the high values involve marker numbers 137, 221, 377 and 477 (markers 11_11066, 11_10221, 11_10793, and 11_11292 respectively). 6. Open the ‘DxB BOPA1 Genotypes.xlsx’ spreadsheet and locate the base calls for Excel row numbers 138, 222, 378 and 478 and reverse the parental base calls for each of these 4 markers. 7. This will flip the ‘a’ and ‘b’ calls for each marker and we will need to re-import the data. Task 5. Importing the corrected data and create a grouping from it. (Note: you could also preserve the original dataset and use ‘Create New Dataset’ and work from that). 1. Click on the ‘Dataset 1’ node to open up the table of markers and individuals. Click in the top LHS of the datasheet to highlight the whole table. 2. Copy the ‘ab-‘ scores from Excel and paste into the Joinmap datasheet. 3. Click ‘Create Population Node’ from the Datset menu which should open up another population node below all the other nodes. 4. Click on the new Population Node and produce a grouping from it by opening the ‘Groupings (tree) tab and clicking on the ‘Calculate’ icon. 5. Highlight the first group at node 5.0/4(72) by right clicking on it. Then click on ‘Create Groups Using the Groupings Tree’ item from the ‘Population’ menu to create a new Group 1. Task 6. Produce a map with the Regression Mapping algorithm. 1. Repeat Task 3.3. This should produce a ‘Mapping Node’ with a ‘Map’ node underneath this time. 2. Check the ‘Session Log’ Note that it reports ‘sufficient linkage to start the map’ 3. Click the ‘Map’ node (Note that it has a yellow background to show that it is from Regression Mapping). This shows a more reasonable map. QUESTION 3. What is the largest distance between adjacent marker pairs in this map? 32.9 cM between 11_20145 & 11_10221 QUESTION 4. Look at the Session Log for the Regression Mapping and note the names of the first two markers to be mapped. Take the recombination fraction as an estimate of the distance between them (it is much less than 10cM) and report the difference in distance between that value and the distance calculated from the difference in the final map positions. Mapping starts with 11_20361 and 11_10527 where r=0.0064 therefore d=0.64. From the map, d=0.669 so the difference is 0.029cM QUESTION 5. Which three markers give the highest chi-square contributions to the overall model? mlo, 11_20044, 11_10510, all with mean chi-square >0.4 Task 7. Produce a map with the Maximum Likelihood algorithm and compare to Regression Mapping 1. Repeat Tasks 3.1. and 3.2 to change back to Maximum Likelihood mapping and press the ‘Calculate Map’ icon. 2. This should produce a much shorter map than the ML mapping attempt. 3. Check that the orientation of the maps produced at Tasks 6.3 and 7.1 are the same and that market mlo appears closer to the bottom. Use the ‘Invert Map’ option from the ‘Map’ menu with the ‘Map Chart’ sheet open to make any necessary corrections. 4. Select the ‘Combine Maps’ option from the ‘Join’ menu to open the ‘Combine maps into a single chart’ dialog box and select the two ‘Map’ nodes created in Tasks 6.3 and 7.1. Press OK. 5. This produces a new ‘Map’ node combining both names with the maps side by side on the sheet. 6. Format the maps by clicking on the ‘Map chart options’ icon (9th from the right). Select the ‘Homol-1’ tab,check the ‘Show Homologs’ box and press ‘OK’. 7. This should produce dots at each locus represented in both maps (i.e. all in this case). 8. Change the colour of the dots and, if necessary, join them by opening the ‘Map chart options’ dialog box again, selecting the ‘Homol -2’ tab and changing the colour to 2 (red) QUESTION 6. What is the largest distance between adjacent marker pairs in this map and which markers are involved? 29.8 cM between 11_20145 and 11_10221. Task 8. Compare your maps to a Reference Map. 1. The text file ‘BOPA1 Map.map’ contains the BOPA1 markers listed on the reference map produced by Munoz-Amritriran et al. with a header identifying the chromosomes followed by a list of marker names and positions, separated by a space or tab, one on each line. 2. Click the ‘Load data into project’ icon (3rd from Left) or choose ‘Load Data’ from the file menu. 3. Browse to the directory containing your map file and it should be the only file available. Slect it and then click ‘Open’. 4. This will create another ‘Map’ node under your current population named after the file name. 5. Repeat Task 7.4 but click on all 3 nodes or the new node and the previous combined node. Ensure you select the Reference map node first. 6. This creates a combined map but the reference map is too dense to produce as one segment on the page at the default settings. 7. Reduce the size of the marker names by opening the ‘Map chart options’ again and selecting the ‘Loci’ tab. Click on ‘Locus Font’ to open the font dialog and change the size to 4. Press OK x 2. 8. This produces a graphic of the three maps side by side. Repeat Tasks 7.6 to 7.8 to compare the orders. Task 9. Checking your map. 1. Highlight the ‘Map’ node that you produced successfully by Regression Mapping. Select the ‘Locus Genot. Freq.’ tab and press the Calculate icon to generate a list of the frequencies of parental alleles and chi-square test for significant deviation from a 1:1 ratio in map order. 2. Click on the ‘Genotype Probabilities’ tab and press ‘Calculate’ to populate the table. This list triplets that are genotypically ‘aba’ or vice versa and tests whether or not they are significant. QUESTION 7. How many triplets involve the marker ‘mlo’? Briefly, why might these occur? 12/14 reported triplets involve mlo, either as the central or a flanking marker. mlo is a disease resistance locus that was characterized phenotypically and so some ‘resistant’ types might have been escapes whereas a slight amount of disease might have led to a genuine resistant type being classified as ‘susceptible. QUESTION 8. What are the three markers causing the greatest stress in the Maximum Likelihood map? Are any of them in common with those you identified in Question 5 and, if so, which ones? mlo, 11_10221 and 11_ 21122 cause the greatest stress. Only mlo is in common with the results from the mean chi-squares of the regression map. Congratulations. You have now constructed a robust linkage map. We will base the homework exercise around mapping the rest of the groups……..