Appendix N: The Grand Inversion 1 Appendix N: Grand Inversion Implementation and Exploration of Logic-Tree Branches Morgan Page (USGS), Ned Field (USGS), and Kevin Milner (USC) Introduction The purpose of the “grand inversion” is to solve for the long-term rate of all possible (“on-fault”) ruptures in California. The rates of these ruptures are constrained by fault slip rates, paleoseismic event rates observed in trenches, magnitude-frequency distributions (MFDs) observed in seismicity (and adjusted to account for background seismicity), as well as other a priori and smoothing constraints described below. The inversion methodology we use is based on a methodology proposed by Andrews and Schwerer (2000) and further developed by Field and Page (2011). Below we describe 1) the constraints used in the inversion, 2) the simulated annealing algorithm, 3) the rules used to define the set of ruptures on the system of faults, and 4) details of a suite of models produced by the inversion algorithm. 1. Setting up the Inversion: Data and Constraints The inversion methodology solves for the rates of ruptures that are consistent with the data and constraints described below. Some of these constraints differ depending on the inversion model branch; for example the CHAR (characteristic) branch solution is constrained to be as close to UCERF2 rates as possible, while the G-R (Gutenberg-Richter) model branch is designed to have a spatially-smooth, approximately Gutenberg-Richter magnitude distribution in space. Constraints used for all branches: 1) Slip Rates. The average slip in each rupture a section participates in, multiplied by the rate of each rupture, must sum to the long-term slip rate for that section, as given by the deformation model. This constraint is applied to each fault subsection. Each slip-rate constraint is normalized by the target slip-rate (so that misfit is proportional to the fractional difference, rather than the absolute difference, between the model slip rates and the target slip rates), with the exception of slip rates below 10-4 m/yr, which are normalized by 10-4 m/yr. This prevents some extremely low slip rates (some deformation models have slip rates as low as 10-13 m/yr) from dominating the misfit. The slip rates on fault subsections with zero or NaN deformation-model slip rates are minimized (with a weight of 10-4). The slip rates used in this constraint have been reduced from the slip rates specified in the deformation models to account for subseismogenic-thickness ruptures and aseismicity (more details on this are given in the main report). 2) Paleoseismic Event Rates. The total rate of all ruptures a section participates in, multiplied by the probability each rupture is paleoseismically visible (Appendix I), must sum to the mean paleoseismic event rate for that section. This constraint is applied to a total of 34 fault subsections that are in the paleoseismic database (see Appendix G). Appendix N: The Grand Inversion 2 Additional CHAR branch constraints: The CHAR (characteristic / close to UCERF2) branch is designed to have rates that are as close as possible to UCERF2 ruptures rates, while satisfying the slip-rate and paleoseismic event rates better and eliminating the magnitude “bulge” (overprediction relative to seismicity MFD) that existed in the UCERF2 model. 1) A priori Rupture Rates. For this branch, we apply an a priori rupture-rate constraint to force rupture rates to be close to UCERF2 rates. Of the solutions that come closest to satisfying the other constraints, we choose the one that is closest (L2 norm) to equivalent UCERF2 rates (the mapping of UCERF2 ruptures to the equivalent inversion ruptures is far from trivial; details are available upon request, but we are confident that conclusions here do not depend on any subjectivity in this mapping). This constraint is only applied to ruptures that have a mapping to a UCERF2 rupture, so it does not affect new faults that have been added to the fault database since UCERF2. 2) Magnitude Distribution. The on-fault target magnitude distribution is found by reducing the total magnitude distribution (for which we use the 1850-present magnitude distribution found from seismicity, see Appendix L) by the amount of off-fault moment in each deformation model. For the CHAR branch, this reduction of the total magnitude distribution is accomplished by reducing the b-value below magnitude 7.6 (which corresponds to the maximum “background” earthquake allowed in UCERF2 – in C-zones), as shown in Figure 1. For each branch, the magnitude-distribution constraint is applied as an equality constraint up to and including magnitude 7.85, and applied as an inequality constraint (with the target on-fault MFD serving as an upper bound) above M7.85. Thus the inversion is not constrained to exactly equal the target MFD at magnitudes for which the true magnitude-frequency distribution may “roll-off”. As it is not well known what the MFD looks like at high magnitudes, we allow the inversion to choose whatever “roll-off” at high magnitudes is consistent with the other constraints. For each branch, this constraint is applied for 2 regions: Northern California and Southern California. Appendix N: The Grand Inversion 3 Figure 1. Schematic of magnitude-frequency distributions for the CHAR (Characteristic) branch. The total target magnitude frequency distribution (found from seismicity, see Appendix L) is shown in black. The total on-fault MFD is shown in purple. It is found by reducing the total target MFD below the bilinear transition magnitude of 7.6 to account for off-fault moment (the moment-rate difference between the black and purple lines is set to be equal to the seismic off-fault moment for each deformation model). This MFD is further reduced to account for subseismogenic ruptures (more details on this are given in the main report) to give the cyan line, which is the MFD target for the inversion, and represents all on-fault ruptures that rupture the full seismogenic width. This MFD target is applied as an equality constraint below the MFD transition magnitude of 7.85 and an inequality (upper-bound) constraint above M7.85. Finally, the implied background MFD, which includes both off-fault and subseismogenic-thickness on-fault ruptures, is defined as the black minus the cyan line, and is shown in grey. 3) Minimum Rupture Rates (waterlevel). To avoid assigning zero rates to ruptures, a minimum “waterlevel” is assigned to each rupture rate. The original inverse problem Ax=d with x ≥ xmin, where xmin are the minimum rupture rates, is mapped to a new inverse problem with simple nonnegativity constraint, Ax=d, x ≥ 0. The “new” data vector is given by d=d - Axmin and the rupture-rate mapping is given by x+xmin=x. The nonnegativity constraint for the mapped inversion problem is strictly enforced in the simulated annealing algorithm, which does not search any solution space that contains negative rates. The minimum rupture rates are defined as 1% of the smooth starting solution, which is a magnitude-dependent waterlevel described below (as it is used as the starting model for the G-R branch). Appendix N: The Grand Inversion 4 Additional G-R branch constraints: This branch is designed to have faults that have Gutenberg-Richter-distributed (with b=1) magnitudes on as small as scale as possible. 1) Magnitude Participation Distribution Smoothness. The purpose of the G-R branch is to make a model that, while fitting the other constraints, has a magnitude-distribution that, at a small scale on faults, is as close to a Gutenberg-Richter distribution as possible. A GutenbergRichter distribution of nucleation’s with a b-value of 1.0 implies a uniform (in magnitude) participation probability for each point on a fault (assuming constant stress drop). Thus, this constraint is implemented by constraining the magnitude-participation distribution for each fault subsection to be uniform. This constraint is applied for each fault subsection magnitude-bin: the rate of ruptures for each magnitude bin, for each fault subsection, is constrained to be close to the average (over all magnitude bins) participation rate for that subsection. This constraint is computationally the most demanding (in terms of both memory and computational time) of all the constraints; for this reason a G-R branch inversion uses 28 GB while the CHAR branch only requires 1.8 GB of memory. The processor time per simulatedannealing iteration scales approximately the same as the memory usage. 2) Magnitude Distribution. The on-fault magnitude-frequency distribution is found by uniformly reducing the total target MFD defined from seismicity by the percentage of moment in the background, for each magnitude bin. Thus the faults are constrained, for Northern and Southern California, to have a total MFD that is Gutenberg-Richter, allowing for a “roll-off” at high magnitudes. 3) Minimum Rupture Rates (waterlevel). The minimum rupture rates are defined as 1% of the smooth starting solution, which is a magnitude-dependent waterlevel described below (as it is used as the starting model for the G-R branch). Additional UNCONSTRAINED branch constraints: This branch is not entirely unconstrained, but rather not over-constrained (as it lacks the regularizing rupture-rate constraint of the CHAR branch, and magnitude participation smoothness constraint of the G-R branch). Due to stochastic nature of the simulated annealing algorithm, this model will give a different solution each time as there are multiple models that fit the data. This branch allows for the exploration of which features on the long-term rates are robust outcomes of the data constraints. 1) Magnitude Distribution. The magnitude-distribution constraint is applied the same way as for the G-R branch (see above). 2) Nonnegativity Constraint. Rupture rates cannot be negative. This is a strict constraint that is not included in the system of equations but is strictly enforced in the simulated annealing algorithm, which does not search any solution space that contains negative rates. Appendix N: The Grand Inversion 5 With the exception of the nonnegativity constraint and the MFD inequality constraint, these constraints are linear and can be combined to form a matrix equation of the form Ax=d. Our task is to set up this matrix equation and solve for x, the rate of all ruptures, given A and d. Starting models for branches In addition to the inversion constraints described above, we also specify a starting model, which is a set of initial ruptures rates given to the simulated annealing algorithm. For the CHAR (characteristic) branch, the starting rates are the UCERF2 ruptures rates (or more accurately, UCERF2 rates mapped onto the inversion rupture set – we find the closest analogues to UCERF2 ruptures given our discretization of the fault system). This means that for most ruptures in the UCERF3 rupture set the starting rate for the CHAR branch will be zero (since there are no corresponding UCERF2 ruptures for most ruptures in the UCERF3 rupture set, since UCERF2 had less than 10,000 on-fault ruptures and the UCERF3 rupture set has 205,254 ruptures for Fault Model 3.1). For the G-R branch, we use a “smooth starting solution” whose rates sum to a Gutenberg-Richter distribution (with a b-value of 1.0 and an a-value set from the total deformation model moment). Rupture rates within each 0.1 magnitude-unit bin are proportional to the minimum-slip-rate subsection in each rupture. As a final processing step before the simulated annealing, the starting solutions for both the CHAR and G-R branches are modified for any magnitude bins where they exceed the MFD constraint rate (all rates within any magnitude bin that exceeds this constraint are uniformly reduced to that the total rate in that bin will equal the constraint rate). This step speeds up the annealing time required to find acceptable solutions. The starting model for the UNCONSTRAINED branch is a rate of zero for all ruptures. Table 1. Inversion Parameter Settings. Setting for CHAR branch Setting for GR branch Setting for UNCONSTRAINED branch Details relative Paleo Rate Weight 1 1 1 Weight of paleoseismic rate constraint relative to slip-rate constraint. Normalized by event-rate standard deviation. relative MFD Equality Constraint Weight 10 10 10 Weight of MFD-equality constraint relative to slip-rate constraint. Normalized by target rate for each magnitude bin. relative MFD Inequality Constraint Weight 1000 1000 1000 Weight of MFD-inequality constraint relative to slip-rate constraint. Normalized by target rate for each magnitude bin. Bilinear Transition Magnitude 7.6 N/A N/A Below this magnitude, the b-value for the on-fault MFD is reduced to allow for background seismicity. This represents the maximum moment of the background for the CHAR branch. Parameter name Appendix N: The Grand Inversion 6 relative Participation Smoothness Constraint Weight 0 1000 0 Weight of magnitude-participation constraint relative to slip-rate constraint. Units of rate/magnitude-bin/yr. participation Constraint Magnitude Bin Size N/A 0.1 N/A Width, in magnitude units, of magnitude bins for magnitude-participation constraint. relative Rupture Rate Constraint Weight 100 0 0 Weight of rupture-rate constraint relative to slip-rate constraint. Units of rate/yr. Applied only to nonzero rates. a Priori Rupture Constraint UCERF2 solution N/A N/A Rates used for rupture-rate constraint. minimum Rupture Rate Fraction 0.01 0.01 0 Fraction of minimum rupture-rate basis that is applied as a minimum rupture-rate (waterlevel). minimum Rupture Rate Basis Smooth starting solution Smooth starting solution N/A Rates used for minimum rupture rates (waterlevel). initial Rupture Model UCERF2 starting solution Smooth starting solution Zero rate for all ruptures Starting set of rupture rates for simulated annealing algorithm. Magnitude-Frequency Distribution (MFD) Constraints Bilinear distribution G-R G-R Target MFD for on-fault ruptures: Total region MFD target less off-fault seismicity. MFD Transition Magnitude 7.85 7.85 7.85 Magnitude to switch from MFD equality constraint to MFD inequality constraint. 2. The Simulated Annealing Algorithm The simulated annealing algorithm solves the nonnegative least squares problem Ax=d with the additional constraint Aineqx ≤ dineq (this last constraint is due to the MFD inequality constraint). This algorithm simulates the slow cooling of a physical material to form a crystal. It has the following steps: 1. Set x equal to initial solution x0. For the CHAR branch, the starting solution is mapped UCERF2 ruptures rates; for the G-R branch, it is the “smooth starting solution” described above, and for the UNCONSTRAINED branch it is a vector of all zeros. 2. Lower the “temperature”, T, from 1 to 0 over a specified number of iterations. We lower the temperature linearly, although different approaches to annealing specify different functions for the temperature. The more iterations (i.e., the more slowly the temperature is lowered), the better the final solution will be. Over each simulated annealing iteration, the following is done: a. One element of x (one rupture rate) is chosen at random. This element is then perturbed randomly. It is here that the nonnegativity constraint is applied – the perturbation function is a function of the rupture rate and will not perturb the rate to a Appendix N: The Grand Inversion 7 negative value. Unlike some simulated annealing algorithms, our algorithm does not use smaller perturbations as the temperature is lowered (this was tested but did not result in faster convergence times). b. The misfit for the perturbed x, xnew, is calculated, and from this the “energy” of that solution: Enew = (Axnew- d)2+Eineq, where Eineq is additional energy from the MFD inequality constraint: Eineq = (min (Aineqxnew- dineq, 0))2. The weight on the inequality constraint is set quite high so that a solution that violates it faces a significant penalty; thus in practice this is a strict inequality constraint. c. The transition probability, P, is calculated based on the change in energy (between the previous state and the perturbed state) and the current temperature T. If the new model is better, P=1. Therefore, a new model is always kept if it is better. If the new model is worse, it is sometimes kept, and this is more likely early in the annealing process when the temperature is high. If E < Enew, P = e(E-Enew)/T. 3. Once the annealing schedule is completed, the best solution x found during the search (the solution with the lowest energy) is returned. (Note that this is a departure from “pure” simulated annealing, which returns the last state found. In some cases the final state will not be the best solution found, since occasionally solutions are discarded for worse solutions). Simulated annealing works similarly to other nonlinear algorithms such as the genetic algorithm. One advantage of simulated annealing is that there is a mathematical basis for it: besides the analogy to annealing a physical material, the simulated annealing algorithm will find the global minimum given infinite cooling time (Granville et al. 1994). There are several advantages of this algorithm in contrast to other approaches such as the nonnegative least-squares algorithm. First, the simulated annealing algorithm scales well as the problem size increases. (In fact, it would not be computationally feasible for us to use the nonnegative least-squares algorithm to solve a problem of this size.) It is designed to efficiently search a large parameter space without getting stuck in local minima. Next, quite importantly, for an underdetermined problem the simulated annealing algorithm gives multiple solutions (at varying levels of misfit depending on the annealing schedule). Thus both the resolution error (the range of models that satisfy one iteration of the data) and the data error (the impact of parameter uncertainty on the model) can be sampled. Finally, simulated annealing can allow us to include other nonlinear constraints in the inversion apart from nonnegativity; in our case we incorporate the MFD inequality constraint, which is a nonlinear constraint that cannot be easily incorporated into the perturbation function. Parallelization of the Simulated Annealing Algorithm In order to tackle the additional computational demands of the statewide inversion, a parallel version of the simulated annealing algorithm has been implemented. This algorithm runs the serial version of simulated annealing (as described above) over a number of processors for a given number of subiterations. Then the best solution among these is kept and redistributed over the processors; this process repeats until convergence criteria (a target misfit, a given number of total iterations, or an allotted annealing time) is satisfied. Appendix N: The Grand Inversion 8 The algorithm scales well up to 20-50 nodes, but adding nodes beyond this does not improve performance. Using the parallelized algorithm on a cluster results in average speedups of 6-20 relative to the serial algorithm. Graphs of simulated annealing “energy” vs. time (lower energy corresponds to a lower misfit) for a range of models and cluster configurations are shown in Figure 2. The solutions presented here were run on the HPCC cluster at the University of Southern California; CHAR-branch models are annealed on 8-core machines for 8 hours, UNCONSTRAINED-branch models for 1 hour. The G-R-branch models require significantly more memory and annealing time; they are run on a cluster of 23 24-core machines for 8 hours. Figure 2. Speedup results for the parallelized simulated annealing algorithm depend on problem size and the amount of constraints on the solutions. Each line shows the average of 5 runs using the number of nodes shown in the legend. More negatively-sloped lines indicate faster convergence speed. 3. Defining the set of possible fault-based ruptures A final a priori step that must be done before running the inversion is defining the set of possible ruptures. The faults are first discretized into subsections, as described in the main report. These subsections are for numerical tractability and do not represent geologic segments. Subsections are linked together to form ruptures, the rates of which will be solved for by the inversion. It is worth noting that here a “rupture” is defined as an ordered list of fault subsections it includes; it has no defined hypocenter. All viable ruptures are generated from the digitized fault sections that satisfy the following rules: 1) All fault sections connect within 5 km or less. This rule is consistent with the maximum jump distance in the Wesnousky database of earthquake surface ruptures (Wesnousky, 2008). 2) Ruptures cannot include a given subsection more than once. 3) Ruptures must contain at least 2 subsections on any involved fault section. Each fault section is discretized into subsections that have a length of at least half the seismogenic width, so this rule results in ruptures with an along-strike length at least as large as the seismogenic-thickness. 4) Ruptures can jump between 2 different fault sections only at their closest point in 3D. This prevents ruptures from “leap-frogging” back-and-forth between two parallel faults. 5) The maximum azimuth change between neighboring subsections is 60 degrees. Appendix N: The Grand Inversion 9 6) The maximum azimuth change between the first and last subsection is 60 degrees. 7) The maximum cumulative rake change (summing over each neighboring subsection pair) is 180 degrees. Note: The current setting of the rake filter allows for Garlock to Northern San Andreas ruptures, which is a departure from the filtering rules used in previous UCERF3 reports. 8). The maximum cumulative azimuth change (summing over each neighboring subsection pair) is 560 degrees. This filter, known colloquially as the “squirrelly-ness” filter, prevents ruptures that have highly non-linear traces. This filter is needed in the UCERF3 fault models in particular because of the high density of faults that could otherwise link in the Ventura and LA basins. An example of the type of rupture excluded by this filter is shown in Figure 3. Without this filter, the number of ruptures would reach hundreds of millions and the inversion would be computationally intractable. Increasing the maximum cumulative azimuth change significantly increases the number of ruptures; the current value is set to keep the number of ruptures low while being high enough to allow near “wall-to-wall” San Andreas ruptures. 9) Coulomb filter: At each branching point (defined as place where a rupture could “jump” from one fault sections to another, whether or not it takes this jump) the Coulomb probability (defined as the ratio of a given stress change at a branching point relative to the cumulative stress change of all possible jumps at that branching point) must be greater than 0.1, unless the sum of Coulomb stresses at each branching point for that rupture is greater than 1.5. (For more details on the Coulomb methodology, see Appendix J.) We are applying the Coulomb modeling to branches only because otherwise some ruptures with no jumps at all between different parent sections (some of which were included in UCERF2) were excluded. The minimum stress exclusion ceiling (the value of 1.5 mentioned above) was added to prevent exclusion of kinematically-favored jumps at branching points where many possible linking-options led to a low Coulomb probability. Examples of ruptures allowed by the above filters but excluded by the Coulomb filter are shown in Figure 4. Some ruptures allowed by the above rules will be highly improbable due to slip-rate differences along strike (as the slowest sections limit how often different fault sections can rupture together). The inversion will naturally satisfy this consideration since it is constrained to satisfy the deformation-model slip rates on each fault subsection. Appendix N: The Grand Inversion 10 Figure 3. Example of rupture excluded by maximum cumulative azimuth change filter (subsections in rupture are shown in red). Figure 4. Examples of ruptures excluded by the Coulomb filter. The above rules define a total of 205,254 ruptures for Fault Model 3.1. By comparison, our UCERF2 mapping has (for a smaller set of faults) 7773 on-fault ruptures. One important remaining question is to what extent ruptures allowed by the above rules may require further penalty. One could impose an “improbability” constraint to any ruptures that are deemed possible but improbable; these constraints a numerically equivalent to an a priori rupture-rate constraint with a rupture rate of zero. These constraints can be weighted as well, so some rupture rates could be penalized more harshly (that is, contribute more to the misfit if they are nonzero) than others. Currently, this improbability constraint is not implemented for the UCERF3 rupture set as we do not have a viable model (that is, a model ready for implementation and agreed to be useful by Working Group members) for the constraint. It is also not clear that Appendix N: The Grand Inversion 11 this constraint is needed. It is not even clear what constitutes a “multi-fault” rupture in the case of a complex, possibly fractal, connected fault network. One way to quantify the amount of “multi-fault-ness” that may not be ideal but at least lacks ambiguity is to simply define “multi-fault” in the context of the names assigned to faults in the database. For this definition of multi-fault, we count all sections of a fault such as the San Andreas fault as one “named fault” even though the individual sections have different names (e.g., Carrizo, Mojave North, etc.). Using the inverted rupture rates for the UCERF3 reference model that we describe below in Section 4, 13% are on multiply-named faults. This increases to 17% for ruptures with M≥6.5 and to 45% for ruptures with M≥7. We can compare this number to the ruptures in the Wesnousky database of surface ruptures (Wesnousky, 2008); of these, 50%, or 14 out of 28, are on multiply-named faults. So by this albeit simple metric, the solutions given by the inversion algorithm are not producing more multi-fault ruptures than are seen in nature. It is important to note that many of the truly multi-fault ruptures seen in nature may in fact not be part of our model at all because they could link up known, mapped faults with unknown faults (or faults that are known but not sufficiently studied as to be included in our fault model). Our model completely separates “on-fault” ruptures from background earthquakes; there are no ruptures that are partly “on-fault” and partly “off-fault”. Without the improbability constraint, the inversion is not likely to give a lower rate to a rupture that is only moderately kinematically compatible (but allowed by our filters) relatively to a more kinematically-favored rupture, unless kinematically less-favored jumps are also disfavored by slip-rate changes. This can be seen in the Figure 5, which compares Coulomb probabilities to normalized subsection pair rates (the rate at which a pair of subsections ruptures together divided by the total rate at which they rupture) for neighboring subsection pairs. There is a positive correlation between these variables (which would imply that the inversion is naturally doing something similar to Coulomb penalties), but it is not particularly strong. Then again, the most egregious subsection pairs that have very low Coulomb probabilities at branching points have already been excluded. Appendix N: The Grand Inversion 12 Figure 5. The probability that 2 subsections will rupture together given by Coulomb modeling vs. the probability given by the UCERF3 reference model. There is some positive correlation, but the inversion is currently not directly constrained to minimize the rates of ruptures with low Coulomb-probability subsections, beyond branching points (those ruptures are excluded from the rupture set entirely). 4. Modeling Results In order to make sense of the inversion results and understand how and why they are different from UCERF2, we present a series of models between UCERF2 and our new UCERF3 models. First, we show the UCERF2 model mapped into our rupture-set framework, and show how well this model matched its own constraints. Next, we show a model that uses UCERF2 ingredients but is generated with UCERF3 inversion methodology – we will see how this model fits the UCERF2 constraints and what changes to the model are a result of methodological changes (rather than changes to data). Finally, we will show models generated with UCERF3 ingredients and UCERF3 methodology. These models use the newer fault models (which have more faults than UCERF2), new deformation models, different magnitude-area and slip-length relations, and updated paleoseismic constraints and seismicity constraints. UCERF2 Reference Model The first model we present uses mean UCERF2 on-fault rupture rates, mapped onto our rupture set. We set the magnitudes of ruptures to their mean UCERF2 values and assume a tapered along-strike slip distribution for each rupture. In the plots below we compare the UCERF2 model to data that was used to constrain the model: slip rates from Deformation Model 2.1, paleoseismic event rates from UCERF2, and the total statewide magnitude-frequency distribution found from 1850-2007 seismicity rates. UCERF2 matches slip rates well on a faultaveraged basis; however, on a smaller scale the model overpredicts slip rate in the center of sections and underpredicts slip rate at fault ends. The UCERF2 deformation model had uniform slip rates for each fault section, yet the model does not fit these uniform rates. This is caused by two factors: 1) the “floating” ruptures on each fault section, which overlap more in the center of the sections, and 2) the tapered slip distribution we assume for the ruptures. In addition, the Appendix N: The Grand Inversion 13 UCERF2 on-fault magnitude distribution, when combined with the background MFD, led to an overprediction of events around M6.5. UCERF2 ingredients, annealed with the UCERF3 inversion algorithm We next apply UCERF3 inversion methodology to the data that was used in the UCERF2 model. This model is able to fit the slip-rate data, paleoseismic event-rate data, and magnitudefrequency distribution constraints better than the UCERF2 model. In particular, we are able to eliminate the “bulge” problem in UCERF2 – and overprediction of earthquakes around M6.5. Through testing changes to the rupture set, we determined that fitting the MFD constraint and thus eliminating this “bulge” required multi-fault ruptures not included in UCERF2. By allowing faults to “link up”, moment that was leading to the M6.5 bulge could be put in largermagnitude ruptures. Figure 6. Slip-rate misfit for UCERF2 (left) and UCERF2 ingredients fit with the inversion algorithm (right). The UCERF3 methodology is able to better-fit the slip rates on a subsection-by-section basis. The UCERF2 solution tends to underpredict at fault section ends and overpredict the section centers, although it is moment-balanced on a section-averaged basis. Note that Mendocino and the San Andreas creeping section were not part of the UCERF2 on-fault solution, so those misfits should be disregarded for the UCERF2 reference model. One can see that the annealed model fits UCERF2 slip rates significantly better than the UCERF2 reference model (more particularly, the inversion can fit these slip rates on a smaller scale). The inversion simultaneously is constrained to fit the paleoseismic event rates as well, and improves the squared residuals by a factor of 2 relative to UCERF2 (Although it should be noted, the model does have more outliers than UCERF2. This could be improved by increasing the event-rate constraint weight, but it does not seem to be a problem in models with UCERF3 ingredients.) Appendix N: The Grand Inversion 14 Appendix N: The Grand Inversion 15 Figure 7. Paleoseismic Event Rates for the UCERF2 Reference Model (top) and UCERF2 Annealed Model (bottom). This plot shows paleoseismically visible participation rates along all fault sections that have a paleoseismic trench in red; the mean event rate and 95% confidence bounds is shown in black. The inversion is able to improve upon the paleoseismic fits of UCERF2; the sum of squared residuals for the annealed model is half that given by UCERF2. Figure 8. Comparison of statewide magnitude-frequency distributions for the UCERF2 reference model (left) and UCERF2 ingredients inverted with UCERF3 methodology (right). Note that the UCERF2 reference model has aftershocks added back in, and neglects aleatory magnitude variability, which was a part of the UCERF2 model. The UCERF2 model overpredicted rates in some magnitude bins (as the on-fault MFD and background together will overpredict the total seismicity MFD shown in black); by contrast, the annealed model on the right is explicitly constrained to match the cyan target MFD which has been constructed so that, when background seismicity is added, the total MFD of the model will match a G-R distribution with a-value and b-value constrained by historical seismicity (black line). The on-fault MFD distribution for the inverted model shown above shows considerable difference from UCERF2. This is by construction; however, this difference leads to a significant difference in the on-fault event rates in the M6.5-7.5 range. This can be seen in the participation plots below. These plots show the rate at which each fault subsection participates (rather than nucleates) in a rupture of a given magnitude; we compare this to UCERF2 participation rates and see considerable differences stemming from the different magnitude-frequency distributions fit by the models. Appendix N: The Grand Inversion 16 Figure 9. Participation rates for the model applying UCERF3 methodology to UCERF2 ingredients divided by participation rates for the UCERF2 reference model for M6-7(left), M7-8 (center), and M≥8 (right). UCERF2 has higher event rates, particularly for M6.5-7.5, which is reflective of the magnitude “bulge” problem in this model. UCERF3 CHAR-branch Reference Model Finally we present our UCERF3 reference branch. This branch is not intended to represent the preferred branch, but rather a reference with which to compare models using other branches on the logic tree. The logic-tree choices for the reference model are shown in the table below. Importantly, the reference branch assumes that 50% of off-fault deformation is aseismic. As discussed in the main report, this is one of several moment adjustments we test in order to bring the deformation model moments into alignment with the historical seismicity rates. Table 2. Logic Tree Settings for UCERF3 Reference Model. Fault Model 3.1 Deformation Model Average of Geologic and Average Block Model (ABM) Inversion Branch CHAR (Characteristic / close to UCERF2) Magnitude-Area Relation Ellsworth-B Slip-Length Relation Ellsworth-B Shape of Slip Distribution Along Strike Tapered Moment Rate Adjustment 50% of off-fault deformation modeled as aseismic Appendix N: The Grand Inversion 17 As shown below, this model fits its constraints quite well. Slip-rate fits are on the whole good, although the model does underpredict the total on-fault target moment (this is the moment given by the deformation model, less creep and subseismogenic-rupture moment) by 6.5%. This underprediction is primarily concentrated on the Imperial and Cerro Prieto faults near the California-Mexico border. The slip rates on these faults are typically underfit by the inversion models due to the MFD constraint (we have tested that removing this constraint leads to good fits on these faults). Both faults have high slip rates and would require many moderately-sized ruptures to satisfy these slip rates due to their isolation (the Cerro Prieto is not close enough to other faults to link with them, and the Imperial fault, while it can link to the north with the Brawley seismic zone, has a much higher slip rate than this fault). The rates of moderately-sized earthquakes are limited strongly by the MFD constraint. Figure 10. Slip-rate misfit for the UCERF3 reference model. The paleoseismic fits for the UCERF3 reference model are shown in Figure 11. All 32 constraints are fitting within their 95% confidence bounds (in fact, we are perhaps overfitting these data due to the generous error bars). Appendix N: The Grand Inversion 18 Figure 11. Paleoseismic event-rate misfit for the UCERF3 reference model. This plot shows paleoseismically visible participation rates along all fault sections that have a paleoseismic trench in red; the mean event rate and 95% confidence bounds is shown in black. On-fault and implied off-fault magnitude distributions for this model are shown in Figure 12. The model fits its target MFD quite well; at high magnitudes it tapers off (which does not contribute to the MFD misfit since the MFD constraint becomes an inequality constraint above M7.85). The background MFD is similar, but smoother, than the background MFD in UCERF2. Appendix N: The Grand Inversion 19 Figure 12. California-wide magnitude-frequency distributions for the UCERF3 reference model. The MFDs shown are as follows: Black – the total target MFD defined from seismicity, Blue – the on-fault MFD found by the inversion, Cyan – the target on-fault MFD used in the inversion MFD constraints, Grey – the implied background MFD (black total target minus blue on-fault MFD), Pink – the UCERF2 background MFD (for comparison). As this is a CHAR (Characteristic / Close to UCERF2) branch model, the on-fault MFD is constrained to deviate from the total MFD below M7.6 in order to leave enough moment for the background seismicity. As with the UCERF2-ingredients model, this model differs in terms of participation rates from UCERF2. This is primarily because the on-fault MFD has been constrained to be quite different, in order to allow for background seismicity while remaining below the total target MFD defined by historical seismicity rates. Appendix N: The Grand Inversion Figure 13. Participation rates for the CHAR-branch UCERF3 reference model (top panels); ratios of these participation rates for faults in UCERF2 (bottom panels). 20 Appendix N: The Grand Inversion 21 Figure 14. Rates at which section pairs rupture together; each subsection pair that can link in a rupture is connected by a line, the color of this line gives the rate that they rupture together normalized by the total rate of ruptures involving those subsections. We also compare the rates at which subsections rupture together in Figure 14. UCERF3 allows for many more fault connections than UCERF2; however, due to slip-rate incompatibilities many ruptures that include these connections have low rates relative to ruptures on single, contiguous faults. G-R branch We now present a G-R-branch model; this model is constrained to have a spatially smooth magnitude distribution on the faults that is as close as possible to a Gutenberg-Richter distribution. All other parameter settings for this branch are set as they are in the reference model presented above. Below we show fits to the data constraints for this model. Appendix N: The Grand Inversion 22 Figure 15. Slip-Rate misfit of UCERF G-R branch model. This branch fits the slip rates somewhat worse than the CHAR branch due to the MFD participation smoothness constraint (which constrains the participation MFD for each subsection to be approximately uniform in magnitude). It underpredicts the on-fault target moment by 13.4%. This underprediction is concentrated on the Cerro Prieto and Imperial faults (as happens with the CHAR branch) as well as the Mendocino section of the Northern San Andreas Fault. The slip rate on the Mendocino fault is difficult for the G-R branch to fit because it has a very high slip rate (twice the slip rate as San Andreas - Offshore) with no tapering at the end or connections to other faults to the west; therefore the MFD constraints limit the rates of smaller earthquakes that could be used at the western end of Mendocino to fit the slip rate there. Appendix N: The Grand Inversion Figure 16. Paleoseismic event-rate fit for G-R branch model. 23 Appendix N: The Grand Inversion 24 Figure 17. California-wide magnitude-frequency distributions for the UCERF3 G-R model. The MFDs shown are as follows: Black – the total target MFD defined from seismicity, Blue – the on-fault MFD found by the inversion, Cyan – the target on-fault MFD used in the inversion MFD constraints, Grey – the implied background MFD (black minus blue), Pink – the UCERF2 background MFD (for comparison). As this is a G-R (Close to GutenbergRichter) branch model, the on-fault MFD is constrained to deviate from the total MFD uniformly for each magnitude bin. Note that the implied off-fault MFD (grey line) plotted here is simply the total target (black) minus the on-fault MFD (blue) and will be truncated at high magnitudes when the background model is constructed (Appendix O). The paleoseismic constraints and magnitude-frequency distribution constraints are fit quite well by the G-R branch. The participation rates for this model, with UCERF2 comparisons, are shown in Figure 17. Many of the differences between this model and UCERF2 are controlled by the MFD. This model assumes that faults are Gutenberg-Richter (and for the fault system as a whole this constraint is matched quite well as shown in the MFD plots above), so this leads to higher rates for many faults in the M6-7 range (particularly fast, through-going faults that were given more moment in larger characteristic ruptures in UCERF2). Appendix N: The Grand Inversion 25 Figure 18. Participation Rates for the G-R model (top) and compared to UCERF2 (bottom) for different magnitude ranges. How “Gutenberg-Richter” is the model? If nucleation rates for a fault section are GutenbergRichter with a b-value of 1 and stress drops are constant with magnitude, incremental participation rates for that section will be uniform. We thus show participation rates for selected fault sections; deviations from uniformity represent deviations from G-R behavior. We also contrast these same sections with our more characteristic reference branch model that is not constrained to have smooth participation MFDs; the reader will notice that these plots show stark differences between these two branches. Appendix N: The Grand Inversion 26 Appendix N: The Grand Inversion Figure 19. Participation rates for selected fault subsections for the Char-branch reference model (left) and G-R branch model (right). Incremental distributions are shown in blue and cumulative distributions are shown in black. Smooth incremental distributions represent behavior that is closer to Gutenberg-Richter behavior. Unconstrained branch The unconstrained branch lacks some of the regularization present in the CHAR and G-R branches as it has no rupture-rate constraint or magnitude participation smoothness constraint. Due to the stochastic nature of the simulated annealing algorithm and the underdetermined nature of the inversion problem for this branch, each unconstrained model will have a different set of rupture rates. We can use this branch to probe which elements of the inversion are robust to this resolution uncertainty. We generate hundreds of models for this branch; the data inputs are the same and the logic-tree parameter choices are the same as the reference model. Thus differences between these models is not due to the data inputs or parameter choices. Each model fits the constraints well; however individual models find different combinations of rupture rates to satisfy these data. Figure 20. Standard deviation of participation rates divided by mean participation rates, for a suite of 1000 Unconstrained-branch models. 27 Appendix N: The Grand Inversion 28 As shown in Figure 20, on average participation rates on fast-moving, long and contiguous faults such as the San Andreas and Garlock have less spread among the Unconstrained-branch inversions than participation rates elsewhere in the model. Moment-Rate Adjustments All of the UCERF3 deformation models have significantly higher moment rates than both the UCERF2 model and historical seismicity (more details on the moment-rate inconsistencies are given in the main report). Thus the inversion is not able to simultaneously satisfy the fault slip rates, the off-fault moment, and the historical seismicity a-value. We have tested several “moment-rate adjustments” to deal with these inconsistencies in the inversion inputs. The models we presented above have a reference moment-rate adjustment, which is a 50% reduction in off-fault aseismicity. Even this 50% reduction is not sufficient to match UCERF2 or historical seismicity moment rates; thus the models presented above still underpredict fault slip rates. For the Geologic plus Average Block Model deformation model, an 80% reduction in off-fault moment rate (that is, assuming that 80% of the off-fault deformation is aseismic (!)) is required to bring moment rates into alignment with UCERF2 moment rates. This more drastic reduction, which will better satisfy the fault slip rates of this deformation model, is another branch in our logic tree. We also have three other options in our logic tree: one is to “do nothing”, that is, invert our inconsistent inputs and obtain a model that does not satisfy these inputs very accurately. As our default MFD constraint is quite strict, in practice this branch results in underpredicting the fault slip rates while leaving the background seismicity rates untouched. Therefore our reference model, with a 50% reduction in off-fault moment and some misfit to slip rates, is something of an average of our “do nothing” and 80% off-fault moment reduction branches. We also have two logic tree branches that assume that the deformation models are correct and reconcile the data by adjusting the total target magnitude-frequency distribution. One of these branches assumes that the historically observed seismicity rate (since 1850) is below the true, long-term seismicity rate, and thus increases the total target MFD by 30%. The other branch relaxes the MFD constraint weights and allows the inversion model to have a “bulge” – this model also violates the observed seismicity rates seen since 1850. In addition, the “relax MFD” bulge will not be Gutenberg-Richter for California as a whole; for this reason we do not prefer this model, although it is interesting to see what the inversion “prefers” in the absence of a strong MFD constraint. Appendix N: The Grand Inversion 29 Appendix N: The Grand Inversion 30 Figure 21. Magnitude frequency distributions (left, in the format of Figure 12) and slip-rate misfits (right) for different moment-rate adjustments. It is important to note that our “increase MFD” and “relax MFD” branches, unlike the other moment-rate adjustments discussed, may significantly increase hazard relative to UCERF2 since these two branches result in overpredicting historically observed moment rates. Other Logic-Tree Branches We have scanned solution misfits for other logic tree branches and found that some branch choices are “preferred” by the inversion, meaning that they consistently have better misfits. For example, uniform slip along the rupture gives better slip-rate fits than the tapered-slip model. This appear to be true regardless of the choices on the other branches of the logic tree. Similarly, the Hanks-Bakun and Shaw-09 magnitude-area relationships perform better than the Ellsworth-B magnitude-area relationship. This is because they allow for larger slips in high-magnitude ruptures, which makes it easier to fit the high (relative to seismicity) moment rates in the deformation models. Some branches do not have a large impact on misfit; the average slip per rupture model, for example, does not have a big impact on this metric. Links to more results We have generated hundreds of models corresponding to different choices in the UCERF3 logic tree; not all models can be presented here. Plots from these models, as well as solution sets that can loaded into SCEC-VDO and viewed can be found at the following links: Characteristic and G-R branch models: http://opensha.usc.edu/ftp/kmilner/ucerf3/2012_03_28-lots-of-combos/ http://opensha.usc.edu/ftp/kmilner/ucerf3/2012_03_28-production-for-realz/ Mean Unconstrained Model & Statistics of Unconstrained Models: http://opensha.usc.edu/ftp/kmilner/ucerf3/2012_03_28-unconstrained-run-like-crazy/ UCERF2 reference solutions (for comparison): http://opensha.usc.edu/ftp/kmilner/ucerf3/2012_03_26-ucerf2_comparisons/ Concluding Remarks The inversion relies on data inputs and models from many other project tasks to determine the rates of on-fault ruptures. Many of these inputs will be uncertain and subject to debate. However, it is important to note that all these uncertainties existed in the old methodology for determining rupture rates as well. The inversion methodology described here eliminates the need for voting directly on rupture rates. It provides a means to easily update the model as new data becomes available. Importantly, the inversion methodology provides a mechanism to simultaneously fit all the constraints concurrently. This was lacking in UCERF2 – expert opinion did not simultaneously satisfy slip rates and event rates, and the final model also presented magnitude distribution problems. The inversion allows all these constraints to be satisfied to the extent that they are compatible. Appendix N: The Grand Inversion 31 The inversion can also be used as a tool to determine when a set of constraints is not compatible, as with the UCERF3 deformation models and historical seismicity rates. This remains one of the most important uncertainties in our inversion in terms of its impact on hazard. To what extent more aseismicity is needed in the deformation models or the historical seismicity rates may underpredict the true long-term rates will largely determine how similar UCERF3 models are to past models such as UCERF2. Acknowledgement Computation for the work described in this manuscript was supported by the University of Southern California Center for High-Performance Computing and Communications (www.usc.edu/hpcc). References Andrews, D. J. and E. Schwerer (2000), Probability of Rupture of Multiple Fault Segments, Bull. Seis. Soc. Am. 90 (6), 1498-1506. doi: 10.1785/0119990163. Field, E. H. and M. T. Page (2011), Estimating Earthquake-Rupture Rates on a Fault or Fault System, Bull. Seis. Soc. Am. 101 (1), 79-92. doi: 10.1785/0120100004. Granville, V., M. Krivanek, and J.-P. Rasson (1994), Simulated annealing: A proof of convergence, IEEE Transactions on Pattern Analysis and Machine Intelligence 16 (6), 652– 656. doi:10.1109/34.295910. Wesnousky, S. G, Displacement and Geometrical Characteristics of Earthquake Surface Ruptures: Issues and Implications for Seismic-Hazard Analysis and the Process of Earthquake Rupture (2008), Bull. Seis. Soc. Am. 98 (4), 1609-1632. doi: 10.1785/0120070111