AIAA 2010-8699 AIAA SPACE 2010 Conference & Exposition 30 August - 2 September 2010, Anaheim, California Comparison of Supportability Analysis Techniques for Lunar Surface Systems Elements Jennifer L. Green * Casitair Consulting, Boston, MA, 02114-2206 Tovey C. Bachman, Ph.D. †, Robert C. Kline ‡, Julie A. Castilho § and David K. Peterson, Ph.D. ** LMI, McLean, VA, 22102-7805 Since 2004, NASA’s Constellation Program’s Lunar Surface Systems (LSS) Project Office has been assessing the supportability of future surface elements in order to quantify the spares mass and volume contribution to the overall resupply demand and to better understand what technology development investments are needed to significantly reduce the supply chain from Earth. Three different analytical techniques were used to obtain initial estimates for LSS availability and spares mass across varying mission durations. Multiple models were used to better understand the relative merits of each analysis tool, to compare availability estimates, and determine which factors contributed to any dissimilar results. We present results of LSS supportability analyses, compare the different analytical methods, discuss complementary capabilities these methods provide, and explain the practical significance of the results for determining future space logistics needs. Nomenclature DC DFR EnvFactor FPHR Kfactor MApm MAran MAtotal MTBF MTBFadj MTBMA MTBMAran MTBMAtotal MTTF MTTFadj Op PMFREQ QPA λ = the active duty cycle = dormant failure rate factor as a percentage of the active duty cycle =increase failures due to in the way removals, infant mortality, etc. = the number of failures per hour (1/MTBF) = multiplier to account for non-inherent failures induced by the environment or operations procedures = preventative maintenance activities = unscheduled maintenance activities = total maintenance activities = mean time between failure in hours = adjusted mean time between failure in hours = mean time between maintenance actions in hours = mean time between unscheduled maintenance actions in hours = total mean time between maintenance actions in hours = mean time to failure in hours = adjusted mean time to failure in hours = operational duty cycle = the frequency for scheduled maintenance in days = the quantity per application (flight quantity). = random failure rate * Consultant, Casitair Consulting, 5 Grove St. #3, Senior Member AIAA † Senior Consultant, Supply Chain Management, 2000 Corporate Ridge, Senior Member AIAA. ‡ Senior Consultant, Supply Chain Management, 2000 Corporate Ridge. § Consultant, Supply Chain Management, 2000 Corporate Ridge. ** Senior Consultant, Supply Chain Management, 2000 Corporate Ridge. 1 American Institute of Aeronautics and Astronautics Copyright © 2010 by the American Institute of Aeronautics and Astronautics, Inc. The U.S. Government has a royalty-free license to exercise all rights under the copyright claimed herein for Go I. Introduction N ASA’s Constellation Program Office has conducted numerous studies related to lunar surface operations since 2004, ranging from high-level campaign analyses to detailed studies of systems and supportability. While NASA is currently shifting to a flexible view of destinations for human space exploration, there is still much that can be learned from lunar studies that is relevant to other space missions. This is particularly true for supportability analyses, which are a key element of successful mission planning, although historically they have been deferred until late in the planning process, driven by limited budgets 1. It is the authors’ intent that lessons learned from the work described here will be useful to either governmental or commercial organizations who are in the early stages of mission planning (when changes to mission architecture are less costly). We begin with an overview of supportability concepts to provide a context for our work. The remainder of the paper then focuses on alternative methodologies for estimating the spare parts mass required to achieve a given system availability. We also show how the lack of spares significantly degrades the probability of mission success. The motivation for performing a sparing-to-availability analysis early in the design cycle is to ensure that, as the mission design evolves, the spares mass does not grow to the point where it becomes an unacceptably large fraction of the total launch mass, thereby negatively impacting mission goals (e.g. reducing the science payload or supplies for the crew) and costs. If analysis by any of the methods we consider indicates an unacceptably large spares mass, these methods can also identify the subsystems or components driving that mass, and help focus engineering efforts (e.g., increasing reliability or redundancy of the items) to remediate this problem. We have found that each methodology discussed has unique strengths and weaknesses, and that using multiple approaches yields greater insight into the nature of the sparing-to-availability relationship than any single approach. This remains true whether we are analyzing supportability of lunar surface operations or an entirely different mission. II. Supportability Supportability includes the concepts of Reliability, Availability, and Maintainability (RAM), as well as integrated logistics support (ILS). To provide a context for our work, we first define the various elements of RAM and ILS, and then provide a supportability concept for lunar surface operations 2. We then focus on the type of maintenance activity where spare parts are required to repair random failures (unscheduled replacement of components). A. Reliability, Availability, and Maintainability We define the components of RAM as in Ref. 2: • Reliability: Reliability is the probability that an item will perform its intended function without failure for a specified mission time under specified environmental and operating conditions. • Availability: Availability is the degree to which a system or subsystem is operable and in a committable state at the start of a mission, when the mission is called for at an unknown (i.e., a random) time. Thus, availability is the proportion of time that a system is in a functioning condition and available for its intended purpose. We will refer to this as functional availability. • Maintainability: Maintainability is the measure of the ability of an item to be retained in or restored to specified conditions when maintenance is performed by personnel having specified skill levels, using prescribed procedures and resources, at each defined level of maintenance. For maximum effectiveness, maintainability must be made inherent during a system’s early design stages. B. Integrated Logistics Support While the ILS concepts we describe here were developed for lunar surface systems (LSS), they are broadly applicable to other space missions. Depending on mission types or phases, not all of the following elements need be present. For example, during a mission to Mars, the phase of inter-planetary travel would not involve a supply support plan. For lunar surface operations, ILS involves the following planning elements (see Ref.2): • maintenance plan • personnel and training plan • technical data plan • supply support plan • test and support equipment plan 2 American Institute of Aeronautics and Astronautics • • transport and handling plan facilities plan. Since our focus is on maintenance and spare parts and their relationship to system availability, we will next consider the maintenance concept in terms of nominal operations (the normal state) and contingency operations (when unexpected failures occur). C. Nominal Maintenance Operations During nominal maintenance operations, maintenance is performed on a continuous basis by the ground crew, surface crew and surface robotic assets; this is described in more detail in Ref.2. Even if the surface crew is not present, maintenance operations can continue in an autonomous state—especially in the areas of predictive and proactive maintenance where continuous monitoring of the hardware is important. There are five areas within the nominal maintenance operations: • maintenance types • maintenance infrastructure • Corrective and Preventative Maintenance (CM & PM) tools and techniques • Predictive and Proactive Maintenance (PdM & ProM) tools and techniques • maintenance resources. CM includes all of the activities to replace or repair a hardware item after a failure has occurred. This involves troubleshooting to verify and isolate the failure, followed by some amount of diagnostics and testing to verify it is fully functional. PM is focused on interval-based maintenance (e.g., 90-day servicing or preemptively replacing a worn part with a 730 day life limit at 700 days), cleaning and servicing, and inspection by the surface crew. PdM & ProM Tools and Techniques include Diagnostic and Test equipment, In-Flight Vehicle Health Management technologies, embedded sensors, dust mitigation techniques, and tools for root cause fault (or failure) assessment (RCFA). RCFA technologies are particularly important when there is no capability to return failed hardware to Earth in order to determine the cause of the failure. PdM focuses on Condition-Based Maintenance where systems and components are continuously monitored to determine whether there are any signs of degraded performance. When the condition of the hardware reaches a predetermined level, a maintenance action is scheduled and performed to minimize system downtime. ProM identifies the root cause of failure in the hardware so that a maintenance plan can be designed that will eliminate or reduce those causes (e.g. restrict the dust contamination that leads to the filter failure that brings down the CO2 Removal System). In this example, PdM would focus on monitoring the status of the filter and schedule a changeout after the dust particulate count exceeded a certain threshold value. Maintenance infrastructure describes the physical locations and environment where maintenance will be performed (e.g., on the lunar surface). Maintenance resources include crew time, robot time (which may also consume crew time if the robots are teleported), and distributed systems such as power, thermal, data, and communications. Crew time is one of the most critical resources for supportability since maintenance time competes directly with science and exploration crew time and therefore must be reduced as much as possible. D. Contingency Maintenance Operations Despite the best attempts to prevent random failures from occurring through nominal maintenance operations, there is always the risk of unanticipated events that could bring down one or more systems. Such events include the failure of redundant systems, human error, radiation events, failure of adjacent systems due to unrelated failures, power surges, and the failure of the monitors and sensors (allowing system degradation to go undetected), as described in Ref.2. The first step in contingency maintenance operations is to determine whether there is an imminent danger to the crew. If there is, it may be necessary to evacuate the crew and abort the mission. If there is no imminent threat, the surface and ground crews will first determine whether there is a true fault, and if so, then attempt to isolate it to the subsystem, line-replaceable-unit (LRU), shop-replaceable-unit (SRU), or even sub-SRU level. The criticality of the hardware will determine how long the crew has to perform test and inspection. After fault-isolation, ground control will determine whether the item should be fixed, depending on the criticality of the item. 3 American Institute of Aeronautics and Astronautics If the item is of low criticality, it is possible that the ground and surface crew will just finish the mission in a degraded capability mode. If the item is mission-critical but not life threatening, the repair could be handled through the normal depot maintenance procedures. If the item is life critical, but not imminently so, (i.e., the failure removes a level of fault tolerance for a critical function), the ground and surface crew may decide to take the system offline and fix the part in real time. In this situation, the “clock is ticking” on system downtime and the sequence in which maintenance procedures are considered would likely be based on how much time each repair requires. The number of possible procedures may vary over the life of a mission, since capabilities may be ramped up over time. We now focus on the spare parts (“spares”) required for contingency maintenance and their relationship to system availability. In the context of contingency maintenance, we define “system availability” as the percentage of time that a system is not down for lack of a spare part 3. This is consistent with the use of system availability in Ref.1, but more restrictive than the earlier definition of functional availability (which requires resources other than spare parts, such as crew time, tools, and test equipment). As shown in Ref.3, system availability is derived from component reliabilities (e.g. MTBFs), the degree and types of redundancy in the system (e.g. hot or cold redundant components, number of parallel channels, etc.), the number of common components 4 employed by the system, the number of spare parts, and the maintenance strategy employed (e.g., what level of repair is possible; whether cannibalization is employed). It is sometimes useful to compare the system availability attained with a given number of spares and a particular maintenance strategy to the availability attained from no spares and no repair— this availability is sometimes referred to inherent availability. III. Problem Definition Lunar surface system operations require a high degree of system availability to ensure mission success, and this is true of space missions in general. As noted earlier, even with the best nominal maintenance strategy (section II. C.), and designs including extensive redundancy, there is always the risk of component failures that require contingency maintenance and replacement with a spare component. The challenge is to quantify the spare’s mass requirement early enough in the design process, so that alternative designs and associated maintenance concepts can be evaluated in terms of sparing. This permits design decisions at a stage when impacts on cost and schedule are less severe than they will be later. The spares requirement is stated in terms of overall spares mass because early in the design phase, system attributes are not definitive— therefore, specific numbers of spares for each type of component cannot be estimated with a high degree of confidence. The question then becomes—given a system’s design, attributes, and maintenance strategy—what is the minimum spares mass required to attain a target level of system availability? Factors that would affect the sparing-to-availability relationship, but not treated here, include the level of repair possible (e.g. replacement of LRUs only, repair of LRUs by replacing SRUs, repair of SRUs, etc.), whether or not cannibalization is allowed, and tradeoffs between reduced spares mass (by virtue of lower level repair) versus increased crew time (to perform that lower level of repair). Some of these factors are considered in Refs.1,3, and 4. IV. Objectives For systems employed in lunar surface operations, and more generally for any space systems where replacement of failed parts is feasible, we would like to determine: • • • • • • • What system availability is attainable for a specified operating duration with no spare parts (inherent availability)? What mass (kg) of spare parts would be required to attain target system availability for a specific operating duration and contingency maintenance strategy? How sensitive is the spares mass required for a target availability to the level of redundancy, the type of redundancy (hot or cold), and the configuration of redundant components? How sensitive is the spares mass required for a target availability to the degree that common components are employed? How does the use of different sparing-to-availability methodologies affect the answers to the above questions? What are the advantages and disadvantages of each methodology used individually? What are the advantages, if any, of using multiple methodologies? 4 American Institute of Aeronautics and Astronautics V. Methodologies E. Overview Over the past several years, lunar supportability analysis has been performed using several different methodologies, as shown in Figure 1 and described in Ref.1. Because the designs of lunar surface elements are still in the conceptual phase, the emphasis has been on developing estimates for spares requirements that are then incorporated into the higher level strategic analysis of the overall campaign. Historically, spares requirements are generated by taking the list of equipment for any given design and breaking it down into a list of maintenance significant items, which in the case of the International Space Station (ISS) program is the LRU or Orbital Replacement Unit (ORU) level. Then attributes such as MTBF, duty cycle, criticality, spares mass, and maintenance crew time are assigned to each item. This data is fed into whichever modeling tool is employed. The two general types of modeling tools are 1) Monte Carlo simulations, which use random number generators as a source of uncertainty and require running multiple trials to examine a range of possible outcomes for a single spares requirements problem; and 2) analytical models, which solve equations that account for uncertainty via probabilities or expected values, but are run only once for each spares requirements problem. Figure 1. Modeling Approaches The main challenge for supportability analysis of the LSS elements is that in order to perform this type of “bottoms-up” analysis, detailed data is required, which is difficult to obtain when the designs are in the conceptual phase and the overall architecture is rapidly changing. Therefore, the process of creating (and re-creating) the element-level supportability dataset is time-consuming and may be counter-productive in the early design phases. However, the “Catch-22” in this situation is that the detailed data is required in order to generate better estimates for spares and crew time, which are required to support the strategic-level analysis (which is subsequently used to help determine the optimal designs of the lunar surface elements). For these reasons, there is a need to simplify the data wherever possible and to use different modeling approaches to try to bracket any associated uncertainties. Three main modeling approaches have been used to perform spares requirements analyses: a simple pipeline model which is used to determine the average annual spares requirement; a “sparing-to-availability” model (such as LMI’s Spacecraft Sustainability Model [SSM]), which optimizes the spares mix in order to obtain a specified system availability given limits in the amount of resources (e.g., spares mass) available; and a Monte Carlo simulation model (such as ARINC’s Raptor). While differing in their methodologies, each of these models requires some form of reliability modeling, whether only at the individual component level or at the subsystem level, which consists of various configurations of 5 American Institute of Aeronautics and Astronautics components (e.g., parallel vs. series). The next section compares several approaches to reliability modeling and contrasts the strengths and limitations of these processes. We will refer to this section later when discussing the various spares requirements models. F. Alternative Reliability Models A system, over its life cycle, will typically exhibit a changing set of reliability and maintainability (R&M) characteristics. To model these dynamically changing characteristics, a variety of R&M modeling approaches have developed. Static models (such as fault tree analysis [FTA] and reliability block diagrams [RBD]) capture a system’s configuration at a point in time; dynamic models (such as Markov chains or Stochastic Petri Nets) can capture a system’s configuration as it changes over time. 1. Fault Tree Analysis (FTA) and Reliability Block Diagrams (RBD) Originally, both FTA and RBD modeling employed Boolean algebra to evaluate the chances of a system’s failure based on properties of its components 5. FTA is an analytical method, based on logic diagrams, which identifies and analyzes likely modes of system failure. FTA starts with the failure of a system (represented by the top of the tree) and relates it to component failures by means of two basic alternatives – represented by the logical operations AND and OR. In contrast, RBDs will diagram the way in which the reliability of individual components, within the context of their system’s structure, contributes to the overall system reliability. RBDs focus on the system’s availability (i.e., reliability) with interconnected blocks representing system components forming an uninterrupted path from the start (source) to the finish (sink). A failed component is represented by a break in the path at the corresponding block. Despite their visual differences, the basic RBD and FTA methodologies are equivalent, with series and parallel connections in an RBD corresponding to the alternatives OR and AND in FTA, respectively. One advantage of these approaches is their simplicity and focus on a single, top-level-oriented objective (such as identifying potential causes of accidents or predicting system reliability). However, this simplicity requires separate models be built for each objective, which can become overwhelming if there are a number of possible scenarios to examine. In addition, while the simplicity of these approaches can lead to compact models, they are not well-suited to modeling systems whose configuration may change over time (e.g., a multi-component system with periodic inspections and repairs). Over time, both FTA and RBD have evolved and now often accommodate dynamic scenarios (such as individual component repairs) through either a Markovian analysis (see below) or discrete-event simulation. Still, neither FTA nor RBD models provide convenient visualization for examining the interactions and dependency among system components. 2. Markov chains An alternative to FTA and RBD’s static, binary representation of a system’s configuration is directly describing the relevant possible states of a system and the transition rules among those states. A continuous time Markov chain is the simplest type of such a state-space representation 6. Markov chains form state-space diagrams where the likely states of a system are identified along with the probabilities of transitioning from one state to another. Since a Markov chain represents a discrete random process which is “memory-less,” the probability of transitioning from one system state to another is independent of the system’s earlier states. Markov chain-based reliability models are typically homogeneous with respect to time; therefore, the transition rates between system states are constant 7. The advantages of Markov chains are that they allow for true dynamic modeling of dependent events, and conceptually they are very simple - consisting of states and transitions between states. In addition, the system’s dynamics can be described by a set of linear differential equations which can be solved in a precise and efficient manner. However, Markov chains have some distinct disadvantages too, such as the size of the model (which exponentially increases with the number of modeled system states), and component transition rates (e.g. failure and repair rates) that are constant over time (e.g., no aging). For example, at the component level, Markov chains do not provide a natural means to capture physically meaningful transition rate variability, such as the replacement of an aged component with a new one. 3. Stochastic Petri Nets (SPN) Similar to Markov chains, SPN structures are state-space based, so the dynamics of a system can be fully captured. However, unlike Markov chains, where each state represents the system as a whole, in SPN the states of individual components are described, and the state of the system is inferred from the states of its components. This “local” view facilitates modeling system aging and often mitigates the state-explosion problem as well. Solutions to 6 American Institute of Aeronautics and Astronautics SPN can be obtained either by converting the problem to a Markov chain (in the case of smaller systems with constant transition rates) or via Monte Carlo simulation. SPN models a system in terms of its static (structural) and dynamic (entities) components. In SPN, entities are represented as tokens and places represent the possible states of those entities. The places that tokens occupy define a particular state of the system, and tokens move between places, simulating changes in the system state. Transitions between places are described by rules for token movements, and transitions only fire when they are enabled (i.e., if certain conditions are satisfied 8). SPN models offer distinct advantages for R&M modeling. They provide a visual means for dynamic changes in system configuration (i.e., moving tokens). They can model concurrent events and a token can have continuous counters assigned, which keep track of the token’s age. Representing aging effects is a natural extension of colored Petri nets (labels can change discretely upon a token’s transition or continuously if a token stays at the same place, as discussed in Ref.8.).In terms of modeling disadvantages, given that SPN models are graphical in nature, they are subject to visualization limits when trying to model complex systems. In addition, their apparent simplicity belies the degree of sophistication required on the part of the R&M modeler when portraying complex models with multiple interrelationships (e.g., modeling a system of systems). 4. Conclusions There are many situations where classical reliability tools, such as FTA and RBD, are sufficient and adequate for modeling a system. However, modeling dynamic R&M effects can be crucial to understanding a system’s reliability. Of course, dynamic models are generally more complex than static models since more R&M details and component information are required by the models to gain this additional insight into a system’s dynamic behavior. In addition, this detail and insight comes at a cost. Markov chain modeling is prone to state-space explosion (although this can be addressed via hierarchical modeling and taking advantage of symmetry to reduce possible system states). SPN models, on the other hand, rely on Monte Carlo simulation to provide a more flexible and realistic modeling environment for R&M scenarios, but the resulting SPN models can be very sophisticated and may also require hierarchical modeling to compensate for higher levels of system complexity. G. Simple Pipeline Model There have been two main variations on the simplified pipeline model (developed in Microsoft® Excel® ††) that have been used to obtain the average spares estimates for LSS elements: A simple spreadsheet calculates maintenance demands based on simplified formulas as shown below: where DC is the active duty cycle; FPHR is the number of failures per hour (1/MTBF), the Kfactor and EnvFactor increase failures (due to in-the-way removals, infant mortality, etc.,) and QPA is the quantity per application (flight quantity). The simplest version of this spreadsheet calculates an average number of failures per year and has no way to account for life limit failures, preventative maintenance, outpost buildup or varying duty cycles based on extended periods of down time. This simple version works well with deterministic demand processes such as estimating the number of air filters required when their replacement is on a set schedule. However when FPHR is assumed to be the mean of a distribution, this approach drastically underestimates spares unless other factors, such as redundancy, compensate. A more complex spreadsheet is based on a verification tool that was developed by the ISS Program in the late 1990s, which was used to validate results from the ISS Reliability & Maintenance Assessment Tool (RMAT) model. RMAT is a Monte Carlo simulation tool that predicts maintenance demands for long-term operations and was used †† Microsoft and Excel are registered trademarks of Microsoft Corporation. 7 American Institute of Aeronautics and Astronautics by the ISS program for its spares predictions. This verification spreadsheet was adapted for lunar supportability analysis and is used to calculate average maintenance demands per year. As compared to the simpler spreadsheet, it adds in a capability to account for life limit failures, dormant failure rates, preventative maintenance, and distributions around a given MTBF or life limit. The main formulas are shown below. In the RMAT, after the MAtotal/year is known for each ORU type, the kg/year, m3/year, crew hours/year and cost/year can be easily determined by multiplying the MAtotal/year by the ORU weight, volume, MTTR (mean time repair) or cost for each ORU type. If the MTTR is different for corrective or preventative maintenance, the crew hours can be calculated separately for each type of action and then added together. While both versions of the spreadsheet model have been used during this analysis over the past five years, only the simpler model was used for the latest LSS Supportability analysis presented in this report. In comparing results from the two spreadsheets from previous analysis, it was determined that the difference between the projected average annual spares mass and maintenance crew time estimates was well within the margin of error associated with the input supportability data. Therefore, the simpler method was determined to be sufficient until the LSS input data reached a higher maturity level. H. Raptor Model Raptor is a software tool created by ARINC to simulate the operation of complex systems, based on operating scenario data and a RBD decomposition of the system into subsystems and their components. For each component in the RBD, the analyst enters attributes such as a probability distribution for failures, maintenance concept, number of spare parts, and cost. The analyst can model hot or cold redundancy, as well as parallel or serial component configurations. Hierarchical decompositions of subsystems are also possible. Raptor uses a Monte Carlo simulation to model the effects of uncertain failures, together with the effects of maintenance and sparing strategies, on system downtime and degraded mode time. I. Spacecraft Sustainability Model 1. Overview LMI developed the Spacecraft Supportability ModelTM (SSMTM) ‡‡ sparing model for estimating the minimum required mass and volume of spare parts for future human space missions 910; this is discussed in detail in Ref.3. The model produces an estimate of these requirements as well as a resource-versus-system availability curve. From this curve, the analyst immediately sees the increase in resources necessary to move to a higher performance level, or, alternatively, the decrease in performance that would arise if resources were constrained. The curve enables the analyst to judge the sensitivity of performance and resource requirements to reliability improvements, miniaturization, or changes in a variety of mission factors. ‡‡ Spacecraft Sustainability Model and SSM are trademarks of the Logistics Management Institute. 8 American Institute of Aeronautics and Astronautics The model estimates the mass and volume of spare parts for future human space missions via a hybrid parametric-analytical approach. At its core, the hybrid model has a notional item database and an analytical optimization engine. The notional item database contains the characteristics of a representative set of items. The analytical optimization engine takes the item data and mission parameters (such as operating durations) and solves stochastic backorder equations, distributes backorders over systems, and computes system availability as a function of the chosen set of spare parts. The model’s optimizer then finds the set of spare parts that either minimizes the spares resources needed to attain a specific availability goal or maximizes availability for a given set of resources. The model’s core is surrounded by a “parametric shell” that allows the analyst to choose key mission elements, adjust the characteristics of notional items to better represent a particular set of hardware, and analyze the sensitivity of results to changes in item characteristics. As system designs mature, notional items can be replaced by real items without changing the model or analytical technique. The model can estimate the minimum spares cost to attain an availability goal, as an alternative to minimizing spares mass and volume. It can also estimate total maintenance time as well as how the repair of lower indenture items affects maintenance time and spares mass and volume, although we did not use those capabilities in this work. 2. Modeling Redundancy in the SSM In order to establish that the SSM sparing model can be used to model certain kinds of system redundancy with a high level of fidelity, we used the SPN@ R&M Monte Carlo simulation to benchmark SSM results with a variety of redundancy configurations. Techniques for accurately modeling commonality, k of n items in parallel, and channels were established and will be discussed in detail below. Commonality Part A k of n Channels Part A Part B Part A Part B Part C Part A Part B Part C Part A Part A Part A Part C Figure 2. Modeling Redundancy in the SSM a. Commonality First, commonality of items across systems in a serial configuration was considered. As shown in Figure 3, the SSM results in blue align with the SPN@ results in red. This is akin to the Raptor “spare pool” method of specifying spares for like items. When the economies of scale are taken into account for items common to several systems, the total spares requirement is less than it would be if each system were considered as separate and independent. 100.0 90.0 80.0 Availability 70.0 60.0 50.0 40.0 30.0 20.0 10.0 0.0 0 10 20 30 40 50 60 70 Day SSM SPN Figure 3. Several Systems with Commonality 9 American Institute of Aeronautics and Astronautics 80 90 b. K of N Second, a k of n configuration was modeled (see Figure 2). This refers to n parallel components, k of which are required for the system to be up. The SSM models cold redundancy of items in parallel by assuming two free spares and one operating component. When the component fails, the spare is assumed to have been switched automatically. Switching failures are not considered. With these assumptions, we again were able to accurately align SSM system availability results to SPN@. c. Channels Lastly, channels were modeled. This refers to two parallel configurations of several items in series, sometimes called “high-level redundancy”. Again, cold redundancy was assumed, the items in one channel do not operate until the other channel has failed (i.e., one of the items in the channel fails). To model this, it was necessary to create a subsystem to represent one entire channel and then identify the items in series within the channel. Though this is an approximation, Figure 4 and Figure 5 demonstrate that the SSM and SPN@ models’ system availability estimates were very close. 100.0 90.0 80.0 Availability 70.0 60.0 50.0 40.0 30.0 20.0 10.0 0.0 0 10 20 30 40 50 60 70 80 90 Day SSM Cold SPN Cold Figure 4. Channels, Cold Redundancy, No Spares 100.0 90.0 80.0 Availability 70.0 60.0 50.0 40.0 30.0 20.0 10.0 0.0 0 10 20 30 40 50 60 70 Day SSM Cold w/Spare SPN Cold w/Spare Figure 5. Channels, Cold Redundancy, With Spares 10 American Institute of Aeronautics and Astronautics 80 90 J. Comparison of Methodologies The simple spreadsheet model is quick to set up and requires little data—it only needs component failure rates or MTBFs and operating durations, along with several factors used to adjust these quantities (as shown in section V.G.). It makes no assumptions regarding redundancy or commonality, so it does not require the user to set up diagrams that describe parallel or series configurations. However, the spreadsheet model only computes spares required to cover mean failures for each component in its operating duration. Because no probability distribution is assumed, it does not consider variability of failures around the mean—there is a significant probability that failure in excess of the mean will occur, and these will not be covered. Also, given the fact that all of the components’ pipeline means are much less than 1, one is faced with the decision to bring either no spares or one spare for each type of component, and the model cannot determine which choice to make. Yet another disadvantage is that the spreadsheet model cannot link spares mass to system availability (but if it did, the system availability rapidly approaches zero as the number of system components increases). Raptor is well-suited for assessments of the combined effects of maintenance and sparing strategies on system uptime, degraded mode time, and downtime, and system availability. This availability is an average across simulation trials. It enables the user to model a component’s failure distributions, which may be parameterized by a mean and variance. Raptor also allows the user to specify detailed configurations of redundant components or channels. However, Raptor is not as well suited for determining the optimal mix of spare parts for a given maintenance strategy, and requires exhaustive testing of spares mixes across numerous simulation trials (on the order of 1,000), to determine a good, if not optimal, set of spares. It also requires detailed data on configuration of subsystems and components, which are not typically available early in the system design phase. Thus Raptor is most useful for assessing overall system availability as a function of spares mass and maintenance strategy once a system is well-defined. SSM determines the spare parts mass required to achieve expected (average) system availability, computed in a single pass via stochastic equations rather than through numerous simulation trials. The model enables the user to model components’ failure distributions either with a Poisson or negative binomial distribution, the latter parameterized by a mean and variance. As discussed in section V.I., the model can be configured to approximately treat redundancy and commonality of components without building reliability block diagrams. In test cases, the availability of several key redundant component configurations agrees closely with that obtained through explicit treatment of the configuration of channels and redundant components with the reliability model SPN@. Analyses with SSM are less time-consuming than those with Raptor, since it is not necessary to build reliability block diagrams that explicitly describe the network of components. The model produces spares mass estimates in a single pass in minutes, rather than numerous simulation trials, which can take hours. Thus SSM can rapidly analyze the spares mass for a variety of missions and hardware designs early in the design phase—this is its strength. The limitations of SSM are that, once definitive system designs are known, it does not model certain complex redundant configurations of components, nor does it model multiple failure modes or degraded system operation. Thus for mature system designs, SSM does not provide as detailed a view of the range of possible system events as does Raptor. VI. Analyses K. Scenario and Systems This analysis is based on a specific lunar surface mission described in Ref.2. This scenario does not represent any specific, currently planned mission. It serves only to illustrate, using real lunar surface systems data, the tradeoffs of using each analytical technique under consideration. 1. Operating Scenario The Malapert Massif is of interest from both a scientific and operational standpoint. Malapert Mountain is an uplifted section of the lunar crust and therefore may have rocks that include some of the earliest and most catastrophic events of lunar basin formation. It is also the highest point in the region, which will be important for communications, lighting and observations of the surrounding terrain. In the scenario used for this analysis, a 15-day mission is performed using a limited number of surface elements. The Malapert excursion of approximately 350 km roundtrip is shown below. 11 American Institute of Aeronautics and Astronautics Figure 6. Malapert Excursion 2. Mission Elements The Malapert mission elements included in this analysis are as follows: two Lunar Electric Rovers (LERs); two Chariot Mobility Chassis (CMCs); two Active-Active Mating Adapters (AAMAs); two Portable Utility Platforms (PUPs); four Suitports; and four EVA suits. Each of these elements will be described in turn. A top-level RBD of the entire set of elements is shown below. Figure 7. Malapert Mission Elements Configuration a. Lunar Electric Rover (LER) The Lunar Electric Rover (LER) provides a pressurized environment for a crew of two to conduct extended-range exploration of the moon, and can carry four crew members for contingency operations. The LER consists of a Chariot Mobility Chassis (CMC) and a pressurized crew cab (PCC). The LER uses two suitports to facilitate quick egress for EVA activities and includes externally mounted manipulators to allow crew to interact with the surface from within the pressurized environment. The LER incorporates a common hatch to facilitate docking with a habitat element, and, with its built-in shielding, the LER fulfills the safe-haven role by providing a radiation shelter for the crew that is accessible from the habitat or while roving on the surface. Figure 8. Lunar Electric Rover (LER) 12 American Institute of Aeronautics and Astronautics b. Chariot Mobility Chassis (CMC) The Crew Mobility Chassis is a roving vehicle designed to carry up to four crewmembers (nominally two) in an unpressurized environment or in a pressurized cab. The chassis can support up to 3000 kg at nominal speeds and greater payloads at reduced speeds. The chassis has interfaces to connect tools for outpost support operation. The chassis can be controlled directly through the chassis driving kit or the pressurized crew cab, telerobitcally or autonomously. Figure 9. Chariot Mobility Chassis (CMC) c. Pressurized Crew Cab (PCC) The Pressurized Crew Cab provides a pressurized environment for a crew of two to conduct extended-range exploration of the moon and can carry four crew members for contingency operations. The PCC carries sufficient consumables for three day operations without resupply or battery recharge. The PCC uses two suit ports, encompassed by an environmental protective cover, to facilitate quick egress for EVA operations and includes controls to operate externally mounted work packages for interacting with the surface from within the pressurized environment, or for use when no crew are present on the surface. The PCC incorporates two common hatches to facilitate docking with habitat and logistics elements, and fulfills, with its built-in shielding, the safe-haven role by providing a radiation shelter for the crew that is accessible from the habitat or while roving on the surface. Figure 10. Pressurized Crew Cab (PCC) d. Active-Active Mating Adapter (AAMA) The Active-Active Mating Adapter enables docking and pressure equalization with other surface systems. It will allow LERs, Hab, and Logistics Modules to dock to each other. The Active-Active Mating Adapter is a new design that is very conceptual in nature. Figure 11. Active-Active Mating Adapter (AAMA) e. Portable Utility Pallet (PUP) The Portable Utility Pallet is designed to attach to mobility chassis with or without a crew cab. It is designed to enable a two-crew, 14-day excursion in an LER. A single 5 m solar is used to provide 4.385 kW net. A PUP is 13 American Institute of Aeronautics and Astronautics carried with each LER and provides power generation and storage capability as well as the ability to recharge the LER batteries. The PUP also has gas and water tanks, and avionics and communication systems. Figure 12. Portable Utility Pallet (PUP) f. Portable Communications Terminal (PCT) The portable communications terminal provides a communication hub for the lunar surface. The services of the PCT are gateway services (low and high data rates) data delivery to Earth via DTE; SWN services; hard wire communication, data storage for retransmission and file sharing; local time and store and forward service and routing. The PCT includes the Ka-Band, S-Band and WLAN systems for the lunar elements. Figure 13. Portable Communications Terminal (PCT) g. Suit port The suit port is a pressure sealing interface between the rear hatch of the EVA Suit and the habitable volume. When the suit is attached (external to the LER aft bulkhead or rear wall of LER pressure vessel), it is continuously exposed to a pressure differential, as the exterior of the suit is at vacuum while the internal volume of the suit is kept at a predetermined pressure to maintain thermal conditioning of the suit. The suit port and suit pressures are raised to nominal LER pressure (8 psi) for donning and doffing suits. During non-EVA periods, the suits are protected from the lunar environment by an environmental protective cover. Figure 14. Suit Port h. Suits The Extravehicular Activity (EVA) System includes the elements necessary to protect crewmembers and allow them to work effectively in environments that exceed the human capability during all crewed mission phases. These elements provide protection from pressure and thermal environments. The EVA System elements include spacesuits, umbilicals, portable life support systems (PLSS), spacesuit servicing equipment, and EVA tools, and stability aids. 14 American Institute of Aeronautics and Astronautics Many factors will affect the spares requirements for the EVA Systems, including the amount of usage, the amount of time that the suit is left pressurized, and exposed to the lunar environment (particularly important when used in a Suit Port configuration). as well as the effect of dust, thermal cycling, and radiation of the suit components, especially the soft materials. Figure 15. EVA Suit L. Analyses with Spreadsheet The spreadsheet model calculates the maintenance demands to obtain the average spares estimate for LSS elements. It performs these calculations assuming a random failure rate λ, based on 1/MTBF (memory-less, no wear out, no event-driven failures). It is also assumed that maintenance actions take place at the end of the mission, so neither repair nor preventative maintenance is considered. The spreadsheet model assumes a 100 percent duty cycle for all elements. In other words, it assumes continuous operation for 15 days, or 360 hours. This is then multiplied by the failure rate and item mass to produce the total spares mass by element. As mentioned previously, the spreadsheet analysis does not calculate inherent or achieved availability for the system, and therefore does not treat degraded modes of operation. It is used to find data anomalies and to identify “heavy hitters”. M. Analyses with Raptor The Raptor analysis also assumed a 15 day mission with a 100 percent duty cycle. It included the elements’ complex redundancy configurations. It assumes everything is locally dependent and all items are critical. Each item’s failure affects the overall availability, unless they are in a redundant path, which is often the case. The analysis covers random failures and does not include preventative maintenance, which is expected to greatly increase the spares and crew time requirement. The results of this analysis are based on the average spares used over 1,000 runs, where common components share a spares pool. Spares values include 30 percent growth. The availability results include the consideration of degraded modes of operation. The Raptor analysis allows for a more fine-grained look at the individual elements, in particular it can assess how redundancy impacts the spares mass requirements. To do this, components or subsystems are modeled as blocks that contain item specific failure, mass, commonality, capacity and dependency data. Nodes are used in conjunction with links to model connectivity logic for complex redundancy structures. Events, which can be used to model discrete event-driven failures such as human error (e.g., running into an unseen rock or crater) and environmental effects (e.g., wear and tear due to dust contamination, thermal cycling, or radiation events), were not employed for this particular analysis. Hierarchies were used to model redundancy within subsystems of each element. N. Analyses with SSM Three types of SSM analyses were conducted. All were designed to model the 15-day Malapert excursion noted earlier. These runs assumed that no repairs or cannibalization were possible, but did consider commonality across sub-elements. a. No Redundancy (pessimistic) The first run made no item-level redundancy assumptions, it considered all items to be in a serial configuration (if one item fails, the system is down) with the given failure rates. The system was modeled as a “fleet” of two LERs and was run to a 49 percent availability target to account for the high-level redundancy of the system. The resultant total spares mass was, as expected, higher than the Raptor results which modeled the complex redundancies within the system to a high level of fidelity. The results of the first run were analyzed by subsystem. The subsystem driving the spares requirement was identified: 53 percent of the spares for the first run were for items in the CMC element. b. General CMC Redundancy (optimistic) The second run made a broad redundancy assumption for the CMC element. The same kit was run with a one-oftwo parallel redundancy assumption for every item in the CMC element (the serial configuration assumption made 15 American Institute of Aeronautics and Astronautics in the first run still held for the other elements). The resultant spares mass was lower than the Raptor results but provided a floor for the spares mass estimates, bracketing the results with minimal configuration assumptions. c. Select CMC Redundancy The third run more precisely modeled the redundancy configuration by using the Raptor reliability block diagrams to identify items within the CMC frame which displayed k of n redundancy. The broad assumption of redundancy made in the second run was fine-tuned to only take into account actual redundant items. VII. Results Results for spares mass, and where applicable, the expected system availability, are displayed in Table 1 below. For all models, the scenario was a 15-day Malapert excursion with two LERs, whose constituent elements are displayed in column 1 (Element). For all methods, the repair strategy was limited to removal and replacement of LRUs—there was no repair of LRUs or replacement of SRUs. The required spares mass to cover the pipeline requirements (essentially mean failures in an operating duration) was computed via spreadsheet for each element and is displayed in column 2 (Simple Pipeline Model). Note that the simple pipeline model does not have information on distributions of failures, commonality, and redundancy. It also does not link the spares requirement to an attained availability for the two LERs. To estimate the availability of the pipeline model, we converted roughly 15 kg solution into a set of spares for the largest pipeline and evaluated that mix in the SSM. Raptor estimated spares mass for each element, required to obtain an overall 50 percent availability (at least one LER operational over the 15 days), is displayed in column 3 (Raptor). Raptor results did account for common components and the complex redundant configuration of components assumed in its reliability block diagrams. Furthermore the results are averages over 1,000 simulation trials, where failure variations across trials reflect component’s failure distributions—a Poisson distribution was used in this case for simplicity, but several other distributions are available. The fact that the spares mass was less for the Raptor analysis (in fact, less for each element) than for the simple pipeline analysis in column 2 is due to Raptor’s ability to model the effects of common components and complex redundancy, as well as to link failures to system availability. The proximity of the totals in columns 2 and 3 is coincidental; there is no reason to expect that for other mission durations, hardware characteristics, or maintenance strategies, the results would still be close. The SSM results in column 4 (No redundancy), targeting 50 percent system availability, are a near-worst case analysis, in the sense that they assume no redundant components; they do however reflect the use of common components. In this case SSM was configured to treat variability of failures with a Poisson distribution as did Raptor (column 3). While in general the spares masses for elements are higher in column 4 than in columns 3 (Raptor) and 2 (pipeline model), that is not always the case, due to the optimization, which selects the spares that produce the most benefit for system availability, given their mass. For example, the SSM did not select any spares for the PUP, where as both Raptor and the simple pipeline model did so. The higher spares mass in the Total row for column 4, relative to the other columns primarily reflects the fact that the SSM was configured for no redundancy. The SSM results in column 5 (CMC redundancy) are close to a best case scenario for spares mass, in the sense that the CMC, which was found to be the overall driver for overall spares mass, was assumed to have a redundant unit for every component. This is responsible for reducing the total required spares mass by more than a factor of three, relative to the Raptor results in column 3, which assume a more limited level of redundancy for the CMC (as well as for other elements). For the results in column 6 (selected CMC redundancy), SSM was configured to approximately model redundancy for each component on the CMC that was treated as redundant in the Raptor RBD. The resultant spares mass was close to that of Raptor, with only a small difference for the same availability (blue line in Figure 16), although the methodologies of Raptor and SSM are quite different. 16 American Institute of Aeronautics and Astronautics Element PCC CMC PUP AAMA PCT SUIT Total Simple Pipeline Model 4.08 4.88 3.04 0.44 0.53 1.56 14.53 Avail 25(estimate) §§ Raptor 4.02 4.69 2.35 0.34 0.41 1.21 13.02 SSM: no redundancy 5.85 19.80 0.00 4.60 2.20 5.06 37.51 SSM: CMC redundancy 1.80 0.00 0.00 0.00 0.40 1.81 4.01 SSM: CMC selected redundancy 3.95 0.00 0.00 4.60 0.40 4.09 13.04 49.50 50.18 49.03 49.05 Table 1. Model results (mass in kg) for 15 day Malapert Excursion The SSM analyses produced spares mass-to-availability tradeoff curves as well as the point estimates for spares mass required to attain the target system availability. The availability-to-resource curves are shown in Figure 16, along with the Raptor point estimate of spares mass, shown in black. Availability vs. Mass 100.00 90.00 Availability (%) 80.00 70.00 Total CMC redundancy Select CMC redundancy No redundancy Raptor SSM Solution 60.00 50.00 40.00 30.00 20.00 10.00 0.00 0 10 20 30 40 50 Mass (kg) Figure 16. SSM and Raptor results The red tradeoff curve was produced by the SSM analysis that assumed no redundancy (but did have common components). The SSM solution on that curve shows the spares mass required to have the mean number of LERs not down for lack of parts equal to one out of two, for the entire 15-day scenario. For comparison with simulations, such as Raptor, this would be equivalent to the average number of LERs up, across an infinite number of trials, being one out of two. The fact that the SSM solution on the red curve is significantly to the right of the black point (Raptor result) illustrates the benefits of redundancy for reducing required spares mass. One should remember of course that moving from non-redundant to redundant systems increases total launch mass. §§ As noted earlier, the simple pipeline model cannot assess system availability. To estimate availability, items were ranked in decreasing order of their pipeline means, and then, moving down the list from the top, we allocated one spare (since all pipeline means were less than one) to each item until the spares mass reached 14.5 kg. We then assessed the system availability for these spares with the SSM—the result was 25 percent. 17 American Institute of Aeronautics and Astronautics The blue tradeoff curve shows results for the SSM analysis where we assumed one redundant component for each CMC component for which the Raptor RBD indicated some level of redundancy. The fact that the SSM solution on the blue curve is nearly coincident with the black point shows that the SSM, using its approximate treatment of CMC redundancy, produced a spares mass similar to that of Raptor, with its more detailed treatment of CMC redundancy. If the SSM analysis included redundancy for all systems, the spares mass would be less than that of Raptor, since Raptor does not make tradeoffs between parts based upon mass. Usually the SSM’s spares mass is roughly 20 percent less than that of non-optimizing resource tradeoff models. The green curve shows the results of the SSM analysis that assumed one out of two redundancy for every component on the CMC. The fact that the green curve is significantly above the blue curve for every spares mass (on the horizontal axis) illustrates the effects of the greater degree of redundancy on LER availability. The three SSM runs together show the sensitivity of availability with respect to the degree of redundancy. These results agree with the more detailed RBD-based Raptor simulation in the case where the redundancy assumptions of the two models’ analyses most closely align. The results of the SSM analyses bracket the results of the more detailed simulation in the case of the spares mass produced by Raptor (green curves above and below black point). The green and red curves estimate the range of outcomes that might be expected with varying degrees of detailed redundancy assumptions. VIII. Findings While all of the spares models produce a required spares mass to attain a mean availability of one out of two LERs for the hypothetical 15-day Malapert mission, spares mass ranges from about 4 kg to almost 38 kg, depending on the methodology employed and the assumptions concerning the degree of commonality and redundancy. In this illustrative case, spares mass estimates can vary by as much as an order-of-magnitude. This wide range would not have been observed had we used only a single methodology and a single set of assumptions regarding the system configuration. Thus we find there is a benefit to using multiple approaches for sparing-to-availability analyses. The simple spreadsheet pipeline analysis has benefit only for checking the pipeline computations and providing a lower bound for mass for a more complex model such as Raptor or SSM. It does not consider variability of failures around the pipeline mean, does not project the effect of a particular set of spares on availability, and does not optimize the spares to produce the greatest availability for least spares mass. The fact that the simple pipeline model produced a spares mass that was close to that of Raptor is coincidental, and we do not expect that this would be true in other cases. In the SSM analyses with approximate treatment of redundancy for the CMC, the results were surprisingly close to the results of Raptor, which had detailed RBDs (Table 1, Total row, columns 3 and 6). The differences in spares mass were at the element level—this is not surprising since the SSM optimizes spares across parts (Raptor does not), and uses a simplified treatment of redundancy (Raptor does not). That means there is benefit to using an optimization model to focus on which elements of the LER should receive most of the spares mass. The SSM results for the no redundancy case (Figure 16, green curve) and the case with complete redundancy for components of the CMC (Figure 16, red curve) bracket the results for the other approaches. Thus the SSM analyses were able to bound the spares mass requirement with a relatively simple treatment of redundancy and commonality; it did not require precise system configurations or the time and detailed data to build RBDs or other system configuration diagrams. IX. Conclusion For future space missions, the overall logistics supportability should be an integral part of mission development and hardware designs, as it interacts with hardware design decisions and plans for deploying that hardware. We were able to show how the supportability concept could be developed in the early stages of planning for hypothetical lunar surface operations. Supportability can be thought of in terms of a nominal maintenance strategy with emphasis on monitoring system health and providing preventative maintenance, and putting in place contingency maintenance strategies, which involved unplanned replacement of components and may involve component repair. For contingency maintenance, sparing-to-availability is a key part of developing the strategy. Furthermore the spare parts required to attain a target system availability is strongly related to the level of redundancy and commonality in the hardware, as well as the maintenance concept. Thus it is important to make sparing-toavailability a part of early system design and mission planning. 18 American Institute of Aeronautics and Astronautics Resource Constraints For sparing-to-availability analyses, there is benefit to using multiple methodologies, as results for the spares mass required to attain a target availability may vary by as much as an order of magnitude, depending on the methodology and assumptions. There is a benefit to using an analytical model such as SSM early in the design phase, because of the ability to rapidly explore the trade space for spares mass vs. availability while the design is not yet well-defined. Detailed simulation models, such as Raptor or SPN@, can provide additional insight into degraded operations and multiple failure modes once more definitive system designs and component characteristics become available. We believe that the use of the varied methodologies, with analytical models such as SSM used more in the early design stages, and increased use of detailed reliability simulations, such as Raptor or SPN@ later in the process would provide the most insight into sparing vs. availability. This phased use of complementary methodologies and models is depicted in Figure 17. Raptor Simulation SSM Optimization Conception Planning Launch Operations Life Cycle / Data Certainty Figure 17. Complimentary models over life cycle Acknowledgments The authors would like to acknowledge Dr. Vitali Volovoi of the Georgia Institute of Technology for his assistance with the R&M modeling and sparing versus redundancy discussions. References 1 Green, J.L. and Watson, K.J., “Supportability and Operability Planning for Lunar Missions,” AIAA-2008-7779, AIAA SPACE 2008 Conference and Exposition, San Diego, California, Sep. 9-11, 2008 2 Green, J.L. and Spexarth, G.R., “A Mars-Forward Approach to Lunar Supportability Planning,” AIAA-2009-6427, AIAA SPACE 2009 Conference and Exposition, Pasadena, California, Sep. 14-17, 2009. 3 Kline, R.C. and Bachman, T.C., “Estimating Spare Parts Requirements with Commonality and Redundancy,” Journal of Spacecraft and Rockets, 44, no. 4, 2007, pp. 977-984. 4 Siddiqi, A., and de Weck, O.L., “Spare Parts Requirements for Space Missions with Reconfigurability and Commonality”, Journal of Spacecraft and Rockets, Vol. 44, No. 1, AIAA, 2007. 5 Rausand, M. and A. Høyland. System Reliability Theory: Models, Statistical Methods, and Applications. 2nd ed. Hoboken, NJ: Wiley-Interscience, 2004. 6 Kijima, M. Markov Processes for Stochastic Modeling. London: Chapman & Hall, 1997. 7 Volovoi, Vitali, Dynamic Approaches to Risk and Reliability Modeling in Design and Operations, Tutorials, Annual Reliability and Maintainability Symposium, San Jose, CA, January 2010. 8 Volovoi, V.V., “Modeling of System Reliability Using Petri Nets with Aging Tokens,” Reliability Engineering and System Safety, 84, no. 2, 2004, pp. 149–161. 9 Bachman T. C., and Kline, R. C., “Model for Estimating Spare Parts Requirements for Future Missions,” AIAA2004-5978, AIAA SPACE 2004 Conference and Exposition, San Diego, California, 2004. 19 American Institute of Aeronautics and Astronautics 10 Kline, R.C. and Bachman, T.C., “Estimating Spare Parts Requirements with Commonality for Human Space Missions,” AIAA-2006-7233, AIAA SPACE 2006 Conference and Exposition, San Jose, California, 2006 20 American Institute of Aeronautics and Astronautics