Draft Modeling Protocol Emissions and Air Quality Modeling for SEMAP Contract # S-2010-04-01 Prepared by: Georgia Institute of Technology University of North Carolina Colorado State University Prepared for: Souteastern States Air Resource Managers, Inc. 526 Forest Parkway, Suite F Forest Park, GA 30297-6140 February 17, 2011 Project Manager: Associate Manager: Talat Odman Zac Adelman School of Civil & Environmental Engineering Georgia Institute of Technology 311 Ferst Drive, Atlanta, GA 30332-0512 odman@gatech.edu Phone: 404-894-2783 Fax: 404-894-8266 Center for Environmental Modeling University of North Carolina–Chapel Hill 137 E. Franklin St., Chapel Hill, NC 27599 zac@unc.edu Phone: 919-966-2206 SEMAP Modeling Protocol Page Left Intentionally Blank SEMAP Modeling Protocol Table of Contents Section 1: Introduction ....................................................................................................................... 1-1 Section 2: Project Overview .............................................................................................................. 2-1 2.1 Management Structure .................................................................................................................. 2-1 2.2 Communication Procedures .......................................................................................................... 2-4 2.3 Participating Organizations .......................................................................................................... 2-4 2.4 Project Schedule ........................................................................................................................... 2-5 2.5 Conceptual Description ................................................................................................................ 2-8 Section 3: Models and Model Inputs ................................................................................................. 3-1 3.1 Meteorology, Emissions and Air Quality Models ........................................................................ 3-1 3.2 Modeling Domains ..................................................................................................................... 3-13 3.3 Episode Selection........................................................................................................................ 3-16 3.4 Input Databases........................................................................................................................... 3-16 3.5 Quality Assurance and Error Correction Protocol ...................................................................... 3-26 Section 4: Technical Approach .......................................................................................................... 4-1 4.1 Simulation Identification .............................................................................................................. 4-1 4.2 General Modeling Procedures ...................................................................................................... 4-1 4.3 Initial Base Case Simulation ......................................................................................................... 4-5 4.4 Diagnostic Model Sensitivities ..................................................................................................... 4-8 4.5 Final Base Case Simulation ........................................................................................................ 4-12 4.6 Typical Base Case Simulation .................................................................................................... 4-12 4.7 Future OTB and Emissions Control Scenarios ........................................................................... 4-13 4.8 Emissions Sensitivities ............................................................................................................... 4-13 4.9 Interactive Database Tool ........................................................................................................... 4-14 Section 5: Air Quality Model Evaluation Protocol ............................................................................ 5-1 5.1 Ambient Air Quality Data ............................................................................................................ 5-1 5.2 Evaluation Tools ........................................................................................................................... 5-4 5.3 Evaluation Procedures .................................................................................................................. 5-5 Section 6: Supplemental Analyses ..................................................................................................... 6-1 6.1 Future Attainment Demonstration Analysis Protocol ................................................................... 6-1 Section 7: Data Backup and Archival Protocol.................................................................................. 7-1 7.1 Georgia Tech ................................................................................................................................ 7-1 7.2 UNC .............................................................................................................................................. 7-1 Section 8: Computer Resources ......................................................................................................... 8-1 8.1 Georgia Tech Computing Resources ............................................................................................ 8-1 8.2 UNC Computing Resources ......................................................................................................... 8-2 Section 9: Project Team Roles and Responsibilities .......................................................................... 9-1 Section 10: Documentation ................................................................................................................ 10-1 Appendix A. SEMAP Emissions QA Worksheets ................................................................................ A-1 Appendix B. SMOKE Emissions QA Checklist and Log for Warnings and Notes .............................. B-1 Appendix C. References ........................................................................................................................ C-1 i SEMAP Modeling Protocol List of Tables Table 2-1. Project schedule ........................................................................................................................ 2-5 Table 3-1. Projection parameters for the CMAQ and CAMx modeling domains ................................... 3-14 Table 3-2. SEMAP WRF and CMAQ Vertical Layer Configurations .................................................... 3-15 Table 3-3. Emissions processing sectors for the SEMAP project ............................................................ 3-19 Table 3-4. 2007 Actual inventory data..................................................................................................... 3-22 Table 3-5. 2007 Typical inventory data ................................................................................................... 3-23 Table 3-6. 2017 OTB inventory data ....................................................................................................... 3-23 Table 3-7. 2017 OTW inventory data ...................................................................................................... 3-24 Table 3-8. 2020 OTB inventory data ....................................................................................................... 3-25 Table 3-9. 2020 OTW inventory data ...................................................................................................... 3-25 Table 4-1. CMAQ and CAMx configuration options ................................................................................ 4-6 Table 5-1. Surface Air Quality Data available for model performance evaluation and input to receptor models for the Southeast ............................................................................................................................ 5-3 Table A-1. Configuration settings worksheet for SMOKEv2.7b .............................................................. A-2 Table A-2. Inventory file recording worksheet for SMOKEv2.7b (sample entry shown in gray) ........... A-7 Table A-3. Ancillary input file recording worksheet for SMOKEv2.7b (sample entry shown in gray) ... A-8 Table A-4. Binary meteorology input and SMOKE output file recording worksheet for SMOKEv2.7b (sample entries shown in gray) ................................................................................................................. A-9 Table A-5. Emissions simulation tracking worksheet (sample entries shown in gray) ......................... A-10 List of Figures Figure 2-1. SEMAP project organization chart.......................................................................................... 2-3 Figure 3-1. SMOKE modeling system ....................................................................................................... 3-5 Figure 3-2. SEMAP 36-km (grey) and 12-km (white) modeling domains .............................................. 3-14 Figure 3-3. Regional Planning Organization map.................................................................................... 3-18 Figure 4-1. Modeled (red solid line) and observed (blue dots) 24-hr average organic carbon concentrations (gm-3) at South DeKalb, GA from May 1st to July 15th, 2009. ...................................... 4-11 Figure 6-1. Location of various PM2.5 monitors in the U.S. for SMAT..................................................... 6-2 ii SEMAP Modeling Protocol List of Acronyms AIRS AMET AOD AQM BC BEIS3 CAMx CB05 CEM CEMPD CENRAP CIRA CoST CMAS CMAQ CTM DDM DVC DVF EC EGU EMF EMS EMP FAQS FB FE GIS GIT HAP IC IDT I/O API IMPROVE JFSP MAQSIP MAT MCIP MEGAN MSA NAAQS NASA NATA NCDAQ Atmospheric Infrared Sounder Atmospheric Model Evaluation Tool Aerosol Optical Depth Air Quality Model Boundary Conditions Biogenic Emission Inventory System, version 3 Comprehensive Air quality Model with extension Carbon Bond—Version 2005 Continuous Emissions Monitor Center for Environmental Modeling for Policy Development Central Regional Air Planning (association) Cooperative Institute for Research in the Atmosphere Control Strategy Tool Community Modeling and Analysis System Community Multiscale Air Quality Chemistry-Transport Model Decoupled Direct Method Current Year Design Value Future Year Design Value Elemental Carbon Electrical Generating Unit Emissions Modeling Framework Emissions Modeling System Emissions Modeling Platform Fall-line Air Quality Study Fractional Bias Fractional Error Geographic Information System Georgia Institute of Technology Hazardous Air Pollutant Initial Conditions Interactive Database Tool Input/Output Applications Programming Interface Interagency Monitoring of Protected Visibility Environments Joint Fire Science Program Multiscale Air Quality Simulation Platform Modeled Attainment Test Meteorology-Chemistry Interface Processor Model of Emissions and Gases from Nature Metropolitan Statistical Area National Ambient Air Quality Standards National Aeronautics and Space Administration National-scale Air Toxics Assessment North Carolina Division of Air Quality iii SEMAP Modeling Protocol NEI netCDF NIF NMIM NMOC OAQPS OC ORD ORL OSAT OTAQ PAMS PAVE PM PPTM PSAT QA QAPP QMP QC RDBMS RFP RHR ROSES RPO RRF SAMI SANDWICH SDBMS SEARCH SEMAP SERDP SIP SLAMS STN TAWG UNC VIEWS VISTAS WRAP WRF National Emissions Inventory network Common Data Form National Inventory Format National Mobile Inventory Model Non-Methane Organic Compounds Office of Air Quality Planning and Standards Organic Carbon Office of Research and Development One-Record-per-Line Ozone Source Apportionment Technology Office of Transportation and Air Quality Photochemical Assessment Monitoring Stations Package for Analysis and Visualization of Environmental Data Particulate Matter Particle and Precursor Tagging Methodology PM Source Apportionment Technology Quality Assurance Quality Assurance Project Plan Quality Management Program Quality Control Relational Database Management System Request for Proposal Regional Haze Rule Research Opportunities in Space and Earth Sciences Regional Planning Organization Relative Reduction Factor Southern Appalachian Mountains Initiative Sulfates, Adjusted Nitrates, Derived Water, Inferred Carbonaceous Mass and estimated aerosol acidity (H+) Spatial Database Management System Southeastern Aerosol Research and Characterization Study Southeastern Modeling Analysis and Planning Strategic Environmental Research and Development Program State Implementation Plan State and Local Air Monitoring Stations Speciated Trends Network Technical Analysis Work Group University of North Carolina Visibility Information Exchange Web System Visibility Improvement State and Tribal Association of the Southeast Western Regional Air Partnership Weather Research and Forecasting model iv SEMAP Modeling Protocol Section 1: Introduction In 2009, SESARM initiated a new Southeastern Modeling, Analysis and Planning (SEMAP) project to produce technical analyses to aid member states, in the development of SIPs for O3 and PM2.5, and in the demonstration of reasonable progress for the RHR, required under the Clean Air Act Amendments. Specifically, since the precursor emissions and some of the atmospheric chemical processes are interlinked in the way one can control the chemical formation and transport for fine particles, ozone and regional haze, an integrated assessment using a oneatmosphere modeling approach is needed to address the air pollution problems in the SESARM states. It is anticipated that this analysis will help the SESARM state agencies to protect human health as well as the environment from the impacts of air pollutants. To address the air quality and emissions modeling needs of the SEMAP project, Georgia Institute of Technology (Georgia Tech), the University of North Carolina Institute for the Environment (UNC) and the Colorado State University Cooperative Institute for Research in the Atmosphere (CSU) formed a team. A contract (Contract # S-2010-04-01) was signed on April 13, 2010 between SESARM and Georgia Tech, the lead organization of the team. Dr. Talat Odman from Georgia Tech is the project principal investigator. Mr. Zac Adelman from UNC and Mr. Shawn McClure from CSU are co-investigators. They are joined by a large number of investigators and staff from all three institutions to perform the tasks of this project. The objective of the SEMAP project is to provide data and technical support to the SESARM members in the development of state implementation plans (SIPs) for O3 and PM2.5, and in the demonstration of reasonable progress for the regional haze rule (RHR). The Project Team will pursue this objective through the collection and preparation of meteorology and emissions data, the development and analysis of air quality modeling simulations, and the distribution and documentation of all modeling results. The SEMAP project team will meet this objective using a modeling system configuration chosen in collaboration with SESARM. The following modeling systems will be used to accomplish the goals of the SEMAP project: Weather Research and Forecasting (WRF) model – meteorological model Meteorology-Chemistry Interface Processor (MCIP) – prepare Weather Research and Forecasting (WRF) model data for emissions and air quality modeling Sparse Matrix Operator Kernel Emissions (SMOKE) System – convert emissions inventories into formats for air quality modeling MOtor Vehicle Emissions System (MOVES) – on-road mobile source emissions model Biogenic Emissions Inventory System (BEIS) – biogenic emissions model Community Multiscale Air Quality (CMAQ) model – Eulerian air quality model Comprehensive Air Quality Model with eXtensions (CAMx) – Eulerian air quality model This document presents the modeling protocol for the SEMAP project. Its intent is to detail the data collection, modeling approaches, and quality assurance/quality control (QA/QC) processes that will be used in the project. Following from U.S. EPA guidance documents on air quality GIT/UNC/CIRA-SEMAP 004.v3 1-1 April 29, 2011 SEMAP Modeling Protocol modeling and analysis (EPA, 2007; EPA, 1999; EPA, 1991), this protocol is a comprehensive approach to the acquisition, production, assessment, archival, and documentation of all data used for the SEMAP project. After presenting the modeling periods and domains this protocol details the modeling systems that will be used during the SEMAP project, how the input databases will be compiled. Protocols for model performance evaluation, future attainment demonstration, quality assurance and error correction, and data backup and archival are then followed by descriptions of the computer resources and the project team. The project schedule is then followed by a description of the documentation that will be produced for the project. This document concludes with two appendices of emissions QA/QC worksheets that will be used during the SEMAP project. GIT/UNC/CIRA-SEMAP 004.v3 1-2 April 29, 2011 SEMAP Modeling Protocol Section 2: Project Overview 2.1 Management Structure The SEMAP tasks present a large and complex work effort with significant data throughput and an aggressive schedule of deliverables. To maintain coordinated workflow, meet the project milestones, and ensure adherence to the project Quality Assurance Project Plan (QAPP), a tiered management structure is proposed that focuses on the primary data elements of the project. The project Principal Investigator (PI), Dr. Talat Odman, has the ultimate responsibility for ensuring that the Project Team properly works toward the project goals and objectives. The project Quality Assurance (QA) Manager, Dr. Adel Hanna, is independent of the PI and will act with the Co-Investigators, Shawn McClure and Zac Adelman, to ensure that the work and deliverables produced during this project meet all requirements of the QAPP. Ms. Jeanne Eichinger, as the Documentation and Deliverables Manager, will serve as an intermediary between the project CoInvestigators and the data managers to ensure that all deliverables are generated on time and in accordance with the project QA/QC requirements. Dr. Sarav Arunachalam, Ms. Uma Shankar, Dr. Aijun Xiu, and Mr. Zac Adelman will serve as the Air Quality Data, Observational Data, Meteorology Data, and Emissions Data Managers, respectively; as data managers they will supervise the acquisition, storage, generation, transfer, and documentation of all project data relevant to their data elements. Figure 2-1 summarizes the project organizational structure and the communication lines for all key project staff. In general, the flow of information and communication for the project will be downward, initiated at SESARM and going through the project PI, Dr. Odman. The CoInvestigators will be responsible for managing the work at their organizations and enforcing the QA/QC requirements at the individual level. The project QA Manager will oversee and audit the QA/QC process for the entire Project Team. The data mangers will be responsible for performing and documenting QA/QC on individual data elements in accordance with the QAPP. The Documentation and Deliverables Manager will ensure proper record keeping of all QA/QC processes and that all products delivered as part of the SEMAP project meet the requirements of the QAPP. Note the central role played by the Documentation and Deliverables Manager as the coordinator between the technical staff/data managers and the project investigators/QA manager. The QA Manager is responsible for the development, assessment, implementation, and approval of the Quality Management Program (QMP). In the event that the QA Manager believes the project activities do not meet program QA/QC policies or requirements, he may recommend, following consultation with the project PI and the SESARM Project Coordinator, suspension of projects and request corrective actions. Remedies and actions will be identified to ensure that project activities meet QA/QC policies. Mr. John Hornback, SESARM Contract Officer will oversee all activity on the contract, and supervise managerial aspects of the project. He will help facilitate communications among State/local agencies, the Georgia Tech/UNC/CSU project team, and the SESARM Project Coordinator. GIT/UNC/CIRA-SEMAP 004.v3 2-1 April 29, 2011 SEMAP Modeling Protocol Mr. Ron Methier, SESARM Project Coordinator will work with the SESARM Contract Officer and State/local Agency Coordinators to define the air quality and emissions modeling activities needed to support regional haze modeling and planning activities. He will be the responsible official for this project overseeing the overall project and budget, as well as tasking Georgia Tech with work required to complete this project. He will communicate project needs to Georgia Tech's project coordinator. GIT/UNC/CIRA-SEMAP 004.v3 2-2 April 29, 2011 SEMAP Modeling Protocol Figure 2-1. SEMAP project organization chart GIT/UNC/CIRA-SEMAP 004.v3 2-3 April 29, 2011 SEMAP Modeling Protocol 2.2 Communication Procedures The components of the SEMAP project are implemented using a task structure and approach that can very effectively complete the work. Each task/subtask is measured and documented for QC, and corrective action will be taken immediately if a problem is found. Prior to delivery to SESARM and other interested parties, all outputs from the SEMAP project tasks will be assessed against the objectives stated in the project work plan to ensure that the work has been completed and quality assured. If SESARM requests that the work products are to be distributed to specific parties, a usability assessment will be completed to ensure that the products are commensurate with their needs. It is necessary to have feedback at all steps in the workflow to provide specific QC at the task level, assessment of the program objectives, and determination of usability to assure that the air quality modeling needs of the states, tribes, and stakeholders in the SESARM region (henceforth referred to as SESARM members) are being fulfilled. Each task in the work plan will have appropriate entries for objectives, subtasks, and deliverables. The formal review and reporting is the project PI’s responsibility. The QA Manager is responsible for verifying that all tasks are appropriately documented and reported, and will report any discrepancies to the PI along with recommendations for correction. The project PI will note the QA Manager’s recommendations in the monthly report to SESARM along with the corrective actions taken. A monthly report containing the following elements for each of the 16 tasks in the SEMAP project will be distributed by the 15th of each month to the SESARM Project Coordinator: 2.3 Work completed the previous month Expected work during the next month Difficulties encountered and corrective actions taken Actions awaiting SESARM authorization Status of project deliverables Participating Organizations The following organizations are participating in the SEMAP project: Southeastern States Air Resource Managers, Inc. (SESARM) – SEMAP project sponsor and technical advisor The Georgia Institute of Technology (Georgia Tech) – SEMAP project lead contractor, air quality modeling and analysis, project management The University of North Carolina (UNC) – emissions and meteorology data preparation, air quality modeling and analysis The Cooperative Institute for Research in the Atmosphere (CIRA) – web-based data display, distribution, and analysis GIT/UNC/CIRA-SEMAP 004.v3 2-4 April 29, 2011 SEMAP Modeling Protocol 2.4 Project Schedule The original project schedule can be found in Section 2.2 of the Project Work Plan. Since then, the schedule of deliverables has been revised. Those revisions are reflected in the schedule of deliverables below. Table 2-1. Project schedule Projected completion date Deliverable Task 1: Project Management Draft project work plan to SESARM 4/28/2010 Draft QAPP to SESARM 4/27/2010 Revised/final QAPP incorporating reviewers comments 6/11/2010 Revised/final project work plan 6/11/2010 Participation on conference calls with the SEMAP Project Coordinator, TAWG, and other contractors (monthly) Ongoing function Prompt responses to questions raised by SESARM and other contractors (as needed) Ongoing function Monthly project reports summarizing work performed on each task, progress on deliverables, and expected progress during the next month (monthly) Ongoing function Participation in up to two project meetings TBD Task 2: Modeling Protocol Initial version of the emissions and air quality modeling protocol 6/13/2010 Wiki version of the reviewed protocol accessible through project website 2/21/2011 Task 3: Data Acquisition/Dataset Preparation for Modeling and Evaluation MCIP outputs for SMOKE and CMAQ models 7/16/2010 Observational datasets in formats that are suitable for the model evaluation software and receptor models to be used in Task 10 7/16/2010 CMAQ-ready ICs/BCs generated from the GEOS-Chem model 8/13/2010 SMOKE-ready 2007 SEMAP emission inventories and ancillary data 2/18/2011 SMOKE-ready future-year SEMAP emission inventories and ancillary data 4/15/2011 SMOKE-ready inventory data for the non-SEMAP regions, including Canada and Mexico, to use for the 2007 modeling 2/18/2011 SMOKE-ready inventories for the non-SEMAP regions to use for the future-year modeling 4/15/2011 Latest BELD and MEGAN land use and emissions factor data gridded to the SEMAP modeling domains 2/18/2011 MCIP outputs for SMOKE and CMAQ models 1/14/2011 GIT/UNC/CIRA-SEMAP 004.v3 2-5 April 29, 2011 SEMAP Modeling Protocol Photolysis rate inputs for CAMx model 3/4/2011 WRFCAMx outputs for CAMx model 3/4/2011 Gridded satellite data for column NO2 and HCHO from OMI and extinction AOD from MODIS and OMI for selected periods 2/18/2011 Task 4: Conceptual Description Draft interim report prepared according to EPA guidance and submitted to the SEMAP TAWG 2/4/2011 Revised/final interim report incorporating reviewers’ comments 3/4/2011 Task 5: 36-km/12-km Emissions/AQ Annual Actual Base-Year Simulations Emissions and air quality model results on the SEMAP 36-km and 12-km modeling grids for the following simulations: Initial “2007 Actual” SMOKE and CMAQ 5/6/2011 Final “2007 Actual” SMOKE and CMAQ 9/2/2011 Final “2007 Actual” SMOKE and CAMx 9/2/2011 Task 6: Model Configuration Diagnostic Sensitivity Modeling CMAQ-ready emissions with 34 vertical layers 4/15/2011 Evaluation of the CMAQ predictions and comparison to the model performance between the 19- and 34-layer simulations 5/27/2011 CMAQ-ready emissions with biogenic CO, NO, and VOC estimates calculated with MEGAN 4/22/2011 Evaluation of the MEGAN and BEIS CMAQ predictions and recommendations on which model to use to estimate biogenic emissions for the SEMAP project 6/3/2011 Diagnosis of vertical mixing/mixing height problems through analysis of long-range plume transport 6/10/2011 Comparison of the new SOA mechanism by Baek (2009) to the standard CMAQ mechanism and evaluation with OC data 6/17/2011 Comparison of ACM cloud with RADM cloud and evaluation with wet deposition data 6/24/2011 Final recommendation on model configuration, with supporting evidence 7/1/2011 Task 7: Emissions and Air Quality Benchmark Simulations Document summarizing the technical transfer protocol and all model utilities and scripts used by the project team for the modeling tasks 12/2/2011 Task 8: 36-km/12-km Annual Typical/Future-Year Emissions AQ Modeling Emissions and air quality model results on the SEMAP 36-km and 12-km modeling grids for the following simulations: “2007 Typical” SMOKE, CMAQ, and CAMx “2017 OTB” SMOKE, CMAQ, and CAMx GIT/UNC/CIRA-SEMAP 004.v3 2-6 9/2/2011 12/2/2011 April 29, 2011 SEMAP Modeling Protocol “2020 OTB” SMOKE, CMAQ, and CAMx 12/2/2011 “2017 Control” SMOKE, CMAQ, and CAMx 3/2/2012 “2020 Control” SMOKE, CMAQ, and CAMx 3/2/2012 Task 9: Emission Sensitivity Modeling Annual CAMx PSAT simulation on the SEMAP 36-km modeling grid with analysis products posted on the project website 11/4/2011 Emissions sensitivities for daily and annual PM2.5 obtained from the brute force CMAQ simulations on the 36-km grid, processed in a similar way to VISTAS 11/4/2011 Emissions sensitivities for 8-hour maximum ozone obtained from the DDM method on the 12-km grid, processed at selected SEMAP sites Emissions sensitivities for 8-hour maximum ozone obtained from the brute force method on the 12-km grid, processed at selected SEMAP sites 11/18/2011 12/2/2011 Task 10: Model Performance Evaluation All standard analyses specified in RFP Attachment A for (a) ozone and precursors; (b) fine PM, coarse PM, and constituent mass concentrations; and (c) regional haze metrics 9/30/2011 Diagnostic interpretation of ozone performance regarding the impact of NOx- and VOC-limited regimes on the model performance using “photochemical day” analysis 9/30/2011 Interpretation of the modeled AOD vs. those reported from OMI and MODIS, using complementary analyses of model performance at the surface 9/30/2011 Task 11: Future Year Model Projections Relative Reduction Factors and Future-Year Design Values for O3, PM2.5, and reasonable rate of progress for regional haze goals at all monitored locations, MATS inputs and outputs, and UAA for O3 and PM2.5, from CAMx and CMAQ model outputs for 2017 and 2020 future-year scenarios 3/16/2012 Task 12: Interactive Database Tool Analyze and understand SEMAP data and metadata 3/18/2011 Enhance and extend VIEWS/TSS database 5/20/2011 Develop import routines and procedures 6/17/2011 Test and refine the import system 8/12/2011 Develop Interactive Database Tool(s) (IDT) by leveraging existing ones 8/12/2011 Test and refine the IDT(s) Ongoing function Collaborate with UNC to import and integrate modeling and emissions data Ongoing function Task 13: Receptor Modeling Apply CMB and PMF to speciated PM2.5 data from 20 locations, and compare results to each other and to related studies; prepare displays of daily source contributions using CMB and PMF, as well as seasonal and annual averages, along with GIT/UNC/CIRA-SEMAP 004.v3 2-7 9/30/2011 April 29, 2011 SEMAP Modeling Protocol uncertainties as provided by both techniques Apply the chosen CMB approach to up to 20 locations (depending upon data availability) to provide estimates of source contributions to ozone increases and how those contributions vary with ozone level and time of year 12/2/2011 Task 14: Development of Technical Web Site Create FTP site for organizing and disseminating data products 8/31/2010 Develop technical website by leveraging existing resources 5/20/2011 Integrate IDT(s) with technical website 3/25/2011 Package IDT(s) and technical website for delivery to SESARM 5/6/2011 Collaborate with SESARM to transfer and configure finished system 5/20/2011 Test and refine website and FTP site Ongoing function Reliable methods for transferring data between modeling contractors and SEMAP members (as needed) Ongoing function Archive of all SEMAP modeling input and output data (as needed) Ongoing function Task 15: Other Tasks as Assigned (Deliverables will be determined when tasks are assigned) TBD Task 16: Reporting Interim task reports documenting each major task milestone (as needed) 2.5 Ongoing function Draft final project report 3/30/2012 One hard copy of final project report, and PDF version posted on project website 5/25/2012 Conceptual Description A qualitatively description of the ozone, PM2.5, and regional haze problem in the SEMAP region was prepared. This description was derived from the air quality data provided by the SEMAP states, the answers by the states to the questions in Chapter 11 of EPA’s Guidance for preparation of conceptual descriptions and relevant results of recent modeling efforts in the SEMAP region. A summary of this conceptual description is included here. Statewide average ozone design values trend downward for all SEMAP states after the peaks in 1999 and 2000. This trend is most likely due to the reductions in ozone precursor emissions. The ozone trends of urban, suburban and rural sites are generally very similar with a few exceptions. On the other hand, geographic grouping of the sites as coastal, inland and mountain showed that ozone values at coastal sites of Georgia and South Carolina and inland sites of Virginia have not been decreasing as rapidly as at other sites. In the last decade, annual PM2.5 has been decreasing in all SEMAP states. A slight increase has been observed between 2005 and 2007 but this is most likely due to the influence of adverse GIT/UNC/CIRA-SEMAP 004.v3 2-8 April 29, 2011 SEMAP Modeling Protocol weather conditions during those years. Except in Florida where annual PM2.5 at rural sites have been decreasing at a much slower pace, the trends among urban and rural sites are very similar. The downward trend is more noticeable at the mountain sites of Georgia than inland or coastal sites of the state and the coastal sites of Virginia experience a slower decrease than inland and mountain sites of the same state. The statewide average 24-hr PM2.5 has also been decreasing in all SEMAP states during the last decade. However, unlike the trends in annual PM2.5, there are significant differences between the rates of decrease at urban and rural sites. The rural sites experience a slower decrease rate in Florida, Kentucky, South Carolina, Tennessee and Virginia. In Mississippi and North Carolina, on the other hand, the rate of decrease of the 24-hr PM2.5 design values is faster at rural sites compared to urban sites. The 24-hr PM2.5 at coastal sites has been decreasing at a much slower rate compared to inland and mountain sites, and has been actually increasing at one coastal location in North Carolina. Both local and regional factors play a role in the SEMAP region’s 8-hour ozone nonattainment problem. As emissions are being reduced in individual SEMAP states, regional factors become increasingly more important. The concentrations of ozone and its precursors are also high at higher elevation sites and aloft based on a limited number of observations aloft. In general, violations of the 8-hour ozone National Ambient Air Quality Standard occur at several monitoring sites throughout nonattainment areas. While the exceedances are frequent in some states (e.g., AL, TN, VA) they are less frequent (e.g., KY, MS) or rare in other states (e.g., FL, SC, NC, WV), especially in recent years. There are some characteristic spatial patterns associated with the exceedances such as grouping in the downwind vicinity of metropolitan areas. Typically, violations do not occur because of mesoscale wind patterns as high temperatures and low winds are the most general accompanying meteorological patterns, except in locations like Birmingham AL, Hampton Roads VA, Ohio and Kanawha River Valleys in WV. There have been significant reductions in NOx emissions in or near the 8-hour ozone nonattainment areas, except in MS and TN, as well as some VOC reductions, as a result of many controls on both industry and motor vehicles and some plant shutdowns. The most notable reductions in EGU emissions came after the NOx SIP Call and other regulations such as North Carolina’s Clean Smokestacks Act. As a result of these emission reductions, the ozone design values in all s SEMAP states have been trending downward. The biggest decreases were observed at some inner city monitors. The coastal areas of the region have decreased the least over the last 5 years. The trends of the precursor concentrations are mostly unknown but VOC concentrations have been trending downward at a few speciation sites where they are measured with no apparent changes in VOC profiles. The results of past modeling studies in the region suggest that there is both a local and a regional component to ozone, each playing a role in nonattainment. Reducing NOx emissions is much more effective than VOC reduction at reducing ozone. In general, biogenic VOC reductions are more beneficial than anthropogenic VOC reductions. High temperature, light winds and clear skies are the most common meteorological conditions coinciding with 8-hour ozone exceedances. In coastal regions, the position of pressure ridges matter because they determine how far inland the see breeze can penetrate and break up the ozone buildup. GIT/UNC/CIRA-SEMAP 004.v3 2-9 April 29, 2011 SEMAP Modeling Protocol The annual PM2.5 nonattainment in the SEMAP region is caused both by local and regional factors. While regional factors are believed to play a bigger role in general, sometimes local factors such as wood burning can lead to exceedances. The relative importance of primary and secondary components of PM2.5 is not known for sure. Measurements at a limited number of sites suggest that sulfates and nitrates, both secondary, are significant while primary components such as soils are small. Sulfates and organics are the most prevalent component of measured PM2.5. Sulfate concentrations are similar between urban and rural areas while organic and elemental carbon concentrations are generally higher in urban areas. This suggests that sulfate is widely distributed throughout the region while urban areas see additional incremental PM2.5 loadings from local emissions sources. The measured PM2.5 components roughly agree with the mix of emissions surrounding the monitoring site. For example, metals are high in Birmingham AL. There are also large gradients of primary PM2.5 around monitors near urban centers, industrial areas, and rail yards. There are some indications that SO2 emissions limit the formation of secondary sulfate formation. Violations at monitors located in valleys and surrounded by mountains are believed to be caused by mesoscale wind patterns. Economically driven plant shutdowns, consent decrees, and SO2 and NOx controls significantly reduced PM and precursor emissions. The downward trend in PM2.5 levels are believed to be tied directly to these emission reductions. Regional SO2 emission reductions are believed to play the biggest role in this decrease. The decrease in design values is observed throughout the SEMAP region with no apparent spatial patterns. Recent modeling studied by VISTAS and ASIP suggested that, in general, regional SO2 sources and local primary PM2.5 sources contribute to the PM2.5 problem. Ammonium sulfate is the predominant contributor to PM2.5. The models predict that regional SO2 emission reductions along with local elemental and organic carbon emission reductions will have the greatest benefits. In general light winds associated with high pressure centered overtop lead to high PM2.5 concentrations. During the warm season, PM2.5 increases with moisture. PM2.5 concentrations are also higher ahead of approaching cold fronts. During the cold season, strong nocturnal inversions often lead to high PM2.5 concentrations. Especially in summer, high PM2.5 concentrations often correspond with high ozone concentrations. Ammonium sulfate is the major contributor to visibility impairment followed by nitrates and organic carbon. Visibility, good or poor, appears to track well among nearby class I areas of the SEMAP region. There is a clear distinction between coastal and mountainous Class I areas. Poor visibility generally occurs at stagnant high pressure conditions when it is hot and humid. Poor visibility generally coincides with high observed concentrations of ammonium sulfate and sometimes organic carbon. The meteorological conditions leading to good visibility are cool temperatures, low humidity, cold frontal passage and high wind speeds. On good visibility days ammonium sulfate is still the largest constituent of particulate matter. Recently, Visibility Improvement States and Tribal Association on the Southeast (VISTAS), the regional planning organization for the SEMAP region, supported regional scale modeling and assessment of the regional haze problem. VISTAS projected responses to emissions reductions expected by the year 2018 under existing federal and state regulations for the southeastern United States. This analysis determined that ammonium sulfate, predominantly from sulfur dioxide emissions from electric generating utilities and industrial sources, contributed 60 to 70% to the visibility impairment on the 20% haziest days; particulate organic matter, predominantly from biogenic emissions and biomass burning, was the second largest contributor (Brewer and GIT/UNC/CIRA-SEMAP 004.v3 2-10 April 29, 2011 SEMAP Modeling Protocol Moore, 2009). As for controls, sulfur dioxide emission reductions are projected to result in nearly one-to-one regional sulfate reductions; oxides of nitrogen emission reductions have minimal impact on nitrate concentrations and haze. One modeling study supported by VISTAS estimated PM and visibility at Class I areas of Southeastern U.S. in the year 2018 (Odman et al, 2007). According to the 2018 on-the-way scenario, which included the Clean Air Interstate Rule, no Class I area in the region is expected to experience visibility degradation on 20% best visibility days. However, SO2 controls and increasing ammonia emissions may increase nitrate levels at a number of sites. Additional controls may be necessary for reasonable progress towards the goal of achieving natural visibility conditions on 20% worst visibility days by the year 2064. In the same study, using 2018 emission projections as the base case, various emission reduction options were simulated to evaluate each option in terms of visibility improvement on 20% worst visibility days and degradation, if any, on 20% best visibility days. Domain-wide SO2 reductions showed the largest benefits at the majority of Class I areas on worst 20% visibility days followed by NOx reductions at mountain sites and PC reductions at other sites. NH3 reductions were somewhat effective at coastal sites but not at mountain sites and VOC reductions were ineffective. None of the reductions made visibility worse than current conditions at any Class I area. When more detailed analysis of the SO2 reductions was conducted, it was found that reductions in elevated SO2 emissions were significantly more effective than ground-level SO2 reductions. Of these elevated SO2 sources, coal-fired power plants generally had the largest effect on visibility. When the coal-fired power plants of the region were broken down by state visibility at each Class I area was affected at a different level by emissions from different states. In general, the largest impacts could be attributed to the closest states however the contributions of distant states were often significant. When the non-EGU emissions were broken down by state, the contribution from distant sources became more obvious. This result points to the regional nature of the haze problem and the need for regional control strategies. Another VISTAS modeling study estimated the responses and sensitivities (responses normalized by emissions reduced) of O3, PM2.5, and Bext (calculated from PM2.5 components) to SO2, NOx, NH3, VOC, and PC emissions in the Southeastern US for the year 2009 (Odman et al., 2009). The analysis of the modeling results focused on areas of concern (AOC) and Class I areas in the VISTAS region. The findings of this study are as follows. The sensitivity of O3 to VOC emissions is 2-3 orders of magnitude smaller than the sensitivities to elevated and ground-level NOx; therefore, VOC controls would be much less effective than NOx controls for reducing O3 levels in Southeastern US. Although sensitivities to ground-level NOx emissions from AOC are larger, state-wide reductions of elevated NOx emissions by the same ratio (30%) are generally more effective in reducing summertime O3. Reducing one state’s ground-level NOx emissions generally produces the largest effect in that state. Elevated NOx reductions have a less localized and more regional effect. Responses are larger in southern VISTAS states where O3 forming potentials are greater. It is possible to achieve as much as 3.5 to 5 ppb decreases in 2009 O3 levels by reducing ground-level or elevated NOx emissions by 30% throughout the VISTAS region. In the VISTAS states, sensitivity of PM2.5 is generally largest to PC emissions (sensitivity to ground-level sources in AOCs being larger than elevated sources statewide), followed by SO2 emissions (sensitivity to non-EGU being larger than EGU emissions), and NOx emissions GIT/UNC/CIRA-SEMAP 004.v3 2-11 April 29, 2011 SEMAP Modeling Protocol (sensitivity to ground-level being larger than elevated sources). There are several exceptions to this rule depending on the season of the year, magnitude of emissions from different source categories in each state, and the proximity of the sources to the sampling sites. The sensitivities provide a means of comparing the controls of different pollutants on a per-unit-mass basis but, ultimately, the amounts of controllable emissions determine the achievable PM2.5 reductions. Since PC emissions are limited but there are ample SO2 emissions that can be controlled, responses to 30% SO2 reductions are generally larger than responses to 30% PC reductions. In this respect, SO2 reductions are ultimately more “beneficial” in summer. As much as 1.12 g/m3 decrease in 2009 PM2.5 levels can be achieved by reducing EGU SO2 emissions by 30% throughout the VISTAS region. The benefits generally decrease from north to south and from west to east. The second most beneficial summertime strategy in the region is the reduction of non-EGU SO2 emissions, followed by ground-level and elevated PC controls, respectively. The impacts of PC reductions are highly localized; as much as 0.69 g/m3 decrease in PM2.5 levels can be expected. The impacts of elevated and ground-level NOx controls are generally smaller. In winter, ground-level PC controls are generally more beneficial than EGU SO2 controls. PM2.5 levels can be decreased by as much as 1.61 g m-3 by reducing the ground-level PC emissions by 30%. The maximum response to 30% EGU SO2 reductions is a PM2.5 decrease of 0.32 g/m3. Colder temperatures in northern VISTAS states limit secondary sulfate formation and the potential benefits of SO2 emissions reductions. It is possible to achieve greater impacts in some states by reducing elevated NOx emissions than by reducing EGU SO2 emissions (however note that the uncertainties in sensitivities to NOx are larger than those to SO2). When PM2.5 responses are normalized by total emissions reductions over the VISTAS region, sensitivity to NH3 emissions ranks second after sensitivity to ground-level PC in winter and is larger than sensitivities to ground and elevated NOx emissions in summer. In the Southeastern US, NO3NH4 levels are generally limited by the availability of NH3, so controlling NH3 emissions can be a more effective strategy for PM2.5 reduction than controlling NOx emissions. Wintertime PM2.5 levels can be reduced by as much as 0.75 g/m3 on average in VISTAS states by reducing NH3 emissions from those states by 30%. VOC controls would have no significant impact on PM2.5 levels in this region. At the mountain Class I areas, which have their worst visibility days in summer, the greatest visibility benefits would likely be from EGU SO2 reductions within the VISTAS states. At the costal Class I areas, which generally have their worst visibility days in winter, controlling NH3 emissions would be more beneficial for reducing NO3NH4 contributions to visibility impairment than controlling NOx emissions from either ground-level or elevated sources. This is consistent with Odman et al. (2007), which suggests NH3 control can be a very effective strategy for reducing regional haze in winter. Controlling anthropogenic VOC emissions or PC emissions from elevated or ground-level sources would have little if any visibility benefits at Class I areas of the Southeastern US. GIT/UNC/CIRA-SEMAP 004.v3 2-12 April 29, 2011 SEMAP Modeling Protocol Section 3: Models and Model Inputs 3.1 Meteorology, Emissions and Air Quality Models This section describes the modeling systems that will be used in the modeling and analysis of the air quality data for the SEMAP project. We have selected the following modeling systems because they present the best combination of scientific and computational formulations to satisfy the requirements of the SEMAP project. The air quality, meteorological, and emissions models selected for this project follow the recommendation in the EPA Modeling Guidance (i.e., Guidance on the Use of Models and Other Analyses for Demonstrating Attainment of Air Quality Goals for Ozone, PM2.5, and Regional Haze) and Appendix W to Part 51 of 40 Code of Federal Regulations: Guideline on Air Quality Models. 3.1.1 WRF 3.1.1.1 Description The WRF v. 3.1.1 mesoscale meteorology model is a fully compressible, nonhydrostatic model with an Eulerian mass dynamical core (Skamarock et al. 2008). Two dynamical cores are available: the Advanced Research WRF (ARW) core, developed and supported by NCAR, and the Nonhydrostatic Mesoscale Model (NMM) core, whose development is centered at NCEP’s Environmental Modeling Center (EMC) and support is provided by NCAR’s Development Testbed Center (DTC). The WRF-NMM is designed to be a real-time forecast model so physics schemes are preset. SEMAP chose WRF-ARW (hereinafter WRF) because it includes multiple physics options for turbulence/diffusion, radiation (long and shortwave), land surface, surface layer, planetary boundary layer, cumulus, and microphysics. The major physics options used are: microphysics – Morrison(option 10); cumulus – Kain-Fritsch(1); PBL – ACM2/PleimXiu(7); surface layer – Pleim-Xiu(7); surface – Pleim-Xiu(7); longwave and shortwave radiation – RRTMG(4). Vertical velocity damping was used to help prevent excessive vertical motion which may cause the model to crash. Per US EPA, 6th order diffusion was used to prevent gridscale artifacts when using the Pleim-Xiu PBL Scheme. Additional details can be found in “Protocol for WRF Meteorological Modeling in Support of Regional SIP Air Quality Modeling in the SEMAP” (SEMAP, 2011). WRF was selected for use in upcoming SIP air quality modeling by a consensus of several RPOs because the model is: Generally considered the most technically advanced public-domain prognostic model available for operational use in preparing inputs to urban- and regional- scale photochemical air quality models. Suitable for a broad spectrum of applications across scales ranging from meters to thousands of kilometers. Flexible and efficient computationally, while offering the advances in physics, numerics, and data assimilation contributed by the research community. GIT/UNC/CIRA-SEMAP 004.v3 3-1 April 29, 2011 SEMAP Modeling Protocol In addition, EPA expects that air quality modelers will also shift from MM5 to WRF (http://www.epa.gov/scram001/metmodel.htm ). The computational domain for the WRF annual simulation consists of a coarse and fine grid nested domains. The coarse domain consists of a Lambert conic conformal projection with specifications used in previous regional modeling studies, which is centered at 40° N and 97° W with true latitudes of 33 and 45° N. The coarse grid structure consists of an array of 165 cells in the east-west direction and 129 cells in the north-south direction with a grid spacing of 36 kilometers (km). This domain was designed to maximize the usage of the Eta analysis region. A nested domain is implemented to resolve finer scale meteorological features. The nested domain grid structure is a 250 by 250 array of grid cells with a grid spacing of 12 km. This domain begins at node (66, 18) of the course domain. The nested domain shares the northern, eastern, and southern boundaries of the EPA 12 km Eastern domain and western boundary of the Iowa/LADCO 12 km domain. This 35-layer vertical structure was designed to provide increased resolution in the PBL and near the tropopause. The second level is placed at ~20 meters so that the midpoint of the grid cells in the vertical direction is located at 10 meters and corresponds to the standard National Weather Service (NWS) anemometer height. This avoids the use of a surface layer parameterization to vertically interpolate horizontal wind components to the standard NWS observational height. The top of the model is fixed at 50 millibars (mb). The geographic data for the outer domain was at 10 m resolution and for the inner nest was 2 m resolution. The North American Mesoscale (NAM) model fields with 40 km horizontal resolution, 27 vertical levels, and 3-hour intervals were used for initial and boundary conditions. The map projection was Lambert conformal conic projection with the first true latitude of 33° N and second true latitude of 45°N with 97°W as the longitude parallel to the conic axis. The model was reinitialized after each segment of 5.5 days (132 hours). The first 12-hour period is for the model spin-up (i.e., the first 12 hours of data were discarded in the application of the datasets). 3.1.2 MCIP 3.1.2.1 Description The Meteorology-Chemistry Interface Processor (MCIP) (Otte and Pleim, 2010) is the software program linking the meteorological model and the air quality model. It takes MM5/WRF outputs and processes them into CMAQ ready data sets. MCIP is designed to maintain dynamic consistency between the meteorological model and the chemical transport model as much as possible. MCIP reads in meteorological model outputs in their native formats and then transforms horizontal (CMAQ domains is smaller than MM5/WRF domains) and vertical coordinates (vertical collapsing) according to the user’s definition. Gridding parameters are defined accordingly. MCIP not only pass through all the necessary meteorological variables but also derive variables needed by CMAQ, such as Jacobian and contravariant vertical velocity. Certain additional atmospheric fields such as cloud information are diagnosed in MCIP. Final products from MCIP are the meteorological variables in IO/API format required by the CMAQ modeling system. GIT/UNC/CIRA-SEMAP 004.v3 3-2 April 29, 2011 SEMAP Modeling Protocol We will use the latest version of MCIP (Version 3.6, release on March 2010) to process the provided WRF model outputs to prepare meteorological fields for CMAQ simulations. The horizontal and vertical coordinates are defined according to SESARM’s guidance. 3.1.2.2 Limitations In WRF model, users have the option to add or reduce output variables through the Registry system (Skamarock et al., 2008), but MCIP code need to be modified to ingest these changes. Also when new physics packages added into WRF model, MCIP needs to be changed as well to reflect the new schemes. Nevertheless, these limitations should not affect this project. 3.1.2.3 Input Requirements MCIP needs WRF model outputs in netcdf format. Also it requires WRF model outputs these variables specific for CMAQ model: fractional land use (LANDUSEF), leaf area index (LAI), Monin-Obukhov length (RMOL), aerodynamic resistance (RA), stomatal resistance (RS), and vegetative fraction (VEGF_PX). These variables are in the WRF Registry and need to be output. Availability: MCIP is freely available for download from http://www.cmascenter.org 3.1.3 WRFCAMx 3.1.3.1 Description The WRFCAMx program generates CAMx meteorological input files from WRF (ARW core) hourly output files. A single WRFCAMx run provides meteorological fields for a single CAMx grid, whether defined as a CAMx master coarse grid, or a nested grid. The program generates the files for any duration (from a single hour to several days). For a single day, 25 hours of meteorology must be present (midnight through midnight, inclusive) as these fields represent hourly instantaneous conditions and CAMx internally time interpolates these fields to each model time step. Precipitation fields are not time-interpolated, but rather time-accumulated, so cloud/precipitation files contain one less hour than other met files (e.g., 24 hours of cloud/precipitation vs. 25 hours other met). The CAMx physical height layer structure (meters AGL) is defined from the geopotential heights at each of WRF's "eta" levels, and thus varies in space and time. Vertical averaging of WRF layers to a coarser CAMx layer structure is allowed. Variables in each WRF layer are mass-weighted to each bulk CAMx layer. Several options are available to derive Kv (diffusivity) fields from WRF output. If TKE is available, Kv can be derived from this; otherwise, other options diagnose Kv from wind, temperature, and PBL parameters. WRFCAMx outputs can be converted into a format viewable in PAVE using the METPAVE utility. The development of photolysis rate inputs for CAMx is crucial for the photochemical mechanisms. The AHOMAP program (version 3) prepares albedo/haze/ozone column input files for CAMx, and must be run prior to running the Total Ultraviolet (TUV) radiative transfer model as it defines the albedo, haze, and ozone column intervals based on input data. A CAMx-ready landuse file (from WRFCAMx) and total integrated ozone column (TOMS) data files (available at http://jwocky.gsfc.nasa.gov/) must be supplied as input. The program assumes a constant haze turbidity value for the entire grid. It should be noted that gaps in day-specific TOMS data at low GIT/UNC/CIRA-SEMAP 004.v3 3-3 April 29, 2011 SEMAP Modeling Protocol latitudes, and high latitudes during winter, may result in zero ozone column values over portions of the modeling grid. These gaps result from the inability of polar-orbiting satellite platforms to cover the entire globe on a daily basis. Seasonal-average TOMS input files (no data gaps) may be used instead; alternatively, AHOMAP provides an option to fill these day-specific gaps with an average determined from valid data processed for the extraction domain. Additionally, the Total Ultraviolet (TUV) radiative transfer program (version 4.0.1) develops photolysis rate inputs for all CAMx CB4, CB05, and SAPRC99 photochemical mechanisms via a deltaEddington radiative transfer model. TUV originates from UCAR (http://cprm.acd.ucar.edu/Models/TUV/). The program uses the albedo, ozone column, and haze turbidity intervals defined in AHOMAP to generate a look-up table of photolysis rates to be used in CAMx. The table defines photolysis rates for several CB05 photolytic reactions over a range of solar zenith angles, altitudes, ozone column, surface UV albedo, and haze turbidity. CAMx internally adjusts the photolysis rates for cloud cover according to the cloud inputs provided to CAMx (from WRFCAMx). We will use the latest WRFCAMx (version 2.2), AHOMAP program (version 3), and TUV program (version 4) to process the provided WRF outputs to CAMx ready meteorological fields. The horizontal and vertical structures will be defined by SESARM. 3.1.3.2 Input Requirements WRFCAMx reads in WRF outputs in netcdf format. In addition to the WRF outputs, the AHOMAP and TUV programs need TOMS data from NASA web site (http://jwocky.gsfc.nasa.gov). The sequence of running these programs should be: WRFCAMx, AHOMAP, and TUV. Availability: All programs and utilities described above can be downloaded at http://www.camx.com/down/support.php. TOMS data can be obtained from http://jwocky.gsfc.nasa.gov 3.1.4 SMOKE 3.1.4.1 Description The Sparse Matrix Operator Kernel Emissions (SMOKE) modeling system is a set of programs that is used by the U.S. EPA, Regional Planning Organizations (RPOs), and State environmental agencies to prepare emissions inventory data for input to an air quality model such as CAMx. SMOKE converts annual or daily estimates of emissions at the state or county level to hourly emissions fluxes on a uniform spatial grid that are formatted for input to an air quality model. SMOKE integrates annual county-level emissions inventories with source-based temporal, spatial, and chemical allocation profiles to create hourly emissions fluxes on a predefined model grid. For elevated sources that require allocation of the emissions to the vertical model layers, SMOKE integrates meteorology data to derive dynamic vertical profiles. In addition to its capacity to simulate emissions from stationary area, stationary point, and on-road mobile sectors, SMOKE is also instrumented with the Biogenic Emissions Inventory System, version 3 (BEIS3) for estimating biogenic emissions fluxes (U.S. EPA, 2004) and the both MOBILE6 and the Motor Vehicle Emission Simulator (MOVES) 2010 model for estimating on-road mobile GIT/UNC/CIRA-SEMAP 004.v3 3-4 April 29, 2011 SEMAP Modeling Protocol emissions fluxes from county-level vehicle activity data. SMOKE can additionally be used to calculate future-year emissions estimates, if the user provides data about how the emissions will change in the future (Figure 3-1). Figure 3-1. SMOKE modeling system SMOKE uses C-Shell scripts as user interfaces to set configuration options and call executables. SMOKE is designed with flexible QA capabilities to generate standard and custom reports for checking the emissions modeling process. After modeling all of the emissions source categories individually, SMOKE creates two files per day for input into CMAQ or CAMx: (1) an elevated point source file for large stationary sources, and (2) a merged gridded source file of low-level point, mobile, non-road, area, and biogenic emissions. The efficient processing of SMOKE makes it an appropriate choice for handling the large processing needs of regional and seasonal emissions processing, as described in more detail by Houyoux et al. (1996, 2000). We will use the latest version of SMOKE (version 2.6 released on October 2009) for the SEMAP modeling. Additional technical details about this version of SMOKE are available from CMAS website (CMAS, 2010). 3.1.4.2 Limitations SMOKE is a software tool and not a set of data files; therefore, SMOKE relies on user-provided data files for emission inventories and factors to apply to those emissions. The factors assign the annual inventory data to the hours, grid cells, and model species and can be adjusted by the user in a way that is the most appropriate for the inventory sources included in the air quality GIT/UNC/CIRA-SEMAP 004.v3 3-5 April 29, 2011 SEMAP Modeling Protocol modeling domain. In addition SMOKE requires meteorology data in the Input/Output Application Programmers Interface (I/O API) format to process meteorology-dependent emissions sectors. The temporal and spatial extents of the SMOKE modeling periods are dictated by the input meteorology. SMOKE can neither interpolate between different grid resolutions nor project/backcast to dates that are not covered in the input meteorology. SMOKE has strict requirements for the nature and formats of the inventory data that it can use. 3.1.4.3 Input Requirements SMOKE primarily uses two types of input file formats: ASCII files and I/O API files. Input files are files that are read by at least one core SMOKE program, but are not written by a core program. SMOKE uses strict rules that define the format and content of the input files. These rules are explicitly laid out in the SMOKE Users Manual (CMAS, 2010). All data input to SMOKE must be either formatted to one of the prescribed input file types or converted to an intermediate form, such as a gridded I/O API inventory file, before it can be input to SMOKE. In general SMOKE requires an emissions inventory, temporal allocation, spatial allocation, and chemical allocation data to prepare emissions estimates for an air quality model. For some source categories, such as on-road mobile and stationary point sources, SMOKE also requires meteorology data to calculate emissions. SMOKE calculates biogenic emissions estimates with gridded land use, vegetative emissions factors, and meteorology data. Further details about the SMOKE input requirements are available at CMAS website. We will use the latest version of BEIS (version 3.14 released on October, 2008 as a part of SMOKE model) for the SESARM modeling. Additional technical details about this version of BEIS are available from CMAS website (CMAS, 2010). Availability: SMOKE is freely available for download from http://www.smoke-model.org 3.1.5 BEIS 3.1.5.1 Descriptions The Biogenic Emissions Inventory System (BEIS) model was developed to estimate emissions of VOCs from biological activity from land-based vegetative species and nitric oxide emissions from microbial activity in certain soil types. BEIS version 3 (BEIS3) was deployed within the SMOKE modeling system and included support for coupling with predicted hourly meteorology data. BEIS3 uses land use data projected to an air quality modeling grid to compute normalized gridded biogenic emissions for different land use categories. The normalized emissions are adjusted for temporal variability using gridded, hourly meteorology data and speciated using precomputed VOC profiles for different photochemical mechanisms. BEIS outputs air quality model-ready hourly, gridded biogenic emissions files. The primary inputs to BEIS3 are: Spatially and temporally resolved meteorological data including temperatures, solar radiation and surface pressures. GIT/UNC/CIRA-SEMAP 004.v3 3-6 April 29, 2011 SEMAP Modeling Protocol Species-specific leaf area indices (LAI) Spatially resolved, species-specific vegetation categories Species-specific biogenic emissions factors for NO and VOCs Chemical speciation profiles to chemically allocate biogenic VOC emissions to species recognized by the chemistry mechanism in the desired air quality model. Overall, BEIS3 uses spatially and temporally resolved meteorology data to estimate hourly emissions factors of NO and VOCs using species-specific LAI for each land use type. The user can assign the units for the output emissions as gram-moles per hour or gram- moles per second. The output is in I/O API format and can be input into both CMAQ and CAMx modeling systems. To support CAMx model, SMOKE needs to use an utility program (CMAQ2CAMx) that converts the BEIS3 I/O API output files into a binary format recognized by the CAMx. Availability: BEIS3 is freely available for download as part of the SMOKE distribution from http://www.smoke-model.org 3.1.6 MEGAN The model of Emissions of Gases and Aerosols from Nature (MEGAN) has been developed to estimate biogenic emissions of reactive gases and aerosols needed for both regional air quality models and global chemistry transport models (Guenther et al., 2006). MEGAN uses an approach similar to previous terrestrial biogenic emission models, like BEIS3 and the Global Biogenic Emissions Investigation System (GLOBEIS) (Guenther et al., 1999) but is easier to update, use, and expand to other compounds. MEGAN is designed to be continually updated as understanding of chemical emissions from plants expands, and further, to be useable by both research scientists and regulatory air quality modelers. One of the biggest changes in MEGAN as compared to the BEIS family of models is in its treatment of plant species area coverage. In the BEIS family of models, plant species are mostly treated explicitly (e.g., California live oak, corn row crop, loblolly pine) where in MEGAN plants are grouped by the following six plant functional types (PFTs): 1) broadleaf trees, 2) fine leaf evergreen trees, 3) fine leaf deciduous trees, 4) shrubs, 5) grass, and non-vascular plants and other ground cover and 6) crops. The MEGAN biogenic emissions estimates are based on data prepared by Guenther et al. (2006): Emission factors available for six PFTs; Soil wilting point data; PFT area coverages; Leaf area index (LAI) for vegetated surfaces. These data sets are currently available for much of the planet in ARC/Grid format and are gridded on a scale of 30 seconds by 30 seconds (approximately 1,000 meters by 1,000 meters). GIT/UNC/CIRA-SEMAP 004.v3 3-7 April 29, 2011 SEMAP Modeling Protocol The MEGAN chemical speciation processing step for VOCs is similar to BEIS3. MEGAN estimates emission factors from biogenic activities for fifty- seven chemicals (e.g., isoprene, ethanol, α-cedrene) using a speciation profile file that supports the speciation of the chemicals from biogenic activities into the species recognized by the CB-IV, CB05 and SAPRC chemical mechanisms. We will use a GIS to convert version 2.1 of the MEGAN emission factors and plant functional types and version 2.0 of the MEGAN leaf area index data to the SEMAP grids and combine these with the SEMAP meteorology data to estimate biogenic emissions for the summer and winter sensitivity periods. Availability: MEGAN can be downloaded at http://bai.acd.ucar.edu/Megan/index.shtml 3.1.7 MOVES In December 2009, a new version of MOVES was released: the Motor Vehicle Emission Simulator 2010 model (MOVES2010) (OTAQ, 2009). MOVES is the next generation of mobile source emissions model developed by U.S. EPA. In the modeling process, the user specifies vehicle types, time periods, geographical areas, pollutants, vehicle operating characteristics, and road types to be modeled. The model then performs a series of calculations, which have been carefully developed to accurately reflect vehicle operating processes (such as cold start or extended idle) and provide estimates of bulk emissions or emission rates. An important feature of MOVES2010 is that it allows users to choose between (1) the Inventory calculation type, which provides emission rates in terms of total quantity of emissions for a given time, and (2) the Emission Rate calculation type, which gives emission rates in terms of grams/mile or grams/vehicle/hour. For large-scale emissions modeling such as that needed for regional and national-scale air quality modeling projects, it is desirable to use the Emission Rate calculation type, which populates emission rate lookup tables that can then be applied to many times and places, thus reducing the total number of MOVES runs required. The approach for running MOVES for SMOKE relies on the concept of reference counties. These are counties that are used during the creation and use of emission factors to represent a set of counties called a “county group.” All counties in a county group share the same fuel parameters, fleet age distribution, and inspection/maintenance (I/M) programs. The use of modeling emission factors for one reference county to represent an entire county group avoids unnecessary duplicative MOVES runs. Each reference county is modeled at a range of speeds and temperatures to produce emission rate lookup tables (grams/mile or grams/vehicle/hour, depending on emission process). This approach allows any county with unique distributions of vehicle miles traveled (VMT), vehicle population, roadway speed, and grid cell temperatures to be modeled in SMOKE without having to rerun MOVES. In addition to the emission factor tables for the reference counties, SMOKE requires other input data in order to use the MOVES-based emission factor tables; these additional data include county- and SCC-specific VMT and vehicle population, average speed by roadway type, and gridded temperature. Finally, some inputs used to create the MOVES emission factor tables, such as the mapping of counties to their reference counties, are also used as input to SMOKE. GIT/UNC/CIRA-SEMAP 004.v3 3-8 April 29, 2011 SEMAP Modeling Protocol The outputs (emissions estimates) from MOVES can be used as inputs to larger emissions modeling systems that model many different types of emissions and provide results that are used in performing air quality modeling. In SMOKE version 2.7b, the new feature has been developed to facilitate the process of using MOVES to create emissions estimates appropriate for air quality modeling. Additional technical details of how to process SMOKE-MOVES integration approach will be available from CMAS website (CMAS, 2010). Availability: MOVES-SMOKE tool is currently not available for distribution, but will be released with SMOKE at http://www.smoke-model.org. 3.1.8 CMAQ Modeling System The Community Multiscale Air Quality (CMAQ) modeling system (Byun and Ching, 1999; Byun and Schere, 2006) has been under development for over a decade by the U.S. EPA and other members of the community, and has been the primary model used in several regulatory applications both by the U.S. EPA and local/state/regional agencies for air quality management. CMAQ includes treatment of ozone, particulate matter, visibility, acid deposition and several air toxics to study air pollution in a ‘one atmosphere’ mode at multiscales from local to regional to intercontinental. The treatment of aerosols in CMAQ and the initial evaluation is described by Binkowski et al (2003) and Mebust et al (2003). CMAQ has been evaluated extensively for several applications both within the U.S. and outside the U.S. (Hanna and Benjey, 2006). CMAQ consists of a core Chemistry Transport Model (CCTM), and a suite of preprocessors that prepare various other inputs for the model. These include the a) Meteorology-Chemistry Interface Preprocessor (MCIP), b) Initial Conditions Preprocessor (ICON), c) Boundary Conditions Preprocessor, and d) Photolysis Rates Preprocessor (JPROC). Emissions are prepared using SMOKE discussed above. The very first release of CMAQ came out in 1998. Starting in 2001 with the release of CMAQ v4.1, EPA has released several versions of CMAQ via the CMAS Center at UNC, with incremental updates and bug fixes along the way. For the SEMAP project, we will use the latest version of CMAQ (v4.7.1) expected to be released by the CMAS Center during June 2010. The changes and new features of this release are given below. Foley et al (2010) describe the results from incremental testing of the various enhancements made in CMAQ v4.7. 3.1.8.1 Features of CMAQ V4.7.1 Aqueous chemistry o Mass conservation improvements o Imposed one second minimum timestep for remainder of the cloud lifetime after 100 "iterations" in the solver o Force mass balance for the last timestep in the cloud by limiting oxidized amount to mass available o Implemented steady state assumption for OH o Only allow sulfur oxidation to control the aqueous chemistry solver timestep (previously, reactions of OH, GLY, MGLY, and Hg for multipollutant model also controlled the timestep) Advection o Added additional divergence-based constraint on advection timestep GIT/UNC/CIRA-SEMAP 004.v3 3-9 April 29, 2011 SEMAP Modeling Protocol o Vertical advection now represented with PPM to limit numerical diffusion Model time step determination o Fixed a potential advection time step error reported by Talat Odman The sum of the advection steps for a given layer time step might not equal the output time step duration in some extreme cases Ensured that the advection steps sum up to the synchronization ste Horizontal diffusion o Fixed a potential error reported by Talat Odman o Concentration data may not be correctly initialized if multiple sub-cycle time steps are required o Fix to initialize concentrations with values calculated in the previous sub-time step Emissions o Bug fix in EMIS_DEFN.F to include point source layer 1 NH3 emissions o Bug fix to calculate soil NO "pulse" emissions in BEIS o Remove excessive logging of cases where ambient air temperature exceeds 315.0 Kelvin. When this occurs, the values are just slightly over 315 Multipollutant model updates o Bug fix to facilitate in-line emissions option o Mercury Dicarboxylic acid reduction mechanism applied to aqueous phase oxidized mercury species Cl oxidation pathway based on Donohoue et al 2005 J. Phys. Chem. A 109(34) o Lowered CB05 mechanism unit yields for acrolein from 1,3-Butadiene tracer reactions to be consistent with laboratory measurements Possible parallel processing problem o Processor 0 creates the NetCDF, I/O-API file header when opening new output files o On some parallel platforms, message-passing latency causes other processors to not have imminent access to the file header, resulting in a "hung" execution or a crash. o Fixed by deferring the other processor's access until later in the computation Photolysis o JPROC/phot_table and phot_sat options Expanded lookup tables to facilitate applications across the globe and vertical extent to 20km Updated temperature adjustments for absorption cross sections and quantum yields Revised algorithm that processes TOMS datasets for OMI data format o In-line option Asymmetry factor calculation updated using values from Mie theory integrated over log normal particle distribution; added special treatment for large particles in asymmetry factor algorithm to avoid numerical instabilities Instrumented Models GIT/UNC/CIRA-SEMAP 004.v3 3-10 April 29, 2011 SEMAP Modeling Protocol o DDM-3D o Sulfur Tracking o Primary Carbon Apportionment 3.1.8.2 Known Issues in CMAQ version 4.7.1 When using the inline option for biogenic emissions, only one hour output timesteps are supported DDM-3D is currently only operational with the "eddy" vertical diffusion module (DDM for the acm2 or acm2_inline modules may be available in the next release) Availability: CMAQ is freely available for download from http://www.cmaq-model.org 3.1.9 CAMx The Comprehensive Air quality Model with extensions (CAMx) is also an Eulerian photochemical dispersion model that includes an integrated “one-atmosphere” assessment of gaseous and particulate air pollution (ozone, PM2.5, PM10, air toxics, mercury) over many scales ranging from sub-urban to continental (Morris et al, 2003a). The CAMx model and the associated pre- and post-processors are also in the public domain, and developed and supported by Environ, Inc. In addition to the PM treatment available in this release that is used by most regulatory applications, CAMx also has an alternate sectional treatment of PM that is used for several research applications (Morris et al, 2003b). The CAMx model will be used to supplement the CMAQ modeling and help provide a weight-of-evidence model attainment demonstration. The CAMx extensions include: a) Ozone Source Apportionment Technology (OSAT) b) Particulate Source Apportionment Technology (PSAT) (Yarwood et al, 2004) c) Decoupled Direct Method (DDM) and Higher Order Decoupled Direct Method (HDDM) for ozone (Dunker et al, 2002) d) Process Analyses (PA) and e) Reactive Tracer (RTRAC) Source Apportionment (Morris et al, 2003c) 3.1.9.1 Overview of Version 5.20 V5.20 provides two updates and several modifications and bug fixes from the previous release (v5.10). The new capabilities in CAMx include the linkage of PM chemistry to the SAPRC99 gas-phase mechanism, and the addition of mineral nitrate. While there are important chemistry changes for v5.20, there are no changes to other input file formats from versions 4.3 through 5.1. There are no changes to core model output file formats from versions 4.4 through 5.1, and no changes to Probing Tool outputs from version 5.1. Since v5.0, CAMx is capable of distributed multi-processing using MPI. To use MPI one must have the MPICH or MPICH2 utility installed on the system. Guidance for running MPI is provided in the README file located in the source code directory, and in the CAMx User's Guide. GIT/UNC/CIRA-SEMAP 004.v3 3-11 April 29, 2011 SEMAP Modeling Protocol 3.1.9.2 Features of Version 5.20 PM chemistry has been linked to the SAPRC99 gas-phase mechanism (mechanism 5). CAMx now determines the uptake of nitrate by calcium in fine mineral (crustal) dust particles. Calcium in soil dust particles reacts with nitric acid to form calcium nitrate. Since the current ISORROPIA implementation in CAMx does not consider mineral cations other than sodium, nitrate uptake by calcium in soil dust is calculated external to ISORROPIA. CAMx outputs total particulate nitrate, i.e., the sum of particle nitrate determined by ISORROPIA and calcium nitrate. Numerous modifications were made to improve the vertical transport solution. o The vertical velocity diagnostic algorithm and associated vertical advection solver were altered to improve the vertical transport solution. The vertical solver changed from an upstream-donor scheme to a centered scheme in space (continues to be backwards-Euler implicit in time). The top boundary condition was changed to a "zero-gradient" approach, alleviating the need for a top boundary condition input file. o The manner in which the "super-stepping" driving time step is calculated has been modified to improve coupling between the horizontal and vertical advection algorithms. o The order of X and Y advection is now constant. o Nested grid boundary winds are no longer set from their parents. All CB and SAPRC gas-phase chemistry mechanisms were improved by removing the "NxOy" species and adding explicit species for NO3 and N2O5. Oxidation of SOA precursors was added to PiG chemistry. The wet deposition routine was improved to more correctly track accumulated gas-phase mass within falling precipitation through a grid column. The gas scavenging rate into precipitation was simplified. Fixed several bugs in the OMP section of the PiG chemistry module. Fixed a bug in the OMP section of the core chemistry driver that affected the chemical apportionment of ozone production in OSAT. Improved the accumulation of mass summary fluxes within several OMP sections of the model. Fixed bugs in the application of OMP for RTCMC. Changed some data structures that caused large applications of source apportionment (OSAT/PSAT) to crash with segmentation faults. Implemented a major update to source apportionment timing tracers. Fixed bug in reading boundary conditions for purposes of calculating the OH reactivity of OSAT boundary tracers. Fixed error in routine that updates boundary condition DDM sensitivities while advancing the timestep. Converted large data structures used in calculating kOH reactivity for OSAT/PSAT from static to dynamically allocated. Fixed bug in reading the boundary conditions for RTRAC. Improved memory management of temporary arrays that are used for message passing in MPI. Availability: CAMx is freely available for download from http://www.camx.com GIT/UNC/CIRA-SEMAP 004.v3 3-12 April 29, 2011 SEMAP Modeling Protocol 3.1.10 MATS The Modeled Attainment Test Software (MATS) is a tool made available by the U.S. EPA to perform the modeled attainment tests for ozone and fine particulate matter as discussed in the U.S. EPA guidance for demonstrating attainment of the NAAQS for ozone and PM2.5. (U.S. EPA, 2007). While the attainment test for ozone is relatively straight forward, the test for PM2.5 is slightly more complicated. MATS includes the Speciated Modeled Attainment Test (SMAT) for PM2.5 by taking speciated components of PM2.5 from CMAQ for each of the base and future years and computes species specific relative reduction factors and uses them in conjunction with ambient data from FRM, STN and IMPROVE monitors to compute future year design values. MATS thus applies modeling data in a relative sense rather than an absolute sense to investigate air quality changes between two scenarios. The various calculations with MATS using the Sulfate, Adjusted Nitrate, Derived Water, Inferred Carbonaceous material balance approach (SANDWICH) are described in Frank (2006). SMAT has been used in a number of EPA policy relevant studies, such as regulatory impact analyses performed to support the Clear Skies, the former Clean Air Interstate Rule (CAIR), and Low Sulfur Diesel Rule, and for modeling performed by the various Regional Planning Organizations including the Visibility Improvement State and Tribal Association of the Southeast (VISTAS). Furthermore, the EPA requires states to apply SMAT in their SIP in conjunction with air quality modeling to demonstrate attainment of the National Ambient Air Quality Standards (NAAQS) for criteria air pollutants, including PM2.5, as part of the Clean Air Act (U.S. EPA, 2007). SMAT results are available as point estimates and spatial estimates. Point estimates are calculated at CMAQ grid cells containing Federal Reference Method (FRM) monitoring sites while spatial estimates are calculated at each grid cell in the CMAQ domain. MATS is needed because of its previous use in policy relevant work and is considered best practice by the EPA, its ability to produce speciated PM2.5 fields, and its combination of ambient data and modeling results. The publicly available version V2.2.1 can perform attainment tests for ozone, annual PM2.5, and visibility. As part of a group of beta testers, UNC has access to a beta version of MATS (V2.2.3) that can perform attainment test for daily average PM2.5 as well, but is being worked on for a few bug fixes. The installer package from EPA includes all ambient datasets that are necessary for performing the attainment tests. MATS is a PC-based software, and takes CMAQ model outputs that are post-processed in an ASCII format suitable for MATS. Availability: MATS is freely available for download from: http://www.epa.gov/ttn/scram/modelingapps_mats.htm 3.2 Modeling Domains This section describes the horizontal and vertical modeling domains that will be simulated for the SEMAP project. 3.2.1 Horizontal Domain The SEMAP modeling domain is on a Lambert Conformal Conic projection centered on 40°N and 97°W. A one-way nested 36-km grid resolution parent domain covering the continental United States will provide boundary conditions to a 12-km grid resolution domain covering the GIT/UNC/CIRA-SEMAP 004.v3 3-13 April 29, 2011 SEMAP Modeling Protocol Southeastern U.S. The projection and grid definition information for the two domains is given in Table 3-1, and Figure 3-2 shows the nested SEMAP modeling domains. Table 3-1. Projection parameters for the CMAQ and CAMx modeling domains Parameter Map Projection P-alpha P-beta P-gamma X-cent Y-cent X-orig Y-orig dx dy Columns Rows Layers 36-km domain 12-km domain Lambert Conformal Conic 33°N 45°N 97°W 97°W 40°N -2,736,000 m 108,000 m -2,088,000 m -1,620,000 m 36,000 m 12,000 m 36,000 m 12,000 m 148 168 112 177 24 Figure 3-2. SEMAP 36-km (grey) and 12-km (white) modeling domains GIT/UNC/CIRA-SEMAP 004.v3 3-14 April 29, 2011 SEMAP Modeling Protocol 3.2.2 Vertical Domain Table 3-2 is the vertical layer structures of the WRF and CMAQ/CAMx modeling domains. The initial configuration collapses 34 layers from the WRF results into 22 layers for the air quality modeling. Table 3-2. SEMAP WRF and CMAQ Vertical Layer Configurations WRF CMAQ Height Pressure Depth Level (m) (mb) (m) Sigma Height (m) Pressure (mb) Depth (m) 23 0 18663 50 5264 22 0.1056 13399 150 3775 21 0.2378 9624 276 2870 20 0.3895 6754 420 2150 19 0.5406 4604 564 1100 18 0.6323 3504 651 880 17 0.7133 2624 728 700 16 0.7828 1924 794 552 15 0.841 1372 849 228 14 0.8659 1144 873 200 13 0.8882 944 894 174 12 0.9079 770 913 150 11 0.9252 620 929 128 10 0.9401 492 943 108 Level Sigma 35 0 18663 50 2034 34 0.0332 16629 82 1715 33 0.0682 14914 115 1515 32 0.1056 13399 150 1375 31 0.1465 12024 189 1255 30 0.1907 10769 231 1145 29 0.2378 9624 276 1045 28 0.2871 8579 323 955 27 0.3379 7624 371 870 26 0.3895 6754 420 790 25 0.4409 5964 469 715 24 0.4915 5249 517 645 23 0.5406 4604 564 580 22 0.5876 4024 608 520 21 0.6323 3504 651 465 20 0.6742 3039 690 415 19 0.7133 2624 728 370 18 0.7494 2254 762 330 17 0.7828 1924 794 293 16 0.8133 1631 823 259 15 0.841 1372 849 228 14 0.8659 1144 873 200 13 0.8882 944 894 174 12 0.9079 770 913 150 11 0.9252 620 929 128 10 0.9401 492 943 108 GIT/UNC/CIRA-SEMAP 004.v3 3-15 April 29, 2011 SEMAP Modeling Protocol 9 0.9528 384 955 90 8 0.9635 294 965 74 7 0.9723 220 974 60 6 0.9796 160 981 48 5 0.9854 112 986 38 4 0.99 74 991 30 3 0.994 44 994 24 2 0.9974 20 998 20 0 1000 1 3.3 1 9 0.9528 384 955 90 8 0.9635 294 965 74 7 0.9723 220 974 60 6 0.9796 160 981 48 5 0.9854 112 986 38 4 0.99 74 991 30 3 0.994 44 994 24 2 0.9974 20 998 20 1 1 0 1000 Episode Selection This section describes the modeling periods that will be simulated for the SEMAP project. The base year modeling and analysis period for the SEMAP project is annual 2007. Calendar year 2007 was selected by the SEMAP states after careful consideration of (1) regional air quality measurements and trends, (2) meteorological conditions, (3) emission trends, and (4) available emission inventories. For additional details on the selection of CY2007, please refer to “Selection of CY2007 as Base Year for SEMAP Modeling Project” (GA DNR, 2011). As has been general practice in the air quality modeling community, in the interest of being able to complete the numerous simulations on time, we propose to model the year in 4 quarters as follows with a 2-week spin-up period for each quarter Quarter 1: 12/16/2006 to 3/31/2007 Quarter 2: 3/16/2007 to 6/30/2007 Quarter 3: 6/16/2007 to 9/30/2007 Quarter 4: 9/16/2007 to 12/31/2007 For each of the 4 quarters, we will exclude the first 2 weeks to avoid the effects of initial conditions, and include only the full 3 months for all analysis purposes. Although future year inventories will be simulated for the SEMAP project, the modeling periods are determined by the meteorology data used for the simulations, and these periods will not change across the SEMAP simulations. 3.4 Input Databases This section describes the input data that will be collected and processed to support the SEMAP project air quality modeling tasks. 3.4.1 Initial and Boundary Conditions This section describes how we will create the necessary initial and boundary conditions (IC/BCs) for the CMAQ 36-km domain from GEOS-Chem (Bey et al, 2001). Recently, GA EPD has GIT/UNC/CIRA-SEMAP 004.v3 3-16 April 29, 2011 SEMAP Modeling Protocol performed GEOS-Chem simulations for the year 2007, and once the model vertical resolution has been finalized, has agreed to extract CMAQ-ready IC/BCs for our use. UNC will review the GEOS-Chem based 36-km initial and boundary condition profiles generated for SEMAP and quality assure them. We will first run the I/OAPI utility m3stat on each day’s file to obtain summary statistics for each day, and plot them to visually verify the ranges of IC and BCs obtained from GEOS-Chem. In addition to the above, we will perform the following QA of the boundary conditions: Create timeseries plots (by hour and by day and by month for the entire year) of all chemical species for each of the model layers and for each of the 4 edges of the domain to verify the possible inflow conditions coming into the domain. Create a profile based ICON file and compare with the GEOS-Chem based ICON file to assess the effects of using initial conditions from a global model such as GEOS-Chem. Review satellite data for the year 2007, to specifically see if there are any dust storms that impacted the Southeastern U.S., and see if GEOS-Chem captured those events, and further verify if they are reflected in the CMAQ-ready IC/BCs. If we find any issues with the GEOS-Chem based IC/BCs, we will alert SESARM to these, and suggest alternate approaches to address them. 3.4.2 Emission Inventories and Ancillary Data 3.4.2.1 General Approach The principal objective of the SEMAP emissions modeling tasks is to generate emissions inputs for simulating air quality in the SESARM region for multiple modeling scenarios. This objective will be fulfilled through the development of emissions modeling platforms (EMPs) that represent North American air pollutant emissions for the SEMAP modeling periods. An EMP is a collection of data and modeling tools needed to simulate emissions in support of air quality modeling studies. In addition to being useful for organizing emissions modeling tasks, EMPs provide a clean method for packaging and distributing all of the data and scripts needed to reproduce an emissions simulation. The following eight EMPs will be developed to produce annual emissions input files for CMAQ and CAMx: 1. 2007 Actual 2. 2007 Typical 3. 2017 OTB 4. 2017 OTW 5. 2020 OTB 6. 2020 OTW Each of these EMPs will include the following items: Emissions inventory data with coverage for the entire SEMAP 36-km modeling domain Land use data gridded to the SEMAP 36-km and 12-km modeling domains and formatted for input to both BEIS3.14 and MEGAN GIT/UNC/CIRA-SEMAP 004.v3 3-17 April 29, 2011 SEMAP Modeling Protocol Temporal, speciation, and spatial allocation data General SMOKE input data, such as SCC description and FIPS codes identification files SMOKE scripts and binary executables The data needed to build the EMPs for the SEMAP project will include a combination of new inventories and ancillary data for the SESARM region and existing datasets that are available from different federal, state, and private organizations. Since many of the emissions datasets outside of the SESARM region will not be available for the four modeling years identified for the SEMAP project (2007, 2017, 2020) and/or the development of these data are not on the same schedule as the SEMAP project, different approaches will be used to obtain the best available emissions data at the time the SEMAP emissions modeling tasks begin. Figure 3-3 shows a map of the regional planning organizations (RPOs) and will be used in the following sections to identify different regional sources of emissions data that will be collected for the SEMAP project modeling. Figure 3-3. Regional Planning Organization map 3.4.3 Emissions Processing Sectors We will refine the inventory categories from the standard four operational categories (area, mobile, point, biogenic) to more specific source sectors. Refinements to the inventory sectors provide flexibility in the design of emissions sensitivities, including source apportionment studies, and to the approaches for performing quality assurance on the emissions data. As the stationary-area-source emissions sector has traditionally been a catch-all for many types of sources, this is the inventory sector that will be refined the most. GIT/UNC/CIRA-SEMAP 004.v3 3-18 April 29, 2011 SEMAP Modeling Protocol The six sectors listed in the SEMAP RFP (EGU point, non-EGU point, area, non-road mobile, on-road mobile, and biogenic) provide the bare-minimum inventory categorization needed for optimizing the SMOKE configuration and probing model performance in CMAQ. We will develop a more detailed set of inventory categories to provide flexibility to the diagnostic modeling and the emissions sensitivity modeling tasks. Given the impact that wildfires had on Southeastern air quality in 2007 (http://alg.umbc.edu/usaq/archives/2007_05.html), there should at least be one explicit fire inventory category that is processed separately from the rest of the area and non-EGU point emissions for the initial CMAQ simulation. Additional inventory sectors can be determined through discussion with the SEMAP Project Coordinator and TAWG following analysis of the results from the initial Actual 2007 annual simulation. Table 3-3 lists the emissions processing sectors that will use for the initial Actual 2007 SEMAP simulation and includes the operational category applicable to the sector and the temporal processing approach (described in Section 5) applied to each sector. Table 3-3. Emissions processing sectors for the SEMAP project ID Processing Sector Operational Category ar arfire nusar nr nusnr US Non-point area sources US Fire area sources Canada/Mexico area sources US Off-road mobile sources Canada/Mexico off-road mobile sources On-road mobile sources Canada/Mexico on-road mobile sources Electricity generating unit point sources Non-electricity generating unit point sources Fire point sources Canada/Mexico point sources Commercial marine vessels Biogenic sources Area Area Area Area Area Temporal Approach MWSS Daily MWSS MWSS MWSS Mobile Area Hourly MWSS Point Hourly Point Daily Point Point Gridded area Biogenic Hourly MWSS Single day Hourly mb nusmb egupt negupt ptfire nuspt cmv bg 3.4.4 SEMAP Project Emissions Modeling Platforms This section describes the EMPs that will be developed for the different modeling scenarios under the SEMAP project. In general, only the inventory data will differ between the EMPs; the ancillary data, including the spatial, temporal, and chemical allocation data, and the modeling software will be constant. Upon receipt of all inventory data they will be formatted for input to SMOKE and summary reports will be generated for use during the QA/QC of the emissions processing. The SMOKE-ready inventories will be installed in directory locations on the UNC computers that are consistent with the simulations that will use the data. The SMOKE scripts, including the Assigns and run scripts, will be configured to process the inventories. The ancillary GIT/UNC/CIRA-SEMAP 004.v3 3-19 April 29, 2011 SEMAP Modeling Protocol emissions data will be installed in the appropriate locations in the SMOKE directory structure on the UNC computers and SMOKE will be configured to process these data. Because the available inventories from RPOs for the non-SEMAP regions of the modeling domain do not exactly match the SEMAP project modeling years, we will explore a few of options for populating the base and future year EMPs for the other regions of the modeling domain: Option 1 – For regions with emissions inventories that fall on both sides of a SEMAP modeling year, interpolate the existing inventories to the actual SEMAP year. For example, LADCO is distributing 2008, 2012, and 2018 inventories for the MWRPO states. We could develop a projection trajectory from these three years to develop 2017, and 2020 inventories for the MWRPO states. Option 2 – Use existing inventory data that match the SEMAP modeling periods as closely as possible. For example, the LADCO 2008 inventory could be used for 2007 and their 2018 inventory for 2017 and 2020. Option 3 – Keep the inventory data constant, using the most recent, best quality data. For inventories, such as the non-mobile WRAP data, which are only available for 2002 and 2018, applying a high quality alternative like the final version of the NEI 2005 to this part of the modeling domain may be the best option. We will contact the data providers for the non-SEMAP inventories to determine the best available data for each region. The approaches that will be used to compile and prepare the general emissions data are presented below followed by the approaches for preparing data specific to the different SEMAP modeling scenarios. 3.4.4.1 General Emissions Data The general emissions data in the SEMAP EMPs include data that will not change between simulations. All of the non-inventory data, including the temporal, spatial, and chemical allocation data, will be held constant across all of the SEMAP simulations. The land use data used to compute biogenic emissions with BEIS and MEGAN will also remain constant across the SEMAP simulations. The U.S. EPA 2005 emissions modeling platform version 4 (http://www.epa.gov/ttn/chief/emch/index.html#2005) (2005EMPv4) will be used as the starting point for the ancillary emissions data that will be used for the SEMAP project. In particular, the following data will be used directly from the 2005EMPv5: Chemical speciation profiles for the Carbon Bond 05 photochemical mechanism and the CMAQ and CAMx aerosol mechanisms Temporal profiles by state, county, and source classification code Cross referencing files for associating speciation, temporal, and spatial allocation data to inventory sources Stack defaults for elevated point sources Federal Implementation Standards (FIPS) codes (i.e., country/state/county codes) definitions SCC definitions Pollutant definitions GIT/UNC/CIRA-SEMAP 004.v3 3-20 April 29, 2011 SEMAP Modeling Protocol Temporally allocating annual, daily, or hourly emissions inventories in SMOKE involves combining a temporal cross-reference file and a temporal profiles file. Temporal cross-reference files associate monthly, weekly, and diurnal temporal profile codes with specific inventory sources, through a combination of a FIPS (country/state/county) code, an SCC, and sometimes for point sources, facility and unit identification codes. Temporal profiles files contain coded monthly, weekly, and diurnal profiles in terms of a percentage of emissions allocated to each temporal unit (e.g., percentage of emissions per month, weekday, or hour). As a starting point for the temporal allocation data for the SEMAP simulations, we will use files from the 2005EMPv5. Based on guidance from the developers of the SEMAP inventory files and analysis of these inventories by UNC, we may update the temporal profiles and assignments for some source categories. Chemical speciation follows a similar approach of using profiles and cross-reference data to split inventory pollutants to the emissions species required by CMAQ and CAMx. These data will also be taken directly from the 2005EMPv5 and updates will be made as needed to accommodate new sources or guidance provided with the SEMAP project inventories. SMOKE uses spatial surrogates and SCC cross-reference files to allocate county-level emissions inventories to model grid cells. Geographic information system (GIS)-calculated fractional land use values define the percentage of a grid cell that is covered by standard sets of land use categories. For example, spatial surrogates can define a grid cell as being 50% urban, 10% forest, and 40% agricultural. In addition to land use categories, demographic or industrial units, such as population or commercial area, can also define spatial surrogates. Similar to the temporal and chemical allocation data, an accompanying spatial cross-reference file associates the spatial surrogates (indexed with a numeric code) to SCCs in the inventory. Spatial allocation with surrogates is applicable only to area and mobile sources. Point sources are located in the model grid cells by SMOKE based on the latitude-longitude coordinates of each source. Biogenic emissions are estimated based on gridded land use information that is mapped to the model grid using geographic information system (GIS). Spatial surrogates for allocating area and mobile source inventories to the SEMAP project modeling grids will be developed using the best available North American GIS Shapefile data. The U.S. EPA Shapefile database version 2004 (http://www.epa.gov/ttn/chief/emch/spatial/newsurrogate.html) will be used as the starting point for the SEMAP project surrogates. None of the census-based surrogates, such as housing, residential heating, and population, will be updated since 2010 census data will not be available during the time frame of this project. To represent the significant changes in population density patterns in the SESARM region between 2000 and 2007, we have explored the availability of updated population data for 2007. The projected population data available from the U.S. Census are based on an extrapolation of Census data collected in 1990 and 2000. The projected data are only accurate if the population change trajectory observed from 1990 to 2000 continued into the 2000’s. As extrapolated data, the projected population data don’t consider the impacts of actual GIT/UNC/CIRA-SEMAP 004.v3 3-21 April 29, 2011 SEMAP Modeling Protocol economic trends on population changes. Despite these issues with the projected data, since the 2010 U.S. Census data will not be available in time for this project, we will use the extrapolated 2007 population data as it presents the best alternative to actual 2007 census tract population information. Updates will be made to the surrogates calculated from land use/land cover and rail/roadway data using geospatial data that have been updated since the release of the EPA 2004 Shapefiles. National Land Cover Database (NLCD) information from the year 2006 are available to characterize the location of forest, agricultural, urban, water, and other land use/land cover data that are relevant to emissions. The NLCD is a very high-resolution (30-m) land cover database for North America that will result in a significant improvement over the land use/land coverbased surrogates in the EPA 2004 Shapefile database. All of the land area, water area, and land use classification surrogates will be updated to 2006 levels for the SEMAP project. The Census Bureau TIGER data for 2007 will be used to create surrogates for road, rail, and waterways for the SEMAP project. Biogenic emissions will be calculated using the best available land use and emissions factors data for BEIS and MEGAN. The BELD3 land use tiles and BEIS3.14 emissions factors will be used to compute biogenic emissions for the SEMAP project modeling. The BELD3 tiles will be projected and regridded to the SEMAP modeling domains using the Spatial Allocator. For the biogenic emissions sensitivity simulation, MEGAN gridded input files (plant functional types 2.1, leaf area index 2.0, emissions factors 2.1) will be projected to the SEMAP modeling domains using a GIS. Meteorology data processed through MCIP as part of this project will be used in the calculation of all biogenic emissions. The biogenic emissions will be held constant through all of the SEMAP project simulations. 3.4.4.2 2007 Actual Table 3-4 shows the inventory configuration that will be used for the 2007 Actual SEMAP project SMOKE simulations. These inventories represent the best available information for this simulation year that we anticipate being available at the beginning of this simulation. This table will be update if new inventory data become available prior to beginning the simulation. Table 3-4. 2007 Actual inventory data Region SESARM MANE-VU Inventory Year 2007 2007 CENRAP 2005 MWRPO WRAP 2007 2002 typical Fires 2007 GIT/UNC/CIRA-SEMAP 004.v3 Description Actual 2007 inventory provided by SESARM Initial 2007 modeling will use the screening 2007 inventory developed for OTC modeling by NY DEC; final 2007 modeling will use the 2007 MARAMA inventory if available The NEI2005v4 will be used in the absence of an updated regional inventory Base C provided by LADCO WRAP planning 2002 inventory version C as recommended by Tom Moore 2007 SESARM inventory and 2007 Blue Sky inventory for the rest of the CONUS domain 3-22 April 29, 2011 SEMAP Modeling Protocol Mexico Canada Offshore 1999 2010 2001,2002 Mexico NEI 1999v3 available from WRAP Canada NEI 2010 available from U.S. EPA 2001 offshore point inventory from the Gulf of Mexico; 2002 gridded 36-km commercial shipping lanes for the Pacific, Gulf, and Atlantic 3.4.4.3 2007 Typical Table 3-5 shows the inventory configuration that will be used for the 2007 Typical SEMAP project SMOKE simulations. These inventories represent the best available information for this simulation year that we anticipate being available at the beginning of this simulation. This table will be update if new inventory data become available prior to beginning the simulation. Table 3-5. 2007 Typical inventory data Region SESARM Inventory Year 2007 MANE-VU CENRAP 2007 2005 MWRPO WRAP 2007 2002 typical Fires 2007 Mexico Canada Offshore 1999 2010 2001,2002 Description Typical 2007 inventory provided by SESARM The NEI2005v4 will be used in the absence of an updated regional inventory Base C provided by LADCO WRAP planning 2002 inventory version C as recommended by Tom Moore 2007 SESARM inventory and 2007 Blue Sky inventory for the rest of the CONUS domain Mexico NEI 1999v3 available from WRAP Canada NEI 2010 available from U.S. EPA 2001 offshore point inventory from the Gulf of Mexico; 2002 gridded 36-km commercial shipping lanes for the Pacific, Gulf, and Atlantic 3.4.4.4 2017 OTB Table 3-6 shows the inventory configuration that will be used for the 2017 OTB SEMAP project SMOKE simulations. These inventories represent the best available information for this simulation year that we anticipate being available at the beginning of this simulation. This table will be update if new inventory data become available prior to beginning the simulation Table 3-6. 2017 OTB inventory data Region SESARM MANE-VU CENRAP GIT/UNC/CIRA-SEMAP 004.v3 Inventory Year 2017 2017 2015 Description 2017 OTB inventory provided by SESARM The NEI2005v4 projected to 2015 will be used in the absence of an updated regional inventory 3-23 April 29, 2011 SEMAP Modeling Protocol MWRPO 2017 WRAP 2018 Fires 2007 Mexico Canada Offshore 2018 2020 2001,2002 2007->2017 growth factors provided by LADCO WRAP preliminary reasonable progress PRP2018cmv inventory as recommended by Tom Moore 2007 SESARM inventory and 2007 Blue Sky inventory for the rest of the CONUS domain Mexico NEI 1999v3 available from WRAP Canada NEI 2010 available from U.S. EPA 2001 offshore point inventory from the Gulf of Mexico; 2002 gridded 36-km commercial shipping lanes for the Pacific, Gulf, and Atlantic 3.4.4.5 2017 OTW Table 3-7 shows the inventory configuration that will be used for the 2017 OTW SEMAP project SMOKE simulations. These inventories represent the best available information for this simulation year that we anticipate being available at the beginning of this simulation. This table will be update if new inventory data become available prior to beginning the simulation. Table 3-7. 2017 OTW inventory data Region SESARM MANE-VU CENRAP Inventory Year 2017 2017 2015 MWRPO 2017 WRAP 2018 Fires 2007 Mexico Canada Offshore 2018 2020 2001,2002 Description 2017 OTW inventory provided by SESARM The NEI2005v4 projected to 2015 will be used in the absence of an updated regional inventory 2007->2017 growth factors provided by LADCO WRAP preliminary reasonable progress PRP2018cmv inventory as recommended by Tom Moore 2007 SESARM inventory and 2007 Blue Sky inventory for the rest of the CONUS domain Mexico NEI 1999v3 available from WRAP Canada NEI 2010 available from U.S. EPA 2001 offshore point inventory from the Gulf of Mexico; 2002 gridded 36-km commercial shipping lanes for the Pacific, Gulf, and Atlantic 3.4.4.6 2020 OTB Table 3-8 shows the inventory configuration that will be used for the 2020 OTB SEMAP project SMOKE simulations. These inventories represent the best available information for this GIT/UNC/CIRA-SEMAP 004.v3 3-24 April 29, 2011 SEMAP Modeling Protocol simulation year that we anticipate being available at the beginning of this simulation. This table will be update if new inventory data become available prior to beginning the simulation. Table 3-8. 2020 OTB inventory data Region SESARM MANE-VU CENRAP Inventory Year 2020 2020 2020 MWRPO 2020 WRAP 2018 Fires 2007 Mexico Canada Offshore 2018 2020 2001,2002 Description 2020 OTB inventory provided by SESARM The NEI2005v4 projected to 2020 will be used in the absence of an updated regional inventory 2007->2020 growth factors provided by LADCO WRAP preliminary reasonable progress PRP2018cmv inventory as recommended by Tom Moore 2007 SESARM inventory and 2007 Blue Sky inventory for the rest of the CONUS domain Mexico NEI 1999v3 available from WRAP Canada NEI 2010 available from U.S. EPA 2001 offshore point inventory from the Gulf of Mexico; 2002 gridded 36-km commercial shipping lanes for the Pacific, Gulf, and Atlantic 3.4.4.7 2020 OTW Table 3-9 shows the inventory configuration that will be used for the 2020 OTW SEMAP project SMOKE simulations. These inventories represent the best available information for this simulation year that we anticipate being available at the beginning of this simulation. This table will be update if new inventory data become available prior to beginning the simulation. Table 3-9. 2020 OTW inventory data Region SESARM MANE-VU CENRAP Inventory Year 2020 2020 2020 MWRPO 2020 WRAP 2018 Fires 2007 Mexico 2018 GIT/UNC/CIRA-SEMAP 004.v3 Description 2020 OTW inventory provided by SESARM The NEI2005v4 projected to 2020 will be used in the absence of an updated regional inventory 2007->2020 growth factors provided by LADCO WRAP preliminary reasonable progress PRP2018cmv inventory as recommended by Tom Moore 2007 SESARM inventory and 2007 Blue Sky inventory for the rest of the CONUS domain Mexico NEI 1999v3 available from WRAP 3-25 April 29, 2011 SEMAP Modeling Protocol Canada Offshore 2020 2001,2002 Canada NEI 2010 available from U.S. EPA 2001 offshore point inventory from the Gulf of Mexico; 2002 gridded 36-km commercial shipping lanes for the Pacific, Gulf, and Atlantic 3.4.5 Landuse Data Different landuse data sets will be collected for the SEMAP project. The landuse data required for estimating biogenic emissions are described in Section 4.1. Landuse data needed for the air quality modeling will be provided by SESARM as part of the WRF meteorology data. The WRF model uses landuse data from NCAR and outputs them in the WRF output files. These landuse data will be processed by MCIP and WRFCAMx and used by CMAQ and CAMx for calculating dry deposition parameters. 3.4.6 Meteorology Data All meteorology data required for the emissions processing and air quality modeling will be provided by SESARM. All the necessary meteorological variables will be processed through MCIP and WRFCAMx along with the two programs (AHOMAP and TUV). The CMAQ and CAMx ready meteorology data will be use by SMOKE, CMAQ, and CAMx. 3.5 Quality Assurance and Error Correction Protocol 3.5.1 Quality assurance and Quality control Quality assurance in the context of emissions and air quality modeling refers to ensuring that the results of the simulations are of the highest quality possible with respect to the data. Quality control, on the other hand, refers to ensuring that the model simulations completed successfully and without errors. QA focuses on the quality of the data; QC focuses on the process of modeling, archiving, and documenting the data. Some analysis products, like emissions spatial plots and simulation web sites, are used for both QA and QC. Other products, such as documents of model settings are used for QC only. This section describes both the QA and QC products that we will use during the emissions and air quality modeling for the SEMAP project to ensure high quality model results and that the modeling software performed as we expected. Specific examples showing how to apply QA/QC concepts in the application of the modeling software provide a framework for guiding the QA process during the production of model-ready emissions and air quality model results. The QA/QC framework begins with the installation of the modeling software and concludes with compiling QA summaries and notes into a final report. The result of following the framework will be highly scrutinized data that were generated by well-documented QA procedures. All quality assurance and quality control procedures applied during the SEMAP project will adhere to the SEMAP quality assurance project plan (QAPP). GIT/UNC/CIRA-SEMAP 004.v3 3-26 April 29, 2011 SEMAP Modeling Protocol 3.5.2 Meteorology Data and Modeling Quality assurance and quality control for meteorological data processing are needed to ensure the meteorological inputs into air quality models are correct and consistent with what the meteorological model generated. This especially important when vertical collapsing is performed. We will compare the results from MCIP and WRFCAMx on full 34 WRF layers and the collapsed vertical layers, especially near the model top where potential problems arise (Emery et al. 2009). We will make certain all the necessary variables CMAQ needs are correctly output from the WRF model. We will generate runtime log files when we run MCIP and WRFCAMx programs to QA the variables in printouts and routinely ensure no error messages were generated by the programs. If errors were generated, we will diagnose the problems and address them. 3.5.3 Emissions Quality Assurance Adelman (2004) presented the first systematic QA/QC protocols for emissions modeling. The concepts presented there will provide the basis for the QA/QC procedures that we will use for the emissions processing for this project. 3.5.3.1 QA Classifications This protocol’s quality assurance plan for emissions modeling focuses on the following four classifications of QA, to ensure that datasets are both of high quality and reproducible: (1) accuracy assurance and problem identification (termed “modeling QA”); (2) software and data tracking (“system QA”); (3) independent data review (“gatekeeping and outside review”); and (4) documentation. Described in detail and presented as actual QA procedures in Adelman (2004), these four areas cover all aspects of emissions modeling. Splitting the QA into different classifications is not an attempt to prioritize the QA effort; instead, it is a way of presenting the information that allows those performing QA to focus on each area individually as it arises in the modeling process. 3.5.3.1.1 Modeling QA Modeling QA involves performing data quality checks, assuring simulation accuracy, and recognizing and identifying problems as they happen; it is the process of looking for glaring faults in the model input and output data (I/O) and determining whether the input data are producing the desired results. Scrutiny of the I/O using standard statistical analyses can reveal problems in the data and/or the model setup. Using a standard approach for analyzing emissions model I/O establishes reference points to use when scrutinizing the data. Seeking these indicators of correct model performance allows QA personnel to determine the accuracy of the simulations and whether faults in the data or model configuration exist. Several indicators of model performance are presented in Adelman (2004). 3.5.3.1.2 System QA System QA addresses model installation and configuration issues, data accounting, and ensures that the modeling systems are producing results that are expected and can be reproduced in the future. Properly benchmarking a model and assuring that the installation is complete is the first step in the system QA process. Confirmation of configuration settings, compile options, and GIT/UNC/CIRA-SEMAP 004.v3 3-27 April 29, 2011 SEMAP Modeling Protocol other system-related parameters must occur and be documented prior to producing any model results. Archiving the model installation at set “freeze points” in the project is also required in order to keep accurate records of the modeling installation and run scripts. A key feature of system QA is facilitating the reproduction of model results in the future and the ability to revert back to a previous configuration or installation after the model has been revised or updated. In addition to thorough documentation, version control software is required for archiving model executables, run and configuration scripts, and important data files. Combining documentation of the modeling procedures with archives of the model installation and data files will ensure that the model installation and configuration are correct and that past simulations are sufficiently documented and archived to allow their reproduction. 3.5.3.1.3 Gatekeeping and Outside Review Gatekeeping and outside review is the process of ensuring that the data entering and leaving the SMOKE modeling process meet a predetermined quality level. A gatekeeper screens model data before they pass from one major step of the modeling process to the next. In emissions QA, gatekeeping is applied to the emissions inventory and SMOKE input files on the front end of the modeling process, and to the SMOKE output files at the back end. A gatekeeper (or gatekeeping team) is responsible for performing a series of “sanity checks” on both the input and output data streams. Outside reviewers are emissions experts who are not part of the RMC modeling team. They periodically review the entire process—the emissions data, the modeling, and the QA steps taken—using their judgment as experts to decide what to review and how to review it. 3.5.3.1.4 Documentation Documentation, a component that is common to all of the other three QA classifications, provides the record of the QA process. Establishing a detailed set of requirements for documenting every step in the QA process will ensure not only that the documentation is created as expected but that the processes recorded by the documentation are completed correctly. In addition to records or lists of completed QA steps, documentation refers to summaries and interpretations of emissions inventory reports and analyses. Covering the entire realm of the modeling process, QA documentation will include records of model configuration, details about data files, simulation records, and final report generation. 3.5.3.2 Implementing the QA Framework With the QA classifications conceptually defined in the previous section, this section focuses on the specifics of how to implement them. Beginning with details about the modeling team requirements of the QA framework and the sources of the QA information, this section then presents specific step-by-step instructions on performing emissions QA. This section concludes with discussion about how and where to document the QA process. 3.5.3.2.1 The Emissions QA Team The level of effort required to implement this QA framework requires a team consisting of a data gatekeeper(s), a production modeler(s), a QA manager, outside reviewers, and a project manager. Each member of the team contributes different perspectives on the data and modeling. GIT/UNC/CIRA-SEMAP 004.v3 3-28 April 29, 2011 SEMAP Modeling Protocol The gatekeeper (or gatekeeping team) is responsible for reviewing the SMOKE I/O data streams for correctness. Before new datasets are used in SMOKE modeling, or new SMOKE output emissions datasets are used in air quality modeling, they must be reviewed by the gatekeeper. For all SMOKE input and output files, the gatekeeper will ensure that the data are complete, are formatted correctly, and pass sanity checks. Checking completeness entails examining the files for the necessary data elements, spatial coverage, and temporal extent. Data formats will be confirmed using the SMOKE manual to check ASCII files and using PAVE and/or VERDI to check binary netCDF files, such as the meteorology inputs. Sanity checks look for glaring errors in the file contents and ensure that the data make sense in the context of how they will be used and relative to similar or reference datasets. Issues that arise during the gate keeping process will be reported to the project manager for resolution. UNC staff will serve as the gatekeepers for the SEMAP emissions data. The production modeler is responsible for receiving input data, maintaining the model run scripts, running the model per the work plan, producing default QA reports, delivering model output data, and assisting with compiling QA reports into summaries for documentation purposes. At least two people are needed in this role in all emissions modeling projects because of its broad scope; a single person cannot complete all of the production modeler functions. A lead modeler will oversee the entire modeling process, perform the majority of the SMOKE modeling, and receive and archive input and output data per the procedures. Secondary modelers will organize the SMOKE QA reports into emissions summaries for data QA and reporting, and will generate custom QA summaries and reports for troubleshooting any problems encountered during the modeling process. The QA manager enforces adherence to this QA protocol. Consistent with the protocol, the QA manager performs and documents all of the checks required for determining model accuracy and revealing errors. The QA manager also ensures that the software is configured correctly and that the data and model are being archived consistently and correctly. Overseeing the production modelers, the QA manager will verify that SMOKE is being applied correctly and that all QA summaries are consistent with the relevant input data. Leading the documentation efforts, the QA manager ensures that all the necessary QA summaries are generated and certified and that they are compiled into a final project report. Zac Adelman from UNC will serve as the QA manager for the SEMAP emissions modeling. Outside reviewers will be required to both systematically confirm and periodically audit the work of the emissions modeling team for consistency with the modeling data and adherence to the QA protocol. An outside perspective of the emissions modeling process will be used to subject the emissions modeling data and process to screening procedures for correctness. SESARM technical staff will serve as outside reviewers of the SEMAP emissions data. The final member of the modeling team is the project manager. The project manager is ultimately responsible for the quality of the final products of the modeling process. Providing technical assistance to the production modelers and clarifying any uncertainties about model input, configuration, and operation for the QA manager, the project manager will be able to address any questions in the modeling process. Working with the QA manager, the project manager approves the results of all QA procedures, and thoroughly scrutinizes the QA summaries for problems to determine whether the data are ready to be sent to the gatekeepers GIT/UNC/CIRA-SEMAP 004.v3 3-29 April 29, 2011 SEMAP Modeling Protocol and outside reviewers for final review before delivery to the air quality modelers. The project manager is responsible for keeping the project on schedule and within budget. For each of the QA classifications presented in Section 5.1.2.1 (modeling QA, system QA, gatekeeping and outside review, and documentation), the following sections provide procedural information and describe the role of each member of the modeling team. These sections are preceded by an outline of the types of information and the components of the SMOKE modeling environment that are involved in the QA procedures. 3.5.3.2.2 Information Required for Performing Quality Assurance Various sources of QA information are available from SMOKE. The nature of the information, how it is generated and where to obtain it, and what it will be used for are explained in Adelman (2004). The following data will be used in the SMOKE QA process. Log files SMOKE QA reports Run Scripts Assigns files Inventory summaries QA Documents Emission QA plots 3.5.3.2.3 Modeling QA Modeling QA involves procedures for determining whether SMOKE is working correctly and producing expected results. These procedures must be completed with every new emissions inventory modeled with SMOKE, with new base-year simulations, new sensitivity simulations, reruns to correct errors in past simulations, and changes to the base SMOKE configuration. This section applies these components to a detailed set of procedures for checking SMOKE modeling. There are several indicators of model performance that must be reviewed to confirm that SMOKE is configured and applied correctly. In general these indicators will be used to confirm the following items: Confirming SMOKE configuration and input files Confirming that SMOKE executed without errors or unexpected warnings Checking the SMOKE ancillary inputs Emission modeling QA checks Confirming that SMOKE processed the emissions inventory correctly Compare the SMOKE QA reports and model-ready emissions for consistency The QA steps detailed in this section coincide with those in the SMOKE emissions QA checklist in Appendix B (Table B-1), and are to be performed by the production modelers. After the production modeler completes the entire checklist, it will be passed to the QA manager for verification. The QA manager will re-check all of the individual items in the checklist. GIT/UNC/CIRA-SEMAP 004.v3 3-30 April 29, 2011 SEMAP Modeling Protocol When all of the steps have been completed for a new inventory or sensitivity and the checklist is filled out in its entirety, the production modeler and QA manager must sign off on the checklist and deliver it to the project manager. As an important last step in the SMOKE QA procedure, the project manager validates the checklist using the accompanying documentation to determine whether the emission modeling results are ready to pass to the gatekeepers and outside reviewers. He/she then certifies the model-ready emissions as final, interim, or not ready for air quality modeling. The emissions may be deemed interim if resource availability (time, computing, funding) or scheduling requirements outweigh the severity of the emissions issues. The project manager will document the caveats in the interim emissions dataset and ensure that they are either resolved in a subsequent emissions modeling simulation and/or presented as caveats in the emissions data to air quality modelers. If the project manager deems that the emissions are not ready for air quality modeling, he/she will also document the problem(s) that lead to this determination and a solution will be worked out by the modeling team. After the problem is corrected, the emissions category that contained the problem will be subjected to the appropriate QA procedures and the model-ready emissions regenerated. Complete details of the model performance indicators are provided in Adelman (2004). 3.5.3.2.4 System QA System QA refers to the accounting process for modeling software, tools, and data, beginning with installation or import onto the computing system of choice and completing with archival using an online version control system. Version control in the context of emissions modeling is the concept of enumerating every instance of a model configuration or data file and storing the configuration information and data with associated version numbers. All changes made to the model configuration or installation and updates made to the data are documented and tracked using the project website. The QA checklist in Appendix B (Table B-1) contains entries for system QA. Each of the major steps detailed in this section correspond to entries in the QA checklist. This protocol requires that all of the steps outlined in this section be completed and verified by the QA manager in the checklist in Table B-1. 3.5.3.2.5 Gatekeeping and Outside Review The third major classification of QA is gatekeeping and outside review. As significant time, effort, and funding are involved in emissions modeling, air quality modeling, and the subsequent analysis of the results, it is very important to ensure that the data are correct before they enter the modeling and analysis process. To supplement the above QA procedures performed by the production modelers and the QA manager, perspectives from people outside of the modeling process are needed and must be part of the QA protocol. These external participants are the gatekeeper and the outside reviewers. We will make QA products available through the Web to facilitate review by the gatekeepers. The gatekeeper’s function is to review the SMOKE input and output data streams before they are used in modeling. A gatekeeper will be designated by the project manager of the emissions modeling team. The gatekeeper should be a person who GIT/UNC/CIRA-SEMAP 004.v3 3-31 April 29, 2011 SEMAP Modeling Protocol understands the data requirements and the operational aspects of SMOKE, has worked with the same or similar datasets as those entering and leaving the SMOKE modeling process, and a person involved in the modeling project and who knows the modeling domain and the nature of the inventory data. The gatekeeper will perform a high-level review of SMOKE inventory files for formatting errors and for data “sanity” (defined below). He or she will also review the CMAQ-ready SMOKE output files for correctness and completeness. While the gatekeeper focuses on the I/O data streams, outside reviewers are called upon to periodically review the entire process: the emissions data, the modeling, and the QA steps taken. 3.5.3.2.6 Documentation Documentation is the fourth key component of the emissions modeling QA protocol; it encompasses all aspects of the QA process. From recording the results, progress, and key findings of the accuracy assurance and problem identification tasks to embodying the central issue of the software and data tracking tasks, documentation is important for ensuring the accuracy of the model results and providing tangible records of how they were obtained and how they can be reproduced. Three levels of documentation are required in this protocol: 1. The first level consists of worksheets for recording SMOKE configuration settings and details about the input and output data. These are working documents that change as the project evolves and new data become available. 2. The second level of documentation involves forms for recording the QA process; these are also working documents. Recording this information ensures that the primary QA goals are fulfilled. This second level of documentation records both the completion of procedures and the creation of documentation as defined by this protocol. 3. The last level of documentation is the generation of a final report. All of the information recorded in the previous two levels of documentation must be compiled into a final report at the end of a modeling project. These documentation levels are described in detail in Adelman (2004). 3.5.4 Air Quality Data and Modeling We will adopt rigorous QA procedures to verify the quality and accuracy of all model inputs and outputs that will be developed for the project. This will be mainly accomplished using statistical and graphical approaches. 3.5.4.1 AQM Input Files IC/BC files: Once the IC/BC files are created, we will create graphical outputs – tile plots for initial conditions and time-series plots of boundary conditions for every model day for all 4 edges of the domain. We will also compute min and max for each hour for each edge of the domain and verify if these lie within reasonable bounds for the entire modeling period. A sample GIT/UNC/CIRA-SEMAP 004.v3 3-32 April 29, 2011 SEMAP Modeling Protocol time-series plot of boundary conditions is shown in Figure xx. We will archive all run logs from the processing of the data and screen them for any error messages. 3.5.4.2 AQM Output Files We will use a rigorous naming convention (see Section 4.1) for distinguishing between all model scenarios and between the CAMx and CMAQ simulations. We will create individual directories for each model scenario, and retain unique scripts for each scenario, each month, each year, and never reuse any script for multiple scenarios. We will retain all runtime logs from the models and routinely parse them for any error/warning messages. We will flag runs/scenarios/days that have error messages and take appropriate action to address them. Once the model simulations are complete, we will use automated post-processing tools that UNC has used extensively in numerous projects to perform time aggregation (daily maxima, daily average, etc.), create tile plots, timeseries plots and standard min/max statistics for all species to QA them for reasonableness. We will create automated ways to parse these min/max values for each and every simulation and report anomalies to the AQM data manager for possible corrective action. All plots will be posted to the interactive project website for SEMAP and the QA manager will be informed for review. The model executable, run scripts and log directories will be periodically synchronized to the archival system at GIT and UNC (mass storage system) in near real-time. Thus, in case of disk failure, etc., we can easily reproduce these runs as necessary. The following general procedures will be used to QA all air quality modeling simulations completed for this project. Verification that correct configuration and science options are used in compiling and running each component of the modeling system, where these components include the meteorology pre-preprocessor, initial/boundary conditions processors, photolysis rate calculators, chemistry-transport model (CTM), and any other preprocessor used to prepare data for the CTM. Verification that correct input data sets are used when running each model. Evaluation of model results to verify that the output is reasonable and consistent with general expectations. Processing of ambient monitoring data for use in the model performance evaluation. Evaluation of the air quality model results against observations. Backup and archiving of critical model input and output data. GIT/UNC/CIRA-SEMAP 004.v3 3-33 April 29, 2011 SEMAP Modeling Protocol Section 4: Technical Approach 4.1 Simulation Identification We will use a naming scheme for the CMAQ and CAMx simulations to identify the versions of the input data, inventory year, and inventory version used in the simulation. The model used to generate the results with version number (i.e., CMAQ4.7.1) will be included in the output file names along with the julian date of the simulation. A three letter input data versioning prefix in the simulation identification (ID) will be used to track the version of the meteorology (digit 1), emissions (digit 2), and air quality data (digit 3). A two-digit number will then denote the inventory year used in the simulation. The last digit(s) will identify the version of the inventory, i.e. actual (A), typical (T), on the books (OTB), on the way (OTW). For example simulation ID adb07T can be interpreted as: Meteorology data/configuration version a Emissions data/configuration version d Air quality data/configuration version b 2007 typical (07T) inventory The justification for using different version IDs for the input data is based on our experience with similar modeling projects that required the ability to track numerous iterations of the input data used in the air quality modeling simulations. When one of the input streams to the air quality model changes, either through changes to the data or model configuration, the ID for that data will be incremented. The version IDs and simulation IDs will be documented and tracked on the SEMAP project website. 4.2 General Modeling Procedures This section describes the general procedures that we will use to process data during the SEMAP project. The modeling procedures described here will be used throughout all of the modeling applications in the SEMAP project. Alternative procedures that diverge from those documented in this section are described in the applicable task descriptions below. 4.2.1 Meteorology Data Preparation 4.2.1.1 MCIP We will acquire the namelist and registry files and WRF outputs for the year 2007 from the SESARM meteorological modeling contractor. We will select two ten day periods from both summer and winter months to test the MCIP make certain the WRF model was run with all the physics options and outputted necessary variables needed for CMAQ simulations. While we run MCIP we will window out the CMAQ domain mentioned in Section 2 from the WRF domain which has been chosen by SESARM and the meteorological modeling team. We will also acquire the WRF modeling protocol and evaluation/analysis documentations and then carefully review the evaluations already done by the WRF modeling team and perform extra evaluation/analysis if needed. GIT/UNC/CIRA-SEMAP 004.v3 4-1 April 29, 2011 SEMAP Modeling Protocol We will have conference calls and emails with the metrological modeling team regarding the CMAQ vertical layers. According to EPA scientists, layer collapsing in the first 1km near the surface is not recommended. After the vertical layer structure is determined we will run MCIP for the two testing periods using vertical collapsing. Since the vertical collapsing has the potential to cause problems at the model top which can be propagate to lower model layers (Emery et al. 2009), we will compare the metrological variables from the original WRF outputs and the MCIP outputs to assure the quality of the met data going into SMOKE and CMAQ. After we are satisfied with the MCIP outputs for the test cases, we will perform MCIP processing with the year 2007 WRF output data. We will report our findings to SESARM and external sources. 4.2.1.2 WRFCAMx We will test the WRFCAMx program with the same testing cases mentioned in 4.1. We will also examine the outputs after vertical collapsing and compare them to the original WRF outputs. We will also run the two programs (AHOMAP and TUV) to prepare photolysis rate inputs for CAMx model. After we make certain the results from WRFCAMx, AHOMAP, and TUV are satisfactory, we will process the whole year of 2007 WRF output data through these programs. 4.2.2 Emissions Data Preparation 4.2.2.1 SMOKE UNC will conduct all emissions modeling for the SEMAP project using the SMOKE version 2.7beta (May 2010), which supports the processing of MOVES-based lookup tables. Before beginning operational SMOKE modeling to develop annual, multi-scale emissions for CMAQ and CAMx, UNC will build an emissions modeling infrastructure on UNC computers to support the SEMAP project. The sequential process of developing this infrastructure encompasses the installation of SMOKE on the UNC computers and creating the 2007 SEMAP emissions modeling platform. SMOKE installation requires that after the binary executables are copied to the system, that the program be benchmarked on the UNC computers before it is used for operational modeling. Benchmarking is the process of confirming that a model is installed on a new computing system correctly by comparing the results from a test simulation run on the new system against results from the same simulation that are distributed with the model. UNC will use the preliminary 2006 version A (Pre06a) RoMANS emissions modeling platform as the starting point for the development of all of the SEMAP emissions modeling (Adelman, 2007). The scripts and SMOKE configuration used for RoMANS Pre06a will be transferred directly to the UNC computing systems for the 2007, 2017 and 2020 SEMAP modeling. After SMOKE is benchmarked on the UNC computers, we will build an emissions modeling platform for developing emissions for the SEMAP project on UNC computing system. GIT/UNC/CIRA-SEMAP 004.v3 4-2 April 29, 2011 SEMAP Modeling Protocol 4.2.2.2 MEGAN We will use a GIS to convert version 2.1 of the MEGAN emission factors and plant functional types and version 2.0 of the MEGAN leaf area index data to the SEMAP grids and combine these with the SEMAP meteorology data to estimate biogenic emissions for the summer and winter sensitivity periods. 4.2.2.3 Blue Sky Framework The Blue Sky Framework (http://blueskyframework.org/) is a modeling system designed to provide estimates of wildfires and other large fire events using high-resolution satellite-based data and ground-based observations. We will use the Blue Sky Framework in conjunction with the Fire Emission Production Simulator (FEPS) to generate 2007 annual fire inventories for input to the SMOKE processing system. These fires data will be used to simulate fire emissions for the regions of the modeling domain outside of the area covered by the SEMAP fire inventories provided by SESARM. The Blue Sky Framework generates hourly fire and plume rise parameters that SMOKE can use to produce hourly, gridded, 3-D emissions for input to CMAQ and CAMx. 4.2.2.4 Spatial Allocator The Spatial Allocator is an open-source GIS that we will use to prepare emissions and air quality input data. The Surrogate Tool component of the Spatial Allocator will be used to convert GIS Shapefiles to spatial surrogates for the SEMAP modeling domains. These spatial surrogates will be used to allocate the county-level inventory data to the 36-km and 12-km modeling grid cells. We will also use the Spatial Allocator to project the BELD tiles to the SEMAP modeling grids for input to BEIS3. Finally, we will use the Spatial Allocator to generate an ocean mask input file to CMAQ which is used to calculate surf zone and open ocean sea salt emissions. We will explore the options for calculating offline sea salt emissions to use in CAMx. 4.2.3 Air Quality Modeling 4.2.3.1 CMAQ CMAQ version 4.7.1 will be used for conducting the SEMAP air quality modeling. As described above, meteorology and emissions inputs to CMAQ are prepared using MCIP and SMOKE, respectively. Initial and boundary conditions for the CMAQ 36-km domain are extracted from GEOS-Chem outputs. Here we will use CMAQ’s preprocessors ICON and BCON, respectively, to prepare initial and boundary conditions for the CMAQ 12-km domain. We will use JPROC to prepare the photolysis rates tables. Finally, the air quality modeling will be conducted using the CMAQ’s core Chemistry-Transport Model CCTM. CMAQ version 4.7.1 system will be installed on Georgia Tech’s computer system and will be benchmarked against release data. We choose ifort as the Fortran compiler for the system. 4.2.3.1.1 Initial and Boundary Conditions We will use ICON and BCON of CMAQ version 4.7.1 to prepare initial and boundary conditions for the CMAQ 12-km domain. The input datasets to ICON and BCON will be the output fields of instantaneous concentrations from the corresponding CMAQ modeling of the 36-km domain. GIT/UNC/CIRA-SEMAP 004.v3 4-3 April 29, 2011 SEMAP Modeling Protocol Initial conditions will be prepared for the beginning hour of each simulation period and boundary conditions will be prepared for every hour during each simulation period. Run scripts, log files and all outputs will be archived and the generated initial and boundary conditions will be visualized using graphics tools for quality assurance. 4.2.3.1.2 Photolysis Rates Photolysis rates tables will be prepared by using JPROC of CMAQ version 4.7.1. The same chemical mechanism to be used in CCTM (i.e. CB05 with Chlorine) will be used here to prepare photolysis rates tables for every day of each simulation period. These photolysis rates tables will be first reviewed and then used for both the 36- and 12-km CCTM modeling. Photolysis rates tables will be prepared only once for the entire SEMAP project. Run scripts, log files and photolysis rates tables will be archived for quality assurance. 4.2.3.1.3 Chemistry-Transport Model We will use CCTM of CMAQ version 4.7.1 to conduct the air quality modeling. All the CMAQ ready inputs will be prepared as described previously and will be reviewed for quality control purposes. CCTM build-it script and run-together scripts will be developed for the SEMAP project. The CCTM configuration for the initial base case simulation will be determined by recommendation using all available information in the literature, consultation with EPA, and our own experiences and by approval of the SEMAP Project Coordinator and TAWG. The final CCTM configuration will be determined after the diagnostic model sensitivities tests and by the SEMAP approval. The source code of each set of CCTM configuration including those for the diagnostic tests will be kept for quality assurance. Run scripts, log files and most importantly the CCTM output files will also be archived. Quality review will be made during each CCTM simulation and after the simulation is finished. Log file will be first reviewed for checking if appropriate input files are used in the simulation and then checked for any suspicious total mass reporting. Simulated concentration fields from output files will be statistically analyzed using m3stat tool after the simulation finished and will be processed for necessary temporal averaging and species merging. The processed concentration fields will be visualized using PAVE and other graphic tools as a standard procedure for further quality assurance. 4.2.3.2 CAMx 4.2.3.2.1 Initial and Boundary Conditions We will use the initial and boundary condition files generated for CMAQ (from GEOS-Chem) and convert them using the CMAQ2CAMx utility to CAMx-ready files for the 36-km domain. We will then use the CAMx-based 36-km concentration files to create the ICON/BCON for the 12-km CAMx modeling applications. Run scripts, log files and all outputs will be archived and the generated initial and boundary conditions will be visualized using graphics tools for quality assurance. 4.2.3.2.2 Other Inputs We will use the CAMx utility AHOMAP to generate additional inputs for CAMx, which include UV albedo, haze turbidity, and ozone column. While recent versions of CAMx can support the use of additional inputs such as snow cover, surface roughness, land-ocean mask, and drought GIT/UNC/CIRA-SEMAP 004.v3 4-4 April 29, 2011 SEMAP Modeling Protocol stress, ahomap does not support the creation of these since they are not essential. We will look into available data from the National Land Cover Database (NLCD) and WRF outputs to see if we can create these. 4.2.3.2.3 Chemical Mechanism Unlike CMAQ which uses a specific chemical mechanism module during compilation, CAMx needs a chemical parameter file as an input to the run script. From the various options, we will use the mechanism file CAMx5.2.chemparam.6_CF, included with the distribution. This mechanism includes the Carbon Bond 2005 gas-phase chemical mechanism with PM treatment. 4.2.3.2.4 Chemistry-Transport Model We will use CAMx V5.20 for conducting the SEMAP air quality modeling. As described above, meteorology and emissions inputs to CAMx are prepared using WRFCAMx and SMOKE, respectively. All other inputs such as IC, BC and photolysis rate files will be developed as discussed above. Finally, the air quality modeling will be conducted using CAMx V5.20. UNC has already installed and configured this model on the UNC computer cluster. We are in the process of performing comparative performance evaluation using OpenMP and MPI to obtain the best performance. We will benchmark the final configuration outputs against available test data provided by the developers before starting modeling for SEMAP. We will use the Intel Fortran Compiler ifort as the Fortran compiler for the system. 4.3 Initial Base Case Simulation The objective of this task is to develop an initial annual “Actual” 2007 base year air quality model simulation (herein referred to as Base2007) on the SEMAP 36-km and 12-km resolution modeling grids (Section 2). Many of the modeling approaches discussed in this section are applicable to the other modeling tasks in the project. The modeling approaches for tasks that differ from those presented in this section will be presented with the applicable task. We will divide the work on this task between Georgia Tech and UNC. UNC will conduct the modeling that prepare all inputs to CMAQ, including IC, BC, JPROC, MCIP and SMOKE modeling; Georgia Tech will conduct all CMAQ modeling. External drives will be used for data transfer between UNC and Georgia Tech. We will use SMOKE version 2.7beta with the SESARM “Actual” 2007 emissions inventory to prepare gridded emissions inputs, GEOS-CHEM to prepare BCs, and the latest versions of ICON, JPROC and MCIP to prepare the other inputs for CMAQ. Details of the input databases that will be compiled to prepare inputs to simulation Base2007 are provided in Section 4, including the compilation of inventories outside of the SESARM region. To select a specific configuration of CMAQ for the SEMAP project, we will start with the standard configuration of the CMAQ version 4.7.1 released by EPA through the CMAS Center at UNC. Features of this version were announced at the CMAS Conference in October 2009, which include updated SOA and cloud modules, support for biogenic sesquiterpene emissions, and a fully incorporated DDM module for both gas and aerosol phases. The initial Base2007 CMAQ simulations, which will be identified as CMAQ4.7.1.aaa07ACT will be performed using a configuration based on a literature review of the relevant air quality model sensitivity simulations including the latest evaluation articles on the most recent GIT/UNC/CIRA-SEMAP 004.v3 4-5 April 29, 2011 SEMAP Modeling Protocol developments in CMAQ (i.e. Foley et al (2009)) and through communication with the EPA Office of Research and Development (ORD) scientists who developed CMAQ. In consideration that CMAQ version 4.7.1, due to its recent release, will not have accompanying publications in the primary literature, we will use a combination of literature reviews of modeling studies that used previous CMAQ versions (i.e. Foley et al., 2009, Wilczak et al., 2009, Byun and Shere, 2006) and discussion with EPA ORD to determine the recommended optimal configuration for the SEMAP project. In general, the base model configuration specifications outlined in the SEMAP RFP will be used for the initial CMAQ simulation with the following modifications: As of version 4.7 the RADM cloud module is no longer an option in CMAQ. We will use the ACM cloud routine that includes extensions for the AERO5 heterogeneous chemistry mechanism. The module specified in the RFP, AERO4, has been replaced by AERO5 in CMAQ version 4.7. We will use the CMAQ AERO5 module for the SEMAP project. We will select a CMAQ configuration using all available information in the literature, consultation with EPA, and our own experiences to provide a recommendation to the SEMAP technical analysis working group (TAWG) along with the rationales used to recommend the configuration. After approval by the SEMAP Project Coordinator and TAWG, we will finalize the initial air quality model configuration and provide our insights on conducting the necessary diagnostic model configuration tests. After completing the CMAQ simulations we will wait for guidance from SESARM about how to proceed with the CAMx simulations. The CAMx simulation will use a model configuration and input data that are as close to the CMAQ simulation as possible. Table 12 shows the proposed configuration options for CMAQ and CAMx that will produce results that are normalized as much as possible. Table 4-1. CMAQ and CAMx configuration options Model Parameter Horizontal Advection CMAQ_v4.7.1 Yamartino (hyamo) Vertical Advection Yamartino (vyamo) Horizontal Diffusion Vertical Diffusion Multiscale Advanced Convective Method (ACM2) CB05 with Chlorine (cb05cl_ae5_aq) Euler Backward Iterative (ebi_cb05cl_ae5) CMAQ 5th Generation (aero5) ACM clouds with aero5 (cloud_acm_ae5) none Gas Chemistry Mechanism Gas Chemistry Solver Aerosol Mechanism Clouds/Aqueous Chemistry Plume in Grid CAMx_v5.20 Piecewise Parabolic Method (PPM) Piecewise Parabolic Method (PPM) Implicit Advanced Convective Method (ACM2) CB05 with Chlorine (Mechanism 6) Euler Backward Iterative (EBI) Two static modes (CF) Implicit none We will use the “Actual 2007” emissions modeling platform (EMP) with SMOKE version 2.7beta to prepare all of the emissions data for input to CMAQ. The inventory and ancillary emissions data used in the Actual 2007 EMP are detailed in Section 4. We will generate both the CMAQ and CAMx emissions directly with SMOKE. The CMAQ2CAMx utility will only be used to process certain inventory sectors and the GEOS-CHEM BCs for input to CAMx. Fire GIT/UNC/CIRA-SEMAP 004.v3 4-6 April 29, 2011 SEMAP Modeling Protocol inventories that are processed through SMOKE do not have the typical stack parameters needed to select the elevated versus low level sources needed to support CAMx. In the case of fire emissions, 3-d CMAQ-ready fire emissions files will be processed with the CMAQ2CAMx utility to split the 3-d CMAQ emissions into elevated and low level CAMx emissions files. The vertically split fire emissions will be merged with the rest of the CAMx-ready emissions files using SMOKE. For the rest of the emissions data, the conversion of the CMAQ emissions files to CAMx is an incremental change that happens during the SMOKE processing sequence. Minor speciation changes, output units, and file format conversions to produce CAMx-ready emissions will be handled by SMOKE for all but the fire inventories. We will also use the CMAQ2CAMx utility to convert the GEOS-CHEM BC data into the CAMx format. Because processing emissions for CAMx with SMOKE and using the CMAQ2CAMx utility is redundant, we are proposing only a small amount of work for the CMAQ2CAMx subtasks to support the conversion of the BCs and a few of the inventory sectors to CAMx format. The combination of SMOKE and CMAQ2CAMx will allow us to produce input data for both CMAQ and CAMx that are nearly identical. Because SMOKE-generated CAMx emissions and CMAQ2CAMx converted emissions are identical, we do not see the need for pursuing the full CMAQ2CAMxbased simulations requested in the RFP. Efficiencies in the emissions processing will be gained by minimizing the number of modelready emissions files required to represent the temporal variability of individual sources. For example, commercial marine emissions use flat temporal profiles across the months of the year and days of the week. Since this configuration results in uniform emissions for every day of the year, only a single day of processed emissions is required to represent this sector in an annual simulation. In addition to a single representative day for the entire year, there are several other temporal configurations for emissions, such as representative weeks (7 days/month), representative Monday-Weekday-Saturday-Sunday (MWSS) days (4 days/month), representative days per month (1 day/month), etc. The different temporal configurations used for simulated emissions for this project and referenced in Table 2 are described below. Single day – one representative day per year; applies to sources with flat monthly, weekly, and daily temporal profiles MWSS – Representative Monday, weekday, Saturday, Sunday per month; includes representing holidays as Sundays Daily – one emissions file per simulation day Hourly – hourly inventories or hourly-meteorology-based emissions We will conduct systematic and automated quality assurance/quality control (QA/QC) in the emissions modeling process. The SMOKE QA sequence focuses on identifying errors in the input data or use of the input data, such as the incorrect assignment of temporal profiles. The SMOKE QC sequence focuses on ensuring that the emissions modeling process is well documented, reproducible, and did not insert errors into the modeling results. Details of the QA/QC procedures that we will use for the SEMAP project emissions processing are presented in Section 8.2. We will use the automated analysis approaches described in Section 6 “Air Quality Model Evaluation Protocol” to produce all of the evaluation products needed to diagnose model biases. The techniques that we will use to identify biases in the initial CMAQ and CAMx simulations GIT/UNC/CIRA-SEMAP 004.v3 4-7 April 29, 2011 SEMAP Modeling Protocol are detailed in Section 6. We will target these biases in the design of the sensitivity simulations that will be performed under SEMAP project Task 6 (Model Configuration Diagnostic Sensitivity Modeling). All model inputs, including BCs, meteorology, and emissions and model configuration options will be analyzed for their contribution to both model biases and errors. If we identify potential biases that result from the meteorology inputs, we will provide our analysis results to the SEMAP Project Coordinator to initiate dialogue with the SESARM meteorology contractor about how the meteorology could be improved. Biases and errors that appear to be resulting from the emissions and BCs will be addressed in the design of the sensitivity simulations conducted under the diagnostic modeling task. The Base2007 simulations will produce emissions and air quality model output on the SEMAP 36-km and 12-km modeling grids for CMAQ and CAMx that will be used for model performance evaluation and in the design of sensitivity simulations for diagnosing air quality model performance. 4.4 Diagnostic Model Sensitivities The objective of this task is to determine the final model configuration that will be used through the rest of this project, including the 2007 “actual” and “typical” simulations, and future year simulations as well as emissions sensitivity modeling. The necessary diagnostic tests will be decided according to prediction deficits revealed by the model performance evaluation on the initial annual “actual” 2007 base year simulation. All the tests to be conducted here will use the initial “actual” 2007 base year simulation as the base case and will be conducted for selected month-long periods during winter and summer times. This task may involve model modification as well as re-processing of emissions and meteorological inputs depending on specific sensitivity tests. Both Georgia Tech and UNC have tremendous experience with tests that require model development, meteorological modeling, and emissions processing capabilities. Our team has proven experience in testing and applying CMAQ in various configurations, developing new modules, and finding and fixing bugs in existing modules of CMAQ that we will use to recommend the most effective and absolutely necessary diagnostic tests. Following discussion with the SEMAP Project Coordinator, we will execute the carefully selected sensitivity tests. After thorough investigations on all the tests results, we will recommend a final optimized model configuration and/or input data improvements for the SEMAP 2007 final simulation and the rest simulations. The determination of the most necessary diagnostic tests will be based on our interpretation of the model performance after evaluation of the initial annual simulation results. We will conduct the following model configuration tests, if necessary, as suggested in the RFP and as based on our past experiences using CMAQ to simulation in the Southeast. 4.4.1 Number of Vertical Layers (19 vs. 34) More vertical layers in CMAQ lead to better resolution of emissions plumes and pollutant transport and can result in better predictions at the surface. The sacrifice of using more vertical layers is increased computational costs, both in terms of slower model speeds and a larger data storage burden. Therefore, CMAQ applications are typically run with less than 20 layers, with higher resolution (more layers) inside the mixed layer (0-3 km). While this type of configuration was believed to be working well for capturing most of the emission impacts on the ground level GIT/UNC/CIRA-SEMAP 004.v3 4-8 April 29, 2011 SEMAP Modeling Protocol concentrations, a recent SERDP project conducted by Georgia Tech to enhance the capability of simulating prescribed forest fires, has found that increasing the number of vertical layers can significantly improve the prediction of the air quality impacts of fire plumes. A recent modeling study at Georgia Tech comparing CMAQ results using 34 and 80 layers showed significant improvement going from 13 to 34 layers for the February 28, 2007 Atlanta smoke incidence. The May 21-22, 2007 period during Georgia-Florida wildfires is an ideal case to test vertical resolution, as well as fire emissions and vertical mixing. The fire plume, which southeasterly winds carried to northern Alabama on May 21, entered Georgia under westerly winds on May 22 and hit Atlanta. Our first modeling attempt using our 12-km forecasting grid was not successful due to limited vertical resolution (13 layers only). Of note, an attempt by Yang et al. (2009) focusing on Tallahassee, FL did not track the long-range transport either, although it used 20 vertical layers. We are currently re-simulating this event with 34 layers as part of the JFSP project. We expect to see improvement in the long-range transport of the wildfire plume due to increased vertical resolution. Team members Georgia Tech and UNC are experienced in applying CMAQ with different number of vertical layers (e.g. 9, 13, 19, 21, 34 and 80 vertical layers in various past and ongoing projects). It should be noted that switching between different numbers of vertical layers is limited by the layering in the original WRF or MM5 simulations. Supported by the layer configuration of the SEMAP project WRF data, we will conduct a test with an increased number of vertical layers 34 in CMAQ by reprocessing the meteorology through MCIP, reprocessing the elevated emissions sectors through SMOKE, and requesting reprocessed GEOS-Chem IC/BC data from the SEMAP Project Coordinator using the new layer configuration. We will focus our typical analyses of vertical layer sensitivities on the impacts of fire and power plant plumes on surface layer model performance. 4.4.2 MEGAN vs BEIS biogenic emissions MEGAN and BEIS are two biogenic emissions models that predict fluxes of the VOC pollutants that are critical to O3 and SOA formation in the atmosphere. MEGAN and BEIS use different datasets to characterize land use and also differ in the algorithms that they use to compute emissions. The biogenic emissions predictions from the two models are quite different and can cause significant difference in ozone and SOA predictions in both spatial patterns and magnitudes ((Sakulyanontvittaya, Duhl et al. 2008; Sakulyanontvittaya, Guenther et al. 2008)). The latest versions of both MEGAN and BEIS estimate monoterpene and sesquiterpene emissions, which are important precursors to SOA formation in the Southeast. In this sensitivity we will prepare the 30-second resolution ESRI GRID emission factor and landuse data to run MEGAN for the 36-km and 12-km SEMAP modeling grids. We will use a GIS to convert version 2.1 of the MEGAN emission factors and plant functional types and version 2.0 of the MEGAN leaf area index data to the SEMAP grids and combine these with the SEMAP meteorology data to estimate biogenic emissions for the summer and winter sensitivity periods. After merging the MEGAN emissions in with the anthropogenic and pyrogenic emissions using SMOKE they will be used to simulate air quality with CMAQ for the two sensitivity periods. Evaluation of the BEIS and MEGAN biogenic emissions will be conducted by comparing the CMAQ non-methane hydrocarbon (NMHC) predictions of biogenic species to ground-based GIT/UNC/CIRA-SEMAP 004.v3 4-9 April 29, 2011 SEMAP Modeling Protocol ambient monitoring data from the Photochemical Assessment Monitoring Stations (PAMS). We will also use the predicted formaldehyde (HCHO) from CMAQ to calculate tropospheric HCHO columns for comparison to remote sensed HCHO data from the Ozone Monitoring Instrument (OMI) on the Aura satellite (Stavrakou, T., et al, 2009). Along with the standard model performance evaluation metrics for ozone and PM, we will use these additional metrics to determine which biogenic emissions model produces the most accurate estimates of biogenic VOC’s for the SEMAP project. Our analysis will also focus on differences in O3 and SOA predictions between the CMAQ simulations that used biogenic emissions estimated with BEIS and MEGAN. 4.4.3 Vertical Mixing/Mixing Heights Model parameters that describe vertical mixing, such as the vertical diffusivity coefficient and mixing height, are critical for accurately predicting surface pollutant concentrations in air quality models. Knowledge of the nocturnal planet boundary layer (PBL) and of the characteristic changes in PBL during the transition times at sunset and sunrise is limited. As a result, the vertical mixing parameters are poorly predicted in meteorology models during these periods. The inability of meteorology models to accurately simulate the nocturnal PBL is linked to poor nighttime air quality model performance ((Hogrefe, Rao et al. 2001; Hogrefe, Rao et al. 2001)). This weakens the accuracy of not only the 24-hr PM2.5 predictions but also the 8-hour ozone predictions. In this sensitivity we will diagnose potential vertical mixing and mixing height problems in the initial CMAQ run. For example, the May 21-22 period with its remarkable long-range transport of the Georgia-Florida wildfire plume will be scrutinized. We will analyze how diagnosed problems are impacting model performance. Depending on the nature of the identified problems, we will test a different model configuration, conduct further diagnostics, reiterate if necessary, and recommend the most appropriate configuration for the final CMAQ simulation. 4.4.4 Updated SOA formation It is well known that insufficient SOA formation mechanisms in CTMs contribute to underpredictions of organic carbon (OC) ((Eder, Kang et al. 2006; Tesche, Morris et al. 2006)). The modifications that EPA added to CMAQ version 4.7 to include SOA formation from sesquiterpene emissions and to consider aged SOA, among other modifications, had small impacts on SOA predictions ((Napelenok, Carlton et al. 2008; Bhave, Carlton et al. 2009)). Recent field and laboratory studies have provided new information on SOA formation that may be useful for enhancing CTM SOA mechanisms ((Jimenez, Canagaratna et al. 2009)). Georgia Tech recently developed a new SOA mechanism that includes all SOA formation pathways and also considers multi-generations of aged SOA ((Baek 2009) ). We have tested this new mechanism using 2007 forecast simulations and found improvements in 24-hr PM2.5 and OC predictions compared to the original forecasts that used default CMAQ SOA mechanism ((Hu, Baek et al. 2008)). We implemented this new SOA mechanism in our forecasting system for the Southeast starting in the 2009 O3 season. Figure 4-1 illustrates how the model performance in predicting 24-hr PM2.5 and OC concentration was significantly improved in the Atlanta, GA area in comparison to the forecasts for previous years ((Hu, Baek et al. 2009)). GIT/UNC/CIRA-SEMAP 004.v3 4-10 April 29, 2011 SEMAP Modeling Protocol Figure 4-1. Modeled (red solid line) and observed (blue dots) 24-hr average organic carbon concentrations (gm-3) at South DeKalb, GA from May 1st to July 15th, 2009. In this sensitivity we will conduct a full comparison of CMAQ SOA predictions using the SOA mechanism in CMAQv4.7.1 and SOA mechanism developed by Georgia Tech. We will use both the standard observational data as well as other observed information in the Southeast that we have access to, such as WSOC, molecular marker observations, and, if available, Aerosol Mass Spectrometer (AMS) and C14 data. 4.4.5 Updated cloud parameters Past applications of CMAQ in the Southeast, such as the VISTAS modeling, favored the RADM module over the ACM module. The recent updates to ACM in CMAQ warrant testing the impacts of the new features on model performance in the Southeast and comparison to the old RADM cloud module. We will apply the new ACM cloud module in the initial 2007 annual simulation and comparing the results to a sensitivity simulation with the RADM cloud module. Georgia Tech will execute the test simulations and, if necessary, will update the CMAQ RADM cloud module accordingly. The evaluation will focus on not only the wet deposition but also aerosol and related gaseous model species. In this sensitivity we will conduct the simulations for each selected diagnostic test, and as stated above use our expertise along with the standard model performance evaluation procedures described in section 6 “Air Quality Model Evaluation Protocol” to assess the different configuration choices. The performance evaluation, however, will emphasize specially on the individual subject in each test. Note that in the “Number of Vertical Layers (19 vs. 34)” test, we will re-process the IC, BC, meteorology and emissions inputs since the change of the vertical layers. In the “MEGAN vs BEIS biogenic emissions” test we will prepare the additional MEGAN biogenic emissions. And in the other three tests we will integrate corresponding modules or fixes into the standard CMAQ 4.7.1 codes. After finishing all of the tests, we will combine the assessment information to recommend a final model configuration to be used for the final annual 2007 and subsequent CMAQ simulations. GIT/UNC/CIRA-SEMAP 004.v3 4-11 April 29, 2011 SEMAP Modeling Protocol 4.5 Final Base Case Simulation The objective of this task is to develop a final annual “actual” 2007 base year air quality model simulation on 36-km and 12-km resolution modeling grids using the final recommendation on model configurations from the previous diagnostic modeling. After optimizing the model configuration and input data for the “Actual 2007” simulations using the sensitivity results from the previous task, we will produce final annual SMOKE, CMAQ and CAMx simulations of 2007 air quality. The selection of configuration options in CAMx will be based on the final recommendation on model configurations for CMAQ and will be discussed with SEMAP TAWG before application. After preparing the final “Actual 2007” emissions data and GEOS-CHEM BCs for both CMAQ and CAMx, we will run annual simulations with both models on the 36-km and 12-km SEMAP grids and prepare performance metrics for comparing the predictions from the simulations, as described in section 6 “Air Quality Model Evaluation Protocol”. All products will be posted to the project web site 4.6 Typical Base Case Simulation The objective of this task is to develop an annual “typical” 2007 base year air quality model simulation on 36-km and 12-km resolution modeling grids using the final recommendation on model configurations. The results from this task and future year simulations will provide air quality simulations in support of SIPs by SEMAP states. We will divide the modeling and analysis work on this task between Georgia Tech and UNC. Georgia Tech will serve as the overall task lead and perform the CMAQ modeling and UNC will conduct the SMOKE and CAMx modeling. The emissions modeling platforms (EMPs) for the typical year emissions data prepared by UNC will be combined with the actual 2007 meteorology and BCs used in the “Final Base Case Simulation” to simulate typical year air quality with CMAQ and CAMx. We will use the same final model configurations applied in the “Final Base Case Simulation” task for CMAQ and CAMx to perform the annual simulations for this task on both SEMAP modeling grids. The same SMOKE configuration, including the use of representative days to optimize the emissions processing tasks, will be applied to the typical year simulations. We will plan on producing daily emissions files for the meteorology-dependent and episodic emissions sectors, such as elevated point sources and fires. We don’t anticipate rerunning the biogenic emissions for this task. Similar to the previous task, we do not recommend using CMAQ2CAMx to produce separate emissions data for explicit CAMx simulations. We will use the same approach of using CMAQ2CAMx to process the fire emissions only, otherwise producing the non-pyrogenic CAMx emissions with SMOKE. The typical year simulations conducted here will take advantage of the SMOKE, CMAQ, and CAMx configurations developed under the “Diagnostic Model Sensitivities” task and applied in the “Final Base Case Simulation” task. We are able to offer cost savings for these simulations because they represent incremental changes to the modeling set up. By only requiring the replacement of certain emissions sectors, the production of the typical year simulations relative to the actual year simulation require less resources to complete, an important consideration given the large number of model runs requested for this Task. GIT/UNC/CIRA-SEMAP 004.v3 4-12 April 29, 2011 SEMAP Modeling Protocol All analysis products outlined in section 6 “Air Quality Model Evaluation Protocol” will be produced for the simulations conducted here and posted to the project web site. 4.7 Future OTB and Emissions Control Scenarios The objective of this task is to develop annual future year air quality model simulations on 36km and 12-km resolution modeling grids using the final configuration used in the base year modeling using both CAMx and CMAQ modeling systems. The results from this task will provide air quality simulations in support of the development of O3 and PM2.5 SIPs by SEMAP states. We will perform the following model simulations for this task. “2017 OTB” SMOKE, CMAQ, and CAMx “2017 Control” SMOKE, CMAQ, and CAMx “2020 OTB” SMOKE, CMAQ, and CAMx “2020 Control” SMOKE, CMAQ, and CAMx We will divide the modeling and analysis work for this task between Georgia Tech and UNC. Georgia Tech will serve as the overall task lead and perform the CMAQ modeling and UNC will conduct the SMOKE and CAMx modeling. The emissions modeling platforms (EMPs) for the future year emissions data prepared by UNC will be combined with the actual 2007 meteorology used in the “Final Base Case Simulation” to simulate future year air quality with CMAQ and CAMx. We will use the same final model configurations applied in the “Final Base Case Simulation” task for CMAQ and CAMx to perform the annual simulations for this task on both SEMAP modeling grids. The same SMOKE configuration, including the use of representative days to optimize the emissions processing tasks, will be applied to the future year simulations. We will plan on producing daily emissions files for the meteorology-dependent and episodic emissions sectors, such as elevated point sources and fires. We don’t anticipate rerunning the biogenic emissions for this task. All of the EMPs used for the future year simulations are detailed in Section 4.1. The emission inventory contractor will keep track of the emission reductions, control factors and efficiencies that will be applied in the emissions modeling for controls that are applied either through on-thebooks federal and state controls and any new state controls and the shutdowns of emission sources. Similar to the base year modeling, we do not recommend using CMAQ2CAMx to produce separate emissions data for explicit CAMx simulations for the various future year scenarios. We will use the same approach of using CMAQ2CAMx to process the fire emissions only, otherwise producing the non-pyrogenic CAMx emissions with SMOKE. 4.8 Emissions Sensitivities To provide useful information in determining the most effective control strategies for specific non-attainment areas, a series of emission sensitivities will be performed. A CAMx PSAT simulation will be performed using 20 categories, selected from a combination of source regions and emissions sectors, to generate results for an annual simulation on the 36-km SEMAP modeling grid. The PSAT categories will be selected based on discussions with the SEMAP Project Coordinator. In addition, five brute force CMAQ simulations of emissions sensitivities GIT/UNC/CIRA-SEMAP 004.v3 4-13 April 29, 2011 SEMAP Modeling Protocol for daily and annual PM2.5 using the 36-km SEMAP grid will be performed. For ozone, CMAQ will be used to perform five DDM and five brute force simulations of emissions sensitivities for 8-hour maximum O3 using the 12-km SEMAP grid during the 5-month ozone season. Addition sensitivities may be performed under the direction of the SEMAP Project Coordinator. 4.9 Interactive Database Tool This section describes the products that will be made available through the Interactive Database Tool (IDT). Ambient Data Display 2006-08, 2007-09, and 2008-10 (when available) design values for ozone and PM2.5 for all monitors. o Bar Charts o Table Display measured daily maximum 1-hour ozone values, daily maximum 8-hour ozone values, daily W126 ozone values, daily PM2.5 values (total and by species), quarterly PM2.5 values (total and by species), annual PM2.5 values (total and by species), and visibility (total and by species) for each monitor for years 2006-2010. o Bar Charts o Table Emissions Data Display emissions by pollutant (SO2, NOx, VOCs, NH3, OC, EC, Soils) and source category (EGU, non-EGU point, area sources, on-road mobile, non-road mobile, and fire). Determine for annual and ozone season (summer day). o Spatial Plots o Tables Modeling Results Display modeled daily maximum 1-hour ozone values, daily maximum 8-hour ozone values, daily W126 ozone values, daily PM2.5 values (total and by species), quarterly PM2.5 values (total and by species), and visibility (total and by species) for all days in modeled year (base year or future year). o Spatial Plots (grid cells) o Bar Charts (monitors) o Table (monitors) Capable of showing the differences between the base year and the future year projections on an absolute and relative (percentage) basis. o Spatial Plots (grid cells) o Bar Charts (monitors) o Table (monitors) Display quarterly and annual RRFs for each scenario. o Spatial Plots (grid cells) GIT/UNC/CIRA-SEMAP 004.v3 4-14 April 29, 2011 SEMAP Modeling Protocol o Bar Charts (monitors) o Table (monitors) Capable of displaying the in-state and out-of-state impacts of sensitivity (e.g. 30% NOx reduction, IC/BC, anthropogenic/biogenic) and source apportionment runs. o Spatial Plots (grid cells) o Bar Charts (monitors) o Table (monitors) Introduce a metric that accounts for the overall impact of emission reductions over all grid cells over a prescribed area as opposed to a single grid cell associated with a monitor location. (e.g., the sum over all grid cells of ozone reduction in ppb in each grid cell for a specific scenario or sensitivity in county X). o Spatial Plots (grid cells) o Bar Charts (user defined area) o Table (user defined area) Model Performance Capable of displaying model performance statistics and graphics o Spatial Plots (observations overlaid on model results) o Scatter Plots o Time Series Plots o Q-Q Plots o Stacked Bar Charts o Bugle Plots o Soccer Plots o Summary Statistical Tables General All of the above functions should be capable of analyzing the results by monitor, county, nonattainment area, or other group of counties (as defined by the user). All of the above functions should be capable of doing sub-portions of the year (e.g., date1 to date2). Ability to export the data to spreadsheets for additional analysis or processing. Ability to save plots and charts to PC. Ability for user to import their own data sets into the tool. GIT/UNC/CIRA-SEMAP 004.v3 4-15 April 29, 2011 SEMAP Modeling Protocol Section 5: Air Quality Model Evaluation Protocol The model evaluation described in this section applies to the 2007 actual CMAQ modeling. All products, including performance statistics and graphics, will be prepared for the 12 km domain only. 5.1 Ambient Air Quality Data We will collect the available surface air quality measurement data, measured atmospheric deposition data, and satellite data for the 2007 SEMAP modeling periods and domains. Particularly, we will collect surface measurement data of air quality from the following sources: EPA database AIRS (including STN, PAMS, SLAMS and Air Toxics networks) and networks CASTNet, IMPROVE and SEARCH, and will collect atmospheric deposition data from the network NADP and NASA’s satellite air quality data from OMI and MODIS. The following sections describe the data that we will collect for this project and the types of evaluation that can be performed with the data. 5.1.1 AIRS AIRS compiles and provides access to datasets from multiple national observational networks/programs including STN, PAMS, SLAMS and Air Toxics. The STN network provides 24-hr (midnight to midnight) measurements of PM2.5 and its composition including trace metals at every 3rd day or every 6th day. The SLAMS network provides hourly measurements of criteria air pollutants including PM2.5, PM10, O3, SO2, NO2, CO etc. The PAMS network measures photochemical smog related species such as O3, NO, NO2, NOy, Total NMOC, and 60 PAMS target VOC compounds in non-attainment areas. The AIRS datasets are spatially dense in populated areas with intensive sampling frequency in the SEMAP states and will be the backbone for model performance evaluation and receptor modeling. In addition the monitoring data from the Air Toxics program provides a critically important role by characterizing HAPs concentrations to support exposure assessments, and can be a supplement for air quality model evaluation and input for receptor modeling. The AMET evaluation tool developed at UNC can take all AIRS datasets in their original AQS engineering format directly. In addition, AIRS includes meteorological data at STN and PAMS sites that can be used for quality assurance of the MCIP files and as an independent evaluation of WRF performance, especially if the meteorological modeling contractor is not using this dataset. 5.1.2 IMPROVE The IMPROVE network was developed to monitor visibility degradation at wildness and remote areas in the nation. IMRPOVE includes measurements of chemical constituents including trace metals of PM2.5 as well as related gaseous species at each site. From these measurements, light extinction values are also calculated and provided. Note that different carbonaceous analysis methods have been used among the major national networks with IMPROVE using TOR and STN using TOT in measuring EC/OC. GIT/UNC/CIRA-SEMAP 004.v3 5-1 April 29, 2011 SEMAP Modeling Protocol 5.1.3 CASTNet CASTNet was developed to monitor dry and wet acid deposition. Monitoring site locations are predominantly rural by design to assess the relationship between regional pollution and changes in regional patterns in deposition. CASTNet also includes measurements of rural ozone and the chemical constituents of PM2.5. 5.1.4 NADP NADP provides atmospheric deposition data. The weekly cumulative deposition data will be extracted from CMAQ/CAMx outputs between the hours of data collection and assigned to the measured species. 5.1.5 SEARCH SEARCH provides continuous and speciated PM data in one urban and one rural location in each one of the following four SEMAP states: Alabama, Georgia, Florida and Mississippi. In addition to 24-hr particulate matter mass (fine and coarse) and composition (EC, OC, sulfate, nitrate, ammonium and metals) SEARCH also measures hourly PM mass and ion components (BC, OC, sulfate, nitrate, ammonium) and a wide range of complementary gaseous species (O3, NO, NO2, NOy, HNO3, SO2, CO, CO2). This dataset will be extremely useful to evaluate how well the models reproduce the diurnal variability (which is not possible with the daily averaged data), spatial variability (urban versus rural), and if the secondary PM formation processes were simulated correctly. We can speciate the SEARCH OC data (and OC data from ASACA sites which are operated by Georgia Tech) into individual compounds for use as molecular markers for periods of the February 2007 prescribed burning event and the May-June 2007 FloridaGeorgia wildfire episodes ((Yan, Zheng et al. 2008)). These additional datasets will be very useful for evaluating the model performance in capturing the important Southeastern fire events during 2007. In addition, the HAP and other VOC data collected at the SEARCH sites in the Atlanta, GA metropolitan area are valuable for both model performance evaluation and receptor modeling. 5.1.6 Satellite Data Air quality data collected by satellites provide a powerful supplement to ground-based air quality observations. Where ground-based observations are finite spatial measurements at the earth’s surface, satellite data provide atmospheric measurements on spatial grids that are commensurate with regional air quality modeling grids. Further, satellite data observe three-dimensional columns of pollutants. Satellite air quality data can be processed to match the spatial and vertical resolution of air quality model predictions, providing an observational dataset that is more spatially commensurate than ground-based measurements. We will use satellite observations of gases and particulate matter to provide supplemental evaluation data for diagnosing CMAQ and CAMx model performance. While these data will not be used to replace the ground-based monitoring networks, they will provide an additional dimension of atmospheric measurements to use in the model evaluations as they observe the entire tropospheric column of pollution. In particular satellite data will be useful in the SEMAP project to diagnose the ability of CMAQ and CAMx to simulate the impacts of the southern Georgia wildfire events in May-July 2007, the impacts of dust transport into the region, and other important episodic pollution events. These data are not intended for use in the SIP process, GIT/UNC/CIRA-SEMAP 004.v3 5-2 April 29, 2011 SEMAP Modeling Protocol they are only going to be used to lend insight into the skill of the air quality models to simulate air quality in the Southeast. We will acquire ambient air quality data, including satellite data from the OMI instrument on tropospheric NO2 and formaldehyde (HCHO), and aerosol optical depth (AOD) data from the OMI and MODIS instruments. Advanced post-processing tools developed by UNC are available to process the satellite data products, the Level 2 swath, or gridded data (Level 2 and Level 3), from their native lat-long grid to the Lambert conformal conic grid used by CAMx and CMAQ, for a user-specified domain and grid resolution. The UNC satellite data toolkit also consists of an advanced version of AMET that allows statistical comparisons of satellite data against CMAQ for selected sub-domains of the data. CMAQ output is not directly comparable with columnintegrated trace-gas concentrations, e.g., tropospheric column measurements of NO2, and HCHO. We will download the satellite data from the data warehouses (GIOVANNI or VIEWS as appropriate) for the base year 2007, re-grid them to the model grid, and post-process CMAQ and CAMx output for comparison with these data for the specified sub-yearly periods. For detailed information, the surface ambient air quality measurement data that will be used for evaluating model performance and input to receptor models are summarized in Table 5-1. Table 5-1. Surface Air Quality Data available for model performance evaluation and input to receptor models for the Southeast Variable PM10 PM2.5 Sulfate Nitrate Ammonium EC (BC) OC Trace Metals in PM2.5 NO, NO2 and NOy SO2 VOCs, HAPs Total Nitrate (c) Averaging time daily, hourly daily, hourly daily, hourly, weekly daily, hourly daily, hourly, weekly daily, hourly daily, hourly daily hourly hourly, daily, weekly hourly daily, weekly Database/Networks AIRS, IMPROVE, SEARCH AIRS, IMPROVE, SEARCH AIRS, IMPROVE, SEARCH, CASTNet AIRS, IMPROVE, SEARCH AIRS, SEARCH, CASTNet (a) AIRS, IMPROVE, SEARCH AIRS, IMPROVE, SEARCH AIRS, IMPROVE, SEARCH AIRS, SEARCH AIRS, CASTNet, IMPROVE, SEARCH AIRS, SEARCH (b) CASTNet, SEARCH (a) Ammonium data from IMPROVE are estimated from sulfate and nitrate based on complete neutralization of those ions. Independent measurements from CASTNet are more appropriate for model evaluation purposes (b) ARIES program measures HAPs and other VOCs at the SEARCH sites in Atlanta, GA area. (c) CASTNet weekly measurements of PM nitrate and HNO 3 will be evaluated as total nitrate due to issues relating to filter-based methods to collect volatile nitrate species. As existing data, surface measurements from the above specific networks or from EPA AIRS database are already quality-checked and well documented. Use of these existing data will be fully documented with data’s original documentation archived. In addition we will use the provided measuring uncertainty information to further screen the datasets and to decide excluding particular data points. Spatial plots and time series plots will be made for visual check of outliers that seems less reasonable. Additional documentation will be provided when necessary to address the issue of particular data points that seem less reliable than others. Also, descriptive statistics (mean, standard deviation, frequency distributions, etc.) will be calculated and be used for data presentation and cross comparison and to facilitate peer review. CrossGIT/UNC/CIRA-SEMAP 004.v3 5-3 April 29, 2011 SEMAP Modeling Protocol checking among the networks will be made too if applicable for further assurance of the data reliability. Special cautions will be used to double-check the quality of the acquired satellite data from OMI and MODIS instruments. The cloud cover information will be used in combination of other information to decide uncertainty for each individual data point. This uncertainty information will then be used for screening the final data to be used and be used to weight the reliability of each data point. Descriptive statistics (mean, standard deviation, frequency distributions, etc.) of the data will be also calculated and compared to the surface measurement, which is considered more reliable. Spatial plots will be made for satellite data and further compared with surface measurements for data screening purpose. All of the above procedures will be documented for quality assurance. 5.2 Evaluation Tools This section describes the software that will be used to conduct the analyses of the meteorology, emissions, and air quality data for the SEMAP project. 5.2.1 Atmospheric Model Evaluation Tool (AMET) The Atmospheric Model Evaluation Tool (AMET) (Appel et al., 2005) is a suite of software designed to facilitate the analysis and evaluation of meteorological and air quality models. AMET matches the model output for particular locations to the corresponding observed values from one or more networks of monitors. These pairings of values (model and observation) are then used to statistically and graphically analyze the model’s performance. More specifically, AMET is currently designed to analyze outputs from the PSU/NCAR Mesoscale Model (MM5), WRF, CMAQ, and MCIP-postprocessed meteorological data (surface only). The basic structure of AMET consists of two fields and two processes. The two fields (scientific topics) are MET and AQ, corresponding to meteorology and air quality data. The two processes (actions) are database population and analysis. Database population refers to the underlying structure of AMET; after the observations and model data are paired in space and time, the pairs are inserted into a database (MySQL). Analysis refers to the statistical evaluation of these pairings and their subsequent plotting. Practically, a user may be interested in using only one of the fields (either MET or AQ), or maybe interested in using both fields. That decision is based on the scope of the study. The three main software components of AMET are MySQL (an open-source database software system), R (a free software environment for statistical computing and graphics), and perl (an open-source cross-platform programming language). 5.2.2 Package for Analysis and Visualization of Environmental (PAVE) data PAVE is a flexible open source application to visualize multivariate gridded environmental datasets. It will be used for the SEMAP project to create 2-d spatial plots of meteorology, emissions, and air quality data. GIT/UNC/CIRA-SEMAP 004.v3 5-4 April 29, 2011 SEMAP Modeling Protocol 5.3 Evaluation Procedures 5.3.1 Ozone and precursors (AQS and PAMS). 5.3.1.1 Data Specifications. For each SEMAP and non-SEMAP state, we will create a table of statistics including mean normalized bias, mean normalized gross error, mean observation, mean prediction, ratio of predicted to observed means, mean bias, normalized mean bias, mean fractional bias, mean error, normalized mean error, mean fractional error, and correlation coefficient. All statistics will focus on the grid cell of the monitor, not the array of grid cells near the monitor. Separate summary tables will be created for 1-hour, 8-hour, 1-hour maximum, and 8-hour maximum ozone values using (1) no threshold and (2) a threshold of ≥60 ppb (observed value). In addition, separate summary tables will be created for daily ozone W126 and seasonal ozone W126. By state, monthly values will be calculated for all sites/all days, all sites/one day, and all days/one site. Depending on the availability of observations, these same statistics will be calculated for hourly (1-hour) ozone precursors and related gas-phase oxidants (e.g., CO, HNO3, H2O2, NOx, NOy, VOCs, and VOC species). The individual hourly and daily measured/modeled values used to perform these calculations will be delivered for all states in the 12 km domain in a separate text file for each state. States can then calculate additional statistics appropriate to their nonattainment areas. 5.3.1.2 Approach For each state, we will create monthly scatter plots of observed versus predicted values for 1hour ozone/8-hour maximum ozone values (all values, no threshold) and daily ozone W126 for March 1 through October 31 for all sites/all days, all sites/one day, and all days/one site. For each site, we will create monthly time series plots of observed versus predicted values for 1hour ozone/8-hour maximum ozone values (no threshold) and daily ozone W126 for March1 to October 31 for all days/one site. We will address whether the y-axis scale can be specific to each site so that observed values cover the full range of the axis even if some outlier data points are lost. For each state, we will create monthly Q-Q plots of observed versus predicted values for 1-hour ozone/8-hour maximum ozone values (no threshold) and daily ozone W126 for March 1 to October 31 for all sites/all days. For each state, we will create scatter plots, time series, and Q-Q plots as described above for available hourly (1-hour) ozone precursors and related gas-phase oxidants from AQS and PAMS sites (e.g., CO, HNO3, H2O2, NOx, NOy, VOCs, and VOC species). Daily and monthly average tile plots of monitored and modeled 8-hour maximum ozone values for March 1 through October 31. GIT/UNC/CIRA-SEMAP 004.v3 5-5 April 29, 2011 SEMAP Modeling Protocol 5.3.1.3 Diagnostic Evaluation As data are available (from PAMS and possibly SEARCH sites for continuous NOy, VOCs, etc.), we will develop ratios of indicator species to determine if ozone nonattainment areas are NOx- or VOC-limited. For each site, we will provide monthly scatter plots and monthly time series plots for all days. Plots will be accompanied by an interpretation of results (e.g., does the data suggest that areas are NOx- or VOC-limited or transitional?). 5.3.2 Fine Particulate Matter (FRM, STN, CASTNET) For FRM and STN sites, we will provide statistics for each state. For IMPROVE sites, we will provide statistics for all monitors in the 12 km domain, and for three subcategories: coastal states, interior SEMAP states, and neighboring SEMAP states. For the CASTNET, NADP (wet deposition), and SEARCH networks, we will provide statistics for all monitors collectively in the 12 km domain. 5.3.2.1 Approach We will create monthly tables of statistics including mean normalized bias, mean normalized gross error, mean observation, mean prediction, ratio of predicted to observed means, mean bias, normalized mean bias, mean fractional bias, mean error, normalized mean error, mean fractional error, correlation coefficient. We will present separate tables for PM2.5 and speciated components of PM by network and month. The values will be calculated for all sites/all days, all sites/one day, and all days/one site where “all sites” are defined above for each monitoring network. The individual hourly and daily measured/modeled values used to perform these calculations will be delivered for all states in the 12 km domain in a separate text file for each state. States can then calculate additional statistics appropriate to their nonattainment areas. We will create the following plots: monthly scatter plots of observed versus modeled PM2.5 and speciated components of PM2.5, for all sites/all days, all sites/one day, and all days/one site, by state for the FRM and STN networks, and for the 12 km domain for other networks. annual time series plots for PM2.5 and speciated components of PM, for all days/one site. monthly Q-Q plots of observed versus predicted PM2.5 and speciated components of PM for all sites/all days, by state for the FRM and STN networks, and for 12 km domain for other networks. tile plots of monitored and modeled PM2.5 and speciated components of PM, on a daily basis for the FRM, IMPROVE, and STN networks, and on a monthly basis for all networks separately. 5.3.2.2 Soccer and Bugle Plots. We will create the following plots and post to the project website: GIT/UNC/CIRA-SEMAP 004.v3 5-6 April 29, 2011 SEMAP Modeling Protocol For all ten SEMAP states together, soccer plots by speciated components of PM2.5 and total PM2.5 for all monitoring networks (CASTNET, IMPROVE, NADP, SEARCH, and STN with, each network having a distinct color or shape), by month with 3 months per plot. For each of the ten SEMAP states, soccer plots for PM2.5 mass for FRM monitors, by month, with all 12 months per plot. For all ten SEMAP states together, bugle plots by speciated components of PM2.5 and total PM2.5 including all monitoring networks, by month with 12 months per plot. For all ten SEMAP states together, bugle plots for all speciated components of PM2.5 on one plot, with separate plots for the IMPROVE network and STN network. For each of the ten SEMAP states, bugle plots for PM2.5 mass for FRM monitors, by month with 12 months per plot. 5.3.2.3 STN and IMPROVE Data. For each STN and IMPROVE site, we will create stacked bar charts of observed versus modeled speciated components of PM2.5 for each monitored day in a month, separate plots for each month. 5.3.2.4 Diagnostic Evaluation EPA Modeling Guidance recommends evaluating “indicator species ratios” to determine if NH4NO3 formation is limited by NH3 or NO3. Because CASTNET data is reliable for total N, but not components of particulate NO3 versus HNO3, and because NH3 measurements are not available in 2007 (with the exception of SEARCH sites), this analysis may not be feasible. Additional alternatives for this evaluation could be sought. 5.3.3 Regional Haze (IMPROVE). For the 12 km domain (21 IMPROVE monitors in SEMAP and neighboring states), we will use the new IMPROVE equation with monthly f(RH) values for both observed and modeled data. 5.3.3.1 Table of Statistics. We will create a table of statistics including mean normalized bias, mean normalized gross error, mean observation, mean prediction, ratio of predicted to observed means, mean bias, normalized mean bias, mean fractional bias, mean error, normalized mean error, mean fractional error, and correlation coefficient. We will present separate tables for total bext and speciated components of bext for 20% worst days/one site and 20% best days/one site. 5.3.3.2 IMPROVE Monitor Data. The individual daily values used to perform these calculations will be included in a single text file for all IMPROVE monitors. We will complete the following tasks: Create scatter plots of observed vs. predicted total bext and speciated components of bext for 20%W days/one site and 20%B days/one site. GIT/UNC/CIRA-SEMAP 004.v3 5-7 April 29, 2011 SEMAP Modeling Protocol Create bugle plots for total bext and speciated components of bext, using the average of all IMPROVE sites in the SEMAP 12 km domain and monthly average values (12 month averages for all sites by 6 components). See reference to Boylan and Russell 2006, Atmos. Environ. Create bugle plots for total bext and speciated components of bext, using the IMPROVE sites in the SEMAP 12 km domain and site average values for the 20% worst days by component. (21 site averages for all months by 6 components). Create bugle plots for total bext and speciated components of bext, using IMPROVE sites in the SEMAP 12 km domain and site averages for the 20% best days by component. (21 site averages for all months by 6 components). For the individual IMPROVE sites in the SEMAP states and for 6 neighboring IMPROVE sites, we will create: For each of the 20% worst and 20% best days in 2007 (defined by the new IMPROVE equation), a stacked bar chart for speciated components of PM2.5 (including sea salt) plus coarse mass for observed versus modeled 2007 actual results versus modeled 2007 typical results. 5.3.4 Model Performance Goals. While there are no bright line statistical measures to determine if model performance is acceptable for regulatory application, SEMAP will compare ozone and PM2.5 modeled statistics to some commonly used evaluation criteria. For ozone model performance, the traditional ≤ ±15% and ≤ 35% performance goals (Tesche, 1990) for mean normalized bias (MNB) and mean normalized gross error (MNGE) using hourly predicted and observed ozone pairs with the observed ozone value greater than 60 ppb will be used. For PM2.5, performance statistics of <±30% mean fraction bias and <50% mean fraction error indicate good performance, while performance statistics of <±60% mean fraction bias and <75% mean fraction error indicate average performance (Boylan, 2006). Recommended performance goals and criteria will be used to help identify areas that can be improved upon in future modeling. Exceeding these levels of performance may indicate fundamental concerns with the modeling system. Extended diagnostic evaluation and sensitivity tests should be performed to address extremely poor performance. GIT/UNC/CIRA-SEMAP 004.v3 5-8 April 29, 2011 SEMAP Modeling Protocol Section 6: Supplemental Analyses 6.1 Future Attainment Demonstration Analysis Protocol The overall goal of this task is to analyze future year model outputs to compute future year design values to demonstrate modeled attainment of the NAAQS. We will rely on guidance provided by the U.S. EPA (U.S. EPA, 2007) for demonstrating modeled attainment of air quality goals for O3, PM2.5 and regional haze to analyze the CMAQ and CAMx outputs from the future year simulations, where the emphasis is on using model outputs in a relative sense. In this guidance, the attainment test methodology uses model outputs and ambient data to estimate future year concentrations. Specifically, 1. Relative Reduction Factor (RRF) = Model predicted change (as a ratio) from base year to future year 2. Future Year Design Value (DVF) = Base Year Design Value (DVC) X RRF This procedure is fairly straightforward for O3, since there is only one component, and no speciation is involved. However, for PM2.5 and the regional haze reasonable progress, the attainment test needs to use all the PM2.5 species, where the RRFs need to be calculated for each individual PM2.5 speciated component, and total PM2.5 is reconstructed from the sum of all PM2.5 components. This procedure called the Speciated Modeled Attainment Test (SMAT) can be directly applied where speciated PM2.5 information is available. Since FRM monitors are the only ones that can be used to demonstrate attainment, the SMAT process can be applied relatively easily if the FRM monitors have collocated STN monitors. However, speciated information is not available at most FRM locations. Further, the measurements collected at the STN and IMPROVE monitors are not directly comparable to FRM measurements. To give a rough idea of this discrepancy, there are ~1200 FRM monitors in the country, but only ~250 STN and ~165 IMPROVE monitors, and over 75% of the FRM monitors do not have a collocated STN monitor. Figure 6-1 shows a map of the U.S. with the locations of the various monitors. GIT/UNC/CIRA-SEMAP 004.v3 6-1 April 29, 2011 SEMAP Modeling Protocol Figure 6-1. Location of various PM2.5 monitors in the U.S. for SMAT. While there is a need to perform interpolation of the STN data to the FRM locations, there are issues with using speciated data from monitoring networks due to issues with sampling methodologies. For example, sampling filters do not retain portions of volatile compounds, such as ammonium nitrate, and therefore lead to negative sampling artifacts. To alleviate this and other issues, SMAT uses the Sulfates, Adjusted Nitrates, Derived Water, Inferred Carbonaceous Mass and estimated aerosol acidity (H+) (SANDWICH) approach developed by Frank (2006). The speciation of SO4, NO3, EC, crustal material and salt are relatively straight forward and the information from the speciated monitors (with the exception of an adjustment for nitrate) are used. For nitrate, the speciated monitor input data is adjusted to account for volatilization. Because volatility is also an issue for monitor measurements for ammonium, ammonium is interpolated using the Degree of Neutralization (DON). DON is defined as DON = NH4,so4 / SO4, where NH4,so4 is ammonium associated with sulfate and SO4 is sulfate. To help with the SANDWICH application and to perform the interpolation of STN data to the FRM locations, we will use the latest version of the U.S. EPA Modeled Attainment Test Software (MATS) (Abt Associates, 2009) with the post-processed CMAQ outputs. The interpolation is done using the Voronoi Neighborhood Averaging (VNA) scheme, which is also used in the BenMAP modeling system used for performing health and economic benefits GIT/UNC/CIRA-SEMAP 004.v3 6-2 April 29, 2011 SEMAP Modeling Protocol analyses of the air quality modeling. We will use all FRM monitors with valid data in the modeling domains. We will also repeat these calculations using Excel and compare the results. For each future year modeling scenario (at the two model grid resolutions of 36k and 12k, and for each of CAMx and CMAQ), we will analyze the model outputs using MATS and create various analyses products. We will develop routines to develop visualization outputs of the RRFs and DVFs for O3, PM2.5 and the glide slopes for demonstrating rate of progress for addressing regional haze goals using the best 20% and worst 20% visibility days. The output products for this task are given below. Tile-plots showing daily, monthly, quarterly and annual average changes in key pollutant species (including daily maximum 8h O3, daily average PM2.5 and the speciated components) Tables of Relative Reduction Factors (RRFs) and Future Year Design Values (DVFs) for 8hO3, PM2.5 for all sites in the SEMAP states Graphics showing progress towards goals for regional haze Spatial fields of RRFs and DVFs (which will address the unmonitored area analyses using the gradient adjusted option in SMAT. Comparison of DVFs between 36k and 12k model outputs, and between CMAQ and CAMx. We will post all analyses products on the interactive website that will be developed for this project. We will enable the interface to be queried from the web based upon a specific state or monitor that may be of interest to the SESARM team. GIT/UNC/CIRA-SEMAP 004.v3 6-3 April 29, 2011 SEMAP Modeling Protocol Section 7: Data Backup and Archival Protocol 7.1 Georgia Tech We pack our data (typically emissions are zipped, meteorology and air quality data are not) in 1 TB external hard drives and shelve them in a locked faculty office of the Ford Environmental Science and Technology (ES&T) Building in steel bookcases. Each drive is labeled properly and the list of contents is inserted in the package. Electronic copies of the contents are saved on regularly backed-up drives. External drives containing a second copy of critical data (e.g., irreproducible or costly to reproduce data) are stored in steel drawers in a separate locked office on a different wing of the same building. In case of fire, this wing is separated from the wing where the first office is located. 7.2 UNC At the end of each model simulation (MCIP, SMOKE and CAMx), we will archive all model inputs and outputs on the UNC ITS mass storage system and retrieve as necessary later. Every file that is copied to UNC’s MS is copied onto 2 tapes, and one tape is stored off site in a remote location for redundancy. Thus, in the event of one tape failing, there is always a backup option for accurate and timely retrieval of the data. All files copied to MS are also automatically compressed during the archival process. The copy process from the compute servers’ scratch disks to MS and back to the disks is done seamlessly with a simple UNIX ‘cp’ command, thus making the data available in near real-time during the entire process. At the end of SMOKE and MCIP processing, we will also make a copy of the outputs for CMAQ onto external hard drives, make a list of contents on the drive, label them accordingly and ship to GIT for CMAQ modeling. We will maintain a catalog of all data drives that will be created along with their contents on the project website for easy tracking and retrieval if necessary. At the end of each AQ simulation, we will run a series of post-processors to create layer one extractions along with aggregate metrics such as daily average, daily maxima, and these postprocessed AQ outputs will also be copied to an external hard drive and shared with GIT, for model inter-comparisons and QA. We will also create an archive of all model code and run scripts used in the modeling along with all logs that will be generated during the modeling. We will set up an automated way of rsyncing the run directories periodically with the mass storage system for continued archival of the code and scripts. GIT/UNC/CIRA-SEMAP 004.v3 7-1 April 29, 2011 SEMAP Modeling Protocol Section 8: Computer Resources Since the modeling activities are shared by both UNC and GIT, we describe below the specific resources that are available at each of these 2 institutions. 8.1 Georgia Tech Computing Resources The computational resources of LAMDA consist of one 8-node cluster (each node has dual 32bit 2.8 GHz Intel Xeon processors with 4 Gigabytes memory); one SunFire X4100 server (dual dual-core 64-bit AMD Opteron processors with 16 Gigabytes memory); two SunFire X4150 servers (each has dual quad-core 64-bit Intel Xeon X5450 processors with 16 Gbytes memory); one Penguin Computing Altus 1704 server (quad AMD Opteron 8431 6-core processors with 32 Gbytes memory); fifteen 2.2 to 3.0 GHz dual-processor Linux workstations (each has 2G-4G memory); and three Sun UltraSPARC-III Unix workstations. The total disk space on these systems is approximately 5 Tera-bytes. A separate data storage facility, consisting of six RAID systems, has the capacity of holding over 20 Terabytes of data. All these systems are supported and secured by the Information Systems Group (ISG), an independent unit of the School of Civil and Environmental Engineering at Georgia Tech. LAMDA is housed in the Environmental Science and Technology Building, a 287,000-square foot facility devoted only to environmental research. This 5-year old facility brings together, under one roof, researchers from the School of Earth and Atmospheric Sciences, the School of Chemical Engineering, and programs in Environmental Biology, Environmental Chemistry, and Environmental Engineering. The building is located in the main Georgia Tech campus within easy access of other facilities. The ES&T Building is equipped with 1-Gbps Ethernet which forms the backbone of LAMDA’s computational network. Georgia Tech is connected to the outside world through the Internet 2 (I2) network. The following server was ordered, in part using SERDP funds ($6,000) to be used for SEMAP modeling. [1] Relion 2712 Server Qty: 1 Unit Price: $8,255.06 Ext. Price: $8,255.06 Standard Features: - One or Two Intel Xeon 5500 Series ('Nehalem') Processors - QuickPath Interconnect, up to 6.4GT/s (GigaTransfers per second) - Up to 96GB of DDR3 RAM - Up to 12 Hot Swap 3.5" SATA or SAS Hard Drives, optional hardware RAID - Dual Integrated 10/100/Gigabit Ethernet, Dedicated 10/100 Management NIC - Integrated IPMI 2.0 Management Processor - Redundant 2 x 720W Redundant Power Supplies Selected Features: - Relion 2712, Base System GIT/UNC/CIRA-SEMAP 004.v3 8-1 April 29, 2011 SEMAP Modeling Protocol - Dual Intel Xeon E5520 Quad Core 2.26GHz 8MB max RAM speed 1066MHz - 6GB, DDR3-1333 ECC (3 x 2GB) - Hardware RAID 0/1/5/6/10/50: Adaptec 5445 w/ 512MB cache and BBU - RAID 6 Volume: 17166 GB (11+1 x 2TB SATA2) - Preload, CentOS, Version 5 - Standard 3-Year Warranty 8.2 UNC Computing Resources In addition to each UNC staff member having his or her own desktop workstation, UNC staff members have access to very high-end computing resources to support a project of this scope. The most important of these is access at no cost on a first-come first-served basis to UNC’s Information Technology Services (ITS) resources for research computing services. This includes an Applications Support Group (ASG) with excellent support for user’s problems and helping with benchmarking, optimizing models on different platforms. However, for some of the work that needs to be performed for the SEMAP project, we have budgeted to purchase dedicated resources (computer blades and scratch disk space to add to the computing cluster, tapes for mass storage, and disk space for dedicated web server) as necessary to ensure timely project deliverables. The most relevant computing resources that ITS maintains and supports, and that are available for this project are: Dell Cluster (topsail) with 4,168 CPUs o Login node : 8 CPUs @ 2.3 GHz Intel EM64T with 2x4M L2 cache (Model E5345/Clovertown), 12 GB memory o Compute nodes : 4,160 CPUs @ 2.3 GHz Intel EM64T with 2x4M L2 cache (Model E5345/Clovertown), 12 GB memory o Shared Disk : (/ifs1) 39 Terabyte IBRIX Parallel File System o Interconnect: Infiniband 4x SDR o Resource management is handled by LSF v.6.2 for serial and parallel batch jobs Linux Beowulf Cluster (Emerald) with a total of 340 processors o o o o o o o o o o o 2 Login Nodes and 168 Compute Nodes 18 Dual-CPU AMD Athlon 1600+ 1.4 GHZ MP nodes (2.0GB memory) 6 Dual-CPU AMD Athlon 1800+ 1.6 GHz MP nodes (2.0 GB memory) 25 Blade Centers, each having Dual-CPU Xeon 2.4 GHz/2.5GB MP nodes 96 Blade Centers, each having Dual-CPU Xeon 2.8 GHz/2.5GB MP nodes 15 Blade Centers, each having Dual-CPU Xeon 3.2 GHz/4.0GB MP nodes 8 Blade Centers, each having Dual-Core Xeon 2.0 GHz/4.0GB MP nodes 2GB DDR RAM on each node (total of 352 GB memory) 20GB EIDE Hard Drive for OS Myrinet Fiber PCI Adapter with 2MB, Myrinet Switch with 64 port enclosure Each master node installed with RAID 5 SCSI Controller connected to 3 76GB hard drives for scratch o Supports Load Sharing Facility (LSF) for serial and parallel batch jobs o Myrinet Switch with 64 port enclosure o OS: Redhat Enterprise Linux (RHEL 3) with XFS filesystem GIT/UNC/CIRA-SEMAP 004.v3 8-2 April 29, 2011 SEMAP Modeling Protocol o 20.0 TeraByte RAID array for scratch space Linux Beowulf Cluster (kure) with a total of 480 processors o 2 Login nodes and 62 Compute Nodes o 60 HP Quad-core blade servers each with 64 MB memory o 2 HP Quad-core blade servers each with 192 MB memory o 34.0 TeraByte RAID array for scratch space (Fiber Channel) o 40. 0 TeraByte RAID array for scratch space (SATA disks) o Additional 25.0 TeraByte space on GPFS Mass Storage silo o 175.0 Terabyte capacity using a SAM-FS file system o Provides near-line access for archived data o Access speeds from 19 to 30 MB/sec (much faster than LTOs or DLTs) o StorageTek Powderhorn 9840 tape cartridges of 20GB capacity o Dual backup of all data (redundancy in storage helps in data recovery) Software Resources o Portland Group Fortran Compilers o Intel Fortran Suite of Compilers o Complete Installation of ArcGIS, MATLAB, SPlus, SAS, etc. Additional details of UNC’s ITS hardware and software resources can be found at: http://its.unc.edu/Research/hardware/index.htm. Given the available resources above, we will perform all SMOKE and MCIP runs on the Emerald cluster, and the CAMx runs and all post-processing on the new Kure cluster at UNC. At the time of release of this protocol, the scratch disks on the two clusters are being reconfigured, and will all share the same disks facilitating easy access of the SMOKE outputs for the CMAQ simulations. GIT/UNC/CIRA-SEMAP 004.v3 8-3 April 29, 2011 SEMAP Modeling Protocol Section 9: Project Team Roles and Responsibilities Task 1. Project Management Task Leader: T. Odman (Georgia Tech) Other staff: Z. Adelman, S. Arunachalam, J. Eichinger, A. Hanna Task 2. Modeling Protocol Task Leader: S. Arunachalam (UNC) Other staff: Z. Adelman, J. Eichinger, A. Hanna, T. Odman Task 3. Data Acquisition/Dataset Preparation for Modeling and Evaluation Emissions Task Leader: Z. Adelman (UNC) Other staff: B. Baek, T. Odman, Research Assistant Meteorology Task Leader: A. Xiu (UNC) Other staff: K. Talgo Initial and Boundary Conditions Task Leader: S. Arunachalam (UNC) Other staff: E. Adams Ambient Air Quality Data Task Leader: Y. Hu (Georgia Tech) Other staff: N. Davis, T. Odman, U. Shankar, A. Valencia Task 4. Conceptual Description Task Leader: T. Odman (Georgia Tech) Other staff: Research Assistant Task 5. 36 km/12 km Emissions/AQ Annual Actual Base Year Simulations Task Leader: T. Odman (Georgia Tech) Other staff: E. Adams, Z. Adelman, S. Arunachalam, B. Baek, Y. Hu, K. Talgo, A. Valencia, Research Assistant Task 6. Model Configuration Diagnostic Sensitivity Modeling Task Leader: T. Odman (Georgia Tech) Other staff: E. Adams, Z. Adelman, B. Baek, Y. Hu, B. Naess GIT/UNC/CIRA-SEMAP 004.v3 9-1 April 29, 2011 SEMAP Modeling Protocol Task 7. Emissions and Air Quality Benchmark Simulations Task Leader: S. Arunachalam (UNC) Other staff: B. Baek, K. Talgo, A. Zubrow Task 8. 36 km/12 km Annual Typical/Future Year Emissions AQ Modeling Task Leader: T. Odman (Georgia Tech) Other staff: E. Adams, Z. Adelman, S. Arunachalam, B. Baek, Y. Hu, K. Talgo, A. Valencia, Research Assistant Task 9. Emission Sensitivity Modeling Task Leader: T. Odman (Georgia Tech) Other staff: Z. Adelman, S. Arunachalam, B. Baek, Y. Hu, A. Valencia, Research Assistant Task 10. Model Performance Evaluation Task Leaders: T. Odman (Georgia Tech) and S. Arunachalam (UNC) Other staff: N. Davis, U. Shankar, A. Valencia, Research Assistant Task 11. Future Year Model Projections Task Leader: S. Arunachalam (UNC) Other staff: N. Davis, T. Odman, A Valencia Task 12. Interactive Database Tool Task Leader: S. McClure (CIRA) Other staff: N. Davis, D. Del Vecchio, J. Huddleston, T. Odman, R. Ross, A. Zubrow Task 13. Receptor Modeling Task Leader: T. Russell (Georgia Tech) Other staff: Y. Hu, T. Odman, Research Assistant Task 14. Development of Technical Web Site Task Leaders: S. McClure (CIRA) and Z. Adelman (UNC) Other staff: J. Huddleston, B. Naess, T. Odman Task 15. Other Tasks as Assigned Task Leader: T. Odman (Georgia Tech) Other staff: TBD, based on types of tasks assigned by SESARM Task 16. Reporting Task Leader: J. Eichinger (UNC) Other staff: Z. Adelman, S. Arunachalam, Y. Hu, T. Odman, T. Russell, U. Shankar, A. Zubrow GIT/UNC/CIRA-SEMAP 004.v3 9-2 April 29, 2011 SEMAP Modeling Protocol Section 10: Documentation This project will generate the following reports: Draft project work plan to SESARM Final project work plan that incorporates SESARM Contract Officer’s comments and follows review of the draft project work plan by SESARM and its participating agencies Draft QAPP to SESARM Final QAPP that incorporates SESARM Contract Officer’s comments and follows review of the draft QAPP by SESARM and its participating agencies An initial version of the emissions and air quality modeling protocol. An interim report with a conceptual description of the O3, PM2.5, and regional haze problem in the SEMAP region will be prepared according to EPA guidance and submitted to the SEMAP TAWG, then finalized in response to their comments Document summarizing the technical transfer protocol and all model utilities and scripts used by the Project Team for the modeling tasks Results of all standard analyses specified in Attachment A of the RFP for (a) O3 and precursors; (b) fine PM, coarse PM, and constituent mass concentrations; and (c) regional haze metrics, as well as other analyses performed Monthly project reports summarizing the work performed on each task, progress on the deliverables, and expected progress during the next month Interim task reports documenting each major task milestone Draft of the final project report Fifteen hard copies of the final project report and a Microsoft Word or PDF version of the report posted to the project website Reports on tasks that use or develop specific data sets will include descriptions of data source(s) and methodologies used in their preparations and any information that might be critical to their use in the project (for examples, data gaps or errors). If possible, the documentation of the data will also include references (reports or publications) that may have further information on the data validity and usability. Copies of all project reports (including monthly reports) will be retained in electronic format through the duration of the project and sent to SESARM for archival as they are generated. The three universities participating in this project have access to advanced computer backup systems. For example, UNC utilizes a 175-terabyte mass-storage silo for data backup and near-line access of archived data. The availability of such a system is critical to keeping large volumes of data available in near real-time during the progress of the project. SESARM stores both financial and programmatic files for the appropriate length of time as determined in SESARM's Federal Assistance Agreements. 10-1 SEMAP Modeling Protocol Appendix A. SEMAP Emissions QA Worksheets These four worksheets are filled out as part of Level 1 of the QA documentation process (see Section 5.1.2.2.3). Table A-1 is the SMOKE configuration settings worksheet. All configuration options for each SMOKE program are contained in Table A-1. Fill in the configuration settings for each source category in the appropriate columns in the worksheet. Before SMOKE is run for the first time, these configuration settings must be confirmed by the project manager. Additional details about completing and archiving Table A-1 are provided in the body of the protocol in Section 5.1.2.2.3.1. Table A-2 is the inventory file recording worksheet. Used for recording SMOKE inventory files, Table A-2 requires the creation/receipt date, data source, file name, number of records, brief file description, and a listing of the variables/file contents for all inventory files used. Additional details about completing and archiving Table A-2 are provided in the body of the protocol in section 5.1.2.2.6.1. Table A-3 is the ancillary input file recording worksheet. Used for recording SMOKE ancillary input files, Table A-3 requires the file name, the SMOKE logical name, the creation/receipt date, data source, and a brief description of the file contents. Additional details about completing and archiving Table A-3 are provided in the body of the protocol in section 5.1.2.2.6.1. Table A-4 is the binary meteorology input and SMOKE output file recording worksheet. This worksheet requires the file type, creation date and source, file name, location on the RMC file servers, file ID, and any comments about the file. Additional details about completing and archiving Table A-4 are provided in the body of the protocol in section 5.1.2.2.6.1. Each worksheet is to be completed in full electronically and then printed for obtaining the various signatures. The completed soft copies of these worksheets will be kept in the centralized emissions QA documentation directory on the RMC compute servers. The completed and signed hard copies will be archived by the QA manager to include in the project final report. A-1 SEMAP Modeling Protocol Table A-1. Configuration settings worksheet for SMOKEv2.7b SMOKE Simulation ID: Checked?* Environment variable Description Setting to use Comments/exceptions Time-independent programs RUN_SMKINVEN Run inventory import program RUN_SPCMAT Run speciation matrix program RUN_GRDMAT Run gridding matrix program RUN_CNTLMAT Run control matrix program Area and point only RUN_ELEVPOINT Run elevated/PinG sources selection Point only RUN_MBSETUP Run on-road mobile setup for MOBILE6 On-road only RUN_NORMBEIS3 Runs normalized emissions program Biogenic only RUN_MET4MOVES Run temperature preprocessing for MOVES On-road only RUN_PREMOBL Run temperature preprocessing On-road only RUN_EMISFAC Run MOBILE6 emission factor generation On-road only RUN_LAYPOINT Run layer fractions program Point only RUN_TEMPORAL Run temporal allocation program RUN_TMPBEIS3 Runs temporal adjustments and speciation RUN_MOVESMRG Run MOVES merging program RUN_SMKMERGE Run merging program Time-dependent programs Biogenic only Quality assurance RUN_SMKREPORT Run for quality assurance reports QA_TYPE Produces particular reports Will run this several times to get different QA reports all Will also probably use custom as well Program-specific controls For Smkinven FILL_ANN_WSEAS Fill annual emissions based on seasonal values Setting HOURLY_TO_DAILY Y reads daily total only from hourly file Point only HOURLY_TO_PROFILE Y converts hourly data to source-specific profiles Point only A-2 A F RD NR M P B SEMAP Modeling Protocol SMOKE Simulation ID: Checked?* Environment variable Description Setting to use Comments/exceptions IMPORT_AVEINV_YN Y for importing annual-average inventories SMK_ARTOPT_YN Y uses area-to-point conversion IMPORT_GRDIOAPI_YN Y for imported pregridded emissions in I/O API format IMPORT_VMTMIX_YN Y imports VMT mix – EMS-95 inputs Mobile only SMK_VMTMIX_FIXFMT Y uses fixed-format VMT Mix file Mobile only RAW_DUP_CHECK Y for errors on duplicate records SMK_NHAPEXCLUDE_YN Y uses NonHAP exclusions file SMKINVEN_FORMULA Formula for calculating an additional pollutant SMK_BASYR_OVERRIDE Override value for base year of all inventories WKDAY_NORMALIZE Y normalizes weekly profiles for weekdays WEST_HSPHERE Y converts all stack coordinates to western hemisphere 0=Laypoint sets elev srcs; 1=use PELVCONFIG UNIFORM_STIME -1 or HHMMSS for uniform start hour for daily emissions days Setting For Cntlmat Set to VOC or ROG For Temporal UNIFORM_TPROF_YN Y makes all temporal profiles uniform ZONE4WM Y uses time zones for start of day/month This assumes that area- and pointsource input formats will be provided as average-weekday emissions and VMT will be provided as averageday VMT. This should be confirmed. Point only POLLUTANT_CONVERSION Y uses ROG-to-TOG file Y renormalizes temporal profiles M P N N N N 0 Default surrogate code RENORM_TPROF NR PMC=PM10-PM2_5 Not used for mobile VMT For Spcmat REACTIVITY_POL RD Point only For Grdmat SMK_DEFAULT_SRGID F Setting For Elevpoint SMK_ELEV_METHOD A A-3 N B SEMAP Modeling Protocol SMOKE Simulation ID: Checked?* Environment variable Description Setting to use Comments/exceptions For Tmpbeis3 BG_CLOUD_TYPE Method used to calculate PAR Biogenic only BIOG_SPRO Speciation profile code to use for biogenics Biogenic only BIOMET_SAME Y uses temperature and rad/cld data in same file Biogenic only BIOSW_YN Y for using seasons file in Tmpbeis3 Biogenic only OUTZONE Output time zone Biogenic only RAD_VAR Name of radiation/cloud variable Biogenic only TMPR_VAR Name of temperature variable Biogenic only PRES_VAR Name of surface pressure variable Biogenic only REP_LAYER_MAX Layer number for reporting high plume rise Point only SMK_SPECELEV_YN Y uses Elevpoint output file to set elevated sources Point only HOUR_PLUMEDATA_YN PHOUR file contains hourly plume data For Laypoint For Met4moves EPI_STDATE Define start dates of MOVES lookup table processing MOVES only EPI_ENDATE Define end dates of MOVES lookup table processing MOVES only TVARNAME Define the name of temperature variable MOVES only AVERAGING_METHOD Define the averaging method (Episode/Monthly/Dayily) MOVES only For Movesmrg MRG_TEMPORAL_YN Y merges with hourly emissions MRG_SPCMAT_YN Y merges with speciation matrix MRG_GRDOUT_YN Y outputs gridded file MRG_REPSTA_YN Y outputs for state totals MRG_REPCNY_YN Y outputs for county totals MODE_RPD Y process MOVES rate-per-distance lookup table MODE_RPV MOVES only MOVES only Y process MOVES rate-per-vehicle lookup table A-4 A F RD NR M P B SEMAP Modeling Protocol SMOKE Simulation ID: Checked?* Environment variable MODE_RPP Description Setting to use Comments/exceptions MOVES only MRG_GRDOUT_UNIT Y process MOVES rate-per-profile lookup table Units of gridded output file moles/s MRG_TOTOUT_UNIT Units of report file tons/day For Smkmerge MRG_TEMPORAL_YN Y merges with hourly emissions MRG_SPCMAT_YN Y merges with speciation matrix MRG_LAYERS_YN Y merges with layer fractions MRG_GRDOUT_YN Y outputs gridded file MRG_REPSTA_YN Y outputs for state totals MRG_REPCNY_YN Y outputs for county totals SMK_ASCIIELEV_YN Y outputs ASCII elevated file MRG_GRDOUT_UNIT Units of gridded output file moles/s MRG_TOTOUT_UNIT Units of report file tons/day MRG_REPORT_TIME Hour in OUTZONE for reporting emissions 230000 MRG_MARKETPEN_YN Apply reactivity controls market penetration Point only Reports will be generated by Smkreport So daily totals will be from 0-23 EDT Apply reactivity controls market penetration AREA_SURROGATE_NUM Number for land-area surrogate Biogenic only Multiple-program controls DAY_SPECIFIC_YN Y imports day-specific inventory EXPLICIT_PLUME_YN Y for special wildfire processing or UAM/REMSAD HOUR_SPECIFIC_YN Y imports hour-specific inventory REPORT_DEFAULTS Y reports default profile application SMK_AVGDAY_YN Y uses average day emissions instead of annual SMK_PING_METHOD 0=no PinG source, 1=use PELVCONFIG file SMK_EMLAYS Number of emissions layers SMK_MAXWARNING Maximum number of warnings in log file SMK_MAXERROR Maximum number of errors in log file OUTZONE Output time zone of emissions Not processing wildfires Setting No PinG sources to be run GMT A-5 A F RD NR M P B SEMAP Modeling Protocol SMOKE Simulation ID: Checked?* Environment variable Description Setting to use Comments/exceptions SMK_DEFAULT_TZONE Time zone to fix if missing COSTCY file VELOC_RECALC Y recalculates velocity from diameter and flow Point only USE_SPEED_PROFILES Y uses speed profiles instead of inventory speeds Mobile only A F RD NR M P Script settings SRCABBR Abbreviation for naming log files Setting QA_TYPE None, all, part1-part4, or custom PROMPTFLAG Never set to Y for batch processing AUTO_DELETE Y deletes SMOKE I/O API output files AUTO_DELETE_LOG Y automatically deletes logs without asking DEBUGMODE Y changes script to use debugger DEBUG_EXE Sets debugger to use when DEBUGMODE = Y NONROAD Y resets area files to nonroad files Episode settings G_STDATE Julian start date G_STTIME Start time (HHMMSS) G_TSTEP Time step (HHMMSS) G_RUNLEN Run length (HHMMSS) Daily files are from 0-23Z, and they need an extra hour! *A=Area; F=Fire, RD=Road dust, NR=Nonroad mobile, M=On-road mobile, P=Point, B = Biogenic We certify that all of the configuration settings listed in the worksheet above for the specified SMOKE simulation have been completed and verified. Production Modeler: QA Manager: Project Manager: __________________________________ __________________________________ __________________________________ Signature Signature Signature Date Date A-6 Date B SEMAP Modeling Protocol Table A-2. Inventory file recording worksheet for SMOKEv2.7b (sample entry shown in gray) SMOKE Simulation ID: Date, source 10/29/01, PES # of records File name pt96_wrap_102901a.ida 64,999 Description Species/contents Full 1996 point source inventory; WRAP, Tier1, Tier2 states (1 of 4) CO, NOx, SO2, VOC, NH3, PM10, PM2.5 We certify that the worksheet above lists all of the emissions inventory files used for the specified SMOKE simulation. Production Modeler: QA Manager: Project Manager: __________________________________ __________________________________ __________________________________ Signature Signature Signature Date Date A-7 Date SEMAP Modeling Protocol Table A10-3. Ancillary input file recording worksheet for SMOKEv2.7b (sample entry shown in gray) SMOKE Simulation ID: SMOKE logical name AGREF File name amgref.m3.us+mex.txt Creation date, source Date unknown, CEP Description Spatial cross-reference distributed with SMOKEv2.7b We certify that the worksheet above lists all of the ancillary SMOKE input files used for the specified SMOKE simulation. Production Modeler: QA Manager: Project Manager: __________________________________ __________________________________ __________________________________ Signature Signature Signature Date Date A-8 Date SEMAP Modeling Protocol Table A-4. Binary meteorology input and SMOKE output file recording worksheet for SMOKEv2.7b (sample entries shown in gray) SMOKE Simulation ID: File type* P Creation date, source 10/31/03, CEPAdelman Met 11/01/01, RMC-Wang File name Location on RMC servers File ID Comments pgts.wrap96h.$jdate.ncf /home/aqm4/edss2/cmaq/data/wrap96h/s moke/output wrap96ptV3 Merged point source file; based off of inventory file wrap96pointV3.022803.ida METCRO2D.wrap96.ncf /home/aqm4/edss2/cmaq/data/met/1996 wrap96 Converted from MM5 to Models-3 with MCIPv2.1 by Zion Wang (RMC) *A=Area, B=Biogenic, M=On-road mobile, P=Point, NR=Non-road mobile, RD=Road dust, WB=Windblown dust, F=Fire, Met=Meteorology, Mrg=Model-ready/merged We certify that all of the meteorology and SMOKE output files listed in the worksheet above have been documented and verified. Production Modeler: QA Manager: Project Manager: __________________________________ __________________________________ __________________________________ Signature Signature Signature Date Date A-9 Date SEMAP Modeling Protocol Table A-5. Emissions simulation tracking worksheet (sample entries shown in gray) SMOKE Simulation ID: Pre2002v1 Date 01/02/10 Months Completed Jan, Feb, Mar, Apr…. Source Category: Ar,Mb,MbvPt… Modeler AH QA BH EI Type: Annual, seasonal, monthly, daily Comments Reprocessed to fix a problem with the formatting of the raw inventory file. A-10 SEMAP Modeling Protocol Appendix B. SMOKE Emissions QA Checklist and Log for Warnings and Notes Table B-1 is the emissions QA checklist. Detailed explanations of the features to examine in each QA step in the table are provided in Section 5.1.2.2.3 and 5.1.2.2.4 The production modeler checks off and initials each QA step in Table B-1 as it is completed. Comments about the QA step or answers to the questions asked in each QA step can be added to the comments field in the checklist. After completion of the QA checklist, the production modeler will pass it off to the QA manager who will then verify and initial each step. Table B-2 is the gatekeepers QA checklist. Detailed explanations of the features to examine in each QA step in the table are provided in Sections 5.1.2.2.5.1. The gatekeeper checks off and initials each QA step in Table B-2 for each source category as it is completed. Table B-3 is the log for recording SMOKE warnings and notes. The purpose of this table is to ensure that the emissions modeling team is aware of all messages reported by SMOKE. The production modeler and QA manager are responsible for filling out Table B-2 and confirming all of the SMOKE messages (see procedure in Section 5.1.2.2.3.2). The QA checklists and the log are to be completed in full electronically and then printed. On the checklist only, the last step is to obtain the various signatures. The completed soft copies of the checklist and log will be kept on the RMC compute servers. The completed hard copies will be archived by the QA manager to include in the project final report. B-1 SEMAP Modeling Protocol Table B-1. Step-by-step emissions QA checklist for SMOKEv2.7b; subheadings list the section numbers of the protocol where detailed explanations of each step are provided. PM* QA* QA step Comments Model QA Configuration settings worksheet (Table A-1) Inventory files worksheet (Table A-2) Ancillary input files worksheet (Table A-3) Meteorology/Output files worksheet (Table A-4) Look for unexpected errors, warning, and notes Warnings and notes worksheet (Table B-1) Spatial surrogate/cross-reference checks Temporal profile/cross-reference checks Speciation profile/cross-reference checks Inventory import (Smkinven) Inventory data check Inventory coverage Annual, daily or hourly emissions Smkinven formula Resetting negative formula values Weekday normalization Missing rule fields Inventory records dropped Duplicate sources Missing or negative inventory records B-2 SEMAP Modeling Protocol PM* QA* QA step Comments Bad stack coordinates Missing stack parameters Missing/bad SIC codes Chemical allocation (Spcmat) Pollutant conversion Default profile assignment Bad profile assignment No speciation cross-reference/no default Incorrect NOx speciation Horizontal spatial allocation (Grdmat) Default surrogate Bad surrogate assignment No gridding cross-reference/no default Temporal allocation (Temporal) Output time zone Bad profile assignment No temporal cross-reference/no default Inventory base year inconsistent with start Redundant activity and inventory data Inventory vehicle types not in MOBILE B-3 SEMAP Modeling Protocol PM* QA* QA step Comments Elevated source selection (Elevpoint) Is Elevpoint needed Elevated source selection method PinG source selection method Vertical layer allocation (Laypoint) Hourly plume data method Bad plume values Special source selection file Vertical layer fractions sum to unity Emissions control and growth (Cntlmat) Control type Bad control packets Duplicate control packets Packet/inventory mismatch Bad output years in control packets Control packet years inconsistent with inventory Badly named control packet Pollutant-specific entries skipped MOBILE6 Meteorology variables MOBILE6 environment variables Group type MOBILE6 temperatures B-4 SEMAP Modeling Protocol PM* QA* QA step Comments Freeway ramp modeling MOBILE6 speeds MOVES Preprocessing Meteorology data : Met4moves Met data averaging method/Temperature variables Reference county fuel month Cross reference county MOVES rate-per-distance lookup table MOVES rate-per-vehicle lookup table MOVES rate-per-profile lookup table BEIS3 Meteorology variables Speciation profile code Biogenic season switch Area surrogate number Fire emissions Spatial and temporal locations Layer allocation Fire temporal and chemical profiles Fugitive/windblown dust emissions Merging emissions components (Smkmerge/Mrggrid/Movesmrg) Emissions units Elevated/PinG source selection B-5 SEMAP Modeling Protocol PM* QA* QA step Comments Plume rise method Emissions components Relative magnitude of component emissions files Comparisons to base inventory Types of MOVES lookup tables B-6 SEMAP Modeling Protocol PM* QA* QA step Comments Report generation (Smkreport) Qualitative Analyses Check spatial coverage of emissions Check temporal variability in emissions Check vertical distribution of emissions Create inventory regressions for all source categories Compare SMOKE reports vs. model-ready emissions values Quantitative Analyses Create Emissions regressions Create annual county/state total using daily modelready input files System QA: Software and Data Tracking Run scripts checked into CVS Assigns file checked into CVS SMOKE binary executables checked into CVS SMOKE source code checked into CVS Installation freeze point created Final files in $GE_DAT directory added to CVS Final inventory files added to CVS *The “PM” column is initialed by the production modeler after the QA step has been completed. The “QA” column is initialed by the QA manager after he/she has verified completion of the step. B-7 SEMAP Modeling Protocol We certify that all of the emissions QA steps listed in the checklist above have been completed and verified. Production Modeler: QA Manager: Project Manager: __________________________________ __________________________________ __________________________________ Signature Signature Signature Date Date B-8 Date SEMAP Modeling Protocol Table B-2. Gatekeepers emissions QA checklist for SMOKEv2.7b GK* QA step A RD SMOKE Inputs Inventory file formats Emissions inventory sanity checks SMOKE Outputs Create and review total domain hourly plots Create and review total domain daily layer fractions Compare daily domain totals from ASCII to netCDF files *The “GK” column is initialed by the gatekeeper after the QA step has been completed. B-9 NR M P Ag B SEMAP Modeling Protocol Table B-3 . Warnings and notes log for SMOKEv2.7b SMOKE Simulation Name: Modeler: Date: Comments: Program Warnings Notes Smkinven: Grdmat: Spcmat: B-10 SEMAP Modeling Protocol Program Warnings Notes Temporal: Cntlmat: Elevpoint: Emisfac: B-11 SEMAP Modeling Protocol Program Warnings Notes Grwinven: Laypoint: Mbsetup: Met4moves: B-12 SEMAP Modeling Protocol Program Warnings Notes Movesmrg: Mrggrid: Normbeis3: Premobl: B-13 SEMAP Modeling Protocol Program Warnings Notes Rawbio: Smkmerge: Tmpbeis3: Tmpbio: B-14 SEMAP Modeling Protocol Appendix C. References Abt Associates, Inc. (2009) Modeled Attainment Test Software (MATS) User’s Guide prepared for the U.S. EPA Office of Air Quality Planning and Standards, Bethesda, MD, March 2009. Adelman, Z., Quality Assurance Protocol – WRAP RMC Emissions Modeling with SMOKE, Prepared for the WRAP Modeling Forum by the WRAP RMC, Riverside, CA, 2003. Aneja, V. P., D. R. Nelson, P. A. Roelle, J. T. Walker, and W. Battye, Agricultural ammonia emissions and ammonium concentrations associated with aerosols and precipitation in the southeast United States, J. Geophys. Res., 108(D4), 4152, doi:10.1029/2002JD002271, 2003. Appel, K.W, R.C. Gilliam, N. Davis, A. Zubrow, S.C. Howard, Overview of the atmospheric model evaluation tool (AMET) v1.1 for evaluating meteorological and air quality models, Environmental Modelling & Software, In Press, Corrected Proof, Available online 28 October 2010, ISSN 1364-8152, DOI: 10.1016/j.envsoft.2010.09.007. Bey, I., D.J. Jacob, R.M. Yantosca, J.A. Logan, B.D. Field, A.M. Fiore, Q. Li, H. Liu, L.J. Mickley, and M.G. Schultz, 2001. Global modeling of tropospheric chemistry with assimilated meteorology: Model description and evaluation. Journal of Geophysical Research, Vol. 106, No. D19: 23,07323,095. Binkowski, F.S. and S.J. Roselle, 2003, Models-3 Community multiscale air quality (CMAQ) model aerosol component, 1 Model description. Journal of Geophysical Research, 108(D6) 4183, doi:10.1029/2001JD001409. Boylan, J.W. and Russell, A.G. (2006) PM and Light Extinction Model Performance Metrics, Goals, and Criteria for Three-Dimensional Air Quality Models; Atmos. Environ., Volume 40, pp. 4946-4959. Byun, D.W., and Ching, J.K.S., 1999. Science algorithms of the EPA Models-3 Community Multiscale Air Quality (CMAQ) modeling system. EPA/600/R-99/030, Office of Research and Development, U.S. EPA, Washington, D.C. Byun, D.W., and Schere, K.L., 2006. Review of the Governing Equations, Computational Algorithms, and Other Components of the Models-3 Community Multiscale Air Quality (CMAQ) Modeling System. J. Applied Mechanics Reviews 59 (2):51-77. CMAS, 2010, SMOKEv2.6 Release Notes. http://www.smokemodel.org/version2.6/html/Release_Notes_v26.html GA DNR, 2011 “Selection of CY2007 as Base Year for SEMAP Modeling Project”. Southeastern States Air Resource Managers, Inc., Forest Park, GA. Dunker, A.M., G. Yarwood, J.P. Ortmann, and G.M. Wilson (2002) The Decoupled Direct Method for Sensitivity Analysis in a Three-Dimensional Air Quality Model — Implementation, Accuracy, and Efficiency, Environ. Sci. Technol. 2002 (36). EPA, 2007, “Guidance on the Use of Models and Other Analyses for Demonstrating Attainment of Air Quality Goals for Ozone, PM2.5, and Regional Haze”, (April 2007) U.S. Environmental Protection Agency, Office of Air Quality Planning and Standards, Research Triangle Park, NC, EPA-454/B-07-002. EPA, 1999. “Draft Guidance on the Use of Models and Other Analyses in Attainment Demonstrations for the 8-hr Ozone NAAQS”. Draft (May 1999), U.S. Environmental Protection Agency, Office of Air Quality Planning and Standards, Research Triangle Park, NC. C-1 SEMAP Modeling Protocol EPA, 1991, “Guideline for Regulatory Application of the Urban Airshed Model”, U.S. Environmental Protection Agency, Office of Air Quality Planning and Standards, Research Triangle Park, NC, EPA-450/4-91-013. Frank, Neil. Retained Nitrates, Hydrated Sulfates, and Carbonaceous Mass in Federal Reference Method Fine Particulate Matter for Six Eastern U.S. Cities. Journal of the Air and Waste Management Association, Volume 56, pp 500-511, 2006 Foley, K. M., Roselle, S. J., Appel, K. W., Bhave, P. V., Pleim, J. E., Otte, T. L., Mathur, R., Sarwar, G., Young, J. O., Gilliam, R. C., Nolte, C. G., Kelly, J. T., Gilliland, A. B., and Bash, J. O.: Incremental testing of the Community Multiscale Air Quality (CMAQ) modeling system version 4.7, Geosci. Model Dev., 3, 205-226, doi:10.5194/gmd-3-205-2010, 2010. Guenther, A., T. Karl, P. Harley, C. Wiedinmyer, P. Palmer and C. Geron (2006). Model of Emissions of Gases and Aerosols from Nature (MEGAN). bai.acd.ucar.edu/Megan/index.shtml Guenther, A., B. Baugh, G. Brasseur, J. Greenberg, P. Harley, L. Klinger, D. Serca, L. Vierling. 1999. Isoprene emission estimates and uncertainties for the Central African EXPRESSO study domain. Journal of Geophysical Research- Atmospheres, 104 (D23):30625-30639. Hanna, Adel and William Benjey, Editors, 2006: Special issue on Model Evaluation: Evaluation of Urban and Regional Eulerian Air Quality Models, Atmospheric Environment, Vol. 40, Issue 26, pp 4809-5110. Houyoux, M. R.; Vukovich, J. M.; Coats, C. J. Jr.; Wheeler, N. M.; Kasibhatla, P. S. “Emission inventory development and processing for the Seasonal Model for Regional Air Quality (SMRAQ) project”, J. Geophys. Res., 2000, 105, D7, pp 9079-9090. Houyoux, M. R., C. J. Coats Jr., A. Eyth, and S. Lo, Emissions modeling for SMRAQ: A seasonal and regional example using SMOKE, paper presented at Computing in Environmental Resource Management, Air and Waste Manage. Assoc., Research Triangle Park, N. C., Dec. 2-4, 1996. Mebust, M.R., B.K. Eder, F.S. Binkowski, and S.J. Roselle, 2003, Models-3 Community multiscale air quality (CMAQ) model aerosol component, 2. Model evaluation. Journal of Geophysical Research, 108(D6) 4184, doi:10.1029/2001JD001410. Morris, R.E., G. Yarwood, C.A. Emery, B. Koo, and G. M. Wilson (2003a) Development of the CAMx One-Atmosphere Air Quality Model to Treat Ozone, Particulate Matter, Visibility and Air Toxics and Application for State Implementation Plans (SIPs), Presented at the A&WMA Specialty Conference, Guideline on Air Quality Models: The Path Forward, Mystic, CT, October 2003. Morris, R.E., G. Yarwood, C. Emery, Spyros Pandis, and F. Lurmann (2003b), Implementation of Stateof-Science PM Modules into the PMCAMx Photochemical Grid Model, Presented at the 96th Annual Conference and Exhibition of the A&WMA, San Diego, California, June 2003. Morris, R.E., S. Lau, and G. Yarwood (2003c), Development and Application of an Advanced Air Toxics Hybrid Photochemical Grid Modeling System Presented at 96th Annual Conference and Exhibition of the A&WMA, San Diego, California, June 2003. Office of Transportation and Air Quality (OTAQ), “Motor Vehicle Emission Simulator (MOVES) 2010: User Guide,” EPA-420-B-09-041, U.S. Environmental Protection Agency, Research Triangle Park, NC, December 2009. Otte, T. L. and Pleim, J. E.: The Meteorology-Chemistry Interface Processor (MCIP) for the CMAQ modeling system: updates through MCIPv3.4.1, Geosci. Model Dev., 3, 243-256, doi:10.5194/gmd-3-243-2010, 2010. C-2 SEMAP Modeling Protocol Tesche, T.W., Georgopoulos, P., Seinfeld, J.H., Lurmann F., Roth, P.M., 1990. Improvements in procedures for evaluating photochemical models. Report A832-103, California Air Resources Board, Sacramento, CA. Yarwood, G., R. Morris, and G. Wilson (2004) Particulate Matter Source Apportionment Technology (PSAT) in the CAMx Photochemical Model Presented at the NATO International Technical Meeting, Banff, Canada, October 2004. Baek, J. (2009). Improving aerosol simulations: Assessing and improving emissions and secondary organic aerosol formation in air quality modeling. School of Civil and Environmental Engineering. Atlanta, GA, Georgia Institute of Tecnology. Ph. D.: 140. Bhave, P., A. M. Carlton, et al. (2009). Evaluation of CMAQ for the Precursor-Specific Contributions to SOA in the Southeastern U.S. 8th Annual Community Modeling & Analysis System (CMAS) Conference. 19-21 October, 2009, Chapel Hill, NC. Eder, B., D. Kang, et al. (2006). "An operational evaluation of the Eta-CMAQ air quality forecast model." Atmospheric Environment 40: 4894-4905. Hogrefe, C., S. T. Rao, et al. (2001). "Evaluating the performance of regional-scale photochemical modeling system: Part II - ozone predictions." Atmospheric Environment 35(4175-4188). Hogrefe, C., S. T. Rao, et al. (2001). "Evaluating the performance of regional-scale photochemical modeling system: Part I - meteorological predictions." Atmospheric Environment 35(4159-4174). Hu, Y., J. Baek, et al. (2008). Improving PM2.5 forecasting ability of Hi-Res in southeastern United States. 7th Annual Community Modeling & Analysis System (CMAS) Conference, 6-8 October, 2008, Chapel Hill, NC. Hu, Y., J. Baek, et al. (2009). Forecasting PM2.5 in the Southeastern United States: An Evaluation of Recent Algorithmic Changes. International Aerosol Modeling Algorithms (IAMA) Conference, December 9-11, 2009, Davis, CA. Jimenez, J. L., M. R. Canagaratna, et al. (2009). "Evolution of Organic Aerosols in the Atmosphere." Science 326: 1525, doi:1510.1126/science.1180353. Napelenok, S. L., A. G. Carlton, et al. (2008). Updates to the Treatment of Secondary Organic Aerosol in CMAQv4.7 7th Annual Community Modeling & Analysis System (CMAS) Conference, 6-8 October, 2008, Chapel Hill, NC. Odman, M. T., Y. Hu, et al. (2007). " Determining the sources of regional haze in the southeastern United States using the CMAQ model." Journal of Applied Meteorology and Climatology 46: 17311743. Sakulyanontvittaya, T., T. Duhl, et al. (2008). "Monoterpene and Sesquiterpene Emisison Estimates for the United States." Environ. Sci. Technol. 42(5): 1623-1629, doi:1610.1021/es702274e. Sakulyanontvittaya, T., A. Guenther, et al. (2008). "Secondary Organic Aerosol from Sesquiterpene and Monoterpene Emissions in the United States." Environ. Sci. Technol. 42(23): 8784-8790, doi:8710.1021/es800817r. Tesche, T. W., R. Morris, et al. (2006). "CMAQ/CMAx annual 2002 performance evaluation over the eastern US." Atmospheric Environment 40(4906-4919). Yan, B., M. Zheng, et al. (2008). "Organic composition of carbonaceous aerosols in an aged prescribed fire plume." Atmos. Chem. Phys. 8: 6381-6394. C-3