Data-Distribution-Reqs - Computer Science and Engineering

advertisement
This is a two-phase study to address the high rates of infection by a specific resistant staphylococcal
bacterium in children who live in the Atlanta area. First, we will look at children who received care from
Children’s Healthcare of Atlanta (CHOA) at Egleston, Hughes Spalding, and Scottish Rite hospitals and
have a positive culture due to methicillin resistant Staphylococcus aureus (MRSA). We will compare these
children with those who have a methicillin sensitive Staphylococcus aureus culture (MSSA). The outcome
of the study in this phase consists of three components: (1) We will develop the methodology to extract
and integrate the variables of interest from the medical information and laboratory databases at these
locations so that comparisons can be made between those children who are infected by the resistant bacteria
to those children who are not. (2) We will develop efficient information monitoring techniques to
prospectively identify patients at the time of admission to the hospital, who have had a history of resistant
Staphylococcal infection. (3) Additionally, we will provide automatic integration of relevant medical and
laboratory variables in real-time for these patients. Another result of this phase of investigation is to
develop an automated system of determining whether or not patients are infected by a MRSA strain
originating from the hospital or from the community at large. This will be based on whether or not these
patients become infected within a specified time of their admission to the hospital, e.g., 48 hours, and the
specific antimicrobial susceptibility pattern of the bacterium isolated from the patient (phenotype of the
bacterium). In the second phase of this research, we plan to geo-code databases on all children who have
positive cultures for Staphylococcus aureus from January 2002 through December 2006 from the hospitals
and use geographic information system tools to determine which risk factors may contribute to infection by
these resistant staphylococcal strains. For example, presence of clustering of patients infected by CAMRSA will be identified based on the mapping of these patients’ residences. Incidence of infection and
other population information to determine certain risk factors can be derived by applying GIS to data
obtained from US Census Bureau. This population information can be grouped according to certain
boundaries, e.g., block groups, blocks, or census tracts. This research project will serve to link
communication and information technologies at three different hospitals in order to have a better
understanding of the epidemiology of children infected by CA-MRSA. The results generated by this
research will directly and indirectly impact the delivery of healthcare and assist with the development of
health policy guidelines as it pertains to infection control and healthcare quality within these three children
hospital sites. This research can be seen as a step towards the HSI vision of shaping the next health care
transformation (1).
Development of a system to combine data and information on children infected with certain resistant
bacteria from the three children’s hospitals servicing Atlanta area has the advantage of providing a
seamless access of information and may serve as a testbed towards enabling children health care
transformation to disease- prevention and health-focused (1). Concretely, this powerful tool can be used to
look at the impact and costs associated with hospitalization of these children. Infection control and quality
assurance programs can also use the information from such a system to critically look and address some of
the health guidelines and specific costs associated with containing the spread of resistant bacteria. This
integration of medical information with laboratory information is also useful from a research standpoint as
it creates the infrastructure needed to obtain data more efficiently. Although the existing technology at
Egleston and Scottish Rite sites provides for identification of patients who have a history of “multiple
resistant organisms”, this project will refine that process so that real time identification, specifically of
children infected by MRSA, can be linked to a database for collection of relevant medical and laboratory
information for prospective data analysis. Currently, there is no integration or method of “flagging,”
prospectively or retrospectively, those patients with MRSA at Hughes Spalding. Moreover, there is no
direct linkage of information between Hughes Spalding and the other two hospitals. Because many
physicians and other healthcare providers will provide services for all three locations, developing a linkage
of information systems will facilitate communication and, hence, better delivery of healthcare at all three
sites.
2. Broader impacts
Analysis of the data will provide valuable information regarding the epidemiology of those children who
are infected by CA-MRSA. Specifically, the information gained from using GIS will determine what risk
factors lead to the acquisition of this particular MRSA which originated from the community. The real time
aspect of the project will define the changing epidemiology of this bacterium as it moves from the
community into hospitals. This information gained has a significant impact on our healthcare system as we
strive to develop infection control policies to contain the transmission of resistant bacteria within the
hospital setting. The data can also provide insights as to the costs of treating such bacterial infections in our
hospitalized children. Development of this system will also improve the timing in which these children are
identified and result in a more efficient delivery of treatment. Perhaps more importantly, development of
real time data collection and integration across different informational systems will provide the backdrop
needed, so that this technology can be applied to population-based databases involving other emerging
infectious disease pathogens. One area of surveillance might include the Active Bacterial Core Surveillance
of the Emerging Infections Program Network, a collaboration between CDC and state health departments
and universities participating in the Emerging Infections Program Network(2). This collaboration was
developed in response to the increasing number of emerging infectious diseases of which CA-MRSA is one
of them. Developing the capability to efficiently integrate epidemiologic and clinical data across different
surveillance systems is an extremely valuable tool in the battle to decrease the spread of these pathogens
within our communities. Although GIS tools have been used in other medical areas, there are few studies
using GIS to look at clinical and molecular epidemiology of resistant bacteria in the US. There are major
health policies that can be impacted by findings based upon population-based studies using GIS
technology. For instance, communities identified to be at high risk for a specific phenotype of MRSA can
be targeted to receive services including empiric treatment tailored against that particular circulating
phenotype.
3. Background and Significance
Antibiotic resistant bacteria are becoming an increasing problem in the United States and worldwide as
they cause serious and, often times, life-threatening infections in the face of a limited armaterium of
antibiotic agents. Resistant bacteria usually originate from hospitals whereby selection pressure from
frequent antibiotic exposure leads to the development of different mechanisms of resistance. However, one
resistant bacterium today is believed to have originated from the community and not from the hospital—
community-associated methicillin resistant Staphylococcus aureus (CA-MRSA). Nationally, the incidence
of CA-MRSA infections is increasing at an alarming rate (3, 4). Morbidity associated with infection
caused by this bacterium range from minor skin or soft tissue infections (SSTI) to severe illnesses
(pneumonia, septic shock, bone infections) resulting often times in death (3, 5-8). The clinical presentation
is believed to be related to virulent factors of CA-MRSA (3). CA-MRSA reflect a particular strain of
resistant Staphylococcus aureus, namely, USA clone 300. These strains have a typical antimicrobial
susceptibility pattern which is characteristic of the phenotype which originated from the community-resistance to the antibiotics oxacillin or resistance to both oxacillin and erythromycin. We propose to
develop a model to systematically merge existing microbiology laboratory database information with
electronic medical record information to identify patients with resistant Staphylococcus aureus. From this
centralized database, we begin to differentiate which patients have infection by bacterial strains that
originated from the community versus those whose strains are acquired from within the hospitals. There are
few studies that have identified which risk factors are associated with infection by CA-MRSA. For
example, few studies have systematically addressed whether or not race/ethnicity, younger age, geographic
location (i.e., specific neighborhoods), crowding, or socioeconomic status increase the risk for infection.
Paramount to improving the healthcare delivery to children who are infected with this resistant bacterium is
quick identification of those children who are potentially infected so that proper and correct antibiotic
therapy can be given. We propose to use geographic information system tools to identify potential risk
factors for the acquisition of this resistant bacterium. Additionally, we propose to develop a system of
rapid identification of children who may have these risk factors for resistant bacterial infection at the time
they present for healthcare. Identifying those children who have risk factors for infection by CA-MRSA
based on area of residence or other demographic information can improve the delivery of preventative or
empiric treatment for clinical diseases caused by such resistant bacterial infections.
Using two approaches will provide key information regarding the epidemiology of those children who
are infected by CA-MRSA.
Aims
• To merge databases from laboratory, medical information databases, and electronic medical records
and identify children who have been infected with both resistant and non-resistant Staphylococcus
aureus through automated information update monitoring and tracking techniques.
• To identify risk factors for CA-MRSA using geographic information system tools and visual and
automated data mining methods.
Data Collection Assumptions:
Data variables of interest. For patients who had Staphylococcus aureus isolated from a clinical culture, the
following information will be obtained from the laboratory (See Appendix I for list of laboratory variables):
Patient medical record number, body site where culture was obtained, hospital location of patient at the
time of specimen collection, date of culture collection, time for culture to become positive, bacterial isolate
specimen number and complete antimicrobial susceptibility pattern. Demographic information on all
patients from whom Staphylococcus aureus was isolated will be obtained through the Medical Information
Department at each children’s hospital. (See Appendix I for list of medical information variables.) In
addition, information on patient days, by hospital unit, will be obtained for hospital administrative
databases.
Merging of medical information with laboratory information systems (See Appendix II for Timeline for
Proposal). It is anticipated that this phase of the proposal will be completed during the first quarter of the
funding period.
 At Scottish Rite and Egleston Hospitals. All clinical culture results from the period January 2002
through December 2006 will be obtained by downloading information from computer systems
used by Egleston and Scottish Rite campuses. Duplicate isolates will be excluded. This
information will be translated into a format by which GIS technology can be applied.
 At Hughes Spalding Hospital. Similarly, at Hughes Spalding, all clinical culture results from the
same period will be obtained. Methodology to extract and merge data from laboratory and
medical information systems at Hughes Spalding Hospital will involve the following:
1. The design of a consolidated database that will combine laboratory database of Hughes Spalding
Children’s Hospital with its medical information database in an effective manner. This requires
the design of a consolidated database schema such that the consolidated database can be used
efficiently for several purposes: (1) timely collecting and integrating of medical data and
laboratory data of those patients who have a positive culture; (2) timely detecting and flagging of
those patients who have a history of methicillin resistant Staphylococcus aureus (MRSA)
infection; (3) timely tracking of all patients and the interested variables that present with positive
culture for MRSA; and (4) providing efficient translation of consolidated data into a format by
which GIS tools can be applied.
2. The concrete methods for combining and linking both laboratory and medical information systems
at Hughes Spalding Hospital include (1) techniques for creating and maintaining a clinical cultures
positive database for Staphylococcus aureus from 1/2002 to 12/2006; (2) techniques for dynamic
joins of laboratory and medical information of the patients in real time with user-friendly graphical
interfaces for pediatric doctors and researchers to use; and (3) techniques for automated extraction
of interesting data items from the consolidated database for performing statistical analysis and data
mining.
Data Distribution Requirements:
Establishing a database prospectively at all three hospitals. Active surveillance for Staphylococcus aureus
infections will be performed on all patients with positive cultures. This phase of the project will be started
during the second quarter of the funding period and will be ongoing thereafter. This collection will occur in
real-time and involve the following measures:
1. Timely and efficient tracking of significant updates on patients with positive clinical cultures,
including the list of variables we have identified (see Appendix I). An example of such monitoring
services is to track patients whose cultures became positive within 48 hours of admission.
2. Providing end-user level facilities, such as GUI, visualization tools, data mining tools, to allow
pediatricians and pediatric researchers to analyze patients’ data and perform statistical data
analysis in real time.
3. Providing system-level facilities to support complex triggers for monitoring significant updates of
laboratory and clinical information.
4. The engineering and development of a real-time database update monitoring system on top of the
consolidated database, which not only will link laboratory database of Hughes Spalding Children’s
Hospital to its medical information database automatically in real time, but also will provide
efficient and reliable methods and techniques for identifying, tracking and alerting in real time the
significant updates on both laboratory data and clinical data of patients of interest.
Geo-coding databases of all children infected with Staphylococcus aureus based on antibiotic resistance
pattern. We will geo-code all patients with positive Staphylococcus aureus cultures for the study period
and will begin this process during the second half of the funding period. This geo-coded dataset will then
be sorted based on patients’ antibiotic resistance pattern as defined in Appendix I. Specifically, we will
group patients into those with CA-MRSA phenotype, non CA-MRSA phenotype and MSSA case controls.
Using ArcView 9.1 (Environmental Systems Research Institute (ESRI), Redlands, California) and
statistical software (e.g., SPSS, SAS), we will determine if certain demographic traits are associated with
risk for infection for the respective groups. Population information to look at possible risk factors including
housing conditions (e.g., overcrowding), race, and household income will come from census data. For
example, Fridkin et al suggested in their population-based study the risks in Atlanta associated with
increased incidence of infection included being younger than two years or black (9). Similarly, risks
associated with either infection or colonization by CA-MRSA in studies of adults include history of
incarceration or HIV infection(10). Spatial clustering of isolates will also be assessed to determine whether
specific strains of resistant MRSA occur in specific communities.
Analysis plan. Data will be collected both retrospectively and prospectively in order to obtain complete
data for the time period of January 2002 through December 2006. The data will be used to determine
monthly rates of infection or colonization due to CA-MRSA. The incidences and rates of CA-MRSA, HAMRSA and nosocomial-MRSA will be determined. The denominator for each of these determinations is
based on the total number of cultures positive for Staphylococcus aureus. We will also determine the
incidence and rate of patients who presented to the emergency department for evaluation and treatment of
SSTI. The denominator for this will be based on the total monthly number of patients seen in the
emergency department. Finally, we will determine rates of infection or colonization from MRSA isolates
that are CA-MRSA phenotype but cultures were obtained and became positive after 48 hours of admission.
These results will be compared to other MRSA phenotypes, which were collected from patients after 48
hours of admission. Comparisons of clinical diagnoses, length of hospital stay, and demographic
information will be made between those patients who had cultures positive for CA-MRSA by phenotype
and those who culture for HA-MRSA. Additionally, similar information will be obtained from a similar
group of patients with cultures positive for MSSA. The laboratory and medical information variables will
be merged into a single database, which will be analyzed using SPSS version 11 (SPSS, Inc.) and SAS V.8
(SAS Institute, Cary, NC). Prevalence and overall incidence for positive cultures due to CA-MRSA, nonCA-MRSA, MSSA per year will be determined. Rates of CA-MRSA, MSSA, Nosocomial-MRSA and HAMRSA skin and soft tissue infections will be determined and analyzed using time series analyses. Study
protocol will be submitted for approval by Georgia Institute of Technology, Children’s Healthcare of
Atlanta, Grady Health System, and Morehouse School of Medicine Institutional Review Boards.
Potential Use of Findings
Results from this study will inform the healthcare community about incidence of CA-MRSA and identify
specific risk factors for infection. This information will be necessary in order for us to develop specific
health policy guidelines within communities at high risk for infection. Recommendations on which
antibiotics should be selected for empiric therapy can be based on the information gained from this study.
For example, if a patient presents for a skin infection and has one of the identified risk factors, treatment
with antimicrobial agents targeting CA-MRSA can be made. This will minimize the overuse and misuse of
broad spectrum antibiotics. The information gained by implementing this proposal will help develop
specific infection control measures which should be adopted by hospitals and other healthcare settings in
order to prevent the spread of this resistant bacterium. Developing and refining the methodology for this
project has significance in applying the technology towards other population based datasets. For example,
the Centers for Disease Control and Prevention have established an active ongoing surveillance system of
specific harmful pathogens through its Active Bacterial Core Surveillance. These are population based
surveillance systems from across the US. Seamless integration of the various information systems can
simplify the process of determining which pathogens pose the greatest infectious disease threat as well as
provide a way to systematically look for risks associated with infection by these pathogens.
Download