Plasmid-Mediated Restriction Evasion Mechanisms Thesis subm itted for the degree of Doctor of Philosophy at the U niversity of Leicester Nicola Jayne A lthorpe BSc D epartm ent of Genetics U niversity of Leicester September 1999 UMI Number: U594550 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a complete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. Dissertation Publishing UMI U594550 Published by ProQuest LLC 2013. Copyright in the Dissertation held by the Author. Microform Edition © ProQuest LLC. All rights reserved. This work is protected against unauthorized copying under Title 17, United States Code. ProQuest LLC 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, Ml 48106-1346 Abstract Plasm id-M ediated R estriction Evasion M echanism s N icola Jayne A lthorpe Plasm id ColIb-P9 (Incll) encodes m echanism s w hich allow it to avoid d e stru ctio n by type I and type II restriction enzym es d u rin g transfer by A genetic system was co n ju g atio n b etw een strain s of Escherichia coli. developed to analyse these m echanism s. The system relied on m easuring C ollb-m ediated rescue of the restriction-sensitive plasm id R751 (IncPp) from destruction by EcoKI (type I) and EcoRI (type II). One Collb m echanism was k n o w n to involve a plasm id-encoded antirestriction gene k n o w n as ar dA , the p ro d u ct of w hich is active against type I enzym es. Tests for alleviation of EcoKI restriction of R751, show ed strong protection by a co-transferring C ollb (A rd+) p lasm id, slight protection w hen Collb w as resid en t in the recip ien t and no effect w h en Collb w as im m obilised in th e d o n o r by rem o v al of its n ic site. H ence, expression of a r d A is activated in the recipient cell follow ing transfer; no detectable transfer of the A rdA protein occurs from the donor to the recipient. The a r d A gene is found in the leading region of C ollb, w hich is defined as the first segm ent of the plasm id to enter the recipient cell during conjugation. N ucleotide sequencing of 11.7 kb of this region identified ten open reading fram es. Furtherm ore, the region also contains three dispersed rep e at sequences hom ologous to a novel single-stranded DN A prom oter described by Masai and A rai (1997, Cell 89, 897-907). It is proposed that these secondary structures form in the transferring T-strand of Collb and function as prom oters for tran scrip tio n of genes encoded on the u n iq u e plasm id stran d transferred d u rin g conjugation. A nother m echanism , w hich acts independently of a r d A , alleviates restriction of both type I and type II enzym es in the recipient in second or su b seq u en t ro u n d s of transfer. Two separate m echanism s ap p ear to be o p e ra tin g since a llev iatio n of type I restriction occurs in trans and is constitutive. In contrast, alleviation of type II restriction is by a cis- acting m echanism . The 's u b stra te sa tu ratio n ' h y pothesis, w h e reb y increasing am o u n ts of tran sferred D N A saturates the restriction system (Read et a l 1992, Mol Microbiol 6, 1933-1941) is ruled out by data presented. Acknowledgements I w o u ld like to thank the following; Brian W ilkins, A ngela Thomas, M atthew Thrower, Emm a 'C hick' Sw anston, Lynn Cook, Kerry 'K esser' Byrne, Pat W ilson, Stevey Bates, Bill Bram m ar, Steve Saville, The M edical Research Council, and all the Yeasty Boys, I w o u ld also like to thank any friends and colleagues not m entioned above and m y fam ily for p u ttin g up w ith me during the past four years. Abbreviations A la Arg A sn Asp ATP A R Ap bp BSA Collb CCC CmR C lm R Cys DNA D N A se dA TP dCTP dGTP dTTP drd ds EDTA e.o.p G in G lu Gly HGT H is HTH lie Inc IPTG IR kb kDa KmR K anR A la n in e A rg in in e Aspartic acid A sparagine A denosine trip h o sp h ate A m picillin resistance Base pair Bovine Serum A lbum in ColIb-P9 Covalently closed circle Plasm id-encoded chloram phenicol resistance C hrom osom al-encoded chloram phenicol resistance Cysteine Deoxyribonucleic acid D eoxyribonuclease D eoxyribo-adenosine triphosphate Deoxyribo-cytosine triphosphate D eoxyribo-guanidine triphosphate D eoxyribo-thim idine triphosphate D erepressed for transfer D ouble-stranded D iam inoethanetetra-acetic acid Efficiency of plating Glutam ic acid G lu tam in e Glycine H igh gelling tem perature H istidine H elix-turn-helix Iso-leucine Incom patibility group Isopropylthio- p-D-galactoside Inverted repeat Kilobase K ilodalton Plasm id-encoded kanam ycin resistance C hrom osom e-encoded kanam ycin resistance Leu LPS Lys m + m~ M et N a lR nt OD ORF PCR P he Pro r r rbs RifR *+■ - Leucine Lipopolysaccharide Lysine M odification proficient M odification deficient M e th io n in e Chrom osom e-encoded Naladixic acid resistance N ucleotide Optical density O pen reading fram e Polym erase chain reaction P heny lalan in e P roline Restriction proficient Restriction deficient Ribosome binding site C hrom osom e-encoded rifam picin resistance SDS Ser Sm R SpR Ribonucleic acid R ibonuclease Rotations per m inute Sodium dodecyl sulphate Serine Plasm id-encoded streptom ycin resistance Plasm id-encoded spectom ycin resistance ss SSB ssi StrR Single-stranded Single-stranded D N A -binding protein Single-stranded initiation sequence Chrom osom e-encoded streptom ycin resistance TAE T cr TetR Tris-acetate EDTA buffer Plasm id-encoded tetracycline resistance Chrom osom e-encoded tetracycline resistance Thr Tn Trp Tyr UV V al T h reo n in e T ransposon T ryptophan RNA R N A se rp m X-Gal Tyrosine U ltraviolet V alin e 5-brom o-4-chloro-3-indolyl-p-D -galactoside Contents Abstract A cknow ledgem ents A bbreviations General Intro d u ctio n 1 1.1 Prologue 1 1.2 Enterobacterial Plasm ids 3 1 .2.1 Incll plasm ids 4 1.3 Enterobacterial Conjugation 6 1.3.1 Conjugative cell-cell interactions 6 1.3.2 DNA processing reactions 9 1.3.3 Plasm id-encoded exclusion 15 1.4 Barriers to H orizontal DNA Transfer by 18 C hapter One Plasm ids 1.5 Bacterial Restriction-M odification Systems 20 1.5.1 Type I R-M systems 21 1.5.2 Type I enzym e activity 24 1.5.3 Regulation of expression of type I R-M genes 25 1.5.4 Type II R-M system s 29 1.5.5 Regulation of type II R-M genes 31 1.5.6 Ecological roles of bacterial R-M system s 32 1.6 Collb Leading Region Genes and Their 34 Roles in Plasm id Establishm ent 1 . 6.1 Single-stranded D N A -binding protein (SSB) 35 1. 6.2 Plasm id SOS inhibition (PsiB) 36 1.6.3 Alleviation of restriction of DNA (ArdA) 37 1.6.4 Zygotic Induction of leading region genes 38 1.6.5 Hypotheses for zygotic Induction 40 1.6.5. Modes of positive regulation 40 1.6.5.2 Modes of negative regulation 41 1.7 Restriction-Avoidance by Plasm ids 41 1.8 Aims and D irection of this Study 43 C hapter Two M aterials and M ethods 2.1 Plasm ids and Bacterial Strains 45 2.2 Media and Chem icals 46 2.2.1 M edia 46 2.2.2 Antibacterial agents 48 2.2.3 11.1 x PCR buffer 48 2.3 Phenotypic Characterisation of Bacterial 48 Strains 2.3.1 Colicin production 48 2.3.2 recA m utants 49 2.3.3 recD m utants 49 2.3.4 Sensitivity to bacteriophage 49 2.4 Strain M anipulations 50 2.4.1 Bacterial conjugation 50 2.4.2 Transform ation of bacterial strains 50 2.4.3 Bacteriophage propagation 51 2.4.4 Transposon insertion m utagenesis 51 2.5 DNA M anipulations 52 2.5.1 Small scale plasm id DNA isolation 52 2.5.2 Large scale plasm id DNA isolation 53 2.5.3 Analysis of the concentration and pu rity 53 of DNA samples 2.5.4 Restriction enzym e analysis of DNA 53 2.5.5 Isolation of DNA restriction fragm ents 54 2.5.6 D ephosphorylation of DNA fragm ents 54 2.5.7 Ligation of DNA fragm ents 54 2.5.8 Prim ers 55 2.5.9 DNA sequencing reactions 55 2.6 55 Standard and Inverse PCR 2.6.1 O ptim isation of the inverse-PCR protocol 56 for am plification of tem plate pN A 2 using prim er 5 and prim er 6 Genetic Analysis of a Plasm id-M ediated C h ap ter Three Restriction E vasion M echanism 3.1 In tro d u ctio n 58 3.2 Results and D iscussion 59 3.2.1 Effects of restriction on the conjugative 59 transm ission of plasm ids R751 and Collbdrd 3.2.2 Effects of co-transferring R751 w ith C o U b d r d 61 to a restricting recipient 3.2.3 Rescue of R751 from restriction by C o U b d r d 63 resident in the restricting recipient 3.2.4 Is conjugative transfer of CoUbdrd a 65 prerequisite for alleviation of restriction? 3.3 S u m m ary 68 C onstruction of an oriT M utant of C h ap ter Four Collbdrd and its Conjugative Properties 4.1 In tro d u ctio n 70 4.2 R esults 71 4.2.1 Cloning and determ ination of the nucleotide 71 sequence of the Collb oriT locus 4.2.2 Strategy em ployed to delete the nic sequence 74 from the cloned Collb oriT 4.2.3 C onstruction of pN A 8 containing the om ega 75 (Q) intersposon 4.2.4 Construction of a Collbdrd m utant lacking the nic reg io n 76 4.2.5 C onjugative properties of p N A ll and 76 pN A 12 4.3 C h a p te r Five D iscussion 78 A nalysis of the A lleviation of Type I and Type II Restriction U sing N on-T ransfer able M utants p N A ll an d pN A 12 5.1 In tro d u c tio n 80 5.2 R esults 80 5.2.1 Rescue of R751 from EcoRI restriction by 80 C o l l b d r d oriT m u ta n t p N A ll 5.2.2 M obilisation of EcoRI pro d u cin g plasm id 81 NTP14 by C o l l b d r d 5.2.3 A non-transferable Collbdrd confers no 82 EcoKI protection on R751 5.2.4 Collbdrd transfer does not m ediate 83 retrotransfer of NTP14 5.3 D iscussion 84 C onstruction of a mob M utant C h a p te r Six of Plasm id NTP14 6.1 In tro d u c tio n 86 6.2 R esults 6.2.1 Restriction analysis of plasm id NTP14 88 6.2.2 C onstruction of pNA 16 and pNA17 89 6.2.3 C haracterisation of pN A 16 and pN A 17 89 6.2.4 Collb does not alleviate EcoRI 91 restriction in trans. 6.2.5 Conjugative transfer of the EcoRI R-M 92 genes causes som e killing of recipient cells 6.3 D iscussion 93 D eterm ination and Analysis of the N ucleotide Chapter Seven Sequence Of the Collb Leading Region 7.1 In troduction 96 7.2 Results and D iscussion 97 7.2.1 Strategy for determ ining the nucleotide 97 sequence of the Collb leading region 7.2.2 Identification of putative protein coding 98 regions (or/s) in the Collb leading region 7.2.3 Similarities betw een DNA and p rotein 99 sequences found in the leading region to those present in the databases 7.2.4 G + C content of the Collb leading region 103 7.2.5 Similarity of the Collb leading region to 104 other conjugative plasm ids 7.2.6 Examination of the Collb leading region for 104 type I restriction enzym e cleavage sites 7.2.7 Repeat sequences w ithin the leading region 105 7.3 S um m ary 106 C hapter Eight General Discussion 108 A ppendices A ppendix I Literature cited Chapter One General Introduction 1.1 Prologue Extrachrom osom al genetic elem ents know n as plasm ids are comm only found in bacteria and determ ine functions involved in their stable replicationm aintenance. M any plasm ids also possess transfer genes (tra ) th at sup p o rt h o riz o n ta l tra n sfe r of the elem ent b e tw ee n b acteria by the process of conjugation. The first conjugative plasm id to be recognised was the F fertility factor of Escherichia coli (Lederberg and Tatum , 1946). Since the discovery of F, the m ajority of research has focused on developing a m olecular understanding of the conjugation system of F and other enterobacterial plasm ids (W ilkins and Lanka, 1993). To date, the m ost docum ented transfer system s belong to p lasm id s of the incom patibility (Inc) com plexes F, I, P, N and W. Essential fea tu res of th ese tra n sfe r system s, in clu d in g strateg ies for m atin g -p air form ation and tran sm issio n of genetic m aterial, are conserved betw een an extensive collection of plasm ids representing m ore than 25 Inc groups isolated from G ram -negative bacteria (Wilkins and Lanka, 1993). These co n serv ed features apply less to the conjugation system s of p lasm id s isolated from G ram -positive bacteria. In fact, the sim ilarities that exist are that they require physical contact betw een donor and recipient cells and th at conjugation is m ediated by plasm ids or a special class of conjugative transposon. T hus, a global definition of conjugation is that it is a "process w hereby a DNA m olecule is transferred from a donor to a recipient bacteria via a specialised protein com plex called the conjugation apparatus" (Zechner et ah, 1999). In a d d itio n to h o u sek eep in g fu n ctio n s in v o lv e d in rep licatio n m aintenance and transfer, m any plasm ids carry an array of specialised genes of hig h environm ental and m edical im portance. 1 The m ost interesting of these g en es are th o se th a t encode resistan ce to a n tib io tics an d v iru le n ce d eterm in an ts. O ne feature of plasm id transfer th at has b ro u g h t it m uch attention is its so-called prom iscuity; th at is, the ability of certain types of plasm ids to m ediate transfer betw een bacteria exceeding their m aintenancerange. D uring such transfer, there is the potential for the exchange of genetic m aterial by the process of recom bination. T herefore, p lasm id s m ake a significant contribution to the gene pool and hence, play an im portant role in the ecology of bacteria (Datta and H ughes, 1983; M azodier and Davis, 1991; Am abile-Cuevas and Chicurel, 1992). O ne potential barrier to horizontal transfer of DNA is that im posed by bacterial restriction-m odification (R-M) system s. These are u b iq u ito u s in n atu re and are therefore likely to be encountered frequently by transferring DNA. The restriction endonuclease com ponent operates to cleave infecting DNA recognised as foreign by its lack of cognate m ethylation at target sites for the enzym e. Plasm ids transferring and replicating w ithin a single host strain w ill acq u ire the a p p ro p ria te m ethylation an d avoid d e stru c tio n by the restriction enzym es of the host. In contrast, plasm ids transferring to new host strains w ill not carry the appropriate m ethylation. Therefore, prom iscuous transfer raises the intriguing question of how plasm ids avoid destruction from new ly encountered R-M systems. B acteriophages, like plasm ids, are also subject to d estru ctio n by hostencoded restriction enzym es during infection of a new strain. H ow ever, bacteriophages are know n to encode a variety of anti-restriction m echanism s. It is now becom ing clear that certain plasm ids also encode such protective m echanism s. This thesis focuses on one such plasm id, ColIb-P9 of the In cll g ro u p of plasm ids, and attem pts to characterise fu rth er the p lasm id 's tw o restriction-evasion m echanism s. Data described in th is thesis have been published in the papers Althorpe et ah (1999) and Bates et al. (1999). 2 1.2 Enterobacterial Plasmids The m ajority of large plasm ids (> 20 kb) isolated from G ram -negative bacteria carry transfer genes that prom ote transm ission of the elem ent to a new bacterial cell by the process of conjugation. conjugative or self-transm issible. Such plasm ids are described as M ost plasm id s are generally classified according to incom patibility (Inc) relationships based initially on the inability of tw o closely related or identical plasm id replicons to be p ropagated in the sam e cell line. The phenom enon of incom patibility reflects the id en tity betw een the two plasm ids and their replication control and active partitioning system s (Couturier et al., 1988). Further classification of plasm ids m ay involve c riteria such as i) rep lico n -ty p in g , ii) D N A h y b rid is a tio n tests, iii) m orphological and serological typing of conjugative pili and iv) sensitivity to pilus-specific bacteriophages (Bradley, 1980). As a result of these tests, it is the tren d th a t p lasm ids from the sam e Inc g ro u p often share sim ilar if n o t identical features betw een their transfer systems. The subject of this thesis is the 93-kb enterobacterial plasm id ColIb-P9 (Fig. 1.1), which along w ith closely related plasm id R64 represent the In c ll group of enterobacterial plasm ids. ColIb-P9 was originally isolated in Shigella s o n n e i P9 and like F w as one of the first conjugative plasm ids discovered (Fredericq, 1965). C onsequently, ColIb-P9 has featured extensively in the literature regarding the fundam entals of conjugation. H ow ever, the complex details regarding the conjugation system of this plasm id and F are beyond the scope of this thesis. Therefore, the reader is directed to a num ber of extensive review s that exist for further details (Willetts and W ilkins, 1984; W illetts and Skurray, 1987; W ilkins and Lanka, 1993; Lanka and W ilkins, 1995; Firth et al., 1996; Pansegrau and Lanka, 1996 and Zechner et al., 1999). To this end, only a general overview of the details regarding Il-like conjugation will be described in the follow ing sections. W here appropriate, analogies w ith F- and P-like conjugation will be m ade. 3 Fig. 1.1 M ap of In c ll plasm id Collb. Regions T ral and Tra2 contain the plasm ids transfer genes. Loci w ithin T ra l in clu d e the sh u fflo n (s h f ), shufflon recom binase (rci), th e flexible p ilu s stru c tu ra l gene (pilV ) and genes encoding an EDTA -resistant nuclease (n u c ). Tra2 includes DN A prim ase (sog), entry exclusion (eex ) and genes involved in rigid p ilu s assem bly. A djacent to the leading region is the origin of tran sfer (oriT) an d genes encoding relaxosom e p roteins (n i k A , nikB). re g io n c o n ta in s th e o rig in of v e g e ta tiv e re p lic a tio n The C ollb Rep (o r i V ), I l - ty p e incom patibility locus (me) and replication controlling factor (repZ). L eading region genes show n are those that encode the single-stranded D N A b in d in g p ro tein (s sb ), plasm id SO S-inhibition (psiB) and alleviation of restric tio n of DNA (ardA). O ther loci include the colicin lb gene (cib) a n d colicin lb im m u n ity gene (im m ), fertility inhibition of F (FinQ) and genes d e te rm in in g abortive phage infection (ibfA, ibfB). Indicated on the m ap by internal lines are EcoRI cleavage sites. Triangles (A) indicate Sail cleavage sites. M ap rep re se n ts data from studies w ith bo th Collb and closely related Incll plasm id R64 (Rees et al., 1987; H ow land et al., 1989; K om ano et al., 1990; F uruya et al., 1991 an d Chilley and W ilkins, 1995). M ap is reproduced from Read (1993). Fig-1-1 cib u m n w * Rep ,oriV repZ inc . imp ssb Tral 90 psiB flexible pilus j r ardA 60 pilV shf rci oriT nikA.B nuc finQ ee x sog rigid pilus? Tra2 Fig. 1.1 repZ inc . oriV flexible pilus imp ssb 90 80 Collb 93 kb psiB 20 ard A 60 40 o riT nikA.B nuc finQ sog eex rigid pilus? Tra2 1.2.1 In c ll plasm ids Plasm ids belonging to the I complex are typified by those w hich specify ho m o lo g o u s thin conjugative pili an d include plasm ids from the six Inc g ro u p s II (la ), 12, 15, B, K and Z. Some of these plasm ids can be fu rth er classified into one of tw o groups based on replicon-typing and serological relationships that exist betw een the conjugative apparatus they specify. These in clu d e the plasm ids from the Inc g ro u p s Il-B-K and the IncI2 plasm ids. D espite the relationships betw een these pili, plasm ids from the I com plex display different patterns of sensitivity to I-pilus specific bacteriophages Ifl and PR64FS (Walia and D uckworth, 1986; Bradley, 1984). All I-type plasm ids determ ine two types of conjugative pilus. The thick rig id p ili are one type, w hich are all m orphologically sim ilar an d b e ar com parisons to those specified by plasm ids of IncP and IncN g ro u p s as they su p p o rt efficient conjugation betw een cells on semi-solid surfaces. H ow ever, these thick rigid pili are also required in liquid m atings and are th o u g h t to be essen tial for DN A transfer (Bradley, 1980; Bradley, 1984; Rees et ah, 1987; H oriuchi and Kom ano, 1998). The second type is the thin flexible pilus, w hich is req u ired to su p p o rt efficient conjugation betw een cells in liq u id m edia. T h rough DNA sequence analysis of the R64 genes responsible for th in pili biosynthesis, these pili w ere inferred to belong to the type IV pilus fam ily (Kim and Komano, 1997). R64 thin pili consist of the major subunit PilS and one of seven PilV products (Horiuchi and Komano, 1998). The Il-B-K plasm ids have a narrow replication-m aintenance range and can be only m aintained in strains of E s c h e r i c h i a , S a l m o n e l l a , S h i g e l l a and Kle bsi ell a (Walia and D uckw orth, 1986; W ilkins, 1995). T hrough replicon typing tests, the replication region of the Incll plasm ids along w ith those of the g ro u p s IncFII, IncFI and IncFIV, have been show n to belong to the RepFIC fam ily of replicons. A pparently, the n arro w m aintenance ran g e of this replicon-type is due to the requirem ent for host encoded enzym es (Jones, 1991; C outurier et al., 1988; Saadi et ah, 1987). Despite the narrow transm ission range 4 of the Incll plasm ids they are still able to prom ote low-level transfer to a broad range of bacteria (Wilkins, 1995). A large proportion of the backbone of the Incll plasm id is assigned to the transfer system of the plasm id (-50 kb). The transfer genes of Collb are organised into two separate m odules, T ral and Tra2, as determ ined by genetic m apping studies of the plasm id [Fig. 1.1] (Uemura and M izobuchi, 1982; Rees et ah , 1987). T ral is located betw een Tra2 an d the rep region an d w as show n by transposon m utagenesis to carry genes that are required for the assem bly of the thin flexible conjugative pilus (Rees et ah, 1987). The com position of the thin pilus can be changed by rearrangem ents in a DNA sequence know n as the shufflon (Komano et ah, 1990). This highly mobile DNA sequence consists of three 300 bp segments separated by six 19 bp repeat sequences and is located at the 5' term inus of the thin-pilus gene, p i l V . Site-specific reco m b in atio n m ediated by the product of the associated rci gene selects a different C-term inal se g m en t of p i l V . C o n seq u en tly , th e conjugation efficiency in liq u id en v iro n m en ts is changed, possibly by affecting the recognition of surface co m p o n en ts of d iffe re n t recipients (K om ano et ah, 1994; H o riu ch i and Kom ano, 1998). The Tra2 segm ent contains genes involved in the assem bly of the thick rigid conjugative p ilus, en try exclusion (eex) and a gene encoding a DNA prim ase known as sog (H artskeerl and H oekstra, 1984; Rees et ah, 1987). Expression of these transfer genes, w ith the exception of eex, are subject to the p lasm id 's negative t r a n s -acting fertility inhibition system w hich is analogous to that of F-like plasm ids (Meynell and Datta, 1967). Finally, the oriT and o r iT operon, w hich are both hom ologous to IncP co u n terp arts can be found located adjacent to the p lasm id 's leading region (Rees et ah, 1987; Furuya and Kom ano, 1991). This and the corresponding regions of other II, B and K plasm ids are found to carry highly conserved zygotically inducible genes w hich are n o t essential for conjugation b u t are 5 believed to be involved in establishm ent of the plasm id in the new ly infected recipient cell. These genes are discussed in more detail in section 1.6. 1.3 Enterobacterial Conjugation The m odel of enterobacterial conjugation is divided into three crucial stages. The first stage involves the form ation of stable contacts betw een donor and recipient bacteria and is com m only referred to as m ating-pair form ation. The second stage involves a series of reactions that result in the transfer of a unique strand of the plasm id to the recipient cell. The final stage involves the estab lish m en t of the im m igrant plasm id in the recipient, w h ich m ay be facilitated by plasm id proteins transferred from the donor to recipient or those synthesised early in the infected cell (Wilkins and Lanka, 1993). 1.3.1 C onjugative cell-cell interactions Interactions betw een donor and recipient cells at surface levels are an essential first stage to conjugation. The pilus specified by all conjugative p lasm id s isolated from Escherichia coli is an essential feature in in itiatin g contact betw een m ating bacteria. These donor-specific surface ap pendages, w hich are also know n as sex pili, tend to fall into one of three m orphological g ro u p s in cluding th in flexible, thick flexible and rigid filam ents or ro d s (Bradley, 1980). The biological significance of the conjugative pilus as an organelle for m ediating contact betw een donor and recipient bacteria is established and its im portance is em phasised by the num ber of transfer genes that are assigned to its assembly. H ow ever, certain aspects of this initial stage of conjugation have yet to be resolved, the m ost interesting of which is the exact route for ssDNA transfer from the donor to the recipient bacterium after contact is initiated. From studies w ith F-mediated conjugations in liquid m edia, the process of conjugative cell-cell interactions is proposed to proceed in several stages (A nthony et al., 1994). First, the donor cells interact w ith recipients via the tip 6 of the pilus and a receptor on the surface of the recipient cell. The exact n ature and identity of these cell surface receptors are not yet known. H ow ever, early ex p erim en ts w ith ConF and ConI m u tan ts (defective in receiving F or I plasm ids by conjugation, respectively) of E. coli K-12 suggested that the surface lipopolysaccharide (LPS) and outer m em brane protein O m pA are the m ost likely candidates (Havekes et al ., 1977; A chtm an et al., 1978b; A nthony et al., 1994). However, w hether LPS or O m pA are the actual receptors recognised by conjugative pili or are involved in the biosynthesis or stru ctu ral integrity of the authentic receptor is not yet understood (Havekes et al., 1977; A nthony et al., 1994). A fter contact is initiated, the second stage of conjugative cell-cell interactions involves the retraction of the pilus w hich brings the donor and recipient cells into w all-w all contact (A chtm an et al., 1978a). A ssem bly of the extended F-pilus is a complicated process that is poorly understood. H ow ever, w hat is know n is that the process involves the products of 15 transfer genes, w h ich form a m em b ran e-sp an n in g com plex trav ersin g the bacterial cell envelope from the w hich the pilus is extended and retracted (M anning et al., 1981; S c h a n d el et al., 1992). R e tra ctio n of th e F -p ilu s in v o lv e s depolym erisation of its subunits into the cell m em brane (D iirrenberger et al., 1991; Frost et al., 1994). M ating aggregates in F-m ediated conjugations are at first sensitive to shear forces. H ow ever, once the m ating cells are in w all-w all contact the aggregate is rapidly stabilised by a process requiring the products of traG a n d traN (M anning et al., 1981; Panicker and M inkley, 1985). TraN can be found w ithin the outer m em brane of the cell envelope of the donor and contains an exposed surface dom ain, which may interact w ith the lipopolysaccharide of the recipient (Diirrenberger et al., 1991). The exact route supporting transfer of single-stranded DNA from the donor to the cytoplasm of the recipient is unknow n. Early studies w ith F- m ediated conjugations suggested that DNA transfer m ight occur through the 7 extended pilus (Ou and A nderson, 1970). Such a view was supported by the findings of H arrin g to n and Rogerson (1990) w ho found th at Hfr-C do n o rs separated from F recipients by a filter 6 pm thick w ith a pore size of 0.01-0.1 pm w ere still able to produce transconjugants. H ow ever, the favoured m odel to date is that the donor and recipient cell envelopes, once w ithin intim ate contact, fuse form ing a transm em brane pore which allows the transfer of DNA an d any proteins involved in DNA m etabolism to the recipient. In m atings m ed iate d by p lasm id F, TraD is fo u n d trav ersin g the in n er and o u ter m em brane of the donor (Panicker and M inkley, 1985) and appears to interact w ith O m pA form ing a p u tativ e pore betw een the d o n o r and the o u te r m em brane of the recipient for DNA transfer (D iirrenberger et al., 1991). No evidence of a plasm a bridge form ed betw een the two cells has yet been found, w h ich is a notion su p p o rted by D iirrenberger et al. (1991) w ho exam ined m atin g cells w ith in w all-w all contact by electron m icroscopy (W illetts an d W ilkins, 1984; D iirrenberger et al., 1991). The m odel is lacking in that there is n o ev id en ce for h o w the plasm id D N A translocates across the in n e r m em brane of the recipient. However, it has been proposed that this m ay be a passiv e process sim ilar to th at of infecting bacteriophage X, w hich u p o n injection into the periplasm ic space, X DNA is transported across the in n er m em brane by the sugar uptake m echanism for m annose (D iirrenberger et al., 1991). The putative signal for initiation of DNA transfer after contact betw een the donor and recipient has been established is proposed to be the product of the F gene traM. TraM which is found at the base of the F pilus is required to be in close contact w ith transfer replication proteins and binds extensive regions of the origin of transfer [oriT] (D iirrenberger et al., 1991; Zechner et al., 1999). TraM also provides the physical link betw een the TraD protein form ing p a rt of the m ating bridge and the relaxase protein Tral that is covalently bound to the transferring DNA. Finally, once DNA transfer has occurred conjugation is term inated and m ating aggregates separate (Achtman et al., 1978a). 8 From studies w ith IncP plasm ids, the assem bly of RP4 surface-specific pili is reasonably well understood (for review see Z echner et al., 1999). In addition, the transfer genes required for RP4-m ediated m atings betw een E. coli and from £. coli to strains of yeast are know n (Lessl et al., 1993; Haase et al., 1995; W ilkins and Bates, 1997; Bates et al., 1998). H ow ever, interactions at the cellular level betw een donor and recipient bacteria on surfaces have been neglected and to date no Con m utants have been isolated. This is unfortunate considering that m ost natural com m unities of bacteria are attached to surfaces or grow as m icro-colonies (Simonsen, 1990). P revious stu d ies w ith ConF” m u ta n ts isolated from liquid m atings w ere show n to d isp lay w ild ty p e behaviour w hen m ated w ith appropriate donors on surfaces, w hich suggests that the interactions that occur during surface m atings are different from those that occur in liquid (Achtman et al., 1978b) 1.3.2 DNA processing reactions The reactions required for tran sfer of a unique DNA stran d of the plasm id to the recipient cell during enterobacterial conjugation have been best studied for plasm id F, IncP plasm id RP4 and some m obilisable plasm ids from the IncQ group. Transfer of the plasm id strand to the recipient cell d u rin g conjugation is pro p o sed to occur in num ber of stages (Lanka and W ilkins, 1995). The first stage of plasm id transfer is initiated at the nic site w ithin the origin of transfer (oriT) by specific endonucleolytic cleavage of a single strand of the plasm id DNA (Lanka and W ilkins, 1995). C leavage is m ediated by a plasm id-encoded transfer gene product called relaxase. The relaxase enzym e b in d s to the o r i T in the presence of accessory pro tein s to form a com plex k n o w n as the relaxosom e. This com plex can be iso lated and u p o n d en atu ratio n w ith SDS releases an o p en circular plasm id th at is an ideal su b strate for determ ining the nic site of the o r i T by nucleotide sequencing (Fiirste et a l , 1989; Furuya and Komano, 1991; Lanka and W ilkins, 1995). 9 Cleavage of o r iT breaks a specific phosphodiester bond betw een tw o chem ically d ifferent nucleotides p resent at the nic site. From studies w ith p lasm id F, RP4, R64, RSF1010 and C olE l, the 3' term inus of nic has been sh o w n to carry an unm odified hydroxyl group th at is susceptible to p rim er extension by DNA polym erase I of E. coli and a 5' term inus that rem ains covalently attached to the relaxase enzym e via the hydroxyl side of either a seryl, threonyl or tyrosyl residue of the enzym e. The location of these active centre am ino acids in the relaxase v ary and can be found w ithin different d om ains of the enzym e depending on w hich plasm id encodes them (Lanka an d W ilkins, 1995). The biochem istry of these reactions at the o r iT is best understood for RP4 relaxosomes (Pansegrau et al ., 1990b). W hether cleavage of the oriT is influenced by an unidentified signal synthesised d uring cellular interactions betw een the donor and recipient cell is still w idely debated. From studies w ith plasm id F, the p ro d u ct of the t r a M gene has been im plicated in this signalling process. TraM is able to interact w ith TraD of the m ating bridge and the orzT-protein com plex (section 1.3.1) suggesting th a t the p rotein is involved in initiating the transfer process (Di Laurenzio et al., 1991). An alternative view to that described above is that cleavage of the o r i T is a reversible process th a t occurs in the absence of recipient cells. It is envisaged th at the relaxase enzym e covalently bound to the 5' term inus of the p lasm id stran d is also able to m ediate a cleaving-joining reaction involving th e hydroxyl free en d of the 3' term inus. Such a p roperty of the relaxase suggests that the enzym e is also involved in recircularisation of the transferred p la sm id stra n d in the recipient cell and consequently in term in atio n of conjugation (Wilkins and Lanka, 1993; Pansegrau et al., 1994b). Exam ination of RP4 relaxosom es in non-conjugating cells rev ealed th a t T ral p o ssesses cleaving-joining activity, w hich resem bles th at of a type I topoisom erase (Pansegrau et al., 1990; Pansegrau et al., 1994b). The m odel proposes that the energy p roduced from the cleaved phosphodiester bond and the form ation of 10 the covalent Tral-orzT interm ediate is conserved and utilised in the re-joining reaction. The superhelical covalently closed circle (CCC) state of the plasm id, w hich is believed to be required in the nicking reaction, is retained by the relaxase form ing a non-covalent clamp to the 3' end at the nic site (Pansegrau et al. , 1990a; M atson and Morton, 1991). In addition, this CCC state m ay also be required so that w hen the plasmid is unw o u n d the energy is provided to drive transfer of the plasm id strand to the recipient cell. The o r i T is the only czs-acting site required for D N A transfer. The location of the oriT is usually at one end of the transfer com plex of the plasm id and can take up to 500 bp in length. The oriT can generally be divided into four dom ains including, i) the nicking dom ain, which is also know n as the nic sequence, ii) an AT-rich region, w hich m ay facilitate stra n d sep aratio n in n egatively supercoiled DNA, iii) one pro m o ter, resp o n sib le for o u tw a rd read in g tran scrip tio n of either the tra or mob genes and iv) a region w ith p a tte rn s of direct and indirect repeats that m ay facilitate recognition and bin din g of specific proteins associated w ith the oriT (Di Laurenzio et al., 1991). O ne interesting feature of the oriT of plasm id F and RP4 is the requirem ent for the form ation of secondary structures th at alter the topology of the associated region and ap p ear to enhance protein-orzT complex form ation (W ilkins and Lanka, 1993; Pansegrau et al., 1994b). Extensive nucleotide sequence sim ilarities are only found betw een the o r i T of closely related plasm ids, although significant sim ilarities are found betw een the nic regions (-10 bp) of a m ore diverse set of conjugative plasm ids (Furuya and Komano, 1991; Fig. 1.2). There are four fam ilies of a particular type of nic region that are identifiable, one of w hich includes the nic sequences from m obilisable plasm ids (Pansegrau and Lanka, 1991; Lanka and W ilkins, 1995; Zechner et al., 1999). The sequence of the Collb o r i T w as unavailable at the start of this research. How ever, nucleotide sequences of Incll plasm id R64 suggests that the Il-orzT bears m ore sim ilarity to those of IncP plasm ids RP4 and R751 than that 11 of plasm id F. Both the II and P oriT have a 19-bp and 17-bp inverted repeat sequence dow nstream of the nic site. In addition, the 12-bp nic region of R64 shares six conserved nucleotides w ith the nic region of RP4 and R751 (Komano et ah, 1988; Furuya and Komano, 1991; Fig. 1.2). The relaxase of R64 is the product of the nikB gene, w hich together w ith n i k A is found directly adjacent to the o r i T as the or iT operon (Furuya et ah, 1991). NikB is responsible for cleavage possibly via the N -term inal dom ain of th e p ro tein , alth o u g h the entire p ro te in is req u ired for relax atio n and m obilisation of R64. NikA (15 kDa) is involved in recognition of oriT a n d directing relaxosome assem bly (Furuya et al., 1991; Furuya and Komano, 1991). In agreem ent w ith the finding that the am ino acid sequences of certain DNA relaxases share striking similarities, both NikA and NikB have been show n to resem ble transfer counterparts of IncP plasm ids RP4 and R751. NikA shares 30% sim ilarity to TraJ and the N -term inal region of NikB shares 26% identity in a 276 amino acid overlap with Tral, w hich is required for relaxation (Furuya and Komano, 1991; Pansegrau and Lanka, 1991). The form ation of relaxosom es at the In cll oriT is a poorly understood process. H o w ever, th ro u g h in vitro stu d ies involving IncP p lasm id s the relaxation of RP4 is proposed to occur in a sequential order, w hich is enhanced by the form ation of a bend upstream of the nic site involving TraK (Pansegrau et at., 1990b; W ilkins and Lanka, 1993). RP4 TraJ binds to the cognate or iT by recognising the right arm of the 19-bp inverted repeat sequence. RP4 Tral, w hich is the relaxase, binds only to the TraJ-orzT complex and not to the or iT alone (Pansegrau et ah, 1990b). The w hole structure is subsequently stabilised by p ro tein -p ro te in interactio n s betw een RP4 TraH and the p ro te in -o n T com plex. M utations in the tral gene suggest that binding of Tral to the o r iT has an additional effect of reducing relaxase gene expression (Balzer et ah, 1994). For a m ore detailed review of RP4 and R751 relaxosom e assem bly see Z echner et al. (1999). The sim ilarities betw een NikA and NikB to their RP4 and R751 counterparts and betw een the In cll or iT and IncP o r i T suggests a 12 Fig. 1.2. S im ilarities b e tw ee n the n ic regions of conjugative p lasm id s. Figure show s the align m ent of several n ic regions from the o r i T s of various conjugative plasm ids. The nucleotide sequences of these nic regions generally fall into one of three groups. The nic cleavage site is indicated (▼). Identical nucleotides are indicated by a black box, conserved nucleotides by a grey box. The consensus sequence generated for each group is also sh o w n . rep ro d u ced from Lanka and W ilkins, 1995) (Figure is Fig. 1.2 Plasm id RP4 R751 pTF-FC2 R64 pTiC58 RB pTiC58 LB N ucleotide sequence of nic region C T T C T T c A A A A T C A C G C C C C C T A A A A G G A A C C G C T T C A T A A A | T C C C T T SI fl T T T T G G T T T u T RSF1010 R1162 p T Fl pTiC58 oriT pSC lO l pIP501 pG O l A A T C A T T A C c G G A C c G G T A c T C G A G T A A A G T C G C G T A T c G C A T T T A T c T G T G C G W G A A C T A N G G G G A T D G ▼G G T G G GG T G G G G T G G G G G A A T A G G A T C A G C R K D R C S T T T T T A C j II G G T G G T K G T B Y G T A G T G G G T A A D T T T T T m A W G T C A A G G C A Y A T C C T G F P307 R100 pED208 R46 R388 c G G C c G C C T C C A II 9 1 T C C T C c T A T T C T C c ▼ T c c G C C C T C C C C T G T G G A T A T A C T A C sim ilar m echanism of relaxosome form ation and cleavage reaction operating for this group of plasm ids. Recent studies w ith R64 NikA have show n th at in a sim ilar w ay to RP4 TraJ, NikA recognises and binds to a repeat sequence 17bp in length found 8-bp from the nic site. Such binding of NikA w as also sh o w n to in d uce DNA bending w ith in the o r i T seq u en ce (F uruya and Kom ano, 1995; Furuya and Komano, 1997). The next stage of DNA transfer involves the u n w in d in g of the cleaved p lasm id DNA strand and transfer to the recipient cell in a unidirectional m anner (H ow land and Wilkins, 1988; Lanka and W ilkins, 1995). U nw inding is m ediated by one or m ore DNA helicases, some of w hich m ay be encoded by the plasm id. Studies w ith plasmid F have show n that the Tral protein has the ad d itio n al p roperties of a helicase, how ever, the function of F T ral as the helicase responsible for unw inding the strand destined for transfer has y et to be show n genetically (Wilkins and Bates, 1997). F Tral unw inds the DNA in a 5'-3' direction relative to the strand that is bound to the enzym e (M atson and M orton, 1991; W ilkins and Lanka, 1993). From com parisons betw een the in vitro rate of unw inding by DNA helicase I (1 kb per sec at 37°C) and the rate of chrom osom al DN A transfer from Hfr donor cells (0.75 kb per sec at 37°C), it is p roposed th at unw in d in g m ay provide the m otive force for DNA transfer as long as the tran sfer com plex is anchored to the cell e n v elo p e allow ing displacem ent of the u n w o u n d strand. Possible candidates for anchoring of p lasm id F T ral-D N A com plex to the cell envelope are TraD and TraM (Silverm an, 1987). It is generally believed that nicking at the o r i T relaxes the plasm id com pletely. H ow ever, exam ination of plasm id F after ran d o m nicking by irradiation show ed that dom ains of the plasm id retain negative supercoiling (Wilkins and Lanka, 1993). If the DNA strand that is nicked by the relaxase is restrain ed from free rotation, DNA helicase activity w o u ld o v erw in d the d u p lex ahead of the enzym e and th u s p rev en t fu rth e r u n w in d in g . This problem could be overcome by a topoisom erase that prevents accum ulation of 13 positive torsional stress. One possible candidate is DNA gyrase, w hich is re q u ire d d u rin g bacterial conjugation as show n th ro u g h a n u m b er of experim ents involving naladixic acid, an inhibitor of the A subunit of the enzym e, and the use of tem p eratu re sensitive g y r m u ta n ts (M atson and M orton, 1991; W ilkins and Lanka, 1993). D uring transfer of plasm id F only one stran d is tran sferred to the recipient cell (Cohen et ah, 1968; V apnek and Rupp, 1971). U pon exam ination of w hich X p rophage strand is transferred d uring H fr m atings, it w as found th at the plasm id F DNA strand is transm itted in the 5'-3' orientation (Willetts and W ilkins, 1984). The orientation of transfer for other plasm ids has n o t yet been dem onstrated. H ow ever, the sim ilarities betw een plasm id protein-onT interactions to those of plasm id F and the orientation of oriT and gene entry to recipient cells suggests that m ost enterobacterial plasm ids transfer in the sam e orientation (H ow land and Wilkins, 1988; Lanka and W ilkins, 1995). The th ird stage in DNA transfer involves the recircularisation of the p lasm id strand and the regeneration of a replacem ent strand in the donor by a rolling circle m ode of replication and synthesis of a com plem entary strand in the recipient (Traxler and M inkley, 1987; W aters and G uiney, 1993; W ilkins and L anka, 1993). In fo rm a tiv e e x p erim e n ts in v o lv in g m a n ip u la te d m obilisable plasm ids w ere able to show that the 3' term inus created at the nic site can be elongated continuously by a DNA polym erase p resent in the donor cell (see Wilkins and Bates, 1997). The m odel further suggests that the plasm id strand destined to be transferred is an interm ediate of greater than unit length, w hich u p o n exposure of the regenerated nic site to the relaxase m olecule linked covalently to the 5' term inus of the concatom er is cleaved (Wilkins and Bates, 1997). Experim ents involving Collb and plasm id F provide no evidence th at the p lasm id is tran sfe rred to the recipient as a concatom er; since sed im en tatio n -rate stu d ies indicate tran sfe rred DNA accum ulates in the recipient cell as u n it length m olecules (Falkow et al., 1971; Boulnois and W ilkins, 1978). 14 G eneration of the com plem entary strand in the recipient is believed to involve the product of the E. coli dnaE gene, DNA polym erase III holoenzym e (Boulnois and W ilkins, 1978). H ow ever, due to the orientation of transfer of the plasm id strand (5'-3'), replication proceeds discontinuously and requires a series of prim ers synthesised de novo (Willetts and W ilkins, 1984). A feature found for Incll and IncP plasm ids is that the transferred strand is escorted to the recipient w ith m ultiple copies of plasm id-specific DNA prim ase m olecules generated by the plasm id-encoded prim ase genes sog and tra C, respectively (Boulnois and Wilkins, 1978; Lanka and Barth, 1981; Chatfield et al., 1982; Rees and W ilkins, 1990). Both the Sog and TraC proteins can be transferred along w ith the plasm id DNA from the donor to the recipient cell (Rees and W ilkins, 1989; Rees and W ilkins, 1990). Plasm id F relies on host-encoded replication m achinery for the generation of these prim ers (M erryw eather et al., 1986; W ilkins and Lanka, 1993). 1.3.3 Plasmid-encoded exclusion Transm issible plasm ids usually carry entry- or surface-exclusion genes, the products of w hich reduce the ability of the host cell to receive plasm ids of the sam e or related type by conjugation. Exclusion should not be confused w ith incom patibility, w hich is m ediated from an entirely separate locus on the plasm id and p rev en ts the co-existence of tw o sim ilar p lasm id s w ith in the sam e cell (Finlay and Paranchych, 1986). Stages of a p lasm id 's conjugation cycle that can be influenced by exclusion are m ating-pair form ation a n d /o r DNA transfer (Rashtchian et al., 1983). One possible ecological role of exclusion m ay be sim ilar to th at of p lasm id fertility inhibition, as a m eans of reducing w astefu l and energy consum ing conjugations. In addition, exclusion m ay also serve to reduce entry of plasm ids that are likely to displace the resident elem ent by the process of incom patibility (Zechner et al., 1999). 15 The effects of exclusion w ere first noticed w hen the viability of F recipient cells was greatly reduced in com parison to F+ recipients w hen m ated w ith an excess of Hfr donors (Alfoldi et al., 1957; Skurray and Reeves, 1973). The phenom enon requires the p ro d u c tio n of pili by the donor and close physical contact betw een the donor and recipient cells (Skurray and Reeves, 1973). This recipient killing event w as nam ed lethal zygosis w hich is believed to be the result of intense m ating conditions and m ultiple conjugation events. Lethal zygosis is associated w ith m em brane dam age and leakage of m em brane constituents and cytoplasm ic com ponents, w hich p resu m ab ly kill the cell (Zechner et al., 1999). The role of exclusion in p reventing lethal zygosis is highlighted by the fact that F recipient cells are not killed w hen m ated w ith an excess of F+ donor cells. This is due to a functional exclusion system encoded by the F plasm id (Zechner et al., 1999). The m ost extensively studied exclusion system s (sfx) belong to the F-like plasm ids. Two genes, traS and traT, have been show n to provide exclusion by tw o in d ep e n d en t m echanism s. Both traS and t r a T are located w ithin the m ajor F transfer operon betw een traG and tr a D . TraS (TraSp) is a 16.9 kDa protein associated w ith the inner m em brane of the donor cell and functions by blocking DNA entry either by affecting the entry system of the donor or by interfering w ith the initiation signal generated follow ing successful m atingp air form ation (M anning et al., 1981; Finlay and Paranchych, 1986; Frost et al., 1994; Firth et al., 1996). TraT (TraTp) is a 26 kDa lip o p ro tein found in abundance in the ou ter m em brane of the donor cell, w hich serves to im pair the form ation of stable m ating aggregates, possibly by blocking access of the Fp ilu s tip to its p utative receptor or by interfering w ith the F products TraG a n d /o r TraN which are required to stabilise the m ating-aggregates (M anning et al., 1981; M inkley and W illetts, 1984; D reiseikelm ann, 1994; Frost et al., 1994; Firth et al., 1996). TraT alone is not sufficient for full expression of the exclusion system (Rashtchian et al., 1983). 16 Extensive sequence hom ologies exist betw een the exclusion genes of plasm id F and the functionally related genes of other F-like plasm ids (Finlay and Paranchych, 1986). The exclusion genes of R100 have been show n to have th eir ow n strong prom oters w hich are n o t subject to regulation by the tran scrip tio n al activator TraJ (Rashtchian et al., 1983). Also, dow nstream of t r a T a term ination signal can be found. H ow ever, it is believed that these genes are only transcribed from such prom oters under certain conditions that have yet to be characterised. Otherwise, they are transcribed from the m ain F Tra operon prom oter (Py) and are therefore subject to control by TraJ (Ham et al., 1989; Frost et al., 1994; Firth et al., 1996). The first exclusion determ inant studied for an I-type plasm id w as that of p lasm id R144. The exclusion (e x c ) locus w as show n to consist of tw o overlapping open reading frames that encode two polypeptides, 13 kDa and 19 kDa in size (H artskeerl et al., 1986). H ow ever, it w as later found th at the sm aller p o ly p ep tid e w as the result of translation being reinitiated in the reading fram e of the larger polypeptide. M utagenesis of the larger polypeptide revealed th at this p ro tein is essential for exclusion (H artskeerl et al., 1985). These polypeptides are now referred to as ExcA (19 kDa) and ExcB (13 kDa) and the exclusion locus as eex. C om parisons betw een the physical and genetic m aps of In cll plasm ids R144, R64 and Collb indicate that the exclusion locus of each is located at a sim ilar corresponding position on the plasm id (Furuya and K om ano, 1994; Rees et al., 1987). The nucleotide sequences of the eex genes of R144 and R64 h av e b een d e te rm in e d and extensive hom ology ex ists b e tw e e n th em (H artsk eerl et al., 1986; Furuya and Komano, 1994). The nucleotide sequence for th e exclusion genes of Collb only recently becam e available w hen the com plete sequence of the plasm id w as determ ined. These genes are 100% identical co those of plasm id R144 (EMBL: AB021078). From cell fractionation experim ents the larger p o ly p ep tid e (ExcA) encoded by R144 w as found bound to the inner m em brane of the donor on the 17 periplasm ic side bu t can also be found in a soluble form in the cytoplasm (H a rtsk e e rl et al., 1985a; H a rtsk eerl an d H o ek stra, 1985). p o ly p ep tid e ExcB can be found in the in n er m em brane. The second The stage of conjugation that is inhibited by I-type exclusion is unknow n, although m atinga g g re g a te fo rm atio n is u n affe cte d (H a rtsk e e rl a n d H o e k stra , 1984). M easurem ents of the am ount of radio-labelled DNA transferred from a donor carrying an I-type plasm id to an R144 excluding recipient suggest that I-type exclusion operates by inhibiting D N A tran sfe r ra th e r th a n m atin g -p a ir form ation (Hartskeerl and Hoekstra, 1985). 1.4 Barriers to Horizontal DNA Transfer of Plasmids Productive plasm id exchange not only requires a m ode of transfer b u t also establishm ent of the DNA in the recipient cell. Such exchange is achieved p rim a rily by the process of co n ju g atio n and su b se q u e n t a u to n o m o u s replication in the new host. This section focuses on barriers th at m ay be encountered by plasm ids transferring by the process of conjugation. A barrier can be described as any biological process, active or passive, w hich can lim it the level of gene exchange betw een organism s (Matic et al., 1996). One b a rrie r is the physical b o u n d a ry th at exists b etw een bacteria sep arated in n a tu re into different m icrohabitats. A nother b arrier, w hich operates betw een different species of bacteria, is referred to as m aintenance- or host-range (Matic et al., 1996). The replication-m aintenance range of a plasm id is subject to a num ber of factors, which will affect the process of establishm ent of the plasm id and as a consequence influence w hether the plasm id persists w ithin a given population of bacteria (Summers, 1996). Some of these factors include, first, the ability of the au to n o m o u s plasm id to replicate at a rate that m atches the grow th and division rate of its host cell. If the plasm id replicates too slowly then plasm id free cells will begin to appear in the population. If the plasm id replicates too fast its copy num ber 18 w ill rise and the m etabolic load im posed on its host will have detrim ental affects. The plasm id generally overcom es this difficulty by encoding a copy nu m b er control m echanism (Summers, 1996). Second, if the plasm id has a low copy num ber the requirem ent for an active partitioning system is essential to ensure even distribution of plasm ids to d a u g h te r cells u p o n cell division. T hird, the range of host replication enzym es th at can be utilised by the p lasm id will influence w hether the p lasm id is m aintained. It sho u ld be n o ted th at m ost n a tu ra l p lasm id s com m only em ploy ad d itio n al m echanism s th at serve to m ain tain th e ir presence w ithin the bacterial population. One such m echanism involves a plasm id-encoded toxin, which is neutralised by a short lived plasm id-encoded antidote. Typically, these toxins have a long half-life. If the cell continues to divide following the loss of the plasm id the antidote becom es insufficient to n e u tra lise the lethal effects of the toxin and the cell becom es exposed (Sum m ers, 1996). C om m on exam ples include colicin pro d u ctio n by colicin p ro d u cin g plasm ids (reviewed in Luria and Suit, 1987). The transfer range of a plasm id is generally m uch broader than its host ran g e, as show n by either the recovery of plasm id-borne transposons in recipient cells after a ro u n d of conjugation or the use of shuttle vectors as reporters for the transfer range of a plasm id (Wilkins, 1995). Evidently, the potential for exchange of genetic m aterial am ong a different variety of bacterial species is enorm ous. Finally, another barrier to plasm id transfer is that im posed by bacterial restriction-m odification system s. These are ubiq u ito u s in n a tu re and are therefore likely to be encountered frequently by tran sferrin g DNA. R-M system s are centrally im portant to this thesis and will be discussed in m ore detail below. 19 1.5 Bacterial Restriction-Modification Systems R-M system s are com m only d etected in b acteria fo u n d in every ecological niche and taxonom ic g roup (W ilson and M urray, 1991). These sy stem s generally consist of tw o en zy m es w ith o p p o sin g in tra ce llu lar activities, able to recognise and react w ith short specific sequences of DNA. The targ e t sequences of these en zy m es v ary in len g th , co n tin u ity , and sy m m etry and w hether they are u n iq u e or d eg en erate. The restriction endonuclease recognises its target sequence and catalyses cleavage of DNA either w ithin the sequence or away from it. The cognate m odification enzym e recognises the sam e targ et sequence to its endonuclease c o u n te rp a rt and catalyses the addition of a methyl group to either an adenosyl or cytosyl residue in each strand of the sequence. M ethylation renders D N A refractory to cleavage by the restriction endonuclease (Wilson and M urray, 1991). Both stra n d s of the target sequence are m eth y la te d . H ow ever, hem im ethylation, w here only one stran d of the sequence is m ethylated, is usually adequate to prevent cleavage. Such a property is especially im portant d u rin g replication of the bacterial chrom osom e where the d aughter strand of the duplex will be unm ethylated (Wilson and M urray, 1991). The phenom enon of restriction and m odification w as first described 50 years ago when it was observed that certain strains of bacteria could restrict the grow th of infecting bacterial viruses. However, an explanation of events at the m olecular level w as not available for another 10 years (A rber and Dussoix, 1962; Bickle and K ruger, 1993). It w as these early findings w hich led to the view that the biological function and ecological role of R-M system s was to protect the host cell from invasion by foreign DNA, nam ely viruses. O ver recent years, the ecological role of R-M system s has come u n d er debate. The discovery that m any bacteriophages often escape the effects of restriction, particularly if they posses few target sites or encode antirestriction systems, has led to a num ber of conflicting views as to w hat the biological role of these system s are (Matic et al, 1996; See section 1.5.6). 20 The classic R-M system s can be classified into one of three distinct g ro u p s based on co-factor requirem ent, su b u n it com position of enzym es, structure of recognition sequence and the position of cleavage by the restriction endonuclease enzym e (Yuan and H am ilton, 1984). H ow ever, it should be n o ted th at several R-M system s identified do not m eet these criteria for classification. One such exam ple includes restriction enzym es th at only recognise and cleave m odified DNA (Redaschi and Bickle, 1996). It is beyond the scope of this thesis to discuss all types of R-M system s that exist, therefore, the reader is directed to review s by W ilson and M urray (1991), Bickle and K ruger (1993) and Redaschi and Bickle (1996) for further details. The classic type I and type II R-M system s are of relevance to this thesis and therefore the follow ing sections will focus on these two types only. 1.5.1 Type I R-M systems Type I R-M system s are the m ost complex system s discovered so far (Redaschi and Bickle, 1996). Three subunits, R (restriction), M (m odification) a n d S (specificity) encoded by the genes hsdR, h s d M and h s d S , form a m ultifunctional enzym e th at can catalyse both cleavage and m ethylation. In ad d itio n , these enzym es have also been show n to have DNA topoisom erase activity (Yuan and H am ilton, 1984; Redaschi and Bickle, 1996). Co-factors req u ire d by type I enzym es for activity include S - a d e n o s y lm e th io n in e (A doM et), M g2+ and ATP. A lthough the S subunit dictates the sequence recognised in both the m ethylation and restriction reactions, an enzym e can be isolated containing only S and M subunits. This p ro d u ct has been found to fun ctio n as a m odification m ethylase. The R subunit, on the other hand, cannot function independently of either of the other tw o subunits (Hadi et al., 1975; Redaschi and Bickle, 1996). Type I enzym es recognise asym m etric bipartite sequences that include an internal non-specific spacer of six to eight bp (Yuan and Ham ilton, 1984). In addition, parts of the recognition sequence outside the internal spacer m ay also 21 be degenerate. The nature of the ends of the restriction fragm ents created by type I enzym es are elusive, although they are resistant to 5' end phosphate labelling by polynucleotide kinase (Endlich and Linn, 1985b). Cleavage by type I restriction enzymes usually occurs at variable distances of 400 to 7000 bp aw ay from the recognition sequence (Redaschi and Bickle, 1996). D ue to the nature of cleavage reactions, detection of type I R-M system s by in vitro digestion of DNA using bacterial cell extract is difficult. Such system s are generally detected in vivo by a series of m olecular tech n iq u es often in c lu d in g restrictio n sensitive bacteriophages (Wilson and M urray, 1991). O ther criteria used to characterise type I R-M system s include chrom osom al location of hsd genes, enzym e subunit structure, genetic organisation, co-factor requirem ent, reaction m echanism and recognition sequence (Bickle and Kruger, 1993). T ype I R-M sy ste m s w e re c la ssic ally a s s o c ia te d w ith th e E nterobacteriaceae fam ily of bacteria as m ost system s d isco v ered w ere id en tified in strains of E. coli and species of C i t r o b a c t e r , Salmonella a n d K le bs ie ll a (Redaschi and Bickle, 1996). The discovery of type I system s in M y c o p l a s m a p ulm on is (Dybvig and Yu, 1994) and Haemophilus influenzae R d (Fleischm ann et al ., 1995) suggests that these systems m ight be m ore prevalent in n atu re than original findings im ply (Barcus and M urray, 1995; Chilley and W ilkins, 1995). U nfortunately, the complex characteristics of these system s h in d er their identification. Type I system s can be organised into one of four genetically related fam ilies based on a series of m o le cu la r ap p ro ach es involving com plem entation tests, DNA hybridisations, im m unological tests and DNA sequence com parisons (Daniel et al., 1988; C ow an et al., 1989). These families include IA (K); EcoBI, EcoKI, EcoDI, S t y SBI, S t y SPI, S t y S Q l and S t y SJIfa; IB (A), EcoAI, EcoEl and C/f Al; the plasm id-encoded IC (R124) fam ily including EcoR124 and EcoDXXI and finally ID; StySBLI (Bickle and K ruger, 1993; T itheradge et al., 1996). The h s d genes of each fam ily have been c h arac terise d by D N A sequencing and DNA hybridisation studies. 22 H om ologies only exist w ith in families betw een either the h s d M or hsdR gene. No hom ology exists betw een the genes of different families apart from short sequence m otifs comm on to DNA adenine m ethylases and ATP-binding proteins (Bickle and Kruger, 1993; Redaschi and Bickle, 1996). A shared feature of all these hsd genes is that they are arranged into tw o continuous transcriptional units. The h s d M and h s dS genes form an operon, w hich is transcribed from the P m o d prom o ter and h sdR is transcribed from its own prom oter, Rres. Systems belonging to the IA and IB families have the sam e gene organisation w ith h s d R gene proceeding h s d M and hsdS. These genes also have the same location w ithin the bacterial chromosome. M ost of the type IC system s are encoded by plasm ids w here the gene o rg an isatio n is sim ilar to th at of the ID fam ily, h s d M and h s d S proceeding hsdR (Redaschi and Bickle, 1996; Titheradge et al., 1996). The S subunit of the type I enzym e can be divided into three functional dom ains; the am ino term inus, w hich recognises the trinucleotide half of the b ip a rtite recognition sequence; the central dom ain, w hich recognises and determ ines the num ber of nucleotides w ithin the non-specific spacer; and the carboxy term inus, w hich recognises the other half of the recognition sequence. A certain degree of hom ology exists betw een the hsdS genes from different fam ilies, p articu larly the region thought to be responsible for S-m ediated protein-protein interactions w ith M and R subunits. The hsdS gene contains tw o regions of non-hom ology, which are believed to be responsible for specific sequence recognition. Hom ology is only ever detected in these regions if the recognition sequences of the enzym es are the sam e (C ow an et al., 1989; Redaschi and Bickle, 1996) Rearrangem ent of p arts of the hsdS genes by recom bination has been d e m o n stra te d in th e lab o ra to ry and is b eliev ed to occur in n a tu re . Recom binations alter the three dom ains of the S subunit involved in sequence recognition, w hich in tu rn results in new sequence specificities being form ed. Such a phenom enon is believed to pu t type I enzym es at an advantage over 23 other R-M system s as they can random ly change their sequence specificity (Bickle and Kruger, 1993). 1.5.2 Type I enzyme activity M ost of the following inform ation about type I enzym e activity comes from stu d ies w ith EcoKI and EcoBI. The type I three-co m p o n en t enzym e com plex can function as either an endonuclease, a DNA m ethylase, an ATP hydrolase coupled to restriction or a DNA helicase (B urckhardt et al., 1981a). The type I enzym e devoid of any co-factors has little affinity for DNA. H ow ever in the presence of Ado-Met the enzym e becom es activated and will bind DNA regardless of its sequence or if it is m ethylated. If the recognition sequence is present, then the enzyme form s a m ore stable recognition com plex th at can react to three different m ethylation states including fully m odified, fully u n m o d ified or hem im ethylated (heteroduplex). If the reco g n itio n sequence is fully m odified, the presence of ATP leads to the dissociation of the enzym e from the DNA (Burckhardt et ah, 1981b). In contrast, if the sequence is u n m eth y lated th en the enzym e b in d s to the DNA as a tig h tly b o u n d com plex, a process th at requires the presence of ATP (W ilson and M urray, 1991). This tightly bound complex is different from that enzym e state w hich binds any sequence of DNA suggesting that a further conform ational change in the enzym e has occurred (Bickle et ah, 1978). In fact, this conform ational change is due to the enzym e losing bound Ado-Met in the presence of ATP. One hypothesis to explain why these enzym es cleave aw ay from their reco g n itio n sequences is th at once the enzym e is b o u n d tig h tly to its u n m eth y lated recognition sequence it translocates DNA in both directions sim ultaneously. The EcoBI enzyme has only ever been show n to translocate DNA in one direction (Endlich and Linn, 1985a). D uring translocation, the enzym e rem ains attached to its recognition sequence and DNA is looped past in a process requiring ATP hydrolysis. In fact, ATP hydrolysis continues after cleavage has gone to completion, which in circular m olecules is believed to be 24 associated w ith stalled translocating enzym es, due to the topological barriers these enzym es face if tw o sites are far apart and the DNA m olecule has not been cleaved by other enzym es (Bickle et al., 1978). C leavage occurs som e distance from the recognition sequence w hen two sim ilar com plexes collide (S tu d ier and B andyopadhyay, 1988; W ilson and M urray, 1991). F u rth er ev id en ce to su p p o rt this hypothesis is th at w hen cleavage reactions are sy n ch ro n ised , the sizes of resu ltin g restriction frag m en ts co rresp o n d to cleavage occurring m idw ay betw een tw o adjacent reco g n itio n sequences (B urckhardt et al., 1981a; B urckhardt et al., 1981b; W ilson and M urray, 1991; Redaschi and Bickle, 1996). Finally, if the sequence is hem im eth y lated then ATP induces the form ation of the m ethylase complex using free Ado-Met as a m ethyl donor to m ethylate the other strand. In the case of type I enzymes, the base m ethylated is alw ays an adenosyl residue. N ot only does Ado-Met function as an allosteric effector b ut is also the m ethyl donor and only co-factor req u ire d in the m ethylation reaction. Once m ethylation is completed, the enzym e dissociates from DNA in a process that does not require ATP hydrolysis. Interestingly, the m ethyltransferases of the EcoKI enzym e are the only prokaryotic enzym es to show a strong preference for hem im ethylated DNA. Such enzym es are often referred to as m aintenance m ethylases as their presence ensures m ethylation follow ing DNA replication. In contrast, the E c o A l m e th y ltra n s fe ra s e m ethylates DNA de novo (Kelleher et al., 1991). 1.5.3 Regulation of expression of type I R-M genes T here are c irc u m sta n c e s w hen re g u la tio n of re s tric tio n m odification activities w ithin the bacterial cell is essential. and These include eith er during establishm ent of an R-M system in a new stra in or d u rin g changes in the physiology of the cell, w hich m ight lead to underm odification of the host genom e. The latter situ atio n is know n to arise d u rin g cell starvation (Bickle and Kruger, 1993; Redaschi and Bickle, 1996). 25 As already m entioned, the genetic organisation of the type I hsd genes is such th at h s d M and h s d S form an operon and h s d R is expressed from a separate prom oter. However, exam ination of specific hsdk-lacZ operon fusions u n d e r co nditions sim ilar to those d escrib ed above sh o w ed no a ltered e x p re ssio n p a tte rn s at the tra n sc rip tio n a l level for th ese genes w h en transferred to a naive cell (Prakash-Cheng et al., 1993). D espite there being no obvious control of these h s d genes at the transcriptional level, transm ission of novel type I R-M genes is surprisingly efficient (Suri and Bickle, 1985; Redaschi and Bickle, 1996). s u g g e s t th a t d u rin g tra n sfe r of th e hsd g e n es, th e Such findings m o d ific a tio n m eth y ltra n sfe ra se is functional in the recip ien t b efore th e re stric tio n endonuclease. As early as 1965, Glover and Coulson described experim ents w hich show ed th at the restriction phenotype of the EcoKI R-M system is d elay ed follow ing conjugative transfer. These experim ents invo lv ed the conjugative transfer of the EcoKI R-M genes from an E. coli K-12 H fr donor to an E. coli B F" recipient. In other experiments, the appearance of the restriction phenotype w as delayed in the recipient for 15 generations w hilst m odification w as observed im m ediately (Prakash-Cheng and Ryu, 1993). Since there is no evidence for transcriptional control of expression of hsdk genes, these findings su g g est th at control m u st be at the translational or p o st-tran slatio n al level (Prakash-Cheng et al., 1993). A spontaneous m u tan t of E. coli C has been isolated, w hich is sensitive to the receipt of h s d k genes. The m u tan t gene resp o n sib le, h s d C , m u st som ehow m ediate control of type I R-M systems in E. coli K-12 (Prakash-C heng et al., 1993). It was postulated that the HsdC peptide m ight be required to influence assembly of the EcoKI enzyme (Dryden et al., 1997). Analyses of the pathw ay responsible for the assembly of EcoKI suggests that at a particular stage the HsdR subunit m ay become susceptible to proteases (D ryden et al., 1997). In a screening of strains of E. coli K-12 that are deficient in proteases, M akovets et al. (1998) identified two possible candidates ClpX and ClpP that m ake up the E. 26 coli ClpXP protease. At least one of these products, ClpX or ClpP is required in the recipient for the efficient transm ission of EcoKI. These results im ply that the ClpXP protease is involved in m o d u latin g restriction activity and that delayed restriction during conjugative transfer occurs at the post-translational level (Makovets et al ., 1998). Over recent years m uch em phasis has been placed on understan d in g the com plexities of transcriptional and translational control in the regulation of p ro tein availability w ithin the cell. The roles of certain p ro teases in the regulation of gene expression in both prokaryotic and eukaryotic organism s is becom ing increasingly m ore evident. It has long been know n th at proteases, p resen t w ithin all cells, degrade p ro tein s th at are u n sta b le or abnorm al. H ow ever, during regulatory proteolysis, specific proteases have the ability to adjust and regulate the am ounts of available protein w ith in the cell by targeting specific proteins for degradation, in particular, reg u lato ry proteins and key m etabolites. Therefore, in ad d itio n to control at the level of transcription and translation, regulatory proteolysis plays an essential role in allow ing the cell to respond to external stresses and developm ental signals. Cleavage by the protease renders the protein biologically inactive and m ay also resu lt in exposing the p ro tein to fu rth er d eg rad atio n by o th er cellular proteases. Cells th at are m utants in such initiating p ro te a se s te n d to accum ulate an abundance of the target protein (Gottesman and M aurizi, 1992; Gottesm an, 1996). Proteins susceptible to degradation by proteases generally fall into three classes. The first class includes those pro tein s th at are eith er unstable, dam ag ed or abnorm al d u e to m utations w ith in the co rre sp o n d in g gene. External stresses that lead to the induction of a p articu lar stress response w ith in the cell often resu lt in the accum ulation of abnorm al or dam aged proteins. Therefore, it is not surprising th at a large p ro p o rtio n of genes in d u ced d u rin g such stress responses encode proteases. particularly true for the heat shock response (Gottesman, 1996). 27 This finding is The second class includes proteins that are required w ithin the cell for lim ited tim es and are quite often involved in complex regulatory pathw ays. These proteins have often been referred to in the literature as 'tim ing proteins' (Gottesm an, 1996). The final class includes proteins th at form p a rt of a m ulti-com plex s tru c tu re , w h ere u n d e r certain c o n d itio n s ra p id d e g ra d a tio n of the uncom plexed protein m ay occur (Gottesm an, 1996). This is believed to be the case during assembly of the type I restriction enzym e, EcoKI (M akovets et al., 1998). It is envisaged that under norm al conditions the H sdM subunit of the type I restriction enzym e protects the H sdR subunit from attack by proteases. H ow ever, one idea is that during transm ission of type I R-M genes, the ClpXP protease com petes w ith the HsdM subunit for interaction w ith H sdR, w hich delays the production of the active endonuclease complex (M akovets et al., 1998). The effect of a c l p X m utation on the transm ission of the EcoKI h s d genes by conjugation is more extreme than the effect of a clpP m utation. Such resu lts im ply th at ClpX m ay be involved in initially targeting the H sdR com ponent, which in tu rn m akes it a substrate for further degradation by the ClpXP protease. ClpP can associate w ith two com ponents to form a protease. These com ponents include either ClpA or ClpX as molecular chaperones. Both ClpA and ClpX form the ATPase subunit of the complex, which is essential, as m ost proteolytic reactions req u ire ATP hydrolysis. W hen com plexed w ith the protease, these ATPase com ponents regulate protein degradation by binding, unfolding and translocating substrates into the proteolytic cham ber of ClpP. Follow ing proteolysis, the products of the degradation reaction diffuse out of the ClpXP or ClpAP com plex (W ickner and M aurizi, 1999). The genes responsible for ClpX and ClpP lie w ithin the same operon, w hich is regulated by a heat shock prom oter. In addition to recent findings that the ClpXP protease is involved in degrading the H sdR com ponent of EcoKI and EcoAI restric tio n en d o n u clease enzym es, the p ro tease is also resp o n sib le for 28 d eg rad atio n of protein, PI addiction protein, Mu repressors, and RpoS, w hich is a sigma factor required for gene expression du rin g stationary phase (Gottesman, 1996). 1.5.4 Type II R-M systems The type II R-M systems are the least complex and have been discovered in virtually every class of bacteria over 150 of w hich w ere isolated in E. coli and Salmonella strains (Roberts and Macelis, 1994; Redaschi and Bickle, 1996). Type II system s typically com prise separate restriction and m odification enzym es that act independently of each other. The only co-factors required are M g2+ ions by the endonucleases and A do-M et by the m ethyltransferases. The ability of these enzym es either to cleave or to m ethylate DNA at fixed positions w ith in their recognition sequences has led to their exploitation in m any branches of m olecular biology and to the w ide screening of bacteria for new system s. Several thousand type II R-M systems have been discovered, bu t only a few have been fully characterised. It should be n o ted th at m ost new specificities discovered are duplicates of other systems and are referred to as isoschizom ers. The recognition sequences of type II enzym es generally consist of betw een four to eight specific nucleotides, quite often th e sequences are sym m etric and m ay be continuous or interrupted (Wilson and M urray, 1991). M ost endonucleases act as hom odim ers w here each subunit interacts w ith one half of the recognition sequence and cleavage of both DN A stran d s is co­ o rd in ated . In contrast, the m ethyltransferases norm ally act as m onom ers, w hich transfer one m ethyl group to one stra n d per D N A b in d in g event (N ew m an et al., 1981). The EcoRI endonuclease is com m only isolated as a hom odim er. Exam ination of the EcoRI-DNA com plex in d icated th a t the active sites of the enzym e are placed directly against the phosphodiester bonds th at are hydrolysed d u rin g the cleavage reaction (M cClarin et al., 1986). M ethylation at these sites presum ably prevents such a complex form ing due to 29 steric hindrance. Cleavage of DNA by type II enzymes can produce either 5' or 3' overhangs or blunt ends. Plasm ids transferring by the process of enterobacterial conjugation do so in single-stranded DNA form. W hether type II restriction enzym es are able to cleave single-stranded DNA is still debatable. Evidence has been reported to su g g est th at some type II restriction enzym es cleave single-stranded DNA, alth o u g h w hether such enzymes act on single-stranded DNA or tem porary duplex structures that form in the DNA strand is unknow n (N ishigaki et al., 1985). In addition, it has been found that EcoRI is capable of cleaving ssDNA that is im m obilised on cellulose by ligation. How ever, the cleavage reaction is less efficient than that which cleaves dsD N A (Bischofberger et al., 1987). M any ty pe II R-M system s have been cloned and the n u cle o tid e sequence of the structural genes determ ined. Surprisingly, no hom ologies exist betw een the amino acid sequences of cognate restriction and m odification enzym es, which suggests that they m ust have evolved independently of each o th er (C handrasegaran and Smith, 1988; W ilson, 1991; Redaschi and Bickle, 1996). Furtherm ore, it is rare for any hom ology to exist betw een restriction endonucleases and their isoschizomers, despite the two enzym es recognising the sam e target sequences (Wilson and M urray, 1991; Bickle and Kruger, 1993). It is envisaged th at the m ethyltransferase enzym e evolved first, m ethylating the bacterial genom e, u p o n w hich the restriction endonuclease could then invade (Bickle and K ruger, 1993). The endonuclease and m ethyltransferase of a cognate type II system possess separate target recognition dom ains (TRD) and as a consequence, each enzym e em ploys a different strategy for recognition of their target sequences. Base analogue substitution experim ents w ere able to show that each enzym e recognises a different base in the target sequence that is com m on to bo th enzym es (N ew m an et aL, 1981). Such a feature of type II R-M system s probably lim its these enzym es from developing new specificities during their evolution (Wilson and M urray, 1991). 30 A subclass, classified as type II system s due to co-factor requirem ent consists of those restriction enzym es th at recognise and cleave at a precise distance one-20 base pairs from their recognition sequence. These are referred to as type IIS systems, w here the S stands for shifted cleavage (Szybalski et al., 1991). 1.5.5 Regulation of type II R-M systems For reasons described in section 1.5.3, expression of type II restriction and m odification activities need to be regulated. Type II R-M system s vary in their genetic organisation and their gene expression patterns. C onsequently, it is difficult to generalise; therefore, the m echanism s described can only be used as examples, which m ay apply to some know n systems. One way in w hich type II R-M reactions can be m odulated is by a passive m echanism that relies on the restriction enzym e functioning in a m ultim eric form . Most type II restriction enzymes function as hom odim ers and cleavage re q u ire s th e accu m u latio n of R su b u n its. In c o n tra st, m o st of the corresponding m ethyltransferases function as monomers. C onsequently, there is a lag betw een the tw o opposing enzym e activities. This m echanism of control can be applied to the EcoRI system (Greene et al., 1981). Evidence that the restriction genes of some type II R-M system s can be cloned w ithout the corresponding m odification gene suggests th at E. coli is able to repair a certain am ount of DNA dam age created by these enzymes. The restriction genes that have been successfully cloned include those of P ae R 7 l, T a q l , Av a il , Ha ell, H i n f l , P s t l and X ba l. H ow ever, colonies of strains differ from others in th at they appear tran slu cen t an d are susceptible to high frequency loss of the plasm id which carries the restriction gene. In addition, synthesis of the restriction enzyme is greatly reduced com pared to that in a m odified strain (Lunnen et a l , 1988; Redaschi and Bickle, 1996). Regulation of restriction and m odification activities by som e type II system s involves the product of another tightly linked ORF (Tao et al., 1991). 31 The p red icted am ino acid sequences of these C (controller) proteins have p o te n tia l DN A b in d in g p ro p e rtie s su ch th a t th ey m ay fu n ctio n as transcriptional repressors or activators. M utations in the C gene of the P v u l l sy stem resu lts in a restriction-less p h e n o ty p e , w h ic h in tu rn can be co m p lem en ted in trans by the w ild-type gene. D isruption of the C gene of BamYLI results in decreased restriction and increased m ethylase activity (Bickle and Kruger, 1993). Nine R-M systems that have been sequenced have one of these ORFs; five share sequence hom ology and are able to com plem ent each other in trans. Such findings suggest that these C genes m ay have evolved as a fam ily in dependently of RM genes as no hom ology is found betw een them (Redaschi and Bickle, 1996). 1.5.6 Ecological roles of bacterial R-M systems Early studies w ith R-M systems and the finding that they are w idespread led to the m odel that their m ain ecological role was to provide bacteria w ith protection against infection by bacteriophages. Such a concept is now referred to as the 'cellular defence' hypothesis. There are situations in n a tu re w here the carriage of a novel R-M system is seen as advantageous to the host cell. O ne of these situ atio n s is w hen bacteria specifying a R-M system in v ad e m icrohabitats containing phages that can infect the strain (Korona and Levin, 1993; Bickle and Kruger, 1993; Korona et al ., 1993). A problem w ith the 'cellular defence' hypothesis is that restriction is n o t an absolute barrier to transferring DNA and that a fraction will escape destruction and acquire m ethylation. Hence the R-M barrier is only transient and consequently, the 'cellular defence' is not the only role that can be assigned to R-M system s (Korona and Levin, 1993). A further short com ing in the 'cellular defence' hypothesis is that it does not explain restriction enzym es that have long recognition sequences of 8 bp, w hich are not likely to be present in the genom es of m any bacterial viruses (Naito et al., 1995). Also, the 'cellular defence' hypothesis does not explain the extrem e diversity and specificity of 32 sequence recognition (O'Neill et al., 1997). Hence, restriction-m odification is reg ard ed as being able to slow the rate at w hich DNA transm ission occurs rather than stop it (Matic et al., 1996). Bacteria probably develop m ore long term p ro tectio n from phage infection by sp o n tan eo u s m u ta tio n of genes encoding phage receptor sites (Korona et al., 1993). A n o th er p o ten tial role assigned to bacterial R-M system s is th at restriction potentiates genetic recom bination in n atural populations of bacteria by cleaving entrant DNA molecules into fragm ents th at have recom binogenic properties (Wilson and M urray, 1991; K orona and Levin, 1993). M ost DNA fragm ents generated by restriction enzym e cleavage in vivo w ill be subject to degradation by the RecBCD nuclease [exonuclease V] (Dixon and Kowalczyski, 1993). H ow ever, recom bination is possible if these fragm ents contain chi sites (x), w hich are short specific 8-bp sequences able to stim ulate the recom bination p ro p erties of the RecBCD enzym e. F urtherm ore, ex p erim en ts have been conducted to show that recom bination of fragm ents generated by restriction can take place in the absence of RecBCD, rely in g c o m p letely on the recom bination p ath w ay s of either Red of phage X or RecE of the cryptic prophage rac. H ow ever, these results are not indicative of events in nature. R ecom bination during interspecific DNA tran sm issio n is unlikely as the extent and degree of sequence divergence that exists will not only delim it the activity of enzym es that control the initial stage of recom bination, strand separation, b u t recom bination will also be inhibited by the m ism atch-repair system (Matic et al., 1996). Nevertheless, 0.1-1% of n atu ral populations of E. coli carry m utations in their m ism atch-repair system s. E vidence to suggest th at restriction-m odification m ay influence recom bination in E. coli is from the mosaic patterns observed in the nucleotide sequences of trp operons of n atu rally occurring strains of Escherichia coli (ECOR-collection) (M ilkm an, 1997; Matic et al., 1996). A m ore recent role, originally assigned to type II R-M system s, is that these paired R-M genes act as selfish sym bionts, w hich serve to force their 33 m aintenance on the host cell. Plasm ids carrying type II R-M system s have increased stability in E. coli and attem pts to displace the resident plasm id by another th at does not carry the same R-M or M genes results in host cell death (Naito et al., 1995; K ulakauskas et al., 1995). Such data are consistent with the 'toxin-antidote' hypothesis, Where the restriction enzym e acts as the toxin and the m ethylase as the antidote. Any cells that continue to divide after losing the p lasm id are unable to protect them selves from attack by rem aining restriction enzym e (Kusano et al., 1995). It is envisaged th at the ability of these plasm id-borne type II R-M system s to act as selfish sym bionts probably contributed to their evolution as gene pairs (Naito et al., 1995). M utations in both hsdR and h s d M genes of type I R-M system s are easily isolated w ithout detrim ental effects on the host cell. Such findings suggest that the 'toxin-antidote' hypothesis applied to type II R-M system s cannot be applied to type I system s. In addition, the ability of the host cell to m odulate restriction activity by type I enzymes also suggests that this particular type of RM system is not selfish in nature (O'Neill et al., 1997; section 1.5.3). 1.6 Collb Leading Region Genes and their Functions As described in section 1.3.2, conjugative transfer of a plasm id starts at oriT by specific cleavage of the nic site by the relaxase enzym e. A unique DNA strand is released called the T-strand, which is transferred to the recipient cell w ith a 5'-3' polarity (W ilkins and Lanka, 1993). The o r iT is orientated such that the transfer genes of the plasm id enter the recipient cell last (Howland and W ilkins, 1988). W hether delayed transfer of the Tra genes to the recipient cell reflects the need for th eir continued expression in the donor cell or their delayed expression in the recipient is unknow n. H ow ever, one possible explanation is that their delayed transfer gives priority to entry of genes that are expressed early in the recipient and prom ote establishm ent of the plasm id in the new ly infected cell. Such genes are located w ithin the leading region of 34 the plasm id, which can be defined as the first segm ent of the plasm id to enter the recipient cell during conjugation. L eading region genes th at have been identified on Collb include ssb, psiB and a r d A , the functions of w hich are described in the following sections. The leading regions of plasmid F and Collb are believed to be inessential for conjugation as disruption of know n genes w ithin this sector result in only a slight decrease in conjugation efficiency u n der laboratory conditions (Loh et al., 1989; Jones et al.f 1992). The length of the C ollb lead in g region is unknow n. H ow ever, it is estim ated to extend 15 kb from the nic site of the plasm id, although these num bers are purely arbitrary (Rees et al., 1987). 1.6.1 Single-stranded DNA-binding protein (SSB) Single-stranded DNA-binding protein (SSB) found in E. coli is know n to play an im portant role in DNA replication, recom bination an d repair, in p a rticu la r m ism atch repair (Chase et al., 1983; Golub and Low, 1986). SSB proteins have a high affinity for single-stranded DNA and bind predom inantly in a tetram eric form w ith no sequence specificity (H ow land et al., 1989). In v i t r o studies w ith SSB have show n th at som e of the roles of the p ro tein include i) p rev enting reassociation of unw ound DNA, ii) protecting ssD N A from nucleases, iii) enhancing the processivity of D N A p olym erases, iv) stabilisation and organisation of replication origins, and v) prom oting binding of DNA polym erase to tem plate DNA (reviewed in M eyer and Laine, 1990). Furtherm ore, in the presence of SSB, the helix-coil transition tem perature of dsD N A is lowered enabling the DNA m olecule to m elt m ore easily (Williams et al., 1983). Collb and a n u m b er of other enterobacterial p lasm id s including F, encode SSB p ro tein s w hich are very closely related in th eir am ino acid composition to the E. coli counterpart (Chase et al., 1983; Golub and Low, 1985; H ow land et al., 1989). 19 plasm ids from 12 incom patibility groups are know n to carry ssb genes that can complement defects caused by an ssb-1 m utation in 35 the corresponding E. coli gene (Golub and Low, 1985; Golub and Low, 1986). In ad d itio n , the ssb genes of these p lasm id s w ere found to share extensive sequence hom ology w ith the ssb gene of plasm id F. These ssb genes w ere found to be co-ordinately regulated w ith the plasm ids fertility genes (Golub and Low, 1986). Such findings suggest th at the role of SSB d uring conjugation is possibly to prevent drainage of the cellular counterpart d uring replication of the com plem entary plasm id strand. H ow ever, m utations in plasm id ssb genes have very little effect on the ability of the plasm id to form transconjugants (Golub and Low, 1986; H ow land et al., 1989). The Collb ssb gene m aps in the leading region of the plasm id 11 kb from the oriT (see Fig. 1.1). The gene was originally isolated by its ability to rescue the tem perature and UV sensitivity of E. coli chrom osom al ssb-1 m utant. In addition, the gene has been show n to be hom ologous to the F ssb gene by cross­ hybridisation studies and sequence com parisons. Both share 84% sequence sim ilarity at the nucleotide sequence level and 83% at the predicted amino acid level. The am ino acid sequence of Collb SSB contains two signature m otifs characteristic of sin g le-stra n d ed DNA b in d in g proteins. O ne of these signatures is the DN A binding motif that has been characterised for the E. coli SSB protein, which is perfectly conserved in Collb SSB (H ow land et al., 1989). 1.6.2 Plasmid SOS inhibition (PsiB) The product of the Collb psiB gene acts during conjugation to prevent induction of the bacterial E. coli SOS response in recipient cells as a function of its intracellular concentration (Jones et al., 1992). The SOS response is triggered by certain conditions th at either cause dam age to DNA or interfere w ith the process of DNA replication. Long-lived regions of single-stranded DNA that are created during such dam age or those that exist near a stalled replication fork result in activation of RecA. Activated RecA in the presence of co-factors such as ATP cleaves the LexA transcriptional repressor and a num ber of phage 36 repressors, which leads to the induction of a diverse set of specialised genes know n collectively as the SOS regulon (Walker, 1987). O ne p o ssibility is th a t the SOS response is in d u ce d by ssD N A transferring in conjugation or short tracts of ssDNA that are exposed w hen the seco n d stra n d of the p lasm id is g e n e ra te d in the re c ip ie n t d u rin g discontinuous lagging strand DNA synthesis (see section 1.3.2). Plasm id psiB genes, therefore, m ay have evolved to perm it conjugative transfer in single­ stra n d e d form w ith o u t the additional com plication of in d u c in g the SOS response and host physiology (Jones et ah, 1992). The PsiB p ro tein of IncFII plasm id R6-5 has been show n to prevent induction of the SOS response by in h ib itin g activ atio n of the RecA co-p ro tease ra th e r th a n p re v e n tin g expression of genes associated with the SOS regulon (Bailone et al., 1988). The Collb psiB gene w as identified through its cross-hybridisation to the psiB gene detected on R6-5 (Bagdasarian et al., 1986; Jones et ah, 1992). Collb PsiB shares 85.4% and 84.6% identity to the corresponding p roteins of F and R6-5, respectively (Jones et al., 1992). T hrough cross-hybridisation stu d ies conducted by Chilley and Wilkins (1995) it was found that plasm ids carrying ssb genes more often than not also carry a psiB gene. Transposon m utagenesis of plasm id ssb genes leads to the generation of a strong Psi+ phenotype, w hich is presum ably the resu lt of the polar effects on transcription caused by the antibiotic resistance gene prom oter present in the Tn insertion (Dutreix et al., 1988; Jones et al., 1992). Such findings suggest that expression and location of bo th ssb and p s iB are linked and that no transcriptional term in ato r exists betw een the two (Jones et al., 1992). 1.6.3 Alleviation of restriction of DNA (ArdA) The product of the Collb ardA gene acts as an anti-restriction protein, w hich is active in alleviating restriction by three families of type I restriction enzym es (Kotova et a l , 1988; Delver et al., 1991; Read et al., 1992). The A rdA protein alleviates restriction mediated by type I restriction enzym es, possibly by 37 interfering w ith the functional integrity of the R-M enzym e complex. The ArdA protein of Collb is unable to alleviate the effects of restriction m ediated by type II and type III enzymes (Read et al., 1992; Belogurov and Delver, 1995). The ard A gene like ssb and psiB is also know n to have a counterpart in the leading regions of plasm ids belonging to other Inc groups. H om ologues h av e been identified in the lead in g reg io n of e n tero b acterial p lasm id s belonging to the IncFV group, the B-Il-K set of the I com plex and the IncN group. All ardA + plasm ids identified so far are enterobacterial plasm ids, w hich m ay be significant as type I restriction enzym es are classically associated w ith m em bers of the Enterobacteriaceae (Chilley and W ilkins, 1995). Furtherm ore, these a r d A genes also have the same transcriptional orientation w ithin their respective leading regions (Delver et al., 1991; Read et al., 1992; Belogurov et al., 1993; Chilley and W ilkins, 1995), although Collb does n o t contain the equivalent ardB antirestriction gene or the a r d K and ardR regulators th at are found in the leading region of IncN plasm id pKM lOl (Belogurov et al., 1993). The ArdA protein operates specifically during conjugation to protect the im m igrant plasm id from destruction by type I restriction enzym es that m ight be present in the recipient cell. The function of ArdA and its interaction w ith type I restriction enzym es will be discussed further in section 1.7. 1.6.4 Zygotic induction of leading region genes Detectable levels of expression of both ssb and psiB is only observed w hen such genes are carried by plasm ids that are derepressed for transfer functions or if the cell is exposed to SOS-inducing treatm ent (Golub and Low, 1986; Jones et al., 1992). Such findings suggest that these genes are un d er the control of the plasm id's fertility inhibition system (H ow land et al., 1989). W hen Collb is derepressed for transfer functions th ro u g h m utation of the Fin system, ssb and psiB are expressed at low levels in an established strain. Transcriptional activity of the leading region of naturally derepressed plasm id F was also only detected at low levels, w hich suggests that genes contained 38 w ithin this region are only fully expressed under unique conditions (Cram et al., 1984). During the process of conjugation, the products of these Collb genes accum ulate in a transient b u rst in the conjugatively infected cell. Such enhanced expression was originally dem onstrated by a technique involving the in sertio n of a prom oterless lacZ, operon fusion into p s i B and ss b on ColVodrd. Conjugation experim ents w ere perform ed u sin g the specially constructed Collb::lacZ plasm ids and ssb or psiB gene product w as quantified by m easuring the level of P-galactosidase specific activity. Shortly after initiation of conjugation a 10-20 fold burst of expression of bo th s s b and p s i B w as observed. Rapid expression starts w ithin 10 m inutes and begins to plateau out at 20-30 m inutes (Jones et al., 1992). To determ ine w hether enhanced expression of ssb and psiB occurs in the recipient or donor cell a selective phage lysis technique w as used. D uring a series of conjugation experim ents using either T6 bacteriophage sensitive or resista n t donor or recipient cells, it w as found that accum ulation of (3galactosidase could be found only in recipient cells. Such results clearly show that both psiB and ssb genes are induced upon entry to the recipient cell during conjugation. It w as these findings w hich led to the application of the term zygotic induction (Jones et al., 1992). However, it should be noted th at the term zygotic induction w as originally used to describe the in d u ctio n of X p ro p h ag e d u rin g H fr m atin g experim ents, w hich resu lte d in lysis of F’ recipient cells (Hayes, 1968). The ardA gene has not yet been show n to be zygotically induced during conjugation. H ow ever, indications th at a r d A expression is also enhanced d u rin g conjugation comes from the findings that optim al alleviation of type I restriction occurs during the process of conjugative transfer of C o U b d r d A rdA + as opposed to transfer by transform ation (Read et al., 1992). 39 1.6.5 Hypotheses for zygotic induction The potential roles of ssb, psiB and a r d A in the establishm ent of Collb shortly after conjugative transfer are understood. H ow ever, one intriguing question that rem ains to be answ ered is how are these leading region genes regulated so that they are expressed early and transiently in the recipient cell. Several hypotheses have been generated, som e of w hich have been exam ined ex p erim en tally (Jones et al., 1992; Roscoe, 1996). The follow ing tw o lists (1.6.5.1, 1.6.5.2) sum m arise these hypotheses. 1.6.5.1 Possible modes of positive regulation i) Activation is by transfer-associated plasm id topology changes. Expression of genes from certain types of p rom oter can be initiated du rin g changes in localised topology of DNA, such that the angle of the -35 and -10 sites of the prom oter form an optim al alignm ent that is recognised by RNA polym erase (W ang and Syvanen, 1992; Roongta, 1990). D uring the process of conjugation, transfer of DNA to the recipient cell is associated w ith significant changes in the topology of the plasm id. H ow ever, treatm ent of Collb containing cells w ith an inhibitor of DNA gyrase, coum erm ycin-A l, w hich reduces the negative supercoiling density of the plasm id does not result in any significant enhancem ent of ssbv.lacZ or psiB::lacZ expression (Roscoe, 1996). ii) Activation occurs w hen genes are transferred in single-stranded form. There is evidence th at during enterobacterial conjugation the plasm id D N A is transferred to the recipient cell in single-stranded form (see section 1.3.2). The usual action of RNA polym erase on single-stranded DNA is the p ro d u ctio n of prim ers for DNA replication. H ow ever, it is envisaged that transcription of leading region genes m ight be initiated on the transferring single plasm id-strand during conjugation. iii) Activation is by a plasm id- or chrom osom al-encoded positive regulator. 40 Leading region gene expression m ay be controlled by a regulator that is induced only during the process of conjugation. Subjection of E. coli to any stim uli that alter the physiological state of the cell often result in the induction of a co-ordinated set of genes, know n collectively as a stress-response. If conjugation induces any such responses, the regulators or gene products could act to control leading region gene expression. In E. coli the heat shock stress response can be induced by stimuli other than heat. H ow ever, conjugative tran sfer of Collb does not stim ulate this stress response as show n by its inability to induce a heat shock reporter construct in recipient cells during a 60 m inute conjugation (Roscoe, 1996). 1.6.5.2 Possible modes of negative regulation i) Transcription is regulated by a tran s -acting repressor th at accum ulates in the new ly infected recipient cell. It is proposed that such a repressor inhibits transcription of leading reg io n genes in the new ly infected cell. This hypothesis w as tested by transferring a Collb plasm id that carries a prom oterless lacZ operon fusion in psiB to a recipient cell harbouring a Collb plasm id that is defective in encoding a functional exclusion system. However, the hypothesis w as ruled out because u n d e r such conditions zygotic induction of psiB was still observed (Roscoe, 1996). 1.7 Restriction-Avoidance Strategies Employed by Plasmids It is known that bacteriophages have evolved a num ber of strategies for avoiding the effects of restriction during their life cycles. Exam ples of phage anti-restriction m echanism s include i) inhibition of restriction enzym es, ii) virus-coded DNA m odifying enzym es, iii) stim ulation of host m odification functions, iv) in co rp o ratio n of u n u su al nucleotides into p hage DNA, V) coinjection of p hage restriction inhibitory head pro tein s w ith DNA, vi) d estru ctio n of restriction endonuclease co-factors and vii) counterselection 41 against restriction sites in phage (review ed in K ruger and Bickle, 1983 and Bickle and Kruger, 1993). P lasm id s that have been fo u n d to encode re stric tio n -av o id a n ce m echanism s include m em bers of the Incl and IncF group of enterobacterial plasm ids. Restriction-avoidance is n o t a feature com m on to all transferring p lasm id s. For exam ple, the IncP p la sm id s are ex trem ely sensitive to restriction. The lack of a restriction-evasion m echanism m ay explain w hy IncP replicons are deficient in restriction enzym e recognition sites. In a sim ilar w ay to som e phages, these plasm ids have presum ably been exposed d u rin g their evolution to strong selection for the elim ination of restriction enzym e target sequences by m utation and selection (Wilkins et al., 1996). Plasmid Collb is rem arkably resistant to destruction by type I and type II restrictio n enzym es d u rin g its tran sfer by conjugation, even th o u g h the plasm id carries m ultiple enzyme target sites (Read et al., 1992). The resistance of Collb to type I restriction enzyme involves a specialised plasm id-encoded an tirestrictio n m echanism know n as A rdA (see section 1.6.3). H ow A rdA specifically acts to alleviate restriction by type I enzymes is not clear. H ow ever, it seem s unlikely that the protein interacts w ith the recognition sites of the enzym es as the protein is highly acidic and therefore unlikely to bind to DNA (Delver et al., 1991). It seem s more likely that ArdA interferes w ith the type I en zy m e com plex in a w ay th at p re v e n ts restric tio n a ctiv ity an d n o t m ethylation (Belogurov and Delver, 1995). The A rdA protein of Collb and a n u m b er of other plasm ids share a com m on am ino acid sequence m otif also found in the T7 0.3 anti-restriction protein. The m otif is only nine am ino acid residues in length and is believed to be the interaction site for anti-restriction proteins and restriction endonucleases (Belogurov et al., 1993). The m otif also shares similarity with a conserved sequence know n as the Argos repeat, w hich is found in the DNA sequence specificity (S) polypeptide of type I enzym es. These motifs in anti-restriction proteins m ight allow the protein to com pete 42 w ith the S subunit, w hich plays a role in the assem bly of the type I enzym e com plex (Belogurov and Delver, 1995). C ollb encodes another restric tio n -av o id a n ce m echanism th a t acts indepen d en tly of ardA. The process becom es m anifest after several m inutes of conjugation and alleviates restriction by both type I and type II enzym es during second or subsequent rounds of transfer (Read et al., 1992). The m olecular basis of this second evasion process has yet to be elucidated. 1.8 Aims and Direction of this Study The initial aim of my studies w as to further the u n d e rstan d in g of the restriction-avoidance m echanism s encoded by Collb w hich are active against type I and type II restriction enzymes and operates independently of ardA. The first hypothesis tested is that transfer of m ultiple copies of C ollb d u rin g co n ju g atio n overw helm s the restriction system in the rec ip ie n t cell by su b strate saturation (Read et al., 1992). A second hypothesis is that m ultiple transfers induce a change in the physiology of the recipient cell, w hich in turn leads to a transient breakdow n of the restriction barrier. A third possibility is th at Collb possesses a gene that encodes an anti-restriction function active against both classes of restriction enzyme. An experim ental priority in the testing of these h y p o th eses w as to establish w hether Collb could confer cross-protection on a restriction-sensitive p lasm id w hen co-transferred together. The first tw o h y p o th eses p red ict a llev iatio n of restriction by Collb in trans. C onsequently, a restriction- sen sitiv e plasm id th a t is co-transferred w ith Collb sh o u ld acquire som e protection from destruction by restriction. For this purpose a genetic system w as developed, w hich relied on m easuring Collb-m ediated rescue of plasm id R751 (IncP(3) from destruction by EcoKI and EcoRI. e x p erim e n ts w as th e discovery of a czs-acting phenom enon. 43 One outcom e of these re s tric tio n p ro te c tio n A n im p o rtan t experim ental tool w as the construction of a specific m u tan t of Collb that was deleted of 14 bp from the nic region. The resulting plasm id w as completely defective in self-transfer by conjugation, yet proficient in form ing m ating-pairs and in expressing a full com plem ent of plasm id p ro te in s . This m u ta n t h as p ro v e d in v a lu a b le in v a rio u s g enetic investigations. Also examined was the classic A rdA system . The u n d erly in g question w as w hether ar dA is transiently overexpressed (zygotically induced) in the recipient cell during conjugation like other leading region genes such as psiB and ssb. This question was addressed using the R751 system and the Collb nic m u tan t to m easure A rdA -m ediated alleviation of restriction d u rin g different stages of conjugation. Data obtained suggested that ardA is subject to zygotic in d u ctio n . To in vestigate p o ten tial reg u la to ry m ech an ism s g o v e rn in g en h an c ed expression of leading reg ion genes d u rin g co n ju g atio n , the n u cleo tid e sequence of 11.7 kb of the leading region w as determ in ed in a co llab o ratio n w ith Dr. Steven Bates w orking in our laboratory. Such an approach was invaluable in the generation of a m odel of zygotic induction and the developm ent of the concept that plasm ids carry 'early genes' expressed by transcription of the incom ing single-stranded DNA. 44 Chapter Two Materials and Methods 2.1 Plasmids and Bacterial Strains The g e n o ty p e s, so u rces an d re le v a n t re s tric tio n -m o d ific a tio n phenotypes of bacterial strains and plasm ids used in this w ork are described in Tables 2.1.1 and 2.1.2. Bacterial strains used w ere all Escherichia coli K-12 derivatives. Plasm ids constructed during this w ork are described in Table 2.1.3. For the p u rp o se of publication, some recom binant plasm ids w ere assigned alternate nam es to those used in this thesis. W here this has occurred, the alternative nam e has been indicated in Table 2.1.3. Plasm id pHP45Q contains an o m eg a (Q ) in te rs p o s o n ; th is 2.0 kb fra g m e n t c o n ta in s a streptom ycin/spectom ycin resistance gene flanked by inverted repeats carrying transcription and translation term ination signals and polylinkers (Prentki and Krisch, 1984). S pontaneous naladixic acid and rifam picin variants of strains GI65, NM654 and NM816 are designated w ith the suffix N or R. All w ere used in conjugation experim ents and are not show n in Table 2.1.1. These resistant strains w ere isolated by plating 1.0 ml of overnight culture, concentrated to 0.1 ml, onto n utrient agar plates containing either naladixic acid or rifam picin at 25 jag ml’1. Plasmids R751 and pHP45Q and bacterial strain DL307 w ere gifts courtesy of Peter Thorsted, C hris Thom as and D avid Leach, respectively. Bacterial strains NM654, NM816 and NM840 were kindly provided by N oreen M urray. 45 Table. 2.1.1 Escherichia coli Strains S train Description or genotype Source or reference D H 5a F" f80A L a c Z A M I 5 r e c A l e n d A l H anahan (1983) g y r A 9 6 thi-1 h s d R 1 7 (rKi" m Ki+) s u p E 44 r e l A l deoR A il a c Z Y A - a r g F ) U169 DL307 (FS1585) GI65 (C600) C600 recD1009 TetR Stahl et al. (1986) thr-1 leu-6 thi-1 l a c Y l f h u A 2 1 A rber and Dussoix (1962) M erryw eather et a l (1986) M akovets et al. (1998) N. M urray (rKi+ m Ki+) J0 8 (BW103) d n a + leu deoB r p sL cir re cA l NM654 C 6 0 0 A h s d R M (r^i u ik i) NM 816 NM 840 C600AhsdR (rKf itiki+) C600 g y r A 9 6 A h s d R M (rKf mKf) W3110 Prototrophic (rKi+mKi+) M akovets et al. (1998) Backman, (1972) Table. 2.1.2 Plasmids Plasm id D escription Source or reference pLG221 Incll derepressed for transfer C o l l b d r d l cib::Tn5 Km R Laboratory stock H ow land et al. (1989) pLG272 Collb cib::Tn5 KmR H ow land et al. (1989) pLG292 C o l l b d r d l a r dA :: a ph A -l KmR Read et al. (1992) O ther plasm ids pBR328 4.9 kb ApR CmR TcR Soberon et al. (1980) Rees et al. (1987) Incll plasm ids Collbdrdl pCRS3 pCRS4 pHP45Q pBR328Q(ColIb oriT+ Sail 15.9 kb) ApR CmR pBR328Q(ColIb Sail 10.1 kb) ApR CmR 4.3 kb ApRSmRSpR pIC19H pJH16 PACYC184n(£coRI m RI+) CmR pLG290 pUCl9Q(ColIb ar dA + Sall-Pstl 2.7 kb) NTP14 ApR 17 kb cea+ rR]+mRi+ A pR pR R 2-l pRR4 R751 pSBCPl 2.7 kb ApR pRRlQ[ColIbdrdl E l (20.4 kb) + E16 (2kb)]A p+ pBR328Q(ColIb eex+EcoRl 3.4 kb) ApR CmR 56 kb IncPP, Tn402/5090, T n 43 21Q (RK2 terR, ter A , Bglll -3.8 kb) Tcr TpR pICl9HQ(ColIb ssb Pstl-Clal 1.1 kb) A R Ap Rees (1986) Prentkl and Krisch (1984) M arsh et al. (1984) H eitm an et al. (1989) Read et al. (1992) Sm ith et al. (1976) Roscoe (1996) Roscoe (1996) T horsted et al. (1998) Laboratory stock pSBCP2 pSBSP2 pIC19HI2(ColIb Clal-Pstl 1.6 kb) A pR pIC19Hn(ColIb psiB Pstl-Sall 1.6 kb) R AP 2 .7 k b A p R Laboratory stock Laboratory stock a pUC19 Y anisch-Perron et a l (1985) Table. 2.1.3 Plasmids constructed during this work N am e D escription Published as p N A l/p N A la pN A 2 pUC19C2(ColIb or iT + nic+ Psfl 1.6 kb) ApR pIC19HD(ColIb o r iT + nic+ Pstl-Clal l.lk b ) A R Ap pNA3 pIC19H£2(ColIb o r i T + rtic+ Sall-EcoRl 8.0 kb) A R Ap pNA4 p N A 2 A n i c : : S p h l - K p n l A pR pNA5 pNA2Anic::Kpnl-Sphl A pR pNA6 pN A 7 pIC19HD(ColIb a r d A + Bglll-Sall 6.1 kb) A pR pNA2Amc::EcoRI A pR pNA8 pNA7Q(Anzc::omega intersposon EcoRI 2.0 kb) A pRSmRSpR pIC19HQ(ColIb B g in - P s tl 0.62 kb) A pR pIC19Hn(ColIb Pstl 2.0 kb) A pR pLG2061 pLG221Anic::£J KmR SmRSpR pLG2062 pLG292A nic::Q KmR SmRSpR pIC19H£2(ColIb Ps tl 0.76 kb) A pR pLG2063 pNA9 pNAlO p N A ll pNA 12 pNA13 pNA14 pNA15 pLG2059 pLG2060 N T P14fl(n intersposon Clal 2.0 kb) (rR]+mRj+) A pR SmRSpR pIC19H O (n intersposon Clal 2.0 kb) A pR SmRSpR pNA16 pBR328D(NTP14 Sall-Clal 9.7 kb) (r^'mRi") A pR CmR pNA17 pBR328D(NTP14 Sall-Clal 7 3 kb) (rRI+m Ki+) A pR Cm R P lasm id s pLG2059, pLG2060, pLG 2061, pLG2062, pL G 2063 are also described in A lthorpe et al. (1999). Table 2.1.4 Oligonucleotide primers used during this work Primer Sequence (5’-3’) Template Reference CCCAGTCACG ACGTT GTAAAACG M13/pUC GIBCOBRL (1994) M13/pUC Universal Forward M13/pUC Universal Reverse Primer B CTGC ATA AG ACTATG AT GC Primer C Primer 5 Primer 6 Lead 1 Lead 2 Lead 3 Lead 33 Lead 331 Lead 332 Lead 333 Lead 334 Lead 335 Lead 4 Lead 44 Lead 43 Lead 432 Lead 443 Lead 433 Lead 431 Lead 434 Lead 5 Lead 55 Lead 6 Lead 66 Lead 7 Lead 77 Lead 771 Lead 772 Lead 8 Lead 88 Lead 881 Lead 882 GCGTATTCCTTGTCACCG CGGAATTCCGTTTTCAGGCCATTATAGCC " CGGAATTCCGGC A ATT GT AAT ACCGTCCC " CAGCCAGGAAAGATGGCATCG pNA9 // CATCA ATT GTTGTTCAT ATCG ATGC GGTACATCACCTTCCAGTAGCCG pLG290 // CCTTCTGCCGTCCACTTCTTCG // GACATCGGACCGTAGCGGCAGC // GACCGTGCAATAAAGGTGAAGG // GGCAGGCGGCG AAAA AGTCG // GAGGGACCGGGTGACCGC AGC // GATATCGCGGTCGTGCTCTGC it GCGTAAATCCGTCGCACAGACG CTCCGCTTTTACCCGGCCATGC a CCACTGAAAGCACTGAAGAGGTGC // CGCTCTATTTTGACT ATG AGG // CATCGCCGAAGCCT GGTTTGACC // GGACAGCTGCCCCGTGACGTGG CGACAGGCAGCTGCTGGCGCACC // GTCTCTACCCTCCAACTCCC GGAACT GCATCTTCCTGCGG pNA13 // CCGCTGGTTGAT AT CACT GTCG // CGTATAAACATAACCATCTTCACCC // CTTCTGCTCACTGC AGGTCGACG GCACAAATCTGTCAAGCTGACC pNAlO // CGTGCCATGTTCTCTGTGTCACG // GTAAGGCCACTGCGCCTGATGG // CAAT GTGAT GA ATCAG ACTTTACC // CGTATCCGTCATCAGGCGGACC // GTTTTGTGGCTGGTACATCTCTCC // CTTTTTCA.AAGGTGTTTTA.TGA.CC // GCATTTTGCCGGGCCTTTTGG AGCGGATAACAATTTCACACAGG // pNA2 // Althorpe et al., (1999) // // // Chapter 7 // // // // // // // // // // tt // // // // // // // // // // // // // // // // // 2.2 Media and Chemicals 2 .2.1 Media U nless otherw ise stated, bacterial strain s w ere ro u tin ely grow n in nutrient broth (NB) and on nutrient agar (NA) plates. Suspensions of bacterial strains w ere m ade in phosphate buffer (PB). B acteriophage X w as alw ays resuspended and diluted in X buffer (^.B). For long term storage of strains, overnight cultures of bacteria were streaked onto NA p lates supplem ented w ith the appropriate antibacterial agent and incubated at 37°C for 12 hours. Frozen stocks w ere m ade by resuspending the bacteria in NB + 20% glycerol, w hich were then stored in vials at -20°C. Frozen cultures m ade in this w ay have a shelf-life of -10 years. All m edia and chem icals w ere resu spended in distilled water unless indicated otherwise. Nutrient agar (NA) Oxoid No.2 nutrient broth 25 g I"1 Bio-gene agar 16 g I’1 Luria agar (LA) Oxoid tryptone 10 g I_1 Oxoid yeast extract 5 g 1" NaCl Bio-gene agar Phosphate buffer (PB) k h 2p o 4 N aH P04 N aCl M gS 04.7FI20 46 X buffer (XB) Tris-HCl 0.06 M M gS 04.7H20 2.5 g I”1 G elatin 0.05 g I'1 p H 7.2 Soft nutrient agar (SNA) BBL trypticase 10 g F1 N aC l 5 g I'1 Bio-gene agar 6.5 g I"1 Bottom layer agar (BLA) BBL trypticase 10 g F1 N aC l 5 g I'1 Bio-gene agar 10 g F -I M inimal media (MM) M inim al salts* 100 ml F1 20% glucose 50 ml F1 0.1% Thiamine HC1 1 ml F1 Ca-Mg salts** 10 ml F1 Bio-gene agar 16.2 g F1 *Minimal salts: N a 2H P 0 4 (anhydrous) 79 g F1, KH2P 0 4 (anhydrous) 30 g F1, NaCl 5 g F1, N H 4C1 10 g F1. ** Ca-Mg salts: CaCl2, 1.47 g F1 (0.01M), M gS04.7H20 4.6 g F1 (0.1M). Thiamine HC1 w as prepared in distilled w ater, filter sterilised and stored at 4°C. 47 2.2.2 A ntibacterial agents Unless otherwise stated, antibacterial agents were ad d ed to media at the follow ing concentrations: ampicillin (Ap), 100 pg ml" ; chloram phenicol (Cm), 25 1 jllg 1 1 ml" ; kanam ycin (Km), 25 p g m l' ; naladixic acid (Nal), 25 pg ml" ; rifam picin (Rif), 25 pg m l'1; streptom ycin (Sm), 50 pg ml"1; spectom ycin (Sp), 50 1 _1 pg m l’ and tetracycline (Tc), 7.5 pg ml" . Chemicals used for a-com plem entation tests (blue an d w hite colony selection) were added to LA Ap plates at the following concentrations; X-gal, 40 pg ml"1 and IPTG, 0.1 mM ml"1 (Sambrook et al., 1989). Stock solutions of X-gal w ere always m ade up in dim ethylform am ide. 2.2.3 11.1 xPCR buffer Tris-HCl pH 8.8 476 mM m l'1 (NH4)2S04 121.9 mM ml' MgCl2 49 mM ml"1 2-m ercaptoethanol (100%) 5 pi ml"1 EDTA pH 8.0 50 pM ml"1 dATP 11.1 mM ml"1 dCTP 11.1 mM ml"1 dGTP 11.1 mM ml"1 dTTP 11.1 mM ml"1 BSA (DNAse free) 1.25 pg ml"1 2.3 Phenotypic Characterisation of Bacterial Strains 2.3.1 Colicin production Strains to be tested for colicin production w ere grow n overnight in NB. O ne pi of culture w as spotted onto a NA plate w hich w as incubated for a fu rth er 12 hours at 37°C. The bacteria w ere then killed by exposure to ch lo ro fo rm v a p o u r for 10 m inutes, allo w in g a n o th e r 15 m in u tes for ev ap o ratio n of the vapour. The p late w as overlaid w ith 3.5 m l of SNA 48 containing 0.1 m l of a 1:20 dilution of an overnight culture of colicin sensitive cells. After 5 hours incubation at 37°C, colicin-producing strains w ere indicated by a clear zone of killing of the indicator strain around the test strain. 2.3.2 recA mutants To test strains for the carriage of a r e c A m u tatio n , the bacteria w ere streak ed onto N A plates and treated w ith a series of tim ed exposures to •I “2 shortw ave UV light (0.5 J sec’ m’ ). The grow th of the test strain that had been irradiated for 0, 10, 15, 30 and 60 seconds was com pared to th at of a wild type strain, GI65 and a recA strain, J08. RecA m utants have increased sensitivity to UV irradiation. 2.3.3 recD mutants The RecD p henotype of DL307 w as tested usin g the bacteriophage U\4M5885(%0 Red' G am ’) and XMM5659(%+ R ed’ Gam’) supplied as gifts from D avid Leach. The screen relies on the ability of both the %° and %+ phage to form indistinguishable large plaques on recD strains w hilst on rec+ strains, the %° phage form sm all plaques and the %+ phage large plaques (Russel et al., 1989) 2.3.4 Sensitivity to bacteriophage Bacterial strains w ere routinely tested for their ability to restrict a virulent strain of bacteriophage X. Overnight cultures of strains to be tested w ere diluted in NB and 10 mM M gS04 to an A60o absorbance value of 0.05. The bacteria were then grow n at 37°C w ith vigorous shaking to an A600 of 0.5 (m id­ exponential phase). The bacteria were resuspended in 3 m l of SNA (~2 x 10s cells) and used to overlay a BLA plate. Xvir.0, propagated on NM654, Xvir.K on GI65 and Xvir.Rl on GI65(NTP14) w ere used, d ep en d in g on the restriction phenotype of the strain being tested. The phage w ere diluted in XB and 10 pi spotted onto the top layer of the BLA plate. The phage suspension was allowed to adsorb into the agar for 20 m inutes and the plate then incubated at 37°C, 49 overnight. Com parison of the efficiency of plating (e.o.p) of the phage on the test strain to that of a control strain allow ed the restriction phenotype to be d eterm in ed . 2.4 Strain Manipulations 2.4.1 Bacterial conjugation Overnight cultures were diluted in NB to an A 60o of 0.05 using a Bausch and Lomb 501 spectrophotom eter and grow n w ith vigorous shaking at 37°C to 8 1 an A60o of 0.5 (2 x 10 cells ml" ). C onjugation m ix tu res co n tain ed equal volum es of donor and recipient bacteria. All in te rru p te d filter m atings involved filtering 0.5 m l of m ating m ixture onto a 25 m m cellulose acetate filter (W hatm an® , pore size 0.45 Jim) for each tim e point, w hich w as then placed on the surface of a prew arm ed n u trient agar plate for the conjugation period. Bacteria w ere released from each filter by vigorous blending in 5 ml phosphate buffer. The titre of the transconjugants w ere determ ined by serial d ilution and 100 jil sam ples spread onto n u trien t agar p lates containing the appropriate antibacterial agent for selection. For liquid m atings equal am ounts of donor and recipients were added to a 200 ml conical flask and incubated at 37°C in a w ater bath w ith gentle rotation (49 rpm ). A 0.3 ml sam ple w as recovered and added to 2.7 ml of PB in a glass tube. The tube w as agitated violently in a blender for 10 seconds to term inate conjugation. A gain, the titres of transconjugants w ere determ ined in the sam e w ay as for filter m atings. 2.4.2 Transformation of bacterial strains The m ethod u sed to transform b acterial cells w ith DNA w as by a chemical m ethod based on that of Cohen et al. (1972). Com petent cells w ere m ade by diluting overnight cultures of bacteria in NB to an o A 60o of 0.05 and grown with vigorous shaking at 37 C to an of 0.5 (m id-exponential phase). A 60o value The cells w ere pelleted at 4°C in an MSE 50 M icrocentaur and then w ashed twice by centrifugation in 5 ml of ice cold 0.1 M C a C l2. The pellet was finally resuspended in 1 ml of 0.1 M CaCl2 giving a concentration of approxim ately 2 x 109 cells m l'1. C om petent cells m ade this w ay can be stored for up to 24 hours at 4°C. 1-5 p g of DNA was added to 250 p i of com petent cells in a sterile E ppendorf tube. Incubation of the cells for 1 h o u r on ice w as followed by a 4 m inute heat-shock at 42°C. The cells w ere then returned to ice for a further 2 m inutes before being transferred to 5 m l of p rew arm ed NB. After 1 ho u r incubation at 37°C w ith vigorous shaking the cells w ere pelleted, resuspended in PB, and diluted to give single colonies w hen sam ples w ere spread onto selective NA plates. 2.4.3 Bacteriophage propagation The bacteriophage to be p ropagated w as plated on the a p p ro p riate bacterial strain at a titre that form ed a num ber of p laq u es after overnight grow th. The SNA overlay was scraped from the BLA plate and transferred to a Sorvall centrifuge tube. A ddition of 1.5 ml ^.B and 0.5 m l of chloroform to the tube w as followed by the mix being thoroughly m ashed w ith a sterile spatula. The tube was allow ed to stand for 30 m inutes at 37°C before being centrifuged at 8000 g in a Sorvall SS-34 fixed angle rotor for 5 m inutes. The supernatant w as rem oved to a sterile test-tube containing 0.1 ml of chloroform , vortexed, then incubated for a fu rth er 15-20 m inutes at 37°C. The aqueous layer was rem oved to a screw cap tube and the phage suspension stored at 4°C. 2.4.4 Transposon insertion mutagenesis Tn5 m utants w ere obtained in Escherichia coli strain W3110 (W inans and Walker, 1983). 51 2.5 DNA Manipulations 2.5.1 Small scale plasmid DNA isolation For isolation of high q u ality D N A sam ples for use in sensitive applications such as DNA sequencing, cloning or PCR, a Q IA prep spin m inikit (Qiagen) w as used according to the m anufacturers instructions. For standard small scale preparations of plasm id D N A an alkaline lysis m ethod based on that of Birmboim and Doly (1979) w as used. 1.5 ml of an overnight culture of plasm id containing cells, grow n w ith appropriate antibiotic selection, were centrifuged at 13,000 g for 3 m inutes in a MSE M icrocentaur. The pellet was resuspended in 100 jll1 of lysis buffer (25 mM Tris-HCl, pH 8.0; 10 mM EDTA, 50 mM sucrose) and incubated on ice for 5 m inutes. A ddition of 200 pi of alkaline SDS (0.2 M N aO H ; 1 % Sodium dodecyl sulphate) to the cells was followed by gentle m ixing and 5 m inutes incubation on ice. 150 pi of 5 M potassium acetate (pH 4.8) w as then added to the tube, the contents mixed carefully before a final 5 m in u te incubation period on ice. The lysate was centrifuged for 10 m inutes to rem ove the cell debris and the su p ern atan t transferred to a clean E ppendorf tube containing 400 pi of a p h e n o l/c h lo ro fo rm m ix tu re [50% phenol, 48% ch lo ro fo rm , 2% isoam yl alcohol] (Sambrook et al., 1989). After mixing, the tube w as centrifuged for 3 m inutes, the upper phase was removed and transferred to an equal volum e of iso-propanol. The m ixture was incubated on ice for 15 m in u tes and then centrifuged for 30 m inutes to pellet the DNA. The su p ern atan t w as discarded an d 500 pi of 70 % ethanol was added to w ash the pellet. The tube w as centrifuged for 5 m inutes and the pellet air dried before being resuspended in 20 pi distilled water. The concentrations of DNA isolated by this m ethod were estim ated by electrophoresis on an agarose gel along side a control DNA sam ple of know n concentration. RNA w as rem oved from DNA sam ples by o incubating at 37 C for 1 hour in the presence of RNAseA (0.25 pg). 52 2.5.2 Large scale plasmid DNA isolation All large scale p rep aratio n s of plasm id DNA w ere carried out using Q iagen mini kits according to the instructions. 2.5.3 Analysis of the concentration and purity of DNA sam ples DNA yields and p u rity w ere d eterm in ed from the A260/ A 280 values m easured in a Hitachi U-2000 spectrophotom eter. DNA concentrations were calculated where: -50 pg m l’1 of double-stranded DNA, - 40 jig m l'1 of single­ stran d ed DNA or RNA and -20 jig ml"1 of sin g le-stran d ed oligonucleotide have an A260 value of 1. The purity of DNA sam ples w as confirm ed by the A 260/ A 2so values of the sam ple having a ratio of 1.8 to 2.0 (Sam brook et al., 1989). 2.5.4 Restriction enzyme analysis of DNA R estriction en donuclease enzym es, su p p lied by GIBCOBRL, w ere ro u tin ely used to analyse DNA. C onditions and reaction buffers used in digests w ere as recom m ended by the m anufacturers. Reactions w ere generally carried out in a total reaction volume of 20 pi consisting of restriction enzym e, 1 x reaction buffer, -1 pg of DNA and if necessary 0.25 pg of RNAseA. Samples, to be analysed by gel electrophoresis, w ere m ixed w ith 2/10 loading dye (5 mM Tris-H Cl (pH 7.5), 100 m g ml"1 glycerol, 0.01 m g ml"1 brom ophenol blue). Restriction fragm ents w ere separated on 0.6-1.0% (w /v ) Seakem HGT agarose (FMC Corp.) horizontal gels in TAE buffer (40 mM Trisacetate (pH 7.4), 1 m M EDTA) containing ethidium brom ide (10 pg m l’1), at 20100 volts for 2-8 h o u rs. DNA fragm ents w ere v isu a lise d on a short w avelength UV transillum inator and p h o to g rap h ed using either a Polaroid MP4 land camera w ith 15 seconds exposure or a UV visual analyser (Sony im agestore 5000, v e rsio n 7.2). The sizes of the D N A frag m en ts w ere 53 determ ined by com parisons w ith X H i n d Ill m olecular w eight m arkers (23.13, 9.4, 6.56, 4.36, 2.32,2.03, 0.56 kb). 2.5.5 Isolation of DNA restriction fragments All DNA bands w ere cut w ith a scalpel from a 0.8% Seakem HGT agarose gel (FMC Corp.) and transferred to an Eppendorf tube. Purification of the DNA fragm ents from the agarose w as achieved u sin g a QIAEX II Gel extraction kit (Qiagen) according to the m anufacturers instructions. The yield and purity of DNA isolated was calculated as described in section 2.5.3. 2.5.6 Dephosphorylation of DNA fragments To prevent re-ligation of single-stranded 5' p ro tru d in g ends created by som e restriction enzym es, the 5' p h o sp h ate resid u es of the D N A w ere rem o v ed by treatm en t w ith calf intestinal alkaline p h o sp h a ta s e (CIAPP h arm acia B iotech Ltd.) according to the m a n u fa c tu re rs in stru c tio n s. Rem oval of the CIAP enzym e w as by phenol : chloroform extraction and isolation of the DNA was by ethanol precipitation. 2.5.7 Ligation of DNA fragments Vector and p lasm id DNA to be ligated w ere d ig e ste d w ith the appropriate restriction enzymes. The corresponding DNA fragm ents from the d ig estio n w ere iso lated from an agarose gel after se p a ra tio n by gel electrophoresis (section 2.5.5). Ligations w ere carried out in a total reaction volum e of 20 pi including 5 units of T4 DNA ligase an d T4 ligation buffer (GIBCOBRL). The concentration of vector: insert DNA w as adjusted so th at a ratio of 1 to 3 was achieved. The ligation reaction was incubated overnight at room tem perature. For blunt-end ligations the same conditions w ere used except the units of T4 DNA ligase enzym e w as increased to 25 per reaction and the vector: insert DNA ratio was changed to 1 to 8. 54 2.5.8 Primers Oligonucleotide primers w ere m ade by the PNACL facility, University of Leicester, using an A pplied Biosystems 394-8 synthesiser. Prim ers arrived in a 35% am m onia solution and w ere precipitated on ice for of M N H 4 C 2H 3O 2 (pH 7.0) and 3 volum es of ethanol. 3 2 h o u rs in 1 volum e The prim ers w ere pelleted by centrifugation in a MSE M icrocentaur for 30 m inutes and w ashed in 70 % ethanol, dried and dissolved in distilled H 2O. The concentrations of single-stranded oligonucleotide w ere calculated as described in section 2.5.3. The num ber of pico moles of prim er w ere calculated and in general 20 pm ol m l ' 1 of prim er used in PCR reactions and 3.2 pm ol m l ’1 u se d in DNA sequencing reactions. 2.5.9 DNA sequencing reactions All DNA sequencing reactions were carried out using the ABI PRISM™ D ye T erm inator cycle sequencing read y reaction kit acco rd in g to th e m anufacturers instructions. Samples w ere analysed by the PNACL facility, Leicester University, on a PE-ABI 377 autom ated DNA sequencer. Sequence data obtained was th en analysed using the abi (A pplied B iosystem s Inc.) sequence analysis program "FACTURA", version 1 .2 . Further analysis of the sequence data w as carried out using the Genetics C om puter Group (GCG) W isconsin Package Suite of program s (version 9.1). FASTA (Pearson and Lipm an, 1988) and BLAST (Altschul et al., 1990) searches w ere carried out th ro u g h the European Bioinform atics In stitu te site on the W orld Wide Web (h t tp : / /w w w .ebi.ac.uk). 2.6 Standard and inverse PCR Primers used for PCR reactions are as described in section 2.5.8. All PCR- amplifications were carried out in a total reaction volum e of 50 pi containing, 5 jil of 11.1 x PCR buffer; 50 ng of tem plate DNA; 20 pm ol ml"1 of each prim er and 5 units of Taq DNA polym erase (Advanced Biotechnologies Ltd.) or w hen the PCR p roduct was to be used in further applications, 5 units of the proof­ reading polym erase BIO X-Act™ (Bioline (UK) Ltd) w as used. w ere carried out in a hybaid om nigene PCR machine. All reactions Exam ple program s include: 95°C-30 sec (Hot start), 1 cycle; 95 C-45 sec, 49-60 C-30 secs, 72 C - 1 m in / kb amplified, 20-35 cycles; 72 C - 1 m in / 1 kb, 1 cycle. 2.6.1 Optimisation of the inverse-PCR protocol for am plification of template pNA2 using primer 5 and primer 6 The first step involved establishing the optim al tem p eratu re at w hich prim er 5 and prim er 6 anneal to tem plate pNA2. Two sets of 25 cycle PCRam plifications w ere set up involving reactions w ith prim er 5 an d universal forw ard (UF) and w ith prim er 6 and universal reverse (UR). The annealing sites for each of these prim ers are indicated on the genetic m ap of pN A 2 presented in Fig. 4.3 (A). A range of annealing tem peratures (49-56°C) w ere tested and the resulting DNA fragm ents produced in each reaction exam ined by agarose gel electrophoresis. H igh annealing tem p eratu res red u ce the am ount of non-specific binding of prim ers to their tem plate w hich in tu rn increases the am ount of specific DNA product yielded (Saiki, 1989). Prim er 5 and prim er 6 w ere inferred to have an optim al annealing tem perature of 53°C as only the desired DNA p ro d u ct w as detected in each reaction at this tem perature (Fig. 4.3, B lane num bers 3 and 4). Further optim isation of the PCR protocol involved exam ining the tim e req u ired for both p rim er 5 and prim er 6 to be fully extended by the T a q polym erase enzym e and result in the am plification of the 3.8 kb pN A 2 tem plate. It is generally accepted that one m inute is the m axim um tim e required for Taq polym erase to extend 1 kb of DNA (Saiki, 1989). Based on this principle, a 35 cycle PCR-amplification of pNA2 was set up w ith prim er 5 and prim er 6, allowing 4 m inutes extension time. The extension p roduct produced 56 by this reaction was analysed by agarose gel electrophoresis, the result of w hich is presented in Fig. 4.3 (B), lane n um ber 5. These findings indicate th a t 4 m in u te s is ad eq u a te tim e for th e am p lific atio n of 3.8 kb of D N A . 57 Chapter Three Genetic Analysis of a Plasmid-Mediated Restriction Evasion Mechanism 3.1 Introduction This chapter describes a series of genetic experim ents aim ed at analysing the m echanism em ployed by ColJbdrd d u rin g conjugation, w hich enables the plasm id to avoid destruction by two different classes of restriction enzym e. C o llb drd is rem arkably resistant to destruction by type I and type II restrictio n enzym es du rin g its transfer by conjugation, even th o u g h th e plasm id carries m ultiple restriction enzym e target sites [~7 EcoKI sites; 20 EcoRI sites] (Read et ah, 1992; Wilkins et ah, 1996). Such resistance is no t a feature com m on to all transferring plasm ids. For exam ple, IncP p lasm id s are extrem ely sensitive to these enzymes during their transfer, despite a distinct lack of restriction enzym e recognition sites in the IncP replicon [R751: 12 EcoKI sites; 5 EcoRI sites] (R751 EMBL accession num ber U67194). C ollb's resistance to ty p e I enzym es involves a specialised plasm id encoded a n ti-restrictio n m echanism know n as ArdA. This system is incapable of alleviating restriction by type II enzymes (Delver et ah, 1991; Read et ah, 1992). Interestingly, after the a r d A gene has been insertionally inactivated, ColJbdrd still displays m arked resistance to type I and type II restriction enzym es during its transfer (Read et ah, 1992). Evidence suggesting a second flnE4-independent restriction evasion process encoded by Collb has been provided by Read et al. (1992). This process becom es m anifest after several m in u te s of conjugation an d allev iates restric tio n of C ollb d r d by type I and type II enzym es d u rin g second or subsequent rounds of transfer. The m olecular basis of this second evasion process has yet to be elucidated. 58 O ne hypothesis proposed by Read et al. (1992) to explain this second restriction-avoidance system of ColJb dr d is that transfer of m ultiple copies of the p lasm id m ay overwhelm the restriction enzym es in the recipient cell by substrate saturation. A second hypothesis is that m ultiple transfers of ColJbdrd induce a change in the physiology of the recipient cell, w hich in tu rn leads to a tran sie n t breakdow n of the restriction barrier. A nother possibility is th at C ollb d r d possesses a gene(s) that encodes an anti-restriction function active against both classes of restriction enzyme. In all cases, the expected phenotype w ould be alleviation of restriction allow ing the next round of plasm id transfer to be successful. Four independent yet related conjugation experim ents are described in this chapter, which are all based on exam ining w hether plasm id ColJbdrd can confer cross-protection on IncPp plasm id R751 from destruction by type I and type II restriction enzymes. To dem onstrate the relative resistance of ColJbdrd to th ese enzym es d u rin g conjugation the first experim ent involved a re ­ exam ination of the effects of type I and type II restriction on the transm ission of plasm ids R751 and ColJbdrd. The second experim ent analysed the effect of co-transferring R751 w ith ColJbdrd from the same donor strain to a restricting recipient. T hird, the effect of tran sferrin g unm odified R751 DNA to a restricting recipient harbouring ColJbdrd w as explored. Finally, Collb w ild type and the cloned Collb exclusion genes (eex ) w ere utilised to investigate w hether expression of the plasm ids restriction-evasion m echanism is triggered by the act of Collb transfer. 3.2 Results and Discussion 3.2.1 Effects of restriction on the conjugative transmission of plasm ids R751 and Collb drd To dem onstrate that ColJbdrd encodes an ardA -independent restriction evasion m echanism , the effects of restriction on the conjugative transm ission of unm odified R751 and ColJbdrd w ere exam ined. The stan d ard experim ent 59 involved an interrupted m ating of a donor strain harbouring R751 or Collb drd and a restricting recipient strain. All conjugations w ere done on filters as plasm id R751 specifies only a surface obligatory m ating system. The effects of restriction on the transm ission of each plasm id w as m onitored by com paring the n u m b er of R751 or ColJbdrd transconjugants produced in the restricting recipients to those produced in the non-restricting counterpart. To sim plify the genetic selections, a rifam picin-resistant (RifR) donor strain and a naladixic acid-resistant (NalR) recipient strain w ere used. One advantage of using a N alR recipient strain is th at naladixic acid inhibits conjugative DNA transfer by sensitive donor cells. Therefore, any additional conjugations that m ight occur on the surface of the NA plates are prevented by the addition of Nal. The donors and recipients used in each m ating experim ent are described w ith in the figure legends. The R751 plasm id used carries tw o selectable antibiotic resistance genes one of which confers resistance to tetracycline (Tc ), the o th er trim eth o p rin (TpR). Col Jb dr d plasm ids used w ere pLG221 and pLG292 b o th of w hich confer resistance to kanam ycin (KmR) th ro u g h the carriage of a Tn5 insertion in the cib (colicin lb) locus of pLG221 and a nontran sp o sab le Tn903 a p h A - 1 insertion in the a rd A gene of pLG292 (Rees et al., 1987; Read et a l , 1992). The tw o R-M system s chosen for stu d y w ere EcoKI and EcoRI as exam ples of type I and type II systems, respectively. The type I system is encoded by the E. coli K-12 chromosome and the type II system is encoded by the n atu ral ColEl-like plasm id NTP14 (Arber and Dussoix, 1962; Sm ith et a l , 1976). Both these R-M system s were originally isolated in strains of Escherichia coli and are therefore presum ed to be native to the Enterobacteriaceae family. Therefore, any plasm id w hose transfer range overlaps this family of bacteria is likely to encounter these R-M systems frequently. The results of the m ating experim ents are presented in Fig. 3.1 and Fig. 3.2. It is noted that data in Fig. 3.2 (A and B) were obtained by repeating p art of the w o rk d escrib ed by Read et a l (1992), w ith the exception th at the 60 Fig. 3.1. K inetics o f transm ission of u n m od ified R751 to recipient cells specifying EcoRI or EcoKI. (A) R751 tran sco n ju g an ts from conjugations of GI65R(R751) d o n o r cells and e ith e r re c ip ie n ts G I65N G I 6 5 N ( N T P 1 4 ) ( • ) . (B ) (O, c o n tro l) o r E coR I r e s t r i c ti n g R751 fro m tra n s c o n ju g a n ts re c ip ie n ts c o n ju g a tio n s of NM654R(R751) d o n o r cells and either recipient cells N M 816N (□ , control) or EcoKI restricting recipients GI65N (■). p la te s co n ta in in g N a l a n d Tc. ex p erim en ts. T ransconjugants w e re selected on NA D ata are th e m e a n v a lu e s fro m th ree Fig. 3.1 Log^g transconjugants per ml EcoRI 6- 4- 0 10 20 30 40 Time (min) Log^o transconjugants per ml EcoKI 0 10 20 Time (min) 30 40 Fig. 3.2. K inetics of transm ission of unm odified pLG221 (Ard+) and pLG292 (Ard") to recipient cells specifying EcoRI or EcoKI. (A) pLG221 transco n ju g an ts from conjugations of GI65R(pLG221) d onor cells an d either recipient cells GI65N ( O , control) or EcoRI re s tric tin g recip ien t G I65N (N TP14) ( • ) . (B) pLG221 transconjugants from NM 654R(pLG221) donor cells an d e ith er rec ip ie n t cells N M 816N ( O , control) o r EcoKI re stric tin g recip ien t GI65N ( • ) . pLG292 tran sco n ju g an ts from NM 654R(pLG292) d o n o r cells an d e ith e r re c ip ie n t cells N M 8 1 6 N (D , control) or EcoKI re s tric tin g recip ien t GI65N (■). T ransconjugants w ere selected o n N A p lates containing N al and Km. D ata are the m ean values of three experm ients Fig. 3.2 k ° 8 l 0 transconjugants per ml EcoRI 4- 0 10 20 30 40 30 40 Time (min) Logio transconjugants per ml EcoKI 6 - 4- 10 20 Time (min) conjugations w ere perform ed on the surface of filters rather than in liquid. In a d d itio n , W ilkins et al. (1996) has described the effects of type II restriction (EcoRI) on the transm ission of R751. H ow ever, the data presented in Fig. 3.1 (A) are m ore com prehensive as they describe the effects of type II restriction on R751 transm ission at intervals over a 40 m inute period. T ransm ission of R751 w as considerably m ore sensitive to restriction th an ColJbdrd. After 40 m inutes, transm ission of R751 w as reduced 10,000-fold by EcoRI restricting recipients and 100-fold by EcoKI restricting recipients (Fig. 3.1 A and B). In contrast, productive transfer of pLG221 to the EcoRI restricting recipient su rg ed after an initial delay w ith only a 10-fold red u ctio n in the n u m b er of pLG221 transconjugants form ed by 40 m in u tes (Fig. 3.2 A ). In terestin g ly , a sim ilar result w as achieved for the tran sm issio n of th e C o llb drd a r d A m utant (pLG292) to the EcoKI restricting recipient (Fig. 3.2 B). This phenom enon w as originally reported by Read et al. (1992). EcoKI restriction had very little effect on the transm ission of pLG221 (Fig. 3.2 B), w ith transconjugant yields reaching the sam e value as those p ro d u c e d in the n on-restricting recipients after 40 m inutes. This EcoK I resistance of pLG221 can largely be attributed to the a rd A gene present on the plasm id (Read et al., 1992). Results in Fig. 3.1 and Fig. 3.2 confirm the findings of W ilkins et al. (1996) and R ead et al. (1992), respectively. Taking into co n sid eratio n the n um ber of restriction enzym e target sites that ColJbdrd and R751 carry (section 3.1) and the effect that restriction has on each plasm ids' transm ission, it is clear th at ColJbdrd encodes an ardA -independent restriction evasion m echanism . 3.2.2 Effects of co-transferring R751 with Collb drd to a restricting recipient The events th at lead to the alleviation of type I and type II restriction a fter m u ltip le tran sfers of Collbdrd are unknow n. The three h y potheses described in section 3.1 predict that restriction in the recipient will be alleviated in trans. This notion w as investigated by testing w h eth er C ollbdrd could 61 Fig. 3.3. Co-transfer of plasm ids R751 and pLG221 to recipient cells specifying EcoRI. (A) Yield of N alR, TcR R751 transconjugants from conjugations of GI65R(R751) d o n o rs an d either recipients GI65N (O , control) or EcoRI restric tin g recipients GI65N(NTP14) ( • ) , GI65R(R751, pLG221) donors an d recipients GI65N(NTP14) (■ ). (B) Yield of N a lR, Km R (pLG221) tra n sc o n ju g a n ts fro m GI65R(R751, pLG221) donors and either recipients GI65N (A, control) or GI65N(NTP14) (A). (C) Yield of N alR, TcR, KmR (R751, pLG221) transconjugants form GI65R(R751, pLG221) donors an d either recipients GI65N (V, control) or GI65N(NTP14) (▼). T ransconjugants w ere selected on N A plates containing N al, Tc, Km. D ata are th e m ean values of three experim ents. Fig. 3.3 pLG221 Logio transconjugants per m l R751 R751 and pLG221 4- !---------- [— 20 30 Time (min) — 0 10 20 30 40 40 Fig. 3.4. Co-transfer of R751 and pLG292 (Ard") to a recipient specifying EcoKI. (A ) Yield of N a lR, TcR (R751) tra n s c o n ju g a n ts fro m c o n ju g a tio n s o f NM654R(R751) d o n o rs and eith er recip ien ts N M 816N ( O , control) or EcoKI restricting recipients GI65N (# ), NM654R(R751, pLG292) d o n o rs an d recipients G I65N (■ ). (B) Y ield of N a lR, K m R (pLG 292) tr a n s c o n ju g a n ts fro m NM654R(R751, pLG292) donors and eith er recipients N M 816N (A, control) o r GI65N (A). (C) Yield of N alR, TcR, Km R (R751, pLG292) tran sco n ju g an ts from NM654R(R751, pLG292) donors and eith er recipients N M 816N (V, control) o r G I65N (Y). T ransconjugants w ere selected on N A p la te s c o n tain in g N al, Tc, Km. Data are the m ean value of three experim ents. Log^o transconjugants per m l <3> NJ 4* oo •n CTQ U> 4l *1 *<l O l K3 NJ ► £» Os oo O H O N3 N» > n CL N3 4^ On 00 on N> ftj 3 CL ^3 r< O N> KJ > i-t CL 4* O Fig. 3.5. Co-transfer of R751 and pLG221 (Ard+) to a recipient specifying EcoKI. (A ) Yield of N a lR, TcR (R751) tra n s c o n ju g a n ts fro m c o n ju g a tio n s of NM654R(R751) d o n o rs and eith er recip ien ts N M 816N ( O , control) or EcoKI restricting recipients GI65N ( • ) , NM654R(R751, pLG221) d o n o rs an d recipients G I65N (■ ). (B) Y ield of N a lR, K m R (pLG 221) tr a n s c o n ju g a n ts fro m NM654R(R751, pLG221) donors and either recipients N M 816N (A, control) or G I65N (A). (C) Yield of N alR, TcR, KmR (R751, pLG221) tran sco n ju g an ts from NM654R(R751, pLG221) donors and either recipients N M 816N (V, control) or G I65N (▼). T ransconjugants w ere selected on N A p la te s co n ta in in g N al, Tc, Km. D ata are the m ean values of three experim ents. Fig. 3.5 pLG221(Ard+) R751 and pLG221(Ard+) per ml R751 6 o to S 4- - 4- or-H W) O hJ 10 20 30 40 10 20 30 Time (min) 40 2Y0 10 20 30 40 confer p rotection on restriction-sensitive plasm id R751 w hen the two plasm ids are co-transferred together to a restricting recipient. The sta n d a rd experim ent involved m ating a donor strain h arb o u rin g b o th R751 a n d Collb drd (pLG221 or pLG292) w ith either an EcoRI or EcoKI re stric tin g recip ien t. R estriction in the recipient strains w as m o n ito red th ro u g h o u t each m ating by exam ining the num ber of R751 transconjugants form ed. A n y alleviation of restriction by transferring Collb drd w o u ld be in d icated b y a n increase in the n u m b er of R751 transconjugants p ro d u ce d c o m p a red to th o se form ed w h en R751 is tran sferred in d e p e n d e n tly of C o llb drd. The n u m b ers of C ollbdrd transconjugants an d R 7 5 1 /C o lIb d rd tran sco n ju g an ts form ed th ro u g h o u t the experim ents w ere also m ea su re d . Such results w ere indicative of the efficiency of the m ating and the activity of the C ollbdrd-m ediated restriction evasion mechanism. The resu lts of these experim ents are presented in Figures 3.3, 3.4 and 3.5. Figure 3.3 (A) show s that w hen R751 is co-transferred w ith pLG221 to an EcoRI re s tric tin g rec ip ie n t the n u m b e r of R751 transconjugants fo rm ed increases only m arginally. This finding provides no com pelling evidence of a tr a n s -acting system operating to alleviate EcoRI restriction. H ow ever, Fig. 3.3 (A) does show a delay in the num ber of R751 transconjugants form ed d u rin g the first 6-10 m inutes of m ating, which is characteristic of that seen in Collbdrd transconjugant pro d u ctio n (Fig 3.3 B), suggesting that the protection conferred on R751, a lth o u g h lim ited, is a ttrib u te d to the C ollb-m ediated restrictio n evasion process. Figure 3.4 show s the data obtained w hen R751 is co-transferred w ith the C o llb d rd a r d A m u tan t (pLG292) to an EcoKI restricting recipient. As w as the case for EcoRI restriction (Fig. 3.3), transferring Collbdrd (A rdA ') was only able to m arginally alleviate destruction of transferring R751 from EcoKI restriction. This fin d in g is in d icated by the 10-fold increase in the num ber of R751 tran sco n ju g an ts form ed in EcoKI restricting recipients w hen co-transferred w ith Collbdrd (Fig. 3.4 A). 62 In contrast, Fig. 3.5 (A) show s th at w hen R751 is co-transferred w ith the a r d A + Collbdrd plasm id (pLG221) to an EcoKI restricting recipient, the yield of R751 transconjugants form ed in the restricting recipient w as equal to those form ed in the non-restricting recipient. Com parisons w ith Fig. 3.4 (A) indicate th at such protection can be attributed to the functional a r d A gene present on pLG221. These results provide a good exam ple of the effects of alleviating restriction in the recipient in trans. Overall, the yield of all three types of transconjugant form ed (A, B and C) in these m atings involving donors carrying R751 and a Collbdrd p la s m id w as reduced b y ~5-fold, w hich m ay be a consequence of using a donor strain su p p o rtin g th e tran sfer of tw o conjugative plasm ids. The reduced yields cannot be ex p lain ed by killing of recipients, since the n u m b er of cells w as m onitored d u rin g each m ating experim ent and found to increase slightly, as expected for a grow ing culture (data not shown). The d ata raise the im portant question of w hether the relative tim e of e n try of R751 an d C ollbdrd into the restricting recipient cells is im portant. C ollb-m ediated alleviation of restriction is delayed by 6-10 m inutes and it m ay be possible th at R751 transfer is already completed by this time. This question is addressed in the follow ing sections. 3.2.3 Rescue of R751 from restriction by C ollbdrd resid en t in the restricting recip ien t The p rev io u s section show ed no significant protection of R751 from EcoRI or EcoKI restriction w hen the donor also transm itted the a rd A m u ta n t of Collbdrd. This section further explores the a r d A -in d e p e n d e n t restrictio n ev asio n m echanism by asking w h e th er C ollbdrd plasm id resident in the recipient can protect transferring R751. The experim ents involved m ating a donor strain harbouring R751 w ith either a EcoKI or EcoRI restricting recipient strain that also harboured Collbdrd plasm ids pLG221 (A rdA +) or pLG292 (ArdA') (Fig. 3.6). Samples of the m ating 63 Fig. 3.6. Transfer of plasm id R751 to a restricting recipient that harbours C ollb. (A) R751 transconjugants from conjugations of GI65R(R751) d o n o rs and either recipients GI65N (O, control), G I65N (N TP14) (# , control) o r G I65N pLG221, N T P 14) (■). (B) R751 tran sco n ju g an ts from N M 654R (R751) d o n o rs a n d recipients N M 816N (O, control), GI65N (# , control) a n d G I65N (pLG292) (■). (C ) R751 tra n sc o n ju g a n ts from N M 816N (O, c o n tr o l ) , NM 654R(R751) d o n o rs a n d re c ip ie n ts G I6 5 N (# , c o n tro l) and G I6 5 N (p L G 2 2 1 ) (■). T ransconjugants w ere selected on N A plates containing N al a n d Tc. the m ean value of three experim ent. D ata are Log^o transconjugants per ml O N> GO ►n CTQ CO ON tn Ci O *3 HH ► •-t CL + KJ On O 00 M o otn o *>—I N3 O > Cl O oJ o K> O M O 4L ON 00 cells w ere rem o v e d at intervals a n d exam ined for the num ber of R751 tran sco n ju g an ts. F igure 3.6 (A) show s the resu lts of tran sferrin g R751 to an EcoR I restricting recipient harbouring pLG221. In such a m ating, the possibility of tran sferrin g R751 entering a restricting recipient w ith o u t Collb present w as elim inated. Therefore, the issue of tim ing of plasm id transfer is not a factor. At 40 m inutes, the yield of R751 transconjugants is 1,000-fold higher than the n u m b er p ro d u c e d w h en the m ating system lacked pLG221. Such results suggest that C o l l b d r d resident in the recipient does alleviate restriction by type II restrictio n en zy m es in trans. The results are very different from those obtained w h en R751 co-transfers w ith pLG221, w here no significant levels of EcoRI protection w ere observed (Fig. 3.3 A). In addition, these results rule out the h y p o th esis th a t the evasion m echanism operates by overw helm ing the EcoRI restriction enzym es by substrate saturation, since such a phenom enon w o u ld req u ire m u ltip le ro u n d s of transfers of C o l l b d r d to the restricting recipient. Figure 3.6 (B and C) also show s that both pLG292 and pLG221 conferred lim ited p ro tectio n on incom ing R751 from EcoKI. It is noted that pLG221 conferred m ore protection than pLG292, particularly during early stages. This p resum ably reflects the constitutive level of A rdA activity in an established C ollbdrd containing strain. Figure 3.6 (B) show s that a resident pLG292 plasm id in the recipient conferred only m odest protection on a transferring R751 plasm id. The 10-fold level of protection is sim ilar to that observed w hen the tw o plasm ids are co­ tran sferred from the sam e donor strain. Therefore, expression of the a r d A - in d e p e n d e n t re stric tio n evasion m echanism ap p ea rs to be unaffected by tra n sfe r of the p lasm id . Such a finding suggests th at the m echanism is expressed co nstitutively by Collbdrd and is not d ependent on transfer of the p lasm id. 64 The m o d est protection conferred on R751 by pLG221 w hen resident in the EcoKI restricting recipient (Fig. 3.6 C) com pared to the complete protection conferred w hen R751 and pLG221 are co-transferred (Fig. 3.5 A), suggests that the A rdA system operates optim ally w hen specified by a transferring plasm id. Such findings are indicative that the a r d A gene of Collb is subject to zygotic induction (C hapter One, section 1.6.4). 3.2.4 Is conjugative transfer of C ollb d r d a prerequisite for alleviation of restriction? A potential problem in the R751 protection experim ents (section 3.2.3) is that Collbdrd transfers from the EcoRI restricting recipients to the R751 donor. E xperim ents su m m arised in this section aim ed to establish w h eth er su ch transfers are a prerequisite for alleviation of type II restriction. Two strategies w ere adopted. The first involved a m ating experim ent w here u nm odified R751 w as tran sferred to a restricting recipient harb o u rin g w ild type C ollb (pLG272). W ild type Collb is naturally repressed for transfer functions an d therefore tran sfer of the plasm id is a rare event. The transfer efficiency of pLG272 is 40,000-fold low er than that of pLG221. The second strategy involved the exploitation of the phenom enon of plasm id-encoded entry exclusion. The m atings described in section 3.2.3 were repeated substituting pLG221 for pLG272. Like pLG221, pLG272 also carries a Tn5 insertion that confers resistance to kanam ycin although the plasm id does not carry a drd m u tatio n (H o w lan d et al., 1989). A ppropriate recipients w ere m ade by transferring pLG272 DNA into each strain by transform ation. th eir resistance to kanam ycin. Strains w ere identified by Filter m ating experim ents w ere then set u p b etw een a donor strain harbouring R751 and the restricting recipient strain h arb o u rin g pLG272. The num bers of R751 transconjugants w ere m onitored th ro u g h o u t the experim ent (Fig. 3.7). Figure 3.7 (A) show s that w hen R751 is transferred to an EcoRI restricting rec ip ie n t h a rb o u rin g pLG272, tran sco n ju g an t p ro d u c tio n w as negligible, 65 Fig. 3.7. Transfer of R751 to a restricting recipient harbouring w ild type Collb. (A) R751 transconjugants from conjugations of GI65R(R751) d o n o rs and eith er recipients GI65N (o , control), GI65N(NTP14) (#, control) o r GI65N (N TP14, pLG272) (■). (B) R751 tra n sc o n ju g a n ts from N M 654R(R751) d o n o rs a n d recipients N M 816N (O, control), GI65N ( • , control) a n d GI65N(pLG272) (■). T ransconjugants w ere selected on N A plates containing N al a n d Tc. the m ean values of three experm ients. D ata are Fig. 3.7 EcoRI 6- 4o 0 10 20 30 Time (min) EcoKI <D Oh 6 - c bC .B, 'o' ou </> c rj j- i bC o hJ 10 Time (min) 40 a m o u n tin g to 10’5 per cell at 40 m inutes. Clearly, pLG272 does not confer p ro tectio n on R751, d em onstrating th at the restriction-evasion m echanism active against type II enzymes is determ ined only by a ColTbdrd plasm id. Figure 3.7 (B) shows the result of transferring R751 to an EcoKI restricting recipient harbouring pLG272. A sm all am ount of protection was conferred on R751 by pLG272, since at 40 m inutes the num ber of R751 transconjugants w as 8-fold h ig h er th a n those form ed in restrictin g stra in s lacking pLG272. Interestingly, protection is initiated early, as found w hen pLG221 (ArdA+) w as resident in the recipient cell (Fig. 3.6 C). These findings suggest that ardA is not u n d er the com plete control of the Collb Fin system an d is expressed at a low level. Read et ah (1992), also described sim ilar findings w here unm odified X.O plated w ith a greater efficiency on an EcoKI restricting strain h arbouring pLG272 (e.o.p of 3.7 x 10' ) com pared to a restricting strain lacking a Collb plasm id (e.o.p of 1.4 x 10’4). A lthough use of pLG272 confirm ed that conjugative transfer of C o U b d r d is a prerequisite for alleviation by type II restriction, the data does not reveal the stage of C ollb's conjugation cycle w hen alleviation occurs. Possibilities are m ating pair form ation; DNA transfer and expression of the plasm id's transfer genes. To explore this question further, the phenom enon of plasm id-encoded entry exclusion w as exploited. The strategy here involved using the cloned Collb exclusion genes (eex) in the donor to prevent transfer of pLG221 from the restricting recipient to the R751 donor. The exclusion genes of Collb have been cloned giving reco m b in an t p lasm id pRR4 (Roscoe, 1996). The level of exclusion conferred by pRR4 on pLG221 is show n in Table 3.2.1. This displays the results of a 40-m inute m ating experim ent w here pLG221 was transferred to a recipient strain harbouring pRR4. The yield of pLG221 transconjugants w as red u ced 1,090-fold relative to the non-excluding control. H ow ever, quite surprisingly, w hen the recipient strain also harboured plasm id R751 as well as pRR4, the level of exclusion conferred on pLG221 was only 12-fold. 66 Table 3.2.1. Exclusion of pLG221 by the cloned Collb exclusion (eex) genes in pRR4 Donor Recipient pLG221 transconjugants/ ml *Level of exclusion GI65N(pLG221) GI65R 1.2 xlO 7 GI65N(pLG221) GI65R(pRR4) 1.1 x 104 1090-fold GI65N(pLG221) GI65R(pRR4, R751) 1.0 x 106 12-fold 0 Equal numbers of donor and recipient cells were mixed together to give a concentration of 108 cells of each strain per ml. A 40 minute filter mating was performed at 37°C. The numbers of pLG221 transconjugants were determined by plating the mating mixture on NA plates containing Km (25 pg mb1) and Rif (25 pg ml4 ). Plasmid pRR4 is a recombinant plasmid carrying the cloned Collb exclusion (eex) genes. *The level of exclusion conferred on transferring plasmid pLG221 by pRR4 was determined by comparing the number of pLG221 transconjugants produced in the recipient harbouring pRR4 to those produced in non-excluding strain GI65R. Fig. 3.8. Transfer of R751 from a donor harbouring the clon ed Collb exclusion genes to a restricting recipient harbouring C ollb drd. (A) R751 tran sco n ju g an ts from co n jugations of GI65R(R751) d o n o rs an d either recipients GI65N (O, control), GI65N(NTP14) ( • , control), GI65N(NTP14, pLG221) (■), and GI65R(R751, pRR4) donors an d G I65N (N TP14, pLG221) (A) recip ien ts. (B) R751 tra n sc o n ju g a n ts from N M 654R(R751) d o n o rs a n d recipients NM 816N (O, control), GI65N ( • , control) an d GI65N(pLG292) (■) and NM654R(R751, pRR4) donors and rec ip ie n t G I65N (pLG 292) (A). (C ) R751 tra n sc o n ju g a n ts from NM 654R(R751) d o n o rs a n d re c ip ie n ts N M 816N (O, c o n tro l), G I65N (•, c o n tro l) and G I65N (pL G 221) (■ ), and d o n o rs NM654R(R751, pRR4) an d recipient GI65N(pLG221) (A). T ransconjugants w ere selected on N A plates containing N a l a n d Tc. th ree experim ent. D ata are th e m ean v alu e of Logio transconjugants per m l N> Os 00 era U> 00 O nin o 53 H H NJ > ft CL + ON N3 00 tn Ci o *I—H H > ft CL ►■ o NJ 4L as oo cn r> o 5* fH > ft CL + 4L O This effect of pLG221 partially overcom ing its ow n exclusion barrier is not due to plasm id R751 displacing pRR4 from the recipient cell. T hroughout the m ating, the fraction of recipient cells harbouring both R751 and pRR4 w as m easured by plating some of the m ating m ixture on NA plates containing Rif, Tc and Ap. The num ber increased slightly by 3-fold as expected for a growing culture. One possible explanation is th at ColJbdrd is able to gain entry to the defined recip ien t via the m ating a p p a ra tu s of R751. A nother is that the presence of R751 in the pRR4 recip ien t interferes w ith the Collb exclusion system. Figure 3.8 (B and C) show s the effects of transferring R751 from a donor strain expressing the Collb exclusion system to an EcoKI restricting recipient harbouring pLG292 (D rd+, A rdA ') (B) or pLG221 (Drd+, A rdA +) (C). In both cases, the ex clu sio n sy stem in th e donor had no strik in g effect on the protection conferred on R751. Such findings indicate that conjugative transfer of Collb is not required for the ardA -independent function that alleviates type I restriction. The sam e conclusion w as draw n by the data in Fig. 3.6 (B) as discussed previously. In co n tra st, w h e n R751 tran sfe rred from the d o n o r to an E coR I restricting recipient harbouring pLG221, the level of R751 protection is reduced by ~10-fold (Fig. 3.8 A ). In the sam e experim ent, pLG221 transconjugant p ro d u ctio n in the R751 d o n o r w as also reduced by 10-fold. Such results su p p o rt the findings th at alleviation of type II restriction requires conjugative transfer of Collb. W hether the C ollb exclusion system w orks by p reventing m ating-pair fo rm atio n or D N A tra n sfe r is n o t know n (C hapter O ne, section 1.3.3). C onsequently, experim ents involving the Collb exclusion genes could not be u sed to identify the stage of the conjugation cycle at w hich the plasm idm e d ia te d re stric tio n e v asio n m echanism is ex p ressed . H ow ever, the experim ents w ere useful in indicating that the anti-type II m echanism is not co-ordinately expressed w ith the transfer genes. 67 3.3 Summary Four im portant conclusions can be draw n from the data in this chapter. First, the data are consistent w ith Read et al. (1992) in th at transfer of m ultiple copies of Collbdrd does alleviate restriction by type I and type II enzym es (Fig. 3.2). H ow ever, alleviation is ap parently not due to the restriction enzym es being overw helm ed by substrate saturation (Read et al., 1992), since, Collb drd alleviated restriction by both type I and type II enzym es w hen resident in the restricting recipient (Fig. 3.6). Second, the a r d A - i n d e p e n d e n t anti-type I system is apparently different from that w hich alleviates type II restriction. This conclusion w as draw n from the o b servation th a t type II restriction alleviation by C o l l b d r d only occurs d u rin g conjugative transfer of the plasm id. In contrast, alleviation of type I restriction occurred independently of Collbdrd transfer (Fig. 3.7). Third, the C ollb a r d A gene appears to be subject to zygotic induction. A rd A -m ed iated re stric tio n alleviation was observed to be optim al w h en encoded by a transferring Collbdrd plasm id rather than an equivalent plasm id resident in the restricting recipient (Fig. 3.5; Fig. 3.6 C). Such data suggest that a r d A expression is enhanced in conjugation but do not distinguish the cell in w hich gene ex pression is am plified. This dilem m a is ad d ressed in C hapter Five. Fourth, Collb partially overcomes its own exclusion barrier w hen IncPp plasm id R751 is resident in the excluding recipient strain. One hypothesis that m ay explain this phenom enon is that Collb utilises the m ating apparatus of R751 to gain entry to the excluding recipient. Such a scenario w ould indicate th a t the Collb exclusion m echanism operates by p re v e n tin g m atin g -p air form ation betw een tw o cells harbouring the same plasm id. How ever, such a hypothesis is inconsistent w ith the w ork of H artskeerl an d H oekstra (1985). They found evidence to suggest that the Collb exclusion system w orks by inhibiting DNA transfer betw een two donor cells (C hapter One, section 1.3.3). 68 D ue to tim e constraints, this concept of interactions betw een conjugation system s w as not pursued further. One w ay of testing the hypothesis w ould be to construct a specific m utant of R751 unable to express conjugative pili. Such a m u tan t w o u ld be expected to no longer provide the route of entry for the Collb plasm id to the excluding recipient cell. These sum m ary points are follow ed up in su b seq u en t chapters and general discussion. 69 Chapter Four Construction of an oriT Mutant of Collb drd and its Conjugative Properties 4.1 Introduction Data presented in the previous chapter suggests that Collbdrd encodes a restrictio n -allev iatio n function, w h ic h is active against type II restrictio n enzym es and is expressed d uring conjugative transfer of the plasm id. There are three m ain stages of C ollb's conjugation cycle w hen such a function could be expressed: d u rin g m ating-pair form ation, DNA transfer or th ro u g h o u t the cycle. To distinguish betw een these three stages, it was necessary to construct a m u ta n t of Collb defective in self-transfer by conjugation, yet proficient in form ing m atin g p a irs an d in expressing the full com plem ent of p lasm id proteins. The p u rp o se of this chapter is to describe the strategy em ployed in creating such a m utant. As described in chapter one, the origin of transfer (oriT ) of a plasm id is a c is -acting elem ent re q u ire d for DN A transfer by conjugation. T ransfer is initiated at the nic site w ithin the oriT by specific endonucleolytic cleavage of a single strand of the plasm id (Lanka and Wilkins, 1995). The m ain strategy for creating a m u tan t of C ollbdrd, defective in self-transfer, w as to delete the nic region from the o r i T of the plasm id. Collbdrd is -93 kb in size and specific alterations of sequences stretching only a few base pairs in length, internal of such a sizeable DN A genom e, are not practical w ith o u t using interm ediate sm all recom binant plasm ids. Therefore, before any alterations to the sequence of th e Collbdrd oriT could be m ade, the locus had to be cloned and its n ic sequence determ ined. The latter w as achieved by DNA sequencing. The nic sequence w as deleted from the cloned o r i T by a p ro ced u re involving PCR-am plification. A selectable m arker gene w as then inserted into 70 th e sequence and the disrupted o r i T transferred back to the Collbdrd plasm id b y hom o lo g o us recom bination. The m ethod used to replace the o r i T of C ollbdrd w ith a selectable m utation w as based on a gene replacem ent strategy described by Russell et al. (1989). The recom binant plasm id, carrying the d is ru p te d o r i T and selectable m ark e r, w as first lin earised by restriction endonuclease digestion. This D N A w as then in troduced by transform ation into an E. coli recD strain harb o u rin g Collbdrd. U nder norm al circum stances, linear DNA is subject to degradation by intracellular nucleases. H ow ever, the advantage of using an E. coli rec D m u ta n t is that the exonuclease activities of the RecBCD enzym e are abolished w ith o u t affecting the enzym es ability to act as a DNA helicase in initiating R ecA -m ediated hom ologous recom bination. The disrupted oriT, w ithin the recom binant plasm id is integrated by a double crossover event sp a n n in g the selectable m arker. recom binant p lasm id is eventually destroyed. The rem a in d e r of th e The m inim um am o u n t of hom ology that is required for efficient recom bination is 44-90 nucleotides for RecBC and RecF dependent pathw ays (Shen and H uang, 1986). O th er experim ents described in this chapter include exam ining the conjugative properties of the ColIbdrdAnzc plasm ids. 4.2 Results 4.2.1 Cloning and determination of the nucleotide sequence of the Collb o riT locus The Collb oriT is located adjacent to the leading region of the plasm id on a 1.6 kb P s t l fragm ent [C hapter O ne, Fig 1.1] (Rees et al., 1987). This fragm ent has already been cloned in this laboratory and inserted into cloning vector pUC19 (Roscoe, 1996). The resulting recom binant plasm id (pCL2) w as u sed initially as a tem plate for determ ining the nucleotide sequence of Collb oriT. Sequence reactions w ere perform ed as described in Chapter Two using M 1 3 /p U C u n iv ersal forw ard an d reverse prim ers. H ow ever, the initial sequence data obtained w ere very am biguous. Consequently, the integrity of 71 the pCL2 tem plate was re-exam ined, firstly by restriction enzym e analysis an d secondly by PCR-amplification. A series of 20-cycle PCR reactions w as set up using universal forw ard and rev erse p rim e rs w ith tem p late pCL2. The tem p eratu re at w hich the p rim ers w ere annealed to the pCL2 tem plate was varied from 49-60°C. The resulting extension products p ro d u ced in each reaction w ere exam ined by gel electrophoresis and in all cases unpredictable DNA bands of varying sizes w ere observed (data n o t show n). Such results are indicative of either non-specific a n n ealin g of p rim e rs or of a m ixed p o p u latio n of tem plate. H ow ever, restriction data for pCL2 w ere not indicative of the latter possibility. To overcom e the problem , the 1.6 kb Ps tl insert w as rem oved from pCL2 by restriction enzym e digestion an d the fragm ent isolated from a 0.8% agarose gel after separation from pUC19 by electrophoresis. The fragm ent was purified from the agarose using the QIAEX II gel extraction kit (Qiagen) and ligated back into the P stl site of a separate batch of pUC19 molecules form ing recom binant p N A l. Isolation of p N A l w as achieved by transferring the ligated DNA by tran sfo rm a tio n in to the E. coli L ac ' strain , D H 5a. T ransform ant colonies containing p N A l w ere d istinguished from those that had gained re-ligated pUC19 vector by their w hite appearance on LA plates containing X-gal, IPTG and Ap (Sam brook et al, 1989; see C hapter Two). In order to establish the orientation of the 1.6 kb P stl insert of p N A l w ith in th e pUC19 v ector, the recom binant plasm ids w ere exam ined by restriction enzym e digestion. All four of the recom binant plasm ids isolated w ere digested w ith Clal and EcoRI, w here the Clal site is internal to the 1.6 kb insert and the EcoRI site adjacent to one of the P stl sites. Three of the four recom binants tested h a d the orientation show n in Fig. 4.1 A, and one the opposite orientation. The latter plasm id was subsequently designated p N A la. R ecom binant p lasm id s p N A l and p N A la w ere both tested for th eir ability to be m obilised by C o ll b d r d plasm id pLG221 from a recA donor. A recA d o n o r strain w as used in all m obilisation experim ents sum m arised in Table 72 Fig. 4.1. Schematic diagram of the construction of pN A 2 (A ) C leavage of p N A l w ith P s t l a n d C l a l to rem o v e th e 1.15 kb of D N A containing C ollb oriT . The rem aining 3.1 kb P s t l - C l a l fra g m e n t is discarded. (B) Cleavage of cloning vector pIC 19H at its Pstl and C l a l sites to obtain the 2.6 kb linearised vector. The sm all P s t l - C l a l fragm ent fro m the polylinker is lost th ro u g h p u rification of the larger fragm ent. (C) In se rtio n of the 1.15 kb P s t l C l a l frag m en t from p N A l into 2.6 kb pIC 19H v e cto r to give pN A 2. (—^ ) rep resen ts the o rien tatio n (5' - 3') an d annealing site for M 1 3 /p U C universal forw ard (UF) an d reverse (UR) prim ers. Fig. 4.1 EcoRI Bam H l H in d lll Sphl Pstl pIC19H 2686 bp P vu I Clal EcoRVXbal Bglll Xhol Sacl Nrul Hindlll H in d lll pNA2 3786 bp H indlll Nru I Sac I Xho I Bgl II Xfcal C la l EcoRI Sma I BamHl S a il P stl Hindlll Table 4.2.1. Mobilisation of recombinant plasmids carrying the Collb oriT locus Donor Stain Transconjugants / ml Recombinant Helper Plasmid Mobilisation Frequency of Recombinant J08(pLG221, pN A l) 2.0 x 106 3.0 x 106 0.66 J08(pLG221, pN A la) 1.1 x 106 2.0 x 106 0.55 J08(pLG221, pNA2) 8.0 x 106 1.1 x 107 0.73 J08(pLG221, pNA3) 4.0 x 106 1.4 xlO 7 0.28 J08(pLG221, pNA7) <10 7.0 x 107 < 1 0 '4 J08(pLG221, pNA8) <10 1.3 x 107 <10‘4 J08(pN A ll, pNA2) 1.9 xlO 7 <10 1.0 J08(pNA12, pNA2) 1.1 x 107 <10 1.0 The frequency of transfer of each plasmid was determined after a 40 minute filter mating at 37°C, using J08 as the donor host strain and GI65N as the recipient. Recombinant plasmids pN A l, pN A la, pNA2, pNA3, pNA7 and pNA8 all confer resistance to ampicillin. Helper plasmids pLG221, p N A ll and pNA12 all confer resistance to kanamycin. Plasmids p N A ll and pNA12 both carry a cis-acting lesion within the oriT which prevents their self-transfer. However, both plasmids still specify all other transfer functions. Transfer of recombinant plasmids were measured by plating the mating mixture on NA plates containing Ap (100 pg ml-1) and Nal (25 pg ml4 ) and transfer of the helper plasmid measured by plating on NA plates containing Km (25 pg ml4 ) and Nal (25 pg ml4 ). ^Mobilisation frequency is the number of mobilisable plasmids transferred per conjugative plasmid. 4.2.1, to p re v e n t co-integrate fo rm a tio n b e tw ee n C ollb p lasm id s a n d recom binants carrying the Collb o r i T sequence. Test strains w ere m ade by tra n sfe rrin g the recom binant p la sm id s by tran sfo rm atio n to J08(pLG221) com petent cells. D onor strains J08(pLG221, p N A l) and J08(pLG221, p N A la) w ere m ated w ith N alR recipient strain, GI65N for 40 m inutes on filters. The results of these transfers are p resented in Table 4.2.1. Both p N A l and p N A la w ere found to be m obilised w ith a n efficiency of 0.66 and 0.55, respectively, relative to the transfer of pLG221. Such results confirm that both recom binant plasm ids contain the Collb or iT an d th at its activity as a czs-acting site for the initiation of DNA transfer is unaffected by orientation. To reduce the num ber of reactions required to determ ine the com plete nucleotide sequence of the 1.6 kb P stl insert w ithin p N A l, the Collb oriT w as subcloned further. The 1.15 kb P s t l - C l a l fragm ent found w ithin p N A l w as isolated and lig ated into cloning vector pIC19H, form ing pN A 2 (Fig. 4.1). Cloning vector pIC19H is identical to pUC19 except it has a different set of cloning sites available w ithin the polylinker (Marsh et al., 1984). M obilisation of pNA 2 from a recA host (J08) w ith a frequency of 0.73 relative to the transfer of pLG221 confirm ed that the plasm id w as oriT+ (Table 4.2.1). The suitability of pNA2 as a tem plate for DNA sequencing was further confirm ed by PCR-am plification and restriction enzym e analysis. Initial sequence reactions w ere set up using u n iv ersal forw ard an d reverse prim ers. H ow ever, to obtain sequence data for both DNA strands, tw o new prim ers w ere designed (prim er B and prim er C; see C hapter Two). The sequence d ata o b tain ed from pN A 2 using u niversal forw ard and reverse prim er and prim er B and prim er C, are presented in Fig. 4.2. The location of the nic site in In cll plasm id R64 is know n, as is the n ucleotide sequence of the su rro u n d in g region (Furuya and Komano, 1991). A lignm ent of nucleotide sequences of a num ber of plasm ids show s that R64 nic region belongs to a family w ith a consensus sequence of 5'-YATCCTGTY-3', w here ▼ indicates the specific cleavage site (Lanka and W ilkins, 1995). The 73 Fig. 4.2. 1,151-bp D N A sequence containing the Collb o r i T and part of the o r iT operon Features sh o w n include the nic region lnnnn w here A rep resen ts the site of cleavage by th e relaxase enzym e; the tw o start co d o n s of n i k A an d nikB |GTG| an d their associated ribosom e binding sites (rbs ) w hich are u n d erlin ed ; an d the TGA stop co d on of n i k A . A m ino acid sequences d e d u c e d from the D N A sequence of th e nikA an d nikB reading fram es are p re se n te d , w here the sta rt of each tra n sla te d am ino acid is in uppercase. in d ic ate d . R elevant restrictio n sites are In v e rte d re p e a t sequences are sh o w n (■----------- *----------------- • ). In v erted rep eats A an d B represent the site at w hich th e n i k A p o ly p e p tid e b in d s to the DNA. P utative prom oter sequences (-35 an d -10 regions) are boxed L. i N u cleo tid es 1-437, p resen ted in low er case, re p re se n t th e le a d in g region stran d th at is transferred to the recipient cell first d u rin g conjugation. A ll features sh o w n , except the n ic sequence, w ere id e n tifie d by se q u en ce com parisons to the p u b lish ed oriT sequence of the In c ll p lasm id R64 (F uruya a n d K om ano, 1991; EMBL d a tab ase accession n u m b e r D90273). The n i c sequence w as d eterm in ed by com paring the sequence d a ta to the consensus 5'YATCCTGt Y-3’ (Lanka and W ilkins, 1995; Fig. 1.2). Fig. 4.2 1 61 121 c c g a c tta g a c a g a a a a a tc ta c g a c a a a a tc tc c ttc ttg c c g c c c ta c c g a a a a a g c t 60 a a g a c g a a g g g c tta tta c a g tg c ttc a c g g a a g a g a g g a tg a a g ta tg g g a c g ta a a a a c tc g c a c a g tg c ttg ta g a c g g tc a c a ta ttg g g a a a c a g g a a tg g tc ttg tta tta c a g 12 0 180 Bglll 181 ta g tg c tta c c g ta g ttc tttta g tg c g tta c a a a a g g c a a c tc g c c a c |tc ta g ^ c c g tt 2 40 2 41 a a g g c ta tttc g tc g a a a a c g g tc g a c c ta c a g ta g ta g a c tc a tttta a a a a a tc ttc g 3 00 3 01 g ta a c c g tg a a a g a g a g g c a a a g ta c a a c ttc a g a c ta a a a ta g ta g tttttg a c tg a g g 3 60 3 61 a c c g tc c c g a c g c g g g a c g g a g a c g g c a g a c g ta ttc tg a ta c ta c g tg ttttta ttg tc 42 0 421 T G C A A lH g^T A C C G T C C C A C G C G A C G C +----------------------i----------------------+ 4 8 0 c g a t a t t a c c g g a c t t t t '^GebCTGTG<lTAOACGTTAACATTATGGCAGGGTGCGCTGCG -3 5 A %'Mk.y nic -1 0 repeat A rfos IR lnikA- 4 8 1 --------------------■+■----------h -------------- h---------------------- f* i-----------------------f 5 4 0 ATAATGTTAACGGTAGACCAGTCCCGAAGCGGGGCTGTGGGGCATTCCTCGGACTTCACT ---------------------------- 1 1 ^ IR Z 1 V a lS e r rep e a t B GTGATAGTGCGGTAAGAAAGAAAAGTGAGGTACGGCAAAAGACAGTTGTCAGGACACTCA 5 4 1 -------------- +------------- +------------- +------------- +------------- +-------------- + 6oo C A C T A T C A C G C C A TT C TT T C TT T T C A C TC C A T G C C G TT T TC T G T C A A C A G T C C T G T G A G T A sp S e rA la V a lA rg L y sL y sS e rG lu V a lA rg G ln L y sT h rV a lV a lA rg T h rL e u A rg GATTCTCTCCTGTTGAGGATGAAACCATCAGGAAAAAAGCTGAAGATTCCGGACTTACTG 601 + ------------------- + ------------------ + ------------------ + ------------------ + ------------------- + 660 CTAAGAGAGGACAACTCCTACTTTGGTAGTCCTTTTTTCGACTTCTAAGGCCTGAATGAC P h e S e rP ro V a lG lu A sp G lu T h rlle A rg L y sL y sA la G lu A sp S e rG ly L e u T h rV a l TATCTGCTTACATCCGAAATGCAGCACTGAATAAGCGAATTAACTCCCGTACAGATGATG 66 1 ------------------+ --------------------+ ------------------ + ------------------ + ------------------ + -------------------+ 72 0 A TAGACGAATGTAGGCTTTACGTCGTGACTTATTCGCTTAATTGAGGGCATGTCTACTAC S e rA la T y rlle A rg A sn A la A la L e u A sn L y sA rg lle A sn S e rA rg T h rA sp A sp A la CTTTTCTGAAGGAGTTAATGAGACTGGGAAGGATGCAGAAGCATCTGTTTGTTCAGGGAA 721 ------------------+ --------------------+ ------------------ + ------------------ + ------------------ + -------------------+ 7 8 0 G AA AA GACTTCCTCAATTACTCTGACCCTTCCTACGTCTTCGTAGACAAACAAGTCCCTT P h e L e u L y s G lu L e u M e tA rg L e u G ly A rg M e tG ln L y sH is L y s P h e V a lG ln G ly L y s AAAGAACCGGTGACAAGGAATACGCAGAAGTGCTTGTTGCCATAACCGAACTTACAAATA 781 ------------------ + ------------------ + ------------------ + ------------------ + ------------------ + ------------------ + 8 4 0 TTTCTTG G C C A C TG TTC CTTA TG CG TCTTC A C G A A C A A C G G TA TTG G C TTG A A TG TTTA T A rg T h rG ly A s p L y s G lu T y rA la G lu V a lL e u V a lA la lle T h rG lu L e u T h rA s n T h r rbs nikB — ► c a t t g c g t a a a c a g t t a a t g g a a g g t (t g ^ g g a t t c t c c t § tg ]a a t g c a g t c a t t c c g a a a 841 G TA ACG CA TTTG TCA ATTACCTTCCAA CTCCTA AG AG G ACACTTACGTCAG TAA GG CTTT L e u A rg L y s G ln L e u M e tG lu G ly * V a lA s n A la V a llle P ro L y s AAGAGAAGGGACGGCAAGTCTTCGTTCGAAGACCTAGTATCCTACGTCTCTGTCCGGGAT 901 ------------------ + ------------------ + ------------------ + ------------------+ -------------------+ + 9 60 TTCTCTTCCCTGCCGTTCAGAAGCAAGCTTCTGGATCATAGGATGCAGAGACAGGCCCTA L y s A rg A rg A s p G ly L y s S e r S e rP h e G lu A s p L e u V a lS e rT y rV a lS e rV a lA rg A s p GACATGACCGATGAGGAACTGGATTTATCTTCTTCTTCGCAGGCTGAACAACCGCACCGA 961 1 f 1 + 1_ 1 0 2 0 CTGTACTGGCTACTCCTTGACCTAAATAGAAGAAGAAGCGTCCGACTTGTTGGCGTGGCT A s p M e tT h r A s p G lu G lu L e u A s p L e u S e rS e rS e r S e rG ln A la G lu G ln P ro H is A rg AGCCGTTTCA GTCGCCTTG TTG ATTATGCG ACACG TCTTCG AA A TG AG TCA TTCG TG G CG 1021 ------------------- (--------------------- h------------------ + ------------------h------------------ h-------------------- + 1 0 8 0 TCGGCAAAGTCAGCGGAACAACTAATACGCTGTGCAGAAGCTTTACTCAGTAAGCACCGC S e rA rg P h e S e rA rg L e u V a lA s p T y rA la T h rA rg L e u A rg A s n G lu S e rP h e V a lA la CCGGTTGATGTCATGAAGGACGGCTGCGCATGGGTCAACTTTTATGGTGTCACCTGCCCT 1081 ------------------ + -------------------- + ------------------ + ----------------- + ------------------+ ------------------- + 1 1 4 0 GGCCAACTACAGTACTTCCTGCCGACGCGTACCCAGTTGAAAATACCACAGTGGACGGGA P ro V a lA s p V a lM e tL y s A s p G ly C y s A la T rp V a lA s n P h e T y rG ly V a lT h rC y s P ro C A TA A C TG TA T 1141 ------------------+ G TA TTG A C A TA H is A s n C y s 1151 nucleotide sequence determ ined for the 1.15 kb Ps tl-Clal fragm ent of pNA2 was found to h av e 97% identity over about 924 bp w ith the corresponding region su rro u n d in g oriT of R64 (EMBL database accession num ber D90273). The nic region of Collb w as identified by alignm ent of the sequence w ith the consensus sequence described by Lanka and W ilkins (1995). In fact, the 12-bp nic region identified for Collb is identical to the nic region of plasm id R64. Other features identified w ith in the sequence of the 1.15 kb Pstl -C lal fragm ent of pNA2 were also identified w ith in the published sequence of R64 (Fig. 4.2). 4.2.2 Strategy em ployed to delete the nic sequence from the cloned Collb o riT Once the sequence of the Collb nic site had been determ ined, a strategy involving PC R -am plification w as em ployed to delete it from the cloned or iT (pNA2) as su m m arised in Fig. 4.3. First, tw o inverse oligonucleotide prim ers w ere designed w hich w ere h om ologous to th e regions of DNA flanking the nic region (prim er 5 and prim er 6, Fig. 4.3, A). A unique EcoRI restriction enzym e target sequence w as added onto the 5' term inus of each prim er. Hence, during amplification of the tem plate by an inverse PCR reaction, 14 bp overlapping the nic region are lost and the targ et sequence for the EcoEl restriction enzym e is incorporated. In ad d itio n , these p rim e rs w ere also designed to avoid d isru p tio n of the o r i T operon prom oter found 12-bp upstream of the nic site (Fig. 4.3, A). Before any am plification of the pNA2 tem plate w as carried out the PCR protocol w as optim ised as described in Chapter Two, section 2.6.1. The results of this optim isation process are presented in Fig. 4.3 (B). Specific com m ents w ere p rim er 5 and p rim er 6 have an optim al annealing tem perature of 53°C (lane 3 and 4) a n d th at 4 m inutes is required for the w hole of the pN A 2 tem plate to be am plified by Taq polymerase (lane 5). The final PC R -am plification of pN A 2 and the subsequent rem oval of the Collb nic reg io n from the plasm id w as achieved using 50 ng of pN A 2 74 Fig. 4.3. Schematic diagram of the key stages in the construction of a Collb o r iT sequence deleted o f the nic region (A) R elatio n sh ip b etw een p rim e rs u sed for inverse P C R -am plification of pN A 2 and the nic region. ▼ indicates the nic site; the boxed sequence is the -35 region of the p u tativ e prom oter of the relaxase operon. into the lead in g region. P rim er 5 extends Prim er n ucleotides show n in lo w er case are non- ho m o lo g o u s to the Collb region and contain the EcoRI ta rg e t h ex an u cleo tid e (5'-gaattc-3'). (—a ) represents the direction (5' - 3') a n d a n n ealin g sites of p rim e rs 5 and 6 a n d the M 13/pU C universal forw ard (UF) and reverse (UR) p rim e rs. (B) Gel electrophoresis of PC R -products of pNA2. M olecular w eig h t m ark ers (X x H i n d III) are in lanes 1 and 6. Prim ers used (2) UF and UR; (3) 6 and UR; (4) 5 and UF; (5) 5 and 6. The size of each extension product is indicated in kb. (C) Cleavage of th e linear PCR p ro d u c t w ith EcoRI an d ligation gave pN A 7 d eleted of the nic region. The om ega (Q) in tersp o so n w as lig ated into the u n iq u e EcoRI site, form ing pNA8. Fig 4.3 pNA2 P stl Primer 5 -35 5 ' cggaacCacgTTTTCAGGCCATTATAGCC 5 ’ GGGACj GTATTACAATTGCACATCCTGTCCCGTTTTTCAGGCCATTATAGCC 3 ■CCCTC-GCA’JA.-.TGTHAACGTGTAGGACAGGGCAA.AAAGTCCGGTAATATCGG 3 • CCCTGCCATAATGTTAACGgccc-aaggc 5 ' P r im e r 6 - 3 .8 B 1.1 — -0 .7 0.4 — Ap] pN A 7 3772 bp P stl C lal EcoRI Sm R/S p R Q 2 kb EcoRI EcoRI tem p late; 20 p m o l of b o th p rim e r 5 and 6; 1 u n it of proof-reading T a q polym erase (BIO-X-ACT, Bioline UK Ltd); 5 jxl of 11.1 x PCR buffer; 40 pi of H 20 and the follow ing PCR program : 95°C - 3 m inutes (1 cycle); 95°C - 45 seconds, 53°C - 30 seconds, 72°C - 4 m inutes (35 cycles) and 72°C - 4 m inutes (1 cycle). The resu ltin g 3.8 kb extension p roduct was isolated and purified from a 0.8% agarose gel u sing the QIAEX II gel extraction kit (Qiagen). The next stage in isolating a recom binant plasm id containing the oriT deletion m utant was to re-ligate the linear PCR product. O ne option w as to digest and re-ligate the EcoRI restriction targ et sites th at w ere incorporated on to the ends of the PCRp ro d u ct. H o w ever, restriction targ et sites at the ends of PC R -products are difficult to digest. To avoid this com plication another approach was used. One key feature of u sing a polym erase enzym e w ith 5'-3' proofreading exonuclease activity is th at a large p o p u latio n of the linear product have flush ends. By exploiting th is p ro p e rty , the P C R -product w as re-circularised by b lu n t-en d ligation. The D N A w as then transferred by transform ation to DH 5a. Colonies th at h a d gained a circularised plasm id w ere detected by their resistance to am picillin. The deletion of the nic sequence from the resulting recom binant plasm id, pN A 7, w as confirm ed by the inability of the plasm id to be m obilised by plasm id pLG221 (Table 4.2.1). 4.2.3 Construction of pNA8 containing the omega (Q) intersposon The next step involved the insertion of a selectable antibiotic resistance gene carried by the om ega (D) intersposon into pNA7. The 2.0 kb fragm ent carries a streptom ycin-spectom ycin resistance gene flanked by inverted repeats w ith tran scrip tio n a n d tran slatio n term ination signals (Prentki and Krisch, 1984). The Q frag m en t w as released from pHP45Q by EcoRI digestion and ligated into the EcoRI site of plasm id pNA 7 to give pNA8 (Fig. 4.3, C). Like pNA7, pNA 8 is also non-m obilisable by pLG221 (Table 4.2.1). Finally, further to confirm the stru c tu re of the m u ta n t o r iT locus, pN A 7 and pNA 8 w ere exam ined by restriction enzym e analysis (Fig. 4.4). 75 Fig. 4.4. Gel electrophoresis of restriction fragments of pNA7 and pNA8. Sm R/S PR EcoRl Ec oR l 2 kb EcoRI B C lal An i c P stl pNA7 3772 b p (A) Samples in lanes are (1) X x Hzndffi; (2) pNA7 x P s t l / C l a l ; (3) pNA7 x P s t l / C l a l / E c o R l ; (4) pN A 7 x EcoRI; (5) pN A 8 x P s t l / C l a l / ; (6) pNA8 x Pst l/Clal/EcoRL; (7) pN A 8 x EcoRI; (8) X x H in d ll l. Fragments sizes are presented in kb. (B) Physical m ap of pN A 7 and the 2.0 kb Q. fragment, which on ligation into the EcoRI site form s pNA8. 4.2.4 Construction of a Collb d rd mutant lacking the nic region The oriTAnicuQ. m utation w as transferred by recom bination from pNA 8 to the C o l l b d r d plasm ids pLG221 (A rdA +) and pLG292 (ArdA'). The 3.1 kb Pstl Clal fragm ent w as released from pN A 8 by restriction enzym e cleavage and the D N A tra n s fe rre d by tra n s fo rm a tio n into E. coli recD c o m p e te n t c e lls , DL307(pLG221) and DL307(pLG292). T ransform ant colonies that w ere KmR, Sm R, SpR and A ps w ere inferred only to contain Collbdrd plasm ids th at had acquired the oriT An ic :: Q m u tatio n from pNA 8 and w ere chosen for further ch aracterisation. The rest of p N A 8 in clu d in g the region th a t encodes ampicillin resistance w as presum ably degraded intracellularly. R ecom binant C ollb p la sm id s p N A ll ( p L G 2 2 1 Anic: :Q ) and pN A 12 (pLG292Am c::Q) w ere isolated by th eir resistance to Km, Sm and Sp. n a tu re of th e o r i T am plification. w ith in The each p la s m id w as c h arac terise d by PCR- P rim ers B and C w ere used to am plify the o r iT regions of plasm ids p N A ll, pNA12, pLG221, pN A 2 and pNA8 (Fig. 4.5). Any insertions w ithin this region w ere detected d u rin g PCR-reactions by an increase in the size of the am plification product. Results presented in Fig. 4.5 (B) show th at p N A ll, pNA 12 and pN A 8 all have an additional 2.0 kb of DNA in the o r i T locus, which is not present in pLG221 and pNA2 w hich carry the w ild type oriT locus. A lternatively, the n atu re of the o r iT locus w ithin these recom binant p lasm id s could h av e been confirm ed by a S o uthern blot h y b rid isatio n experim ent. H o w ev er, d u e to the tim e constraints of this project, this additional w ork w as not undertaken. 4.2.5 Conjugative properties of p N A ll and pNA12 Recom binant plasm ids p N A ll and pNA12 w ere tested for self-transfer by conjugation. Test strains w ere m ade by transferring p N A ll and pNA12 by transform ation into GI65N com petent cells and selecting for KmR, SmR and 76 Fig. 4.5. Analysis of the nature of the oriT within p N A ll and pNA12 by PCR-amplification SmR/SpR E c° m Clal i B ' J p nic p E coR 1 P sfl (not to scale) (A) Relationship betw een prim ers B an d C used for PCR-amplification of the Collb oriT locus. (-») represents the direction (5-3') and annealing sites of primer B and C. Plasm id pNA2 and pLG221 carry an un d isru p ted oriT . Plasmids p N A ll, pNA12 and pN A 8 are deleted of the nic region a n d contain the 2.0 kb Q insertion. Also show n is the oriT operon prom oter (p) and nikA and nikB. The region between the annealing sites of prim er B and C is 0.4 kb. (B) PCR reactions w ere perform ed u sin g prim er B and prim er C with plasm ids p N A ll, pNA12, pLG221, pNA 2 and pN A 8. Gel electrophoresis of the resulting extension products and their sizes in kb are presented. Lanesl and 7 contain X x H i n d U l m olecular w eight m arkers. Tem plates used w ere (2) p N A ll; (3) pNA12; (4) pLG221; (5) pNA2; (6) pNA8. Faint b an d s at the ends of lane 2,3 and 6 are unused prim er. SpR. C onjugation experim ents w ere then set up on filters using a RifR recipient strain (GI65R). A fter 40 m inutes, the num bers of recipients that had acquired p N A ll or pN A 12 w ere scored by p latin g on NA plates containing Rif and Km. No transconjugants w ere detected (< 2 x 10'6 relative to the transfer efficiency of pLG221). H ence, bo th p N A ll a n d pNA12 w ere found to be com pletely defective in self-transfer by conjugation. To confirm th a t p N A ll an d pN A 12 specify all t r a n s -acting tran sfer functions test conjugations w ere set up to exam ine w h eth er each plasm id could m obilise the sm all recom binant plasm id carrying Collb or iT+ (pNA2). D onor strain s (J08) w ere m ade th a t harb o u red recom binant plasm id pN A 2 and either p N A ll or pNA 12. T hese donors w ere then m ated w ith a RifR recipient strain for 40 m inutes on filters. After such tim e, the num bers of pN A 2 transconjugants w ere exam ined by plating on NA plates containing Rif an d Ap. Both p N A l l and pN A 12 w ere able to m obilise pNA2 w ith an efficiency of 1.0 from a r ec A h o st relative to pLG221 directed m obilisation (Table 4.2.1). In su m m ary p N A ll an d pNA12 are inferred to specify all of the tran s -acting p roteins required for conjugation but to contain a cis -acting lesion p rev en tin g self-transfer. These resu lts also confirm that each plasm id has retained its D rd + phenotype. Finally, the A rd A p h en o ty p es of p N A ll and pNA 12 w ere exam ined. The test involved infecting an EcoKI restricting E. coli strain (GI65N) w ith a X, v ir lacking EcoKI m odification (X v i r . 0) and exam ining the num ber of plaques form ed. In strains h a rb o u rin g p N A ll, the num ber of plaque form ing units p ro d u ced by X vir.O w as 3.8 x 108 m l’1 com pared to 1.7 x 106 m l'1 form ed on GI65N alone. In contrast, the n u m b er of plaque form ing units produced on Y 1 strain s h a rb o u rin g pN A 12 w as 8.0 x 10 m l' (A ppendix I). These results confirm that p N A ll has an A rdA + and pNA12 an A rdA ' phenotype. 77 4.3 Discussion R esults in this chapter detail the successful construction of a specific m u tatio n w ith in the or iT of Collbdrd. The resulting m utants, p N A ll (D rd+, A rd A +) and pN A 12 (D rd+, ArdA'), w ere completely defective in self-transfer by conjugation. H ow ever, the ability of p N A ll and pNA12 to specify all other transfer functions w as confirm ed by the m obilisation of an oriT+ recom binant p lasm id w ith a m axim al efficiency (Table 4.2.1). In sum m ary, p N A ll and pNA12 specify all the t r a n s - acting p ro tein s required for conjugation, yet, contain a cis- acting lesion w ithin the o r iT preventing self-transfer. Several n o tab le observations w ere m ade du rin g the construction of these m utants an d they are described below . N u c leo tid e seq u en ce sim ilarities are only found betw een the o r i T stru ctu res of closely related plasm id s (Furuya and Komano, 1991). This is co nfirm ed as th e n u c le o tid e sequence of the o r i T of Collb only shares significant seq uence sim ilarities w ith the published or iT sequence of closely related In cll p lasm id R64 (section 4.2.2). Significant sequence similarities exist betw een the nic regions of different conjugative plasm ids (Lanka and W ilkins, 1995). These sim ilarities have been organised into three groups presented in C hapter O ne, Fig. 1.2. These findings are further supported here, as the nic region of C ollb b e ars 100% identity w ith the corresponding region of R64, w hich in tu rn belongs to a family of nic regions harbouring the consensus 5'Y A T C C T G ^ Y -3 '. O th er m em bers of this group include plasm ids from the incom patibility g ro u p P and sequences from the Ti plasm id of Agrobacterium, tumefaciens (Lanka a n d W ilkins, 1995). The location and sequence of the Collb oriT nic site was confirmed by an ap p ro ach alternative to m ore conventional m ethods described by Furuya and K om ano (1991) (see C h ap ter One, section 1.3.2). First, the location of the p u tativ e Collb nic region w as identified by alignm ent of the oriT sequence data w ith the consensus sequence 5'-YATCCTG^Y-3'. Secondly, the legitimacy of the p red icted nic sequence w as confirm ed by the inability of recom binant 78 plasm id s, w h ich h ad been relieved of their nic sequence, to be m obilised by C o llb d rd (Table 4.2.1). This w hole approach w as enabled by the extensive database on or iT regions. It h as b e en p ro p o se d th a t the In c ll conjugation system m ay have evolved follow ing fusion of two distinct types of conjugative plasm id (Rees et al., 1987). Such a concept arose from the findings th at In cll plasm ids and plasm id F carry genes w hich h av e alm ost identical functions. These genes in clu d e p s i B an d s s b , w hich are fo u n d w ith in the leading regions of both plasm id F an d Collb (C hapter O ne, sections 1.6.1 and 1.6.2). In addition, the nucleotide sequences of these Collb genes share significant sim ilarity to those of plasm id F, w hich indicates a com m on ancestry. In contrast, the oriT of Incll plasm ids Collb and R64 share sim ilar properties to the oriT of IncP plasm ids RP4 and R751 (Lanka and W ilkins, 1995). The oriT of both Collb and R64 can be found situ ated adjacent to the leading region of each plasm id. If the hybrid hypothesis is correct th en there m u st be a point w ithin the leading region of these tw o In c ll p lasm id s w here sequence sim ilarities w ith plasm id F diverge. U nfortunately, the lack of sequence data available for the leading region of plasm id F has p rev e n ted this point from being determ ined. How ever, w ork is currently u n d e rw ay to determ ine the com plete nucleotide sequence of F. Plasm id C ollbdrd only specifies one particular species of oriT. Removal of 14 b p from a ro u n d the nic site elim inated the p la sm id 's ability to self­ transfer by conjugation. These results rule out the possibility of another o r iT stru c tu re b ein g p re se n t on Collb other th an that d eterm ined by Rees et al. (1987). If a second o r iT w ere present on Collbdrd, then transfer of the plasm id by conjugation w o u ld not be elim inated by a lesion in the orthodox oriT locus, unless the second o r iT operates u n d er alternative conditions to those induced d u rin g intraspecific conjugations betw een E. coli. 79 Chapter Five Analysis of the Alleviation of Type I and Type II Restriction Using Non-Transferable Mutants p N A ll and pNA12 5.1 Introduction C o n ju g a tiv e tra n s fe r of C o l l b d r d is an essential req uirem ent for expression of the p lasm id 's anti-type II restriction alleviation function. The p u rp o se of th is c h ap ter is to determ in e the stage d u rin g conjugation w hen restrictio n -allev iatio n occurs; is it m atin g -p air form ation, DNA transfer or th ro u g h o u t th e cycle? To test this, the C o l l b d r d o r i T m u tan t described in C hapter Four w as exam ined for its ability to confer protection on transferring R751. O th er e x p erim en ts described include exam ining w hether the C o l l b d r d or iT m u ta n ts im m o b ilised in d o n o r strains could protect transferring R751 from EcoKI an d EcoRI restriction. 5.2 Results 5.2.1 Rescue of R751 from EcoRI restriction by C ollb d rd o r i T m utant p N A ll This section is aim ed at determ ining the stage in the Collb conjugation cycle w h en EcoRI restrictio n is alleviated. An interrupted m ating experim ent was p erfo rm ed on filters using a donor strain harbouring R751 and an EcoRI restricting recipient h a rb o u rin g p N A ll. The num bers of R751 transconjugants form ed in the restricting recipients w ere m easured throughout the experim ent and the results are p resen te d in Fig. 5.1 (A). Over a 40-m inute period the yield of R751 tran sco n ju g an ts p ro d u ced in recipients harbouring p N A ll w as very sim ilar to th at p ro d u c e d in a recipients harbouring pLG221. Such results rule out the possibility th a t the alleviation function is expressed during the act of DNA transfer. 80 Fig. 5.1. Transfer of R751 to an EcoRI restricting recipient harbouring p N A ll. (A) R751 tran sconjugants from conjugations of GI65R(R751) d o n o rs and either recip ien ts GI65N (O, control), GI65N (N TP14) (#, c o n tro l), GI65N (N TP14, pLG221) (■, control) or GI65N(NTP14, p N A ll) (♦). (B) R751 tran sco n ju g an ts from conjugations of GI65R(R751) d on ors and recipients G I65N (O, control) or GI65N(NTP14) ( • , control) and GI65R(R751, p N A ll) d o n o rs an d recip ien t GI65N(NTP14) (■). T ransconjugants w ere selected on N A p la te s containing N al and Tc. D ata are the m ean values of three experim ents. Log^Q transconjugants per ml K> 4^ Log^Q transconjugants per ml ctq’ CO N> o 3 n> B Time (min) H O o o 4* On Qo * To explore the potential role of m ating-pair form ation in the alleviation of EcoRI restriction, it w as exam ined w hether p N A ll could confer the sam e degree of p ro tec tio n on transferring R751 w hen p resen t in the donor rather th a n th e rec ip ie n t. The p re d ic tio n is th at R751 sh o u ld be pro tected if restriction is alleviated by the act of form ing m ating pairs. To this end, a donor strain h a rb o u rin g R751 and p N A ll w as m ated w ith an EcoRI restric tin g recipient. P lasm id p N A ll is transfer defective and will n o t be able to transfer itself. H ow ever, the plasm id encodes all other transfer functions including the ability to form m a tin g pairs. The yields of R751 transconjugants form ed d uring such experim ents are presented in Fig. 5.1 (B). These results show quite clearly th a t n o p ro te c tio n is co n ferred on R751 by p N A ll in the donor, suggesting th at restriction-alleviation is not a product of the act of m ating-pair fo rm atio n . In a d d itio n , this re su lt show s th at alleviation of restrictio n req u ire s th e p re se n c e of the C o l l b d r d plasm id in the restricting strain. T herefore, th e p o te n tia l for the genetic pathw ays involved in m ating-pair form ation b eing im plicated in restriction-alleviation cannot be ruled out. O ne im p o rta n t discovery, w hich w as m ade d u rin g these experim ents, w as that the EcoRI p ro d u cin g plasm id NTP14 was m obilised to the R751 donor stra in b y th e C o llb d rd plasm id in the restricting recipient. This issue is exam ined fu rth er in the follow ing section. 5.2.2 M obilisation o f the EcoRI producing plasmid NTP14 by Collbdrd It w as o b se rv e d d u rin g experim ents described in section 5.2.1 th at plasm id NTP14, w h ich encodes the type II R-M system EcoRI, w as m obilised from a stra in also h a rb o u rin g a C ollbdrd plasm id. To test this further, an in te rru p te d m atin g experim ent w as set up on filters betw een a donor strain h arb o u rin g b o th NTP14 and Collbdrd and a RifR recipient strain. The num bers of NTP14 tran sco n ju g an ts pro d u ced w ere m easured by selecting for RifR and A p R cells (Fig. 5.2). It is clear from these data that Collbdrd plasm ids pLG221 and p N A ll are able to m obilise plasm id NTP14 efficiently. A lthough plasm id 81 Fig. 5.2. M obilisation of NTP14 by Collb. NTP14 tran sco n ju g an ts from conjugations of G I65N (N TP14, pLG221) (□) or G I65N (N TP14, p N A ll ) (O) and G I65N (N TP14, pLG 272) (A) d o n o rs a n d recip ien t GI65R. T ransconjugants w ere selected on N A p la te s containing Rif a n d Ap. D ata are the m ean values of tw o experim ents. Log^Q transconjugants per ml hrj N3 O H • *1• 3n> 3M* 3 OJ o o <X\ 00 p N A ll cannot tran sfe r itself, it still specifies all functions required in trans for transfer. A t 40 m in u te s, p N A ll m obilised NTP14 w ith an efficiency of 1.0 relative to pLG221 directed m obilisation (Fig. 5.2). In contrast, the num ber of NTP14 tran sco n ju g an ts w as red u ced by 141-fold w hen the mobilising plasm id was Collb w ild typ e. This red u ctio n can be attributed to the repression of the Collb conjugation system . The p o ten tial im plications of these findings is that during experim ents described in section 5.2.1 C ollbdrd directed m obilisation of NTP14 to R751 donors m ay result in R751 being m odified at its EcoRI target sites before entry to the EcoRI restrictin g recipients. Therefore, the R751 plasm id w ould acquire EcoRI protection before entry. This notion is exam ined further in C hapter Six. 5.2.3 A n o n -tra n sfe ra b le C ollbdrd confers no EcoKI protection on R751 In C h a p te r T hree, the genetic pro ced u re involving plasm id R751 w as used to show th at th e A rdA system of Collb operates optim ally w hen specified by a tra n sfe rrin g p la sm id (Fig. 3.5 A and Fig. 3.6 C). H ow ever, from such results it w as im possible to determ ine in w hich strain enhanced production of the A rdA p ro te in o ccu rred . It m ig h t occur in the donor strain if the A rdA protein is tra n sfe rre d to the recipient d u rin g conjugation, as is the Collb Sog pro tein (Rees a n d W ilkins, 1989). To test this idea, an in terru p ted m ating experim ent w as p e rfo rm e d on filters u sing a donor strain harbouring bo th R751 an d p N A ll( A r d A + ) w ith an EcoKI restricting recipient. Samples of the m atin g m ix w e re re m o v e d transconjugants (Fig. 5.3 A). and e x am in e d for th e n u m b e r of R751 R esults clearly show th at p N A ll is unable to confer any p ro tec tio n on tran sferrin g R751 DNA from destruction by EcoKI. Such a finding ru les o u t the possibility th at the A rdA protein is transm issible by conjugation. A sim ilar m a tin g ex p erim en t w as carried out using an R751 donor strain h a rb o u rin g pN A 12(A rdA ') (Fig. 5.3 B). Again, no significant increase in the n u m b er of R751 tran sco n ju g an ts w as observed. 82 Like ArdA , the second Fig. 5.3. T ransfer of R751 from a donor harbouring p N A ll(A r d A +) or pNA12(ArdA") to an EcoKI restricting recipient. (A) R751 tran sco n ju g an ts from co njugations of NM 654R(R751) d o n o rs an d e ith e r re c ip ie n ts N M 816N (O, c o n tro l) or G I65N NM 654R(R751, p N A ll ) d o n o rs a n d re c ip ie n t G I65N (# , (■). c o n tro l) (B ) and R751 tra n s c o n ju g a n ts from c o n ju g atio n s of NM 654R(R751) d o n o rs a n d e ith e r rec ip ie n ts N M 816N (O, control) or GI65N (#, control) a n d NM 654R(R751, pN A 12) d o n o rs an d recipient GI65N (■). N A p la te s c o n ta in in g N al and Tc. ex p erim en ts. T ransconjugants w ere selected on D ata are the m e a n v a lu e s of th re e Fig. 5.3 Log^p transconjugants per ml EcoKI, ArdA+ -O ' 6 - 4- 0 10 30 20 Tim e (min) 40 LogiQ transconjugants per ml EcoKI, ArdA 6 - 4- 0 10 20 30 Time (min) 40 EcoKI allev iatio n function encoded b y Collb is not transm issible through the donor. In stead , the alleviation process requires the presence of Collb in the restricting strain. 5.2.4 C ollb d rd transfer does not m ediate retrotransfer of NTP14 D ata p re s e n te d in section 5.2.2 clearly show s th at Collbdrd is able to efficiently m obilise plasm id NTP14. It is proposed th at Collbdrd m ight itself evade d e stru ctio n by EcoRI d u rin g conjugation by m ediating retrotransfer of NTP14 from th e recip ien t cell to th e d onor cell; such that transm ission of NTP14 to the d o n o r cell resu lts in th e im m ediate expression of the gene encoding the EcoRI m ethylase en zy m e w hich in tu rn m odifies all 20 EcoRI target sites p re se n t on the backbone of Collbdrd before the plasm id enters the restrictin g re c ip ie n t by conjugation. The phenom enon of retrotransfer has been described as the transm ission of chrom osom al or plasm id DNA from the recipient cell to th e d onor cells su p p o rtin g conjugation (Sia et al., 1996). If this is how Collb ev ad e s type II restriction then it m ight explain the 6-10 m inute delay in C ollbdrd tran sco n ju g an t form ation in an EcoRI restricting recipient harbouring NTP14 (Fig. 3.2 A). The delay being attributed to the time taken for transm ission of N TP14 to the d onor and m ethylation of the 20 EcoRI targ e t sites present on C ollb. Also, the requirem ent for m ultiple rounds of transfer of Collbdrd to the restrictin g recipient could be explained in that this is required to initiate retro tran sfer of NTP14. To test th is idea an in te rru p te d filter m ating experim ent was perform ed betw een a d o n o r stra in h arb o u rin g Collbdrd plasm id pLG221 and a recipient strain h a rb o u rin g NTP14. The n u m b er of pLG221 transconjugants produced an d th e n u m b e r of d o n o r cells th a t acq u ired NTP14 w ere m e a su re d th ro u g h o u t the m a tin g (Table 5.2.1). No donor cells that acquired plasm id NTP14 w ere d e te c te d th ro u g h o u t th e 40-m inute d u ratio n of the m atin g experim ent d esp ite successful transm ission of pLG221 to the EcoRI restricting recipient. These resu lts suggest th at transfer of Collbdrd cannot potentiate the 83 Table 5.2.1. Analysis of Collb-mediated retrotransfer of mobilisable plasmid NTP14 Donor Recipient pLG221 NTP14 Transconjugants ml"1 Transconjugants ml-1 GI65R(pLG221) GI65N 2.6 x 107 GI65R(pLG221) GI65N(NTP14) 1.0 x 107 <10 2.0 x 107 1.1 xlO 7 GI65N(pLG221/ NTP14) GI65R 1 Equal numbers of donors and recipients cells were mixed together to give a concentration of 108 cells of each strain per ml. A 40 minute filter mating was performed at 37°C. The numbers of pLG221 (KmR) and NTP14 (ApR) transconjugants were determined by plating the mating mix on NA plates containing the appropriate antibiotic selection. Plasmid NTP14 is mobilised with a frequency of 0.55 relative to the transfer of pLG221, where mobilisation frequency is the number of mobilised plasmids transferred per conjugative plasmid. retrotransfer of NTP14. This is therefore not how C o l l b d r d evades destruction by type II restriction enzym e, EcoRI. 5.3 Discussion Three im p o rta n t conclusions can be d raw n from data in this chapter. First, the Collb A rd A p ro tein is n o t transm issible from the donor. Evidence is th at a C ollbdrd (p N A ll) plasm id im m obilised in a donor strain could confer no protection on tran sfe rrin g R751 from EcoKI restriction in the recipient (Fig. 5.3 A). In contrast, C ollbdrd (pLG221) itself confers com plete protection of R751 (C hapter 3, Fig. 3.5 A). These experim ents indicate that enhanced expression of the a r d A gene occurs in the recipient cell shortly after entry of the C ollbdrd plasm id. Second, like A rdA , the com ponents m aking up the unidentified a r d A independent anti-ty p e I system are n o t transm issible by conjugation (Fig. 5.3 B). In sum m ary, a llev iatio n of EcoKI restrictio n by Collb (ArdA') can only occur w ithin the cell th a t contains the plasm id. The m echanism by which restriction is alleviated is ex p re ssed constitutively by a drd+ and drd Collb plasm id (see C hapter Three). T hird, C o llb d rd -m ed ia te d alleviation of EcoRI restriction appeared to occur d u rin g the fo rm atio n of m ating-pairs (Fig. 5.1 A). Further experim ents indicated th at th e m ech an ism has to be expressed in the strain encoding the restrictio n sy stem (Fig. 5.1 B). T hese experim ents led to the finding th at C o llb d rd is able to m obilise efficiently the EcoRI p ro d u cin g plasm id NTP14 (Fig. 5.2). It m ay be possible th at m obilisation of NTP14 to the R751 donor results in the m o d ifica tio n of R751 EcoRI target sites before the plasm id is transferred to the restricting recipient. If such a hypothesis were correct, then it w ould explain a n u m b e r of observations m ade du rin g experim ents described in C h a p te r T hree. T hese include th e initial lag in R751 transconjugant p ro d u ctio n in re stric tin g recipients h arb o u rin g Collbdrd (Fig. 3.6 A). The lag reflects the tim e ta k e n for NTP14 to be m obilised to R751 donors and for 84 com plete m e th y la tio n of the R751 replicon. This issue w ill be ad d ressed furth er in C h ap ter Six. A n o th e r o u tco m e of th is fin d in g w as th a t it p ro v id e d a possible explanation as to how Collbdrd itself m ight avoid destruction by EcoRI during conjugation. The idea being th a t C ollbdrd m ediates retrotransfer of NTP14 from the recipient to the Collbdrd d onor resulting in m odification of the EcoRI target sites of th e plasm id. H ow ever, results in Table 5.2.1 clearly show this not to be the case. The p o ssib ility th at a r d A -in d e p e n d e n t alleviation of EcoKI by Collb is m ediated by m obilisation of the EcoKI R-M genes can be ruled out by the fact that these genes are located on the E. coli chrom osom e. One o th er dilem m a that needs to be addressed is that m obilisation of the EcoRI R-M genes sh o u ld result in som e killing of recipient cells. All these issues involving the m obilisation of NTP14 are addressed in C hapter Six. 85 Chapter Six Construction of a mob Mutant of Plasmid NTP14 6.1 Introduction D ata in C h a p te r Five clearly show ed that C ollb d rd is able to mobilise efficiently th e EcoRI p ro d u c in g p la sm id NTP14 w hen resident in the same strain (Fig. 5.2). The aim of this chapter is to establish w hether the protection conferred on tra n sfe rrin g R751 (Fig. 3.6 A) from EcoRI restriction is CoXSbdrdm ed iated or the co n seq u en ce of N TP14 m obilised to the R751 donor. To address this d ilem m a it w as necessary to construct a m utant of NTP14, w hich was unable to be m obilised by a conjugative plasm id. D etails c o n cern in g the n a tu re and organisation of NTP14 are lim ited. The only p u b lish e d d a ta available is in Smith et al. (1976), w ho reported that NTP14 w as isolated as a recom binant plasm id present in a strain of E. coli after an u n id en tified f i + R -factor encoding the EcoRI R-M system and a num ber of antibiotic resistan ce d e te rm in a n ts w as m obilised by F-like plasm id R1-19K*. NTP14 is 17 kb in size an d has a copy num ber of -14 per chromosome. The plasm id encodes th e ty p e II R-M system EcoRI, resistance to am picillin and colicin E l. From D N A -D N A re-association experim ents, the com position of NTP14 w as d e te rm in e d to be m ade u p of 33% NTP1, 33% ColEl and 34% Rl19k* (Smith et al., 1976). The e x p e rim e n ta l d esig n for creating and isolating a mob m utant of NTP14 w as m ad e difficu lt d u e to the lim ited data available for the plasm id. C onsequently, several attem p ts w ere m ade to isolate a m utant. The first was based on the fact th a t NTP14 shares 33% of its genom e w ith ColEl. The mob region of C olE l consists of a nic site also know n as bom (basis of mobility) and four related m o b genes, m b e A , mbeB, m b e C and m b e D (Boyd et al., 1989). Like NTP14, C olEl also possess a unique Clal cleavage site, which in ColEl is located in the largest m o b gene, m b e A (Boyd et al., 1989). Based on the fact that each 86 plasm id co n tains a un iq u e Cl a l cleavage site, it w as assum ed that NTP14 had inherited its m o b region from C olE l. The Clal site of NTP14 was utilised for the insertion of an antibiotic resistance m arker gene w ith the aim of disrupting the p u ta tiv e m o b region. H o w ev e r, none of the recom binant p lasm id s isolated, typified by pNA 14, dem onstrated a Mob' phenotype. The se co n d a tte m p t to iso late a M ob' m u ta n t of NTP14 involved ra n d o m tr a n s p o s o n m u ta g e n e sis of the p la sm id u sin g Tn5. Several N T P 1 4 ::T n 5 in se rtio n s w ere iso lated ; yet, again n o n e determ ined a M ob' phenotype. The failure of these tw o attem pts m ay be the consequence of the p lasm id 's copy n u m b e r such th at th e recom binant plasm ids m ay be p a rt of a m ixed p o p u la tio n w h ere the original NTP14 plasm ids are able to provide the m issing function. This prevents the M ob' phenotype from being expressed. The fin al a tte m p t to iso late a. m u ta n t invo lv ed cloning restrictio n fragm ents of N TP14 into a non-m obilisable cloning vector w ith the aim of isolating a reco m b in an t plasm id w ith an R ri+, Mob" phenotype. The choice of cloning vector for these experim ents w as critical. It w as im portant to choose a vector th at could n o t be m obilised by Collbdrd yet had a sim ilar copy num ber to th at of NTP14. C lo n in g vector pBR322 has a copy num ber of -2 2 p er chrom osom e b u t w as unsuitable for these experim ents as it carries the ColEl b o m reg io n a n d can th erefo re be m obilised by a n u m b er of conjugative p lasm ids (Finnegan an d Sherratt, 1982). Instead, cloning vector pBR328 w as chosen, w h ich is a d eriv ativ e of pBR322 b u t does not carry the ColEl b o m sequence (Soberon et al., 1980; Sam brook et al., 1989). To th is e n d , e x p e rim e n ts d escrib ed in this c h ap ter include; first, restrictio n en zy m e an aly ses of p lasm id NTP14; Second, the details of the cloning of restriction fragm ents of NTP14 into pBR328 as well as the tests used to d eterm in e w h ic h reco m b in an ts w ere R ri+, Mob'; T hird, the transm ission kinetics of R751 a n d C ollbdrd to a recipient cell harb o u rin g the Rr i+, M ob' recom binant are described to confirm the restriction phenotype, and finally the c o n ju g a tio n e x p e rim e n ts are d e sc rib e d , w h ich d e te rm in e w h e th er the 87 protection conferred on transferring R751 from EcoRI restriction is m ediated by C o llb dr d in th e recipient or the consequence of NTP14 being mobilised to the R751 donor. A d d itio nal experim ents described in this chapter include exam ining the viability of u n m o d ified recipient cells th at acquired the transm itted EcoRI R-M genes by conjugation. 6.2 Results 6.2.1 Restriction analysis of plasm id NTP14 As a p re re q u isite to the cloning of restriction fragm ents of NTP14 a physical m ap of the p lasm id h a d to be generated. C onstruction of the m ap involved restrictio n enzym e analysis of NTP14 using enzym es that cleave the plasm id no m ore th a n twice. These enzym es included H m dlll, Clal, Sail, Bgll l and Bam H l. To g enerate useful d ata for m apping the cleavage sites on NTP14, single, double an d trip le digests w ere perform ed. The combinations used and the resu ltin g re stric tio n d ata o b tain ed are p resen ted in Table 6.2.1. The restriction m ap of NTP14 is presented in Fig. 6.1 A. The location of the EcoRI R-M genes on NTP14 is unknow n. How ever, the location of th ese genes is k n o w n for E. coli plasm id R113. The EcoRI re stric tio n e n d o n u c le a s e an d m e th y la se genes are fo u n d w ith in close proxim ity of each o th er an d span 2.2 kb of R113. The nucleotide sequence of these genes a n d su rro u n d in g regions has been d eterm ined and contains an in tern al 1.8-kb H m d lll fragm ent 1.4-kb upstream of a Sail site (Greene et a l , 1981; EMBL: J01675). Interestingly, NTP14 also has a 1.8 kb H m dlll fragm ent located 1.4-kb from a u n iq u e Sail site (Fig. 6.1 A). From these similarities it is p rop o sed th a t the EcoRI R-M genes of NTP14 lie in the vicinity of the 1.8 kb H zndill fragm ent. C onsequently, the strategy for cloning fragm ents of NTP14 w ith the aim of iso la tin g recom binants th at are Rri+ w as based around this observation. 88 Table 6.2.1 Restriction data for NTP14 (17 kb) Gel No. Lane No. Digest 1 2 HindUl 15.2 3 Bglll 17 4 Clal 17 5 Sail 17 6 Hindll/BglU 15.2 1.6 0.2 7 HindU l/Clal 11.1 4.1 8 H indU l/C lal/ Bglll 11.1 4.1 1.8 1.6 0.2 9 Clal/Bglll 11.3 5.7 10 11 S a il / C la l 9.7 7.3 13.8 1.8 9.7 4.1 2 Sail/HindUl 12 Sail/H indU l/ Clal 2 Ba m H l 3 Bam H l/H indlU 4 BamHl/Sail 5 Bam H l/H indU l/ Sail Fragment sizes (kb) 1.8 1.4 1.8 17 9.5 8.9 5.7 1.8 8.1 8.1 5.7 1.8 1.4 Gel 2 G e ll ■ 1 2 3 4 5 6 7 8 9 101112 1.4 1 2 3 4 5 Restriction fragm ents of NTP14 w ere separated on 0.8% agarose gels by electrophoresis. X x H i n d U l m olecular weight markers were run in lane 1 of each gel. The size of each restriction fragm ent of NTP14 was calculated and is shown in kb next to each gel and in the above table. Faint bands running at 3.8 kb in gel 1, lanes 2-5 and 10-11 and gel 2, lanes 2-5 are uncut NTP14. Fig. 6.1. Diagram depicting the construction of recom binant plasm ids pN A 16 and pNA17. (A ) P h y sical m a p of 17 kb p la sm id NTP14 sh o w in g th e ta rg e t sites for restrictio n e n zy m es H m d lll, B g ll l, C la l , B a m H l and S ail. ru n from th e u n iq u e C la l site. EcoRI R-M genes (' C o-ordinates in bp A lso show n is the su sp ec ted location of the as fo u n d on plasm id R113 on a 2.2 kb fra g m e n t o v erlap p in g the 1.8 kb H m dlll fragm ent (Greene et ah, 1981; EMBL database acc. no. J01675). M ap w as com piled using the restriction d ata in Table 6.2.1. (B) NTP14 w as digested w ith Sail and Clal yielding tw o fragm ents of 7.3 kb and 9.7 kb in size. T hese tw o frag m en ts w ere ligated in to the 4.2 kb S a l l - C l a l frag m en t of cloning vector pBR328 (th in line), form ing pN A 16 a n d pN A 17, resp ectiv ely . p N A 17 carries the EcoRI system ; pN A 16 does not. recom binants are non-m obilisable. B oth Fig. 6.1 Clal N T P14 H m d lll 12900 17000 bp Bglll 11330 H m d lll 11100 Sal I 9700 B EcoRI Ball PstI EcoRI Bal l Clal Pstl Clal Ava Ava I S a il ' on pN A 17 11585 bp p cS Sal I H m d lll H m d lll N ot To Scale Bglll pN A 16 13985 bp 6.2.2 Construction of pNA16 and pNA17 It is p ro p o se d th at the EcoRI R-M genes of NTP14 overlap the 1.8 kb H indU l frag m ent of the plasm id. To isolate recom binant plasm ids that carry these genes an d n o t the mob region of NTP14, a strategy w as em ployed, which involved cloning fragm ents of NTP14 avoiding d isru p tin g the 1.8 kb H m d lll fragm ent an d s u rro u n d in g region. To this end, the 7.3- and 9.7-kb Cl al -S al l frag m en ts of N T P14 w ere rem o v e d by restriction enzym e digestion and isolated from a 0.8% agarose gel after separation by electrophoresis. Both the 7.3- and 9.7-kb fragm ents w ere p urified from the agarose using the QIAEX II gel extraction kit (Q iagen) and ligated into the 4.2 kb Clal -Sa ll portion of pBR328 form ing reco m b in an ts pN A 17 an d pN A 16, respectively (Fig. 6.1 B). Isolation of these reco m b in an t plasm ids w as achieved by transferring the ligated DNA by tran sfo rm atio n to D H 5 a com petent cells. T ransform ant colonies that w ere A p R, C m R and Tcs w ere inferred to carry the vector and a cloned insert at the Tc gene. C onstructs w ere confirm ed initially by restriction enzym e analysis. 6.2.3 Characterisation of pNA16 and pNA17 C h a ra c te ris a tio n of re c o m b in a n ts pN A 16 a n d pN A 17 in v o lv e d exam ining th e m o b ilisatio n and restriction phenotypes of each plasm id. The Rri p h en o ty p es of pN A 16 and pN A 17 w ere exam ined by infecting their E. coli host strain (GI65N) w ith a X vir lacking EcoRI m odification and exam ining the n u m b er of p la q u e s form ed. The e.o.p of X v i r . K w as 1.0 in GI65N(pNA16) relative to its p la sm id free host a n d 0.001 in GI65N(NTP14). In contrast, the e.o.p of X v i r . K on strain s h arb o u rin g pNA 17 w as 0.001 (A ppendix I). These resu lts co n firm th a t p N A 17 h as a Rr i + p h e n o ty p e and pN A 16 an Rr i‘ p h en o ty p e, as p re d ic te d from restrictio n enzym e cleavage com parisons of NTP14 w ith R113 (Fig. 6.1 A). It w as assum ed that pN A 17 m ust also carry the EcoRI m ethylase gene to p re v e n t the host strain from attack by the EcoRI endonuclease c o m p o n en t. 89 Table 6.2.2. Mobilisation of recombinant plasmids pNA16 and pNA17 by Collbdrd Transconjugants per ml Donor GI65N(pLG221, NTP14) GI65N(pLG221, pBR328) GI65N(pLG221, pNA16) GI65N(pLG221, pNA17) ’'Mobilisation frequency Recipient pLG221 GI65R 2.7 x 107 2.4 x 107 It 4.3 x 107 <10 < 2.3 x 10’7 H 3.1 x 107 <10 < 3.2 x 10-7 H 1.5 xlO 7 <10 < 6.7 x 10‘7 Recombinant plasmid 0.88 The frequency of transfer of each plasmid was determined after a 40 minute filter mating at 37°C, using GI65N as the donor host strain and GI65R as the recipient. Plasmids pNA16, pNA17, pBR328 and NTP14 all confer resistance to ampicillin. Helper plasmid pLG221 confers resistance to kanamycin. Transfer of pNA16, pNA17, pBR328 and NTP14 was measured by plating the mating mixture on NA plates containing Ap (100 pg ml"1) and Rif (25 pg m l'1) and transfer of the helper plasmid measured by plating on NA plates containing Km (25 pg m l'1) and Rif (25 pg m l'1). ^Mobilisation frequency is the number of mobilisable plasmids transferred per conjugative plasmid. R ecom binants pNA 16 and pN A 17 were then tested for their ability to be m obilised by C ollbdrd. Test strains w ere m ade by transferring pNA16 an d pNA17 DN A by transform ation to GI65N(pLG221) and selecting for KmR, A pR and C m R. C o n ju g atio n e x p erim e n ts w ere set u p on filters using d onors described in T able 6.2.2 an d a recip ien t strain (GI65R). At 40 m inutes, the num bers of recipients that h a d acquired either pNA16 or pNA17 were scored by plating on N A p lates containing Rif and Ap (Table 6.2.2). Despite efficient transfer of pLG221, n either pNA16 n o r pNA17 w as detected in recipient cells. These results co nfirm th at the cloning strategy described in section 6.2.2 w as successful in iso lating a recom binant plasm id that w as Mob’. In addition, these results also su g g est th at the mob region of NTP14 m ust overlap either the Cla l site or the S a l I site w h e re NTP14 w as cleaved as a p re lu d e to m ak in g constructs. To e x a m in e f u r th e r th e re s tric tio n p ro p e rtie s of pN A 17, th e tran sm issio n k in e tic s of pLG221 a n d R751 to a recipient h arb o u rin g the recom binant w ere m easu red . Two separate conjugation experim ents w ere set up on filters u sin g either GI65R(pLG221) or GI65R(R751) as donors and a N alR recipient h a rb o u rin g pNA 17. The yields of pLG221 and R751 transconjugants are show n in Fig. 6.2 A and B, respectively. EcoRI restriction still has the sam e effect on the tra n sm issio n of pLG221 and R751 if it is encoded by NTP14 or pNA17. These resu lts confirm th at pN A 17 is a suitable non-m obile source of the EcoRI R-M genes. Finally, th e colicin E l p roducing phenotypes of pNA16 and pNA17 w ere exam ined. E xperim ents w ere perform ed as described in C hapter Two, section 2.3.1. Test strains GI65N, GI65N(pNA16), GI65N(pNA17) and GI65N(NTP14) were exam ined for colicin pro d u ctio n using GI65N as an indicator strain. Only strain GI65N(NTP14) w as found to be colicin producing (data not show n), suggesting th at th e colicin E l gene, cea , of NTP14 overlaps either the Cl a l or Sail cleavage site of the plasm id. A dditionally, pNA16 and pNA17 w ere also exam ined for th e ir ability to confer im m unity against the lethal effects of 90 Fig. 6.2. Transfer o f pLG221 and R751 to a recip ien t strain harbouring recom binant plasm id pNA17. (A) pLG221 tran sco n ju g an ts from conjugations of GI65R(pLG221) donors an d recipients G I65N (O, control), GI65N(NTP14) (#, control) a n d G I65N (pN A 17) (■). (B) R751 tran sco n ju g an ts from conjugations of GI65R(R751) d o n o rs an d e ith e r re c ip ie n ts GI65N (pN A17) (■). G I65N (O, co n tro l); G I65N (N T P 14) (# , c o n tro l) or T ransconjugants w ere selected on N A p la te s contain in g eith er N a l/K m or N a l/T c. ► n era L°gio transconjugants per ml NJ ^ <T> Logio transconjugats per ml KJ 00 N3 O M_ 3M1 O Time (min) H * i# 9 (D a\ NJ O w o o ^ ON 00 colicin E l on th eir ho st strain. GI65N(NTP14) w as used as the test strain and GI65N, G I65N (pN A 16), G I65N (pN A 17) and GI65N(NTP14) as the indicator strains. O nly pN A 16 and NTP14 w e re able to confer any protection on their hosts strains in d icatin g th at the colicin E l im m unity gene, z'ram, of NTP14 can be found lo cated so m ew h ere w ith in the 9.7 kb C l a l - S a l l fragm ent of the plasm id (Fig. 6.1 A). 6.2.4 Collb cannot alleviate EcoRI restriction in trans To d e te rm in e w h e th e r the p ro te c tio n conferred on transferring R751 against EcoRI restriction (Fig. 3.6 A) is C ollb-m ediated or due to m obilisation of NTP14 to th e R751 d o n o r, a c o n ju g atio n experim ent w as carried out using pN A 17. M a tin g e x p e rim e n ts w e re set u p on filters betw een a d o n o r harbouring R751 an d a recipient h a rb o u rin g pLG221 and pNA17. Sam ples of the m ating cells w ere rem oved at intervals and exam ined for the num bers of R751 tra n sc o n ju g a n ts p ro d u c e d . F igure 6.3 clearly show s that no EcoRI protection is co nferred on tran sferrin g R751 by ColVodrd w hen the R-M genes cannot be m o b ilise d to the R751 d o n o r. These resu lts confirm th a t the protection o rig in a lly conferred on R751 from EcoRI restriction, as show n in Fig. 3.6 A, is n o t C ollb-m ediated. Such protection m ust be due to m obilisation of NTP14 to th e R751 d o n o r strain an d subsequent m ethylation of R751 before transfer. Therefore, it seem s likely th a t any evasion of EcoRI restriction observed d u rin g tran sfer of C o U b d r d (Fig. 3.2 A) m ust be due to a m echanism that can only operate on ColVodrd rather th an a trans -acting m echanism as suggested in C hapter Three (section 3.2.3). Such findings explain w hy no significant EcoRI protection is co n ferred on R751 by C ollb d r d w hen the tw o plasm ids are co­ transferred together (Fig. 3.3 A). 91 Fig. 6.3. T ransfer of R751 to a recip ient harbouring b o th C o llb d r d and recombinant plasm id pNA17 (R-M r i + ). R751 tra n sc o n ju g a n ts from co njugations of GI65R(R751) d o n o rs a n d e ith e r recipients G I65N (O, control); GI65N(pNA17) (•); GI65N(pLG221, pN A 17) (■) or G I6 5 N (p N A ll, pNA17) (A). containing N al an d Tc. T ransconjugants w ere selected on N A p lates LogiQ transconjugants per ml Fig. 6.3 6 - 4- 0 10 20 30 T im e (m in) 40 6.2.5 C onjugative transfer of the EcoRI R-M genes causes some k illin g of recipient cells Type II R-M genes are considered to be selfish as they tend to im pose selective p re s s u re fo r the m ain ten a n ce of the plasm id that encodes them (Naito et ah, 1995; O 'N eill et ah, 1997). H ow ever, this is not necessarily true for type I R-M system s. As described in C hapter One, section 1.5.3, the restriction activity of the EcoKI enzym e in a new E. coli host strain is m odulated by the ClpXP p ro tease en zy m e leaving its m odification activity in tact. The result of EcoKI re stric tio n a ctiv ity b e in g m o d u la te d is very little cell death after tran sm issio n of th e R-M genes to a new unm odified host (M akovets et ah, 1998). It has b e en sh o w n in experim ents described in this thesis that plasm id NTP14, w hich encodes the type II R-M system EcoRI, is mobilisable by Collbrfri (Fig. 5.2). This o bservation raises the question of how recipients of the plasm id su rv iv e th e a c q u isitio n of the n ew R-M genes. This section is aim ed at exam ining th e consequences for an unm odified E. coli strain of receiving the EcoRI R-M genes by transfer of the plasm id NTP14. The e x p erim e n t involved m ating donors harbouring p N A ll and NTP14 and an u n m o d ified recip ien t strain. The conjugations w ere set up on filters and the n u m b e rs of NTP14 tran sco n ju g an ts p ro d u ced w ere determ ined at intervals d u rin g th e m ating. The yield of NTP14 transconjugants w ere then co m p ared to th e in p u t n u m b er of recipient cells in o rder to evaluate the viability of th e rec ip ie n ts after th e acquisition of the EcoRI R-M genes. As a u sefu l c o n tro l, a sim ila r e x p e rim e n t w as p e rfo rm ed u sing a recip ien t h a rb o u rin g pJH 16. T his is a recom binant plasm id th at carries the EcoRI m ethylase gene (H e itm an et ah, 1989). The results of these experim ents are p re se n te d in T ab le 6.2.3. It is clear from th e percentage of NTP14 transconjugants p ro d u c e d in an unm odified host com pared to those produced in a host carrying pJH 16 that conjugative transfer of the EcoRI R-M genes on a plasm id m u st h av e resu lted in som e recipient cell death, although the effect is 92 Table 6.2.3. Analysis of the viability of an unmodified E. coli strain during the acquisition of the EcoRI R-M genes by conjugation Donor GI65N(pNAll, NTP14) GI65N(pNAll, NTP14) NTP14 transconjugants per ml Recipient Time (min) GI65R 0 7.5 x 107 0 ND 5 5.3 4.0 x 106 10 7.3 x 107 4.5 x 107 20 1.5 xlO 7 30 3.4 x 107 20 1.5 x 107 40 2.4 x 107 16 1.2 xlO7 0 6.5 x 106 0 <10 5 5.2 x 106 15 1.0 x 106 10 4.6 x 106 30 2.0 x 106 30 9.0 x 106 46 3.0 x 106 40 8.0 x 106 81 5.3 x 106 GI65R(pJH16) Recipients per ml *% transconjugants Equal numbers of donors and recipient cells were mixed together and a filter mating was performed at 37°C for 40 minutes. Samples of the mating cells were removed at intervals and examined for the number of NTP14 transconjugants by plating on NA plates containing Rif (25 pig ml4 ) and Ap (100 pg ml4 ). Throughout the experiment the number of recipient cells were monitoured by plating samples of the mating mix on Rif plates. Helper plasmid p N A ll carries a cis-acting lesion within the oriT which prevents its self­ transfer. However, the plasmid still specifies all other transfer functions in trans. Plasmid pJH16 is a recombinant plasmid carrying the EcoRI methylase gene. * % transconjugants were calculated by comparing the input number of recipient cells/m l to the number of NTP14 transconjugants/m l produced. Numbers are the mean value of three experiments. m inim al. T hese resu lts suggest th at, like other type II plasm id-encoded R-M system s te s te d (N a ito et al., 1995), the EcoRI g en es also display selfish behaviour. The fact th at n o t every recipient cell is killed by EcoRI could be due to either the activ ity of rep air system s em ployed by th e E. coli host strain to m end D N A cleavages or p ro d u c tio n of the m ethylase p rio r to synthesis of the endonuclease. 6.3 Discussion R esults in th is ch ap ter d e ta il the successful isolation of a recom binant plasm id w ith an R ri+, M ob' p h enotype. The plasm id, pNA17, was isolated by cloning th e 7.3 kb C l a l - S a l l fra g m e n t of NTP14 in to vector pBR328. In addition, pN A 17 w as inferred to specify a sim ilar R r i+ phenotype to NTP14 as determ in ed d u rin g ex p erim en ts exam in in g the transm ission kinetics of bo th R751 and ColT bdrd to a recipient h arb o u rin g either NTP14 or pNA17 (Fig. 6.2). D u rin g th e c o n stru c tio n of reco m b in an t pN A 17, several other notable o b se rv a tio n s w e re m ade w h ic h f u rth e r c h a ra c te rise NTP14. T hese observations are described below. The cloning of th e tw o C l a l - S a l l fragm ents of NTP14 resulted in the generation of tw o recom binant p lasm id s, pNA16 and pNA17, neither of w hich w ere m obilisable b y Col Vo dr d (Table 6.2.2). These findings indicate that the mob region of N TP14 m u st overlap either the Clal or Sail site of the plasm id. If the m o b reg io n of N TP14 is h o m o lo g o u s to th at of plasm id C olEl, then com parisons b e tw ee n the restriction data for the tw o plasm ids w ould suggest that d isru p tio n is d u e to cleavage by Clal (Section 6.1). Such cloning ex p erim en ts w ere also useful in indicating that the colicin El gene (cea) of NTP14, w hich m u st have been inherited from ColEl, can be found o v erlap p in g eith er the Cl a l or Sail cleavage sites of the plasm id (section 6.2.3). H ow ever, e x am in a tio n of the nucleotide sequence of ColEl revealed that there are no C l a l or S ai l cleavage sites present w ith in cea. It seems m ore likely th a t th e ce a g en e of N TP14 is n o t o v e rla p p in g one of these tw o 93 restriction sites a n d th at som e o th er explanation is necessary. Furtherm ore, it w as show n b y S m ith et al. (1976) th at about 1 /3 p o rtio n of NTP14 (~ 5.6 kb) consists of C o lE l D N A . For b o th the mob region an d cea gene of NTP14 to be inherited from C o lE l and be d isru p te d by Clal and Sail is impossible as these two sites are 7.3- a n d 9.7 kb a p a rt (Fig. 6.2). One possibility is that the ColEl derived D N A in NTP14 carries an insertion or is interspersed throughout the NTP14 genom e. D espite these com plications, the colicin E l im m unity gene imm w as fo u n d located w ithin the 9.7 kb Clal-Sall fragm ent of NTP14 (Fig. 6.1). F urther w o rk is re q u ire d to locate the exact locations of these three loci on NTP14. D espite the in terestin g observations that w ere m ade during the course of the w ork d e scrib e d in this ch ap ter, only one im p o rtan t conclusion could be draw n, w hich is rele v an t to the m ain b o d y of this thesis. That is, that C o l l b d r d does n o t allev iate EcoRI restrictio n in trans d u rin g its conjugative tran sfer (section 6.2.4, Fig. 6.3). It ap p ears th at Collb itself evades EcoRI restriction by a cis- acting a n ti-restrictio n function rath er than a tr an s - acting one as previously su g g ested (C h a p te r T hree, section 3.2.3). Such fin d in g s explain w h y no significant p ro tec tio n is conferred on R751 by C o l l b d r d against EcoRI restriction w hen the tw o p la sm id s are co-transferred together (Fig. 3.3 A). In addition, the h y potheses th a t co n ju g ativ e tran sfe r of Collbdrd alleviates EcoRI restriction either by alterin g th e physiological state of the recipient cell or by substrate saturation of the restrictio n enzym es can now be ruled out as both w ould have a trans- acting effect. H ow ever, the possibility that Collb carries another anti­ restriction gene c an n o t be ru led out, although the p ro d u ct of this gene w ould have to act in-czs. T hese a ssu m p tio n s w ill be discussed further in C hapter Eight. O ne o th er o b se rv atio n m ade d u rin g this w ork w as that transfer of the EcoRI R-M g en es b y co n ju g atio n decreases, only slightly, the viability of recipient cells. P resu m ab ly , any killing is due to the detrim ental effects of the endonucleolytic a ctiv ity of the EcoRI restriction enzym e on the unm odified 94 h o st genom e of the recipient cell. C onstruction of a restriction m u tan t of NTP14 for use in a control experim ent w ould have been useful in confirm ing th at killing of recipient cells is due to the EcoRI restriction enzym e and no o th er p la sm id -en c o d ed function. The results of tran sfe rrin g this m u tan t p lasm id to un m odified recipient cells could have been used to compare w ith the results obtained using w ild type NTP14 (Table 6.2.3). U nfortunately, due to the tim e co n straints of the project this useful NTP14 m u ta n t w as not m ade. As an alternative control, a strain harbouring recom binant plasm id pJH16 w as used as a source of a recipient w ith a m odified genom e. H ow ever, to w hat ex ten t the stra in s genom e is m ethylated as a result of h arb o u rin g such a p lasm id is n ot know n. Despite this, results in Table 6.2.3 suggest that the slight decrease in viability of recipient cells acquiring plasm id NTP14 is due to the acq u isitio n of g en es encoding the EcoRI restriction endonuclease. results will not be discussed further in this thesis. 95 These Chapter Seven Determination and Analysis of the Nucleotide Sequence Of the Collb Leading Region 7.1 Introduction This c h a p te r is concerned w ith the determ ination and analysis of the n u cleo tid e seq uence of the Collb leading region. A lthough peripheral to the m ain b o d y of th is thesis, such an investigation w as useful in pro v id in g an in te re stin g in sig h t in to a relatively uncharacterised region of a conjugative p lasm id . The lea d in g reg io n is defined as the first segm ent of a conjugative p lasm id to en ter th e recipient cell du ring bacterial conjugation. To date, only th ree genes h av e b e e n identified w ithin this region of C ollb, w hich are ssb, p s iB an d a r d A . A lth o u g h not essential for conjugation, the products of these genes h av e been sh o w n to play an im portant ancillary role in plasm id DNA m etab o lism (Loh et al., 1989; Jones et al., 1992; Read et ah, 1992; see C hapter O ne, section 1.6). Each gene is expressed early in a transient b u rst in the new ly in fe c te d tra n s c o n ju g a n t cell p ro m o tin g estab lish m en t of the im m ig ran t p la sm id (H ow land et al., 1989; Jones et al., 1992; Althorpe et al., 1999). H ow the e x p re ss io n of th e se g en es is enhan ced d u rin g c o n ju g atio n is u n k n o w n . H o w ev er, a n u m b er of hypotheses exist (Chapter One, section 1.6.5). The aim of th is ch ap ter is to use the sequence d ata for a num ber of p u rp o s e s in c lu d in g d e te rm in in g the genetic o rg a n isa tio n of the region, id e n tify in g m ore p o te n tia l pro tein coding regions and, th ro u g h sequence co m p ariso n s, d e te rm in in g their possible function or evolutionary ancestry. It w a s also h o p e d th a t ind icatio n s of the possible m echanism for enhanced ex p ressio n of these genes during conjugation m ight be revealed. The w ork described in this chapter is the result of a joint effort w ith Dr S. B ates, a p o st-d o c to ra l researcher in this laboratory. 96 The recom binant p lasm id s co n structed by m yself and S. Bates for sequencing the Collb leading region are described in Fig. 7.1. Of the 11.7 kb of sequence data generated, 6.4 kb w as o b tain ed d u rin g this w ork and the other 5.3 kb by S. Bates. Analysis of the sequence d ata w as a joint effort. o b ta in e d d u r in g th e W ith the perm ission of Dr S. Bates, all data c o lla b o ra tio n is p re se n te d w ith in this ch ap ter. E xperim ents described include the strategy used to clone and sequence 11.7 kb of th e C ollb le a d in g region and the com puter-based analyses em ployed to identify in terestin g features w ithin the resulting sequence data. N o attem p ts w ere m ade to identify prom oters w ithin the leading region by exam ining the sequence data for the classical -10 and -35 RNA polym erase b in d in g and re c o g n itio n c o n sen su s sequences. S tu d ies of n u m e ro u s collections of p ro m o te rs identified in £. coli has led to a divergence from the o rig in al co n sen su s sequences p ro p o sed by Pribnow (-10) and M aniatis (-35) w here a base su b stitu tio n at each position is possible. Such degeneracy has led to the conclusion th a t a p rom oter can be found every 200 bp of sequence. Use of the consensus sequences to identify prom oters has to be treated w ith caution an d an y c a n d id a te s m u st be considered arbitrary until functional data are obtained (H enaut an d D anchin, 1996). 7.2 Results and Discussion 7.2.1 Strategy for determ ining the nucleotide sequence of the Collb leading reg io n The leading reg io n of Collb is found located adjacent to the oriT w ithin the larg est EcoRI (E l) frag m en t of the plasm id (Fig. 7.1 A). The length of the lead in g region is estim ated to be ~ 17 kb from oriT to the genes encoding UV p ro tectio n functions (imp). To generate a library of subclones suitable for DNA se q u en cin g , sm aller restric tio n fragm ents of the region w ere isolated from e ith e r pCRS4 or pCRS3 (Rees, 1986; Rees et al., 1987) an d ligated into the ap p ro p ria te cleavage sites of cloning vector pIC19H (Fig. 7.1 C). Recombinants 97 Fig. 7.1. Schem e sh ow ing the construction of a sm all library of subclones used to generate the 11.7 kb nucleotide sequence of the Collb lead in g region. (A) Physical m ap of the large EcoRI (E l) fragm ent of C ollb, w hich encom passes the leading region an d p art of the tran sfer region in c lu d in g or iT and the o r i T o peron. R estriction enzym e cleavage sites are indicated by B (BglTL), C {Clal), E (EcoRI), P (Psfl) and S {Sail). Also indicated are the p o sitio n s of ssb, psiB, a r d A a n d the nic site w ith in oriT. (B) S chem atic d ia g ra m sh o w in g th e stra te g y d e v is e d to d e te rm in e th e n u cleo tid e seq uence of both DNA stra n d s of the C ollb lead in g region. O ne a rro w (5'—► 3 ') m ay represent m ore th an one sequencing reaction to g enerate d a ta for one stra n d of the c o rresp o n d in g subclone. P rim ers u se d in th ese sequencing reactions are in C hapter Two, Table 2.1.4. (C) L ibrary of subclones constructed to generate sequence data for the C ollb lead in g region. R estriction enzym e cleavage sites are indicated. Subclones generated from pCRS4 (Rees, 1986) an d the resulting sequence data w ere by S. Bates. Subclones gen erated from recom binant pN A 3 a n d resulting sequence d a ta was p a rt of this w ork w ith the exception of pLG290 (Read et al., 1992). The source of the in sert for pNA 3 w as pCRS3 (Rees et al., 1987). Subclones pR R 2-l (Roscoe, 1996) pN A 6 and pCRS4 w e re used to g en erate sequence d a ta over restriction enzym e cleavage sites. The A rdA p h en o ty p es of pN A 6 an d pLG290 w ere exam ined and the results are sh o w n in A ppendix I. Fig. 7.1 kb 8 10 12 16 14 18 El fragment 20 Transfer operon Leading region “ I------- II------nic EPS l LL M r PCB_ P i psiB ssb ~ 1 P I llT I i ardA oriT 5' 3' 3' 5' B pRR2-l pCRS3 SS pCRS4 pNA3 pNA6 pSBCPl I f pSBCP2 ! pLG290 P s |pSBSP21 | P P I I pNA13 P P I pNAlO | u pNA9 C pNA2 J pLG 290 (Read et al., 1992) an d pN A 2 (A lthorpe et al., 1999; C hapter Four, section 4.2.1) w ere isolated during other studies. A ttem p ts w ere m ade by S. Bates to subclone the region upstream of ssb. H o w ev er, all recom binants isolated resulted in rearrangem ents of the Sall-Pstl insert. Such a p h en o m e n o n has also been described by H ow land et al. (1989) d u rin g the cloning of S4, w hich is a Sail fragm ent internal of the E l fragm ent of C ollb (Fig. 7.1 A ). U nfortunately, d u e to time constraints this problem w as n o t reso lv ed an d consequently no sequence data are available beyond 400 bp u p stre a m of ssb. Sequence reactions w ere perform ed as described in C hapter Two using eith er M 1 3 /p U C u n iv ersal forw ard or reverse prim ers or a specially designed p rim er as described in Table 2.1.4. Each subclone was sequenced completely on b o th stra n d s a cc o rd in g to the strategy show n in Fig. 7.1 B. G eneration of se q u en ce d a ta o v e r restrictio n enzym e cleavage sites w as achieved u sin g specifically d esig n ed prim ers and one of the larger subclones, pCRS4, pNA6 or p R R 2 -l. The seq u en ce data generated from these reactions, com bined w ith the d ata g enerated for the oriT region (C hapter Four, Fig. 4.2) w ere com piled using the GEL set of p ro g ram s from the GCG package to give a contig of 11,712 bp in length. The raw sequence data was then annotated and is show n in Fig. 7.3. In a d d itio n , this d a ta can also be found in the EMBL database accession num ber AJ238399. M uch of this w o rk is also published in Bates et al. (1999). 7.2.2 Id e n tific a tio n of p u ta tiv e p ro te in coding reg io n s (or/s) in the C ollb le a d in g region To identify p u ta tiv e protein coding regions w ithin the leading region of C ollb th e seq u en ce d a ta generated w as analysed u sing the GCG p ro g ram CO D O N PREFEREN CE a n d the £. coli codon usage table of highly expressed genes (G ribskow et al., 1984; H enaut and D anchin, 1996). Furtherm ore, the tra n sla tio n table w a s altered so that as well as ATG, GTG start codons w ere 98 Fig. 7.2. Protein coding regions encoded by Collb leading region. P utative p ro tein coding regions (or/s) and previously id en tife d genes (ssb, psiB an d ardA) w ere identified by analysis of the Collb leading region sequence data u sin g the GCG p ro g ram CODONPREFERENCE (w indow 25 bp) and the E. coli codon usage table of highly expressed genes. Only or/s w ith ATG or GTG start codons w ere searched for. Panel A sh o w s the analysis ru n n in g from ssb tow ards th e nic site of the oriT. P an el B show s the analysis ru n n in g in the opposite o rie n ta tio n from the n ic site to ssb. N ote th at the potential or/s identified, and ssb, psiB and a r d A are all transcribed tow ards oriT. Codon preference Codon preference OJ N> o T ■■3 aj iJ Or4 o o ij (ft cr ir> il w in co rp o rated in to the analyses. Both orientations of the 11.7 kb sequence w ere an aly sed an d the CODONPREFERENCE statistical plots and orfs identified are sh o w n in Fig. 7.2. In total, ten orfs (Fig. 7.2 and Table 7.2.1) w ere identified, th ree of w h ic h in clu d e p rev io u sly identified genes ssb, psiB and a r d A . The m ost com m on sta rt codon w as ATG found for eight of the ten orfs, w here the GTG sta rt co d o n w as found for or fl and or/3. TGA w as the only stop codon fo u n d . A d d in g to th e authenticity of these potential orfs, good m atches to the Shine-D algarno consensus sequence (3'-AUUCCUCCACUGCAU-5') (M athew s and v an H olde, 1990) for ribosom e binding were located for each orf betw een 48 b p u p stre a m of each sta rt codon. These ribosom e-binding sites (r b s ) are indicated in Fig. 7.3. The m o st strik in g feature of these new ly identified orfs is that like ssb, psiB an d a r d A , th ey are all orientated in the same direction tow ards oriT. Such a fin d in g raises the possibility th at these genes m ight be transcribed as p a rt of an o p e ro n ra th e r th a n as single separate units. In addition, the biological significance of th is o rganisation of the leading region m ay be th at the D N A stra n d d e stin e d for tran sfer (T -strand) form s the tem plate strand for m R N A synthesis. This m ay be im p o rtan t to ensure that each gene is expressed soon after infection of th e recipient cell by the im m igrant plasm id. The co d in g regions of all the orfs identified w ere translated u sing the co m p u ter p ro g ra m 'G ene Jockey IT (BIOSOFT). Both the nucleotide and am ino acid seq u en ces of th ese coding regions w ere com pared to other sequences p re s e n t in th e d a ta b a se s EMBL, GenBank and Sw issProt, available via the W orld W ide W eb (C hapter Two). 7.2.3 Sim ilarities b etw een D N A and protein sequences found in the leading region to those present in the databases A tte m p ts w e re m a d e to d e te rm in e the p o te n tia l fu n ctio n and ev o lu tio n ary h isto ry of the orfs identified w ithin the leading region of Collb. 99 A n aly ses in v o lv e d co m p arin g specific regions of eith er the nucleotide or p red ic te d am ino acid sequences of the leading region to those available w ithin the d a tab ases EMBL, GenBank and Sw issProt. The GCG program s used for these co m p ariso n s w ere FASTA, BLAST and TFASTA (Pearson and Lipm an, 1988; A ltsc h u l et al., 1990; H e n a u t a n d D anchin, 1996). H ow ever, once p o te n tia l h o m o lo g u e s had b een id en tified , the GCG p ro g ram BESTFIT w as often u sed to d eterm ine the am ount of sim ilarity and identity betw een the tw o seq u en ces. P o te n tia l seco n d ary stru c tu re s w ithin p ro tein sequences w ere identified u sin g the GCG p rogram MOTIF. In th e first p lace, seq u en ce com parisons in volved the p re v io u sly id en tified genes ssb, psiB and a r d A to establish if any new hom ologues could be fo u n d w ith in th e d atab ases to th o se found w hen the genes w ere first sequenced (H o w land et al., 1989; Jones et al., 1992; Delver et al., 1991). N o new hom ologues w ere fou nd for both ssb and psiB, despite the fact th at a n u m b e r of p lasm id s are know n to carry ssb- and psz'B-like sequences (Golub et al., 1988). H ow ever, a n u m b er of errors w ere found in the original sequence of p s i B , w h ich on correction resulted in both the nucleotide and p red icted am ino acid sequence form ing a better alignm ent w ith the tw o other seq u en ced psiB genes of plasm id F an d R6-5 (Dutreix et al., 1988). F and R6-5 PsiB p ro tein s are th e sam e size as Collb PsiB (144 residues) and display 85.4% and 84.6% id en tity to the Collb protein respectively. Such alignm ents suggest th at a stro n g er relationship exists betw een the genes of Collb, F and R6-5 than p reviously suggested by Jones et al. (1992). T hro u g h cross-hybridisation studies hom ologues of the Collb ar dA gene have b een found on a n u m b er of enterobacterial plasm ids including m em bers of th e I com plex (Il-B-K ), IncN and IncFV plasm ids (Chilley and W ilkins, 1995). Sequence sim ilarities have also been dem onstrated betw een Collb A rd A an d those of p lasm id s R16 (IncB), pKM lO l (IncN) and F0lac (IncFV), displaying 94.0%, 60.5% an d 56.4% identity, respectively. D uring our analyses w e found tw o n ew h o m o lo g u es of Collb A rdA , w hich w ere Orf79 of plasm id pM T l of 100 Yersinia pestis (causative agent of bubonic plaque) and O rfl8 of the conjugative tr a n s p o s o n Tn916. The p re d ic te d am ino acid sequences of these A rdA - h o m o lo g u es sh are 55% and 25% identity w ith that of Collb ArdA, respectively (H u et ah, 1998; EMBL accession n u m b er AF053947; Clewell et al., 1995). A nalysis of the or f found w ith in the sequence at base positions 8002-7283 (Fig. 7.3) rev ealed a sim ilarity to the p s i A genes of plasm id F and R6-5 (Loh et al., 1989). Like p s i A of F and R6-5, Collb p s i A also overlaps the TGA stop cod o n of the p s i B gene, w hich suggests that these tw o orfs are co-ordinately expressed from th e sam e transcript. The Collb PsiA protein, w hich is predicted to b e 239 a m in o acid resid u e s in length, shares 63.4% id en tity to the co rresp o n d in g p ro te in s of F and R6-5 w ith the m ajority of am ino acid changes lo cated in th e C -term in a l region of the peptide. The function of PsiA of p lasm id F an d R6-5 is unknow n although unlike PsiB it is not believed to be in v o lv ed in SOS in h ib itio n as PsiB confers the m ajor Psi phenotype (Bailone et al., 1988). The la rg e st o r f identified w ith in the sequence w as or/6, which can be fo u n d located b e tw ee n ssb and psiB at base positions 10527-8488 (Fig. 7.3). The p re d ic te d am in o acid sequence of Orf6 is 679 am ino acid residues w ith a m olecular w eig h t of 74 kDa (Table 7.2.1). Com parisons of the sequence of Orf6 w ith those fo u n d in the databases identified only one full-length hom ologue, Orf81 of Yersinia pestis plasm id pM T l (53% identity). The function of Orf81 is u n k n o w n (H u et al., 1998). O ther hom ologues identified included m em bers of the SpoOJ fam ily of pro tein s, w hose functions include DNA partitioning, and the KorB tran scrip tio n a l repressor of IncP plasm id RP4. These proteins share 20-30% id e n tity w ith O rf6 over 200 am ino acids resid u es. From these c o m p a riso n s, it is p re d ic te d th a t Orf6 m ight be a DN A b in d in g p ro tein , a lth o u g h th e p ro te in exhibits an overall net negative charge (Table 7.2.1). H o w e v e r, fu rth e r e v id e n ce to su p p o rt this notion is th at the am ino acid sequence of O rf6 contains a potential Helix-Turn-Helix (HTH; residues 186-205) of the A raC fam ily type. Furtherm ore, a potential P-loop structure is identified 101 Table 7.2.1. Orfs identified in the Collb leading region. ORF Co-ordinates Amino acid Mr from nic Residues (kDa) t Net charge Motifs § N otes ssb 11316-10789 175 19.2 3 ssDNA binding ssDNA -binding protein orf6 10529-8490 679 74.3 -18 P-loop HTH motif coiled coil DNA binding? p s iB 8435-8001 144 15.9 -8 - Inhibitor of SOS induction psiA 8004-7285 239 27.7 3 - U nknow n orf5 7155-5980 391 43.9 40 ardA 5531-5030 166 19.2 -25 orf4 4301-3867 144 16.5 2 orf3 3775-3590 88 10.5 6 orf2 3444-2503 313 35.8 -9 orfl 1409-561 282 32.2 -3 Possible transposase A ntirestriction protein - U nknow n U nknow n ya d D hom ologue U nknow n / m ethy lase? t Mr was predicted using the PEPTIDESORT program from the GCG package version 9.2 (Genetics com puter group, 1997) § Motifs were detected using the GCG program MOTIFS searching against the PROSITE database. Predicted helix turn helix (HTH) using the GCG program s HTHSCAN and COILSCAN respectively. Table is taken from Bates et al., (1999). at resid u es 178-185, w hich is indicative of an A TP/G TP biding site motif. Both these regions of Orf6 have a net positive charge (Table 7.2.1). T here is suggestive evidence th at expression of or/6 is enhanced during co n ju g ativ e tra n sfe r of Collb, since a protein of sim ilar size (82-kDa) to the p red ic te d Orf6 w a s found to increase in relative concentration during bacterial co n ju g atio n (Jones et al., 1992). The or/5 locus can be found at base position 7153-5978 in the sequence (Fig. 7.3). The p re d ic te d am ino acid sequence of Orf5 is 391 residues w ith a h ig h n e t p o sitiv e charge (Table 7.2.1). C om parisons of the sequence of Orf5 w ith o th er p ro te in s p resen t in the databases revealed extensive hom ologies to the p ro te in -c o d in g reg io n s found w ith in a n u m b er of u n u su al in sertio n elem en ts (IS) all rela te d to IS892. In fact, sim ilarities w ere detected over the full le n g th of O rf5 to th e probable transposases of IS elem ents IS! 253 of Dichelobacter n o d o s u s (41% identity, 52% sim ilarity; Billington et al., 1996; EMBL: U34772); IS 1 3 4 1 of therm ophillic bacterium PS3 (39% identity, 51% s im ila r ity , M u ra i et al., 1995); IS605 (t n p B ) of He lio cb a c te r pylori cag p ath o g en icity isla n d (34% identity, 45% sim ilarity, Censini et al., 1996; EMBL: U95957) a n d I S 8 9 1 of C y an o b acteriu m A n a b a e n a sp. (39% identity, 44% sim ilarity; B ancroft a n d W olk, 1989; EMBL: M24855). Hence, the p ro d u ct of or/5 ap p ea rs to be a transposase (Table 7.2.1). O ur analysis also revealed that like th e IS elem ents m en tio n ed above, the N -term inal 120 am ino acids of Orf5 h a v e a h ig h d e g re e of hom ology to VirE (81% identity) and VsdF (79% id e n tity ) , w h ic h Salmonella a re e n c o d e d by v iru le n c e -a sso c ia te d t y p h i m u r i u m and Salmonella p lasm id s from d u b l i n , respectively (Gulig et al., 1992; K rause et al., 1991). N o in v e rte d re p e a t sequences th at are classically associated w ith the en d s of IS elem ents could be found su rrounding or/5. H ow ever, as found for 1ST253, an im p e rfec t d irec t repeat sequence w ith internal dyad sym m etry, p o ssib ly d irectin g tran sp o sase recognition, was found flanking orf5 (Fig. 7.3). Interestingly, th e location of or/5 corresponded to a region of the sequence that 102 Fig. 7.3. 11.7 kb D N A sequence of the C ollb leading region. F e a tu re s sh o w n in clude the / lGTG sta rt a n d stop codons of ssb, p s i B , p s i A , a r d A , o r f l , orf2, orf3, orf4, o r f 5, a n d or/6. T he associated ribosom e b in d in g site (rbs) of each o r f is u n d e rlin e d . A m ino acid sequences d ed u ced fro m th e D N A sequences of all orfs a re p re s e n te d w h e re th e start of each tra n s la te d am ino acid is in uppercase. All orfs are tran scrib ed in the rightw ards d ire c tio n (5 '—► 3 '). Restriction enzym e cleavage sites are B g ll l, Clal, P s t l and Sail. T he s s i sites (indicated by single lines) are p u ta tiv e p ro m o te rs for rig h tw ard s tra n sc rip tio n of the transferring D N A stra n d . T hese ssi sites w ere identified by th e ir sim ila rity in sequence to F r p o , w h ic h fu n c tio n s as a sin g le-stra n d ed p ro m o te r in p lasm id F (Masai a n d A rai, 1997; EMBL: D90178). T he n ic site is indicated (A) an d the d ire c tio n of tra n sfe r of the plasm id to the re c ip ie n t d u rin g conjugation from this site is from rig h t to left (5-3'). Im p e rfe c t ( Z . _______ d ir e c t re p e a ts fla n k in g orfS in th e T -s tra n d and are: 5945 5' -TGCTCCCCGCCCTG - TCGGGCGAGGCTTCCC-3' 5974 7233 5 ’- TCCTCCCCACCCGCATTGGGCGAGGGTTCTC-3' 7263 are in d ic a te d Fig. 7.3. 11712 AGACAGCGCGGCACTGGACGCGAGGGACGTGTGGACTGCCACACACTCCTTTGACGCGGA 1 1 6 b 5 1 1 6 5 4 CTACCGCGTCGTCAGAAGGGAGGACGGCGGTGGAGACGCCGGACAAAGGGCGACACCTGA 1 1 5 9 5 1 1 5 94 AAGGCCCCCCGTCTGCATTGGGGGAGGGCAAGGTAGACAGGTCCCACTCTACGTGGCCGG 1 1 5 3 5 1 1 5 3 4 TGTTTTAAAAATCGGTACTTGGGCCGTTTTTTAAAACGCAGGCGGGGGCCTGTCGACAGG 1 1 4 7 5 1 1 4 7 4 GCACTCCGCCATTTGCCCGGGGGTAGCCTTTCCAGGTGACGCCCCTCTTTGGGGACACTT 1 1 4 1 5 AAGGGGCACATGAACGTCCCGTCTCCGGT ----------------- + --------------------+ --------------------- + H 3 5 5 11414 CACCTGCCGTCCTCCTCGAAAGCGAGGAGACTTCCCCGTGTACTTGCAGGGCAGAGGCCA 11354 ATCCGGTTTATCTCACATCTTTCAGTCAGGAGGTTCT'ljATGlAGTGCACGTGGTATCAACA -------------------- + ---------------------- + -------------------- + ------------------- + ------------------- + --------------------- + 1 1 2 9 5 TAGGCCAAATAGAGTGTAGAAAGTCAGTCCTCCAAGAATACTCACGTGCACCATAGTTGT rbs ssb M e tS e r A la A r g G ly lle A s n L y s 11294 AGGTCATCCTCGTCGGGCGTCTGGGCAATGATCCGGAGGTCCGTTACATCCCCAACGGTG -------------------- + ---------------------- + -------------------- + -------------------+ ------------------- + --------------------- + 1 1 2 3 5 TCCAGTAGGAGCAGCCCGCAGACCCGTTACTAGGCCTCCAGGCAATGTAGGGGTTGCCAC V a l 1 1 e L e u V a l G l y A r g L e u G l y A s n A s p P r o G l u V a l A r g T y r I l e P r o A s n G ly G l y PstI g c g c a g t g g c a a a c c t g c a g Ig t g g c c a c g t c a g a a a g c t g g c g c g a c a a a c a g a c g g g g g 11234 --------------------+ ----------------------+ --------------------+ ------------------- + -------------------- + ---------------------+ 1 1 1 7 5 CGCGTCACCGTTTGGACGTCCACCGGTGCAGTCTTTCGACCGCGCTGTTTGTCTGCCCCC A la V a lA la A s n L e u G ln V a lA la T h r S e r G lu S e r T r p A r g A s p L y s G ln T h r G ly G lu AGATGCGGGAGCAGACGGAATGGCACCGTGTTGTGCTGTTCGGCAAGCTCGCGGAAGTGG 1 1 1 7 4 -------------------- + --------------------- + -------------------- + -------------------+ -------------------- + + 11115 TCTACGCCCTCGTCTGCCTTACCGTGGCACAACACGACAAGCCGTTCGAGCGCCTTCACC M e t A r g G lu G lu T h r G lu T r p H is A r g V a lV a lL e u P h e G ly L y s L e u A la G lu V a lA la CAGGTGAATATCTGCGCAAGGGCGCGCAGGTCTACATCGAGGGGCAACTCCGTACCCGTA 1 1 1 1 4 -------------------- + --------------------- + -------------------- + -------------------+ -------------------- + + 11055 GTCCACTTATAGACGCGTTCCCGCGCGTCCAGATGTAGCTCCCCGTTGAGGCATGGGCAT G ly G lu T y r L e u A r g L y s G ly A la G ln V a lT y r lle G lu G ly G ln L e u A r g T h r A r g S e r 11054 GCTGGGACGACAACGGCATCACCCGCTACATCACTGAAATTCTTGTTAAGACCACGGGCA --------------------+ ----------------------+ --------------------+ -------------------+ -------------------- + + 10995 CGACCCTGCTGTTGCCGTAGTGGGCGATGTAGTGACTTTAAGAACAATTCTGGTGCCCGT T r p A s p A s p A s n G ly lle T h r A r g T y r lle T h r G lu I le L e u V a lL y s T h r T h r G ly T h r 10994 CCATGCAGATGCTGGGGAGTGCACCACAGCAGAACGCTCAGGCGCAACCGAAGCCTCAGC -------------------- + ---------------------+ -------------------- + -------------------- + --------------------+ + 10935 GGTACGTCTACGACCCCTCACGTGGTGTCGTCTTGCGAGTCCGCGTTGGCTTCGGAGTCG M e tG ln M e tL e u G ly S e r A la P r o G ln G ln A s n A la G ln A la G ln P r o L y s P r o G ln G ln 10934 AGAATGGGCAGCCACAGAGTGCTGACGCGACGAAAAAAGGTGGCGCGAAAACGAAAGGCC --------------------+ ----------------------+ -------------------- + -------------------- + ------------------- + + 10875 TCTTACCCGTCGGTGTCTCACGACTGCGCTGCTTTTTTCCACCGCGCTTTTGCTTTCCGG A s n G ly G ln P r o G ln S e r A la A s p A la T h r L y s L y s G ly G lv A la L y s T h r L y sG ly A r g 10874 GTGGACGTAAGGCCGCGCAGCCCGAGCCTCAGCCGCAAACGCCGGAGGGGGAGGATTACG --------------------+ ----------------------+ -------------------- + -------------------- + ------------------- + + 10815 CACCTGCATTCCGGCGCGTCGGGCTCGGAGTCGGCGTTTGCGGCCTCCCCCTCCTAATGC G ly A r g L y s A la A la G ln P r o G lu P r o G ln P r o G ln T h r P r o G lu G ly G lu A s p T y r G ly g g t t t t c a g a c g a c a t c c c g t t c Ftga It c g g g c t g a c t g t g a c a a c c g c c c c g c c c t g t g c 10814 --------------------+ ----------------------+ -------------------- + -------------------- + ------------------- + ---------------------+ 1 0 7 5 5 CCAAAAGTCTGCTGTAGGGCAAGACTAGCCCGACTGACACTGTTGGCGGGGCGGGACACG P h e S e r A s p A s p I le P r o P h e * 10754 GGGGCATCACCGGAGATATGAGGATGAGCGAATATTTCAGAATACTTCAGGGACTGCCGG --------------------+ ---------------------- + -------------------- + ------------------- + -------------------- + ---------------------+ 1 0 6 9 5 CCCCGTAGTGGCCTCTATACTCCTACTCGCTTATAAAGTCTTATGAAGTCCCTGACGGCC 10694 ACGGCTCCTTTACCCGCGAACAGGCGGAAGCCGTTGCCGCACAGTACCGGAACGTCTTTA --------------------+ ---------------------- + -------------------- + ------------------- + -------------------- + ---------------------+ 1 0 6 3 5 TGCCGAGGAAATGGGCGCTTGTCCGCCTTCGGCAACGGCGTGTCATGGCCTTGCAGAAAT 10634 TCGAGGATGACCACGGAGAACATTTTCGCCTGGTTGTCCGTAATAACGGAACAATGGTCT --------------------+ ---------------------- + -------------------- + ------------------- + -------------------- + ---------------------+ 1 0 5 7 5 AGCTCCTACTGGTGCCTCTTGTAAAAGCGGACCAACAGGCATTATTGCCTTGTTACCAGA 10574 GGGGTACCTGGAATTTTGAGGACGGTGCCGGGTACTGGATGAACdATGhcATCCGTGATT --------------------+ ---------------------- + -------------------- + ------------------- + -------------------- + ---------------------+ 1 0 5 1 5 CCGCATGGACCTTAAAACTCCTGCCACGGCCCATGACCTACTTGGTACGGTAGGCACTAA rbs orf 6 M e tP r o S e r V a llle 10514 TCGGGATTCTTAAGTAAGAGAGGTGCCGGACGCGCGAACGTCCGGCAGGACATAAGCAAT --------------------+ ---------------------- + -------------------- + ------------------- + -------------------- + ---------------------+ 1 0 4 5 5 AGCCCTAAGAATTCATTCTCTCCACGGCCTGCGCGCTTGCAGGCCGTCCTGTATTCGTTA S e r G ly P h e L e u S e r L y s A r g G ly A la G ly A r g A la A s n V a lA r g G ln A s p I le S e r A s n 10454 TATAAGGGGATGATTATGCCTGTAACGAAGTGTGAACCAGAAACCACCCGCAAAGCAAGC --------------------+ ---------------------- + -------------------- + ------------------- + -------------------- + ---------------------+ 1 0 3 9 5 ATATTCCCCTACTAATACGGACATTGCTTCACACTTGGTCTTTGGTGGGCGTTTCGTTCG T y r L y s G ly M e tlle M e t P r o V a lT h r L y s C y s G lu P r o G lu T h r T h r A r g L y s A la S e r 10394 CGTAAATCTGCAAAAACGCAGGAAACTGCCCTGTCTGCCCTGCTGGCGCAGACGGAGGAA --------------------+ ---------------------- + -------------------- + ------------------- + -------------------- + ---------------------+ 1 0 3 3 5 GCATTTAGACGTTTTTGCGTCCTTTGACGGGACAGACGGGACGACCGCGTCTGCCTCCTT A rgL y s S e r A la L y s T h r G ln G lu T h r A la L e u S e r A la L e u L e u A la G ln T h r G lu G lu 10334 GTGAGCGTGCCGCTGGCCTCACTGATTAAATCACCGCTGAATGTGCGCACGGTGCCGTAT --------------------+ ---------------------- + -------------------- + ------------------- + -------------------- + --------------------- + 2 0 2 7 5 CACTCGCACGGCGACCGGAGTGACTAATTTAGTGGCGACTTACACGCGTGCCACGGCATA 10274 TCTGTGGAGTCCGTCAGCGAACTGGCTGGGTCCATTAAGGGCGTTGGCCTGTTGCAGAAT ------------------- + ---------------------- + --------------------+ -------------------+ -------------------- + -------------------- +1 0 2 1 5 AGACACCTCAGGCAGTCGCTTGACCGACCCAGGTAATTCCCGCAACCGGACAACGTCTTA V a lS e r V a lP r o L e u A la S e r L e u I le L y s S e r P r o L e u A s n V a lA r g T h r V a lP r o T y r S e r V a lG lu S e r V a lS e r G lu L e u A la G ly S e r lle L y s G ly V a lG ly L e u L e u G ln A s n CTGGTCGTGCATACCCTGCCTGGTGACCGTTACGGTGTCGCCGCAGGCGGTCGCCGACTG •• 1 0 2 1 4 -------------------- + --------------------- + -------------------- + ------------------- + -------------------- + -------------------- +1 0 1 5 5 GACCAGCACGTATGGGACGGACCACTGGCAATGCCACAGCGGCGTCCGCCAGCGGCTGAC L e u V a lV a lH is T h r L e u P r o G ly A s p A r g T y r G ly V a lA la A la G ly G ly A r g A r g L e u GCAGCACTCAACATGCTGGCAGAGCGCAACATCCTTCCGGCTGACTGGCCTGTCCGCGTG 1 01 54 ---------------- +----------------- +---------------- +---------------- +---------------- +----------------- + 10095 CGTCGTGAGTTGTACGACCGTCTCGCGTTGTAGGAAGGCCGACTGACCGGACAGGCGCAC A la A la L e u A s n M e tL e u A la G lu A r g A s n lle L e u P r o A la A s p T r p P r o V a lA r g V a l Clal I AAAATTATTCCGCAGGCACTGGCGACGGCTGcETCGATbACCGAGAACGGTCATCGTCGG 10094 -------------- +-------------- +--------------+__L-------- J_+------------- +-------------- +1003 5 TTTTAATAAGGCGTCCGTGACCGCTGCCGACGTAGCTACTGGCTCTTGCCAGTAGCAGCC Ly s I le lle P r o G ln A la L e u A la T h r A la A la S e r M e tT h r G lu A s n G ly H is A r g A r g GATATGCACCCAGCCGAACAGATTGCCGGATTCCGCGCAATGGCGCAGGAAGGCAAAACA 10034 +----------------+------------- +---------------+--------------- +--------------- + 99?5 CTATACGTGGGTCGGCTTGTCTAACGGCCTAAGGCGCGTTACCGCGTCCTTCCGTTTTGT A s p M e t H is P r o A la G lu G ln lle A la G ly P h e A r g A la M e tA la G ln G lu G ly L y s T h r CCTGCGCAGACTGGCGACCTGCTGGGCTATTCACCCCGCCACGTCCAGCGAATGCTGAAA 9974 ------------ +------------- +-----------+------------ +-------------+-------------+ 99i5 GGACGCGTCTGACCGCTGGACGACCCGATAAGTGGGGCGGTGCAGGTCGCTTACGACTTT P r o A la G ln T h r G ly A s p L e u L e u G ly T y r S e r P r o A r g H is V a lG ln A r g M e t L e u L y s CTGGCAGACCTTGCACCCGTCATCCTTGATGCGCTGGCAGAAGACCGCATCACCACCGAA 9914 +---------------- +------------- +---------------+----------------+--------------- + 9855 GACCGTCTGGAACGTGGGCAGTAGGAACTACGCGACCGTCTTCTGGCGTAGTGGTGGCTT L e u A l a A s p L e u A l a P r o V a 1 1 1 e L e u A s p A l a L e u A l a G lu A s p A r g 1 1 e T h r T h rG l u CACTGTCAGGCGCTGGCGCTGGAGAACGACACCGCGCGTCAGGTGCAGGTGTTTGAAGCC 9854 i----------------- + |_ ).---------------- ^--------------- 1_9795 GTGACAGTCCGCGACCGCGACCTCTTGCTGTGGCGCGCAGTCCACGTCCACAAACTTCGG H is C y s G ln A la L e u A la L e u G lu A s n A s p T h r A la A r g G ln V a lG ln V a lP h e G lu A la GCCTGCCAGTCAGGATGGGGCGGTAAACCGGATGTGCGGGTTATCCGCAACCTGATTACC 9794 ---------------- +----------------- +----------------+-----------------+------------------+----------------- + 9 7 3 5 CGGACGGTCAGTCCTACCCCGCCATTTGGCCTACACGCCCAATAGGCGTTGGACTAATGG A la C y s G ln S e r G ly T r p G ly G ly L y s P r o A s p V a lA r g V a llle A r g A s n L e u I le T h r GAAAGTGAAGTGGCGGTGGCGGGGAACAGTAAATTCCGCTTCGTGGGGGCTGATGCCTTC 9734 -------------- +---------------+------------- +-------------- +--------------- +---------------+ 9675 CTTTCACTTCACCGCCACCGCCCCTTGTCATTTAAGGCGAAGCACCCCCGACTACGGAAG G lu S e r G lu V a lA la V a lA la G ly A s n S e r L y s P h e A r g P h e V a lG ly A la A s p A la P h e TCGCCAGACGAACTGCGCACCGATTTGTTCAGCGACGACGAGGGTGGCTATGTGGACTGC 9674 -------------- + + h --------------- 1------------------ 1--------------- k9615 AGCGGTCTGCTTGACGCGTGGCTAAACAAGTCGCTGCTGCTCCCACCGATACACCTGACG S e r P r o A s p G lu L e u A r g T h r A s p L e u P h e S e r A s p A s p G lu G ly G ly T y r V a lA s p C y s GTGGCGCTCGATGCCGCCCTGCTGGAAAAACTCCAGGCTGTCGCTGAACACCTTCGGGAA 9614 ---------------- +------------------ +---------------+-----------------+----------------- +----------------- + 9555 CACCGCGAGCTACGGCGGGACGACCTTTTTGAGGTCCGACAGCGACTTGTGGAAGCCCTT V a lA la L e u A s p A la A la L e u L e u G lu L y s L e u G ln A la V a lA la G lu H is L e u A r g G lu GCCGAAGGCTGGGAATGGTGCGCCGGGCGCATGGAGCCTGTCGGTGAGTGCCGTGAGGAT 9554 ---------------- +------------------ +---------------+-----------------+----------------- +----------------- + 9495 CGGCTTCCGACCCTTACCACGCGGCCCGCGTACCTCGGACAGCCACTCACGGCACTCCTA A la G lu G ly T r p G lu T r p C y s A la G ly A r g M e t G lu P r o V a lG ly G lu C y s A r g G lu A s p GCCGGAACATACCGCTGTCTGCCGGAGCCGGAAGCGGTGCTGACGGAGGCAGAAGAAGAA 9494 ---------------- +------------------ +---------------+-----------------+----------------- +----------------- + 9435 CGGCCTTGTATGGCGACAGACGGCCTCGGCCTTCGCCACGACTGCCTCCGTCTTCTTCTT A la G ly T h r T y r A r g C y s L e u P r o G lu P r o G lu A la V a lL e u T h r G lu A la G lu G lu G lu CGCCTGAACGAACTGATGACGCGTTACGACGCGCTGGAAAACCAGTGTGAGGAATCCGAC 9434 ---------------- +------------------ +---------------+-----------------+----------------- +----------------- + 9 3 7 5 GCGGACTTGCTTGACTACTGCGCAATGCTGCGCGACCTTTTGGTCACACTCCTTAGGCTG A r g L e u A s n G lu L e u M e t T h r A r g T y r A s p A l a L e u G l u A s n G l n C y s G lu G lu S e r A s p 9374 CTGCTGGAAGCAGAAATGAAGCTGATGCGCTGCATGGCGAAGGTCAGAGCGTGGACGCCG --------------------+ ----------------------+ --------------------+ -------------------- + -------------------+ ---------------------+ 9 3 1 5 GACGACCTTCGTCTTTACTTCGACTACGCGACGTACCGCTTCCAGTCTCGCACCTGCGGC L e u L e u G lu A l a G lu M e t L y s L e u M e t A r g C y s M e t A l a L y s V a lA r g A la T r p T h r P r o 9314 GAGATGCGTACCGGAAGCGGTGTGGTGGTGTCCTGGCGTTACGGTAATGTGTGTGTCCAG --------------------+ ---------------------- + -------------------- + -------------------- + -------------------+ ---------------------+ 9 2 5 5 CTCTACGCATGGCCTTCGCCACACCACCACAGGACCGCAATGCCATTACACACACAGGTC G lu M e t A r g T h r G ly S e r G ly V a lV a lV a lS e r T r p A r g T y r G ly A s n V a lC y s V a lG ln 9254 CGTGGTGTGCAGTTGCGCAGTGAAGATGACGCGGCTGACCGCACGGAACAGGTGCAGGAG --------------------+ ----------------------+ -------------------- + -------------------- + -------------------+ ---------------------+ 9 1 9 5 GCACCACACGTCAACGCGTCACTTCTACTGCGCCGACTGGCGTGCCTTGTCCACGTCCTC A r g G ly V a l G l n L e u A r g S e r G l u A s p A s p A l a A l a A s p A r g T h r G l u G l n V a l G l n G l u 9194 AAAGCATCAGTGGAGGAAATCAGTCTGCCACTGCTGACGAAAATGTCTTCCGAACGCACG ------------------- + ---------------------- + -------------------- + -------------------- + -------------------+ ---------------------+ 9 1 3 5 TTTCGTAGTCACCTCCTTTAGTCAGACGGTGACGACTGCTTTTACAGAAGGCTTGCGTGC L y s A la S e r V a lG lu G lu I le S e r L e u P r o L e u L e u T h r L y s M e tS e r S e r G lu A r g T h r 9134 CTGGCAGTCCAGGCGGCACTCATGCAGCAACCGGACAAATCTCTGGCACTGCTGGCATGG ------------------- + ---------------------- + -------------------- + -------------------- + -------------------+ ---------------------+ 9 0 7 5 GACCGTCAGGTCCGCCGTGAGTACGTCGTTGGCCTGTTTAGAGACCGTGACGACCGTACC L e u A la V a lG ln A la A la L e u M e t G ln G ln P r o A s p L y s S e r L e u A la L e u L e u A la T r p 9074 ACGCTCTGCCTGAATGTGTTTGGCAGCGGAGCGTACAGTAAACCAGCACAAATCAGCCTG ------------------- + ---------------------- + -------------------- + -------------------- + -------------------+ ---------------------+ 9 0 1 5 TGCGAGACGGACTTACACAAACCGTCGCCTCGCATGTCATTTGGTCGTGTTTAGTCGGAC T h r L e u C y s L e u A s n V a lP h e G iy S e r G ly A la T y r S e r L y s P r o A la G ln lle S e r L e u 9014 GAATGTAAACATTATTCGCTGACCAGCGATGCGCCGTCAGGGAAGGAAGGTGCCGCGTTC ------------------- + ---------------------- + -------------------- + -------------------- + -------------------+ ---------------------+ 8 9 5 5 CTTACATTTGTAATAAGCGACTGGTCGCTACGCGGCAGTCCCTTCCTTCCACGGCGCAAG G lu C y s L y s H i s T y r S e r L e u T h r S e r A s p A l a P r o S e r G l y L y s G l u G l y A l a A l a P h e 8954 ATGGCGCTGATGGCAGAAAAAGCCCGTCTTGCGGCCCTGTTACCGGAGGGATGGTCACGG ------------------- + ---------------------- + -------------------- + -------------------- + -------------------+ ---------------------+ 8 8 9 5 TACCGCGACTACCGTCTTTTTCGGGCAGAACGCCGGGACAATGGCCTCCCTACCAGTGCC M e tA la L e u M e t A la G lu L y s A la A r g L e u A la A la L e u L e u P r o G lu G ly T r p S e r A r g 8894 GACATGACGACGTTCCTGTCGCTCAGCCAGGAGGTGCTGTTATCCCTGCTCAGTTTCTGC ------------------- + -----------------------+ -------------------+ -------------------- + -------------------- + ---------------------+ 8 8 3 5 CTGTACTGCTGCAAGGACAGCGAGTCGGTCCTCCACGACAATAGGGACGAGTCAAAGACG A s p M e tT h r T h r P h e L e u S e r L e u S e r G ln G lu V a lL e u L e u S e r L e u L e u S e r P h e C y s 8834 ACCGCGTGCAGCATCCACGGCGTTCAGACCCGCGAGTGTGGTCACACGTCACGCAGCCCG ------------------- + -----------------------+ -------------------+ -------------------- + -------------------- + ---------------------+ 8 7 7 5 TGGCGCACGTCGTAGGTGCCGCAAGTCTGGGCGCTCACACCAGTGTGCAGTGCGTCGGGC T h r A la C y s S e r l l e H i s G ly V a lG ln T h r A r g G lu C y s G ly H is T h r S e r A r g S e r P ro 8774 CTGGATACGCTGGAGAGCGCCATCGGTTTTCACATGCGCGACTGGTGGCAGCCGACAAAA ------------------- + -----------------------+ -------------------+ -------------------- + -------------------- + ---------------------+ 8 7 1 5 GACCTATGCGACCTCTCGCGGTAGCCAAAAGTGTACGCGCTGACCACCGTCGGCTGTTTT L e u A s p T h r L e u G lu S e r A la lle G ly P h e H is M e tA r g A s p T r p T r p G ln P r o T h r L y s 8714 GCAAACTTCTTCGGACACCTGAAAAAGCCACAAATTATCGCAGCCCTGAATGAGGCCGGA ------------------- + -----------------------+ ------------------- + -------------------- + -------------------- + ---------------------+ 8 6 5 5 CGTTTGAAGAAGCCTGTGGACTTTTTCGGTGTTTAATAGCGTCGGGACTTACTCCGGCCT A la A s n P h e P h e G ly H is L e u L y s L y s P r o G ln lle lle A la A la L e u A s n G lu A la G ly 8654 CTGTCCGGTGCCGCACGGGACGCGGAGAAGATGAAGAAAGGTGATGCGGCTGAACATGCA -------------------- h--------------------- + --------------------- 1--------------------- + *■ 8 5 9 5 GACAGGCCACGGCGTGCCCTGCGCCTCTTCTACTTCTTTCCACTACGCCGACTTGTACGT L e u S e r G ly A la A la A r g A s p A la G lu L y s M e tL y s L y s G ly A s p A la A la G lu H is A la 8594 GAGCACCATATGAAAGACAACCGCTGGGTTCCAGGCTGGATGTGTGCTCCACATCCACAG --------------------+ --------------------- + -------------------- + ------------------- + ------------------ + + 85 35 CTCGTGGTATACTTTCTGTTGGCGACCCAAGGTCCGACCTACACACGAGGTGTAGGTGTC G lu H is H is M e t L y s A s p A s n A r g T r p V a lP r o G ly T r p M e t C y s A la P r o H is P r o G ln 8534 ACAGATGCCACTGAACGCACCGATAACCTGGCTGATGCCGcdrGAtTGA^.CAACCACACCG --------------------+ --------------------- + -------------------- + ------------------- + ------------------ + + 8475 TGTCTACGGTGACTTGCGTGGCTATTGGACCGACTACGGCGGACTACTTGTTGGTGTGGC T h r A s p A la T h r G lu A r g T h r A s p A s n L e u A la A s p A la A la J l r\jS p siB * CCCCGCCGGAGACGGGGCGGCAGCAAGGGAGATACCGTGkTG kAAACTGAACTGACCGTG 8474 ------------ +------------- +------------ +------------ +----------- +-------------- + 8415 GGGGCGGCCTCTGCCCCGCCGTCGTTCCCTCTATGGCACTACTTTTGACTTGACTGGGAC M e t L y s T h r G lu L e u T h r L e u PstI AATGCC^TGCAG|rCCATGAACGCACAGGAATATGAAGATATCCGTGCTGCGGGAAGCGAT 8414 ------------ +------------- +------------ +------------ +----------- +-------------- + 8355 TTACGGGACGTCAGGTACTTGCGTGTCCTTATACTTCTATAGGCACGACGCCCTTCGCTA A s n A la L e u G ln S e r M e tA s n A la G ln G lu T y r G lu A s p I le A r g A la A la G ly S e r A s p ATGCGCCGTAATCTCACTCACGAGGTGATGCGTGAAGTGGACGCACCGGCTAACTGGATG 8354 ---------------- +------------------ +-----------------+---------------- +--------------- +------------------- + 8295 TACGCGGCATTAGAGTGAGTGCTCCACTACGCACTTCACCTGCGTGGCCGATTGACCTAC M e t A r g A r g A s n L e u T h r H is G lu V a lM e tA r g G lu V a lA s p A la P r o A la A s n T r p M e t ATGAATGGCGAGTATGGCAGTGAGTTCGGGGGCTTTTTCCCCGTCCAGGTTCGTTTCACG 8294 i- + h----------------- 1- !--------------------- + 8235 TACTTACCGCTCATACCGTCACTCAAGCCCCCGAAAAAGGGGCAGGTCCAAGCAAAGTGC M e tA s n G ly G lu T y r G ly S e r G lu P h e G ly G ly P h e P h e P r o V a lG ln V a lA r g P h e T h r CCAGCCCACGAACGTTTCCACCTGGCATTATGTTCGCCGGGAGACGTCTCTCAGCTCTGG 8234 ---------------- +------------------ +-----------------+----------------+----------------+------------------ + 8175 GGTCGGGTGCTTGCAAAGGTGGACCGTAATACAAGCGGCCCTCTGCAGAGAGTCGAGACC P r o A la H is G lu A r g P h e H is L e u A la L e u C y s S e r P r o G ly A s p V a lS e r G ln L e u T r p ATGCTGGTTCTGGTGAATTGTGGTGGACAGCCTTTCGCCGTCGTTCAGGTGCAACATATC 8174 ---------------- +-------------------+-----------------+----------------+----------------+------------------ + 8115 TACGACCAAGACCACTTAACACCACCTGTCGGAAAGCGGCAGCAAGTCCACGTTGTATAG M e t L e u V a lL e u V a lA s n C y s G ly G ly G ln P r o P h e A la V a lV a lG ln V a lG ln H is I le TTCACGCCTGTCGCTATCAGTCACACGCTGGCGCTTGCCGCGACACTTGATGCGCAGGGG 8114 ---------------- +-------------------+-----------------+----------------+----------------+------------------ + 8055 AAGTGCGGACAGCGATAGTCAGTGTGCGACCGCGAACGGCGCTGTGAACTACGCGTCCCC P h e T h r P r o V a lA la lle S e r H is T h r L e u A la L e u A la A la T h r L e u A s p A la G ln G ly rbs pjsi£. TACAGTGTTAACGACATCATCCATATCCTGATGGCAGAAGGAGGTCAGGC12T' 8054 JCGCAC ■+ 7995 ATGTCACAATTGCTGTAGTAGGTATAGGACTACCGTCTTCCTCCAGTCCGTACTCGCGTG T y r S e r V a lA s n A s p I le lle H is I le L e u M e t A la G lu G ly G ly G ln A la * M e tS e r A la A r g 7994 GTTCACGGGCACTGATCCCCCTCAGCGCAGAGCAACAGGCCGCCATGCAGGCTGTGGCTG ------------------- + --------------------- + -------------------- + -------------------- + -------------------- + ---------------------+ 7 9 3 5 CAAGTGCCCGTGACTAGGGGGAGTCGCGTCTCGTTGTCCGGCGGTACGTCCGACACCGAC S e r A r g A la L e u I le P r o L e u S e r A la G lu G ln G ln A la A la M e tG ln A la V a lA la V a l 7934 TCACAGAACAACGTCGTCGTCAGGGACGCACACTTTCAGCATGGCCTTATGCCAGCGCTT --------------------+ --------------------- + -------------------- + -------------------- + -------------------- + ---------------------+ 7 8 7 5 AGTGTCTTGTTGCAGCAGCAGTCCCTGCGTGTGAAAGTCGTACCGGAATACGGTCGCGAA T h r G lu G ln A r g A r g A r g G ln G lu A r g T h r L e u S e r A la T r p P r o T y r A la S e r A la P h e 7874 TCTTTCGCTGCCTGAATGGCAGCCGCCGGATTTCGCTGACCGATCTCCGCTTTTTTGCTC ------------------- + --------------------- + -------------------- + -------------------- + -------------------- + ---------------------+ 7 8 1 5 AGAAAGCGACGGACTTACCGTCGGCGGCCTAAAGCGACTGGCTAGAGGCGAAAAAACGAG P h e A r g C y s L e u A s n G ly S e r A r g A r g lle S e r L e u T h r A s p L e u A r g P h e P h e A la P r o CTGCGCTGACGAAGGAGGAATTTCATGGCAACCGTCTCCTGTGGCTGGCTGCCGTGGATA + 7814 + + + + + 7755 GACGCGACTGCTTCCTCCTTAAAGTACCGTTGGCAGAGGACACCGACCGACGGCACCTAT A la L e u T h r L y s G lu G lu P h e H is G ly A s n A r g L e u L e u T r p L e u A la A la V a lA s p L y AACTGATTGAAAGTTTTGGTGAAGTCTGTGTTCTTCCCCTGCCATCCGATGCTGGGCATC +----------------+------------- +---------------- +-------------- +-------------- + 7695 7754 TTGACTAACTTTCAAAACCACTTCAGACACAAGAAGGGGACGGTAGGCTACGACCCGTAG L e u I le G lu S e r P h e G lu G lu V a lC y s V a lL e u P r o L e u P r o S e r A s p A la G ly H is A r g GTCTGTTCCCGTCCGTTCCTTTTCGTGAAGGTGAGCGGCGTCGTCAGAAAACCACGCTGA +---------------- +--------------+---------------- +-------------- +-------------- + 7635 7694 CAGACAAGGGCAGGCAAGGAAAAGCACTTCCACTCGCCGCAGCAGTCTTTTGGTGCGACT L e u P h e P r o S e r V a lP r o P h e A r g G lu G ly G lu A r g A r g A r g G ln L y s T h r T h r L e u T h r CAGAGCAGAAATACAGCCGCCAGCGGGAACGTGAGGCAGAACGACGGGAACTGGAATACC +--------------- +-------------- +----------------+---------------+------------- + 7575 7634 GTCTCGTCTTTATGTCGGCGGTCGCCCTTGCACTCCGTCTTGCTGCCCTTGACCTTATGG G lu G l n L y s T y r S e r A r g G l n A r g G l u A r g G l u A l a G l u A r g A r g G l u L e u G l u T y r G l n AGACATGTTTTGCTCAGGCGCAGATTGACCTTGCGTTTCATACTCCCGCCACGGTCGGAA +----------------+-------------- +----------------+---------------+------------- + 7515 7574 TCTGTACAAAACGAGTCCGCGTCTAACTGGAACGCAAAGTATGAGGGCGGTGCCAGCCTT T h r C y s P h e A la G ln A la G ln lle A s p L e u A la P h e H is T h r P r o A la T h r V a lG ly S e r GCTGGTTGTCCCGCTGGTCTGGTGTTGTTGAGGAGCATGATCTGGAAACGATTTTCTGGG 7514 ---------------- +------------------ +----------------+------------------+----------------+----------------- + 7455 CGACCAACAGGGCGACCAGACCACAACAACTCCTCGTACTAGACCTTTGCTAAAAGACCC T r p L e u S e r A r g T r p S e r G ly V a lV a lG lu G lu H is A s p L e u G lu T h r lle P h e T r p G ly 7454 GGTGGTGCGGGCGTTTTCCATCACTGTCATCATTTGACCGGTTTTTCTGGCAGGAGGAAC ---------------- +------------------- h----------------+----------------- +----------------+------------------h7395 CCACCACGCCCGCAAAAGGTAGTGACAGTAGTAAACTGGCCAAAAAGACCGTCCTCCTTG T r p C y s G ly A r g P h e P r o S e r L e u S e r S e r P h e A s p A r g P h e P h e T r p G ln G lu G lu P r o CACTCTGGCGGCTGATTTTTGAAGCCGGTGAGGCCGGTCGTGGTGCACCGGTACAGATAC 7394 ---------------- +------------------ +----------------+------------------+----------------+----------------- + 7 3 3 4 GTGAGACCGCCGACTAAAAACTTCGGCCACTCCGGCCAGCACCACGTGGCCATGTCTATG L e u T r p A r g L e u I le P h e G lu A la G ly G lu A la G ly A r g G ly A la P r o V a lG ln lle A r g GTGCACTTGAGCAGTGGATGATCCCGAACAAGCTGGAGAACGCAATAfTGAfTGA^ATCAGC 7334 ---------------------+ ------------------------ + -------------------- + -----------------------+ -------------------- + ---------------------- + 7 2 7 5 CACGTGAACTCGTCACCTACTAGGGCTTGTTCGACCTCTTGCGTTATACTACTTTAGTCG A la L e u G lu G ln T r p M e tlie P r o A s n L y s L e u G lu A s n A la lle * * GAAAAGCCGCCGAGAACCCTCGCCCAATGCGGGTGGGGAGGAAAGGCGGAAAAGCCCGAA 7274 --------------------- + -----------------------* --------------------- + ----------------------+ ----------------------+ ---------------------- + 7215 CTTTTCGGCGGCTCTTGGGAGCGGGTTACGCCCACCCCTCCTTTCCGCCTTTTCGGGCTT direct repeat rJys orf 5 GGGCTTTAAAAATATTATTACGGTGTTAGCATAAAAACCTGTACCAATGAGCGGTGACT|A 7214 ---------------- +------------------+-----------------+-----------------+-----------------+----------------- + 7 1 5 5 CCCGAAATTTTTATAATAATGCCACAATCGTATTTTTGGACATGGTTACTCGCCACTGAT ^ M et TGjTTAAGAGCAACAAAAATACGCATCTATCCTACTCCTGAACAGGCTGCATTTCTCAACG 7154 ---------------- +------------------+-----------------+-----------------+-----------------+----------------- + 7 0 9 5 ACAATTCTCGTTGTTTTTATGCGTAGATAGGATGAGGACTTGTCCGACGTAAAGAGTTGC L e u A r g A la T h r L y s I le A r g lle T y r P r o T h r P r o G lu G ln A la A la P h e L e u A s n A la CCCAGTTCGGTGCGGTACGTTTTGCGTACAACAAAGCTCTTCACATCAAAAAACACGCTT 7094 ---------------- +------------------+-----------------+-----------------+-----------------+----------------- + 7 0 3 5 GGGTCAAGCCACGCCATGCAAAACGCATGTTGTTTCGAGAAGTGTAGTTTTTTGTGCGAA G ln P h e G ly A la V a lA r g P h e A la T y r A s n L y s A la L e u H is I le L y s L y s H is A la T y r ACCAGAGACACGGAGTAAATTTAAGTCCGCGTAACGATCTGAAACCGTTGTTGTCTGTGG 7034 ---------------- +------------------+-----------------+-----------------+---------------- + + 6975 TGGTCTCTGTGCCTCATTTAAATTCAGGCGCATTGCTAGACTTTGGCAACAACAGACACC G ln A r g H is G ly V a lA s n L e u S e r P r o A r g A s n A s p L e u L y s P r o L e u L e u S e r V a lA la CCAAAAAGTCCCGTAAATATGCCTGGCTCAAAGAATTTGACTCAATGGCATTGCAACAGG 6974 ---------------- +------------------+-----------------+-----------------+---------------- + + 6915 GGTTTTTCAGGGCATTTATACGGACCGAGTTTCTTAAACTGAGTTACCGTAACGTTGTCC Ly s L y s S e r A r g L y s T y r A la T r p L e u L y s G lu P h e A s p S e r M e tA la L e u G ln G ln A la CGGTAATCAATATTGATGTGGCATTTTCCAACTTTTTTAATCCGAAGCTGAAAGCGCGGT 6914 ---------------- +-------------------+----------------+-----------------+----------------+------------------ + 6855 GCCATTAGTTATAACTACACCGTAAAAGGTTGAAAAAATTAGGCTTCGACTTTCGCGCCA V a llle A s n lle A s p V a lA la P h e S e r A s n P h e P h e A s n P r o L y s L e u L y s A la A r g P h e TCCCGACGTTTAAACGCAAACACGGAAAACAATCGAGCTATCACTGTGTCGGGATTAAGG 6854 ---------------- +------------------ +----------------+---------------- +----------------+------------------ + 6795 AGGGCTGCAAATTTGCGTTTGTGCCTTTTGTTAGCTCGATAGTGACACAGCCCTAATTCC P r p T h r P h e L y s A r g L y s H is G ly L y s G ln S e r S e r T y r H is C y s V a lG ly lle L y s V a l TGCTGGATAATGCCATCAAAATCCCCAAACTGGCCCCGATAGAAGCACGCCTTCATCGTG 6794 ---------------- +------------------ +----------------+-----------------+----------------+------------------ + 6735 ACGACCTATTACGGTAGTTTTAGGGGTTTGACCGGGGCTATCTTCGTGCGGAAGTAGCAC L e u A s p A s n A la lle L y s I le P r o L y s L e u A la P r o I le G lu A la A r g L e u H is A r g G lu AACTTCATGGAAAGCTGAAAAGCATCACTGTCACCCGTTCTGCAACGGGGAAATACTACG 6734 ---------------- +------------------ +----------------+-----------------+----------------+------------------ + 6675 TTGAAGTACCTTTCGACTTTTCGTAGTGACAGTGGGCAAGACGTTGCCCCTTTATGATGC L e u H is G ly L y s L e u L y s S e r lle T h r V a lT h r A r g S e r A la T h r G ly L y s T y r T y r A la CTTCCATTCTTTGTGACGATGGCGCTTCCGCACCAGAGAAACCCACGTACCTGAAGGAAG 6674 ----------------- h------------------ 1------------------- 1----------------- 1-----------------h + 6615 GAAGGTAAGAAACACTGCTACCGCGAAGGCGTGGTCTCTTTGGGTGCATGGACTTCCTTC S e r lle L e u C y s A s p A s p G ly A la S e r A la P r o G lu L y s P r o T h r T y r L e u L y s G lu G lu AAAAGATTACAGGTCTGGATATGGGACTGGAGCACTATGCCATCACTTCTGATGCTGTAA 6614 ---------------- +------------------+-----------------+-----------------+--------------- + + 6555 TTTTCTAATGTCCAGACCTATACCCTGACCTCGTGATACGGTAGTGAAGACTACGACATT L y s I le T h r G ly L e u A s p M e tG ly L e u G lu H is T y r A la lle T h r S e r A s p A la V a lL y s AAGTTCCGAATCCTCGCCATCTTATCAATGTAAGTCGTAATCTGCGACGCAAACAAAAAG 6554 ---------------- +------------------+-----------------+-----------------+--------------- + + 6495 TTCAAGGCTTAGGAGCGGTAGAATAGTTACATTCAGCATTAGACGCTGCGTTTGTTTTTC V a lP r o A s n P r o A r g H is L e u I le A s n V a lS e r A r g A s n L e u A r g A r g L y s G ln L y s A la CATTATCCCGTAAGCAAAAGGGAAGCGCGAATCGTAAAAAGGCCAGAATCCGTCTTGCGG 6 4 9 4 ----------------- +-----------------+-----------------+-----------------+--------------- + + 6435 GTAATAGGGCATTCGTTTTCCCTTCGCGCTTAGCATTTTTCCGGTCTTAGGCAGAACGCC L e u S e r A r g L y s G ln L y s G ly S e r A ia A s n A r g L y s L y s A la A r g lle A r g L e u A la A la CCTTACACGAACGGGTGGCGAATGCCCGTGCTGATTTTCAACACAAGCTCTCTTGCGCGA 6434 ---------------- +------------------+-----------------+-----------------+----------------+ + 6375 GGAATGTGCTTGCCCACCGCTTACGGGCACGACTAAAAGTTGTGTTCGAGAGAACGCGCT L e u H is G lu A r g V a lA la A s n A la A r g A la A s p P h e G ln H is L e u S e r C y s A la lle Sail TTpTCGACpAAAACCAAGCGGTAATTGTGGAGACGCTGAAAACAGCCAACATGATGAAAA 6374 ---------- +------------------+-----------------+-----------------+----------------+ + 6315 AACAGCTGCTTTTGGTTCGCCATTAACACCTCTGCGACTTTTGTCGGTTGTACTACTTTT V a lA s p G lu A s n G ln A la V a llle V a lG lu T h r L e u L y s T h r A la A s n M e tM e tL y s A s n ACCACAATCTGGCAAGAGCGATAGGTGATGCTGGCTGGCACAGCTTCATCACAAAGCTGG 6314 ---------------- +------------------+-----------------+-----------------+----------------+ + 6255 TGGTGTTAGACCGTTCTCGCTATCCACTACGACCGACCGTGTCGAAGTAGTGTTTCGACC H is A s n L e u A la A r g A la lle G ly A s p A la G ly T r p H is S e r P h e lle T h r L y s L e u G lu AGTACAAGGCAGCGACAAAAGGCGTTCACCTGGTAAAACTCGACCAGTGGTTTGCCAGTT +---------------- +---------------+------------- +---------------+--------------- + 6195 6254 TCATGTTCCGTCGCTGTTTTCCGCAAGTGGACCATTTTGAGCTGGTCACCAAACGGTCAA T y r L y s A la A la T h r L y s G ly V a lH is L e u V a lL y s L e u A s p G ln T r p P h e A la S e r S e r CGAAAACGTGCCATTGTTGTGGTTACAAAATGCCGGAAATGCCACTTCATAAACGCATCT +----------------+---------------+------------- +---------------+--------------- + 6135 6194 GCTTTTGCACGGTAACAACACCAATGTTTTACGGCCTTTACGGTGAAGTATTTGCGTAGA L y s T h r C y s H is C y s C y s G ly T y r L y s M e tP r o G lu M e tP r o L e u H is L y s A r g lle T r p GGCGATGCCCTGAGTGTGGAGCAGAGCACGACCGCGATATCAATGCGGCACTCAACATCC +----------------+---------------+------------- +---------------+--------------- + 6Q75 6134 CCGCTACGGGACTCACACCTCGTCTCGTGCTGGCGCTATAGTTACGCCGTGAGTTGTAGG A r g C y s P r o G lu C y s G ly A la G lu H is A s p A r g A s p I le A s n A la A la L e u A s n l le A r g GGCAAAAAGGAATACTGGAACTAAAGGCGGCGGGACTCGTCGTCTCTGCCCATGGAGGCC -i----------------|----------------1---------------+ 6074 |--------------- 1. 6015 CCGTTTTTCCTTATGACCTTGATTTCCGCCGCCCTGAGCAGCAGAGACGGGTACCTCCGG G ln L y s G ly lle L e u G lu L e u L y s A la A la G ly L e u V a lV a lS e r A la H is G ly G ly G ln AGCGTAAATCCGTCGCACAGACGGTTGCGGCCrrGAlGAAGTGGGAAGCCTCGCCCGACAGG +--------------- +----------------- + 6014 5955 TCGCATTTAGGCAGCGTGTCTGCCAACGCCGGACTCTTCACCCTTCGGAGCGGGCTGTCC A r g L y s S e r V a lA la G ln T h r V a lA la A la * direct repeat GCGGGGAGCAGTCACGGGTGGTCTGGAGGTCATGCGGCGTGTCCTCTGCACTCGCCGGAA 5954 ---------------- +----------------- +---------------- +-----------------+---------------- + + 5895 CGCCCCTCGTCAGTGCCCACCAGACCTCCAGTACGCCGCACAGGAGACGTGAGCGGCCTT TAAGGAAGTCGCCGGCGGCTCCGCTTTTACCCGGCCATGCGGGGCATGGCCTTGTGGGTT 5894 ---------------- +----------------- +---------------- +-----------------+---------------- + + 5835 ATTCCTTCAGCGGCCGCCGAGGCGAAAATGGGCCGGTACGCCCCGTACCGGAACACCCAA TTCAGCTCTGTGGCCTCAGCGTCGTGTGCGGGCTGTGCCGTGCCTCCATCTTAGCCGGGC 5834 ---------------- +----------------- +-----------------+-----------------+----------------+ + 5775 AAGTCGAGACACCGGAGTCGCAGCACACGCCCGACACGGCACGGAGGTAGAATCGGCCCG TGGCAGGGATGCAAGGGTACGCTTCGCCGCTGCGGTCACCCGGTCCCTCCTTCCCGTCTG 5774 --------------------------- + ---------------------------- + --------------------------- + ----------------------------+ -------------------------- + + 5715 ACCGTCCCTACGTTCCCATGCGAAGCGGCGACGCCAGTGGGCCAGGGAGGAAGGGCAGAC CCGTGATTTTCCGGTCGTTCACCCGCTCAGATTTCGCGGTCTCACCCTCCAACTCCCGC 5714 -------------- +--------------- +---------------+---------------+-------------- + + 5555 GGCACTAAAAGGCCAGCAAGTGGGCGAGTCTAAAGCGCCAGAGTGGGAGGTTGAGGGCG CAGCCGTGTGCGTCAGGCTGCGGCTTCCCTTGCGCCCTGTGCATCCCCGCCTTATCGCCC 5654 ---------------- +------------------+-----------------+-----------------+-----------------+---------------- + 5595 GTCGGCACACGCAGTCCGACGCCGAAGGGAACGCGGGACACGTAGGGGCGGAATAGCGGG rbs GGCTTTTATGGAGGCACGGCACCGCCCGGTGCCGCAGAACATG?GACTATGGAG£LATTCG 5594 ---------------- +------------------+-----------------+-----------------+-----------------+---------------- + 5535 CCGAAAATACCTCCGTGCCGTGGCGGGCCACGGCGTCTTGTACACTGATACCTCCTAAGC ardA GG^ATGjrCTGTTGTTGCACCTGCTGTATACGTTGGAACCTGGCACAAATACAACTGTGGA 5534 _ +---------------- +-----------------+-----------------+---------------- +----------------- + 5475 CCTTACAGACAACAACGTGGACGACATATGCAACCTTGGACCGTGTTTATGTTGACACCT M e tS e r V a lV a lA la P r o A la V a lT y r V a lG ly T h r T r p H is L y s T y r A s n C y s G ly AGCATCGCCGAACGCTGGTTTGACCTGACCACGTTTGATGATGAGCGCGACTTTTTCGCC -------------- +--------------- +---------------+--------------- +-------------- +--------------- + 5415 TCGTAGCGGCTTGCGACCAAACTGGACTGGTGCAAACTACTACTCGCGCTGAAAAAGCGG S e r lle A la G lu A r g T r p P h e A s p L e u T h r T h r P h e A s p A s p G lu A r g A s p P h e P h e A la GCCTGCCGTGCTCTTCACCAGGATGAAGCCGATCCTGAACTGATGTTTCAGGATTATGAG 5414 ---------------- +------------------ +----------------+-----------------+-----------------+ + 5355 CGGACGGCACGAGAAGTGGTCCTACTTCGGCTAGGACTTGACTACAAAGTCCTAATACTC A la C y s A r g A la L e u H is G ln A s p G lu A la A s p P r o G lu L e u M e tP h e G ln A s p T y r G lu 5354 GGATTCCCGGGGAATATGGCCTCTGAATGCCATATCAACTGGGCCTGGGTTGAAGGCTTC --------------------+ ---------------------- + ------------------- + -------------------- +-------------------- + + 5295 CCTAAGGGCCCCTTATACCGGAGACTTACGGTATAGTTGACCCGGACCCAACTTCCGAAG G ly P h e P r o G ly A s n M e t A la S e r G lu C y s H is I le A s n T r p A la T r p V a lG lu G ly P h e 5294 CGCCAGGCACGGGATGAAGGCTGCGAAGAGGCTTATCGTCTCTGGGTGGAGGATACCGGT ■“ h---------------------- 1--------------------------------------------+ -------------------- h--------------------- +■ 5 2 3 5 GCGGTCCGTGCCCTACTTCCGACGCTTCTCCGAATAGCAGAGACCCACCTCCTATGGCCA A r g G ln A la A r g A s p G lu G ly C y s G lu G lu A la T y r A r g L e u T y r V a lG lu A s p T h r G ly 5234 GAGACGGATTTTGACACCTTCCGCGATGCCTGGTGGGGCGAGGCTGACAGTGAGGAGGCT --------------------+ ---------------------- + ------------------- + -------------------- + -------------------- + ---------------------+ 5 1 7 5 CTCTGCCTAAAACTGTGGAAGGCGCTACGGACCACCCCGCTCCGACTGTCACTCCTCCGA G lu T h r A s p F P h e A s p T h r P h e A r g A s p A l a T r p T r p G l y G l u A l a A s p S e r G l u G l u A l a 5174 TTTGCGGTTGAGTTCGCCAGTGATACCGGCCTGCTGGCTGACGTGCCGGAGACGGTGGCG --------------------+ ---------------------- + ------------------- + -------------------- + -------------------- + ---------------------+ 5 1 1 5 AAACGCCAACTCAAGCGGTCACTATGGCCGGACGACCGACTGCACGGCCTCTGCCACCGC P h e A la V a lG lu P h e A la S e r A s p T h r G ly L e u L e u A la A s p V a lP r o G lu T h r V a lA la CTCTATTTTGACTATGAGGCGTATGCGCGGGATTTATTCCTGGACTCCTTCACCTTTATT 5114 ------------ +--------------+------------+------------ +------------ +-------------+5055 GAGATAAAACTGATACTCCGCATACGCGCCCTAAATAAGGACCTGAGGAAGTGGAAATAA L e u T y r P h e A s p T y r G lu A la T y r A la A r g A s p L e u P h e L e u A s p S e r P h e T h r P h e lle GACGGTCATGTGTTCCGTCGG|TGA|rTTACTCCCCCGCTCCGGCGGGGGTATGGCCTGCCA +---------------- +---------------- + 5054 + 4995 CTGCCAGTACACAAGGCAGCCACTAAATGAGGGGGCGAGGCCGCCCCCATACCGGACGGT A s p G ly H is V a lP h e A r g A r g * GGCCGCCGGAAAACCCTGGCAGGCTTACGCCTGCCTGAAATGCAGGAGGGCCGCGCCTGC 4994 ---------------- +----------------- +---------------- +-----------------+---------------- + + 4935 CCGGCGGCCTTTTGGGACCGTCCGAATGCGGACGGACTTTACGTCCTCCCGGCGCGGACG GGCGCTGCCCTTCCTGCCCGTCATGGTGTGAGGCATTTTTCTGTTATGTCGTTTCTGTTG 4934 ---------------+--------------- +---------------+---------------+-------------- + + 4875 CCGCGACGGGAAGGACGGGCAGTACCACACTCCGTAAAAAGACAATACAGCAAAGACAAC 4874 TTCTGATTTCCGGAGGCGGGGACACCTTCGGCTCCCGGCTGCCTGGCGGCATCCGGTCGT --------------------+ -------------------- + --------------------+ -------------------- + ------------------- + + 4815 AAGACTAAAGGCCTCCGCCCCTGTGGAAGCCGAGGGCCGACGGACCGCCGTAGGCCAGCA 4814 CAGGCCCGACTCCGACAGGCAGCTGCTGGCGCACCCGCCAGTCTCCGCCGAACCCGCTGG --------------------+ -------------------- + -------------------- + -------------------- + ------------------- + + 4755 GTCCGGGCTGAGGCTGTCCGTCGACGACCGCGTGGGCGGTCAGAGGCGGCTTGGGCGACC CGCGGTTTCGGGCTGCCGCTACGGTCCGATGTCTGAAAAACCAGGCTTGCAGCGACCGTT 4754 -------------- +--------------- +---------------+---------------+-------------- + + 4695 GCGCCAAAGCCCGACGGCGATGCCAGGCTACAGACTTTTTGGTCCGAACGTCGCTGGCAA CCGGTCGCGCCGCCCCTGTCGCGCCGGAAACGGCGC 4694 GGCCAGCGCGGCGGGGACAGCGCGGCCTTTGCCGCGAGGGACGTGTGGACTGCAACACAT 4 6 3 5 ssi2 4 6 3 4 TCCGGTGACGCGGACTACCGCGTCAGTTGGGAAGGCGGCGGTGGGGACGCCGGACAAAGG 4 5 7 5 4 5 7 4 GCGACACCTGAAAGGCCCCCGACCTCATTGGTGTAGGGCAAAGTCGTCAGGTCCCCTTCT 4 5 1 5 4 514 ATATGGCAGGTGTTTTAAAAATCGGTATTTGGGCCGTTTTTTAAAACGCAGGCGGGGGCC 4 4 55 4 4 5 4 TGTCGACGGGGCACTGCACCACTGCCTGCGGGGTAGCCTTTCCAGGTGACGCCCCTCTTT 43 9 5 GGGAGCAACCGGTCGGGCC -----------------+ + 4335 4 3 9 4 GGGGCTTCTTCACCTGCCGTCTTCCCCGAAAGCGGGGAGGCCCCTCGTTGGCCAGCCCGG rbs orf 4 ► c g g c g a c c t g a t t c a g c a g a a g g a g a t g t t t t c Ia t g Kc a g t c a g c a g c a c g a t t t c c g t t 4334 --------------------+ ---------------------+ --------------------- + ------------------+ --------------------- + ---------------------+ 4 2 7 5 GCCGCTGGACTAAGTCGTCTTCCTCTACAAAAGTACTGTCAGTCGTCGTGCTAAAGGCAA M e tT h r V a lS e r S e r T h r lle S e r V a l 4274 TTTTGCCGCGACGGGGTGTTCCGTACCGTTTACTGCCACCTGCACGGGGAGCCGACCTGG --------------------+ ---------------------+ --------------------- + ------------------+ --------------------- + ---------------------+ 4 2 1 5 AAAACGGCGCTGCCCCACAAGGCATGGCAAATGACGGTGGACGTGCCCCTCGGCTGGACC P h e C y s A r g A s p G ly V a lP h e A r g T h r V a lT y r C y s H is L e u H is G ly G lu P r o T h r T r p AACGGTCGCATCCTGCATACTCACTATGCCACCGGTCAGCAGGCCGAAGCCCTGGTTGAA 4 2 1 4 -------------------- + -------------------- + --------------------- + ------------------+ --------------------- + ---------------------+ 4 1 5 5 TTGCCAGCGTAGGACGTATGAGTGATACGGTGGCCAGTCGTCCGGCTTCGGGACCAACTT A s n G ly A r g lle L e u H is T h r H is T y r A la T h r G ly G ln G ln A la G lu A la L e u V a lG lu 4154 CACGGTGATATCCGTTGCCTCGGTCCCCGTTGTGACAAACCCGCCGGGCATACGCTTCAG --------------------+ --------------------+ ---------------------- + ------------------+ --------------------- + ---------------------+ 4 0 9 5 GTGCCACTATAGGCAACGGAGCCAGGGGCAACACTGTTTGGGCGGCCCGTATGCGAAGTC H is G ly A s p I le A r g C y s L e u G ly P r o A r g C y s A s p L y s P r o A la G ly H is T h r L e u G ln 4094 AACCCGGTGGACGGTGTGACGGCTTATTACGGACGTGACAGTGGTTTCCGGATGGACAGT --------------------+ --------------------+ ---------------------- + ------------------+ --------------------- + ---------------------+ 4 0 3 5 TTGGGCCACCTGCCACACTGCCGAATAATGCCTGCACTGTCACCAAAGGCCTACCTGTCA A s n P r o V a lA s p G ly V a lT h r A la T y r T y r G ly A r g A s p S e r G ly P h e A r g M e t A s p S e r 4034 GAGGCGCGTGAGTACCGTTCTTTCAGGGAGGCTATTGCCACTGAAAGCACTGAAGAGGTG --------------------+ --------------------+ ---------------------- + ------------------+ --------------------- + ---------------------+ 3 9 7 5 CTCCGCGCACTCATGGCAAGAAAGTCCCTCCGATAACGGTGACTTTCGTGACTTCTCCAC G lu A la A r g G lu T y r A r g S e r P h e A r g G lu A la lle A la T h r G lu S e r T h r G lu G lu V a l 3974 CGCTTCCATTATGTGTTCATCGACGGCTACTGGAAGGTGATGTACCGCACGCCGGAAGGC ------------------- + --------------------+ ---------------------- + ------------------+ --------------------- + ---------------------+ 3 9 1 5 GCGAAGGTAATACACAAGTAGCTGCCGATGACCTTCCACTACATGGCGTGCGGCCTTCCG A r g P h e H is T y r V a lP h e lle A s p G ly T y r T r p L y s V a lM e tT y r A r g T h r P r o G lu G ly 3914 TGGAAGATGAAAGCGCTCGCGCTGGCACTGCGTCGCTGTCCGAAAIGAAAAAAAGGCGGG ------------------- + ---------------------+ -------------------- + --------------------+ ---------- 1------ \--+ -------------------- + 3 8 5 5 ACCTTCTACTTTCGCGAGCGCGACCGTGACGCAGCGACAGGCTTTACTTTTTTTCCGCCC T r p L y s M e tL y s A la L e u A la L e u A la L e u A r g A r g C y s P r o L y s * 3854 GCGATAAACCCTGCCTCTCTCCGCCGTTACTTCCCCGCCAGGAAGAGCGGCAGTCATCAA --------------------+ --------------------+ ---------------------- + ------------------+ --------------------- + + 3795 CGCTATTTGGGACGGAGAGAGGCGGCAATGAAGGGGCGGTCCTTCTCGCCGTCAGTAGTT o r f 3 —► 3 7 9 4 CAACATAAGGAGTAATATTpTGAGACCATCAATTATCTTCGCAACCGCCGAGTATGTAAA -------------------- + -------------------W - --------------- + ------------------+ --------------------- + + 3735 GTTGTATTCCTCATTATAACACTCTGGTAGTTAATAGAAGCGTTGGCGGCTCATACATTT V a lA r g P r o S e r I le lle P h e A la T h r A la G lu T y r V a lL y s 3734 GCGTCTGCGTGAAGAGTGCCTGCGGGAGAATAAACCCCTGCACCGCCATACCCGCTTCAG --------------------+ --------------------- + -------------------- + -------------------- + -------------------- + -------------------+ 3675 CGCAGACGCACTTCTCACGGACGCCCTCTTATTTGGGGACGTGGCGGTATGGGCGAAGTC A r g L e u A r g G lu G l u C y s L e u A r g G lu A s n L y s P r o L e u H i s A r g H i s T h r A r g P h e A r g ACGTCAGGAGCTGGCACAGGATGAGATTAACCCGGACGTCCTGGCGATGAGCGGCCATAT 3674 +---------------+---------------+---------------+---------------+-------------- + 3615 TGCAGTCCTCGACCGTGTCCTACTCTAATTGGGCCTGCAGGACCGCTACTCGCCGGTATA A r g G lu G ln L e u A la G ln A s p G lu I le A s n P r o A s p V a lL e u A la M e tS e r G ly H is I le P stI cgccagacgE tg caS tgagcagaagcgggtgcgtatccccgctatgaaagtcagcgaatg 3614 -------------------- hr------- J ---------- + -------------------+ -------------------- + -------------------+ + 3555 GCGGTCTGCGACGTCACTCGTCTTCGCCCACGCATAGGGGCGATACTTTCAGTCGCTTAC A la A r g A r g C y s S e r G lu G ln L y s A r g V a lA r g lle P r o A la M e tL y s V a lS e r G lu T r p GGGCCACCTGCTCCGCGCGCTTGAAATAGAGCGGGTCTGCCAC|rGAtrTAACCCTTGCCGT 3554 --------------------+ ----------------------+ -------------------+ --------------------+ -------------------+ ----------------------+ 3 4 9 5 CCCGGTGGACGAGGCGCGCGAACTTTATCTCGCCCAGACGGTGACTAATTGGGAACGGCA G ly H is L e u L e u A r g A la L e u G lu I le G lu A r g V a lC y s H is * rbs orf 2 —► CTGCCTGTGCAGGCGGCAGGATACGTTCAGACATCTCACCAGGAGGAACCjGTGAGTAAGA 3494 + 3435 GACGGACACGTCCGCCGTCCTATGCAAGTCTGTAGAGTGGTCCTCCTTGGCACTCATTCT V a lS e r L y sL y s 3434 AGAAAAACACCACAACGCCCACGCCGCATGATGCCGCGTTCCGGTCGTTCCTGGCGAATC -------------------- h---------------------- H--------------------- H--------------------- H---------------------+ h3375 TCTTTTTGTGGTGTTGCGGGTGCGGCGTACTACGGCGCAAGGCCAGCAAGGACCGCTTAG L y s A s n T h r T h r T h r P r o T h r P r o H is A s p A la A la P h e A r g S e r P h e L e u A la A s n P r o 3374 CCGACGTCGCCAGAGATTTTCTGGAACTGCATCTTCCTGCGGAGTACCGGCAGTTGTGCG --------------------+ ---------------------+ -------------------- + -------------------- + -------------------- + ---------------------+ 3315 GGCTGCAGCGGTCTCTAAAAGACCTTGACGTAGAAGGACGCCTCATGGCCGTCAACACGC A s p V a lA la A r g A s p P h e L e u G lu L e u H is L e u P r o A la G lu T y r A r g G ln L e u C y s A s p 3314 ACCTGTCCACGCTGAAGCTGGAACCCGCCACCTTTGTTGAGCCGGACCTGCATCAGTACG --------------------+ --------------------- + -------------------- + -------------------- + -------------------- + ---------------------+ 3 2 5 5 TGGACAGGTGCGACTTCGACCTTGGGCGGTGGAAACAACTCGGCCTGGACGTAGTCATGC L e u S e r T h r L e u L y s L e u G lu P r o A la T h r P h e V a lG lu P r o A s p L e u H is G ln T y r A la CCAGCGATATCCTCTGGAGCGTGAAAACCACCGGGGGTGAAGATGGTTATGTTTATACGC 3254 ---------------------------+ -----------------------------+ --------------------------- + ----------------------------+ --------------------------- + ---------------------------- + 3195 GGTCGCTATAGGAGACCTCGCACTTTTGGTGGCCCCCACTTCTACCAATACAAATATGCG S e r A s p I le L e u T r p S e r V a lL y s T h r T h r G ly G ly G lu A s p G ly T y r V a lT y r T h r L e u TCATCGAGCACCAGAGCACCGAAAATCTGTACATGCCTTTTCGCATGTTACGCTACAGTG 3194 --------------------------- + ------------------------------ + -------------------------- + --------------------------- + --------------------------- + ---------------------------- + 3135 AGTAGCTCGTGGTCTCGTGGCTTTTAGACATGTACGGAAAAGCGTACAATGCGATGTCAC I le G lu H is G ln S e r T h r G lu A s n L e u L y s M e tP r o P h e A r g H e tL e u A r g T y r S e r V a l 3134 TGGCGGCGATGCAGAGACATCTGGAGCAGCACAAAACGTTGCCACTGGTCATTCCGGTGC --------------------+ ----------------------+ ------------------- + -------------------- + -------------------- + ---------------------+ 3 0 7 5 ACCGCCGCTACGTCTCTGTAGACCTCGTCGTGTTTTGCAACGGTGACCAGTAAGGCCACG A la A la M e t G ln A r g H is L e u G lu G ln H is L y s T h r L e u P r o L e u V a llle P r o V a lL e u 3074 TGTTCTATCACGGTGAGCGCAGCCCGTACCCGTACAGCATGAACTGGCTGGACTGTTTTG --------------------+ ----------------------+ ------------------- + -------------------- + -------------------- + ---------------------+ ACAAGATAGTGCCACTCGCGTCGGGCATGGGCATGTCGTACTTGACCGACCTGACAAAAC 3015 P h e T y r H is G ly G lu A r g S e r P r o T y r P r o T y r S e r M e tA s n T r p L e u A s p C y s P h e G lu 3014 AGAATCCGGCGCTTGCGGCTAAAATATACACAAAGCCGTTTCCGCTGGTTGATATCACTG --------------------+ ----------------------+ -------------------+ -------------------- + -------------------- + ---------------------+ 2 9 5 5 TCTTAGGCCGCGAACGCCGATTTTATATGTGTTTCGGCAAAGGCGACCAACTATAGTGAC A s n P r o A la L e u A la A la L y s I le T y r T h r ly s P r o P h e P r o L e u V a lA s p I le T h r V a l 2954 TCGTTGATGACAATGAAATCATGAACCATCGCCGGATGGCCGCACTGACGCTGCTGATGA -------------------- + ---------------------+ -------------------- + --------------------+ --------------------+ + 2895 AGCAACTACTGTTACTTTAGTACTTGGTAGCGGCCTACCGGCGTGACTGCGACGACTACT V a lA s p A s p A s n G lu I le M e tA s n H is A r g A r g M e tA la A la L e u T h r L e u L e u M e tL y s P stI 2894 AGCATATCCGTCAGCGGGATATGCTGATGTGTCTGGACAACCTGGTCAGAGCACTGCAGp -------------------- + ---------------------+ -------------------- + --------------------- + ------------------+ y =*+ 2 8 3 5 TCGTATAGGCAGTCGCCCTATACGACTACACAGACCTGTTGGACCAGTCTCGTGACGTCC H is I l e A r g G l n A r g A s p M e t L e u M e t C y s L e u A s p A s n L e u V a l A r g A l a L e u G l n A s p 2834 ACATCCAGGATGAAGAGCAGATTACAGTTCTTTTCAACTATCTGCTTAATGGCAGCGAAC -------------------- + ---------------------+ -------------------- + --------------------- + ------------------+ ---------------------- + 2 7 7 5 TGTAGGTCCTACTTCTCGTCTAATGTCAAGAAAAGTTGATAGACGAATTACCGTCGCTTG I le G ln A s p G lu G lu G ln lle T h r V a lL e u P h e A s n T y r L e u L e u A s n G ly S e r G lu H is 2774 ATGTAACGGTTGAATTTTTGCAGACGCTGGCGCAGCGACTGCCGCAGCACGAGGACAGCA -------------------- + ---------------------+ -------------------- + --------------------- + ------------------+ ---------------------- + 2 7 1 5 TACATTGCCAACTTAAAAACGTCTGCGACCGCGTCGCTGACGGCGTCGTGCTCCTGTCGT V a lT h r V a lG lu P h e L e u G ln T h r L e u A la G ln A r g L e u P r o G ln H i s G l n A s p S e r l l e 2714 TTATGACACTGGCAGAACGTCTTAAGCAGGAAGGTATCCAGCAAGGTATCCAGCAGGGTA -------------------- + --------------------- + -------------------- + --------------------- + ------------------+ ---------------------- + 2 6 5 5 AATACTGTGACCGTCTTGCAGAATTCGTCCTTCCATAGGTCGTTCCATAGGTCGTCCCAT M e t T h r L e u A la G l u A r g L e u L y s G l n G lu G l y l l e G l n G l n G l y l l e G l n G l n G l y l i e 2654 TCCAGCAGGGTATCCAGCAGGGAGTTCAGCAGGGGGCGCTTCAAAAAGCACGGGAAATCG -------------------- + ---------------------+ -------------------- + --------------------- + ------------------+ ---------------------- + 2 5 9 5 AGGTCGTCCCATAGGTCGTCCCTCAAGTCGTCCCCCGCGAAGTTTTTCGTGCCCTTTAGC G ln G ln G ly I le G ln G ln G ly V a lG ln G ln G ly A la L e u G ln L y s A la A r g G lu I le A la 2594 CCCGGGAGCTCCGGAACGCGGGAATGCCGGCAGCACAAATCTGTCAGCTGACCGGACTTT -------------------- + ---------------------+ -------------------- + --------------------- + ------------------+ ----------------------+ 2 5 3 5 GGGCCCTCGAGGCCTTGCGCCCTTACGGCCGTCGTGTTTAGACAGTCGACTGGCCTGAAA A r g G lu L e u A r g A s n A la G ly M e t P r o A la A la G ln lle C y s G ln L e u T h r G ly L e u S e r 2534 C TGAAGC TGAACTGAAG AACATC AC TC A cflG A rGA GAGCCAAAAGGCCCGGCAAAATGCC -------------------- + -------------------- + -------------------- + --------------------- + ------------------+ + 2475 GACTTCGACTTGACTTCTTGTAGTGAGTGACTACTCTCGGTTTTCCGGGCCGTTTTACGG G lu A la G lu L e u L y s A s n lle T h r H is * 2474 * GGGCCTTTTGCCAGCAGGAGCAGAGAAAACAGTATGACGATTGCAGAACGTCTTATACAG -------------------- + -------------------- + -------------------- + --------------------- + ------------------+ ---------------------- + 2 4 1 5 CCCGGAAAACGGTCGTCCTCGTCTCTTTTGTCATACTGCTAACGTCTTGCAGAATATGTC AAAGGTTTTGAAAAAGGTGCTCTTGAAGTGGCGCGGGAAATAGCCTGCCGGCTTCGGGAT 2 4 1 4 --------------------- + -------------------- h---------------------+ ---------------------+ ------------------+ + 2355 TTTCCAAAACTTTTTCCACGAGAACTTCACCGCGCCCTTTATCGGACGGCCGAAGCCCTA 2354 ATGGGCTGGACGCCGGAACGGATTCAGGAGGCCACGGGACTTTCCGGTGAAGAACTGAAA --------------------+ --------------------- + -------------------- + ---------------------+ ------------------+ + 2295 TACCCGACCTGCGGCCTTGCCTAAGTCCTCCGGTGCCCTGAAAGGCCACTTCTTGACTTT 2294 AAACTGTTTCCTGATGAGCAGTAGCCTGGCATTCTGCCAGGCTGTGAACCACGCTCACAC --------------------+ --------------------- + -------------------- + ---------------------+ ------------------+ + 2235 TTTGACAAAGGACTACTCGTCATCGGACCGTAAGACGGTCCGACACTTGGTGCGAGTGTG 2234 ATCCTCTGCCGTTTCCCCCAGTTCATTTTCCAGACGCCTGATGATATCTGCCTTGCGCAT --------------------+ --------------------- + -------------------- + ---------------------+ ------------------+ + 2175 TAGGAGACGGCAAAGGGGGTCAAGTAAAAGGTCTGCGGACTACTATAGACGGAACGCGTA CCCCTGCATGTGAGGCAGCAGGTCACGGACTGAAAGTGCCTGGGTGGTCCTCCAGACGGT 2 1 7 4 --------------------- + --------------------+ -------------------- + ---------------------+ ------------------+ + 2115 GGGGACGTACACTCCGTCGTCCAGTGCCTGACTTTCACGGACCCACCAGGAGGTCTGCCA 2114 GACATCCACCTGTTCCGCATTTTCCCAGGCGTGCCATGTTCTCTGTGTCACGCCAAATTC -------------------- + -------------------- + -------------------- + --------------------+ -------------------- + + 2055 CTGTAGGTGGACAAGGCGTAAAAGGGTCCGCACGGTACAAGAGACACAGTGCGGTTTAAG 2054 TCTGGCCGCTTTTTCGCGTGACCAGAGCATGCTTTTTCGCCAGATTCGCAGTTCCCAGCC -------------------- + -------------------- + -------------------- + --------------------+ --------------------+ + 1995 AGACCGGCGAAAAAGCGCACTGGTCTCGTACGAAAAAGCGGTCTAAGCGTCAAGGGTCGG GGTCATAAAACACCTTTGAAAAAGTGAAAAAACTTCACTTTATTTTATCACCATAACCAC 1 9 9 4 ---------------+---------------+---------------+---------------+---------------+ + ig35 CCAGTATTTTGTGGAAACTTTTTCACTTTTTTGAAGTGAAATAAAATAGTGGTATTGGTG 1934 CAAATCAGCGCGAAAAATAAGTGTGAAAAAGTGAAGGAATTTCACTTTTTTTGATTGTAG -------------------- + -------------------- + --------------------+ --------------------+ --------------------+ + 1875 GTTTAGTCGCGCTTTTTATTCACACTTTTTCACTTCCTTAAAGTGAAAAAAACTAACATC 1874 GTGTCGTGCTGTTTTCTCTGTTTTCACCTGTGCGGGAGTGGTGTGCCAGTGACCACCACA -------------------- + -------------------- + -------------------- + --------------------+ --------------------+ ---------------------+ 1 8 1 5 CACAGCACGACAAAAGAGACAAAAGTGGACACGCCCTCACCACACGGTCACTGGTGGTGT TGCCATTGTTGCGCCGTCGTTTTTCTTC S S il 1 8 1 4 -------------------- + -------------------- + -----------------ACGGTAACAACGCGGCAGCAAAAAGAAGTCAGCGCGGCCTACGCCGCGAGGGACGTGTGG 1 7 5 5 1 7 5 4 AATGCCACACATTCCGGTGACGCGGACTACCGCGTCGACAGAGACGGACAGACGGCGGCG 1 6 95 16 94 TGACGCCGGACAAGAGGCGTCTACCTGAAAGGCCCCCATGTATGTGCTGGTGTAGGGCAA 1 6 3 5 1 6 3 4 AGTCAGCAGGTCCCCTCTCTACATGGTCGGTGTTTTGAAAAACGGCGTCCGCAGTTTTTG 1 5 7 5 1574 AAAACGCAGACGGGGGCCTGTTGACGGGGCACTGCACCACTCGCAGGGGGCAGCCTTTCC 1 5 1 5 1514 GGAA + 1455 AGGTGACGCCCCTTTAGGGGCTTCTGGGCCTGCCGTCTTCCTCGAAGCGAGGAGGCCCTT rbs a t g a c c t g g c t g a c c c g g c a g t c g t g a c g a t g a g g a g a c a a t g t g Ia t g Ra t c a g a c t t t a 1454 --------------------+ --------------------- + -------------------- + -------------------- + -------------------- + ------------------- + 1 3 9 5 TACTGGACCGACTGGGCCGTCAGCACTGCTACTCCTCTGTTACACTACTTAGTCTGAAAT M e t A s n G ln T h r L e u 1394 CCCACTGCTGACCTGAATACTGCCGGCACGACAGATGTTATTCCGTCTGTGGCCATCGAC --------------------+ --------------------- + -------------------- + -------------------- + -------------------- + ------------------- + 1 3 3 5 GGGTGACGACTGGACTTATGACGGCCGTGCTGTCTACAATAAGGCAGACACCGGTAGCTG P r o T h r A la A s p L e u A s n T h r A ia G ly T h r T h r A s n V a llle P r o S e r V a lA la lle A s n 1334 CGCATCATCGCGCAGCGTAACGAAGGTATTGCACTGTTCATGCAGGCGATGGAATGCCTG --------------------+ --------------------- + -------------------- + -------------------- + -------------------- + ------------------- + 1 2 7 5 GCGTAGTAGCGCGTCGCATTGCTTCCATAACGTGACAAGTACGTCCGCTACCTTACGGAC A r g lle lle A la G ln A r g A s n G lu G ly lle A la L e u P h e M e tG ln A la M e tG lu C y s L e u 1274 GCGACAGCGCGCAGGATTCTGCTTGATGCGTCAGGTGATATTTTTCTTTACGGTTTTGAA --------------------+ --------------------- + -------------------- + -------------------- + -------------------- + ------------------- + 1 2 1 5 CGCTGTCGCGCGTCCTAAGACGAACTACGCAGTCCACTATAAAAAGAAATGCCAAAACTT AlaThr A l a A r g A r g l l e L e u L e u A s p AlaSe rGlyAspIlePheLeuTyrGlyPheGlu GACTGCGTGACCGACTCCGTTCGGTGCATGGATAAACCGGAAGAAGCGAAAAGGAATATC 1 2 1 4 --------------- H -------------------- 1------------------+------------------H ----------------- h----------------+ 1155 CTGACGCACTGGCTGAGGCAAGCCACGTACCTATTTGGCCTTCTTCGCTTTTCCTTATAG A s p C y s V a lT h r A s p S e r V a lA r g C y s M e tA s p L y s P r o G lu G lu A la L y s A r g A s n lle ACCCGTCTTGCCGACCGGAAAATCTGGGACCGCCTGATGACGGATACGGGGATGTACACC 1 1 5 4 --------------- +------------------ +-----------------+-----------------+-----------------+----------------+ 1095 TGGGCAGAACGGCTGGCCTTTTAGACCCTGGCGGACTACTGCCTATGCCCCTACATGTGG T h r A r g L e u A l a A s p A r g L y s I le T r p A s p A r g L e u M e t T h r A s p T h r G l y M e t T y r T h r TTCATGAGTTCATGTCAGCGTGATGAGTGGAACAGCCAGCTGATGAGCGACACCTGTCCT 1094 ---------------- +------------------+-----------------+-----------------+-----------------+----------------+ 1035 AAGTACTCAAGTACAGTCGCACTACTCACCTTGTCGGTCGACTACTCGCTGTGGACAGGA P h e M e t S e r S e r C y s G l n A r g A s p G lu T r p A s n S e r G l n L e u M e t S e r A s p T h r C v s P r o GAAATCACCCTGGACAATGTACTGGCAACTTTTCGCCATCTGAATGCCAGTAAGATGCAG 1034 ---------------- h------------------- +-----------------+-----------------+-----------------+----------------+ 975 CTTTAGTGGGACCTGTTACATGACCGTTGAAAAGCGGTAGACTTACGGTCATTCTACGTC G lu I le T h r L e u A s p A s n V a lL e u A la T h r P h e A r g H is L e u A s n A la S e r L y s M e tG ln ACATTTGAACAGGGACTGATTGATGTCTACCGGAAATTGTCATGGGATTACAGAACCAAT 974 ----------------- h------------------- h------------------ h----------------+---------------- +----------------+ 915 TGTAAACTTGTCCCTGACTAACTACAGATGGCCTTTAACAGTACCCTAATGTCTTGGTTA T h r P h e G lu G ln G ly M e t lle A s p V a lT y r A r g L y s L e u S e r T r p A s p T y r A r g T h r A s n AATCCCTGCCGTCTGGGCAAGAGAATCATTATTGAAAACCTGCTGTACCGCTGGAGTAAC 9 ! 4 --------------------- + -------------------- + --------------------- + -------------------+ -------------------- + ------------------- + 855 TTAGGGACGGCAGACCCGTTCTCTTAGTAATAACTTTTGGACGACATGGCGACCTCATTG A s n P r o C y s A r g L e u G ly L y s A r g lle lle lle G lu A s n L e u L e u T y r A r g T r p S e r A s n P stI 8 5 4 -- GGGCGTGTTACGCTGG^CTGCAGCGGACGGGAGGCACTGGATGACCTGGTACGTCCGTTT + -----------------------------+ --------------------+ --------------------+ -------------------- + 795 CCCGCACAATGCGACCTGACGTCGCCTGCCCTCCGTGACCTACTGGACCATGCAGGCAAA G ly A r g V a lT h r L e u A s p C y s S e r G ly A r g G lu A la L e u A s p A s p L e u V a lA r g P r o P h e 7 9 4 TATCTGCTGGAGGGGCGCAACGTTCCTGACTTCCGGAGCAGCATCGGGGCGCAGTACGGT --------------------- + -------------------- + -------------------- + -------------------- + -------------------- + ------------------- + 735 ATAGACGACCTCCCCGCGTTGCAAGGACTGAAGGCCTCGTCGTAGCCCCGCGTCATGCCA T y r L e u L e u G lu G ly A r g A s n V a lP r o A s p P h e A r g S e r S e r lle G ly A la G ln T y r G ly 7 3 4 GAATTTATCGGGAACGGCGACAATGTCGGTAAGCTGTTAGAAGGGGAATATTTTACGGTG --------------------- + -------------------- + -------------------- + -------------------- + -------------------- + ------------------- + 675 CTTAAATAGCCCTTGCCGCTGTTACAGCCATTCGACAATCTTCCCCTTATAAAATGCCAC G lu P h e lle G ly A s n G ly A s p A s n V a lG ly L y s L e u L e u G lu G ly G lu T y r P h e T h r V a l 674 CGTGGCTACCAGAAAGGGACCGTACACATTGTCTTTAAGCGTCCTGACCTTGTTGAAAAA +--------------- +---------------+---------------+---------------+-------------- + 615 GCACCGATGGTCTTTCCCTGGCATGTGTAACAGAAATTCGCAGGACTGGAACAACTTTTT A r g G ly T y r G ln L y s G ly T h r V a lH is I le V a lP h e L y s A r g P r o A s p L e u V a lG lu L y s CTGAATGATATTATTGCACGGCATTATCCTGGTTCATTGCCGCCACGAGTCjTGAjrTAAAC 6 1 4 ----------------- +---------------- +----------------- +----------------+---------------- +---------------- + 555 GACTTACTATAATAACGTGCCGTAATAGGACCAAGTAACGGCGGTGCTCAGACTAATTTG L e u A s n A s p I le lle A la A r g H is T y r P r o G ly S e r L e u P r o P r o A r g V a l* 5 5 4 AGAAAAGCCCTGGTATTTATGCCGGGGCTTTTTTTCTGTAATCATCATAATATCAGAACA --------------------- + -------------------- + --------------------- + -------------------+ --------------------+ --------------------+ 495 TCTTTTCGGGACCATAAATACGGCCCCGAAAAAAAGACATTAGTAGTATTATAGTCTTGT TGATGTGACATTGTCACATCAATTGTTGTTCATATCGATGCCATCTTTCCTGGCTGAATC 494 — i----------------1---------------- — —h--------------- 1- h 435 ACTACACTGTAACAGTGTAGTTAACAACAAGTA^AG|CT^CGGTAGAAAGGACCGACTTAG CM TGTCTTTTTAGATGCTGTTTTAGAGGAAGAACGGCGGGATGGCTTTTTCGATTCTGCTTC 434 --------------+-------------- +-------------- +-------------- +--------------+ + 375 ACAGAAAAATCTACGACAAAATCTCCTTCTTGCCGCCCTACCGAAAAAGCTAAGACGAAG CCGAATAATGTCACGAAGTGCCTTCTCTCCTACTTCATACCCTGCATTTTTGAGCGTGTC 374 --------------- 1--------------- _|--------------- 1---------------1_---------------(--------------- 1. 315 GGCTTATTACAGTGCTTCACGGAAGAGAGGATGAAGTATGGGACGTAAAAACTCGCACAG ACGAACATCTGCCAGTGTATAACCCTTTGTCCTTACCAGAACAATAATGTCATCACGAAT 314 -------------- +-------------- +-------------- +-------------- +--------------+-------------- + 255 TGCTTGTAGACGGTCACATATTGGGAAACAGGAATGGTCTTGTTATTACAGTAGTGCTTA BgUI GGCATCAAGAAAATCACGCAATGTTTTCCGTTGAGCGGTGAGATCTpGCAATTCCGATAA 254 + + + +■=- + + 195 CCGTAGTTCTTTTAGTGCGTTACAAAAGGCAACTCGCCACTCTAGACCGTTAAGGCTATT AGCAGCTTTTGCCAGCTGGATGTCATCATCTGAGTAAAATTTTTTAGAAGCCATTGGCAC 194 --------------------------- + ---------------------------- + ----------------------------+ --------------------------- + -------------------------- + ---------------------------- + 135 TCGTCGAAAACGGTCGACCTACAGTAGTAGACTCATTTTAAAAAATCTTCGGTAACCGTG TTTCTCTCCGTTTCATGTTGAAGTCTGATTTGATCATCAAAAACTGACTCCTGGCAGGCT 134 -------------- +-------------- +-------------- +-------------- +--------------+-------------- + 75 AAAGAGAGGCAAAGTACAACTTCAGACTAAACTAGTAGTTTTTGACTGAGGACCGTCCGA GCGCCCTGCCTCTGCCGTCTGCATAAGACTATGATGCACAAAAATAACAGGCTATAATGG ---------------+--------------- +---------------+---------------+-------------- + + 15 CGCGGGACGGAGACGGCAGACGTATTCTGATACTACGTGTTTTTATTGTCCGATATTACC CCTGAAAAACGGGA 1 4 ------------ +------- i GGACTTTTTGCCCT w as d istin g u ished by its low GC content (see section 7.2.4; Fig. 7.5). H ow ever, it rem ains to be determ ined if such a p u tativ e IS elem ent is still active. C om parisons of the predicted am ino acid sequence of orf2 to sequences p resen t in the databases revealed th at Orf2 shares hom ology (47-50% identity) to a fam ily of hypothetical proteins th a t have been identified in E. coli through the genom e sequencing project (B lattner et al., 1997). The functions of these p ro tein s YADD, YFAD, YHGA and YJIP are unknow n. O ne other hom ologue (55.6% identity) to Orf2 is the plasm id-encoded Orf73 of Yersinia pestis plasm id pM T l. Again, the function of this p ro tein of pM T l is unknow n. The p red icted am ino acid sequence of orfl displayed a w eak hom ology (29% identity, 35% sim ilarity) over 197 am ino acid residues of the C -term inal region to a p lasm id -en co d ed restriction m ethylase gene of IncH plasm id R478 (EMBL: U60283). H ow ever, no other sim ilar hom ologues w ere found. Finally, the p red icted am ino acid sequence of orf4 show ed no significant hom ology to o th er p ro tein s av ailable in the d atab ases and or/3 w ith E. coli h y p o th etica l p ro tein YDFB. The p o ten tial functions of all these orfs identified w ithin the leading region of Collb are sum m arised in Table 7.2.1. 7.2.4 G + C content of the Collb leading region The overall G + C content of the 11.7 kb leading region of Collb w as 55.7% (Fig. 7.5), w h ich is higher th an the predicted value of 51-52% for the genom e th at the p lasm id w as originally isolated in, nam ely Shigella sonnei P9 (O chm an and Law rence, 1996). The G + C content drops significantly below the average at base positions 7500-6000 to 45 % (Fig. 7.5). On the basis that the GC contents of closely related organism s ten d to be sim ilar, such a finding is indicative of an insertion into the backbone of the lea d in g region du rin g its evolution. Interestingly, this region corresponds to the position of orf5 and the putative IS elem ent w e have identified (Section 7.2.3). 103 Fig. 7.4. A lingm ent of the nucleotide sequences of s s i elem ents in the C ollb Tstrand and Frp o. L ine 1, s s i 3 ; line 2, ssz‘2; line 3, s s i l ; line 4, F r p o . N u c le o tid e s th a t a re hom ologous are boxed. C oordinates in the leading region sequence are ssi3, 2331; ssi2, 7036-7361; s s i l , 9928-10256. F rpo is taken from M asai and Arai (1997). Figure is taken from Bates et ah, (1999). Fig.7.4 2 0 ___________________ 30 10 A o a a a a c a a c T o c a A A a c C a c a a a a c T c c t a G a a a a G a a a a T T T T| 0 a a c c ' t | c | c t o c c o t c c A C lT C C T T C T O C C O T C C a [c]t C C T T C T a C C q T C C| a a a c c t t PF } t q | 'r t c c j c f r 70 T T T CC 0 A C a a a a a A - C 0 C T C A C C A C a T C AC g [c r~ ale a T | q a q a C A T C A C T o le C A c o T A T - T A G - T A C A T T T T T T T T T a: t A C t a a 00^ o A A C 0 O a c |o T O O COT COT COT COT O £ A G •r A o p r jT o a 0 T o a c a a c a 0 a G T a a c a a c G a a - c a a c a a c A a c A c a" a [ c ] a a [ £ a a 310 aaaa c tJ c a c c 120 C A O C T O T C C o a a o a c 0 a A a a A - C A o A 0 A - cl a a c [7 |t c |c [t 110 C A O C T O T C C O O O O O C o t c c o o o o o c T O T C C o a a a a c - c 0 c A A A - c a c A A A - c a c A A A A A G A C c 0 c A A A 180 t o 260 G A A O A G A G C A C A - Aj C C C T O O A C A C C C C T O O A C T 220 210 OcT [ tt_| oa C j c0 - C C C C C O O A C C A 0 - C C C C C O O A oA t J tc ao t T g A C C C C C 0 0 A q | c t | t a| qCt cC c c c | o o t | o a 0 o o a - - A A - - T T c A O A c A C C 320 a a a c c a c a c o le a T T A C c t c c t c A gIc - -j ole A |C c Aa 0 c A A A A OT C C A OT C C A A A CAa o c a c a E1 o|o o ( c [ 7 'T * A c a a a o c c A T C A a 0 a c c A T C A a o o fjvjc o lT f o |A j A ] a a t 340 a [o A O O T O T O C A O O O A O C O C 0 T C A c o o c o c o a c T T T C c a a c o c o A c A O a T a T O C A O O O A O C O C c o o c o c o a c T a CATC 240 230 A A a c c 330 G T OT C C A 280 a o o a C C C C T O O A C C C C C T O O A C A A 270 O a a a A o a T O T a c A a a a A O c a c o c c c 'c ] t J t * a a c ]c |t t t t t o a o a a T T T T O T O O A A A A A A A A A G T T T T o jc ]o O O A 200 0 A |o fo ]G |o 250 C O C A O c a c a a c a c a a a c | c J o [<j aaaa A A A A A T I T T O T O O T T T T O T O O A A A A A A C O O O A T O T O O T TJ A C a a c q o a a t a t a a |o a 60 150 T T O C C T T O C C T T q[a]c T T Q C C 190 T O a 0 0 [ g1 c | c g | o | a 130 A T t SO T T C T C C C C O C A Q T Q O A C C T T C T C C C C a C A O T O O A C C T T - T C C C C q C A O T q q A CC| 100 T T T CC 0 tJ c 40 a a a a c A C c T T c T T T A T 90 A T a a a a a C C C a T T T a c c |o fc |c |T c a c a q a A T a a a o[c 0 T C C G T C A C C A C O T C A C O Q O T T T CC 0 ' 80 T T T a A 0 a t* g |a |a O C A a T T T C C T a c a a t G G C C T a c a a t a a C C T a cfu ]a t T T o le ja q ac 300 290 c c c c a t |o 0 c C T T - A C A C A C A C A C A C ± F liL £ Fig. 7.5. G + C content of the Collb leading region. P anel show s G+C% profile of the 11.7 kb of the C ollb lea d in g region. h o rizo n tal line indicates the m ean v a lu e of 55.7% G+C. The The GC profile w as calculated u sin g the program m e W INDOW (w indow = 500; shift = 25) follow ed b y STATPLOT. A bove the p an el is a genetic m ap of th e lea d in g reg io n sh o w in g its o rganisation and position of genetic loci in relatio n to the G+C% co n ten t. In d icated on the m ap is th e nic site of the o r i T , the p u ta tiv e o r f s id e n tifie d , the s s i sequences and s s b , p s i B and a r d A . R estrictio n e n zy m e cleavage sites are also indicated; B, Bglll; C, Clal; P, Psfl; S, Sail. N ote th at at base po sitio n s 7500 - 6000 the G+C content d ro ps significantly to 45% w hich w o u ld be indicative of an insertion into the backbone of the leading region d u rin g its e v o lu tio n (orf5). Fig. 7.5 ss i2 s s il BC P n ic orf1 P orf2 ssi3 P c orf4 ardA orf5 psiA psiB orf3 0+0% 60 - 50 - 40 Position from nic (kb) orf6 P ssb 7.2.5 Sim ilarity of the Collb leading region to other conjugative plasmids Sequence d ata are only available for the leading regions of IncP plasm ids RP4 an d R751 (P ansegrau et al., 1994a; Thorsted et al., 1998) and regions of p la s m id F. T h ro u g h our co m p ariso n s, no significant hom ologies w ere detected b etw een the leading region of Collb and the genomes of RP4 and R751. E xtensive nucleotide sequence sim ilarities betw een Collb and plasm id F w ere fo u n d ex te n d in g from the Shine-Dalgarno sequence upstream of ssb to 2b p d o w n s tre a m of p s i A . O ver this region, there is approxim ately 80% co n serv atio n b e tw een Collb and F at the nucleotide sequence level. H ow ever, it is im p o rta n t to n ote that the sequence of the corresponding region of F has n o t b een p u b lis h e d in full particularly the region betw een ssb and psiB. N o hom ology to F w as detected beyond the ps iA gene. It rem ains to be determ ined if any hom ology exists further upstream of Collb ssb to plasm id F. The sim ilarities betw een Collb and plasm id F has led us to conclude that th e reg io n from s s b to p s i A m ay exist as a conserved cassette-like m odule, w hich m ay also be p resent on a num ber of other plasm ids. Cross-hybridisation to p lasm id s from Inc groups FI, FII, FIV, FVI, K, II, Y, 9 and B shows that ssb an d psiB genes exist on a num ber of conjugative plasm ids all of which display a h ig h d eg ree of id en tity w ith each other (Golub et al., 1988). The proposed ssbp s i A m o d u le is se p a ra te d from the rest of the leading region of Collb by a p u ta tiv e IS elem ent (or/5). The only other p a rt of the Collb leading region that bears sim ilarity w ith th e lead in g reg io n s of other conjugative plasm ids is the a r d A gene (Section 7.2.3). 7.2.6 Exam ination of the Collb leading region for type I restriction enzym e cleavage sites The lead in g region of Collb has been exam ined for type II restriction en zy m e cleav ag e sites th ro u g h restriction enzym e analysis of the p lasm id 104 Table 7.2.2 Type I restriction enzyme cleavage sites found within 11.7 kb of the leading region of Collb. Family Enzyme K EcoKI EcoB EcoD Sh/SB A C Recognition sequence a a c n 6g t g c Number sites 1 Position in sequence 9646-9658 2 1 2438-2452; 7286-7299 StySF TGAN8TCCT t t a n 7g t c y g a g n 6r t a y g a a c n 6g t r c EcoA g a g n 7g t c a 2 2144-2163; 7643-7657 EcoE g a g n 7g t c a - EcoR124 g a a n 6r t c g 3 4248-4266;6461-6473; 10796-108 EcoDXXI t c a n 7r t t c 1 7086-8000 - - 9646-9658 Type I restriction enzyme cleavage sites in the leading region of Collb were determined by comparing the sequence data available with the recognition sequence of the enzyme using the GCG program FINDPATTERNS. Restriction enzyme recognition sequences were obtained from REBASE via the World Wide Web (http ://reb ase.n eb .co m /reb ase/rebase.html). N = unspecified base, R = A /G , Y = C /T. (Rees et al., 1987). Several of these sites are indicated in Fig. 7.3. H ow ever, in v i t r o an aly sis of any DNA for typ e I cleavage sites is m ade difficult by the n a tu re of the cleaving reactions for this particular class of enzym e (C hapter O ne, section 1.5.2) an d their lim ited availability. The m ost appropriate w ay to determ in e the n u m b er of type I cleavage sites is to probe any sequence data that is available for the enzym es recognition sequence. C ollb h a s b e e n estim ated to have approxim ately 7 EcoKI restrictio n en zy m e cleav ag e sites (B. M. W ilkins, p erso n al com m unication). From stu d ie s w ith a r d A an d the arrfA -in d e p en d e n t restric tio n evasion p ro cess m ed iated b y C ollb, it is of interest to this project to determ ine how m any of these sites can be fo u n d w ithin the leading region of Collb. Consequently, the se q u e n c e d a ta w as p ro b e d fo r th e E coK I re c o g n itio n sequence A A C N N N N N N G T G C (Wilson and M urray, 1991). O ur analyses revealed that on ly one of the seven EcoKI cleavage sites can be found w ithin the sequenced reg io n of the C ollb leading region at base positions 9646-9658 in orf6 (Table 7.2.2; Fig. 7.3). Finally, the sequence data of the Collb leading region was also probed for th e cleavage sites of a num ber of other type I restriction cleavage sites. The enzym es chosen an d the results obtained are in Table 7.2.2. 7.2.7 Repeat sequences w ithin the leading region T hree copies of a ~328-bp repeat sequence w ere identified w ithin the seq u en ce d a ta av aila b le for the leading region of Collb. Each rep e at is p o sitio n ed in the sam e orientation approxim ately 60 nucleotides upstream of eith er ssb, orf4 or orfl and the three repeats share betw een 80 and 85% identity to each o th er over th e ir entire length. The repeats w ere labelled s s i l , s s i l or ssz‘3 a n d are fo u n d at base p o sitio n s 1787-1459, 4678-4353, 11711-11381, respectively (Fig. 7.3). C om parisons of the nucleotide sequence of each ssi to those p resen t in the databases identified one hom ologue belonging to plasm id 105 F, originally term ed ssi D, w hich has now been renam ed F rpo. Each ssi shares -60% id e n tity over 160-bp of th eir central p ortion to the F rpo sequence of p lasm id F, as sh o w n in Fig. 7.4 (N om ura et al., 1991; M asai and A rai, 1997; EMBL: D90178). The F rpo sequence w as originally identified in the leading region of the p lasm id F as a single-stranded D N A synthesis initiation sequence, hence the n am e s s i D . Early studies w ith the single-stranded phage M13 carrying s s i D sh o w ed th a t the locus plays an active role in prim in g DNA synthesis on sin g le-stran d ed viral DNA (N om ura et al., 1991). It w as later found by M asai and A rai (1997) th at the repeat locus also had additional properties that allow it to fu n c tio n as a n ovel type of p ro m o te r active in sin g le-stran d ed form . C onsequently, the locus was renam ed F rpo. The p ro m o te r form s in single-stranded DNA th ro u g h the form ation of secondary structure. The close identity that exists betw een the ssi sequences found in the leading region of Collb and Frpo of plasm id F suggests a sim ilar role in C ollb. It is envisaged that such sequences m ay act as single-stranded p ro m o ters th at form on the transferring plasm id strand and facilitate transient expression of the leading region genes in the recipient before regeneration of a com plem entary stra n d through DN A synthesis. The fo rm ation of secondary structure and action of these ssi sequences as p ro m o ters involved in enhancing expression of leading region genes d u rin g conjugative transfer of the plasm id is discussed further in C hapter Eight. 7.3 Summary G eneration and analysis of the 11.7 kb of nucleotide sequence data for the Collb leading region led to the identification of the following features: * Identification of ten p o tential pro tein coding regions, three of w hich have been prev io u sly characterised (ssb, psiB, ardA). 106 * Three of the novel or/s identified include an F-like p s i A gene, another w hich encodes a p o ten tial DNA binding protein (or/6) and one w hich is a m em ber of the E. coli fam ily of genes know n as YHGA (or/2). * Id en tificatio n of an u n u su al IS elem ent and a related O rf (or/5), the am ino acid seq u en ce of w hich bears sim ilarity to a num ber of putative transposase enzym es. * All or/s id e n tifie d w ith in the lead in g region are orientated in the sam e d irectio n to w a rd s o r i T (Fig. 7.2). Such an observation raises the question of w h eth er the genes p resen t in the leading region of Collb are transcribed as p art of an operon. * The or/s id e n tifie d from ssb to ps iA of Collb and plasm id F appear to be conserved as a cassette-like m odule that m ay be present on a num ber of other conjugative p lasm id s. * Three of th e or/s ( a r d A , orf6 and or/2) identified had hom ology to three or/s p re s e n t on Y e r s in ia p es ti s p lasm id pM T l. H ow ever, there is no general sim ilarity in the genetic organisation of these two plasm ids. F inally, th e m o st in terestin g finding was the identification of th ree rep eat sequences (-328 bp) present in the leading region of Collb which appear to be analogous to F rpo. It is envisaged that the Collb ssi sequences, like F rpo, act as sin g le-stranded prom oters. Such findings have led us to assum e a m odel ex p lan atio n as to h o w expression of the leading region genes on Collb are enhanced d u rin g conjugative transfer of the plasm id (C hapter Eight). 107 Chapter Eight General Discussion T he fact th a t C ollb h as a n a rro w rep lica tio n -m ain ten a n ce ran g e e n co m p assin g g e n era such as Escherichia, S a lm o n el la , Shigella and Kle bsiella from the E nterobacteriaceae fam ily raises the question of w hy the plasm id has e v o lv ed tw o se p a ra te strategies for restriction avoidance. H ow ever, the fin d in g th a t stra in s of Escherichia coli encode more th an a h u n d red know n RM system s (h ttp ://re b a se .n e b .c o m /re b a se /re b a se .h tm l) not only highlights the u b iq u ity of these system s in natu re b u t suggests th at the ability of Collb to overcom e ty p e I a n d type II restriction barriers m u st be a significant if no t essential aid to transm ission. W hether Collb has recognition sites for all these enzym es an d w h e th e r the restriction-avoidance m echanism s encoded by Collb are active against all these R-M system s has yet to be determ ined. The resistan ce of Collb to both type I and type II restriction enzym es d u rin g its tran sfer b y conjugation involves two distinctive m echanism s. O ne m e c h a n ism in v o lv e s a sp ecialised p lasm id -en co d ed a n tire stric tio n gene k n o w n as a r d A , th e p ro d u c t of w hich is active ag ain st type I restrictio n enzym es. A n o th e r m echanism , w hich acts independently of a r d A , alleviates restrictio n of C ollb from bo th type I and type II enzym es in the recipient in second or su b seq u e n t rounds of transfer. The m ain th ru st of this thesis w as to in v estig ate fu rth e r these tw o Collb restriction-evasion m echanism s. The use of a specially d e sig n ed genetic system that exploited the sensitivity of IncPp p la s m id R751 to re stric tio n w as in stru m en tal in allo w in g a n u m b er of observ atio n s to be m ad e about these tw o m echanism s and helped to rule out h y p o th eses w h ic h w ere m ade for the ard A -in d ep en d en t m echanism . The genetic sy stem re lie d on m easu rin g C ollb-m ediated rescue of R751 from d e stru c tio n by EcoKI (I) and EcoRI (II) during conjugation. co n c lu sio n s a n d For sim plicity, o b se rv a tio n s m ad e for the C ollb a r d A - i n d e p e n d e n t 108 m ech an ism w ill be discussed first follow ed by those m ade for the A rdA system . O ther w o rk described in this thesis involved an investigation into the re g u la to ry m ec h a n ism g o v e rn in g lead in g region gene expression d u rin g conjugation. The p ro p o sed m odel w ill be discussed last. The C ollb restriction-avoidance m echanism w hich was identified using an a r d A m u ta n t of Collb w as originally described by Read et al. (1992). It was p o stu lated by R ead et al. (1992) th at am plified transfer of 93-kb Collb m ight o v erw h elm the restric tio n enzym es in the recipient by substrate saturation, reg ard less of w h e th e r they are type I or type II enzym es. Such a scenario should result in alleviation of restriction in trans. H ow ever, data presented in this thesis are n o t in agreem ent w ith this 'substrate sa tu ratio n ' hypothesis. F u rth e rm o re , re s u lts su g g e st th a t one m echanism is n o t responsible for a llev iatin g re stric tio n from b o th type I and type II restriction enzym es as p ro p o se d by R ead et al. (1992). o b s e rv a tio n th a t u n lik e th e Such a conclusion w as d raw n from the a n ti-ty p e II m ech an ism , C o llb -m e d ia te d alleviation of ty p e I restriction occurs independently of any plasm id transfer and ap p ears to be expressed constitutively and in trans (Fig. 3.6 B). For clarity, observations m ad e for these tw o m echanism s will be discussed separately. The p o ssibility th at Collb alleviates type II restriction by overw helm ing the enzym es by su b strate saturation during amplified transfer w as ruled out by d ata in C hapter T hree (Fig. 3.3 A). The key finding w as that CoWodrd confers no significant p ro te c tio n on restriction-sensitive plasm id R751 w hen the tw o p lasm id s are co -tra n sferre d together to a restricting recipient. If am plified tra n s fe r of C ollb fro m the d o n o r to the recip ien t re su lte d in su b stra te satu ratio n of restrictio n enzym es, then restriction should be alleviated in trans an d the tran sm ission frequency of R751 should approach that observed in n o n ­ restricting m atings. H ow ever, data in Fig. 3.3 (A) clearly show s this not to be the case. It also seem s unlikely from these results th at am plified transfer of Q o W b d r d changes the physiology of the recipient such that there is a transient 109 b re a k d o w n of restriction. A gain, such a scenario should result in alleviation of restriction in the recipient in trans. The fin d in g th at Collb cannot alleviate ty p e II restriction in trans su ggests th a t the plasm id m ust avoid destruction from such enzym es by a cisacting m echanism (see C hapter Six). A num ber of phages are know n to encode a n ti-re s tric tio n m ec h an ism s th a t are czs-acting. O ne su ch m ech an ism involves th e ra n d o m incorporation of unusual bases into the genom e of the p h ag e, such th a t th e DNA is n o t recognised as unm ethylated, the signal for cleavage b y m an y restriction enzym es (Kruger and Bickle, 1983; Bickle and K ruger, 1993). T hese unu su al bases are quite often m odified versions of either a d en in e or cytosine residues as these are com m only the substrates for m ost m eth y ltra n sfe ra se enzym es w ithin restriction enzym e recognition sequences. S u ch b a ses are m a d e at th e level of n u cleo tid e m etab o lism a n d are in c o rp o ra te d in to the genom e of the phage only. C onsequently, p h a g es d e m o n strate resistance to restriction; yet, their presence in a cell has no effect o n re s tric tio n in trans. It seem s unlikely that C ollb em ploys a sim ilar m ech an ism of restriction-avoidance as the plasm id is sensitive to restriction en zy m e cleavage in vitro. Furtherm ore, R751 DNA should also be m odified w h e n p resen t in th e sam e cell as Collb by using the sam e nucleotides. The p o ssib ility th at Collb carries another antirestriction gene(s) cannot be ru led o u t by an y data in this thesis. One idea is that such a gene(s) encodes a non-specific m ethyltransferase enzym e that random ly m ethylates the genom e of th e p lasm id. Such a gene m u st only be expressed d u rin g conjugation, as C ollb DN A isolated from vegetatively growing cells is sensitive to the effects of restric tio n in vitro and in vivo du rin g transform ation (Rees et al., 1987; Read et al., 1992). If the gene is lim ited to expression d u rin g conjugation, it seem s likely th at the gene should be situated w ithin the leading region of Collb. E x am in atio n of th e sequence data generated for the leading region of C ollb (C h a p te r Seven) id e n tifie d only one p o ten tial orf th a t m ig h t e n co d e a m eth y lase. The p ro d u c t of o r f l , displays w eak sim ilarity to a p u ta tiv e 110 restrictio n m eth y lase of IncH plasm id R478 (U60283) of Serratia marcescens. H o w ev e r, the reg io n of hom ology (29% identity, 35% sim ilarity o v er Cterm in al 197 am in o acids) does n o t encom pass the N-6 adenosine m ethylase sig n a tu re ty p ical of DN A m ethylases. No other or/s identified w ith in the com plete n u cleo tid e sequence of Collb w ere found to have hom ology to other restriction m ethylase genes (Fig. 8.1). It seem s unlikely from these predictions m ade for O rfl that the putative p ro te in m e th y la te s C ollb d u rin g conjugation to p ro tect the plasm id from d e stru c tio n b y ty p e II restrictio n enzym es. F u rth erm o re, if the enzym e m eth y lates in trans, tim ing of expression of orfl and subsequent m ethylation of C ollb w o u ld b e an im p o rtan t factor to consider as no cross-protection is conferred on R751 d u rin g co-transfer of the two plasm ids (see C hapter Three). F u rth er analyses of orfl are required before its role in the avoidance of type II restrictio n en zy m es by Collb can be ruled out. E xperim ents that m ig h t be in stru ctiv e w o u ld involve cloning orfl into a vector w here it can be expressed co n stitu tiv ely a n d exam ining the ability of the resulting construct to rescue eith er tran sferrin g R751 or a restriction sensitive strain of phage X from the in v iv o effects of ty p e II restriction enzym es. Such a construct could also be used to determ in e if o r f l has a role in protecting Collb A rdA ' from type I restriction enzym es. D esign of an e x p erim en t to reveal details ab o u t an anti-restrictio n m echanism th at is believed to be czs-acting is difficult. F urther com plications are th a t such a m echanism appears to only be active d u rin g the process of conjugation, as sh o w n by the inability of Collb to evade type II restriction d u r in g th e p ro c e ss of tra n sfo rm a tio n (Read et a l ., 1992). One line of in v e s tig a tio n c o u ld in v o lv e cloning restrictio n frag m e n ts of C ollb an d e x am in in g th em for th eir ability to confer protection against the effects of E c o R l u p o n the resu ltin g construct. The vector chosen w ould require EcoRI cleavage sites a n d the Collb oriT locus. Such an analysis could only be utilised if after its m obilisation by the Collb nic m utant (p N A ll) the vector is sensitive 111 Fig. 8.1. Genetic organisation of In cll plasm id Collb. The genetic m ap w as deduced from the com plete n u c le o tid e sequence of the p lasm id , w h ich w as recently m ade available (Sam pei a n d M izobuchi; EMBL: AB021078). The genom e is represented by the lines m ark e d in kb coordinates. A ll o p e n re a d in g fram es (or/s) id e n tifie d are sh o w n as rectan g les. The tra n s c rip tio n a l d irec tio n s of each o r / is indicated b y an arro w an d th o se ru n n in g from left to rig h t are below the line and those from rig h t to left are above the line. The or/s that are p a rt of the T ral region of the plasm id are S B rectangles, w ith the exception of the shufflon, rci locus a n d E D T A -resistant n u c le a s e n u c . The or/s th at form th e Tra2 region of th e p la sm id are I 1 rectangles, w ith the exception of s o g and eex. The T ra l an d Tra2 regions are indicated on the m ap, broken lines h ig h lig h t or/s that encode pili; these are the flexible p ilu s in T ral an d the rigid p ilu s in Tra2. Loci w ith defined functions are indicated by I H rectangles. The or/s com prising the leading region of the plasm id , som e of w hich w ere identified d u rin g this w ork, are indicated by C53 rectangles. P u tative or/s w ith no defin ed function are unlabelled. Locations of o r i T and o r i V are in d ic ate d . T he lo catio n s of p o te n tia l sin g le -s tra n d e d p ro m o ters {s s il , ssi2 and ssi 3 ) identified d u rin g this w ork are also show n Fig. 8.1 0 kb I________ I 10 kb ▼ L—I II .. ■ *- I□ □ - > > — ^ -> cib repZ ibfA ibfC imm <- x6 repeat resolvase 20 kb 10 kb □> iz z o resA parA parB ; impB ssi3 30 kb 20 kb K3KSSS3S3 ESSSHES > > CS KS CSS CSSSS — > ssb ssil ssi2 oriT T 30 kb w w v or/6 w \\V \ ES SSH orf 5 ardA orf4 orf3 psiB psiA trbB trbC e s s > orfl orf2 trbA 1 40 kb ■ -> n ikA x3 repeat ~ 40 kb r 50 kb pnd nikB exc I eex I r r traY_______ traX traW traV traU traT traR i==1RI Pi=n I 60 kb 50 kb Tra2 sogS traQ traP 1^-1 < — traO II < ■■ traN —... < traM trau± : - 1< —— i 3 S08 l nuc traK tral 1 70 kb 60 kb tral traH traG traF traE rri cVniffinr. ____21 rilU vilT 80 kb 70 kb Tral pilS vilR ilO yiip _______ I________I________I_______ I_______ !_______ 1 90 kb 80 kb traC 90 kb (ifi traB traA 93.399 kb to the effects of EcoRI in the recipient. M obilisation of constructs that carry a p a rticu la r fragm ent of Collb m ight be instructive in determ ining if a particular gene or locus of th e plasm id can be im plicated in the anti-type II restriction e v asio n m ech an ism . H ow ever, the possibility th at m ore than one locus is in v o lv ed m ay com plicate such an investigation further. Finally, it is possible that Collb avoids the effects of type II restriction by its m o d e of e n try du rin g conjugation. It is generally assum ed that restriction enzym es are lo cated w ithin the cytoplasm of their host cell. This assum ption arises from early attem pts to isolate new R-M systems, w here host cell extract is able to frag m ent D N A (Wilson and M urray, 1991). The fin d in g th at restriction-m odification system s can be isolated from bacteria fo u n d in every ecological niche suggests that the random dispersal of all restrictio n enzym es throughout the cell is a too sim plistic view to ap p ly to every k n o w n system w ithout further investigation. It m ight be possible th at certain restriction enzym es are com partm entalised w ithin the cell, rather th an ev e n ly d is trib u te d th ro u g h o u t. K ohring and M ayer (1987) sh o w ed by im m u n o -lo calisation that betw een 70-90% of total EcoRI restriction enzym e is found in the cell envelope of E. coli, w hilst 60-70% of the m ethyltransferase is p resen t w ith in th e cytoplasm . Such results are in agreem ent w ith the 'cellular d e fe n ce ' h y p o th e s is described in C h ap ter One, section 1.5.6, w h e re the m eth y lase p ro tec ts the cells ow n DN A from d estruction and the restriction enzym e p a rt cleaves any foreign DNA entering the cell. If certain restrictio n enzym es are com partm entalised w ithin the cell, p articu larly in the periplasm ic space, then such findings by Kohring and M ayer (1987) could ex p lain w hy som e cloned R-M genes in E. coli are viable in the r+m~ configuration. Furtherm ore, it m ay be possible that Collb itself avoids the effects of EcoRI restriction d uring conjugation by determ ining a DNA tran sp o rt sy stem th at e n su res transferring DNA does not come into contact w ith the EcoRI-rich p eriplasm ic space. First, this could be the outcom e of the plasm id m atin g -p air form ation system , w hich ensures direct entry of transferring DN A 112 into the cytoplasm of the recipient. Bacteriophage X is sensitive to restriction by EcoRI. D u rin g the initial stages of infection by the phage, the genom e is injected as d u p lex DNA into the periplasm ic space before it traverses the inner m em b ran e b y a passive sugar u p tak e m echanism (D iirrenberger et a l ., 1991). P resum ably d u rin g transfer of R751, the plasm id enters the periplasm ic space. A ccording to the m odel proposed by D iirrenberger et al. (1991) described in C h a p te r O ne, section 1.3.1, p lasm id DNA transferring by the process of e n tero b acterial conjugation enters the periplasm of the recipient cell before trav ersin g the in n er m em brane by a sim ilar m echanism to th at of p h ag e X. H o w ev er u n lik e X, plasm id DNA at this stage is still in single-stranded form . R estriction en zy m es are tho u g h t to cleave single-stranded DN A only w h e n canonical d u p le x stru ctu res are form ed (Nishigaki et al., 1985; C hapter O ne, section 1.5.4). P erh ap s du rin g C ollb-m ediated conjugation conversion of the p la sm id into d u p le x DN A is delay ed until entry to the cytoplasm of the recipient. In contrast, perhaps com plem entary strand synthesis of R751 occurs concurrently d u rin g transfer w ithin the periplasm ic space, m aking the plasm id susceptible to cleavage. Such hypotheses are difficult to test. If synthesis of the second stran d is delayed until entry to the cytoplasm , this m ight explain w h y the plasm id carries a psiB gene. Single-stranded DNA entering the cytoplasm w ill induce the SOS response (Jones et al., 1992). R751 does not carry a sim ilar psiB gene and it has yet to be tested if transfer of the plasm id induces the SOS response. If the findings by Kohring and M ayer (1987) are correct and Collb does in fact inject its D N A d irectly in to the cytoplasm of the recipient, th e n m o b ilisatio n of any restrictio n sensitive vector by the Collb n ic m u ta n t (p N A ll) sh o uld avoid the effects of EcoRI restriction in the recipient. D uring w o rk described in this thesis (Chapter Four), p N A ll w as often used to m obilise the cloned C ollb o riT plasm id (pNA2). M obilised pN A 2 w as not sensitive to the effects of EcoRI restriction d u rin g its transfer (data not shown). H ow ever, w h eth er this finding is due to protection by the Collb transport system or to the 113 relative in sensitivity of pN A 2 to EcoRI through carriage of only one cleavage site is n o t kn o w n . These experim ents need to be repeated using a sim ilar v ecto r c a rry in g e n o u g h EcoRI cleavage sites to m ake it a suitable assay of restriction. U nfortunately, tim e w as not available during this work to p u rsu e th is in v estig atio n further. The 'su b strate saturation7 hypothesis as a m eans of explaining how Collb A rd A ' av o ids th e effect of type I restriction enzym es can be ruled out by data p re se n te d in th is thesis (C hapter Three). The substrate saturation scenario relies o n a m p lifie d transfer of C o llb d r d from the d o n o r to the restricting recipient. H o w ev er, C oW odrd A rdA ' plasm id is still able to alleviate the effects of type I en zym es w hen already established in the restricting recipient (Fig. 3.6 B). O ne h y p o th e sis that has yet to be ruled out is th at either transfer of C ollbdrd A rdA ' or presence of the plasm id in the restricting recipient induces a change in cell physiology, w hich in tu rn leads to a breakdow n of restriction. In te re stin g ly as early as 1965, G lover and C oulson described experim ents involving the b reak d o w n of type I restriction due to changes in cell physiology. Such e x p erim en ts involved the conjugative transfer of the EcoKI R-M genes from an E. coli K-12 H fr donor to an E. coli B F recipient. The restrictio n p h e n o ty p e is d e la y e d in the re c ip ie n t for 15 g e n eratio n s w h ilst th e m o d ificatio n p h e n o ty p e is observed im m ediately (Prakash-C heng and Ryu, 1993). This delay ed expression of restriction is due to the activity of ClpX and C lp P , w h ich m ak e u p the ClpXP protease of E. coli. ClpXP m o d u la tes restrictio n activity by operating post-transcriptionally d u rin g assem bly of the EcoKI com plex (see C hapter One, section 1.5.3). ClpX and ClpP form p art of the E. coli h eat shock response that is associated w ith changes in cell physiology. It m ay be possible th at either conjugative transfer of Collb A rd A ' (Fig. 3.4) or just the presence of the plasm id in E. coli (Fig. 3.6 B) is able to stim ulate e ith er ClpXP or an o th er protease, w hich in tu rn m odulates type I enzym e restriction activity. In contrast, Collb m ay encode its ow n protease, although 114 e x am in atio n of the com plete nucleo tid e sequence of the plasm id, w hich is cu rren tly available (AB021078), identified no ORFs that have potential protease activity. H ow ever, it should be noted that several of the ORFs identified are still w ith o u t any predicted function (Fig. 8.1). U n fo rtu n ately , the hypothesis th at Collb stim ulates activity of ClpXP w as n o t tested d u rin g this work. It is predicted that if Collb stim ulates ClpXP activity, w h ich in tu rn m odulates restriction by EcoKI, transfer of the plasm id to c lp X and c lp P m u tan t strains of E. coli K-12 sho u ld resu lt in a significant red u c tio n in th e plasm ids transm ission, com pared to that show n in Fig. 3.2 B. Such a h y p o th esis should be easily tested by obtaining the appropriate strains described. Finally, it rem ains possible that Collb A rdA ' encodes in addition to a rd A a n o th er anti-restriction gene(s) active against type I enzym es and is expressed co nstitutively. Recent findings by Rastorguev et al. (1998) show ed th at In cll p lasm id R64 carries a gene that encodes a m oderate anti-restriction phenotype a ctiv e sp ecifically ag ain st type I enzym es. The gene identified is n o t h o m o lo g o u s to a r d A , although the protein does share a sim ilar nine am ino acid m otif com m on to A rdA and other anti-restriction proteins (see C h ap ter O n e, sectio n 1.7). The R64 gene is located ad jacen t to the p la sm id s stre p to m y c in a n d tetracycline resistance genes, d o w n stre am from o r iV . C om p ariso n of the nucleotide sequence of the gene w ith those present in the datab ases revealed only one hom ologue, that of the arsR gene of IncFI plasm id R773. ArsR is a rep resso r of the arsenical resistance operon arsR D A B C . H o w ev er, R64 carries no such operon as show n by the sensitivity of cells carry in g the plasm id to treatm ent w ith arsenical chem icals (Rastorguev et al., 1998). A nalysis of the com plete nucleotide sequence of Collb revealed th at the p lasm id does n o t carry a sim ilar gene. The arsR gene of R64 was identified by cloning fragm ents of the plasm id and exam ining the resulting recom binants for their ability to confer protection on restriction sensitive phage X. It w ould be of interest to this work to apply a 115 sim ilar strateg y to Collb, w ith the aim of identifying a region of the plasm id th a t is responsible for constitutive alleviation of restriction by type I enzym es. Zygotic ind u ctio n of Collb leading region genes, psiB and ssb, has been d e m o n s tra te d (Jones et al., 1992; C hapter One, section 1.6.4). Only recently, evidence h as b een obtained to show that transcription of a rd A is enhanced in th e recipient cell d u rin g the process of conjugation. The technique used was a com petitive RT-PCR assay, which is able to quantify mRNA transcript levels of a p a rticu la r gene. A full description of the details relating to this technique is a v a ila b le in A lth o rp e et al. (1999). D uring conjugation, a r d A mRNA tran scrip ts w e re m easured and com pared to those from vegetatively grow ing cells. T ran script levels of a rd A increased in a short pulse during conjugation, w h ich b eg an a t ab o u t five-six m inutes. Transcript levels later dim inished as tra n sc rip tio n is tu rn e d d ow n an d then existing tran scrip ts w ere d eg rad ed . C om parisons of these results w ith the tim e of entry of ColTbdrd to the recipient cell, nine m inutes, suggest that transcription of a rd A begins before com pletion of the first ro u n d of plasm id transfer (Althorpe et al., 1999). The b u rs t of a r d A tran scrip tio n during conjugation in com parison to th at detected in vegetatively grow ing cells suggests that like psiB and ssb , a rd A is also subject to zygotic induction. This view is further supported b y data p resen ted in this thesis (C hapter Three and Chapter Five), w hich has also been p u b lish ed in conjunction w ith the transcriptional analysis of the ardA gene in A lth o rp e et al. (1999). In experim ents involving the use of the R751 restriction assay, A rd A -m ediated restriction alleviation was observed to be optim al w hen enco d ed by a transferring Collbdrd plasm id rather than an equivalent plasm id resid en t in the restricting recipient (Fig. 3.5; Fig. 3.6 C). Such results im ply that expression of a rd A is activated d uring conjugation. T ran scrip tio n al activation of a rd A in the recipient w as indicated by the stro n g p ro tectio n conferred by a co-transferring C o lJ b d rd plasm id (Fig. 3.5 A) b u t lack of even basal protection by the C oU bdrd nic m utant (Chapter Four and C h ap ter Five, Fig. 5.3 A) im m obilised in the donor. 116 Thus, it is inferred that n e ith e r a r d A m R N A or A rdA p ro te in are tra n sp o rte d from the d o n o r to recipient cells. R751 is protected to an interm ediate extent by a co-transferring C o l l b d r d A rd A m u tan t (Fig. 3.4 A ), b u t this effect is attributed to the a r d A - in d ep e n d en t restriction-avoidance response described already in this chapter. Some in terestin g observations about the tim ing of expression of a r d A can be m ad e from the studies co nducted on a r d A tra n sc rip tio n d u rin g co n ju g atio n as m easu red by the com petitive RT-PCR assay (A lthorpe et al., 1999). T ran sc rip tio n of a r d A , in the recipient, extends over 10 m inutes or m ore in the cell population. Transfer of Collb from the donor to the recipient is n o t sy n ch ro n ised and transcription in a single pair of conjugating cells is likely to be less th a n 10 m inutes. From m easuring and analysing the tim e of en try of the p lasm id , the onset of a rd A transcription can be estim ated. The rate of D N A tra n s fe r in H fr m atings is estim ated to be 45-kb m in '1 at 37°C (B achm ann et al., 1976). A ssum ing transfer of Collb occurs at the sam e rate, tra n sfe r of th e 100-kb plasm id (pLG221) should be com pleted w ith in 2.5 m inutes. C om pletion of the first round of transfer in a bacterial population is m ark ed by a su rg e of transconjugant production. su rg e occurs after nine m inutes. In the case of Collb, this The level of a r d A tran scrip ts increase in abundance after five-six m inutes, w hich suggests that transcription of this gene is initiated w ith in a m inute of entry of the plasm id. Such tim ing indicates that tran scrip tio n occurs shortly after entry of the leading region to the recipient cell. One in trig u in g question is w h at is the regulatory m echanism em ployed by Collb to allow enhanced expression of leading region genes, psiB , ssb and a rd A , in a transient b u rst shortly after conjugative transfer of the plasm id. It is kno w n that plasm id D N A is transferred to the recipient cell in single-stranded form , yet, it seem s likely from the observations m ade above that these genes are expressed before transfer of the plasm id has finished. By sequencing the lead in g region of C ollb, the aim of C hapter Seven w as to use such data not 117 only to m ap genetic loci, b u t to look for regulatory sequences that m ight give an in d icatio n of the m echanism governing leading region gene expression. For all p lasm id s exam ined, a rd A and p siB -ssb genes are orientated in th e ir le a d in g reg io n s such th a t the DNA stra n d th a t is th o u g h t to be tra n sfe rre d to the recipient cell d u rin g conjugation also corresponds to the te m p la te s tr a n d for tran scrip tio n of these genes (Fig. 7.5). A nu m b er of p u ta tiv e o rfs w ere identified from the sequence data generated, the predicted functions of w h ic h are described in C hapter Seven, section 7.2.3. Interestingly, all the o rfs id en tifie d w ere orientated in the sam e transcriptional orientation as a rd A , psiB a n d ssb (Fig. 7.5). P e rh a p s th e m ost interesting finding w as th e identification of three rep e at seq u en ces of ~328-bp in length, which w ere nam ed s s i l , ssi2 and ssi3 . The repeats sh are betw een 80-85% sequence identity w ith each other over their en tire length. R em arkably, each of these repeats aligns w ith only one other sequence in th e databases, a sim ilar sequence found w ithin the leading region of plasm id F, originally called ssiD , now renam ed F rpo (Fig. 7.4). The F locus w as originally isolated by N om ura et al. (1991) and show n to be a sin g le -stra n d e d initiation sequence capable of su p p o rtin g synthesis of p rim ers for D N A replication, hence the nam e s s iD . Later w ork carried out by M asai and A rai (1997) show ed th at the ssiD locus has additional pro p erties w h ich allow it to function as a novel type of prom oter active only in single­ stra n d e d form . C onsequently, the locus was renam ed F rpo. The elaborate stem -loop s tru c tu re p ro p o se d by M asai and A rai (1997) extends over -300 n u cleo tid es in the single strand and includes a -35 and -10 sequence and a tran scrip tio n start site (Fig. 8.3). M asai and Arai (1997) found that Frpo is only active as a p ro m o te r in single-stranded DNA. H ow ever, dsD N A can be activated for tran scrip tio n from such prom oters by denaturation. The Collb ssi sequences h a v e p red icted secondary structure sim ilar to that of F rpo (Bates et al., 1999; Fig. 8.2). The putative -10 regions of the Collb ssi sequences contain n u m e ro u s m is-m atch es, w hich m ay aid p ro m o ter function by facilitating 118 Fig. 8.2. Com parison of the proposed secondary structure for Fr p o w ith the sequences of the Collb ssi sites. Figure is tak en from Bates et al. (1999). The F rpo stru c tu re is from M asai an d A rai (1997) w ith the exception th a t o p posing G and T residues are paired. The a rro w in d icates leftw ard extension from the 5' end of the RNA. th e rm o d y n a m ic a lly fav o u red v a ria n t of th e sequence, F rpo* is a s s i stru c tu re s w e re calculated u sin g the m fo ld 3.0 RN A folding p ro g ram (M athew s et a l ., 1999), which was obtained via (h ttp ://w w w .ib c .w u s tl.e d u /z u k e r /). the World Wide Web U p p e r case letters indicate n u cleo tid es th a t conform w ith the consensus sequences of the -35 an d -10 reg io n s (5’TTGACA-3' and 5-TATAAT-3'). Figure w as draw n by B. M. W ilkins. Fig. 8.2. F rpo + -1 0 ^ -3 5 ____ i c a a 5 ' - c c g 3 7 - g g c . . . . g g g g c t c a c c c c g t t a c T t c g a a a g c c a c g g c c g T g c g g tg t t g g c T a g a g a e g t . c g t c c g c g g t CA G g g c g g g g g a c a a c c c c t t T a c a a c a a e g e a t t t t t g e e g tg t t a c a 3 a a a a a e g g g % a g t c t t g t e g c g t fc F x p o * 4 5 -1 0 c a a ' - c c g 3 ' - g g c . . . -3 5 t g a a a a c g c c a c g g c c g e g t c g t c c g g g g T g c g g tg t t g g c T g c g CA G g c c c c c g t g g c g tt t tg a a g o . a g a g a g . sFa. . t c g g g g c a c c c c g T t t c t T . g c a a c c g c a a a a c g c t t t t t g c c t a t a a a a a c g g g t ^ ctt: s s il -1 0 a c g 5 7 - e g g g g g 3 ' - g c c c c c -3 5 . . . g c tc a g g g a c c a c g tc e g c a g ttg tc c g g g g g a c t g t g g t g t a g a tA T g T g c g c g a a a . a c a g g tc a g C A G g T c c c c t c t c g tc g t a c t a e g e a a a a g t t t t t g a e g g t g t t t t g a a a a a c g g c g c c t g ss±2 -1 0 a t g tc c -3 5 c a 5 ' - t c c g g g g g c g t a c c a c g t c g g g e g c a g e 3 '- a g g c c c c c g cA a c c t t g g t g t a g g c T g g tc g a a a t g tc c g g g g g e g g a e g e a a a a t tt tt C A G g T cccc t fct g c g t g tt tt a a a a a STfca g c t g g g c c g t e g g t t t a t a t a t ssi3 -1 0 a t 5 ' - e g 3 ' - g c . . c g g g g g c c c c c c g t c g t g cA t c t T -3 5 t a c c tg g . g c c t c g g a g g a e g g c g g g a g c a g t a a g g a t c tg tc c g g g g e g g e g e a a a a t tt tt gA C A G gT ccc g e e g t g tt tt a a a a a a a g g t c t V <? t a t t g g g c c g e g g t t t a c Fig. 8.3. Schematic diagram depicting the proposed secondary structure o f the Frpo locus. The figure is b a sed on the p ro p o sed stru ctu re form ed in single-stranded D N A at the F rpo locus of plasm id F (M asai an d Arai, 1997). The stru ctu re acts as a novel type of p ro m o te r capable of directing tran scrip tio n by RNA p olym erase from ssD N A tem plates. Sites of in terest include the -35, -10 and tran scrip tio n sta rt site. The stem loop stru c tu re cannot form in dsD N A as it w o u ld be an im p erfect d u p lex . The loops, d u e to m ispairing are seen to be esential for function. Figure is derived from th at in M asai and A rai (1997) an d w as d ra w n by Chris Cane. Fig. 8.3. CO m eltin g of the d uplex structure and form ation of the open complex necessary for tran scrip tio n (Bates et al., 1999). Im p o rta n tly , F rpo and the Collb ssi sequences are orientated in their resp ectiv e lea d in g regions such th at the prom oter structure will form in the D N A stra n d th a t is tran sferred to the recipient. This feature leads to the ap p ea lin g m o d el th at enhanced expression of leading region genes is due to activity of Frpo-like sequences in directing transcription of the tran sferrin g plasm id stran d . The m odel described shows a strategy for supplying proteins early in the recipient before the second strand of the plasm id is generated. Early expression allow s A rd A anti-restriction protein to accum ulate in the new ly infected cell before the D N A stra n d is converted to duplex DNA, the substrate of ty p e I re stric tio n en zy m es. The m odel allow s for rapid expression of PsiB as a fu n ctio n th a t p re v e n ts induction of the bacterial SOS response follow ing tran sfer of p la sm id D N A in single-stranded form. Furtherm ore, the m odel allow s for elev ated levels of SSB protein, which m ay prevent drainage of the cellular c o u n te rp a rt, an essential com ponent of the chrom osom e replication ap p aratu s. In a d d itio n , SSB has been show n to stim ulate transcription from Vrpo, p resu m ab ly by facilitating binding of RNA polym erase through protein- p ro tein in teractions on tem plate DNA (Masai and Arai, 1997). It is envisaged th a t th ese p ro m o te rs are silenced upon synthesis of the co m p lem en tary p lasm id stra n d in the recipient, w hich w ould explain how leading region genes are expressed in a transient burst. D esign of an experim ent to test if the Collb ssi sequences are able to act as sin g le-stran d ed p ro m o ters is possible. A Collb plasm id carrying a prom oterless la c Z o p ero n fusion w ithin the ssb gene has already been created in this lab o rato ry (Jones et al., 1992). The plasm id w ould provide a useful source of a D N A fragm ent carrying the ssbv.lacZ fusion and the upstream ssi3 sequence for cloning into a single-stranded vector based on M13. Confirm ation that ssi3 acts as a p ro m o ter w o u ld be achieved if elevated levels of p-galactosidase activity 119 w ere detected in cells carrying the single-stranded DN A construct. As a useful control, a sim ilar vector carrying the Collb fragm ent inserted in the opposite o rie n ta tio n s h o u ld re su lt in n o e le v ated levels of p - g a l a c t o s i d a s e . A ltern a tiv e ly , the Q fragm ent described in C hapter Four, could be cloned u p stre am of a r d A in a construct carrying only the leading region (see Fig. 7.5). It is p re d ic te d th a t if zygotic in d u ctio n of a rd A is governed by ssi3 found u p stre am of the gene, then the presence of D will d isru p t transcription and p re v e n t e n h a n c e d expression of a r d A . C onsequently, m obilisation of the construct w o u ld resu lt in no A rdA -m ediated alleviation of type I restriction in th e recipient cell. H ow ever, this experim ent does n o t confirm that ssi3 is the p ro m o te r. In c o n c lu sio n , d e sp ite c o m p licatio n s e n c o u n te re d , a n u m b e r of h y p o th eses for the C ollb anti-type I and anti-type II restriction m echanism s h av e been tested and rejected by this work. U nfortunately, it w as beyond the scope of th is thesis to d eterm ine the m olecular basis of these two plasm id evasion processes, although a n u m b er of new hypotheses have been generated w h ich are described in this chapter. In addition, data in this thesis w ere not o n ly u sefu l in p ro v id in g su p p o rtin g evidence th a t expression of a r d A is enhanced by m atin g (A lthorpe et al., 1999) but has played a m ajor role in the g en eratio n of a m o d el th at explains how leading region genes m ig h t be zygotically in d u ced in E. coli du rin g conjugation (Bates et al., 1999). If the Collb s s i regions are confirm ed as being prom oters active in single-stranded DNA form , then it w o u ld be interesting to see if they are also active in strains other th an E. coli. 120 Appendix I Determination of the RKI and Rri phenotypes of E. coli strains using X vir coli X vir pfu ml"1 e.o.p X vir .0 5.0 x 108 1.0 II 1.7 xlO 6 0.003 ft 1.2 x 108 0.24 II 5.0 x 107 0.10 II 3.5 x 108 0.70 GI65N(pNA12) II 8.0 x 107 GI65N(pNA6) II 5.3 x 108 0.16 1.06 GI65(pLG290) II 4.9 x 108 0.98 GI65N GI65N(NTP14) X vir. K E . host NM816N GI65N GI65N(pLG221) GI65N(pLG292) GI65N(pNAll) GI65N(pNA16) 1.0 xlO 12 1.0 if 1.0 xl O9 0.001 II 2.1 x 1012 2.1 ll 1.0 xl O9 0.001 GI65N(pNA17) 100 jliI of E. coli cells (107 m l'1) were mixed with 3 ml of SNA and overlayed onto a BLA plate. 10 pi of each diluted bacteriophage suspension was spotted onto the surface of the plate and allowed to adsorb. The plates were then incubated overnight at 37°C. The number of plaques formed were examined and the plaque forming units (pfu) per ml and efficiency of plating (e.o.p) were calculated. An e.o.p value of 1.0 is equivalent to 100% plating efficiency. X vir is a virulent strain of X, where X vir. 0 carries no methylation, X vir .K carries EcoKl methylation. Literature Cited A c h t m a n , M ., M o r e lli, G ., a n d S c h w u c h o w , S. (1 9 7 8 a ) C e ll-c e ll in t e r a c tio n s in c o n ju g a tin g Escherichia coli: role o f F p ili a n d fate o f m a tin g a g g r e g a te s. / Bacteriol 135: 1 0 5 3 -1 0 6 1 . A c h tm a n , M ., S c h w u c h o w , S., H e lm u t h , R ., M o r e lli, G ., a n d M a n n in g , P. A . (1 9 7 8 b ) C e ll-c e ll in te r a c tio n s in c o n ju g a tin g E scherichia coli: C o n - m u ta n ts a n d s ta b liz a tio n o f m a tin g a g g re g a tes. M ol Gen Genet 164: 171-183. A l f o l d i , L., Ja cob , F., a n d W o llm a n , E. L. (1957) Z y g o s e le ta le d a n s les c r o ise m e n ts en tre s o u c h e s c o lic in o g e n e s e t n o n c o lic o g e n e s. A cad Sci 244: 2974-2976. A lth o r p e , N . J., C h ille y , P. M ., T h o m a s , A . T ., B ram m ar, W . J., a n d W ilk in s , B. M . (1 9 9 9 ) T ra n sie n t tr a n sc r ip tio n a l a c tiv a tio n o f th e I n c ll p la s m id a n ti-r e str ic tio n g e n e (ardA ) an d SO S in h ib itio n g e n e (psiB) e a r ly in c o n ju g a tin g r e c ip ie n t b acteria. M ol M icrobiol 31: 133-142. A lts c h u l, S. F., G is h , W ., M ille r , W ., M y e r s, E. W ., a n d L ip m a n , D . J. (1990) B a sic lo ca l a lig n m e n t sea r ch to o l. J M ol Biol 215: 403-410. A m a b ile -C u e v a s , C. F., a n d C h icu rel, M . E. (1992) B acterial p la s m id s a n d g e n e flu x . Cell 70: 189-199. A n th o n y , K . G ., S h e r b u r n e , C., S h e r b u r n e , R ., a n d F rost, L. S. (1994) T he r o le o f th e p ilu s in r e c ip ie n t c e ll r e c o g n itio n d u r in g b a c te r ia l c o n ju g a tio n m e d ia te d b y F -lik e p la s m id s . M o l M icrobiol 13: 939-953. A r b e r, W ., a n d D u s s o ix , D . (1962) H o s t sp e c ific ity o f D N A p r o d u c e d b y Escherichia coli I. H o s t c o n tr o lle d m o d ific a tio n o f b a c te r io p h a g e X. J M ol Biol 5: 18-36. B a c k m a n n , B. J. (1972) P e d ig r e e s o f s o m e m u ta n t strain s o f Escherichia coli K-12. Bacteriol R e v 36: 5 2 5 -5 5 7 . B a c k m a n n , B. J., L o w , K . B ., a n d T a y lo r , A . L. (1976) R e c a lib r a te d lin k a g e m a p o f Escherichia coli K -12. Bacteriol R ev 40: 116-167. B a g d a s a r ia n , M ., B a ilo n e , M . M ., M a n n in g , P. A ., L u r z, R ., T im m is , K. M ., a n d D e v o r e t , R . (1986) A n in h ib ito r o f S O S in d u c tio n , s p e c ifie d b y a p la s m id lo c u s in Escherichia coli. Proc N a tl A ca d Sci U S A 83: 5723-5726. B a g d a s a r ia n , M ., B a ilo n e , A ., A n g u lo , J. F., S c h o lz , P., B a g d a s a r ia n , M ., a n d D e v o r e t, R. (1992) P siB , an a n ti-S O S p ro te in , is tr a n sien tly e x p r e s s e d b y th e F se x factor d u r in g its tr a n sm issio n to a n Escherichia coli K -12 recip ien t. M ol M icrobiol 6: 885-893. B a ilo n e , A ., B a c k m a n , A ., S o m m e r , S ., C e le r ie r , J., B a g d a sa r ia n , M . M ., B a g d a sa r ia n , M ., a n d D e v o r e t, R. (1 9 8 8 ) P siB p o ly p e p t id e p r e v e n ts a c tiv a tio n o f R ecA p r o te in in Escherichia coli. M ol Gen G enet 214: 389-395. B a lz e r , D ., P a n s e g r a u , W ., a n d L a n k a , E. (1994) E ssen tia l m o tifs o f rela x a se (T ral) a n d TraG p r o te in s in v o lv e d in c o n ju g a tiv e transfer o f p la s m id RP4. / Bacteriol 176: 4285-4295. B a n c ro ft, I., a n d W o lk , C . P. (1989) C h a ra cterisa tio n o f an in se r tio n se q u e n c e (IS892) o f n o v e l stru ctu re fr o m th e C y a n o b a c te r iu m Anabaena sp. S train M -131. / Bacteriol 171: 59495954. B a r c u s , V . A ., a n d M u rr a y , N . E. (1 9 9 5 ) B arriers to r e c o m b in a tio n : r e str ic tio n . In P o p u la tio n G en etics o f Bacteria. B a u m b e rg , S., Y o u n g , J. P. W ., W e llin g to n , E. M . H . a n d S a u n d e r s, J. R. (e d s). C am b rid ge: C a m b rid g e U n iv e r sity P ress, p p . 31-58. B a te s , S ., C a s h m o r e , A . M ., a n d W ilk in s , B. M . (1998) In cP p la s m id s are u s u a lly e f f e c t iv e in m e d ia t in g c o n ju g a tio n o f Escherichia coli a n d S acch arom yces c e r e v is ia e : I n v o lv e m e n t o f th e T ra2 m a tin g s y s te m . J Bacteriol 180: 6538-6543. B a te s , S ., R o s c o e , R. R ., A lth o r p e , N . J., B ram m ar, W . J., a n d W ilk in s , B. M . (1 9 9 9 ) E x p r e s s io n o f le a d in g r e g io n g e n e s o n I n c ll p la sm id ColIb-P9: G en etic e v id e n c e for s in g le ­ s tr a n d e d D N A tr a n sc rip tio n . M icrobiology 145: 2655-2662. B e lo g u r o v , A . A ., D e lv e r , E. P ., a n d R o d z e v ic h , O . V . (1992) In c N p la s m id p K M lO l a n d I n c ll p la s m id C olIb -P 9 e n c o d e h o m o lo g o u s an tirestriction p r o te in s in th eir le a d in g r e g io n s . / B acteriol 174: 5079-5085. B e lo g u r o v , A . A ., D e lv e r , E. P., a n d R o d z e v ic h , O . V . (1993) P la sm id p K M lO l e n c o d e s t w o n o n - h o m o lo g o u s a n tir e str ic tio n p r o te in s (A rd A a n d A rd B ) w h o s e e x p r e s s io n is c o n tr o lle d b y h o m o lo g o u s reg u la to ry seq u en ces. / Bacteriol 175: 4843-4850. B e l o g u r o v , A . A ., a n d D e lv e r , E. P. (1995) A m o tif c o n s e r v e d a m o n g th e ty p e I r e s t r ic t io n - m o d if ic a t io n e n z y m e s a n d a n tir e str ic tio n p r o te in s: a p o s s ib le b a s is fo r m e c h a n is m o f a c tio n o f p la s m id -e n c o d e d an tirestriction fu n c tio n s. N ucleic A cid s Res 23: 7 8 5 -7 8 7 . B ic k le , T. A ., B rack , C ., a n d Y u an , R . (1978) A T P -in d u ce d c o n fo r m a tio n a l c h a n g e s in th e r e str ic tio n e n d o n u c le a s e fro m Escherichia coli K-12. Proc N atl A cad Sci U S A 75: 3 0 9 9 3103. B ic k le , T . A ., a n d K r u g e r , D . H . (1993) B io lo g y o f D N A restrictio n . M icrobiol R ev 57: 4 3 4 -4 5 0 . B i ll i n g t o n , S. J., S in is t a j , M ., C h e e th a m , B. F., A y r e s, A ., M o s e s , E. K ., K a tz , M . E ., a n d R o o d , J. I. (1 9 9 6 ) I d e n tific a tio n o f a n a tiv e D ichelobacter nodosus p la s m id a n d im p lic a tio n s for th e e v o lu tio n o f th e vap reg io n . Gene 172: 111-116. B ir m b io m , H . C ., a n d D o ly , J. (1979) A rap id alkalin e extraction p ro ce d u re for s c r e e n in g r e c o m b in a n t p la s m id D N A . N u cl A cid s Res 7: 1513-1523. B is h o f b e r g e r , N ., N g , P. G ., W e b b , T . R ., an d M a tte u c c i, M . D . (1987) C le a v a g e o f sin g le -s tr a n d e d o lig o n u c le o tid e s b y EcoRI restriction e n d o n u c le a se . N ucl A cids Res 15: 709716. B la ttn e r , F. R ., P lu n k e t t , G ., B lo c h , C. A ., F ern a, N . T ., B u r la n d , V ., R ile y , M ., C o lla d o v id e s , J., G la s n e r , J. D ., R o d e , C. K ., M a y h e w , G . F., G reg o r , J., D a v is , N . W ., K ir k p a tr ic k , H . A ., G o e d e n , M . A ., R o s e , D . J., M a u , B ., a n d S h a o , Y. (1 9 9 7 ) T h e c o m p le te g e n o m e s e q u e n c e o f Escherichia coli K-12. Science 277: 1453-1470. B o u ln o is , G . J., a n d W ilk in s , B. M . (1978) A C o ll-s p e c ifie d p r o d u c t, s y n th e s is e d in n e w l y in fe c te d r e c ip ie n ts , lim its th e a m o u n t o f D N A tra n sferred d u r in g c o n ju g a tio n o f Escherichia coli K -12. / Bacteriol 133: 1-9. B o y d , C. A ., A r c h e r , J. A . K ., a n d S h erratt, D . J. (1989) C h a ra c terisa tio n o f th e C o lE l m o b iliz a tio n r e g io n a n d its p r o te in p ro d u c ts. M ol Gen Genet 217: 488-498. B r a d le y , D . E. (1980) M o r p h o lo g ic a l a n d s e r o lo g ic a l r e la tio n s h ip s o f c o n ju g a tiv e p ili. P la s m id 4: 155- 176. B r a d le y , D . E. (1983) D e r e p r e s s e d p la s m id s o f in c o m p a tib ility g ro u p Ix d e te r m in e t w o d iffe r e n t m o r p h o lo g ic a l form s o f p ilu s . Plasm id 9: 331-334. B r a d le y , D . E. (1 9 8 4 ) C h a ra c teristic s a n d fu n c tio n o f th ic k a n d th in c o n ju g a tiv e p ili d e te r m in e d b y tran sfer d e r e p r e sse d p la s m id o f in c o m p a tib ility g r o u p s l lf I2,1 5/ B, K a n d Z. / Gen Microbiol 130: 1489-1502. B u k h a r i, A . I., S h a p ir o , J. A ., a n d A d h y a , S. L. (1977) B a cte ria l p la s m id s . In D N A insertion elements, and episomes. N e w York: C o ld S prin ger H arb or, p p .621-624. B u r c k h a r d t, J., W e is e m a n n , J., a n d Y u a n , R . (1981a) C h a r a c te r is a tio n o f th e D N A m e t h y la s e a c t iv ity o f th e re strictio n e n z y m e fro m Escherichia coli K*. / Biol Chem 256: 4 0 2 4 -4 0 3 2 . B u r c k h a r d t, J., W e is e m a n n , J., H a m ilt o n , D . L., a n d Y u a n , R. (1 9 8 1 b ) C o m p le x e s fo r m e d b e t w e e n th e restriction e n d o n u c le a se EcoK an d h e te r o d u p le x D N A . } Mol Biol 153: 4 2 5 -4 4 0 . C e n s in i, S ., L a n g e , C ., X ia n g , Z., C rab tree, J. E., G h ia ra , P ., B o r o d o v s k y , M ., R a p u o li, R ., a n d C o v a c c i, A . (1996) Cag, a p a th o g e n ic ity isla n d o f Helicobacter pylori, e n c o d e s ty p e I -s p e c ific a n d d is e a s e -a s s o c ia t e d v ir u le n c e factors. Proc N atl Acad Sci USA 93: 1464814653. C h a b b e r t, Y. A ., R o u s s e l, A ., W itc h itz , J. L., S a n -S o n -L e P o r s, M . J., a n d C o u r v a lin , P. (1 9 7 9 ) R e s t r ic t io n e n d o n u c le a s e g e n e r a te d p a tte r n s o f p la s m id s b e l o n g i n g to i n c o m p a t ib ilit y g r o u p s I I , C, M a n d N : A p p lic a t io n to p la s m id t a x o n o m y a n d e p id e m io lo g y . In Plasmids of Medical, Environmental and Commercial importance. T im m is, K. N ., a n d P iih le r , A . (e d s). N o r th H o lla n d / A m ste r d a m / N e w York: E lsev ier, p p . 183193. C h a n d r a se g a r a n , S ., a n d S m ith , H . O . (1988) A m in o acid s e q u e n c e h o m o lo g ie s a m o n g t w e n t y - f iv e re str ic tio n e n d o n u c le a s e s a n d m e th y la se s. In Structure and Expression. From proteins to ribosomes. Sarm a, R. H ., a n d Sarm a, M . H . (eds). N e w York: A d e n in e p r e ss Inc, p p . 1 4 9 -1 5 6 . C h a s e , J. W ., M e r r ill, B. M ., a n d W illia m s , K. R. (1983) F -se x factor e n c o d e s a s in g le str a n d e d D N A b in d in g -p r o te in (SSB) w ith e x te n siv e seq u en ce h o m o lo g y to Escherichia coli SSB. Proc Natl Acad Sci USA 80: 5480-5484. C h a t f ie ld , L. K ., O rr, E., B o u ln o is , G . J., a n d W ilk in s , B. M . (1982) D N A p r im a se o f p la s m id C o llb is in v o lv e d in co n ju g a l D N A sy n th e sis in d o n o r a n d re c ip ie n t b acteria. / Bacteriol 152: 118 8 -1 1 9 5 . C h i l l e y , P. M ., a n d W i l k in s , B . M . (1995) D is tr ib u tio n o f th e a rd A a n tirestrictio n g e n e s o n c o n ju g a tiv e p la sm id s. Microbiology 141: 2157-2164. fa m ily o f C l e w e l l, D . B ., F la n n a g a n , S. E., a n d J a w o r sk i, D . D . (1995) U n c o n s tr a in e d b a c te r ia l p r o m is c u ity - th e T n 926 - T n 1545 fa m ily o f co n ju g a tiv e tr a n sp o s o n s. Trends Microbiol 3: 2 2 9 -2 3 6 . C o h e n , A ., F is h e r , W . D ., C u r tiss III, R ., a n d A d le r , H . (1 9 6 8 ) D N A is o la te d fr o m Escherichia coli m in ic e lls m a te d w ith F+ cells. Proc Natl Acad Sci USA 61: 61-68. C o h e n , S . N ., C h a n g , A . C. Y., a n d H s u , L. (1 9 7 2 ) N o n - c h r o m o s o m a l a n t ib io t ic r e s is ta n c e in b a cte ria l g e n e tic tr a n sfo r m a tio n o f Escherichia coli R -factor D N A . Proc Natl Acad Sci USA 69: 2 1 1 0 -2 1 1 4 . C o u tu r ie r , M ., B e x , F., B e r g q u is t, P. L., a n d M a a s, W . K. (1 9 8 8 ) I d e n tific a tio n a n d c la s s ific a tio n o f b a cteria l p la sm id s. Microbiol Rev 52: 375-395. C o w a n , G . M ., G a n n , A . A . F., a n d M u rra y , N . E. (1989) C o n s e r v a tio n o f c o m p le x D N A r e c o g n itio n d o m a in s b e tw e e n fa m ilies o f restriction e n z y m e s . Cell 56:103-109. C r a m , D ., R a y , A ., O 'G o r a m , L., a n d S k u rr a y , R. (1984) T ra n scrip tio n a l a n a ly s is o f le a d in g r e g io n in F p la s m id transfer. Plasmid 11: 221-233. C u r tis s , R ., C a r o , L. G ., A l li s o n , D . P ., a n d S ta llio n s , D . R. (1969) E arly s ta g e s o f c o n ju g a tio n in Escherichia coli. J Bacteriol 100: 1091-1104. D a n i e l , A . S ., F u lle r -P a c e , F. V ., L e g g e , D . M ., an d M u rra y , N . E. (1988) D istr ib u tio n a n d d iv e r s it y o f hsd g e n e s in Escherichia coli a n d oth er e n te ric b acteria. J Bacteriol 170: 1 7 7 5 -1 7 8 2 . D a tta , N ., a n d H u g h e s , V . (1983) P la s m id s o f th e sa m e Inc g r o u p s in E n terob acteria b e fo r e a n d a fter th e m e d ic a l u se o f a n tib io tics. Nature 306: 616-617. D a v i s , E. J., a n d H e n r y , J. (1982) C o n ju g a l tran sfer r e p lic a tio n o f R 6 M r d ll p la s m id D N A in th e d o n o r c e lls o f Escherichia coli K -12. Mol Gen Genet 187: 305-309. D e lv e r , E. P ., K o to v a , V . U ., Z a v ilg e ls k y , G . B ., a n d B e lo g u r o v , A . A . (1991) N u c le o t id e s e q u e n c e o f th e g e n e (ard) e n c o d in g th e a n tirestrictio n p r o te in o f p la sm id C olIb -P 9. / Bacteriol 173: 5 8 8 7 -5 8 9 2 . D i L a u r e n z io , L ., F ro st, L. S., F in la y , B. B., a n d P a ra n ch y ch , W . (1991) C h a ra c terisa tio n o f th e oriT r e g io n o f th e IncFV p la s m id p E D 208. Mol Microbiol 5: 780-785. D ix o n , D . A ., a n d K o w a lc z y k o w s k i, S. C. (1993) T h e r e c o m b in a tio n h o ts p o t % is a r e g u la to r y s e q u e n c e th a t acts b y a tte n u a tin g th e n u c le a se a ctiv ity o f th e E. coli R e cB C D e n z y m e . Cell 73: 87-96. D r e is e ik e lm a n n , B. (1994) T ra n slo ca tio n o f D N A across b acterial m em b ran es. Microbiol Rev 58: 2 9 3 -3 1 6 . D r y d e n , D . T . F., C o o p e r , L. P ., T h o r p e , P. H ., a n d B y r o n , O . (1997) T h e in vitro a s s e m b ly o f th e E coK I ty p e I D N A r e s tr ic t io n /m o d ific a tio n e n z y m e a n d its in vivo im p lic a tio n s . Biochemistry 5: 1065-1076. D iir r e n b e r g e r , M . B ., V illig e r , W ., a n d B a e c h i, T. (1 9 9 1 ) C o n ju g a tio n a l ju n c tio n s: m o r p h o lo g y o f s p e c ific c o n ta c ts in c o n ju g a tin g Escherichia coli b acteria. J Struct Biol 107: 1 4 6 -1 5 6 . D u tr e ix , M ., B a c k m a n , A ., C e le r ie r , J., B a g d a sa ria n , M . M ., S o m m e r , S ., B a ilo n e , A ., D e v o r e t, R ., a n d B a g d a s a r ia n , M . (1988) Id en tifica tio n o f psiB g e n e s o f p la s m id s F a n d R 6-5. M o le c u la r b a s is fo r psiB e n h a n c e d ex p r e ssio n in p la s m id R6-5. Nucl Acids Res 16: 1 0 6 6 9 -1 0 6 7 9 . D y b v i g , K ., a n d Y u , H . (1994) R e g u la tio n o f a restriction a n d m o d ific a tio n s y s te m v ia D N A in v e r s io n in Mycoplasma pulmonis. Mol Microbiol 12: 547-560. E n d lic h , B ., a n d L in n , S . (1985a) T h e D N A restriction e n d o n u c le a s e o f Escherichia coli B. I. S tu d ie s o f th e D N A tr a n slo c a tio n a n d th e A T P ase a c tiv itie s. / Biol Chem 260: 5 7 2 0 5728. E n d lic h , B ., a n d L in n , S . (1985b) T h e D N A restriction e n d o n u c le a s e o f Escherichia coli B. II. F u rth er s tu d ie s o f th e stru ctu re o f D N A in term ed ia tes a n d p ro d u cts. / Biol Chem 260: 5 7 2 9 -5 7 3 8 . F a lk o w , S ., T o m p k in s , L. S., S ilv e r , R. P., G u erry, P., a n d Le B la n c , D . J. (1971) T h e r e p lic a tio n o f R -factor D N A in Escherichia coli K-12. Ann N Y Acad Sci 182: 153-171. F a lk o w , S ., H e d g e s , R . W ., G u e rry , R ., a n d D a tta , N . (1974) P o ly n u c le o tid e s e q u e n c e r e la tio n s h ip s a m o n g p la s m id s o f th e I co m p a tib ility c o m p le x . J Gen Microbiol 85: 65-76. F in la y , B . B ., a n d P a ra n ch y ch , W . (1986) N u c le o tid e s e q u e n c e o f th e su rfa ce e x c lu s io n g e n e s traS a n d traT from the IriFJac p la s m id pE D 208. J Bacteriol 166: 713-721. F in n e g a n , J., a n d S h erratt, D . (1982) P la s m id C o lE l c o n ju g a l m o b ility : T h e n a tu r e o f bom, a r e g io n re q u ir e d in cis for transfer. Mol Gen Genet 185: 344-351. F irth , N ., I p p e n - I h le r , K ., a n d S k u r a y , R . A . (1996) S tru ctu r e a n d fu n c tio n o f th e F fa cto r a n d m e c h a n is m o f co n ju g a tio n . In E sch erich ia c o li and S a lm o n e lla ty p h im u r iu m . Cellular and Molecular Biology. 2nd e d n . N e id h a r d t, F.C. (e d .-in -c h ie f). W a s h in g to n , DC: A m e r ic a n S o c ie ty for M icr o b io lo g y P ress, p p . 2377-2401. F le is c h m a n n , R . D ., A d a m s , M . D ., W h it e , O ., C la y t o n , R . A ., K ir k n e s s , E. F., K e r la v a g e , A . R ., B u lt, C. J., T o m b , J-F., D o u g h e r ty , B. A ., M e r r ic k , J. M ., M c K e n n y , K ., S u tto n , G ., F itz h u g h , W ., F ie ld s , C ., G o c a y n e , J. O ., S c o tt, J., S h ir le y , R ., L iu , L. I., G lo d e k , A ., K e lle y , J. M ., W e id m a n , J. F., P h illip s , C. A ., S p r ig g s , T., H e d b lo m , T ., C o tt o n , M . D ., U tte r b a c k , T. R ., H a n n a , M . C ., N g u y e n , D . T ., S a n d e k , D . M ., B r a n d o n , R. C ., F in e , L. D ., F ritc h m a n , J. L., F u h r m a n n , J. L ., G e o g h a g e n , N . S. M ., G r e h m , C. L., M c D o n a ld , L. A ., S m a ll, K . V ., Fraser, C. M ., S m ith , H . O ., a n d V e n te r , J. C. (1995) W h o le -g e n o m e r a n d o m s e q u e n c in g an d a s s e m b ly o f Haemophilus influenzae R d. Science 269: 496-512. F r e d e r ic q , P. (1965) G en etic s o f c o lic in o g e n ic factors. Zentbl Bakt Parasit Kde 196: 142151. F ro st, L. S ., I p p e n - I h le r , K ., a n d S k u rr a y , R. A . (1994) A n a ly s is o f th e s e q u e n c e a n d g e n e p r o d u c ts o f th e transfer r e g io n o f th e F se x factor. Microbiol Rev 58 :1 6 2 -2 1 0 . F iir s te , J. P ., P a n s e g r a u , W ., Z i e g e l i n , G ., K r o e g e r , M ., a n d L a n k a , E. ( 1 9 8 9 ) C o n ju g a tiv e tr a n sfe r o f p r o m is c u o u s In cP p la sm id s: in te r a c tio n s o f p la s m id -e n c o d e d p r o d u c ts w ith th e transfer origin . Proc Natl Acad Sci USA 86: 1771-1775. F u r u y a , N ., a n d K o m a n o , T. (1991) D e te r m in a tio n o f th e n ic k s ite at oriT o f I n c l l p la s m id R64: g lo b a l sim ila rity o f oriT str u c tu r es o f I n c ll a n d In cP p la sm id s. / Bacteriol 173: 6 6 1 2 -6 6 1 7 . F u ru y a , N ., N i s i o k a , T ., a n d K o m a n o , T. (1991) N u c le o tid e se q u e n c e a n d fu n c tio n s o f th e oriT o p e r o n in I n c ll p la sm id R64. / Bacteriol 173: 2231-2237. F u r u y a , N ., a n d K o m a n o , T. (1994) S u r fa c e e x c lu s io n g e n e o f I n c ll p la s m id R64: N u c le o t id e s e q u e n c e an d a n a ly sis o f d e le tio n m u tan ts. Plasmid 32: 80-84. F u ru y a , N ., a n d K o m a n o , T. (1995) S p e cific b in d in g o f th e N ik A p ro te in to o n e a rm o f 1 7 -b a se-p a ir in v e r te d rep ea t s e q u e n c e s w ith in th e oriT r e g io n o f p la sm id R64. / Bacteriol 177: 4 6 -5 1 . F u r u y a , N ., a n d K o m a n o , T . (1997) M u ta tio n a l a n a ly s is o f th e R 64 oriT r e g io n : R e q u ir e m e n t for p r e c ise lo ca tio n o f th e N ik A -b in d in g se q u e n c e . / Bacteriol 179: 7291-7297. G lo v e r , S. W ., a n d C o ls o n , C. (1965) T h e b r e a k d o w n o f th e re strictio n m e c h a n is m in z y g o t e s o f Escherichia coli. Genet Res 6: 153-155. G o lu b , E. I., a n d L o w , B. K. (1985) C o n ju g a tiv e p la sm id s o f e n te ric b acteria fr o m m a n y d iffe r e n t in c o m p a tib ility g r o u p s h a v e sim ila r g e n e s for s in g le -s tr a n d e d D N A -b in d in g p r o te in s. J Bacteriol 162: 235-241. G o lu b , E. I., a n d L o w , B. K. (1 9 8 6 ) D e r e p r e s s io n o f s in g le -s tr a n d e d D N A - b in d in g p r o te in g e n e s o n p la s m id s d e r e p r e sse d for con ju gation , an d c o m p le m e n ta tio n o f a n E. coli ssb' m u ta tio n b y th e s e g en es. Mol Gen Genet 204: 410-416. G o lu b , E. I., B a ilo n e , A ., a n d D e v o r e t, R . (1988) A g e n e e n c o d in g an SO S in h ib ito r is p r e s e n t in d iffe r e n t co n ju g a tiv e p la s m id s . / Bacteriol 170: 4392-4394. G o tte s m a n , S. (1996) P ro tea ses a n d th eir ta r g ets in Escherichia coli. Ann Rev Genet 30: 4 6 5 -5 0 6 . G o tte s m a n , S ., a n d M a u r iz i, M . R. (1992) R e g u la tio n b y p r o te o ly sis: E n e r g y -d e p e n d e n t p r o te a s e s a n d th eir targets. Microbiol Rev 56: 592-621. G r e e n e , P. J., G u p ta , M ., B o y e r , H . W ., B r o w n , W . E., a n d R o s e n b e r g , J. M . (1 9 8 1 ) S e q u e n c e a n a ly s is o f th e D N A e n c o d in g th e EcoRI e n d o n u c le a s e m e th y la se . J Biol Chem 2 5 6 : 2 1 4 3 -2 1 5 3 . G r ib s k o v , M ., D e v e r e u x , J., a n d B u r g e s s , R . R. (1984) T h e c o d o n p r e fe r e n c e p lot: g r a p h ic a n a ly s is o f p r o te in c o d in g s e q u e n c e s a n d p r e d ic tio n o f g e n e e x p r e ss io n . Nucl Acids Res 12: 539-5 4 9 . G u lig , P. A ., C a ld w e ll, A . L., a n d C h io d o , V . A . (1992) Id e n tific a tio n , g e n e tic a n a ly s is a n d D N A s e q u e n c e o f a 7.8 kb v ir u le n c e r e g io n o f th e Salmonella typhimurium v ir u le n c e p la s m id . Mol Microbiol 6: 1395-1411. H a a s e , J., L u r z , R ., G r a h n , A . M ., B a n fo r d , D . H ., a n d L a n k a , E. (1 9 9 5 ) B a c te r ia l c o n ju g a tio n m e d ia t e d b y p la s m id R P 4 - R SF1010 m o b ilz a tio n , d o n o r s p e c ific p h a g e p r o p a g a tio n , a n d p ilu s p r o d u c tio n req u ire th e sa m e Tra2 co r e c o m p o n e n ts o f a p r o p o s e d D N A tr a n sp o rt c o m p le x . / Bacteriol 177: 4779-4791. H a d i, S. M ., B ic k le , T. A ., an d Y u a n , R . (1975) T he role o f S -a d e n o s y lm e th io n in e in th e c le a v a g e o f d e o x y r ib o n u c le ic acid b y th e restriction e n d o n u c le a se fro m Escherichia coli K. J Biol Chem 2 50: 4 1 5 9 -4 1 6 4 . H a m , L. M ., C ram , D ., a n d S k u rra y , R . (1989) T ran scrip tion al a n a ly sis o f th e F p la s m id su rfa c e e x c lu s io n region : m a p p in g o f traS, traT a n d traD tran scrip ts. Plasmid 2 1 :1 -8 . H a m a , C ., T a k iz a w a , T ., M o r iw a k i, H ., U r a s a k i, Y., a n d M iz o b u c h i, K . ( 1 9 9 0 ) O r g a n isa tio n o f th e re p lic a tio n co n tr o l r e g io n o f p la sm id C olIb -P 9. / Bacteriol 172: 19831991. H a n a h a n , D . (1983) S tu d ie s o f tr a n sfo r m a tio n o f Escherichia coli w ith p la sm id s. / M ol Biol 166: 5 5 7 -5 8 0 . H a r r in g to n , L. C ., a n d R o g e r s o n , A . C. (1990) T he F p ilu s o f E. coli a p p e a rs to s u p p o r t sta b le D N A tr a n sfe r in th e a b s e n c e o f w a ll-w a ll con tact b e tw e e n c e lls. / Bacteriol 172: 7 2 6 3 -7 2 6 4 . H a r ts k e e r l, R . A ., B e r g m a n s , J. E. N ., K a m p , M . C., a n d H o e k s tr a , W . P. M . (1 9 8 3 ) C lo n in g o f a n e x c lu s io n -d e te r m in in g fr a g m e n t o f th e In cl p la s m id , R144. Plasmid 10: 11- 20. H a r ts k e e r l, R . A ., a n d H o e k str a , W . P. M . (1984) E x clu sio n in I n c ll-ty p e Escherichia coli c o n ju g a tio n s : th e s ta g e o f c o n ju g a tio n a t w h ic h e x c lu s io n o p e r a te s . A ntonie van leeuwenhoek, J Microbiol 50: 113-124. H a r ts k e e r l, R ., T o m m a s s e n , J., a n d H o e k s tr a , W . P. M . (1985a) R e la tio n sh ip b e t w e e n th e p r o te in s e n c o d e d b y th e e x c lu s io n d e te r m in in g lo cu s o f th e In cl p la sm id R144 a n d th e c e llu la r lo c a liz a tio n o f th e se p r o te in s in Escherichia coli K -12. Mol Gen Genet 2 0 0 :1 3 8 -1 4 4 . H a r ts k e e r l, R ., Z u id w e g , E., G e f f e n , M ., a n d H o e k str a , W . P. M . (1985b) T h e I n c ll p la s m id s R 144, R 64 a n d C o llb b e lo n g to o n e e x c lu s io n g r o u p . / Gen Microbiol 131: 13051311. H a r ts k e e r l, R . A ., a n d H o e k str a , W . P. M . (1985) E x c lu sio n b y th e In cl p la s m id R 144 d e te r m in e d b y m e a s u r in g D N A tran sfer in Escherichia coli c o n ju g a tio n s. J Gen Microbiol 131: 2 2 6 3 -2 2 6 7 . H a r ts k e e r l, R ., O v e r d u in , P., H o e k s tr a , W ., a n d T o m m a s s e n , J. (1 9 8 6 ) N u c l e o t id e se q u e n c e o f th e e x c lu sio n -d e te r m in in g lo c u s o f In cl p la sm id R144. Gene 42 :1 0 7 -1 1 1 . H a v e k e s , L., T o m m a s s e n , J., H o e k s tr a , W . P. M ., an d L u g te n b e r g , B. (1977) I s o la tio n a n d ch a ra c ter isa tio n o f Escherichia coli K -12 F" m u tan ts d e fe c tiv e in co n ju g a tio n w ith an Ity p e d o n o r . / Bacteriol 129: 1-8. H a y e s , W . (1968) T em p er a te b a c te r io p h a g e s a n d ly so g e n y . In The Genetics of Bacteria and their Viruses: Studies in Basic Genetics and Molecular Biology. (2nd e d ). O xford : B la c k w e ll, p p . 4 4 7 -4 7 9 . H e itm a n , J., T r e is m a n , J., D a v is , N . G ., a n d R u s s e l, M . (1 9 8 9 ) C a s s e tte s o f th e f l in terg en ic reg io n . Nucl Acids Res 17: 4413. H e n a u t, A ., a n d D a n c h in , A . (1996) A n a ly s is an d p r e d ic tio n s fr o m Escherichia coli s e q u e n c e s, or E. coli In silico. In E sch erich ia co li and S a lm o n ella ty p h im u r iu m . Cellular and Molecular Biology. 2nd ed n . N e id h a r d t, F.C . (e d .-in -ch ief). W a s h in g to n , DC: A m e r ic a n S o c ie ty for M ic r o b io lo g y P ress, p p . 2047-2066 H o r iu c h i, T ., a n d K o m a n o , T . (1998) M u ta tio n a l a n a ly sis o f p la s m id R 64 th in p ilu s p r e p ilin : th e e n tir e p r e p ilin se q u e n c e is r e q u ired for p r o c e s s in g b y ty p e IV p r e p ilin p e p tid a s e . J Bacteriol 180: 4613-4620. H o w la n d , C. J., a n d W ilk in s , B. M . (1988) D irectio n o f c o n ju g a tiv e tran sfer o f I n c ll p la s m id C olIb -P 9. / Bacteriol 170: 4958-4959. H o w la n d , C. J, R e e s , E. E. D ., B arth , P. T ., a n d W ilk in s , B. M . (1989) T he ssb g e n e o f p la s m id C olIb -P 9. J Bacteriol 171: 2466-2473. H u , P ., E llio tt, J., M c C r e a d y , P ., S k o w r o n s k i, E., G a m e s , J., K o b a y a s h i, A ., B r u b a k e r , R . R ., a n d G a rcia , E. (1998) S tru ctu ral o r g a n isa tio n of v ir u le n c e -a s s o c ia te d p la s m id s o f Yersinia pestis. J Bacteriol 180: 5192-5202. J o n e s , A . L. (1991) T h e le a d in g r e g io n o f I n c ll p la sm id C olIb -P 9. P h .D . th e sis, U n iv e r s ity o f L eicester, L eice ster U K . J o n e s , A . L., B a rth , P. T ., a n d W ilk in s , B. M . (1992) Z y g o tic in d u c tio n o f p la s m id ssb a n d psiB g e n e s f o llo w in g c o n ju g a tiv e transfer of In c ll p la s m id C olIb -P 9. Mol Microbiol 6: 6 0 5 -6 1 3 . K e lle h e r , J. E., D a n ie l, A . S ., a n d M u rra y , N . E. (1991) M u ta tio n s th a t co n fe r de novo a c tiv ity u p o n a m a in te n a n c e m e th y ltra n sfera se . / Mol Biol 221: 431-440. K o h r in g , G -W ., a n d M a y e r , F. (1987) In situ d istr ib u tio n o f E coR I m e t h y la s e a n d restrictio n e n d o n u c le a s e in ce lls o f Escherichia coli Bs 5. FEBS Letters 216: 207-210. K im , S -R ., a n d K o m a n o , T. (1997) T he p la s m id R64 th in p ilu s id e n tifie d as a ty p e IV p ilu s . J Bacteriol 179: 3594-3603. K o b a y a s h i, I., N a it o , T ., H a n d a , N ., a n d K u sa n o , K. (1995) S e lfish g e n e s - r e str ic tio n m o d ific a tio n s y s te m s force ce lls to m a in ta in th em . J Cell Biochem S 19A : 110. K o m a n o , T ., T o y o s h im a , A ., M o r ita , K , a n d N is io k a , T. (1988) C lo n in g a n d n u c le o tid e se q u e n c e o f th e oriT r e g io n o f th e I n c ll p la s m id R64. / Bacteriol 170: 4385-4387. K o m a n o , T ., F u n a y a m a , N ., K im , S . R ., a n d N is io k a , T . (1990) T ran sfer r e g io n o f I n c ll p la s m id R64 a n d r o le o f sh u fflo n in R 64 transfer. J Bacteriol 172: 2230-2235. K o m a n o , T., K im , S. R., Y o sh id , T ., a n d N is io k a , T. (1994) D N A rea rra n g em en ts o f th e sh u fflo n d e te r m in e s recip ien t sp e c ific ity in liq u id m a tin g s o f I n c ll p la s m id R64. / Mol Biol 243: 6-9. K o r o n a , R ., K o r o n a , B ., a n d L e v in , B . R . (1993) S e n s it iv it y o f n a tu r a lly o c c u r r in g c o lip h a g e s to ty p e I a n d ty p e II r e str ic tio n a n d m o d ific a tio n . J Gen Microbiol 139: 12831290. K o r o n a , R ., a n d L e v in , B. R. (1993) P h a g e -m e d ia t e d s e le c tio n a n d th e e v o lu tio n a n d m a in ten a n ce o f r e strictio n -m o d ifica tio n . Evolution 47: 556-575. K o to v a , V . Y., Z a v ilg e ls k il, G . B ., a n d B e lo g u r o v , A . A . (1988) W e a k e n in g o f ty p e I r e strictio n in th e p r e se n c e o f p la s m id s o f th e In cl g r o u p - G en era l ch a ra c ter istic s a n d c lo n in g o f ard g en e. / Mol Biol 22: 227-232. K r a u se , M ., R o u d ie r , C., F ierer, J., H a r w o o d , J., a n d G u in e y , D . (1 9 9 1 ) M o le c u la r a n a ly sis o f th e v ir u le n c e lo c u s o f th e Salmonella dublin p la s m id p S D L 2. M ol Microbiol 5: 3 0 7 -3 1 6 . K ru ger, D . H ., a n d B ic k le , T. A . (1983) B a cte rio p h a g e su rvival: M u ltip le m e c h a n is m s for a v o id in g the d e o x y r ib o n u c le ic acid re strictio n sy s te m s o f th eir h o st. Microbiol Rev 47: 345360. K u la k a u s k a s , S ., L u b y s , A ., a n d E h r lic h , S. D . (1995) D N A r e s tr ic t io n -m o d ific a t io n s y s te m s m e d ia te p la sm id m a in te n a n c e . J Bacteriol 177: 3451-3454. K u s a n o , K., N a ito , T ., H a n d a , N ., a n d K o b a y a s h i, I. (1 9 9 5 ) R e s tr ic tio n -m o d ific a tio n s y s te m s as g e n o m ic p a r a site s in c o m p e titio n for sp e c ific se q u e n c e s. Proc Natl Acad Sci USA 92: 1 1 0 9 5-11099. L a n k a , E., a n d B a rth , P. T. (1981) P la s m id R P4 s p e c ifie s a d e o x y r ib o n u c le ic a c id p rim a se in v o lv e d in its co n ju g a l tran sfer a n d m a in ten a n ce. / Bacteriol 148: 769-781. L a n k a, E., a n d W ilk in s , B. M . (1995) D N A p r o c e ss in g re a ctio n s in b a cteria l co n ju g a tio n . Ann Rev Biochem 64: 141-169. L ed erb erg , J., a n d T a tu m , E. L. (1946) G e n e re co m b in a tio n in E. coli. Nature 158: 558. L e s s l, M ., B a lzer , D ., W e y r a u c h , K ., a n d L an k a, E. (1993) T h e m a tin g -p a ir fo r m a tio n s y s te m o f p la s m id R P4 d e fin e d b y R S F 1010 m o b iliz a tio n a n d d o n o r -s p e c ific p h a g e p r o p a g a tio n . J Bacteriol 175: 6415-6425. L o h , S ., C ram , D ., a n d S k u rr a y , R . (1989) N u c le o t id e s e q u e n c e o f th e le a d in g r e g io n a d ja cen t to th e o r ig in o f tran sfer o n p la s m id F a n d its c o n se r v a tio n a m o n g c o n ju g a tiv e p la s m id s . Mol Gen Genet 219: 177-186. L u n n e n , K. D ., B a r s o m ia n , J. M ., C a m p , R. R ., C ard , C. O ., C h e n , S. Z ., C r o ft, R ., L o o n e y , M . C., M e d a , M . M ., M o r a n , L. S ., N w a n k w o , D . O ., S la tk o , B. E., V a n C ott, E. M ., a n d W ils o n , G . G . (1988) C lo n in g ty p e II restrictio n an d m o d ific a tio n g e n e s. Gene 74: 25 -3 2 . L u ria, S. E., a n d S u it, J. L. (1987) C o lic in s a n d C o l p la s m id s . In E sch erich ia c o li and S a lm o n e lla ty p h im u r iu m : Cellular and Molecular Biology. In g ra h a m , J., L o w , K. B., M a g a sa n ik , B., S ch a ech ter, M ., U m b a rg e r, H . E., a n d N e id h a r t, F. C. (ed s). W a sh in g to n : A S M P re ss, p p . 1615-1624. M a k o v e t s , S ., T it h e r a d g e , A . J. B ., a n d M u rra y , N . E. (1998) C lp X a n d C lp P are e s s e n tia l fo r th e e f fic ie n t a c q u is itio n o f g e n e s s p e c ify in g ty p e IA a n d IB r e str ic tio n s y s te m s . Mol Microbiol 28: 25-35. M a n n in g , P. A ., M o r e lli, G ., a n d A c h tm a n , M . (1981) TraG p ro te in o f th e F se x factor o f Escherichia coli K -12 a n d its role in co n ju g a tio n . Proc Natl Acad Sci USA 78: 7487-7491. M a r sh , J. L., E r fle , M ., a n d W y k e s, E. J. (1984) T he pIC p la s m id a n d p h a g e v e c to r s w ith v e r s a tile sites for r e co m b in a n t s e le c tio n b y in se rtio n a l in a c tiv a tio n . Gene 32: 481-485. M a s a i, H ., a n d A r a i, K . (1997) F rpo: a n o v e l s in g le -s tr a n d e d D N A p r o m o t e r fo r tra n scrip tio n a n d for p rim er R N A s y n th e s is o f D N A re p lica tio n . Cell 89: 897-907. M a th e w s , D . H ., S a b in a , J., Z u k e r , M ., a n d T u rn er, D . H . (1999) E x p a n d e d s e q u e n c e d e p e n d e n c e o f th e r m o d y n a m ic p a r a m e te r s im p r o v e s p r e d ic tio n o f R N A s e c o n d a r y structure. / M ol Biol 288: 911-940. M a th e w s , C . K ., a n d v a n H o ld e , K . E. (1990) In B iochem istry. R e d w o o d C ity : T h e Benjam in / C u m m in g s P u b lish in g C om p an y. M a tic , I., T a d d e i, F., a n d R a d m a n , M . (1996) G en etic b arriers a m o n g b a cteria . Trends Microbiol 4: 69-73. M a ts o n , S. W ., a n d M o r to n , B. S. (1991) Escherichia coli D N A h e lic a s e I c a ta ly s e s a site a n d str a n d -sp e c ific n ic k in g reaction at th e F p la sm id oriT. J Biol Chem 266: 109-114. M a z o d ie r , P ., a n d D a v is , J. (1991) G e n e tra n sfer b e tw e e n d is ta n tly r e la te d b a c te r ia . Annu Rev Genet 25: 147-171. M c C la r in , J. A ., F r e d e r ic k , C. A ., W a n g , B. C ., G ree n e , P ., B o y er, H . W ., G r a b le , J., a n d R o s e n b e r g , J. M . (1986) T he stru ctu re o f th e D N A -E coR I e n d o n u c le a s e c o m p le x at 3 A r e so lu tio n . Science 234: 1526-1541. M e r r y w e a th e r , A ., R e e s , C. E. D ., S m ith , N . M ., an d W ilk in s , B. M . (1986) R o le o f sog p o ly p e p t id e s s p e c if ie d b y p la s m id C o lIb -P 9 a n d th eir tra n sfer b e t w e e n c o n ju g a tin g b a cteria . EMBO J 5: 3 0 07-3012. M e y e r , R ., a n d L a in e , P. S. (1 9 9 0 ) T h e s in g le -s tr a n d e d D N A b in d in g p r o t e in o f Escherichia coli. Microbiol Rev 54: 342-380. M e y n e ll, E., a n d D a tta , N . (1967) M u ta n t d r u g resistan t factors o f h ig h tr a n sm issib ility . N ature 259: 4 5 5 -4 5 8 . M ilk m a n , R . (1997) R e c o m b in a tio n a n d p o p u la tio n stru ctu re in Escherichia coli. Genetics 1 46: 7 4 5 -7 5 0 . M i n k l e y , E. G ., a n d W ille t t s , N . S. (1 9 8 4 ) O v e r p r o d u c t io n , c h a ra c ter isa tio n o f th e F traT p ro tein . Mol Gen Genet 196: 225-235. p u r if ic a t io n and M u r a i, N ., K a m a ta , H ., N a g a s h im a , Y., Y a g isa w a , H ., a n d H ira ta , H . (1995) A n o v e l in s e r tio n s e q u e n c e (IS )-lik e e le m e n t o f th e th e r m o p h ilic b a c te r iu m PS3 p r o m o t e s e x p r e s s io n o f th e a la n in e carrier p r o te in -e n c o d in g gen e. Gene 1 6 3:103-107. N a i t o , T ., K u s a n o , K ., a n d K o b a y a s h i, I. (1995) S e lfis h b e h a v io u r o f r e s tr ic t io n m o d ific a tio n s y s te m s . Science 267: 897-899. N e w m a n , A . K ., R u b in , R. A ., K im , S. H ., a n d M o d r ic h , P. (1981) D N A s e q u e n c e s o f str u ctu ra l g e n e s for EcoRI D N A re strictio n an d m o d ific a tio n e n z y m e s . J Biol Chem 256: 2 1 3 1 -2 1 3 9 . N is h ig a k i, K ., K a n e k o , Y., W a k u d a , H ., H u s im i, Y., a n d T a n a k a , T. (1985) T y p e II r e strictio n e n d o n u c le a s e s c le a v e s in g le -s tr a n d e d D N A s in g en er a l. Nucl Acids Res 13: 5 7 4 7 -5 7 6 0 . N o m u r a , N ., M a s a i, H ., I n u z u k a , M ., M iy a z a k i, C ., O h t s u b o , E., Ito h , T ., S a s a m o to , S., M a ts u i, M ., I s h iz a k i, R ., a n d A r a i, K . (1991) Id e n tifica tio n o f e le v e n sin g le -str a n d e d in itia tio n s e q u e n c e s (ssi ) for p r im in g o f D N A re p lica tio n in th e F, R6K, R100 a n d C olE 2 p la s m id s . Gene 108: 15-22. O c h m a n , H ., a n d L a w r e n c e , J. G . (1996) P h y lo g e n e tic s a n d th e a m elio ra tio n o f bacterial g e n o m e s. In E sch erich ia co li and S a lm o n e lla ty p h im u r iu m . Cellular and Molecular Biology. 2 nd e d n . N e id h a r d t , F.C . ( e d .- in - c h ie f ) . W a s h in g to n , D C : A m e r ic a n S o c ie ty for M icr o b io lo g y P r e ss, p p . 2627-2637. O 'N e ill, M ., C h e n , A ., a n d M u rra y , N . E. (1997) T h e r e str ic tio n -m o d ific a tio n g e n e s o f Escherichia coli K -12 m a y n o t b e selfish : T h e y d o n o t re sist lo s s a n d are re a d ily rep la ced b y a lle le s c o n fe r r in g d iffer en t s p e c ific itie s . Proc Natl Acad Sci USA 94: 14596-14601. O u , J. T ., a n d A n d e r s o n , T. F. (1970) R o le o f p ili in b acterial co n ju g a tio n . / Bacteriol 102: 6 4 8 -6 5 4 . P a n ic k e r , M . M ., a n d M in k le y , E. G . Jr. (1985) D N A tr a n sfe r o cc u r s d u r in g a cell su rfa ce co n ta ct s ta g e o f F s e x fa c to r -m e d ia te d b acterial co n ju g a tio n . J Bacteriol 162: 584590. P a n seg ra u , W ., Z ie g e lin , G ., a n d L a n k a , E. (1990a) C o v a le n t a sso cia tio n o f th e tral g en e p r o d u c t o f p la s m id R P4 w ith th e 5 '-te r m in a l n u c le o tid e at th e relaxation N ic k site. / Biol Chem 265: 1 0 6 3 7 -1 0 6 4 4 . P a n se g r a u , W ., B a lz e r , D ., K ru ft, V ., L u r z, R ., a n d L a n k a , E. (1990b) In vitro a ss e m b ly o f r e la x o s o m e s a t th e tra n sfer o r ig in o f p la s m id RP4. Proc Natl Acad Sci USA 87: 65556559. P a n s e g r a u , W ., L a n k a , E ., B a rth , P. T ., F ig u r s k i, D . H ., G u in e y , D . G ., H a a s, D ., H e lin s k i, D . R ., S c h w a b , H ., S ta n is ic h , V . A ., an d T h o m a s , C. M . (1994a) C o m p le te n u c le o tid e s e q u e n c e o f B ir m in g h a m I n c P a p la sm id s: c o m p ila tio n a n d c o m p a r a tiv e a n a ly sis o f s e q u e n c e d ata. / Mol Biol 239: 623-663. P a n se g r a u , W ., S c h r o d e r , W ., a n d L a n k a , E. (1994b) C o n ce rted a ctio n o f th re e d istin ct d o m a in s in th e D N A c le a v in g -jo in in g re a ctio n ca ta ly zed b y rela x a se (Tral) o f co n ju g a tiv e p la s m id RP4. J Biol Chem 269: 2782 -2 7 8 9 . P a n s e g r a u , W ., a n d L a n k a , E. (1991) C o m m o n se q u e n c e m o tifs in D N A re la x a se s and n ic k re g io n s fr o m a v a r ie ty o f D N A tran sfer sy stem s. Nucl Acids Res 19: 3455. P a n s e g r a u , W ., a n d L a n k a , E. (1996) M e c h a n ism o f in itia tio n a n d term in a tio n rea ctio n s in c o n ju g a tiv e D N A p r o c e s s in g - In d e p e n d e n c e o f tig h t su b str a te b in d in g a n d catalytic a c tiv ity o f r e la x a se (Tral) o f IncP a lp h a p la s m id RP4. / Biol Chem 271: 13068-13076. P e a r s o n , W . R ., a n d L ip m a n , D . J. (1 9 8 8 ) I m p r o v e d to o ls fo r b io lo g ic a l s e q u e n c e c o m p a r iso n . Proc Natl Acad Sci USA 85: 2444-2448. P r a k a s h -C h e n g , A ., C h u n g , S. S., a n d R y u , J. (1993) T he e x p r e ss io n a n d r e g u la tio n o f hsdk g e n e s after co n ju g a l transfer. Mol Gen Genet 241: 491-496. P r a k a s h -C h e n g , A ., an d R y u , J. (1993) D e la y e d e x p r e ss io n o f in vivo re strictio n a c tiv ity f o llo w in g c o n ju g a l tra n sfer o f Escherichia coli hsdK (r e str ic tio n -m o d ific a tio n ) g e n e s . / Bacteriol 175: 4 9 0 5 -4 9 0 6 . P r e n tk i, P ., a n d K r isc h , H . M . (1984) In vitro in sertio n a l m u ta g e n e sis w ith a s e le c ta b le D N A fr a g m en t. Gene 19: 303-313 R a s h tc h ia n , A ., C r o o k s, J. H ., a n d L e v y , S. B. (1983) Tra] in d e p e n d e n c e in e x p r e s s io n o f traT o n F. / Bacteriol 154: 1009-1012. R a s to r g u e v , S. M ., Z a v ilg e ls k y , G . B ., a n d T c h u r ik o v , N . A . (1998) I n c ll p la s m id R64 e n c o d e s th e A r sR p r o te in th at a lle v ia te s ty p e I restriction. FEBS Letters 426: 21-23. R e a d , T. D . (1 9 9 3 ) E v a sio n o f re str ic tio n s y s te m s b y p la s m id C olIb -P 9 d u r in g b a cteria l co n ju g a tio n . P h .D th e sis, U n iv e r sity o f L eicester, L eicester U K . R e a d , T. D ., T h o m a s , A . T ., a n d W ilk in s , B. M . (1992) E v a sio n o f ty p e I a n d ty p e II re strictio n s y s t e m s b y I n c ll p la s m id C olIb -P 9 d u r in g tra n sfer b y b acterial co n ju g a tio n . Mol Microbiol 6: 1933-1941. R e d a s c h i, N ., a n d B ic k le , T. A . (1996) D N A r e strictio n a n d m o d ific a tio n s y s te m s . In E sh erich ia c o li and S a lm o n e lla ty p h im u r iu m . Cellular and Molecular Biology. 2nd ed n . N e id h a r d t, F. C. (e d .-in -c h ie f). W a s h in g to n , DC: A m e r ic a n S o c ie ty for M ic r o b io lo g y P re ss, p p . 7 7 3 -7 8 1 . R e e s , C. E. D . (1986) A g e n e tic a n a ly sis o f th e transfer g e n e s o f th e I n c ll p la s m id C o llb P9. P h .D th e s is , U n iv e r s ity o f L eicester, L eicester UK. R e e s , C. E. D ., B r a d le y , D . E., a n d W ilk in s , B. M . (1987) O rg a n isa tio n an d r e g u la tio n o f th e c o n ju g a tio n g e n e s o f I n c ll p la s m id C olIb-P9. Plasmid 18: 223-236. R e e s , C. E. D ., a n d W ilk in s , B. M . (1989) T ransfer o f tra p r o te in s in to the r e c ip ie n t ce ll d u r in g b a cte ria l c o n ju g a tio n m e d ia te d b y p la sm id C olIb-P9. J Bacteriol 171: 3152-3157. R e e s , C. E. D ., a n d W ilk in s , B. M . (1990) P ro tein transfer in to th e re cip ie n t c e ll d u r in g b a cteria l co n ju g a tio n : s tu d ie s w ith F a n d RP4. Mol Microbiol 4: 1199-1205. R o b e r ts , R . J., a n d M a c e lis , D . (1994) REBASE - restrictio n e n z y m e s an d m e th y la s e s . Nucl Acids Res 22: 3628-3639. R o o n g ta , V . A ., J o n e s , C. R ., an d G o r e n s te in , D . G. (1990) E ffect o f d is to r tio n s in th e d e o x y r ib o s e p h o s p h a t e b a c k b o n e c o n fo r m a tio n o f d u p le x o lig o d e o x y r ib o - n u c le o t id e d o d e c a m e r s c o n ta in in g G T, GG, G A , A C a n d G U b a se-p a ir m ism a tc h e s o n P -31 N M R sp ectra . Biochemistry 29: 5245-5258. R o s c o e , R. A . (1996) R e g u la tio n o f le a d in g re g io n g en es o n I n c ll p la s m id C olIb -P 9. P h .D . th e sis. U n iv e r s ity o f L eicester, L eicester, U K . R u s s e ll, C. B ., T h a le r , D . S ., an d D a h lq u is t, F. W. (1989) C h ro m o so m a l tr a n sfo r m a tio n o f Escherichia coli recD strain s w ith lin e a r ise d p la sm id s. J Bacteriol 171: 2609- 2613. S a a d i, S ., M a a s , W . K ., H ill, D . F., a n d B e r g q u ist, P. L. (1987) N u c le o t id e s e q u e n c e a n a ly s is o f R epF IC , a b a sic re p lico n p r e se n t in IncFI p la sm id s P307 a n d F, an d its re la tio n to th e R e p A r e p lic o n o f IncFII p la s m id s. J Bacteriol 169: 1836-1846. S a ik i, R. K . (1989) T h e d e s ig n an d o p tim isa tio n of th e PCR. In PCR Technology: Principles and Applications for D NA Amplification. E rlich, H . A . (ed). N e w York / L o n d o n / T o k y o / M e lb o u r n e / H o n g K ong: M S tock ton P ress, p p . 7-16. S a m b r o o k , J., F ritc h , E. F., an d M a n ia tis , T. M . (1989) Molecular Cloning: A Laboratory M anual- 2 nd e d itio n . C o ld S p rin g H a rb o r, N e w York: C o ld S p rin g H arb or L a b o r a to ry P re ss. S c h a n d e l, K . A ., M u lle r , M . M ., a n d W e b s te r , R. E. (1992) L o c a liz a tio n o f T raC , a p ro te in in v o lv e d in a sse m b ly o f th e F c o n ju g a tiv e p ilu s. J Bacteriol 174: 3800-3806. S h e n , P., a n d H u a n g , H . V . (1986) H o m o lo g o u s r e c o m b in a tio n in Escherichia coli: d e p e n d e n c e o n su b stra te le n g th an d h o m o lo g y . Genetics 112: 441-457. S ia , E. A ., K u e h n e r , D . M ., a n d F ig u r s k i, D . H . (1996) M e c h a n is m o f retro tra n sfer in co n ju g a tio n - p rio r tran sfer o f th e p la s m id is req u ired . / Bacteriol 178: 1457-1464. S ilv e r m a n , P. M . (1987) T he stru c tu r a l b a s is o f p ro k a r y o tic D N A transfer. In Bacterial outer membranes as model systems. I n o u y e , M . (ed ). N e w York: Joh n W ile y a n d S o n s, p p . 2 7 7 -3 0 9 . S im o n s e n , L. (1990) D y n a m ic s o f p la s m id tra n sfer o n s u r fa c e s. / Gen Microbiol 136: 1 0 0 1 -1 0 0 7 . S kurray, R. A ., a n d R e e v e s , P. (1973) C h a ra c terisa tio n o f le th a l z y g o s is a ss o c ia te d w ith con ju g a tio n in Escherichia coli K-12. J Bacteriol 113: 58-70. S m ith , H . R ., H u m p h r e y s , G . O ., W i ls h a w , G . A ., a n d A n d e r s o n , E. S . ( 1 9 7 6 ) C h aracterisation o f p la sm id s co d in g for th e restriction e n d o n u c le a s e EcoRl. M ol Gen Genet 143: 319 -3 2 5 . S o b e r o n , X., C o v a r r u b ia s, L., a n d B o liv a r , F. (1980) C o n str u c tio n a n d c h a r a c te r isa tio n o f n e w c lo n in g v e h ic le s .IV. D e le tio n d e r iv a tiv e s o f pBR 322 a n d pB R 325. Gene 9: 287-305. S ta h l, F. W ., K o b a y a s h i, I., T h a le r, D ., a n d S ta h l, M . M . (1986) D ire ctio n o f tr a v e l o f RecBC r e c o m b in a se th r o u g h b a c te r io p h a g e la m b d a D N A . Genetics 113: 215-227. S tu d ier , F. W ., a n d B a n d y o p a d h y a y , F. W . (1988) M o d e l for h o w ty p e I e n z y m e s s e le c t c lea v a g e s ite s in D N A . Proc Natl Acad Sci USA 85: 4677-4681. S u m m e r s, D . K. (1996) P la sm id d is s e m in a tio n . In The Biology of Plasmids. S u m m e r s, D . K. (ed). O xford : B la c k w e ll S cien ce L td , p p . 92-110. S u r i, B ., a n d B ic k le , T . A . (1985) EcoA: th e first m e m b e r o f a n e w fa m ily o f ty p e I restrictio n m o d ific a tio n s y s te m s - g e n e o r g a n isa tio n a n d e n z y m a tic a ctiv itie s. / Mol Biol 186: 77-85. S z y b a ls k i, W ., K im , S. C ., H a sa n , N ., a n d P o d h a jsk a , A . J. (1991) C lass-IIS re str ic tio n e n z y m e s - a r e v ie w . Gene 100: 13-26. T a o , T ., P a te l, J., F ir m a n , K ., a n d K n e a le , G . (1991) A fa m ily o f r e g u la to r y g e n e s a sso cia te d w ith ty p e II r e str ic tio n -m o d ific a tio n sy ste m s. J Bacteriol 173: 1367-1375. T a r d if, G ., a n d G ra n t, R . B . (1983) T ra n sfer o f p la s m id s fr o m Escherichia coli t o Pseudomonas aeruginosa - c h a r a c te r isa tio n o f a Pseudomonas aeruginosa m u ta n t w it h e n h a n c e d r e c ip ie n t a b ility for e n te ro b a cte ria l p la s m id s . Antimicrobiol Agents Chemo 24: 2 0 1 -2 0 8 . T h o r s te d , P . B ., M a c C a r tn e y , D . P ., A k h ta r , P., H a in e s , A . S ., A li, N ., D a v id s o n , P ., S ta ffo r d , T ., P o c k lin g t o n , M . J., P a n s e g r a u , W ., W ilk in s , B . M ., L a n k a , E ., a n d T h o m a s , C. M . (1998) C o m p le te s e q u e n c e o f th e IncPfJ p la s m id R751: im p lic a tio n s for e v o lu tio n a n d o r g a n isa tio n o f th e In cP b a ck b o n e. / Mol Biol 282: 969-990. T ith e r a d g e , A . J. B ., T e r n e n t, D ., a n d M u rra y , N . E. (1996) A th ird fa m ily o f a lle lic hsd g e n e s in Salmonella enterica: s e q u e n c e c o m p a r is o n s w it h r e la te d p r o te in s id e n t if y c o n s e r v e d re g io n s im p lic a te d in re strictio n o f D N A . Mol Microbiol 22: 437-447. T ra x ler, B. A ., a n d M in k le y , E. G . Jr. (1987) R e v ise d g en etic m a p o f th e d ista l e n d o f th e F tran sfer o p e r o n : im p lic a tio n s for D N A h e lic a s e I, n ic k in g a t oriT , an d c o n ju g a l D N A tra n sp o rt. / B acteriol 169: 3251-3259. U e m u r a , H ., a n d M iz o b u c h i, K. (1 9 8 2 ) G e n e tic a n d p h y s ic a l ch a ra c ter isa tio n o f th e C o llb p la s m id u s in g C olIb-R 222 h y b r id s. M ol Gen Genet 185: 1-12. V a p n e k , D ., a n d R u p p , W . D . (1 9 7 1 ) I d e n tific a tio n o f in d iv id u a l se x -fa c to r D N A str a n d s a n d th e ir r e p lic a tio n d u r in g c o n ju g a tio n in th e r m o s e n s itiv e D N A m u ta n ts o f Escherichia coli. J M o l Biol 60: 413-424. V ie r a , J., a n d M e s s in g , J. (1982) T h e p U C p la s m id s , a n d M 1 3 m p & -d e r iv e d s y s te m for in sertio n m u ta g e n e s is a n d se q u e n c in g w it h sy n th etic u n iv er sa l p rim ers. Gene 19: 259-268. W a lia , S. K ., a n d D u c k w o r t h , D . H . (1986) T h e r e la tio n s h ip o f th e d e lta fa c to r a n d K co llb to th e C o llb p la sm id . / Gen M icrobiol 132: 3261-3268. W a lk er, G . (1987) T h e SO S re sp o n se o f Escherichia coli. In E sch erich ia coli and S a lm o n e lla ty p h im u r iu m . C ellu lar an d M olecu lar B iology. 2nd e d n . N e id h a r d t, F.C. (e d .-in -c h ie f). W a sh in g to n , D C : A m e r ic a n S ociety for M ic r o b io lo g y P ress, p p . 1346-1357. W a n g , J. Y ., a n d S y v a n e n , M . (1 9 9 2 ) D N A tw is t as a tr a n s c r ip tio n a l s e n s o r for e n v ir o n m e n ta l c h a n g e s. M o l M icrobiol 6: 1861-1866. W a ters, V . L ., a n d G u in e y , D . G. (1993) P ro c esses at th e n ick r e g io n lin k c o n ju g a tio n , TD N A tra n sfer a n d r o llin g circle re p lica tio n . M ol M icrobiol 9: 1123-1130. W ic k n e r , S ., a n d M a u r iz i, M . R. (1999) H ere 's th e h ook: sim ila r su b strate b in d in g site s in th e c h a p e r o n e d o m a in s o f C lp a n d L on . Proc N a tl A cad Sci U S A 96: 8318-8320. W ille t t s , N . S , a n d S k u r r a y , R. A . (1987) S tru ctu re an d fu n c tio n o f th e F fa c to r a n d m e c h a n is m o f co n ju g a tio n . In E sch erich ia coli and S a lm o n ella ty p h im u r iu m . N e id h a r d t, F.C. (e d .-in -c h ie f). W a sh in g to n , DC: A m e r ic a n S o c ie ty for M ic r o b io lo g y P ress, p p . 692696. W ille tts , N ., a n d W ilk in s , B. M . (1 9 8 4 ) P r o c e s sin g o f p la s m id D N A d u r in g b a cte ria l c o n ju g a tio n . M icrobiol R ev 48: 24-41. W illia m s , K . R ., S p ic e r , E. K. A . L. R ., a n d G u g g e n h e im e r , R. A . (1 9 8 3 ) L im ite d p r o te o ly s is s tu d ie s o n th e Escherichia coli sin g le -stra n d ed b in d in g p rotein . / Biol Chem 258: 3 3 4 6 -3 3 5 5 . W ilk in s , B . M . (1975) P artial su p p r e s s io n o f th e p h e n o ty p e o f Escherichia coli K -12 dnaG m u ta n ts b y s o m e I-lik e c o n ju g a tiv e p la sm id s. J Bacteriol 122: 899-904. W ilk in s , B. M . (1995) G e n e transfer b y b acterial con ju gation : d iv e r sity o f s y s te m s a n d fu n c tio n a l sp e c ia liz a tio n s. In Population Genetics o f Bacteria. B au m b erg, S., Y o u n g , J. P. W ., S a u n d e r s, S. R., a n d W e llin g to n , E. M . H . (eds) C am bridge: C a m b rid g e U n iv e r sity P ress, p p . 5 9 -8 8 . W i l k i n s , B . M ., a n d B a te s , S. (1 9 9 7 ) G e n e tr a n sfe r b y b a c te r ia l c o n ju g a tio n : E s ta b lis h m e n t o f th e im m ig r a n t p la s m id in th e recip ien t cell. In M olecular M icrobiology. N A T O A S I se r ie s V o l. H 103. B u sb y , S. J. W ., T h om as, C. M ., a n d B row n , N . L. (e d s). B erlin H e id en b e rg : S prin ger-V erlag, p p . 43-64. W ilk in s , B. M ., a n d L a n k a , E. (1993) D N A p r o c e ssin g a n d re p lic a tio n d u r in g p la s m id tran sfer b e tw e e n G ra m -n e g a tiv e b acteria. In Bacterial Conjugation. C lew ell, D .B. (ed ). N e w York: P le n u m P re ss, p p . 105-136. W ilk in s , B . M ., C h i ll e y , P. M ., T h o m a s , A . T ., a n d P o c k lin g t o n , M . J. ( 1 9 9 6 ) D istr ib u tio n o f r e str ic tio n e n z y m e r e c o g n itio n se q u e n c e s o n b r o a d h o st r a n g e p la sm id RP4: m o lec u la r a n d e v o lu tio n a r y im p lic a tio n s. J Mol Biol 258: 447-456. W ils o n , G . G . (1991) O rg a n isa tio n o f r e strictio n -m o d ifica tio n s y s te m s. Nucl Acids Res 19: 2 5 3 9 -2 5 6 6 . W ils o n , G . G ., a n d M u rra y , N . E. (1991) R estrictio n a n d m o d ific a tio n sy ste m s. Ann Rev Genet 25: 5 8 5 -6 2 7 . W in a n s, S. C ., a n d W a lk e r , G . C. (1983) G e n e tic lo c a lis a tio n a n d ch a ra c ter isa tio n o f a p K M lO l-c o d e d e n d o n u c le a s e . / Bacteriol 154: 1117-1125. W in a n s , S. C ., E lle d g e , S. J., K r u e g e r , J. H ., a n d W a lk e r , G . C. (1985) S ite -d ir e c te d in sertion a n d d e le tio n m u ta g e n e sis w ith c lo n e d fra g m en ts in Escherichia coli. J Bacteriol 161: 1 2 1 9 -1 2 2 1 . Y a n isc h -P e r r o n , C ., V ie ir a , J., a n d M e s s in g , J. (1985) I m p r o v e d M 13 p h a g e c lo n in g v ecto rs a n d h o s t strains: n u c le o tid e se q u e n c e s o f th e M 1 3 m p l8 a n d p U C 19 v e c to r s. Gene 33: 103-119. Y u a n , R ., a n d H a m ilt o n , D . L. (1 9 8 4 ) T y p e I a n d ty p e III r e s tr ic tio n -m o d ific a tio n e n z y m e s . In D N A methylation. R a z in , A ., C edar, H ., a n d R ig g s, A . D . (ed s). N e w York: S p rin g er-V erlag, p p . 11-37. Z e c h n e r , E. L., d e la C r u z , F., E ise n b r a n d t, R ., G ra h n , A . M ., K o r a im a n n , G ., L a n k a , E., M u th , G ., P a n s e g r a u , W ., T h o m a s , C. M ., W ilk in s , B. M ., a n d Z a ty k a , M . (1 9 9 9 ) C o n ju g a tiv e D N A tran sfer p r o c e sse s. In T he H o rizo n ta l G en e P ool: Bacterial Plasmids and Gene Spread. T h o m a s, C. M . (ed). A m sterd a m : H a r w o o d A c a d e m ic P u b lish ers, C h ap ter 3.