LOCUS upaG_pcr 5392 bp ds-DNA linear 31-JAN-2009 DEFINITION Escherichia coli CFT073, complete genome. ACCESSION NC_004431 REGION: 4200977..4219997 VERSION NC_004431.1 GI:26245917 PROJECT GenomeProject:313 KEYWORDS . SOURCE Escherichia coli CFT073 ORGANISM Escherichia coli CFT073 Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacteriales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 19021) AUTHORS Welch,R.A., Burland,V., Plunkett,G.D. III, Redford,P., Roesch,P., Rasko,D.A., Buckles,E.L., Liou,S.-R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L.T., Donnenberg,M.S. and Blattner,F.R. TITLE Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 17020-17024 (2002) PUBMED 12471157 REFERENCE 2 (bases 1 to 19021) CONSRTM NCBI Genome Project TITLE Direct Submission JOURNAL Submitted (10-SEP-2004) National Center for Biotechnology Information, NIH, Bethesda, MD 20894, USA REFERENCE 3 (bases 1 to 19021) AUTHORS Welch,R.A., Burland,V., Plunkett,G.D. III, Redford,P., Roesch,P., Rasko,D.A., Buckles,E.L., Liou,S.-R., Boutin,A., Hackett,J., Stroud,D., Mayhew,G.F., Rose,D.J., Zhou,S., Schwartz,D.C., Perna,N.T., Mobley,H.L.T., Donnenberg,M.S. and Blattner,F.R. TITLE Direct Submission JOURNAL Submitted (20-JUN-2002) Genetics Laboratory, University of Wisconsin - Madison, 445 Henry Mall, Madison, WI 53706, USA COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final NCBI review. The reference sequence was derived from AE014075. COMPLETENESS: full length. COMMENT ApEinfo:methylated:1 FEATURES Location/Qualifiers misc_feature 28..1167 /label=Invasin domain /ApEinfo_fwdcolor=#c0c0c0 /ApEinfo_revcolor=#c0c0c0 misc_feature 28..186 /label=prepro /ApEinfo_fwdcolor=#800080 /ApEinfo_revcolor=#800080 misc_feature 5098..5361 /label=L1 L2 beta /ApEinfo_fwdcolor=#dd2c22 /ApEinfo_revcolor=#dd2c22 misc_feature 1168..5097 /label=Him region /ApEinfo_fwdcolor=#8080c0 /ApEinfo_revcolor=#8080c0 misc_feature 5..10 /label=XhoI /ApEinfo_fwdcolor=#00ff00 /ApEinfo_revcolor=#00ff00 misc_feature 5382..5387 /label=XhoI(1) /ApEinfo_label=XhoI /ApEinfo_fwdcolor=#00ff00 /ApEinfo_revcolor=#00ff00 misc_feature 798..803 /label=EcoRI /ApEinfo_fwdcolor=#00ff00 /ApEinfo_revcolor=#00ff00 misc_feature 286..291 /label=SpeI /ApEinfo_fwdcolor=#00ffff /ApEinfo_revcolor=#00ffff misc_feature 1076..1081 /label=PstI /ApEinfo_fwdcolor=#0080ff /ApEinfo_revcolor=#0080ff misc_feature 1522..1527 /label=PstI(1) /ApEinfo_label=PstI /ApEinfo_fwdcolor=#0080ff /ApEinfo_revcolor=#0080ff ORIGIN 1 CGCGCTCGAG ATAATAAGGA ATCAATAATG AACAAAATAT Ttaaagttat ctggaatccg 61 gcaacaggca gttacaccgt tgccagcgaa acggcgaaga gccgtggtaa aaaaagcggg 121 cgcagtaagc tgttaatttc tgcactggtt gcgggtgggt tgttgtcgtc gtttggggca 181 agtgcagata attacactgg gcagccaact gattatggcg atggctcagc aggtgacggc 241 tgggttgcta tcggtaaagg ggcaaaagca aataccttta tgaacactag tggcgcgagt 301 acagctttag gatatgacgc gatagccgaa ggtgagtaca gttctgccat cgggtcaaaa 361 acccttgcaa ctggtggagc atccatggcg ttcggggtta gtgcaaaagc aatgggtgac 421 agaagtgtcg cgctaggtgc atcgtcagta gcaaatggcg atcgttcgat ggcttttggt 481 cgttacgcaa agacgaatgg ttttacatct cttgctattg gggactcctc ccttgccgat 541 ggtgaaaaaa ctattgcgtt aggaaatacg gctaaagctt acgaaattat gagcatcgcc 601 ctcggtgata atgccaatgc gtcaaaagag tatgcaatgg cgctgggagc aagtagcaaa 661 gctggcggtg ctgatagcct cgcattcggc agaaaatcta cagctaatag cactggctca 721 ctggcaatag gtgctgacag tagcagttcg aacgataacg ccatcgcgat agggaacaaa 781 acgcaagccc tgggagtgaa ttcgatggcc ctgggtaatg caagtcaggc atctggcgaa 841 tccagtattg cattaggtaa caccagtgaa gccagcgaac aaaatgcgat tgcgctgggg 901 caaggtagca ttgcaagcaa agtgaactca atcgcgttgg gaagtaacag tttgtcctcg 961 ggagagaatg ccatcgcatt gggagagggt agtgccgctg gtggcagcaa cagccttgct 1021 ttcggtagcc agtccagggc aaacggcaat gattctgtcg ccatcggtgt aggggctgca 1081 gcagcgaccg acaattctgt cgctatcggc gcaggatcga ccacagatgc aagcaatacg 1141 gtttcagttg gcaacagcgc aacaaaacgc aaaattgtta atatggctgc tggtgccata 1201 agcaacacca gtaccgatgc catcaacggc tcacagcttt atacgatcag tgattcagtc 1261 gccaagcgac tcggaggagg cgctactgta ggcagcgatg gcaccgtaac cgcagtaagc 1321 tacgcgttga gaagcggaac ctataataac gtgggtgatg ctctgtcagg aatcgacaat 1381 aataccctac aatggaataa aaccgcgggg gcgttcagcg ccaatcacgg tgcaaatgcc 1441 accaacaaaa tcactaatgt tgctaaaggt acggtttctg caaccagcac cgatgtagta 1501 aacggctctc aattgtacga cctgcagcag gatgctctgt tgtggaacgg cacagcattc 1561 agtgccgcac acggcaccga agccaccagc aaaatcacta acgtcaccgc tggcaacctg 1621 actgccggca gcactgacgc cgttaacggc tctcagctca aaaccaccaa cgacaacgtg 1681 acgaccaaca ccaccaacat cgccactaac accaccaata tcaccaacct gactgacgct 1741 gttaacggtc tcggtgacga ctccctgctg tggaacaaag cagctggcgc attcagcgcc 1801 gcgcacggca ccgaagccac cagcaaaatc accaacgtca ccgctggcaa cctgactgcc 1861 ggtagcactg acgccgttaa cggctcccag ctcaaaacca ccaacgacaa cgtgacgacc 1921 aacaccacca acatcgccac taacaccacc aatatcacca acctgactga cgctgttaac 1981 ggtctcggtg acgactccct gctgtggaac aaaacagctg gcgcattcag cgccgcgcac 2041 ggcactgacg ccaccagcaa gatcaccaac gtcaccgctg gcaacctgac tgccggcagc 2101 actgacgccg ttaacggctc ccagctcaaa accaccaacg acaacgtgac gaccaacacc 2161 accaacatcg ccactaacac caccaatatc accaacctga ctgacgctgt taacggtctc 2221 ggtgacgact ccctgctgtg gaacaaaaca gctggcgcat tcagcgccgc gcacggcact 2281 gacgccacca gcaagatcac caatgtcaaa gccggtgacc tgacagctgg cagcactgac 2341 gccgttaacg gctctcagct caaaaccacc aacgataacg tgtcgaccaa caccaccaac 2401 atcaccaacc tgactgacgc tgttaacggt ctcggtgacg actccctgct gtggaacaaa 2461 acagctggcg cattcagcgc cgctcacggc actgacgcca ccagcaagat caccaatgtc 2521 aaagccggtg acctgacagc tggcagcact gacgccgtta acggctccca gctcaaaacc 2581 accaacgata acgtgtcgac caacaccacc aacatcacta acctgacgga ttccgttggc 2641 gaccttaagg acgattctct gctgtggaac aaagcggctg gcgcattcag cgccgcgcac 2701 ggtaccgaag ctaccagcaa gatcaccaac ttactggctg gcaagatatc ttctaacagc 2761 actgatgcca ttaatggctc acaactttat ggcgtagcgg attcatttac gtcatatctt 2821 ggtggtggtg ctgatatcag cgatacgggt gtattaagtg ggccaaccta cactattggt 2881 ggtactgact acactaacgt cggtgatgct ctggcagcca ttaacacatc atttagcaca 2941 tcactcggcg acgccctact ttgggatgca accgcaggca aattcagcgc caaacacggc 3001 attaataatg ctcccagtgt aatcactgat gttgcaaacg gtgcagtctc gtccaccagc 3061 agcgacgcca ttaacggttc acaactttat ggtgttagtg actacattgc cgatgctctg 3121 ggcgggaatg ctgtggtgaa cactgacggc agtatcacta caccaactta tgccatcgct 3181 ggcggcagtt acaacaacgt cggtgacgcg ctggaagcga tcgataccac gctggatgat 3241 gctctgctgt gggatacaac agccaatggc ggtaacggtg catttagcgc cgctcacggg 3301 aaagataaaa ctgccagtgt aatcactaac gtcgctaacg gtgcagtctc tgccaccagc 3361 aacgatgcca ttaatggctc acagctctat agcactaata agtacatcgc tgatgcgctg 3421 ggtggtgatg cagaagtcaa cgctgacggt actatcactg caccgactta caccattgca 3481 aataccgatt acaacaacgt cggtgaagcc ctggatgcgc tcgataataa cgcgctgctg 3541 tgggatgaag acgcaggtgc ctacaacgcc agccatgatg gcaatgccag caaaatcacc 3601 aacgttgcgg ctggtgatct ctccacaacc agtaccgatg ctgttaacgg ttcccagtta 3661 aacgcaacca atattctggt tacgcaaaat agccaaatga ttaaccagct tgctggtaac 3721 actagcgaaa cctacatcga ggaaaacggt gcgggtatta actatgtacg taccaacgac 3781 agcggcttag cgttcaacga tgccagcgct tcaggtattg gcgctacagc tgtaggttat 3841 aacgcagttg cctctcatgc cagcagtgta gccatcggtc aggacagcat cagcgaagtt 3901 gatacgggta tcgctctggg tagcagttcc gtttccagcc gtgtaatagt taaagggact 3961 cgtaacacca gcgtatcgga agaaggtgtt gtgattggtt atgacaccac ggatggcgaa 4021 ctgcttggcg cgttgtcgat tggtgatgac ggtaaatatc gtcaaatcat caacgtcgcg 4081 gatggttctg aagcccatga tgcggtcact gttcgccagt tgcaaaacgc cattggtgca 4141 gtcgcaacca caccaaccaa atactatcac gccaactcaa cggctgaaga ctcactggca 4201 gtcggtgaag actcgctggc aatgggcgcg aaaaccatcg ttaatggtaa tgcgggtatt 4261 ggtatcggcc tgaacacgct ggttctggct gatgcgatca acggtattgc tatcggttct 4321 aacgcacgcg caaatcatgc cgacagcatt gcaatgggta atggttctca gactacccgt 4381 ggtgcgcaga ccaactacac tgcctacaac atggatgcac cgcagaactc tgtgggtgag 4441 ttctctgtcg gcagtgaaga cggtcaacgt cagatcacca acgtcgcagc aggttcggcg 4501 gataccgatg cggttaacgt gggtcagttg aaagtaacgg acgcgcaggt ttcccagaat 4561 acccagagca ttactaacct gaacactcag gtcactaatc tggatactcg cgtgaccaat 4621 atcgaaaacg gcattggcga tatcgtaacc accggtagca ctaagtactt caagaccaac 4681 accgatggcg cagatgccaa cgcgcagggt aaagacagtg ttgcgattgg ttctggttcc 4741 attgctgccg ctgacaacag cgtcgcactg ggcacgggtt ccgtagcaga cgaagaaaac 4801 accatctctg tgggttcttc taccaaccag cgtcgtatca ccaacgttgc tgccggtgtt 4861 aatgccaccg atgcggttaa cgtttcgcaa ctgaagtctt ctgaagcagg cggcgttcgc 4921 tacgacacca aagctgatgg ctctatcgac tacagcaaca tcactctcgg tggcggcaat 4981 agcggtacga ctcgcatcag caacgtttct gctggcgtga acaacaacga cgcagtgaac 5041 tatgcgcagt tgaagcaaag tgtgcaggaa acgaagcaat acaccgatca gcgcatggtt 5101 gagatggata acaaactgtc caaaactgaa agcaagctga gtggtggtat cgcttctgca 5161 atggcaatga ccggtctgcc gcaggcttac acgccgggtg ccagcatggc ctctattggt 5221 ggcggtactt acaacggtga atcggctgtt gctttaggtg tgtcgatggt gagcgccaat 5281 ggtcgttggg tctacaaatt acaaggtagt accaatagcc agggtgaata ctccgccgca 5341 ctcggtgccg gtattcaGTG GTAATCATCC ATTAACAAAT GCTCGAGCGC CG //