1 FASTA searches a protein or DNA sequence data bank
2 version 3.3t08 Jan. 17, 2001
4 W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
6 t/data/cysprot1.fa: 343 aa
8 vs /data_2/jason/blastdb/ecoli.aa library
9 searching /data_2/jason/blastdb/ecoli.aa library
13 22 0 0: one = represents 7 library sequences
20 36 130 127:==================*
21 38 252 210:=============================*======
22 40 310 293:=========================================*===
23 42 405 359:===================================================*======
24 44 401 396:========================================================*=
25 46 386 403:======================================================== *
26 48 348 386:================================================== *
27 50 360 352:==================================================*=
28 52 290 309:========================================== *
29 54 264 264:=====================================*
30 56 215 221:===============================*
31 58 145 181:===================== *
32 60 144 147:====================*
33 62 119 118:================*
34 64 96 94:=============*
46 88 2 5:* inset = represents 1 library sequences
63 1358987 residues in 4289 sequences
64 Expectation_n fit: rho(ln(x))= 5.9493+/-0.00202; mu= 2.7408+/- 0.115
65 mean_var=77.5610+/-17.011, 0's: 0 Z-trim: 0 B-trim: 2 in 1/41
67 Kolmogorov-Smirnov statistic: 0.0234 (N=29) at 44
69 FASTA (3.36 June 2000) function [optimized, BL50 matrix (15:-5)] ktup: 2
70 join: 37, opt: 25, gap-pen: -12/ -2, width: 16
72 The best scores are: opt bits E(4289)
73 gi|1787478|gb|AAC74309.1| (AE000221) nitrate redu ( 512) 92 29 1.2
74 gi|1790635|gb|AAC77148.1| (AE000491) putative DEO ( 251) 84 27 2.1
75 gi|1786590|gb|AAC73494.1| (AE000145) orf, hypothe ( 94) 78 26 2.1
76 gi|1790853|gb|AAC77345.1| (AE000509) soluble lyti ( 654) 84 28 4.8
77 gi|1789307|gb|AAC75975.1| (AE000377) biosynthetic ( 658) 83 27 5.6
78 gi|1788174|gb|AAC74937.1| (AE000280) orf, hypothe ( 199) 74 25 7.4
79 gi|1789138|gb|AAC75818.1| (AE000361) putative kin ( 492) 79 26 7.8
80 gi|1789427|gb|AAC76084.1| (AE000386) orf, hypothe ( 354) 76 26 9.1
82 >>gi|1787478|gb|AAC74309.1| (AE000221) nitrate reductase (512 aa)
83 initn: 35 init1: 35 opt: 92 Z-score: 109.2 bits: 29.2 E(): 1.2
84 Smith-Waterman score: 92; 23.936% identity (26.012% ungapped) in 188 aa overlap (125-305:2-181)
86 100 110 120 130 140 150
87 CYS1_D NKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTT-GNV--
88 . :. : : : .: .: . :.: ::
89 gi|178 MKIRSQVGMVLNLDKCIGCHTCSVTCKNVWT
93 CYS1_D --EGQHFISQNKLVSLSEQNLVDCDHECME-YEGEEACDEGCNGGLQPNAYNYIIKNGGI
94 :: .. :.. . :.. : : .: :.: . :: ::: : . : :
95 gi|178 SREGVEYAWFNNVETKPGQGF-PTDWENQEKYKGGWI--RKINGKLQPRMGNRAMLLGKI
98 210 220 230 240 250 260
99 CYS1_D QTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGP-LAIAADAVEW
100 .. : . .:. :. . . :.. . . :: .: . .:
101 gi|178 FANPHLPGIDDYYEPFDFDYQNLHTAPEG----SKSQPIARPRSLITGERMAKIEKGPNW
102 90 100 110 120 130 140
104 270 280 290 300 310 320
105 CYS1_D QFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR
106 . .:: :: . ...:. : . :: .. : .:
107 gi|178 EDDLGGEFDKLAKDKNFDN-IQKAMYSQFENTFMMYLPRLCEHCLNPACVATCPSGAIYK
108 150 160 170 180 190 200
111 CYS1_D GKNTCGVSNFVSTSII
113 gi|178 REEDGIVLIDQDKCRGWRMCITGCPYKKIYFNWKSGKSEKCIFCYPRIEAGQPTVCSETC
114 210 220 230 240 250 260
116 >>gi|1790635|gb|AAC77148.1| (AE000491) putative DEOR-typ (251 aa)
117 initn: 46 init1: 46 opt: 84 Z-score: 104.9 bits: 27.4 E(): 2.1
118 Smith-Waterman score: 84; 22.078% identity (23.288% ungapped) in 77 aa overlap (99-171:119-195)
121 CYS1_D HKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRG
122 :.:. ::.:.:: :. .:. ...
123 gi|179 QLVNPGESVVINCGSTAFLLGREMCGKPVQIITNYLPLANYLIDQEHDSVIIMGGQYNKS
124 90 100 110 120 130 140
126 130 140 150 160 170 180
127 CYS1_D AVTPVKNQGQCGSC----WSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEE
128 .. ::. .: : :.. .. .. . . . :....::....
129 gi|179 QSITLSPQGSENSLYAGHWMFTSGKGLTAEGLYKTDMLTAMAEQKMLSVVGKLVVLVDSS
130 150 160 170 180 190 200
132 190 200 210 220 230 240
133 CYS1_D ACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKN
135 gi|179 KIGERAGMLFSRADQIDMLITGKNANPEILQQLEAQGVSILRV
138 >>gi|1786590|gb|AAC73494.1| (AE000145) orf, hypothetical (94 aa)
139 initn: 37 init1: 37 opt: 78 Z-score: 104.8 bits: 25.9 E(): 2.1
140 Smith-Waterman score: 78; 36.842% identity (43.750% ungapped) in 38 aa overlap (242-278:42-74)
142 220 230 240 250 260 270
143 CYS1_D SSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFY-
144 :.. ::..: . : ::..:: :
145 gi|178 VKSIGFSSSSTGRASVGVMVEGEYTFSTAEPEEMTVISGALNVLLP-----DATDWQVYE
148 280 290 300 310 320 330
149 CYS1_D IGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKN
151 gi|178 AGSVFNVPGHSEFHLQVAEPTSYLCRYL
154 >>gi|1790853|gb|AAC77345.1| (AE000509) soluble lytic mur (654 aa)
155 initn: 61 init1: 61 opt: 84 Z-score: 98.5 bits: 27.6 E(): 4.8
156 Smith-Waterman score: 84; 32.692% identity (34.694% ungapped) in 52 aa overlap (104-152:104-155)
158 80 90 100 110 120 130
159 CYS1_D KFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFDWRTRGAVTPV
160 :: : :...:.: . ::: : .:
161 gi|179 YPYLEYRQITDDLMNQPAVTVTNFVRANPTLPPARTLQSRFVNELARREDWRGLLAFSPE
162 80 90 100 110 120 130
164 140 150 160 170 180 190
165 CYS1_D K---NQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEACDEGC
167 gi|179 KPGTTEAQCNYYYAKWNTGQSEEAWQGAKELWLTGKSQPNACDKLFSVWRASGKQDPLAY
168 140 150 160 170 180 190
170 >>gi|1789307|gb|AAC75975.1| (AE000377) biosynthetic argi (658 aa)
171 initn: 41 init1: 41 opt: 83 Z-score: 97.3 bits: 27.4 E(): 5.6
172 Smith-Waterman score: 83; 23.913% identity (24.176% ungapped) in 92 aa overlap (178-268:315-406)
174 150 160 170 180 190 200
175 CYS1_D TGNVEGQHFISQNKLVSLSEQNLVDCDHECMEYEGEEA-CDEGCNGGLQPNAYNYIIKNG
176 ..::: .. : . : ::. : : : :
177 gi|178 TGVRESARFYVELHKLGVNIQCFDVGGGLGVDYEGTRSQSDCSVNYGLNEYANNIIWAIG
178 290 300 310 320 330 340
180 210 220 230 240 250 260
181 CYS1_D GIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVE
182 :.. :. . . .:. . .::. . .:: .. . .: :. .
183 gi|178 DACEENGLPHPTVITESGRAVTAHHTVLVSNIIGVERNEYTVPTAPAEDAPRALQSMWET
184 350 360 370 380 390 400
186 270 280 290 300 310 320
187 CYS1_D WQFYIGGVFDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLR
189 gi|178 WQEMHEPGTRRSLREWLHDSQMDLHDIHIGYSSGIFSLQERAWAEQLYLSMCHEVQKQLD
190 410 420 430 440 450 460
192 >>gi|1788174|gb|AAC74937.1| (AE000280) orf, hypothetical (199 aa)
193 initn: 46 init1: 46 opt: 74 Z-score: 95.2 bits: 25.2 E(): 7.4
194 Smith-Waterman score: 74; 43.750% identity (50.000% ungapped) in 32 aa overlap (308-335:110-141)
196 280 290 300 310 320 330
197 CYS1_D PCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRR-GKNT---CG
198 :.: .::: .: . . ::: : .: ::
199 gi|178 PVDAPSPAKVLPENWWQHPAALGATDSDIEIIKRQWGAFYGTDLELQLRRRGIDTIVLCG
200 80 90 100 110 120 130
205 gi|178 ISTNIGVESTARNAWELGFNLVIAEDACSAASAEQHNNSINHIYPRIARVRSVEEILNAL
206 140 150 160 170 180 190
208 >>gi|1789138|gb|AAC75818.1| (AE000361) putative kinase [ (492 aa)
209 initn: 36 init1: 36 opt: 79 Z-score: 94.7 bits: 26.5 E(): 7.8
210 Smith-Waterman score: 84; 19.136% identity (21.233% ungapped) in 162 aa overlap (34-192:165-313)
213 CYS1_D ILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELN
215 gi|178 GEFKDNIANYFGQWPVDYKSWAWSEDAAVMDKFNIP---RHMLFDVQMPGTVLGHITPQA
216 140 150 160 170 180 190
219 CYS1_D LIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFINSIPTAFD
220 .: . : : .: . . :... :... .: ... . . . :.:.
221 gi|178 ALATHFPAGLPV-VCTTSDKPVEALGAGLLDDETAVISLGTYIALMMNGKALPKDPVAY-
224 130 140 150 160 170 180
225 CYS1_D WRTRGAVTPV---KNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHECMEY
226 : ... . .. : . :. : .. :. .:.. . .:: ..:.. :.
227 gi|178 WPIMSSIPQTLLYEGYGIRKGMWTVSWLRDMLGESLIQDARAQDLSPEDLLNKKASCVP-
228 250 260 270 280 290 300
230 190 200 210 220 230 240
231 CYS1_D EGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTM
233 gi|178 -------PGCNGLMTVLDWLTNPWEPYKRGIMIGFDSSMDYAWIYRSILESVALTLKNNY
234 310 320 330 340 350 360
236 >>gi|1789427|gb|AAC76084.1| (AE000386) orf, hypothetical (354 aa)
237 initn: 65 init1: 40 opt: 76 Z-score: 93.5 bits: 25.8 E(): 9.1
238 Smith-Waterman score: 76; 22.619% identity (23.899% ungapped) in 168 aa overlap (141-303:81-244)
240 120 130 140 150 160 170
241 CYS1_D DDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNL
243 gi|178 GDKIWQSSEYFMNVFCNNALPGPSPGEEYPSAWANIMMLLASGQDFYNQNSYTFGVTYNG
247 CYS1_D VDCDHECMEYEGEEACDEGCNGGLQPNAYNY-IIKNGGIQTESSYPYTAETGTQCNFNSA
248 :: : . .: . ..: :.:. . .:: . . : . ... : .. :
249 gi|178 VDYDSTSPLPIAAPVCIDIKGAGTFGNGYKKPAVCSGGPEPQLSVTFPVRV--QLYIKLA
252 230 240 250 260 270 280
253 CYS1_D NIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDI---PCNPN-SLD
254 . . :... ..: .: . . .: :: .: . : : :. .: : : .:.
255 gi|178 KNANKVNKKLVLP-DEYIALEFKGMSGAGAIEVDK-NLTFRIRGLNNIHVLDCFVNVDLE
256 170 180 190 200 210 220
258 290 300 310 320 330 340
259 CYS1_D HGILIVGYSAKNTIFRKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII
261 gi|178 PADGVVDFGKINSRTIKNTSVSETFSVVMTKDPGAACTEQFNILGSFFTTDILSDYSHLD
262 230 240 250 260 270 280
266 343 residues in 1 query sequences
267 1358987 residues in 4289 library sequences
269 start: Sat Dec 8 11:43:36 2001 done: Sat Dec 8 11:43:37 2001
270 Scan time: 1.110 Display time: 0.090
272 Function used was FASTA [version 3.3t08 Jan. 17, 2001]