CN1849391A - 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 - Google Patents
腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 Download PDFInfo
- Publication number
- CN1849391A CN1849391A CNA038166828A CN03816682A CN1849391A CN 1849391 A CN1849391 A CN 1849391A CN A038166828 A CNA038166828 A CN A038166828A CN 03816682 A CN03816682 A CN 03816682A CN 1849391 A CN1849391 A CN 1849391A
- Authority
- CN
- China
- Prior art keywords
- residue
- polypeptide
- sequence
- nucleic acid
- places
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/40—Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
- C12P7/42—Hydroxy-carboxylic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/78—Hydrolases (3) acting on carbon to nitrogen bonds other than peptide bonds (3.5)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/002—Nitriles (-CN)
- C12P13/004—Cyanohydrins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P41/00—Processes using enzymes or microorganisms to separate optical isomers from a racemic mixture
- C12P41/006—Processes using enzymes or microorganisms to separate optical isomers from a racemic mixture by reactions involving C-N bonds, e.g. nitriles, amides, hydantoins, carbamates, lactames, transamination reactions, or keto group formation from racemic mixtures
Abstract
本发明涉及腈水解酶和编码腈水解酶的核酸。并且,也提供了设计新颖腈水解酶的方法和使用它们的方法。腈水解酶在增高的pH和温度下具有增加的活性和稳定性。
Description
相关申请的交叉参考文献
【0001】本申请要求基于在2002年9月9日申请的美国专利申请序列号No.(USSN)10/241,742和在2002年5月15日申请的USSN 10/146,772的优先权权益,这两个申请要求于2002年1月22日申请的USSN 60/351,336,于2001年7月30日申请的USSN 60/309,006,和于2001年6月21日申请的USSN 60/300,189的优先权权益;并且,本申请是于2000年12月28日申请的USSN 09/751,299的延续申请,该申请要求于2000年12月7日申请的USSN 60/254,414和于1999年12月29日申请的USSN 60/173,609的每一个申请的优先权权益。此处完整引入这些申请的所有目的是作为主题申请的参考。
版权宣告
【0002】依照37 C.F.R.§1.71(e),本专利文件的一部分包括受版权保护的材料。对任意一份专利文件或专利公开的影印复制,如同它出现在专利商标局的专利文件或记录中时,版权所有人对其没有异议,但无论如何保留所有版权权利。
发明领域
【0003】本发明总的来说涉及分子生物学、生物化学和化学领域,尤其是涉及具有腈水解酶(nitrilase)活性的酶蛋白(enzymatic proteins)。本发明也涉及编码该酶的多核苷酸,涉及这些多核苷酸和酶的用途。
发明背景
【0004】天然存在的酶在将腈转化为大量有用的产品和中间产物的工业化学加工中具有极大应用潜力。这样的酶包括腈水解酶,该酶能将腈直接转化为羧酸。腈水解酶存在于大范围的嗜温微生物中,包括如下种类:芽孢杆菌(Bacillus)、诺卡氏菌(Norcardia)、Bacteridium、红球菌属(Rhodococcus);以及存在于包括如下物种的微生物中:芽孢杆菌、Norcardia、Bacteridium、红球菌属(Rhodococcus)、微球菌(Micrococcus)、短杆菌属(Brevibacterium)、产碱杆菌(Alcaligenes)、不动细菌属(Acinetobacter)、棒状杆菌(Corynebacterium)、镰刀菌(Fusarium)和克雷伯氏菌(Klebsiella)。此外,还有存在于细菌中的嗜热腈水解酶(thermophilicnitrilases)。
【0005】从腈到类似酸有两种主要途径:(1)腈水解酶催化腈直接水解为羧酸,同时伴随着释放氨;或者(2)腈水合酶通过碳氮键合体系添加水分子,以产生相应的酰胺,然后该酰胺能作为酰胺酶的底物,该酶水解碳-氮键以产生羧酸产物,同时伴随着释放氨。因此,腈水解酶提供了产生酸的更直接途径。
【0006】腈基团在合成途径的设计中有很多优点,它通常更容易被引入到分子结构中,而且作为掩蔽酸或酰胺基团,能被携带通过许多过程。然而,如果腈在合成中的相应步骤能被去掉掩蔽,这是反而有用的。氰化物代表具有广泛应用价值的C1-合成纤维(氰化物是少数几个在水中稳定的负碳离子之一),它可以在碳骨架的合成中被使用。然而,由于使用正常化学合成程序需要苛刻的反应条件来进行腈的水解,所以如此得到的腈的进一步转化受到了阻碍。使用酶来催化腈的反应是很有吸引力的,这是因为腈水解酶能完成反应,与许多传统化学方法相比较,该反应具有更少的对环境有害的试剂和副产物。实际上,腈的化学选择性的生物催化水解代表一个有价值的选择方案,原因在于它可以在室温(ambienttemperature)和接近生理pH的条件下发生。
【0007】药物设计和发现中的不对称有机合成的重要意义推动了对新合成方法和手性前体(chiral precursors)的寻找,手性前体可以在具有生物学重要性的复杂分子开发中被使用。手性分子(chiral molecules)的一个重要种类是α-取代的羧酸,其包括α-氨基酸。这些分子长期以来已经被公认是多种复杂的具有生物活性的分子的重要的手性前体,大量研究努力已经被投入到对映体纯的α-氨基酸和手性药物的合成方法的开发上。
【0008】对于制造手性药物的合成化学家来说,尤其有用的将是一种在未消毒条件下有用的酶系统,该系统在非生物实验室里有用,它可以以便于存储和使用的形式获得;该系统具有广泛的底物特异性,可以在水溶性极差的底物上发挥作用;该系统具有可预知的产物结构;该系统可以提供对酸或酰胺产物的选择;并且该系统能够手性区分(chiral differentiation)。因此,存在对于有效的、不昂贵的、高产量的合成方法的需求,该方法用于产生对映异构体纯的α-取代的羧酸,诸如,例如α-氨基酸和α-羟基酸。
【0009】此外,通常可以通过利用一种简便的高通量筛选或选择方法来帮助发现或演变进行特定转化的酶。尽管在不能得到有效的超高通量(ultra high throughout,UHTP)筛选之时,可以使用一种替代底物,但直接筛选一种特定地进行期望转化的酶还是需要的。设计UHTP筛选的前景是显然的,例如,当发现或演变计划的目的在于揭示立体选择性转化,以产生仅仅一种立体异构体或对映异构体时。在这种情况下,可以利用的高通量筛选方法是极其缺乏的。尽管最简单的方法是使用手性液相或气相分离法来分离两种正在讨论的对映异构体,但该方法通常不能提供所要求的极高通量容量。通过使用质谱学(MS)技术,极高通量筛选是可能的。然而,当以传统方式应用时,MS不能提供有关手性或对映体选择性的信息。
【00010】另一种方法是用一种单一对映异构体化合物化学方法衍生对映异构混合物,从而产生化合物的非对映体混合物,该混合物能在非手性固定相上分离的。而且,这是一种很麻烦的方法,不能很好地参与高通量筛选。
【00011】在整个申请中,作者参考了各种公开文献并且注明了日期。此处将这些公开文献的公开内容完整地引入本申请中,以便完整地描述本技术领域的技术人员所已知的现有技术的状况,其中现有技术至所描述的和所要求的本发明的日期为止。
发明概述
【00012】本发明关于一种分离核酸或重组核酸,包括核苷酸,具有一个与如下序列至少有大约50%相同的序列,如下序列为SEQ ID NO:1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,35,37,39,41,43,45,47,49,51,53,55,57,59,61,63,65,67,69,71,73,75,77,79,81,83,85,87,89,91,93,95,97,99,101,103,105,107,109,111,113,115,117,119,121,123,125,127,129,131,133,135,137,139,141,143,145,147,149,151,153,155,157,159,161,163,165,167,169,171,173,175,177,179,181,183,185,187,189,191,193,195,197,199,201,203,205,207,209,211,213,215,217,219,221,223,225,227,229,231,233,235,237,239,241,243,245,247,249,251,253,255,257,259,261,263,265,267,269,271,273,275,277,279,281,283,285,287,289,291,293,295,297,299,301,303,305,307,309,311,313,315,317,319,321,323,325,327,329,331,333,335,337,339,341,343,345,347,349,351,353,355,357,359,361,363,365,367,369,371,373,375,377,379,381,383,385,或其变体,其中核酸编码具有腈水解酶活性的多肽。在本发明的另一个方面,核酸包括核苷酸,具有一个与SEQ ID NO:或其变体具有至少大约50%,51%,52%,53%,54%,55%,56%,57%,58%,59%,60%,61%,62%,63%,64%,65%,66%,67%,68%,69%,70%,71%,72%,73%,74%,75%,76%,77%,78%,79%,80%,81%,82%,83%,84%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高,或完全同一性(100%相同)的序列。例证性的变体可能包括,例如,SEQ ID NO:195,205,207,209或237的下述变体,在如下位点具有一个或更多突变:位点163-165 AAA,AAG,GGT,GGC,GGA,GGG,CAA或CAG;位点178-180 GAA或GAG;位点331-333TCT,TCC,TCA,TCG,AGT或AGC;位点568-570 CAT,CAC,TCT,TCC,TCA,TCG,AGT,AGC,ACT,ACC,ACA,TCA,TAT,TAC,ATG或ACG;位点571-573 TTA,TTG,CTT,CTC,CTA,CTG,GTT,GTC,GTA,GTG,ATG,ACT,ACC,ACA,GAT,GAC,GGT,GGC,GGA,GGG,GAA,GAG,TAT,TAC或ACG;位点595-597 GAA,GAG,TTA,TTG,CTT,CTC,CTA或CTG;位点646-666 TTA,TTG,CTT,CTC,CTA或CTG;或其任意组合。在本发明的一个方面,相比于由SEQ ID NO编码的多肽,变体编码具有提高的或者降低的对映体选择性的多肽,例如,在3-羟基戊二酰基腈(3-hydroxyglutarylnitrile,HGN)到(R)-4-氰基-3-羟基丁酸酯的转化中。
【00013】在本发明的一个方面,核酸包括核苷酸,具有一个与SEQ ID NO:或其变体基本上相同的序列。在另一个方面,本发明提供了一种分离核酸或重组核酸,该核酸包括连续核苷酸,该核苷酸具有一个与SEQ ID NO:33具有至少79%同一性的序列,其中所述核酸编码具有腈水解酶活性的多肽。本发明提供了所述核酸的片段,其中所述片段编码具有腈水解酶活性的多肽。本发明也提供了与所述核酸中任何一个互补的分离核酸或重组核酸。本发明也提供了与所述核酸中任何一个在严格条件下杂交的分离核酸或重组核酸。一方面,严格条件包括至少50%甲酰胺,和大约37℃到大约42℃。
【00014】本发明提供了一个核酸探针,包括从大约15个核苷酸到大约10,15,20,25,30,35,40,45,50,55,60,65,70,75,80,85,90,95,100,150,200,250,300,350,400,450,500个或更多个核苷酸,其中至少10,11,12,13,14,15,16,17,18,19,20个或更多个连续核苷酸与如下序列所描述的核酸序列内的核酸靶区域有至少50%互补性:SEQ ID NO:1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,35,37,39,41,43,45,47,49,51,53,55,57,59,61,63,65,67,69,71,73,75,77,79,81,83,85,87,89,91,93,95,97,99,101,103,105,107,109,111,113,115,117,119,121,123,125,127,129,131,133,135,137,139,141,143,145,147,149,151,153,155,157,159,161,163,165,167,169,171,173,175,177,179,181,183,185,187,1 89,191,193,195,197,199,201,203,205,207,209,211,213,215,217,219,221,223,225,227,229,231,233,235,237,239,241,243,245,247,249,251,253,255,257,259,261,263,265,267,269,271,273,275,277,279,281,283,285,287,289,291,293,295,297,299,301,303,305,307,309,311,313,315,317,319,321,323,325,327,329,331,333,335,337,339,341,343,345,347,349,351,353,355,357,359,361,363,365,367,369,371,373,375,377,379,381,383,385,其变体,或它们的补体。一方面,核酸探针包括连续核苷酸,这些核苷酸与核酸靶区域具有至少55%的互补性。一方面,本发明提供了一个核酸探针,其中连续核苷酸与核酸靶区域具有至少50%,51%,52%,53%,54%,55%,56%,57%,58%,59%,60%,61%,62%,63%,64%,65%,66%,67%,68%,69%,70%,71%,72%,73%,74%,75%,76%,77%,78%,79%,80%,81%,82%,83%,84%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高或者100%互补性。另一方面,核酸主要包括大约20到大约50个核苷酸。在其它方面,核酸的长度可以是至少大约20,25,30,35,40,45,50,75,100,150个核苷酸。
【00015】本发明提供了一种核酸载体,该核酸载体能在宿主细胞中复制,其中所述载体包括本发明的核酸。本发明也提供了包括所述核酸的宿主细胞。本发明也提供了一种包括该宿主细胞的宿主生物体。一方面,该宿主生物体包括革兰氏阴性细菌、革兰氏阳性细菌或真核生物。另一方面,革兰氏阴性细菌包括大肠杆菌(Escherichia coli)或荧光假单胞菌(Pseudomonas fluorescens)。在进一步的一方面,革兰氏阳性细菌包括戴弗萨链霉菌(Streptomyces diversa)、加氏乳杆菌(Lactobacillus gasseri)、乳酸乳球菌(Lactococcus lactis)、乳酸乳球菌乳脂亚种(Lactococcus cremoris)或枯草杆菌(Bacillus subtilis)。在进一步的一方面,真核生物包括酿酒酵母(Saccharomyces cerevisiae)、非洲粟酒裂殖酵母(Schizosaccharomyce pombe)、巴斯德毕赤酵母(Pichia pastoris)、乳酸克鲁维酵母(Kluyveromyces lactis)、Hansenula plymorpha或黑曲霉(Aspergillus niger)。
【00016】本发明提供了一种分离核酸或重组核酸,该核酸编码多肽,该多肽包括氨基酸,所述氨基酸具有一个与如下序列具有至少50%同一性的序列,SEQ ID NO:2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62,64,66,68,70,72,74,76,78,80,82,84,86,88,90,92,94,96,98,100,102,104,106,108,110,112,114,116,118,120,122,124,126,128,130,132,134,136,138,140,142,144,146,148,150,152,154,156,158,160,162,164,166,168,170,172,174,176,178,180,182,184,186,188,190,192,194,196,198,200,202,204,206,208,210,212,214,216,218,220,222,224,226,228,230,232,234,236,238,240,242,244,246,248,250,252,254,256,258,260,262,264,266,268,270,272,274,276,278,280,282,284,286,288,290,292,294,296,298,300,302,304,306,308,310,312,314,316,318,320,322,324,326,328,330,332,334,336,338,340,342,344,346,348,350,352,354,356,358,360,362,364,366,368,370,372,374,376,378,380,382,384,386或其变体,其中多肽具有腈水解酶活性。一方面,多肽包括与SEQ ID NO或其变体具有至少大约50%,51%,52%,53%,54%,55%,56%,57%,58%,59%,60%,61%,62%,63%,64%,65%,66%,67%,68%,69%,70%,71%,72%,73%,74%,75%,76%,77%,78%,79%,80%,81%,82%,83%,84%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高或100%同一性的氨基酸。例证性的变体可以包括,例如,SEQ ID NO:196,206,208,210或238的下述变体,具有一个或多个突变:在残基55处赖氨酸、甘氨酸或谷氨酰胺;在残基60处谷氨酸;在残基111处丝氨酸;在残基190处丝氨酸、组氨酸、酪氨酸或苏氨酸;在残基191处亮氨酸、缬氨酸、蛋氨酸、天冬氨酸、甘氨酸、谷氨酸、酪氨酸或苏氨酸;在残基199处谷氨酸或亮氨酸;在残基222处亮氨酸;或其任意组合。
【00017】本发明也提供了一种分离核酸或重组核酸,该核酸编码包括至少10个连续氨基酸的多肽,所述氨基酸具有一个与如下序列的一个氨基酸序列的一部分相同的序列,SEQ ID NO:2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62,64,66,68,70,72,74,76,78,80,82,84,86,88,90,92,94,96,98,100,102,104,106,108,110,112,114,116,118,120,122,124,126,128,130,132,134,136,138,140,142,144,146,148,150,152,154,156,158,160,162,164,166,168,170,172,174,176,178,180,182,184,186,188,190,192,194,196,198,200,202,204,206,208,210,212,214,216,218,220,222,224,226,228,230,232,234,236,238,240,242,244,246,248,250,252,254,256,258,260,262,264,266,268,270,272,274,276,278,280,282,284,286,288,290,292,294,296,298,300,302,304,306,308,310,312,314,316,318,320,322,324,326,328,330,332,334,336,338,340,342,344,346,348,350,352,354,356,358,360,362,364,366,368,370,372,374,376,378,380,382,384,386或其变体。
【00018】一种分离多肽或重组多肽,该多肽包括氨基酸,所述氨基酸具有一个序列,该序列至少大约50%与如下序列相同,SEQ ID NO:2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62,64,66,68,70,72,74,76,78,80,82,84,86,88,90,92,94,96,98,100,102,104,106,108,110,112,114,116,118,120,122,124,126,128,130,132,134,136,138,140,142,144,146,148,150,152,154,156,158,160,162,164,166,168,170,172,174,176,178,180,182,184,186,188,190,192,194,196,198,200,202,204,206,208,210,212,214,216,218,220,222,224,226,228,230,232,234,236,238,240,242,244,246,248,250,252,254,256,258,260,262,264,266,268,270,272,274,276,278,280,282,284,286,288,290,292,294,296,298,300,302,304,306,308,310,312,314,316,318,320,322,324,326,328,330,332,334,336,338,340,342,344,346,348,350,352,354,356,358,360,362,364,366,368,370,372,374,376,378,380,382,384,386或其变体,其中所述多肽具有腈水解酶活性。在本发明的一方面,多肽包括氨基酸,所述氨基酸具有一个序列,该序列至少大约50%,51%,52%,53%,54%,55%,56%,57%,58%,59%,60%,61%,62%,63%,64%,65%,66%,67%,68%,69%,70%,71%,72%,73%,74%,75%,76%,77%,78%,79%,80%,81%,82%,83%,84%,85%,86%,87%,88%,89%,90%,91%,92%,93%,94%,95%,96%,97%,98%,99%或更高或100%与SEQ ID NO:或其变体相同。
【00019】本发明提供了一种分离核酸或重组核酸,该核酸包括核苷酸,所述核苷酸具有一个序列,所述序列以如下的SEQ ID NO:1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,35,37,39,41,43,45,47,49,51,53,55,57,59,61,63,65,67,69,71,73,75,77,79,81,83,85,87,89,91,93,95,97,99,101,103,105,107,109,111,113,115,117,119,121,123,125,127,129,131,133,135,137,139,141,143,145,147,149,151,153,155,157,159,161,163,165,167,169,171,173,175,177,179,181,183,185,187,189,191,193,195,197,199,201,203,205,207,209,211,213,215,217,219,221,223,225,227,229,231,233,235,237,239,241,243,245,247,249,251,253,255,257,259,261,263,265,267,269,271,273,275,277,279,281,283,285,287,289,291,293,295,297,299,301,303,305,307,309,311,313,315,317,319,321,323,325,327,329,331,333,335,337,339,341,343,345,347,349,351,353,355,357,359,361,363,365,367,369,371,373,375,377,379,381,383,385及其变体中的任何一个所阐明(下文称作“A组核酸”)。本发明也涉及与A组核酸序列中的任意序列具有具体说明的最小百分比的序列同一性的核酸。
【00020】另一方面,本发明提供了一种分离(纯化)多肽或重组多肽,该多肽包括氨基酸残基,该残基具有一个如下述序列中的任意一个序列所示的序列,如下述序列为SEQ ID NO:2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62,64,66,68,70,72,74,76,78,80,82,84,86,88,90,92,94,96,98,100,102,104,106,108,110,112,114,116,118,120,122,124,126,128,130,132,134,136,138,140,142,144,146,148,150,152,154,156,158,160,162,164,166,168,170,172,174,176,178,180,182,184,186,188,190,192,194,196,198,200,202,204,206,208,210,212,214,216,218,220,222,224,226,228,230,232,234,236,238,240,242,244,246,248,250,252,254,256,258,260,262,264,266,268,270,272,274,276,278,280,282,284,286,288,290,292,294,296,298,300,302,304,306,308,310,312,314,316,318,320,322,324,326,328,330,332,334,336,338,340,342,344,346,348,350,352,354,356,358,360,362,364,366,368,370,372,374,376,378,380,382,384,386或其变体,(下文称作“B组氨基酸序列”)。本发明也涉及纯化的多肽,该多肽与B组氨基酸序列中的任意一个序列具有具体说明的最小百分比的序列同一性。
【00021】本发明提供了一个多肽片段,该片段的长度至少是50个氨基酸,并且其中所述片段具有腈水解酶活性。进一步,本发明提供了一种多肽或其片段的肽模拟体(peptidomimetic),该肽模拟体具有腈水解酶活性。本发明提供了一种密码子最优化的多肽或其片段,该片段具有腈水解酶活性,其中密码子使用被最优化,以适合特定生物或细胞。Narum等人Infect.Immun.2001年12月,69(12):7250-3描述了小鼠系统中的密码子最优化。Outchkourov等人Protein Expr.Purif.2002年2月;24(1):18-24描述了酵母系统中的密码子最优化。Feng等人Biochemistry 2000年12月19,39(50):15399-409描述了大肠杆菌(E.coli)中的密码子最优化。Humphreys等人Protein Expr.Purif.2000年11月,20(2):252-64描述了密码子使用如何影响大肠杆菌中的分泌作用。
【00022】一方面,生物体或细胞包括革兰氏阴性细菌、革兰氏阳性细菌或真核生物。在本发明的另一个方面,革兰氏阴性细菌包括大肠杆菌(Escherichia coli)或荧光假单胞菌(Pseudomonas fluorescens)。在本发明的另一个方面,革兰氏阳性细菌包括戴弗萨链霉菌(Streptomyces diversa)、加氏乳杆菌(Lactobacillus gasseri)、乳酸乳球菌(Lactococcus lactis)、乳酸乳球菌乳脂亚种(Lactococcus cremoris)或枯草杆菌(Bacillus subtilis)。在本发明的进一步的一方面,真核生物包括酿酒酵母(Saccharomyces cerevisiae)、非洲粟酒裂殖酵母(Schizosaccharomyce pombe)、巴斯德毕赤酵母(Pichia pastoris)、乳酸克鲁维酵母(Kluyveromyces lactis)、Hansenulaplymorpha或黑曲霉(Aspergillus niger)。
【00023】另一方面,本发明提供了一种纯化抗体,该抗体与本发明的多肽或其具有腈水解酶活性的片段特定地结合。一方面,本发明提供了一个抗体片段,该抗体片段与具有腈水解酶活性的多肽特定地结合。
【00024】本发明提供了一种酶制剂,该酶制剂包括至少一种本发明的多肽,其中所述制剂是液体或干燥的。该酶制剂包括缓冲剂、辅因子或第二或额外的蛋白质。一方面,该制剂被固定在固体载体上。在本发明的一方面,该固体载体可以是凝胶、树脂、聚合物、陶瓷制品、玻璃、微电极及其任意组合。另一方面,该制剂可以被封装在凝胶或珠子中。
【00025】本发明进一步提供了一种组合物,该组合物包括至少一种本发明的核酸,所述核酸包括至少一种本发明的多肽或其具有腈水解酶的片段,或其肽模拟体,或其任意组合。
【00026】本发明提供了一种方法,用于将腈水解为羧酸,包括将该分子在适合腈水解酶活性的条件下与至少一种本发明的多肽或其具有腈水解酶活性的片段,或其肽模拟体接触。一方面,所述条件包括含水条件。另一方面,所述条件包括pH为大约8.0,和/或温度为大约37℃到大约45℃。
【00027】本发明提供了一种方法,用于水解一个分子的羟腈部分(cyanohydrinmoiety)或氨基腈部分(aminonitrile moiety),所述方法包括在适合腈水解酶活性的条件下将该分子与本发明的至少一种多肽,或其具有腈水解酶活性的片段,或其肽模拟体接触。
【00028】本发明提供了一种方法,用于产生手性α-羟基酸分子、手性氨基酸分子、手性β-羟基酸分子、或手性γ-羟基酸分子,所述方法包括将具有羟腈部分或氨基腈部分的分子与至少一种具有对映体选择性的腈水解酶活性的多肽混和,所述多肽具有一个氨基酸序列,该序列与B组氨基酸序列或其片段具有至少50%的同一性,或其肽模拟体。一方面,手性分子是(R)-对映异构体。另一方面,手性分子是(S)-对映异构体。在本发明的一方面,特定酶可以对特定底物具有R-特异性,并且该酶可以对不同特定底物具有S-特异性。
【00029】本发明也提供了一种方法,用于产生组合物或其中间体,该方法包括将该组合物或中间体的前体与本发明的至少一种多肽,或其具有腈水解酶活性的片段或肽模拟体混和,其中所述前体包括羟腈部分或氨基腈部分;水解前体中的羟腈部分或氨基腈部分,从而制备其组合物或中间体。一方面,该组合物或中间体包括(S)-2-氨基-4-苯基丁酸。在进一步的一方面,其组合物或中间体包括L-氨基酸。在进一步的一方面,该组合物包括一种食品添加剂或药用药物。
【00030】本发明提供了一种方法,用于产生(R)-乙基-4-氰基-3-羟基丁酸,该方法包括将羟基戊二酰基腈与至少一种多肽,或其具有腈水解酶活性的片段或肽模拟体接触,选择性地产生(R)-对映异构体,从而制备(R)-乙基-4-氰基-3-羟基丁酸,所述多肽具有一个B组氨基酸序列的氨基酸序列。一方面,效率(ee)是至少95%或至少99%。另一方面,羟基戊二酰基腈包括1,3-二-氰基-2-羟基-丙烷或3-羟基戊二酰基腈。在进一步的一方面,多肽具有B组氨基酸序列的任意一个中的一个氨基酸序列,或其具有腈水解酶活性的片段或肽模拟体。
【00031】本发明也提供了一种方法,用于产生(S)-乙基-4-氰基-3-羟基丁酸,该方法包括将羟基戊二酰基腈与至少一种多肽,或其具有腈水解酶活性的片段或肽模拟体接触,选择性地产生(S)-对映异构体,从而产生(S)-乙基-4-氰基-3-羟基丁酸,所述多肽具有B组氨基酸序列的一个氨基酸序列。
【00032】本发明提供了一种方法,用于产生(R)-扁桃酸,该方法包括将扁桃腈与至少一种多肽,或其具有适当腈水解酶活性的任意片段或肽模拟体混和,所述多肽具有B组氨基酸序列的任意一个氨基酸序列。一方面,(R)-扁桃酸包括(R)-2-氯扁桃酸。另一方面,(R)-扁桃酸包括在邻-、间-或对-位的一个芳香环取代;(R)-扁桃酸的1-萘基衍生物,(R)-扁桃酸的吡啶基衍生物,或(R)-扁桃酸的噻吩基衍生物,或其任意组合。
【00033】本发明提供了一种方法,用于产生(S)-扁桃酸,该方法包括将扁桃腈与至少一种多肽,或其具有腈水解酶活性的任意片段或肽模拟体混和,所述多肽具有B组序列的氨基酸序列。一方面,(S)-扁桃酸包括(S)-甲基苄基氰化物,扁桃腈包括(S)-甲氧基-苄基氰化物。一方面,(S)-扁桃酸包括在邻-、间-或对-位的芳香环取代;(S)-扁桃酸的1-萘基衍生物,(S)-扁桃酸的吡啶基衍生物,或(S)-扁桃酸的噻吩基衍生物,或其任意组合。
【00034】本发明也提供了一种方法,用于产生(S)-苯基乳酸衍生物或(R)-苯基乳酸衍生物,该方法包括将苯基乳腈与至少一种多肽或其任意活性片段或肽模拟体混和,选择性地产生(S)-对映异构体或(R)-对映异构体,从而产生(S)-苯基乳酸衍生物或(R)-苯基乳酸衍生物,所述多肽选自B组氨基酸序列。
【00035】本发明提供了一种方法,用于产生本发明的多肽或其片段,该方法包括(a)在允许宿主细胞产生多肽的条件下将编码多肽的核酸引入宿主细胞,和(b)回收如此所产生的多肽。
【00036】本发明提供了一种方法,用于产生编码具有腈水解酶活性的多肽的核酸变体,其中所述变体具有相对于自然发生者具有改变了的生物活性,该方法包括(a)通过如下步骤修饰核酸:(i)用一个不同的核苷酸替换一个或多个核苷酸,其中核苷酸包括天然或非天然核苷酸;(ii)缺失一个或多个核苷酸,(iii)增加一个或多个核苷酸,或(iv)其任意组合。一方面,非天然核苷酸包括肌苷。另一方面,该方法进一步包括,分析由所修饰的核酸编码的多肽的腈水解酶活性改变,从而识别编码具有改变的腈水解酶活性的多肽的修饰核酸(或者多种核酸)。一方面,步骤(a)的修饰是由如下方法实现的:PCR、易错PCR(error-prone PCR)、改组(shuffling)、寡核苷酸指导的诱变(oligonucleotide-directed mutagenesis)、装配PCR(assembly PCR)、有性PCR诱变(sexual PCR mutagenesis)、体内诱变(invivo mutagenesis)、盒式诱变(cassette mutagenesis)、递归集合诱变(recursiveensemble mutagenesis)、指数集合诱变(exponential ensemble mutagenesis)、位点特异性诱变(site-specific mutagenesis)、基因重装配(gene reassembly)、基因位点饱和诱变(gene site saturated mutagenesis)、连接酶链式反应(ligase chain reaction)、体外诱变(in vitro mutagenesis)、连接酶链式反应(ligase chain reaction)、寡核苷酸合成(oligonucleotide synthesis)、任何产生DNA的技术及其任意组合。另一方面,该方法进一步包括至少重复一次修饰步骤(a)。
【00037】本发明进一步提供了一种方法,用于从两个或更多个核酸产生多核苷酸,该方法包括:(a)识别两个或更多个核酸之间的相同区域和不同区域,其中至少一个核酸包括本发明的核酸;(b)提供一组寡核苷酸,其与所述两个或更多个核酸中的至少两个核酸在序列上相符合;和(c)用聚合酶延伸该寡核苷酸,从而产生多核苷酸。
【00038】本发明进一步提供了一种用于鉴定腈水解酶的筛选分析法,该分析方法包括:(a)提供多个核酸或多肽,包括至少一种本发明的核酸,或至少一种本发明的多肽;(b)从上述多个之中,获得将要测试腈水解酶活性的多肽候选者;(c)测试候选者的腈水解酶活性;和(d)鉴定那些是腈水解酶的多肽候选者。一方面,该方法进一步包括在测试候选者的腈水解酶活性之前,修改至少一种核酸或多肽。另一方面,步骤(c)的测试进一步包括测试包括在宿主细胞或宿主生物中的改良表达。在进一步的一方面,步骤(c)的测试进一步包括在pH值为大约3到大约12的范围内测试腈水解酶活性。在进一步的一方面,步骤(c)的测试进一步包括在pH值为大约5到大约10的范围内测试腈水解酶活性。另一方面,步骤(c)的测试进一步包括在温度为大约4℃到大约80℃的范围内测试腈水解酶活性。另一方面,步骤(c)的测试进一步包括在温度为大约4℃到大约55℃的范围内测试腈水解酶活性。另一方面,步骤(c)的测试进一步包括测试腈水解酶活性,这导致产生了对映体选择性反应产物(enantioseletive reaction product)。另一方面,步骤(c)的测试进一步包括测试腈水解酶活性,这导致产生了区域选择性反应产物(regio-selective reaction product)。
【00039】本发明提供了在如下的方法中使用本发明的核酸,或其具有腈水解酶活性的片段或肽模拟体,该方法是被设计用来优化基因的某一方面或由该基因编码的多肽的某一方面。一方面,该方法包括将修饰引入核酸的核苷酸序列。另一方面,修饰是由如下方法引入的:PCR、易错PCR、寡核苷酸指导的诱变、装配PCR、有性PCR诱变、体内诱变、盒式诱变、递归集合诱变、指数集合诱变、位点特异性诱变、基因重装配、基因位点饱和诱变、连接酶链式反应、体外诱变、连接酶链式反应、寡核苷酸合成、任何其它产生DNA的技术及其任意组合。在进一步的一方面,该方法可以被重复。
【00040】本发明提供了在工业方法中使用本发明的多肽,或其具有将水解酶活性的片段或肽模拟体。一方面,该方法用来产生药物组合物,该方法用来产生化学制品,该方法用来产生食品添加剂,该方法用来催化废物的分解,或该方法用来产生药物中间体。在进一步的一方面,该方法包括使用所述多肽来水解羟基戊二酰基腈底物。在进一步的一方面,该方法用来产生LIPITORTM。另一方面,所用的多肽包括一个多肽,该多肽具有序列SEQ ID NO:44,196,208,210或238或其具有腈水解酶活性的片段的连续氨基酸。另一方面,该方法用来产生洗涤剂。另一方面,该方法用来产生食物产品。
【00041】本发明提供了在制备转基因生物中使用本发明的核酸,或其编码具有腈水解酶活性的多肽的片段。
【00042】本发明提供了一个试剂盒,该试剂盒包括(a)本发明的核酸,或其编码具有腈水解酶活性的多肽的片段,或(b)本发明的多肽,或其具有腈水解酶活性的片段或肽模拟体,或其组合;盒(c)一种缓冲剂。
【00043】本发明提供了一种方法,用来修饰分子,该方法包括:(a)将本发明的多肽或其具有腈水解酶活性的片段或肽模拟体与起始分子混和,以产生一种反应混合物;(b)将起始分子与多肽反应,以产生修饰的分子。
【00044】本发明提供了一种方法,用来鉴定修饰的化合物,该方法包括:(a)将本发明的多肽或其具有腈水解酶活性的片段或肽模拟体与起始化合物混和,以产生一种反应混合物,然后产生修饰的起始化合物的文库;(b)测试该文库,以确定该文库中是否存在表现出期望活性的修饰的起始化合物;(c)鉴定表现出期望活性的修饰化合物。
【00045】本发明提供了一种筛选分析法,用来分析对映体选择性转化,该方法包括:(a)提供具有两个前手性或局部对映部分的分子;(b)标记该分子的至少一个前手性或局部对映部分;(b)通过选择性催化剂修饰两个部分中的至少一个;和(c)通过质谱分析检测结果。筛选分析法可以被用来确定或监控对映体过量百分比(ee),或确定非对映体过量百分比(de)。在该分析方法中有用的一个例证性标记是重同位素或轻(liter)同位素。在该分析方法中有用的选择性催化剂可以是酶。可以采用两个部分予以标记来完成筛选分析。筛选分析法可以在两个方向进行,即从反应物到产物,以及从产物到反应物。
【00046】本发明提供了一种可机读的存储介质,其上已经存储了本发明的核酸,例如包括至少一个核苷酸序列的核酸,选自如下序列:SEQ ID NO:1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,35,37,39,41,43,45,47,49,51,53,55,57,59,61,63,65,67,69,71,73,75,77,79,81,83,85,87,89,91,93,95,97,99,101,103,105,107,109,111,113,115,117,119,121,123,125,127,129,131,133,135,137,139,141,143,145,147,149,151,153,155,157,159,161,163,165,167,169,171,173,175,177,179,181,183,185,187,189,191,193,195,197,199,201,203,205,207,209,211,213,215,217,219,221,223,225,227,229,231,233,235,237,239,241,243,245,247,249,251,253,255,257,259,261,263,265,267,269,271,273,275,277,279,281,283,285,287,289,291,293,295,297,299,301,303,305,307,309,311,313,315,317,319,321,323,325,327,329,331,333,335,337,339,341,343,345,347,349,351,353,355,357,359,361,363,365,367,369,371,373,375,377,379,381,383,385及其变体,和/或至少一个选自如下序列的氨基酸序列,SEQ ID NO:2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62,64,66,68,70,72,74,76,78,80,82,84,86,88,90,92,94,96,98,100,102,104,106,108,110,112,114,116,118,120,122,124,126,128,130,132,134,136,138,140,142,144,146,148,150,152,154,156,158,160,162,164,166,168,170,172,174,176,178,180,182,184,186,188,190,192,194,196,198,200,202,204,206,208,210,212,214,216,218,220,222,224,226,228,230,232,234,236,238,240,242,244,246,248,250,252,254,256,258,260,262,264,266,268,270,272,274,276,278,280,282,284,286,288,290,292,294,296,298,300,302,304,306,308,310,312,314,316,318,320,322,324,326,328,330,332,334,336,338,340,342,344,346,348,350,352,354,356,358,360,362,364,366,368,370,372,374,376,378,380,382,384,386或其变体。
【00047】本发明提供了一种计算机系统,该系统包括一个处理器和一个数据存储设备,其中数据存储设备上已经存储了本发明的核酸,例如包括至少一个核苷酸序列的核酸,选自如下序列:SEQ ID NO:1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,35,37,39,41,43,45,47,49,51,53,55,57,59,61,63,65,67,69,71,73,75,77,79,81,83,85,87,89,91,93,95,97,99,101,103,105,107,109,111,113,115,117,119,121,123,125,127,129,131,133,135,137,139,141,143,145,147,149,151,153,155,157,159,161,163,165,167,169,171,173,175,177,179,181,183,185,187,189,191,193,195,197,199,201,203,205,207,209,211,213,215,217,219,221,223,225,227,229,231,233,235,237,239,241,243,245,247,249,251,253,255,257,259,261,263,265,267,269,271,273,275,277,279,281,283,285,287,289,291,293,295,297,299,301,303,305,307,309,311,313,315,317,319,321,323,325,327,329,331,333,335,337,339,341,343,345,347,349,351,353,355,357,359,361,363,365,367,369,371,373,375,377,379,381,383,385,及其变体,和/或至少一个选自如下序列的氨基酸序列,SEQ ID NO:2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62,64,66,68,70,72,74,76,78,80,82,84,86,88,90,92,94,96,98,100,102,104,106,108,110,112,114,116,118,120,122,124,126,128,130,132,134,136,138,140,142,144,146,148,150,152,154,156,158,160,162,164,166,168,170,172,174,176,178,180,182,184,186,188,190,192,194,196,198,200,202,204,206,208,210,212,214,216,218,220,222,224,226,228,230,232,234,236,238,240,242,244,246,248,250,252,254,256,258,260,262,264,266,268,270,272,274,276,278,280,282,284,286,288,290,292,294,296,298,300,302,304,306,308,310,312,314,316,318,320,322,324,326,328,330,332,334,336,338,340,342,344,346,348,350,352,354,356,358,360,362,364,366,368,370,372,374,376,378,380,382,384,386或其变体。一方面,计算机系统进一步包括一个序列比较算法和一个已经存储了至少一个参考序列的数据存储设备。另一方面,序列比较算法包括识别多态性的计算机程序。
【00048】本发明提供了一种方法,用于识别序列中的特征,该方法包括:(a)将序列输入到计算机中;(b)在计算机上运行序列特征识别程序,以便在序列中识别特征;和(c)识别序列中的特征,其中所述序列包括本发明的核酸,例如包括SEQ ID NO:1-386中至少一个序列的核酸,它的变体,或其任意组合。
【00049】本发明提供了一种测定方法,用于识别多肽的功能片段,该方法包括:(a)获得本发明的至少一个多肽的片段;(b)将步骤(a)的至少一个片段与具有一个羟腈部分或氨基腈部分的底物接触,条件是适合腈水解酶活性;(c)测量由步骤(b)中至少一个片段中的每一个所产生的反应产物的量;和(d)识别能产生腈水解酶反应产物的至少一个片段;从而识别多肽的功能片段。一方面,步骤(a)的片段是通过合成片段得到的。另一方面,步骤(a)的片段是通过断裂多肽获得的。
【00050】本发明提供了一种测定方法,用于识别多肽的功能变体,该方法包括:(a)获得本发明的至少一个多肽的至少一个变体;(b)将步骤(a)的至少一个变体与具有一个羟腈部分或氨基腈部分的底物接触,条件是适合腈水解酶活性;(c)测量步骤(b)的至少一个变体中的每一个所产生的反应产物的量;和(d)识别能产生腈水解酶反应产物的至少一个变体;从而识别多肽的功能变体。
附图简述
【00051】图1显示了化学反应示意图,其中立体选择性腈水解酶水解羟腈或氨基腈,以产生手性α-羟基酸或α-氨基酸。
【00052】图2图解说明了一种基于OPA(邻苯二醛)的氰化物检测分析法,用于鉴定腈水解酶活性的存在。
【00053】图3是用于检测和量化α-羟基酸的光谱系统的图例,所述检测和量化基于立体选择性乳酸脱氢酶。
【00054】图4是用于检测和量化α-氨基酸的光谱系统的图例,所述检测和量化基于立体选择性氨基酸氧化酶。
【00055】图5是一个流程图,举例说明了腈水解酶筛选方法的步骤。
【00056】图6A-6E是用于D-苯基甘氨酸的底物和产物结合的色谱特征,显示了空白样品(图6A),酶反应样品(图6B);在缓冲液中包括细胞溶解产物的阴性对照组(图6C);苯基甘氨酸的手性分析(图6D);和具有D-对映异构体的腈峰的共洗脱(图6E)。
【00057】图7A-7E说明了色谱图,其表征(R)-2-氯扁桃酸的底物和产物结合。图7A显示了缓冲液中仅有2-氯扁桃腈;图7B显示了氯扁桃酸标准。图7C的色谱图显示出现了产物,并且底物峰值表现降低。
【00058】图8A-8B举例说明了(S)-苯基乳酸的底物和产物结合的色谱特征。
【00059】图9A-9B举例说明了L-2-甲基苯基甘氨酸的底物和产物结合的色谱特征。
【00060】图10A-10C举例说明了L-叔-亮氨酸的底物和产物结合的色谱特征。
【00061】图11A-11C举例说明了(S)-2-氨基-6-羟基己酸的底物和产物结合的色谱特征。
【00062】图12A-12D举例说明了4-甲基-D-亮氨酸和4-甲基-L-亮氨酸的底物和产物结合的色谱特征。
【00063】图13A-13B举例说明了(S)-环己基扁桃酸的底物和产物结合的色谱特征。
【00064】图14A-14B举例说明与本发明的筛选分析法有关的、用于定量的两个例证性标准曲线。
【00065】图15举例说明了所选择的化合物,所述化合物可以使用本发明的酶和/或方法从腈水解酶催化的反应产生。
【00066】图16举例说明了所选择的化合物,所述化合物可以使用本发明的酶和/或方法从腈水解酶催化的反应产生。
【00067】图17举例说明了本发明的例证性腈水解酶反应。
发明详述
【00068】本发明涉及腈水解酶、编码腈水解酶的核酸,及其用途。正如此处所用,术语“腈水解酶”包括具有任意腈水解酶活性的任意多肽,例如,能将腈催化为它们的相应羧酸和氨的能力。腈水解酶具有商业用途,如作为生物催化剂用于对映体选择性的芳族和脂族氨基酸或羟基酸的合成。
【00069】腈水解酶化学如下所述:
【00070】制备羟基酸的腈水解酶反应如下:
【00071】制备氨基酸的腈水解酶反应如下:
【00072】此外,在前述水解反应的每一个反应中,消耗了两个水分子,释放了一个氨分子。
【00073】有几种类型的分析方法,用来测试样品中腈水解酶活性的存在,或者用来测试特定多肽是否表现出腈水解酶活性。例如,这些分析方法可以检测由腈水解酶催化的化学反应中存在还是不存在产物或副产物。例如,腈水解酶活性的存在可以通过分别从羟腈部分或氨基腈部分中产生α-羟基酸或α-氨基酸来检测,腈水解酶活性水平可以通过测量所产生的反应产物的相对量来量化。图1显示了化学反应示意图,使用立体选择性腈水解酶以高产量产生手性α-羟基酸或α-氨基酸。起始材料是醛或亚胺,亚胺是醛与氨反应产生的。醛或亚胺与氰化氢的反应产生了相应的羟腈和氨基腈的对映异构体混合物。然后可以用立体选择性腈来立体选择性地将一个对映异构体转化为相应的α-羟基酸或α-氨基酸。图3示意性地说明了α-羟基酸的立体选择性腈水解酶依赖性产生和分光光度测定,基于α-羟基酸乳酸脱氢酶转化α-羟基酸为相应的α-酮酸,和伴随的可检测染料的氧化-还原作用。图4示意性地说明了α-氨基酸的立体选择性腈水解酶依赖性产生和分光光度测定,基于氨基酸脱氢酶转化α-氨基酸为相应的α-酮酸,和伴随的可检测染料的氧化-还原作用。
【00074】考虑在本发明的实践中使用的腈水解酶包括,那些将腈或羟腈立体选择性地水解为它们的相应酸和氨的腈水解酶。一方面,本发明的腈水解酶可以立体选择性地将腈或羟腈水解为它们的相应酸和氨。腈水解酶包括,例如本发明的腈水解酶,如那些在B组氨基酸序列中所示的腈水解酶。下边的表中所示为一些立体选择性地水解它们的底物的腈水解酶。
【00075】本发明的腈水解酶共有下述额外的特征:
(1)大约333个氨基酸到大约366个氨基酸的全长氨基酸序列,
(2)作为由大约2个亚单位到大约16个亚单位的同多聚体的集合和活性,
(3)连续氨基酸谷氨酸-赖氨酸-半胱氨酸(Glu-Lys-Cys)的催化性三联体(catalytic triad)的存在,
(4)从大约pH5到大约pH9的最适pH,和
(5)从大约0℃到大约100℃,或者从大约40℃到大约50℃的最适温度。
在新腈水解酶中的共有序列
【00076】此处公开的腈水解酶是使用生物信息学和序列比较程序研究的,收集到下述共有序列信息。在腈水解酶多肽中鉴别到三个区域的保守基序(conservedmotifs)。这相当于腈水解酶中存在的催化性三联体(E-K-C)。(H.Pace和C.Brenner(2001年1月15日)“The Nitrilase Superfamily:classification,structure andfunction(腈水解酶超家族:分类,结构和功能)”Genome Biology第2卷,第1期,第1-9页。)
【00077】此处所用的缩写是用于氨基酸的传统的单字母代码:A代表丙氨酸;B代表天冬酰胺或天冬氨酸;C代表半胱氨酸;D代表天冬氨酸;E代表谷氨酸盐,谷氨酸;F代表苯基丙氨酸;G代表甘氨酸;H代表组氨酸;I代表异亮氨酸;K代表赖氨酸;L代表亮氨酸;M代表蛋氨酸;N代表天冬酰胺;P代表脯氨酸;Q代表谷氨酰胺;R代表精氨酸;S代表丝氨酸;T代表苏氨酸;V代表缬氨酸;W代表色氨酸;Y代表酪氨酸;Z代表谷氨酰胺或谷氨酸。参见L.Stryer,Biochemistry(生物化学),1988,W.H.Freeman and Company,纽约。
【00078】在本发明的腈水解酶多肽序列中所进行的计算机序列比较,导致在每一个氨基酸序列中鉴别到这些基序:
F | P | E | t | f |
r | R | K | L | . | P | T |
L | . | C | W | E | h | . | . | P |
【00079】下述残基(那些带下划线的残基)在所有被鉴别的腈水解酶中是完全保守的:第一个基序或区域中的第三个氨基酸(E代表谷氨酸);第二个基序中的第二个残基(R代表精氨酸);第二个基序中的第三个残基(K代表赖氨酸);第三个基序中的第三个残基(C代表半胱氨酸);和第三个基序中的第五个残基(E代表谷氨酸)。
【00080】在这些方框中,大写字母表示本发明的腈水解酶中的90%或更高的共有性,而小写字母表示50%或更低的共有性。斜体字母表示在本发明的腈水解酶中具有30%或更高的共有性。方框中的圆点表示非保守的残基。
【00081】在Pace和Brenner文章中,腈水解酶超家族的腈水解酶分支中的腈水解酶序列被描述为具有催化性三联体(Genome Biology,2001,第二卷,第一期,第1-9页)。然而,本发明的腈水解酶的催化性三联体区域与先前那些在Pace和Brenner参考文献中所鉴别的催化性三联体在下述方面不同:
【00082】在第一个基序中的区别:第一个基序的第一个方框中的F在本发明腈水解酶中保守性为90%,而不是那些先前所鉴别的腈水解酶所具有的仅仅50%。第一个基序的第四个残基是一个“t”,在本发明的腈水解酶中的苏氨酸,它被发现具有50%或更高的共有性。然而,该残基被Pace和Brenner鉴别为“a”(丙氨酸)。第一个基序的最后一个残基被鉴别是“f”(苯基丙氨酸),被表明以50%或更高的共有性发生。然而,本发明的腈水解酶仅显示“f”(苯基丙氨酸),以30%的共有性发生。
【00083】在第二个基序中的区别:在本发明的腈水解酶的第二个基序的第一个方框中有一个“r”(精氨酸)。然而,Pace和Brenner共有性表明在那个位置有一个“h”(组氨酸)。第二个方框中的“R”(精氨酸)在本发明的腈水解酶中是完全保守的,然而该残基在Pace和Brenner参考文献中仅表现出90%的共有性。第二个基序的第四个方框中的“L”(亮氨酸)在90%或更高百分比的本发明的腈水解酶中是保守的。然而,Pace和Brenner腈水解酶仅仅表明该残基在50%的序列中具有保守性。同样地,第二个基序的第六个方框中的“P”(脯氨酸)在90%或更高百分比的本发明的腈水解酶中是保守的。然而,Pace和Brenner腈水解酶仅仅表明该残基在50%的序列中具有保守性。
【00084】第三个基序中的区别:第一个方框中的“L”在90%或更高百分比的本发明的腈水解酶中是保守的。然而,Pace和Brenner参考文献仅表明该残基当时显示50%。最后,本发明的腈水解酶中的第三个基序中的第六个方框当时表明组氨酸为50%。然而,Pace和Brenner参考文献表明,该位置当时表明天冬酰胺为50%。
【00085】本发明提供了一种具有腈水解酶活性的分离多肽,该多肽包括三个区域,其中第一个区域包括五个氨基酸,其中第一个区域的第一个氨基酸是F,第一个区域的第四个氨基酸是T。本发明也提供了一种具有腈水解酶活性的分离多肽,该多肽包括三个区域,其中第二个区域包括七个氨基酸,其中第二个区域的第一个氨基酸是R,其中第二个区域的第二个氨基酸是R,其中第二个区域的第六个氨基酸是P。本发明也提供了一种具有腈水解酶活性的多肽,该多肽包括三个区域,其中第三个区域包括九个氨基酸,其中第三个区域的第一个氨基酸是L,第三个区域的第六个氨基酸是H。
【00086】本发明也提供了一种具有腈水解酶活性的分离多肽,该多肽包括三个共有子序列,其中第一个共有子序列是FPETF,其中第二个共有子序列是RRKLXPT,其中第三个共有子序列是LXCWEHXXP。
【00087】本发明也提供了一种具有腈水解酶活性的分离多肽,该多肽包括三个共有子序列,其中第一个共有子序列是FPEXX,其中第二个共有子序列是XRKLXPT,其中第三个共有子序列是LXCWEXXXP。
【00088】根据本发明,提供了产生对映体纯的α-取代羧酸的方法。由本发明的方法产生的对映体纯的α-取代羧酸具有如下结构:
其中:
R1≠R2,另外R1和R2独立地是-H,取代或未取代的烷基、烯基、炔基、芳基、杂芳基、环烷基或杂环,其中所述取代基是低级烷基、羟基、烷氧基、氨基、巯基、环烷基、杂环、芳基、杂芳基、芳氧基或卤素,或任意地R1和R2是直接或非直接共价结合,以形成一个功能环部分,E是-N(RX)2或-OH,其中每一个RX独立地是-H或低级烷基。
【00089】正如此处所用,术语“烷基”指1到24个碳原子的直链或支链或环烃基,包括甲基、乙基、n-丙基、异丙基、n-丁基、异丁基、叔-丁基、n-戊基、n-己基及其类似基团。术语“低级烷基”指1个到大约6个碳原子的单价直链或支链或环基。
【00090】正如此处所用,“烯基”指具有一个或多个碳碳双键,并且具有大约2个到大约24个碳原子的直链或支链或环烃基。
【00091】正如此处所用,“炔基”指具有至少一个碳碳三键,并且具有大约2个到大约24个碳原子的直链或支链或环烃基。
【00092】正如此处所用,“环烷基”指含有大约3到大约14个碳原子的环烃基。
【00093】正如此处所用,“杂环”指具有一个或多个杂环原子(例如N,O,S,P,Se,B等等)作为环结构的一部分,并且具有大约3个到大约14个碳原子的环状基团。
【00094】正如此处所用,“芳基”指具有大约6个到大约14个碳原子的芳香基团(即具有共轭双键系统的环状基团)。
【00095】正如此处所用,关于化学基团或部分,术语“取代的”指这样的一个基团或部分进一步具有一个或多个非氢取代基。这样的取代基的实例包括,但不限于,氧(例如在酮、醛、醚或酯中)、羟基、烷氧基(低级烷基的)、氨基、硫基、巯基(低级烷基的)、环烷基、取代的环烷基、杂环、取代的杂环、芳基、取代的芳基、杂芳基、取代的杂芳基、芳氧基、取代的芳氧基、卤素、三氟甲基、氰基、硝基、硝酮基、氨基、酰氨基、-C(O)H、酰基、氧酰基、羧基、氨基甲酸酯、磺酰基、磺酰胺、硫酰基及其类似基团。
【00096】在优选的方面,由本发明的方法所产生的对映体纯的α-取代的羧酸是α-氨基酸或α-羟基酸。在一些方面,对映体纯的α-氨基酸是D-苯基丙氨酸、D-苯基甘氨酸、L-甲基苯基甘氨酸、L-叔-亮氨酸、D-丙氨酸或D-羟基正亮氨酸((S)-2-氨基-6-羟基己酸)、R-泛内酯、2-氯扁桃酸或(S)-或(R)-扁桃酸,对映体纯的α-羟基酸是(S)-环己基扁桃酸。正如此处所用,“小分子”包括分子量为至少25道尔顿的任意分子。
【00097】此处所用的术语“大约”意思是大概、粗略地、左右或某一个范围内。当术语“大约”与数字范围联合使用时,它通过扩展所罗列的数值的上界和下界修改该范围。总的来说,此处使用术语“大约”在所标明的数值的上下修改该数值,范围是上下20%(更高或更低)。
【00098】正如此处所用,单词“或者”指特定列表中的任意一个成员,也包括该列表中的成员的任意组合。
【00099】此处所用的短语“核酸”指自然发生或合成的寡核苷酸或多核苷酸,是DNA或RNA或DNA-RNA杂交体,单链或双链,有义或反义,能通过沃森-克里克碱基配对(Watson-Crick base-pairing)杂交到一个互补核酸上。本发明的核酸也包括核苷酸类似物(例如BrdU),非磷酸二酯核苷酸间键合(例如肽核酸(PNA)或硫二酯键合)。尤其是,核酸可以包括,但不限于,DNA,RNA,cDNA,gDNA,ssDNA或dsDNA或其任意组合。在一些方面,本发明的“核酸”包括,例如编码B组氨基酸序列中所示的多肽及其变体。此处所用的短语“核酸序列”指缩写词、字母、字符或单词的连续列表,其代表核苷酸。一方面,核酸可以是“探针”,其是相对短的核酸,其长度通常少于100个核苷酸。通常,核酸探针的长度通常是从大约50个核苷酸到大约10个核苷酸。核酸的“靶区域”是被鉴定为相关的核酸中的一部分。
【000100】核酸的“编码区”是核酸的一部分,所述核酸当被置于适当调节序列的控制之下时,以序列特异性方式被转录和翻译,以产生特定多肽或蛋白质。编码区指示编码这样的多肽或蛋白质。
【000101】术语“基因”指可操作性地连接到适当调节序列上的编码区,所述调节序列能以某种方式调节多肽的表达。基因包括DNA的非转录调节区(例如启动子、增强子、阻遏子等等),编码区(开放式阅读框,ORF)的前面区域(上游区)和后面区域(下游区),以及,如果适用,个体编码区(即外显子)之间的间插序列(即内含子)。
【000102】此处所用的“多肽”指如何肽、寡肽、多肽、基因产物、表达产物或蛋白质。多肽包括连续氨基酸。术语“多肽”包括自然发生分子或合成分子。
【000103】此处,正如此处所用,术语“多肽”指通过肽键或修饰的肽键互相结合的氨基酸,例如肽等排物,可以包括除了20个由基因编码的氨基酸以外的修饰氨基酸。多肽可以用天然方法来修饰,如翻译后加工,或通过本技术领域熟知的化学修饰技术进行修饰。修饰可以在多肽中的任意位置发生,包括肽骨架、氨基酸侧链和氨基或羧基末端。应该意识到,相同类型的修饰可以在给定多肽中的几个位点以相同或可变程度存在。而且,给定多肽可以有许多种类型的修饰。这些修饰包括,但不限于,乙酰化、酰化、ADP-核糖基化、酰胺化、共价交联或环化、黄素的共价连接、血红素部分的共价连接、核苷酸或核苷酸衍生物的共价连接、脂质或脂质衍生物的共价连接、磷脂酰肌醇(phosphytidylinositol)的共价连接、二硫键形成、去甲基化、半胱氨酸或焦谷氨酸酯的形成、甲酰化、γ-羧化、糖基化、GPI锚的形成、羧基化、碘化、甲基化、十四酰基化、氧化、pergylation、蛋白酶解加工、磷酸化、异戊二烯化、外消旋作用、硒化作用(selenoylation)、硫酸盐化、转运-RNA介导的添加氨基酸到蛋白质中的作用,如精氨酰化。(参考Proteins-Structure and Molecular Properties 2nd Ed.,T.E.Creighton,W. H.Freemanand Company,Ed.,Academic Press,New York,pp.1-12(1983))。
【000104】正如此处所用,术语“氨基酸序列”指代表氨基酸残基的缩略语、字母、字符或单词的列表。
【000105】正如此处所用,术语“分离的”指已经从其原始环境中离开的材料。例如,在活的动物体中存在的自然发生的多核苷酸或多肽就不是分离的,但从自然体系中的一些或所有共存材料中分离的相同多核苷酸或多肽就是分离的。这样的多核苷酸可以是载体的一部分,和/或这样的多核苷酸或多肽可以是组合物的一部分,多核苷酸是分离的,其原因在于这样的载体或组合物不是其原始环境的一部分。
【000106】关于核酸,正如此处所用,术语“重组的”指核酸与一个核酸是共价连接且邻接的,在其自然环境中,这两个核酸不是邻接的。此外,正如此处所用,关于核酸群体中的特定核酸,术语“富集的”指核酸代表分子群体中的核酸数量的5%或更多。典型地,富集核酸代表分子群中的15%或更多数量的核酸。更典型地,富集核酸代表分子群中的50%、90%或更多数量的核酸。
【000107】“重组的”多肽或蛋白指重组DNA技术产生的多肽或蛋白,即由编码目标多肽或蛋白的外源重组DNA构建物所转化的细胞产生的。“合成的”多肽或蛋白是用化学合成法制备的多肽或蛋白(例如固相肽合成法)。化学肽合成法在本技术领域是已知的(例如参见Merrifield(1963),Am.Chem.Soc.85:2149-2154;Geysen等人(1984),Proc.Natl.Acad.Sci.,USA81:3998),合成试剂盒以及自动肽合成仪可以通过商业途径获得(例如Cambridge Research Biochemicals,Cleveland,United Kingdom;来自Applied Biosystems,Inc.,Foster City,CA的Model 431 A合成仪)。该设备提供了即刻得到本发明的肽的途径,或者通过直接合成,或者通过合成一系列用其它已知技术可以连接的片段。
【000108】正如此处所用,关于核酸或氨基酸序列对,“同一性”指两个序列在序列内的能够比对的位置上不予变化的程度。两个给定序列之间的同一性百分比可以使用算法来计算,如BLAST(Altschul等人(1990),J.Mol.Biol.215:403-410)。参见
www.ncbi.nlm.nih.gov/Education/BLASTinfo。当使用BLAST算法时,对于不超过250个核苷酸或大约80个氨基酸的序列(“短查询”),搜索参数可以如下:过滤器为关闭,打分矩阵为PAM30,字符大小为3或2,E值为1000或更高,空位成本为11,1。对于长度超过250个核苷酸或80个氨基酸残基的序列,可以使用缺省搜索参数。BLAST网站提供了在这样的情况下遵循的用于特定情况的建议。
【000109】正如此处所用,“同源性”在核苷酸序列的上下文中与“同一性”代表相同的意思。然而,关于氨基酸序列,“同源性”包括同样的氨基酸取代和保守性氨基酸取代的百分比。同源性百分比可以根据Smith和Waterman(1981),Adv.Appl.Math.2:482计算。
【000110】正如此处所用,在两个或更多个核酸序列的情形中,两个序列是“基本上同一的”,是在当使用上面所描述的已知的序列比较算法进行测量,进行比较和比对,以达到最大对应时,它们具有至少99.5%的核苷酸同一性之时。此外,为了确定序列是否是基本上同一的,编码区中的同义密码子可以被认为是同一的,原因在于遗传密码子的简并性。典型地,确定基本上同一性的区域必须至少跨过20个残基,在最通常情况下,这些序列越过至少大约25-200个残基时是基本上同一的。
【000111】正如此处所用,在两个或更多个氨基酸序列的情形中,两个序列是“基本上同一的”,是在当使用上面所描述的已知的序列比较算法进行测量,比较和比对,以达到最大对应时,它们具有至少99.5%的同一性之时。此外,为了确定序列是否是基本上同一的,如果多肽基本上保持其生物学功能,那么保守性氨基酸取代可以被认为是同一的。
【000112】“杂交”指一种过程,通过该过程,核酸链通过氢键在互补碱基处与互补链结合。杂交分析法可以是灵敏的和有选择性的,以便相关的特定序列可以被鉴别,即使在以低浓度存在的样品中也可以被鉴别。严格条件是通过盐或者甲酰胺在预杂交和杂交溶液中的浓度,或者通过杂交温度来定义的,这些条件在本技术领域是已知的。可以通过降低盐的浓度,增加甲酰胺的浓度,或者升高杂交温度来增加严格性。尤其是,正如此处所用,“严格的杂交条件”包括42℃,在50%甲酰胺,5X SSPE,0.3%SDS,和200ng/ml剪接和变性鲑精DNA,及其等价物中。上述范围和条件的变化在本技术领域是已知的。
【000113】术语“变体”是指在一个或多个核苷酸或氨基酸残基上被修饰的本发明的多核苷酸或多肽(分别地),其中被编码的多肽或多肽保持了腈水解酶活性。变体可以通过任意数量的方法产生,例如易错PCR、改组、寡核苷酸指导的诱变、装配PCR、有性PCR诱变、体内诱变、盒式诱变、递归集合诱变、指数集合诱变、位点特异性诱变、基因重装配、基因位点饱和诱变或其任意组合。
【000114】对基于已知序列产生肽模拟体的方法已经有所描述,例如在美国专利号5,631,280;5,612,895;和5,579,250。肽模拟体的用途可以包括在给定位置引入一个具有非酰胺连接的非氨基酸残基。本发明的一方面是肽模拟体,其中该化合物具有一个键、一个肽骨架或一个氨基酸成分,被用一个合适的模拟型(mimic)替换。可以是合适的氨基酸模拟型的非天然氨基酸的实例包括β-丙氨酸、L-α-氨基丁酸、L-γ-氨基丁酸、L-α-氨基异丁酸、L-e-氨基己酸、7-氨基庚酸、L-天冬氨酸、L-谷氨酸、N-ε-Boc-N-α-CBZ-L-赖氨酸、N-ε-Boc-N-α-Fmoc-L-赖氨酸、L-蛋氨酸砜、L-正亮氨酸、L-正缬氨酸、N-α-Boc-N-δCBZ-L-鸟氨酸、N-δ-Boc-N-α-CBZ-L-鸟氨酸、Boc-p-硝基-L-苯丙氨酸、Boc-羟基脯氨酸、Boc-L-硫代脯氨酸。
【000115】正如此处所用,“小分子”包括分子量在大约20道尔顿到大约1.5千道尔顿之间的分子。
【000116】分子生物学技术,如亚克隆,是使用常规方法进行的,这些方法对本技术领域的熟练技术人员是已知的。(Sambrook,J.Fritsch,EF,Maniatis,T.(1989)分子克隆:实验室手册(第二版)(Molecular Cloning:A Laboratory Mannual(2nded.),Cold Spring Harbor Laboratory Press,Plainview NY.)。
计算机系统
【000117】在本发明的一方面,本发明的任意核酸序列和/或多肽序列可以在任何可以通过计算机阅读和访问的介质上被保存、记录和操作。正如此处所用,单词“被记录”和“被保存”指在计算机介质上保存信息的过程。本发明的另一方面是一种计算机可读的介质,其上已经存储了至少2、5、10、15或20个如SEQ IDNOS:1-386中所示的核酸序列,以及与其基本上同一的序列。在进一步的一方面,另一方面是通过计算机在本发明的核酸序列或多肽序列之中和之间进行的比较,以及在本发明的序列和其它序列之中和之间进行的比较。计算机可读的介质包括磁性可读介质、光学可读介质、电子可读介质和磁性/光学介质。例如,计算机可读介质可以是硬盘、软盘、磁带、CD-ROM、数字化视频光盘(DVD)、随机存取存储器(RAM)或只读存储区(ROM),以及本技术领域熟练技术人员已知的其它类型的其它介质。
【000118】本发明的一些方面包括一些系统(例如基于因特网的系统),尤其是存储和操作此处描述的序列信息的计算机系统。正如此处所用,“计算机系统”指硬件部分、软件部分和用于分析序列的数据存储部分(或者是核酸,或者是多肽),所述序列如SEQ ID NOS:1-386中的至少任意一个序列,以及与其基本上同一的序列。计算机系统通常包括一个用于处理、访问和操作序列数据的处理器。处理器可以是任意已知类型的中央处理单元,例如英特尔公司的Pentium III,或Sun、Motorola、Compaq、AMD或国际商用机器公司(International Business Machines)的类似处理器。
【000119】通常计算机系统是一个普通目的的系统,该系统包括处理器和用于存储数据的一个或多个内部数据存储部件,和用于检索存储在数据存储部件上的数据的一个或多个数据检索设备。
【000120】在一个特定的方面,计算机系统包括一个连接到总线上的处理器,所述总线连接到主存储器上(优选地以RAM实现),和一个或多个内部数据存储设备,如硬盘驱动器和/或其它已经存储了数据的计算机可读介质。在一些方面,计算机系统进一步包括一个或多个数据检索设备,用于读取存储在内部数据存储设备上的数据。
【000121】数据检索设备可以代表,例如软盘驱动器、高密度磁盘驱动器、磁带驱动器或能够连接到远程数据存储系统上的调制解调器(例如通过因特网)等等。在一些方面,内部数据存储设备是一个可移动的计算机可读介质,如软盘、高密度磁盘、磁带等等,包括控制逻辑和/或其上存储的数据。计算机系统可以有利地包括适当的软件或通过适当的软件被编程,所述软件用于从已经插入到数据检索设备上的数据存储部件上阅读控制逻辑和/或数据。
【000122】计算机系统包括用于向计算机用户显示输出信号的显示器。也应该注意到,计算机系统可以被连接到网络或广域网中的其它计算机系统上,以提供到计算机系统的集中式访问。在一些方面,计算机系统可以进一步包括序列比较算法。“序列比较算法”指在计算机系统上执行(本地或远程)的一个或多个程序,以便将核苷酸序列与数据存储设备上存储的其它核苷酸序列和/或化合物进行比较。
腈水解酶的用途
【000123】腈水解酶已经被鉴别为产生手性α-羟基酸的关键酶,所述α-羟基酸在精细化学工业中是有用的中间体,并且可以作为药物中间体。本发明的腈水解酶可以用于催化羟腈或氨基腈的立体选择性水解,以分别产生手性α-羟基酸和α-氨基酸。
【000124】立体选择性酶相对于化学拆分方法(chemical resolution methods)提供了一个主要优点,因为它们不需要苛刻的条件并且能更好地与环境兼容。尤其相关的腈水解酶的用途是产生手性氨基酸和α-羟基酸。使用立体选择性腈水解酶,可以构建动态拆分条件,这是由于在含水条件下底物的外消旋作用。从而可以获得100%的理论产量。
【000125】本发明涉及腈水解酶,所述腈水解酶已经被发现并且从天然存在的原料中分离出来。本发明也涉及从多种和极端环境来源中演变出新颖基因和基因途径。在开发最广泛种类的可以利用的酶的努力中,DNA是从样品中直接提取的,所述样品是从地球上的不同栖息地中收集的样品。依靠这些努力,开发了世界上最大的环境基因文库的集合。通过这些文库的广泛的高通量筛选,到目前为止已经发现了192种序列独特的新腈水解酶。在本发明之前,在文献和公开数据库中已经报道了不到20种微生物来源的腈水解酶。
【000126】生物催化剂,如腈水解酶在催化活的生物体中的代谢反应中起着重要作用。此外,已经发现生物催化剂在化学工业中有用,在这些化学工业中生物催化剂可以催化许多不同的反应。在腈水解酶用途中的优点的一些实例是:它们提供了高对映的、化学的和区域选择性;它们在温和的反应条件下起作用;它们提供对产物的直接获得途径-具有最小保护措施;它们具有高催化效率;与化学催化剂相比它们产生减少的废物;它们更容易以酶或细胞的形式被固定化;它们是可回收的、可循环的,能通过分子生物学技术被操纵;它们可以在整个细胞过程中被再生;它们可以忍受有机溶剂;重要地是,它们可以被进化或最优化。此处将这些最优化的腈水解酶展示在这里,作为本发明的工作实例。
【000127】腈水解酶催化产生相应羧酸的腈部分的水解。腈的传统化学水解需要强酸或强碱和高温。然而,本发明的一个优点是,提供了在温和条件下进行这一反应的腈水解酶。可以通过具有高对映的、化学的和区域选择性的腈水解酶转化宽范围的腈底物。
表1-本发明的腈水解酶的一些特征
以前发现的腈水解酶 | 新腈水解酶 | |
局限性 | 新特征 | 优点 |
所报道的少于20种 | 新发现的多于180种 | 可得到更广泛的底物范围 |
同源性 | 独特的腈水解酶,这些酶中的许多与先前已知的腈水解酶具有极小的同源性 | |
窄底物活性谱 | 广底物活性谱 | |
极少显示出对映选择性 | 对映选择性;两种对映异构体均可得到 | 具有高对映异构体过量和最小废物产生的产物 |
有限的稳定性曲线 | 在多种条件下都稳定 | 在大范围的处理条件中的潜在用途 |
不一致的供应 | 一致的供应 | 产物的可靠来源 |
是不可应用的 | 可以最优化 | 良好来源材料产生较好产物 |
【000128】动态动力学拆分:腈水解酶的使用允许在两个快速平衡的对映异构体之间实现区别,以便产生单一产物,是以100%的理论产量产生。腈水解酶被用于关键的羟腈或氨基腈的动态拆分,以产生对映异构体纯的α-羧酸和α-氨基酸。此处公开的最新发现的腈水解酶产生了具有>95%对映异构体过量(ee)和具有>95%产量的产物。腈水解酶在含水溶液中或在存在有机溶剂的情况下,在温和的条件下,有效地进行这一转化。
【000129】上面所示的这些产物也包括相对的对映异构体,尽管它们没有被显示出来。一方面,本发明提供了一种分离的核酸,所述核酸具有一个如A组核酸序列中的任意一个序列所示的序列,具有一个与其基本上同一的序列,或具有一个与其互补的序列。
【000130】另一方面,本发明提供了一种分离的核酸,包括至少20个与如A组核酸序列中所示的核苷酸序列的一部分同一的连续核苷酸,具有一个与其基本上同一的序列,或具有一个与其互补的序列。
【000131】另一方面,本发明提供了一种分离的核酸,其编码具有一个如B组氨基酸序列中所示的序列的多肽,或具有一个与其基本上同一的序列。
【000132】另一方面,本发明提供了一种一种分离的核酸,其编码一个多肽,该多肽具有至少10个与B组氨基酸序列中所示的序列的一部分同一的连续氨基酸,具有一个与其基本上同一的序列。
【000133】仍然在另一方面,本发明提供了一个基本上纯化的多肽,所述多肽包括连续氨基酸残基,其具有一个如B组氨基酸序列中所示的序列,或具有一个与其基本上同一的序列。
【000134】另一方面,本发明提供了一种分离的抗体,该抗体与本发明的多肽特定地结合。本发明也提供了该抗体的一个片段,其保留了与多肽特定地结合的能力。
【000135】另一方面,本发明提供了一种方法,用于产生具有一个如B组氨基酸序列和与其基本上同一的序列中所示的序列的多肽。该方法包括引导编码多肽的核酸进入宿主细胞,其中该核酸可操作性地连接到启动子上,以及包括在允许核酸表达的条件下培养宿主细胞。
【000136】另一方面,本发明提供了一种方法,用于从B组氨基酸序列和与其基本上同一的序列中所示的序列,产生具有至少10个连续氨基酸的多肽。该方法包括引导编码多肽的核酸进入宿主细胞,其中核酸可操作性地连接到启动子上,以及包括在允许核酸表达的条件下培养宿主细胞,从而产生多肽。
【000137】另一方面,本发明提供了一种方法,用于产生腈水解酶的变体,包括选择如A组核酸序列中所示的核酸序列,并且将序列中的一个或多个核苷酸变化为另一个核苷酸,删除该序列中的一个或多个核苷酸,或将一个或多个核苷酸加入到该序列中。
【000138】另一方面,本发明提供了用于鉴别B组氨基酸序列的功能变体的分析方法,这些功能变体保留了B组氨基酸序列的多肽的酶功能。这些分析方法包括将包括连续氨基酸残基的多肽与底物分子在允许多肽发挥作用的条件下接触,所述残基具有一个与B组氨基酸序列或其部分的序列同一的序列,具有一个与B组氨基酸序列或其部分的序列基本上同一的序列,或具有一个是B组氨基酸序列的一个序列变体的序列,该变体保留了腈水解酶活性;检测底物水平的降低,或者多肽和底物之间反应的特定反应产物水平的增加;从而鉴别这些序列的功能变体。
本发明所述多肽的修饰
【000139】酶是高度选择性催化剂。它们的特点是催化具有精美的立体-选择性、区域-选择性和化学-选择性的反应,这一特点是传统合成化学方法所无法匹敌的。而且,这些酶是显著多用途的。它们可以被改装,以便在有机溶剂中发挥作用,在极端pH(例如酸性或碱性条件)、极端温度(例如高温和低温)、极端盐度水平(例如高盐度和低盐度)下操作,并且催化与它们的天然的、生理学底物在结构上没有关系的化合物的反应,但是在酶活性位点上无关的化合物除外。
【000140】本发明提供了一些方法,用于修饰具有腈水解酶活性的多肽,或编码这些多肽的多核苷酸,以便获得新颖多肽,所述新颖多肽保留了腈水解酶活性,但一些期望特征有所改进。这些改进可以包括:在有机溶剂中发挥作用的能力(即表现出腈水解酶活性),在极度或非典型pH下操作,在极端或非典型温度下操作,在极端或非典型盐度水平下操作,催化与不同底物的反应,等等。
【000141】本发明涉及使用腈水解酶以便利用这些酶的独特的催化特征的方法。尽管在化学转化中使用生物催化剂(即纯化或粗酶)通常需要鉴别与特定起始化合物反应的特定生物催化剂,但本发明使用了选择的生物催化剂和反应条件,它们对于许多起始化合物中存在的官能基团是特异的。每一种生物催化剂对于一种官能基团或者与其相关的官能基团是特异的,能与含有这一官能基团的许多起始化合物反应。
【000142】酶在起始化合物内的特定位点反应,而不影响分子的其余部分,这是一个用传统化学方法很难实现的过程。这一高度特异性提供了在化合物文库内鉴别单一活性化合物的方法。该文库的特征在于用于产生该文库的生物催化反应系列,即所谓的“生物合成历史(biosynthetic history)”。筛选文库的生物学活性和跟踪生物合成历史鉴别产生活性化合物的特定反应序列。重复反应序列,并且确定合成的化合物的结构。不像其它合成和筛选方法,这种鉴别模式不需要固定化技术,并且几乎可以使用任何类型的筛选分析法,不需要溶液就可以合成和测试化合物。重要的是应该注意到,酶在官能基团反应的高度特异性允许“跟踪”形成生物催化产生的文库的特定酶反应。(关于分子的修饰的进一步教导,包括小分子,参见PCT申请号PCT/US94/09174,此处完整地引入作为参考)。
【000143】在一个例证中,本发明提供了相关腈水解酶基因家族,和它们的相关产物的编码家族的嵌合化作用(chimerization)。从而根据本发明的一方面,多个腈水解酶核酸的序列(例如A组核酸序列)作为腈水解酶“模板”,是用序列比较算法,如上面所描述的那些算法,进行比对的。然后,在比对的模板序列中,确定一个或多个分界点,这些分界点位于一个或多个同源区域。可以用这些分界点来勾画核酸结构单元(nucleic acid building blocks)的边界,这些核酸结构单元被用来产生嵌合腈水解酶。因此,在腈水解酶模板分子中确定和选择的分界点作为嵌合腈水解酶分子装配中的潜在嵌合化作用点。
【000144】通常,有用的分界点是至少两个祖先模板之间的局部同一性区域,但优选地,分界点是由至少一半的模板,至少三分之二的模板,至少四分之三的模板,或几乎所有的模板共有的同一性区域。
【000145】然后,由分界点定义的结构单元,可以被混和(或者照字面意思在溶液中,或者理论上在纸上或计算机中),重装配以形成嵌合腈水解酶基因。一方面,基因重装配过程被彻底地进行,以便产生所有可能组合的完备文库。换句话说,在最终嵌合的核酸分子集合中,表达了核酸结构单元的所有可能的有序组合。然而,同时,设计每一组合中5’到3’方向上的每一结构单元的装配次序,以反映模板中的次序,并且降低不需要的、不可操作的产品的产生。
【000146】在一些方面,基因装配过程被系统地进行,以便产生区室化文库(compartmentalized library),所述区室化文库具有可以被系统地筛选的区室,例如逐个地筛选。换句话说,本发明提供了,通过有选择性地和明智地使用特定核酸结构单元,与有选择性地和明智地使用有序的分段装配反应相结合,可以在几个反应容器中的每一个中产生嵌合产物的特定集合情况下实现实验设计。这允许进行系统的检查和筛选程序。因此,这允许对潜在地非常大数量的嵌合分子在较小的小组中进行系统地检查。
【000147】在一些方面,产生或重装配结构单元的步骤的合成性质允许核苷酸序列的设计和引入(例如密码子或内含子或调节序列),随后这些核苷酸序列在体外过程(例如通过突变)或体内过程中(例如通过使用宿主生物体的基因剪接能力)能被任选地去除。引入这些核苷酸可能是有好处的,原因有很多,包括产生有用的分界点的潜在益处。
【000148】本发明的合成基因在组装方法使用了多个核酸结构单元,每一个结构单元有两个可连接末端。每一个核酸结构单元上的两个可连接末端的一些实例包括,但不限于,两个钝末端,或一个钝末端和一个粘末端,或两个粘末端。在一个进一步的非限定性实例中,粘末端可能包括一个碱基对,两个碱基对,三个碱基对,四个碱基对或更多碱基对。
【000149】双链核酸结构单元的大小是可以变化的。结构单元的优选大小范围在从大约1个碱基对(bp)(不包括任何粘末端)到大约100,000个碱基对(不包括任何粘末端)。也提供了其它优选的大小范围,其下限为大约1bp到大约10,000bp(包括其中的每一个整数数值),上限为大约2bp到大约100,000bp(包括其中的每一个整数数值)。
【000150】根据一个方面,产生了一个双链核酸结构单元,方法是首先产生两个单链核酸,允许它们退火从而形成双链核酸结构单元。双链核酸结构单元的两个链在除形成粘末端的任何核苷酸之外的每一个核苷酸上可以是互补的;这样除任何粘末端之外不会出现错配。可以选择地,双链核酸结构单元的两个链在少于除任何粘末端之外的每一个核苷酸的少数核苷酸上可以是互补的。尤其是,这些链之间的错配可以被用来引导密码子简并,使用的方法是,如此处描述的位点饱和诱变。
【000151】核苷酸的体内改组在提供变体中也是有用的,可以使用细胞的自然特性来进行,以重组多聚体。当体内重组已经提供了达到分子多样性的主要自然途径时,基因重组保持了相对复杂的过程,提供了(1)同源性的识别;(2)链分裂、链侵入和代谢步骤,导致产生重组交叉;和最后地,(3)交叉分解为离散的重组分子。交叉的形成需要识别同源序列。
【000152】因此,本发明包括一种方法,用于从至少第一个多核苷酸和第二个多核苷酸体内产生嵌合或重组多核苷酸。本发明可以被用来产生重组多核苷酸,通过引导至少第一个多核苷酸和第二个多核苷酸进入适当的宿主细胞,其中所述第一个多核苷酸和第二个多核苷酸共有具有部分序列同源性的至少一个区域(例如A组核酸序列,及其组合)。部分序列同源性区域促进了导致产生重组多核苷酸的序列再组织的过程。这样的杂交多核苷酸可以从分子间重组事件产生,这些事件促进了DNA分子之间的序列整合。此外,这样的杂交多核苷酸可以从分子内还原重配过程产生,这些过程使用重复序列来改变DNA分子内部的核苷酸序列。
【000153】本发明提供了一种方式,用于产生编码生物活性变体多肽(例如腈水解酶变体)的重组多核苷酸。例如,多核苷酸可以编码来自某种微生物的特定酶。由来自一种生物体的第一种多核苷酸编码的酶可以,例如在特定环境条件下有效地发挥作用,例如高盐度条件下。由来自另一种生物体的第二种多核苷酸编码的酶可以在另一种环境条件下有效地发挥作用,例如极度高温条件下。含有来自第一种和第二种原始多核苷酸的序列的重组多核苷酸编码变体酶,该变体酶表现出由原始多核苷酸编码的两种酶的特征。因此,由重组多核苷酸编码的酶可以在环境条件下有效地发挥作用,所述环境条件为由第一种和第二种多核苷酸编码的每一种酶共有的条件,例如高盐度和极高温。
【000154】变体多肽可以表现出原始酶中没有显示出的特定的酶活性。例如,在编码腈水解酶活性的多核苷酸的重组和/或还原重配,可以对所产生的由重组多核苷酸编码的变体多肽进行筛选,以筛选从原始酶的每一种获得的特定腈水解酶活性,即腈水解酶发挥作用的温度或pH。原始多核苷酸的来源可以从个体生物体(“分离物”)分离,从收集的已经在已知成分培养基中生长的生物体(“富集培养物”)分离,或者从未培养的生物体(“环境样品”)分离。使用不依赖培养基的方法来获得来自环境样品的编码新颖生物活性的多核苷酸是最优选的,原因是该方法允许人们使用具有生物多样性的未利用资源。制备多核苷酸的微生物体包括原核微生物,如黄色杆菌属(Xanthobacter)、真细菌(Eubacteria)和古细菌(Archaebacteria),和低级真核微生物,如真菌、一些藻类和原生动物。多核苷酸可以从环境样品分离,在这种情况下可以不培养生物体来回收核酸,或者从一个或多个培养的生物体中回收核酸。在一个方面,这样的微生物可以是极端条件生物(extremophiles),如超嗜热菌(hyperthermophiles),嗜冷微生物(psychrophiles),冷育微生物(psychrotrophs),喜盐植物、嗜压生物和嗜酸菌。编码从极端条件微生物分离的酶的多核苷酸是尤其优选的。这样的酶可以在如下条件下发挥作用,在温度高于100℃的陆地温泉和深海热出口,在温度低于0℃的北极水域中,在饱和盐环境的死海中,在pH值大约为0的煤沉淀物中和富硫的地热温泉中,或者在pH值大于11的污水污泥中。
【000155】可以被用来表达重组蛋白的哺乳动物表达系统的实例包括,COS-7、C127、3T3、CHO、HeLa和BHK细胞系。哺乳动物表达载体包括复制起点、适当的启动子和增强子,也包括任何必须的核糖体结合位点、聚腺苷酸位点、剪接供体和受体位点、转录终止序列和5’侧翼非转录序列。来自SV40剪接和聚腺苷酸位点的DNA序列可以被用来提供所需的非转录遗传元件。此处完整地引入美国专利号6,054,267作为参考。
【000156】含有相关多核苷酸的宿主细胞可以在传统的营养基中培养,这些传统营养培养基被修饰,以适合激活启动子、选择转化体或扩增基因。培养条件是先前选择用于表达的宿主细胞所使用的,如温度和pH等等,这些条件对于普通的熟练技术人员是显而易见的。然后,对被鉴别出具有期望的酶活性或其它特性的克隆进行测序,以识别编码具有期望活性或特性的酶的重组多核苷酸序列。
【000157】在一个方面,本发明提供了分离的腈水解酶,或者作为分离的核酸,或者作为分离的多肽,其中核酸或多肽是通过从DNA群体回收DNA,并且用回收的DNA转化宿主而制备的,从而产生用来筛选特定蛋白如腈水解酶活性的克隆文库,所述DNA群体来源于至少一种未培养的微生物。美国专利号6,280,926,Short提供了对这些方法的描述,此处将其完整地引入,其全部目的是作为参考。
【000158】因此,在一个方面,本发明涉及一种方法,用于产生生物活性的重组腈水解酶多肽,并且筛选具有期望活性或特性的这样的多肽,步骤如下:
1)引导至少第一种腈水解酶多核苷酸和第二种腈水解酶多核苷酸进入适当的宿主细胞,所述至少第一种腈水解酶多核苷酸和第二种腈水解酶多核苷酸共有至少一个序列同源性区域;
2)培养宿主细胞,条件是能促进导致重组腈水解酶多核苷酸的序列再组织;
3)表达由重组腈水解酶多核苷酸所编码的重组腈水解酶多肽;
4)筛选具有期望活性或特性的重组腈水解酶多肽;
5)分离编码重组腈水解酶多肽的重组腈水解酶多核苷酸。
【000159】可以使用的载体的实例包括病毒颗粒、杆状病毒、噬菌体、质粒、噬菌粒、粘粒、fosmid、细菌人工染色体、病毒DNA(例如牛痘病毒、腺病毒、禽痘病毒、假狂犬病和SV40的衍生物),基于P1的人工染色体、酵母质粒、酵母人工染色体和对相关宿主特异的任何其它载体(例如杆菌属(Bacillus)、曲霉菌(Aspergillus)和酵母)。大量合适的载体对本技术领域的普通技术人员是已知的,并且可以通过商业途径获得。细菌载体的实例包括pQE载体(Qiagen,Valencia,CA);pBluescript质粒、pNH载体和λ-ZAP载体(Stratagene,La Jolla,CA);和pTRC99a、pKK223-3、pDR540和pRIT2T载体(Pharmacia,Peapack,NJ)。真核载体的实例包括pXTl和pSG5载体(Stratagene,La Jolla,CA);和pSVK3、pBPV、pMSG和pSVLSV40载体(Pharmacia,Peapack,NJ)。然而,也可以使用任何其它质粒或其它载体,只要它们是可以复制的,并且可以在宿主中存活。
【000160】在本发明中使用的一种优选类型的载体含有f-因子(或致育因子)复制起点。大肠杆菌中的f-因子是质粒,该质粒能影响其本身在接合过程中的高频转移,和其本身的细菌染色体的低频转移。尤其优选的一个方面是使用被称作“fosmid”的克隆载体或细菌人工染色体(BAC)载体。这些是从大肠杆菌f-因子得到的,能稳定地整合大的基因组DNA片段。
【000161】表达载体中的DNA序列可操作性地连接到适当地表达控制序列上,包括启动子,以指导RNA合成。有用地细菌启动子包括lacI、lacZ、T3、T7、gpt、λPR、PL和trp。有用的真核启动子包括CMV立即早期启动子、HSV胸苷激酶、早期和晚期SV40、来自逆转录酶病毒的LTR和小鼠金属硫蛋白-I。适当载体和启动子的选择在本技术领域的普通技术人员的水平内。表达载体也含有翻译起始和转录终止子的核糖体结合位点。载体也包括用于扩增表达的适当序列。可以使用CAT(氯霉素转移酶)载体或具有选择性标记物的其它载体从任何期望的基因选择启动子区域。
【000162】此外,表达载体可以含有一个或多个选择性标记基因,以提供选择转化宿主细胞的表型特性。有用的选择性标记物包括对真核细胞培养物具有抗性的二氢叶酸还原酶或新霉素,或对大肠杆菌中有抗性的四环素或氨卡青霉素。
【000163】可以使用多种技术将载体引入宿主细胞,包括转化、转染、转导、病毒感染、基因枪或Ti介导的基因转移。特定方法包括磷酸钙转染、DEAE-葡聚糖介导的转染、脂质转染、或电穿孔。
【000164】还原重配-在其它方面,变体腈水解酶多核苷酸可以通过还原重配过程产生。尽管重组是一个“分子间”过程,该过程在细菌中通常被看作是“recA依赖的”现象,还原重配的过程通过“分子内”不依赖recA的过程发生。在这个方面,本发明依赖于细胞介导还原过程的能力,以通过删除降低细胞中的准重复序列的复杂性。该方法包括产生含有连续重复或准重复序列(原始编码序列)的构建物,将这些序列插入到适当的载体中,随后将载体引入适当的宿主细胞中。个体分子同一性的重配通过拥有构建物的同源性区域中的连续序列之间,或准重复单元之间的组合过程产生。重配过程重组和/或降低重复序列的复杂性或程度,并且导致产生新颖的分子种类。可以使用多种处理来增加重配率,如紫外线光或DNA损伤化学药品。此外,可以使用显示出增强水平的“遗传不稳定性”的宿主细胞系。
【000165】重复序列-重复或“准重复”序列在遗传不稳定性中起着重要作用。在本发明中,“准重复”是结构上不完全相同的重复,但代表一组具有高度相似性或同一性序列的连续序列。细胞中的还原重配或删除过程通过删除准重复序列内的位置之间的序列降低了所获得的构建物的复杂性。因为删除(和潜在地插入)事件可以事实上在准重复序列内的任何地方发生,所以这些序列提供了潜在变体的大量的全部组成成分。
【000166】当准重复序列被完全以相同方向连接时,例如头-到-尾连接或相反,对于大部分而言,删除的端点在准重复序列内的任何地方等概率地发生。相反,当这些单元以头-到-头或尾-到-尾存在时,所插入的准重复序列可以形成双链体,该双链体描绘了邻接单元的端点,从而有助于删除不连续的单元。因此,在本发明中优选的是准重复序列以相同方向被连接,这是因为准重复序列的随机方向导致重配效率的损失,而序列的一致方向将提供最高的效率。但是,尽管在相同方向具有较少的邻接序列降低了效率或还原重配,仍然可以为有效回收新颖分子提供足够的变化。
【000167】可以使用多种方法中的任意一种方法将序列以头-到-尾方向装配,包括如下:
a)可以使用引物,包括聚腺苷酸头和聚胸腺嘧啶核苷酸尾,当产生单链时,聚腺苷酸头和聚胸腺嘧啶核苷酸尾可以提供方向。这是通过具有从RNA产生的引物的最初少数几个碱基实现的,因此容易通过RNAse H去除。
b)可以使用引物,包括独特的限制性切割位点。需要多个位点、一组独特的序列、重复合成和连接步骤。
c)引物的内部少数几个碱基可以被硫醇化,核酸外切酶用来适当地产生有尾分子。
【000168】重配序列的恢复依赖于鉴别具有降低的重复指数(RI)的克隆载体。然后通过扩增恢复重配编码序列。再克隆和表达产物。具有降低的RI的克隆载体的回收,可以通过如下几点来实现:
1)使用仅仅当构建物的复杂性降低时能稳定地维持的载体。
2)通过物理方法物理回收缩短的载体。在这种情况下,将使用标准质粒分离方法回收克隆载体,然后使用标准方法对克隆载体进行大小分级(例如具有低分子量截断值的琼脂糖凝胶或柱)。
3)当插入物大小降低时,可以选择含有间断基因的载体予以回收。
4)使用直接选择技术,其中使用表达载体且进行适当的选择。
【000169】来自相关生物体的编码序列可能会展示出高度同源性,但编码变化相当多的蛋白质产物。这些类型的序列在本发明中作为准重复序列尤其有用。然而,虽然下面阐明的实例证实了具有高度同一性的编码序列(准重复)的重配,该过程不限于几乎同一的重复物。
【000170】下述实例举例说明了本发明的一个方法。获得了来自三个不同种类的准重复编码序列。每一序列编码具有一组独特特性的蛋白。这些序列中的每一个在序列中的独特位置上的一个或多个碱基对上不同,被称作“A”、“B”和“C”。准重复序列被独立地或共同地扩增并连接到随机组装物中,这样可以在连接分子群体中得到全部连接分子的所有可能的排列和组合。准重复单元的数量通过装配条件来控制。构建物中准重复单元的平均数量被定义为重复指数(repetitive index,RI)。
【000171】一旦形成,构建物可以根据已出版的规程,在琼脂糖凝胶上进行大小分级,被插入到克隆载体中,转染到适当的宿主细胞中。然后这些细胞被繁殖以允许还原重配的发生。如果期望,还原重配过程的速度可以通过引入DNA损伤来刺激。RI的降低是通过“分子内”机制,通过重复序列之间的缺失形成来介导的,或者是通过“分子间”机制,通过重组事件来介导的,是无关紧要的。最终的结果是分子重配到所有可能的组合中。
【000172】在另一个方面,在重组或重配之前或过程中,本方面的多核苷酸或由此处描述的方法产生的多核苷酸可以经受能促进将突变引入原始多核苷酸中的试剂或方法。引入这样的突变将增加所得到的杂交多核苷酸和由其编码的多肽的多样性。可以促进诱变的试剂或方法包括,但不限于:(+)-CC-1065,或合成类似物如(+)-CC-1065-(N3-腺嘌呤)(Sun等人,(1992),Biochemistry 31(10):2822-9);能抑制DNA合成的N-乙酰或去乙酰4’-氟-氨基联苯加合物(例如参见van de Poll等人(1992),Carcinogenesis 13(5):751-8);或能抑制DNA合成的N-乙酰或去乙酰4-氨基联苯加合物(同样参见Van de Poll等人(1992),同上);三价铬,三价铬盐,能抑制DNA复制的多环芳烃(“PAH”)DNA加合物,如7-溴甲基-苯并蒽(“BMA”),三(2,3-二溴丙基)磷酸盐(“Tris-BP”),1,2-二溴-3-氯丙烷(“DBCP”),2-溴丙烯醛(2BA),苯并芘-7,8-,二氢二醇-9-10-环氧化物(“BPDE”),铂(II)卤素盐,N-羟基-2-氨基-3-甲基咪唑[4,5-f]-喹啉(“N-羟基-IO”),和N-羟基-2-氨基-1-甲基-6-苯基咪唑[4,5-f]-嘧啶(“N-羟基-PhIP”)。减缓或停止PCR扩增的尤其优选的方法包括UV光(+)-CC-1065-和(+)-CC-1065-(N3-腺嘌呤)。特别包括的方法是DNA加合物或含有来自多核苷酸或多核苷酸集合体的DNA加合物的多核苷酸,它们可以通过一种方法来释放或去除,该方法包括在进一步处理之前加热含有该多核苷酸的溶液。
【000173】GSSMTM-本发明也提供了使用含有简并N,N,G/T序列的密码子引物,将点突变引入多核苷酸,以产生一组后代多肽,其中在每一氨基酸位置表示了全部范围的单一氨基酸取代,该方法被称作基因位点饱和诱变(GSSMTM)。所用的寡核苷邻接地包括第一个同源序列,一个简并N,N,G/T序列,和可能的第二个同源序列。从使用这样的寡核苷酸得到的后代翻译产物包括沿着多肽的每一氨基酸位点的所有可能的氨基酸变化,这是因为N,N,G/T序列的简并性包括所有20个氨基酸的密码子。
【000174】在一个方面,用一个这样的简并寡核苷酸(包括一个简并N,N,G/T盒子)使亲代多核苷酸模板中的每一个原始密码子经受全部范围的密码子取代。在另一个方面,使用至少两个简并N,N,G/T盒子-或者在相同的寡核苷酸上或者不在相同的的寡核苷酸上,使亲代多核苷酸模板中的至少两个原始密码子经受全部范围的密码子取代。因此,一个寡核苷酸中可以包括不止一个N,N,G/T序列,以便在不止一个位点引入氨基酸突变。多数N,N,G/T序列可以是直接邻接的,或者是通过一个或多个其它核苷酸序列分离的。在另一个方面,可以单独或与含有N,N,G/T序列的密码子组合使用用于引入插入和删除的寡核苷酸,以便引入氨基酸插入、删除和/或取代的任何组合或排列。
【000175】在一个特定的例证中,使用含有连续N,N,G/T三联体,即简并(N,N,G/T)n序列的寡核苷酸,同时诱变两个或多个连续氨基酸位置是可能的。
【000176】在另一个方面,本方面提供了使用简并盒子,所述简并盒子的简并性低于N,N,G/T序列。例如,在一些情况下,使用仅包括一个N的简并三联体序列可能是令人期望的,其中所述N可以是三联体的第一第二或第三个位置。可以在三联体的剩余两个位置使用任何其它碱基,包括其任意组合和排列。另外,在一些情况下使用简并N,N,N三联体序列或N,N,G/C三联体序列可能是令人期望的。
【000177】然而,应该意识到,使用本发明所公开的简并三联体(如N,N,G/T或N,N,G/C)是有利的,原因有好几个。一方面,本发明提供了一种系统地且相当容易地产生完全范围的20个可能的氨基酸取代到多肽中每一个氨基酸位置的方法。因此,对于含有100个氨基酸的多肽,本发明提供了一种系统地且相当容易地产生2000个不同种类的途径(即每一位置的20个可能的氨基酸乘以100个氨基酸位置)。应该意识到,通过使用含有简并N,N,G/T或N,N,G/C三联体序列的寡核苷酸,提供了编码20个可能的氨基酸的32个个体序列。因此,在亲代多核苷酸序列使用一个这样的寡核苷酸进行饱和诱变的反应容器中,产生了编码20个不同多肽的32个不同的子代多核苷酸。相反,在定点诱变中使用非简并寡核苷酸导致每一反应容器仅仅只有一个子代多肽产物。
【000178】本发明也提供了使用非简并寡核苷酸,它们可以任选地与所公开的简并引物组合使用。应该意识到,在一些情况下,使用非简并寡核苷酸在工作多核苷酸中产生特异性点突变是有利的。这提供了一种方式,产生特异性沉默点突变,导致相应氨基酸变化的点突变,和产生终止密码子和相应的多肽片段表达的点突变。
【000179】因此,在一个方面,每一饱和诱变反应容器含有编码至少20个后代多肽分子的多核苷酸,这样在与亲代多核苷酸中诱变的密码子位置相应的特异性氨基酸位置上表达了所有20个氨基酸。从每一饱和诱变反应容器产生的32倍简并后代多肽可以进行克隆扩增(例如,使用表达载体克隆到合适的大肠杆菌宿主中),并且进行表达筛选。当通过筛选鉴别出个体后代多肽以显示有利的特性变化时(当与亲代多肽比较时),可以对其进行测序以鉴别其中所包括的相对有利的氨基酸取代。
【000180】应该意识到,一旦使用此处公开的饱和诱变诱变了亲代多肽中的每个氨基酸位置,可以在不止一个氨基酸位置鉴别出有利的氨基酸变化。可以产生一个或多个新颖后代分子,它们含有全部或部分这些有利的氨基酸取代的组合。例如,如果在多肽中的三个氨基酸位置中的每一个中鉴别出两个特异性有利的氨基酸变化,包括每一位置的三种可能性(与原始氨基酸没有变化,和两个有利变化中的每一个)和三个位置的排列。因此,总共有3×3×3或27种可能性,包括先前确定的7种可能性---6个单一点突变(即在三个位置中的每一个上有2个)和在任何位置没有变化。
【000181】仍然在另一个方面,位点饱和诱变可以连同筛选一起与改组、嵌合、重组和其它诱变过程一起使用。本发明提供了以重复方式使用任何诱变过程,包括饱和诱变。在一个例证中,任何诱变过程的重复使用是与筛选组合使用的。
【000182】因此,在一个非限定性例证中,本发明的多核苷酸和多肽可以通过饱和诱变与其它诱变过程组合使用得到,如两个或多个相关多核苷酸被引入适当的宿主细胞,从而通过重组和还原重配产生杂交多核苷酸的过程。
【000183】除了沿着基因的整个序列进行诱变之外,可以使用诱变来取代多核苷酸序列中的任意数量的碱基中的每一个碱基,其中将被诱变的碱基数量可以是大约15到大约100,000之间的每一个整数。因此,代替沿着分子的每一个位置诱变,可以对每一个或非连续数量(例如总数在大约15到大约100,000之间的子集)的碱基进行诱变。在一方面,分离的核苷酸被用来诱变沿着多核苷酸序列的每一位置或位置组。将被诱变的一组3个位置可以是一个密码子。一方面,使用含有异源盒子的诱变引物引入突变,其中异源盒子也被称作诱变盒子。例如,盒子可以具有大约1到大约500个碱基。在这样的异源盒子中的每一核苷酸位置可以是N,A,C,G,T,A/C,A/G,A/T,C/G,C/T,G/T,C/G/T,A/G/T,A/C/T,A/C/G,或E,其中E是非A,C,G或T的任何碱基。
【000184】在通常意义下,饱和诱变包括诱变欲被诱变的指定多核苷酸序列(例如,将被诱变序列的长度为大约15个到大约100,000个碱基)中的诱变盒子的完整集合(例如,每一盒子长度为大约1-500个碱基)。因此,一组突变(范围为大约1个到大约100个突变)被引入到将被诱变的每一盒子中。在应用一轮饱和诱变中,将被引入一个盒子中的突变的一个编组可以与将被引入第二个盒子的第二个编组突变不同或相同。这样的编组的例证是缺失、增加,特定密码子的编组,和特定核苷酸盒子的编组(该groupings)。
【000185】将被诱变的规定序列包括完整基因、途径、cDNA、完整的开放阅读框(ORF)、启动子、增强子、抑制子/反式激活蛋白、复制起点、内含子、操纵基因或任何多核苷酸官能基团。通常,用于这一目的的“规定序列”可以是任何多核苷酸,15个碱基多核苷酸序列,和长度在大约15个碱基到大约15,000个碱基之间的多核苷酸序列(本发明明确地指定为它们之间的每一整数)。考虑选择密码子编组,包括由简并诱变盒子编码的氨基酸类型。
【000186】在可以被引入诱变盒子的突变编组的一个特别优选的例证中,本发明特别地提供了在每一位置编码2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19和20个氨基酸的简并密码子取代(使用简并寡核苷酸),和由其编码的多肽文库。
【000187】本发明的一方面是一种分离核酸,该核酸含有A组核酸序列的一个序列,基本上与其同一的序列,与其互补的序列,或包括A组核酸序列的一个序列的至少10,15,20,25,30,35,40,50,75,100,150,200,300,400或500个连续碱基的片段。分离核酸可以包括DNA,包括cDNA、基因组DNA和合成DNA。DNA可以是双链或单链的,如果单链可以是编码链或非编码(反义)链。另外,分离核酸可以包括RNA。
【000188】正如下面更详细地所讨论的,本发明的分离核酸序列可以被用来制备B组氨基酸序列的多肽中的一个多肽,和与其基本上同一的序列,或包括B组氨基酸序列的多肽中的一个多肽的至少5,10,15,20,25,30,35,40,50,75,100或150个连续氨基酸的片段,和与其基本上同一的序列。
【000189】另外,本发明的核酸序列可以使用传统技术来诱变,如定点诱变或本技术领域普通技术人员所熟知的其它技术,以便将沉默变化引入A组核酸序列和与其基本上同一的序列的多核苷酸中。正如此处所用,“沉默变化”包括,例如,不改变由多核苷酸编码的氨基酸序列的变化。为了通过引入在宿主生物体中频繁发生的密码子或密码子对,来增加由含有编码多肽的载体的宿主细胞产生的多肽的水平,这样的变化可能是令人期望的。
【000190】本发明也涉及具有核苷酸变化的多核苷酸,所述核苷酸变化导致本发明的多肽中(例如B组氨基酸序列)的氨基酸取代、插入、缺失、融合和平截。这样的核苷酸变化可以使用如下技术来引入:定点诱变、随机化学诱变、核酸外切酶III缺失和其它重组DNA技术。可选择地,这样的核苷酸变化可以是天然存在的等位基因变异体,所述等位基因变异体是通过鉴别特定地与包括如下碱基的探针杂交的核酸序列分离的:A组核酸序列和与其基本上同一的序列(或与其互补的序列)的一个序列的至少10,15,20,25,30,35,40,50,75,100,150,200,300,400或500个连续碱基,杂交条件是如此处提供的高度严格、中度严格或低度严格条件。
固定化酶固相支持体
【000191】酶、其片段和编码这些酶和片段的核酸可以被固定到固相支持体上。在工业过程中使用这些酶通常是经济的和有效的。例如,在特定化学反应中使用的酶(或其活性片段)的聚生体或混合物,可以被附着在固相支持体上,并且浸入到工艺桶中。酶反应可以发生。然后,从桶中取出固相支持体和附着在其上的酶,用于重复使用。在本发明的一个方面,分离的核酸被固定到固相支持体上。在本发明的另一个方面,固相支持体选自凝胶、树脂、聚合物、陶瓷制品、玻璃、微电极及其任意组合。
【000192】例如,在本发明中有用的固相支持体包括凝胶。凝胶的一些实例包括琼脂糖凝胶、白明胶、戊二醛、脱乙酰壳多糖处理的戊二醛、白蛋白-戊二醛、脱乙酰壳多糖-黄原胶、toyopearl凝胶(聚合物凝胶)、藻酸盐、藻酸盐-多熔素、角叉菜胶、琼脂糖、乙醛酰琼脂糖、磁性琼脂糖、葡聚糖-琼脂糖、聚(氨甲酰磺酸盐)水凝胶、BSA-PEG水凝胶、磷酸化聚乙烯醇(PVA)、单氨基乙基-N-氨基乙基(MANA)、氨基或其任意组合。
【000193】在本发明中有用的另一个固相支持体是树脂或聚合物。树脂或聚合物的一些实例包括纤维素、丙烯酰胺、尼龙、人造丝、聚酯、离子交换树脂、AMBERLITETMXAD-7、AMBERLITETMXAD-8、AMBERLITETMIRA-94、AMBERLITETMIRC-50、聚乙烯、聚丙烯酸、聚甲基丙烯酸酯或其任意组合。在本发明中有用的另一种类型的固相支持体是陶瓷制品。一些实例包括非多孔性陶瓷、多孔性陶瓷、SiO2、Al2O3。在本发明中有用的另一种类型的固相支持体是玻璃。一些实例包括非多孔性玻璃、多孔玻璃、氨丙基玻璃或其任意组合。可以使用的另一种类型的固相支持体是微电极。一个实例是聚乙烯胺涂覆的磁铁。石墨颗粒可以被用作固相支持体。固相支持体的另一个实例是细胞,如红血球。
固定化方法
【000194】有许多用于将酶或其片段或核酸固定到固相支持体上的方法,这些方法对本技术领域的普通技术人员是已知的。这些方法的一些实例包括静电小滴产生、电化学方法、通过吸附、通过共价结合、通过交联、通过化学反应或过程、通过封装、通过截留、通过藻酸钙或通过聚(2-羟乙基甲基丙烯酸)。在如下文献中描述了类似方法:Methods in Enzymology,Immobilized Enzymes and Cells,C卷.1987.Academic Press.由S.P.Colowick和N.O.Kaplan编辑,第136卷;和Immobilization of Enzymes and Cells.1997.Human Press.由G.F.Bickerstaff编辑,Series:Methods in Biotechnology,由J.M.Walker编辑。
【000195】探针-A组核酸序列、与其基本上同一的序列、互补序列或含有前述序列中的一个序列的至少10,15,20,25,30,35,40,50,75,100,150,200,300,400或500个连续碱基的片段的分离核酸也可以被用作探针,来确定生物样品如土壤样品,是否含有包括本发明的核酸序列的生物体,或可以得到该核酸的生物体。在这些方法中,得到了潜在地具有分离核酸的生物体的生物样品,并且从这些样品得到了核酸。将核酸与探针在允许探针与其中存在的任何互补序列特定地杂交的条件下接触。
【000196】在需要的时候,允许探针与互补序列特定地杂交的条件可以如下确定,将探针与来自样品的互补序列以及不含有互补序列的控制序列接触,所述样品已知含有互补序列。可以改变杂交条件,如杂交缓冲液的盐浓度、杂交缓冲液的甲酰胺浓度或杂交温度,以确定允许探针与互补核酸特定地杂交的条件。此处叙述了严格杂交条件。
【000197】可以通过用可检测试剂探针来检测杂交,如放射性同位素标记、荧光染料或能催化可检测产物形成的酶。使用已标记的探针来检测样品中互补核酸的存在的许多方法对本技术领域的普通技术人员是熟知的。这些方法包括Southern印迹、Northern印迹、克隆杂交方法和斑点印迹。这些方法中的每一个方法的规程在如下文献中有所提供:Ausubel等人,(1997),Current Protocols in MolecularBiology,John Wiley&Sons,Inc.,和Sambrook等人(1989),Molecular Cloning:A Laboratory Manual第二版,Cold Spring Harbor Laboratory Press,此处将这些文献完整地引入作为参考。
【000198】在一个实例中,探针DNA是用特定结合对的一员(即配体)“标记”的,该对的另一员被结合到固体基质中,以便从靶物质来源容易地分离靶物质。例如,配体和特定结合对可以在每个方向选自如下:(1)抗原或半抗原和抗体或其特定结合片段;(2)生物素或亚氨基生物素和抗生物素蛋白或链霉抗生物素蛋白;(3)糖和对其特异的外源凝集素;(4)酶及其抑制剂;(5)脱辅基酶蛋白和辅因子;(6)互补同聚寡核苷酸;和(7)激素及其受体。在一个实例中,固相选自:(1)玻璃或聚合表面;(2)聚合玻珠的填充柱;和(3)磁性或顺磁性颗粒。
【000199】可以选择地,不止一个探针(其中至少一个探针能与核酸样品中存在的任何互补序列特定地杂交)可以被用于扩增反应中,以确定样品是否包括含有本发明的核酸序列的生物体(例如,从中分离核酸的生物体)。典型地,探针包括寡核苷酸。一方面,扩增反应包括PCR反应。PCR规程在Ausubel等人(1997)同上,和Sambrook等人(1989)同上中有所描述。另外,扩增可以包括连接酶链式反应,3SR,或链置换反应。(参见Barany(1991),PCR Methods and Application1:5-16;Fahy等人(1991),PCR Methods and Applications 1:25-33;和Walker等人(1992),Nucleic Acid Research 20:1691-1696,将这些文献完整地引用于此作为参考)。
【000200】也可以在染色体步查方法中使用来自如下序列的末端附近的序列所衍生的探针:A组核酸序列和与其基本上同一的序列中所示的序列,以鉴别含有与上面所述的核酸序列邻接的基因组序列的克隆。这样的方法允许分离编码来自宿主生物体的其它蛋白的基因。
【000201】如如下序列中所示的分离核酸序列可以被用作探针来鉴别和分离相关核酸:A组核酸序列,与其基本上同一的序列,与其互补的序列,或含有前述序列中的一个序列的至少10,15,20,25,30,35,40,50,75,100,150,200,300,400或500个连续碱基的片段。在一些方面,相关核酸可以是来自生物体的cDNA或基因组DNA,从中分离核酸的生物体除外。例如,其它生物体可以是相关生物体。在一些方法中,核酸样品与探针在允许探针与相关序列特定地杂交的条件下接触。然后使用上面说明的任意一种方法来检测探针与来自相关生物体的核酸的杂交。
【000202】在核酸杂交反应中,用于获得特定严格水平的条件可以有所变化,这依赖于被杂交的核酸的性质。例如,在选择杂交类型时可以考虑核酸的长度、核酸之间的互补性数量、核苷酸序列组成(例如G-C富集与A-T富集程度)和核酸类型(例如RNA与DNA)。可以通过在低于探针的溶解温度的变化温度下进行杂交来改变严格性。溶解温度Tm是50%的靶序列与完全互补的探针杂交的温度(在已知离子强度和温度下)。对于特定探针,对严格条件进行选择使其等于Tm或比Tm低大约5℃。使用如下公式计算探针的溶解温度:
【000203】对于长度在14到70个核苷酸之间的探针,计算溶解温度(Tm)公式为:Tm=81.5+16.6(log[Na+]+0.4(组分G+C)-(600/N),其中N是探针的长度。
【000204】如果杂交是在含有甲酰胺的溶液中进行的,计算溶解温度的方程为:Tm=81.5+16.6(log[Na+]+O.4(组分G+C)-(0.63%甲酰胺)-(600/N),其中N是探针的长度。
【000205】表达文库-将本发明的多核苷酸与表达载体和适当的宿主细胞组合使用来产生表达文库。该文库允许由本发明的多核苷酸编码的多肽的体内表达。在这样的表达文库已经产生后,在通过细胞分选进行筛选之前,可以增加额外的步骤,即对这样的文库进行“生物淘选”的步骤。“生物淘选”方法是这样的一个过程,即通过筛选克隆文库中的序列特异性鉴别具有特定生物活性的克隆,所述克隆文库是通过如下步骤产生的:(i)选择性地分离来源于至少一种微生物体的靶DNA,通过使用至少一个探针DNA,该探针DNA包括编码具有特定生物活性(例如腈水解酶活性)的多肽的DNA序列的至少一部分;和(ii)任选地用分离的靶DNA转化宿主,以产生克隆文库,用于筛选特定生物活性。
【000206】探针DNA,是用于从来自至少一种微生物体的DNA中选择性地分离相关靶DNA的,可以是相对于已知活性的酶的DNA的全长编码区序列或部分编码区序列。使用探针的混合物来探测原始DNA文库,所述探针包括编码具有特定酶活性的酶的DNA序列的至少一部分。这些探针或探针文库是单链的,被探测的微生物DNA已经被转化为单链形式。特别合适的探针是那些来源于编码如下酶的DNA的探针,所述酶具有与被筛选的特定酶活性类似的活性或相同的活性。
【000207】已经从选择性地从生物体分离的DNA制备了克隆多重性,这样的克隆被筛选特定酶活性,以鉴别具有特定酶特性的克隆。
【000208】可以在个体表达克隆上影响酶活性的筛选,或者最初在表达文库的混合物上影响,以确定该混合物是否具有一种或多种特定酶活性。如果所述混合物具有特定酶活性,那么可以对个体克隆进行再次筛选,以筛选这样的酶活性或更特异的活性。因此,例如,如果克隆混合物具有腈水解酶活性,那么可以对个体克隆进行恢复和筛选,以确定这些克隆中的哪一个克隆具有腈水解酶活性。
【000209】正如对于上面的一个方面的描述,本发明提供了一种过程,用于含有选择的DNA的克隆的酶活性筛选,所述选择的DNA来源于微生物体,该过程包括:筛选文库,以筛选特定酶活性,所述文库包括多个克隆,所述克隆已经通过从所选择DNA的微生物体的基因组DNA中回收而产生,所述选择的DNA是通过与至少一个DNA序列杂交选择的,所述至少一个DNA序列是编码具有特定活性的酶的DNA序列的全部或一部分;和用选择的DNA转化宿主,以产生被用来筛选特定酶活性的克隆。
【000210】在一个方面,来自微生物体的DNA文库被进行一个选择程序,以从中选择DNA,该DNA与一个或多个探针DNA序列杂交,所述探针DNA序列是编码具有特定酶活性的酶的DNA序列的全部或一部分;
(a)将来自DNA文库的的单链DNA群体与结合到配体上的DNA探针在严格杂交条件下接触,以产生探针和DNA文库的一个成员之间的双链体;
(b)将双链体与配体的固相特异性结合成员接触,以产生固相复合体;
(c)从DNA文库的非双链体成员分离固相复合体;
(d)对复合体进行变性,以释放DNA文库的成员;
(e)产生来自步骤(d)的成员的互补DNA链,以使该成员形成双链DNA;
(f)将双链DNA引入适当的宿主,以表达由成员DNA编码的多肽;和
(g)确定所表达的多肽是否表现出特定酶活性。
【000211】在其它方面,该过程包括预选择,以回收包括信号或分泌序列的DNA。用这种方式,通过如上面所描述的仅仅杂交包括信号或分泌序列的DNA,从基因组DNA群体选择是可能的。下面的段落描述了本方面的这一方面的规程,分泌信号序列通常情况下的性质和功能,这些序列应用于分析或选择过程的特定的例证性应用。
【000212】在上面的步骤(a)之后但在(b)之前,这一方面的特别方面进一步包括如下步骤:
(i)将(a)的单链DNA群体与结合到配体上的寡核苷酸探针在严格杂交条件下接触,以形成双链DNA双链体,所述结合到配体上的寡核苷酸探针与对于给定的蛋白种类独特的分泌信号序列互补;
(ii)将(i)的双链体与所述配体的固相特异性结合成员接触,以产生固相复合体;
(iii)从(a)的单链DNA群体分离固相复合体;
(iv)对复合体进行变性,以释放基因组群体的单链DNA成员;
(v)从固相结合探针分离单链DNA成员。
【000213】然后,已经被选择和分离以包括信号序列的DNA被进行上面所描述的选择程序,以从中选择和分离DNA,该DNA结合到一个或多个探针DNA序列上,所述探针DNA来源于编码具有特定酶活性的酶的DNA。该程序在美国专利号6,054,267中有所描述和例证,此处将其完整地引入作为参考。
【000214】体内生物淘选可以使用基于FACS(荧光激活细胞分选仪)的仪器进行。用含有稳定转录的RNA的元件的载体构建复合基因文库。例如,内含导致二级结构如发夹结构的序列,将有助于增强它们的稳定性,从而增加它们在细胞中的半衰期,所述二级结构被设计在RNA的转录区域的侧翼。在生物淘选过程中使用的探针分子,包括用报道分子标记的寡核苷酸,该报道分子只有在探针与靶分子结合时才发出荧光。使用几种转化方法中的一种,将这些探针引入来自文库的重组细胞。探针分子结合到转录的靶mRNA上,导致DNA/RNA异源双链体分子。探针与靶物质的结合,将产生荧光信号,该信号在筛选过程中通过FACS仪器进行检测和分类。
【000215】在一些方面,编码如下序列的多肽之一的核酸以适当的相与前导序列装配,所述前导序列能指导翻译多肽或其片段的分泌,所述如下序列包括B组氨基酸序列,与其基本上同一的序列,或包括其至少5,10,15,20,25,30,35,40,50,75,100或150个连续氨基酸的片段。任选地,核酸可以编码融合多肽,其中如下序列的多肽之一被融合到异源肽或多肽上,如期望特性例如增强的稳定性或简化的纯化过程,异源肽或多肽例如是N-端识别肽,所述如下序列包括B组氨基酸序列,与其基本上同一的序列,或包括其至少5,10,15,20,25,30,35,40,50,75,100或150个连续氨基酸的片段。
【000216】宿主细胞可以是本技术领域的普通技术人员所熟悉的任何宿主细胞,包括原核细胞、真核细胞、哺乳动物细胞、昆虫细胞或植物细胞。正如适当宿主的代表性实例,可能提及到:细菌细胞,如大肠杆菌(E.coli.)、链霉菌(Streptomyces)、枯草杆菌(Bacillus subtilis)、鼠伤寒沙门氏菌(Salmonellatyphimurium)和如下属中的各种种类:假单胞菌属(Pseudomonas)、链霉菌属(Streptomyces)和葡萄球菌属(Staphylococcus),真菌细胞如酵母,昆虫细胞如果蝇S2(Drosophila S2)和草地夜蛾Sf9(Spodoptera Sf9),动物细胞如中国仓鼠卵巢细胞(CHO)、COS或黑色素瘤(Bowes melanoma)和腺病毒。适当宿主的选择在本技术领域的普通技术人员的能力范围内。
【000217】在适当的时候,工程宿主细胞可以在传统的培养基中培养,该培养基被修饰以适合激活启动子,选择转化体或扩增本发明的基因。在合适的宿主菌株转化和宿主菌株生长到适当的细胞密度之后,可以用适当的方法(例如变温或化学诱导)诱导所选择的启动子,细胞被再次培养一段时间以允许它们产生期望的多肽或其片段。
【000218】细胞被特别地通过离心收获,通过物理或化学方法破坏,并且保留所得到的粗提取物用于进一步纯化。用于蛋白表达的微生物细胞可以通过任何传统方法来破坏,包括冻融循环、超声处理、机械破坏或使用细胞裂解试剂。这样的方法对本技术领域的普通技术人员是已知的。所表达的多肽或其片段可以通过如下方法从重组细胞培养物中回收和纯化:硫酸铵或乙醇沉淀、酸提取、阴离子或阳离子交换色谱法、磷酸纤维素色谱法、疏水相互作用色谱法、亲和色谱法、羟磷灰石色谱法和外源凝集素色谱法。在必要时,在完成多肽的构型中,可以使用蛋白重折叠步骤。如果期望,可以将高效液相色谱(HPLC)用于最终的纯化步骤。
【000219】各种哺乳动物细胞培养系统也可以被用于表达重组蛋白。哺乳动物表达系统的实例包括猴肾纤维原细胞的COS-7系(由Gluzman描述(1981),Cell 23:175),和其它能表达来自相容载体的蛋白的细胞系,如C127、3T3、CHO、海拉细胞(HeLa)和BHK细胞系。
【000220】本发明也涉及如下序列的多肽的变体:B组氨基酸序列,与其基本上同一的序列,或包括其至少5,10,15,20,25,30,35,40,50,75,100或150个连续氨基酸的片段。尤其是,这些变体可以通过如下方式在氨基酸序列上与B组氨基酸序列和与其基本上同一的序列不同,即通过一个或多个替代、增加、缺失、融合和平截,这些方式也可以以其任意组合存在。
【000221】这些变体可以是天然存在的或是体外产生的。尤其是,这样的变体可以使用基因工程技术来产生,如定点诱变、随机化学诱变、核酸外切酶III缺失方法和标准克隆技术。可以选择地,这样的变体、片段、类似物或衍生物可以使用化学合成或修饰方法来产生。
【000222】产生变体的其它方法对于本技术领域的普通技术人员是已知的。这些方法包括这样的方法,即其中从天然分离物得到的核酸序列被修饰,以产生编码多肽的核酸,所述核酸具有能增强它们在工业或实验室应用的特性。在这样的方法中,产生且表征了与来自天然分离物的序列具有一个或多个核苷酸差异的大量变体序列。典型地,这些核苷酸差异导致与由来自天然分离物的核酸编码的多肽的氨基酸变化。
易错PCR(Error Prone PCR)
【000233】例如,可以使用易错PCR产生变体。在易错PCR中,PCR是在DNA聚合酶的拷贝保真度较低的条件下进行的,这样可以沿着PCR产物的整个长度获得较高的点突变率。易错PCR描述在Leung等人(1989),Technique 1:11-15和Caldwell等人(1992),PCR Methods Applic.2:28-33,将其公开内容完整地引用于此作为参考。简单来说,在这些方法中,将被诱变的核酸与PCR引物和试剂(例如反应缓冲液、MgCl2、MnCl2、Taq聚合酶和适当浓度的dNTP)混和,以沿着PCR产物的整个长度获得较高的点突变率。例如,可以使用将被诱变的20微微毫摩尔核酸,30皮摩尔每一种PCR引物,含有如下成分的反应缓冲液:50mM KCl、10mM Tris HCl(pH 8.3)和0.01%明胶、7mM MgCl2、0.5mM MnCl2、5单位Taq聚合酶、0.2mM GTP、0.2mM dATP、1mM dCTP和1mM dTTP。PCR可以被进行30个循环:94℃1分钟,45℃1分钟,72℃1分钟。然而,应该意识到,这些参数在适当的时候可以改变。被诱变的核酸被克隆到适当的载体中,并且评价由被诱变的核酸编码的多肽的活性。
【000224】也可以使用寡核苷酸指导的诱变来产生变体,以便在相关的任何克隆DNA中产生位点特异性突变。寡核苷酸诱变描述在Reidhaar-Olson等人(1988),Science,241:53-57,将其公开内容完整地引用于此作为参考。简单来说,在这些方法中,合成了被引入克隆DNA的具有一个或多个突变的大量双链寡核苷酸,且被插入到将被诱变的克隆DNA中。回收含有诱变DNA的克隆,并且评价由它们所编码的多肽的活性。
装配PCR(Assembly PCR)
【000225】另一种产生变体的方法是装配PCR。装配PCR涉及从小DNA片段的混合物装配PCR产物。大量不同的PCR反应在相同的小瓶中平行发生,一个反应的产物引导另一反应的产物。装配PCR描述在美国专利号5,965,408中,将其公开内容完整地引用于此作为参考。
有性PCR诱变(Sexual PCR mutagenesis)
【000226】仍然,产生变体的另一个方法是有性PCR诱变。在有性PCR诱变中,由于基于序列同源性的DNA分子的随机断裂,随后是PCR反应中引物延伸的交换固定,强制同源重组在具有不同但高度相关的DNA序列的DNA分子之间体外发生。有性PCR诱变描述在Stemmer(1994),Proc.Natl.Acad.Sci.USA 91:10747-10751中,将其公开内容完整地引用于此作为参考。简单来说,在这样的方法中,用DNAse消化大量将被重组的核酸,以产生平均大小为大约50-200核苷酸的片段。所需平均大小的片段被纯化,且重悬浮在PCR混合物中。PCR在有助于核酸片段之间的重组的条件下进行。例如,通过将纯化片段以10-30ng/μl的浓度重悬浮在包括如下成分的溶液中来进行:每种dNTP为0.2mM、2.2mM MgCl2、50mM KCl、10mM Tris HCl、pH9.0、和0.1%Triton X-100。每100μl反应混合物中加入2.5单位Taq聚合酶,使用如下时间段进行PCR:94℃ 60秒、94℃ 30秒、50-55℃30秒、72℃30秒(30-45次)和72℃5分钟。然而,应该意识到,这些参数在适当的时候可以变化。在一些方面,寡核苷酸可以被包括在PCR反应中。在其它方面,在第一组PCR反应中可以使用DNA聚合酶I的克列诺(Klenow)片段,在随后的PCR反应组中可以使用Taq聚合酶。分离重组序列,评价由这些重组序列所编码的多肽的活性。
体内诱变(In vivo Mutagenesis)
【000227】也可以通过体内诱变产生变体。在一些情况下,通过在细菌菌株如大肠杆菌菌株中繁殖相关序列来产生相关序列中的随机突变,该菌株以一个或多个DNA修复途径进行突变。这样的“增变”菌株比野生型母体的菌株具有较高的随机突变率。在这些菌株中繁殖DNA,将最终在DNA中产生随机突变。适合在体内诱变中使用的增变菌株,描述在PCT出版物序列号WO 91/16427中,将其公开内容完整地引用于此作为参考。
盒式诱变(Cassette Mutagenesis)
【000228】也可以通过盒式诱变产生变体。在盒式诱变中,用与天然序列不同的合成寡核苷酸“盒子”取代双链DNA分子的一个小区域。寡核苷酸通常完全和/或部分地含有随机化的天然序列。
递归集合诱变(Recursive Ensemble Mutagenesis)
【000229】递归集合诱变也可以被用来产生变体。递归集合诱变是一种蛋白质工程(蛋白质诱变)算法,该算法是开发用来产生表型相关变体的不同群体的,所述群体的成员在氨基酸序列上不同。该方法使用回馈机制来控制组合盒式诱变的连续循环。递归集合诱变描述在Arkin等人(1992),Proc.Natl.Acad.Sci.USA,89:7811-7815中,将其公开内容完整地引用于此作为参考。
指数集合诱变(Exponential Ensemble Mutagenesis)
【000230】在一些情况下,使用指数集合诱变来产生变体。指数集合诱变是用来产生组合文库的一种方法,所述组合文库具有高百分比的独特和功能突变体,其中残基小组被平行地随机化,以便在每一改变的位置上鉴定导致功能蛋白质的氨基酸。指数集合诱变描述在Delegrave等人(1993),Biotechnology Research 11:1548-1552,将其公开内容完整地引用于此作为参考。
随机和定点诱变(Random and site-directed Mutagenesis)
【000231】随机和定点诱变描述在Arnold(1993),Current Opinions inBiotechnology 4:450-455,将其公开内容完整地引用于此作为参考。
改组方法(Shuffling Procedures)
【000232】在一些方面,使用改组方法来产生变体,其中编码不同多肽的大量核酸的部分被融合在一起,以产生编码嵌合多肽的嵌合核酸序列,正如美国专利号5,965,408和5,939,250中所描述的,将其公开内容完整地引用于此作为参考。
【000233】B组氨基酸序列的多肽的变体可以是这样的变体,其中B组氨基酸序列的多肽的一个或多个氨基酸残基被一个保守或非保守氨基酸残基(例如一个保守氨基酸残基)取代,这样取代的氨基酸残基可以由或者不由遗传密码编码。
【000234】保守取代是指多肽中的一个给定氨基酸被具有类似特性的另一个氨基酸取代。通常看作保守取代的是如下取代:一个脂族氨基酸如丙氨酸、缬氨酸、亮氨酸和异亮氨酸被另一个脂族氨基酸取代;丝氨酸被苏氨酸取代,或苏氨酸被丝氨酸取代;一个酸性残基如天冬氨酸和谷氨酸被另一个酸性残基取代;具有一个酰胺基因的残基,如天冬酰胺和谷氨酰胺,被另一个具有酰胺基团的残基取代;碱性残基如赖氨酸和精氨酸被另一个碱性残基置换;和一个芳族残基如苯丙氨酸和酪氨酸被另一个芳族残基取代。
【000235】其它变体是其中B族氨基酸序列的多肽的一个或多个氨基酸残基包括一个取代基团的变体。
【000236】仍然,其它变体是其中多肽与另一种化合物相关的变体,如增加多肽半衰期的化合物(例如聚乙二醇)。
【000237】其它变体是其中其它氨基酸被融合到多肽上的变体,如前导序列、分泌序列、前蛋白(proprotein)序列或有助于纯化、富集、或稳定多肽的序列。
【000238】在一些方面,片段、衍生物和类似物保留了与B族氨基酸序列和与其基本上同一的序列相同的生物功能或活性。在其它方面,片段、衍生物或类似物包括前蛋白,这样片段、衍生物或类似物可以通过裂解前蛋白部分以产生活性多肽来激活。
【000239】本发明的另一个方面是多肽或其片段,它们与B组氨基酸序列,与其基本上同一的序列,或包括其至少5,10,15,20,25,30,35,40,50,75,100或150个连续氨基酸的片段具有至少大约85%、至少大约90%、至少大约95%、或高于大约95%的同源性。同一性百分比可以使用上面所描述的任意一种程序来确定,这些程序对比被比较的多肽或片段,并且确定氨基酸同源性的程度或它们之间的相似性。应该意识到,氨基酸“同源性”包括如上面所描述的那些保守氨基酸取代。在本发明的一个方面,片段可以被用来产生抗体。这些抗体可以被用来固定可用于工业过程中的腈水解酶。编码本发明的腈水解酶的多核苷酸可以以类似方式使用。
【000240】另外,同源的多肽或片段可以通过生物化学富集或纯化方法来获得。潜在同源的多肽或片段的序列通过蛋白酶消化、凝胶电泳和/或微量测序来确定。使用此处描述的任意一种程序将预期同源的多肽或片段的序列与如下序列的多肽之一进行比较:B组氨基酸序列,与其基本上同一的序列,或包括其至少5,10,15,20,25,30,35,40,50,75,100或150个连续氨基酸的片段。
【000241】本发明的另一个方面是一种分析方法,用于鉴别B组氨基酸序列或与其基本上同一的序列的变体或片段,这些变体或片段保留了与B组氨基酸序列和与其基本上同一的序列的多肽的酶功能。例如,多肽的片段或变体,可以被用来催化生物化学反应,这表明所述片段或变体保留了与B组氨基酸序列中的多肽的酶活性。
【000242】用于确定变体的片段是否保留了B组氨基酸序列和与其基本上同一的序列的多肽的酶活性的分析方法包括如下步骤:将多肽片段或变体与底物分子在允许多肽片段或变体发挥作用的条件下接触;检测,或者是底物水平有所增加,或者是多肽和底物之间反应的特定反应产物水平有所增加。
【000243】B组氨基酸序列,与其基本上同一的序列,或包括其至少5,10,15,20,25,30,35,40,50,75,100或150个连续氨基酸的片段的多肽可以在多种应用中使用。例如,多肽或其片段可以被用来催化生物化学反应。根据本发明的一个方面,提供了一种方法,用于使用B组氨基酸和与其基本上同一的序列的多肽,或编码这样的多肽的多核苷酸来水解氨基腈。在这样的方法中,将含有卤代烷烃化合物的物质与B组氨基酸序列和与其基本上同一的序列的多肽之一在有助于该化合物水解的条件下接触。
【000244】抗体-B组氨基酸序列,与其基本上同一的序列,或包括其至少5,10,15,20,25,30,35,40,50,75,100或150个连续氨基酸的片段的多肽,也可以被用来产生与酶多肽或片段特定地结合的抗体。所得到的抗体也可以被用于免疫亲和色谱法,来分离或纯化多肽或确定生物样品中是否存在多肽。在这样的方法中,将蛋白制剂,如提取物或生物样品,与能与如下序列的多肽之一特定地结合的抗体接触:B组氨基酸,与其基本上同一的序列,或前述序列的片段。
【000245】在免疫亲和方法中,抗体被结合到固相支持体上,如小珠或柱基质。将蛋白制剂在一定的条件下与抗体接触,在这样的条件下抗体与B组氨基酸序列、与其基本上同一的序列、或其片段的多肽之一特定地结合。在以去除非特定地结合的蛋白质的洗涤之后,洗脱特定地结合的多肽。
【000246】在生物样品中蛋白与抗体结合的能力可以使用本技术领域的技术人员所熟知的多种方法中的任意一种来确定。例如,可以通过用可检测的标记物标记抗体来确定结合,如荧光试剂、酶标记物、或放射性同位素。另外,抗体与样品的结合可以使用其上具有这样的可检测标记的二次抗体来检测。特定的测定方法包括ELISA测定法、夹心测定法、放射免疫测定法和蛋白质印迹测定法。
【000247】本发明的抗体可以被结合到固相支持体上,并且可以被用来固定本发明的腈水解酶。正如上面所描述的,在工业化学方法中可以使用这样固定的腈水解酶,用于将腈转化为大范围的有用的产品和中间产物。
【000248】通过将多肽直接注射到动物或通过将多肽施用给动物,可以获得针对如下序列的多肽产生的多克隆抗体:B组氨基酸序列,与其基本上同一的序列,或包括其至少5,10,15,20,25,30,35,40,50,75,100或150个连续氨基酸的片段。然后,如此获得的抗体将结合多肽本身。以这种方式,即使仅仅编码多肽片段的序列也可以被用来产生能结合到整个天然多肽上的抗体。然后,这样的抗体被用来从表达该多肽的细胞中分离多肽。
【000249】为了制备单克隆抗体,可以使用能通过连续细胞系培养物产生抗体的任何技术。实例包括杂交瘤技术(Kohler和Milstein(1975),Nature,256:495-497,将其公开内容完整地引用于此作为参考),三系杂交瘤(trioma)技术、人B-细胞杂交瘤技术(Kozbor等人(1983),Immunology Today 4:72,将其公开内容完整地引用于此作为参考),和EBV-杂交瘤技术(Cole等人(1985),在MonoclonalAntibodies and Cancer Therapy,Alan R.Liss,Inc.,第77-96页中,将其公开内容完整地引用于此作为参考)。
【000250】所描述的用于产生单链抗体的技术(美国专利号4,946,778,将其公开内容完整地引用于此作为参考)适合用来产生针对多肽的单链抗体,例如B组氨基酸序列或其片段的多肽。另外,转基因大鼠可以被用来表达针对这些多肽或片段的人源化抗体。
【000251】所产生的针对如下序列的多肽的抗体可以被用来从其它生物体和样品筛选类似多肽:B组氨基酸序列,与其基本上同一的序列,或包括其至少5,10,15,20,25,30,35,40,50,75,100或150个连续氨基酸的片段。在这样的技术中,将来自生物体的多肽与抗体接触,并且检测那些与抗体特定地结合的多肽。上面所描述的任何技术可以被用来检测抗体结合。一个这样的筛选方法描述在“Methods for Measuring Cellulase Activities”,Methods in Enzymology,160:87-116,将其公开内容完整地引用于此作为参考。
使用包括核酸的全细胞
【000252】本发明提供了使用已经用编码一个或多个本发明的腈水解酶的核酸(或其活性片段)转化的全细胞。本发明也提供了在底物上进行腈水解酶的反应中使用这样的全细胞。因此,本发明提供了使用包括此处公开的至少一个核酸或多肽(SEQ ID NOS:1-386)的全细胞来水解羟腈或氨基腈键合的方法。例如,用编码腈水解酶的核酸稳定地转染(本发明也包括瞬时转染或转化全细胞)的全细胞是本发明的一个方面。这样的细胞是有用的,作为反应混合物中的试剂来作用于底物,并且表现出腈水解酶活性。
序列分析软件
【000253】两个或多个序列之间的同一性或同源性百分比通常是使用序列分析软件(例如位于University of Wisconsin Biotechnology Center,Madison,WI的遗传学计算机小组的序列分析软件包)来测量的。这样的软件通过将同一性或同源性百分比分配给各种缺失、替代和其它修饰来匹配相似序列。在两个或多个核酸或多肽序列的情形中,术语“同一性百分比”指,在指定区域或比较“窗口”,按照最大对应比对之后进行比较时,相同的核苷酸或氨基酸残基的百分比。在一些算法中,保守氨基酸取代可以被认为是“同一的”,在密码子的摆动位点上的变化可以被认为是“同一的”。
【000254】“比对”指将两个或多个序列排列起来以获得最大对应的过程,目的是为了评价同一性或同源性程度,正如相关比对算法的环境中所定义的。
【000255】对于序列比较,通常一个序列作为参考序列发挥作用,将试验序列与其进行比较。当使用序列比较算法时,试验序列和参考序列被输入到计算机中,如果必要,指定子序列坐标,并且为特定算法指定序列算法程序参数。可以使用缺省程序参数,或可以指定替换性参数。然后,基于程序参数,序列比较算法计算试验序列相对于参考序列的同一性百分比或同源性百分比。
【000256】正如此处所用,“比较窗口”是核酸或氨基酸序列中连续位置的一个片段,包括20个到600个,通常大约50个到大约200个,更通常地大约100个到大约150个核苷酸或残基,在两个序列被最优化地比对后,该片段可以与具有相同或不同数量的连续位置的参考序列进行比较。进行序列比较算法的方法在本技术领域是已知的。可以进行序列的最适比对,例如通过Smith and Waterman(1981),Adv.Appl.Math.2:482的局部同源性算法,通过Needleman and Wunsch(1970),J.Mol.Biol 48:443的同源性比对算法,通过Pearson and Lipman(1988),Proc.Natl.Acad.Sci.USA 85:2444-24448的相似性搜索方法,通过这些算法的计算机化实施,或通过人工比对和视觉观察。确定同源性或同一性的其它算法包括,例如BLAST程序(基本局部比对搜索工具,National Center for Biological Information),BESTFIT,FASTA和TFASTA(位于Madison,WI的遗传学计算机小组的威斯康星遗传学软件包(Wisconsin Genetics Software Package)),ALIGN,AMAS(多重比对序列分析,Analysis of Multiply Aligned Sequences),AMPS(多重蛋白序列分析,Alignment of Multiple Protein Sequence),ASSET(比对片段统计学评估工具,Aligned Segment Statistical Evaluation Tool),BANDS,BESTSCOR,BIOSCAN(生物学序列比较分析节点,Biological Sequence Comparative Analysis Node),BLIMPS(Blocks IMProved Searcher),Intervals and Points(间隔和点),BMB,CLUSTALV,CLUSTAL W,CONSENSUS,LCONSENSUS,WCONSENSUS,Smith-Waterman算法,DARWIN,Las Vegas算法,FNAT(强制核苷酸比对工具,Forced NucleotideAlignment Tool),Framealign(框架比对),Framesearch(框架搜索),DYNAMIC,FILTER,FSAP(Fristensky序列分析软件包,Fristensky Sequnence AnalysisPackage),GAP(全球比对程序,Global Alignment Program),GENAL,GIBBS,GenQuest,ISSC(敏感序列比较,Sensitive Sequence Comparison),LALIGN(局部序列比对,Local Sequence Alignment),LCP(局部含量程序,Local ContentProgram),MACAW(多重比对构造和分析工作平台,Multiple alignment Constructionand Analysis Workbench),MAP(多重比对程序,Multiple Alignment Program),MBLKP,MBLKN,PIMA(模式诱导多序列比对,Pattem-Induced Multi-sequenceAligmnent),SAGA(借助遗传学算法的序列比对)Sequence Alignment by GeneticAlgorithm)和WHAT-IF。这些比对算法也可以被用来筛选基因组数据库,以鉴别具有基本上同一的序列的多核苷酸序列。许多基因组数据库是可以获得的,例如作为人类基因组测序计划(Human Genome Sequencing Project)的一部分,可以获得人基因组的重要部分(J.Roach,http://weber.u.Washington.edu/~roach/human_genome_progress 2.html)(Gibbs,1995)。至少二十一个其它基因组已经被测序,例如包括生殖器支原体(Mgenitalium)(Fraser等人,1995),甲烷球菌(M.jannaschii)(Bult等人,1996),流感嗜血菌(H.influenzae)(Fleischmann等人,1995),大肠杆菌(E.coli)(Blattner等人,1997),和酿酒酵母(S.cerevisiae)(Mewes等人,1997),和果蝇(D.melanogaster)(Adams等人,2000)。对生物体模型的基因组进行测序也已经取得了重大进步,如小鼠、线虫(C.elegans)和拟南芥(Arabadopsis sp)。几个包括基因组信息的数据库由不同的组织保存,其中注释了一些功能信息,这些数据库可以通过因特网得到,例如
http://wwwtigr.org/tdb;
http://www.genetics.wisc.edu;http://genome-www.stanford.edu/~ball;
http://hiv-web.lanl.gov;http://www.ncbi.nlm.nih.gov;
http://www.ebi.ac.uk;
http://Pasteur.fr/other/biology;和http://www.genome.wi.mit.edu。
【000257】有用的算法的实例是BLAST和BLAST2.0算法,分别描述在Altschul等人(1997),Nuc.Acids Res.25:3389-3402,和Altschul等人(1990),J.Mol.Biol.215:403-410中。进行BLAST分析的软件可以通过National Center forBiotechnology Information(http://www.ncbi.nlm.nih.gov/)公开获得。该算法包括首先通过识别查询序列中短字符的长度W来识别高得分序列对(HSPs),当与数据库序列中相同长度的字符比对时,这或者匹配或者满足一些正值阈值得分T。T指相邻字符得分阈值(Altschul等人,同上)。这些初始相邻字符命中作为寻找含有它们的较长HSPs的初始搜索的种子。字符命中沿着每一序列在两个方向延伸,一直延伸到累积比对得分能被增加。使用参数M(一对匹配残基的奖赏得分;通常大于零)计算累积得分。对于氨基酸序列,用得分矩阵来计算累积得分。当出现如下情况时,中断字符命中在每一方向的延伸:累积比对得分从其所得到的最大值下降了数量X;由于一个或多个负得分残基比对的累积,累积得分等于或低于零;或者已经到达任一序列的末端。BLAST算法参数W、T和X确定比对的灵敏度和速度。对于核苷酸序列,BLASTN程序使用缺省值,字符长度(W)为11,期望值(E)为10,M=5,N=-4,并且比较双链。对于氨基酸序列,LASTP程序使用的缺省值为:字符长度(W)为3,期望值(E)为10和BLOSUM62得分矩阵(参见Henikoff和Henikoff(1989),Proc.Natl.Acad.Sci.USA 89:10915)。
【000258】BLAST算法也可以进行两个序列之间相似性的统计分析(例如参见,Karlin和Altschul(1993),Proc.Natl.Acad.Sci.USA 90:5873)。由BLAST算法提供的一个相似性测量是最小总和概率(P(N)),这提供了两个核苷酸或氨基酸序列之间的匹配偶然发生的概率指示。例如,如果在试验核酸与参考核酸的比较中,最小总和概率低于大约0.2,低于大约0.01,或低于大约0.001,核酸被认为与参考序列是相似的。
【000259】在一个方面,使用基本局部比对搜索工具(“BLAST”)来评价蛋白和核酸序列同源性。尤其是,五个特定的BLAST程序被用来完成如下任务:
(1)BLASTP和BLAST3将氨基酸查询序列与蛋白质序列数据库进行比较;
(2)BLASTN将核苷酸查询序列与核苷酸序列数据库进行比较;
(3)BLASTX将查询核苷酸序列(两个链)的六框架概念翻译产物与蛋白质序列数据库进行比较;
(4)TBLASTN将查询蛋白序列与在所有六个阅读框(两个链)中翻译的核苷酸序列数据库进行比较;
(5)TBLASTX将核苷酸查询序列的六框翻译与核苷酸序列数据库的六框翻译进行比较。
【000260】BLAST程序通过识别相似片段来识别同源序列,此处被称作查询氨基酸或核酸序列和试验序列之间对“高得分片段对”,所述试验序列可以从蛋白质或核酸序列数据库得到。高打分片段对通过打分矩阵方式来鉴别(即比对),许多这样的方式在本技术领域是已知的。在一个实例中,所用的打分矩阵是BLOSUM62矩阵(Gonnet等人(1992),Science 256:1443-1445;Henikoff和Henikoff(1993),Proteins 17:49-61)。在另一个实例中,也可以使用PAM或PAM250矩阵(例如参见,Schwartz和Dayhoff(1978),Matrices for Detecting Distance Relationships:Atlasof Protein Sequence and Structure,Washington:National Biomedical ResearchFoundaion)。BLAST程序可以通过美国国家医药图书馆(U.S.Naional Library ofMedicine)获得,例如通过
www.ncbi.nlm.nih.gov。
【000261】可以对上述算法所用的参数进行调整,依赖于序列长度和所考虑的同源性程度。在一些方面,在没有用户说明书的情况下,这些参数可以是算法所用的缺省参数。
【000262】在一个特定的方面,本发明提供了一种方法,用于修饰小分子,包括将此处描述的多核苷酸编码的多肽或其酶活性片段与小分子接触,以产生修饰的小分子。对修饰的小分子的文库进行测试,以确定表现出期望活性的文库中是否存在修饰的小分子。产生具有期望活性的修饰小分子的特定生物催化反应是通过如下步骤来鉴别的:系统地除去用于产生文库的一部分的每一生物催化反应;然后测试文库部分中产生的小分子中存在还是不存在具有期望活性的修饰小分子。任选地重复产生具有期望活性的修饰小分子的特定生物催化反应。用一组生物催化剂完成生物催化反应,所述一组生物催化剂与小分子结构中发现的不同结构部分发生反应,每一生物催化剂对于一个结构部分或一组相关结构部分是特异的;和每一生物催化剂与含有不同结构部分的许多不同的小分子发生反应。
【000263】腈水解酶用途的一些方面是:
α-羟基酸-腈水解酶通过水解羟腈产生α-羟基酸。这种情况的一个实例是产生扁桃酸及其衍生物。这种类型的一个重要应用包括从扁桃腈以高产量和高对映选择性商业化地产生(R)-扁桃酸。已经发现扁桃酸及其衍生物作为产生许多手性药物和农用产品的中间产物和拆解试剂具有广泛的应用。先前将鲜为人知的腈水解酶应用于使用类似底物的方法中的尝试,已经为非常低的活性、生产率和选择性所困扰。
苯基乳酸衍生物
【000264】另一个应用是以高产量和高对映体选择性产生(S)-苯基乳酸衍生物。已经发现苯基乳酸在产生许多手性药物和农用产品中具有广泛应用。
β-羟基酸
【000265】出于重要的商业考虑,提供腈水解酶,产生4-氰基-3-羟基丁酸的任一对映异构体,其(R)-对映异构体是合成药物LIPITORTM(立普妥,斯达汀或阿伐他汀)中的一个关键性中间产物。
【000266】下述腈水解酶是用于将羟基戊二酰基腈转化为(R)-3-羟基-4-氰基-丁酸中有用的腈水解酶的多个实例:SEQ ID NOS:205,206,SEQ ID NOS:207,208,SEQ ID NOS:195,196,SEQ ID NOS:43,44,SEQ ID NOS:321,322,和SEQ IDNOS:237,238。上述示意性实例表明“选择的腈水解酶”可以被用来将羟基戊二酰基腈转化为(S)-3-羟基-4-氰基-丁酸:SEQ ID NOS:107,108,SEQ ID NOS:109,110,SEQ ID NOS:111,112,SEQ ID NOS:127,128,SEQ ID NOS:129,130,SEQ ID NOS:133,134,SEQ ID NOS:113,114,SEQ ID NOS:145,146,SEQ IDNOS:101,102,SEQ ID NOS:179,180,SEQ ID NOS:201,202,SEQ ID NOS:159,160,SEQ ID NOS:177,178,SEQ ID NOS:181,182,SEQ ID NOS:183,184,SEQ ID NOS:185,186,SEQ ID NOS:57,58,SEQ ID NOS:197,198,SEQ ID NOS:59,60,SEQ ID NOS:67,68和SEQ ID NOS:359,360。
【000267】参考下面实施例将对本发明做进一步的描述;然而,应该理解的是,本发明不限于这些实施例。由于本发明的公开内容描述了实施本发明的当前的最好方式,许多修改和变化将展现给本技术领域的普通技术人员,而不背离本发明的范围和精神。所有在与权利要求的等同之下的意思和范围内出现的变化、修改和变更被认为在权利要求的范围内。
实施例
实施例1:噬菌粒感染
【000268】对于被用来筛选腈水解酶的每一文库,感染物是按照如下所述准备的:
将5ml SEL700细胞的OD600nm=1重悬浮液和1ml将被筛选的噬菌粒文库混和。将该组合物在37℃的水浴中温育45分钟。
【000269】使用该感染物,在10mM MgSO4中进行连续稀释,使用10μl等分试样的感染物。
文库滴度 进行稀释
~105cfu/ml 10-1稀释
~106cfu/ml 10-1,10-2稀释
~107cfu/ml 10-1,10-2,10-3稀释
【000270】将60μl每一下述稀释物放置到一个小的LB-Kan50平板上:
文库滴度 进行稀释
~105cfu/ml 未稀释的感染物,10-1稀释
~106cfu/ml 10-1,10-2稀释
~107cfu/ml 10-2,10-3稀释
【000271】在台式离心机中,在4℃下,4.6k rpm将感染物中的细胞离心10分钟,以形成沉淀。从所得到的沉淀中轻轻倒出上清液。将细胞重悬浮在剩余液体中。将所有重悬浮的细胞放置到一个单个的大LB-Kan50平板上。将所有平板在30℃培养过夜。
实施例2:选择性筛选
【000272】用~4毫升10mM MgSO4重悬浮每一感染平板上的细胞。将重悬浮液放置到试管中。用~3毫升10mM MgSO4重悬浮每一平板上剩余的细胞,并且与来自同一平板上的第一重悬浮液组合。用10mM MgSO4使每一试管的体积达到12ml,将试管进行剧烈涡流摇动。在台式离心机上,在4℃下,4.6k将试管离心10分钟,以形成沉淀。从每一重悬浮液中,轻轻倒出上清液。用10ml 10mM MgSO4重悬浮每一试管中经过洗涤的细胞。将每一文库的重悬浮液在4℃保存,直到准备建立选择性培养物。
【000273】对于每一重悬浮液,使用如下过程建立选择性培养物:
1)制备腈水解酶选择性培养基,使用包括0.2%葡萄糖、不含氮和50μg/ml卡那霉素(仅仅对于pBK噬菌粒文库而言;对于pBS文库,使用氨苄青霉素)的1XM9培养基。
2)将5ml培养基等分到50ml螺旋口圆锥形试管中。
3)将25μl所保存的重悬浮液加入到试管中。
4)加入5μ1脂肪腈,以使终浓度到达8.8mM。可以使用其它腈底物来代替脂肪腈。
5)将所得到的组合物在30℃培养。
对于每一腈底物重复步骤1-5。
实施例3:来自选择性培养物的阳性腈水解酶克隆的分离
【000274】将正在生长的十(10)μl选择性培养物,划线接种到一个小的LB-Kan50平板上,并且允许在30℃生长两个晚上。挑取五个分离的菌落形成单位(cfu),并使其中的每一个在30℃下、在2ml腈水解酶选择性培养基中生长。监控每一培养物(其中,生长表明阳性菌落形成单位被挑取),当监控表明它处于生长的稳定期时将其拿开。用一(1)ml培养物制备噬菌粒制剂,并且用40μl洗脱缓冲液洗脱。用Pst I/Xho I或Sac I/Kpn I限制酶剪接五到八(5-80)μl DNA,以便从载体中去除插入物。完成限制性片段长度多态性(RFLP)测定,以鉴别插入物的大小。对插入物进行测序。
实施例4:腈水解酶的筛选和表征
【000275】筛选针对靶底物的本发明的腈水解酶。在初次筛选中显示出水解活性的那些酶中,选择其对映体选择性高于20%对映体过量(ee)的酶,用于进一步表征。对那些酶的选择基于:1)具有针对相关底物之一的活性和2)表现出大于35%ee(对映体过量)。这一筛选过程的结果如上面表1所示。用于筛选的产物是:D-苯基甘氨酸,L-苯基乳酸,(R)2-氯扁桃酸,(S)-环己基扁桃酸,L-2-甲基苯基甘氨酸,(S)-2-氨基-6-羟基己酸,和4-甲基-L-亮氨酸。
针对靶底物D-苯基甘氨酸的腈水解酶的筛选
苯基甘氨酸腈 D-苯基甘氨酸
【000276】进行苯基甘氨酸腈的水解。这些酶中的一些酶显示出高于20%的ee值,它们被选择用于初步表征。
【000277】基于初步表征实验,鉴别关于苯基甘氨酸腈的许多推断命中,对这些酶积累了大量数据。该数据显示出许多一般的特性:大部分酶的活性最适pH为7,通常情况下,在较低pH值下,对映选择性有所提高。这些酶被发现在较高温度下更活跃,尤其是38℃,尽管这一温度通常导致较低的对映体选择性。在反应中,使用水混溶共溶剂显示出是一个实用的选择。在酶反应中,含有10-25%甲醇(v/v)基本上不影响酶活性,而且在许多情况下,导致对映选择性的增加。使用两相系统也已经显示出一些成功的征兆,通过加入高达70%(v/v)的己烷维持酶的活性水平,在一些情况下加入甲苯。然而,在两相系统中使用乙酸乙酯导致较低的活性。
【000278】对于已经鉴别出对于苯基甘氨酸腈有活性的酶,好几种酶的对映选择性显示出维持了高于35%ee的成功标准。初步表征数据表明一些酶对D-苯基甘氨酸显示出高对映选择性,相应地至产物的转化率为40-60%。进一步的研究提示,这些酶中的一些酶的活性速率比底物的外消旋速率更快。降低酶的浓度导致改进的对映选择性;因此,可以发现,通过控制化学外消旋作用和酶活性的相对速率来获得一些益处。
针对靶底物(R)-2-氯扁桃酸的腈水解酶的筛选
2-氯扁桃腈 (R)-2-氯扁桃酸
【000279】已经鉴别出这些酶对于2-氯扁桃腈显示出活性。对于2-氯扁桃腈和苯基甘氨酸腈具有活性的酶之间,存在高度重叠。这些酶中的许多酶也形成一个不同的序列家族。
【000280】对于活性酶,较高的温度和中性pH显示出导致最高活性。对于绝大多数的腈水解酶,在较高的温度下,对映选择性也有所增加,尤其是在38℃。这些酶在存在高达25%甲醇或10%异丙醇的情况下保持了它们的活性;在这些情况中的许多情况下,对映选择性也有所提高。在两相系统中的活性,在很大程度上,是可以与含水条件相比较的,尤其是将己烷作为非水相时;在不同腈水解酶之间,观察到了对于甲苯的耐受性的变化。
表2.从2-氯扁桃腈的对映选择性水解的表征实验确定的最适条件的总结
SEQ ID NOS: | 最适pH | 最适温度℃ | 溶剂耐受性 |
385,386 | 7 | 38 | 25%MeOH |
169,170 | 5 | 38 | 25%MeOH,10%IPA |
185,186 | 7 | 38 | 25%MeOH,10%IPA |
47,48 | 7 | 38 | 10%IPA |
197,198 | 6 | 55 | 25%MeOH,10%IPA |
187,188 | 7 | 38 | 10%MeOH,40%IPA |
217,218 | 7 | 38 | 25%MeOH,10%IPA, |
70%己烷,40%甲苯 | |||
55,56 | 7 | 38 | 10%MeOH,IPA,70%己烷 |
167,168 | 9 | 38 | 10%MeOH,IPA,70%己烷 |
15,16 | 7 | 38 | 25%MeOH,10%IPA,70%己烷,40%甲苯 |
针对靶底物(S)-苯基乳酸的腈水解酶的筛选:
苯乙醛羟腈 (S)-苯基乳酸
【000281】所实验的许多腈水解酶对于苯乙醛羟腈表现出活性。这些酶中的许多酶是两个相关序列家族的一部分,与对于苯基甘氨酸腈和氯扁桃腈表现出活性的那些酶不同。
【000282】酶的最适pH通常高于7(即pH8或9),具有在这些水平将表现出的较高的对映选择性。许多酶在较高温度显示出较好的活性,尤其在38℃。温度对于酶的对映选择性的影响可以变化;在许多情况下,这一特性在较高温度下稍微低一些。当酶对于加入共溶剂具有耐受力时,尤其是10%(v/v)甲醇,加入这些共溶剂没有得到活性或对映选择性的改善。使用两相系统再次显示出是可行的。
【000283】表3.从苯乙醛羟腈的对映选择性水解的表征实验确定的最适条件的总结
SEQ ID NOS: | 最适pH | 最适温度℃ | 溶剂耐受性 |
103,104 | 7 | 55 | 10%MeOH,IPA |
99,100 | 8 | 38 | 10%MeOH,70%己烷,甲苯 |
183,184 | 9 | 38 | 10%MeOH,IPA,70%甲苯,己烷 |
173,174 | 5 | 38-55 | 25%MeOH,IPA,70%己烷,甲苯 |
213,214 | 7 | 38 | 10%MeOH,25%IPA, |
70%己烷,甲苯 | |||
61,62 | 7 | 38 | 10%MeOH,70%己烷,甲苯 |
205,206 | 8 | 38-55 | 10%MeOH,IPA,40%己烷,甲苯 |
207,208 | 8 | 38 | 10%MeOH,70%己烷 |
309,210 | 8 | 38 | 10%MeOH,40%己烷,甲苯 |
195,196 | 8 | 38 | 10%MeOH,40%己烷,甲苯 |
43,44 | 9 | 38 | 10%MeOH,40%己烷 |
161,162 | 9 | 38 | 25%MeOH,IPA,10%己烷,甲苯 |
175,176 | 6 | 38-55 | 10%MeOH,IPA,40%己烷 |
293,294 | 6 | 38 | 10%MeOH,IPA,40%己烷 |
针对靶底物L-2-甲基苯基甘氨酸的腈水解酶的筛选
2-甲基苯基甘氨酸腈 L-2-甲基苯基甘氨酸
【000284】腈水解酶已经显示出对该底物的活性,优先产生了D-2-甲基苯基甘氨酸,而不是所需的L-2-甲基苯基甘氨酸。
相对于靶底物L-羟基正亮氨酸((S)-2-氨基-6-羟基己酸)的腈水解酶的筛选
5-羟基戊醛氨基腈 L-羟基正亮氨酸
【000285】已经分离了许多对2-氨基-6-羟基己烷腈显示出活性的腈水解酶。所有这些酶对于产物的L-异构体显示出对映选择性。
【000286】这些酶在较高的pH都显示出较高的对映选择性,并且与所实验的其它腈水解酶相比,对加入溶剂似乎更敏感。尽管活性是在存在有机溶剂的情况下检测的,但通常低于含水对照组的活性。再一次地,酶的活性受到酸性产物和醛起始物质的负面影响。
表4.从2-氨基-6-羟基己烷腈的对映选择性水解的表征实验所确定的最适条件的总结
SEQ ID NOS: | 最适pH | 最适温度℃ | 溶剂耐受性 |
217,218 | 9 | 38 | 10%MeOH |
55,56 | 9 | 38 | 无 |
187,188 | 9 | 38 | 10%MeOH |
167,168 | 9 | 38 | 无 |
221,222 | 9 | 38 |
【000287】在所确认的命中酶中,观察到的对于2-氨基-6-羟基己烷腈的水解活性范围
相对于靶底物4-甲基-D-亮氨酸和4-甲基-L-亮氨酸的腈水解酶的筛选
3,3-二甲基正丁醛氨基腈 4-甲基-L-亮氨酸
4-methyl-D-leucine
4-甲基-D-亮氨酸
【000288】用好几种腈水解酶进行2-氨基-4,4-二甲基戊烷腈的水解。在这些腈水解酶中,一些显示出将腈水解为相应酸的L-异构体,选择这些腈水解酶用于进一步表征。
表5.从2-氨基-4,4-二甲基戊烷腈的对映选择性水解的表征实验确定的最适条件的总结
SEQ ID NOS: | 最适pH | 最适温度℃ | 溶剂耐受性 |
103,104 | 7 | 23 | 25%MeOH,10%IPA |
59,60 | 8 | 23 | 25%MeOH |
221,222 | 6 | 38 | 25%MeOH,10%IPA |
相对于靶底物(S)-环己基扁桃酸的腈水解酶的筛选
环己基扁桃腈 (S)-环己基扁桃酸
相对于靶底物扁桃腈的腈水解酶的筛选
扁桃腈 (R)-扁桃腈
【000289】在扁桃腈上也筛选腈水解酶收集物。腈水解酶活跃地水解苯基甘氨酸腈和氯扁桃腈。
确定对映选择性的酶促测定法
【000290】在确定手性α-羟基酸和α-氨基酸的光谱系统的设计中,开发并且使用了允许检测产物形成和对映选择性的基于酶的测定方法。
【000291】图6和7描述了基于乳酸脱氢酶(L-LDH和D-LDH)和氨基酸氧化酶(L-AA氧化酶(L-AA Oxid)和D-AA氧化酶(D-AA Oxid))检测α-羟基酸和α-氨基酸的光谱系统。选择这些酶的原因是,据报道它们具有相当广泛的底物范围,仍然保留了接近绝对对映特异性。
【000292】该系统的全部可行性已经被建立(表12)。既不是亲本羟基腈,也不是氨基腈被次生或检测酶代谢,因此起始材料没有受到干扰。未经过热处理的细胞裂解产物导致LDH系统的背景活性;然而,加热灭活消除了背景活性。细胞裂解产物似乎不干扰AA氧化酶测定。一个考虑因素是AA氧化酶的灭活,所述AA氧化酶通过残余的氰化物使用了FMN辅因子。然而,对照组研究表明,在2mM PGN(其能释放高达2mM HCN)灭活不是一个问题。该测定法适合于384孔(或可能更大密度)微量滴定平板。
表6.针对酸性产物的手性检测的次生酶鉴别总结
底物 | 从商业途径获得的具有底物活性的酶 |
羟基酸产物: | |
L-乳酸 | 是 |
D-乳酸 | 是 |
L-苯基乳酸 | 是 |
D-苯基乳酸 | 是 |
S-环己基扁桃酸1 | 不适用 |
R-环己基扁桃酸1 | 不适用 |
氨基酸产物: | |
4-甲基-L-亮氨酸 | 是 |
4-甲基-L/D-亮氨酸 | 是(D-未知) |
D-苯基丙氨酸 | 是 |
R-苯基甘氨酸 | 是 |
L-高苯基乳酸 | 是 |
D-高苯基乳酸 | 是 |
L-高苯基丙氨酸 | 是 |
D-高苯基丙氨酸 | 是 |
(S)-2-氨基-6-羟基己酸 | 是 |
(R/S)-2-氨基-6-羟基己酸 | 是(D-未知) |
L-甲基苯基甘氨酸 | 1.不适用 |
D-甲基苯基甘氨酸 | 不适用 |
该测定法不适用于环己基扁桃酸和2-甲基苯基甘氨酸,原因在于叔醇不受这一特定氧化作用的影响。
实施例5:标准测定条件
【000293】制备下述溶液:
●底物储备溶液:溶解在0.1M磷酸盐缓冲液(pH7)中的50mM的氨基腈底物,或溶解在0.1M醋酸钠缓冲液(pH5)中的50mM的羟腈底物
●酶储备溶液:将3.33ml的0.1M磷酸盐缓冲液(pH7)加到每一小瓶中,其中含有20mg的冻干细胞裂解产物(终浓度为6mg蛋白质/ml)
【000294】程序:
●将100μl的50mM底物溶液加到96孔平板的适当数量的孔中
●将80μl缓冲液加到每一孔中
●将20μl酶溶液加到每一孔中
●通过用20μl缓冲液代替酶溶液建立空白对照组
●在许多实验中也包括阴性对照组,该对照组由在180μl缓冲液中的20μl酶溶液组成。该对照组一旦已经建立,细胞裂解产物不干扰产物的检测,就不包括这些对照组。
【000295】反应取样:
●通过从每一孔中取出等分试样(15-50μl),并且如下稀释样品,从而对反应进行取样:
●用于非手性HPLC分析的样品:
●苯基甘氨酸,2-氯扁桃酸和苯基乳酸:最初,用水稀释样品2倍,进一步用甲醇或乙腈稀释二倍(最终稀释:4倍)。发现对这些样品的8倍稀释导致改进的色谱分离
●(S)-2-氨基-6-羟基己酸,4-甲基亮氨酸,t-亮氨酸,2-甲基苯基甘氨酸和环己基扁桃酸:用甲醇或乙腈1∶1稀释样品。溶剂的选择基于HPLC分析方法中所用的溶剂。
【000296】·手性HPLC分析的样品:
●苯基甘氨酸,2-氯扁桃酸和苯基乳酸:对于非手性分析,正如上面所描述的那样;手性分析的样品最初稀释两倍,在该项目的随后阶段,以4倍稀释。
●(S)-2-氨基-6-羟基己酸,4-甲基亮氨酸,t-亮氨酸,2-甲基苯基甘氨酸:用甲醇或乙腈以1∶1稀释样品。
【000297】·对于每一实验,HPLC运行中包括产物的标准曲线。该曲线被描绘在X-Y轴上,根据这些曲线的斜率计算样品中产物的浓度。
【000298】·对于初步表征实验,如此取出样品,以致酶的活性处于线性相位;如此进行的目的是,可以确定参数对于反应速率影响的差异,而不是完全转化。取样时间表示在本文所包括的表中。
【000299】·使用表20和21中叙述的方法,通过HPLC分析样品。
实施例6:确定pH对于酶活性和对映选择性的影响
【000300】通过在不同的缓冲液范围内进行标准测定,研究pH对于酶活性和对映选择性的影响:
0.1M 柠檬酸磷酸pH5
0.1M 柠檬酸磷酸pH6
0.1M 磷酸钠pH7
0.1M Tris-HCl pH8
M Tris-HCl pH9
【000301】通过非手性和手性HPLC方法分析样品,其结果的实例表示在此处的表5,8和11中。
实施例7:确定温度对于酶活性和对映选择性的影响
【000302】温度对于活性和对映选择性的影响是通过在室温,38℃和55℃进行标准测定法来研究的。样品是通过非手性和手性HPLC方法来分析的,所得结果的实例表示在此处的表5,8和11中。
实施例8:确定溶剂对于酶活性和对映选择性的影响
【000303】作为两相系统,为了研究水混溶性和非水混溶性溶剂对于酶的影响,酶反应是在存在共溶剂的情况下进行的。在存在共溶剂的情况下,反应在标准条件下进行,用甲醇或异丙醇代替缓冲液。反应中的溶剂终浓度为0,10,25和40%(v/v)。
【000304】两相反应也在标准条件下进行,使用形成非水相的非水混溶性有机溶剂层。溶剂以如下水平被加入:水相的0%,10%,40%和70%(v/v)。来自这些反应的样品在真空下通过离心蒸发,并且再次溶解在甲醇或乙腈和水的50∶50混合物中。样品通过非手性和手性HPLC方法来分析。
实施例9:确定工艺成分对酶活性和对映选择性的影响
活性
【000305】通过在酶反应中加入单一成分,来建立工艺成分对于酶活性的影响。这些成分包括腈合成的起始材料、醛、氰化物和氨,以及三乙胺,三乙胺以催化量被加入到腈合成反应中。反应物的浓度用能够想到的可能的工艺条件进行选择,并且要适合酶测定法中所用的反应物水平。在一些情况下,醛和产物的溶解性相对较低;在这些情况下,将最高水平的溶解性加入到反应中,作为最高水平,该水平的10%作为较低值。
【000306】在标准条件下进行酶反应,加入一种或多种如下成分:苯甲醛、苯基甘氨酸、苯基乙醛、苯基乳酸、2-氯苯甲醛、2-氯扁桃酸、5-羟基戊醛、(S)-2-氨基-6-羟基己酸、4-甲基亮氨酸、KCN、三乙胺、NH4Cl。在标准条件下进行对照组反应,不加入添加剂。通过非手性HPLC分析样品。
稳定性
【000307】酶对工艺条件的稳定性,是通过在存在单一反应成分的情况下将酶温育预定时间期间来监控,这在标准条件下测定酶活性之前进行。在这些实验中,将酶以1.2mg蛋白质/ml的浓度,在存在下述反应成分的每一种的情况下进行温育,所述下述反应成分:甲醇、苯甲醛、苯基甘氨酸、苯乙醛、苯基乳酸、2-氯苯甲醛、2-氯扁桃酸、5-羟基戊醛、(S)-2-氨基-6-羟基己酸、KCN、NH4Cl。
测定条件:
【000308】在特定添加剂中温育0、2、6和24小时,取出50μl酶溶液,加入50μl的50mM底物储备溶液,在标准条件下测定酶活性。在加入底物后,在下述时间对反应进行取样:苯基甘氨酸腈:10分钟;苯乙醛羟腈:1小时;2-氯扁桃腈:2小时。通过仅仅在缓冲液中温育酶,进行对照组反应。使用非手性HPLC方法分析酶。
实施例10:推断命中酶的确认
当获得较高的转化时,在初步表征实验之后,在所确定的最适条件下,测定已经鉴别为推断命中的酶,以便评价它们的性能,尤其是对映选择性。用25mM底物对酶进行测定,pH和温度条件如本文中所包括的表中所记录的。除非另外说明,对于每一种酶所用的标准浓度为0.6mg/ml蛋白质。
实施例11:酶反应层析谱的选择实例
在这一部分中,所示为每一底物和产物组合的层析谱代表性实例,同时讨论了这些方法所遇到的一些挑战,以及如何解决这些挑战。
D-苯基甘氨酸
【000311】参见图8A-8E,非手性分析显示了在2.6分钟和3.2分钟洗脱的底物峰。这两个峰出现在所有含有较高浓度腈的样品中;第二个峰被认为是与腈相关的一个产物;该峰随着时间而降低,一旦已经发生了到产物的完全转化,该峰将不再出现。在图8A中所示的层析谱为空白对照组,仅仅含有腈和缓冲液;正如上面第1部分所解释的,用水和溶剂稀释所有样品。对于下面所讨论的所有样品,重复这一步骤。图8B中的层析谱中所示为酶反应样品,产物是在0.4分钟之时洗脱的。
【000312】在这些层析谱中,要注意的是在0.3分钟洗脱的小溶剂前峰。该峰进一步的表示在图8C所示的层析谱中给出,其中进行的是包括在缓冲液中的细胞裂解产物的阴性对照组。在0.4分钟,一个极小峰与产物进行共洗脱。在该方案的初始阶段,该峰被认为是有问题的,尽管对每一实验进行了适当的对照组,以便维持精确性。在这些实验中,将从细胞裂解产物得到的峰值面积从酶反应中的产物峰值面积中减去,尽管细胞裂解产物得到的峰值面积相对较小。通过进一步稀释样品以及在HPLC中使用较小注射体积,可以进一步改进该分析。在完成这些实验之后,该峰的干扰显示出达到了最小限度,正如图6C中举例说明的层析谱所示。
【000313】苯基甘氨酸的手性分析表示在图6D中的层析谱中,L-对映异构体在6分钟洗脱,D-对映异构体在11分钟洗脱。获得了这两个异构体之间的良好拆分。然而,所用的柱是非常敏感的,该柱的特征显示出随着时间有所变化,从而导致酸的洗脱时间的变化。尽管这通过使用适当的对照组和标准很容易检测,但在用D-对映异构体共洗脱腈峰中存在更大的问题(层析谱显示在图6E中)。该共洗脱的原因尚不清楚了;然而,通过使用适当的标准可以容易地检测出;此外,酸的UV光谱非常有特色,这使得使用该工具可以有效地检测共洗脱。该问题也可以通过调节流动相中的甲醇含量很容易地予以解决。
(R)-2-氯扁桃酸
【000314】氯扁桃酸和氯扁桃腈的HPLC分析,提供了许多与苯基甘氨酸样品相关的挑战。从图7A中所示的层析谱,该图仅在缓冲液中含有氯扁桃腈,显然在相同时间洗脱的峰作为图7B中所示层析谱中的产物,该峰表示扁桃酸标准。发现细胞溶解产物对该峰的贡献很小;对该峰的最大贡献似乎来自氯扁桃腈,或者来自分解产物或者来自腈制剂中的污染物。使用适当的对照组,峰值面积在整个实验过程中保持不变,发现将峰值面积从产物的峰值面积中减去得到了足够的精确度。已经进行了许多尝试来改变HPLC条件,以便在随后的时间洗脱产物峰;然而,这些尝试没有成功。图7C所示的层析谱,举例说明了产物的出现和底物峰的降低。
【000315】氯扁桃酸的手性分析几乎是没有问题的。在与(S)-对映异构体相同的时间洗脱小峰,表现出一些影响(在图7D所示的层析谱中,在2.4分钟处的峰)。然而,一旦已经建立,该峰在所有样品中以相同水平出现,包括空白对照组,并且该峰具有的UV光谱不同于氯扁桃酸峰的UV光谱,这被认为不是一个问题。因此,在每一样品中,从2.4分钟洗脱的峰值中减去该峰值。(R)-对映异构体在3分钟洗脱。
(S)-苯基乳酸
【000316】对苯基乳酸的分析,最初也为如关于苯基甘氨酸和2-氯扁桃酸的讨论中所述的相同问题所困扰。然而,在这种情况下,调节非手性HPLC方法中的溶剂浓度导致酸的保留时间有所改变,这样它不再与细胞溶解产物峰共洗脱。在这之后,或者非手性或者手性方法没有遇到问题。产物(1.9分钟)和羟腈底物(3.7分钟)的代表性非手性层析谱表示在图8A中,而酸的手性分析表示在图8B中,L-对映异构体在2分钟洗脱,相对的对映异构体在6分钟洗脱。
L-2-甲基苯基甘氨酸
【000317】甲基苯基甘氨酸的分析是不成问题的,尽管非手性方法不提供细胞溶解物峰和产物峰之间的基线分离,正如图9A中举例说明的层析谱中所示。在该项目的最后阶段,提供了该方法的氨基酸标准,从而缩短了该方法开发的时间。在图9A所示的层析谱中,氨基酸在0.7分钟洗脱,氨基腈在5.0分钟洗脱。达到了这两个初始峰之间的充分分离,以允许计算到产物的大约转化。
【000318】该化合物的手性分析提供了这两个对映异构体之间的良好分离,正如图9B中举例说明的层析谱中所示。L-对映异构体在5分钟洗脱,D-对映异构体在8分钟洗脱。
L-叔-亮氨酸
【000319】对于叔-亮氨酸的非手性分析,细胞溶解产物出现了该项目的产物组中最严重的问题。这是由于该氨基酸的低光谱特性所增加的问题,从而导致从细胞溶解产物中区分产物峰出现困难。通过图10A中所示的手性分析获得了单一产物对映异构体的良好分离。在初步筛选过程中,在某种样品中,在与L-氨基酸标准相同的时间洗脱小峰(参见图10B),并且该小峰被认为是氨基酸。然而,对该方法进一步的开发以及使用适当的对照组确认该峰实际上是细胞溶解产物峰。
【000320】两个t-亮氨酸峰值之间洗脱的氨基腈,正如图10C中所示,该层析谱也在4.8分钟显示了细胞溶解产物峰。腈的UV光谱与氨基酸的UV光谱不同,这样可以更容易地从酸峰中区分出腈。
L-羟基正亮氨酸((S)-2-氨基-6-羟基己酸)
【000321】(S)-2-氨基-6-羟基己酸的手性分析是一致且可靠的。相反,非手性方法出现了许多问题,主要是腈和酸峰之间没有分离。越接近该项目的后半部分,开发了一种方法,并且成功地用于确认活性。在此之前,使用手性方法进行了许多分析;绘制了产物的标准曲线,以便量化该反应。图11A所示为(S)-2-氨基-6-羟基己酸的代表性层析谱,(S)-2-氨基-6-羟基己酸在6分钟洗脱。该方法没有对氨基腈进行检测。
【000322】单一(S)-2-氨基-6-羟基己酸对映异构体的分离如图11B所示。首先在2分钟洗脱L-对映异构体,随后在3分钟洗脱D-对映异构体。在图11C中,表示了酶样品;需要轻微关注的唯一区域是洗脱L-对映异构体之前的阴性峰。然而,它似乎对于该对映异构体的洗脱没有显著性干扰;该方法的开发没有消除该负峰。
4-甲基-D-亮氨酸和4-甲基-L-亮氨酸
【000323】对于4-甲基亮氨酸的检测,手性HPLC方法再次证明更加可靠。该方法对于化合物的低活性和低敏感性的组合,导致使用非手性HPLC检测存在困难。图12A所示为氨基酸的2.5mM标准,峰高度为大约40mAU;这显著低于对于芳族化合物的检测值。图12B中的层析谱显示了酶样品,其中转化是使用手性HPLC方法检测的;尽管不清楚,但仍然显示出4-甲基亮氨酸峰在2.7分钟洗脱,峰高和峰面积都非常低。该峰没有出现在通过手性HPLC分析是负值的样品中。
【000324】4-甲基-L-亮氨酸和4-甲基-D-亮氨酸的手性分析没有表现出任何问题。L-对映异构体在5分钟洗脱,D-对映异构体在7分钟洗脱,尽管如(i)部分中对于苯基甘氨酸所做的描述,作为对柱的敏感性的结果,没有发生一些峰移动。在图14C-14D中所示的层析谱中,给出了这些氨基酸的分离;第一个样品表示产生了两种对映异构体的酶,在第二种样品中,酶优先水解L-对映异构体,形成了少量的D-氨基酸。
(S)-环己基扁桃酸
【000325】所示为环己基扁桃酸(图13A)的标准色谱图和相应的腈(图13B)。酸在1.3分钟洗脱,而在2.5分钟观察到了羟腈。在2.1分钟洗脱的峰被认为是环己基苯基甲酮,正如在该点上通过洗脱酮标准所示。
实施例12:生物催化的酶文库方法:开发用于对映选择性产生羧酸衍生物的腈水解酶平台
【000326】生物催化过程可以在转化中提供独特的优势,这正在挑战通过传统化学方法实现转化(Wong,C.-H.;Whitesides,G.M.Enzymes in Synthetic OrganicChemistry;Pergamon,New York,1994;Drauz,K.;Waldmann,H.,Roberts,S.M.编著.Enzyme Catalysis in Organic Synthesis;VCH:Weinheim,Germany,第二版,2002)。腈水解酶(EC3.5.5.1)促进了有机腈直接缓和水解转化为相应的羧酸(Kobayashi,M.;Shimizu,S.FEMS Microbiol.Lett.1994,120,217;Bunch,A.W.In Biotechnology;Rehm,H.-J.;Reed,G.;Puhler,A.;Stadler,P.,编著;Wiley-VCH:Weinheim,Germany,8a卷,第六章,277-324页;Wieser,M.;Nagasawa,T.In stereoselective Biocatalysis;Patel,R.N.,编著;Marcel Dekker:New York,2000,第17章,461-486章。)到目前为止已经表征和报道了不到15种微生物来源的腈水解酶。(Harper,D.B.Int.J.Biochem.1985,17,677;Levy-Schil,S.;Soubrier,F.;Crutz-Le Coq,A.M.;Faucher,D.;Crouzet,J.;Petre,D.Gene 1995,161,15;Yu,F.1999,美国专利5872000;Ress-Loschke,M.;Friedrich,T.;Hauer,B.;Mattes,R.;Engels,D.PCT申请WO00/23577,2000年4月。)。先前已经开发了好几种腈水解酶用于制备单一对映异构体羧酸,但是在开发作为可行的合成工具的腈水解酶中所取得的进步是很小的。该申请描述了大量和不同种类的腈水解酶的发明,此处举例说明了该腈水解酶文库的用途,用于鉴别能催化有效的对映选择性、产生有价值的羟基羧酸衍生物的酶。
【000327】在获得最多样化范围的在自然界能发现的酶的尝试中,通过直接从环境样品中提取DNA,我们创立了大的基因组文库,其中环境样品是从业已从变化的全球生境中收集而来。(对于这些方法的描述,参见:Short,J.M.Nature Biotech.1997,15,1322;Handelsman,J.;Rondon,M.J.;Brady,S.F.;Clardy,J.;Goodman,R.M.Chem.Biol.1998,5,R245;Henne,A.;Daniel,R.;Schmitz,R.A.;Gottschalk,G.Appl.Environ.Microbiol.1999,65,3901.)。我们已经建立了多种通过筛选未培养的DNA的混和群体而用于鉴别新颖活性的方法。(Robertson,D.E.;Mathur,E.J.;Swanson,R.V.;Marrs,B.L.;Short,J.M.SIM News 1996,46,3;Short,J.M.美国专利5,958,672,1999;Short J.M.美国专利6,030,779,2000.)通过该方法,接近200种新颖腈水解酶已经被发现和表征。(对于该研究的简单描述,参见下面的材料和方法部分(Materials and Methods section))所有腈水解酶被认为在序列水平是独特的,并且显示出拥有保守性催化三联体Glu-Lys-Cys,这是该酶类别所特有的。(Pace,H.;Brenner,C.Genome Biology 2001,2,0001.1-0001.9。)在我们的文库中的每一腈水解酶被过量表达,保存为冻干的细胞溶解产物,以便有助于加速该文库对特定生物催化功能的评价。
【000328】最初研究的重点在于腈水解酶的效力,该腈水解酶用于产生通过羟腈1的水解形成的α-羟基酸2。羟腈被很好地证明,容易在碱性条件下通过HCN的可逆损失发生外消旋作用。(Inagaki,M.;Hiratake,J.;Nishioka,T.;Oda,J.;J.Org.Chem 1992,57,5643.(b)van Eikeren,P.美国专利5,241,087,1993。)因此,动态动力学拆分过程是可能的,从而酶选择性地仅水解对映异构体1,以100%理论产量和高水平地对映异构体纯度提供了2。
【000329】该类型的一个重要应用涉及商业性从扁桃腈产生(R)-扁桃酸。(Ress-Loschke,M.;Friedrich,T.;Hauer,B.;Mattes,R.;Engels,D.PCT申请WO00/23577,2000年4月;Yamamoto,K.;Oishi,K.;Fujimatsu,I.;Komatsu,K.Appl.Environ.Microbiol.1991,57,3028;Endo,T.;Tamura,K.美国专利5,296,373,1994年3月.)作为中间产物和拆解剂,扁桃酸和衍生物在用于产生许多药物和农业产品中存在广泛的用途。(Coppola,G.M.;Schuster,H.F.Chiral α-Hydroxy Acids inEnantioselective Syntheis(对映选择性合成中的手性α-羟基酸);Wiley-VCH:Weinheim,Germany:1997.)然而,来自培养的生物体的鲜为人知的腈水解酶还没有被发现用于类似底物的有效和选择性水解。
【000330】以在扁桃腈(3a,Ar=苯基)水解为扁桃酸中的活性和对映选择性对腈水解酶文库进行筛选。
初步结果显示,27种酶提供了>90%ee的扁桃酸。一种酶,SEQ ID NO:385,386,被进行了更详细的研究,并且被发现对于水解扁桃腈非常活跃。在使用处于10%MeOH(v/v)0.1M磷酸缓冲液中的25mM 3a和0.12mg/mL酶,在37℃和pH8的标准条件下,在10分钟内,定量地形成了(R)-扁桃酸,具有98%ee。为了确认合成效用,使用1.0g 3a(50mM)和9mg腈水解酶(0.06mg/mL腈水解酶I)进行反应;3小时后,以高产量(0.93g,86%)分离了(R)-扁桃酸,又一次地,具有98%ee。
(a)反应是在标准条件下进行的(参见正文)。到4的完全转化的反应时间为1-3小时。在pH9和5mM底物浓度下,进行项目8-9。(b)在5分钟转化时间点测量比活性,并且被表示为μmol mg-1 min-1。(c)TOF=转换频率,mol产物/mol催化剂/秒。(d)通过手性HPLC分析确定对映选择性。分离羟基酸,在所有情况下绝对构型被确定为(R)。
【000331】接着研究SEQ ID NO:385,386的底物范围。正如在表13中所示,可以通过该方法制备广泛范围的扁桃酸衍生物,以及芳族和杂芳族类似物(4)。SEQ IDNO:385,386在扁桃腈衍生物的邻位-、间位-、对位-位置可以有芳香环取代基,并且产生了具有高选择性的类型4产物。在活性位点也容纳了其它大的芳香基团如1-萘基和2-萘基,再次提供了具有高选择性的酸4(表13,项目8-9)。最后,使用该过程容易地制备了扁桃酸的3-吡啶基和3-噻吩基类似物(表13,项目10-11)。这是最先报导的腈水解酶的论证,提供了一系列类型4的扁桃酸衍生物和杂芳族类似物。特别值得注目的是,在更空间位阻的邻位取代和1-萘基衍生物上的高活性。
【000332】我们接着研究了通过相应羟腈5的水解制备芳基乳酸衍生物6。苯基乳酸和衍生物作为制备许多生物活性化合物的通用结构单元。(Coppola,G.M.;Schuster,H.F.Chiral α-Hydroxy Acids in Enantioselective Synthesis(对映选择性合成中的手性α-羟基酸);Wiley-VCH:Weinheim,Germany:1997.)一旦针对亲本羟腈5a(Ar=苯基)筛选了我们的腈水解酶文库,我们发现了几种酶,提供了具有高对映体过量的6a。对一种酶,SEQ ID NO:103,104,进行了进一步的表征。在最优化之后,SEQ ID NO:103,104,显示出提供了在6小时内具有完全转化(50mM)和非常高的对映选择性(98%ee)的(S)-苯基乳酸(6a)。先前报道的5到6的生物催化转化的最高对映选择性是75%ee,是通过使用假单胞菌菌株的全细胞转化获得的。(Hashimoto,Y;Kobayashi,E.;Endo,T.;Nishiyama,M.;Horinouchi,S.Biosci.Biotech.Biochem.1996,60,1279.)
表7.腈水解酶II催化产生芳基乳酸衍生物和类似物6a
项目 | 6中的Ar | 比活性b | TOFc | %eed |
1 | C6H5 | 25 | 16 | 99 |
2 | 2-Me-C6H5 | 160 | 100 | 95 |
3 | 2-Br-C6H5 | 121 | 76 | 95 |
4 | 2-F-C6H5 | 155 | 97 | 91 |
5 | 3-Me-C6H5 | 21 | 13 | 95 |
6 | 3-F-C6H5 | 22 | 14 | 99 |
7 | 1-萘基 | 64 | 40 | 96 |
8 | 2-吡啶基 | 10.5 | 6.6 | 99 |
9 | 3-吡啶基 | 11.6 | 7.2 | 97 |
10 | 2-噻吩基 | 3.4 | 2.1 | 96 |
11 | 3-噻吩基 | 2.3 | 1.4 | 97 |
(a)除了使用了0.016mg/mL的腈水解酶之外,反应条件如表13所示。在6小时内观察到了到6的完全转化。(b)-(d)参见表13。苯基乳酸的绝对构型被确定为(S),项目2-11基于相同手性HPLC峰洗脱顺序而指定。
【000333】邻位和间位取代基显示出可以被腈水解酶II很好的耐受,相对于亲本底物5a,邻位取代的衍生物可以惊人地以较高比率被转化。新颖杂芳族衍生物,如2-吡啶基、3-吡啶基、2-噻吩基和3-噻吩基乳酸,以高转化和对映选择性被制备(项目8-11)。未预料到地,对位取代基大大地降低了这些反应的速率,在这些条件下完全转化需要两周时间。
【00334】我们检验的最终转化是容易获得的前手性底物3-羟基戊二酰基腈(7)的去对称化(Johnson,F.;Panella,J.P.;Carlson,A.A.J.Org.Chem.1962,27,2241),以提供羟基酸(R)-8,一旦被酯化为(R)-9,是在生产降低胆固醇的药物LIPITORTM中使用的中间产物。先前报道在该过程中使用酶的尝试没有成功,是以低选择性(最高:22%ee)和不期望的(S)-构型产生8的。(Crosby,J.A.;Parratt,J.S.;Turner,N.J.Tetrahedron:Asymmetry 1992,3,1547;Beard,T.;Cohen,M.A.;Parratt,J.S.;Turner,N.J.Tetrahedron:Asymmetry 1993,4,1085;Kakeya,H.;Sakai,N.;Sano,A.;Yokoyama,M.;Sugai,T.;Ohta,H.Chem.Lett.1991,1823.)
【000335】筛选腈水解酶文库,发现并且分离了独特的酶,该酶提供了所要求的具有高转化率(>95%)和>90%ee的产物(R)-8。使用(R)-特异性腈水解酶中的一种,在1.0g等级上进行该过程(240mM 7,30mg酶,22℃,pH7),在22小时后,以98%的产率和95%ee分离出(R)-8。有趣的是,相同的筛选程序也鉴别了以90-98%ee提供另外一种的对映异构体(S)-8的腈水解酶。因此,生物多样性的广泛筛选发现了以高对映选择性提供中间产物8的任一对映异构体的酶。我们发明了能提供(R)-8的第一种酶,强调了能使用大的和多样化的腈水解酶文库的这一优点。
【000336】通过探究我们的环境基因组文库,该文库是从未培养的DNA产生的,我们已经发明了大量新颖腈水解酶。该研究已经揭示了能提供扁桃酸和芳基乳酸衍生物的特异性腈水解酶,以及以高产率和对映体过量提供4-氰基-3-羟基丁酸的任一种对映异构体的腈水解酶。
程序和分析数据:
【000337】羟基戊二酰基腈是从TCI America购买的,并且是刚刚购买到就使用的。用于制备芳基乳酸标准的氨基酸是从PepTech(Cambridge,MA)购买的。(R)-3-羟基-4-氰基丁酸是从Gateway Chemical Technology(St.Louis,MO)得到的。(R)-和(S)-扁桃酸以及(R)-和(S)-苯基乳酸标准是从Sigma Aldrich购买的。所有其它的试剂是从Sigma Aldrich购买的,并且在使用时没有经过进一步的纯化。从Aldrich购买的硅胶(Silica Gel),70-230mesh(目),60被用于色谱纯化。所有1H NMRs和13C NMRs是在Bruker model AM-500仪器上运行的,设置为室温,对1H和13C分别为500MHz和125MHz。质谱分析和单位质量分辨率是通过流动注射分析(FIA)实现的,使用了Perkin-Elmer Sciex API-4000 TURBOIONTM SprayLC/MS/MS系统。LC流量是通过Schimadzu LC-10Advp泵提供的,具有0.05%醋酸和MeOH。注射是通过Valco注射阀实现的。HPLC分析是在具有Astec’sChirobiotic R柱(100×4.6mm,目录编号(cat no.)13022或150×4.6mm,目录编号(catno.)13023)或Daicel’s Chiralcel OD柱(50X4.6mm,目录编号(cat no.)14022)的Agilent 1100 HPLC上进行的,DAD探测器设置为210、220、230和250nm。对于比旋光,使用Perkin Model 341 Polarimeter,设置为589nm,钠灯,癌室温下,使用100mm路径长度的池。比旋光的浓度以克/100mL溶剂表示。微生物技术根据已公开的规程执行。(Sambrook,J.Fritsch,EF,Maniatis,T.(1989)Molecular Cloning:A Laboratory Manual(第二版),Cold Spring Harbor LaboratoryPress,Plainview NY.)分离了羟基乙酸产物,通过与文献中除了(-)-3-吡啶基羟基乙酸之外的构型已定化合物的旋光数据的比较,确定了所有情况下的绝对构型为(R),根据我们的理解该化合物(-)-3-吡啶基羟基乙酸不是单一对映异构体。(对于扁桃酸、2-氯扁桃酸、2-甲基扁桃酸、3-氯扁桃酸、3-溴扁桃酸和4-氟扁桃酸参见Hoover,J.R.E.;Dunn,G.L.;Jakas,D.R.;Lam,L.L.;Taggart,J.J.;Guarini,J.R.;Philips,L.J.Med.Chem.1974,17(1),34-41;对于2-溴扁桃酸参见Collet,A.;Jacques,J.;Bull.Soc.Chem.Fr.1973,12,3330-3331;对于1-和2-萘基羟基乙酸参见Takahashi,I;Y.Aoyagi,I.Nakamura,Kitagawa,A.,Matsumoto,K.,Kitajima,H.Isa,K.Odashima,K.Koga,K.Heterocycles 1999,51(6),1371-88;对于3-噻吩基羟基乙酸参见Gronowitz,S.Ark.Kemi,1957,11,519-525.)
【000338】对于芳基乳酸产物,通过与文献中的旋光数据的比较,建立了苯基乳酸的绝对构型为(S),对于所有其它苯基乳酸产物,使用手性HPLC基于洗脱次序预测绝对构型。通过衍生为(R)-(-)-甲基(3-O-[苯甲酰]-4-氰基)-丁酸酯,和与文献中的构型已定的化合物的旋光数据的比较,建立3-羟基-4-氰基-丁酸的绝对构型。(3.Beard,T.Cohen,M.A.Parratt,J.S.Turner,N.J.Tetrahedron:Asymm.4(6),1993,1085-1104.)
腈水解酶发现和表征方法:
1.腈水解酶选择
【000339】在腈底物上针对腈水解酶选择进行大肠杆菌(Escherichia coli)筛选宿主菌株,SEL700的最优化。用卡那霉素抗性环境DNA文库对在10mM MgSO4中的Abs600nm=1,SEL700筛选宿主的重悬浮液进行感染,在37℃感染45分钟,这样可以获得文库的完全筛选覆盖度。被感染的细胞,现在指示为卡那霉素抗性,被平板于卡那霉素LB平板上,允许在30℃生长过夜。最小滴定量平板也被用来确定感染效率。第二天早晨用10mM MgSO4集中、洗涤和重悬浮细胞。将转化的克隆接种在M9培养基中(不含氮),含有10mM腈水解酶底物。M9培养基包括1XM9盐(省略NH4Cl)、0.1mM CaCl2、1mM MgSO4、0.2%葡萄糖和大约10mM腈选择底物。然后于30℃温育,以200rpm振荡选择培养物,时间直达五周。通过生长,鉴别阳性腈水解酶培养物,这是由于阳性克隆具有水解腈底物的能力。通过对生长的选择培养物划线接种,鉴别阳性克隆,随后在相同的合成培养基中,再次培养分离的群落。然后分离来自任何表现出再生长的阳性传代培养物的DNA,并且对其测序,以确认发现了腈水解酶基因,以及确定该基因的独特性质。
2.腈水解酶的生物淘选
【000340】传统过滤提升杂交筛选规程(tranditional filter lift hybridization screeningprotocols)限于具有大约106-107个成员的文库。尝试筛选一个文库,将需要大约5000个过滤提升。因此,已经开发了溶液相和其它生物淘选格式,用于基于超高通量顺序的筛选,允许快速筛选含有高达108个成员的环境文库。在溶液格式中,在促进杂交的条件下,将来自大量数目的文库克隆的DNA与相关标记的分子混和。然后,从溶液中,去除标记克隆和杂交DNA,并且在一定严格水平下洗涤,以去除与探针不具有序列同一性的克隆。然后,洗脱且回收杂交DNA。对相关克隆进行测序和克隆,以提供相关酶活性。对于相关序列,该方法已经被证明每一轮获得了高达1000倍的富集。
3.高通量腈水解酶活性测定
【000341】用在0.25mL测定溶液中的25mM(~3mg/mL)底物、0.1mg/mL腈水解酶进行活性测定。测定溶液包括0.1M磷酸钠缓冲液中的0-10%(v/v)MeOH,pH7-9,温度为37℃或22℃。除非另外说明,在5分钟转化时间点,测量比活性,单位表示为μmol mg-1 min-1。通过将酶产物浓度与外消旋酸性产物的标准曲线比较,用高通量HPLC分析确定对映体过量和转化率。产物的分析条件列于下述表中。
分析方法:
酸性产物 | 柱 | 液相色谱方法 | 对映异构体保留时间(分钟) | |
1.1 | 扁桃酸 | Chirabiotic R100X4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.4(S);2.9(R) |
1.2 | 2-Cl-扁桃酸 | Chirabiotic R | 20%[0.5%AcOH], | 2.3(S); |
100×4.6mm | 80%CH3CN1ml/分钟 | 2.9(R) | ||
1.3 | 2-溴-扁桃酸 | Chirabiotic R100×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.8;4.0 |
1.4 | 2-CH3-扁桃酸 | Chirabiotic R100×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 3.1;3.8 |
1.5 | 3-Cl-扁桃酸 | Chirabiotic R100×4.6mm | 10%[0.5%AcOH],90%CH3CN1ml/分钟 | 3.1;3.8 |
1.6 | 3-溴-扁桃酸 | Chirabiotic R100×4.6mm | 10%[0.5%AcOH],90%CH3CN1ml/分钟 | 3.3;3.9 |
1.7 | 4-氟-扁桃酸 | Chirabiotic R100×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 3.7;4.8 |
1.8 | 1-萘基羟基乙酸 | Chirabiotic R100×4.6mm | 4%[0.5%AcOH],96%CH3CN1ml/分钟 | 3.1;3.7 |
1.9 | 2-萘基羟基乙酸 | Chirabiotic R100×4.6mm | 4%[0.5%AcOH],96%CH3CN1ml/分钟 | 3.7;4.7 |
1.10 | 3-吡啶基羟基乙酸 | Chirabiotic R100×4.6mm | 5%[0.5%AcOH],65%H2O,30%CH3CN2ml/分钟 | 4.4;5.5 |
1.11 | 3-噻吩基羟基乙酸 | Chirabiotic R100×4.6mm | 20%[0.5%AcOH],80%CH3CN2ml/分钟 | 1.4;2.5 |
2.1 | 苯基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.8(S);4.0(R) |
2.2 | 2-甲基苯基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.5;2.8 |
2.3 | 2-溴苯基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.8;3.2 |
2.4 | 2-氟苯基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.6;2.9 |
2.5 | 3-甲基苯基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.4;3.2 |
2.6 | 3-氟苯基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.8;3.6 |
2.7 | 1-萘基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.7;3.1 |
2.8 | 2-吡啶基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.5;2.9 |
2.9 | 3-吡啶基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 2.9;3.6 |
2.10 | 2-噻吩基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 3.6;4.6 |
2.11 | 3-噻吩基乳酸 | Chirabiotic R150×4.6mm | 20%[0.5%AcOH],80%CH3CN1ml/分钟 | 3.5;4.6 |
甲基(3-O[苯甲酰]-4-氰基)-丁酸酯 | Daicel OD50×4.6mm | 5%异丙醇,95%己烷1ml/分钟 | 4.5(R);5.4(S) |
羟腈(底物)合成:
【000342】扁桃腈合成方法A:将丙酮合氰化氢(685μl,7.5mmol),醛(5mmol)和催化性二异丙基乙胺(DIEA)(13μl,0.075mmol)在0℃混和。将反应物在冰上搅拌45分钟。为了推进向产物的平衡,在真空中去除丙酮。随后,用H2SO4(3μL)酸化粗反应物,并且于-20℃保存。用薄层层析(TLC)监控反应进展(3∶1己烷/乙酸乙酯(EtOAc))。
【000343】扁桃酸合成方法B:将醛(5mmol)和乙酸(315μL,5.5mmol)加入到溶解在MeOH(1mL)中的KCN(358mg,5.5mmol)溶液中,温度0℃。在冰上搅拌1小时后,在真空中去除MeOH,用EtOAc和H2O分配粗反应物。保留有机组分,真空浓缩。用TLC分析监控反应进展(3∶1己烷/EtOAc)。
【000344】芳基乙醛羟腈:,在氮(克)氛围下,将芳基乳酸(50mmol)溶解在放置在双颈500ml圆底烧瓶中的50ml无水四氢呋喃(THF)中。等到该溶液冷却到0℃,在剧烈混和下,缓慢地加入105mmol叔己基氯硼烷-二甲基硫醚(thexylchloroborane-dimethyl sulfide)(2.55M,在二氯甲烷中)。允许反应继续到过夜。加入过量乙酸(10ml)猝灭反应,随后通过加入10ml水以酸化反应物。在室温下搅拌1小时后,真空中去除溶剂,将残余物溶解在100ml水中,用200mlEtOAc提取。在硫酸钠上干燥EtOAc层,过滤,然后在真空中浓缩。随后,将60mmol KCN加入到残余物中,接着加入100ml甲醇。然后将溶液冷却到0℃,加入乙酸(60mmol)。在所有KCN溶解后,将反应物搅拌1-2小时。在真空中去除溶剂,将残余物溶解在100ml水和200ml EtOAc中。用EtOAc将水层再次提取一次。混和EtOAc提取物,用饱和盐水洗涤,并且在硫酸钠上干燥,过滤,然后在真空中干燥,以得到粗羟腈产物。在必要的时候,通过二氧化硅-凝胶柱(己烷/EtOAc)纯化羟腈。
【000345】2-氯扁桃腈:1HNMR(CDCl3,500MHz)δ7.69(m,1H),7.41(m,1H),7.36(m,2H),5.84(s,1H),3.07(br,1H)。13C NMR(CDCl3,125MHz)δ132.89,132.73,131.22,130.19,128.48,127.84,118.24,60.87。对于[C8H6ClNO]167.01,用质谱计算得到实测值167.9(LC-MS+)。
【000346】2-溴扁桃腈:1H NMR(CDCl3,500MHz)δ7.72(d,1H,J=6.58),7.62(d,1H,J=8.35),7.43(t,1H,J=8.42),7.30(t,1H,J=7.00),5.85(s,1H)。13C NMR(CDCl3,125MHz)δ134.550,133.584,131.564,128.819,128.535,122.565,118.153,63.379。
【000347】2-甲基扁桃腈:1H NMR(CDCl3,500MHz)δ:7.60(d,1H,J=7.4),7.23-7.35(m,3H),5.66(s,1H),2.44(s,3H)。13C NMR(CDCl3,298K,125MHz)δ:136.425,133.415,131.450,130.147,127.204,126.894,118.952,18.916。对于[C9H9NO]147.07,用质谱计算得到147.2(ESI+)。
【000348】3-氯扁桃腈:1H NMR(CDCl3,500MHz)δ7.55(s,1H),7.43-7.37(m,3H),5.54(s,1H)。13C NMR(CDCl3,125MHz)δ137.183,135.480,130.718,130.303,127.047,124.891,118.395,63.156。对于[C8H6ClNO]167.01,用质谱(MS)计算得到167.9(LC-MS+)。
【000349】3-溴扁桃腈:1HNMR(CDCl3,500MHz)δ7.69(s,1H),7.56(d,J=6.2Hz,1H),7.45(d,J=5.5Hz,1H),7.32(t,J=6.4.Hz,1H),5.53(s,1H)。13C NMR(CDCl3,125MHz)δ137.376,133.201,130.934,129.208,125.359,123.380,118.458,63.006。对于[C8H6BrNO]212.0,用质谱计算得到211.9(LC-MS+)。
【000350】4-氟扁桃腈:1H NMR(CDCl3,500MHz)δ5.54(s,1H),7.13(m,2H),7.51-7.53(m,2H)。13C NMR(CDCl3,125MHz)δ63.02,116.44,118.97,128.90,131.54,132.51,162.575。
【000351】4-氯扁桃腈:1H NMR(CDCl3,500MHz)δ7.47(d,J=7.0Hz,2H),7.42(d,J=7.0Hz,2H),5.53(s,1H)。13C NMR(CDCl3,125MHz)δ136.209,133.845,129.647,128.232,118.630,63.154。对于[C8H6CINO]167.01,用质谱计算得到167.9(LC-MS+)。
【000352】1-萘基羟腈:1H NMR(CDCl3,500MHz)δ8.14(d,1H,J=8.5),7.92(t,2H,J=6.1),7.82(d,1H,J=5.7),7.62(t,1H,J=6.1),7.56(t,1H,J=6.1),7.50(t,1H,J=6.1),6.18(s,1H);13C NMR(CDCl3,125MHz)δ137.0,135.7,134.2,131.1,129.2,127.5,126.7,125.8,125.3,123.1,119.0,62.4;对于[C12H9O]183.21,用质谱计算得到183.2(ESI+)。
【000353】2-萘基羟腈:1H NMR(CDCl3,500MHz)δ8.03(s,1H),7.92(d,1H,J=8.6),7.87-7.91(m,2H),7.61(dd,1H,J=6.7,1.2),7.55-7.60(m,2H),5.72(s,1H);13C NMR(CDCl3,125MHz)δ134.9,133.9,132.7,129.6,128.6,128.0,127.4,127.2,126.4,123.9,118.9,64.1;对于[C12H9O]183.21,用质谱计算得到183.2(ESI+,电喷雾电离质谱+)。
【000354】3-吡啶基羟腈:1H NMR(CDCl3,500MHz)δ:8.62(d,1H,J=1.8),8.57(d,1H,J=5.1),7.94(d,1H,J=8.1),7.41(dd,1H,J=8.1,5.1),5.64(s,1H);13C NMR(CDCl3,125MHz)δ149.921,147.355,135.412,133.044,124.443,118.980,61.085。对于[C7H6N2O]134.05,用质谱计算得到135.2(ESI+)。
【000355】3-噻吩基羟腈:1H NMR(CDCl3,500MHz)δ7.45(d,J=2.2Hz 1H),7.56(dd,J=6.2Hz,1H),7.45(d,J=5.5Hz,1H),7.32(t,J=6.4.Hz,1H),5.53(s,1H)。13C NMR(CDCl3,125MHz)δ137.376,133.201,130.934,129.208,125.359,123.380,118.458,63.006。对于[C6H5NOS]139.01,用质谱计算得到139.9(LC-MS+)。
【000356】苯基乙醛羟腈:1H NMR(CDCl3,500MHz)δ7.34(m,5H),4.64(t,J=6.75Hz,1H),3.11(d,J=6.75Hz,2H),2.75(br,1H):13C NMR(CDCl3,125MHz)δ133.96,129.91,129.16,128.08,119.47,62.33,41.55。
【000357】2-甲基苯基乙醛羟腈:1HNMR(CDCl3,500MHz)δ7.11(m,4H),4.61(t,J=6.62Hz,1H),3.12(d,J=6.62Hz,2H),2.14(s,3H):13C NMR(CDCl3,125MHz)δ136.94,136.47,132.57,130.48,127.61,125.75,120.11,62.95,44.73对于[C10H11NO]:161.08,用质谱计算得到162.2(M+Na,ESI+)
【000358】2-溴苯基乙醛羟腈:1H NMR(CDCl3,500MHz)δ7.20(m,4H),4.78(t,J=6.5Hz,1H),3.26(d,J=6.5Hz,2H)。13C NMR(CDCl3,100MHz)δ133.93,132.82,131.72,129.21,128.12,124.86,119.41,63.02,44.89。
【000359】2-氟苯基乙醛羟腈:1H NMR(CDCl3,500MHz)δ7.2(m,2H),7.02(m,2H),4.50(dd,J=4.62Hz,J=7.88Hz,1H),3.23(dd,J=4.62Hz,1J=14.12Hz,1H),2.97(dd,7.88Hz,14.12Hz,1H)。13C NMR(CDCl3,125MHz)δ132.18,131.52,129.66,129.03,128.07,124.05,115.8,63.02,44.79。对于[C9H8FNO]165.06,用质谱计算得到164.2(ESI+)。
【000360】3-甲基苯基乙醛羟腈:1H NMR(CDCl3,500MHz)δ7.18(m,1H),7.02(m,3H),4.54(dd,J=4.62Hz,J=8Hz,1H),3.06(dd,J=4.62Hz,J=14.38Hz,1H),2.83(dd,J=8Hz,J=14.38Hz,1H),2.36(s,3H)。13C NMR(CDCl3,125MHz)δ176.25,138.18,136.0,130.97,128.93,127.68,126.58,76.42,34.29,37.69。对于[C10H12O3]180.08,用质谱计算得到180.0(ESI+)。
【000361】3-氟苯基乙醛羟腈:1H NMR(CDCl3,500MHz)δ7.18(m,2H),6.95(m,2H),4.44(dd,1H),3.11(dd,1H)。13C NMR(CDCl3,125MHz)δ130.40,125.53,124.85,116.92,114.87,114.50,119.77,61.97,41.27。
【000362】1-萘基乙醛羟腈:1H NMR(CDCl3,500MHz)δ8.07(m,1H),7.86(m,1H),7.74(m,1H),7.41(m,4H),4.20(t,J=7Hz,1H),3.33(d,J=6.8Hz,2H)13C NMR(CDCl3,125MHz)δ177.7,140.31,129.74,129.24,128.92,128.26,127.84,125.63,124.53,124.05,123.42,70.58,38.0对于[C13H11NO]197.08,用质谱计算得到197.1(ESI+)。
【000363】2-吡啶基乙醛羟腈:1H NMR(CDCl3,500MHz)δ8.50(m,1H),7.85(m,1H),7.48(m,1H),7.34(m,1H),4.42(m,1H),3.19(dd,J=3.5Hz,J=13.7Hz,2H)。13C NMR(CDCl3,125MHz)δ157.44,145.69,140.24,126.96,126.16,122.99,60.30,42.60。对于[C8H8N2O]148.06,用质谱计算得到149.1(ESI+)。
【000364】3-吡啶基乙醛羟腈:1H NMR(CDCl3,500MHz)δ8.62(d,1H,J=1.8),8.57(d,1H,J=5.1),7.94(d,1H,J=8.1),7.41(dd,1H,J=8.1,5.1),5.64(s,1H)。13C NMR(CDCl3,125MHz)δ:149.921,147.355,135.412,133.044,124.443,118.980,61.085。对于[C7H6N2O]134.05,用精确质量(exact mass)计算得到135.2(ESI+)。
【000365】2-噻吩基乙醛羟腈:1HNMR(CDCl3,500MHz)δ7.1(m,1H),6.9(m,1H),6.8(m,1H),4.11(t,J=7.0Hz,1H),2.86(d,J=7.0Hz,2H)。13C NMR(CDCl3,125MHz)δ127.68,127.41,125.58,124.60,118.70,63.25,44.84。
【000366】3-噻吩基乙醛羟腈:1H NMR(CDCl3,500MHz)δ7.09(m,3H),4.60(t,J=6.25Hz,1H),3.12(d,J=6.25Hz,2H)。13C NMR(CDCl3,125MHz)δ129.05,127.16,125.27,122.65,119.87,61.58,44.90。
【000367】从相应的羟腈制备外消旋扁桃酸标准:(Stoughton,R.W.J.Am.Chem.Soc.1941,63,2376)将2-溴扁桃腈(230mg,1.08mmol)溶解在浓盐酸(1mL)中,在室温搅拌18小时,然后于70℃搅拌24小时。在冷却后,用乙醚(4×2mL)提取反应混合物。将有机提取物混和,在MgSO4上干燥,过滤,在真空中浓缩。分离出无色粉末2-溴扁桃酸(180mg,0.78mmol,70%产率)。
【000368】从相应的氨基酸制备外消旋扁桃酸标准:在室温下,在氮气(g)氛围下,将苯丙氨酸(10mmol,1.65g)溶解在30mL 2N H2SO4中。在3-4小时的时间内,将亚硝酸钠(1.4g,在3ml水溶液中,2当量(eq))缓慢地加入到反应混合物中,在室温下,在氮气(g)氛围下,用力搅拌。将反应混合物搅拌过夜,然后将苯基乳酸提取到乙醚(3×30ml)中。在MgSO4上将混和的醚提取物干燥,然后过滤并且在真空中浓缩。(Kenji,I.;Susumu,A.;Masaru,M.;Yasuyoshi,U.;Koki,Y.;Koichi,K.专利号WO0155074,公开日期:2001-08-02)。
酶促制备α-羟基酸的一般方法:
【000369】(R)-(-)-扁桃酸 在pH8,包括10%v/v甲醇的150mL磷酸钠(100mM)缓冲液中的扁桃腈(1.005g,7.56mmol)溶液中,于37℃加入9mg腈水解酶1(以腈水解酶含量进行标准化),所述溶液已经被氮气(g)搅动。反应在氮气(g)氛围下,在旋转的平板摇床上进行。通过取出等分试样,用于HPLC分析来监控反应进程。在温育3小时后,用1N盐酸将反应混合物酸化到pH2,用乙醚(4×50ml)提取。在真空中浓缩有机组分,然后将残余物吸收在10%碳酸氢钠溶液中。然后用乙醚(3×50ml)洗涤含水溶液,接着用1N盐酸将该溶液酸化到pH2,用乙醚(3×50ml)提取。混和有机组分,用盐水洗涤,在MgSO4上干燥,过滤,然后在真空中浓缩。分离出无色粉末(R)-(-)-扁桃酸,产率86%(933mg,6.22mmol)。1H NMR(DMSO-d6,500MHz)δ12.6(br,s,1H),7.41(m,2H),7.34(m,2H),7.28(m,1H),5.015(s,1H)。13C NMRDMSO-d6,125MHz)δ174.083,140.216,128.113,127.628,126.628,72.359。对于[C8H8O3]150.07,用质谱计算得到实测值150.9(ESI+);ee=98%[HPLC]。[α]20 598=-134.6(c=0.5,甲醇)。
【000370】(-)-2-氯扁桃酸 1H NMR(DMSO-d6,500MHz)δ7.75(m,1H),7.44(m,1H),7.34(m,2H),5.34(s,1H)。13C NMR(DMSO,298K,125MHz)δ173.070,137.985,132.105,129.399,129.158,128.705,127.235。对于[C8H7ClO3]186.0,用质谱计算得到185.0(LC-MS-)。ee=96%[HPLC]。92%产率。[α]20 598=-137.6(c=0.5,乙醇)。
【000371】(-)-2-溴扁桃酸 1H NMR(DMSO-d6,500MHz)δ7.60(d,J=7.93,1H),7.48(m,1H),7.40(m,1H),7.25(m,1H),5.30(s,1H)。13C NMR DMSO-d6,125MHz)δ172.994,139.61,132.355,129.652,128.753,127.752,122.681,71.644。对于[C8H7BrO3]230.0,用质谱计算得到230.9。ee=96%[HPLC]。92%产率。[α]20 598=-116.4(c=0.5,乙醇)。
【000372】(-)-2-甲基扁桃酸 1HNMR(DMSO-d6,500MHz)δ11.78(bs,1H),7.38(m,1H),7.16-7.38(m,3H),5.18(s,1H),2.35(s,3H)。13C NMR DMSO-d6,125MHz)δ174.229,138.623,135.649,130.129,127.491,126.990,125.698,125.698,69.733,18.899。对于[C9H10O3]166.1,用质谱计算得到165.2。ee=91%[HPLC]。86%产率。[α]20 598=-164.4(c=0.5,乙醇)。
【000373】(-)-3-氯扁桃酸 1H NMR(DMSO-d6,500MHz)δ7.46(s,1H),7.36(m,3H),5.07(s,1H)。13C NMR(DMSO,298K,125MHz)δ173.554,142.685,132.813,130.069,127.568,126.355,125.289,71.659。对于[C8H7ClO3]186.0,用质谱计算得到185.34(MALDI TOF-)。ee=98%[HPLC]。70%产率。[α]20 598=-120.48(c=0.5,乙醇)。
【000374】(-)-3-溴扁桃酸 1H NMR(DMSO-d6,500MHz)δ7.60(s,1H),7.49(m,1H),7.42(m,1H),7.31(m,1H),5.06(s,1H)。13C NMR(DMSO,298K,125MHz)δ173.551,142.917,130.468,130.379,129.237,125.687,121.404,71.605。对于[C8H7BrO3]229.98,用质谱计算得到229.1(LC-MS)。ee=98%[HPLC]。82%产率。[α]20 598=-84.8(c=0.5,甲醇)。
【000375】(-)-4-氟扁桃酸 1HNMR(DMSO,298K,500MHz)δ12.65(s,1H),7.44(m,2H),7.17(m,2H),5.91(s,1H),5.03(s,1H)。13C NMR(DMSO,298K,125MHz)δ173.93,162.57,136.47,128.61,128.55,114.96,114.80,71.61。对于[C8H7FO3]170.0,用质谱计算得到168.8。ee=99%[HPLC]。81%产率。[α]20 598=-152.8(c=0.5,甲醇)。
【000376】(-)-1-萘基羟基乙酸 1H NMR(DMSO-d6,500MHz)δ8.28-8.26(m,1H),7.87-7.93(m,2H),7.47-7.58(m,4H),5.66(s,1H)。13C NMR DMSO-d6,125MHz)δ174.288,136.284,133.423,130.654,128.353,128.192,125.926,125.694,125.613,125.266,124.558,70.940。对于[C12H10O3]:202.21,用质谱计算得到201.37(MALDITOF-,基体辅助激光解吸电离飞行时间质谱)。ee=95%[HPLC]。90%产率。[α]20 598=-115.4(c=0.5,乙醇)。
【000377】(-)-2-萘基羟基乙酸 1H NMR(DMSO-d6,500MHz)δ12.6(bm,1H),7.88-7.93(m,4H),7.48-7.56(m,3H),5.20(s,1H)。13C NMR(DMSO-d6,125MHz)δ174.005,137.760,132.644,132.498,127.811,127.658,127.506,127.209,125.993,125.334,124.761,72.472。对于[C12H10O3]202.21,用质谱计算得到201.37(MALDITOF)。ee=98%[HPLC]。68%产率。[α]20 598=-115.4(c=0.5,乙醇)。
【000378】(-)-3-吡啶基羟基乙酸 该反应在100mM甲酸铵缓冲液中进行,pH8。为了分离产物,通过10,000MWCO(分子量截留值)膜过滤反应混合物,以去除酶,然后在真空中浓缩。1H NMR(DMSO-d6,500MHz)δ8.56(s,1H),8.36(d,J=4.57Hz,1H),8.25(s,1H),7.71(m,1H),7.25(dd,J=4.98,4.80Hz,1H),5.45(s,1H)。13C NMR DMSO-d6,125MHz)δ165.911,147.862,147.251,139.118,133.381,122.746,71.508。对于[C7H7NO3]153.04,用质谱计算得到154.0(MALDITOF)。ee=92%[HPLC]。84%产率。[α]20 598=-65.2(c=0.5,水)。
【000379】(-)-3-噻吩基羟基乙酸 1H NMR(DMSO-d6,500MHz)δ7.48(m,1H),7.45(d,J=2.81,1H),7.10(m,1H),5.09(s,1H),3.33(s,1H)。13C NMR(DMSO,298K,125MHz)δ173.704,141.109,126.446,126.042,122.247,68.915。对于[C6H6O3S]158.00,用质谱计算得到157.224(MALDI TOF)。ee=95%[HPLC]。70%产率。[α]20 598=-123.28(c=0.5,甲醇)。
【000380】(S)-(-)-苯基乳酸 1H NMR(DMSO-d6,500MHz)δ7.28(m,5H),4.17(dd,J=4.5Hz,J=8.3Hz,1H),2.98(dd,J=4.5Hz,J=13.7Hz,1H),2.79(dd,J=8.3Hz,J=13.7Hz,1H)。13C NMR(DMSO,298K,125MHz)δ178.16,133.4,129.27,128.6,127.3,70.45,44.12。ee=97%[HPLC],84%产率。[α]20 598=-17.8(c=0.5,甲醇)。
【000381】(-)-2-甲基苯基乳酸 1H NMR(DMSO-d6,500MHz)δ7.16(m,4H),4.47(dd,J=3.9Hz,J=8.8Hz,1H),3.25(dd,J=3.9Hz,J=14.3Hz,1H),2.94(dd,J=8.8Hz,J=14.3Hz),2.35(s,3H)。13C NMR(DMSO,298K,125MHz)δ178.61,137.08,134.74,130.80,130.25,127.44,126.34,70.93,37.67,19.79。对于[C10H12O3]180.08,用质谱计算得到180.0(ESI+)。86%产率。ee=95%[HPLC]。[α]20 598=-13.2(c=0.5,甲醇)。
【000382】(-)-2-溴苯基乳酸 1H NMR(DMSO-d6,500MHz)δ7.28(m,4H),4.60(dd,J=4.0Hz,J=9.1Hz,1H),3.45(dd,J=4.0Hz,J=14.1Hz,1H),3.04(dd,J=8.0Hz,J=14.1Hz,1H)。13C NMR(DMSO,298K,125MHz)δ178.70,136.05,133.21,132.10,128.99,127.72,125.0,70.04,40.76。对于[C9H9BrO3]243.9,用质谱计算得到243.3(ESI+)。91%产率。ee=93%[HPLC],[α]20 598=-17.6(c=0.5,甲醇)。
【000383】(-)-2-氟苯基乳酸 1H NMR(DMSO-d6,500MHz)δ7.10(m,4H),4.64(t,J=6.8Hz,1H),3.11(d,J=6.8Hz,2H)。13C NMR(DMSO,298K,125MHz)δ132.18,131.52,129.66,129.03,128.07,124.05,115.8,63.02,44.79。对于[C9H8FNO]:165.06,用质谱计算得到164.2(ESI+)。91%产率。ee=88%[HPLC]。[α]20 598=-14.0(c=0.5,甲醇)。
【000384】(-)-3-甲基苯基乳酸 1H NMR(DMSO-d6,500MHz)δ7.18(m,1H),7.02(m,3H),4.54(dd,J=4.6Hz,J=8.0Hz,1H),3.06(dd,J=4.54Hz,J=14.4Hz,1H),2.83(dd,J=8.0Hz,J=14.4Hz,1H),2.36(s,3H)。13C NMR(DMSO,298K,125MHz)δ175.88,163.80,130.33,130.09,125.7,116.68,113.75,71.31,34.28。对于[C10H11NO]161.08,用质谱计算得到162.2(ESI+)。80%产率。ee=98%[HPLC]。[α]20 598=-2.4(c=0.5,甲醇)。
【000385】(-)-3-氟苯基乳酸 1HNMR(DMSO-d6,500MHz)δ7.2(m,1H),6.9(m,3H),4.56(dd,4.5Hz,J=7.9Hz,1H),3.09(dd,J=4.5Hz,J=14.1Hz,1H),2.86(dd,J=7.9Hz,J=14.1Hz,1H)。13C NMR(DMSO,298K,125MHz)δ175.88,163.80,130.33,130.09,125.7,116.68,113.75,71.31,34.28。对于[C9H9O3F]184.05,用质谱计算得到184.1(ESI+)。82%产率。ee=97%[HPLC]。[α]20 598=-5.2(c=0.5,甲醇)。
【000386】(-)-1-奈基乳酸 1H NMR(DMSO-d6,500MHz)δ8.57(m,1H),8.21(m,1H),8.08(m,1H),7.61(m,4H),4.64(dd,3.5Hz,8.5Hz,1H),3.84(dd,J=3.5Hz,J=14.5Hz,1H),3.38(dd,J=8.5Hz,J=14.5Hz,1H)。13C NMR(DMSO,298K,125MHz)δ177.7,140.31,129.74,129.24,128.92,128.26,127.84,125.63,124.53,124.05,123.42,70.58,38.0。对于[C13H11NO]197.08,用质谱计算得到197.1(ESI+)。87%产率。ee=94%[HPLC]。[α]20 598=-16.2(c=0.5,甲醇)。
【000387】(-)-2-吡啶基乳酸 1H NMR(DMSO-d6,500MHz)δ8.49(m,1H),7.62(m,1H),7.21(m,2H),4.50(t,J=5.0Hz,1H),3.01(d,J=5.0Hz,2H)。13C NMR(DMSO,298K,125MHz)δ178.8,159.79,148.84,136.89,124.35,121.75,71.14,44.09。对于[C8H9NO3]:167.06,用质谱计算得到167.0(ESI+)。62%产率。ee=94%[HPLC]。[α]20 598=-3.6(c=0.5,甲醇)。
【000388】(-)-3-吡啶基乳酸 1H NMR(DMSO-d6,500MHz)δ8.43(m,2H),7.62(m,1H),7.28(m,1H),4.57(t,5.37Hz,1H),2.85(d,5.37Hz,2H)。13C NMR(DMSO,298K,125MHz)δ176.6,150.03,147.12,136.41,129.45,123.26,61.56,31.46。对于[C8H9NO3]167.06,用质谱计算得到167.0(ESI+)。59%产率。ee=94%[HPLC]。[α]20 598=-4.0(c=0.5,甲醇)。
【000389】(-)-2-噻吩基乳酸 1H NMR(DMSO-d6,500MHz)δ7.18(m,1H),6.94(m,1H),6.90(m,1H),4.49(dd,J=4.1Hz,J=6.25Hz,1H),3.36(dd,J=4.1Hz,J=15.0Hz,1H),3.26(dd,J=6.25Hz,J=15.0Hz,1H)。13C NMR(DMSO,298K,125MHz)δ127.68,127.41,125.58,124.60,118.70,63.25,44.84。对于[C7H7NOS]153.02,用质谱计算得到153.0(ESI+)。85%产率。ee=95%[HPLC]。[α]20 598=-13.0(c=0.5,甲醇)。
【000390】(-)-3-噻吩基乳酸 1H NMR(DMSO-d6,500MHz)δ7.30(m,1H),7.13(m,1H),7.01(m,1H),4.50(dd,J=4.25Hz,J=6.5Hz,1H),3.21(dd,J=4.25Hz,J=15.OHz,1H),3.10(dd,J=6.5Hz,J=15.0Hz,1H)。13C NMR(DMSO,298K,125MHz)δ127.50,136.09,128.83,126.24,123.32,70.65,34.84。对于[C7H8O3S]172.02,用质谱计算得到172.1(ESI+)。81%产率。ee=96%[HPLC]。[α]20 598=-18.8(c=0.5,甲醇)。
3-羟基戊二酰基腈的酶促水解:
【000391】在室温下,将3-羟基戊二酰基腈(1.0g,9.0mmol,240mM)悬浮在氮气(g)搅动的磷酸钠缓冲液(37.5mL,pH7,100mM)中。加入细胞溶解产物(30mg,以腈水解酶含量进行标准化),使浓度达到0.8mg/ml酶,在100rpm和室温下震荡该反应物。通过TLC(1∶1 EtOAc∶己烷,Rf=0.32,腈;Rf=0.0,酸)监控反应进程。22小时后,用1M HCl酸化反应物。用乙醚连续提取反应混合物。分离出黄色油状酸性产物(1.15g,98%产率)。1H NMR(DMSO,298K,500MHz)δ12.32(s,1H),5.52(s,1H),4.10(m,1H),2.70(dd,1H,J=16.8,4.1Hz),2.61(dd,1H,J=16.9,6.3Hz),2.44(dd,1H,J=15.4,5.3Hz),2.37(dd,1H,J=15.6,7.8Hz)。13C NMR(DMSO,298K,125MHz)δ171.9,118.7,63.4,41.2,25.2。对于[C5H7NO3]:129.0,用质谱计算得到130.0[M+H+],(ESI+)。
制备(R)-(-)-甲基(3-O-[苯甲酰]-4-氰基)-丁酸酯
【000392】在室温下,将苯甲酰氯(0.068ml,0752mmol)加入到溶解在嘧啶(2.0ml)中的(R)-甲基-(3-羟基-4-氰基)-丁酸酯(71.7mg,0.501mmol)的搅拌溶液中。19小时后,加入另外的0.5当量苯甲酰氯(0.023ml,0.251mmol)。在23小时完成反应,这正如通过TLC所确定的。加入1ml水(H2O),用醚提取(3×10ml)。用盐水洗涤(2×10ml)。用MgSO4干燥混和的含水提取物。过滤掉干燥剂,通过旋转蒸发去除溶剂。通过柱层析(己烷∶乙酸乙酯[2∶1])纯化。组分的旋转蒸发产生了黄色油状产物(46mg,0.186mmol,37%)。1H NMR(DMSO,298K,500MHz)δ7.96(d,2H,J=7.8),7.70(t,1H,J=7.25),7.56(t,2H,J=7.8),5.55(m,1H),3.59(s,3H),3.13(m,2H),2.90(m,2H)。13C NMR(DMSO,298K,125MHz)δ169.6,164.5,133.8,129.3,128.9,128.5,117.3,66.0,51.8,37.5,22.2。对于[C13H13NO4]:247.25,用质谱计算得到270.3。[M+Na+]ee=95%[HPLC]。[α]20 598=-32.4(c=0.5,CHCl3)。
(R)-乙基-(3-羟基-4氰基)-丁酸酯的合成
【000393】制备溶解在无水乙醇(1.94mL)中的(R)-3-羟基-4-氰基-丁酸的0.2M溶液。将乙醇溶液滴加到过筛的1.0ml的无水1M盐酸醚溶液和无水乙醇的50∶50(v/v)混合物中。在室温和氮气(g)氛围下,将反应物搅拌过夜。通过TLC监控反应,(1∶1 EtOAc∶己烷,Rf=0.45,酯;Rf=0.0,酸,用p-茴香醛染色)。30小时后,通过旋转蒸发去除溶剂。将粗产物放入到25mL醚中,洗涤,是用5mL饱和碳酸氢盐和然后用5mL盐水进行洗涤。在MgSO4上干燥有机提取物,过滤,然后在真空中浓缩,产生了清澈油状产物。1H NMR(DMSO,298K,500 MHz)δ5.60(d,1H,J=5.58Hz),4.12(m,1H),4.07(q,2H,J=7.1),2.66(m,2H),2.47(m,2H),1.87(t,3H,J=7.0)。13C NMR(DMSO,298K,125MHz)δ170.21,118.60,63.40,59.98,41.10,25.14,14.02。对于[C7H11NO3]:157.1,用质谱计算得到158.2[M+H+]。
实施例13:用于对映选择性产生(R)-2-氯扁桃酸的腈水解酶的最优化
【000394】氯扁桃酸具有如下结构:
【000395】鉴别从(R,S)-2-氯扁桃腈选择性地产生(R)-2-氯扁桃酸的腈水解酶。鉴别出腈水解酶,其有助于提高这些酶的对映选择性,并且确定了工艺条件对酶的影响。完成对酶促腈水解的反应条件的检验,以便提高产物的对映体过量。另外,进行工艺条件对酶的影响的进一步研究。
2-氯扁桃腈 (R)-2-氯扁桃酸
【000396】在这一方面,对映选择性产生(R)-2-氯扁桃酸是目的。选择一种酶,SEQID NOS:385,386,用于进一步确认其针对2-氯扁桃腈的对映选择性。SEQ IDNOS:385,386显示出对于工艺成分是稳定的,具有8小时的半衰期。酶受到2-氯苯甲醛和羟腈底物中的一种污染物,2-氯苯甲酸的抑制。酶反应按比例逐步增加,直到底物浓度为45mM 2-氯扁桃腈。获得了大于90%的转化,以及97%的ee。改进了手性HPLC方法,以去除底物中存在的污染峰。使用该方法在确定对映选择性中获得了改进的准确度。
【000397】筛选针对2-氯扁桃腈的腈水解酶,具有31种腈水解酶在该底物上表现出活性。9种酶显示出高对映选择性。对这些酶中的5种进行最优化,其中一种被鉴别为下一阶段开发的候选者。
【000398】在改进所选择的用于(R)-2-氯扁桃酸的酶的对映选择性的尝试中,对大量因素进行了研究,已知这些因素对这一特性以及酶的活性有所影响。这些因素包括pH,温度,缓冲强度和反应中所加入的溶剂。最初,选择了5种酶用于这些研究,选择是基于这些酶所获得的高对映选择性。这些酶是:SEQ ID NOS:385,386,SEQ ID NOS:197,198,SEQ ID NOS:217,218,SEQ ID NOS:55,56和SEQ IDNOS:167,168。
pH的影响
【000399】在一系列pH值范围内进行酶促反应,从pH5到pH9。对于所有酶,观察到随着pH值的增加,活性和对映选择性有所增加。除了SEQ ID NOS:385,386,pH9(0.1M Tris-HCl缓冲液)被确定为活性和对映选择性的最适pH。SEQ IDNOS:385,386的最适pH为pH8(0.1M磷酸钠缓冲液)。
温度的影响
【000400】这些酶表现出类似的温度曲线,在37℃和45℃测量出最高活性。尽管后面的温度导致较高的转化,但大部分酶的对映选择性显示出明显的偏爱较低温度,当温度升高至高于37℃时,ee值降低了10-20%。在SEQ ID NOS:385,386的情况下,对映选择性的娇气的最适温度明显在37℃。因此,这一温度被确定为通过这些酶水解2-氯扁桃腈的最适温度。
酶浓度的影响
【000401】在同时进行的苯乙醛羟腈对映选择性水解为L-苯基乳酸的研究过程中,发现酶在反应过程中的浓度对反应的对映选择性有重大影响。这表明酶水解速度快于反应中剩余羟腈的外消旋作用速度。基于这一点,研究了酶浓度对于酶针对(R)-2-氯扁桃腈的对映选择性的影响。用标准酶浓度(0.6mg蛋白/ml),标准浓度的一半和标准浓度的四分之一进行了酶促反应。
【000402】下述表格表明反应所达到的最高转化率,用相应的ee表示。除了SEQID NOS:385,386,似乎观察到,如果存在的话,对映选择性有所增加也非常小。因此,似乎剩余氯扁桃腈的外消旋速率不是获得较高对映选择性的限制因素。
其它阳性酶的研究
【000403】除了上述表中的酶,对大量其它腈水解酶针对2-氯扁桃腈的对映选择性进行了筛选。一些酶是最新发现的酶。一些酶进行了再次研究了,是在已经被发现对这些酶最适的条件下(pH8和37℃)进行。该筛选的结果如下述表中所示。
共溶剂浓度的影响
【000404】在酶促反应中,加入作为共溶剂的甲醇,显示出增加了ee值。为了确定能被加入到反应物中的最低水平的甲醇,在变化的甲醇浓度下进行了酶反应,变化范围为0-20%(v/v)。各种甲醇浓度之间,很明显,对映选择性没有显著性差异。然而,这些反应中的ee值是97-98%,而没有加入甲醇的对照组反应的ee值是95-96%。尽管在ee值上的差异较小,在研究过程中,甲醇的影响在不止一组实验中都有显示,因此认为该影响是显著的。
反应成分对SEQ ID NOS:385,386的活性的影响
【000405】对酶的过程最优化的研究中的一个关键部分,涉及确定酶促反应中可能存在的任何化合物的影响。对于SEQ ID NOS:385,386,这些成分被建立为起始材料,和羟腈的平衡产物,2-氯苯甲醛;产物,2-氯扁桃酸和底物中检测到的污染物,2-氯苯甲酸。发现,反应中加入氰化物对酶活性没有影响。也发现酶可以容忍微量三乙胺的存在。
【000406】各种反应成分对于SEQ ID NOS:385,386的活性的影响,是通过在酶反应中加入各种水平的可能抑制剂来评价的。从这些实验中,显示出醛及其氧化产物,2-氯苯甲酸都对酶活性有所损害。在反应中加入5mM 2-氯苯甲醛或5mM 2-氯苯甲酸,对SEQ ID NOS:385,386的活性分别有大约70%和40%的损失。
2-氯扁桃腈的按比例放大的水解
【000407】为了确认在用于产生(R)-2-氯扁桃酸中、由SEQ ID NOS:385,386所获得的转化和对映选择性,进行了大规模反应,并从含水混合物中分离了产物。反应在20mL反应体积中进行,底物浓度为45mM 2-氯扁桃腈。实现了羟腈的完全转化,所形成产物为30mM。产物的ee值是97%,酶的比活性是0.13mmol产物/mg腈水解酶/小时。
【000408】从该实验以及所进行的其它实验,显而易见,产物的形成不能说明底物的完全损失。在所有实验中,进行了含有腈的对照组样品,以便确定羟腈的分解程度。总的来说,在37℃,在4小时的时间段内,似乎损失了大约50%的底物。可以预期到,分解将一直到其平衡产物,氰化物和2-氯苯甲醛,该化合物将能进行进一步氧化。以90mM 2-氯扁桃腈的底物浓度,也进行了较大规模的反应。然而,在反应中没有检测到产物。应该预期到,在较高底物浓度下,平衡产物的浓度,2-氯苯甲醛和污染物,2-苯甲酸将以较高量存在。基于上述结果,很可能在这样的条件下,酶将被完全抑制。
两相条件下的反应
【000409】使用两相系统,将有助于在酶反应步骤后回收产物。这些系统也被用于去除对酶具有抑制作用的产物或副产物。在使用多种溶剂的两相条件下,腈水解酶显示出具有活性。在上述较高底物浓度下所得的低转化率之后,进一步使用命中酶,SEQ ID NOS:385,386,进行了两相系统的研究。重要的是,确定任意抑制因子是否能被溶剂相去除,以及使用两相系统是否能获得任何过程优势。
【000410】用己烷作为有机相,获得了有希望的结果。因此,进一步的研究涉及以两个不同水平使用该溶剂:水相体积的100%和70%,随着底物浓度的增加,高达90mM。将底物溶解在有机相中。己烷水平似乎对产物形成的水平没有影响,尤其是在2-氯扁桃腈浓度较高下。
【000411】再一次,在两相系统中,观察到了高转化,在5小时后观察到了76%的产物产量。产物形成率似乎比相应的单相系统中略微低一些,在相应的单相系统中反应是在1小时内完成的。在两相系统中,观察到了较低的对映选择性。导致这些结果的一些可能性是:(i)传质速率低于酶活性的速率或(ii)非极性溶剂直接影响酶。
【000412】在较高的底物浓度下,观察到了非常低的转化,从90mM 2-氯扁桃腈形成了7mM 2-氯扁桃酸。虽然这一转化水平较低,但仍然高于使用了相同底物浓度的单相系统中观察到的水平。这些结果暗示,在非极性有机溶剂中保留了一些抑制性的2-氯苯甲醛或2-氯苯甲酸。
标准测定条件:
【000413】制备下述溶液:
-底物储备溶液:50mM羟腈底物,在0.1M磷酸盐缓冲液中(pH8)。
-酶储备溶液:将3.33mL 0.1M磷酸盐缓冲液(pH8)加到每一小瓶中,其中含有20mg冻干细胞溶解产物(终浓度6mg蛋白质/ml)
【000414】反应体积在不同实验之间有所变化,这依赖于所采用的时间点的数量。除非另外说明,所有反应包括25mM 2-氯扁桃腈和10%(v/v)的酶储备溶液(终浓度6mg蛋白/ml)。除非另外说明,反应在37℃进行。对每一实验,进行了对照组,来监控腈降解。这些包括溶解在0.1M磷酸盐缓冲液(pH8)中的25mM 2-氯扁桃腈。
【000415】反应取样:通过从每一反应中取出等分试样,并且按照因数8稀释这些样品,对反应进行取样。通过手性和非手性HPLC方法,对双重复样品进行分析。除非如上述图中所示,反应在0.5,1,1.5,2,3和4小时进行取样。
HPLC方法
使用10mM磷酸钠缓冲液(pH2.5)的流动相,在SYNERGI-EPTM柱(4μm;50×2mm)上进行非手性HPLC方法。在3.5分钟引入甲醇梯度,并且在1.5分钟内增加到50%,随后将甲醇减少到0%。2-氯扁桃酸和2-氯扁桃腈的洗脱时间是2.5和6.1分钟,腈在5.9分钟出现了另一个峰。
【000416】如上所述,在研究过程中,对手性HPLC方法进行最优化,以改善2-氯苯甲酸和(S)-2-氯扁桃酸之间的分离。在研究的后半部分,使用优化的方法,并且在CHIROBIOTIC-RTM柱上进行。流动相是80%乙腈∶20%的0.5%(v/v)乙酸。(S)-2-氯扁桃酸和(R)-2-氯扁桃酸的洗脱时间分别是2.4和3.5分钟。2-氯苯甲酸的峰在1.9分钟洗脱。对于每一实验,在HPLC运行中包括产物的标准曲线。产物在样品中的浓度,是用这些曲线的斜率计算的。
pH的影响
【000417】通过在不同缓冲液范围内进行标准测定研究pH对酶活性和对映选择性的影响:0.1M柠檬酸磷酸盐,pH5;0.1M柠檬酸磷酸盐,pH6;0.1M磷酸钠,pH6;0.1M磷酸钠,pH7;0.1M磷酸钠,pH8;0.1M Tris-HCl,pH8;和0.1MTris-HCl,pH9。除了SEQ ID NO:385,386使用标准浓度的一半之外(5%v/v酶储备溶液),对所有酶使用标准酶浓度。
温度的影响
【000418】温度对于活性和对映选择性的影响,是通过在一系列不同的温度范围内进行标准测定分析来研究的:室温,37℃,45℃,50℃和60℃。除了SEQ IDNO:385,386使用标准浓度的一半之外(5%v/v酶储备溶液),对所有酶使用标准酶浓度。
酶浓度的影响
【000419】在标准条件下进行反应,使用变化的酶浓度:1%,5%和10%(v/v)的酶储备溶液。用适当的缓冲液将反应体积标准化。
溶剂的加入
【000420】在存在作为共溶剂的甲醇的情况下,进行酶反应。以下述水平将甲醇加入到标准反应混合物中:0,5,10,15和20%(v/v)。
【000421】也对具有己烷的两相系统进行了研究。水相包括处于0.1M磷酸盐缓冲液(pH8)中的10%(v/v)的酶储备溶液。在加入到反应物中之前,将羟腈溶解到己烷中。使用了两个水平的有机相:与水相体积等值,和水相体积的0.7。此外,对一系列腈浓度进行了研究:25,45和90mM。这些反应是在室温下进行的。
【000422】这些反应的样品来自水相和有机相。在真空下离心蒸发己烷,并且再次溶解在甲醇和水的50∶50混合物中,这样样品与含水样品处于相同的稀度。对样品的分析,是通过非手性和手性HPLC进行的。
工艺成分的影响
【000423】(i)活性:通过将单一成分2-氯苯甲醛,2-氯苯甲酸或2-氯扁桃酸加入到酶反应中,建立工艺成分对于酶活性的影响。酶反应在标准条件下进行,在存在2种可能的抑制剂之一的情况下,如下所述:5,10,20和25mM 2-氯苯甲醛;1.5和5mM 2-氯苯甲酸;和10,20,40和80mM 2-氯扁桃酸。对照组反应在标准条件下进行,不加入添加剂。在每一取样时间,将样品以十分之一的水平稀释。使用含有反应成分但不含有酶的对照组样品,并且稀释到相同水平。通过非手性HPLC分析样品。
【000424】(ii)稳定性:在标准条件下鉴定酶活性之前,在存在反应成分,2-氯苯甲醛和2-氯扁桃酸的情况下,通过将酶温育预定时间期间,来监控酶对工艺条件的稳定性。在这些实验中,在存在下述反应成分的每一种的情况下,以3mg蛋白/ml的浓度温育酶,所述下述反应成分为:5,10,20和25mM 2-氯苯甲醛;和10,20,40和80mM 2-氯扁桃酸。通过仅仅在缓冲液中温育酶进行对照组反应。
【000425】检测条件:在特定添加剂中,在0,4,8和24小时的温育时间下,移出20μl酶溶液,并且加入60μl的41.6mM底物储备溶液和20μl缓冲液。从而在标准条件下检测酶活性。在加入底物后90分钟,对反应取样,用非手性HPLC方法进行分析。
酶促反应的按比例放大
【000426】在两个不同的浓度下进行酶反应:45mM和90mM底物。在标准条件下进行反应,即pH8(0.1 M磷酸钠缓冲液),37℃和10%(v/v)的酶储备溶液。在加入缓冲液之前,将底物溶解在10%(v/v)甲醇中。最终的反应体积是20ml,用磁力搅拌进行反应。
实施例14:用于对映选择性产生L-2-氨基-6,6-二甲氧基己酸的腈水解酶的最优化
5,5-二甲氧基戊醛 5,5-二甲氧基戊醛 L-2-氨基-6,6-二甲氧基己酸
氨基腈
【000427】四种分离的酶显示出将2-氨基-6-羟基己烷腈水解为(S)-2-氨基-6-羟基己酸,其具有朝向L-对映异构体的选择性。鉴别出一种新颖靶物质,该物质与(S)-2-氨基-6-羟基己酸具有类似结构。针对该靶物质,5,5-二甲氧基戊醛氨基腈,筛选了一组分离的腈水解酶。对阳性酶在该底物上进行表征。使用实验室进化技术,优化这些酶针对特定靶物质的改进的对映选择性。使用初步筛选鉴别推定的高表达突变型,这是使用HPLC确认的。
【000428】酶的优化:GSSMTM和GeneReassemblyTM可以在选择出的腈水解酶上进行,以便提高用于产生L-2-氨基-6,6-二甲氧基己酸的酶的对映选择性和活性。鉴别出四种能够对映选择性地将2-氨基-6-羟基己烷腈水解为L-(S)-2-氨基-6-羟基己酸的酶。然而,在新颖靶分子L-2-氨基-6,6-二甲氧基己酸中,存在微小的结构差异。为了确定该差异对酶的活性和对映选择性是否有影响,针对该新颖靶物质筛选具有完全谱的腈水解酶。
【000429】选择对于产生L-2-氨基-6,6-二甲氧基己酸表现出活性和对映选择性的最高组合的酶用于GSSMTM。在靶酶突变后,使用高通量筛选技术,在5,5-二甲氧基戊醛氨基腈上,筛选所得到的突变型。在通过HPLC分析确认高表达突变型之后,单一高表达突变型将被结合,以便进一步提高突变酶的性质。
【000430】与GSSMTM平行,GeneReassemblyTM也可以在亲本酶的组合上进行,可以对其中至少一种,选择针对L-2-氨基-6,6-二甲氧基己酸的活性和对映选择性。至少两种具有高度同源性的其它腈水解酶,可以与前述酶进行重装配;对这些酶进行选择,以便对重装配序列提供多样性。
【000431】开发用于对映选择性的高通量检测方法是该演化努力取得成功的关键。这样的检测方法是基于新颖酶的对映选择性的检测方法,该检测方法允许在比传统使用的HPLC方法显著更短的时间期间内筛选>30,000种突变型。
【000432】一方面,用一种非随机方法来产生变体,所述非随机方法被称作合成连接重装配(synthetic ligation reassembly),该方法与随机改组有关,除了核酸结构单元不是被改组或连接或随机嵌合,而是被非随机地装配,可以被用于产生变异体。该方法不需要在被改组的核酸之间存在高度同源性。可以用连接重装配方法非随机地产生具有至少10100或至少101000不同嵌合体的后代分子文库(或集合)。连接重装配方法提供了一种非随机方法,用于产生一组最终的嵌合核酸,该核酸具有通过设计所选择的总装配次序,该方法包括通过设计产生多个特异性核酸结构单元,这些核酸结构单元具有有用的相互相容可连接末端,这样在装配这些核酸结构单元时,可以实现设计的总装配次序。
【000433】将被装配的核酸结构单元的相互相容可连接末端,如果它们能使得结构单元以预定次序被结合,就被认为对这一类型的有序装配是“有用的(serviceable)”。因此,一方面,其中核酸结构单元可以被结合的总装配次序是通过可连接末端的设计指定的,如果使用不止一个装配步骤,那么其中核酸结构单元可以被结合的总装配次序也可以通过装配步骤的顺序次序来指定。在本发明的一个方面,用酶,如连接酶(例如,T4 DNA连接酶)来处理退火的结构片段,以获得结构片段的共价连接。
【000434】在另一个方面,在对一组祖先核酸模板的序列分析上,获得核酸结构单元的设计,该祖先核酸模板被作为产生一组后代最终嵌合核酸分子的基础。这些祖先核酸模板因此作为序列信息来源,有助于被诱变,即嵌合,重组或改组的的核酸结构单元的设计。
【000435】在一个例证中,本发明提供了嵌合相关基因家族和它们的相关产物的编码家族。在一个特别的例证中,被编码的产物是腈水解酶。编码本发明的腈水解酶的核酸可以根据此处描述的方法被诱变。
【000436】因此,根据本发明的一个方面,编码腈水解酶的大量祖先核酸模板的序列被比对,以便选择一个或多个分界点,这些分界点可以位于一个同源性区域中。分界点可以被用来描述将要产生的核酸结构单元的边界。因此,在祖先分子中鉴别和选择的分界点,作为后代分子装配中的潜在的嵌合点。
【000437】典型地,有用的分界点是由至少两个祖先模板共有的同源性区域(包括至少一个同源的核苷酸碱基),但分界点可以是由至少一半的祖先模板,至少三分之二的祖先模板,至少四方之三的祖先模板,和优选地至少所有祖先模板共有的同源性区域。仍然,甚至更优选地,有用的分界点是由所有祖先模板共有的同源性区域。
【000438】一方面,连接重装配过程被彻底地进行,以便产生详尽的文库。换句话说,核酸结构单元的所有可能的有序组合被表示在最终嵌合的核酸分子的集合中。同时,设计(或非随机的,非无规则的)每一组合中的装配次序(即在每一最终嵌合核酸的5’到3’序列上的每一结构单元的装配次序)。由于该方法的非随机特性,大大降低了不受欢迎的副产物的可能性。
【000439】在另一个方面,该方法提供了,连接重装配过程被系统地进行,例如以便产生系统地区室化文库,具有可以被系统地筛选的区室,例如,一个接着一个。每一区室(或部分)拥有具有已知特性的嵌合体或重组体。换句话说,本发明提供了,通过选择性和明智地使用特定核酸结构单元,结合选择性和明智地使用连续步骤的装配反应,可以实现实验设计,其中在几个反应容器中的每一个中产生后代产物的特定集合。这允许进行系统的检验和筛选程序。因此,这允许潜在的非常大量的后代分子以较小群体被系统地检验。
【000440】由于其以高度灵活但仍然彻底且系统的方式进行嵌合作用的能力,尤其当祖先分子之间存在低水平同源性时,此处描述的本发明提供了产生包括大量后代分子的文库(或集合)。由于连接重装配方法的非随机性质,所产生的后代分子优选地包括具有通过设计所选择的总装配次序的最终嵌合核酸分子的文库。在一个特定的方面,这样产生的文库包括大于103到大于101000不同后代分子种类。
【000441】在另一个例证中,其中产生结构单元的步骤的合成性质,允许设计和引入核苷酸(例如一个或多个核苷酸,例如可以是密码子或内含子或调节序列),所述核苷酸随后被任选地在体外过程中被去除(例如通过诱变)或在体内过程中被去除(例如通过使用宿主生物的基因剪接能力)。应该预期到,在许多情况下,引入这些核苷酸也是期望的,除了产生有用的分界点的潜在益处外,还有许多其它原因。
【000442】本发明的合成连接重装配方法使用了多个核酸结构单元,这些结构单元中的每一个优选地具有两个可连接末端。在每一核酸结构单元上的两个可连接末端,可以是钝末端(即每一个末端具有零个核苷酸的突出端),或优选地一个钝末端和一个突出端,或更优选地仍然是两个突出端。在一个双链核酸上,有用的突出端可以是3’突出端,或5’突出端。核酸结构单元可以具有一个3’突出端,一个5’突出端,两个3’突出端,或两个5’突出端。其中核酸结构单元被装配形成最终嵌合核酸分子的总次序,是通过有目的的实验设计确定的(例如基于5’和3’突出端的序列,设计结构单元核酸之间的粘末端),并且不是无规则的。
【000443】根据一个优选的方面,核酸结构单元是如下产生的:化学合成两个单链核酸(也被称作单链寡核苷酸),并且在杂交条件下将其接触以允许它们退火形成双链核酸结构单元。双链核酸结构单元的大小是可以变化的。这些结构单元的大小可以小或大。结构单元的优选大小范围,是从1个碱基对(不包括任何突出端)到100,000个碱基对(不包括任何突出端)。也提供了其它优选的大小范围,下限为1bp到10,000bp(包括在此之间的每一个整数数值),上限为2bp到100,100bp(包括在此之间的每一个整数数值)。
【000444】根据一个方面,首先通过产生两个单链核酸,然后允许它们退火以形成双链核酸结构单元,从而产生双链核酸结构单元。在双链核酸结构单元的两个链中,除了在任何形成突出端的核苷酸上,在每一个核苷酸可以是互补的;因此不含有错配,除了任何突出端。根据另一个方面,双链核酸结构单元的两个链在比除了任何形成突出端的核苷酸的每一个核苷酸少的情形下,是互补的。因此,根据这一方面,双链核酸结构单元可以被用来引入密码子简并。优选地,使用此处描述的位点饱和诱变引入密码子简并,使用一个或多个N,N,GIT盒子或可选择地使用一个或多个N,N,N盒子。
实施例15:评价腈水解酶活性和对映选择性的分析方法
【000445】描述了一种测定方法,该方法可以适合高通量自动装置,以便增加对于腈水解酶的发现和演化尝试的筛选通量。理想的测定方法是,即允许量化产物形成或底物转化,以及对映体过量的测定方法。开发了适合高通量筛选的两种非手性和两种手性比色测定法。
所开发的非手性比色测定:
【000446】用于残余底物的OPA测定。邻苯二醛测定(OPA测定)适用于α-氨基或α-羟基腈底物。全细胞的裂解是不必要的。针对2-氯扁桃腈和苯基乙醛羟腈,这些结果通过HPLC得到了验证。测定工作最好使用芳族腈。脂族化合物表现出线性标准曲线,荧光性被降低,从而降低了测定的效率。
【000447】所形成的羟基酸的量化和ee确定的LDH测定
LDH测定适用于苯基乳酸,但不适用于2-氯扁桃酸。使用刃天青检测系统增加灵敏度,并且降低背景。在进行测定之前,全细胞的背景荧光或者通过离心或者通过热灭活来克服。
【000448】所形成的氨基酸的量化和ee确定的AAO测定
AAO测定适用于苯基丙氨酸,和(S)-2-氨基-6-羟基己酸。使用Amplex Red检测系统增加灵敏度。显示出细胞裂解是不必要的。细胞在确定成分培养基中生长,以便防止背景荧光。
OPA测定
【000449】使用基于腈水解酶测定的邻苯二醛(OPA)荧光来量化剩余的α-羟基腈底物的量。α-羟基腈在pH控制的分解中形成相应的醛和氰化物,OPA与其中释放的氰化物反应,产生发荧光的、可以计量的产物。OPA与α-羟基腈在pH控制的、形成相应的醛和氰化物的降解中所释放的氰化物反应,产生发荧光的1-氰基-2-R-苯并异吲哚。
【000450】建立针对下述底物的标准曲线:2-氯扁桃腈(CMN,0.998),环己基扁桃腈(CHMN,0.99),乙酰苯基氨基腈(APA,0.99),和苯乙醛羟腈(PAC,0.97),(图5),(圆括号中为R2值)。也建立了苯基甘氨酸的标准曲线(PGN,0.93)。三种所实验的底物,二甲基正丁醛氨基腈(DMB)(2-氨基-4,4-二甲基戊烷腈),羟基新戊醛氨基腈(Hydroxypivaldehyde aminonitrile,HPA),和新戊醛氨基腈(pivaldehyde aminonitrile,PAH),在最初的测定条件下,给出非常低的荧光读数和不可靠的结果。然而,对于这些化合物,其中许多参数被调整,但是通过这些操作没有增加这些化合物的荧光信号强度。
【000451】在增加这三种化合物的荧光信号的尝试中,用萘二羧醛(NDA)代替OPA。构建了使用OPA或者NDA的PAH,HPA和DMB的标准曲线。为了确定灵敏度和背景荧光,加入对底物中的每一种具有疑似催化活性的冻干腈水解酶溶解产物(SEQ ID NOS:189,190)。在四种化合物中的三种化合物中检测水解。NDA急剧地增强了信号,通常以一个数量级增加,尽管这一降低的线性大概是由于信号的饱和。
【000452】NDA被确定作为脂族化合物的可替代的检测试剂。然而,对于该测定,期望的是对于所有底物使用相同的检测系统,原因在于这将有助于自动化评价多个腈水解酶底物。对于分析PAC,CMN,CHMN,APA,MN和PGN,当前基于OPA的测定是有效的。尽管已经开发了针对脂族化合物PAH,HPA和DMB的标准曲线。
全细胞最优化
【000453】评价将冻干腈水解酶溶解产物加入到测定成分中的影响,或者是未处理的,或者是热灭活的。在任一种情况下,没有观察到干涉性背景荧光。接着评价OPA测定,并且以全细胞格式最优化针对腈水解酶活性检测的OPA测定。评价表达腈水解酶的全细胞和原位溶解细胞。评价冻干细胞溶解产物,同时它们各自的全细胞克隆作为对照组。对于这一最优化研究,选择扁桃腈(MN)作为模型底物。
【000454】与表达SEQ ID NOS:187,188的全细胞,和表达SEQ ID NOS:187,188的原位溶解细胞一起,评价SEQ ID NOS:187,188的冻干细胞溶解产物。加入全细胞不会影响荧光,也不会导致荧光猝灭。加入三种细胞溶解溶液中的任一种可以提高全细胞系统中扁桃腈的渗透性(以及因此的转化)。评价三种细胞溶解溶液:B-PER(Pierce),BugBuster(Novagen)和CelLytic B-II(Sigma),发现对OPA测定没有有害影响。加入产物α-羟基酸或α-氨基酸不会影响OPA测定的检测。
【000455】对测定的需要几个液体转移步骤的原始格式进行修改,修改到一个平板过程中,其中细胞生长、腈水解和OPA测定反应在同一微量滴定平板中发生。使用该单一孔格式试验扁桃腈。在这种情况下,评价大肠杆菌基因位点饱和诱变(GSSMTM)细胞宿主。对三种克隆进行试验:SEQ ID NOS:101,102,SEQ IDNOS:187,188,和一个用作对照组的空白载体。在四个时间点评价水解,在10和20mM下,也使用0mM对照组。在一个早期的实验中,针对苯乙醛羟腈底物(对于该化合物,该酶没有表现出活性)评价克隆SEQ ID NOS:187,188,没有观察到活性。
【000456】OPA测定被发现可以检测α-羟基和α-氨基腈底物的存在。用该测定可以容易地检测芳族化合物,而对于脂族化合物在检测上有一些挑战。当使用冻干细胞溶解产物、原位溶解全细胞或未溶解全细胞时,没有明显的背景问题。该测定适用于单平板分析,其中细胞生长、用底物温育、和测定是在同一平板上进行:不需要液体转移,易于实现自动化。尽管所有被试验的腈产生了线性响应,但脂族化合物显示出低荧光响应。
手性LDH测定(手性乳酸脱氢酶测定)
【000457】开发了基于乳酸脱氢酶(L-LDH)的光谱系统,用于分析通过腈水解酶催化水解羟腈所产生的手性α-羟基酸。羟基腈底物没有被次级或检测酶代谢,从而没有干扰起始材料。未经过热处理的细胞溶解产物导致LDH系统的背景活性;然而,细胞溶解产物的热灭活或造粒消除了背景活性。(参见图4)
【000458】评价此处公开的针对腈水解酶的可以商购的D-和L-乳酸脱氢酶的活性和对映体异构特异性。鉴别出LDH,其既适合于D-苯基乳酸分析,也适合于L-苯基乳酸分析。没有发现适合于2-氯扁桃酸分析的酶。所选择的LDH酶表现出实质上绝对的立体选择性。建立使用冻干细胞溶解产物检测从PAC产生的D-和L-LDH的测定的可行性。
【000459】最初,评价了三种比色染料,所有这些是四唑盐:NBT(3,3’-二甲氧基-4,4’-亚联苯基)双[2,(4-硝基苯基)-5-苯基-2H]-,氯化物)MTT(3-(4,5-二甲基噻唑-2-基)-2,5-二苯基溴化四唑)INT(2-(4-碘苯基)-3-(4-硝基苯基)-5-2H-氯化四唑)。这些检测系统的产物不溶解性在分析上具有挑战。为了说明这一点,评价了另一种已报导的具有可溶性产物的四唑盐,XTT(2,3-双-(2-甲氧基-4-硝基-5-磺苯基)-2H-四唑-5-羧基腈)。尽管XTT产生了一种可溶性大红色的产物,但底物是不可溶的,从而导致相同的分析挑战。作为四唑染料族的一种可选择性,评价了双重性的比色/荧光染料刃天青。刃天青的氧化产生了resourfin。底物和产物都是可溶的,颜色变化可以比色或使用荧光测定法量化,从而增加了准确性。由于刃天青的灵敏度,可以量化0.05mM乳酸。当使用与底物相同范围的染料时,获得了最适结果,例如0.5mM刃天青可以量化0.05到0.5范围内的乳酸(及其类似物),尽管最好的线性处于该数值范围的较下端。resourfin在28小时内是稳定的,并且具有线性荧光响应。
【000460】在存在LDH测定成分的情况下,冻干酶显示出背景荧光/吸收。为了说明这一问题,将溶解产物煮沸10分钟,然后离心。这导致背景信号降低了90%。有趣的是,单独离心(5分钟@14.1rcf)或离心后煮沸(5分钟@100℃)可以将荧光降低到背景水平。由于煮沸将增加蒸发(8μl孔大小),并且潜在地挥发(volatize)腈底物,所以在高通量格式中,如1536孔平板中,旋转相对于煮沸是优选的。从生长培养基(LB和TB和M9)或细胞溶解溶液(B-PER,CelLytic和BugBuster)中没有记录到背景信号。
手性AAO测定
【000461】开发了基于氨基酸氧化酶(AAO)的光谱系统,用于分析腈水解酶催化水解氨基腈所产生的手性α-氨基酸。
测定开发和确认
【000462】最初的测定确认,使用了如上面所描述的2,2’-连氮基-双-{3-乙基苯并噻吡咯啉-6-磺酸(ABTS)检测系统。然而,由于颜色不稳定,进一步的研究使用了苯酚氨基安替比林(PAAP)检测系统,是在λmax510nm分析的。对4-甲基-亮氨酸、苯丙氨酸、(S)-2-氨基-6-羟基己酸和叔亮氨酸的每一对映异构体发现了具有适当活性的酶。该测定不适合于甲基苯基甘氨酸,与苯基甘氨酸不能很好地工作。
【000463】对于苯丙氨酸产生了从0-15mM的标准曲线。当浓度保持在低于1mM时,该曲线更加线性。只要保持黑暗,颜色在好几天内都能保持稳定。三种细胞溶解溶液Bug Buster(BB),细菌蛋白提取试剂(Bacterial Protein Extracting Reagent,BPER),和细胞溶解试剂(Cell Lytic Reagent,CLR)被加入到标准曲线中,没有显示出对显色的影响。加入细胞溶解产物(cell lysate,cl)没有表现出形成背景颜色。加入苯基乙醛氨基腈硫酸盐(PAS),起始材料也没有显示出对颜色形成的影响。
【000464】AAO系统在直至高达1mM的底物下表现出更强的线性。调节AAO酶和酸性底物的浓度,以试图去除L-AAO和D-AAO曲线在接近图表中间时的交叉。预先混和PAAP,HRP和AAO证明是有效的,并且没有引起所观察的活性的变化,这确定了测定成分可以被加入到混和格式的测定中。
【000465】对全细胞的AAO测定观察到了高水平的背景,这归因于TB和LB生长培养基中存在的L-氨基酸。在M9培养基中洗涤和重悬浮细胞消除了背景。对于所有未来的实验,细胞在具有0.2%葡萄糖的M9培养基中培养。与未溶解细胞相比,溶解细胞仅仅显示出略微好的响应。因此,细胞溶解是不必要的。SEQ IDNOS:187,188在基于HPLC分析的初步筛选中表现出对HPA的活性。
【00466】研究了荧光检测系统的用途,该系统允许以超高通量方式执行测定,如1536孔或千兆矩阵格式。最适合我们的系统的荧光试剂是来自Molecular Probes的Amplex Red,该试剂产生了高度荧光的试卤灵(resorufin)(λex545nm;λem590nm)。建立苯丙氨酸和(S)-氨基-6-羟基己酸的标准曲线(0-100μM)。
【000467】在测定自动化的准备中,通过荧光激活细胞分选(FACS)将表达腈水解酶的细胞加入到含有M90.2%葡萄糖,0.25mM IPTG培养基的微量滴定平板中。评价三种表达亚克隆的腈水解酶,和空白载体对照组:SEQ ID NOS:101,102,SEQID NOS:187,188,SEQ ID NOS:29,30,和空白载体。细胞分选之后,证明细胞生存能力是不一致的。因此目前评价了菌落挑取,作为一种可选的方法,将细胞加入到微量滴定平板中。未覆盖的1536孔微量滴定平板的蒸发损失在自动培养箱中每天大约是30%(培养条件:85%相对湿度(RH),温度为37℃)。在95%RH培养箱中培养,将每天的蒸发损失降低到了1%。
【000468】使用HPA腈,建立三种亚克隆在存在高达3.5mM腈的情况下的生长能力。生长速度仅仅略微受到了影响(低于30%)。在存在HPA的情况下,生长的亚克隆显示出表达能催化羟基正亮氨酸(hydroxy norleucine,HNL)形成的腈水解酶,正如使用Amplex Red检测系统所建立的。当酶是S-选择性时,仅仅评价了S。在10分钟时间间隔下,读取反应平板,在40分钟显示出最好的线性。尽管当细胞在pH7生长时,高于5mM的HPA极大地抑制了细胞生长,但对于细胞在pH8的生长,高于0.1mM HPA抑制了生长。
【000469】为了使用HPLC验证AAO结果,使用高浓度HPA,高达40mM(由于对于(S)-2-氨基-6-羟基己酸的HPLC检测挑战),和冻干细胞溶解产物SEQ IDNOS:187,188进行反应。
比较对于HNL的AAO和HPLC数据
%ee | %转化率 | |||
[HNL] | ||||
mM40302010 | AAO89%89%86%78% | HPLC100%97%97%98% | AAO17%29%21%13% | HPLC18%36%34%35% |
【000470】为了确定与基于HPLC筛选所用的20mM底物范围相比,以较低浓度进行筛选是否会引起结果的偏差,使用三个浓度范围用SEQ ID NOS:187,188进行实验。每一实验被进行三个重复,以便去除任何非系统误差。
【000471】AAO测定可以在384或1536孔格式上进行,细胞被分拣到M9 0.2%葡萄糖,0.25mM IPTG培养基中。在存在腈的情况下培养细胞(在这种情况下是HPA),或者允许细胞达到一定浓度,然后加入腈。尽管细胞溶解试剂不干扰测定,当测定HPA时,发现加入溶解试剂是不必要的。或者在之前加入腈,或者在之后加入腈,母板将必须被拆分为子板,然后分别对L-和D-对映异构体含量进行测定。用AAO/Amplex Red试剂调节温育时间,以便在不同时间读取D-和L-平板。
实施例16:鉴别、开发和产生强劲、新颖酶
靶向一系列高数值对映选择性生物过程
【000472】本发明提供了通过定向进化(directed evolution)开发腈水解酶,该腈水解酶为下述化学靶物质的工艺制造提供了极大的技术和商业优势:
L-2-氨基-6,6-二甲氧基己酸
5,5-二甲氧基戊醛 5,5-二甲氧基戊醛氨基腈 L-2-氨基-6,6-二甲氧基己酸
【000473】腈水解酶显示出将2-氨基-6-羟基己烷腈水解为(S)-2-氨基-6-羟基己酸,具有针对L-对映异构体的选择性。针对靶物质5,5-二甲氧基戊醛氨基腈,筛选一系列腈水解酶。在该底物上,对阳性酶进行表征。使用初步筛选来鉴别推定的高表达突变型,然后使用HPLC来确认。
【000474】在所选择的腈水解酶上进行GSSMTM和GeneReassemblyTM,以便提高用于产生L-2-氨基-6,6-二甲氧基己酸的酶的对映选择性和活性。鉴别将2-氨基-6-羟基己烷腈对映选择性水解为L-(S)-2-氨基-6-羟基己酸的腈水解酶。然而,新颖靶分子L-2-氨基-6,6-二甲氧基己酸表现出微小的结构差异。为了确定该差异是否影响酶的活性和对映选择性,针对该新颖靶物质筛选腈水解酶的完整谱。
【000475】首先,通过更详细的表征产生L-2-氨基-6,6-二甲氧基己酸的命中酶,进行用于GSSM的正确靶基因的鉴别。该尝试涉及更广泛地研究pH和温度对于活性和对映选择性的影响,以及更深入地分析酶对工艺条件的稳定性。在最初筛选之前,完成烷基氨基腈的单一对映异构体的合成;在理解该因素和酶的对映选择性之间的关系的尝试中,研究腈的外消旋作用。
选择用于产生L-2-氨基-6,6-二甲氧基己酸的表现出活性和对映选择性的高度组合的酶用于GSSM。在靶酶突变后,使用高通量筛选技术,针对5,5-二甲氧基戊醛氨基腈筛选所得到的突变体。在通过HPLC分析确认高表达突变型之后,达到了决策点,以便评价GSSM针对靶物质的结果。
【000476】与GSSMTM平行,GeneReassemblyTM也可以在亲本酶的组合上进行,其中至少一种是选择针对L-2-氨基-6,6-二甲氧基己酸的活性和对映选择性。至少两种其它腈水解酶可以与前述酶进行重装配;对这些酶进行选择,以便对重装配序列提供多样性。
【000477】本发明提供了开发针对原始底物氨基腈的外消旋条件。此外,本发明提供了通过动态动力学拆分来鉴别能将这些氨基腈转化为靶α-氨基酸的酶。本发明也提供了筛选和开发用于产生(R)-2-氨基-6,6-二甲氧基己酸(ε-醛基赖氨酸)的腈水解酶催化的动力学拆分方法。(S)-2-氨基-6-羟基己酸将被用作模型底物,用于开发动力学拆分。靶α-氨基酸如下所示:
(i)D-4-氟苯基甘氨酸
4-氟苯甲醛 4-氟苯基甘氨酸腈(FPGN) D-4-氟苯基甘氨酸
(ii)L-2-氨基-6,6-二甲氧基己酸(ε-醛基赖氨酸)
5,5-二甲氧基戊醛 5,5-二甲氧基戊醛 L-2-氨基-6,6-二甲氧基
氨基腈(DMPAN) 己酸
【000478】开发了用于腈水解酶催化产生D-4-氟苯基甘氨酸和2-氨基-4,4-二甲基戊烷腈(ε-醛基赖氨酸)的氨基腈底物的外消旋作用的条件。两种模型底物,最初使用苯基甘氨酸腈和戊醛氨基腈,在不存在酶的情况下研究外消旋作用。进行了在多种可能的外消旋作用条件下同时确定一种或多种可利用的腈水解酶的性能。此外,针对用于产生(S)-2-氨基-6-羟基己酸的羟基戊醛氨基腈筛选腈水解酶,并且最优化有希望的酶。一旦建立了外消旋作用条件,筛选腈水解酶的活性。进行对产物的动力学拆分的进一步最优化。
【000479】鉴别用于将α-氨基腈水解为α-氨基酸的大量对映选择性腈水解酶。尽管这些酶显示出优先选择某些氨基腈的所需对映异构体,在进一步筛选、开发和比较候选腈水解酶中的一个限制因素,是氨基腈底物在反应条件下的外消旋作用的速率。
芳族氨基腈外消旋作用
【000480】第一个步骤是使用模型底物,苯基甘氨酸腈,建立芳族氨基腈外消旋发生的条件。外消旋作用策略包括,但不限于下述列表所示。根据它们商业适用性,粗略地区分选项的优先次序。
(1)控制反应的pH。既然显示出外消旋在高pH时比较快,该方法需要发现和最优化在pH>10时具有活性和选择性的腈水解酶。
(2)加入已知化学外消旋试剂,如醛、酮、弱碱、树脂、金属离子、路易斯酸等等,这些试剂在较低pH下可以增加外消旋作用。
(3)合成N-酰化氨基腈衍生物,例如N-乙酰基苯基甘氨酸腈,该化合物更容易被外消旋。在N-乙酰基苯基甘氨酸腈的情况中,去除乙酰基的选择性D-酰基转移酶将能增加腈水解酶产物的光学纯度。
(4)使用两相系统,其中碱催化的外消旋发生在疏水有机相中,酶促水解发生在水相中。
(5)使用2-酶系统,该系统包括腈水解酶和氨基腈消旋酶。目前可以通过商业途径获得一种氨基酸消旋酶,该氨基酸消旋酶将被用来试验针对苯基和氟苯基甘氨酸腈的活性。搜索基因文库,以搜索与已知氨基酸酰胺消旋酶、乙内酰脲消旋酶或可以被鉴别的任何其它消旋酶表现出同源性的基因。
【000481】一旦已经建立外消旋作用的条件,就可以为开发靶芳族底物,4-氟苯基甘氨酸腈(FPGN)的外消旋条件提供基础。FPGN被认为稳定性低于模型底物;因此,可以更快地发生外消旋作用,但降解反应同样也比较快。评价样品酶忍受和/或在该条件下良好地发挥作用的能力。最终最优化筛选方法包括靶底物、样品腈水解酶和底物外消旋条件。
【000482】所进行的研究已经显示出,苯基甘氨酸腈在pH10.8易于发生外消旋。然而,似乎现有酶中的任一种酶没有显示出能忍受这样苛刻的pH条件。筛选来自高度碱性环境的样品,筛选是否存在能忍受这样的条件的腈水解酶。一旦发现,对这些酶进行测序和亚克隆,并且这些酶被以冻干细胞溶解产物形式产生,以备筛选使用。
脂族氨基腈外消旋作用
【000483】一种模型脂族氨基腈,戊醛氨基腈,被以其外消旋形式合成。然而,使用下述方法,制备旋光富集的样品:(i)制备性手性HPLC;(ii)非对映体盐拆分;(iii)非对映体衍生或柱色谱;(iv)从L-N-BOC正亮氨酸合成。用HPLC分析检测这些化合物。
HPLC测定
【000484】使用HPLC测定来检测(S)-2-氨基-6-羟基己酸。使用包括柱衍生之前的测定。
筛选/表征:
【000485】针对2-氨基-6-羟基己烷腈筛选腈水解酶。对于在大于25mM的底物下能很好地发挥作用的酶,进行按比例放大的反应。研究其它酶的底物/产物耐受性和稳定性曲线。
【000486】筛选腈水解酶,并且对命中酶进行表征,重点在于反应条件下的最适pH和最适温度,对映选择性和稳定性。
酶进化
【000487】选择表现出期望特性的靶酶用于GSSMTM。在靶酶突变后,使用高通量筛选技术在底物上筛选所得到的突变体。一旦已经通过HPLC分析确认了高表达突变型,组合能增加性能的单一突变,并且评价可能的加性或协同效应。
【000488】此外,可以在先导酶(lead enzymes)的组合上进行GeneReassemblyTM,选择期望特性,包括在反应中的活性、对映选择性和稳定性。
实施例17:用于对映选择性产生(S)-苯基乳酸的腈水解酶的最优化
【000489】鉴别用于对映选择性水解5种不同的腈底物的腈水解酶。分离这些腈水解酶,并且针对所选择的靶物质进行最优化。最优化包括过程最优化和定向进化或定向演化(directed evolution)。尤其是,表征和最优化对于产生(S)-苯基乳酸特异的酶。这样做的目的主要在于提高酶的活性,同时保持高对映选择性。也进行了工艺条件对酶的影响的研究。
苯乙醛 苯乙醛羟腈 (S)-苯基乳酸
【000490】用于从潜在的定向演化尝试中筛选突变体的高通量测定的开发取得了成功。开发了适合于高通量筛选的两种非手性和两种手性比色测定,并且用于腈水解酶定向进化。
【000491】鉴别作为用于产生(S)-苯基乳酸的高对映选择性腈水解酶的SEQ IDNOS:103,104。SEQ ID NOS:103,104的表征显示出最适反应pH和温度,分别是pH8和37℃;分别高达5mM和30mM水平的反应起始材料,苯乙醛,和产物,苯基乳酸显示出对酶活性没有影响。按比例放大的酶促反应具有95%的对映体过量(ee)。
实施例18:编码腈水解酶的核酸的定向进化
【000492】对nitB基因进行基因位点饱和诱变TM或GSSMTM,以产生覆盖整个酶的单一氨基酸取代突变体的文库。定向进化中所用的“亲本”nitB基因的序列是SEQ ID NOS:103,104。通过执行GSSMTM产生nitB突变体文库。然后筛选nitB突变体文库,筛选具有增加的全细胞羟甲基硫代丁基腈(hydroxymethylthiobutryonitrile,HMTBN,是一种腈水解酶底物)活性的克隆。
腈水解酶反应在该底物上的产物是羟甲基硫代丁酸(hydroxymethylthiobutyric acid,HMTBA)。
【000493】在35℃下,用100mM HMTBN和100mM K3PO4,pH7进行测定,大约30-40%转化率。使用两种方法定量表示HMTBN转化,一个方法是由HPLC分析所产生的关于HMTBS的直接测量值,另一个方法使用使用荧光氰化物测定的针对残余HMTBN的间接检测,该测定先前已经进行了描述。
【000494】对推定的nitB高表达突变体进行二次测定,以确认所增加的活性。在二次测定中,在摇瓶中在表达培养基中,诱导高表达突变型和野生型对照组。然后用100mM K3PO4,pH7洗涤摇瓶培养基,并且重悬浮到相同的光学密度,在660nm下。然后用标准化细胞重悬浮液,在与初始测定中所用的条件相同的条件下,进行动力学测定。对已经确认的具有增加的HMTBN活性的推定高表达突变型进行测序,并且在转化回相同表达菌株后,试验增加的活性,以确保活性的增加不是由于宿主突变。
【000495】所确认的nitB GSSMTM高表达突变型是nitB G46P,其在氨基酸46上具有甘氨酸(GGT)到脯氨酸(CCG)的取代。在25℃和35℃下,该突变体的全细胞HMTBN活性比野生型NitB均大约高50%。一旦鉴别了有利的G46P突变,再次使用nitB G46P模板,用GSSMTM来产生双重突变体的集合。这些突变体都含有G46P突变,以及在一个随机位点具有额外的单一氨基酸取代。测定双重突变体的HMTBN活性,其高于nitB G46P的HMTBN活性。产生双重、三重和四重突变体,以便加速突变过程,更快地鉴别有利的突变。在鉴别和分离了最初几个有利的突变之后,将它们组合以产生双重突变体,其中最好的是DMl8。DMl8被用作模板来产生三重突变体。最有活性的三重突变体是TM3,其被用作模板来产生四重突变体。最有活性的四重突变体是QM2。该表概括了这些突变。
突变体 | 突变1 | 突变2 | 突变3 | 突变4 |
DMl8 | R(gcg)29C(tgt) | Y(tac)207M(atg) | ||
TM3 | R(gcg)29C(tgt) | Y(tac)207M(atg) | L(ctt)170T(act) | |
QM2 | R(gcg)29C(tgt) | Y(tac)207M(atg) | L(ctt)170T(act) | A(gcg)197 N9(aat) |
【000496】首先通过研究这些突变体的全细胞HMTBN活性表征这些突变体。在100mM HMTBN,QM2的HMTBS生产率比亲本基因的HMTBS生产率高1.2倍。然而,在200mM HMTBN,QM2的HMTBS生产率是亲本基因的3.6倍。当HMTBN浓度从100mM增加到300mM时,这些突变体的生产率有相应增加。对于转化率,在270分钟后,TM3完全地转化了底物,在该时间之后,DMl8和SM均显示出高于75%的转化率。为了进一步着手于HMTBN浓度影响NitB的活性/生产率的问题,在400mM和528mM HMTBN下,测定了好几种突变体。NitB在这些底物浓度下实质上没有活性,然而突变体在这些浓度下保留了显著的活性。尤其是,这些高浓度下的活性与它们在200mM底物下的活性实质上相同。因此,这些突变体可以在广泛的底物浓度范围内使用,并且可以在使用上提供比NitB亲本基因更好的灵活性。
【000497】突变体显示出比亲本基因具有更高的表达水平,而且,正如SDS-PAGE分析中所看到的,似乎QM2和TM3突变体比野生型含有更大比例的可溶性酶。至于稳定性,所有酶在25℃和35℃显示出实质上相同的稳定性模式。
【000498】最后,对突变体进行密码子最优化。该方法的目的在于最优化密码子,从而增加特定宿主细胞中的表达水平。这反过来又增加酶的每个细胞的活性。与对照组相比,这导致密码子最优化的突变体中全细胞活性的增加。活性的增加量大约是该活性的两倍。使用大肠杆菌表达系统。
实施例19:从腈水解酶催化剂的反应所产生的化合物的选择实施例
【000499】图15中所列出的化合物是所选择的、可以使用本发明的酶和/或方法从腈水解酶催化的反应中产生的化合物。
【000500】此外,下面是可以通过腈水解酶斯特雷克尔格式(Strecker format)产生的潜在的化合物。可以使用本发明的腈水解酶从它们各自的醛或酮产生多于100种氨基酸和许多新颖药物。例如,可以使用本发明的腈水解酶合成的具有很大市场的药物包括:同聚苯丙氨酸(homophenylalanine),VASOTECTM,VASOTERICTM,TECZEMTM,PRINIVILTM,PRINZIDETM,ZESTRILTM,ZESTORETICTM,RAMACETM,TARKATM,MAVIKTM,TRANDOAPRILTM,TRANDOLAPRILATTM,ALTACETM,ODRIKTM,UNIRETICTM,LOTENSINTM,LOTRELTM,CAPOTENTM,MONOPRILTM,TANATRILTM,ACECOLTM,LONGESTM,SPIRAPRILTM,QUINAPRILTM和CILAZAPRILTM。其它手性药物包括DEMSERTM(α-甲基-L-酪氨酸),ALDOCHLORTM,LEVOTHROIDTM,SYNTHROIDTM,CYTOMELTM,THYOLARTM,HYCODANTM,CUPRIMINETM,DEPENTM,PRIMAXINTM,MIGRANOLTM,D.H.E.-45,DIOVANTM,CEFOBIDTM,L-DOPA,D-DOPA,D-α-甲基-DOPA,L-α-甲基-DOPA,L-γ-羟基谷氨酸,D-γ-羟基谷氨酸,3-(2-萘基)-L-丙氨酸,D-高丝氨酸和L-高丝氨酸。
【000501】此外,本发明的腈水解酶可以用于合成下述氨基酸。这些氨基酸中的许多氨基酸具有药物应用。D-苯基甘氨酸,L-苯基甘氨酸,D-羟基苯基甘氨酸,L-羟基苯基甘氨酸,L-叔亮氨酸,D-叔亮氨酸,D-异亮氨酸,L-异亮氨酸,D-正亮氨酸,L-正亮氨酸,D-正缬氨酸,L-正缬氨酸,D-2-噻吩基甘氨酸,L-2-噻吩基甘氨基,L-2-氨基丁酸酯,D-2-氨基丁酸酯,D-环亮氨酸,L-环亮氨酸,D-2-甲基苯基甘氨酸,L-2-甲基苯基甘氨酸,L-噻吩基丙氨酸和D-噻吩基丙氨酸。
【000502】本发明的腈水解酶的酶能用于合成下述天然氨基酸:甘氨酸,L-丙氨酸,L-缬氨酸,L-亮氨酸,L-异亮氨酸,L-苯基丙氨酸,L-酪氨酸,L-色氨酸,L-半胱氨酸,L-甲硫氨酸,L-丝氨酸,D-丝氨酸,L-苏氨酸,L-赖氨酸,L-精氨酸,L-组氨酸,L-天冬氨酸,L-谷氨酸,L-天冬酰胺,L-谷氨酰胺和L-脯氨酸。下面是可以使用本发明的腈水解酶产生的非天然氨基酸的实例。D-丙氨酸,D-缬氨酸,D-亮氨酸,D-异亮氨酸,D-苯基丙氨酸,D-酪氨酸,D-色氨酸,D-半胱氨酸,D-甲硫氨酸,D-苏氨酸,D-赖氨酸,D-精氨酸,D-组氨酸,D-天冬氨酸,D-谷氨酸,D-天冬酰胺,D-谷氨酰胺和D-脯氨酸。
【000503】进一步,本发明的腈水解酶可以在非斯特雷克尔化学反应(non-Strecherchemical reactions)中使用,包括更多手性药物的合成,如TAXOTERETM,以及含有3-羟基-戊二腈($5.5B的市场);LIPITORTM,BAYCOLTM和LESCOLTM的手性药物。不是药物的手性产物靶物质包括PANTENOLTM,L-膦丝菌素(L-phosphinothricin),D-膦丝菌素(D-phosphinothricin),D-氟苯基丙氨酸和L-氟苯基丙氨酸。最后,腈水解酶可以被用来产生缺乏手性中心的非天然氨基酸化合物,如肌氨酸,亚氨基二乙酸,乙二胺四乙酸(EDTA),α-氨基丁酸和β-丙氨酸。
图16是本发明的腈水解酶和/或本发明的方法所产生的底物和产物的实例。所示为底物和产物的化学结构。此处所示的化学反应是本发明的腈水解酶活性的非限定性实例。
实施例20:使用SEQ ID NO:210的变体的多肽的例证性制备
【000504】变体,腈水解酶1506-83-H7A,是在残基190处用His取代Ala的SEQID NO:210。在密码子水平,所发生的突变是GCT到CAT。该变体在3-羟基戊二酰基腈(HGN)转化为(R)-4-氰基-3-羟基丁酸中表现出改进的对映选择性。
【000505】该变体已经被证明,在室温下,在100mM pH7磷酸钠缓冲液中进行该转化。该突变体可以在其它缓冲液系统和温度下进行,以及具有提供额外的改变特性的潜力。例证性特性包括,但不限于,改变的反应速率,%ee和稳定性。尤其是,改变的特性可以是较高的反应速率,较高的%ee和较高的稳定性。改变的特性可以是高于野生型至少25%、30%、35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%、90%或95%。
【000506】该变体显示出通过以10mM到3M底物(HGN)的高对映体过量产生产物来进行转换。较高或较低底物浓度也是可能的。已经达到了大于或等于95%的对映体过量。然而,对映体过量可以是高于野生型的至少25%、30%、35%、40%、45%、50%、55%、60%、65%、70%、75%、80%、85%或90%。
【000507】本发明的SEQ ID NOS:的变体可以被克隆到表达载体中。例如,核酸序列SEQ ID NO:195,205,207,209或237的变体,和编码氨基酸序列SEQ ID NO:210的变体的核苷酸可以被克隆到例证性载体中,包括但不限于,pSE420(大肠杆菌表达载体)和pMYC(假单胞菌属表达载体)。
实施例21:使用本发明的变体的制备:
【000508】在室温下,~22℃,将3-羟基戊二酰基腈(1g,9mmol)滴加到在2.12mL的100mM pH7磷酸钠缓冲液中的腈水解酶细胞溶解产物的搅拌溶液中(标准化为150mg蛋白含量)。在室温下,用磁力搅拌棒将该3M溶液搅拌24小时。通过TLC(薄层色谱)和GC(气相色谱)监控反应进程。反应应该在24小时内完成。
【000509】此处预期的其它变体包括,但不限于下述:N111S;Al90H,S,Y或T;F191L,V,M,D,G,E,Y或T;M199E,或L;D222L;A55K,G或Q;I60E,或其任意组合。
实施例22:对映选择性转化的筛选测定分析
【000510】公开了一种新颖方法,用于筛选对映选择性转化,例如将前手性底物对映选择性地转化为手性底物,该方法可以提供监控所得到的产物的对映体过量(%ee)的能力。该方法也适用于确定非对映体过量(%de)。
【000511】例如,通过标记一个分子中的两个前手性或互变性(enantiotopic)部分之一,例如通过使用重同位素或轻同位素,可以通过质谱(MS)建立由选择性催化剂例如酶对两个部分中的一个进行修饰。
【000512】通过在15N-(R)-HGN(R)(如图17所示)或15N-(S)-HGN上进行例证性腈水解酶反应,可以通过分析被形成的两个可能标记产物与未标记酸性产物的每一种的量,确定酶的对映选择性。
【000513】筛选实验可以在任一个方向进行。筛选实验可以被用于15N-(R)-和(S)-HGN部分。事实上,为了确保标记物不招致任何人为假象变化,开始时应该对两个都进行研究。
【000514】为了使所观察的腈水解酶转化所得到的对映体过量相等,可以使用下述例证性公式:
%ee={[130]-[129]}/{[130]+[129]},其中轻酸(129)和重酸(130)的每一浓度通过质谱仪上的峰面积与标准曲线的相关性确定,或者通过直接比较129和130质谱峰中的每一个的面积来确定。用于确定两个对映异构体中的每一个的相对量(标记的和未标记的)的实际质量单位(actual mass units)依赖于质谱仪是如何调整的。
【000515】在一些情况下,通过质谱观察到的%ee可能与另一个替代性分析技术如液相色谱观察到的不同,具有一个因数的不同,这是由于天然同位素丰度所产生的背景或污染峰。然而,这不影响筛选过程的最终结果。量化重酸和轻酸的例证性标准曲线如图14A和B所示。
【000516】下述反应是可能的合成路线,例如用于制备15N(R)-HGN,使用了可商购的起始材料和本技术领域已知的化学技术。
【000517】通过以任一种阳性模式、阴性模式,使用MS,并且或者从亲本质量或者从任何碎裂质量的分析中,可以建立两种可能的立体结果中的每个结果的量。
实施例23:本发明的例证性酶的稳定性和活性
酶稳定性:
将野生型酶(SEQ ID NOS:209和210)与SEQ ID NOS:209和210的突变体A190H比较。在实验中,在两种不同的底物:己二腈和羟基戊二酰基腈上,在4℃和21℃,在水中以10mg/ml将每一种酶温育1、25、50、75和150小时。在所有情况下,发现两种酶可以将活性保持150小时。正如通过Nitroprusside Bertholit测定所评价的那样,野生型酶在己二腈上显示出较好的活性,而突变(A190H)的酶在羟基戊二酰基腈上显示出较好的活性(例如参见,Fawcett,J.K.&Scott,J.(1960);J.Clin.Path.;第13卷,第156页)。
SEQ ID NOS:209和210的GSSMTM变体 | 100mM羟基戊二酰基腈ee% | 2.5mM羟基戊二酰基腈ee% | 完成的时间(小时) |
A55G | 96.5±0.4 | 未测定 | >160 |
A55K | 94.7±0.2 | 未测定 | >160 |
160E | 96.5±0.5 | 未测定 | >160 |
N111S | 95.8±0.5 | 96.1±0.9 | >160 |
A190T | 96.5±0.2 | 96.6±0.4 | 40 |
A190S | 96.8±0.2 | 95.5±0.7 | 40 |
A190H | 97.9±0.1 | 98.1±0.1 | 15 |
F191L | 97.9±0.1 | 未测定 | >160 |
F191T | 97.9±0.1 | 未测定 | >160 |
F191M | 97.9±0.1 | 未测定 | >160 |
F191V | 97.9±0.1 | 未测定 | >160 |
M199E | 97.9±0.1 | 未测定 | 160 |
M199L | 97.9±0.1 | 95.4±0.1 | >160 |
野生型SEQ IDNOS:209和210 | 94.5±0.1 | 87.8±0.2 | 24 |
GSSM突变体具有增强的对映选择性
在全细胞格式中,用从大肠杆菌表达的腈水解酶进行100mM反应,用36小时完成该反应。用腈水解酶作为冻干澄清细胞溶解产物,进行2.25M反应。所有报道的%ee数据是三次测量值的平均值,并给出平均数标准差。反应完成的时间是通过TLC大概估计的。
特异性:
腈水解酶活性测定,100mM HGN:
对推定腈水解酶高表达突变型进行三重测定。将每一转化体,于37℃和220rpm下,在5mL LB(100μg/mL氨苄青霉素)中培养18小时。将过夜培养物进行2倍稀释,用0.1mM IPTG于37℃和220rpm下诱导水解酶表达6小时。通过离心收获细胞,在100mM pH7磷酸钠缓冲液中洗涤,然后重悬浮在1mL的处于100mM pH7磷酸钠缓冲液中的100mM HGN中。轻轻地搅拌,在22℃允许反应至少继续进行36小时。反应进程通过TLC(1∶1 EtOAc∶己烷,Rf=0.5,腈;Rf=0.0,酸)监控。通过离心,去除细胞和其它碎片,在冻干之前用1体积甲醇处理。将冻干物质重新悬浮在甲醇中,用四甲基硅烷(TMS)-重氮甲烷处理(10当量,溶解在己烷中的2M溶液),直到气体形成停止,黄色持续,以便制备用于GC分析的甲基酯。然后评价所选择的产生具有95%ee或更高ee的(R)-(-)-3-羟基-4-氰基丁酸的腈水解酶变体在2.25M HGN上的性能。
在2.25M 3-HGN下的腈水解酶活性测定
将3-HGN(0.2g,1.8mmol,3M)于22℃悬浮在磷酸钠缓冲液(0.6mL,pH7,100mM)中。加入细胞溶解产物(6mg,标准化为腈水解酶含量),使浓度达到11mg/ml酶,振荡反应物(100rpm,22℃)。反应进程通过TLC(1∶1乙酸乙酯∶己烷,Rf=0.32,腈;Rf=0.0,酸)监控。在冻干之前,用1份甲醇处理反应混合物。将冻干物质重悬浮在甲醇中,用10当量TMS-重氮甲烷处理(10当量,溶解在己烷中的2M溶液),以便制备甲基酯,并且通过GC分析。
用于筛选高数量样品的新颖高流通量LC/MS方法的描述:
超高流通量(Ultra Hieh-throughput)初步手性活性筛选:
通过自动化菌落分选仪将GSSM文库的不同成员分配到含有40μL(Luria-Bertani)LB培养基(100μg/mL氨苄青霉素)的384孔平板中,然后于37℃,85%湿度下,将其温育。用0.1mM IPTG于37将腈水解酶表达诱导24小时。复制每一平板,制备20%甘油原种,归档保存在-80℃下。将10mM 15N-(R)-1底物加入到每一个384孔平板中。于37℃、85%湿度下,将平板温育3天。通过离心去除细胞和其它碎片,在进行质谱分析之前,将上清液稀释17,576倍。
LC/MS离子喷雾以下述方式适用于高流通量(through-put)分析。通过使用CTCPAL自动取样器(Leap Technologies,Carrboro,N.C.),从384孔平板流动注射样品实现了高流通量筛选。使用71%乙腈、29%水等度混合物,带有0.1%的甲酸,通过LC-10Advp泵(Shimadzu,Kyoto,Japan)以2.2mL/分钟通过LC-18滤筒(Supelco,Bellefonte,PA)予以提供。样品适用于API 4000 TurboIon喷雾三联四级质谱仪(Applied Biosystems,Foster City,CA)。以负离子模式,对分析物进行离子喷雾和多重反应监控(MRM),每一分析需要60秒。
用野生型酶(SEQ ID NOS:209和210)转化的大肠杆菌被用作阳性活性对照组,用空载体(empty vector)转化的大肠杆菌被用作阴性活性对照组。或者用15N-(R)-1或者用15N-(S)-1通过质谱确定的野生型(WT)酶阳性对照组的%ee是相同的,从而证明了不存在显著的同位素影响。
温度(℃) | pH | 磷酸钠缓冲液浓度(mM) | %ee | 标准偏差 |
4 | 7 | 100 | 98.7 | 0.1% |
19 | 7 | 100 | 98.7 | 0.1% |
21 | 7 | 100 | 98.6 | 0.1% |
37 | 7 | 100 | 98.4 | 0.1% |
21 | 7 | 100 | 98.6 | 0.1% |
21 | 6 | 100 | 98.6 | 0.1% |
21 | 8 | 100 | 98.6 | 0.1% |
21 | 7 | 100 | 98.5 | 0.1% |
21 | 7 | 50 | 98.6 | 0.1% |
21 | 7 | 25 | 98.7 | 0.1% |
反应参数对具有A190H突变的SEQ ID NOS:209和210的作用
反应使用150mg/ml蛋白(~49mg/ml酶),在3M HGN浓度下进行。%ee是通过三份重复运行的GC分析确定的。
【000518】尽管本发明已经参考某些优选的方面进行了详细的描述,但应该理解到,修改和变化也在所描述和权利要求所要求的范围和精神内。
序列表
<110>戴弗萨公司
<120>腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法
<130>CPUSZ42557
<140>PCT/US03/15712
<141>2003-05-15
<150>US 10/241,742
<151>2002-09-09
<150>US 10/146,772
<151>2002-05-15
<150>US 60/309,006
<151>2001-07-30
<150>US 60/351,336
<151>2002-01-22
<150>US 60/300,189
<151>2001-06-21
<150>US 09/751,299
<151>2000-12-28
<150>US 60/254,414
<151>2000-12-07
<150>US 60/173,609
<151>1999-12-29
<160>386
<170>FastSEQ for Windows Version 4.0
<210>1
<211>939
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>1
atggaaaagt atattaaagt cgccgcaatt cagatagcta caaaaatagc agattcaccc 60
gtgaatatag aaaattgcga acgtttggca ttatcggcgg tcaatgaggg tgcgcgttgg 120
attgctttgc cggagttctt caatacgggc gttagttgga acaaaaaaat tgccttggct 180
attcagacgc ctgacggcaa ggctgcgatg ttcttacgcg acttttctgc aagacatcat 240
gtattgatag gaggctcatt tctgtgcagg ttgccggatg gcagtgtgcg caaccgctat 300
atgtgttatg ccaacggcgc tctcgtgggc aaacatgaca aagacctacc cacgatgtgg 360
gaaaatgctt tttatgaagg tggggattcc agcgatattg gggtgctggg aacatttgaa 420
aatacgcgcg ttggtgcagc cgtctgttgg gagttcatgc ggacgatgac tgcccggcgt 480
cttcgcaatc aggtggatgt catcatgggt ggttcctgct ggtggagcat accgaccaat 540
ttccccggtt ttgtgcaaaa gctgtgggaa cctggaaata gccgcaacgc gcttgctgcc 600
atacaggata atgcgcgtct cattggcgtg ccggttgttc atgccgctca ttgcggtgaa 660
attgagtgtc cgatgccagg attgccgata ggttacaggg ggttctttga gggtaacgcg 720
gccattgtga atgcagaagg tcaggtgctt gcgcatcggg gtgctggcga gggcgaagga 780
attgtttgcg cggagatttt accggtagcc aaatcaaaca ggtcggaaat tcccaatcgt 840
tactggttgc gctgcagagg ctttctacct atttttgcct ggcatcagca acgttggttg 900
ggaaggcatt ggtatttgcg caatgtgcgc aggacttaa 939
<210>2
<211>312
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>2
Met Glu Lys Tyr Ile Lys Val Ala Ala Ile Gln Ile Ala Thr Lys Ile
1 5 10 15
Ala Asp Ser Pro Val Asn Ile Glu Asn Cys Glu Arg Leu Ala Leu Ser
20 25 30
Ala Val Asn Glu Gly Ala Arg Trp Ile Ala Leu Pro Glu Phe Phe Asn
35 40 45
Thr Gly Val Ser Trp Asn Lys Lys Ile Ala Leu Ala Ile Gln Thr Pro
50 55 60
Asp Gly Lys Ala Ala Met Phe Leu Arg Asp Phe Ser Ala Arg His His
65 70 75 80
Val Leu Ile Gly Gly Ser Phe Leu Cys Arg Leu Pro Asp Gly Ser Val
85 90 95
Arg Asn Arg Tyr Met Cys Tyr Ala Asn Gly Ala Leu Val Gly Lys His
100 105 110
Asp Lys Asp Leu Pro Thr Met Trp Glu Asn Ala Phe Tyr Glu Gly Gly
115 120 125
Asp Ser Ser Asp Ile Gly Val Leu Gly Thr Phe Glu Asn Thr Arg Val
130 135 140
Gly Ala Ala Val Cys Trp Glu Phe Met Arg Thr Met Thr Ala Arg Arg
145 150 155 160
Leu Arg Asn Gln Val Asp Val Ile Met Gly Gly Ser Cys Trp Trp Ser
165 170 175
Ile Pro Thr Asn Phe Pro Gly Phe Val Gln Lys Leu Trp Glu Pro Gly
180 185 190
Asn Ser Arg Asn Ala Leu Ala Ala Ile Gln Asp Asn Ala Arg Leu Ile
195 200 205
Gly Val Pro Val Val His Ala Ala His Cys Gly Glu Ile Glu Cys Pro
210 215 220
Met Pro Gly Leu Pro Ile Gly Tyr Arg Gly Phe Phe Glu Gly Asn Ala
225 230 235 240
Ala Ile Val Asn Ala Glu Gly Gln Val Leu Ala His Arg Gly Ala Gly
245 250 255
Glu Gly Glu Gly Ile Val Cys Ala Glu Ile Leu Pro Val Ala Lys Ser
260 265 270
Asn Arg Ser Glu Ile Pro Asn Arg Tyr Trp Leu Arg Cys Arg Gly Phe
275 280 285
Leu Pro Ile Phe Ala Trp His Gln Gln Arg Trp Leu Gly Arg His Trp
290 295 300
Tyr Leu Arg Asn Val Arg Arg Thr
305 310
<210>3
<21l>981
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>3
atggggaatt cgttcaagat cgcggtggta caagcctgtc cggtctttct ggatcgtggc 60
gcgacagtcg ccaaggcatg ccgcctgatc gcggaggcag ccgcggcggg cgccagcctg 120
gtggtctttc cggaggcgtt tgtgcccgga tacccactgt gggtctggtt cattccggca 180
gggcattcgc aaccactgcg ggagttatac gccgaactgg tggggaacgc cgtggcggta 240
ccgggcgatg ccaccgatcg gctttgcgcg gcagccagag aagccggcgt ggtagtggcg 300
atcggcatca atgaagtgaa cagcgaagcc agcggcacga cgatttacaa tacgctgctg 360
tacatcggag cggacggcgc gattctgggc aaacaccgca aagtaatgcc gacgggcgga 420
gagcgcctgg tctgggcgct tggcgatggg agcgacctgg aggtctacga cctgcctttc 480
ggccgattgg gtggcctgtt gtgctgggag aactacatgc ccctggcccg gtacgcgatg 540
tcggcatggg gaaccgagat ctacgtggct ccaacttggg atcgcggaga accgtggctg 600
tccacaatgc ggcatatcgc gaaagaaggg cgatgctacg tagtgggatg ctgcagttgc 660
atgaaaattg acgatgtacc cgaccggctg gcgttcaaag ggaagtatct gtcgacggcc 720
gagggctggc tcaaccccgg cgatagcgta atcgtcgatc cggacggcaa gctgatcgcg 780
ggcccggcaa gcgagcagga gacgattctg tatgccgatg ccgaccggtc taagatcacc 840
gggcccaggt ggcagttgga tgtggccggc cactacgcgc ggccggatat cttcgaactg 900
atcgtgcacc gcgaacctaa gcgatttttg acgatagctc cgcggacgaa ggaggagcgg 960
gagcctgggc cggaggcctg a 981
<210>4
<21l>326
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>4
Met Gly Asn Ser Phe Lys Ile Ala Val Val Gln Ala Cys Pro Val Phe
1 5 10 15
Leu Asp Arg Gly Ala Thr Val Ala Lys Ala Cys Arg Leu Ile Ala Glu
20 25 30
Ala Ala Ala Ala Gly Ala Ser Leu Val Val Phe Pro Glu Ala Phe Val
35 40 45
Pro Gly Tyr Pro Leu Trp Val Trp Phe Ile Pro Ala Gly His Ser Gln
50 55 60
Pro Leu Arg Glu Leu Tyr Ala Glu Leu Val Gly Asn Ala Val Ala Val
65 70 75 80
Pro Gly Asp Ala Thr Asp Arg Leu Cys Ala Ala Ala Arg Glu Ala Gly
85 90 95
Val Val Val Ala Ile Gly Ile Asn Glu Val Asn Ser Glu Ala Ser Gly
100 105 110
Thr Thr Ile Tyr Asn Thr Leu Leu Tyr Ile Gly Ala Asp Gly Ala Ile
115 120 125
Leu Gly Lys His Arg Lys Val Met Pro Thr Gly Gly Glu Arg Leu Val
130 135 140
Trp Ala Leu Gly Asp Gly Ser Asp Leu Glu Val Tyr Asp Leu Pro Phe
145 150 155 160
Gly Arg Leu Gly Gly Leu Leu Cys Trp Glu Asn Tyr Met Pro Leu Ala
165 170 175
Arg Tyr Ala Met Ser Ala Trp Gly Thr Glu Ile Tyr Val Ala Pro Thr
180 185 190
Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Met Arg His Ile Ala Lys
195 200 205
Glu Gly Arg Cys Tyr Val Val Gly Cys Cys Ser Cys Met Lys Ile Asp
210 215 220
Asp Val Pro Asp Arg Leu Ala Phe Lys Gly Lys Tyr Leu Ser Thr Ala
225 230 235 240
Glu Gly Trp Leu Asn Pro Gly Asp Ser Val Ile Val Asp Pro Asp Gly
245 250 255
Lys Leu Ile Ala Gly Pro Ala Ser Glu Gln Glu Thr Ile Leu Tyr Ala
260 265 270
Asp Ala Asp Arg Ser Lys Ile Thr Gly Pro Arg Trp Gln Leu Asp Val
275 280 285
Ala Gly His Tyr Ala Arg Pro Asp Ile Phe Glu Leu Ile Val His Arg
290 295 300
Glu Pro Lys Arg Phe Leu Thr Ile Ala Pro Arg Thr Lys Glu Glu Arg
305 310 315 320
Glu Pro Gly Pro Glu Ala
325
<210>5
<211>1005
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>5
atgggtacca agcacccggc cttcaaggcc gcagtggtcc aggccgcgcc ggaatggctc 60
gatctcgacc gcaccgtcga caagaccatc gcgctgatcg aggaggccgc cggcgccggc 120
gcgaagctca ttgcgttccc ggaaacctgg attcccggct atccgtggca catctgggtc 180
ggcacgccgg cgtgggcgat cagccgcggc ttcgtgcagc gctacttcga caattcactg 240
gcctacgaca gcccgcaggc ccagcgcatc gcggacgccg cgaagaagaa caagatcacc 300
gtggtgctcg gcctgtcgga gcgcgagggt ggcagccttt atatctcgca gtggctgatt 360
gggccggacg gcgagaccat tgccaagcgc cgcaaactgc gccccaccca cgtcgagcgc 420
accgtgttcg gcgatggcga cggcagccac atcgcggtgc acgagcgtgc tgacatcggc 480
cgcctcggcg cgctgtgctg ctgggagcac atccagccgc tgaccaaata cgccatgtat 540
gcccagaacg agcaggtgca cgtcgccgcc tggccgagct tctcgatgta cgagccgttc 600
gcccacgcgc tcggctggga agtcaacaat gcggcgagca agatctacgc cgtcgaaggc 660
tcgtgtttcg tgctcggcgc atgcgcggtg atctcgcagg cgatggtcga cgaaatgtgc 720
gacaccgagg acaagcgggc gctggtccat gccggcggcg gccacgcggt gatcttcggg 780
ccggacggca gatcgctggc ggacaagatt ccggagaccc aggaaggcct gctctatgcc 840
gacatcgacc tcggcgcaat tggcgtggcc aagaacgcgg ccgatccggc ggggcactac 900
tcgcgcccgg acgtgacgcg gctcctgttc aacaacaagc cggcgcgccg ggtcgagtat 960
ttctcgctgc cggtcgacgc ggtcgagacg ccgccgcagc cctga 1005
<210>6
<211>334
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>6
Met Gly Thr Lys His Pro Ala Phe Lys Ala Ala Val Val Gln Ala Ala
l 5 10 15
Pro Glu Trp Leu Asp Leu Asp Arg Thr Val Asp Lys Thr Ile Ala Leu
20 25 30
Ile Glu Glu Ala Ala Gly Ala Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Thr Trp Ile Pro Gly Tyr Pro Trp His Ile Trp Val Gly Thr Pro Ala
50 55 60
Trp Ala Ile Ser Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ala Tyr Asp Ser Pro Gln Ala Gln Arg Ile Ala Asp Ala Ala Lys Lys
85 90 95
Asn Lys Ile Thr Val Val Leu Gly Leu Ser Glu Arg Glu Gly Gly Ser
100 105 110
Leu Tyr Ile Ser Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Val Glu Arg Thr Val Phe Gly
130 135 140
Asp Gly Asp Gly Ser His Ile Ala Val His Glu Arg Ala Asp Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Ile Gln Pro Leu Thr Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Met Tyr Glu Pro Phe Ala His Ala Leu Gly Trp Glu Val
195 200 205
Asn Asn Ala Ala Ser Lys Ile Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Gly Ala Cys Ala Val Ile Ser Gln Ala Met Val Asp Glu Met Cys
225 230 235 240
Asp Thr Glu Asp Lys Arg Ala Leu Val His Ala Gly Gly Gly His Ala
245 250 255
Val Ile Phe Gly Pro Asp Gly Arg Ser Leu Ala Asp Lys Ile Pro Glu
260 265 270
Thr Gln Glu Gly Leu Leu Tyr Ala Asp Ile Asp Leu Gly Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Phe Asn Asn Lys Pro Ala Arg Arg Val Glu Tyr
305 310 315 320
Phe Ser Leu Pro Val Asp Ala Val Glu Thr Pro Pro Gln Pro
325 330
<210>7
<211>999
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>7
atgccgagtg attatcatgc tccattcaaa gtagcagttg tccaggcaac tcccgtcttt 60
ctcgatcgca gcgcgacgat tgagaaggca tgtgagctaa ttgcctgtgc tggacgtgag 120
ggcgcacgtc tgatcgtgtt tcctgaagcg ttcattccca cctatcccga ttgggtctgg 180
accattccac ctggggagat gcggctgctt ggcgaactct acacagagtt gcttgccaat 240
gcggtcacga tccccagtaa tgcaacggat aggctctgcc aggctgcgaa acgagctgct 300
gcgtatgtgg tcatgggaat gaacgaacgc aatatcgagg cgagtggaag gagtctctat 360
aacaccctgt tatacatcga tgctcagggc cagatcatgg gcaaacaccg caagttgata 420
cccacagccg gtgagcggct catatgggcg caaggagatg ggagtacatt ccaggtctac 480
gatactcctc tgggcaaact gggagggctc atctgctggg aaaactacat gcctctggct 540
cgctatgcga tgtatgcctg gggcacgcag atttatgtcg ccccgacatg ggatcgtggc 600
aacctctggc tctctactct gcggcatatc gctaaggagg gaggcgtcta tgttcttggt 660
tgtagtatgg tcatgcgcaa gaatgacatt cccgatcact ttgctttcaa agagcagttt 720
tatgctactg tggacgaatg gatcaacgtt ggtgacagcg ccattgtcca tcccgagggg 780
aactttcttg cgggaccggt gcgccacaaa gaagagattc tctatgcaga acttgatcca 840
cgccaatcgt gcggtccggg atggatgctc gatgtggctg ggcactatgc acgccctgat 900
gtgtttgaat tgattgtcca cacagagatg cgacccatga tgaagcaaga agaggtagga 960
ggagaaaata catctgaggg aggtgtacga ttcttgtaa 999
<210>8
<211>332
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>8
Met Pro Ser Asp Tyr His Ala Pro Phe Lys Val Ala Val Val Gln Ala
1 5 10 15
Thr Pro Val Phe Leu Asp Arg Ser Ala Thr Ile Glu Lys Ala Cys Glu
20 25 30
Leu Ile Ala Cys Ala Gly Arg Glu Gly Ala Arg Leu Ile Val Phe Pro
35 40 45
Glu Ala Phe Ile Pro Thr Tyr Pro Asp Trp Val Trp Thr Ile Pro Pro
50 55 60
Gly Glu Met Arg Leu Leu Gly Glu Leu Tyr Thr Glu Leu Leu Ala Asn
65 70 75 80
Ala Val Thr Ile Pro Ser Asn Ala Thr Asp Arg Leu Cys Gln Ala Ala
85 90 95
Lys Arg Ala Ala Ala Tyr Val Val Met Gly Met Asn Glu Arg Asn Ile
100 105 110
Glu Ala Ser Gly Arg Ser Leu Tyr Asn Thr Leu Leu Tyr Ile Asp Ala
115 120 125
Gln Gly Gln Ile Met Gly Lys His Arg Lys Leu Ile Pro Thr Ala Gly
130 135 140
Glu Arg Leu Ile Trp Ala Gln Gly Asp Gly Ser Thr Phe Gln Val Tyr
145 150 155 160
Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn Tyr
165 170 175
Met Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gln Ile Tyr
180 185 190
Val Ala Pro Thr Trp Asp Arg Gly Asn Leu Trp Leu Ser Thr Leu Arg
195 200 205
His Ile Ala Lys Glu Gly Gly Val Tyr Val Leu Gly Cys Ser Met Val
210 215 220
Met Arg Lys Asn Asp Ile Pro Asp His Phe Ala Phe Lys Glu Gln Phe
225 230 235 240
Tyr Ala Thr Val Asp Glu Trp Ile Asn Val Gly Asp Ser Ala Ile Val
245 250 255
His Pro Glu Gly Asn Phe Leu Ala Gly Pro Val Arg His Lys Glu Glu
260 265 270
Ile Leu Tyr Ala Glu Leu Asp Pro Arg Gln Ser Cys Gly Pro Gly Trp
275 280 285
Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu
290 295 300
Ile Val His Thr Glu Met Arg Pro Met Met Lys Gln Glu Glu Val Gly
305 310 315 320
Gly Glu Asn Thr Ser Glu Gly Gly Val Arg Phe Leu
325 330
<210>9
<211>945
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>9
atggcggctc acaagatcgc ggtggttcag gcgcccagcg ttctcctcga tcgcgagggc 60
tcggtcgcgc gcgcggtcac gctgctcgac gaggcggcgg cggccggcgc ccgcctggtc 120
gtgtttccgg aggcctacat ccccggctac ccggactgga tctggcgcct gcgcccctac 180
ccggacgtca agctggccgc cgagctgcac gaacggctgc tcgccaacgc ggtggatctc 240
tccaccgacg tgctggcgcc ggtgctggcg gcggcggcgc gtcacgggct caccgtggtc 300
atgtgcgtgc aggagcgcga cgccggattc agccgcgcca cactttacaa caccgcgctg 360
gtcatcgacg ccgccggcaa gatcgcgaac cggcaccgca agctcatgcc caccaacccc 420
gagcgaatgg tgtggggatt cggtgacgcc tcggggctgc gggtggtgag cacgcccgtc 480
gggcgggtgg gcacgctcct gtgctgggag agctacatgc ccctggcgcg ctgcgcgctc 540
tacgccgagg gggtcgagat ctacgtgacc ccgacctggg actacggcga aggctggcgc 600
gccagcatgc agcacatcgc ccgcgagggg cgctgctggg tggtgaccgc ttgcatgtgc 660
gtgcaggcgc gcgacgtgcc ggccgacttc cccgggcgcg cccagctcta ccccgacgag 720
gaggagtggt tgaaccccgg cgattcgctg gtggtcgacc ccggcggcaa gatcgtggcc 780
ggtccgatgt cgcgcgagaa ggggatcttg tacgcggaga tcgatccgga tcgcgtggcg 840
ggggcgcacc gctcgttcga cgtcgtgggc cactactcgc gtcccgacgt gttccggctg 900
gaggtcgatc ggacaccggc ggcgccggtg agcttcaaaa aatga 945
<210>10
<211>314
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>10
Met Ala Ala His Lys Ile Ala Val Val Gln Ala Pro Ser Val Leu Leu
l 5 10 15
Asp Arg Glu Gly Ser Val Ala Arg Ala Val Thr Leu Leu Asp Glu Ala
20 25 30
Ala Ala Ala Gly Ala Arg Leu Val Val Phe Pro Glu Ala Tyr Ile Pro
35 40 45
Gly Tyr Pro Asp Trp Ile Trp Arg Leu Arg Pro Tyr Pro Asp Val Lys
50 55 60
Leu Ala Ala Glu Leu His Glu Arg Leu Leu Ala Asn Ala Val Asp Leu
65 70 75 80
Ser Thr Asp Val Leu Ala Pro Val Leu Ala Ala Ala Ala Arg His Gly
85 90 95
Leu Thr Val Val Met Cys Val Gln Glu Arg Asp Ala Gly Phe Ser Arg
100 105 110
Ala Thr Leu Tyr Asn Thr Ala Leu Val Ile Asp Ala Ala Gly Lys Ile
115 120 125
Ala Asn Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val
130 135 140
Trp Gly Phe Gly Asp Ala Ser Gly Leu Arg Val Val Ser Thr Pro Val
145 150 155 160
Gly Arg Val Gly Thr Leu Leu Cys Trp Glu Ser Tyr Met Pro Leu Ala
165 170 175
Arg Cys Ala Leu Tyr Ala Glu Gly Val Glu Ile Tyr Val Thr Pro Thr
180 185 190
Trp Asp Tyr Gly Glu Gly Trp Arg Ala Ser Met Gln His Ile Ala Arg
195 200 205
Glu Gly Arg Cys Trp Val Val Thr Ala Cys Met Cys Val Gln Ala Arg
210 215 220
Asp Val Pro Ala Asp Phe Pro Gly Arg Ala Gln Leu Tyr Pro Asp Glu
225 230 235 240
Glu Glu Trp Leu Asn Pro Gly Asp Ser Leu Val Val Asp Pro Gly Gly
245 250 255
Lys Ile Val Ala Gly Pro Met Ser Arg Glu Lys Gly Ile Leu Tyr Ala
260 265 270
Glu Ile Asp Pro Asp Arg Val Ala Gly Ala His Arg Ser Phe Asp Val
275 280 285
Val Gly His Tyr Ser Arg Pro Asp Val Phe Arg Leu Glu Val Asp Arg
290 295 300
Thr Pro Ala Ala Pro Val Ser Phe Lys Lys
305 310
<210>11
<211>966
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>11
atgactggat cttatcctaa agacacactg atcgttgggc tagctcaaat cgctcctgtc 60
tggctggatc gggcggggac actgtcaaag atactggctc aagtccatgc ggcaaatcaa 120
gcgggttgtc atctcgtagc atttggcgaa gggctgcttc ctggatatcc gttttggatt 180
gagcgaacaa atggcgcgct gttcaactcg actgtacaaa aggaaatcca cgcgcattat 240
atggatcagg cggtgcagat cgaagccggt catctcgatc cgctttgtgc aacagccaag 300
aaatttggaa tcaccgttgt actcggatgc atcgaacgcc cactcgatcg gggcggtcac 360
agcttgtatg caagtctggt atatattgat tccgagggca gcattcaatc cgtgcatcgc 420
aaactaatgc caacctacga agaacgactt acctggtcgt caggcgatgg gcacggttta 480
cgagtgcata ccttaggtgc gtttacggtg ggtggtctca actgttggga aaattggatg 540
cccttggcgc gcgcagcgat gtatggtcag ggtgaagatt tacatgttgc gatctggcca 600
ggcggttctc atctcacgca ggatattacc cgctttattg cgctcgaatc acgttcgtac 660
gtattatctg tctccggtct gatgcgcgca accgattttc caaaagatac tccccatctt 720
gcctccatcc tagctaaagg tgaagagatt cttgcgaatg gtggttcttg tattgcaggt 780
cctgacggca agtgggtcgt tgggcctctt gtaggagaag agaagttaat tgtcgcaacc 840
attgatcact gccgcgtgcg cgaagaacgt cagaatttcg atccttccgg gcattacagc 900
cggcccgatg tactgcaatt aaaaatcaac agggaacgcc agagcacaat ttcatttagc 960
gagtag 966
<210>12
<211>321
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>12
Met Thr Gly Ser Tyr Pro Lys Asp Thr Leu Ile Val Gly Leu Ala Gln
1 5 10 15
Ile Ala Pro Val Trp Leu Asp Arg Ala Gly Thr Leu Ser Lys Ile Leu
20 25 30
Ala Gln Val His Ala Ala Asn Gln Ala Gly Cys His Leu Val Ala Phe
35 40 45
Gly Glu Gly Leu Leu Pro Gly Tyr Pro Phe Trp Ile Glu Arg Thr Asn
50 55 60
Gly Ala Leu Phe Asn Ser Thr Val Gln Lys Glu Ile His Ala His Tyr
65 70 75 80
Met Asp Gln Ala Val Gln Ile Glu Ala Gly His Leu Asp Pro Leu Cys
85 90 95
Ala Thr Ala Lys Lys Phe Gly Ile Thr Val Val Leu Gly Cys Ile Glu
100 105 110
Arg Pro Leu Asp Arg Gly Gly His Ser Leu Tyr Ala Ser Leu Val Tyr
115 120 125
Ile Asp Ser Glu Gly Ser Ile Gln Ser Val His Arg Lys Leu Met Pro
130 135 140
Thr Tyr Glu Glu Arg Leu Thr Trp Ser Ser Gly Asp Gly His Gly Leu
145 150 155 160
Arg Val His Thr Leu Gly Ala Phe Thr Val Gly Gly Leu Asn Cys Trp
165 170 175
Glu Asn Trp Met Pro Leu Ala Arg Ala Ala Met Tyr Gly Gln Gly Glu
180 185 190
Asp Leu His Val Ala Ile Trp Pro Gly Gly Ser His Leu Thr Gln Asp
195 200 205
Ile Thr Arg Phe Ile Ala Leu Glu Ser Arg Ser Tyr Val Leu Ser Val
210 215 220
Ser Gly Leu Met Arg Ala Thr Asp Phe Pro Lys Asp Thr Pro His Leu
225 230 235 240
Ala Ser Ile Leu Ala Lys Gly Glu Glu Ile Leu Ala Asn Gly Gly Ser
245 250 255
Cys Ile Ala Gly Pro Asp Gly Lys Trp Val Val Gly Pro Leu Val Gly
260 265 270
Glu Glu Lys Leu Ile Val Ala Thr Ile Asp His Cys Arg Val Arg Glu
275 280 285
Glu Arg Gln Asn Phe Asp Pro Ser Gly His Tyr Ser Arg Pro Asp Val
290 295 300
Leu Gln Leu Lys Ile Asn Arg Glu Arg Gln Ser Thr Ile Ser Phe Ser
305 310 315 320
Glu
<210>13
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>13
atgggcattc aacatccgaa atataaggtt gcggtggtgc aggcggcgcc ggcctggctc 60
gatctcgatg cgtcgatcgc caaatcgatc gcgttgatcg aggaggcggc tgccaatggc 120
gccaagctga tcgccttccc ggaggcgttc atccctggct atccctggta tatctggctg 180
gactcgccgg cctgggcgat cggccgcggt tttgtgcagc gctatttcga caactcgctg 240
gcctatgaca gcccgcaggc cgagaagctg cggctggcgg tgaagaaggc cggcctcacc 300
gccgtgatcg gcctctccga gcgcgagggc ggcagccttt atctcgcgca atggctgatc 360
gggcccgatg gcgagaccat cgcaaagcgc cgcaagctgc ggccgaccca tgccgagcgc 420
accgtctatg gcgaaggcga tggcagcgat ctcgcggtgc atgaccgccc cggcatcggc 480
cggctcggcg cgctgtgctg ctgggagcat ctgcagccgc tgtcgaaata cgcgatgtat 540
gcccagaacg agcaggttca tgtcgcggcc tggccgagct tctcgctcta cgacccgttc 600
gcgccggcgc tcggctggga ggtcaacaat gcggcctcac gcgtctatgc ggtggaaggc 660
tcgtgcttcg tgctggcgcc ctgcgcgacg gtgtcgaagg cgatgatcga cgagctctgc 720
gaccgcgacg acaagcacgg gctgctgcat gtcggcgggg gacacgccgc gatctatggg 780
ccggacggct cttcgattgc ggagaaattg ccgccggagc aggagggcct gctctatgcc 840
gacatcgatc tcggcgccat cgggattgcc aagaacgccg ccgatccggc cggacattac 900
tcgcggcccg acgtgacgcg gctgttgctc aacaagaagc cgtcgaagcg tgtcgagcat 960
ttttcgctgc cggtcgacaa tgtcgagccg gagatcgacg ccgccgccag ctga 1014
<210>14
<211>337
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>14
Met Gly Ile Gln His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 l0 15
Pro Ala Trp Leu Asp Leu Asp Ala Ser Ile Ala Lys Ser Ile Ala Leu
20 25 30
Ile Glu Glu Ala Ala Ala Asn Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Ala Phe Ile Pro Gly Tyr Pro Trp Tyr Ile Trp Leu Asp Ser Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ala Tyr Asp Ser Pro Gln Ala Glu Lys Leu Arg Leu Ala Val Lys Lys
85 90 95
Ala Gly Leu Thr Ala Val Ile Gly Leu Ser Glu Arg Glu Gly Gly Ser
100 105 110
Leu Tyr Leu Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Pro Gly Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Trp Glu Val
195 200 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Thr Val Ser Lys Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Arg Asp Asp Lys His Gly Leu Leu His Val Gly Gly Gly His Ala
245 250 255
Ala Ile Tyr Gly Pro Asp Gly Ser Ser Ile Ala Glu Lys Leu Pro Pro
260 265 270
Glu Gln Glu Gly Leu Leu Tyr Ala Asp Ile Asp Leu Gly Ala Ile Gly
275 280 285
Ile Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Ser Lys Arg Val Glu His
305 310 315 320
Phe Ser Leu Pro Val Asp Asn Val Glu Pro Glu Ile Asp Ala Ala Ala
325 330 335
Ser
<210>15
<211>1047
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>15
atgccaacat caaaacaatt tagagtcgct gcagttcaag ccgccccggt atttcttgac 60
ctggagggcg caataagcaa aggcatctcc ctcattgagg aggccgcttc caatggagcc 120
aagctcattg ccttcccgga aacgtggatt cccggctacc cctggtggat ctggctggac 180
tcacccgctt ggggcatgcg ctttgtccag cgctattttg acaactcgct catgctgggt 240
agtgagcaag ccaagcgcat gaaccaggct gccgccaata acaagattta cgtggtgatg 300
ggttatagcg aacgcagtgg cggcagcctc tacatgggcc aatccattat caacgacaag 360
ggtgaaacga tttttacccg ccgcaaactc aagccaactc atgtcgagcg taccgtgttt 420
ggggagggag acggcagcca tctttgcgta atggataccg agattggccg cgtcggcgcg 480
atgtgctgtt gggaacattt gcagccgctc agcaaatatg caatgtattc tcaggatgaa 540
caaattcaca ttgcctcctg gccgagcttt tcgttatatc ggggggcagc ctatgcactc 600
ggccccgaac tgaacaacgc cgccagccaa atgtatgcag ccgaaggcca gtgctttgtc 660
cttgcccctt gcgccaccgt ctcaaaggag atgatcgaaa tgctgataga tgatcccagg 720
aaagagccgc ttctgctgga aggtggcggg ttcaccatga tttacggccc cgatgggcga 780
cctctggcta aaccgttgcc tgaaaacgag gaagggctgc tatatgccga tattgacctg 840
ggcatgattt caatggccaa ggctgccgcc gacccggcag gtcactacgc acgcccggat 900
gtcactcgcc tactattcaa ttccgcgccc gccaatcgcg tcgagtatat caacccagcg 960
tcaggcccaa ccgaatcctt aaaagatatg ggaaagatgc aaatggaggc cgaacagcaa 1020
aaggcggccc tgcgagagat gatctaa 1047
<210>16
<211>348
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>16
Met Pro Thr Ser Lys Gln Phe Arg Val Ala Ala Val Gln Ala Ala Pro
1 5 10 15
Val Phe Leu Asp Leu Glu Gly Ala Ile Ser Lys Gly Ile Ser Leu Ile
20 25 30
Glu Glu Ala Ala Ser Asn Gly Ala Lys Leu Ile Ala Phe Pro Glu Thr
35 40 45
Trp Ile Pro Gly Tyr Pro Trp Trp Ile Trp Leu Asp Ser Pro Ala Trp
50 55 60
Gly Met Arg Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu Met Leu Gly
65 70 75 80
Ser Glu Gln Ala Lys Arg Met Asn Gln Ala Ala Ala Asn Asn Lys Ile
85 90 95
Tyr Val Val Met Gly Tyr Ser Glu Arg Ser Gly Gly Ser Leu Tyr Met
100 105 110
Gly Gln Ser Ile Ile Asn Asp Lys Gly Glu Thr Ile Phe Thr Arg Arg
115 120 125
Lys Leu Lys Pro Thr His Val Glu Arg Thr Val Phe Gly Glu Gly Asp
130 135 140
Gly Ser His Leu Cys Val Met Asp Thr Glu Ile Gly Arg Val Gly Ala
145 150 155 160
Met Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr Ala Met Tyr
165 170 175
Ser Gln Asp Glu Gln Ile His Ile Ala Ser Trp Pro Ser Phe Set Leu
180 185 190
Tyr Arg Gly Ala Ala Tyr Ala Leu Gly Pro Glu Leu Asn Asn Ala Ala
195 200 205
Ser Gln Met Tyr Ala Ala Glu Gly Gln Cys Phe Val Leu Ala Pro Cys
210 215 220
Ala Thr Val Ser Lys Glu Met Ile Glu Met Leu Ile Asp Asp Pro Arg
225 230 235 240
Lys Glu Pro Leu Leu Leu Glu Gly Gly Gly Phe Thr Met Ile Tyr Gly
245 250 255
Pro Asp Gly Arg Pro Leu Ala Lys Pro Leu Pro Glu Ash Glu Glu Gly
260 265 270
Leu Leu Tyr Ala Asp Ile Asp Leu Gly Met Ile Ser Met Ala Lys Ala
275 280 285
Ala Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Thr Arg Leu
290 295 300
Leu Phe Asn Ser Ala Pro Ala Asn Arg Val Glu Tyr Ile Asn Pro Ala
305 310 315 320
Ser Gly Pro Thr Glu Ser Leu Lys Asp Met Gly Lys Met Gln Met Glu
325 330 335
Ala Glu Gln Gln Lys Ala Ala Leu Arg Glu Met Ile
340 345
<210>17
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>17
atgagagttg ttaaagccgc agctgtccaa ctgagtcccg tcctctatag ccgcgaggga 60
acggtcgaga aggtcgtgcg gaagatccat gaacttgccg aagagggagt cgagttcgtc 120
acctttcctg agaccgtggt gccttattac ccgtactttt cgttcgttca gacgcccttg 180
cagcaaatct tcggaacaga gtatctgagg ctgctcgacc aggcagtcac cgtgccatcc 240
gccgccaccg acgcgatcgg cgaggctgcc aggttcgctg gagttgttgt ctcgatcggc 300
gtcaacgagc gagacggggg aactctgtac aacactcagc ttctcttcga tgccgacgga 360
agcttaattc agcggcgccg caagatcacg cccacccatt acgagcgcat gatctggggc 420
cagggtgacg gctcaggtct gcgggccgtt gatagcaagg ccggccgcat tggtcagctg 480
gcatgctggg agcacaacaa tccactggcg cgctacgcgc tgatagccga cggcgagcag 540
atccattcgg ccatgtatcc gggctccatg ttcggcgact cgtttgccaa aaagaccgaa 600
atcaatatcc ggcagcatgc gctggagtct gcgtgcttcg tcgtgaacgc aacggcctgg 660
ctggacggcg atcaacaggc gcaaatcatg aaggacaccg gctgcagcat cggcccgatc 720
tccggcggtt gcttcaccac tatcgtggcg ccggacggtt ccctgatcgg cgagcccctc 780
cgctcgggtg agggcgtggt catcgccgac ctcgacttca cgttaatcga caggcgtaag 840
caggtgatgg actcgcgagg ccactacagc cggccggagt tgctcagcct cttaatagac 900
cgcaccccta ccgcgcactt tcacgaacgc gcttcgcacc ccacgacagg agctgagcaa 960
ggctccgagg atgtgttcga ggctaacatt taa 993
<210>18
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>18
Met Arg Val Val Lys Ala Ala Ala Val Gln Leu Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Glu Lys Val Val Arg Lys Ile His Glu Leu
20 25 30
Ala Glu Glu Gly Val Glu Phe Val Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Thr Pro Leu Gln Gln Ile Phe
50 55 60
Gly Thr Glu Tyr Leu Arg Leu Leu Asp Gln Ala Val Thr Val Pro Ser
65 70 75 80
Ala Ala Thr Asp Ala Ile Gly Glu Ala Ala Arg Phe Ala Gly Val Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Ser Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Tyr Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Ala Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Leu Ile Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Met Phe Gly
180 185 190
Asp Ser Phe Ala Lys Lys Thr Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Ala Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Gly Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Ser Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Thr Ile Val Ala Pro Asp Gly Ser Leu Ile
245 250 255
Gly Glu Pro Leu Arg Ser Gly Glu Gly Val Val Ile Ala Asp Leu Asp
260 265 270
Phe Thr Leu Ile Asp Arg Arg Lys Gln Val Met Asp Ser Arg Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Thr
290 295 300
Ala His Phe His Glu Arg Ala Ser His Pro Thr Thr Gly Ala Glu Gln
305 310 315 320
Gly Ser Glu Asp Val Phe Glu Ala Asn Ile
325 330
<210>19
<211>1050
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>19
atggaaagca acttccttgc cgcagcagtg caagcagaac cggtttactt caatgctttt 60
cagacggccg aaaaggccgc gtcattgatt gacgatgccg gtcggcaggg ggctcgctta 120
gtgacatttc ccgaaacgtg gctgcccggt tacccgtact ggatctggct tggtgccccc 180
gcctggggaa tgcatcattt catcctaaag taccatcaaa actcgccggt tgcaggagga 240
ccagaggaac agatcctttg tcaggcggcc cgccgcaacg ggatttttgt cgtcatggga 300
ctcagcgaga aaatcggggc aagcctctac atggcgcagt ggttcatcag tccagacggc 360
aaagtggtcg ctcgccgacg caaattgaag cctactcacg tcgaacgttc ggtcttcggg 420
gaaggggatg gttccgacat tgtcgttctt gatacacccc ttggaaaggt cgggggcctt 480
tgctgctggg agcacatgca gccactttcg aagtacgcca tgtactcgca aggcgagcag 540
atccatgctg cttcttggcc gagtgttagc gtctatcgcg ataaaattta cgttctgggg 600
ccggagctga acggtgccgc caatcagatg tatgcggcag aaggtcagtg tttcgtcctg 660
gcatcctggg caacggtttc acaagcggct atcgatcttt tttgcgacac gcccgacaag 720
gccgcgctca tgaaaattgg tggtggtttt tcccagatct atgggccaga cgggtgcccc 780
ctggcgaagc cgttgccgga ggacgtcgaa ggattggtga ccgctgagat tgacttcaat 840
gccatcacgc gcgtgaaagc agcggcggac cccgtagggc actatagccg gcccgatgta 900
ttccgcctgt tgttcaatcg tacgcgccaa gaacgcgtgg tttctgtcaa cacgtttgtg 960
ccaggtgtca cccagcgaac cgccaagaat gggtcggcgg acgaattggt cggtcacccg 1020
gagaacgctg tcgcccgggc tgcagagtaa 1050
<210>20
<211>349
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>20
Met Glu Ser Asn Phe Leu Ala Ala Ala Val Gln Ala Glu Pro Val Tyr
1 5 10 15
Phe Asn Ala Phe Gln Thr Ala Glu Lys Ala Ala Ser Leu Ile Asp Asp
20 25 30
Ala Gly Arg Gln Gly Ala Arg Leu Val Thr Phe Pro Glu Thr Trp Leu
35 40 45
Pro Gly Tyr Pro Tyr Trp Ile Trp Leu Gly Ala Pro Ala Trp Gly Met
50 55 60
His His Phe Ile Leu Lys Tyr His Gln Asn Ser Pro Val Ala Gly Gly
65 70 75 80
Pro Glu Glu Gln Ile Leu Cys Gln Ala Ala Arg Arg Ash Gly Ile Phe
85 90 95
Val Val Met Gly Leu Ser Glu Lys Ile Gly Ala Ser Leu Tyr Met Ala
100 105 110
Gln Trp Phe Ile Ser Pro Asp Gly Lys Val Val Ala Arg Arg Arg Lys
115 120 125
Leu Lys Pro Thr His Val Glu Arg Ser Val Phe Gly Glu Gly Asp Gly
130 135 140
Ser Asp Ile Val Val Leu Asp Thr Pro Leu Gly Lys Val Gly Gly Leu
145 150 155 160
Cys Cys Trp Glu His Met Gln Pro Leu Ser Lys Tyr Ala Met Tyr Ser
165 170 175
Gln Gly Glu Gln Ile His Ala Ala Ser Trp Pro Ser Val Ser Val Tyr
180 185 190
Arg Asp Lys Ile Tyr Val Leu Gly Pro Glu Leu Asn Gly Ala Ala Asn
195 200 205
Gln Met Tyr Ala Ala Glu Gly Gln Cys Phe Val Leu Ala Ser Trp Ala
210 215 220
Thr Val Ser Gln Ala Ala Ile Asp Leu Phe Cys Asp Thr Pro Asp Lys
225 230 235 240
Ala Ala Leu Met Lys Ile Gly Gly Gly Phe Ser Gln Ile Tyr Gly Pro
245 250 255
Asp Gly Cys Pro Leu Ala Lys Pro Leu Pro Glu Asp Val Glu Gly Leu
260 265 270
Val Thr Ala Glu Ile Asp Phe Asn Ala Ile Thr Arg Val Lys Ala Ala
275 280 285
Ala Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Phe Arg Leu Leu
290 295 300
Phe Asn Arg Thr Arg Gln Glu Arg Val Val Ser Val Asn Thr Phe Val
305 310 315 320
Pro Gly Val Thr Gln Arg Thr Ala Lys Asn Gly Ser Ala Asp Glu Leu
325 330 335
Val Gly His Pro Glu Asn Ala Val Ala Arg Ala Ala Glu
340 345
<210>21
<211>1065
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>21
atggcactag aacatccgaa gtacgtggcg gccgtggttc aggccgcgcc cgaattcctg 60
aatctagaca gagggatcga aaagacgatc gcattgatcg acgaagcggg acagaaaggg 120
gcggccctga ttgcatttcc ggaaacctgg ctgccgggct atccgtttca tgtctggctc 180
ggtcctcccg catgggcgct tggctcagga ttcgtccagc gctatttcga caactcgatg 240
acgtacgata gtcctcaggc cgctgcactg agggacgctg ccgcgcgcaa cgggatcacg 300
gtggtattgg gcttgtcgga gcgatgcggc ggcagcctct atatcgcgca atggatcatc 360
ggcccggatg gcgcgacggt cgccacgcgc cgcaaattgc ggccgactca tatcgagcgc 420
accgttttcg gcgatggcga cggcagcgat ctggcagtac acgatctcaa catcggccgc 480
cttggcgcac tgtgctgctg ggagcacatt cagccgctga ccaagtacgc gatgtatgcg 540
cagcacgaac aggtgcacgt cgcggcctgg ccgagcttct ccatgtatga attcgcgccc 600
gcgctcggtc acgaggtgaa caacgcagtc agccgcgtct atgccgttga gggatcgtgc 660
ttcgtgctcg cgccgtgcgc ggtcatcagc gagcaaatgg tcgacatgtt gtgcgacacg 720
gcagacaagc gcgcgatgat acgtgccggc ggcgggcacg cagtggcgtt cgggccggac 780
ggcgaagctc tggtcgagaa actgccggaa aatgaggaag gcttgctgct ggtcgatatc 840
gatctcggtc gcatctcgct tgcgaaggct gcggccgacc ccgtcggtca ctacgcgcgc 900
cccgatgtct tgcggctctg gttcgacaag caaccgcggc ggtgcgtcga acatgccggc 960
gagaacgacg cgtcgcgcag gtcgcacggg tcgtccgggt cacaatcgcc ggcgcaggat 1020
gggccggcga acgacatggt agaccgtcag gaaaacgtcg attga 1065
<210>22
<211>354
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>22
Met Ala Leu Glu His Pro Lys Tyr Val Ala Ala Val Val Gln Ala Ala
1 5 10 15
Pro Glu Phe Leu Asn Leu Asp Arg Gly Ile Glu Lys Thr Ile Ala Leu
20 25 30
Ile Asp Glu Ala Gly Gln Lys Gly Ala Ala Leu Ile Ala Phe Pro Glu
35 40 45
Thr Trp Leu Pro Gly Tyr Pro Phe His Val Trp Leu Gly Pro Pro Ala
50 55 60
Trp Ala Leu Gly Ser Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Met
65 70 75 80
Thr Tyr Asp Ser Pro Gln Ala Ala Ala Leu Arg Asp Ala Ala Ala Arg
85 90 95
Asn Gly Ile Thr Val Val Leu Gly Leu Ser Glu Arg Cys Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Ile Ile Gly Pro Asp Gly Ala Thr Val Ala
115 120 125
Thr Arg Arg Lys Leu Arg Pro Thr His Ile Glu Arg Thr Val Phe Gly
130 135 140
Asp Gly Asp Gly Ser Asp Leu Ala Val His Asp Leu Asn Ile Gly Arg
145 150 155 160
Leu Gly Ala Leu Cys Cys Trp Glu His Ile Gln Pro Leu Thr Lys Tyr
165 170 175
Ala Met Tyr Ala Gln His Glu Gln Val His Val Ala Ala Trp Pro Ser
180 185 190
Phe Ser Met Tyr Glu Phe Ala Pro Ala Leu Gly His Glu Val Asn Asn
195 200 205
Ala Val Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val Leu Ala
210 215 220
Pro Cys Ala Val Ile Ser Glu Gln Met Val Asp Met Leu Cys Asp Thr
225 230 235 240
Ala Asp Lys Arg Ala Met Ile Arg Ala Gly Gly Gly His Ala Val Ala
245 250 255
Phe Gly Pro Asp Gly Glu Ala Leu Val Glu Lys Leu Pro Glu Asn Glu
260 265 270
Glu Gly Leu Leu Leu Val Asp Ile Asp Leu Gly Arg Ile Ser Leu Ala
275 280 285
Lys Ala Ala Ala Asp Pro Val Gly His Tyr Ala Arg Pro Asp Val Leu
290 295 300
Arg Leu Trp Phe Asp Lys Gln Pro Arg Arg Cys Val Glu His Ala Gly
305 310 315 320
Glu Asn Asp Ala Ser Arg Arg Ser His Gly Ser Ser Gly Ser Gln Ser
325 330 335
Pro Ala Gln Asp Gly Pro Ala Asn Asp Met Val Asp Arg Gln Glu Asn
340 345 350
Val Asp
<210>23
<211>1005
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>23
atgaccgtta tcaaagcagc cgccattcaa atcagccccg tgctctacag ccgggcgggg 60
acagtcgaga aagttgttag gaaggttaga gagctcgggg ccaaaggtgt ccgattcgct 120
acctttcccg aaaccatcat accgtactac ccgtacttct cgttcgttca gtcggcgttc 180
gacatgaagc ttgggagtga acatcagcgg ctgctcgacg aatcagtcac aattccttcg 240
tccgagacgg acgcgatcgc ccaggccgcc aaggaagcgg gcatggtggt gtccgtcggg 300
gtcaatgagc gcgatgggcg atccatctac aacactcaac ttctgttcga cgctgatggc 360
acgctcattc agcgtaggcg aaagatcacc ccgacctatc acgagcgcat gatttggggt 420
caaggcgatg gatccggcct acgcgcggtc gatagcgccg tgggccggat cggccagctt 480
gcctgctggg agcactacct tcccctggcg cggtacgccc tcatcgcgga cggagagcaa 540
atccactcgg caatgtatcc aggctcgttc gctggtccgc tatttgccga gcagatagag 600
gttagtatcc gccagcacgc gcttgagtca gcctgcttcg tcgtcaacgc gaccggatgg 660
cttagcgccg agcagcaagc tcaaatagtg aaggataccg gatgcgtcgt tggaccaatc 720
tccggtggct gctttacggc gattgttgat ccggagggtc ggatcatggg ggcgccactc 780
aaggcaggtg agggggaggt catcgcagat ctcgattttg cgcagattga tttccgcaag 840
cgtgtgatgg atacgcgagg gcactacagc cgccccgaac ttctaagcct cacgatcgac 900
cgcagtcagc accatcacat gactgagcga ggcgccgatc accgtgtaga ccacgcaaag 960
ccaacggtca ccgcagagca gtcggccgtc gagccggcgg aatga 1005
<210>24
<211>334
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>24
Met Thr Val Ile Lys Ala Ala Ala Ile Gln Ile Ser Pro Val Leu Tyr
l 5 10 15
Ser Arg Ala Gly Thr Val Glu Lys Val Val Arg Lys Val Arg Glu Leu
20 25 30
Gly Ala Lys Gly Val Arg Phe Ala Thr Phe Pro Glu Thr Ile Ile Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Ser Ala Phe Asp Met Lys Leu
50 55 60
Gly Ser Glu His Gln Arg Leu Leu Asp Glu Ser Val Thr Ile Pro Ser
65 70 75 80
Ser Glu Thr Asp Ala Ile Ala Gln Ala Ala Lys Glu Ala Gly Met Val
85 90 95
Val Ser Val Gly Val Asn Glu Arg Asp Gly Arg Ser Ile Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Ala Val Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Trp Glu His Tyr Leu Pro Leu Ala Arg Tyr Ala Leu Ile Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Phe Ala Gly
180 185 190
Pro Leu Phe Ala Glu Gln Ile Glu Val Ser Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Ala Cys Phe Val Val Asn Ala Thr Gly Trp Leu Ser Ala Glu
210 215 220
Gln Gln Ala Gln Ile Val Lys Asp Thr Gly Cys Val Val Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Ala Ile Val Asp Pro Glu Gly Arg Ile Met
245 250 255
Gly Ala Pro Leu Lys Ala Gly Glu Gly Glu Val Ile Ala Asp Leu Asp
260 265 270
Phe Ala Gln Ile Asp Phe Arg Lys Arg Val Met Asp Thr Arg Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Thr Ile Asp Arg Ser Gln His
290 295 300
His His Met Thr Glu Arg Gly Ala Asp His Arg Val Asp His Ala Lys
305 310 315 320
Pro Thr Val Thr Ala Glu Gln Ser Ala Val Glu Pro Ala Glu
325 330
<210>25
<211>939
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>25
gtgtcatcaa ctatcaaagt cgccattatt caggccgctc ctgcttacta cgacctgcag 60
gcgtcgctgg caaaggccgc cagtctgatc cgcgaggcgg cacgcggcgg cgcgcaattc 120
gtcgcgttcg gggagacatg gctgccgggc tatccgatgt ggctggattg gtgtcctggc 180
gcgatcatct gggataaccc cgccaccaaa accgtcttcg cgcgcctcca tgaaaacagc 240
gtcgccgttc ccggcaggga aacggcattt ctcgccgacc ttgcgatgtc gttaagcatc 300
gtattatgca tcggcgtcaa tgagaaggtc atgaatgggc cgggacacgg cacgctctac 360
aacacgctcc tgacgtttga tgcaacgggt gaaatcatca atcatcatcg caagttgatg 420
ccaacctatg gcgagagatt ggtatggggg ccgggcgacg cagttggcgt gcaagcggtt 480
gatagtacgg tcgggcgcat cggcgggctg atctgttggg agcactggat gccgctgcca 540
cgccaactca tgcacaacag cggcgagcag attcacgtct gcgcatggcc gggcgtgcac 600
gaaatgcacc agatcgcgag ccgtcattat gcattcgagg gccgctgctt tgtgctggcc 660
gccggattga tcatgcccgc gttcgacctg cccagcgaac tcgaatttcc gcccgaactg 720
gccgacaagc gcgactatct cctaatgaac ggcggcagcg ccatcatcaa gcccaatggc 780
aaatatctcg ccgggccggt ttatgacgaa gagactattc tctgcgccga ccttgacctg 840
actgagaaca tcaaggagca gatgacgctg gacgtgacag ggcattatgc gcgagcggaa 900
ctgtttgact tgaatgtggt gcggcggcgg aatgcgtag 939
<210>26
<211>312
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>26
Val Ser Ser Thr Ile Lys Val Ala Ile Ile Gln Ala Ala Pro Ala Tyr
1 5 10 15
Tyr Asp Leu Gln Ala Ser Leu Ala Lys Ala Ala Ser Leu Ile Arg Glu
20 25 30
Ala Ala Arg Gly Gly Ala Gln Phe Val Ala Phe Gly Glu Thr Trp Leu
35 40 45
Pro Gly Tyr Pro Met Trp Leu Asp Trp Cys Pro Gly Ala Ile Ile Trp
50 55 60
Asp Asn Pro Ala Thr Lys Thr Val Phe Ala Arg Leu His Glu Asn Ser
65 70 75 80
Val Ala Val Pro Gly Arg Glu Thr Ala Phe Leu Ala Asp Leu Ala Met
85 90 95
Ser Leu Ser Ile Val Leu Cys Ile Gly Val Asn Glu Lys Val Met Asn
100 105 110
Gly Pro Gly His Gly Thr Leu Tyr Asn Thr Leu Leu Thr Phe Asp Ala
115 120 125
Thr Gly Glu Ile Ile Asn His His Arg Lys Leu Met Pro Thr Tyr Gly
130 135 140
Glu Arg Leu Val Trp Gly Pro Gly Asp Ala Val Gly Val Gln Ala Val
145 150 155 160
Asp Ser Thr Val Gly Arg Ile Gly Gly Leu Ile Cys Trp Glu His Trp
165 170 175
Met Pro Leu Pro Arg Gln Leu Met His Asn Ser Gly Glu Gln Ile His
180 185 190
Val Cys Ala Trp Pro Gly Val His Glu Met His Gln Ile Ala Ser Arg
195 200 205
His Tyr Ala Phe Glu Gly Arg Cys Phe Val Leu Ala Ala Gly Leu Ile
210 215 220
Met Pro Ala Phe Asp Leu Pro Ser Glu Leu Glu Phe Pro Pro Glu Leu
225 230 235 240
Ala Asp Lys Arg Asp Tyr Leu Leu Met Asn Gly Gly Ser Ala Ile Ile
245 250 255
Lys Pro Asn Gly Lys Tyr Leu Ala Gly Pro Val Tyr Asp Glu Glu Thr
260 265 270
Ile Leu Cys Ala Asp Leu Asp Leu Thr Glu Asn Ile Lys Glu Gln Met
275 280 285
Thr Leu Asp Val Thr Gly His Tyr Ala Arg Ala Glu Leu Phe Asp Leu
290 295 300
Asn Val Val Arg Arg Arg Asn Ala
305 310
<210>27
<211>1056
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>27
atgccaaccc ccagcgatca tttcaaaatc gccgctgttc aggcctcgcc cgtgtttctg 60
gaccgggagg ccactgtgga aaaggcctgc cggttgatcg ccgaagccgc aaagcagggc 120
gcccgcctca tcgtctttcc ggaatctttc atcccgacct acccggattg ggtgtgggcc 180
gttcccccgg gaagggaaag aatcctgaac cagctgtatt ctgaattcct ggccaatgcc 240
gtcgatgttc ccggcgcggc gaccgaacaa cttgcccagg ctgcacgaat ggccggcgcc 300
tatgtgatta tgggcgtcac cgaaagagac acctcggcca gcggggccag cctctacaac 360
accctgctct acttcagccc cgaaggcatc ctaatgggca aacaccggaa gctggttccc 420
acggggggcg aacggctggt ctgggcctac ggagacggca gcacgctgga ggtctacgac 480
actccgctgg gaaagatcgg cgggctgatc tgctgggaga actacatgcc cctggcccgg 540
tacacgatgt acgcctgggg cacccagatt tacatcgccg ccacctggga ccgcggggaa 600
ccgtggctct ccaccctgcg gcatatcgcc aaggaaggaa gggtctacgt catcgggtgc 660
tgcatcgccc tgcgccaggg ggatatcccg gaccggttcg agtacaaggg aaaattttat 720
tccgggtccc gggagtggat caatgagggc gacagcgcca tcgtgaaccc ggacggggaa 780
ttcatcgccg ggccggtgcg gatgaaggag gagatcctgt atgccgagat agacccccgg 840
cagatgcggg gccccaagtg gatgctcgat gtggccggtc attacgcccg gccggatatc 900
ttcgagctca tcgtccaccg gaatccccac ccgatgatca aaatcgccga agacaggggc 960
acggggatcg cctcaagttt gattcgcccc cgccctaacc ttcccccatc aagggggagg 1020
aaatcggcaa gaagcaaacg caagcccaaa aaatga 1056
<210>28
<211>351
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>28
Met Pro Thr Pro Ser Asp His Phe Lys Ile Ala Ala Val Gln Ala Ser
1 5 10 15
Pro Val Phe Leu Asp Arg Glu Ala Thr Val Glu Lys Ala Cys Arg Leu
20 25 30
Ile Ala Glu Ala Ala Lys Gln Gly Ala Arg Leu Ile Val Phe Pro Glu
35 40 45
Ser Phe Ile Pro Thr Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly
50 55 60
Arg Glu Arg Ile Leu Asn Gln Leu Tyr Ser Glu Phe Leu Ala Asn Ala
65 70 75 80
Val Asp Val Pro Gly Ala Ala Thr Glu Gln Leu Ala Gln Ala Ala Arg
85 90 95
Met Ala Gly Ala Tyr Val Ile Met Gly Val Thr Glu Arg Asp Thr Ser
100 105 110
Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Phe Ser Pro Glu
115 120 125
Gly Ile Leu Met Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu
130 135 140
Arg Leu Val Trp Ala Tyr Gly Asp Gly Ser Thr Leu Glu Val Tyr Asp
145 150 155 160
Thr Pro Leu Gly Lys Ile Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met
165 170 175
Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gln Ile Tyr Ile
180 185 190
Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His
195 200 205
Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile Ala Leu
210 215 220
Arg Gln Gly Asp Ile Pro Asp Arg Phe Glu Tyr Lys Gly Lys Phe Tyr
225 230 235 240
Ser Gly Ser Arg Glu Trp Ile Asn Glu Gly Asp Ser Ala Ile Val Asn
245 250 255
Pro Asp Gly Glu Phe Ile Ala Gly Pro Val Arg Met Lys Glu Glu Ile
260 265 270
Leu Tyr Ala Glu Ile Asp Pro Arg Gln Met Arg Gly Pro Lys Trp Met
275 280 285
Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Ile Phe Glu Leu Ile
290 295 300
Val His Arg Asn Pro His Pro Met Ile Lys Ile Ala Glu Asp Arg Gly
305 310 315 320
Thr Gly Ile Ala Ser Ser Leu Ile Arg Pro Arg Pro Asn Leu Pro Pro
325 330 335
Ser Arg Gly Arg Lys Ser Ala Arg Ser Lys Arg Lys Pro Lys Lys
340 345 350
<210>29
<211>1017
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>29
atgggcatcg aacatacgaa atacaaggtg gcggtggtgc aggcggcgcc ggcctggctc 60
gacctcgagg cctcgatcgg caagtccatc ggcctaatca aggaggccgc ggacaagggc 120
gccaagctga tcgcctttcc ggaggccttc atccccggtt acccctggta tatctggatg 180
gactcgccgg cctgggcgat cggccgcggc ttcgtccagc gctatttcga caattcgctc 240
tcctacgaca gtccccaggc cgagcggctg cgtgatgccg tgcgccaggc caagctcacc 300
gccgtgatcg gcctgtccga acgcgacggc ggcagccttt acctggcgca atggttgatc 360
gggcccgacg gcgaaaccat tgccaagcgc cgcaagctgc ggccgaccca tgccgagcgc 420
accgtctatg gcgaaggcga cggcagcgat ctggccgtac atgcccggcc cgacatcggt 480
cgcttgggcg cgctgtgctg ctgggagcat cttcagccgt tgtcgaagta cgcaatgtac 540
gcccagaacg agcaggtcca cgtcgctgcc tggccgagct tctcgctcta cgatcccttc 600
gccccggcgc tcggcgccga ggtcaacaac gctgcctcgc gcgtctatgc ggtggagggc 660
tcctgcttcg tgctcgcgcc ttgcgcgacg gtgtcgcagg ccatgatcga cgaactctgc 720
gatcggcccg ataagcatgc gctgctgcat gccggcggag gctttgccgc gatctacggc 780
cccgacggca gccagatcgg cgagaagctg gcgccggatc aggagggtct gctgatcgcc 840
gagattgatc tgggcgccat cggtgttgcc aagaacgcgg cagatcccgc cggtcattat 900
tcacggccgg atgtgacgcg gttgctgctc aacaagaagc ggtaccagcg cgtcgagcaa 960
tttgccttgc ccgccgacat ggtcgagccc gcggacatag gcgcggcggc gagctga 1017
<210>30
<211>338
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>30
Met Gly Ile Glu His Thr Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Ala Trp Leu Asp Leu Glu Ala Ser Ile Gly Lys Ser Ile Gly Leu
20 25 30
Ile Lys Glu Ala Ala Asp Lys Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Ala Phe Ile Pro Gly Tyr Pro Trp Tyr Ile Trp Met Asp Ser Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ser Tyr Asp Ser Pro Gln Ala Glu Arg Leu Arg Asp Ala Val Arg Gln
85 90 95
Ala Lys Leu Thr Ala Val Ile Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Leu Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Ala Arg Pro Asp Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Ala Glu Val
195 200 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Arg Pro Asp Lys His Ala Leu Leu His Ala Gly Gly Gly Phe Ala
245 250 255
Ala Ile Tyr Gly Pro Asp Gly Ser Gln Ile Gly Glu Lys Leu Ala Pro
260 265 270
Asp Gln Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Lys Lys Arg Tyr Gln Arg Val Glu Gln
305 310 315 320
Phe Ala Leu Pro Ala Asp Met Val Glu Pro Ala Asp Ile Gly Ala Ala
325 330 335
Ala Ser
<210>31
<211>933
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>31
atgaccagaa tagccattat tcagcgaccg cccgtgctgc tcgatcgaag cgccaccatt 60
gcccgggccg tgcaatcggt cgccgaagcg gcagcgcaag gcgcgaccct gattgtcttg 120
cccgaatcgt acatccctgg ctatccctca tggatctggc ggctcgcgcc tggcaaagac 180
ggcgcgatcg tgggccagtt gcatgcgcgc ttgctggcca atgcggtcga cctgagcagc 240
actgacctcg atgcgcttct tgaagcggcc cgtcagcacg gcgtgaccat tgtttgcggc 300
atgaacgagt gcgaacggcg tcgcggcggc ggcaccttgt acaacacggt ggtcgtgatc 360
ggaccggacg gcgtcatgct caaccggcat cgcaaattga tgccgaccaa tcccgagcgc 420
atggtgcatg gctttggcga tgcatccgga ctgaaagcag ttgatacgcc tgccggccgg 480
ctgggcacgc tgatctgctg ggagagctac atgccgctgg cacgctatgc cctgtacgag 540
caaggcatcg agatctacat cgcaccaact tatgacagtg gtgacggctg gatcagcacc 600
atgcgccaca ttgcactcga agggcgctgc tgggtgattg gcagcggcac ggtcctgaaa 660
ggcagtgata ttccggacga tttcccggaa cgggcacgcc tgttccctga tccggatgag 720
tggatcaacg atggtgattc ggtagttatc gatccgcagg gaaagatcgt tgccggtccg 780
atgcgtaggg aagcaggcat tctatacgcc gatatcgacg tcgcgcgcgt agcaccatca 840
cgccgcacgc tggatgtcgc ggggcattac gcgcgtccgg acgtcttcga gcttcgggta 900
caccaggcac cgggggcacg agtaagtaat tga 933
<210>32
<211>310
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>32
Met Thr Arg Ile Ala Ile Ile Gln Arg Pro Pro Val Leu Leu Asp Arg
1 5 10 15
Ser Ala Thr Ile Ala Arg Ala Val Gln Ser Val Ala Glu Ala Ala Ala
20 25 30
Gln Gly Ala Thr Leu Ile Val Leu Pro Glu Ser Tyr Ile Pro Gly Tyr
35 40 45
Pro Ser Trp Ile Trp Arg Leu Ala Pro Gly Lys Asp Gly Ala Ile Val
50 55 60
Gly Gln Leu His Ala Arg Leu Leu Ala Asn Ala Val Asp Leu Ser Ser
65 70 75 80
Thr Asp Leu Asp Ala Leu Leu Glu Ala Ala Arg Gln His Gly Val Thr
85 90 95
Ile Val Cys Gly Met Asn Glu Cys Glu Arg Arg Arg Gly Gly Gly Thr
100 105 110
Leu Tyr Asn Thr Val Val Val Ile Gly Pro Asp Gly Val Met Leu Asn
115 120 125
Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val His Gly
130 135 140
Phe Gly Asp Ala Ser Gly Leu Lys Ala Val Asp Thr Pro Ala Gly Arg
145 150 155 160
Leu Gly Thr Leu Ile Cys Trp Glu Ser Tyr Met Pro Leu Ala Arg Tyr
165 170 175
Ala Leu Tyr Glu Gln Gly Ile Glu Ile Tyr Ile Ala Pro Thr Tyr Asp
180 185 190
Ser Gly Asp Gly Trp Ile Ser Thr Met Arg His Ile Ala Leu Glu Gly
195 200 205
Arg Cys Trp Val Ile Gly Ser Gly Thr Val Leu Lys Gly Ser Asp Ile
210 215 220
Pro Asp Asp Phe Pro Glu Arg Ala Arg Leu Phe Pro Asp Pro Asp Glu
225 230 235 240
Trp Ile Asn Asp Gly Asp Ser Val Val Ile Asp Pro Gln Gly Lys Ile
245 250 255
Val Ala Gly Pro Met Arg Arg Glu Ala Gly Ile Leu Tyr Ala Asp Ile
260 265 270
Asp Val Ala Arg Val Ala Pro Ser Arg Arg Thr Leu Asp Val Ala Gly
275 280 285
His Tyr Ala Arg Pro Asp Val Phe Glu Leu Arg Val His Gln Ala Pro
290 295 300
Gly Ala Arg Val Ser Asn
305 310
<210>33
<211>1026
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>33
atgttaagtc ccgtgacgca gtatcgcgcc gccgcggtgc aggcggcgcc atcttttctc 60
gatctcgacc gcaccgtcga gaagacgatc gcgatcatcg agcaggcggc cgagcaggat 120
gtgcgcctga tcgcgtttcc ggaaacctgg attcccggct atccgctctg gatctggctc 180
ggctcgccgg cctggggcat gcgcttcgtg cagcgctatt tcgagaactc gctggtgcgc 240
ggcagcaaac agtggaacgc gatcgccgat gcggcgcggc gccaccgcat gaccgtcgtc 300
gtcggcttca gcgagcgcgc gggaggcagc ctctacatgg gccaggcgat cttcggcccc 360
gaaggcgagc tcatcgcggc gcgccggaag ctcaagccga cacacgccga gcgaacggtg 420
ttcggcgagg gcgacggcag ccacttggcc gtttacgaga cgggcgttgg tcgcatcggc 480
gccctctgct gctgggagca catccagccg ctctcgaaat acgcgatgta tgcggccaac 540
gaacaggtgc atgtggcctc gtggccgtgc ttcagccttt atcgcggcat ggcctatgcg 600
ctcgggccgg aggtgaacac cgccgcgagc caggtctacg cggtcgaggg cggctgctac 660
gtgctggcct cctgtctcgt cgtgacaccc gagatcctga aggtgctgat cgacacgccc 720
gacaaggagc cgttgctgct cgccggcggg gggttctcga tgatcttcgg ccccgacggc 780
cgcgcgctcg cccagccgct gccggagacc gaagaggggc tcgtcacggc cgagatcgat 840
ctcggcgcga tcgcgctcgc caaggccgcg gccgatcccg ccggccatta cgcgcggccc 900
gacgtgacgc ggttgttgct gaacccgcgc cccgcggcgc gcgtcgaagc gctgggtccg 960
cgcttcgagg tcgtgcagag cgagcaggcc gagccgccca cgcaaccggc cgaagcggcg 1020
gattga 1026
<210>34
<211>341
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>34
Met Leu Ser Pro Val Thr Gln Tyr Arg Ala Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Ser Phe Leu Asp Leu Asp Arg Thr Val Glu Lys Thr Ile Ala Ile
20 25 30
Ile Glu Gln Ala Ala Glu Gln Asp Val Arg Leu Ile Ala Phe Pro Glu
35 40 45
Thr Trp Ile Pro Gly Tyr Pro Leu Trp Ile Trp Leu Gly Ser Pro Ala
50 55 60
Trp Gly Met Arg Phe Val Gln Arg Tyr Phe Glu Asn Ser Leu Val Arg
65 70 75 80
Gly Ser Lys Gln Trp Asn Ala Ile Ala Asp Ala Ala Arg Arg His Arg
85 90 95
Met Thr Val Val Val Gly Phe Ser Glu Arg Ala Gly Gly Ser Leu Tyr
100 105 110
Met Gly Gln Ala Ile Phe Gly Pro Glu Gly Glu Leu Ile Ala Ala Arg
115 120 125
Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly Glu Gly
130 135 140
Asp Gly Ser His Leu Ala Val Tyr Glu Thr Gly Val Gly Arg Ile Gly
145 150 155 160
Ala Leu Cys Cys Trp Glu His Ile Gln Pro Leu Ser Lys Tyr Ala Met
165 170 175
Tyr Ala Ala Asn Glu Gln Val His Val Ala Ser Trp Pro Cys Phe Ser
180 185 190
Leu Tyr Arg Gly Met Ala Tyr Ala Leu Gly Pro Glu Val Asn Thr Ala
195 200 205
Ala Ser Gln Val Tyr Ala Val Glu Gly Gly Cys Tyr Val Leu Ala Ser
210 215 220
Cys Leu Val Val Thr Pro Glu Ile Leu Lys Val Leu Ile Asp Thr Pro
225 230 235 240
Asp Lys Glu Pro Leu Leu Leu Ala Gly Gly Gly Phe Ser Met Ile Phe
245 250 255
Gly Pro Asp Gly Arg Ala Leu Ala Gln Pro Leu Pro Glu Thr Glu Glu
260 265 270
Gly Leu Val Thr Ala Glu Ile Asp Leu Gly Ala Ile Ala Leu Ala Lys
275 280 285
Ala Ala Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Thr Arg
290 295 300
Leu Leu Leu Asn Pro Arg Pro Ala Ala Arg Val Glu Ala Leu Gly Pro
305 310 315 320
Arg Phe Glu Val Val Gln Ser Glu Gln Ala Glu Pro Pro Thr Gln Pro
325 330 335
Ala Glu Ala Ala Asp
340
<210>35
<211>942
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>35
atgacagctc taaaaattgc tgctgttcaa atgtgcgccg aattgggcgc tacagatcga 60
aacctgagtg cagctggatc attcgtgcgc gacgcatttc gcgaaggtgc ccagtgggta 120
atcctcccag agttttttac ctcggcaatg gcattcgcac cttcgatggc gcaagcttgg 180
ttgccactgg aaggaaaggc gctagcgatg atgcgcagcc ttgcgcgtca attcgatggg 240
gttgttggag gctcatatgt tgccagagag gggaacgact gcgtaaatgc ctttcttctc 300
gtctttccgg atggaagcta ctaccggcat gacaaagata ttccaacaat gtgggagaac 360
tgttactaca tcggcggcgt cgacgatggg gtgctggaaa caccaattgg tgcggtggga 420
gttgcactgt gttgggagtt catccgaaca caaaccgccc gaagactgaa ggatcgcgtt 480
caattagtgg ttggcggtac ttgctggtgg gattttccga tgcctgtacc tgaacgatat 540
ctgaggctga ccaggcatat ctccaggaac tttgagcgcg atgctccggc gcggttggcc 600
agtatgttgg gtgtgcctgt tgtacacgct tcccatgctg gggattttac tgctgtcacc 660
ccaggcaatg aaacgaagaa ttaccgatcc aactatctgg gagagaccca gatcgtcgat 720
gccaatggaa atgtgttgaa gcgaatgaca gtggctgatg gtgagggtta cgtcattgct 780
gacgttcaat tgggggccat atcaaccggt cgaacttcga tccccgacac cttctggacc 840
tgcaagctaa cgccaggggc acaacaggct tgggatgaac aaaatgcttt tgggtgtggc 900
tactatgaga acgtcacacg caaacaccta atcggtcgat ga 942
<210>36
<211>313
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>36
Met Thr Ala Leu Lys Ile Ala Ala Val Gln Met Cys Ala Glu Leu Gly
1 5 10 15
Ala Thr Asp Arg Asn Leu Ser Ala Ala Gly Ser Phe Val Arg Asp Ala
20 25 30
Phe Arg Glu Gly Ala Gln Trp Val Ile Leu Pro Glu Phe Phe Thr Ser
35 40 45
Ala Met Ala Phe Ala Pro Ser Met Ala Gln Ala Trp Leu Pro Leu Glu
50 55 60
Gly Lys Ala Leu Ala Met Met Arg Ser Leu Ala Arg Gln Phe Asp Gly
65 70 75 80
Val Val Gly Gly Ser Tyr Val Ala Arg Glu Gly Asn Asp Cys Val Asn
85 90 95
Ala Phe Leu Leu Val Phe Pro Asp Gly Ser Tyr Tyr Arg His Asp Lys
100 105 110
Asp Ile Pro Thr Met Trp Glu Asn Cys Tyr Tyr Ile Gly Gly Val Asp
115 120 125
Asp Gly Val Leu Glu Thr Pro Ile Gly Ala Val Gly Val Ala Leu Cys
130 135 140
Trp Glu Phe Ile Arg Thr Gln Thr Ala Arg Arg Leu Lys Asp Arg Val
145 150 155 160
Gln Leu Val Val Gly Gly Thr Cys Trp Trp Asp Phe Pro Met Pro Val
165 170 175
Pro Glu Arg Tyr Leu Arg Leu Thr Arg His Ile Ser Arg Asn Phe Glu
180 185 190
Arg Asp Ala Pro Ala Arg Leu Ala Ser Met Leu Gly Val Pro Val Val
195 200 205
His Ala Ser His Ala Gly Asp Phe Thr Ala Val Thr Pro Gly Asn Glu
210 215 220
Thr Lys Asn Tyr Arg Ser Asn Tyr Leu Gly Glu Thr Gln Ile Val Asp
225 230 235 240
Ala Asn Gly Asn Val Leu Lys Arg Met Thr Val Ala Asp Gly Glu Gly
245 250 255
Tyr Val Ile Ala Asp Val Gln Leu Gly Ala Ile Ser Thr Gly Arg Thr
260 265 270
Ser Ile Pro Asp Thr Phe Trp Thr Cys Lys Leu Thr Pro Gly Ala Gln
275 280 285
Gln Ala Trp Asp Glu Gln Asn Ala Phe Gly Cys Gly Tyr Tyr Glu Asn
290 295 300
Val Thr Arg Lys His Leu Ile Gly Arg
305 310
<210>37
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>37
atggattcag atatgacgga tacttttaag gcagccatta ttcagcacgc gcctgttttt 60
ttaaatctcg aagagagtct ggacaaagcc ggcagcctta tagaaaaggc tgccgatcaa 120
ggcgcgaaag tgatcgcctt tcctgaaaca tggctgcccg gttatcccgt atggctcgac 180
tactctccaa aagcgggtct gtgggactat cagcctgcaa aatctctcta tcgtctgcta 240
gtcgataatt cagtcacctt acccggcaaa cacctcgatc aactcctctc catagcgcaa 300
aagaccggcg catatgttgt aatgggggca cacgaacgag tgggtggaac actctataac 360
acgacgatct atgttgggat tgatgggaag gagtacaaac ttcatagaaa gctggtgccg 420
acctataccg aaagattgat ctgggggcgg ggagacggca gcacattgag tgtgttgatg 480
acggattatg gcgttcttgg aggattgatc tgctgggagc actggatgcc tctggcaaga 540
gccgcaatgc atgccagata tgaaaccctt catgtggcgc aatggccggc tgtaaaagat 600
atccatcaga tagcaagcag acattatgct tttgaaggcc ggtgtttcgt gctcgcggca 660
ggctctgttc tgactcgaag agatataata gaaggattca actcactggc tcgcgccgat 720
agtgatgcat tggaacttct gaaagctatt tcgggagaag atagtgatct tattttgaat 780
gggggaagcg cgataattgc gccgaatgga gagtatcttg cgggcccggt ctttaatgaa 840
ccctccatta tttatgctga aattgatcct gcactgataa gtgagggcca tcttacactg 900
gatacaagcg gacactactc gcgccctgac atttttcgtc tggagataaa cgatcaacct 960
caacatgatg taactttcag atcggggcat tag 993
<210>38
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>38
Met Asp Ser Asp Met Thr Asp Thr Phe Lys Ala Ala Ile Ile Gln His
1 5 10 15
Ala Pro Val Phe Leu Asn Leu Glu Glu Ser Leu Asp Lys Ala Gly Ser
20 25 30
Leu Ile Glu Lys Ala Ala Asp Gln Gly Ala Lys Val Ile Ala Phe Pro
35 40 45
Glu Thr Trp Leu Pro Gly Tyr Pro Val Trp Leu Asp Tyr Ser Pro Lys
50 55 60
Ala Gly Leu Trp Asp Tyr Gln Pro Ala Lys Ser Leu Tyr Arg Leu Leu
65 70 75 80
Val Asp Asn Ser Val Thr Leu Pro Gly Lys His Leu Asp Gln Leu Leu
85 90 95
Ser Ile Ala Gln Lys Thr Gly Ala Tyr Val Val Met Gly Ala His Glu
100 105 110
Arg Val Gly Gly Thr Leu Tyr Asn Thr Thr Ile Tyr Val Gly Ile Asp
115 120 125
Gly Lys Glu Tyr Lys Leu His Arg Lys Leu Val Pro Thr Tyr Thr Glu
130 135 140
Arg Leu Ile Trp Gly Arg Gly Asp Gly Ser Thr Leu Ser Val Leu Met
145 150 155 160
Thr Asp Tyr Gly Val Leu Gly Gly Leu Ile Cys Trp Glu His Trp Met
165 170 175
Pro Leu Ala Arg Ala Ala Met His Ala Arg Tyr Glu Thr Leu His Val
180 185 190
Ala Gln Trp Pro Ala Val Lys Asp Ile His Gln Ile Ala Ser Arg His
195 200 205
Tyr Ala Phe Glu Gly Arg Cys Phe Val Leu Ala Ala Gly Ser Val Leu
210 215 220
Thr Arg Arg Asp Ile Ile Glu Gly Phe Asn Ser Leu Ala Arg Ala Asp
225 230 235 240
Ser Asp Ala Leu Glu Leu Leu Lys Ala Ile Ser Gly Glu Asp Ser Asp
245 250 255
Leu Ile Leu Asn Gly Gly Ser Ala Ile Ile Ala Pro Asn Gly Glu Tyr
260 265 270
Leu Ala Gly Pro Val Phe Asn Glu Pro Ser Ile Ile Tyr Ala Glu Ile
275 280 285
Asp Pro Ala Leu Ile Ser Glu Gly His Leu Thr Leu Asp Thr Ser Gly
290 295 300
His Tyr Ser Arg Pro Asp Ile Phe Arg Leu Glu Ile Asn Asp Gln Pro
305 310 315 320
Gln His Asp Val Thr Phe Arg Ser Gly His
325 330
<210>39
<211>1008
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>39
atgaaaaata tcaaaaactc agaaaaaagc agcacagtaa gagtcgctgc ggtacaaatc 60
agtccggtgt tgtacaaccg cgaagctacc gttcaaaaag tagtcaacaa aatccttgaa 120
ctaggaaaac aaggggtaca attcgccact tttccggaaa cgatagtgcc ttattatcct 180
tatttctctt ttattcaggc gccttatgcc atgggcaaag aacacctgcg cttgcttgaa 240
caatcagtta ctgttccgtc agccgcgacc gatgccataa gtgaggcggc aaaggaagcc 300
aatatggtag tgtctattgg tgtcaatgaa cgagacggtg gtaccattta caatacgcaa 360
ctcctttttg atgctgacgg aacattaatt cagcgcagac gtaaacttac accaacgtat 420
catgaaagaa tgatttgggg acaaggtgac gcttcaggtc ttcgtgccac agacagcgct 480
gttgggcgta tcgggcagtt ggcttgttgg gaacattaca atccattgtt ccgttatgct 540
ttgattgctg atggagaaca aatccattct gccatgtatc ccggatcatt tttaggtgcg 600
ttgcacggtg aacaaaccga aatcaatgta cgccaacacg ctttagaatc ggccagcttc 660
gtcgtagtgg ctaccggttg gttggatgcc gatcaacaag cacaaattgc gaaagacacc 720
ggtggaccaa tcggaccaat ttcgggaggt tgttttacag ccgttatagg ccctgacgga 780
caactaatcg gggaagccct tacatcaggt gaaggggaag tgattgccga tattgatttg 840
gcacaaattg atgcccgcaa aagattaatg gatgccagtg gtcactacaa ccgtcctgaa 900
ttgttgagct tgcatatcga tcacactccg actgctccta tgcatgaaag agtagtttac 960
actgagccgg gattagcaaa aagacaaaat gaaaattcat caaattaa 1008
<210>40
<211>335
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>40
Met Lys Asn Ile Lys Asn Ser Glu Lys Ser Ser Thr Val Arg Val Ala
1 5 10 15
Ala Val Gln Ile Ser Pro Val Leu Tyr Asn Arg Glu Ala Thr Val Gln
20 25 30
Lys Val Val Asn Lys Ile Leu Glu Leu Gly Lys Gln Gly Val Gln Phe
35 40 45
Ala Thr Phe Pro Glu Thr Ile Val Pro Tyr Tyr Pro Tyr Phe Ser Phe
50 55 60
Ile Gln Ala Pro Tyr Ala Met Gly Lys Glu His Leu Arg Leu Leu Glu
65 70 75 80
Gln Ser Val Thr Val Pro Ser Ala Ala Thr Asp Ala Ile Ser Glu Ala
85 90 95
Ala Lys Glu Ala Asn Met Val Val Ser Ile Gly Val Asn Glu Arg Asp
100 105 110
Gly Gly Thr Ile Tyr Asn Thr Gln Leu Leu Phe Asp Ala Asp Gly Thr
115 120 125
Leu Ile Gln Arg Arg Arg Lys Leu Thr Pro Thr Tyr His Glu Arg Met
130 135 140
Ile Trp Gly Gln Gly Asp Ala Ser Gly Leu Arg Ala Thr Asp Ser Ala
145 150 155 160
Val Gly Arg Ile Gly Gln Leu Ala Cys Trp Glu His Tyr Asn Pro Leu
165 170 175
Phe Arg Tyr Ala Leu Ile Ala Asp Gly Glu Gln Ile His Ser Ala Met
180 185 190
Tyr Pro Gly Ser Phe Leu Gly Ala Leu His Gly Glu Gln Thr Glu Ile
195 200 205
Asn Val Arg Gln His Ala Leu Glu Ser Ala Ser Phe Val Val Val Ala
210 215 220
Thr Gly Trp Leu Asp Ala Asp Gln Gln Ala Gln Ile Ala Lys Asp Thr
225 230 235 240
Gly Gly Pro Ile Gly Pro Ile Ser Gly Gly Cys Phe Thr Ala Val Ile
245 250 255
Gly Pro Asp Gly Gln Leu Ile Gly Glu Ala Leu Thr Ser Gly Glu Gly
260 265 270
Glu Val Ile Ala Asp Ile Asp Leu Ala Gln Ile Asp Ala Arg Lys Arg
275 280 285
Leu Met Asp Ala Ser Gly His Tyr Asn Arg Pro Glu Leu Leu Ser Leu
290 295 300
His Ile Asp His Thr Pro Thr Ala Pro Met His Glu Arg Val Val Tyr
305 310 315 320
Thr Glu Pro Gly Leu Ala Lys Arg Gln Asn Glu Asn Ser Ser Asn
325 330 335
<210>41
<211>966
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>41
atgcccaatg agaataacat cgctacattc aaagttgccg cagtccaggc cacaccggtg 60
tttcttgatc gtgaagcaac catcgacaaa gcttgcgggt tgattgccac tgccggcaat 120
gaaggagcgc gcctgattgt gtttccagaa gcgttcatcc caacctatcc tgaatgggtt 180
tggggtattc cttccggtga gcaaggttta ctcaatgaac tctatgcaga gctgctcacc 240
aatgcggtca ctattcccag tgacgcgact gacaggctgt gcgaggccgc gcagcttgcg 300
aatgcctacg tagtgatggg aatgagcgaa cggaatgtcg aggcgagtgg cgcaagccta 360
tataatacgc tgttgtacat caatgcgcag ggggagattt tagggaaaca tcgaaagctg 420
gtgccaacgg gcggcgaacg cctggtatgg gcgcagggtg atggcagcac gctgcaggtc 480
tacgatacgc cattgggaaa actcggtggt ctcatttgct gggaaaatta tatgccgctg 540
gcacgctatg ctatgtatgc ctgggggaca caaatctatg tcgcggcaac gtgggatcga 600
ggccaaccct ggctttctac attacggcat atcgccaaag aaggcagggt atacgtgatt 660
ggttgctgta tcgcgatgcg aaaagacgat attccggata gttactccat gaagcagaaa 720
taccatgctg aaatggatga atggattaat gttggcgaca gtgtgattgt caatcccgaa 780
ggacacttta tcgcagggcc tgtgcgcaag caagaagaaa ttctctacgc ggagatcgat 840
ccacgtatgg tgcaaggccc gaagtggatg ctcgatgtgg cggggcatta tgcgagacca 900
gatgtgttcc agttgacggt gcatacgaat gtgagagaga tgatgcgggt ggaagatgat 960
tcataa 966
<210>42
<211>321
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>42
Met Pro Asn Glu Asn Asn Ile Ala Thr Phe Lys Val Ala Ala Val Gln
1 5 10 15
Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr Ile Asp Lys Ala Cys
20 25 30
Gly Leu Ile Ala Thr Ala Gly Asn Glu Gly Ala Arg Leu Ile Val Phe
35 40 45
Pro Glu Ala Phe Ile Pro Thr Tyr Pro Glu Trp Val Trp Gly Ile Pro
50 55 60
Ser Gly Glu Gln Gly Leu Leu Asn Glu Leu Tyr Ala Glu Leu Leu Thr
65 70 75 80
Asn Ala Val Thr Ile Pro Ser Asp Ala Thr Asp Arg Leu Cys Glu Ala
85 90 95
Ala Gln Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg Asn
100 105 110
Val Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Ile Asn
115 120 125
Ala Gln Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly
130 135 140
Gly Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val
145 150 155 160
Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn
165 170 175
Tyr Met Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gln Ile
180 185 190
Tyr Val Ala Ala Thr Trp Asp Arg Gly Gln Pro Trp Leu Ser Thr Leu
195 200 205
Arg His Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile
210 215 220
Ala Met Arg Lys Asp Asp Ile Pro Asp Ser Tyr Ser Met Lys Gln Lys
225 230 235 240
Tyr His Ala Glu Met Asp Glu Trp Ile Asn Val Gly Asp Ser Val Ile
245 250 255
Val Asn Pro Glu Gly His Phe Ile Ala Gly Pro Val Arg Lys Gln Glu
260 265 270
Glu Ile Leu Tyr Ala Glu Ile Asp Pro Arg Met Val Gln Gly Pro Lys
275 280 285
Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln
290 295 300
Leu Thr Val His Thr Asn Val Arg Glu Met Met Arg Val Glu Asp Asp
305 310 315 320
Ser
<210>43
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>43
atgagagtcg ttaaagccgc ggcggtccaa ctgaaacctg tcctctatag ccgagaggga 60
actgtcgaaa acgtcgtccg taaaatccac gagcttggac agcaaggggt acagttcgcg 120
acgtttccag agactgtggt gccttactac ccgtactttt cgatcgtgca gtccggctat 180
caaattcttg gcggtggtga gttcctgaag ctgcttgatc agtcagtaac cgtgccatct 240
ctcgctacgg aagcgatcgg cgaggcttgc aggcaggcgg gcgtcgttgt ctccatcggc 300
gtcaacgagc gtgatggagg aactctatac aacacgcaac ttctctttga tgccgacgga 360
acattgattc aaagacgacg caagatcaca cccacccatt acgagcgcat ggtctggggc 420
cagggcgatg gctcaggttt acgggccatt gacagcaagg tcgcgcgcat tggtcaactg 480
gcgtgttttg agcactacaa ccctctcgca cgttacgcga tgatggccga tggcgagcag 540
atccattctg cgatgttccc cggctccatg ttcggcgata atttttcaga gaaggtggaa 600
atcaacataa ggcagcatgc aatggagtct gggtgctttg tcgtttgcgc tactgcctgg 660
ttggatgctg accagcaggc tcaaatcatg aaagacacgg gatgtgagat cggaccgatc 720
tcaggaggtt gcttcacagc gatcgcggca ccagatggaa gccttatagg tgaacccatc 780
cgctcaggtg aaggcgtttg tattgccgac ctcgatttca aacttatcga caagcggaag 840
cacgtagtag acacacgcgg ccattatagc cggccagaat tgctcagcct cctgattgat 900
cggacgccga cggcccacat acacgaaagg accgagcaac cgagggcggc catcgagaaa 960
gagtcgcagg atgttttcac cgctgttgct taa 993
<210>44
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>44
Met Arg Val Val Lys Ala Ala Ala Val Gln Leu Lys Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Glu Asn Val Val Arg Lys Ile His Glu Leu
20 25 30
Gly Gln Gln Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Ile Val Gln Ser Gly Tyr Gln Ile Leu Gly
50 55 60
Gly Gly Glu Phe Leu Lys Leu Leu Asp Gln Ser Val Thr Val Pro Ser
65 70 75 80
Leu Ala Thr Glu Ala Ile Gly Glu Ala Cys Arg Gln Ala Gly Val Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Tyr Glu Arg Met Val Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Ile Asp Ser Lys Val Ala Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Met Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Phe Pro Gly Ser Met Phe Gly
180 185 190
Asp Asn Phe Ser Glu Lys Val Glu Ile Asn Ile Arg Gln His Ala Met
195 200 205
Glu Ser Gly Cys Phe Val Val Cys Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Glu Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Ala Ile Ala Ala Pro Asp Gly Ser Leu Ile
245 250 255
Gly Glu Pro Ile Arg Ser Gly Glu Gly Val Cys Ile Ala Asp Leu Asp
260 265 270
Phe Lys Leu Ile Asp Lys Arg Lys His Val Val Asp Thr Arg Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Thr
290 295 300
Ala His Ile His Glu Arg Thr Glu Gln Pro Arg Ala Ala Ile Glu Lys
305 310 315 320
Glu Ser Gln Asp Val Phe Thr Ala Val Ala
325 330
<210>45
<211>996
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>45
gtgaaaccgc cgacgtcatt ccgcgtggcc gccgttcagg cctgtcccgt ctacctcgat 60
cgcgacctga cgatcggcaa ggcagaaggg ttgatcgccg aggcggctgg aaacggcgcg 120
aacctcatcg tgttccccga agcgttcgtg cctggctatc cggtgtgggt gtggttcatc 180
ccgcccggtc gcacggcgga tcttcgcgaa gcgtacagcg tcctccacgc caactcgatc 240
gcggtcccca gccagtcgac cgagcgtctg tgcgcggcgg cgcggcgcgc tggcgtcgcc 300
gtggcgattg gtgtcaacga aagaaacagc gaggccagcg gcggcagcct cttcaacacg 360
ctgctgtaca tcggaccgga cggcacgctg ctcggtaaac accgaaagct ggtgccaaca 420
ggcggagagc gtcttgtctg ggccagcggc gacggcagcg accttgccgt gttcacactg 480
cctttcgcgc gagtcggcgg actgatctgc tgggagaact acatgccgct cgcccgctac 540
gcgctggcgg cctggggtgc gcaaatccac gtggcgccga cctgggaccg cggcgagccg 600
tggctctcaa cactgcgtca tgtcgcgaag gaaggtagag ccgtgacgat cggctgctgt 660
caggccgtcc gcaaggaaga cattccggac gggctggcat tcaagtcccg atacctggcc 720
gacgtgggcg cctgggtcaa cccaggcggg agcgtcatcg tcgatcccga cggaaaaatt 780
cttgccggac ctgcgaacga aaccgaaggc atcttgtacg ctgacatcag ggccgatcag 840
ctcgtcgggc cgagatggca actcgacatt gccggacact acgcgcggcc ggacgtcttc 900
gagctgatcg tgcatcggcg ttcgacgccg atgattcgcg aggtctcggc gcctcgtcgt 960
cgcgcaagaa cgggaaagcg accgcgacgc cgctga 996
<210>46
<211>331
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>46
Val Lys Pro Pro Thr Ser Phe Arg Val Ala Ala Val Gln Ala Cys Pro
1 5 10 15
Val Tyr Leu Asp Arg Asp Leu Thr Ile Gly Lys Ala Glu Gly Leu Ile
20 25 30
Ala Glu Ala Ala Gly Asn Gly Ala Asn Leu Ile Val Phe Pro Glu Ala
35 40 45
Phe Val Pro Gly Tyr Pro Val Trp Val Trp Phe Ile Pro Pro Gly Arg
50 55 60
Thr Ala Asp Leu Arg Glu Ala Tyr Ser Val Leu His Ala Asn Ser Ile
65 70 75 80
Ala Val Pro Ser Gln Ser Thr Glu Arg Leu Cys Ala Ala Ala Arg Arg
85 90 95
Ala Gly Val Ala Val Ala Ile Gly Val Asn Glu Arg Asn Ser Glu Ala
100 105 110
Ser Gly Gly Ser Leu Phe Asn Thr Leu Leu Tyr Ile Gly Pro Asp Gly
115 120 125
Thr Leu Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu Arg
130 135 140
Leu Val Trp Ala Ser Gly Asp Gly Ser Asp Leu Ala Val Phe Thr Leu
145 150 155 160
Pro Phe Ala Arg Val Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met Pro
165 170 175
Leu Ala Arg Tyr Ala Leu Ala Ala Trp Gly Ala Gln Ile His Val Ala
180 185 190
Pro Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His Val
195 200 205
Ala Lys Glu Gly Arg Ala Val Thr Ile Gly Cys Cys Gln Ala Val Arg
210 215 220
Lys Glu Asp Ile Pro Asp Gly Leu Ala Phe Lys Ser Arg Tyr Leu Ala
225 230 235 240
Asp Val Gly Ala Trp Val Asn Pro Gly Gly Ser Val Ile Val Asp Pro
245 250 255
Asp Gly Lys Ile Leu Ala Gly Pro Ala Asn Glu Thr Glu Gly Ile Leu
260 265 270
Tyr Ala Asp Ile Arg Ala Asp Gln Leu Val Gly Pro Arg Trp Gln Leu
275 280 285
Asp Ile Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu Ile Val
290 295 300
His Arg Arg Ser Thr Pro Met Ile Arg Glu Val Ser Ala Pro Arg Arg
305 310 315 320
Arg Ala Arg Thr Gly Lys Arg Pro Arg Arg Arg
325 330
<210>47
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>47
gtgaaagaag caatcaaagt agcctgtgtg caagcagctc cagtctttct cgacctggac 60
gccacagtgg acaagaccgt cgccctgatt gaggaggcag cccgtaacgg cgcacgccta 120
atcgcctttc cagagacctg gattccaggc tacccatggt tcctttggct ggactcacca 180
gcctggggga tgcaattcgt gcgccgatac cacgagaact cactggtcct cgacagccct 240
caggccaagc gcatcagtga ggccgcccag cgcgccggta tatacgtcgc gctagggtac 300
agcgaacgcg tgagcggaac cctctacatg gggcagtggc tcattgacga taagggcgaa 360
acagctgggc tgcgccgaaa gctgaaacca acccatgtag agcgaaccct cttcggtgaa 420
ggcgacggat catccctttc cactttcgac acaccgttgg gggtgctggg cggactctgc 480
tgttgggaac acttacaacc tctttcgaaa tatgcgctct acgcacagaa cgaggaaata 540
cacttcgccg cctggcctag cttcagcatc taccgtcaag cgacagaagt ccttggacca 600
gaagtaaatg tcgcagcttc tcggatctac gccgtggaag ggcagtgttt tgttctcgct 660
tcctgcgcgc tcgtctcgcc agagatgatc gaaatgctct gcactgacga aagcaagcac 720
agccttcttc aggccggcgg cgggtactcc cgcattatcg gtcccgatgg cagcgaccta 780
gcgcgcccct tgggcgaaaa cgaggaaggt attctctatg ccactctgga ccctgccgct 840
cgaatctatg caaagaccgc agctgatcca gccgggcact actccagacc agacgtcact 900
cggctgctga tcaatcgcag tgccaatcag ccagtcgtag aggttggaag ggaaatacct 960
gcatcggccc aaggctttga agttgaggcg gcccccgggt acgaaggcga ttga 1014
<210>48
<211>337
<212>PRI
<213>未知
<220>
<223>从环境样品获得
<400>48
Val Lys Glu Ala Ile Lys Val Ala Cys Val Gln Ala Ala Pro Val Phe
1 5 10 15
Leu Asp Leu Asp Ala Thr Val Asp Lys Thr Val Ala Leu Ile Glu Glu
20 25 30
Ala Ala Arg Asn Gly Ala Arg Leu Ile Ala Phe Pro Glu Thr Trp Ile
35 40 45
Pro Gly Tyr Pro Trp Phe Leu Trp Leu Asp Ser Pro Ala Trp Gly Met
50 55 60
Gln Phe Val Arg Arg Tyr His Glu Asn Ser Leu Val Leu Asp Ser Pro
65 70 75 80
Gln Ala Lys Arg Ile Ser Glu Ala Ala Gln Arg Ala Gly Ile Tyr Val
85 90 95
Ala Leu Gly Tyr Ser Glu Arg Val Ser Gly Thr Leu Tyr Met Gly Gln
100 105 110
Trp Leu Ile Asp Asp Lys Gly Glu Thr Ala Gly Leu Arg Arg Lys Leu
115 120 125
Lys Pro Thr His Val Glu Arg Thr Leu Phe Gly Glu Gly Asp Gly Ser
130 135 140
Ser Leu Ser Thr Phe Asp Thr Pro Leu Gly Val Leu Gly Gly Leu Cys
145 150 155 160
Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr Ala Leu Tyr Ala Gln
165 170 175
Asn Glu Glu Ile His Phe Ala Ala Trp Pro Ser Phe Ser Ile Tyr Arg
180 185 190
Gln Ala Thr Glu Val Leu Gly Pro Glu Val Asn Val Ala Ala Ser Arg
195 200 205
Ile Tyr Ala Val Glu Gly Gln Cys Phe Val Leu Ala Ser Cys Ala Leu
210 215 220
Val Ser Pro Glu Met Ile Glu Met Leu Cys Thr Asp Glu Ser Lys His
225 230 235 240
Ser Leu Leu Gln Ala Gly Gly Gly Tyr Ser Arg Ile Ile Gly Pro Asp
245 250 255
Gly Ser Asp Leu Ala Arg Pro Leu Gly Glu Asn Glu Glu Gly Ile Leu
260 265 270
Tyr Ala Thr Leu Asp Pro Ala Ala Arg Ile Tyr Ala Lys Thr Ala Ala
275 280 285
Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr Arg Leu Leu Ile
290 295 300
Asn Arg Ser Ala Asn Gln Pro Val Val Glu Val Gly Arg Glu Ile Pro
305 310 315 320
Ala Ser Ala Gln Gly Phe Glu Val Glu Ala Ala Pro Gly Tyr Glu Gly
325 330 335
Asp
<210>49
<211>1038
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>49
atgaacaaag tcgtggctgc tgccgttcag tgcagcccgg tgctttactc ttgcgccgga 60
actgtaaata aaatttgcga gtggattgca gatttgggca aacaaggggt tgagctggcg 120
gtgttcgcgg aaaccctggt gccttactac ccgtattttt cttttatcca ggctccttgt 180
gcgatgggcg cgcaacattt gttgttgatg caagaatcag tagaggttcc ttccatctac 240
acgcaacaaa ttgccgctgc agcaaaagca gcgaagatgg tggtgtcagt tggtattaac 300
gaacgcgacg gcggttctat ttataacgcg caattattat ttgatgcggg cggtcagctt 360
gttcagcacc gccgaaaaat tacgccgaca tttcatgagc gcatggtgtg ggggcagggc 420
gatggctccg gtttgtgcgc agtggatacg gcagttggtc gtgttggttc gctcgcttgc 480
tgggaacatt acaacccact cgcgcgttac gcattgatgg cagatcgcga acaaattcac 540
gtgagtatgt ttcccggttc tttggtcggc gaaatttttg ccgagcaaat tgaagcaact 600
attcgtcacc acgcattgga gtccggttgc tttgtggtaa atgcgacggg ctggttaacg 660
ccggaacagc aagctcaaat cgtaaaagat actggtggtc ctatcgctgc cattagcggt 720
ggttgtttca ccgccattgt ttcaccggaa ggaaaattgc tcggcacgcc attgcgcagt 780
gattccgggg agggtgcctg tatcgccgaa ctggatttta atctcatcaa taagcgtaag 840
cgcatgatgg attctgtcgg ccattacagt cgtcctgaat tgctcagttt gctgattgat 900
aaaacaccga caagtcatac acatccgctt aaaaaacctt tggctcccag tgaaaaaaat 960
acgccagagg atatcgccac tggtttaaca ctggtcactc ccgtttcaaa tgcaaacctt 1020
ttcagcgcaa gcaactag 1038
<210>50
<211>345
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>50
Met Asn Lys Val Val Ala Ala Ala Val Gln Cys Ser Pro Val Leu Tyr
1 5 10 15
Ser Cys Ala Gly Thr Val Asn Lys Ile Cys Glu Trp Ile Ala Asp Leu
20 25 30
Gly Lys Gln Gly Val Glu Leu Ala Val Phe Ala Glu Thr Leu Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Ile Gln Ala Pro Cys Ala Met Gly Ala
50 55 60
Gln His Leu Leu Leu Met Gln Glu Ser Val Glu Val Pro Ser Ile Tyr
65 70 75 80
Thr Gln Gln Ile Ala Ala Ala Ala Lys Ala Ala Lys Met Val Val Ser
85 90 95
Val Gly Ile Asn Glu Arg Asp Gly Gly Ser Ile Tyr Asn Ala Gln Leu
100 105 110
Leu Phe Asp Ala Gly Gly Gln Leu Val Gln His Arg Arg Lys Ile Thr
115 120 125
Pro Thr Phe His Glu Arg Met Val Trp Gly Gln Gly Asp Gly Ser Gly
130 135 140
Leu Cys Ala Val Asp Thr Ala Val Gly Arg Val Gly Ser Leu Ala Cys
145 150 155 160
Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp Arg
165 170 175
Glu Gln Ile His Val Ser Met Phe Pro Gly Ser Leu Val Gly Glu Ile
180 185 190
Phe Ala Glu Gln Ile Glu Ala Thr Ile Arg His His Ala Leu Glu Ser
195 200 205
Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr Pro Glu Gln Gln
210 215 220
Ala Gln Ile Val Lys Asp Thr Gly Gly Pro Ile Ala Ala Ile Ser Gly
225 230 235 240
Gly Cys Phe Thr Ala Ile Val Ser Pro Glu Gly Lys Leu Leu Gly Thr
245 250 255
Pro Leu Arg Ser Asp Ser Gly Glu Gly Ala Cys Ile Ala Glu Leu Asp
260 265 270
Phe Asn Leu Ile Asn Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Lys Thr Pro Thr
290 295 300
Ser His Thr His Pro Leu Lys Lys Pro Leu Ala Pro Ser Glu Lys Asn
305 310 315 320
Thr Pro Glu Asp Ile Ala Thr Gly Leu Thr Leu Val Thr Pro Val Ser
325 330 335
Asn Ala Asn Leu Phe Ser Ala Ser Asn
340 345
<210>51
<211>897
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>51
gtgaacgtcc gcgtcgcggt ggtgcaggcc acgccggccg tgctcgacgg gccggcgtcg 60
gtgcggaagg cctgccgcct gatcggcgag gccgcggccg gcggcgcccg cctgatcgcc 120
ctgcccgagg gcttcgtgcc catcatgccg cgctcctgct gggggcacca cttcgcgctg 180
atcgcctcgc cgaagtcggc ggccctgcac cggcgcatct gggagaacgc cgtcgacgtc 240
ggcggcccgc tggcccgcga gctcggcgac gccgcgcgcc gcgcggacgc ctgggtggcc 300
atcggggtga acgagcgcga cgcccgccgg ccgggcacgc tctggaacac gctgctctgg 360
ttcgcgcccg acgggagcct ggcccggcgc caccgcaagc tcgtgcccac catgcacgag 420
cgcacgttct gggggcaggg cgcgggcgac gacctcgagg cgctggccgc ggacttcggc 480
cgcctgggcg gcctgatctg ctgggagaac ttcatgcccg ccgcgcgccg gcgcctgcac 540
cgggacgggg tcgacttcta cctggccccc acggcggacg accgggacat ctgggtcgcc 600
gcgatgcgca cgttcgcctt cgaggccggc gccttcgtcc tctcgccggt gcagtacctg 660
cggaccgccg acttcccgga ggacttcccg ctgcgcgagg agctcgccga ctgccccgag 720
gtccagttca ccggggggag cgtgatctgc gacccgtggg gcaacctcct ggcggggccg 780
gtccacgggg gcgaggagat cctctacgcc gactgcgatc tcgacctcgt cctcgaggcc 840
cgacgggtgc tcgacacggc cggccactac gaccgcccgg acctcgcctc ggcctga 897
<210>52
<211>298
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>52
Val Asn Val Arg Val Ala Val Val Gln Ala Thr Pro Ala Val Leu Asp
1 5 10 15
Gly Pro Ala Ser Val Arg Lys Ala Cys Arg Leu Ile Gly Glu Ala Ala
20 25 30
Ala Gly Gly Ala Arg Leu Ile Ala Leu Pro Glu Gly Phe Val Pro Ile
35 40 45
Met Pro Arg Ser Cys Trp Gly His His Phe Ala Leu Ile Ala Ser Pro
50 55 60
Lys Ser Ala Ala Leu His Arg Arg Ile Trp Glu Asn Ala Val Asp Val
65 70 75 80
Gly Gly Pro Leu Ala Arg Glu Leu Gly Asp Ala Ala Arg Arg Ala Asp
85 90 95
Ala Trp Val Ala Ile Gly Val Asn Glu Arg Asp Ala Arg Arg Pro Gly
100 105 110
Thr Leu Trp Asn Thr Leu Leu Trp Phe Ala Pro Asp Gly Ser Leu Ala
115 120 125
Arg Arg His Arg Lys Leu Val Pro Thr Met His Glu Arg Thr Phe Trp
130 135 140
Gly Gln Gly Ala Gly Asp Asp Leu Glu Ala Leu Ala Ala Asp Phe Gly
145 150 155 160
Arg Leu Gly Gly Leu Ile Cys Trp Glu Asn Phe Met Pro Ala Ala Arg
165 170 175
Arg Arg Leu His Arg Asp Gly Val Asp Phe Tyr Leu Ala Pro Thr Ala
180 185 190
Asp Asp Arg Asp Ile Trp Val Ala Ala Met Arg Thr Phe Ala Phe Glu
195 200 205
Ala Gly Ala Phe Val Leu Ser Pro Val Gln Tyr Leu Arg Thr Ala Asp
210 215 220
Phe Pro Glu Asp Phe Pro Leu Arg Glu Glu Leu Ala Asp Cys Pro Glu
225 230 235 240
Val Gln Phe Thr Gly Gly Ser Val Ile Cys Asp Pro Trp Gly Asn Leu
245 250 255
Leu Ala Gly Pro Val His Gly Gly Glu Glu Ile Leu Tyr Ala Asp Cys
260 265 270
Asp Leu Asp Leu Val Leu Glu Ala Arg Arg Val Leu Asp Thr Ala Gly
275 280 285
His Tyr Asp Arg Pro Asp Leu Ala Ser Ala
290 295
<210>53
<211>954
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>53
atggtcaatt caaagagtca gattaaaatc gcggcggtgc aggctgcccc ggtctttctg 60
gaacgggagg cgacgattga caaagcttgc cggttgattg cggaggcagg cgagcagggc 120
gcgaatctgg tggtcttccc tgagtcattc gtcccggctt atcccgattg ggtctgggcc 180
gttccggcag gtgaaacaac gctcctgaac acgctctatg ccgaactgct ggccaatgcc 240
gttgaaattc cgggtccggc gacagagcgg ctgagccagg cagccaacct ggccggggtt 300
tatgtcgcga ttggcttgac cgagcggaac atcgaggcca gtggggcgag cctgtacaat 360
actttgctct ttctcgactc agccggcggc atgttaggca agcatcgcaa actgatcccc 420
accggcggcg agcgcctggt ctgggctcag ggtgatggca gcactctggc ggtgtacgag 480
actaggtttg gaaaaatggg agggttgatt tgctgggaga attacatgcc cctggcccgt 540
tatgccttgt atgcctgggg gacgcagatt tacatcgcgg ccacctggga tcgaggcgag 600
ccgtggctgt caacgctgcg gcatatcgcc gcggaaggcc gggttgttgt cgtcggctgt 660
ggcatggccc tgcgcaaagc cgacctgccc gaccgctttg aactcaagca gcgattttac 720
cagaacgccg atgagtggat caatgtcggc gacagcgcga ttgttaaccc tgatggtgaa 780
ttcatcgccg ggccgctgcg cgagcaggaa ggcatcctct atgctgagat tgatctggcc 840
cagatgcgcg gccccaaatg gatgctcgac gtggccggcc attacgctcg cccggatgtg 900
tttgaactca tcgttcatcg ggaggcgcgg cccatgattg cgctaatttc atga 954
<210>54
<211>317
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>54
Met Val Asn Ser Lys Ser Gln Ile Lys Ile Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Val Phe Leu Glu Arg Glu Ala Thr Ile Asp Lys Ala Cys Arg Leu
20 25 30
Ile Ala Glu Ala Gly Glu Gln Gly Ala Asn Leu Val Val Phe Pro Glu
35 40 45
Ser Phe Val Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Ala Gly
50 55 60
Glu Thr Thr Leu Leu Ash Thr Leu Tyr Ala Glu Leu Leu Ala Asn Ala
65 70 75 80
Val Glu Ile Pro Gly Pro Ala Thr Glu Arg Leu Ser Gln Ala Ala Asn
85 90 95
Leu Ala Gly Val Tyr Val Ala Ile Gly Leu Thr Glu Arg Asn Ile Glu
100 105 110
Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Phe Leu Asp Ser Ala
115 120 125
Gly Gly Met Leu Gly Lys His Arg Lys Leu Ile Pro Thr Gly Gly Glu
130 135 140
Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Ala Val Tyr Glu
145 150 155 160
Thr Arg Phe Gly Lys Met Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met
165 170 175
Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly Thr Gln Ile Tyr Ile
180 185 190
Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His
195 200 205
Ile Ala Ala Glu Gly Arg Val Val Val Val Gly Cys Gly Met Ala Leu
210 215 220
Arg Lys Ala Asp Leu Pro Asp Arg Phe Glu Leu Lys Gln Arg Phe Tyr
225 230 235 240
Gln Asn Ala Asp Glu Trp Ile Asn Val Gly Asp Ser Ala Ile Val Asn
245 250 255
Pro Asp Gly Glu Phe Ile Ala Gly Pro Leu Arg Glu Gln Glu Gly Ile
260 265 270
Leu Tyr Ala Glu Ile Asp Leu Ala Gln Met Arg Gly Pro Lys Trp Met
275 280 285
Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu Ile
290 295 300
Val His Arg Glu Ala Arg Pro Met Ile Ala Leu Ile Ser
305 310 315
<210>55
<211>1017
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>55
atgggtatcg aacatccgaa gtacaaggtc gccgtggtgc aggcagctcc cgcctggctc 60
gatctcgacg cgtcgatcga caagtcgatc gcgctgatcg aggaggcggc ccaaaaaggc 120
gccaagctga tcgcattccc cgaggccttc attcccggct atccctggca catctggatg 180
gactcgccgg cctgggcgat cggccgcggc tttgtgcagc gctattttga caattcgctc 240
gcctatgaca gcccgcaggc cgagaagctg cgcgcggcgg tgcgcaaggc aaagctcacc 300
gccgtgctcg ggctgtccga gcgcgacggc ggcagtctct atctggcgca atggttgatc 360
gggcccgatg gcgagaccat cgcaaaacgc cgcaagctgc ggccgacaca tgccgagcgc 420
acggtgtacg gcgagggcga cggcagcgat ctcgcagtcc acaaccgtcc cgatatcggc 480
cgcctcggcg cgctctgctg ctgggagcat ttgcagccac tgtcgaaata cgcgatgtac 540
gcgcagaacg agcaggtgca tgtcgcggcc tggccgagct tttcgctcta cgatcccttt 600
gcggtggcgc tcggcgccga ggtgaacaac gcggcctcgc gcgtctatgc agtcgaaggc 660
tcctgcttcg tgctggcgcc atgcgccacc gtctcgcagg ccatgatcga cgagctctgc 720
gaccgaccgg acaagcatac gctgctgcat gtcggcggcg gttttgccgc gatctatggt 780
cctgacggca gccagatcgg cgacaagctc gcgcccgacc aggaagggct gttgatcgcg 840
gagatcgacc ttggggccat tggcgtcgcc aagaacgcgg ccgatcccgc cgggcattat 900
tcgcggcccg acgtgacgcg gctcctgctc aacaagaaac cgtacaagcg cgtcgagcag 960
ttctcgccac cggccgaggc ggtcgagccc acagatatcg cagcggcggc aagctga 1017
<210>56
<211>338
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>56
Met Gly Ile Glu His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Ala Trp Leu Asp Leu Asp Ala Ser Ile Asp Lys Ser Ile Ala Leu
20 25 30
Ile Glu Glu Ala Ala Gln Lys Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Ala Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Met Asp Ser Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ala Tyr Asp Ser Pro Gln Ala Glu Lys Leu Arg Ala Ala Val Arg Lys
85 90 95
Ala Lys Leu Thr Ala Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Leu Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asn Arg Pro Asp Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Val Ala Leu Gly Ala Glu Val
195 200 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Arg Pro Asp Lys His Thr Leu Leu His Val Gly Gly Gly Phe Ala
245 250 255
Ala Ile Tyr Gly Pro Asp Gly Ser Gln Ile Gly Asp Lys Leu Ala Pro
260 265 270
Asp Gln Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Tyr Lys Arg Val Glu Gln
305 310 315 320
Phe Ser Pro Pro Ala Glu Ala Val Glu Pro Thr Asp Ile Ala Ala Ala
325 330 335
Ala Ser
<210>57
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>57
gtgaaagaag caatcaaagt agcctgtgtg caagcagctc cagtctttct cgacctggac 60
gccacagtgg acaagaccgt cgccctgatt gaggaggcag cccgtaacgg cgcacgccta 120
atcgcctttc cagagacctg gattccaggc tacccatggt tcctttggct ggactcacca 180
gcctggggga tgcaattcgt gcgccgatac cacgagaact cactggtcct cgacagccct 240
caggccaagc gcatcagtga ggccgcccag cgcgccggta tatacgtcgc gctagggtac 300
agcgaacgcg tgagcggaac cctctacatg gggcagtggc tcattgacga taagggcgaa 360
acagctgggc tgcgccgaaa gctgaaacca acccatgtag agcgaaccct cttcggtgaa 420
ggcgacggat catccctttc cactttcgac acaccgttgg gggtgctggg cggactctgc 480
tgttgggaac acttacaacc tctttcgaaa tatgcgctct acgcacagaa cgaggaaata 540
cacttcgccg cctggcctag cttcagcatc taccgtcaag cgacagaagt ccttggacca 600
gaagtaaatg tcgcagcttc tcggatctac gccgtggaag ggcagtgttt tgttctcgct 660
tcctgcgcgc tcgtctcgcc agagatgatc gaaatgctct gcactgacga aagcaagcac 720
agccttcttc aggccggcgg cgggtactcc cgcattatcg gtcccgatgg cagcgaccta 780
gcgcgcccct tgggcgaaaa cgaggaaggt attctctatg ccactctgga ccctgccgct 840
cgaatctatg caaagaccgc agctgatcca gccgggcact actccagacc agacgtcact 900
cggctgctga tcaatcgcag tgccaatcag ccagtcgtag aggttggacg ggaaatacct 960
gcatcggccc aaggctttga agttgaggcg gcccccgggt acggaggcga ttga 1014
<210>58
<211>337
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>58
Val Lys Glu Ala Ile Lys Val Ala Cys Val Gln Ala Ala Pro Val Phe
1 5 10 15
Leu Asp Leu Asp Ala Thr Val Asp Lys Thr Val Ala Leu Ile Glu Glu
20 25 30
Ala Ala Arg Asn Gly Ala Arg Leu Ile Ala Phe Pro Glu Thr Trp Ile
35 40 45
Pro Gly Tyr Pro Trp Phe Leu Trp Leu Asp Ser Pro Ala Trp Gly Met
50 55 60
Gln Phe Val Arg Arg Tyr His Glu Asn Ser Leu Val Leu Asp Ser Pro
65 70 75 80
Gln Ala Lys Arg Ile Ser Glu Ala Ala Gln Arg Ala Gly Ile Tyr Val
85 90 95
Ala Leu Gly Tyr Ser Glu Arg Val Ser Gly Thr Leu Tyr Met Gly Gln
100 105 110
Trp Leu Ile Asp Asp Lys Gly Glu Thr Ala Gly Leu Arg Arg Lys Leu
115 120 125
Lys Pro Thr His Val Glu Arg Thr Leu Phe Gly Glu Gly Asp Gly Ser
130 135 140
Ser Leu Ser Thr Phe Asp Thr Pro Leu Gly Val Leu Gly Gly Leu Cys
145 150 155 160
Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr Ala Leu Tyr Ala Gln
165 170 175
Asn Glu Glu Ile His Phe Ala Ala Trp Pro Ser Phe Ser Ile Tyr Arg
180 185 190
Gln Ala Thr Glu Val Leu Gly Pro Glu Val Asn Val Ala Ala Ser Arg
195 200 205
Ile Tyr Ala Val Glu Gly Gln Cys Phe Val Leu Ala Ser Cys Ala Leu
210 215 220
Val Ser Pro Glu Met Ile Glu Met Leu Cys Thr Asp Glu Ser Lys His
225 230 235 240
Ser Leu Leu Gln Ala Gly Gly Gly Tyr Ser Arg Ile Ile Gly Pro Asp
245 250 255
Gly Ser Asp Leu Ala Arg Pro Leu Gly Glu Asn Glu Glu Gly Ile Leu
260 265 270
Tyr Ala Thr Leu Asp Pro Ala Ala Arg Ile Tyr Ala Lys Thr Ala Ala
275 280 285
Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr Arg Leu Leu Ile
290 295 300
Asn Arg Ser Ala Asn Gln Pro Val Val Glu Val Gly Arg Glu Ile Pro
305 310 315 320
Ala Ser Ala Gln Gly Phe Glu Val Glu Ala Ala Pro Gly Tyr Gly Gly
325 330 335
Asp
<210>59
<211>987
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>59
atgcgagata ggaatttcaa actggcggcc attcaggcgg agccggtttt ctttaatcgc 60
cgggcctcga cggaaaaggc ctgcagattg atcaaagaag cgggcgcgat gggcgccgat 120
atcgcgggat tcagcgagac ctggcttccc gggtatccct tttttatctg gggcgcaagc 180
gccgatccat ccctgctctg gaaggcttct gcggaatacc tggccaatgc cgttcaaata 240
cccggtcccg agacggatca attatgcgag gcggcgaaaa aggccggcat cgatgtggcg 300
atcggagtgg ttgaactcga cgagtttacg aagggaacgg cttactgcac gctgctcttc 360
atcggcaaag aagggaagat cctgggaaag caccgcaaac tcaagccgac gcaccgggag 420
cgcacggtat ggggagaggg cgatgcgacg ggactcagtg tccatgagcg tccttacggg 480
cggatcagcg gcctgaactg ctgggagcat aatatggtcc tgcccggcta tgtcctgatg 540
tctcagggca cgcacattca tatcgcggcc tggccgggtt cggaagggaa agcacctccc 600
gcgccgtctc cgatgtggga gcgccagctt ctgctctccc gcgctttcgc ttcgcaatcc 660
gccgcatacg tgattctggt cggaggactc ctgaacccgc agaatattcc ggcgccctac 720
gatgaacttg ccgtcaagta ccggggagac agtttcatca tcgatccgcg cggggagatc 780
atcgccgggc cggccaaggg ggaaaccatt ctcatcgccg aaggctcgat ggaacaggtc 840
ctcgcggcaa agtccgcctt cgatgtcgcg ggacattatt cccgccccga cgtctttcaa 900
ctctgcgtca accgcaaacc gtaccggcgt gtaagggaaa cttcggagca ggaccaaccc 960
gcttctgaaa gagaatcgga atcgtaa 987
<210>60
<211>328
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>60
Met Arg Asp Arg Asn Phe Lys Leu Ala Ala Ile Gln Ala Glu Pro Val
1 5 10 15
Phe Phe Asn Arg Arg Ala Ser Thr Glu Lys Ala Cys Arg Leu Ile Lys
20 25 30
Glu Ala Gly Ala Met Gly Ala Asp Ile Ala Gly Phe Ser Glu Thr Trp
35 40 45
Leu Pro Gly Tyr Pro Phe Phe Ile Trp Gly Ala Ser Ala Asp Pro Ser
50 55 60
Leu Leu Trp Lys Ala Ser Ala Glu Tyr Leu Ala Asn Ala Val Gln Ile
65 70 75 80
Pro Gly Pro Glu Thr Asp Gln Leu Cys Glu Ala Ala Lys Lys Ala Gly
85 90 95
Ile Asp Val Ala Ile Gly Val Val Glu Leu Asp Glu Phe Thr Lys Gly
100 105 110
Thr Ala Tyr Cys Thr Leu Leu Phe Ile Gly Lys Glu Gly Lys Ile Leu
115 120 125
Gly Lys His Arg Lys Leu Lys Pro Thr His Arg Glu Arg Thr Val Trp
130 135 140
Gly Glu Gly Asp Ala Thr Gly Leu Ser Val His Glu Arg Pro Tyr Gly
145 150 155 160
Arg Ile Ser Gly Leu Asn Cys Trp Glu His Asn Met Val Leu Pro Gly
165 170 175
Tyr Val Leu Met Ser Gln Gly Thr His Ile His Ile Ala Ala Trp Pro
180 185 190
Gly Ser Glu Gly Lys Ala Pro Pro Ala Pro Ser Pro Met Trp Glu Arg
195 200 205
Gln Leu Leu Leu Ser Arg Ala Phe Ala Ser Gln Ser Ala Ala Tyr Val
210 215 220
Ile Leu Val Gly Gly Leu Leu Asn Pro Gln Asn Ile Pro Ala Pro Tyr
225 230 235 240
Asp Glu Leu Ala Val Lys Tyr Arg Gly Asp Ser Phe Ile Ile Asp Pro
245 250 255
Arg Gly Glu Ile Ile Ala Gly Pro Ala Lys Gly Glu Thr Ile Leu Ile
260 265 270
Ala Glu Gly Ser Met Glu Gln Val Leu Ala Ala Lys Ser Ala Phe Asp
275 280 285
Val Ala Gly His Tyr Ser Arg Pro Asp Val Phe Gln Leu Cys Val Asn
290 295 300
Arg Lys Pro Tyr Arg Arg Val Arg Glu Thr Ser Glu Gln Asp Gln Pro
305 310 315 320
Ala Ser Glu Arg Glu Ser Glu Ser
325
<210>61
<211>966
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>61
atgactcgat cttacccgaa tgacacactc acggttgggc ttgcgcaaat tgctccagtc 60
tggttggatc gtacagggac aatttcaaag atattagctc aagtccatgc ggcaaatgaa 120
gcgggctgtc atcttgtcgc gtttggcgaa ggtctcctcc ccggatatcc gttttggatt 180
gagcgaacaa atggtgcagt cttcaactcg cccacgcaga aagaaattca cgcgcattat 240
ctggatcagg ctgtccagat cgaagcaggt catcttgagg cgctttgcga agcagccaag 300
gaatatgaga tcgcaattgt cctgggatgc attgaacgtc cgcaagatcg tggagggcac 360
agtctgtatg caagccttgt atatattgat tcagacggca tcatccaatc tgtgcatcga 420
aagttaatgc caacatatga agaacggctc acctggtcgc caggtgacgg acatggatta 480
cgggtgcaca aattaggtgc ctttacggtt ggcggcctca actgttggga aaactggatg 540
cctttggcac gcgcggccat gtatggtcaa ggcgaggatt tgcatattgc catttggccc 600
ggcggctccc acaatacgca agacattaca cgctttattg cactagaatc gcgttcctat 660
gttttatctg tgtcaggttt aatgcgctca ggcgattttc caaaagagac cccacatctt 720
gcatccatcc tggctaaagg tgaggatatt cttgccaacg gtggttcatg tatcgccggt 780
cctgacggca aatggatcgt tgagccgctt gtaggagaag agaagttaat tgttgcaacg 840
attgatcatt gtcgtgtgcg cgaagagcgt caaaattttg atccttcagg acattacagc 900
aggccagatg tattgcaact gaaaataaac aggcaacgcc agagtacaat ctcgtttgga 960
gagtaa 966
<210>62
<211>321
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>62
Met Thr Arg Ser Tyr Pro Asn Asp Thr Leu Thr Val Gly Leu Ala Gln
1 5 10 15
Ile Ala Pro Val Trp Leu Asp Arg Thr Gly Thr Ile Ser Lys Ile Leu
20 25 30
Ala Gln Val His Ala Ala Asn Glu Ala Gly Cys His Leu Val Ala Phe
35 40 45
Gly Glu Gly Leu Leu Pro Gly Tyr Pro Phe Trp Ile Glu Arg Thr Asn
50 55 60
Gly Ala Val Phe Asn Ser Pro Thr Gln Lys Glu Ile His Ala His Tyr
65 70 75 80
Leu Asp Gln Ala Val Gln Ile Glu Ala Gly His Leu Glu Ala Leu Cys
85 90 95
Glu Ala Ala Lys Glu Tyr Glu Ile Ala Ile Val Leu Gly Cys Ile Glu
100 105 110
Arg Pro Gln Asp Arg Gly Gly His Ser Leu Tyr Ala Ser Leu Val Tyr
115 120 125
Ile Asp Ser Asp Gly Ile Ile Gln Ser Val His Arg Lys Leu Met Pro
130 135 140
Thr Tyr Glu Glu Arg Leu Thr Trp Ser Pro Gly Asp Gly His Gly Leu
145 150 155 160
Arg Val His Lys Leu Gly Ala Phe Thr Val Gly Gly Leu Asn Cys Trp
165 170 175
Glu Asn Trp Met Pro Leu Ala Arg Ala Ala Met Tyr Gly Gln Gly Glu
180 185 190
Asp Leu His Ile Ala Ile Trp Pro Gly Gly Ser His Asn Thr Gln Asp
195 200 205
Ile Thr Arg Phe Ile Ala Leu Glu Ser Arg Ser Tyr Val Leu Ser Val
210 215 220
Ser Gly Leu Met Arg Ser Gly Asp Phe Pro Lys Glu Thr Pro His Leu
225 230 235 240
Ala Ser Ile Leu Ala Lys Gly Glu Asp Ile Leu Ala Asn Gly Gly Ser
245 250 255
Cys Ile Ala Gly Pro Asp Gly Lys Trp Ile Val Glu Pro Leu Val Gly
260 265 270
Glu Glu Lys Leu Ile Val Ala Thr Ile Asp His Cys Arg Val Arg Glu
275 280 285
Glu Arg Gln Asn Phe Asp Pro Ser Gly His Tyr Ser Arg Pro Asp Val
290 295 300
Leu Gln Leu Lys Ile Asn Arg Gln Arg Gln Ser Thr Ile Ser Phe Gly
305 310 315 320
Glu
<210>63
<211>978
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>63
atgcaagata gagtaccgat tgtacgagct gcggctatcc aggctgaacc catagtcctt 60
gattgtgacg cgaccgtgga aaaagcctgc cgattgatcg gtgaagcagc agaaaatggt 120
gcaaacctga tcgtgtttcc cgaagccttc attcccgttt atcccaatgc ggcgatctgg 180
ggtcgaggtc tggccacttt tggcggacag cgccagaaat acgtatggac gcgactatgg 240
aacaattcgg tggaaatccc tggtccggcc accgacaggc tggcaaaggc agcacacgag 300
gctcgagcca ccgttgtcat gggattgaat gagcgcgcgg tcgataacaa cacgctttac 360
aacaccctgc tatttattgg gccagacggt cgcttgctgg gcaagcaccg taagctcatg 420
cccaccaatc acgaacggat gatctggggt atgggagatg ggagcaccct gcgggttttt 480
gatacaccct gtggaaaagt aggcggtctc atctgctggg aaaactacat gcctctggcg 540
cgttatgcac tctatggaca gggcgaacaa atccatgtcg cgccgactgc gcacgatggt 600
gagatcactc tggtcaatgc acgcaatacc gcctatgagg gacgcttatt cgtcatctcc 660
gtgtgcatga tccttcgcaa gtccagcttt ccccatgatt ttgagctggg cgaggaattg 720
gcggaggcag atgacttcat aaaatcaggc ggcagcgcga tcgttgggcc agatggcgag 780
gtgctggcgg gtccattgtg gaatgaagag aatatactgt atgccgatct tgacttgaat 840
cgaattgtgg atgagagacg agtatttgat gtgacgggcc attattcacg tccagatgtt 900
ctacgactgc actttaatgc ttcccctcag aaaactattg aaagatatga gcaacctctc 960
gatccgtctg agggttaa 978
<210>64
<211>325
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>64
Met Gln Asp Arg Val Pro Ile Val Arg Ala Ala Ala Ile Gln Ala Glu
1 5 10 15
Pro Ile Val Leu Asp Cys Asp Ala Thr Val Glu Lys Ala Cys Arg Leu
20 25 30
Ile Gly Glu Ala Ala Glu Asn Gly Ala Asn Leu Ile Val Phe Pro Glu
35 40 45
Ala Phe Ile Pro Val Tyr Pro Asn Ala Ala Ile Trp Gly Arg Gly Leu
50 55 60
Ala Thr Phe Gly Gly Gln Arg Gln Lys Tyr Val Trp Thr Arg Leu Trp
65 70 75 80
Asn Asn Ser Val Glu Ile Pro Gly Pro Ala Thr Asp Arg Leu Ala Lys
85 90 95
Ala Ala His Glu Ala Arg Ala Thr Val Val Met Gly Leu Asn Glu Arg
100 105 110
Ala Val Asp Asn Asn Thr Leu Tyr Asn Thr Leu Leu Phe Ile Gly Pro
115 120 125
Asp Gly Arg Leu Leu Gly Lys His Arg Lys Leu Met Pro Thr Asn His
130 135 140
Glu Arg Met Ile Trp Gly Met Gly Asp Gly Ser Thr Leu Arg Val Phe
145 150 155 160
Asp Thr Pro Cys Gly Lys Val Gly Gly Leu Ile Cys Trp Glu Asn Tyr
165 170 175
Met Pro Leu Ala Arg Tyr Ala Leu Tyr Gly Gln Gly Glu Gln Ile His
180 185 190
Val Ala Pro Thr Ala His Asp Gly Glu Ile Thr Leu Val Asn Ala Arg
195 200 205
Asn Thr Ala Tyr Glu Gly Arg Leu Phe Val Ile Ser Val Cys Met Ile
210 215 220
Leu Arg Lys Ser Ser Phe Pro His Asp Phe Glu Leu Gly Glu Glu Leu
225 230 235 240
Ala Glu Ala Asp Asp Phe Ile Lys Ser Gly Gly Ser Ala Ile Val Gly
245 250 255
Pro Asp Gly Glu Val Leu Ala Gly Pro Leu Trp Asn Glu Glu Asn Ile
260 265 270
Leu Tyr Ala Asp Leu Asp Leu Asn Arg Ile Val Asp Glu Arg Arg Val
275 280 285
Phe Asp Val Thr Gly His Tyr Ser Arg Pro Asp Val Leu Arg Leu His
290 295 300
Phe Asn Ala Ser Pro Gln Lys Thr Ile Glu Arg Tyr Glu Gln Pro Leu
305 310 315 320
Asp Pro Ser Glu Gly
325
<210>65
<211>1002
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>65
atgccgaccc ccacttcaaa attcaaaatc ggcgccgtgc aggcatcgcc ggtttttctg 60
gaccgggaag ccactgcgca aaaagcctgc aaattgattg ccgaagcggg agggcagggc 120
gcgcggctga tcgttttccc ggagtctttc attcccacct atcctgattg ggtctgggcg 180
gtcccgccgg gaagaggaaa agtgttaagc gaactttacg ccgagctgct ggccaatgcc 240
gtggaagtcc ccgggccggt caccgatcag ctgggtgaag cagcccaaaa aacgggcgcc 300
tatgtcgtca tgggcgtcac ggaaaaggac accgacgcaa gcggcgcgag cctttacaac 360
acgctcctct atttcaaccc cgcgggggac ctcctgggaa aacaccggaa gcttgttcct 420
accggcgggg agcggctggt ctgggcgcag ggcgacggca gcaccctgga agtgtacgac 480
actcccctgg gaaaaatcgg aggcctcatc tgctgggaaa actacatgcc cctcgcccgg 540
tacacgatgt atgcctgggg gacccagatt tatatcgcgg ccacatggga ccagggggag 600
acgtggcttg ccaccctgcg gcatatcgct aaggaaggac gggtgtacgt catcggctgc 660
tgcatcgcgc tgcggcggga cgacatcccc gaccggctgg aatacaagaa gaagttctac 720
tcggggtcgc gggaatggat caatatgggg gacagcgcca tcgtgaaccc ggaaggcgaa 780
ttcattgccg gccccgtgcg gatgaaggag gagatcctgt atgccgaggt ggaccccctc 840
ctgatggcgg gatcgaaatg gatgctcgac gtcgcggggc attacgcgcg ccccgacgtc 900
tttgaactca tcgtccaccg ccagccccac ccgatgatcc gggtaatcga gaaagaggga 960
ggggccggaa gaaccgggga cgagaagaag gaaaatgagt ga 1002
<210>66
<211>333
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>66
Met Pro Thr Pro Thr Ser Lys Phe Lys Ile Gly Ala Val Gln Ala Ser
1 5 10 15
Pro Val Phe Leu Asp Arg Glu Ala Thr Ala Gln Lys Ala Cys Lys Leu
20 25 30
Ile Ala Glu Ala Gly Gly Gln Gly Ala Arg Leu Ile Val Phe Pro Glu
35 40 45
Ser Phe Ile Pro Thr Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly
50 55 60
Arg Gly Lys Val Leu Ser Glu Leu Tyr Ala Glu Leu Leu Ala Asn Ala
65 70 75 80
Val Glu Val Pro Gly Pro Val Thr Asp Gln Leu Gly Glu Ala Ala Gln
85 90 95
Lys Thr Gly Ala Tyr Val Val Met Gly Val Thr Glu Lys Asp Thr Asp
100 105 110
Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Phe Asn Pro Ala
115 120 125
Gly Asp Leu Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu
130 135 140
Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Glu Val Tyr Asp
145 150 155 160
Thr Pro Leu Gly Lys Ile Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met
165 170 175
Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gln Ile Tyr Ile
180 185 190
Ala Ala Thr Trp Asp Gln Gly Glu Thr Trp Leu Ala Thr Leu Arg His
195 200 205
Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile Ala Leu
210 215 220
Arg Arg Asp Asp Ile Pro Asp Arg Leu Glu Tyr Lys Lys Lys Phe Tyr
225 230 235 240
Ser Gly Ser Arg Glu Trp Ile Asn Met Gly Asp Ser Ala Ile Val Asn
245 250 255
Pro Glu Gly Glu Phe Ile Ala Gly Pro Val Arg Met Lys Glu Glu Ile
260 265 270
Leu Tyr Ala Glu Val Asp Pro Leu Leu Met Ala Gly Ser Lys Trp Met
275 280 285
Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu Ile
290 295 300
Val His Arg Gln Pro His Pro Met Ile Arg Val Ile Glu Lys Glu Gly
305 310 315 320
Gly Ala Gly Arg Thr Gly Asp Glu Lys Lys Glu Asn Glu
325 330
<210>67
<211>936
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>67
atgccccgtg tggcggtggt ccagcgcccg ccggtgtttc tcgaccgcgc cgcgaccctc 60
gagaacgccg tggcttcgct cgccgaggcc gcgtcgaacg gggctcgcct cgcggtcttt 120
ccggaagccc tggttcccgg ctatccggcg tggatgtggc ggctgcggcc cgggcccgac 180
atggcgctca ccgagcggat tcacgcgcgc ttgcgggcga actcggtgag cctcgccgcc 240
gacgagctcg cgccgctgcg cgaggcggcc cggcgccacg agctcaccgt agtgtgcggc 300
ctgcacgagc gcgacgaggc gctcggcggc ggcacgctct ataacaccgt cgtcacgatc 360
ggcgccgacg gcgcggtgct caaccgccac cggaagctga tgcccaccaa ccccgagcgc 420
atggtctggg gctgcggcga tgccagcggg ctcaggacgg tccccaccca gtgcgggcgc 480
gtcggcgccc tgatctgctg ggaaagctac atgccgcttg cacgctacgc gctgtacgcc 540
cagggaatcg acctctacgt cacgccgacc tacgacagcg gcgagcgggc ggttgcgacc 600
atgcagcaca ttgcccgcga aggcggctgc tgggtggtga gctgcggctc ggcgtttcag 660
gcgcgcgacg tcccggacgc gtttccgggg aagagcgagc ttttccgcga caacgacgag 720
tggatcaacc cgggcgactc ggtcgtggtc gcgccgggcg gcaaggtcgt cgccgggccg 780
ctgcacaaag aacgcgcgat cctgtacgcc gagatcgacc tcgagcgggt cggcgtggcg 840
cgccgcagcc tggacgtggt cggccattat gcgcggcccg acctcttcga cctgcacgtg 900
aacgcccgcc cgcaaagcgt ggttgaattg cgctga 936
<210>68
<211>311
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>68
Met Pro Arg Val Ala Val Val Gln Arg Pro Pro Val Phe Leu Asp Arg
1 5 10 15
Ala Ala Thr Leu Glu Asn Ala Val Ala Ser Leu Ala Glu Ala Ala Ser
20 25 30
Asn Gly Ala Arg Leu Ala Val Phe Pro Glu Ala Leu Val Pro Gly Tyr
35 40 45
Pro Ala Trp Met Trp Arg Leu Arg Pro Gly Pro Asp Met Ala Leu Thr
50 55 60
Glu Arg Ile His Ala Arg Leu Arg Ala Asn Ser Val Ser Leu Ala Ala
65 70 75 80
Asp Glu Leu Ala Pro Leu Arg Glu Ala Ala Arg Arg His Glu Leu Thr
85 90 95
Val Val Cys Gly Leu His Glu Arg Asp Glu Ala Leu Gly Gly Gly Thr
100 105 110
Leu Tyr Asn Thr Val Val Thr Ile Gly Ala Asp Gly Ala Val Leu Asn
115 120 125
Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly
130 135 140
Cys Gly Asp Ala Ser Gly Leu Arg Thr Val Pro Thr Gln Cys Gly Arg
145 150 155 160
Val Gly Ala Leu Ile Cys Trp Glu Ser Tyr Met Pro Leu Ala Arg Tyr
165 170 175
Ala Leu Tyr Ala Gln Gly Ile Asp Leu Tyr Val Thr Pro Thr Tyr Asp
180 185 190
Ser Gly Glu Arg Ala Val Ala Thr Met Gln His Ile Ala Arg Glu Gly
195 200 205
Gly Cys Trp Val Val Ser Cys Gly Ser Ala Phe Gln Ala Arg Asp Val
210 215 220
Pro Asp Ala Phe Pro Gly Lys Ser Glu Leu Phe Arg Asp Asn Asp Glu
225 230 235 240
Trp Ile Asn Pro Gly Asp Ser Val Val Val Ala Pro Gly Gly Lys Val
245 250 255
Val Ala Gly Pro Leu His Lys Glu Arg Ala Ile Leu Tyr Ala Glu Ile
260 265 270
Asp Leu Glu Arg Val Gly Val Ala Arg Arg Ser Leu Asp Val Val Gly
275 280 285
His Tyr Ala Arg Pro Asp Leu Phe Asp Leu His Val Asn Ala Arg Pro
290 295 300
Gln Ser Val Val Glu Leu Arg
305 310
<210>69
<211>939
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>69
gtgaccgagt ttcggacggt gcgggtcgca gcggtgcagg cgacgccggt gaccctcgac 60
gccgatgcct cggtcgagaa ggcgatcggg ctgatcggcg aggcggtggc cggtggagcg 120
cagctcgtcg tgctgcccga ggccttcgtg tcgctctacc cgtcgaacgc gtgggcgcga 180
gcggccgccg gattcggcgg cttcgacgag ctctgggagc ggatgtgggc cagctcgctc 240
gacgtcccgg gcccgctggt cgaccggctg gtcgatgcgt gccgcaggca tgacgtggta 300
tgcgtgatcg gcgtgaacga gcgcgaaagc gaaaggccgg ggtcgcttta caacacgatg 360
ctgaccctcg gcccgtcggg cctcctgcac cggcaccgca agctcatgcc gacgcaccac 420
gagcggctgt tccatgggat cggcgacggt caagacctcg gcgttgtgga gaccgacgcg 480
ggacggatcg ggggactgat ctgctgggag aaccgaatgc cgctcgcgcg ctacgcggtc 540
taccagggtg gaccgcagat ctgggtcgcg ccgacggccg atgactccga cggctggctc 600
gcgagcatgc gccacatcgc gatcgagtcg ggcgcgttcg tcgtgtcggt gccgcagttc 660
atcccggcgt ccgcgttccc cgacgatttc cccgtcgagc taccgccggg caaggaggtg 720
ttcggccgcg gcggtgcggc gatcgtcgag ccgacctggg gcgaggtaat cgccgggccg 780
ctctacgatc gggaggggat cgtgttcgcc gactgtgacc tgcgacgcgg cttgcatgcc 840
aagcgctggt tcgactccgt cggccattac agccgcgcgg aggtgctcga tggcggcgtc 900
gagcgcgtcc cggcgccggt ggacggcgaa tcgccgtga 939
<210>70
<211>312
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>70
Val Thr Glu Phe Arg Thr Val Arg Val Ala Ala Val Gln Ala Thr Pro
1 5 10 15
Val Thr Leu Asp Ala Asp Ala Ser Val Glu Lys Ala Ile Gly Leu Ile
20 25 30
Gly Glu Ala Val Ala Gly Gly Ala Gln Leu Val Val Leu Pro Glu Ala
35 40 45
Phe Val Ser Leu Tyr Pro Ser Asn Ala Trp Ala Arg Ala Ala Ala Gly
50 55 60
Phe Gly Gly Phe Asp Glu Leu Trp Glu Arg Met Trp Ala Ser Ser Leu
65 70 75 80
Asp Val Pro Gly Pro Leu Val Asp Arg Leu Val Asp Ala Cys Arg Arg
85 90 95
His Asp Val Val Cys Val Ile Gly Val Asn Glu Arg Glu Ser Glu Arg
100 105 110
Pro Gly Ser Leu Tyr Asn Thr Met Leu Thr Leu Gly Pro Ser Gly Leu
115 120 125
Leu His Arg His Arg Lys Leu Met Pro Thr His His Glu Arg Leu Phe
130 135 140
His Gly Ile Gly Asp Gly Gln Asp Leu Gly Val Val Glu Thr Asp Ala
145 150 155 160
Gly Arg Ile Gly Gly Leu Ile Cys Trp Glu Asn Arg Met Pro Leu Ala
165 170 175
Arg Tyr Ala Val Tyr Gln Gly Gly Pro Gln Ile Trp Val Ala Pro Thr
180 185 190
Ala Asp Asp Ser Asp Gly Trp Leu Ala Ser Met Arg His Ile Ala Ile
195 200 205
Glu Ser Gly Ala Phe Val Val Ser Val Pro Gln Phe Ile Pro Ala Ser
210 215 220
Ala Phe Pro Asp Asp Phe Pro Val Glu Leu Pro Pro Gly Lys Glu Val
225 230 235 240
Phe Gly Arg Gly Gly Ala Ala Ile Val Glu Pro Thr Trp Gly Glu Val
245 250 255
Ile Ala Gly Pro Leu Tyr Asp Arg Glu Gly Ile Val Phe Ala Asp Cys
260 265 270
Asp Leu Arg Arg Gly Leu His Ala Lys Arg Trp Phe Asp Ser Val Gly
275 280 285
His Tyr Ser Arg Ala Glu Val Leu Asp Gly Gly Val Glu Arg Val Pro
290 295 300
Ala Pro Val Asp Gly Glu Ser Pro
305 310
<210>71
<211>966
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>71
atgccaaacg agaacaccaa cgccacattc aaagttgccg ctgtgcaggc ttcgcctgtg 60
tttcttgatc gtgccgcaac aatcgacaag gcttgcgatt tgatcgccgc tgctggcggt 120
gaaggggcac gcttgattgt ctttccagaa gcattcatcc cgtcttatcc tgattgggta 180
tgggcaattc cttcgggtga agagggcgta ctcaatgagt tgtacgcaga tctgctatcc 240
aactcggtca cgattcccag tgactcgacg gacaaactgt gcagagcagc caggcttgct 300
aatgcctacg tggtgatggg tatgagcgaa cgcaatgctg aggcaagcgg cgcgagcatg 360
tataacacgc tattgtatat tgatgcacag ggggagattc tgggcaagca tcggaagttg 420
gtgccaacgg gcggcgagcg gctagtctgg gcgcagggcg atggcagtac actgcaggtc 480
tatgatactc ccttagggaa actcggtggc ttaatttgct gggagaatta tatgccactg 540
gcccgctata ccatgtatgc ctggggcaca caaatctatg tcgcggcaac gtgggatcgg 600
ggtcagccct ggctctctac tttacgccac attgccaaag aaggcagggt gtatgtgatt 660
ggttgttgta tcgcgatgcg taaagacgat atcccagacc attatacaat gaaacagaag 720
ttttactcag atgcagatga gtggattaat attggcgata gtgcgattgt taatcccgaa 780
gggcaattta tcgctggacc ggtgcgcaag caggaagaga ttctctatgc ggagattgat 840
ccgcgcatgg tccaagggcc gaagtggatg ctcgacgtgg cgggacatta tgccaggccg 900
gatgtgttcg aactgattgt ccacacggat attcgaagga tgatcaaatc ggaaaagaat 960
tcataa 966
<210>72
<211>321
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>72
Met Pro Asn Glu Asn Thr Asn Ala Thr Phe Lys Val Ala Ala Val Gln
1 5 10 15
Ala Ser Pro Val Phe Leu Asp Arg Ala Ala Thr Ile Asp Lys Ala Cys
20 25 30
Asp Leu Ile Ala Ala Ala Gly Gly Glu Gly Ala Arg Leu Ile Val Phe
35 40 45
Pro Glu Ala Phe Ile Pro Ser Tyr Pro Asp Trp Val Trp Ala Ile Pro
50 55 60
Ser Gly Glu Glu Gly Val Leu Asn Glu Leu Tyr Ala Asp Leu Leu Ser
65 70 75 80
Asn Ser Val Thr Ile Pro Ser Asp Ser Thr Asp Lys Leu Cys Arg Ala
85 90 95
Ala Arg Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg Asn
100 105 110
Ala Glu Ala Ser Gly Ala Ser Met Tyr Asn Thr Leu Leu Tyr Ile Asp
115 120 125
Ala Gln Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly
130 135 140
Gly Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val
145 150 155 160
Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn
165 170 175
Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gln Ile
180 185 190
Tyr Val Ala Ala Thr Trp Asp Arg Gly Gln Pro Trp Leu Ser Thr Leu
195 200 205
Arg His Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile
210 215 220
Ala Met Arg Lys Asp Asp Ile Pro Asp His Tyr Thr Met Lys Gln Lys
225 230 235 240
Phe Tyr Ser Asp Ala Asp Glu Trp Ile Asn Ile Gly Asp Ser Ala Ile
245 250 255
Val Asn Pro Glu Gly Gln Phe Ile Ala Gly Pro Val Arg Lys Gln Glu
260 265 270
Glu Ile Leu Tyr Ala Glu Ile Asp Pro Arg Met Val Gln Gly Pro Lys
275 280 285
Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu
290 295 300
Leu Ile Val His Thr Asp Ile Arg Arg Met Ile Lys Ser Glu Lys Asn
305 310 315 320
Ser
<210>73
<211>1035
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>73
atgacagcaa tagactcaac gtttaaagtc gccgccgttc aggctgcgcc ggtcttcctc 60
aatcgcgacg caaccgtgga gaaggcgtgc cggctgatca agtccgcggc agagggaggc 120
gcgcgtctga tcgttttccc ggaagcgttc ataccggcct acccggactg ggtgtggacg 180
gtccctgccg gtgagcaagg cctgctcaac gacctctacg gccaactcgt cgaccagtcc 240
gtgacgattc ccagcgacat caccaccgag ttatgtaacg cggcacgggc agcaaacgcc 300
tatgtcgtga ttggtgtcaa cgagcgcaac gcggaggcaa gcaatggaag cctctacaac 360
tcgctcctct acatcgacgc aaacggcaaa attctcggta agcaccgcaa gctcgttccc 420
acaggcggag aacggctcgt gtgggcgcag ggcgatggca gcacgctcga agcctacgac 480
acggagctgg gcaaactcgg cggtctcatt tgctgggaga actatatgcc gctggcacgc 540
tacgcgatgt acgcatgggg agtgcagctc tatgtcgccg cgacctggga ccgtggcggc 600
ccctggactg ccacgctgcg tcatgtcgcc aaggaaggtc agatgtacgt catcgggtgc 660
tgccaggccc tgcacaagga tgacctgccg gagctagacg ggctgaagga gaagtactac 720
gccaacgcac gagagtggat caatgttggc gacagcgcta ttgtcggccc ggacggacaa 780
ttccttgtcg agcccgtccg aatgcgggaa gacatcctct acgccgaggt ggacactcgc 840
aacttccgcg gcccgaagtg gatgttcgac gcggctggac actacgcgcg tcccgacatt 900
ttccaactca cagtgaaccg cgagcagcgg ccgatggtcc gcgtcgtcgg tgacagcagt 960
gaccagaagg agcggccgct cccggacgac ggacggctct ggtacgccta cagcaccaat 1020
cagcaccacg actga 1035
<210>74
<211>344
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>74
Met Thr Ala Ile Asp Ser Thr Phe Lys Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Val Phe Leu Asn Arg Asp Ala Thr Val Glu Lys Ala Cys Arg Leu
20 25 30
Ile Lys Ser Ala Ala Glu Gly Gly Ala Arg Leu Ile Val Phe Pro Glu
35 40 45
Ala Phe Ile Pro Ala Tyr Pro Asp Trp Val Trp Thr Val Pro Ala Gly
50 55 60
Glu Gln Gly Leu Leu Asn Asp Leu Tyr Gly Gln Leu Val Asp Gln Ser
65 70 75 80
Val Thr Ile Pro Ser Asp Ile Thr Thr Glu Leu Cys Asn Ala Ala Arg
85 90 95
Ala Ala Asn Ala Tyr Val Val Ile Gly Val Asn Glu Arg Asn Ala Glu
100 105 110
Ala Ser Asn Gly Ser Leu Tyr Asn Ser Leu Leu Tyr Ile Asp Ala Asn
115 120 125
Gly Lys Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu
130 135 140
Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Glu Ala Tyr Asp
145 150 155 160
Thr Glu Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met
165 170 175
Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Val Gln Leu Tyr Val
180 185 190
Ala Ala Thr Trp Asp Arg Gly Gly Pro Trp Thr Ala Thr Leu Arg His
195 200 205
Val Ala Lys Glu Gly Gln Met Tyr Val Ile Gly Cys Cys Gln Ala Leu
210 215 220
His Lys Asp Asp Leu Pro Glu Leu Asp Gly Leu Lys Glu Lys Tyr Tyr
225 230 235 240
Ala Asn Ala Arg Glu Trp Ile Asn Val Gly Asp Ser Ala Ile Val Gly
245 250 255
Pro Asp Gly Gln Phe Leu Val Glu Pro Val Arg Met Arg Glu Asp Ile
260 265 270
Leu Tyr Ala Glu Val Asp Thr Arg Asn Phe Arg Gly Pro Lys Trp Met
275 280 285
Phe Asp Ala Ala Gly His Tyr Ala Arg Pro Asp Ile Phe Gln Leu Thr
290 295 300
Val Asn Arg Glu Gln Arg Pro Met Val Arg Val Val Gly Asp Ser Ser
305 310 315 320
Asp Gln Lys Glu Arg Pro Leu Pro Asp Asp Gly Arg Leu Trp Tyr Ala
325 330 335
Tyr Ser Thr Asn Gln His His Asp
340
<210>75
<211>1125
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>75
atgagcacca ttgttaaagc cgctgcggtt caaatcagcc cagtcctcta cagccgcgag 60
gggacagtcg caaaagttgt gcggaagatc cacgaacttg gccaaaaggg ggtgcggttc 120
gccacgttcc cggagaccgt ggttccctac tatccatatt tttccgccgt ccagaccccc 180
attcaactat tgtccggaac cgagtacctg aagttgctcg accaaggcgt gaccgtgccg 240
tccacgacta ccgacgcaat cggggaggct gcccggaacg ccggcatggt tgtatctatc 300
ggcgtgaatg agcgtgacgg cgggaccctg tacaacgcgc agttgctctt cgatgcggat 360
gggaccttga ttcagcgtcg ccgcaagatc actcctacgc attacgagcg catgatctgg 420
ggccagggag atggttcggg tttgcgggcc gtcaagagcc aggttggtcg tattggccaa 480
cttgcatgct ttgagcacaa caacccactg gcgcgttacg cgatgatggc cgatggcgag 540
caaatccatt cggccatgta tccaggttcc gcgttcggcg aggggttcgc ggaaaagatg 600
gaaatcaata tccgccagca tgcgttggag tccgggtgct tcgttgtgaa tgcaacggcc 660
tggcttgacg ccagccagca ggcacaaatc atgaatgaca cgggttgcca aatcggtccg 720
atctcgggcg gttgctttac cacgatcgta acacccgacg gcacgtttct gggcgaacct 780
ctccggtcgg gtgagggcga ggtcatcgcc gatctcgatt tcaagctgat cgacaaacgc 840
aagatgttga tggactcgcg cggccactac agtcgcccgg aattgctcag tctgctgatc 900
gaccgcaccc ccaccgcgca cattcatgag cgaggtgcgc cgcagacgtc aggcgctgtg 960
caagaggcga cgaaagtggg ttcacacgcg ccgctcctgc gtgacggaca atgggatcag 1020
ctcaatgcgg gagcgggccg acatacaggg aatggagaag cacagataga aatcatggcc 1080
gcggcccact cgggcacccg tggaattgaa gcgaagggag cctaa 1125
<210>76
<211>374
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>76
Met Ser Thr Ile Val Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu
1 5 10 15
Tyr Ser Arg Glu Gly Thr Val Ala Lys Val Val Arg Lys Ile His Glu
20 25 30
Leu Gly Gln Lys Gly Val Arg Phe Ala Thr Phe Pro Glu Thr Val Val
35 40 45
Pro Tyr Tyr Pro Tyr Phe Ser Ala Val Gln Thr Pro Ile Gln Leu Leu
50 55 60
Ser Gly Thr Glu Tyr Leu Lys Leu Leu Asp Gln Gly Val Thr Val Pro
65 70 75 80
Ser Thr Thr Thr Asp Ala Ile Gly Glu Ala Ala Arg Asn Ala Gly Met
85 90 95
Val Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn
100 105 110
Ala Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg
115 120 125
Lys Ile Thr Pro Thr His Tyr Glu Arg Met Ile Trp Gly Gln Gly Asp
130 135 140
Gly Ser Gly Leu Arg Ala Val Lys Ser Gln Val Gly Arg Ile Gly Gln
145 150 155 160
Leu Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Met
165 170 175
Ala Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ala Phe
180 185 190
Gly Glu Gly Phe Ala Glu Lys Met Glu Ile Asn Ile Arg Gln His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala
210 215 220
Ser Gln Gln Ala Gln Ile Met Asn Asp Thr Gly Cys Gln Ile Gly Pro
225 230 235 240
Ile Ser Gly Gly Cys Phe Thr Thr Ile Val Thr Pro Asp Gly Thr Phe
245 250 255
Leu Gly Glu Pro Leu Arg Ser Gly Glu Gly Glu Val Ile Ala Asp Leu
260 265 270
Asp Phe Lys Leu Ile Asp Lys Arg Lys Met Leu Met Asp Ser Arg Gly
275 280 285
His Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro
290 295 300
Thr Ala His Ile His Glu Arg Gly Ala Pro Gln Thr Ser Gly Ala Val
305 310 315 320
Gln Glu Ala Thr Lys Val Gly Ser His Ala Pro Leu Leu Arg Asp Gly
325 330 335
Gln Trp Asp Gln Leu Asn Ala Gly Ala Gly Arg His Thr Gly Asn Gly
340 345 350
Glu Ala Gln Ile Glu Ile Met Ala Ala Ala His Ser Gly Thr Arg Gly
355 360 365
Ile Glu Ala Lys Gly Ala
370
<210>77
<211>1056
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>77
atgccaaccc ccagcgatca tttcaaaatc gccgctgttc aggcctcgcc cgtgtttctg 60
gaccgggagg ccactgtgga aaaggcctgc cggttgatcg ccgaagccgc aaagcagggc 120
gtccgcctca tcgtctttcc ggaatcgttc atcccgacct acccggactg ggtatgggcc 180
gttcccccgg gaagggaaag aatcctgaac cagctgtatt ctgaattcct ggccaatgcc 240
gtcgatgttc ccggcgcggc gaccgaacaa cttgcccagg ctgcacgaat ggccggcgcc 300
tatgtgatta tgggcgtcac cgaaagagac acctcggcca gcggggccag cctctacaac 360
accctgctct acttcagccc cgaaggcatc ctaatgggca aacaccggaa gctggttccc 420
acggggggcg aacggctggt ctgggcctac ggagacggca gcacgctgga ggtctacgac 480
actccgctgg gaaagatcgg cgggctgatc tgctgggaga actacatgcc cctggcccgg 540
tacacgatgt acgcctgggg cacccagatt tacatcgccg ccacctggga ccgcggggaa 600
ccgtggctct ccaccctgcg gcatatcgca aaggaaggaa gggtctacgt catcgggtgc 660
tgcatcgccc tgcgccaggg ggatatcccg gaccggttcg agtacaaggg aaaattttat 720
tccgggtccc gggagtggat caatgagggc gacagcgcca tcgtgaaccc ggacggggaa 780
ttcatcgccg ggccggtgcg gacgaaggag gagatcctgt atgccgagat agacccccgg 840
cagatgcggg gccccaagtg gatgctcgat gtggccggtc attacgcccg gccggatatc 900
ttcgagctca tcgtccaccg gaatccccac ccgatgatca aaatcgccga agacaggggc 960
acggggatcg cctcaagttt gattcgcccc cgccctaacc ttcccccatc aagggggagg 1020
aaatcggcaa gaagcaaacg caagcccaaa aaatga 1056
<210>78
<211>351
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>78
Met Pro Thr Pro Ser Asp His Phe Lys Ile Ala Ala Val Gln Ala Ser
1 5 10 15
Pro Val Phe Leu Asp Arg Glu Ala Thr Val Glu Lys Ala Cys Arg Leu
20 25 30
Ile Ala Glu Ala Ala Lys Gln Gly Val Arg Leu Ile Val Phe Pro Glu
35 40 45
Ser Phe Ile Pro Thr Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly
50 55 60
Arg Glu Arg Ile Leu Asn Gln Leu Tyr Ser Glu Phe Leu Ala Asn Ala
65 70 75 80
Val Asp Val Pro Gly Ala Ala Thr Glu Gln Leu Ala Gln Ala Ala Arg
85 90 95
Met Ala Gly Ala Tyr Val Ile Met Gly Val Thr Glu Arg Asp Thr Ser
100 105 110
Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Phe Ser Pro Glu
115 120 125
Gly Ile Leu Met Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu
130 135 140
Arg Leu Val Trp Ala Tyr Gly Asp Gly Ser Thr Leu Glu Val Tyr Asp
145 150 155 160
Thr Pro Leu Gly Lys Ile Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met
165 170 175
Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gln Ile Tyr Ile
180 185 190
Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His
195 200 205
Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile Ala Leu
210 215 220
Arg Gln Gly Asp Ile Pro Asp Arg Phe Glu Tyr Lys Gly Lys Phe Tyr
225 230 235 240
Ser Gly Ser Arg Glu Trp Ile Asn Glu Gly Asp Ser Ala Ile Val Asn
245 250 255
Pro Asp Gly Glu Phe Ile Ala Gly Pro Val Arg Thr Lys Glu Glu Ile
260 265 270
Leu Tyr Ala Glu Ile Asp Pro Arg Gln Met Arg Gly Pro Lys Trp Met
275 280 285
Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Ile Phe Glu Leu Ile
290 295 300
Val His Arg Asn Pro His Pro Met Ile Lys Ile Ala Glu Asp Arg Gly
305 310 315 320
Thr Gly Ile Ala Ser Ser Leu Ile Arg Pro Arg Pro Asn Leu Pro Pro
325 330 335
Ser Arg Gly Arg Lys Ser Ala Arg Ser Lys Arg Lys Pro Lys Lys
340 345 350
<210>79
<211>990
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>79
atgacgaaga aaagcggccg cgattcgttt cgggtcgctg cggtccaggc ctcgtccgtc 60
tacctggatc gggaacggag catcgagaaa gcgtgccggc tgatcgacga cgcgggacga 120
aacgacgccg acctcgtcgt gttccccgaa gccttcgtgc ccggataccc actgtgggtg 180
tggctcgttc cgccggggcg caccgcagac ttgcgctccg cttatgcgac gctccacgcc 240
aacgcgatca gcattccgga cgactccacc gatcggctgt gcgccgccgc aaaagacgcc 300
ggcgtcgccg tcgcgatcgg cgtcaacgaa cgcaacaccg aagcgagcgg catgagcctg 360
ttcaacacgc tgctctatat cggagcggac ggccggattc tcggaaaaca ccggaagctg 420
gtaccgaccg gcggcgaacg gctcgtctgg gcatctggcg acggcagcga cctcgaggtc 480
tactcgctgc cgttcggtcg cgtaagcgga ctgatctgct gggagcacta catgccgctc 540
gcccggtatg cgctcgccgc gtggggcgaa caggtgcacg tcgctccaac ctgggatcgt 600
ggcgagccgt ggctgtccac gctaaggcac atcgcgaagg aaggccgcgt tctcgtcgtc 660
ggctgctgtc aagccgtgcg caaggacgac atccctgaca cgctcgcgtt caagtccaaa 720
tacctcgcag acgtggacgg ctggatcaac ccaggtggca gcgtcatcat caatcctgac 780
ggcaaggtcg tcgcgggacc ggcgatggaa accgaaactg tactgtacgc ggaccttcgc 840
accgagcagc tcgtcggacc gcgctggcag ctcgacgtcg gcggacatta cgctcgtccg 900
gacgtcttcg agctcgtcgt ccatcggcat ccgaagccgt tgattcggac agcgaccggt 960
gtcaggcgcc gcaagcgtgc acgtcgctaa 990
<210>80
<211>329
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>80
Met Thr Lys Lys Ser Gly Arg Asp Ser Phe Arg Val Ala Ala Val Gln
1 5 10 15
Ala Ser Ser Val Tyr Leu Asp Arg Glu Arg Ser Ile Glu Lys Ala Cys
20 25 30
Arg Leu Ile Asp Asp Ala Gly Arg Asn Asp Ala Asp Leu Val Val Phe
35 40 45
Pro Glu Ala Phe Val Pro Gly Tyr Pro Leu Trp Val Trp Leu Val Pro
50 55 60
Pro Gly Arg Thr Ala Asp Leu Arg Ser Ala Tyr Ala Thr Leu His Ala
65 70 75 80
Asn Ala Ile Ser Ile Pro Asp Asp Ser Thr Asp Arg Leu Cys Ala Ala
85 90 95
Ala Lys Asp Ala Gly Val Ala Val Ala Ile Gly Val Asn Glu Arg Asn
100 105 110
Thr Glu Ala Ser Gly Met Ser Leu Phe Asn Thr Leu Leu Tyr Ile Gly
115 120 125
Ala Asp Gly Arg Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly
130 135 140
Gly Glu Arg Leu Val Trp Ala Ser Gly Asp Gly Ser Asp Leu Glu Val
145 150 155 160
Tyr Ser Leu Pro Phe Gly Arg Val Ser Gly Leu Ile Cys Trp Glu His
165 170 175
Tyr Met Pro Leu Ala Arg Tyr Ala Leu Ala Ala Trp Gly Glu Gln Val
180 185 190
His Val Ala Pro Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu
195 200 205
Arg His Ile Ala Lys Glu Gly Arg Val Leu Val Val Gly Cys Cys Gln
210 215 220
Ala Val Arg Lys Asp Asp Ile Pro Asp Thr Leu Ala Phe Lys Ser Lys
225 230 235 240
Tyr Leu Ala Asp Val Asp Gly Trp Ile Asn Pro Gly Gly Ser Val Ile
245 250 255
Ile Asn Pro Asp Gly Lys Val Val Ala Gly Pro Ala Met Glu Thr Glu
260 265 270
Thr Val Leu Tyr Ala Asp Leu Arg Thr Glu Gln Leu Val Gly Pro Arg
275 280 285
Trp Gln Leu Asp Val Gly Gly His Tyr Ala Arg Pro Asp Val Phe Glu
290 295 300
Leu Val Val His Arg His Pro Lys Pro Leu Ile Arg Thr Ala Thr Gly
305 310 315 320
Val Arg Arg Arg Lys Arg Ala Arg Arg
325
<210>81
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>81
atgaaagtcg tcaaagccgc cgctgtccag ttcagcccgg tgctctatag ccgcgaagcg 60
accgtcgcca aggtcgtccg gaaaatccac gagctcggtc agaaaggcgt gcagttcgcc 120
acctttcctg aaacggtcgt gccttattac ccttacttcg cggccgtcca gacgggcatc 180
gagctcttgt cgggcaccga acatctgcgc ctgctcgaac aggccgtgac tgtgccctcc 240
gctgcgaccg atgcaatcgg cgaagccgcg cgacaggccg gcatggtcgt gtccatcggc 300
gtcaatgagc gtgacggcgg cacgctttac aacacgcaac tgctcttcga tgccgacggt 360
acgctgatcc agcgccgccg caagatcacg ccgacccatt tcgaacgcat gatctggggg 420
cagggagatg gctcgggctt gcgtgcagtc gacagcgcag tcggccgcat cggccagctc 480
gcatgcttcg agcacaacaa cccgcttgca cgttacgcaa tgatcgccga cggcgagcag 540
atccattcag cgatgtaccc tggctcggcc tttggcgagg gcttcgccca gcgtatggag 600
atcaacatcc gccagcatgc gctcgagtcc gccgctttcg tcgtcaacgc aacggcgtgg 660
cttgacgccg accagcaggc gcaaatcatg aaggacaccg gttgtggaat cggtccgatc 720
tcgggcggct gcttcaccac gatcgtttct cctgacggta tgctgatggc cgatccgctt 780
cgctcgggcg aaggcgaagt gattgtcgat ctcgacttca cgcagatcga ccgccgcaag 840
atgctgatgg actcggccgg ccactacaac cgccctgaac tgctgagtct gatgatcgac 900
cgtacgccgg ctgcgcatgt tcacgaacgc gcttcgcgcc cgatgaccgt cgacgaccag 960
agttccggcg atctgcgcac ccaggttgca tga 993
<210>82
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>82
Met Lys Val Val Lys Ala Ala Ala Val Gln Phe Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Ala Thr Val Ala Lys Val Val Arg Lys Ile His Glu Leu
20 25 30
Gly Gln Lys Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ala Ala Val Gln Thr Gly Ile Glu Leu Leu Ser
50 55 60
Gly Thr Glu His Leu Arg Leu Leu Glu Gln Ala Val Thr Val Pro Ser
65 70 75 80
Ala Ala Thr Asp Ala Ile Gly Glu Ala Ala Arg Gln Ala Gly Met Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Phe Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Ala Val Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Ile Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly
180 185 190
Glu Gly Phe Ala Gln Arg Met Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Ala Ala Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Gly Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Thr Ile Val Ser Pro Asp Gly Met Leu Met
245 250 255
Ala Asp Pro Leu Arg Ser Gly Glu Gly Glu Val Ile Val Asp Leu Asp
260 265 270
Phe Thr Gln Ile Asp Arg Arg Lys Met Leu Met Asp Ser Ala Gly His
275 280 285
Tyr Asn Arg Pro Glu Leu Leu Ser Leu Met Ile Asp Arg Thr Pro Ala
290 295 300
Ala His Val His Glu Arg Ala Ser Arg Pro Met Thr Val Asp Asp Gln
305 310 315 320
Ser Ser Gly Asp Leu Arg Thr Gln Val Ala
325 330
<210>83
<211>1071
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>83
atgatgagtt cagcccgtgt aataaaactc gccgcagcac agctttcacc tgtgctgccg 60
ggggagtcca caaatagccg cgacggcacc attgccaaag tcgtcgcggc gattgcggag 120
gctgcgcgcg ccggcgcgca gctgatcgtg tttcccgaaa cggtggtgcc gtattacccg 180
tatttctcgt tcattacgcc ggcggtgacg atgggggcgg agcatttgcg cttgtacgat 240
cagtctgtcg tggtgccgag cgccgccact gatactgttg ccgccgctgc aaaaaaacac 300
agcatggtgg tcgtgctcgg tattaacgaa cgcgatcacg gcacgctcta caacgcgcaa 360
ttaattttcg atgcgagcgg cgaattatta ttaaaacgcc gaaaaattac cccgacctat 420
cacgagcgca tggtgtgggg tcagggcgac ggcagcggtt tgaaaaccgt cgacaccgcg 480
atcggccgtg tcggtgcgct cgcctgctgg gaacattaca acccattggc gcgttacagc 540
ctgatggccc agcacgaaga aattcattgc agtcaatttc cggggtcatt ggtcgggcca 600
attttcgccg agcaaatgga agtgacaatg cgccaccacg cgctcgaatc cggttgcttc 660
gtcgttaatg caacggcgtg gttatcggaa gcgcaaattc aatcgatcag cagcgatccc 720
gcgatgcaaa aagcactgcg cggcggttgc tacaccgcaa ttatttcgcc cgaaggcaaa 780
catctgtgcg agccgctacg cgaaggtgaa ggtttgattt ttgccgaagc cgatatggcg 840
ctcattacca aacgcaaacg catgatggat tcggttggtc attacgcgcg acccgaattg 900
ctgtcgctgt taatcgacca tcgcgccacc acaccattgc atagcgtcac cgcgagtgat 960
gccgccgccg taaaaaatac tcggagttcc gctcatgaat cagccgatag tgaaaccatc 1020
cgcgagtcag ttaataacgg aactccaatc gcacggcttg cgcctagttg a 1071
<210>84
<211>356
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>84
Met Met Ser Ser Ala Arg Val Ile Lys Leu Ala Ala Ala Gln Leu Ser
1 5 10 15
Pro Val Leu Pro Gly Glu Ser Thr Asn Ser Arg Asp Gly Thr Ile Ala
20 25 30
Lys Val Val Ala Ala Ile Ala Glu Ala Ala Arg Ala Gly Ala Gln Leu
35 40 45
Ile Val Phe Pro Glu Thr Val Val Pro Tyr Tyr Pro Tyr Phe Ser Phe
50 55 60
Ile Thr Pro Ala Val Thr Met Gly Ala Glu His Leu Arg Leu Tyr Asp
65 70 75 80
Gln Ser Val Val Val Pro Ser Ala Ala Thr Asp Thr Val Ala Ala Ala
85 90 95
Ala Lys Lys His Ser Met Val Val Val Leu Gly Ile Asn Glu Arg Asp
100 105 110
His Gly Thr Leu Tyr Asn Ala Gln Leu Ile Phe Asp Ala Ser Gly Glu
115 120 125
Leu Leu Leu Lys Arg Arg Lys Ile Thr Pro Thr Tyr His Glu Arg Met
130 135 140
Val Trp Gly Gln Gly Asp Gly Ser Gly Leu Lys Thr Val Asp Thr Ala
145 150 155 160
Ile Gly Arg Val Gly Ala Leu Ala Cys Trp Glu His Tyr Asn Pro Leu
165 170 175
Ala Arg Tyr Ser Leu Met Ala Gln His Glu Glu Ile His Cys Ser Gln
180 185 190
Phe Pro Gly Ser Leu Val Gly Pro Ile Phe Ala Glu Gln Met Glu Val
195 200 205
Thr Met Arg His His Ala Leu Glu Ser Gly Cys Phe Val Val Asn Ala
210 215 220
Thr Ala Trp Leu Ser Glu Ala Gln Ile Gln Ser Ile Ser Ser Asp Pro
225 230 235 240
Ala Met Gln Lys Ala Leu Arg Gly Gly Cys Tyr Thr Ala Ile Ile Ser
245 250 255
Pro Glu Gly Lys His Leu Cys Glu Pro Leu Arg Glu Gly Glu Gly Leu
260 265 270
Ile Phe Ala Glu Ala Asp Met Ala Leu Ile Thr Lys Arg Lys Arg Met
275 280 285
Met Asp Ser Val Gly His Tyr Ala Arg Pro Glu Leu Leu Ser Leu Leu
290 295 300
Ile Asp His Arg Ala Thr Thr Pro Leu His Ser Val Thr Ala Ser Asp
305 310 315 320
Ala Ala Ala Val Lys Asn Thr Arg Ser Ser Ala His Glu Ser Ala Asp
325 330 335
Ser Glu Thr Ile Arg Glu Ser Val Asn Asn Gly Thr Pro Ile Ala Arg
340 345 350
Leu Ala Pro Ser
355
<210>85
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>85
atgggtctgg ttcatcagaa atacaaggtt gcggtggttc aggcggcgcc ggtctttctc 60
gacctcgatg cgacggtgga caagacgatc gccctgatcg agcaggccgc agcacagggc 120
gcgaagctga tcgcgtttcc cgagaccttc attcccggat atccgtggca gatctggctt 180
ggggcgcccg cctgggcgat cggccgtggc ttcgtgcagc gctatttcga taactcgttg 240
tcatttgaca gcccgcaggc cgaaaaaatt cgcaaggccg tcaagcgcgc caagctgacc 300
gcggtgatcg gcgtctccga acgcgacggc ggcagcctct atatcggcca atggctgatc 360
ggtcccgacg gcgagaccat tgcgaagcgc cgcaagctgc ggccgaccca tgccgaacgc 420
accgtgttcg gcgagggcga cggcagcgac ctcgccgtcc atgatcgcgc cgacgtggga 480
cggctcggtg caatgtgctg ctgggagcat ctgcagccgc tgtcgaaata cgcgatgtac 540
gcccagaacg agcaggttca cgtcggcgcc tggccgagct tctcattgta cgacccattc 600
gcccatgcgc ttggctggga agtaaacaac gcggcgagca aggtttatgc tgtcgagggc 660
tcatgtttct tcctcggccc gtgcgcggtg gtctcgcagg ccatgatcga cgagctctgc 720
gattcccccg aaaagcacgc cttcctgcac gctggcggcg gccacgcggt aatctatggg 780
ccggacggga gttcgcttgc cgacaaactt ccacccgatc aggagggcat tctgtatgcc 840
gatatcgatc tcggcatgat cggcgtggca aagaacgccg ccgaccccgc aggacactat 900
tccaggccgg acgtcacgcg gctgctgctc aacacttccc gcgccaatcg cgtcgagcat 960
ttttcattgc cgatcgatgc cgaggtcatg agcgaaatca gacttcaggc ctga 1014
<210>86
<211>337
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>86
Met Gly Leu Val His Gln Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Val Phe Leu Asp Leu Asp Ala Thr Val Asp Lys Thr Ile Ala Leu
20 25 30
Ile Glu Gln Ala Ala Ala Gln Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Thr Phe Ile Pro Gly Tyr Pro Trp Gln Ile Trp Leu Gly Ala Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ser Phe Asp Ser Pro Gln Ala Glu Lys Ile Arg Lys Ala Val Lys Arg
85 90 95
Ala Lys Leu Thr Ala Val Ile Gly Val Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Ile Gly Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Phe Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Ala Asp Val Gly
145 150 155 160
Arg Leu Gly Ala Met Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Gly Ala Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala His Ala Leu Gly Trp Glu Val
195 200 205
Asn Asn Ala Ala Ser Lys Val Tyr Ala Val Glu Gly Ser Cys Phe Phe
210 215 220
Leu Gly Pro Cys Ala Val Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Ser Pro Glu Lys His Ala Phe Leu His Ala Gly Gly Gly His Ala
245 250 255
Val Ile Tyr Gly Pro Asp Gly Ser Ser Leu Ala Asp Lys Leu Pro Pro
260 265 270
Asp Gln Glu Gly Ile Leu Tyr Ala Asp Ile Asp Leu Gly Met Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Thr Ser Arg Ala Asn Arg Val Glu His
305 310 315 320
Phe Ser Leu Pro Ile Asp Ala Glu Val Met Ser Glu Ile Arg Leu Gln
325 330 335
Ala
<210>87
<211>1062
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>87
atggcggaat cgaagctgaa ggtcgccgca attcaagttg cgcccgtgtt catggatcgc 60
gatgccacga tcgcccgcgc ctgcgagcgg atcgccgaag ccgcccgcgc cggcgcggag 120
ttggtggtct ttcccgaggc attcgtgccc gggtatcccg actggatctg ggtggcgcgg 180
ccaagccaac gcaaactgct caatgatctt tacgcgcacc tcgtctcgca gtcggtcgac 240
gtgccgtcgg cctccgtgga tcgtttgcgc gacgcggctc gcgacggcgg ggtcacggtg 300
gtgatcggcg tcaacgagcg caacaccgaa gcgagcggcg cgagcctcta caacaccgcg 360
ctcgtgatcg gtccactggg gcagctgatc ggccgccacc gcaagcttgt gccgaccggg 420
ccggagcgca tggtgtgggc gcagggcgac ggcagcacgc tcgacgtcta cgacacaccc 480
gtcggcaagc tttcgacgtt gatctgctgg gagaactaca tgccgctcgc gcgctacgcc 540
atggcggcgt ggggcgcgcg catccacgtc gccggcacgt gggaccgcgg cgagccgtgg 600
atctcgacca tgcgtcatgt ggcgacggag ggccgcgtat tcgtgattag ctgttgcatg 660
gcgctgcgca aacgagacat tcccgccgag ctcgagttcg cgatgctcta tcccgacggg 720
cgcgaatgga tcaacgccgg tgattcgctg gtcgtgaatc ccgctggcca gatcatcgct 780
gggccgttgc acgagcagga aggaatcctc tacgccgagc tcgagcgcaa tcagatgacc 840
ggtccgcgtt ggatgttcga cgccgccggc cattacgcgc gaccggacgt cttccaactc 900
acggtaaacc gctccccgcg cccgatgctg cgggaggcgg gggcaaagac gagtgaggca 960
aacacgagag atgccgtacc catggacagc acgccctcga gatcgcggcc ccgcgcggtg 1020
gcgcgaaagg ccgcacgcac cggtcgctcc aagcggcggt ga 1062
<210>88
<211>353
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>88
Met Ala Glu Ser Lys Leu Lys Val Ala Ala Ile Gln Val Ala Pro Val
1 5 10 15
Phe Met Asp Arg Asp Ala Thr Ile Ala Arg Ala Cys Glu Arg Ile Ala
20 25 30
Glu Ala Ala Arg Ala Gly Ala Glu Leu Val Val Phe Pro Glu Ala Phe
35 40 45
Val Pro Gly Tyr Pro Asp Trp Ile Trp Val Ala Arg Pro Ser Gln Arg
50 55 60
Lys Leu Leu Asn Asp Leu Tyr Ala His Leu Val Ser Gln Ser Val Asp
65 70 75 80
Val Pro Ser Ala Ser Val Asp Arg Leu Arg Asp Ala Ala Arg Asp Gly
85 90 95
Gly Val Thr Val Val Ile Gly Val Asn Glu Arg Asn Thr Glu Ala Ser
100 105 110
Gly Ala Ser Leu Tyr Asn Thr Ala Leu Val Ile Gly Pro Leu Gly Gln
115 120 125
Leu Ile Gly Arg His Arg Lys Leu Val Pro Thr Gly Pro Glu Arg Met
130 135 140
Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Asp Val Tyr Asp Thr Pro
145 150 155 160
Val Gly Lys Leu Ser Thr Leu Ile Cys Trp Glu Asn Tyr Met Pro Leu
165 170 175
Ala Arg Tyr Ala Met Ala Ala Trp Gly Ala Arg Ile His Val Ala Gly
180 185 190
Thr Trp Asp Arg Gly Glu Pro Trp Ile Ser Thr Met Arg His Val Ala
195 200 205
Thr Glu Gly Arg Val Phe Val Ile Ser Cys Cys Met Ala Leu Arg Lys
210 215 220
Arg Asp Ile Pro Ala Glu Leu Glu Phe Ala Met Leu Tyr Pro Asp Gly
225 230 235 240
Arg Glu Trp Ile Asn Ala Gly Asp Ser Leu Val Val Asn Pro Ala GIy
245 250 255
Gln Ile Ile Ala Gly Pro Leu His Glu Gln Glu Gly Ile Leu Tyr Ala
260 265 270
Glu Leu Glu Arg Asn Gln Met Thr Gly Pro Arg Trp Met Phe Asp Ala
275 280 285
Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln Leu Thr Val Asn Arg
290 295 300
Ser Pro Arg Pro Met Leu Arg Glu Ala Gly Ala Lys Thr Ser Glu Ala
305 310 315 320
Asn Thr Arg Asp Ala Val Pro Met Asp Ser Thr Pro Ser Arg Ser Arg
325 330 335
Pro Arg Ala Val Ala Arg Lys Ala Ala Arg Thr Gly Arg Ser Lys Arg
340 345 350
Arg
<210>89
<211>918
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>89
atgaatacca aagaagtaaa ggtcgcagcc gctcaatttg ccccacattt tctgaatttg 60
agcaaaacgg tggaaaaaac ctgcaacttg atttccgaag caggcaaaaa tggagcaaag 120
ctcattgtat ttccggaagc cttcctctct ggttatcccg attgggtctg gttaattccc 180
aatggaaatt caacaatgct ggatgattta tatcaggaat tggttgagaa cgctgtaaca 240
atccctgatt caacaacaca gaaactctgt caggcagcaa aagatgccgg ggtatatgtc 300
gcagtcggta tccatgaaag aaatgcagaa gcaagtggct tcacactttt caataccctt 360
ctatacatta atgatcaagg cagcatcatt ggaaaacacc gaaaactgat cccaacaggg 420
ggcgaacgcc tggtctgggg gcagggtaat ggggatacgc ttgctgcatt cgatacacac 480
tttggcaaat tgggaggatt gctttgctgg gaaaactaca tgcccctggc tcggcaagct 540
atgtacgcag ttgggactga agtttatgtt gccccaacct gggactccag tgagaattgg 600
ttgctgagta tgcgccatat agccagagag ggcggcatgt ttgtgatcaa tgtttgccag 660
gctgtccgaa aagacgatat tcctgaccgc tatgcattca agcaactcta ttctggtaat 720
tcagaatgga tcaatagcgg caacagttgc atcatcaatc cgcgcggtga aatcattgcc 780
ggaccatcct caaacaggca agaaatactc tacgcagatt tagatctgag tttgattaca 840
aaatctaaac gcatgttcga tgttaccggg cattatgccc ggccggatgt gtttagatat 900
gaaatcaaaa aaagctag 918
<210>90
<211>305
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>90
Met Asn Thr Lys Glu Val Lys Val Ala Ala Ala Gln Phe Ala Pro His
1 5 10 15
Phe Leu Asn Leu Ser Lys Thr Val Glu Lys Thr Cys Asn Leu Ile Ser
20 25 30
Glu Ala Gly Lys Asn Gly Ala Lys Leu Ile Val Phe Pro Glu Ala Phe
35 40 45
Leu Ser Gly Tyr Pro Asp Trp Val Trp Leu Ile Pro Asn Gly Asn Ser
50 55 60
Thr Met Leu Asp Asp Leu Tyr Gln Glu Leu Val Glu Asn Ala Val Thr
65 70 75 80
Ile Pro Asp Ser Thr Thr Gln Lys Leu Cys Gln Ala Ala Lys Asp Ala
85 90 95
Gly Val Tyr Val Ala Val Gly Ile His Glu Arg Asn Ala Glu Ala Ser
100 105 110
Gly Phe Thr Leu Phe Asn Thr Leu Leu Tyr Ile Asn Asp Gln Gly Ser
115 120 125
Ile Ile Gly Lys His Arg Lys Leu Ile Pro Thr Gly Gly Glu Arg Leu
130 135 140
Val Trp Gly Gln Gly Asn Gly Asp Thr Leu Ala Ala Phe Asp Thr His
145 150 155 160
Phe Gly Lys Leu Gly Gly Leu Leu Cys Trp Glu Asn Tyr Met Pro Leu
165 170 175
Ala Arg Gln Ala Met Tyr Ala Val Gly Thr Glu Val Tyr Val Ala Pro
180 185 190
Thr Trp Asp Ser Ser Glu Asn Trp Leu Leu Ser Met Arg His Ile Ala
195 200 205
Arg Glu Gly Gly Met Phe Val Ile Asn Val Cys Gln Ala Val Arg Lys
210 215 220
Asp Asp Ile Pro Asp Arg Tyr Ala Phe Lys Gln Leu Tyr Ser Gly Asn
225 230 235 240
Ser Glu Trp Ile Asn Ser Gly Asn Ser Cys Ile Ile Asn Pro Arg Gly
245 250 255
Glu Ile Ile Ala Gly Pro Ser Ser Asn Arg Gln Glu Ile Leu Tyr Ala
260 265 270
Asp Leu Asp Leu Ser Leu Ile Thr Lys Ser Lys Arg Met Phe Asp Val
275 280 285
Thr Gly His Tyr Ala Arg Pro Asp Val Phe Arg Tyr Glu Ile Lys Lys
290 295 300
Ser
305
<210>91
<211>939
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>91
atgaccaaaa tcgctgtcat tcaagaacct ccggtctatc tgaatctgag taaatcgatg 60
gacagagcgg tcgacttgat tgccaatgct gcaagcaagg ggtgtgagtt gattgtgttt 120
cccgaagcct ggcttgcagg ttaccccacc ttcgtctggc gtcttgcgcc gggcagcgga 180
atgggaaaaa ctgatgagct ttacgcgcgt ttgctcgcca actcggtcga ccgtagcaaa 240
gaggggctta gaccattgca ggaggccgca aaggagcatg gcgttgtcat tgtgctgggt 300
tatcaagagg tggatggcgc gggaagcagc agcacgatct tcaacagctg tgcgattatt 360
gatgcggacg ggcgactggc caacaatcat cgcaagttga tgcccaccaa tccggagagg 420
atggtttggg gttttggcga cggttcaggc ctgaacgtcg ttgacaccgc ggtgggcagg 480
atcggcacgc tgatttgctg ggaaaactac atgccgttag cgcgctacgc gctgtatgtc 540
caaaacatcg aaatctatgt tgccccgact tgggacagtg gtgccatgtg gcaggcgacc 600
ctgcagcata tcgcgcgcga aggtggctgc tgggtcatcg gatgtgcaac gtcgctggaa 660
gcctctgaca tcccggacga cgttccccat cgggatgagc tattcccgaa caaagacgaa 720
tgggtaaacc ctggcgatgc ggtggtttat aagccatttg gcggcattgt ggccggcccc 780
atgcatcagg aaaaggggct tctcatcgca gagttggacg tcgccgctgt tcagtcgtca 840
cgtcggaagt tcgatgcgag cgggcactac gctcgccccg atgtcttcaa actgcatgtg 900
aatcgcaccg cgatgcggcc agttgatttc acgaattag 939
<210>92
<211>312
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>92
Met Thr Lys Ile Ala Val Ile Gln Glu Pro Pro Val Tyr Leu Asn Leu
1 5 10 15
Ser Lys Ser Met Asp Arg Ala Val Asp Leu Ile Ala Asn Ala Ala Ser
20 25 30
Lys Gly Cys Glu Leu Ile Val Phe Pro Glu Ala Trp Leu Ala Gly Tyr
35 40 45
Pro Thr Phe Val Trp Arg Leu Ala Pro Gly Ser Gly Met Gly Lys Thr
50 55 60
Asp Glu Leu Tyr Ala Arg Leu Leu Ala Asn Ser Val Asp Arg Ser Lys
65 70 75 80
Glu Gly Leu Arg Pro Leu Gln Glu Ala Ala Lys Glu His Gly Val Val
85 90 95
Ile Val Leu Gly Tyr Gln Glu Val Asp Gly Ala Gly Ser Ser Ser Thr
100 105 110
Ile Phe Asn Ser Cys Ala Ile Ile Asp Ala Asp Gly Arg Leu Ala Asn
115 120 125
Asn His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly
130 135 140
Phe Gly Asp Gly Ser Gly Leu Asn Val Val Asp Thr Ala Val Gly Arg
145 150 155 160
Ile Gly Thr Leu Ile Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr
165 170 175
Ala Leu Tyr Val Gln Asn Ile Glu Ile Tyr Val Ala Pro Thr Trp Asp
180 185 190
Ser Gly Ala Met Trp Gln Ala Thr Leu Gln His Ile Ala Arg Glu Gly
195 200 205
Gly Cys Trp Val Ile Gly Cys Ala Thr Ser Leu Glu Ala Ser Asp Ile
210 215 220
Pro Asp Asp Val Pro His Arg Asp Glu Leu Phe Pro Asn Lys Asp Glu
225 230 235 240
Trp Val Asn Pro Gly Asp Ala Val Val Tyr Lys Pro Phe Gly Gly Ile
245 250 255
Val Ala Gly Pro Met His Gln Glu Lys Gly Leu Leu Ile Ala Glu Leu
260 265 270
Asp Val Ala Ala Val Gln Ser Ser Arg Arg Lys Phe Asp Ala Ser Gly
275 280 285
His Tyr Ala Arg Pro Asp Val Phe Lys Leu His Val Asn Arg Thr Ala
290 295 300
Met Arg Pro Val Asp Phe Thr Asn
305 310
<210>93
<211>978
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>93
atgcccatca tcaaagccgc tgccgtgcaa atcagcccgg tgctttacag tcgcgaaggc 60
accgtggaca aggtctgtca acagatcatc gacctcggtc ggcaaggcgt gcagttcgcc 120
gtctttccgg aaacggtggt gccttactac ccgtactttt cgtttgtgca gccggccttt 180
gccatgggcg cacagcacct caagttgctg gatcaatcgg tgacagtgcc gtcggccgcc 240
accttggcca tcggtgaagc ttgcaagcaa gcagggatag tggtgtccat cggcgtcaac 300
gaacgcgatg gcggtacgat ctacaacgcg caattactct tcgatgccga cggcagcctg 360
attcagcatc gccgcaaaat caccccgacc tatcacgaac gcatggtctg ggggcaaggc 420
gatggttccg gcctgcgcgc catcgacagt gcagtggggc gcattggctc cctggcctgt 480
tgggagcatt acaacccgct ggctcgttat gccttgatgg ccgatggcga gcagatccac 540
gccgcgatgt ttcccggctc gctggtgggc gacatttttg ccgagcagat cgaagtcacc 600
atccgccatc acgccttgga gtccggctgt ttcgtggtca acgccaccgc ctggctggac 660
gccgatcagc agggccaaat catgcaagac accggttgca gcctcggccc gatctcgggt 720
ggctgcttca ccgccatcgt ttcccctgaa ggcaagttgc tcggtgagcc gctgcgttcc 780
ggcgaagggg tggtgatcgc cgatctcgat ctggcactga tcgataagcg taaacggatg 840
atggattcgg tcgggcatta cagtcgcccg gaactgctca gcctgttgat cgaccgcacg 900
cccacagcgc atgtgcatga acgcagcgcg cacctggtgg ctgtcgctac cgaggagttc 960
gatcatgcaa accaatga 978
<210>94
<211>325
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>94
Met Pro Ile Ile Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Asp Lys Val Cys Gln Gln Ile Ile Asp Leu
20 25 30
Gly Arg Gln Gly Val Gln Phe Ala Val Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Pro Ala Phe Ala Met Gly Ala
50 55 60
Gln His Leu Lys Leu Leu Asp Gln Ser Val Thr Val Pro Ser Ala Ala
65 70 75 80
Thr Leu Ala Ile Gly Glu Ala Cys Lys Gln Ala Gly Ile Val Val Ser
85 90 95
Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Ala Gln Leu
100 105 110
Leu Phe Asp Ala Asp Gly Ser Leu Ile Gln His Arg Arg Lys Ile Thr
115 120 125
Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp Gly Ser Gly
130 135 140
Leu Arg Ala Ile Asp Ser Ala Val Gly Arg Ile Gly Ser Leu Ala Cys
145 150 155 160
Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp Gly
165 170 175
Glu Gln Ile His Ala Ala Met Phe Pro Gly Ser Leu Val Gly Asp Ile
180 185 190
Phe Ala Glu Gln Ile Glu Val Thr Ile Arg His His Ala Leu Glu Ser
195 200 205
Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp Gln Gln
210 215 220
Gly Gln Ile Met Gln Asp Thr Gly Cys Ser Leu Gly Pro Ile Ser Gly
225 230 235 240
Gly Cys Phe Thr Ala Ile Val Ser Pro Glu Gly Lys Leu Leu Gly Glu
245 250 255
Pro Leu Arg Ser Gly Glu Gly Val Val Ile Ala Asp Leu Asp Leu Ala
260 265 270
Leu Ile Asp Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser
275 280 285
Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Thr Ala His
290 295 300
Val His Glu Arg Ser Ala His Leu Val Ala Val Ala Thr Glu Glu Phe
305 310 315 320
Asp His Ala Asn Gln
325
<210>95
<211>966
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>95
atgtccaacg agaataacat tgctacattc aaagttgccg cagtccaggc cacacctgtg 60
tttcttgatc gtgaagcaac catcgacaaa gcttgcgcgt tgattgccac tgctggcagt 120
gaaggagcgc gcctgattgt gtttccagaa gcattcatcc caacttatcc tgaatgggta 180
tggggtattc cctccggtga gcaaggttta ctcaacgaac tctatgcaga gttgctcacc 240
aatgcggtca ccattcccag cgatgcgact gacaggctgt gcgaggctgc gcagcttgcg 300
aatgcctacg tagtgatggg catgagcgaa cggaacgtcg aggcgagtgg cgcaagcctg 360
tataatacgc tgttgtacat aaatgcgcag ggggagattt tagggaaaca tcgaaagctg 420
gtgccaacgg gcggcgaacg cctggtatgg gcgcagggtg atggcagtac gctgcaggtc 480
tacgatactc cattgggaaa actcggtggc ttaatttgct gggaaaatta tatgccgctg 540
gcacggtatg ctatgtatgc ctggggaaca caaatctatg tcgcggcaac gtgggatcgc 600
ggtcaaccct ggctttctac attaaggcat atcgccaaag aaggcagggt atacgtgatt 660
ggttgctgta tcgcgatgcg taaagacgat attccagatc gttacaccat gaagcaaaaa 720
tattatgctg aaatggatga atggatgaat gttggtgaca gtgtgattgt caatcccgag 780
gggcacttta ttgccgggcc tgtgcgcaag caggaagaaa ttctctacgc ggagattgat 840
cctcgcatgg tgcaaggccc gaagtggatg ctcgatgtgg cagggcatta tgcgagaccg 900
gatgtgttcc agttgacggt gcatacggat gtgaggcgga tgatgcgggt ggaagatgat 960
tcataa 966
<210>96
<211>321
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>96
Met Ser Asn Glu Asn Asn Ile Ala Thr Phe Lys Val Ala Ala Val Gln
1 5 10 15
Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr Ile Asp Lys Ala Cys
20 25 30
Ala Leu Ile Ala Thr Ala Gly Ser Glu Gly Ala Arg Leu Ile Val Phe
35 40 45
Pro Glu Ala Phe Ile Pro Thr Tyr Pro Glu Trp Val Trp Gly Ile Pro
50 55 60
Ser Gly Glu Gln Gly Leu Leu Asn Glu Leu Tyr Ala Glu Leu Leu Thr
65 70 75 80
Asn Ala Val Thr Ile Pro Ser Asp Ala Thr Asp Arg Leu Cys Glu Ala
85 90 95
Ala Gln Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg Asn
100 105 110
Val Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Ile Asn
115 120 125
Ala Gln Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly
130 135 140
Gly Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val
145 150 155 160
Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn
165 170 175
Tyr Met Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gln Ile
180 185 190
Tyr Val Ala Ala Thr Trp Asp Arg Gly Gln Pro Trp Leu Ser Thr Leu
195 200 205
Arg His Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile
210 215 220
Ala Met Arg Lys Asp Asp Ile Pro Asp Arg Tyr Thr Met Lys Gln Lys
225 230 235 240
Tyr Tyr Ala Glu Met Asp Glu Trp Met Asn Val Gly Asp Ser Val Ile
245 250 255
Val Asn Pro Glu Gly His Phe Ile Ala Gly Pro Val Arg Lys Gln Glu
260 265 270
Glu Ile Leu Tyr Ala Glu Ile Asp Pro Arg Met Val Gln Gly Pro Lys
275 280 285
Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln
290 295 300
Leu Thr Val His Thr Asp Val Arg Arg Met Met Arg Val Glu Asp Asp
305 310 315 320
Ser
<210>97
<211>1017
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>97
atgggcatcg aacatccgaa atacaaggtc gccgtggtgc aagctgcgcc cgcctggctc 60
gacctcgacg cgtcgatcga caagacgatc gggctgatcg aggaggcggc gaagaaaggc 120
gccaagctga tcgctttccc cgaagccttc attcccggct acccttggca catctggctc 180
gactcacccg cctgggcgat cggccgcggt ttcgtgcagc gctatttcga caattcgctc 240
gcctacgaca gcccacaggc ggaaaggctg cgacaggccg tgcggaaggc caagctcacc 300
gccgtgatcg gcctgtccga gcgcgacggc ggcagcctct atctcgcgca gtggctgatc 360
gggcccgacg gtgagaccat cgcaaagcgc cgcaagctgc ggccgaccca tgccgagcgc 420
accgtctatg gcgaaggcga cggcagcgat ctcgccgtcc atgagcgggc cgacatcggc 480
cggctcggcg cgctgtgctg ctgggagcat ctgcagccgc tgtcgaaatt cgccatgtac 540
gcccagaacg agcaggtaca tgtcgcggcc tggccgagct tctcgctcta cgatcccttc 600
gcgcctgcgc tgggcgcgga ggtgaacaac gccgcctccc gcatctatgc ggtggaaggc 660
tcctgcttcg tgctcgcacc gtgcgcgacg gtctcgcagg ccatgatcga cgagctctgc 720
gatcggccgg acaagcacgc gctgctgcat gccggcggcg gcttcgccgc gatctacggg 780
cccgacggca gccagatcgg cgacaagctg ccgcccgagc aggagggcct gctgatcgcc 840
gagatcgatc tgggcgcgat cggcgtcgcc aagaacgcgg ccgatcccgc cgggcattat 900
tcgcggcccg acgtcacgcg gctcctgctc aacaggaagc cgaacaagcg cgtggagcag 960
ttcgcgctgc ccgtcgacac ggtcgagccc gtcgacgtcg cggcggcagc aagctga 1017
<210>98
<211>338
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>98
Met Gly Ile Glu His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Ala Trp Leu Asp Leu Asp Ala Ser Ile Asp Lys Thr Ile Gly Leu
20 25 30
Ile Glu Glu Ala Ala Lys Lys Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Ala Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Leu Asp Ser Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ala Tyr Asp Ser Pro Gln Ala Glu Arg Leu Arg Gln Ala Val Arg Lys
85 90 95
Ala Lys Leu Thr Ala Val Ile Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Leu Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Glu Arg Ala Asp Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Phe Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Ala Glu Val
195 200 205
Asn Asn Ala Ala Ser Arg Ile Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Arg Pro Asp Lys His Ala Leu Leu His Ala Gly Gly Gly Phe Ala
245 250 255
Ala Ile Tyr Gly Pro Asp Gly Ser Gln Ile Gly Asp Lys Leu Pro Pro
260 265 270
Glu Gln Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Arg Lys Pro Asn Lys Arg Val Glu Gln
305 310 315 320
Phe Ala Leu Pro Val Asp Thr Val Glu Pro Val Asp Val Ala Ala Ala
325 330 335
Ala Ser
<210>99
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>99
atgcctgaca agagaatcgt ccgcgccgcc gcggtccaga tagcaccgga cctcgaacgg 60
cccggtggca cgctcgagaa ggtcctcgag acgatcgacg acgccgcacg ccagggcgtg 120
cagctcatcg tcttccccga gaccttcctg ccctactacc cgtacttttc gttcgtgcgg 180
gcgccggtgg catcgggtgc agagcacatg cggctctatg acgaagcggt ggtcgtgccc 240
gggccggtga cgcatgcggt ggccgagcgg gcacggcggc acggcatggt cgtcgtgctc 300
ggcgtgaacg agcgcgatca cggcagctta tacaacgcac aactgatctt cgataccgac 360
ggcgagctgc tgctcaagcg ccgcaagatc acgccgacgt ttcacgaacg gatgatctgg 420
ggcatgggcg acgcagccgg cctgaaggta gcggaaacgc gtatcggccg ggtgggtgca 480
ctcgcttgct gggaacacta caacccgctt gcacgttatg cactgatgac ccagcacgaa 540
gagattcatt gcagccagtt tcccggctcg ctggtcggac ccatcttcgg tgaacagatc 600
gaagtgacca tccggcatca cgcactggaa tccggctgct tcgtgatcaa ttccaccggc 660
tggctgaccg agccgcagat cgagtcgatc acgaaagatc cgggcctgca gaaggcgctt 720
cgcggcggct gcaacacggc gatcatctcg cccgaaggcc agcatctcgc cccgccgctg 780
cgtgagggcg agggcatggt catcgctgac ctggacatgt cgctgatcac caaacgcaaa 840
cgcatgatgg attctgtcgg ccactacgcg cggcccgaac tgctgagcct cgccatcaac 900
gaccggccgg cggtcacgtc ggcacccatg aacagcttct catcttcaac cgggggattg 960
caccttgaac gcgaacgaga ccttgtcggc cgtgagccgg caattgatga ctga 1014
<210>100
<211>337
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>100
Met Pro Asp Lys Arg Ile Val Arg Ala Ala Ala Val Gln Ile Ala Pro
1 5 10 15
Asp Leu Glu Arg Pro Gly Gly Thr Leu Glu Lys Val Leu Glu Thr Ile
20 25 30
Asp Asp Ala Ala Arg Gln Gly Val Gln Leu Ile Val Phe Pro Glu Thr
35 40 45
Phe Leu Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Arg Ala Pro Val Ala
50 55 60
Ser Gly Ala Glu His Met Arg Leu Tyr Asp Glu Ala Val Val Val Pro
65 70 75 80
Gly Pro Val Thr His Ala Val Ala Glu Arg Ala Arg Arg His Gly Met
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn
100 105 110
Ala Gln Leu Ile Phe Asp Thr Asp Gly Glu Leu Leu Leu Lys Arg Arg
115 120 125
Lys Ile Thr Pro Thr Phe His Glu Arg Met Ile Trp Gly Met Gly Asp
130 135 140
Ala Ala Gly Leu Lys Val Ala Glu Thr Arg Ile Gly Arg Val Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met
165 170 175
Thr Gln His Glu Glu Ile His Cys Ser Gln Phe Pro Gly Ser Leu Val
180 185 190
Gly Pro Ile Phe Gly Glu Gln Ile Glu Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Ile Asn Ser Thr Gly Trp Leu Thr Glu
210 215 220
Pro Gln Ile Glu Ser Ile Thr Lys Asp Pro Gly Leu Gln Lys Ala Leu
225 230 235 240
Arg Gly Gly Cys Asn Thr Ala Ile Ile Ser Pro Glu Gly Gln His Leu
245 250 255
Ala Pro Pro Leu Arg Glu Gly Glu Gly Met Val Ile Ala Asp Leu Asp
260 265 270
Met Ser Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Ala Ile Asn Asp Arg Pro Ala
290 295 300
Val Thr Ser Ala Pro Met Asn Ser Phe Ser Ser Ser Thr Gly Gly Leu
305 310 315 320
His Leu Glu Arg Glu Arg Asp Leu Val Gly Arg Glu Pro Ala Ile Asp
325 330 335
Asp
<210>101
<211>1065
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>101
atggcgacag tccatccgaa atttaaagta gccgccgtcc aggcggcccc ggcctttctc 60
gacctcgacg cgtcggtgga aaaagcggtg cgcctgattg atgaagccgg cgccgctggt 120
gcccggctca tcgcgtttcc agagactttt atccccggtt atccgtggtg gatctggctc 180
ggtgctccgg cctgggcgat catgcgcggc ttcgtctccc gctatttcga caactcgctg 240
cagtacggca ccccggaagc cgaccggctg cgggcagccg ccaaacgcaa caaaatgttc 300
gtcgcgctcg gactgtcaga gcgcgacggc ggcagtctct acatcgccca atggattatc 360
ggacccgacg gcgagacggt cgcaacgcgc cgcaagctca agcctactca cgccgagcgg 420
acggtgttcg gcgaaggcga tggctcgcac cttgcggtcc acgaacttga tatcgggcgg 480
gtcggtgcgc tgtgctgttg ggagcacctg cagccactgt cgaagtacgc gatgtatgcg 540
cagaacgagc aagttcatat cgcggcgtgg ccgagctttt cgctttacga tccgttcgcg 600
catgcgcttg gcgccgaggt caacaacgcg gcgagcaaga tctacgcggt cgaaggctca 660
tgctttgtga ttgcgccatg cgcgaccgtt tcccaggcga tgatcgacga attgtgtgac 720
tcgcccgaga agcatcagtt cctgcacgtc ggcggcggtt tcgccgtgat ctatggtccc 780
gacggcgcgc cactcgccaa gccactggcg cccgatcagg agggtctcct ttacgcggat 840
atcgacctcg gcatgatttc ggtcgcgaaa gcggcggccg atccggctgg acattacgcg 900
cgcccggacg tgacccgtct gttgttcaac aatcgtcctg ggaaccgggt ggagacactc 960
gcgctgccgg tcgaccagga ggcagaggcg ggagcaggcg gcaaacctgc gcccaagtca 1020
ccgagtgtcg ctgcgttcac actgacgcag gcggcagccg agtag 1065
<210>102
<211>354
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>102
Met Ala Thr Val His Pro Lys Phe Lys Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Ala Phe Leu Asp Leu Asp Ala Ser Val Glu Lys Ala Val Arg Leu
20 25 30
Ile Asp Glu Ala Gly Ala Ala Gly Ala Arg Leu Ile Ala Phe Pro Glu
35 40 45
Thr Phe Ile Pro Gly Tyr Pro Trp Trp Ile Trp Leu Gly Ala Pro Ala
50 55 60
Trp Ala Ile Met Arg Gly Phe Val Ser Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Gln Tyr Gly Thr Pro Glu Ala Asp Arg Leu Arg Ala Ala Ala Lys Arg
85 90 95
Asn Lys Met Phe Val Ala Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Ile Ile Gly Pro Asp Gly Glu Thr Val Ala
115 120 125
Thr Arg Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly
130 135 140
Glu Gly Asp Gly Ser His Leu Ala Val His Glu Leu Asp Ile Gly Arg
145 150 155 160
Val Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr
165 170 175
Ala Met Tyr Ala Gln Asn Glu Gln Val His Ile Ala Ala Trp Pro Ser
180 185 190
Phe Ser Leu Tyr Asp Pro Phe Ala His Ala Leu Gly Ala Glu Val Asn
195 200 205
Asn Ala Ala Ser Lys Ile Tyr Ala Val Glu Gly Ser Cys Phe Val Ile
210 215 220
Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys Asp
225 230 235 240
Ser Pro Glu Lys His Gln Phe Leu His Val Gly Gly Gly Phe Ala Val
245 250 255
Ile Tyr Gly Pro Asp Gly Ala Pro Leu Ala Lys Pro Leu Ala Pro Asp
260 265 270
Gln Glu Gly Leu Leu Tyr Ala Asp Ile Asp Leu Gly Met Ile Ser Val
275 280 285
Ala Lys Ala Ala Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val
290 295 300
Thr Arg Leu Leu Phe Asn Asn Arg Pro Gly Asn Arg Val Glu Thr Leu
305 310 315 320
Ala Leu Pro Val Asp Gln Glu Ala Glu Ala Gly Ala Gly Gly Lys Pro
325 330 335
Ala Pro Lys Ser Pro Ser Val Ala Ala Phe Thr Leu Thr Gln Ala Ala
340 345 350
Ala Glu
<210>103
<211>945
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>103
atgggcgagt tcggcgaggt gacgctgggg gtggcgcagg cggcgccggt gtacttogac 60
cgggaggcgt cgacggagaa ggctcgcggc ctgatccggg aggcggggga gaagggcgtc 120
gacctgttgg cgttcgggga gacgtggctg acggggtacc cgtactggaa ggatgcgccg 180
tggtctcggg agtacaacga cctgcgcgcg cggtacgtgg cgaatggcgt gatgataccg 240
gggccggaga cggacgcgct atgccaggca gcggcggaag cgggggtgga cgtggcaatc 300
ggcgtggtgg agctggagcc ggggagcctt tcgagcgtgt attgcacgtt gctgttcatc 360
tcgcgcgagg gcgagatcct ggggcggcac cggaagctga agccgacgga ttcggaacgg 420
cggtactggt cagagggtga tgcgacgggg ctgcgggtgt acgagcggcc atatggccgg 480
ttgagcggat tgaactgctg ggaacacctt atgatgttgc cggggtacgc gctggcggca 540
caggggacgc agtttcatgt ggcagcgtgg ccgaacatgg cgagctcggc gagcgagctg 600
ctgtcgcggg cgtatgcgta ccaggccgga tgctacgtgt tgtgcgcggg cgggctcggg 660
cctgcgccgg gagagctacc ggacggcatc gcggcggagt cgctggacca cctgacgggc 720
gagagctgca tcatcgaccc gtggggaaaa gtgatcgcgg ggccggtgtc gtgcgaggag 780
acgctgatta cggcgcgggt atcgaccgcg tcaatctacc ggcgcaagtc gctgacggac 840
gtgggtggcc actactcgcg accggacgtg ttccggttcg aggtggatag gtcggagcgc 900
ccgcgagtgg tgtttcggga tggggatgtg gacgaccggg ggtaa 945
<210>104
<211>314
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>104
Met Gly Glu Phe Gly Glu Val Thr Leu Gly Val Ala Gln Ala Ala Pro
1 5 10 15
Val Tyr Phe Asp Arg Glu Ala Ser Thr Glu Lys Ala Arg Gly Leu Ile
20 25 30
Arg Glu Ala Gly Glu Lys Gly Val Asp Leu Leu Ala Phe Gly Glu Thr
35 40 45
Trp Leu Thr Gly Tyr Pro Tyr Trp Lys Asp Ala Pro Trp Ser Arg Glu
50 55 60
Tyr Asn Asp Leu Arg Ala Arg Tyr Val Ala Asn Gly Val Met Ile Pro
65 70 75 80
Gly Pro Glu Thr Asp Ala Leu Cys Gln Ala Ala Ala Glu Ala Gly Val
85 90 95
Asp Val Ala Ile Gly Val Val Glu Leu Glu Pro Gly Ser Leu Ser Ser
100 105 110
Val Tyr Cys Thr Leu Leu Phe Ile Ser Arg Glu Gly Glu Ile Leu Gly
115 120 125
Arg His Arg Lys Leu Lys Pro Thr Asp Ser Glu Arg Arg Tyr Trp Ser
130 135 140
Glu Gly Asp Ala Thr Gly Leu Arg Val Tyr Glu Arg Pro Tyr Gly Arg
145 150 155 160
Leu Ser Gly Leu Asn Cys Trp Glu His Leu Met Met Leu Pro Gly Tyr
165 170 175
Ala Leu Ala Ala Gln Gly Thr Gln Phe His Val Ala Ala Trp Pro Asn
180 185 190
Met Ala Ser Ser Ala Ser Glu Leu Leu Ser Arg Ala Tyr Ala Tyr Gln
195 200 205
Ala Gly Cys Tyr Val Leu Cys Ala Gly Gly Leu Gly Pro Ala Pro Gly
210 215 220
Glu Leu Pro Asp Gly Ile Ala Ala Glu Ser Leu Asp His Leu Thr Gly
225 230 235 240
Glu Ser Cys Ile Ile Asp Pro Trp Gly Lys Val Ile Ala Gly Pro Val
245 250 255
Ser Cys Glu Glu Thr Leu Ile Thr Ala Arg Val Ser Thr Ala Ser Ile
260 265 270
Tyr Arg Arg Lys Ser Leu Thr Asp Val Gly Gly His Tyr Ser Arg Pro
275 280 285
Asp Val Phe Arg Phe Glu Val Asp Arg Ser Glu Arg Pro Arg Val Val
290 295 300
Phe Arg Asp Gly Asp Val Asp Asp Arg Gly
305 310
<210>105
<211>975
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>105
atgaccattg tcaaagccgc tgccgtccag attgcccccg ttctctacag ccgtgaaggc 60
actgtagaaa aggtcgttaa caagattcgc gaactcggcg agaagggcgt gcagttcgcc 120
gttttccctg aaaccgtcgt accgtactac ccgtactttt cctttgtgca gagccctttc 180
aaaatgggtt ccgagcacta caaattgctc gaccaggccg ttgtcgtgcc gtcggcgacc 240
accgatgcca tcggcaaagc ggccaaggaa gccaacatgg tggtgtccat cggcgtcaac 300
gaacgcgatg gcagcaccct ctacaacacg cagttgctgt ttgatgccga cggcactttg 360
attcaggccc gtcgcaagat ttcaccgacc taccacgaac gcatgatctg gggcatgggc 420
gacggttccg gcctgcgcgc caccgacagc gcggtcgggc gcatcggaca attggcctgc 480
tgggaacatt acaatccgct ggcgcgttac gccttgatcg aagacggcga acagatccac 540
gcctcgatgt acccgggctc gttcgcaggt cctttattca ctcgccagat ggaagtcagc 600
atccgcatgc atgccctgga atcggcgtgc ttcgtggtca actcgaccgc gtggttgtac 660
ccggaacagc aagcccagat catggccgac accggttgcg agatcgggcc gatctccggc 720
ggctgctaca ccgcgatcat cgacccacag ggtgaagtcg tcggcgcact gaccgaaggc 780
gagggcgaag tgattgccga catcgatctg ttccagatcg aaatccgtaa acgtcagatg 840
gacggccgtg gtcactacag ccgtccggaa atcctgagcc tgaacatcga ccgtacgccg 900
catcgccatg ttcacgaacg caacgaccag cagaaaccgg gtgtgatcga cactgctgaa 960
gaaaccgggc gttga 975
<210>106
<211>324
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>106
Met Thr Ile Val Lys Ala Ala Ala Val Gln Ile Ala Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Glu Lys Val Val Asn Lys Ile Arg Glu Leu
20 25 30
Gly Glu Lys Gly Val Gln Phe Ala Val Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Ser Pro Phe Lys Met Gly Ser
50 55 60
Glu His Tyr Lys Leu Leu Asp Gln Ala Val Val Val Pro Ser Ala Thr
65 70 75 80
Thr Asp Ala Ile Gly Lys Ala Ala Lys Glu Ala Asn Met Val Val Ser
85 90 95
Ile Gly Val Asn Glu Arg Asp Gly Ser Thr Leu Tyr Asn Thr Gln Leu
100 105 110
Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Ala Arg Arg Lys Ile Ser
115 120 125
Pro Thr Tyr His Glu Arg Met Ile Trp Gly Met Gly Asp Gly Ser Gly
130 135 140
Leu Arg Ala Thr Asp Ser Ala Val Gly Arg Ile Gly Gln Leu Ala Cys
145 150 155 160
Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Ile Glu Asp Gly
165 170 175
Glu Gln Ile His Ala Ser Met Tyr Pro Gly Ser Phe Ala Gly Pro Leu
180 185 190
Phe Thr Arg Gln Met Glu Val Ser Ile Arg Met His Ala Leu Glu Ser
195 200 205
Ala Cys Phe Val Val Asn Ser Thr Ala Trp Leu Tyr Pro Glu Gln Gln
210 215 220
Ala Gln Ile Met Ala Asp Thr Gly Cys Glu Ile Gly Pro Ile Ser Gly
225 230 235 240
Gly Cys Tyr Thr Ala Ile Ile Asp Pro Gln Gly Glu Val Val Gly Ala
245 250 255
Leu Thr Glu Gly Glu Gly Glu Val Ile Ala Asp Ile Asp Leu Phe Gln
260 265 270
Ile Glu Ile Arg Lys Arg Gln Met Asp Gly Arg Gly His Tyr Ser Arg
275 280 285
Pro Glu Ile Leu Ser Leu Asn Ile Asp Arg Thr Pro His Arg His Val
290 295 300
His Glu Arg Asn Asp Gln Gln Lys Pro Gly Val Ile Asp Thr Ala Glu
305 310 315 320
Glu Thr Gly Arg
<210>107
<211>981
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>107
atggccatca ttcgcgcagc agccgtacag atcagcccgg ttctttacag ccgcgaaggc 60
accgtggaca aggtctgcca gcagatcatc acccttggca aacagggtgt gcagttcgcc 120
gtgttcccgg aaacggtggt gccgtactac ccctattttt cctttgtgca gccggcgttc 180
gccatgggtg cgcaacacct caaattgcta gatcaatctg taaccgtgcc atcggccgcc 240
accctggcga ttggcgaagc gtgcaagcaa gcaggaatgg tcgtttccat cggagtcaat 300
gaacgcgatg gcggtacgat ttacaacgcg caattactct tcgatgctga cggcacgctg 360
attcagcatc ggcgcaaaat caccccgacc taccacgagc gcatggtctg ggggcagggc 420
gatggttccg gtctgcgcgc catcgacagc gcggtcgggc gcatcggctc cctggcatgc 480
tgggaacatt acaacccgct ggcccgttac gccttgatgg cagacggcga acagatccac 540
gccgcgatgt ttcccggttc cctggtgggt gacatcttcg ccgagcagat cgaggtcacc 600
atccgccatc acgcattgga gtcaggatgc ttcgtggtca atgcaacagc ctggctggat 660
gcggatcagc agggccaaat aatgcaggac acaggttgcg gccttggtcc catctcgggc 720
ggctgcttca ccgcgatcgt atcgccggaa gggaagctac ttggagagcc gcttcgctcc 780
ggggaaggcg tagtgattgc cgacctcgat acggccttga tcgacaagcg caaacggatg 840
atggattcag taggtcatta cagtcgtccc gagctgctca gcctattgat cgatcgatcg 900
ccgactgcgc atgttcatga acgcgccggc tttgtttcga gcaacgccgg tttgcaggag 960
gtcgcccatg cagaccaatg a 981
<210>108
<211>326
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>108
Met Ala Ile Ile Arg Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Asp Lys Val Cys Gln Gln Ile Ile Thr Leu
20 25 30
Gly Lys Gln Gly Val Gln Phe Ala Val Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Pro Ala Phe Ala Met Gly Ala
50 55 60
Gln His Leu Lys Leu Leu Asp Gln Ser Val Thr Val Pro Ser Ala Ala
65 70 75 80
Thr Leu Ala Ile Gly Glu Ala Cys Lys Gln Ala Gly Met Val Val Ser
85 90 95
Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Ala Gln Leu
100 105 110
Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln His Arg Arg Lys Ile Thr
115 120 125
Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp Gly Ser Gly
130 135 140
Leu Arg Ala Ile Asp Ser Ala Val Gly Arg Ile Gly Ser Leu Ala Cys
145 150 155 160
Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp Gly
165 170 175
Glu Gln Ile His Ala Ala Met Phe Pro Gly Ser Leu Val Gly Asp Ile
180 185 190
Phe Ala Glu Gln Ile Glu Val Thr Ile Arg His His Ala Leu Glu Ser
195 200 205
Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp Gln Gln
210 215 220
Gly Gln Ile Met Gln Asp Thr Gly Cys Gly Leu Gly Pro Ile Ser Gly
225 230 235 240
Gly Cys Phe Thr Ala Ile Val Ser Pro Glu Gly Lys Leu Leu Gly Glu
245 250 255
Pro Leu Arg Ser Gly Glu Gly Val Val Ile Ala Asp Leu Asp Thr Ala
260 265 270
Leu Ile Asp Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser
275 280 285
Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Ser Pro Thr Ala His
290 295 300
Val His Glu Arg Ala Gly Phe Val Ser Ser Asn Ala Gly Leu Gln Glu
305 310 315 320
Val Ala His Ala Asp Gln
325
<210>109
<211>1092
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>109
atggccatca ttcgcgcagc agccgtacag atcagcccgg ttctttacag ccgcgaaggc 60
accgtggaca gggtctgcca gcagatcatc acccttggca aacaaggtgt gcagttcgcc 120
gtgttcccgg aaacggtggt gccgtactac ccctattttt cctttgtgca gccggcattt 180
gcgatgggtg cacaacacct caaattgctc gatcaatctg taaccgtgcc atcggccgcc 240
accctggcga ttggcgaagc gtgcaagcaa gcaggaatgg tcgtttccat cggcgtcaat 300
gaacgcgatg gcggtacgat ttacaacgcg caattactct tcgatgctga cggcactctg 360
attcagcatc ggcgcaaaat caccccgacc taccacgagc gcatggtctg ggggcagggc 420
gatggttccg gtctgcgcgc catcgacagc gcggtcgggc gcatcggctc cctggcatgc 480
tgggaacatt acaacccgct ggcccgttac gccttgatgg cagacggcga acagatccac 540
gccgcgatgt ttcccggttc cctggtgggt gacatcttcg ccgagcagat cgaggtcacc 600
atccgccatc acgcattgga atcaggatgc ttcgtggtca atgcaacagc ttggctggat 660
gcggatcagc agggccaaat aatgcaggac acaggttgcg gccttggtcc catctcgggc 720
ggctgcttca ccgcgatcgt atcgccggaa gggaagctac ttggagagcc gcttcgctca 780
ggggaaggcg tagtgattgc cgacctcgat atggccttga tcgacaagcg caaacggatg 840
atggattcag taggtcatta cagtcgtccc gagctgctca gcctattgat cgatcgatcg 900
ccgactgcgc attttcatga acgcgccggg ctttgttccg agcgacgccg gtttgcagga 960
ggtcgcgcat gcagaccaat gaattgctcg ctgacctgca aatccaaggc ctgcgttggc 1020
cggccgcgca aatggcttgt cgcgccaagg cggcgccggt ccttcagacc acaaggcgct 1080
gagcctaggt aa 1092
<210>110
<211>363
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>110
Met Ala Ile Ile Arg Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Asp Arg Val Cys Gln Gln Ile Ile Thr Leu
20 25 30
Gly Lys Gln Gly Val Gln Phe Ala Val Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Pro Ala Phe Ala Met Gly Ala
50 55 60
Gln His Leu Lys Leu Leu Asp Gln Ser Val Thr Val Pro Ser Ala Ala
65 70 75 80
Thr Leu Ala Ile Gly Glu Ala Cys Lys Gln Ala Gly Met Val Val Ser
85 90 95
Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Ala Gln Leu
100 105 110
Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln His Arg Arg Lys Ile Thr
115 120 125
Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp Gly Ser Gly
130 135 140
Leu Arg Ala Ile Asp Ser Ala Val Gly Arg Ile Gly Ser Leu Ala Cys
145 150 155 160
Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp Gly
165 170 175
Glu Gln Ile His Ala Ala Met Phe Pro Gly Ser Leu Val Gly Asp Ile
180 185 190
Phe Ala Glu Gln Ile Glu Val Thr Ile Arg His His Ala Leu Glu Ser
195 200 205
Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp Gln Gln
210 215 220
Gly Gln Ile Met Gln Asp Thr Gly Cys Gly Leu Gly Pro Ile Ser Gly
225 230 235 240
Gly Cys Phe Thr Ala Ile Val Ser Pro Glu Gly Lys Leu Leu Gly Glu
245 250 255
Pro Leu Arg Ser Gly Glu Gly Val Val Ile Ala Asp Leu Asp Met Ala
260 265 270
Leu Ile Asp Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser
275 280 285
Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Ser Pro Thr Ala His
290 295 300
Phe His Glu Arg Ala Gly Leu Cys Ser Glu Arg Arg Arg Phe Ala Gly
305 310 315 320
Gly Arg Ala Cys Arg Pro Met Asn Cys Ser Leu Thr Cys Lys Ser Lys
325 330 335
Ala Cys Val Gly Arg Pro Arg Lys Trp Leu Val Ala Pro Arg Arg Arg
340 345 350
Arg Ser Phe Arg Pro Gln Gly Ala Glu Pro Arg
355 360
<210>111
<211>990
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>111
atgcccaaaa cagtacgtgc cgcagcagtc cagatcgcgc ccgacctgac gtcacgcgcc 60
ggcaccgtcg agcgggtcct caatgcaatc gccgaagctg ctgacaaagg cgccgagctg 120
atcgtatttc ccgagacctt cgtgccctgg tatccctatt tcagtttcgt tctgccacct 180
gtccagcaag gccctgagca tcttcgtctt tatgaggaag cagtcacggt accatcagca 240
gaaacacggg ccgtcgcgga cgccgcgcgc aaacgcaatg cggttatcgt ccttggcgtc 300
aatgagcgcg accacggctc gctctataac actcagctga tcttcgacgc ggatggcagc 360
ctgaaactca agcgtcgcaa gatcacgccg acctatcacg aacggatgat ctggggccaa 420
ggcgatggcg ccggcctgaa ggttgtcgac actgccgtcg gtcgcgtggg tgccctggca 480
tgctgggagc attacaatcc tctggcccgc tatactttga tggcccagca tgaggaaatt 540
cacgcctctc atttcccggg ctcactggtc ggcccgatat tcggcgagca aatcgaagtc 600
accatgcgcc accacgcgtt ggaatcgggc tgtttcgtgg tcaatgccac cggctggctg 660
agcgaggagc agatcgcatc tattcatccg gaccccgcct tgcaaaaggg cctgcgcgat 720
ggctgcatga cctgcatcat cacgccggaa ggacgccatg tcgtaccgcc gctgacctcg 780
ggcgaaggca tcctgatcgg cgatctggac atgcggctca ttaccaagcg caagcggatg 840
atggattcgg tcggacacta tgctcggcct gaactgctgc accttgtcca tgacacgacg 900
cccgcacgcg cacgcgagca ggtcggcctt tcaggcgatt ttcccgatgc ggagcaagac 960
aagctatttg aggaggttca taatgcgtga 990
<210>112
<211>329
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>112
Met Pro Lys Thr Val Arg Ala Ala Ala Val Gln Ile Ala Pro Asp Leu
1 5 10 15
Thr Ser Arg Ala Gly Thr Val Glu Arg Val Leu Asn Ala Ile Ala Glu
20 25 30
Ala Ala Asp Lys Gly Ala Glu Leu Ile Val Phe Pro Glu Thr Phe Val
35 40 45
Pro Trp Tyr Pro Tyr Phe Ser Phe Val Leu Pro Pro Val Gln Gln Gly
50 55 60
Pro Glu His Leu Arg Leu Tyr Glu Glu Ala Val Thr Val Pro Ser Ala
65 70 75 80
Glu Thr Arg Ala Val Ala Asp Ala Ala Arg Lys Arg Asn Ala Val Ile
85 90 95
Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn Thr Gln
100 105 110
Leu Ile Phe Asp Ala Asp Gly Ser Leu Lys Leu Lys Arg Arg Lys Ile
115 120 125
Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp Gly Ala
130 135 140
Gly Leu Lys Val Val Asp Thr Ala Val Gly Arg Val Gly Ala Leu Ala
145 150 155 160
Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Thr Leu Met Ala Gln
165 170 175
His Glu Glu Ile His Ala Ser His Phe Pro Gly Ser Leu Val Gly Pro
180 185 190
Ile Phe Gly Glu Gln Ile Glu Val Thr Met Arg His His Ala Leu Glu
195 200 205
Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Ser Glu Glu Gln
210 215 220
Ile Ala Ser Ile His Pro Asp Pro Ala Leu Gln Lys Gly Leu Arg Asp
225 230 235 240
Gly Cys Met Thr Cys Ile Ile Thr Pro Glu Gly Arg His Val Val Pro
245 250 255
Pro Leu Thr Ser Gly Glu Gly Ile Leu Ile Gly Asp Leu Asp Met Arg
260 265 270
Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ala
275 280 285
Arg Pro Glu Leu Leu His Leu Val His Asp Thr Thr Pro Ala Arg Ala
290 295 300
Arg Glu Gln Val Gly Leu Ser Gly Asp Phe Pro Asp Ala Glu Gln Asp
305 310 315 320
Lys Leu Phe Glu Glu Val His Asn Ala
325
<210>113
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>113
atgacgaagg aacgcgccgc gcgcagcctg cgcgcagctg ccatacagct tgaagccgaa 60
gtcggcgaca tcgccgccaa tctcgcacgc atcgaggcga tggtcgagga ggctgcgggc 120
aagggcgccg aactgatcgc cattccggag ttctgcacct cccgcatgcc cttcgatgca 180
cgcgtgcacg acgccgtgct gccgccggac aacttcgtgg tcgatgcctt tcgccgcatg 240
gcagcgacgc acaactgccg gctcggcggc tccatgctca ttgccgacgg tggcgagatc 300
tacaaccgct accacttcgt cgaacccgac ggcagcgtgc atctgcacga caaggatctg 360
ccgacgatgt gggagaacgc cttctacacc ggcggctccg acgacggcgt cttcgacacc 420
ggcatcggcg gcgtcggcgc cgcggtgtgc tgggaactgg tacgcaccgg caccgtgcga 480
cgcatgctcg gtcgcgtcga cgtcgccatg accggcacgc attggtggac gatgccgcac 540
aactggggca gcgccgtcgc gcgcacgctg gccgcgatga cgcagtacaa ccgctacatg 600
tccgagaatg cacccaccga attcgcccgc cgcctgggtg tgccggtgct gcaggcctcg 660
cactgcggaa gcttccgcac cggtttcttg ctgetgccag gcagcgggcg tgcactgccc 720
tatgacaccg agtacgtcgg cgccacacag atcgtcgatg ccgatggcca catcctcgcc 780
caccgtcgca cgcaggaagg ccccggtgtc gtcgtcgccg acatcacgct cggtgcccgc 840
acgcccgagc tgccactgga agaccgcttc tggattcccg agctgccgct cttcctcaag 900
gcctactggc accaccagaa cctgtgcggc aagtcctact accgtcgcgt cggccgcgat 960
gccggcctgg cggcggcgga gcgttcggca tga 993
<210>114
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>114
Met Thr Lys Glu Arg Ala Ala Arg Ser Leu Arg Ala Ala Ala Ile Gln
1 5 10 15
Leu Glu Ala Glu Val Gly Asp Ile Ala Ala Asn Leu Ala Arg Ile Glu
20 25 30
Ala Met Val Glu Glu Ala Ala Gly Lys Gly Ala Glu Leu Ile Ala Ile
35 40 45
Pro Glu Phe Cys Thr Ser Arg Met Pro Phe Asp Ala Arg Val His Asp
50 55 60
Ala Val Leu Pro Pro Asp Asn Phe Val Val Asp Ala Phe Arg Arg Met
65 70 75 80
Ala Ala Thr His Asn Cys Arg Leu Gly Gly Ser Met Leu Ile Ala Asp
85 90 95
Gly Gly Glu Ile Tyr Asn Arg Tyr His Phe Val Glu Pro Asp Gly Ser
100 105 110
Val His Leu His Asp Lys Asp Leu Pro Thr Met Trp Glu Asn Ala Phe
115 120 125
Tyr Thr Gly Gly Ser Asp Asp Gly Val Phe Asp Thr Gly Ile Gly Gly
130 135 140
Val Gly Ala Ala Val Cys Trp Glu Leu Val Arg Thr Gly Thr Val Arg
145 150 155 160
Arg Met Leu Gly Arg Val Asp Val Ala Met Thr Gly Thr His Trp Trp
165 170 175
Thr Met Pro His Asn Trp Gly Ser Ala Val Ala Arg Thr Leu Ala Ala
180 185 190
Met Thr Gln Tyr Asn Arg Tyr Met Ser Glu Asn Ala Pro Thr Glu Phe
195 200 205
Ala Arg Arg Leu Gly Val Pro Val Leu Gln Ala Ser His Cys Gly Ser
210 215 220
Phe Arg Thr Gly Phe Leu Leu Leu Pro Gly Ser Gly Arg Ala Leu Pro
225 230 235 240
Tyr Asp Thr Glu Tyr Val Gly Ala Thr Gln Ile Val Asp Ala Asp Gly
245 250 255
His Ile Leu Ala His Arg Arg Thr Gln Glu Gly Pro Gly Val Val Val
260 265 270
Ala Asp Ile Thr Leu Gly Ala Arg Thr Pro Glu Leu Pro Leu Glu Asp
275 280 285
Arg Phe Trp Ile Pro Glu Leu Pro Leu Phe Leu Lys Ala Tyr Trp His
290 295 300
His Gln Asn Leu Cys Gly Lys Ser Tyr Tyr Arg Arg Val Gly Arg Asp
305 310 315 320
Ala Gly Leu Ala Ala Ala Glu Arg Ser Ala
325 330
<210>115
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>115
atgaaccaaa tcattaaagc ggcggcagtt caatgtagcc ctgtgttgta tagccaagcg 60
ggtacagtca agaaaatctg tgacacgatt ttggagttgg ggcagcaagg tgtgcaattt 120
gccgtatttc ctgaaactgt tgtgccttat tacccttatt tttcttttgt gcaaccaccg 180
tttgccatgg gtaaagaaca tttaaagcta ttgcatgaat cggttgtcgt gccatcggca 240
gcaacaactt taattggaca ggcatgcaaa gaagcgaaca tggtggtttc tattggtatt 300
aatgagcgtg caggcggcac gatttataac gctcaattgt tgtttgatgc ggatggttcg 360
attattcagc atcgccgtaa aattacccca acgtatcatg aacgtatggt gtgggggcaa 420
ggcgatggca gtggtttacg tgcgatagat tctgctgtag gacgtattgg gtcgctggca 480
tgttgggagc attacaaccc tttggctcgg tttgctttga tggcggatgg tgagcaaatt 540
catgcggcga tgtttccggg atcactcgtg gggcagattt ttgcagatca gatcagtgcc 600
accattcagc accatgcttt agagtcgggc tgttttgtgg tgaatgccac agcatggctt 660
gacccagagc aacaacaaca aattatgcaa gatacaggct gtgaactcgg tccaatttcg 720
gggggatgtt ttacggccat cgtttctcca gaaggcaaat ttttgtctga accgatcaca 780
caaggcgaag gttatgtgat tgccgattta gacttttcct taatcgaaaa acgtaaacgg 840
atgatggatt ctgttgggca ttatagtcgt ccagaattac tcagtttgtt gattgatcgt 900
cgtcctacct cagttttgca tgagttaaaa ctagagaatc catcgaataa cagcatcgaa 960
aaagtgtctg aatttgccga ggtacacgca tag 993
<210>116
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>116
Met Asn Gln Ile Ile Lys Ala Ala Ala Val Gln Cys Ser Pro Val Leu
1 5 10 15
Tyr Ser Gln Ala Gly Thr Val Lys Lys Ile Cys Asp Thr Ile Leu Glu
20 25 30
Leu Gly Gln Gln Gly Val Gln Phe Ala Val Phe Pro Glu Thr Val Val
35 40 45
Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Pro Pro Phe Ala Met Gly
50 55 60
Lys Glu His Leu Lys Leu Leu His Glu Ser Val Val Val Pro Ser Ala
65 70 75 80
Ala Thr Thr Leu Ile Gly Gln Ala Cys Lys Glu Ala Asn Met Val Val
85 90 95
Ser Ile Gly Ile Asn Glu Arg Ala Gly Gly Thr Ile Tyr Asn Ala Gln
100 105 110
Leu Leu Phe Asp Ala Asp Gly Ser Ile Ile Gln His Arg Arg Lys Ile
115 120 125
Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp Gly Ser
130 135 140
Gly Leu Arg Ala Ile Asp Ser Ala Val Gly Arg Ile Gly Ser Leu Ala
145 150 155 160
Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Phe Ala Leu Met Ala Asp
165 170 175
Gly Glu Gln Ile His Ala Ala Met Phe Pro Gly Ser Leu Val Gly Gln
180 185 190
Ile Phe Ala Asp Gln Ile Ser Ala Thr Ile Gln His His Ala Leu Glu
195 200 205
Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Pro Glu Gln
210 215 220
Gln Gln Gln Ile Met Gln Asp Thr Gly Cys Glu Leu Gly Pro Ile Ser
225 230 235 240
Gly Gly Cys Phe Thr Ala Ile Val Ser Pro Glu Gly Lys Phe Leu Ser
245 250 255
Glu Pro Ile Thr Gln Gly Glu Gly Tyr Val Ile Ala Asp Leu Asp Phe
260 265 270
Ser Leu Ile Glu Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr
275 280 285
Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Arg Pro Thr Ser
290 295 300
Val Leu His Glu Leu Lys Leu Glu Asn Pro Ser Asn Asn Ser Ile Glu
305 310 315 320
Lys Val Ser Glu Phe Ala Glu Val His Ala
325 330
<210>117
<211>957
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>117
atgactcaat ccaggataat tcgtgctgcg gcagcgcaga tcgctccgga tttgcaggtt 60
ccaggtaaca cgatcgacaa agtttgccgc accatcagcg aggcggccgc aaaaggcgta 120
cagattattg ttttccctga aaccttggtg ccttattacc cttacttctc ttacatttca 180
ccgcccattc aacagggcaa agaacatttg cggctgtatg accatgcagt ggttgtgccc 240
ggctcggaaa ccgaggcaat ttcagctctt gccgcccaac acaatatggt ggtggttttg 300
ggtgtgaacg agcgcgatca cggcacactt tacaacgcac aaattatttt caacagcgac 360
ggaaagattc tgttgaagcg ccgaaaaatt acaccaactt atcacgagcg gatggtgtgg 420
gggcagggtg acgcttcagg cttgaaggtg gttgattccg cagtgggccg tgtgggtgca 480
ttggcctgtt gggaacacta caaccccttg gctcgctatt gtttgatggc ccagcacgaa 540
gaaattcact gtgcgcagtt tcccggttca ttggtggggc aagtttttgc cgaccaaatg 600
gaagtgacca ttcgtcacca cgcacttgag tcgggctgtt ttgtcatcaa cagcaccgct 660
tggctttctg aagaacaggt tcaaagtatt tcatccgaca gcgcattgca gaaagggctt 720
agaggcggtt gtttcacggc cattgtcagc cctgagggaa agctgttggc tgagccgctc 780
accgagggtg agggcatggt gatcgccgac ctcgacatgg cgttggttac gaaacgcaaa 840
cgcatgatgg attcagtggg ccattatgcg cgccccgagt tgttgagttt gctggttcgg 900
gatgaggctt caagccccat gaaaaaaatt cagggagttc aacatgctga gtactga 957
<210>118
<211>318
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>118
Met Thr Gln Ser Arg Ile Ile Arg Ala Ala Ala Ala Gln Ile Ala Pro
1 5 10 15
Asp Leu Gln Val Pro Gly Asn Thr Ile Asp Lys Val Cys Arg Thr Ile
20 25 30
Ser Glu Ala Ala Ala Lys Gly Val Gln Ile Ile Val Phe Pro Glu Thr
35 40 45
Leu Val Pro Tyr Tyr Pro Tyr Phe Ser Tyr Ile Ser Pro Pro Ile Gln
50 55 60
Gln Gly Lys Glu His Leu Arg Leu Tyr Asp His Ala Val Val Val Pro
65 70 75 80
Gly Ser Glu Thr Glu Ala Ile Ser Ala Leu Ala Ala Gln His Asn Met
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Thr Leu Tyr Asn
100 105 110
Ala Gln Ile Ile Phe Asn Ser Asp Gly Lys Ile Leu Leu Lys Arg Arg
115 120 125
Lys Ile Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp
130 135 140
Ala Ser Gly Leu Lys Val Val Asp Ser Ala Val Gly Arg Val Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Cys Leu Met
165 170 175
Ala Gln His Glu Glu Ile His Cys Ala Gln Phe Pro Gly Ser Leu Val
180 185 190
Gly Gln Val Phe Ala Asp Gln Met Glu Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Ile Asn Ser Thr Ala Trp Leu Ser Glu
210 215 220
Glu Gln Val Gln Ser Ile Ser Ser Asp Ser Ala Leu Gln Lys Gly Leu
225 230 235 240
Arg Gly Gly Cys Phe Thr Ala Ile Val Ser Pro Glu Gly Lys Leu Leu
245 250 255
Ala Glu Pro Leu Thr Glu Gly Glu Gly Met Val Ile Ala Asp Leu Asp
260 265 270
Met Ala Leu Val Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Leu Val Arg Asp Glu Ala Ser
290 295 300
Ser Pro Met Lys Lys Ile Gln Gly Val Gln His Ala Glu Tyr
305 310 315
<210>119
<211>984
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>119
atggatacac tcaaagttgg attggttcag atggccccca tctggttgaa ccgggataaa 60
accctgatca aagttgagga atacatgcag aaagcaggca aacagggctg caacctggta 120
gcttttggtg aagcgctggt tcccggctac cccttctggg tggaacgcac agagggcgcc 180
agattcaatt ccaaagtcca gaaagaactc tttgcacatt accttgatca ggcggtgcag 240
atcgaagccg gccaccttga tcctctccag gcattagccc aacaatacaa gatggctgtg 300
tacgtgggga cgattgaacg cccgcctgag cggagcggcc acagcctgta ctgctcccta 360
atatttatag acccagaagg cgagatcggc tcggttcacc gcaagttgat gcccacccat 420
gaggaacgcc tggtctggtc aactggcgat gggcacggcc tgcgaacaca ttctctgggc 480
gcctttaccg ttggcggact caactgctgg gaaaactgga tgccgctctc ccgcacagct 540
ctttatgcca tgggagagga tcttcatgtt gctgcctggc ccgggagtca gcgcaatact 600
tatgatataa ccaaattcat tgccaaggaa tctcgctctt atgtgatctc cgtatccggg 660
atgatgaaaa aagaaaatat cctctctgaa attccccaca gccaattgat gctggaaaat 720
agcgaggata ttatggctga tggcggatcc tgtctggctg gaccagatgg agaatggatc 780
atcgagccca tcgtcggaga ggaaaccctg gtaactgctg aactatcaca tcagcgggtc 840
agagaagaaa gacagaattt cgacccaaca ggtcactaca gtcggcctga tgtgacccgc 900
ctggtagtcg accgcaggcg ccagcagatc ctggagatca ccccggacga aaaaggaaga 960
tcggatgaaa atcaatccct ttaa 984
<210>120
<211>327
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>120
Met Asp Thr Leu Lys Val Gly Leu Val Gln Met Ala Pro Ile Trp Leu
1 5 10 15
Asn Arg Asp Lys Thr Leu Ile Lys Val Glu Glu Tyr Met Gln Lys Ala
20 25 30
Gly Lys Gln Gly Cys Asn Leu Val Ala Phe Gly Glu Ala Leu Val Pro
35 40 45
Gly Tyr Pro Phe Trp Val Glu Arg Thr Glu Gly Ala Arg Phe Asn Ser
50 55 60
Lys Val Gln Lys Glu Leu Phe Ala His Tyr Leu Asp Gln Ala Val Gln
65 70 75 80
Ile Glu Ala Gly His Leu Asp Pro Leu Gln Ala Leu Ala Gln Gln Tyr
85 90 95
Lys Met Ala Val Tyr Val Gly Thr Ile Glu Arg Pro Pro Glu Arg Ser
100 105 110
Gly His Ser Leu Tyr Cys Ser Leu Ile Phe Ile Asp Pro Glu Gly Glu
115 120 125
Ile Gly Ser Val His Arg Lys Leu Met Pro Thr His Glu Glu Arg Leu
130 135 140
Val Trp Ser Thr Gly Asp Gly His Gly Leu Arg Thr His Ser Leu Gly
145 150 155 160
Ala Phe Thr Val Gly Gly Leu Asn Cys Trp Glu Asn Trp Met Pro Leu
165 170 175
Ser Arg Thr Ala Leu Tyr Ala Met Gly Glu Asp Leu His Val Ala Ala
180 185 190
Trp Pro Gly Ser Gln Arg Asn Thr Tyr Asp Ile Thr Lys Phe Ile Ala
195 200 205
Lys Glu Ser Arg Ser Tyr Val Ile Ser Val Ser Gly Met Met Lys Lys
210 215 220
Glu Asn Ile Leu Ser Glu Ile Pro His Ser Gln Leu Met Leu Glu Asn
225 230 235 240
Ser Glu Asp Ile Met Ala Asp Gly Gly Ser Cys Leu Ala Gly Pro Asp
245 250 255
Gly Glu Trp Ile Ile Glu Pro Ile Val Gly Glu Glu Thr Leu Val Thr
260 265 270
Ala Glu Leu Ser His Gln Arg Val Arg Glu Glu Arg Gln Asn Phe Asp
275 280 285
Pro Thr Gly His Tyr Ser Arg Pro Asp Val Thr Arg Leu Val Val Asp
290 295 300
Arg Arg Arg Gln Gln Ile Leu Glu Ile Thr Pro Asp Glu Lys Gly Arg
305 310 315 320
Ser Asp Glu Asn Gln Ser Leu
325
<210>121
<211>1158
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>121
atgagcaaaa aagttctagg cggcagagaa aaagtaaaag ttgcagtagt tcaggctgcg 60
cccgttttca tggacaagga gaagacgatt gaaaaggctt gcaagctaat aaaagaagcg 120
gggagaaatc gagccgagct catagcgttc tcagagtcat tcatccccgt ctatcctgca 180
tactataccg tcggctatga aaccccttct caagaatgga gagattacgt gattgcgcta 240
caggataact ccgtgctgat tccgagcgag gataccgagg tactcggaca ggctgcaaag 300
gaggcagggg cttatgcagt aataggatgc agcgagatgg acgaccgtcc gggaagccga 360
acagtttaca acacgctcct cttcatcggc aaagacggca aggtcatggg aaggcataga 420
aaactcaaac ccacgttcac ggagagaata tactggggag agggagatgc tggagacata 480
aaggtttttg ataccgagat cggcaggatc ggaggcctcg tatgctggga gaaccatatg 540
actctagtca gggccgcgat gatacacagg ggagaggagt ttcatatcgc ggtctggccg 600
ggaaactgga agggtgcgga aaacaagctt ctccaagcag ataatagccc aggaggcgcc 660
ctctgcaacc ttcaatctct cattaaagta cacgcctttg aggccggggc gtttgtgctg 720
agcgcttgcg gctttttgac gccagaggat ttcccggaaa ggtggcatta tataagggat 780
ggtaaccata ttaactgcga ctgggcactg ggcggaagct caatcgtcaa tcccgccggc 840
cgttatctcg tcgagcctaa ctttgagaag gatgcaatcc tctatgcgga ttgttatgca 900
aaccagataa aagcagtaaa agcggttttt gattcccttg gccactattc ccgctgggat 960
attgcccaac tggcgataag gcaggaagcc tggaatccag aggtttcttt gatcgattcc 1020
tcttcgactg aagttgagct tccggcagac gagcttcgaa ggatttcgga gaagtttgaa 1080
gtaactgcgg ataagttgga atctttgctt gaggaaattg gaaagattaa aaagcccagg 1140
aaacaagccg gttcctaa 1158
<210>122
<211>385
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>122
Met Ser Lys Lys Val Leu Gly Gly Arg Glu Lys Val Lys Val Ala Val
1 5 10 15
Val Gln Ala Ala Pro Val Phe Met Asp Lys Glu Lys Thr Ile Glu Lys
20 25 30
Ala Cys Lys Leu Ile Lys Glu Ala Gly Arg Asn Arg Ala Glu Leu Ile
35 40 45
Ala Phe Ser Glu Ser Phe Ile Pro Val Tyr Pro Ala Tyr Tyr Thr Val
50 55 60
Gly Tyr Glu Thr Pro Ser Gln Glu Trp Arg Asp Tyr Val Ile Ala Leu
65 70 75 80
Gln Asp Asn Ser Val Leu Ile Pro Ser Glu Asp Thr Glu Val Leu Gly
85 90 95
Gln Ala Ala Lys Glu Ala Gly Ala Tyr Ala Val Ile Gly Cys Ser Glu
100 105 110
Met Asp Asp Arg Pro Gly Ser Arg Thr Val Tyr Asn Thr Leu Leu Phe
115 120 125
Ile Gly Lys Asp Gly Lys Val Met Gly Arg His Arg Lys Leu Lys Pro
130 135 140
Thr Phe Thr Glu Arg Ile Tyr Trp Gly Glu Gly Asp Ala Gly Asp Ile
145 150 155 160
Lys Val Phe Asp Thr Glu Ile Gly Arg Ile Gly Gly Leu Val Cys Trp
165 170 175
Glu Asn His Met Thr Leu Val Arg Ala Ala Met Ile His Arg Gly Glu
180 185 190
Glu Phe His Ile Ala Val Trp Pro Gly Asn Trp Lys Gly Ala Glu Asn
195 200 205
Lys Leu Leu Gln Ala Asp Asn Ser Pro Gly Gly Ala Leu Cys Asn Leu
210 215 220
Gln Ser Leu Ile Lys Val His Ala Phe Glu Ala Gly Ala Phe Val Leu
225 230 235 240
Ser Ala Cys Gly Phe Leu Thr Pro Glu Asp Phe Pro Glu Arg Trp His
245 250 255
Tyr Ile Arg Asp Gly Asn His Ile Asn Cys Asp Trp Ala Leu Gly Gly
260 265 270
Ser Ser Ile Val Asn Pro Ala Gly Arg Tyr Leu Val Glu Pro Asn Phe
275 280 285
Glu Lys Asp Ala Ile Leu Tyr Ala Asp Cys Tyr Ala Asn Gln Ile Lys
290 295 300
Ala Val Lys Ala Val Phe Asp Ser Leu Gly His Tyr Ser Arg Trp Asp
305 310 315 320
Ile Ala Gln Leu Ala Ile Arg Gln Glu Ala Trp Asn Pro Glu Val Ser
325 330 335
Leu Ile Asp Ser Ser Ser Thr Glu Val Glu Leu Pro Ala Asp Glu Leu
340 345 350
Arg Arg Ile Ser Glu Lys Phe Glu Val Thr Ala Asp Lys Leu Glu Ser
355 360 365
Leu Leu Glu Glu Ile Gly Lys Ile Lys Lys Pro Arg Lys Gln Ala Gly
370 375 380
Ser
385
<210>123
<211>990
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>123
atgtcaactt tcaagatcgc taccgtgcag agtgcaccag tatttatgga ccgcgaagct 60
accattgaca agacttgcga gctgatcgcc gaagcagcac aagatgacga cgttcgccta 120
gtggtcttcc ccgaggcctt tatccccacc tatccggact gggtatggcg tatccctccc 180
ggacagcacc agatgcttgc cgacctgtac ggggagttgc tcgagcagtc ggtgacgata 240
cccagtctgg ctaccgagcg gctctgtcag gctgcaaaga aagcgggcgt ttatgtagct 300
gtgggcctta acgaacgcaa tacagaggcc agcaacgcta ccctgtacaa caccctgctc 360
tacattgacg ccgagggcaa cttgctaggt aagcaccgaa agctggtacc gaccgctccc 420
gaacgcatgg tctgggcaca gggagatggc agtacccttg aggtctacga gacctccttc 480
ggaaaactca gcggactaat ctgttgggag aactacatgc ctctcgctcg ttatgccctg 540
tatgcctggg gagtacagct ctatttggct cctacttggg atcgaggcga gccctggctt 600
tccactctgc ggcacattgc caaggaagga cgagtatacg tggtcggctg ctctatcgcc 660
ttacgtaagg aagacatccc cgaccgattc gaattcaagg cgaagtacta cgcagaggca 720
ggagagtgga taaacaaagg tgacagcgtc atcgtcggtc ccgatggcga gctcatcgcc 780
gggcctctac ataaggaaca ggggatactc tatgctgagc tggacacaag gcagatgcac 840
gcccccaagt ggaacctgga tgtagccgga cactacgcgc gcccggacgt gtttcggctg 900
accgtgagca aggatggcca tccgatgctc ggcgttgccc aagggcccaa gcatgagccg 960
caagataaga ccgaagtatt agagggctag 990
<210>124
<211>329
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>124
Met Ser Thr Phe Lys Ile Ala Thr Val Gln Ser Ala Pro Val Phe Met
1 5 10 15
Asp Arg Glu Ala Thr Ile Asp Lys Thr Cys Glu Leu Ile Ala Glu Ala
20 25 30
Ala Gln Asp Asp Asp Val Arg Leu Val Val Phe Pro Glu Ala Phe Ile
35 40 45
Pro Thr Tyr Pro Asp Trp Val Trp Arg Ile Pro Pro Gly Gln His Gln
50 55 60
Met Leu Ala Asp Leu Tyr Gly Glu Leu Leu Glu Gln Ser Val Thr Ile
65 70 75 80
Pro Ser Leu Ala Thr Glu Arg Leu Cys Gln Ala Ala Lys Lys Ala Gly
85 90 95
Val Tyr Val Ala Val Gly Leu Asn Glu Arg Asn Thr Glu Ala Ser Asn
100 105 110
Ala Thr Leu Tyr Asn Thr Leu Leu Tyr Ile Asp Ala Glu Gly Asn Leu
115 120 125
Leu Gly Lys His Arg Lys Leu Val Pro Thr Ala Pro Glu Arg Met Val
130 135 140
Trp Ala Gln Gly Asp Gly Ser Thr Leu Glu Val Tyr Glu Thr Ser Phe
145 150 155 160
Gly Lys Leu Ser Gly Leu Ile Cys Trp Glu Asn Tyr Met Pro Leu Ala
165 170 175
Arg Tyr Ala Leu Tyr Ala Trp Gly Val Gln Leu Tyr Leu Ala Pro Thr
180 185 190
Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His Ile Ala Lys
195 200 205
Glu Gly Arg Val Tyr Val Val Gly Cys Ser Ile Ala Leu Arg Lys Glu
210 215 220
Asp Ile Pro Asp Arg Phe Glu Phe Lys Ala Lys Tyr Tyr Ala Glu Ala
225 230 235 240
Gly Glu Trp Ile Asn Lys Gly Asp Ser Val Ile Val Gly Pro Asp Gly
245 250 255
Glu Leu Ile Ala Gly Pro Leu His Lys Glu Gln Gly Ile Leu Tyr Ala
260 265 270
Glu Leu Asp Thr Arg Gln Met His Ala Pro Lys Trp Asn Leu Asp Val
275 280 285
Ala Gly His Tyr Ala Arg Pro Asp Val Phe Arg Leu Thr Val Ser Lys
290 295 300
Asp Gly His Pro Met Leu Gly Val Ala Gln Gly Pro Lys His Glu Pro
305 310 315 320
Gln Asp Lys Thr Glu Val Leu Glu Gly
325
<210>125
<211>1050
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>125
atgacaactg taaaaaagac ggtacgcgca gcagcgatcc agatcgcacc tgacctcgac 60
agtgcaggcg gtacgctgga caaggtttgc acggccattc aaaaggcggc ggcacaaggc 120
gcggagctgg tggtttttcc cgaaaccttc ttgccctact atccttactt ttcattcgtg 180
cggccgccct tcgcatccgg cccggaacac ttgctgctat atgaacgcgc agtggcggtg 240
ccaggcccgg tgaccgatgc cgtctctgcc gtcgcgcgca gccacggcgt ggtggtggta 300
ctcggcgtca atgaacgcga ccatggcacg ctgtacaaca cccaactggt gttcgacgcg 360
aatggcgaac tggtgttgaa acgcagaaaa atcacgccga cttatcacga gcggatgatc 420
tggggtcaag gcgacggcag cggactcaaa gtagtgcaaa cggcggtcgg ccggctaggc 480
gcgctagcct gttgggaaca ctacaaccca ctggcccgtt atgcattgat ggcgcaacac 540
gaagaaatcc attgcgccca gtttcccggg tccatggtcg ggcaaatatt cgccgaccag 600
atggaagtga cgatacgcca tcacgctctc gagtcggctt gcttcgtggt gaatgccaca 660
ggctggctga ccgatgcgca aatcacatcg atcacgccgg accccgcgct acaaaaggca 720
ttacgtggcg gttgctgcac cgccatcgtc tcgccggaag gtgtgctcct ggcagagccg 780
ctacgcagcg gcgaaggcat ggtgatcgcc gatctcgata tggcactcat caccaaacgc 840
aaacggatga tggattcggt cggccactat gcgcggcccg aattgttaag cctgcttgtc 900
gacgaccggc gcaaggtacc ggtatccgcg ctatttgccg acagcaaccc tgccaacggg 960
cacacagttt tcaccccatc cgacatacca acccttggga gcgcacatca tgcaaacagt 1020
taccaaaccg aaccagcaac tgatcactga 1050
<210>126
<211>349
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>126
Met Thr Thr Val Lys Lys Thr Val Arg Ala Ala Ala Ile Gln Ile Ala
1 5 10 15
Pro Asp Leu Asp Ser Ala Gly Gly Thr Leu Asp Lys Val Cys Thr Ala
20 25 30
Ile Gln Lys Ala Ala Ala Gln Gly Ala Glu Leu Val Val Phe Pro Glu
35 40 45
Thr Phe Leu Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Arg Pro Pro Phe
50 55 60
Ala Ser Gly Pro Glu His Leu Leu Leu Tyr Glu Arg Ala Val Ala Val
65 70 75 80
Pro Gly Pro Val Thr Asp Ala Val Ser Ala Val Ala Arg Ser His Gly
85 90 95
Val Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Thr Leu Tyr
100 105 110
Asn Thr Gln Leu Val Phe Asp Ala Asn Gly Glu Leu Val Leu Lys Arg
115 120 125
Arg Lys Ile Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly
130 135 140
Asp Gly Ser Gly Leu Lys Val Val Gln Thr Ala Val Gly Arg Leu Gly
145 150 155 160
Ala Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu
165 170 175
Met Ala Gln His Glu Glu Ile His Cys Ala Gln Phe Pro Gly Ser Met
180 185 190
Val Gly Gln Ile Phe Ala Asp Gln Met Glu Val Thr Ile Arg His His
195 200 205
Ala Leu Glu Ser Ala Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr
210 215 220
Asp Ala Gln Ile Thr Ser Ile Thr Pro Asp Pro Ala Leu Gln Lys Ala
225 230 235 240
Leu Arg Gly Gly Cys Cys Thr Ala Ile Val Ser Pro Glu Gly Val Leu
245 250 255
Leu Ala Glu Pro Leu Arg Ser Gly Glu Gly Met Val Ile Ala Asp Leu
260 265 270
Asp Met Ala Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly
275 280 285
His Tyr Ala Arg Pro Glu Leu Leu Ser Leu Leu Val Asp Asp Arg Arg
290 295 300
Lys Val Pro Val Ser Ala Leu Phe Ala Asp Ser Asn Pro Ala Asn Gly
305 310 315 320
His Thr Val Phe Thr Pro Ser Asp Ile Pro Thr Leu Gly Ser Ala His
325 330 335
His Ala Asn Ser Tyr Gln Thr Glu Pro Ala Thr Asp His
340 345
<210>127
<211>1005
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>127
atgatagcac ggaagacaat aagggcggcg gcggtgcaga tagcgcctgt gatggaagat 60
cggaaggcga cgaccgacaa ggtgtgcgcc tacattcagg aagcaggcga gaatggagcc 120
gaaattgtgg tgtttcctga aaccttcatt cccaattatc cctatttctc ttttgtaaaa 180
cctcccgtgt tggcaggtaa ggatcacctt accttgtatg accaagcggt ggaaatccct 240
agccctacta ccgaccaagt ggggtctatg gccaaaaaat ggggaatcgt agtggtgttg 300
ggcgtgaacg aaagaagcca cggcactttg tacaatgccc aaattgtctt tgacgctact 360
ggtgatattg tattggtgag acgcaaaatc acccctacct atcatgaacg gatgatctgg 420
ggacagggag atggcagtgg attaaaagca gtagacacag ctgtgggaag agtgggcgct 480
ttggcgtgtt gggaacacta taatccactt gcgcgctacg cccttatggt agaccatgag 540
gaaattcatt gcagccaatt ccctggctct atggtcggcc ccattttcgg tgaccagata 600
gaagtgacga ttcgccacca tgcgttggaa tcgggttgtt ttgtcatcaa ttccacaggt 660
tggctgtttg aagagcaaat ccaagccatc accgatgatc cgaaactgca caaagcattg 720
aaagacggct gtatgaccgc cattatttct cccgaaggcg tgcatttgac caaaccctta 780
acagaaggcg aaggcatcat ctacgcctat ctggacatga aactcataga caagcggaaa 840
cggatgatgg actcggtagg acactatgca cgtccagagt tgctctcttt gcatatcaac 900
aatgcagagc aaaaaccagc cgtttacacc tctcctctta ccaaaacgga aaccaaagaa 960
gacgtaaaaa gctatgatcg caacaaagaa cagcttatcg tctga 1005
<210>128
<211>334
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>128
Met Ile Ala Arg Lys Thr Ile Arg Ala Ala Ala Val Gln Ile Ala Pro
1 5 10 15
Val Met Glu Asp Arg Lys Ala Thr Thr Asp Lys Val Cys Ala Tyr Ile
20 25 30
Gln Glu Ala Gly Glu Ash Gly Ala Glu Ile Val Val Phe Pro Glu Thr
35 40 45
Phe Ile Pro Asn Tyr Pro Tyr Phe Ser Phe Val Lys Pro Pro Val Leu
50 55 60
Ala Gly Lys Asp His Leu Thr Leu Tyr Asp Gln Ala Val Glu Ile Pro
65 70 75 80
Ser Pro Thr Thr Asp Gln Val Gly Ser Met Ala Lys Lys Trp Gly Ile
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Ser His Gly Thr Leu Tyr Asn
100 105 110
Ala Gln Ile Val Phe Asp Ala Thr Gly Asp Ile Val Leu Val Arg Arg
115 120 125
Lys Ile Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp
130 135 140
Gly Ser Gly Leu Lys Ala Val Asp Thr Ala Val Gly Arg Val Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met
165 170 175
Val Asp His Glu Glu Ile His Cys Ser Gln Phe Pro Gly Ser Met Val
180 185 190
Gly Pro Ile Phe Gly Asp Gln Ile Glu Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Ile Asn Ser Thr Gly Trp Leu Phe Glu
210 215 220
Glu Gln Ile Gln Ala Ile Thr Asp Asp Pro Lys Leu His Lys Ala Leu
225 230 235 240
Lys Asp Gly Cys Met Thr Ala Ile Ile Ser Pro Glu Gly Val His Leu
245 250 255
Thr Lys Pro Leu Thr Glu Gly Glu Gly Ile Ile Tyr Ala Tyr Leu Asp
260 265 270
Met Lys Leu Ile Asp Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu His Ile Asn Asn Ala Glu Gln
290 295 300
Lys Pro Ala Val Tyr Thr Ser Pro Leu Thr Lys Thr Glu Thr Lys Glu
305 310 315 320
Asp Val Lys Ser Tyr Asp Arg Asn Lys Glu Gln Leu Ile Val
325 330
<210>129
<211>1011
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>129
atgtcagaaa agcgaattat taaagcggct gcagttcaaa tcacaccaga ttttgaatcg 60
catgatggaa ccgtaaagaa ggtttgtaat gtaattgatg aagcgggtgc taaaggtgta 120
cagatcattg tattccctga aacctttatt ccatattacc catatttttc tttcatcact 180
ccaccagtga ctgctggcgc ggagcatttg cggctctatg aaaaaagtgt cgtgatacct 240
ggtcccgtta ctcaagccat ttccgaacgt gcacgcatga ataatatggt tgttgtactt 300
ggtgtaaatg agcgtgataa cggcagtcta tataacaccc agattatttt tgatgctacc 360
ggtgagatgc ttctgaagag aagaaaaatc acacctacct atcatgagcg catgatttgg 420
gggcaaggag atgcttcagg cctgaaggtc gtcgatacgg ctattgggcg agtcggagca 480
ttggcatgct gggagcacta taaccctttg gctagataca gcctcatgac acagcatgaa 540
gaaattcact gtgctcaatt tccaggctcc atggttggtc agatcttcgc agatcaaatg 600
gatgtcacga ttcgtcatca tgccttggag tcaggttgct tcgtcatcaa ctccactggc 660
tggttaactg atgatcagat caaatctatc accgacgatc ccaaaatgca gaaagcttta 720
agaggtggtt gcaacacggc cattatttct ccagaaggga atcatttaac cgagcctttg 780
cgagaaggtg aaggcatggt gattgctgat cttgatatgg cactcatcac caaacgaaaa 840
agaatgatgg actcagttgg ccactacgcc agaccagaac tgttgagctt agcgatcaat 900
gatgctccgg ctactccttc attccagatg aacgaacatc gtcttaaatc agtgcaatta 960
cctatcgcag aggagcttaa aaatgacaac aagcttagca gtggacagta a 1011
<210>130
<211>336
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>130
Met Ser Glu Lys Arg Ile Ile Lys Ala Ala Ala Val Gln Ile Thr Pro
1 5 10 15
Asp Phe Glu Ser His Asp Gly Thr Val Lys Lys Val Cys Asn Val Ile
20 25 30
Asp Glu Ala Gly Ala Lys Gly Val Gln Ile Ile Val Phe Pro Glu Thr
35 40 45
Phe Ile Pro Tyr Tyr Pro Tyr Phe Ser Phe Ile Thr Pro Pro Val Thr
50 55 60
Ala Gly Ala Glu His Leu Arg Leu Tyr Glu Lys Ser Val Val Ile Pro
65 70 75 80
Gly Pro Val Thr Gln Ala Ile Ser Glu Arg Ala Arg Met Asn Asn Met
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp Asn Gly Ser Leu Tyr Asn
100 105 110
Thr Gln Ile Ile Phe Asp Ala Thr Gly Glu Met Leu Leu Lys Arg Arg
115 120 125
Lys Ile Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp
130 135 140
Ala Ser Gly Leu Lys Val Val Asp Thr Ala Ile Gly Arg Val Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ser Leu Met
165 170 175
Thr Gln His Glu Glu Ile His Cys Ala Gln Phe Pro Gly Ser Met Val
180 185 190
Gly Gln Ile Phe Ala Asp Gln Met Asp Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Ile Asn Ser Thr Gly Trp Leu Thr Asp
210 215 220
Asp Gln Ile Lys Ser Ile Thr Asp Asp Pro Lys Met Gln Lys Ala Leu
225 230 235 240
Arg Gly Gly Cys Asn Thr Ala Ile Ile Ser Pro Glu Gly Asn His Leu
245 250 255
Thr Glu Pro Leu Arg Glu Gly Glu Gly Met Val Ile Ala Asp Leu Asp
260 265 270
Met Ala Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Ala Ile Asn Asp Ala Pro Ala
290 295 300
Thr Pro Ser Phe Gln Met Asn Glu His Arg Leu Lys Ser Val Gln Leu
305 310 315 320
Pro Ile Ala Glu Glu Leu Lys Asn Asp Asn Lys Leu Ser Ser Gly Gln
325 330 335
<210>131
<211>1011
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>131
atgtcagaaa agcgaattat taaagcggct gcagttcaaa tcacaccaga ttttgaatcg 60
catgatggaa ccgtaaagaa ggtttgtaat gtaattgatg aagcgggtgc taaaggtgta 120
cagatcattg tattccctga aacctttatt ccatattacc catatttttc tttcatcact 180
ccaccagtga ctgctggcgc ggagcatttg cggctctatg aaaaaagtgt cgtgatacct 240
ggtcccgtta ctcaagacat ttccgaacgt gcacgcatga ataatatggt tgttgtactt 300
ggtgtaaatg agcgtgataa cggcagtcta tataacaccc agattatttt tgatgctacc 360
ggtgagatgc ttctgaagag aagaaaaatc acacctacct atcatgagcg catgatttgg 420
gggcaaggag atgcttcagg cctgaaggtc gtcgatacgg ctattgggcg agtcggagca 480
ttggcatgct gggagcacta taaccctttg gctagataca gcctcatgac acagcatgaa 540
gaaattcact gtgctcaatt tccaggctcc atggttggtc agatcttcgc agatcaaatg 600
gatgtcacga ttcgtcatca tgccttggag tcaggttgct tcgtcatcaa ctccactggc 660
tggttaactg atgatcagat caaatctatc accgacgatc ccaaaatgca gaaagcttta 720
agaggtggtt gcaacacggc cattatttct ccagaaggga atcatttaac cgagcctttg 780
cgagaaggtg aaggcatggt gattgctgat cttgatatgg cactcatcac caaacgaaaa 840
agaatgatgg actcagttgg ccactacgcc agaccagaac tgttgagctt agcgatcaat 900
gatgctccgg ctactccttc attccagatg aacgaacatc gtcttaaatc agtgcaatta 960
cctatcgcag aggagcttaa aaatgacaac aagcttagca gtggacagta a 1011
<210>132
<211>336
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>132
Met Ser Glu Lys Arg Ile Ile Lys Ala Ala Ala Val Gln Ile Thr Pro
1 5 10 15
Asp Phe Glu Ser His Asp Gly Thr Val Lys Lys Val Cys Asn Val Ile
20 25 30
Asp Glu Ala Gly Ala Lys Gly Val Gln Ile Ile Val Phe Pro Glu Thr
35 40 45
Phe Ile Pro Tyr Tyr Pro Tyr Phe Ser Phe Ile Thr Pro Pro Val Thr
50 55 60
Ala Gly Ala Glu His Leu Arg Leu Tyr Glu Lys Ser Val Val Ile Pro
65 70 75 80
Gly Pro Val Thr Gln Asp Ile Ser Glu Arg Ala Arg Met Asn Asn Met
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp Asn Gly Ser Leu Tyr Asn
100 105 110
Thr Gln Ile Ile Phe Asp Ala Thr Gly Glu Met Leu Leu Lys Arg Arg
115 120 125
Lys Ile Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp
130 135 140
Ala Ser Gly Leu Lys Val Val Asp Thr Ala Ile Gly Arg Val Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ser Leu Met
165 170 175
Thr Gln His Glu Glu Ile His Cys Ala Gln Phe Pro Gly Ser Met Val
180 185 190
Gly Gln Ile Phe Ala Asp Gln Met Asp Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Ile Asn Ser Thr Gly Trp Leu Thr Asp
210 215 220
Asp Gln Ile Lys Ser Ile Thr Asp Asp Pro Lys Met Gln Lys Ala Leu
225 230 235 240
Arg Gly Gly Cys Asn Thr Ala Ile Ile Ser Pro Glu Gly Asn His Leu
245 250 255
Thr Glu Pro Leu Arg Glu Gly Glu Gly Met Val Ile Ala Asp Leu Asp
260 265 270
Met Ala Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Ala Ile Asn Asp Ala Pro Ala
290 295 300
Thr Pro Ser Phe Gln Met Asn Glu His Arg Leu Lys Ser Val Gln Leu
305 310 315 320
Pro Ile Ala Glu Glu Leu Lys Asn Asp Asn Lys Leu Ser Ser Gly Gln
325 330 335
<210>133
<211>1026
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>133
atgtcgacca agcggatcgt acgcgccgct gccgttcagc tggcaccgga tctggagcgg 60
ccggagggca cactggagaa ggtttgcgcg gccatcgaca aggcggcggg ggacggtgtg 120
cagctcatcg tcttccccga gaccttcgta ccgtactacc cgtacttctc tttcgtgcgt 180
gcgccggtcg cgatgggtgc cgagcacatg cggttatacg agcgcgcggt agcggtgccc 240
ggtccagtaa cggccaccgt ggcggagcgg gcaaaagcgc acgcgatggt cgtcgtgctg 300
ggtgtaaacg agcgcgatca cggctcactg tataacgcgc aactgatctt cgacgagacc 360
ggccgtctcg tcctcaaacg ccgcaagatc actccgacct atcacgagcg catggtgtgg 420
gggcagggcg acggcagcgg ccttaaggtt gtagacaccg gtatcggcag gatcggagcc 480
ctcgcctgct gggagcacta caacccgctc gcgcgctatg cgctcatggc gcagcacgaa 540
gagattcatt gcgcgcagtt tccgggctcg atggtggggc cgatcttcgc ggatcagatc 600
gaggtcacga tccgccatca cgcgctggag tcgggctgct tcgtcgtcaa tgcgaccggc 660
tggctgacac ccgaacagat cgcgtcgatc acaccggacg cgggtctgca aaaggcaatc 720
agcgggggct gcaacaccgc gatcatctcg ccggagggcg tgcacctggc cccgccgttg 780
cgagaaggtg agggcatggt cgtggccgac ctcgacatgg cgctcatcac caaacgcaaa 840
cgcatgatgg attcggtggg tcactacgct cgcccggagt tgctcagcct gcgcatcgat 900
agccgcgccg cttcgccgat gtcgtcacaa atggaaatac ccgggagctt gcatgaaatc 960
accagccacg atgtccagcc agcaactgat gaccgagctc cagtcctccg gcttgaggtt 1020
ggctga 1026
<210>134
<211>341
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>134
Met Ser Thr Lys Arg Ile Val Arg Ala Ala Ala Val Gln Leu Ala Pro
1 5 10 15
Asp Leu Glu Arg Pro Glu Gly Thr Leu Glu Lys Val Cys Ala Ala Ile
20 25 30
Asp Lys Ala Ala Gly Asp Gly Val Gln Leu Ile Val Phe Pro Glu Thr
35 40 45
Phe Val Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Arg Ala Pro Val Ala
50 55 60
Met Gly Ala Glu His Met Arg Leu Tyr Glu Arg Ala Val Ala Val Pro
65 70 75 80
Gly Pro Val Thr Ala Thr Val Ala Glu Arg Ala Lys Ala His Ala Met
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn
100 105 110
Ala Gln Leu Ile Phe Asp Glu Thr Gly Arg Leu Val Leu Lys Arg Arg
115 120 125
Lys Ile Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp
130 135 140
Gly Ser Gly Leu Lys Val Val Asp Thr Gly Ile Gly Arg Ile Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met
165 170 175
Ala Gln His Glu Glu Ile His Cys Ala Gln Phe Pro Gly Ser Met Val
180 185 190
Gly Pro Ile Phe Ala Asp Gln Ile Glu Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr Pro
210 215 220
Glu Gln Ile Ala Ser Ile Thr Pro Asp Ala Gly Leu Gln Lys Ala Ile
225 230 235 240
Ser Gly Gly Cys Asn Thr Ala Ile Ile Ser Pro Glu Gly Val His Leu
245 250 255
Ala Pro Pro Leu Arg Glu Gly Glu Gly Met Val Val Ala Asp Leu Asp
260 265 270
Met Ala Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Arg Ile Asp Ser Arg Ala Ala
290 295 300
Ser Pro Met Ser Ser Gln Met Glu Ile Pro Gly Ser Leu His Glu Ile
305 310 315 320
Thr Ser His Asp Val Gln Pro Ala Thr Asp Asp Arg Ala Pro Val Leu
325 330 335
Arg Leu Glu Val Gly
340
<210>135
<211>1011
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>135
atgtcagaca agcgaatcat taaagcggct gcagttcaaa tcactcctga ctttgactca 60
gcagatggaa ccgttaagaa agtgtgcaag gtaatcgatg aagcaggtgc aaagggagtt 120
caaattattg tattcccgga aaccttcatc ccttactacc catacttttc attcattaca 180
cctccagtca ctgctggcgc tgagcattta aagctttatg agaaaagtgt cgtgatacct 240
ggcccggtta cccaagcgat tgccgagcga gccagggtta atcagatggt tgtcgtgctt 300
ggtgtcaacg agcgagataa cggtagcctc tacaacacac aattgatctt tgataccaac 360
ggcgaactgc tacttaaaag aagaaaaatc acccctacct accatgaacg tatgatctgg 420
gggcaaggtg atgcatcagg tctcaaagta gttgaaacag agatcgcccg agtaggtgcc 480
ttggcttgtt gggaacacta caacccactg gccagatatg cactcatgac acagcatgaa 540
gaaattcact gtgcgcaatt cccaggctct atggttggcc agatatttgc cgatcagatg 600
gatgtcacta tccgacatca cgccttagag tcaggctgct tcgtcatcaa cgccactggc 660
tggctcaccg acgcgcaaat ccaatcgatt actgatgacc caaaaatgca aaaagcatta 720
cgtggcggct gcaacacagc catcatctcc cccgaagggg tgcacttaac agagccacta 780
cgtgaaggag aaggcatggt gattgccaat cttgatatgg cactcatcac aaaacgaaaa 840
agaatgatgg attcggtagg ccattattca agaccagaat tattaagcct ggcaattaac 900
gacaaaccag caactacaac attttcaatg actgaggggc gtactcaaac agagccattt 960
cgaatcgcag aggagttgaa aaatgacgac aagcttagca ctggaaacta a 1011
<210>136
<211>336
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>136
Met Ser Asp Lys Arg Ile Ile Lys Ala Ala Ala Val Gln Ile Thr Pro
1 5 10 15
Asp Phe Asp Ser Ala Asp Gly Thr Val Lys Lys Val Cys Lys Val Ile
20 25 30
Asp Glu Ala Gly Ala Lys Gly Val Gln Ile Ile Val Phe Pro Glu Thr
35 40 45
Phe Ile Pro Tyr Tyr Pro Tyr Phe Ser Phe Ile Thr Pro Pro Val Thr
50 55 60
Ala Gly Ala Glu His Leu Lys Leu Tyr Glu Lys Ser Val Val Ile Pro
65 70 75 80
Gly Pro Val Thr Gln Ala Ile Ala Glu Arg Ala Arg Val Asn Gln Met
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp Asn Gly Ser Leu Tyr Asn
100 105 110
Thr Gln Leu Ile Phe Asp Thr Asn Gly Glu Leu Leu Leu Lys Arg Arg
115 120 125
Lys Ile Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp
130 135 140
Ala Ser Gly Leu Lys Val Val Glu Thr Glu Ile Ala Arg Val Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met
165 170 175
Thr Gln His Glu Glu Ile His Cys Ala Gln Phe Pro Gly Ser Met Val
180 185 190
Gly Gln Ile Phe Ala Asp Gln Met Asp Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Ile Asn Ala Thr Gly Trp Leu Thr Asp
210 215 220
Ala Gln Ile Gln Ser Ile Thr Asp Asp Pro Lys Met Gln Lys Ala Leu
225 230 235 240
Arg Gly Gly Cys Asn Thr Ala Ile Ile Ser Pro Glu Gly Val His Leu
245 250 255
Thr Glu Pro Leu Arg Glu Gly Glu Gly Met Val Ile Ala Asn Leu Asp
260 265 270
Met Ala Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Ala Ile Asn Asp Lys Pro Ala
290 295 300
Thr Thr Thr Phe Ser Met Thr Glu Gly Arg Thr Gln Thr Glu Pro Phe
305 310 315 320
Arg Ile Ala Glu Glu Leu Lys Asn Asp Asp Lys Leu Ser Thr Gly Asn
325 330 335
<210>137
<211>978
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>137
atggctattg tcaaggccgc ggcggtgcag atcagtccgg tgctctacag tcgcgccggc 60
acagtggaca aggtcgtcgc gaagatccgc gagctgggcc gacgaggggt cgagttcgcc 120
gtcttccccg agaccgtcat tccctactat ccctacttct ctttcgtgca gcccccctac 180
acccaggcca ccgaacacct gcgcctgctc gaggaatcgg tgaccgtgcc ctccgccgaa 240
accgacgcga tcgccaaggc cgctcgcgag gcgggcatgg tcgtctccat cggcgtcaac 300
gagcgcgacg gcggaaccat ctacaacacc caactcctct tcgacgccga cggcactctc 360
atccagcgcc gccgcaagat cacccccacc tatcacgaac gcatggtctg ggggcaggga 420
gacggctcag gtctgcgcgc cgtcgacagt gcggtcggcc gcatcggcca gctcgcctgc 480
tgggagcact accagccact ggcccggtac gccctcatcg ctgacggcga gcagatccac 540
gccgcgatgt accccggcgc cttcggcggc gatctgttcg ccgagcagat cgaggtcaac 600
atccgccagc acgccctgga atccgccagc ttcgtcgtca acgccaccgc ctggctcgac 660
gccgatcagc aggcccagat cgccaaggac accggaggcc cggtcccggc cttctccggt 720
ggcttcttca ccgccatcgt cgaccccgaa ggccgtatca tcggcgaccc cctcaccagc 780
ggcgaaggcg aagtgatcgc cgacctcgat ctcgctctca tcaaccgccg caagcgcctc 840
atggacgcca gtggacacta ccagccgccc gaaattctta gcttcacatt gaccggtgca 900
ccggcgcctt atgtcaagag cgcggcgtgc cggggaaccc cgggtacgac cgtggccgag 960
gagggacggt ccgcttag 978
<210>138
<211>325
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>138
Met Ala Ile Val Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Ala Gly Thr Val Asp Lys Val Val Ala Lys Ile Arg Glu Leu
20 25 30
Gly Arg Arg Gly Val Glu Phe Ala Val Phe Pro Glu Thr Val Ile Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Pro Pro Tyr Thr Gln Ala Thr
50 55 60
Glu His Leu Arg Leu Leu Glu Glu Ser Val Thr Val Pro Ser Ala Glu
65 70 75 80
Thr Asp Ala Ile Ala Lys Ala Ala Arg Glu Ala Gly Met Val Val Ser
85 90 95
Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Thr Gln Leu
100 105 110
Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys Ile Thr
115 120 125
Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp Gly Ser Gly
130 135 140
Leu Arg Ala Val Asp Ser Ala Val Gly Arg Ile Gly Gln Leu Ala Cys
145 150 155 160
Trp Glu His Tyr Gln Pro Leu Ala Arg Tyr Ala Leu Ile Ala Asp Gly
165 170 175
Glu Gln Ile His Ala Ala Met Tyr Pro Gly Ala Phe Gly Gly Asp Leu
180 185 190
Phe Ala Glu Gln Ile Glu Val Asn Ile Arg Gln His Ala Leu Glu Ser
195 200 205
Ala Ser Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp Gln Gln
210 215 220
Ala Gln Ile Ala Lys Asp Thr Gly Gly Pro Val Pro Ala Phe Ser Gly
225 230 235 240
Gly Phe Phe Thr Ala Ile Val Asp Pro Glu Gly Arg Ile Ile Gly Asp
245 250 255
Pro Leu Thr Ser Gly Glu Gly Glu Val Ile Ala Asp Leu Asp Leu Ala
260 265 270
Leu Ile Asn Arg Arg Lys Arg Leu Met Asp Ala Ser Gly His Tyr Gln
275 280 285
Pro Pro Glu Ile Leu Ser Phe Thr Leu Thr Gly Ala Pro Ala Pro Tyr
290 295 300
Val Lys Ser Ala Ala Cys Arg Gly Thr Pro Gly Thr Thr Val Ala Glu
305 310 315 320
Glu Gly Arg Ser Ala
325
<210>139
<211>999
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>139
atgaaaacaa cggttaccgt tgcctgcgtt caggccgccc ccgtatttat ggatttagaa 60
ggcaccgtag ataaaacaat caccctcatc tctgaagccg cacagaaagg cgcggagctc 120
atcgcttttc cggagacctg gatacccggt tacccgtggt tcttatggct gaactcgccc 180
gccacaaata tgcccctggt ttatcagtat catcagaact ctctggtgct ggacagtacc 240
caggcgaagc gaattgcgga tgcggcacgg cagaataaca tcactgtcgc tctgggcttc 300
agcgaacgcg atcatggaag cctctatatc gcacagtggc tgattggcag cgacggggag 360
accattggca tccggcgcaa gctcaaggcc acgcacgtgg agcgtacgct gttcggcgaa 420
agcgacggct cctccctgac cacctgggag acacctctgg gtaacgtcgg ggccctctgc 480
tgctgggagc acctgcagcc gctgtcccgc tatgcaatgt attcccagca tgaggagatc 540
cacatcgctg cctggcccag tttcagtctc tacaccagtg caacggccgc actgggtcct 600
gacgtcaata cggcggcttc acgcctctat gccgcggagg ggcagtgctt cgtgatagcc 660
ccgtgtgccg tggtttctga tgaaatgatt gatttactct gtcctgatga tgaccggaga 720
gcgttactca gtgccggagg gggacatgcc cgtatttacg gcccggacgg aagagaactc 780
gtcacccctc tcggggaaaa tgaggaagga ctgcttatcg ctgagctcga ctctgctgcg 840
attacctttg ccaaactggc ggcagacccg gttggccact attcccgtcc tgacgtgacc 900
cgcctccttt ttaatccttc agccaacaag actgtgatta aacgacattc gcctcctgag 960
ttaattgccg agcagactgc agaagaagag gaggagtag 999
<210>140
<211>332
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>140
Met Lys Thr Thr Val Thr Val Ala Cys Val Gln Ala Ala Pro Val Phe
1 5 10 15
Met Asp Leu Glu Gly Thr Val Asp Lys Thr Ile Thr Leu Ile Ser Glu
20 25 30
Ala Ala Gln Lys Gly Ala Glu Leu Ile Ala Phe Pro Glu Thr Trp Ile
35 40 45
Pro Gly Tyr Pro Trp Phe Leu Trp Leu Asn Ser Pro Ala Thr Asn Met
50 55 60
Pro Leu Val Tyr Gln Tyr His Gln Asn Ser Leu Val Leu Asp Ser Thr
65 70 75 80
Gln Ala Lys Arg Ile Ala Asp Ala Ala Arg Gln Asn Asn Ile Thr Val
85 90 95
Ala Leu Gly Phe Ser Glu Arg Asp His Gly Ser Leu Tyr Ile Ala Gln
100 105 110
Trp Leu Ile Gly Ser Asp Gly Glu Thr Ile Gly Ile Arg Arg Lys Leu
115 120 125
Lys Ala Thr His Val Glu Arg Thr Leu Phe Gly Glu Ser Asp Gly Ser
130 135 140
Ser Leu Thr Thr Trp Glu Thr Pro Leu Gly Asn Val Gly Ala Leu Cys
145 150 155 160
Cys Trp Glu His Leu Gln Pro Leu Ser Arg Tyr Ala Met Tyr Ser Gln
165 170 175
His Glu Glu Ile His Ile Ala Ala Trp Pro Ser Phe Ser Leu Tyr Thr
180 185 190
Ser Ala Thr Ala Ala Leu Gly Pro Asp Val Asn Thr Ala Ala Ser Arg
195 200 205
Leu Tyr Ala Ala Glu Gly Gln Cys Phe Val Ile Ala Pro Cys Ala Val
210 215 220
Val Ser Asp Glu Met Ile Asp Leu Leu Cys Pro Asp Asp Asp Arg Arg
225 230 235 240
Ala Leu Leu Ser Ala Gly Gly Gly His Ala Arg Ile Tyr Gly Pro Asp
245 250 255
Gly Arg Glu Leu Val Thr Pro Leu Gly Glu Asn Glu Glu Gly Leu Leu
260 265 270
Ile Ala Glu Leu Asp Ser Ala Ala Ile Thr Phe Ala Lys Leu Ala Ala
275 280 285
Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Thr Arg Leu Leu Phe
290 295 300
Asn Pro Ser Ala Asn Lys Thr Val Ile Lys Arg His Ser Pro Pro Glu
305 310 315 320
Leu Ile Ala Glu Gln Thr Ala Glu Glu Glu Glu Glu
325 330
<210>141
<211>1026
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>141
atggtgttca aggcagcgac tgttcatgca gctccggtat tcatggacaa ggaagcgtcg 60
atagataagg ctatcgacct catcaagaag gccggtcagg aagggattaa gcttctggtt 120
tttccggaaa cgtttattcc gggctatccg tattttatcg aatgctatcc gccgcttgcg 180
caggtggaag cgctcgccca gtacactgac gcttccgtgg agatcgacgg cccggaagtc 240
acccggcttc agcaggtagc caaggcggca ggcgttgcag tcgtcatggg catcagcgaa 300
cgaatggctg agacccgaac ctgcttcaac tcgcaggtgt tcattgacgt cgacggcacg 360
ctgctcggcg tgcatcgcaa gctgcagccg acttatgccg agcgcaaggt atgggcacag 420
ggcggtggtt atacgctgag gacctacaag agctcgcttg gcgtgctcgg cggtcttgcc 480
tgctgggagc acacgatgaa cctcgcgcgg caggccctga tcatgcagag cgagcagatc 540
catgcggctg catggcccgg actatcgacg atgcgaggtt tcgagcccgt ggccgatatc 600
cagatcgacg ccatgatgaa gactcacgcg cttaccgcac agtgctgggt gctttcggcc 660
ggcaatcccg tcgaccggac ctgcctcgac tggatggaaa agaacatcgg accgcaggat 720
tacgtcaccg agggcggcgg atggagcgcc gttatccatc cgttcaacag ctatctcggc 780
ggccctcaca cgggccttga ggaaaagctg gtcgtcggcg agatcaatct ggacgatctc 840
aagttcgtca aagtctggct cgacagcaaa gggcactatg ctcggccgga aatcctgaaa 900
cttggcgtca accaaaagca gatttggcct gatgaacatt tgctggcgcg gcaggatgtg 960
accgagttgc tggaggcgga tatcatcgaa taccccttgc aactgttgca agaccgcgcg 1020
caatag 1026
<210>142
<211>341
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>142
Met Val Phe Lys Ala Ala Thr Val His Ala Ala Pro Val Phe Met Asp
1 5 10 15
Lys Glu Ala Ser Ile Asp Lys Ala Ile Asp Leu Ile Lys Lys Ala Gly
20 25 30
Gln Glu Gly Ile Lys Leu Leu Val Phe Pro Glu Thr Phe Ile Pro Gly
35 40 45
Tyr Pro Tyr Phe Ile Glu Cys Tyr Pro Pro Leu Ala Gln Val Glu Ala
50 55 60
Leu Ala Gln Tyr Thr Asp Ala Ser Val Glu Ile Asp Gly Pro Glu Val
65 70 75 80
Thr Arg Leu Gln Gln Val Ala Lys Ala Ala Gly Val Ala Val Val Met
85 90 95
Gly Ile Ser Glu Arg Met Ala Glu Thr Arg Thr Cys Phe Asn Ser Gln
100 105 110
Val Phe Ile Asp Val Asp Gly Thr Leu Leu Gly Val His Arg Lys Leu
115 120 125
Gln Pro Thr Tyr Ala Glu Arg Lys Val Trp Ala Gln Gly Gly Gly Tyr
130 135 140
Thr Leu Arg Thr Tyr Lys Ser Ser Leu Gly Val Leu Gly Gly Leu Ala
145 150 155 160
Cys Trp Glu His Thr Met Asn Leu Ala Arg Gln Ala Leu Ile Met Gln
165 170 175
Ser Glu Gln Ile His Ala Ala Ala Trp Pro Gly Leu Ser Thr Met Arg
180 185 190
Gly Phe Glu Pro Val Ala Asp Ile Gln Ile Asp Ala Met Met Lys Thr
195 200 205
His Ala Leu Thr Ala Gln Cys Trp Val Leu Ser Ala Gly Asn Pro Val
210 215 220
Asp Arg Thr Cys Leu Asp Trp Met Glu Lys Asn Ile Gly Pro Gln Asp
225 230 235 240
Tyr Val Thr Glu Gly Gly Gly Trp Ser Ala Val Ile His Pro Phe Asn
245 250 255
Ser Tyr Leu Gly Gly Pro His Thr Gly Leu Glu Glu Lys Leu Val Val
260 265 270
Gly Glu Ile Asn Leu Asp Asp Leu Lys Phe Val Lys Val Trp Leu Asp
275 280 285
Ser Lys Gly His Tyr Ala Arg Pro Glu Ile Leu Lys Leu Gly Val Asn
290 295 300
Gln Lys Gln Ile Trp Pro Asp Glu His Leu Leu Ala Arg Gln Asp Val
305 310 315 320
Thr Glu Leu Leu Glu Ala Asp Ile Ile Glu Tyr Pro Leu Gln Leu Leu
325 330 335
Gln Asp Arg Ala Gln
340
<210>143
<211>1122
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>143
atgacgatca ttgcaggcgc ggttcatgcg gcgccggtat tcatggatgt cgatgccact 60
atcgacaagg catgcgaaat cattcgcaag gcaggcaaag acggaatcga gcttctcgtc 120
ttccctgagg ttttcgtacc cggctacccc tacttcatcg agtgctatcc gaccttgaac 180
caaaccgctg cgctggccgc ctatacggat gcctcgatcg aggttccagg cccggaagtc 240
cggcgcttgc aggtggccgc acatcaggcc ggcgtgatgg ttgtgatggg cgtgagcgag 300
cgtctgcgcg gatctcgcac ctgcttcaac agccaggtgt tcatcgaccg tgacggcacc 360
ttgctgggcg tgcaccgcaa actccagccg acctatgtcg agcgcatcgt ctggggccag 420
ggcggcggac acaccctcaa ggtattcgac agcacactgg gcaaggtggg cggactggcc 480
tgctgggagc acacgatgaa cctcgcgcgc catgcgttga tcgcccaggg tatccagatc 540
catgccgccg cctggcctgg gctttcgaca atggccgggt tcgaagcggt ggctgacgtc 600
cagatcgacg cgatgatgaa aactcatgcg ttgagcgcgc aatgctttgt cgtatcggcc 660
gcaaaccctg tggatcagac ctgcctggag tggatggaga aacacctcgg cccgcagcaa 720
ctcgttaccg ccggcggagg ctggtcggca atcgtccatc ctttctgtgg ttatatcgcc 780
gcccctcaca ccggtgccga ggagaaggtt ctggtaggcg aaatcaatct ggacgacctc 840
aagcaggtca aggtatgggt tgattccgca ggtcattatg cgcgcccgga agtcgtgcaa 900
ttgcgcgacg ccctggagag ccgtggcaat tatcgcgttg cgctgacccg cgacgccgac 960
accttcgtgc cgctggaaga ccgcgtgcgc tttgcgcgcc agcagaacgc cgacctcttc 1020
atctcgatcc acgccgacgc caacgccaac cacgatgcgc gcggggctgg cttcacttcg 1080
aaggttgaaa acctttccac gggcatttta ccaggcgatt ga 1122
<210>144
<211>373
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>144
Met Thr Ile Ile Ala Gly Ala Val His Ala Ala Pro Val Phe Met Asp
1 5 10 15
Val Asp Ala Thr Ile Asp Lys Ala Cys Glu Ile Ile Arg Lys Ala Gly
20 25 30
Lys Asp Gly Ile Glu Leu Leu Val Phe Pro Glu Val Phe Val Pro Gly
35 40 45
Tyr Pro Tyr Phe Ile Glu Cys Tyr Pro Thr Leu Asn Gln Thr Ala Ala
50 55 60
Leu Ala Ala Tyr Thr Asp Ala Ser Ile Glu Val Pro Gly Pro Glu Val
65 70 75 80
Arg Arg Leu Gln Val Ala Ala His Gln Ala Gly Val Met Val Val Met
85 90 95
Gly Val Ser Glu Arg Leu Arg Gly Ser Arg Thr Cys Phe Asn Ser Gln
100 105 110
Val Phe Ile Asp Arg Asp Gly Thr Leu Leu Gly Val His Arg Lys Leu
115 120 125
Gln Pro Thr Tyr Val Glu Arg Ile Val Trp Gly Gln Gly Gly Gly His
130 135 140
Thr Leu Lys Val Phe Asp Ser Thr Leu Gly Lys Val Gly Gly Leu Ala
145 150 155 160
Cys Trp Glu His Thr Met Asn Leu Ala Arg His Ala Leu Ile Ala Gln
165 170 175
Gly Ile Gln Ile His Ala Ala Ala Trp Pro Gly Leu Ser Thr Met Ala
180 185 190
Gly Phe Glu Ala Val Ala Asp Val Gln Ile Asp Ala Met Met Lys Thr
195 200 205
His Ala Leu Ser Ala Gln Cys Phe Val Val Ser Ala Ala Asn Pro Val
210 215 220
Asp Gln Thr Cys Leu Glu Trp Met Glu Lys His Leu Gly Pro Gln Gln
225 230 235 240
Leu Val Thr Ala Gly Gly Gly Trp Ser Ala Ile Val His Pro Phe Cys
245 250 255
Gly Tyr Ile Ala Ala Pro His Thr Gly Ala Glu Glu Lys Val Leu Val
260 265 270
Gly Glu Ile Asn Leu Asp Asp Leu Lys Gln Val Lys Val Trp Val Asp
275 280 285
Ser Ala Gly His Tyr Ala Arg Pro Glu Val Val Gln Leu Arg Asp Ala
290 295 300
Leu Glu Ser Arg Gly Asn Tyr Arg Val Ala Leu Thr Arg Asp Ala Asp
305 310 315 320
Thr Phe Val Pro Leu Glu Asp Arg Val Arg Phe Ala Arg Gln Gln Asn
325 330 335
Ala Asp Leu Phe Ile Ser Ile His Ala Asp Ala Asn Ala Asn His Asp
340 345 350
Ala Arg Gly Ala Gly Phe Thr Ser Lys Val Glu Asn Leu Ser Thr Gly
355 360 365
Ile Leu Pro Gly Asp
370
<210>145
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>145
atgggcatca cccacccgaa ctacaaggtc gcagtggtcc aggctgcgcc ggtctggttg 60
aacctcgagg caacggtcga gaagacaatc aggtatattg aagaggcggc caaggctgga 120
gcgaagctga tagcgtttcc ggaaacctgg attccgggct atccatggca catttggatc 180
ggaacgcccg catgggcaat cggtaagggc ttcgtccagc gctatttcga caactcgctc 240
agctatgaca gcccgctcgc gcggcagatc gctgacgccg cagcaaagag caagatcacg 300
gttgttctcg gcctctccga gcgcgacggt ggaagcctat acatcgcgca atggctgatc 360
ggaccagatg gcgagaccat cgcgaagcgg cgcaagctgc gtccgaccca cgtcgagcgc 420
acggtgttcg gtgacggtga cggcagccac atcgccgtgc atgaccgatc cgatctgggc 480
cggctcgggg cgttgtgctg ctgggagcac gtgcagccgt tgacgaaatt cgcgatgtac 540
gcgcagaacg agcaggttca cgtggcagca tggccgagct tctcgatgta cgaacccttt 600
gcgcatgcgc tgggttggga gacgaacaac gcggtcagca aggtctacgc ggtcgaggga 660
tcgtgcttcg tgctcgctcc ctgtgccgtt atttcgcaag cgatggtgga cgagatgtgc 720
gacactcccg acaagcgcga gcttgttcac gccggcggcg gccacgcggt gatttacggc 780
cctgacggaa gcccgctcgc agaaaagctc ggggaaaacg aagaggggct tctctacgcg 840
acggtcaatc ttgctgcgat cggggttgcc aagaatgccg cggatccggc cgggcactat 900
tcgcgtccgg acgttctaag gctgctattc aacaagagcc cggcccgaag agtggagcat 960
tttgcgctgc cgcacgagca gctcgagatc ggggcaggcc cgtctggcga ctga 1014
<210>146
<211>337
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>146
Met Gly Ile Thr His Pro Asn Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Val Trp Leu Asn Leu Glu Ala Thr Val Glu Lys Thr Ile Arg Tyr
20 25 30
Ile Glu Glu Ala Ala Lys Ala Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Thr Trp Ile Pro Gly Tyr Pro Trp His Ile Trp Ile Gly Thr Pro Ala
50 55 60
Trp Ala Ile Gly Lys Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ser Tyr Asp Ser Pro Leu Ala Arg Gln Ile Ala Asp Ala Ala Ala Lys
85 90 95
Ser Lys Ile Thr Val Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Val Glu Arg Thr Val Phe Gly
130 135 140
Asp Gly Asp Gly Ser His Ile Ala Val His Asp Arg Ser Asp Leu Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Val Gln Pro Leu Thr Lys
165 170 175
Phe Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Met Tyr Glu Pro Phe Ala His Ala Leu Gly Trp Glu Thr
195 200 205
Asn Asn Ala Val Ser Lys Val Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Val Ile Ser Gln Ala Met Val Asp Glu Met Cys
225 230 235 240
Asp Thr Pro Asp Lys Arg Glu Leu Val His Ala Gly Gly Gly His Ala
245 250 255
Val Ile Tyr Gly Pro Asp Gly Ser Pro Leu Ala Glu Lys Leu Gly Glu
260 265 270
Asn Glu Glu Gly Leu Leu Tyr Ala Thr Val Asn Leu Ala Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Leu Arg Leu Leu Phe Asn Lys Ser Pro Ala Arg Arg Val Glu His
305 310 315 320
Phe Ala Leu Pro His Glu Gln Leu Glu Ile Gly Ala Gly Pro Ser Gly
325 330 335
Asp
<210>147
<211>1098
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>147
atgacccagc acgagaccac tgcccggagg ctggcagctg tgcatgccgc gcctgtgttc 60
atggacaccg acgcgaccat cgacaaggtg atcggcttcg tcgaacaggc cggccgcgaa 120
ggcatcgaac tcctggtgtt ccccgagacc ttcgtgcctg gttaccccta ctggatcgag 180
tgctatccgc cgctgcagca ggtggccgcc aacgcgcagt acacggacgc ctccgtcgag 240
gtgcctggtc cggagatcaa gcgggtgcag gcggcctgtg cccgcgctgg cgtcgaagtc 300
gtcctcggcg tcagcgagcg actcaggggt accaggacat gcttcaactc ccaggtgttc 360
atcgacgccg acgggagcct gctcggcgtg caccgcaagc tgcagccgac gtacgtggag 420
cgcatcgtgt gggcccaggg cggaggcgcg accctgtcgg tgttcggctc ccgctccggc 480
cggatcggcg gtctggcctg ctgggagcac acgatgaacc tggctcgtca ggcactgctt 540
gagcaggagc agcagatcca cgcggcggcg tggcctgccc tgtcgacgat ggcggggttc 600
gagaccgtcg cggacgccca gatcgaggcc atgatgaaga cccatgcgct cacggcacag 660
gtgttcgtca tctgcgcgtc caacccggtc gacggcactt gcctggaatg gatgcgggac 720
aacctcggtg aacagaagtt cgtgaccgcc ggagggggct ggtccgcggt catccacccc 780
ttcaactcct tcctcggcgg gccgcatacc ggtttggagg agaagctcgt cagcgcgacg 840
atcgacttct ccgacatccg cttggtcaag gcctgggttg attcgaaggg gcactacgcg 900
cggcccgagg tcctgcgact cgcggtcgac cgcaagccac tgtggcacga cgagtgcgag 960
gtgccgggac aggcgcaggt acgcacccgc gctgcttctc tggcagtgca ggagcacccg 1020
gtggtgctgc ctcagggggc ggcgcggccc gctccgcaag actgggacac ctctgcggcg 1080
caggagctga cttcctga 1098
<210>148
<211>365
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>148
Met Thr Gln His Glu Thr Thr Ala Arg Arg Leu Ala Ala Val His Ala
1 5 10 15
Ala Pro Val Phe Met Asp Thr Asp Ala Thr Ile Asp Lys Val Ile Gly
20 25 30
Phe Val Glu Gln Ala Gly Arg Glu Gly Ile Glu Leu Leu Val Phe Pro
35 40 45
Glu Thr Phe Val Pro Gly Tyr Pro Tyr Trp Ile Glu Cys Tyr Pro Pro
50 55 60
Leu Gln Gln Val Ala Ala Asn Ala Gln Tyr Thr Asp Ala Ser Val Glu
65 70 75 80
Val Pro Gly Pro Glu Ile Lys Arg Val Gln Ala Ala Cys Ala Arg Ala
85 90 95
Gly Val Glu Val Val Leu Gly Val Ser Glu Arg Leu Arg Gly Thr Arg
100 105 110
Thr Cys Phe Asn Ser Gln Val Phe Ile Asp Ala Asp Gly Ser Leu Leu
115 120 125
Gly Val His Arg Lys Leu Gln Pro Thr Tyr Val Glu Arg Ile Val Trp
130 135 140
Ala Gln Gly Gly Gly Ala Thr Leu Ser Val Phe Gly Ser Arg Ser Gly
145 150 155 160
Arg Ile Gly Gly Leu Ala Cys Trp Glu His Thr Met Asn Leu Ala Arg
165 170 175
Gln Ala Leu Leu Glu Gln Glu Gln Gln Ile His Ala Ala Ala Trp Pro
180 185 190
Ala Leu Ser Thr Met Ala Gly Phe Glu Thr Val Ala Asp Ala Gln Ile
195 200 205
Glu Ala Met Met Lys Thr His Ala Leu Thr Ala Gln Val Phe Val Ile
210 215 220
Cys Ala Ser Asn Pro Val Asp Gly Thr Cys Leu Glu Trp Met Arg Asp
225 230 235 240
Asn Leu Gly Glu Gln Lys Phe Val Thr Ala Gly Gly Gly Trp Ser AIa
245 250 255
Val Ile His Pro Phe Asn Ser Phe Leu Gly Gly Pro His Thr Gly Leu
260 265 270
Glu Glu Lys Leu Val Ser Ala Thr Ile Asp Phe Ser Asp Ile Arg Leu
275 280 285
Val Lys Ala Trp Val Asp Ser Lys Gly His Tyr Ala Arg Pro Glu Val
290 295 300
Leu Arg Leu Ala Val Asp Arg Lys Pro Leu Trp His Asp Glu Cys Glu
305 310 315 320
Val Pro Gly Gln Ala Gln Val Arg Thr Arg Ala Ala Ser Leu Ala Val
325 330 335
Gln Glu His Pro Val Val Leu Pro Gln Gly Ala Ala Arg Pro Ala Pro
340 345 350
Gln Asp Trp Asp Thr Ser Ala Ala Gln Glu Leu Thr Ser
355 360 365
<210>149
<211>942
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>149
atgacgaagc ttgagaaggt ggtcgcggcg gcggtccagg cgacgccgga gttcctcgac 60
cgcgaggcga ccgtcgagaa ggccgtgcgg ctgatcaagg aagcggccgg ggagggcgcc 120
ggcctgatcg tgttccccga gacgttcatc ccgacgtacc cggactgggt ctggcgcgcg 180
ccggcctggg acggcccatc cgcggacctg tacgcaatgc tgctggagaa cgcggtggag 240
atccccgggc cggtgacgga gaccctgggg aaggcggcga agcaggccaa ggccttcgtg 300
tcgatgggcg tcaacgagcg cgagccgggc ggcgggacga tctacaacac gcaggtcacg 360
ttcggacccg acgggagcgt gctcggcaag caccgcaagc tgatgccgac cggcggcgag 420
cgcctggtgt gggggatggg cgacgggtcg atgctccagg tctatgacac gccgttcggc 480
cgcctgggcg ggctgatctg ctgggagaac tacatgccgc tcgcgcgcta ctcgatgtac 540
gccaagggcg tggacgtcta cgttgcgccg acgtgggaca acagcgacat gtgggtggcg 600
acgctccgcc acatcgccaa ggaggggcgg ctgtacgtga tcggcgtggc gccgctgctg 660
cgcgggtcgg acgtccccga cgacgtgccg gggaaggccg agctgtgggg cggcgatgac 720
gactggatgt cgcgcggctt ctccaccatc gtcgcgccgg gcggcgaggt gctggccggt 780
ccgctgacgg aggaggaagg catcctctac gcggagatcg acccggcgag agcccgttcg 840
tcacggcacc agttcgatcc ggtggggcac tactcgcgcc ccgacgtgtt tcggctcgtc 900
gtggacgagt cgcccaagcc ccagacgtcc ggcccgggct ag 942
<210>150
<211>313
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>150
Met Thr Lys Leu Glu Lys Val Val Ala Ala Ala Val Gln Ala Thr Pro
1 5 10 15
Glu Phe Leu Asp Arg Glu Ala Thr Val Glu Lys Ala Val Arg Leu Ile
20 25 30
Lys Glu Ala Ala Gly Glu Gly Ala Gly Leu Ile Val Phe Pro Glu Thr
35 40 45
Phe Ile Pro Thr Tyr Pro Asp Trp Val Trp Arg Ala Pro Ala Trp Asp
50 55 60
Gly Pro Ser Ala Asp Leu Tyr Ala Met Leu Leu Glu Asn Ala Val Glu
65 70 75 80
Ile Pro Gly Pro Val Thr Glu Thr Leu Gly Lys Ala Ala Lys Gln Ala
85 90 95
Lys Ala Phe Val Ser Met Gly Val Asn Glu Arg Glu Pro Gly Gly Gly
100 105 110
Thr Ile Tyr Asn Thr Gln Val Thr Phe Gly Pro Asp Gly Ser Val Leu
115 120 125
Gly Lys His Arg Lys Leu Met Pro Thr Gly Gly Glu Arg Leu Val Trp
130 135 140
Gly Met Gly Asp Gly Ser Met Leu Gln Val Tyr Asp Thr Pro Phe Gly
145 150 155 160
Arg Leu Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg
165 170 175
Tyr Ser Met Tyr Ala Lys Gly Val Asp Val Tyr Val Ala Pro Thr Trp
180 185 190
Asp Asn Ser Asp Met Trp Val Ala Thr Leu Arg His Ile Ala Lys Glu
195 200 205
Gly Arg Leu Tyr Val Ile Gly Val Ala Pro Leu Leu Arg Gly Ser Asp
210 215 220
Val Pro Asp Asp Val Pro Gly Lys Ala Glu Leu Trp Gly Gly Asp Asp
225 230 235 240
Asp Trp Met Ser Arg Gly Phe Ser Thr Ile Val Ala Pro Gly Gly Glu
245 250 255
Val Leu Ala Gly Pro Leu Thr Glu Glu Glu Gly Ile Leu Tyr Ala Glu
260 265 270
Ile Asp Pro Ala Arg Ala Arg Ser Ser Arg His Gln Phe Asp Pro Val
275 280 285
Gly His Tyr Ser Arg Pro Asp Val Phe Arg Leu Val Val Asp Glu Ser
290 295 300
Pro Lys Pro Gln Thr Ser Gly Pro Gly
305 310
<210>151
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>151
atgagagtcg ttaaagccgc cgcggtccaa ctgaaaccag tcctttatag ccgtgaggga 60
acagtcgata acgtcgtcaa gaagatccac gagctgggcc aacaaggagt gcagttcgca 120
acgttcccgg aaaccgtggt gccttactat ccgtactttt cgatcgtgca gtccggctat 180
caaatccttg ccggcggtga gttcctaaag ctgcttgatc agtcagtgac cgtgccatct 240
cttgccaccg aagcgatcgg cgaggcctgc aggcaagcgg gcgtcgttgt ctccatcggc 300
gtcaacgagc gtgacggggg aactctgtac aatacgcaac ttctctttga tgccgacggc 360
acgttgattc aaagacgacg caagatcacg cccacccatt acgagcgcat ggtctggggc 420
cagggcgatg gctcaggttt acgggcggtt gacagcaagg tcgcgcgcat tggtcaactg 480
gcttgttttg agcactacaa cccgcttgcg cgttacgcca tgatggccga tggcgagcaa 540
atccactctg cgatgttccc gggctccatg ttcggcgatg cgttttcaga gaaggtggaa 600
atcaacgtaa ggcagcatgc aatggagtct ggatgctttg tcgtctgcgc tacggcctgg 660
ctggatgccg accaacaggc acaaatcatg aaggacacag gctgcgagat cggtccgatc 720
tcgggcggtt gcttcaccgc tatcgtgaca cccgacggga cgctgatagg cgaacccatc 780
cactcgggcg aaggcgtttg tattgccgac ctcgatttca agctcatcga caagcggaag 840
cacgtggtgg acacgcgcgg ccactacagc cggccagaat tgctcagcct cctaattgat 900
cggactccca cggcacacat acacgaacgg aacgagcaac cgaagtcggc cgttgagcaa 960
gactcgcaga atgtattcac cgctattgct taa 993
<210>152
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>152
Met Arg Val Val Lys Ala Ala Ala Val Gln Leu Lys Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Asp Asn Val Val Lys Lys Ile His Glu Leu
20 25 30
Gly Gln Gln Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Ile Val Gln Ser Gly Tyr Gln Ile Leu Ala
50 55 60
Gly Gly Glu Phe Leu Lys Leu Leu Asp Gln Ser Val Thr Val Pro Ser
65 70 75 80
Leu Ala Thr Glu Ala Ile Gly Glu Ala Cys Arg Gln Ala Gly Val Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Tyr Glu Arg Met Val Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Val Ala Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Met Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Phe Pro Gly Ser Met Phe Gly
180 185 190
Asp Ala Phe Ser Glu Lys Val Glu Ile Asn Val Arg Gln His Ala Met
195 200 205
Glu Ser Gly Cys Phe Val Val Cys Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Glu Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Ala Ile Val Thr Pro Asp Gly Thr Leu Ile
245 250 255
Gly Glu Pro Ile His Ser Gly Glu Gly Val Cys Ile Ala Asp Leu Asp
260 265 270
Phe Lys Leu Ile Asp Lys Arg Lys His Val Val Asp Thr Arg Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Thr
290 295 300
Ala His Ile His Glu Arg Asn Glu Gln Pro Lys Ser Ala Val Glu Gln
305 310 315 320
Asp Ser Gln Asn Val Phe Thr Ala Ile Ala
325 330
<210>153
<211>1074
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>153
atgccaaacg caagaaagat tgttggagcc gtggcccaag ttgcacagga attcttcgac 60
actgaagcga atctcggtaa agcgatagcg gcgattcaca atgctgcgaa gcaaggcgca 120
gatatcgtcg tcttcgccga atgctatttg ggccaatatc catattgggc gcaattttac 180
gacaactctg ccaagaacta ttccaaggtt tggacggccc tgtacgacgg tgcgatcact 240
gtgggtggcg atgaatgccg ggctattgct gctgcggcta gacagtccaa gattcatgtc 300
gtcatgggtt gcaatgagct atccgaccga gccggcggcg caacgttata caacagcctc 360
ttgtttttcg accgaaaggg cgagttgatc ggtcgacacc ggaaattgat gccgtcgatg 420
cacgagcggt tgatccatgg cacaggcgac ggaagagact tgaatgttta cgataccgat 480
atcggtatgt tgggtgggtt gatttgctgg gagcaccata tgtcgctctc gaagtatgcc 540
atggcgacta tgggtgaaga agttcatgtt gcaagctggc ctgggatgtg gcgcggagga 600
gacgcggcaa tcggtgagag gatggtcgaa gcggatcttg gggcgccgtt tgtttgtgac 660
gccgaatttg cgatccgaga atatgcggca gagacaggaa atttcgttct aagcgcgtct 720
ggatattttc cgaaggacaa tatatccgat gagtggcgcg aagcgattcc aaaccttcaa 780
gcgcagtggg ctgtgggcgg gagttctatc gtggcaccgg ggggctccta tctggtccca 840
ccactcatta atgaggagaa gatcctctgc gccgaactcg atttcaatct caggcgtctt 900
tggaaagcct ggatcgatcc gattggtcac tattcgcgtc ccgatgttta tagcctgcaa 960
ctgcataacg ttgctgggcg tgagtattcc tatcaggccg tagatttgaa gcgcacgcca 1020
aagccccaat cgctgtgggt agatgcgtcc gaggaagacg gtgcgctgaa ttga 1074
<210>154
<211>357
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>154
Met Pro Asn Ala Arg Lys Ile Val Gly Ala Val Ala Gln Val Ala Gln
1 5 10 15
Glu Phe Phe Asp Thr Glu Ala Asn Leu Gly Lys Ala Ile Ala Ala Ile
20 25 30
His Asn Ala Ala Lys Gln Gly Ala Asp Ile Val Val Phe Ala Glu Cys
35 40 45
Tyr Leu Gly Gln Tyr Pro Tyr Trp Ala Gln Phe Tyr Asp Asn Ser Ala
50 55 60
Lys Asn Tyr Ser Lys Val Trp Thr Ala Leu Tyr Asp Gly Ala Ile Thr
65 70 75 80
Val Gly Gly Asp Glu Cys Arg Ala Ile Ala Ala Ala Ala Arg Gln Ser
85 90 95
Lys Ile His Val Val Met Gly Cys Asn Glu Leu Ser Asp Arg Ala Gly
100 105 110
Gly Ala Thr Leu Tyr Asn Ser Leu Leu Phe Phe Asp Arg Lys Gly Glu
115 120 125
Leu Ile Gly Arg His Arg Lys Leu Met Pro Ser Met His Glu Arg Leu
130 135 140
Ile His Gly Thr Gly Asp Gly Arg Asp Leu Asn Val Tyr Asp Thr Asp
145 150 155 160
Ile Gly Met Leu Gly Gly Leu Ile Cys Trp Glu His His Met Ser Leu
165 170 175
Ser Lys Tyr Ala Met Ala Thr Met Gly Glu Glu Val His Val Ala Ser
180 185 190
Trp Pro Gly Met Trp Arg Gly Gly Asp Ala Ala Ile Gly Glu Arg Met
195 200 205
Val Glu Ala Asp Leu Gly Ala Pro Phe Val Cys Asp Ala Glu Phe Ala
210 215 220
Ile Arg Glu Tyr Ala Ala Glu Thr Gly Asn Phe Val Leu Ser Ala Ser
225 230 235 240
Gly Tyr Phe Pro Lys Asp Asn Ile Ser Asp Glu Trp Arg Glu Ala Ile
245 250 255
Pro Asn Leu Gln Ala Gln Trp Ala Val Gly Gly Ser Ser Ile Val Ala
260 265 270
Pro Gly Gly Ser Tyr Leu Val Pro Pro Leu Ile Asn Glu Glu Lys Ile
275 280 285
Leu Cys Ala Glu Leu Asp Phe Asn Leu Arg Arg Leu Trp Lys Ala Trp
290 295 300
Ile Asp Pro Ile Gly His Tyr Ser Arg Pro Asp Val Tyr Ser Leu Gln
305 310 315 320
Leu His Asn Val Ala Gly Arg Glu Tyr Ser Tyr Gln Ala Val Asp Leu
325 330 335
Lys Arg Thr Pro Lys Pro Gln Ser Leu Trp Val Asp Ala Ser Glu Glu
340 345 350
Asp Gly Ala Leu Asn
355
<210>155
<211>1041
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>155
atgggcatcg aacatccgaa atacaaggtg gccgtggtgc aggccgcgcc ggcctggctc 60
gatctcgacg gctcgatcaa gaaggcgatt gcgctgatcg aggaagcggc cgccaagggc 120
gctaagctga tcgctttccc cgaaaccttc attcccggct atccctggca catctggctg 180
gactcgccgg cctgggcgat cggccgcggc tttgtgcagc gctacttcga taactcgctg 240
gcctacgaca gcccgcaagc cgaaaagctg cgcgccgcgg tcaagaaggc caagctcact 300
gccgtgattg gcctgtcgga gcgcgacggc ggcagcctct atatagcgca atggctgatt 360
ggccctgatg gcgagaccat cgcaaaacgc agaaagctgc ggccaacgca cgcggaacgc 420
accgtttttg gcgagggtga cggcagcgac cttgccgtgc acgaccggcc cggaatcggg 480
cggctgggag cgctgtgctg ctgggagcac ctgcaaccgc tttcgaaata cgcgatgtat 540
gcgcagaacg aacaggtcca tgtcgcgtca tggccgagct tctcgctcta cgaccccttc 600
gcgccggcgc tcggcgccga ggtcaacaat gcggcttccc gcgtctacgc ggtcgagggc 660
tcgtgcttcg tgctggcgcc gtgcgccacg gtttcgcaag ccatgatcga cgagctgtgt 720
gaccggccgg acaagcatgc gctgttgcac gccggtggcg gacacgccgc gatttacggc 780
ccggacggca gctcgatcgc ggagaagctg ccgcaggacg cggagggcct gttgatcgcc 840
gagatcgatc tcggggcgat cggggttgcc aagaatgcag ccgacccggc cggtcattat 900
tcgcggccgg acgtgacgcg actcctgctg aacaagaacc ggatgcgaag ggtcgaggag 960
tttgcgctgc cggtcgatcc ggtcgcaacg accgaggagg agcaagtcgc gacgccgtcg 1020
aggcccagcc aggccgcgta a 1041
<210>156
<211>346
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>156
Met Gly Ile Glu His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Ala Trp Leu Asp Leu Asp Gly Ser Ile Lys Lys Ala Ile Ala Leu
20 25 30
Ile Glu Glu Ala Ala Ala Lys Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Thr Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Leu Asp Ser Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ala Tyr Asp Ser Pro Gln Ala Glu Lys Leu Arg Ala Ala Val Lys Lys
85 90 95
Ala Lys Leu Thr Ala Val Ile Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Phe Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Pro Gly Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ser Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Ala Glu Val
195 200 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Arg Pro Asp Lys His Ala Leu Leu His Ala Gly Gly Gly His Ala
245 250 255
Ala Ile Tyr Gly Pro Asp Gly Ser Ser Ile Ala Glu Lys Leu Pro Gln
260 265 270
Asp Ala Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Lys Asn Arg Met Arg Arg Val Glu Glu
305 310 315 320
Phe Ala Leu Pro Val Asp Pro Val Ala Thr Thr Glu Glu Glu Gln Val
325 330 335
Ala Thr Pro Ser Arg Pro Ser Gln Ala Ala
340 345
<210>157
<211>1011
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>157
atgagagtcg ttaaagctgc tgcggtccaa ctgagtcccg tgctgtatag ccgtgaggga 60
acagtagaaa aggtcgttcg gaagatccac gagcttggcg atcaaggagt cgagttcgcc 120
acgttcccgg agaccgtagt gccctactat ccgtacttct cggccgtcca gacgccgatt 180
cagaacatgc acggcccgga gcacctgaag ttgctcgagc aatcggtgac cgtcccgtcg 240
cccgccaccg acgcgatcgg cgacgcctgc cgccacgccg gcgtcgtcgt ctcgatcggc 300
gtcaacgaac gcgatggcgg cacgatctac aacacgcagc tcctgttcga cgccgacggc 360
accttgatcc agcgccggcg aaagatcacg ccgaccttct acgaacgaat ggtctgggga 420
cagggtgacg gttcggggct gcgcgccgtc gacagccgcg taggacgcat cggccagctc 480
gcctgtttcg agcactacaa cccgctggcg cgctacgcca tgatggccga cggcgagcag 540
attcactccg cgatgtaccc cggctccatc tttggagacg cattcgcgca gaaaatcgag 600
atcaacatcc gccagcacgc gctcgagtcc ggtgcgttcg tcgtcaacgc caccgcctgg 660
ctcgatgccg accagcaggc gcggatcatg aaggataccg gctgcaccat cgaaccgatc 720
tcgggcggtt gcttcaccgc catcgtcacc ccggacggga ccctgctggg cgaagcgata 780
cgttcggggg agggagtggt ggtcgccgat ctcgacttca cgctgatcga caggcgcaag 840
caagtgatgg actctcgtgg tcactacagt cggccggagt tgctcagcct tctgatcgac 900
cgcacaccca ccgcacacct acacgaacgc gaagcgcacc ccagagcaag tgaggactgg 960
caaggttccg agagtctgcg cgccatgcag gcctcggcac cgaaggtctg a 1011
<210>158
<211>336
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>158
Met Arg Val Val Lys Ala Ala Ala Val Gln Leu Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Glu Lys Val Val Arg Lys Ile His Glu Leu
20 25 30
Gly Asp Gln Gly Val Glu Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Ala Val Gln Thr Pro Ile Gln Asn Met His
50 55 60
Gly Pro Glu His Leu Lys Leu Leu Glu Gln Ser Val Thr Val Pro Ser
65 70 75 80
Pro Ala Thr Asp Ala Ile Gly Asp Ala Cys Arg His Ala Gly Val Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr Phe Tyr Glu Arg Met Val Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Arg Val Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Met Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ile Phe Gly
180 185 190
Asp Ala Phe Ala Gln Lys Ile Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Gly Ala Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Arg Ile Met Lys Asp Thr Gly Cys Thr Ile Glu Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Ala Ile Val Thr Pro Asp Gly Thr Leu Leu
245 250 255
Gly Glu Ala Ile Arg Ser Gly Glu Gly Val Val Val Ala Asp Leu Asp
260 265 270
Phe Thr Leu Ile Asp Arg Arg Lys Gln Val Met Asp Ser Arg Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Thr
290 295 300
Ala His Leu His Glu Arg Glu Ala His Pro Arg Ala Ser Glu Asp Trp
305 310 315 320
Gln Gly Ser Glu Ser Leu Arg Ala Met Gln Ala Ser Ala Pro Lys Val
325 330 335
<210>159
<211>930
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>159
atgtcatcaa ccgtgacggt tgccattatt caggcagcac ccgtgtatta tgacctgcct 60
gccacgctgg acaaagccgc caaactggtg gcggatgcgg cggcacaggg cgcaacgctg 120
attgtcttcg gcgagacatg gtttccgggg tatccggcat ggctggatta ctgccccaat 180
gtcgcgctgt ggaatcatcc cccgaccaag caggtatttg agcgcctgca tcgcaacagc 240
atcgctgtgc caagcaagga actcgatttt ctgggggcgc tggcacgcaa gcatcaggtg 300
gtgctggtgt tgagcattaa tgaacgtgtg gagcagggcg cggggcatgg cacgctgtat 360
aacacgctgc tcacgattga cgccgatggc acgctggcaa atcatcatcg caaactgatg 420
ccgacctata ccgagcgcat ggtgtggggc atgggcgacg gggtggggtt gcaagcggtg 480
gatactgccg tcgggcgcgt aggcggctta atctgctggg aacactggat gccgttggca 540
cgccagacca tgcacatcag cggcgaacag attcatattt ccgtcttccc aaccgtccat 600
gagatgcacc agattgccag ccgccagtat gcctttgaag ggcggacgtt tgtgctgacc 660
gttggcggca ttcttgcggc acaggacttg cccgccgaac tggaacgccc cgccgatttg 720
ccgcccacgc agcttgtcca gcgcggcggc agcgccatta tcgcgccgga tggtcgttat 780
ctggcgggtc cagtctataa tgaggaaacc atcctgaccg caacgctgga tttgggcgag 840
atcatccgcg agagcatgac gctggatgtc accggacatt atgcccgccc ggatgttttt 900
gacctgaccg tgaagcgcag ccgaccatga 930
<210>160
<211>309
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>160
Met Ser Ser Thr Val Thr Val Ala Ile Ile Gln Ala Ala Pro Val Tyr
1 5 10 15
Tyr Asp Leu Pro Ala Thr Leu Asp Lys Ala Ala Lys Leu Val Ala Asp
20 25 30
Ala Ala Ala Gln Gly Ala Thr Leu Ile Val Phe Gly Glu Thr Trp Phe
35 40 45
Pro Gly Tyr Pro Ala Trp Leu Asp Tyr Cys Pro Asn Val Ala Leu Trp
50 55 60
Asn His Pro Pro Thr Lys Gln Val Phe Glu Arg Leu His Arg Asn Ser
65 70 75 80
Ile Ala Val Pro Ser Lys Glu Leu Asp Phe Leu Gly Ala Leu Ala Arg
85 90 95
Lys His Gln Val Val Leu Val Leu Ser Ile Asn Glu Arg Val Glu Gln
100 105 110
Gly Ala Gly His Gly Thr Leu Tyr Asn Thr Leu Leu Thr Ile Asp Ala
115 120 125
Asp Gly Thr Leu Ala Asn His His Arg Lys Leu Met Pro Thr Tyr Thr
130 135 140
Glu Arg Met Val Trp Gly Met Gly Asp Gly Val Gly Leu Gln Ala Val
145 150 155 160
Asp Thr Ala Val Gly Arg Val Gly Gly Leu Ile Cys Trp Glu His Trp
165 170 175
Met Pro Leu Ala Arg Gln Thr Met His Ile Ser Gly Glu Gln Ile His
180 185 190
Ile Ser Val Phe Pro Thr Val His Glu Met His Gln Ile Ala Ser Arg
195 200 205
Gln Tyr Ala Phe Glu Gly Arg Thr Phe Val Leu Thr Val Gly Gly Ile
210 215 220
Leu Ala Ala Gln Asp Leu Pro Ala Glu Leu Glu Arg Pro Ala Asp Leu
225 230 235 240
Pro Pro Thr Gln Leu Val Gln Arg Gly Gly Ser Ala Ile Ile Ala Pro
245 250 255
Asp Gly Arg Tyr Leu Ala Gly Pro Val Tyr Asn Glu Glu Thr Ile Leu
260 265 270
Thr Ala Thr Leu Asp Leu Gly Glu Ile Ile Arg Glu Ser Met Thr Leu
275 280 285
Asp Val Thr Gly His Tyr Ala Arg Pro Asp Val Phe Asp Leu Thr Val
290 295 300
Lys Arg Ser Arg Pro
305
<210>161
<211>1008
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>161
atgaccacca tccgcgccgc cgccgtgcag tttagcccgg tgctgtactc gcgccaggcc 60
accgtcgaca agctgtgccg caccctgctg gaactgggcc gcgaaggggt gcagttcgcg 120
gtattcccgg aaaccgtggt gccgtactac ccatattttt ccttcgtgca gccaccgttc 180
gccatgggca aacaacacct gttgctgctc gagcaatccg tcactgtgcc ctctgacgtc 240
acccggcaga tcggtgaggc ctgccgggaa gcggggatcg tcgccagcat cggcgtcaac 300
gaacgcgacg gcggcactat ttataacgcg cagttgctgt tcgatgccga cggcagcctg 360
attcagcagc ggcgcaagat caccccgacc tatcacgaac gcatggtctg ggggcagggc 420
gatggttccg gcctgcgcgc cgtggacagt gcggtggggc gtatcggttc cctggcctgc 480
tgggaacatt acaaccccct ggcgcgctac gcgctgatgg ccgatggcga acagattcat 540
gtggcgatgt ttcccggctc cctggtcggc gacatctttg ccgagcagat cgaagtcacc 600
atccgccacc acgccctgga aagcggctgc ttcgtggtca acgccacggc ttggctggat 660
gccgaccagc agggccggat catgcaggac accggctgcg agttggggcc gatttccggc 720
ggctgtttta ccgcgatcat ttccccggag ggcaaggttc tcggcgagcc gctgcgcagc 780
ggcgaagggg tggtcattgc tgacctcgac ctggccctga tcgacaagcg caaacgcatg 840
atggattcgg tcggtcacta cagccgcccg gaactgctca gcctgcttat cgaccgcagc 900
ccgaccgccc acgtgcatga acttgccgcc gcgcttaatc ctgccaggga gtctgatcca 960
ctagtgtcga cctgcaggcg cgcgagctcc agcttttgtt ccctttag 1008
<210>162
<211>335
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>162
Met Thr Thr Ile Arg Ala Ala Ala Val Gln Phe Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Gln Ala Thr Val Asp Lys Leu Cys Arg Thr Leu Leu Glu Leu
20 25 30
Gly Arg Glu Gly Val Gln Phe Ala Val Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Pro Pro Phe Ala Met Gly Lys
50 55 60
Gln His Leu Leu Leu Leu Glu Gln Ser Val Thr Val Pro Ser Asp Val
65 70 75 80
Thr Arg Gln Ile Gly Glu Ala Cys Arg Glu Ala Gly Ile Val Ala Ser
85 90 95
Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Ala Gln Leu
100 105 110
Leu Phe Asp Ala Asp Gly Ser Leu Ile Gln Gln Arg Arg Lys Ile Thr
115 120 125
Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp Gly Ser Gly
130 135 140
Leu Arg Ala Val Asp Ser Ala Val Gly Arg Ile Gly Ser Leu Ala Cys
145 150 155 160
Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp Gly
165 170 175
Glu Gln Ile His Val Ala Met Phe Pro Gly Ser Leu Val Gly Asp Ile
180 185 190
Phe Ala Glu Gln Ile Glu Val Thr Ile Arg His His Ala Leu Glu Ser
195 200 205
Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp Gln Gln
210 215 220
Gly Arg Ile Met Gln Asp Thr Gly Cys Glu Leu Gly Pro Ile Ser Gly
225 230 235 240
Gly Cys Phe Thr Ala Ile Ile Ser Pro Glu Gly Lys Val Leu Gly Glu
245 250 255
Pro Leu Arg Ser Gly Glu Gly Val Val Ile Ala Asp Leu Asp Leu Ala
260 265 270
Leu Ile Asp Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser
275 280 285
Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Ser Pro Thr Ala His
290 295 300
Val His Glu Leu Ala Ala Ala Leu Asn Pro Ala Arg Glu Ser Asp Pro
305 310 315 320
Leu Val Ser Thr Cys Arg Arg Ala Ser Ser Ser Phe Cys Ser Leu
325 330 335
<210>163
<211>978
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>163
gtgaccatca tcaaagccgc cgcagtgcag atcagccccg tgctttacag ccgggaagcc 60
accgtcgaaa aggtcgttcg cgagacccgc gaactcggcc agaagggcgt gcagttcgca 120
acgtttccgg aaaccgtggt gccgtactac ccatacttct ccgccgtcca gacgggcatc 180
gaactgctgt ccggcaaaga gcacctgcga ctgctggagc aggccgtgac tgttccttcc 240
cccgccactg atgcgattgc ccaggcggca cgcgaggccg gcatggtggt gtcgatcggc 300
gtcaacgagc gtgacggcgg caccatctac aacacgcagc tgctctttga tgccgacggc 360
acgctggtgc agcgccgccg caagatcacg ccgacgcatt tcgagcgcat ggtgtggggc 420
cagggcgacg gttcgggcct gcgcgcagtg gataccaagg tcggccgcat tggccagctg 480
gcctgcttcg agcacaacaa cccgctcgcg cgctacgcaa tgatggccga tggcgagcag 540
atccattcct ccatgtaccc gggctccgcc ttcggcgacg gattcgcgca gcgcatggag 600
atcaacattc gccaacacgc cctggagtcg ggttgcttcg tggtgaatgc caccgcgtgg 660
ctcgacgccg accagcaggc gcagatcatg aaggacacgg gctgcgccat cgggccgatc 720
tctggcggct gcttcacgac catcgtcacg ccggacggca tgctgatcgg cgaacccctc 780
cgcgagggcg agggcgagat catcgccgac ctcgatttca ccctgatcga ccgccgcaag 840
ctgctgatgg actcggtcgg ccactacaac cgtccggagc tgctgagcct gctgatcgac 900
cgcacacccg cggcgaactt ccatgagcgc agtacgcatc cggccgtcga tgccgccagc 960
ggcctcgaaa tcctctaa 978
<210>164
<211>325
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>164
Val Thr Ile Ile Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Ala Thr Val Glu Lys Val Val Arg Glu Thr Arg Glu Leu
20 25 30
Gly Gln Lys Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Ala Val Gln Thr Gly Ile Glu Leu Leu Ser
50 55 60
Gly Lys Glu His Leu Arg Leu Leu Glu Gln Ala Val Thr Val Pro Ser
65 70 75 80
Pro Ala Thr Asp Ala Ile Ala Gln Ala Ala Arg Glu Ala Gly Met Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Val Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Phe Glu Arg Met Val Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Thr Lys Val Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ser Met Tyr Pro Gly Ser Ala Phe Gly
180 185 190
Asp Gly Phe Ala Gln Arg Met Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Ala Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Thr Ile Val Thr Pro Asp Gly Met Leu Ile
245 250 255
Gly Glu Pro Leu Arg Glu Gly Glu Gly Glu Ile Ile Ala Asp Leu Asp
260 265 270
Phe Thr Leu Ile Asp Arg Arg Lys Leu Leu Met Asp Ser Val Gly His
275 280 285
Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Ala
290 295 300
Ala Asn Phe His Glu Arg Ser Thr His Pro Ala Val Asp Ala Ala Ser
305 310 315 320
Gly Leu Glu Ile Leu
325
<210>165
<211>1008
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>165
atggccaatt tcaaattcaa ggcggcggcg gtgcaggccg cgcccgcttt cctcgatctc 60
gaggctagca tcgccaagtc gatcgccctg atcgaacaag ccgccgccaa cggcgccaag 120
ctgatcgcct ttcccgaagt cttcattccc ggctacccct ggcacatctg gctcgacagt 180
cccgcctggg cgatcgggcg cggcttcgtc tcgcgctatt tcgagaactc gctggactac 240
aacagccccg aggccgagcg cctcaggctc gccgtcaaga aggcgggcct gacggcggtg 300
atcggcctct ccgagcgcga cggcggcagc ctctacatcg cgcaatggat catcggccct 360
gacggcgaga ccgttgcgaa acggcgtaag ctccggccga cccattgcga gcgcacggtc 420
tatggagaag gcgacggcag cgacctcgcg gttcacgacg tatctggcat cggccgtctc 480
ggcgcgctct gctgctggga gcatatccag ccgctgtcga aattcgcgat gtattcgcaa 540
aatgagcaag tgcacgtcgc gtcctggccg agcttctcgc tctacgaccc gttcgcgccg 600
gcgctgggcg ccgaggtcaa caacgcagcc tcgcggatct atgcggtcga aggctcatgc 660
ttcgtcattg cgccctgcgc gaccgtttcg cctgcaatga tcgaggaact gtgcgacgcg 720
ccaaacaaac atgcgcttct gcacgcgggc ggcggcttcg cgcgcatcta tgggccggac 780
ggcgcttcga tcgccgagac gctgccgcca gatcaggaag gcttgatcta cgccgacatc 840
gacctcaccg cgatcggcgt cgccaaggcc gccgccgatc ccgccggcca ttattcgcgc 900
cccgacgtca cgcgcctgct cttcaacaag aagcccgctc ggcgagtcga aacttttgct 960
ttgcccgtcg atgcgccggc gccggagacg cagaccgccg cgagctga 1008
<210>166
<211>335
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>166
Met Ala Asn Phe Lys Phe Lys Ala Ala Ala Val Gln Ala Ala Pro Ala
1 5 10 15
Phe Leu Asp Leu Glu Ala Ser Ile Ala Lys Ser Ile Ala Leu Ile Glu
20 25 30
Gln Ala Ala Ala Asn Gly Ala Lys Leu Ile Ala Phe Pro Glu Val Phe
35 40 45
Ile Pro Gly Tyr Pro Trp His Ile Trp Leu Asp Ser Pro Ala Trp Ala
50 55 60
Ile Gly Arg Gly Phe Val Ser Arg Tyr Phe Glu Asn Ser Leu Asp Tyr
65 70 75 80
Asn Ser Pro Glu Ala Glu Arg Leu Arg Leu Ala Val Lys Lys Ala Gly
85 90 95
Leu Thr Ala Val Ile Gly Leu Ser Glu Arg Asp Gly Gly Ser Leu Tyr
100 105 110
Ile Ala Gln Trp Ile Ile Gly Pro Asp Gly Glu Thr Val Ala Lys Arg
115 120 125
Arg Lys Leu Arg Pro Thr His Cys Glu Arg Thr Val Tyr Gly Glu Gly
130 135 140
Asp Gly Ser Asp Leu Ala Val His Asp Val Ser Gly Ile Gly Arg Leu
145 150 155 160
Gly Ala Leu Cys Cys Trp Glu His Ile Gln Pro Leu Ser Lys Phe Ala
165 170 175
Met Tyr Ser Gln Asn Glu Gln Val His Val Ala Ser Trp Pro Ser Phe
180 185 190
Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Ala Glu Val Asn Asn
195 200 205
Ala Ala Ser Arg Ile Tyr Ala Val Glu Gly Ser Cys Phe Val Ile Ala
210 215 220
Pro Cys Ala Thr Val Ser Pro Ala Met Ile Glu Glu Leu Cys Asp Ala
225 230 235 240
Pro Asn Lys His Ala Leu Leu His Ala Gly Gly Gly Phe Ala Arg Ile
245 250 255
Tyr Gly Pro Asp Gly Ala Ser Ile Ala Glu Thr Leu Pro Pro Asp Gln
260 265 270
Glu Gly Leu Ile Tyr Ala Asp Ile Asp Leu Thr Ala Ile Gly Val Ala
275 280 285
Lys Ala Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr
290 295 300
Arg Leu Leu Phe Asn Lys Lys Pro Ala Arg Arg Val Glu Thr Phe Ala
305 310 315 320
Leu Pro Val Asp Ala Pro Ala Pro Glu Thr Gln Thr Ala Ala Ser
325 330 335
<210>167
<211>1017
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>167
atgggtattg aacatccgaa gtacagggtt gccgtggtgc aggccgcacc ggcctggctc 60
gatcttgacg cgtcgatcga caagtcgatc gcgctgatcg aggaggctgc ccagaaaggc 120
gccaagctga tcgcattccc cgaggccttc atccccggct acccctggca tatctggatg 180
gactcgcccg cctgggcgat tggccgcggt tttgtgcagc gctacttcga caattcgctg 240
gcctatgaca gcccgcaggc cgagaagctg cgcgcggccg tgcgcaaggc aaaactcacg 300
gccgtgatcg gcttgtcgga gcgtgacggc ggcagccttt atctcgcaca atggctgatc 360
ggccccgacg gcgagaccat cgcaaaacgg cgcaagctgc ggccgacaca tgccgagcgc 420
actgtgtacg gcgagggcga cggcagcgac cttgcggtcc acaatcgtcc ggacatcggc 480
aggctcggtg cgctctgctg ctgggagcat cttcagccac tgtcgaaata cgcgatgtac 540
gcgcagaacg agcaggtgca cgtcgcggcc tggccgagct tttcgctcta cgatcccttc 600
gccgtggcgc tcggcgccga ggtgaacaac gcggcctccc gcgtctatgc ggtcgaaggc 660
tcctgcttcg tgctggcgcc gtgcgcgaca gtctcgcaag ccatgatcga cgagctctgc 720
gatcggccgg acaagcacgc gctgctgcat gtcggcggcg gctttgccgc gatctacggg 780
cccgacggca gccagatcgg cgacaagctc gcccccgacc aggagggcct gttgatcgcc 840
gagatcgatc tcggcgccat aggtgtcgcc aagaacgccg cggatcccgc cgggcactat 900
tcgcggcccg acgtgacgcg gctgttgctc aacaagaaac cgtacaagcg cgtcgaacag 960
ttctcgccgc cgtcggaggc ggttgaaccc acggatatcg cggcggcggc aagctga 1017
<210>168
<211>338
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>168
Met Gly Ile Glu His Pro Lys Tyr Arg Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Ala Trp Leu Asp Leu Asp Ala Set Ile Asp Lys Ser Ile Ala Leu
20 25 30
Ile Glu Glu Ala Ala Gln Lys Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Ala Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Met Asp Ser Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ala Tyr Asp Ser Pro Gln Ala Glu Lys Leu Arg Ala Ala Val Arg Lys
85 90 95
Ala Lys Leu Thr Ala Val Ile Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Leu Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asn Arg Pro Asp Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Val Ala Leu Gly Ala Glu Val
195 200 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Arg Pro Asp Lys His Ala Leu Leu His Val Gly Gly Gly Phe Ala
245 250 255
Ala Ile Tyr Gly Pro Asp Gly Ser Gln Ile Gly Asp Lys Leu Ala Pro
260 265 270
Asp Gln Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Tyr Lys Arg Val Glu Gln
305 310 315 320
Phe Ser Pro Pro Ser Glu Ala Val Glu Pro Thr Asp Ile Ala Ala Ala
325 330 335
Ala Ser
<210>169
<211>1077
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>169
atggccctca cccatccgaa attgaaagtc gccgccgtgc aggcagctcc cgcgttcctc 60
gatgtcgatg ccgcagtgga caaagcggtg cggctaatcg acgaagcggc agcaaacggc 120
tccagtctgg tggcattccc cgagacctgg atccccggct atccgttttg gatctggctt 180
ggctcgccgg cctgggcaat catgcgcggg tttgtgtctc gctatttcga taattcgctc 240
agctatgaca gccggcaggc agagcgcctg cgcgacgccg cgaagcgcca caaactgacc 300
gtcgtcatgg gcctgtccga gcgcgccggc ggtagccttt acatcgcgca gtggatcatt 360
ggtcccaatg gcgagaccgt cgcacagcgg cgcaagctca agcccaccca tgcggagcgc 420
accgtcttcg gcgagggtga cggcagccac ctggcggtac acaatcttcc aatcggacgg 480
ctcggtgcgc tgtgctgctg ggagcacctc cagccgctct ccaaatacgc gatgtacgcc 540
cagaacgaag agatccacgt ggcggcatgg ccgtccttct cgctctacga cccgtttgcg 600
cacgcgctcg gcgccgaagt caacaacgca gcgagccaga tctacgcggt tgaaggttcc 660
tgctttgtcg tcgcgccatg tgcggtgatc tcgcaggaaa tgatcgatct tatgtgcgat 720
acccccgaca agcatcagct tattcacgtc ggtggcggct tcaccgtgat ctatggcccg 780
gacggtgcgc gcatcggcga caagctcgcg ccagatcagg aaggcattgt ctatgccgac 840
atcgatctcg gcatgatccc gatcgcgaaa gctgccgccg atcctgccgg ccactatgcg 900
cgacccgacg ttacccgcct tctgttcaac aatcgtcccg ccaatcgggt ggaaaccctc 960
gtgctccccg ttgatcaggt ccgtgacatc gatgcacgtg tggaggccgc ggcacctcag 1020
gcgcgaccag caaccgggaa cgaggatccc gccgcaaagc ctatggccgc cgaatga 1077
<210>170
<211>358
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>170
Met Ala Leu Thr His Pro Lys Leu Lys Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Ala Phe Leu Asp Val Asp Ala Ala Val Asp Lys Ala Val Arg Leu
20 25 30
Ile Asp Glu Ala Ala Ala Asn Gly Ser Ser Leu Val Ala Phe Pro Glu
35 40 45
Thr Trp Ile Pro Gly Tyr Pro Phe Trp Ile Trp Leu Gly Ser Pro Ala
50 55 60
Trp Ala Ile Met Arg Gly Phe Val Ser Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ser Tyr Asp Ser Arg Gln Ala Glu Arg Leu Arg Asp Ala Ala Lys Arg
85 90 95
His Lys Leu Thr Val Val Met Gly Leu Ser Glu Arg Ala Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Ile Ile Gly Pro Asn Gly Glu Thr Val Ala
115 120 125
Gln Arg Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly
130 135 140
Glu Gly Asp Gly Ser His Leu Ala Val His Asn Leu Pro Ile Gly Arg
145 150 155 160
Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr
165 170 175
Ala Met Tyr Ala Gln Asn Glu Glu Ile His Val Ala Ala Trp Pro Ser
180 185 190
Phe Ser Leu Tyr Asp Pro Phe Ala His Ala Leu Gly Ala Glu Val Asn
195 200 205
Asn Ala Ala Ser Gln Ile Tyr Ala Val Glu Gly Ser Cys Phe Val Val
210 215 220
Ala Pro Cys Ala Val Ile Ser Gln Glu Met Ile Asp Leu Met Cys Asp
225 230 235 240
Thr Pro Asp Lys His Gln Leu Ile His Val Gly Gly Gly Phe Thr Val
245 250 255
Ile Tyr Gly Pro Asp Gly Ala Arg Ile Gly Asp Lys Leu Ala Pro Asp
260 265 270
Gln Glu Gly Ile Val Tyr Ala Asp Ile Asp Leu Gly Met Ile Pro Ile
275 280 285
Ala Lys Ala Ala Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val
290 295 300
Thr Arg Leu Leu Phe Asn Asn Arg Pro Ala Asn Arg Val Glu Thr Leu
305 310 315 320
Val Leu Pro Val Asp Gln Val Arg Asp Ile Asp Ala Arg Val Glu Ala
325 330 335
Ala Ala Pro Gln Ala Arg Pro Ala Thr Gly Asn Glu Asp Pro Ala Ala
340 345 350
Lys Pro Met Ala Ala Glu
355
<210>171
<211>1011
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>171
atgactaagg gaacagtgaa agtcgcggcg gcgcaggtca cccccgtgtt catggatcgt 60
aaagccacga tcgtcaaggc ctgcgacacg atcgccgagg cgggcaagaa cggcgcgcgg 120
ctggtggtgt tcccggagac gtttgttcct ggctacccgg actgggtctg gacggcgacg 180
gctgggacgc atcgcgatat ccaccaggcg atgtacgcgg aactgctgga ccaggctgtc 240
tcgattccga gcccggcgac ggacgccctc tgccgtgctg caaagaaggc gggcgtctac 300
gtcgtcatcg gcgtcaatga gctgagtggg ccgggcggaa gcctgtacaa cacgctgatc 360
tacatcgatg acgaaggcga gatcatgggc cgccaccgca agctggtccc cacgatgggc 420
gagcgcctgg tctgggcacc cggcgacggc agcacgctgg aggcgtacga gacatcgatc 480
ggcaggctgg gcggactgat ctgctgggag aactacatgc cgctggcccg ctacgccatg 540
tacgcctggg gcgtgcagat ctacgtcgcg ccgacgtggg acagctcgga cgggtgggtt 600
ggcagcatgc agcacatcgc ccgcgaaggg cggacggcgg tgatcggctg ctgcatggcg 660
atccgtcgca gcgacatccc ggacaagtac gagttcaaga agctgtaccc gccgagcaag 720
agcaaagacg aagaatgggt gaacgatggc aacagcgtca tcgtcgcacc cggtggacga 780
atactcgccg ggccggtcgc caaagaggag acgatcctct acgccgatct ggacccggca 840
gccgagcgcg gttcaaagtt ctcgttagat gtggcagggc actacgcgcg gccggacgtc 900
ttccagctga cggtgaatcg cggtccggca gaactggtga atgtggccgg tgatatcgca 960
ccggcaacca acggcaaagt caaaacaccg gcgaaattac gccgcaagta a 1011
<210>172
<211>336
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>172
Met Thr Lys Gly Thr Val Lys Val Ala Ala Ala Gln Val Thr Pro Val
1 5 10 15
Phe Met Asp Arg Lys Ala Thr Ile Val Lys Ala Cys Asp Thr Ile Ala
20 25 30
Glu Ala Gly Lys Asn Gly Ala Arg Leu Val Val Phe Pro Glu Thr Phe
35 40 45
Val Pro Gly Tyr Pro Asp Trp Val Trp Thr Ala Thr Ala Gly Thr His
50 55 60
Arg Asp Ile His Gln Ala Met Tyr Ala Glu Leu Leu Asp Gln Ala Val
65 70 75 80
Ser Ile Pro Ser Pro Ala Thr Asp Ala Leu Cys Arg Ala Ala Lys Lys
85 90 95
Ala Gly Val Tyr Val Val Ile Gly Val Asn Glu Leu Ser Gly Pro Gly
100 105 110
Gly Ser Leu Tyr Asn Thr Leu Ile Tyr Ile Asp Asp Glu Gly Glu Ile
115 120 125
Met Gly Arg His Arg Lys Leu Val Pro Thr Met Gly Glu Arg Leu Val
130 135 140
Trp Ala Pro Gly Asp Gly Ser Thr Leu Glu Ala Tyr Glu Thr Ser Ile
145 150 155 160
Gly Arg Leu Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met Pro Leu Ala
165 170 175
Arg Tyr Ala Met Tyr Ala Trp Gly Val Gln Ile Tyr Val Ala Pro Thr
180 185 190
Trp Asp Ser Ser Asp Gly Trp Val Gly Ser Met Gln His Ile Ala Arg
195 200 205
Glu Gly Arg Thr Ala Val Ile Gly Cys Cys Met Ala Ile Arg Arg Ser
210 215 220
Asp Ile Pro Asp Lys Tyr Glu Phe Lys Lys Leu Tyr Pro Pro Ser Lys
225 230 235 240
Ser Lys Asp Glu Glu Trp Val Asn Asp Gly Asn Ser Val Ile Val Ala
245 250 255
Pro Gly Gly Arg Ile Leu Ala Gly Pro Val Ala Lys Glu Glu Thr Ile
260 265 270
Leu Tyr Ala Asp Leu Asp Pro Ala Ala Glu Arg Gly Ser Lys Phe Ser
275 280 285
Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln Leu Thr
290 295 300
Val Asn Arg Gly Pro Ala Glu Leu Val Asn Val Ala Gly Asp Ile Ala
305 310 315 320
Pro Ala Thr Asn Gly Lys Val Lys Thr Pro Ala Lys Leu Arg Arg Lys
325 330 335
<210>173
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>173
atgaaagttg ttaaagccgc cgcagtccaa ctgagtcccg tcctctatag ccgcgaggga 60
acggtcgaga gggtcgttcg gaagatccac gagcttggcc ggcagggggt acagttcgcc 120
accttcccgg agaccgtagt gccttactac ccgtactttt ccttcgtcca gacgccctta 180
cagattatag ccggacctga gcatctaaag ctgctcgacc aggcagtgac cgtgccgtcc 240
cctgccaccg acgctatcag cgaggctgcc aggcaggcgg gagttgtggt gtccataggc 300
gtcaacgagc gtgacggcgg aaccctgtac aacacgcagc tgctcttcga tgccgatggc 360
gccttgatcc agcgccgccg caagattacg cccactcatt tcgagcgcat gatctggggc 420
cagggcgacg ggtcgggcct gcgcgctgtc gacagcaagg tcggtcgcat tggccagctc 480
gcatgctggg agcacaacaa ccccctggcg cgctacgcga tgatagccga cggcgagcag 540
atccattcgg caatgtatcc gggctccatg ttcggcgacc cgtttgccca gaagacggaa 600
atcaatatcc ggcagcatgc attggagtct gcgtgcttcg tcgtgtgcgc cacggcctgg 660
ctggacgccg atcagcaggc gcaaatctgc aaggacactg gctgcgacat cggcccgatc 720
tccggcggtt gcttcaccgc gatcgtggcg cctgatggaa ccttgctggg cgagcccatc 780
cgctcgggcg aaggcatggt catcgtcgac ctcgacttca cgctcatcga caagcgcaag 840
caggtgatgg actcgcgcgg ccactacaac cggccggaat tgctcagtct cctgatcgac 900
cgcacaccca ctgcgcatgt tcacgaccgc gctgtgcgcc ccgagtcagc cgcggagcaa 960
cgttcggagg aacttctcgc tacggctgtc taa 993
<210>174
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>174
Met Lys Val Val Lys Ala Ala Ala Val Gln Leu Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Glu Arg Val Val Arg Lys Ile His Glu Leu
20 25 30
Gly Arg Gln Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Thr Pro Leu Gln Ile Ile Ala
50 55 60
Gly Pro Glu His Leu Lys Leu Leu Asp Gln Ala Val Thr Val Pro Ser
65 70 75 80
Pro Ala Thr Asp Ala Ile Ser Glu Ala Ala Arg Gln Ala Gly Val Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Ala Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Phe Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Val Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Ile Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Met Phe Gly
180 185 190
Asp Pro Phe Ala Gln Lys Thr Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Ala Cys Phe Val Val Cys Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Cys Lys Asp Thr Gly Cys Asp Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Ala Ile Val Ala Pro Asp Gly Thr Leu Leu
245 250 255
Gly Glu Pro Ile Arg Ser Gly Glu Gly Met Val Ile Val Asp Leu Asp
260 265 270
Phe Thr Leu Ile Asp Lys Arg Lys Gln Val Met Asp Ser Arg Gly His
275 280 285
Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Thr
290 295 300
Ala His Val His Asp Arg Ala Val Arg Pro Glu Ser Ala Ala Glu Gln
305 310 315 320
Arg Ser Glu Glu Leu Leu Ala Thr Ala Val
325 330
<210>175
<211>945
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>175
atgaccacct acgggcagtt tagacttgcc gcgatcaatg ccgcgcctgt ctacttcgac 60
agggaagcat ccaccgaaaa agcttgccgg ctcattctcg aagcgggggc atcaggggcg 120
acgctggcgg cgtttggcga gacgtggctt cccggctatc cattccacat ctggagggaa 180
gttccgactg ctctccgagt ggaatacatc gctaatgccg tcgagattcc cagcccaacg 240
accgaccgat tgtgcgcggc ggctcgtcag gcgaacatcg atgttgtgat cggcgttgtc 300
gaactggatg cgcagacaca cgggacggtc tactgtacgc tcttgttcat tggcagcgat 360
ggctcaattc tgggacgtca tcgaaagatt aaaccgactt tcgtggagcg aaccgcatgg 420
ggggaaggtg acggcagcag cctgatcgtc tacgagcgcc cgtatggcaa gatcagtggt 480
ctgtgttgct gggaacacaa tatggttctg ccgggctacg cgctgatggc gcaggggacg 540
cagattcata tcgccgcatg gcccggctgg gaaagcactc gccatctgct cttatcaaga 600
gcattcgctt ctcaggcagc ggcgtatgtg attgatgtag gcgctatcgt caatcgtgac 660
gaccttcggg aagattacca ggctttgatt gctggaagct actggggcgg aagttgcatc 720
atcaacccag aaggcgaggt catcgctggt ccagcgaaat cggagaccat tctggttgca 780
gattgctcaa ccgagcagat ctttagctca aaagtgctct gtgatgtggg cgggcattat 840
tctcgcccgg atatttttca gctccatgtc aatcgaaagc catatcaacg tatcgtcgag 900
acgaacaacc cacaccccgc tccgattgag ttcgattacc gttga 945
<210>176
<211>314
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>176
Met Thr Thr Tyr Gly Gln Phe Arg Leu Ala Ala Ile Asn Ala Ala Pro
1 5 10 15
Val Tyr Phe Asp Arg Glu Ala Ser Thr Glu Lys Ala Cys Arg Leu Ile
20 25 30
Leu Glu Ala Gly Ala Ser Gly Ala Thr Leu Ala Ala Phe Gly Glu Thr
35 40 45
Trp Leu Pro Gly Tyr Pro Phe His Ile Trp Arg Glu Val Pro Thr Ala
50 55 60
Leu Arg Val Glu Tyr Ile Ala Asn Ala Val Glu Ile Pro Ser Pro Thr
65 70 75 80
Thr Asp Arg Leu Cys Ala Ala Ala Arg Gln Ala Asn Ile Asp Val Val
85 90 95
Ile Gly Val Val Glu Leu Asp Ala Gln Thr His Gly Thr Val Tyr Cys
100 105 110
Thr Leu Leu Phe Ile Gly Ser Asp Gly Ser Ile Leu Gly Arg His Arg
115 120 125
Lys Ile Lys Pro Thr Phe Val Glu Arg Thr Ala Trp Gly Glu Gly Asp
130 135 140
Gly Ser Ser Leu Ile Val Tyr Glu Arg Pro Tyr Gly Lys Ile Ser Gly
145 150 155 160
Leu Cys Cys Trp Glu His Asn Met Val Leu Pro Gly Tyr Ala Leu Met
165 170 175
Ala Gln Gly Thr Gln Ile His Ile Ala Ala Trp Pro Gly Trp Glu Ser
180 185 190
Thr Arg His Leu Leu Leu Ser Arg Ala Phe Ala Ser Gln Ala Ala Ala
195 200 205
Tyr Val Ile Asp Val Gly Ala Ile Val Asn Arg Asp Asp Leu Arg Glu
210 215 220
Asp Tyr Gln Ala Leu Ile Ala Gly Ser Tyr Trp Gly Gly Ser Cys Ile
225 230 235 240
Ile Asn Pro Glu Gly Glu Val Ile Ala Gly Pro Ala Lys Ser Glu Thr
245 250 255
Ile Leu Val Ala Asp Cys Ser Thr Glu Gln Ile Phe Ser Ser Lys Val
260 265 270
Leu Cys Asp Val Gly Gly His Tyr Ser Arg Pro Asp Ile Phe Gln Leu
275 280 285
His Val Asn Arg Lys Pro Tyr Gln Arg Ile Val Glu Thr Asn Asn Pro
290 295 300
His Pro Ala Pro Ile Glu Phe Asp Tyr Arg
305 310
<210>177
<211>948
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>177
atggactcac ttaccgttgg tctcgcgcaa atcgcgcccg tttggttgaa tcgggcgggg 60
acgttgtcaa agatgttgga acaggttcga gcggcgaaag aggcgggctg tcagcttgtt 120
gtgtttggtg aggcgttgct ccccggttat ccattttgga tcgaactgac gaacggcgca 180
gtcttcaatt cgccgatgca aaaggaaatc cacgcgcact acatggatca agctgtgcag 240
atcgaagcag ggcatcttga tccattgtgc ggcgcggcaa aagcgcacgg catcaccgtg 300
gtcgcgggca tcatcgagcg tccgttggat cgcggcggac atagtttata tgcgagtctg 360
gtgtatatcg atttgaacgg tgtcatccaa tcggtgcatc gcaaactgat gcccacctat 420
gaagaacgac tcacctggtc gcctggcgat ggtcatgggt tacgcgtgca tacactgggc 480
gcttttacgg ttggcaaact caattgttgg gaaaactgga tgccgctgcc gcgcgcggct 540
ctgtatgcgc aaggcgaaga tctgcacgtt gctgtctggc ccgggtccgt gcgcaacaca 600
caggatatta cgcgctttat cgcaatggag tcgcgatcgt ttgtcgtttc ggtttcgagt 660
ttgatgcgca agagtgactt cccacaagat acgcctcatc tctccgccat tcttgaatct 720
gcacccgatc cactcgccaa cggaggttcg tgtctggctg gacctgacgg taaatggatc 780
gttgaaccgg ttgcggatga agagaagttg atcgtcgcca ccattgacca tgcccgtgta 840
cgtgaagaac gccagaactt tgatccatcc ggtcattaca gccgaccaga tgtgacacaa 900
ttgagagtca accgccagcg acaaagcgtt atcgcttttg atgagtag 948
<210>178
<211>315
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>178
Met Asp Ser Leu Thr Val Gly Leu Ala Gln Ile Ala Pro Val Trp Leu
1 5 10 15
Asn Arg Ala Gly Thr Leu Ser Lys Met Leu Glu Gln Val Arg Ala Ala
20 25 30
Lys Glu Ala Gly Cys Gln Leu Val Val Phe Gly Glu Ala Leu Leu Pro
35 40 45
Gly Tyr Pro Phe Trp Ile Glu Leu Thr Asn Gly Ala Val Phe Asn Ser
50 55 60
Pro Met Gln Lys Glu Ile His Ala His Tyr Met Asp Gln Ala Val Gln
65 70 75 80
Ile Glu Ala Gly His Leu Asp Pro Leu Cys Gly Ala Ala Lys Ala His
85 90 95
Gly Ile Thr Val Val Ala Gly Ile Ile Glu Arg Pro Leu Asp Arg Gly
100 105 110
Gly His Ser Leu Tyr Ala Ser Leu Val Tyr Ile Asp Leu Asn Gly Val
115 120 125
Ile Gln Ser Val His Arg Lys Leu Met Pro Thr Tyr Glu Glu Arg Leu
130 135 140
Thr Trp Ser Pro Gly Asp Gly His Gly Leu Arg Val His Thr Leu Gly
145 150 155 160
Ala Phe Thr Val Gly Lys Leu Asn Cys Trp Glu Asn Trp Met Pro Leu
165 170 175
Pro Arg Ala Ala Leu Tyr Ala Gln Gly Glu Asp Leu His Val Ala Val
180 185 190
Trp Pro Gly Ser Val Arg Asn Thr Gln Asp Ile Thr Arg Phe Ile Ala
195 200 205
Met Glu Ser Arg Ser Phe Val Val Ser Val Ser Ser Leu Met Arg Lys
2l0 215 220
Ser Asp Phe Pro Gln Asp Thr Pro His Leu Ser Ala Ile Leu Glu Ser
225 230 235 240
Ala Pro Asp Pro Leu Ala Asn Gly Gly Ser Cys Leu Ala Gly Pro Asp
245 250 255
Gly Lys Trp Ile Val Glu Pro Val Ala Asp Glu Glu Lys Leu Ile Val
260 265 270
Ala Thr Ile Asp His Ala Arg Val Arg Glu Glu Arg Gln Asn Phe Asp
275 280 285
Pro Ser Gly His Tyr Ser Arg Pro Asp Val Thr Gln Leu Arg Val Asn
290 295 300
Arg Gln Arg Gln Ser Val Ile Ala Phe Asp Glu
305 310 315
<210>179
<211>915
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>179
atgaccacca aagcagccat cattcaggcg cgccccatat actacgatct ggcggcgtgt 60
gtcgataagg cgcttgccct catcaccgag gcggcggcac gcggcgcgaa catcgtcacg 120
ctcggcgaga cgtggctgcc gggctatccc gcgtggctgg atgtgtgcgt cgagatgggg 180
ctgtgggatc acgcgccgac caaagccgtc tttcagcggc tccatgccaa cagcgtcacc 240
atccccggcg cggagatcag ccagttctgc gacatcgccc gccgccttag catcgtgctg 300
gtgctcagcg tcaacgagcg cgtccgcaac accttgttca acaccctgct cacgattgac 360
gagcgcggcg acatccgcaa ccaccaccgc aagctgatgc cgacctacac tgagcgcatc 420
gtctgggggc agggcgacgg cgcgggctta caggcggtcg agacggcaac cgggcgcgtc 480
ggcgggctga tctgctggga acactggatg ccgctggcac ggcaggcgct gcacaacgcc 540
ggggagcaaa ttcacgtttc ggtcttcccg accgtcaacg acccgcgcca ccaagtcgcc 600
agccgccagt acgctttcga ggggcgctgc ttcgtgctga ccgccggcag catccagcgc 660
gccgacgacc taccgccgga actgaccgtc aaggcgggca tcgcgccgga tgatctggtg 720
cagggcggcg gcagcgccat catcgcgccg gacatgcgct acctcgccgg accctgcttc 780
gacgaggaaa ccatcctcta cgccgacctc gacctgagcg agacgatccg cgagagcatg 840
acgctggacg tgagcgggca ttactcgcgc cccgacgtgt tcaccttcga ggttaatcgg 900
cagcggaaaa tttag 915
<210>180
<211>304
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>180
Met Thr Thr Lys Ala Ala Ile Ile Gln Ala Arg Pro Ile Tyr Tyr Asp
1 5 10 15
Leu Ala Ala Cys Val Asp Lys Ala Leu Ala Leu Ile Thr Glu Ala Ala
20 25 30
Ala Arg Gly Ala Asn Ile Val Thr Leu Gly Glu Thr Trp Leu Pro Gly
35 40 45
Tyr Pro Ala Trp Leu Asp Val Cys Val Glu Met Gly Leu Trp Asp His
50 55 60
Ala Pro Thr Lys Ala Val Phe Gln Arg Leu His Ala Asn Ser Val Thr
65 70 75 80
Ile Pro Gly Ala Glu Ile Ser Gln Phe Cys Asp Ile Ala Arg Arg Leu
85 90 95
Ser Ile Val Leu Val Leu Ser Val Asn Glu Arg Val Arg Asn Thr Leu
100 105 110
Phe Asn Thr Leu Leu Thr Ile Asp Glu Arg Gly Asp Ile Arg Asn His
115 120 125
His Arg Lys Leu Met Pro Thr Tyr Thr Glu Arg Ile Val Trp Gly Gln
130 135 140
Gly Asp Gly Ala Gly Leu Gln Ala Val Glu Thr Ala Thr Gly Arg Val
145 150 155 160
Gly Gly Leu Ile Cys Trp Glu His Trp Met Pro Leu Ala Arg Gln Ala
165 170 175
Leu His Asn Ala Gly Glu Gln Ile His Val Ser Val Phe Pro Thr Val
180 185 190
Asn Asp Pro Arg His Gln Val Ala Ser Arg Gln Tyr Ala Phe Glu Gly
195 200 205
Arg Cys Phe Val Leu Thr Ala Gly Ser Ile Gln Arg Ala Asp Asp Leu
210 215 220
Pro Pro Glu Leu Thr Val Lys Ala Gly Ile Ala Pro Asp Asp Leu Val
225 230 235 240
Gln Gly Gly Gly Ser Ala Ile Ile Ala Pro Asp Met Arg Tyr Leu Ala
245 250 255
Gly Pro Cys Phe Asp Glu Glu Thr Ile Leu Tyr Ala Asp Leu Asp Leu
260 265 270
SerGlu Thr Ile Arg Glu Ser Met Thr Leu Asp Val Ser Gly His Tyr
275 280 285
Ser Arg Pro Asp Val Phe Thr Phe Glu Val Asn Arg Gln Arg Lys Ile
290 295 300
<210>181
<211>990
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>181
atgcccaaga ccgtacgcgc agccgcagtc cagatcgcgc ccgacctgac gtcacgcgct 60
ggcaccgtcg agcgggtcct gaatgcgatt gccgaagcct ctgacaaagg cgcggagctg 120
attgtatttc cagaaacctt tgtgccttgg tatccgtatt tcagcttcgt tctgccgcct 180
gtccagcagg gacccgagca cctgcggctt tatgaagaag cggtgacggt accatcagca 240
gaaacacggg ccgtcgccga cgccgcgcgc aaacgcaatg cggtcatcgt ccttggcgtc 300
aatgagcgcg accacggctc gctctacaac acccagctga tcttcgacgc ggatggcagc 360
ctgaaactca agcgccgcaa gatcacgccc acctatcacg agcggatgat ctggggacag 420
ggcgatggcg ccggtctaaa agtggtcgaa actgccatcg gccgcatggg cgcattggcg 480
tgctgggagc actacaaccc cctcgcccga tacgcgctga tggctcagca tgaggaaatt 540
cacgcctctc attttccggg ctcactggtc ggcccgatat tcggcgagca gatcgaagtc 600
acgatgcgcc accacgcgtt ggaatcgggc tgtttcgtgg tcaatgccac cggctggtta 660
agcgaggagc agatcgcgtc cattcaccca gatcccagcc tgcagaaggg tcttcgagat 720
ggctgcatga cctgcatcat aaccccggaa ggccgccacg tcgttcctcc tctgacatcg 780
ggtgaaggaa tcctgattgg cgacctggac atgcggctca tcaccaagcg caagcgaatg 840
atggattccg tcggacacta tgcacgtcct gagctgctgc accttgtcca tgacacgacg 900
cccgcacgcg cacgcgagca ggtgggcctt tcaggcgatt tttccgatgc agggcaagac 960
aagctatttg aggaggttca agatgcgtga 990
<210>182
<211>329
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>182
Met Pro Lys Thr Val Arg Ala Ala Ala Val Gln Ile Ala Pro Asp Leu
1 5 10 15
ThrSer Arg Ala Gly Thr Val Glu Arg Val Leu Asn Ala Ile Ala Glu
20 25 30
Ala Ser Asp Lys Gly Ala Glu Leu Ile Val Phe Pro Glu Thr Phe Val
35 40 45
Pro Trp Tyr Pro Tyr Phe Ser Phe Val Leu Pro Pro Val Gln Gln Gly
50 55 60
Pro Glu His Leu Arg Leu Tyr Glu Glu Ala Val Thr Val Pro Ser Ala
65 70 75 80
Glu Thr Arg Ala Val Ala Asp Ala Ala Arg Lys Arg Asn Ala Val Ile
85 90 95
Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn Thr Gln
100 105 110
Leu Ile Phe Asp Ala Asp Gly Ser Leu Lys Leu Lys Arg Arg Lys Ile
115 120 125
Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp Gly Ala
130 135 140
Gly Leu Lys Val Val Glu Thr Ala Ile Gly Arg Met Gly Ala Leu Ala
145 150 155 160
Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Gln
165 170 175
His Glu Glu Ile His Ala Ser His Phe Pro Gly Ser Leu Val Gly Pro
180 185 190
Ile Phe Gly Glu Gln Ile Glu Val Thr Met Arg His His Ala Leu Glu
195 200 205
Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Ser Glu Glu Gln
210 215 220
Ile Ala Ser Ile His Pro Asp Pro Ser Leu Gln Lys Gly Leu Arg Asp
225 230 235 240
Gly Cys Met Thr Cys Ile Ile Thr Pro Glu Gly Arg His Val Val Pro
245 250 255
Pro Leu Thr Ser Gly Glu Gly Ile Leu Ile Gly Asp Leu Asp Met Arg
260 265 270
Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ala
275 280 285
Arg Pro Glu Leu Leu His Leu Val His Asp Thr Thr Pro Ala Arg Ala
290 295 300
Arg Glu Gln Val Gly Leu Ser Gly Asp Phe Ser Asp Ala Gly Gln Asp
305 310 315 320
Lys Leu Phe Glu Glu Val Gln Asp Ala
325
<210>183
<211>1002
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>183
atggctgagt cacgcattat tcgtgccgcc gccgcccaga ttgcgccaga tctccatgag 60
gccagcaaaa cgctggcccg ggtgctggac gcgatcgatc aggcagccgc acagggggca 120
gagatcatcg tctttcccga gacctttgtg ccttattacc cctacttctc gtttatcacg 180
cccgcgatga ccgccggagc ggcccatctg aaattgtatg accaggcggt ggtggtgccc 240
ggcccgatca cccatgcggt gggcgaacgc gcccgcctgc gcaacatcgt cgtggtgctg 300
ggggtgaatg aacgtgacca cggcacgctc tacaacaccc aactggtatt tgatgccagc 360
ggggaactgg tgctgaaacg ccgcaaaatc accccgacct atcacgaacg gatgatctgg 420
ggacagggag acggtgccgg attaaaggtg gtggactcgg cggttgggcg catcggggct 480
ttagcctgct gggagcacta caacccactg gcgcgctaca gcctgatgac tcagcacgag 540
gagatccatt gcagccagtt ccctggttca ctggtggggc cgatttttgc cgagcagatg 600
gacgtcacca ttcgccatca tgcactggag tccggttgct ttgtcatcaa tgccaccggc 660
tggctgaccg aggagcagat caacgagctg accagcgacc cggcgttaca aaaggggctg 720
cgtggtggct gcaacaccgc catcatctcg ccggaaggcc gccatctggt gccgccactg 780
accgaaggtg aggggatttt gattgccgat ctggacatgg ccctgatcac caaacgcaaa 840
cgcatgatgg attctgtcgg ccactatgcc cgaccggaat tactcagcct gcgcctcgat 900
gcgacgcctg cccgttatgt ggtggcgcgt gataatgagt ccgaaaccgg aggaggcaac 960
gatgcagaac gtaccgtcta cgcgccagca gctgatcact ga 1002
<210>184
<211>333
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>184
Met Ala Glu Ser Arg Ile Ile Arg Ala Ala Ala Ala Gln Ile Ala Pro
1 5 10 15
Asp Leu His Glu Ala Ser Lys Thr Leu Ala Arg Val Leu Asp Ala Ile
20 25 30
Asp Gln Ala Ala Ala Gln Gly Ala Glu Ile Ile Val Phe Pro Glu Thr
35 40 45
Phe Val Pro Tyr Tyr Pro Tyr Phe Ser Phe Ile Thr Pro Ala Met Thr
50 55 60
Ala Gly Ala Ala His Leu Lys Leu Tyr Asp Gln Ala Val Val Val Pro
65 70 75 80
Gly Pro Ile Thr His Ala Val Gly Glu Arg Ala Arg Leu Arg Asn Ile
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Thr Leu Tyr Asn
100 105 110
Thr Gln Leu Val Phe Asp Ala Ser Gly Glu Leu Val Leu Lys Arg Arg
115 120 125
Lys Ile Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp
130 135 140
Gly Ala Gly Leu Lys Val Val Asp Ser Ala Val Gly Arg Ile Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ser Leu Met
165 170 175
Thr Gln His Glu Glu Ile His Cys Ser Gln Phe Pro Gly Ser Leu Val
180 185 190
Gly Pro Ile Phe Ala Glu Gln Met Asp Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Ile Asn Ala Thr Gly Trp Leu Thr Glu
210 215 220
Glu Gln Ile Asn Glu Leu Thr Ser Asp Pro Ala Leu Gln Lys Gly Leu
225 230 235 240
Arg Gly Gly Cys Asn Thr Ala Ile Ile Ser Pro Glu Gly Arg His Leu
245 250 255
Val Pro Pro Leu Thr Glu Gly Glu Gly Ile Leu Ile Ala Asp Leu Asp
260 265 270
Met Ala Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Arg Leu Asp Ala Thr Pro Ala
290 295 300
Arg Tyr Val Val Ala Arg Asp Asn Glu Ser Glu Thr Gly Gly Gly Asn
305 310 315 320
Asp Ala Glu Arg Thr Val Tyr Ala Pro Ala Ala Asp His
325 330
<210>185
<211>1017
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>185
atgggcattg aacatccgaa atacaaggtc gcggtggtgc aggcggcccc cgcctggctc 60
gatctcgacg gctcggtcga taagtcgatc gcgctgatca aggaggcggc cgagaagggg 120
gcgaagctga tcgcctttcc cgaggccttc atccccggtt acccctggca tatctggatg 180
gactcgccgg cctgggcgat cggccgcggc ttcgtgcagc gttatttcga caattcgctg 240
tcctatgaca gtccgcaggc cgagcggctg cgcgatgcgg tgaagaaggc gaagctcacc 300
gccgtgttcg gactgtccga gcgcgacggc ggcagcctct acctcgcgca atggctgatc 360
gggcccgatg gcgagaccat cgccaagcgc cgcaagctgc ggccgaccca cgccgaacgt 420
accgtctatg gcgaaggcga cggcagcgat cttgccgtgc atgcgcgcgc cgacatcggc 480
cggatcggcg cgctctgctg ctgggagcat ctgcagccac tgtcgaaata cgcgatgtac 540
gcccagaacg aacaggtcca tgtcgcagcc tggcccagct tctcgctgta cgaccccttc 600
gcgccggcgt taggggccga ggtcaacaac gcggcctccc gcgtctatgc ggtggaaggc 660
tcctgcttcg tgctcgcgcc gtgcgcgacg gtgtcgcagg cgatgatcga cgagctctgc 720
gaccggcccg acaagaacgc gctgctgcac gtcggcggcg gctttgccgc gatctatggc 780
cccgacggca gccagatcgg cgacaagctg gcgccggacc aggaggggct gctgatcgcc 840
gagatcgacc ttggcgccat cggtgtcgcc aagaacgccg ccgatcccgc cgggcactat 900
tcgcgtcccg acgtgacgcg gttgctgctc aacaagaagc gataccagcg cgtcgagcag 960
ttcgcgctgc cggtcgacac cgtcgagccg gcggatatcg gcgcagcggc gagctga 1017
<210>186
<211>338
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>186
Met Gly Ile Glu His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Ala Trp Leu Asp Leu Asp Gly Ser Val Asp Lys Ser Ile Ala Leu
20 25 30
Ile Lys Glu Ala Ala Glu Lys Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Ala Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Met Asp Ser Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ser Tyr Asp Ser Pro Gln Ala Glu Arg Leu Arg Asp Ala Val Lys Lys
85 90 95
Ala Lys Leu Thr Ala Val Phe Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Leu Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Ala Arg Ala Asp Ile Gly
145 150 155 160
Arg Ile Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Ala Glu Val
195 200 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Arg Pro Asp Lys Asn Ala Leu Leu His Val Gly Gly Gly Phe Ala
245 250 255
Ala Ile Tyr Gly Pro Asp Gly Ser Gln Ile Gly Asp Lys Leu Ala Pro
260 265 270
Asp Gln Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Lys Lys Arg Tyr Gln Arg Val Glu Gln
305 310 315 320
Phe Ala Leu Pro Val Asp Thr Val Glu Pro Ala Asp Ile Gly Ala Ala
325 330 335
Ala Ser
<210>187
<211>1059
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>187
atgggcatca atcatcccaa gtacaaagtg gccgtcgtgc aagcggcacc tgtctggctc 60
gacctggatg gaacagtcga caagtgcatt cggctgatag gcgaggccgc tgagaaggga 120
tgcaagctca ttgcatttcc cgagacgttc atcccggggt acccctggca catctggatg 180
ggagctccgg cctggacgat cgggcgcgga ttcgtgcagc gatacttcga caattcgctt 240
gcgtacgaca gtccgcaggc aaacaagctt cgcgccgcgg tgaagcgcgc cggagtgacg 300
gcagttctcg gcttgtcgga gcgccgcgga ggctccctgt acatcgccca gtggctcatc 360
ggacctgatg gcgagaccat cgctcaacgg cgaaagctgc gccccaccca tgcggagcgc 420
accgtcttcg gcgagggcga tggcagcgat ttggcggtgc acagccgccc cgacatcggc 480
cgactgggtg ccctttgctg ctgggaacat ctccagcctt tgaccaagta cgcgatgtac 540
gcgcaagacg agcaagtgca cgtcgctgca tggccgagct tctcgatgta cgagcctttc 600
gcgcacgccc tggggtggga gacgaacaac gcggtgagca aggtgtacgc ggtcgaaggt 660
tcgtgctacg tcctggcccc ctgcgccatc atctctcagg cgatggtgga cgaactcgtc 720
gacagcgagg acaagaagcc gctggttcat gccggcgggg ggcatgcggt gatctatggt 780
cccgatggca ccctgcttac tcccaagctt gcagaagacg aggagggcct actgatcgcg 840
gagatcgatc tgggggcaat cggggtcgcc aagaacgcgg cagaccccgc cggccactac 900
tcgcggcccg atgtcacccg cctgctcttc aacaaccggc cggccaagcg cgtggagacg 960
atgctgctcc cggtcgacgc ggcagaagtc gtggagccgg cggacggagc gctcaatgcg 1020
tccgagggac gccagcgaca gttcaagctg cccgcctag 1059
<210>188
<211>352
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>188
Met Gly Ile Asn His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Val Trp Leu Asp Leu Asp Gly Thr Val Asp Lys Cys Ile Arg Leu
20 25 30
Ile Gly Glu Ala Ala Glu Lys Gly Cys Lys Leu Ile Ala Phe Pro Glu
35 40 45
Thr Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Met Gly Ala Pro Ala
50 55 60
Trp Thr Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ala Tyr Asp Ser Pro Gln Ala Asn Lys Leu Arg Ala Ala Val Lys Arg
85 90 95
Ala Gly Val Thr Ala Val Leu Gly Leu Ser Glu Arg Arg Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Gln Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Phe Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Ser Arg Pro Asp Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Thr Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asp Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Met Tyr Glu Pro Phe Ala His Ala Leu Gly Trp Glu Thr
195 200 205
Asn Asn Ala Val Ser Lys Val Tyr Ala Val Glu Gly Ser Cys Tyr Val
210 215 220
Leu Ala Pro Cys Ala Ile Ile Ser Gln Ala Met Val Asp Glu Leu Val
225 230 235 240
Asp Ser Glu Asp Lys Lys Pro Leu Val His Ala Gly Gly Gly His Ala
245 250 255
Val Ile Tyr Gly Pro Asp Gly Thr Leu Leu Thr Pro Lys Leu Ala Glu
260 265 270
Asp Glu Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Phe Asn Asn Arg Pro Ala Lys Arg Val Glu Thr
305 310 315 320
Met Leu Leu Pro Val Asp Ala Ala Glu Val Val Glu Pro Ala Asp Gly
325 330 335
Ala Leu Asn Ala Ser Glu Gly Arg Gln Arg Gln Phe Lys Leu Pro Ala
340 345 350
<210>189
<211>1005
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>189
atgcaagaca cgaaattcaa agttgcagtc gtccaggccg cgccggtatt catggatgcg 60
ccagcctccg tggccaaggc gatcggtttc atccaggagg cgggcgcagc cggggcgaag 120
ctgctggcgt tcccggaggt ctggattccg ggctaccctt ggtggctttg gctcgggacg 180
ccggcgtggg gaatgcagtt tgtgccgcgc tatcacgcca attcgctgcg tgctgatgga 240
cccgaaatcc tcgctctttg tgcggccgcc gccgaagcga agatcaacgt cgtgatgggc 300
ttctccgaaa tcgacggagg aacgctctac ctaagtcagg ttttcatcag cgatgcgggc 360
aagatcatct tcaagcgccg aaagctcaag ccgacccacg tcgaacgtac gctatttggt 420
gaaggagatg ggtctgattt ccgagtcgtc gacagcagcg tcgggcgcct cggagccctg 480
tgctgtgccg aacacattca gccgttgtcg aaatacgcca tgtacgcgat gaacgagcaa 540
attcatgtgg cgtcgtggcc atctttcacg ctctatcgcg gcaaagccta cgctttgggt 600
catgaggtga atcttgccgc cagccaaatc tacgcgctcg aaggaggttg cttcgtcttg 660
catgccacgg caattaccgg tcaggatatg ttcgacatgc tttgcgacac tccggaaagg 720
gcggatttgc tgaatgcgga gggagcaaag ccgggtggag gctattcgat gatttttggt 780
cccgatggtc agccgatgtg cgagcatctg ccgcaggaca aggaaggcat cctctatgcc 840
ggcgtagacc tgtcgatgat tgcgatcgcc aaagcggcct acgatcctac ggggcactac 900
gcccgcggtg atgtcgtccg tctcatggtc aaccgcagcc cccgtcgcac gagcgteagc 960
ttcagcgaag acgagaacgc ggcggtcact ttcaccgaga cctga 1005
<210>190
<211>334
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>190
Met Gln Asp Thr Lys Phe Lys Val Ala Val Val Gln Ala Ala Pro Val
1 5 10 15
Phe Met Asp Ala Pro Ala Ser Val Ala Lys Ala Ile Gly Phe Ile Gln
20 25 30
Glu Ala Gly Ala Ala Gly Ala Lys Leu Leu Ala Phe Pro Glu Val Trp
35 40 45
Ile Pro Gly Tyr Pro Trp Trp Leu Trp Leu Gly Thr Pro Ala Trp Gly
50 55 60
Met Gln Phe Val Pro Arg Tyr His Ala Asn Ser Leu Arg Ala Asp Gly
65 70 75 80
Pro Glu Ile Leu Ala Leu Cys Ala Ala Ala Ala Glu Ala Lys Ile Asn
85 90 95
Val Val Met Gly Phe Ser Glu Ile Asp Gly Gly Thr Leu Tyr Leu Ser
100 105 110
Gln Val Phe Ile Ser Asp Ala Gly Lys Ile Ile Phe Lys Arg Arg Lys
115 120 125
Leu Lys Pro Thr His Val Glu Arg Thr Leu Phe Gly Glu Gly Asp Gly
130 135 140
Ser Asp Phe Arg Val Val Asp Ser Ser Val Gly Arg Leu Gly Ala Leu
145 150 155 160
Cys Cys Ala Glu His Ile Gln Pro Leu Ser Lys Tyr Ala Met Tyr Ala
165 170 175
Met Asn Glu Gln Ile His Val Ala Ser Trp Pro Ser Phe Thr Leu Tyr
180 185 190
Arg Gly Lys Ala Tyr Ala Leu Gly His Glu Val Asn Leu Ala Ala Ser
195 200 205
Gln Ile Tyr Ala Leu Glu Gly Gly Cys Phe Val Leu His Ala Thr Ala
210 215 220
Ile Thr Gly Gln Asp Met Phe Asp Met Leu Cys Asp Thr Pro Glu Arg
225 230 235 240
Ala Asp Leu Leu Asn Ala Glu Gly Ala Lys Pro Gly Gly Gly Tyr Ser
245 250 255
Met Ile Phe Gly Pro Asp Gly Gln Pro Met Cys Glu His Leu Pro Gln
260 265 270
Asp Lys Glu Gly Ile Leu Tyr Ala Gly Val Asp Leu Ser Met Ile Ala
275 280 285
Ile Ala Lys Ala Ala Tyr Asp Pro Thr Gly His Tyr Ala Arg Gly Asp
290 295 300
Val Val Arg Leu Met Val Asn Arg Ser Pro Arg Arg Thr Ser Val Ser
305 310 315 320
Phe Ser Glu Asp Glu Asn Ala Ala Val Thr Phe Thr Glu Thr
325 330
<210>191
<211>945
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>191
atgaaaaagt tagctgtggt tcaacgtgcg tcaggatttt tagataagca gcagagcatc 60
gcgttggcgg tggaaagtat tcagtctgct gcgaataatg gcgcagagct tgttgttttt 120
acggaagcct ttattcctgg ttatcctgtc tggttatggc gtctgcgccc tggcaaagac 180
tgggggacaa cagacagtct ttatcaacgc ttaataagca acgcggttga tttaagctca 240
tcggatttgg atccgattta tgaagcggca aaacgtcatc acgtcacggt tgtatgcggc 300
attaatgaac gcgactccag cgtcagccga acaacgctat acaacactta catcacggtt 360
tgtcatgagg gcaatctcat caatgttcat cgaaaactga tgccgaccaa cccagagaga 420
atggtgtggg gctttggtga tgcgactgga ttaagggtag tagacactcc tgtcggaagg 480
attggctcac tcgtttgctg ggagaactac atgccgttgg cacgctatgc actttatgct 540
cagggcgtcg aaatttacat tgcgcctact tatgacagtg gctcggactg gactgaaagc 600
ttgcgccata tcgccagaga gggcagatgc tacgttgtcg gcagcggtaa cttgttgaga 660
gccagcgacc tgcctgatga ttttccagaa aaagaaaccc tctatcctga taaagacgag 720
tggattaacg gcggagactc taccgttatc gctcccggcg gtgaaacatt agttgctccg 780
ctgcatgcag aggaaggcat actgtattgc gatattgata ctgataaagt ggcggcggct 840
cggcgttctt tcgacgttgc aggccattac tctcgcccag acatatttac actcaacgta 900
aatcgagcgc cgcaaacatc tctgcgtatc agggaagccg agtaa 945
<210>192
<211>314
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>192
Met Lys Lys Leu Ala Val Val Gln Arg Ala Ser Gly Phe Leu Asp Lys
1 5 10 15
Gln Gln Ser Ile Ala Leu Ala Val Glu Ser Ile Gln Ser Ala Ala Asn
20 25 30
Asn Gly Ala Glu Leu Val Val Phe Thr Glu Ala Phe Ile Pro Gly Tyr
35 40 45
Pro Val Trp Leu Trp Arg Leu Arg Pro Gly Lys Asp Trp Gly Thr Thr
50 55 60
Asp Ser Leu Tyr Gln Arg Leu Ile Ser Asn Ala Val Asp Leu Ser Ser
65 70 75 80
Ser Asp Leu Asp Pro Ile Tyr Glu Ala Ala Lys Arg His His Val Thr
85 90 95
Val Val Cys Gly Ile Asn Glu Arg Asp Ser Ser Val Ser Arg Thr Thr
100 105 110
Leu Tyr Asn Thr Tyr Ile Thr Val Cys His Glu Gly Asn Leu Ile Asn
115 120 125
Val His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly
130 135 140
Phe Gly Asp Ala Thr Gly Leu Arg Val Val Asp Thr Pro Val Gly Arg
145 150 155 160
Ile Gly Ser Leu Val Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr
165 170 175
Ala Leu Tyr Ala Gln Gly Val Glu Ile Tyr Ile Ala Pro Thr Tyr Asp
180 185 190
Ser Gly Ser Asp Trp Thr Glu Ser Leu Arg His Ile Ala Arg Glu Gly
195 200 205
Arg Cys Tyr Val Val Gly Ser Gly Asn Leu Leu Arg Ala Ser Asp Leu
210 215 220
Pro Asp Asp Phe Pro Glu Lys Glu Thr Leu Tyr Pro Asp Lys Asp Glu
225 230 235 240
Trp Ile Asn Gly Gly Asp Ser Thr Val Ile Ala Pro Gly Gly Glu Thr
245 250 255
Leu Val Ala Pro Leu His Ala Glu Glu Gly Ile Leu Tyr Cys Asp Ile
260 265 270
Asp Thr Asp Lys Val Ala Ala Ala Arg Arg Ser Phe Asp Val Ala Gly
275 280 285
His Tyr Ser Arg Pro Asp Ile Phe Thr Leu Asn Val Asn Arg Ala Pro
290 295 300
Gln Thr Ser Leu Arg Ile Arg Glu Ala Glu
305 310
<210>193
<211>966
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>193
atgtcaaacg agaaccacaa ccaaacattc aaagttgccg cggtgcaggc cacacctgta 60
ttcctcgatc gtgaagcgac catcgacaaa gcttgtgagt tgattgctgc agccggcaat 120
gaaggagcgc ggctggttgt cttcccggag gcattcatcc catcctaccc agattgggta 180
tgggcaatcc caccgggcga agaaggcgtg ctcaatgagt tgtacgcgga actgctctcc 240
aattcggtca cgattcccag tgatgtgacg gatagactgt gccgagccgc gagacttgcc 300
aatgcctacg tagtgatggg gatgagcgaa cgcaatgccg aggccagtgg cgcaagcctg 360
tataacacgc tgttgtacat cgatgcgcag ggcgagattc tgggcaaaca tcgaaagctg 420
gtgcccacag gcggcgaacg gctggtgtgg gcgcagggcg atggcagcac gctgcaggtc 480
tacgatactc cactgggtaa actcggcggt ttgatttgct gggagaatta tatgccgctg 540
gcccgctaca ccatgtacgc atggggcaca caaatctatg ttgcggcgac atgggatcgc 600
gggcaaccct ggctctccac tttacggcat atcgccaaag aaggcagggt gtacgtgatc 660
ggctgctgta tcgtgatgcg caaagacgat atcccagatc gttacccgat gaagcagaag 720
ttttacgcgg aggccgatga gtggatcaac ataggggaca gcgcaatcgt caatcctgaa 780
gggcagttta gcgccgggcc ggtacgcaaa caggaagaga ttctctacgc ggaaattgat 840
ccgcgcatgg tgcaaggccc gaagtggatg ctcgacgtag cagggcacta cgcgaggccg 900
gacgtattcc agttgacggt gcatacggat gcgaggcaga tgatcaggtt ggaacacgat 960
gtttaa 966
<210>194
<211>321
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>194
Met Ser Asn Glu Asn His Asn Gln Thr Phe Lys Val Ala Ala Val Gln
1 5 10 15
Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr Ile Asp Lys Ala Cys
20 25 30
Glu Leu Ile Ala Ala Ala Gly Asn Glu Gly Ala Arg Leu Val Val Phe
35 40 45
Pro Glu Ala Phe Ile Pro Ser Tyr Pro Asp Trp Val Trp Ala Ile Pro
50 55 60
Pro Gly Glu Glu Gly Val Leu Asn Glu Leu Tyr Ala Glu Leu Leu Ser
65 70 75 80
Asn Ser Val Thr Ile Pro Ser Asp Val Thr Asp Arg Leu Cys Arg Ala
85 90 95
Ala Arg Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg Asn
100 105 110
Ala Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Ile Asp
115 120 125
Ala Gln Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly
130 135 140
Gly Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val
145 150 155 160
Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn
165 170 175
Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gln Ile
180 185 190
Tyr Val Ala Ala Thr Trp Asp Arg Gly Gln Pro Trp Leu Ser Thr Leu
195 200 205
Arg His Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile
210 215 220
Val Met Arg Lys Asp Asp Ile Pro Asp Arg Tyr Pro Met Lys Gln Lys
225 230 235 240
Phe Tyr Ala Glu Ala Asp Glu Trp Ile Asn Ile Gly Asp Ser Ala Ile
245 250 255
Val Asn Pro Glu Gly Gln Phe Ser Ala Gly Pro Val Arg Lys Gln Glu
260 265 270
Glu Ile Leu Tyr Ala Glu Ile Asp Pro Arg Met Val Gln Gly Pro Lys
275 280 285
Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln
290 295 300
Leu Thr Val His Thr Asp Ala Arg Gln Met Ile Arg Leu Glu His Asp
305 310 315 320
Val
<210>195
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>195
atgaaagtcg tcaaagccgc cgcggtacag ttcagcccgg tgctctatag ccgtgaggca 60
accgtcgcca aggtcgtgca gaagatccac gaactcggcc agaaaggcgt gcagttcgcc 120
accttccccg aaacggtcgt gccttattac ccttactttg cggccgtcca gacgggcatc 180
gagcttctct cgggcaccga acatctgcgc ctgctcgaac aggctgtgac tgtcccgtcc 240
gctgctaccg acgcgatcgg cgaagccgcg cgaaaggccg gcatggtcgt gtccattggc 300
gtcaatgagc gcgatggcgg cacgctgtac aacgcacaac tgctcttcga tgccgacggt 360
acgctgatcc agcgccgccg caagatcacg ccgacgcatt tcgaacgcat gatctggggc 420
cagggagatg gctcgggctt gcgtgcagtc gacagcgccg tcggccgcgt cggccagctc 480
gcatgtttcg agcacaacaa cccgctcgcc cgctacgcaa tgatcgccga cggcgagcag 540
atccattcgg cgatgtaccc tggctcggcc tttggcgagg gcttcgccca gcgtatggaa 600
atcaacatcc gccagcatgc gctcgagtcc gccgctttcg tcgtcaacgc aacagcgtgg 660
ctggacgccg accagcaggc gcaaatcatg aaggacaccg gttgtggaat cggtccgatc 720
acgggcggct gcttcaccac gatcgtctct cctgacggca tgctgatggc cgagccgctt 780
cgctcgggtg aaggcgaagt gatcgtcgat ctcgacttca cgcagatcga ccgccgcaag 840
atgctgatgg actcggccgg ccactacaac cgccctgaac tgctgagtct gatgatcgac 900
cgtacgccga ccgcgcatgt tcacgaacgc gcttcgcacc cgatgatcgt caacgaccag 960
ggttccgacg atctgcgcac ccaggctgca tga 993
<210>196
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>196
Met Lys Val Val Lys Ala Ala Ala Val Gln Phe Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Ala Thr Val Ala Lys Val Val Gln Lys Ile His Glu Leu
20 25 30
Gly Gln Lys Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ala Ala Val Gln Thr Gly Ile Glu Leu Leu Ser
50 55 60
Gly Thr Glu His Leu Arg Leu Leu Glu Gln Ala Val Thr Val Pro Ser
65 70 75 80
Ala Ala Thr Asp Ala Ile Gly Glu Ala Ala Arg Lys Ala Gly Met Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Ala
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Phe Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Ala Val Gly Arg Val Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Ile Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly
180 185 190
Glu Gly Phe Ala Gln Arg Met Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Ala Ala Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Gly Ile Gly Pro Ile
225 230 235 240
Thr Gly Gly Cys Phe Thr Thr Ile Val Ser Pro Asp Gly Met Leu Met
245 250 255
Ala Glu Pro Leu Arg Ser Gly Glu Gly Glu Val Ile Val Asp Leu Asp
260 265 270
Phe Thr Gln Ile Asp Arg Arg Lys Met Leu Met Asp Ser Ala Gly His
275 280 285
Tyr Asn Arg Pro Glu Leu Leu Ser Leu Met Ile Asp Arg Thr Pro Thr
290 295 300
Ala His Val His Glu Arg Ala Ser His Pro Met Ile Val Asn Asp Gln
305 310 315 320
Gly Ser Asp Asp Leu Arg Thr Gln Ala Ala
325 330
<210>197
<211>1017
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>197
atgggcatcg aacatccaaa ataccgcgtt gccgccgtgc aggctgcgcc ggcctggctc 60
gacctcgatc gctcgatcga caaggctatt gcactgatcg aggaggccgc cgcgaacggc 120
gccagattga tcgcattccc ggaggtcttc atccccggct acccctggca tatctggctc 180
gactcgccgg cctgggcgat cggccgcggc ttcgtgcagc gttatttcga caattcgctc 240
gcttatgata gtccgcaggc cgagcggctc cgcgcagcgg tccgcaaggc gcgcctgacc 300
gccgtgatcg gcctttcgga gcggagcggc ggcagcctct acatcgcgca atggctcgtt 360
ggccccgacg gcgagaccat cgcgaagcgc cgcaagctcc gtccgacgca tgccgagcgc 420
acggtctatg gcgagggcga cggcagcgat ctggcggtcc atgaccggcc cgatatcgga 480
cggctcggcg cgctgtgctg ctgggaacat ctgcaaccgt tgtcgaaata tgcgatgtat 540
gcccagaacg agcaggtcca tgtggcgtca tggccgagtt tttcgctcta cgatcccttt 600
gccccggcgc tcggcgcgga ggtgaacaat gcggcctccc gggtctatgc ggtcgaaggc 660
tcctgcttcg tgctggcgcc gtgcgcgacc gtctcgcagg ccatgatcga tgagctgtgc 720
gaccggcccg acaagcacgc gctgctccat gccggcggtg gctttgccgc gatctacggg 780
cccgacggca gttcgctggc cgaaaagctc gcgccggacc aggagggcct gctttacgcc 840
gacatcgatc tcggcgcgat cggcgtcgcg aagaacgccg ccgacccggc agggcattat 900
tcgcggcccg atgtcacgcg gctgctgctg aacaacaagc cctacaagcg cgtggagcat 960
tttgctttgc ccggcgatac cgtggcgcct gccgatgtgg atgcggcggc gagctga 1017
<210>198
<211>338
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>198
Met Gly Ile Glu His Pro Lys Tyr Arg Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Ala Trp Leu Asp Leu Asp Arg Ser Ile Asp Lys Ala Ile Ala Leu
20 25 30
Ile Glu Glu Ala Ala Ala Asn Gly Ala Arg Leu Ile Ala Phe Pro Glu
35 40 45
Val Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Leu Asp Ser Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ala Tyr Asp Ser Pro Gln Ala Glu Arg Leu Arg Ala Ala Val Arg Lys
85 90 95
Ala Arg Leu Thr Ala Val Ile Gly Leu Ser Glu Arg Ser Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Leu Val Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Pro Asp Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ser Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Ala Glu Val
195 200 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Arg Pro Asp Lys His Ala Leu Leu His Ala Gly Gly Gly Phe Ala
245 250 255
Ala Ile Tyr Gly Pro Asp Gly Ser Ser Leu Ala Glu Lys Leu Ala Pro
260 265 270
Asp Gln Glu Gly Leu Leu Tyr Ala Asp Ile Asp Leu Gly Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Asn Lys Pro Tyr Lys Arg Val Glu His
305 310 315 320
Phe Ala Leu Pro Gly Asp Thr Val Ala Pro Ala Asp Val Asp Ala Ala
325 330 335
Ala Ser
<210>199
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>199
atgtcaaacg cactgaagcc gttcaaaatc gccgccgttc aggcgacgcc ggtttttctg 60
aaccgggaag ccacggcgga gaaggccgcg gctttgatcc gcgaagcggg aagcgccgga 120
gccaagctca tcgttttccc ggaatcgttt attccggcct atccggactg ggtctgggtg 180
gtcccctcgg ggagggatcg ccttctcagc ggcctctacg gggagatgct cgaaaacgcc 240
gtggaaatcc ccggcccggc cacggggcat atcggccggg cggcgaagga atcgggcgct 300
tatgtcgtca tgggcgtgac cgagcgggac acggaggcga gcggagccag tttgttcaac 360
accttgattt atttcggtcc gaccggggaa attttgggca aacaccggaa gctggttccc 420
accgggggcg aacggatcgt ctgggcccag ggggacggaa gcaccctgga ggtctacgat 480
acgcccctgg gaaaactggg cgggctgatc tgctgggaaa actacatgcc cctggcccgg 540
tacgccatgt acgcctgggg aacccagctt tacgtggccg ccacctggga ccgaggcgaa 600
ccctggcttt cgacgcttcg gcatatcgcc aaggaagggc gggtgtatgt catcgggtgc 660
tgcatcgcca tgcggaaagg ggatatcccg gatcggttcg aacacaaggg gctctacgcc 720
cccgaccggg actggatcaa ccccggcgac agcgcgatcg tcaaccccca gggggagatg 780
atcgccgggc ccgcttccaa taaggaagag atcctttatg cggaagtcga cccgcagatg 840
atgcgcgggc ccaaatggat gctcgatgtg gccggccatt acgcgcggcc cgatgtcttc 900
gagctcaccg tccgccggga accgcggccg atgatccgcg tggcgggagg cgcgggcggg 960
accgaaccca aagagaagaa gaccgccggc tga 993
<210>200
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>200
Met Ser Asn Ala Leu Lys Pro Phe Lys Ile Ala Ala Val Gln Ala Thr
1 5 10 15
Pro Val Phe Leu Asn Arg Glu Ala Thr Ala Glu Lys Ala Ala Ala Leu
20 25 30
Ile Arg Glu Ala Gly Ser Ala Gly Ala Lys Leu Ile Val Phe Pro Glu
35 40 45
Ser Phe Ile Pro Ala Tyr Pro Asp Trp Val Trp Val Val Pro Ser Gly
50 55 60
Arg Asp Arg Leu Leu Ser Gly Leu Tyr Gly Glu Met Leu Glu Asn Ala
65 70 75 80
Val Glu Ile Pro Gly Pro Ala Thr Gly His Ile Gly Arg Ala Ala Lys
85 90 95
Glu Ser Gly Ala Tyr Val Val Met Gly Val Thr Glu Arg Asp Thr Glu
100 105 110
Ala Ser Gly Ala Ser Leu Phe Asn Thr Leu Ile Tyr Phe Gly Pro Thr
115 120 125
Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu
130 135 140
Arg Ile Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Glu Val Tyr Asp
145 150 155 160
Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met
165 170 175
Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gln Leu Tyr Val
180 185 190
Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His
195 200 205
Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile Ala Met
210 215 220
Arg Lys Gly Asp Ile Pro Asp Arg Phe Glu His Lys Gly Leu Tyr Ala
225 230 235 240
Pro Asp Arg Asp Trp Ile Asn Pro Gly Asp Ser Ala Ile Val Asn Pro
245 250 255
Gln Gly Glu Met Ile Ala Gly Pro Ala Ser Asn Lys Glu Glu Ile Leu
260 265 270
Tyr Ala Glu Val Asp Pro Gln Met Met Arg Gly Pro Lys Trp Met Leu
275 280 285
Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu Thr Val
290 295 300
Arg Arg Glu Pro Arg Pro Met Ile Arg Val Ala Gly Gly Ala Gly Gly
305 310 315 320
Thr Glu Pro Lys Glu Lys Lys Thr Ala Gly
325 330
<210>201
<211>930
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>201
atgaccacga agagcatccg catcgcggcc gtacaggcgg ctcccgcgtt tctcgacctg 60
gctggtacgc tggaccggct cgaggcctgg gcccgcaagg ccgccgccac cggtgcccgc 120
gtcatcgcgt tccccgagac ctggctgccg ggctacccgg cgtggatcga ctcgtcgccg 180
gaggccgcga tctggggcca tcccggctcg cgcgacctgc accagcgcct gatggagaat 240
gccgtcgagg tcccgggccc cgcgaccgcg cgcatcgcga agctcgccgg cgagctcggc 300
gtgacgatcg tggtcggcgc gcacgagcgg gcggggaaca ccctctacaa cacggcgctg 360
acgttcgggc ccgagggcag gctgctcaat caccaccgga agctggtgcc gacctacagc 420
gaacggctgc tgtggggcta cggcgacggc gctggactgg tggcgccggc ggtggacggt 480
gtgaaggtcg gggcgctggt gtgctgggag cactggatgc cgctcacccg ccaggcgatg 540
cacgacgtcg gcgagcacgt gcacgtcgcc ctgtggcccg gcgtccacga gatgcaccag 600
gtggcctcgc ggcactatgc gttcgagggc cgctgtttcg tgatcgcggt cgggagcatc 660
ctgcgcgtgg accagatgcc gaagcagctg ccgccgctgg agaagtacgc gaagagcgcc 720
aaggggctga tgatcgcggg cggcagcgcc atcatcgcgc cgaacggccg ctacgtcgcg 780
gcgccggtgt acgacgagga gacgatcgtc accgccgact gcgacctcgg cgagatcccg 840
cgcgaggcgc agacgctcga tgtctcgggc cactacagcc ggccggacgt gttcagcttc 900
ggggtggtca gacaccggcc gcgtgcgtaa 930
<210>202
<211>309
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>202
Met Thr Thr Lys Ser Ile Arg Ile Ala Ala Val Gln Ala Ala Pro Ala
1 5 10 15
Phe Leu Asp Leu Ala Gly Thr Leu Asp Arg Leu Glu Ala Trp Ala Arg
20 25 30
Lys Ala Ala Ala Thr Gly Ala Arg Val Ile Ala Phe Pro Glu Thr Trp
35 40 45
Leu Pro Gly Tyr Pro Ala Trp Ile Asp Ser Ser Pro Glu Ala Ala Ile
50 55 60
Trp Gly His Pro Gly Ser Arg Asp Leu His Gln Arg Leu Met Glu Asn
65 70 75 80
Ala Val Glu Val Pro Gly Pro Ala Thr Ala Arg Ile Ala Lys Leu Ala
85 90 95
Gly Glu Leu Gly Val Thr Ile Val Val Gly Ala His Glu Arg Ala Gly
100 105 110
Asn Thr Leu Tyr Asn Thr Ala Leu Thr Phe Gly Pro Glu Gly Arg Leu
115 120 125
Leu Asn His His Arg Lys Leu Val Pro Thr Tyr Ser Glu Arg Leu Leu
130 135 140
Trp Gly Tyr Gly Asp Gly Ala Gly Leu Val Ala Pro Ala Val Asp Gly
145 150 155 160
Val Lys Val Gly Ala Leu Val Cys Trp Glu His Trp Met Pro Leu Thr
165 170 175
Arg Gln Ala Met His Asp Val Gly Glu His Val His Val Ala Leu Trp
180 185 190
Pro Gly Val His Glu Met His Gln Val Ala Ser Arg His Tyr Ala Phe
195 200 205
Glu Gly Arg Cys Phe Val Ile Ala Val Gly Ser Ile Leu Arg Val Asp
210 215 220
Gln Met Pro Lys Gln Leu Pro Pro Leu Glu Lys Tyr Ala Lys Ser Ala
225 230 235 240
Lys Gly Leu Met Ile Ala Gly Gly Ser Ala Ile Ile Ala Pro Asn Gly
245 250 255
Arg Tyr Val Ala Ala Pro Val Tyr Asp Glu Glu Thr Ile Val Thr Ala
260 265 270
Asp Cys Asp Leu Gly Glu Ile Pro Arg Glu Ala Gln Thr Leu Asp Val
275 280 285
Ser Gly His Tyr Ser Arg Pro Asp Val Phe Ser Phe Gly Val Val Arg
290 295 300
His Arg Pro Arg Ala
305
<210>203
<211>966
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>203
atgtcaagct tacccacatc cgcgttcacc gtcgccgccg cgcaagcgtc gccagtgttc 60
ctcgaccgcg acgcgacgct gcagaaggct tgcgggctga tcgccgacgc cgggcgcgcg 120
ggcgcgcgcc tgatcgtctt ccccgaagcc ttcatcccgg cctaccccga ttgggtgtgg 180
gcggtcccag ctggcgaaga ggggatgctg agcgagctct acgccgagct ggtcgcgaat 240
tcgctggcta ttccgagcga cgcgaccgat cggctatgtc gcgcggcgca ggccgcgcat 300
atcaatgtgg tcgtggggtt gagcgagcgc aatgtcgagg ccagcggcgc cagcctctac 360
aacacgctgc tgtatatcga cgcggcggga acgatcctgg gtaaacaccg caagcttgtg 420
ccgaccggcg gggagcgcct ggtctgggcg cagggcgacg gcagcacgct cgatgtgtac 480
gacaccgcgc tcggcaagct cggcggcctg atctgttggg aaaactacat gccgctggca 540
cgctacgcgc tgtacgcctg gggtgtgcaa atctatgtcg cggccacctg ggatcgcggc 600
gagccctggc tttctactct gcgacatatc gccaaggaag gccgtgtcta cgtgatcggc 660
tgtggcatgg cgctgcgcag agatgatatt cccgatcgct tcgctttcaa gcagcgcttc 720
tatgcccagg ccggcgaatg gatcaacgtc ggcgacagcg cgatcgtcaa cccgagcggc 780
gagtttattg ccggacctgt gcgcgaacgc gaggagattc tgtacgcgga ggtcgacccg 840
gagcagatga gcgggccaaa gtggatgctc gacgtggccg ggcactacgg gcggccggat 900
gtcttccggc tcagcgtcaa ccgggcgccg caccagatga tccagacgga gaaccgggag 960
acctga 966
<210>204
<211>321
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>204
Met Ser Ser Leu Pro Thr Ser Ala Phe Thr Val Ala Ala Ala Gln Ala
1 5 10 15
Ser Pro Val Phe Leu Asp Arg Asp Ala Thr Leu Gln Lys Ala Cys Gly
20 25 30
Leu Ile Ala Asp Ala Gly Arg Ala Gly Ala Arg Leu Ile Val Phe Pro
35 40 45
Glu Ala Phe Ile Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Ala
50 55 60
Gly Glu Glu Gly Met Leu Ser Glu Leu Tyr Ala Glu Leu Val Ala Asn
65 70 75 80
Ser Leu Ala Ile Pro Ser Asp Ala Thr Asp Arg Leu Cys Arg Ala Ala
85 90 95
Gln Ala Ala His Ile Asn Val Val Val Gly Leu Ser Glu Arg Asn Val
100 105 110
Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Ile Asp Ala
115 120 125
Ala Gly Thr Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly
130 135 140
Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Asp Val Tyr
145 150 155 160
Asp Thr Ala Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn Tyr
165 170 175
Met Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly Val Gln Ile Tyr
180 185 190
Val Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg
195 200 205
His Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Gly Met Ala
210 215 220
Leu Arg Arg Asp Asp Ile Pro Asp Arg Phe Ala Phe Lys Gln Arg Phe
225 230 235 240
Tyr Ala Gln Ala Gly Glu Trp Ile Asn Val Gly Asp Ser Ala Ile Val
245 250 255
Asn Pro Ser Gly Glu Phe Ile Ala Gly Pro Val Arg Glu Arg Glu Glu
260 265 270
Ile Leu Tyr Ala Glu Val Asp Pro Glu Gln Met Ser Gly Pro Lys Trp
275 280 285
Met Leu Asp Val Ala Gly His Tyr Gly Arg Pro Asp Val Phe Arg Leu
290 295 300
Ser Val Asn Arg Ala Pro His Gln Met Ile Gln Thr Glu Asn Arg Glu
305 310 315 320
Thr
<210>205
<211>969
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>205
atgaccaccg taaaagccgc cgcagtacag atcagccccg tgctctatag ccgggaggcc 60
accgtagaca aggtcgttcg caagatccgc gagctcggcc aaaagggagt gcagttcgcc 120
accttcccgg aaaccgtagt gccgtactac ccctacttcg ctgcagtcca gacaggcatc 180
gaactgttgt ccggcaagga acacatgcgc ctgctggagc aggccgttac cgtcccctcg 240
cccgccacgg atgcgattgc tcaggcggcg cgcgaagcca atatggtggt gtccatcggc 300
gtcaacgagc gcgacggcgg caccatctac aacacgcagc tgctcttcga tgccgacggc 360
acgctcgtgc agcgccgccg caagataacg ccaacgcact tcgagcgcat ggtctggggc 420
cagggcgatg gttcgggatt gcgcgccgcg gacaccaagg ttggccgcat cggccagttg 480
gcctgcttcg agcacaacaa cccgctcgcc cgttacgcca tgatggccga tggcgagcag 540
atccactccg ccatgtaccc gggctcggcc ttcggcgagg gcttcgcgca gcgcatggag 600
atcaacatcc gccagcatgc cctggagtct ggctgcttcg tggtgaatgc gaccgcctgg 660
ctcgatgccg accaacaggc gcagatcatg aaggacaccg gttgctcgat cggcccgatc 720
tccggcggct gcttcacgac catcgtcacg cctgagggca tgctgattgg cgagccgctc 780
cgcgagggcg aaggcgaaat catcgccgac ctcgatttct cgatgatcga tcgccgcaag 840
ctgctgatgg actcggtcgg tcactacaac cgtccggagc ttctgagcct cctgatcgat 900
cgcacgcctg ccgcgaactt ccatgaacgt accgcgagcc aggcgaacgc cggcgtcgaa 960
atcctctga 969
<210>206
<211>322
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>206
Met Thr Thr Val Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Ala Thr Val Asp Lys Val Val Arg Lys Ile Arg Glu Leu
20 25 30
Gly Gln Lys Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ala Ala Val Gln Thr Gly Ile Glu Leu Leu Ser
50 55 60
Gly Lys Glu His Met Arg Leu Leu Glu Gln Ala Val Thr Val Pro Ser
65 70 75 80
Pro Ala Thr Asp Ala Ile Ala Gln Ala Ala Arg Glu Ala Asn Met Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Val Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Phe Glu Arg Met Val Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Ala Asp Thr Lys Val Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly
180 185 190
Glu Gly Phe Ala Gln Arg Met Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Ser Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Thr Ile Val Thr Pro Glu Gly Met Leu Ile
245 250 255
Gly Glu Pro Leu Arg Glu Gly Glu Gly Glu Ile Ile Ala Asp Leu Asp
260 265 270
Phe Ser Met Ile Asp Arg Arg Lys Leu Leu Met Asp Ser Val Gly His
275 280 285
Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Ala
290 295 300
Ala Asn Phe His Glu Arg Thr Ala Ser Gln Ala Asn Ala Gly Val Glu
305 310 315 320
Ile Leu
<210>207
<211>966
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>207
atgtcaaacg agaacaacaa cgctacattc aaagttgccg cagtacaggc tacacctgtt 60
tttctcgatc gtgaagcgac tctcgacaag gcttgcgatt tgatcgccgc cgccggaggt 120
gaaggggcac gattggttgt ctttccagaa gccttcatac cggcctatcc ggattgggta 180
tgggcaatcc caccgggtga agagggcgta cttaatgagt tgtacgcaga gctgctctcc 240
aactcggtca cgattcccag tgacgcgacg gacagactgt gccgggccgc gaggcttgct 300
aatgcttacg tggtgatggg gataagcgaa cgcaatgtcg aggcgagtgg agcaagcctg 360
tataacacgc tgttgtacat cgatgcgcag ggtgagattc taggcaaaca tcgaaagcta 420
gtgccaacgg gcggcgagcg gctggtgtgg gcgcagggcg atggcagcac actgcaggtc 480
tacgatactc cactgggaaa actcggcggt ttaatttgct gggagaatta tatgccgctg 540
gcccgctata ccatgtatgc ctggggcaca caaatctatg tcgccgctac gtgggatcgc 600
gggcaaccct ggctctccac tttgcggcat atcgccaaag aaggcagggt gtacgtgatt 660
ggttgttgta tcgcgatgcg caaagacgat atccctgatc gttacgcaat gaagcagaag 720
ttttacgcgg aggcagatga gtggatcaat ataggtgaca gcgcgattgt caatcctgaa 780
gggcaattta tcgcagggcc agtacgcaag caggaagaga ttctctacgc agagattgat 840
ccgcgcatgg tacaagggcc gaagtggatg ctcgacgtgg cggggcacta tgccaggccg 900
gatgtgttcc agttgacggt gcatacggat gtgcgacaga tgattcggat ggaacacgat 960
tcttaa 966
<210>208
<211>321
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>208
Met Ser Asn Glu Asn Asn Asn Ala Thr Phe Lys Val Ala Ala Val Gln
1 5 10 15
Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr Leu Asp Lys Ala Cys
20 25 30
Asp Leu Ile Ala Ala Ala Gly Gly Glu Gly Ala Arg Leu Val Val Phe
35 40 45
Pro Glu Ala Phe Ile Pro Ala Tyr Pro Asp Trp Val Trp Ala Ile Pro
50 55 60
Pro Gly Glu Glu Gly Val Leu Asn Glu Leu Tyr Ala Glu Leu Leu Ser
65 70 75 80
Asn Ser Val Thr Ile Pro Ser Asp Ala Thr Asp Arg Leu Cys Arg Ala
85 90 95
Ala Arg Leu Ala Asn Ala Tyr Val Val Met Gly Ile Ser Glu Arg Asn
100 105 110
Val Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Ile Asp
115 120 125
Ala Gln Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly
130 135 140
Gly Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val
145 150 155 160
Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn
165 170 175
Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gln Ile
180 185 190
Tyr Val Ala Ala Thr Trp Asp Arg Gly Gln Pro Trp Leu Ser Thr Leu
195 200 205
Arg His Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile
210 215 220
Ala Met Arg Lys Asp Asp Ile Pro Asp Arg Tyr Ala Met Lys Gln Lys
225 230 235 240
Phe Tyr Ala Glu Ala Asp Glu Trp Ile Asn Ile Gly Asp Ser Ala Ile
245 250 255
Val Asn Pro Glu Gly Gln Phe Ile Ala Gly Pro Val Arg Lys Gln Glu
260 265 270
Glu Ile Leu Tyr Ala Glu Ile Asp Pro Arg Met Val Gln Gly Pro Lys
275 280 285
Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln
290 295 300
Leu Thr Val His Thr Asp Val Arg Gln Met Ile Arg Met Glu His Asp
305 310 315 320
Ser
<210>209
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>209
atgaaagtcg tcaaagccgc cgctgtccag atcagtcccg ttctctacag ccgtgaggca 60
accgtcgaaa aggtcgtgaa gaaaattcac gagcttggcc aactgggcgt gcagttcgcc 120
acctttcccg agaccgtagt gccttactac ccgtactttt ccgccgtcca gacaggcatt 180
gagcttctgt ccggaactga gcatctgcgg ctgctcgatc aggccgtgac ggtaccgtct 240
cccgctaccg atgcgatcgg agaggcggcc cgcaaggcgg gcatggtggt gtccatcggc 300
gtgaatgaac gcgacggcgg caccttgtac aacacacagt tgctcttcga tgccgatggc 360
accttgatcc agcgccgccg caagatcacg cccacccact tcgaacggat gatctggggc 420
cagggggacg gctcgggcct gcgcgccgtc gacagcaagg ttggtcgcat tggtcagctt 480
gcctgcttcg agcacaacaa cccgctggcc cgctacgcgc tgattgccga cggcgagcag 540
atccattccg ccatgtatcc gggttctgct ttcggcgaag gctttgccca aaggatggaa 600
atcaatatcc gccagcatgc gctggagtct ggtgcctttg tcgtcaacgc aacggcctgg 660
ctggatgctg accagcaggc gcaaatcatc aaggacaccg gctgtgggat tggcccgatc 720
tcgggcggct gcttcaccac gatcgtggca cccgacggca tgctgatggc cgaacctctg 780
cgttcgggcg agggtgaggt catcgtggat ctcgacttca cgctgatcga ccgacgcaag 840
atgttgatgg actcggcggg ccactataac cgtccagaac tgctcagtct catgattgac 900
cgtaccgcga cggcgcatgt tcacgaacgc gctgcgcatc cggtgtcggg cgcggagcag 960
ggtccggagg atctgcgcac tccggccgcg tga 993
<210>210
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>210
Met Lys Val Val Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Ala Thr Val Glu Lys Val Val Lys Lys Ile His Glu Leu
20 25 30
Gly Gln Leu Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Ala Val Gln Thr Gly Ile Glu Leu Leu Ser
50 55 60
Gly Thr Glu His Leu Arg Leu Leu Asp Gln Ala Val Thr Val Pro Ser
65 70 75 80
Pro Ala Thr Asp Ala Ile Gly Glu Ala Ala Arg Lys Ala Gly Met Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Phe Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Val Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Leu Ile Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly
180 185 190
Glu Gly Phe Ala Gln Arg Met Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Gly Ala Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Ile Lys Asp Thr Gly Cys Gly Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Thr Ile Val Ala Pro Asp Gly Met Leu Met
245 250 255
Ala Glu Pro Leu Arg Ser Gly Glu Gly Glu Val Ile Val Asp Leu Asp
260 265 270
Phe Thr Leu Ile Asp Arg Arg Lys Met Leu Met Asp Ser Ala Gly His
275 280 285
Tyr Asn Arg Pro Glu Leu Leu Ser Leu Met Ile Asp Arg Thr Ala Thr
290 295 300
Ala His Val His Glu Arg Ala Ala His Pro Val Ser Gly Ala Glu Gln
305 310 315 320
Gly Pro Glu Asp Leu Arg Thr Pro Ala Ala
325 330
<210>211
<211>1062
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>211
atgggcgtcg cacatccgaa atacaaagtg gccgccgtgc aggcagcgcc cgcttttctc 60
gacctggacg cctcggtcga aaaggccgtc cgtttcatcg acgaagccgg cgccgccggc 120
gcccgcctca tcgcctttcc ggagacctgg atacccggtt acccctggtg gatctggcta 180
ggcgcgccgg cctgggctat catgcgcggc ttcgtctcgc gctatttcga caactcgctc 240
agctacgaca gcccgcaggc cgagaagctc cgcgccgccg ccaagcgcaa caagatggtg 300
gtggtgctcg gcctctccga gcgcgacggc ggcagccttt acatcgcgca atggatcatc 360
ggcccggacg gcgaaaccat cgccaagcgc cgcaagctca agccgaccca cgcggagcgg 420
accgtgttcg gcgaaggcga cggctcgcat cttgcggtgc acgagcttga tgttggccgg 480
ctcggcgcgc tgtgctgctg ggaacacctg cagccgctgt ccaaatacgc catgtatgcg 540
cagaacgaac aggtgcatgt cgcggcctgg ccgagctttt cgctttacga tccgttcgcg 600
cacgcgctcg gcgcggaagt gaacaatgcg gcgagcaaaa tctatgcggt cgagggctcg 660
tgtttcgtca tcgcgccgtg cgcgaccgtt tcgcaggcga tgatcgacga actctgcgat 720
acgccggaga agcatcagtt cctgcatgcc ggcggcggct ttgccgtgat ttacggcccc 780
gacggcgccc cgctcgcggc gccgctgccg cccgacaagg aaggcttgct ctacgccgac 840
atcgatctcg ggatgatttc ggttgccaaa gcggcagccg atccggccgg gcattatgca 900
cgccccgacg tcacccggct tctgttcaac aatcggcctg ggtatcgggt cgagaccatg 960
gcgttgccga tcgatgcgga gaccaaggcg gaagcaccgg ctaagccgga acccaaggca 1020
ccgaacgtgg cgccgttcgc gccggtgcaa gcggccgagt ga 1062
<210>212
<211>353
<212>PRT
<213>未知
<220>
<223>从环境样品获得
Met Gly Val Ala His Pro Lys Tyr Lys Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Ala Phe Leu Asp Leu Asp Ala Ser Val Glu Lys Ala Val Arg Phe
20 25 30
Ile Asp Glu Ala Gly Ala Ala Gly Ala Arg Leu Ile Ala Phe Pro Glu
35 40 45
Thr Trp Ile Pro Gly Tyr Pro Trp Trp Ile Trp Leu Gly Ala Pro Ala
50 55 60
Trp Ala Ile Met Arg Gly Phe Val Ser Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ser Tyr Asp Ser Pro Gln Ala Glu Lys Leu Arg Ala Ala Ala Lys Arg
85 90 95
Asn Lys Met Val Val Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Ile Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly
130 135 140
Glu Gly Asp Gly Ser His Leu Ala Val His Glu Leu Asp Val Gly Arg
145 150 155 160
Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr
165 170 175
Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro Ser
180 185 190
Phe Ser Leu Tyr Asp Pro Phe Ala His Ala Leu Gly Ala Glu Val Asn
195 200 205
Asn Ala Ala Ser Lys Ile Tyr Ala Val Glu Gly Ser Cys Phe Val Ile
210 215 220
Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys Asp
225 230 235 240
Thr Pro Glu Lys His Gln Phe Leu His Ala Gly Gly Gly Phe Ala Val
245 250 255
Ile Tyr Gly Pro Asp Gly Ala Pro Leu Ala Ala Pro Leu Pro Pro Asp
260 265 270
Lys Glu Gly Leu Leu Tyr Ala Asp Ile Asp Leu Gly Met Ile Ser Val
275 280 285
Ala Lys Ala Ala Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val
290 295 300
Thr Arg Leu Leu Phe Asn Asn Arg Pro Gly Tyr Arg Val Glu Thr Met
305 310 315 320
Ala Leu Pro Ile Asp Ala Glu Thr Lys Ala Glu Ala Pro Ala Lys Pro
325 330 335
Glu Pro Lys Ala Pro Asn Val Ala Pro Phe Ala Pro Val Gln Ala Ala
340 345 350
Glu
<210>213
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>213
atgagagttg ttaaagccgc tgctgtccaa ctgagtcccg tcctctatag tcgcgaggga 60
acggtcgaaa aggtcgtgcg gaagatccat gaacttgccg aagagggagt cgagttcgcc 120
acctttcctg agaccgtggt gccttactac ccgtactttt ccttcgttca gacgcccttg 180
gagcaaatct tcggaacaga gtatctgagg ctgctcgacc aggcagtcac cgtgccatcc 240
cctgccaccg acgcgatcgg cgaggcagcc aggttcgctg gagttgttgt ctcgatcggc 300
gtcaacgagc gagacggggg aactctatac aacactcagc ttctcttcga tgccgacggc 360
aggataattc agcggcgccg caagatcacg cccacccatt acgagcgcat gatctggggc 420
cagggcgacg gctcaggtct gcgggccgtt gatagcaagg ccggccgtat tggtcagctg 480
gcatgctggg agcacaacaa tccactggcg cgctacgcgc tgatggccga cggcgagcag 540
atccattccg ccatgtatcc gggctccatg ttcggcgact cgtttgccca gaagaccgaa 600
atcaatatcc ggcagcatgc cctagagtct gggtgcttcg tcgtgaacgc aacggcctgg 660
ctggacggcg atcagcaggc gcatatcatg aaggacaccg gctgcagcat cggcccgatc 720
tccggcggtt gcttcactgc gatcgtcgca cccgatggta gcctgctggg cgaacccatc 780
cgttccggtg agggcgtggt catcgccgac ctcgacttca cgttgatcga caggcgtaag 840
caggtgatgg actcgcgagg ccattacagc cggccggagt tgctcagcct cttaatagac 900
cgcaccccta ccgcgcactt tcacgaacgc gcttcgcccc ccacgacaga agctgagcaa 960
ggctccgagg atgtgttcga ggctcgcatt taa 993
<210>214
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>214
Met Arg Val Val Lys Ala Ala Ala Val Gln Leu Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Glu Lys Val Val Arg Lys Ile His Glu Leu
20 25 30
Ala Glu Glu Gly Val Glu Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Thr Pro Leu Glu Gln Ile Phe
50 55 60
Gly Thr Glu Tyr Leu Arg Leu Leu Asp Gln Ala Val Thr Val Pro Ser
65 70 75 80
Pro Ala Thr Asp Ala Ile Gly Glu Ala Ala Arg Phe Ala Gly Val Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Arg Ile Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Tyr Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Ala Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Met Phe Gly
180 185 190
Asp Ser Phe Ala Gln Lys Thr Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Gly Asp
210 215 220
Gln Gln Ala His Ile Met Lys Asp Thr Gly Cys Ser Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Ala Ile Val Ala Pro Asp Gly Ser Leu Leu
245 250 255
Gly Glu Pro Ile Arg Ser Gly Glu Gly Val Val Ile Ala Asp Leu Asp
260 265 270
Phe Thr Leu Ile Asp Arg Arg Lys Gln Val Met Asp Ser Arg Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Thr
290 295 300
Ala His Phe His Glu Arg Ala Ser Pro Pro Thr Thr Glu Ala Glu Gln
305 310 315 320
Gly Ser Glu Asp Val Phe Glu Ala Arg Ile
325 330
<210>215
<211>1008
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>215
atgagtatta cccaccccaa atttaaagct gctgttgttc aagctgcccc agtatttcta 60
gacctagatg ggtctgttaa taaggcgatt aatctcattg atgaagctgc cgctgccgga 120
gccaagctca ttgccttccc tgaaaccttc attccaggct atccatggtg gatttggctg 180
ggatcgccgg cgtgggctct gggccggggg ttcgttcagc gttacttcga caattccctg 240
cagtacgaca gcccgcaggc ggatcgctta cgcgaggcgg cacgacgcaa cagcattacg 300
gtcgtgctgg gcttgtccga gcgtgatggc ggttctctct atatcgcaca gtggctgatc 360
ggcccggatg gcgaaaccat cgcgcagcgg cgcaagcttc gtcctactca tggggagcgc 420
acggtattcg gtgaagggga tggcagcgat ctggtggttc atcaaaccga actgggccgt 480
cttggcgcgc ttaactgctg ggagaacatc ctgtctctga acaaatatgt gatgtactcc 540
cagcatgaac aggtccatgt agcatcctgg cccagtttct cgacgtatga accgttcgcg 600
catgcgctcg gctatgaggt aaacaacgca attagccagg tctatgcggt ggaaggcggg 660
tgcttcgtgt tggccccgtg ctctaccatc tctgaagaaa tgattgccga actgtgcgat 720
acacccgata aattcgagct gacgcatgct ggtggcggcc acgcaatcat ctatggtccg 780
gacggtcgtg ctctgtgcga aaagctgccc gagaaccagg agggcctgct gtacgcggaa 840
atcgatctgg gggtgatttc tatggcaaaa agtgccatgg atcctgtcgg ccattactct 900
cgccccgatg tctaccgtgt gctgttcaat aagatcccgg caaagcgtat cgagcacttc 960
aatttgccgt tggatgagca agcaggggaa gagccaccag ctgattaa 1008
<210>216
<211>335
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>216
Met Ser Ile Thr His Pro Lys Phe Lys Ala Ala Val Val Gln Ala Ala
1 5 10 15
Pro Val Phe Leu Asp Leu Asp Gly Ser Val Asn Lys Ala Ile Asn Leu
20 25 30
Ile Asp Glu Ala Ala Ala Ala Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Thr Phe Ile Pro Gly Tyr Pro Trp Trp Ile Trp Leu Gly Ser Pro Ala
50 55 60
Trp Ala Leu Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Gln Tyr Asp Ser Pro Gln Ala Asp Arg Leu Arg Glu Ala Ala Arg Arg
85 90 95
Asn Ser Ile Thr Val Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Gln Arg Arg Lys Leu Arg Pro Thr His Gly Glu Arg Thr Val Phe Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Val Val His Gln Thr Glu Leu Gly Arg
145 150 155 160
Leu Gly Ala Leu Asn Cys Trp Glu Asn Ile Leu Ser Leu Asn Lys Tyr
165 170 175
Val Met Tyr Ser Gln His Glu Gln Val His Val Ala Ser Trp Pro Ser
180 185 190
Phe Ser Thr Tyr Glu Pro Phe Ala His Ala Leu Gly Tyr Glu Val Asn
195 200 205
Asn Ala Ile Ser Gln Val Tyr Ala Val Glu Gly Gly Cys Phe Val Leu
210 215 220
Ala Pro Cys Ser Thr Ile Ser Glu Glu Met Ile Ala Glu Leu Cys Asp
225 230 235 240
Thr Pro Asp Lys Phe Glu Leu Thr His Ala Gly Gly Gly His Ala Ile
245 250 255
Ile Tyr Gly Pro Asp Gly Arg Ala Leu Cys Glu Lys Leu Pro Glu Asn
260 265 270
Gln Glu Gly Leu Leu Tyr Ala Glu Ile Asp Leu Gly Val Ile Ser Met
275 280 285
Ala Lys Ser Ala Met Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val
290 295 300
Tyr Arg Val Leu Phe Asn Lys Ile Pro Ala Lys Arg Ile Glu His Phe
305 310 315 320
Asn Leu Pro Leu Asp Glu Gln Ala Gly Glu Glu Pro Pro Ala Asp
325 330 335
<210>217
<211>1011
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>217
gtgggtatca gccatccgaa attcaaggct gcggtcgtac aggccgggcc tgccttcctc 60
gacctcgacg gcggcgtcga acgagccgtg tcgctcatcg gccaagcagc ggccgaaggg 120
gcacagctga ttgcctttcc tgaaacgtgg attcccggtt acccgtggca cacctggctt 180
ggcagcccgg cgtgggcaat ggaaaaaggc tttgtccaac gatatttcga caacgcgttg 240
cggcatggtt ctccgcaagc cgagcgaatc tccggggctg cggcggagca caagattatg 300
gtgtcgcttg ggtttgcgga acgcgatgga ggcacgcttt atatcgcgca gtggctcatc 360
ggacccgacg gccaaactat ctcacgacgg cggaagctta agccgactca cgtcgagcgc 420
actgtatttg gcgagggaga cggaagcgat ctctccgtgc atgatacggc gcttggacgt 480
atcggctcac tttgttgctg ggagcatttg caaccgttgt cgaaatacgc aatgtacgcc 540
cagaatgaac agattcacat tggcgcatgg cccagctttt cgctatacca gccatttgcg 600
aatgcgctga gtcccgaagt caatatcgca gtaagccgcg tgtacgccgt ggaaggccag 660
tgtttcttcc tcgcgccgtg cgcgacggtt tcggacgcca tgatcgaaac actgtgcgat 720
acgcccgaaa agcagggact gattcgggcg ggtggcgggc acgccgcgat cttcggccca 780
gatggaagtc tgctgacgcc tacggtagcg gatacttacg agggcctgct gtatgcagaa 840
ctcgacctcg gcgtcatttc gatcgccaag agtgcagcgg accccgccgg ccactattcg 900
cggccagatg tcacacgcct tctattgaat cagacgcctt cgaagcgcgt tcagaatatg 960
gtgttaccac tggagacggt cacggagccc gaaggcccgg ttcagcccta g 1011
<210>218
<211>336
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>218
Val Gly Ile Ser His Pro Lys Phe Lys Ala Ala Val Val Gln Ala Gly
1 5 10 15
Pro Ala Phe Leu Asp Leu Asp Gly Gly Val Glu Arg Ala Val Ser Leu
20 25 30
Ile Gly Gln Ala Ala Ala Glu Gly Ala Gln Leu Ile Ala Phe Pro Glu
35 40 45
Thr Trp Ile Pro Gly Tyr Pro Trp His Thr Trp Leu Gly Ser Pro Ala
50 55 60
Trp Ala Met Glu Lys Gly Phe Val Gln Arg Tyr Phe Asp Asn Ala Leu
65 70 75 80
Arg His Gly Ser Pro Gln Ala Glu Arg Ile Ser Gly Ala Ala Ala Glu
85 90 95
His Lys Ile Met Val Ser Leu Gly Phe Ala Glu Arg Asp Gly Gly Thr
100 105 110
Leu Tyr Ile Ala Gln Trp Leu Ile Gly Pro Asp Gly Gln Thr Ile Ser
115 120 125
Arg Arg Arg Lys Leu Lys Pro Thr His Val Glu Arg Thr Val Phe Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ser Val His Asp Thr Ala Leu Gly Arg
145 150 155 160
Ile Gly Ser Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr
165 170 175
Ala Met Tyr Ala Gln Asn Glu Gln Ile His Ile Gly Ala Trp Pro Ser
180 185 190
Phe Ser Leu Tyr Gln Pro Phe Ala Asn Ala Leu Ser Pro Glu Val Asn
195 200 205
Ile Ala Val Ser Arg Val Tyr Ala Val Glu Gly Gln Cys Phe Phe Leu
210 215 220
Ala Pro Cys Ala Thr Val Ser Asp Ala Met Ile Glu Thr Leu Cys Asp
225 230 235 240
Thr Pro Glu Lys Gln Gly Leu Ile Arg Ala Gly Gly Gly His Ala Ala
245 250 255
Ile Phe Gly Pro Asp Gly Ser Leu Leu Thr Pro Thr Val Ala Asp Thr
260 265 270
Tyr Glu Gly Leu Leu Tyr Ala Glu Leu Asp Leu Gly Val Ile Ser Ile
275 280 285
Ala Lys Ser Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val
290 295 300
Thr Arg Leu Leu Leu Asn Gln Thr Pro Ser Lys Arg Val Gln Asn Met
305 310 315 320
Val Leu Pro Leu Glu Thr Val Thr Glu Pro Glu Gly Pro Val Gln Pro
325 330 335
<210>219
<211>996
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>219
atgaaaatcg tcaaagctgc cgccgttcaa attagtcccg tgctttatag ccgcgatgga 60
acggtggaaa aagttgtgcg gaagatccat gagctcggcc aacagggagt gcagttcgca 120
accttccccg agaccgtgat tccgtactat ccctattttt cctttctgca acccgcctac 180
cagatcgcgg ccgggcagga gcacctcaag ttgctcgacc aggcggtgac ggtgccgtcc 240
gctgccacgc acgcgatcgg ccaggcatgc aaacaggcag gagtggttgt ttcgatcggc 300
atcaatgagc gggacaatgg aacgatctac aacacccagc ttctcttcga ctccgacgga 360
accctgctgc agcggcgccg caagatctca cccacttacc acgaacggat gatctggggc 420
cagggcgacg gttccggcct gcgcgccgtg gatagcaagg ccggccggat tgggcagctc 480
gcctgttggg agcactacaa ccctttggcg cgctacgcga tgatggccga cggcgagcag 540
atccattccg ccatgtaccc tggctccatc ttcggcgatc tgtttgccga acagacccag 600
atcaacgtcc ggcagcacgc gttggagtcc ggctgtttcg tagtctgctc caccgcctgg 660
ttggatcccg atcaacaagc gcaaatcgtc aaggacacag gcggcgccat tggtgcgatc 720
tcgggcggct gtttcacggc gattgtggcc cccgatagca ctctgctcgg agagccgatc 780
cgctccggcg aaggcgtggt cattgcggat ctcgacttca cccggatcga caagcgtaaa 840
cagttgatgg attcccgagg tcattacagc cggccggaat tgctgagtct gctgattgat 900
cgcacttcca ccgcacatgt gcaaaaccgc gtgccggccg atccattcgg cgccggctgt 960
gttgcagaac caggagtaga aacatgccca cgatag 996
<210>220
<211>331
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>220
Met Lys Ile Val Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Asp Gly Thr Val Glu Lys Val Val Arg Lys Ile His Glu Leu
20 25 30
Gly Gln Gln Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Ile Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Leu Gln Pro Ala Tyr Gln Ile Ala Ala
50 55 60
Gly Gln Glu His Leu Lys Leu Leu Asp Gln Ala Val Thr Val Pro Ser
65 70 75 80
Ala Ala Thr His Ala Ile Gly Gln Ala Cys Lys Gln Ala Gly Val Val
85 90 95
Val Ser Ile Gly Ile Asn Glu Arg Asp Asn Gly Thr Ile Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ser Asp Gly Thr Leu Leu Gln Arg Arg Arg Lys
115 120 125
Ile Ser Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Ala Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Trp Glu His Tyr Asn Pro LeH Ala Arg Tyr Ala Met Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ile Phe Gly
180 185 190
Asp Leu Phe Ala Glu Gln Thr Gln Ile Asn Val Arg Gln His Ala Leu
195 200 205
Glu Ser Gly Cys Phe Val Val Cys Ser Thr Ala Trp Leu Asp Pro Asp
210 215 220
Gln Gln Ala Gln Ile Val Lys Asp Thr Gly Gly Ala Ile Gly Ala Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Ala Ile Val Ala Pro Asp Ser Thr Leu Leu
245 250 255
Gly Glu Pro Ile Arg Ser Gly Glu Gly Val Val Ile Ala Asp Leu Asp
260 265 270
Phe Thr Arg Ile Asp Lys Arg Lys Gln Leu Met Asp Ser Arg Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Ser Thr
290 295 300
Ala His Val Gln Asn Arg Val Pro Ala Asp Pro Phe Gly Ala Gly Cys
305 310 315 320
Val Ala Glu Pro Gly Val Glu Thr Cys Pro Arg
325 330
<210>221
<211>1146
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>221
atgttgacct acaaaggcgt gttcaaagca gcgacagttc aggcggagcc ggtgtggatg 60
gacgctgacg ccaccattac caaagccatc cggatcatcg aagaggcggc tgataacggg 120
gcgaagtttg tcgcattccc ggaggtgttc atccctggat acccttggtg gatctggctg 180
ggcactgcga tgtggggagc caaattcgtc gtgcccttcc acgaaaattg ccttgagctt 240
ggcgacaagc ggatgcagcg cattcaagct gcggcaaaac aaaatggcat tgcactcgtc 300
atgggttacg gtgaacgtga cggaggtagc cgctacatga gccaggtatt catcgacgat 360
agcggaaaaa ttgtggcgaa ccgccgcaag ctcaaaccga cgcacgaaga gcgaacgatt 420
ttcggcgagg gaaacggatc cgatttcatc acccatgact tcccatttgc gagagtcgga 480
ggcttcaact gctgggaaca ccttcagccc ctgagcaaat acatgatgta cagcctgcag 540
gaacaggtgc atgtcgcgtc gtggccggcc atgtgcacct atcagcctga cgtgccccag 600
ttaggtgcag gcgctaacga ggcggtcacc cgctcttacg cgatcgaagg cgcatgttat 660
gtgctgggag cgacactcgt tatcggcaag gcagcccatg atgcattctg cgacactgaa 720
gagcatcaca agctgcttgg catgggcgga gggtgggccc gaattttcgg cccggacggt 780
gagtatctcg cagaatcgct cgctcatgac gcggagggca tcctgtacgc tgacattgat 840
ctgagcaaga tcctgctcgc caaagcaaac acggacacag ttggccacta cgcgcgacct 900
gacgtcttgt cattgctcgt caacacacac aatcccgggc ctgtacgcta tcttgacgaa 960
gagggccggc aggtttcaac gtcaattcgc aggcacgaaa aactggaagg ccagagcctc 1020
gacctcgagg ttacgccagc gacaccagcg accctggaca tcgcgagcct ggtacaacag 1080
gctaagccat cgaccgttaa gagcgagagt aacgcgagca caaaacaacc ggatctcgcg 1140
gtctga 1146
<210>222
<211>381
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>222
Met Leu Thr Tyr Lys Gly Val Phe Lys Ala Ala Thr Val Gln Ala Glu
1 5 10 15
Pro Val Trp Met Asp Ala Asp Ala Thr Ile Thr Lys Ala Ile Arg Ile
20 25 30
Ile Glu Glu Ala Ala Asp Asn Gly Ala Lys Phe Val Ala Phe Pro Glu
35 40 45
Val Phe Ile Pro Gly Tyr Pro Trp Trp Ile Trp Leu Gly Thr Ala Met
50 55 60
Trp Gly Ala Lys Phe Val Val Pro Phe His Glu Asn Cys Leu Glu Leu
65 70 75 80
Gly Asp Lys Arg Met Gln Arg Ile Gln Ala Ala Ala Lys Gln Asn Gly
85 90 95
Ile Ala Leu Val Met Gly Tyr Gly Glu Arg Asp Gly Gly Ser Arg Tyr
100 105 110
Met Ser Gln Val Phe Ile Asp Asp Ser Gly Lys Ile Val Ala Asn Arg
115 120 125
Arg Lys Leu Lys Pro Thr His Glu Glu Arg Thr Ile Phe Gly Glu Gly
130 135 140
Asn Gly Ser Asp Phe Ile Thr His Asp Phe Pro Phe Ala Arg Val Gly
145 150 155 160
Gly Phe Asn Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr Met Met
165 170 175
Tyr Ser Leu Gln Glu Gln Val His Val Ala Ser Trp Pro Ala Met Cys
180 185 190
Thr Tyr Gln Pro Asp Val Pro Gln Leu Gly Ala Gly Ala Asn Glu Ala
195 200 205
Val Thr Arg Ser Tyr Ala Ile Glu Gly Ala Cys Tyr Val Leu Gly Ala
210 215 220
Thr Leu Val Ile Gly Lys Ala Ala His Asp Ala Phe Cys Asp Thr Glu
225 230 235 240
Glu His His Lys Leu Leu Gly Met Gly Gly Gly Trp Ala Arg Ile Phe
245 250 255
Gly Pro Asp Gly Glu Tyr Leu Ala Glu Ser Leu Ala His Asp Ala Glu
260 265 270
Gly Ile Leu Tyr Ala Asp Ile Asp Leu Ser Lys Ile Leu Leu Ala Lys
275 280 285
Ala Asn Thr Asp Thr Val Gly His Tyr Ala Arg Pro Asp Val Leu Ser
290 295 300
Leu Leu Val Asn Thr His Asn Pro Gly Pro Val Arg Tyr Leu Asp Glu
305 310 315 320
Glu Gly Arg Gln Val Ser Thr Ser Ile Arg Arg His Glu Lys Leu Glu
325 330 335
Gly Gln Ser Leu Asp Leu Glu Val Thr Pro Ala Thr Pro Ala Thr Leu
340 345 350
Asp Ile Ala Ser Leu Val Gln Gln Ala Lys Pro Ser Thr Val Lys Ser
355 360 365
Glu Ser Asn Ala Ser Thr Lys Gln Pro Asp Leu Ala Val
370 375 380
<210>223
<211>996
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>223
atgagagttg ttaaagccag cgcagttcaa ctgaagcctg tactttatag ccgcgagggg 60
acggtcgaaa aggtcgtagc gaagattcat gagctagggc agcagggtgt gcagttcgcc 120
gcgttccctg agaccgtggt gccttattac ccgtactttt cgatcgtgca gtccggctat 180
caaatccttc gcggcggtga gttcgtcaag ctcctcgatc agtccgtgac ggtgccatct 240
tatagcactg aagccatcgg cgaagcctgc aggcaggagg agatggttgt ctccataggc 300
gtcaacgagc gtgatggcgg aacgatctac aacgcgcagt tactctttga ttccgacggc 360
acgttgatcc aaagacgacg caagatcacc cccacccatt acgaacgcat gatctggggc 420
cagggcgatg gctcaggtct gcgcgctgtt gacagcaatg tcgcacgcat tggtcaactg 480
gcatgctttg agcactacaa ccctcttgcg cggtacgcga tgatggccga tggcgagcag 540
atccattccg ccatgttccc cggttccatg ttcggcgatg gttttgcgga gaggacggaa 600
attgccgtaa ggcaacatgc gatggagtcc gggtgctttg tcgtttgcgc tacggcctgg 660
ctcgatcccg gccagcaggc tcagatcgcc aacgacaccg gtatcaccga catcggcccg 720
atctccgggg gttgcttcac tgcgatcatc gcacccgatg ggagcctgct gggccaacct 780
atccgctcgg gtgaaggcga agtcatcgtc gacctcgatt tcacgttaat tgacaagcgg 840
aaacatattg tcgactcgag aggacattac agccggccag aattgctgag cctgctgatc 900
gatcgcactc ccaccgcgca ccttcacgac cgcgctgtgc agcacaatgc cggatcggaa 960
ggagcgtcgg aacatcttcg cgaagacgcc gcctga 996
<210>224
<211>331
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>224
Met Arg Val Val Lys Ala Ser Ala Val Gln Leu Lys Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Glu Lys Val Val Ala Lys Ile His Glu Leu
20 25 30
Gly Gln Gln Gly Val Gln Phe Ala Ala Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Ile Val Gln Ser Gly Tyr Gln Ile Leu Arg
50 55 60
Gly Gly Glu Phe Val Lys Leu Leu Asp Gln Ser Val Thr Val Pro Ser
65 70 75 80
Tyr Ser Thr Glu Ala Ile Gly Glu Ala Cys Arg Gln Glu Glu Met Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Ala
100 105 110
Gln Leu Leu Phe Asp Ser Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Tyr Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Asn Val Ala Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Met Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Phe Pro Gly Ser Met Phe Gly
180 185 190
Asp Gly Phe Ala Glu Arg Thr Glu Ile Ala Val Arg Gln His Ala Met
195 200 205
Glu Ser Gly Cys Phe Val Val Cys Ala Thr Ala Trp Leu Asp Pro Gly
210 215 220
Gln Gln Ala Gln Ile Ala Asn Asp Thr Gly Ile Thr Asp Ile Gly Pro
225 230 235 240
Ile Ser Gly Gly Cys Phe Thr Ala Ile Ile Ala Pro Asp Gly Ser Leu
245 250 255
Leu Gly Gln Pro Ile Arg Ser Gly Glu Gly Glu Val Ile Val Asp Leu
260 265 270
Asp Phe Thr Leu Ile Asp Lys Arg Lys His Ile Val Asp Ser Arg Gly
275 280 285
His Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro
290 295 300
Thr Ala His Leu His Asp Arg Ala Val Gln His Asn Ala Gly Ser Glu
305 310 315 320
Gly Ala Ser Glu His Leu Arg Glu Asp Ala Ala
325 330
<210>225
<211>951
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>225
atgaccaccg tcaaagccgc cgctgtccag atgagtcccg tgctttacag tcgcgacgat 60
accatcgaga aaatttgtcg acagatcatc gagcttggcc gacagggagt gcagttcgcc 120
acgtttccag agacagttat tccgtattat ccatatttcg ccttcgtgca gcggccttac 180
gaaatgtcgg cccaatatca tcagttgctt gatcaagcgg tgaccgtgcc ttccggttca 240
acgcacgcca ttggggccgc ctgtaaacaa gccggaattg ttgtctcaat cggcgtcaac 300
gagcgggagg gcggcacgct ctatggcaca cagttgctgt ttgatgcaga cggcctcttg 360
atccagcgtc gccgcaagat cactccgacc taccacgagc ggatgatttg gggacagggt 420
gacggttccg ggctgcgagc cgtagatagc gcggtcggtc gaatcggaca gttggcgtgt 480
tgggaacacc acaatccgct ggctcgttat gctttagcgg ccgatggcga acaaattcac 540
gcggcgatgt accctggctc gatcttgggt gaactatttg ccgagcagat tcaggtcaac 600
atccggcagc acgccatgga atctggttgc ttcgtcgtga acgccacggc ctggctaagc 660
gaggaacagc aagcccgaat catgaaggac accggatcat tcgatagccc aatcaccggt 720
ggttgcttta ccgccattgt cgcgcccaac gggcagataa tcggtgaacc gctgcgcatc 780
ggcgaaggcg tcgtgattgc cgatttggac ttcgctttga ttgatgagag gaagcggctg 840
atggactcac gcggcctcta tagccgccct gagttgctaa gcttgttaat cgacagaatg 900
cctacatccc atgtgcatga acgggttgag cgtagcatgg cgatggcatg a 951
<210>226
<211>316
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>226
Met Thr Thr Val Lys Ala Ala Ala Val Gln Met Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Asp Asp Thr Ile Glu Lys Ile Cys Arg Gln Ile Ile Glu Leu
20 25 30
Gly Arg Gln Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Ile Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ala Phe Val Gln Arg Pro Tyr Glu Met Ser Ala
50 55 60
Gln Tyr His Gln Leu Leu Asp Gln Ala Val Thr Val Pro Ser Gly Ser
65 70 75 80
Thr His Ala Ile Gly Ala Ala Cys Lys Gln Ala Gly Ile Val Val Ser
85 90 95
Ile Gly Val Asn Glu Arg Glu Gly Gly Thr Leu Tyr Gly Thr Gln Leu
100 105 110
Leu Phe Asp Ala Asp Gly Leu Leu Ile Gln Arg Arg Arg Lys Ile Thr
115 120 125
Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp Gly Ser Gly
130 135 140
Leu Arg Ala Val Asp Ser Ala Val Gly Arg Ile Gly Gln Leu Ala Cys
145 150 155 160
Trp Glu His His Asn Pro Leu Ala Arg Tyr Ala Leu Ala Ala Asp Gly
165 170 175
Glu Gln Ile His Ala Ala Met Tyr Pro Gly Ser Ile Leu Gly Glu Leu
180 185 190
Phe Ala Glu Gln Ile Gln Val Asn Ile Arg Gln His Ala Met Glu Ser
195 200 205
Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Ser Glu Glu Gln Gln
210 215 220
Ala Arg Ile Met Lys Asp Thr Gly Ser Phe Asp Ser Pro Ile Thr Gly
225 230 235 240
Gly Cys Phe Thr Ala Ile Val Ala Pro Asn Gly Gln Ile Ile Gly Glu
245 250 255
Pro Leu Arg Ile Gly Glu Gly Val Val Ile Ala Asp Leu Asp Phe Ala
260 265 270
Leu Ile Asp Glu Arg Lys Arg Leu Met Asp Ser Arg Gly Leu Tyr Ser
275 280 285
Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Met Pro Thr Ser His
290 295 300
Val His Glu Arg Val Glu Arg Ser Met Ala Met Ala
305 310 315
<210>227
<211>1035
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>227
atgtccgatc gacgcatcgt tcgcgcggca gccattcaga tcgccccgga tctcgagcgc 60
tgcagtgtaa cgctcgagaa ggtttgctcg gcgatcgacg aagccgcggg caaaggcgct 120
caactgagcg tctttcccga gaccttcgtc ccgtactacc cctatttctc gtttgtccgc 180
cctccggtcg catcgggggc cgatcacatg cggctatacg aggaagcggt ggtggtgccg 240
ggtcccgtga cgcaggctgt ttccgaacgt gcgcgcatgc atgggatggt ggtggtgctc 300
ggcgtcaacg agcgcgacca cgggagcctc tacaacaccc aactgatttt cgattgcgat 360
ggtcggctcg ctctgaaacg ccgcaagatc acgccgacgt ttcacgagcg catgatctgg 420
ggccagggcg acgccagtgg actgaaagtc gcgcgcacgg gtatcgggcg ggtcggcgcg 480
ctcgcgtgct gggagcatta caacccgctc gcgcgctacg cgctgatgac ccagcacgaa 540
gagatccatt gcagtcaatt tccaggctcg ctcgtcggtc cgatcttctc cgagcagatg 600
gacgtgacga tccgccatca cgccctcgaa tcggggtgct tcgtcgtgaa cgcaaccggt 660
tggctcacgg acgcgcagat cgcctcgatc accgatgacc cgaagcttca acgggcgctg 720
cgcggtggct gcaacacggc gatcgtttca ccggaaggcc agcatctggc ggagccgttg 780
cgcgaaggcg aggggatggt ggtcgcggac ctcgacatgt cgctcatcac caagcgcaag 840
cgaatgatgg attcggtcgg ccactatgcg cggccggaac tgttgagcct cgcgatcaac 900
gatcgtcccg cagccccttt cggccggatg tgcgctgccg aagcaatgcg gggagccgac 960
gacgtcgtca cagtaggagc atttcatgag cgccagcgag aacgtgtcgg cgaagagccg 1020
gcaattgatg actga 1035
<210>228
<211>344
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>228
Met Ser Asp Arg Arg Ile Val Arg Ala Ala Ala Ile Gln Ile Ala Pro
1 5 10 15
Asp Leu Glu Arg Cys Ser Val Thr Leu Glu Lys Val Cys Ser Ala Ile
20 25 30
Asp Glu Ala Ala Gly Lys Gly Ala Gln Leu Ser Val Phe Pro Glu Thr
35 40 45
Phe Val Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Arg Pro Pro Val Ala
50 55 60
Ser Gly Ala Asp His Met Arg Leu Tyr Glu Glu Ala Val Val Val Pro
65 70 75 80
Gly Pro Val Thr Gln Ala Val Ser Glu Arg Ala Arg Met His Gly Met
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn
100 105 110
Thr Gln Leu Ile Phe Asp Cys Asp Gly Arg Leu Ala Leu Lys Arg Arg
115 120 125
Lys Ile Thr Pro Thr Phe His Glu Arg Met Ile Trp Gly Gln Gly Asp
130 135 140
Ala Ser Gly Leu Lys Val Ala Arg Thr Gly Ile Gly Arg Val Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met
165 170 175
Thr Gln His Glu Glu Ile His Cys Ser Gln Phe Pro Gly Ser Leu Val
180 185 190
Gly Pro Ile Phe Ser Glu Gln Met Asp Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr Asp
210 215 220
Ala Gln Ile Ala Ser Ile Thr Asp Asp Pro Lys Leu Gln Arg Ala Leu
225 230 235 240
Arg Gly Gly Cys Asn Thr Ala Ile Val Ser Pro Glu Gly Gln His Leu
245 250 255
Ala Glu Pro Leu Arg Glu Gly Glu Gly Met Val Val Ala Asp Leu Asp
260 265 270
Met Ser Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Ala Ile Asn Asp Arg Pro Ala
290 295 300
Ala Pro Phe Gly Arg Met Cys Ala Ala Glu Ala Met Arg Gly Ala Asp
305 310 315 320
Asp Val Val Thr Val Gly Ala Phe His Glu Arg Gln Arg Glu Arg Val
325 330 335
Gly Glu Glu Pro Ala Ile Asp Asp
340
<210>229
<211>975
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>229
atgcctgaaa acaaagtcaa agttgccctc gttcagcatc cgcctgtttt tctgaatttg 60
ccgaaaacgt tggaaaaagt agaaggtttg gcgcgagagt gcgccgccaa tgaagcgaaa 120
atcgttgtct ttcctgaaac ctggctgacc ggctatccgg tctggcttga tgaagcgccg 180
agagccgcgc tgtgggatta tccgcccgcc aagcgtttgt atcaatacct cacggaaaat 240
tcattgcaga ttccgagcgc tgagtttgaa tctttgcgcg aaatcgctaa gaaaaattcc 300
ctttatttag tcgttggagt tcacgaacga agcggcggaa cgctctacaa tacgataatt 360
tatctcacgc ccgacggcag ttataaaact caccgcaaat tggttccaac ttacacggaa 420
agacttgtct ggggcgcagg cgacggaagc ggtctaaatg ttgtggaaac gccttacggg 480
attctcggag gtttgatttg ctgggaacat tggatgcctc tggctagggc ggcaatgcat 540
tcaaaaaatg aagcgattca cgtttgccaa tttcccacgg ttcacgagcg acatcaaatc 600
gccagccgtc attacgcctt cgaagggcag tgttttgtct tgacttccgg ttgcgcgatg 660
acgaaaacgg atgttttgga aggttttgaa tcgctcgaaa caaacgacca cgaagttttc 720
gggcttttgg attcgataga aaaggaagaa ctgatgcgtg gcggaagcgc gattattgcg 780
cccgatttga gctattcggt cgagccggtt tttgacgaaa aaacgattgt ttacggcgaa 840
ttaaatctcg atttaaccaa gcagggacat ctgtttttgg ataccgacgg acattattcg 900
cgtcccgatg ttttcgagtt gcgcgtcaac gataaagcga accgaaacgt ccgttttgca 960
tccgaaacag tatag 975
<210>230
<211>324
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>230
Met Pro Glu Asn Lys Val Lys Val Ala Leu Val Gln His Pro Pro Val
1 5 10 15
Phe Leu Asn Leu Pro Lys Thr Leu Glu Lys Val Glu Gly Leu Ala Arg
20 25 30
Glu Cys Ala Ala Asn Glu Ala Lys Ile Val Val Phe Pro Glu Thr Trp
35 40 45
Leu Thr Gly Tyr Pro Val Trp Leu Asp Glu Ala Pro Arg Ala Ala Leu
50 55 60
Trp Asp Tyr Pro Pro Ala Lys Arg Leu Tyr Gln Tyr Leu Thr Glu Asn
65 70 75 80
Ser Leu Gln Ile Pro Ser Ala Glu Phe Glu Ser Leu Arg Glu Ile Ala
85 90 95
Lys Lys Asn Ser Leu Tyr Leu Val Val Gly Val His Glu Arg Ser Gly
100 105 110
Gly Thr Leu Tyr Asn Thr Ile Ile Tyr Leu Thr Pro Asp Gly Ser Tyr
115 120 125
Lys Thr His Arg Lys Leu Val Pro Thr Tyr Thr Glu Arg Leu Val Trp
130 135 140
Gly Ala Gly Asp Gly Ser Gly Leu Asn Val Val Glu Thr Pro Tyr Gly
145 150 155 160
Ile Leu Gly Gly Leu Ile Cys Trp Glu His Trp Met Pro Leu Ala Arg
165 170 175
Ala Ala Met His Ser Lys Asn Glu Ala Ile His Val Cys Gln Phe Pro
180 185 190
Thr Val His Glu Arg His Gln Ile Ala Ser Arg His Tyr Ala Phe Glu
195 200 205
Gly Gln Cys Phe Val Leu Thr Ser Gly Cys Ala Met Thr Lys Thr Asp
210 215 220
Val Leu Glu Gly Phe Glu Ser Leu Glu Thr Asn Asp His Glu Val Phe
225 230 235 240
Gly Leu Leu Asp Ser Ile Glu Lys Glu Glu Leu Met Arg Gly Gly Ser
245 250 255
Ala Ile Ile Ala Pro Asp Leu Ser Tyr Ser Val Glu Pro Val Phe Asp
260 265 270
Glu Lys Thr Ile Val Tyr Gly Glu Leu Asn Leu Asp Leu Thr Lys Gln
275 280 285
Gly His Leu Phe Leu Asp Thr Asp Gly His Tyr Ser Arg Pro Asp Val
290 295 300
Phe Glu Leu Arg Val Asn Asp Lys Ala Asn Arg Asn Val Arg Phe Ala
305 310 315 320
Ser Glu Thr Val
<210>231
<211>1062
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>231
atgagtcaga tccatccaaa acttaaggtc gcggccgtgc aggctgcccc cgcctttctc 60
gatctcgatg catcgatcga aaagacaata cgctatgtcg acgaagcggc tgcggccggg 120
gcgaagttga ttgcgtttcc ggaaacctgg attcccggct acccatggtg gatctggctc 180
ggcgctccgg cctgggcgat catgcgtggc ttcgtctcgc gctatttcga caactcgctg 240
caatatggca gtccggaagc tgaacggctg cgggacgccg ccaggcgcaa caagatatac 300
atcgccctcg gcctgtccga gcgcgacggg ggcagtcttt atatcgcgca atggatcatc 360
gggcctggcg gcgaaacggt tgcacaacgc cgcaagctca agcccacgca cgccgagcgc 420
actgtattcg gcgaaggcga tggttcacat ctggccgtgc atgatctcga tattggaaga 480
ttgggcgcgc tttgttgctg ggaacatctg caaccgttgt cgaaatatgc aatgtacgcc 540
cagaacgagc aaattcacgt cgccgcctgg ccgagcttct cgctatacga tccctttgca 600
cacgcactcg gcgccgaggt caataacgct gcgagcaaga tctatgcggt cgagggatcg 660
tgcttcgtca ttgcgccgtg cgcaacggtt tcgcaggtga tgatcgatga gctctgcgat 720
acccccgaaa agcatcaatt ccttcacgtc ggcggcggct tcgccgtcat ttacggtccc 780
gacggctcgc cactggccaa acctcttccg ccagaccagg agggacttct ctatgccgac 840
atcgatctcg gcatgatctc ggtcgccaag gccgcagccg atcccgccgg acattatgca 900
cgtcccgatg taactcgcct gctgttcaac aatcgcccgg caaaccgcgt cgagaagctg 960
gcgttgccgg tcgatcagga ggccgaagtg gatagtccgc tgaaggctcc cgacgcatct 1020
cccaaagtga cggcgctcaa gccgtcgcag gctgcggagt ag 1062
<210>232
<211>353
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>232
Met Ser Gln Ile His Pro Lys Leu Lys Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Ala Phe Leu Asp Leu Asp Ala Ser Ile Glu Lys Thr Ile Arg Tyr
20 25 30
Val Asp Glu Ala Ala Ala Ala Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Thr Trp Ile Pro Gly Tyr Pro Trp Trp Ile Trp Leu Gly Ala Pro Ala
50 55 60
Trp Ala Ile Met Arg Gly Phe Val Ser Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Gln Tyr Gly Ser Pro Glu Ala Glu Arg Leu Arg Asp Ala Ala Arg Arg
85 90 95
Asn Lys Ile Tyr Ile Ala Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Ile Ile Gly Pro Gly Gly Glu Thr Val Ala
115 120 125
Gln Arg Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly
130 135 140
Glu Gly Asp Gly Ser His Leu Ala Val His Asp Leu Asp Ile Gly Arg
145 150 155 160
Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr
165 170 175
Ala Met Tyr Ala Gln Asn Glu Gln Ile His Val Ala Ala Trp Pro Ser
180 185 190
Phe Ser Leu Tyr Asp Pro Phe Ala His Ala Leu Gly Ala Glu Val Asn
195 200 205
Asn Ala Ala Ser Lys Ile Tyr Ala Val Glu Gly Ser Cys Phe Val Ile
210 215 220
Ala Pro Cys Ala Thr Val Ser Gln Val Met Ile Asp Glu Leu Cys Asp
225 230 235 240
Thr Pro Glu Lys His Gln Phe Leu His Val Gly Gly Gly Phe Ala Val
245 250 255
Ile Tyr Gly Pro Asp Gly Ser Pro Leu Ala Lys Pro Leu Pro Pro Asp
260 265 270
Gln Glu Gly Leu Leu Tyr Ala Asp Ile Asp Leu Gly Met Ile Ser Val
275 280 285
Ala Lys Ala Ala Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val
290 295 300
Thr Arg Leu Leu Phe Asn Asn Arg Pro Ala Asn Arg Val Glu Lys Leu
305 310 315 320
Ala Leu Pro Val Asp Gln Glu Ala Glu Val Asp Ser Pro Leu Lys Ala
325 330 335
Pro Asp Ala Ser Pro Lys Val Thr Ala Leu Lys Pro Ser Gln Ala Ala
340 345 350
Glu
<210>233
<211>1002
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>233
gtgagcacca tcgtcaaagc cgcggccgtg cagatcagcc ccgtgctgta cagccgcgag 60
ggcaccgtcg agcgggtcgt gaagaagatc cgggagctgg gcgaaaaggg cgtccagttc 120
gccaccttcc ccgagaccgt catcccttac tacccgtact tttccttcgt tcagacgccc 180
ttgcagatcc tcgccggccc cgagcatctg aagctgctcg accagtcggt gaccgtgccg 240
tcccccgcca cggacgcgat cggccaggcc gcccggcagg caggaatggt ggtgtccatc 300
ggcgtcaacg agcgtgacgg cggcaccctg tacaacacgc agctgctctt cgacgcggac 360
ggcgcgctga tccagcgtcg ccgcaagatc aagcccaccc actacgagcg catgatctgg 420
ggcgagggcg acggctccgg cctgcgcgcc gtcgacagcc aggtcggtcg tatcggccag 480
ctggcctgct gggagcacaa caaccccctg gcgcgctacg ccatgatggc cgacggcgag 540
cagatccatt cggccatgta tccgggctcg atgttcggcg acccgttcgc ccagaagacg 600
gaaatcaaca tccggcagca tgcgctggaa tccggatgct tcgtcgtctg ctcgacggcc 660
tggttggacg ccgatcagca ggcgcaaatc atgcaggaca cgggctgcgc catcggcccg 720
atctcgggcg gctgcctcac ggcgatcgtg gcgcccgacg gcacgttcct gggcgaaccg 780
ctcacgtcgg gcgagggcga ggtcatcgcc gacctcgatt tcaagctgat cgacaagcgc 840
aagcagacga tggactcgcg cggccactac aaccgccccg aactgctcag cctgctgatc 900
gatcgaacgc cgacgtcgaa cgtccatgag cgcgccgcgc acccgaaggt cgaggcgtca 960
caaacggctg gcgacacgga gcggacccgc gaggtcctgt aa 1002
<210>234
<211>333
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>234
Val Ser Thr Ile Val Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu
1 5 10 15
Tyr Ser Arg Glu Gly Thr Val Glu Arg Val Val Lys Lys Ile Arg Glu
20 25 30
Leu Gly Glu Lys Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Ile
35 40 45
Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Thr Pro Leu Gln Ile Leu
50 55 60
Ala Gly Pro Glu His Leu Lys Leu Leu Asp Gln Ser Val Thr Val Pro
65 70 75 80
Ser Pro Ala Thr Asp Ala Ile Gly Gln Ala Ala Arg Gln Ala Gly Met
85 90 95
Val Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn
100 105 110
Thr Gln Leu Leu Phe Asp Ala Asp Gly Ala Leu Ile Gln Arg Arg Arg
115 120 125
Lys Ile Lys Pro Thr His Tyr Glu Arg Met Ile Trp Gly Glu Gly Asp
130 135 140
Gly Ser Gly Leu Arg Ala Val Asp Ser Gln Val Gly Arg Ile Gly Gln
145 150 155 160
Leu Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Met
165 170 175
Ala Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Met Phe
180 185 190
Gly Asp Pro Phe Ala Gln Lys Thr Glu Ile Asn Ile Arg Gln His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Val Cys Ser Thr Ala Trp Leu Asp Ala
210 215 220
Asp Gln Gln Ala Gln Ile Met Gln Asp Thr Gly Cys Ala Ile Gly Pro
225 230 235 240
Ile Ser Gly Gly Cys Leu Thr Ala Ile Val Ala Pro Asp Gly Thr Phe
245 250 255
Leu Gly Glu Pro Leu Thr Ser Gly Glu Gly Glu Val Ile Ala Asp Leu
260 265 270
Asp Phe Lys Leu Ile Asp Lys Arg Lys Gln Thr Met Asp Ser Arg Gly
275 280 285
His Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro
290 295 300
Thr Ser Asn Val His Glu Arg Ala Ala His Pro Lys Val Glu Ala Ser
305 310 315 320
Gln Thr Ala Gly Asp Thr Glu Arg Thr Arg Glu Val Leu
325 330
<210>235
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>235
atgaaaattg ttaaagcggc agcagttcag ctgagccccg tcctctatag ccgcgagggg 60
acggtcgaaa gagtagtgcg gaagattcat caacttggtc aacagggagt gcagtttgcc 120
accttcccgg aaacagtggt gccttactac ccgtattttt cgatcgtgca gtccgggtat 180
caaatccttc gcggcggtga gttcgtgaag ctgctcgatc agtcagtgac ggtgccatct 240
cttgccaccg aagcgatcgc cgaggcctgc aggcaggcgg gcgtcgttgt ctccatcggc 300
gtcaatgagc gtgacggcgg aactatatac aatgcgcagc ttctgtttga ttcggacggc 360
acattgattc agaggcgacg caagatcacg cccacccact acgagcgcat gatctggggc 420
cagggcgatg gctcgggtct gcgggctgtg gacagcaagg tggcacgtat tggtcaactg 480
gcgtgctttg agcattacaa ccccctcgca cgatacgcga tgatcgccga tggcgagcag 540
atccactctg caatgtttcc cggttccatg ttcggcgatg gtttcgcgga gaggaccgag 600
atcgcggtca ggcagcatgc gcaggagtcc ggatgctttg tagtttgtgc tacggcgtgg 660
ctggatgccg accagcaggc tcaaattgcc gcggacacag gcatcaccga cctgggaccg 720
atctccggcg gttgcttcac tgcgatcatt gcacctgatg ggagcctgct gggtcaacca 780
atccgctcgg gcgaaggtga cgtcattgtc gatctcgatt tcactctgat cgacaggcgg 840
aagcatgttg tggactcgag aggtcactac agccggccgg aattgctaag cctgctgatc 900
gaccgtactc ccacagcgca cgttcacgaa cgggccgcgc actctcactt ggccgccgag 960
caatgcttgg aggatcttaa cgcgcttgct taa 993
<210>236
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>236
Met Lys Ile Val Lys Ala Ala Ala Val Gln Leu Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Glu Arg Val Val Arg Lys Ile His Gln Leu
20 25 30
Gly Gln Gln Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Ile Val Gln Ser Gly Tyr Gln Ile Leu Arg
50 55 60
Gly Gly Glu Phe Val Lys Leu Leu Asp Gln Ser Val Thr Val Pro Ser
65 70 75 80
Leu Ala Thr Glu Ala Ile Ala Glu Ala Cys Arg Gln Ala Gly Val Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Ala
100 105 110
Gln Leu Leu Phe Asp Ser Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Tyr Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Val Ala Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Met Ile Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Phe Pro Gly Ser Met Phe Gly
180 185 190
Asp Gly Phe Ala Glu Arg Thr Glu Ile Ala Val Arg Gln His Ala Gln
195 200 205
Glu Ser Gly Cys Phe Val Val Cys Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Ala Ala Asp Thr Gly Ile Thr Asp Leu Gly Pro
225 230 235 240
Ile Ser Gly Gly Cys Phe Thr Ala Ile Ile Ala Pro Asp Gly Ser Leu
245 250 255
Leu Gly Gln Pro Ile Arg Ser Gly Glu Gly Asp Val Ile Val Asp Leu
260 265 270
Asp Phe Thr Leu Ile Asp Arg Arg Lys His Val Val Asp Ser Arg Gly
275 280 285
His Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro
290 295 300
Thr Ala His Val His Glu Arg Ala Ala His Ser His Leu Ala Ala Glu
305 310 315 320
Gln Cys Leu Glu Asp Leu Asn Ala Leu Ala
325 330
<210>237
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>237
atgaaagtcg tcaaagccgc cgcagtgcag atcagtcctg ttctctacag ccgagaggca 60
accgtcgcca aggtcgtgca gaagatccac gaactcggcc agaaaggcgt gcagttcgcc 120
acctttccag agaccgtagt gccttactac ccgtactttt ccgccgtcca gacaggcatc 180
gagcttctgt ccggcacgga gcatctccgg ctgctcgatc aggccgtgac ggtgccgtct 240
gccgctaccg acgcgatcgg agaggcagcc cggaaggcag gcatggtggt gtcgatcggc 300
gtcaatgaac gcgatggcgg caccttgtac aacacacagt tgctcttcga tgccgatggc 360
accttgatcc agcgccgccg caagatcacg ccgacccact tcgaacgcat gatctggggc 420
cagggggacg gttcgggcct gcgcgctgtc gacagcaagg tcggtcgcat tggccagttg 480
gcctgcttcg agcacaacaa cccgctggcc cgctacgcgt tgattgccga cggcgagcag 540
atccattccg ccatgtatcc gggttctgct tttggcgaag gatttgccca aaggatggaa 600
atcaatatcc gccagcatgc gctggagtcg ggtgcgttcg tcgtcaacgc aacggcctgg 660
ctggatgctg accagcaggc gcagatcatg aaggacaccg gctgcgggat tggcccgatc 720
tcgggcggtt gcttcaccac gatcgtgtca cccgacggca tgctgatggc cgaacccctg 780
cgctcgggcg agggtgaggt catcgtcgat ctcgacttca cgctgatcga ccgtcgcaag 840
atgttgatgg actcggcggg ccactataac cgcccggaac tgctcagtct catgatcgac 900
cgcaccccga ccgcgcacgt tcatgaacgc gctgcgcgtc cggtgtcggg cgttgagcag 960
aacccggagg aacttcgcat cccggccgcg tga 993
<210>238
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>238
Met Lys Val Val Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Ala Thr Val Ala Lys Val Val Gln Lys Ile His Glu Leu
20 25 30
Gly Gln Lys Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Ala Val Gln Thr Gly Ile Glu Leu Leu Ser
50 55 60
Gly Thr Glu His Leu Arg Leu Leu Asp Gln Ala Val Thr Val Pro Ser
65 70 75 80
Ala Ala Thr Asp Ala Ile Gly Glu Ala Ala Arg Lys Ala Gly Met Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Phe Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Val Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Leu Ile Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly
180 185 190
Glu Gly Phe Ala Gln Arg Met Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Gly Ala Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Gly Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Thr Ile Val Ser Pro Asp Gly Met Leu Met
245 250 255
Ala Glu Pro Leu Arg Ser Gly Glu Gly Glu Val Ile Val Asp Leu Asp
260 265 270
Phe Thr Leu Ile Asp Arg Arg Lys Met Leu Met Asp Ser Ala Gly His
275 280 285
Tyr Asn Arg Pro Glu Leu Leu Ser Leu Met Ile Asp Arg Thr Pro Thr
290 295 300
Ala His Val His Glu Arg Ala Ala Arg Pro Val Ser Gly Val Glu Gln
305 310 315 320
Asn Pro Glu Glu Leu Arg Ile Pro Ala Ala
325 330
<210>239
<211>969
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>239
atgtccaatg agaataagta tgacacattt aaagttgctg cagtccaggc cacacctgtg 60
tttcttgatc gtgaagcaac catcgacaaa gcttgcgagt tgattgctac tgcaggtcgt 120
gaaggtgctc gcctgattgt ctttccagaa gcgttcatcc catcctatcc cgagtgggta 180
tggggtattc cctctggtga gcaaggttta ctcaacgaac tttattcaga gttgctcacc 240
aatgcggtta ccatacccag cgacgcgact gacagattgt gcgaggcggc aaagcttgcg 300
aatgcctatg tagtgatggg aatgagcgaa cggaatgtcg aagcgagtgg tgcaagcctg 360
tataacacga tgttgtatat agatgcacag ggggagattt tagggaaaca tcggaagctg 420
gtgccaacgg gtggtgaacg cctggtatgg gcgcaaggtg atggcagcac gctgcaggtc 480
tacgatactc cattgggaaa acttggtggg ttaatttgct gggaaaatta tatgccgctg 540
gcacgctata cgatgtatgc ctggggaaca caaatctatg ttgcagcaac gtgggattgc 600
ggccaaccct ggctctcaac gatacggcat attgctaaag aaggcagggt atacgtggtt 660
ggttgctgta tcgcgatgcg taaagatgat attccagatc gttactctat gaagcagaaa 720
tattatgctg aaatggatga atggataaat gttggggata gcgcgattgt caatcccgaa 780
ggacatttta ttgcagggcc tgtgcgcaag caagaagaaa ttctctatgc ggagattgat 840
ccacgtatga tgcaaggccc gaagtggatg cttgacgtgg cgggacatta tgcaagacca 900
gatgtgttcc agttgacggt gcatacggat gtgaggcaga tgatacgggt ggaagatgat 960
tctcaatga 969
<210>240
<211>322
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>240
Met Ser Asn Glu Asn Lys Tyr Asp Thr Phe Lys Val Ala Ala Val Gln
1 5 10 15
Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr Ile Asp Lys Ala Cys
20 25 30
Glu Leu Ile Ala Thr Ala Gly Arg Glu Gly Ala Arg Leu Ile Val Phe
35 40 45
Pro Glu Ala Phe Ile Pro Ser Tyr Pro Glu Trp Val Trp Gly Ile Pro
50 55 60
Ser Gly Glu Gln Gly Leu Leu Asn Glu Leu Tyr Ser Glu Leu Leu Thr
65 70 75 80
Asn Ala Val Thr Ile Pro Ser Asp Ala Thr Asp Arg Leu Cys Glu Ala
85 90 95
Ala Lys Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg Asn
100 105 110
Val Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Met Leu Tyr Ile Asp
115 120 125
Ala Gln Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly
130 135 140
Gly Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val
145 150 155 160
Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn
165 170 175
Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gln Ile
180 185 190
Tyr Val Ala Ala Thr Trp Asp Cys Gly Gln Pro Trp Leu Ser Thr Ile
195 200 205
Arg His Ile Ala Lys Glu Gly Arg Val Tyr Val Val Gly Cys Cys Ile
210 215 220
Ala Met Arg Lys Asp Asp Ile Pro Asp Arg Tyr Ser Met Lys Gln Lys
225 230 235 240
Tyr Tyr Ala Glu Met Asp Glu Trp Ile Asn Val Gly Asp Ser Ala Ile
245 250 255
Val Asn Pro Glu Gly His Phe Ile Ala Gly Pro Val Arg Lys Gln Glu
260 265 270
Glu Ile Leu Tyr Ala Glu Ile Asp Pro Arg Met Met Gln Gly Pro Lys
275 280 285
Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln
290 295 300
Leu Thr Val His Thr Asp Val Arg Gln Met Ile Arg Val Glu Asp Asp
305 310 315 320
Ser Gln
<210>241
<211>972
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>241
atgatatcca acgagcataa caatactcca ttcaaggttg ctgctgtgca ggctacacct 60
gtgtttcttg atcgcgaagc aacgatcgat aaagcgtgtg aactgatcgc tactgccggt 120
catgaaggcg ctcgtttgat tgtttttcca gaagcgttca tcccatccta tcccgagtgg 180
gtatggggaa ttccctctgg cgagcaaggt ttgctcaacg atctgtatgc agagttactc 240
accaattcag ttacgatacc cagcaacgca actgacaggc tttgtagagc cgcgaagctt 300
gctaatgcct acgtggtgat ggggatgagc gaacggaata tcgaagcgag cggcgcaagc 360
ctgtacaata cgatgttata tatagatgca cagggtgaga ttttgggcaa acatcgaaag 420
ctggtgccaa cgggcggaga acgcctggta tgggcacaag gagatggaag cacgctgcag 480
gtttacgata cacctctagg aaagcttggt ggtttaattt gctgggaaaa ttatatgccg 540
ctggcacgct acgctatgta tgcctgggga actcaaatct acgtcgcggc aacgtgggat 600
cgcggccaac cctggctctc aacgatacgg catatcgcta aagagggcag ggtatacgta 660
atcggttgct gtatcgcgat gcgtaaagac gatattccag ataggtactc catgaagcag 720
aagtattatg cggagatgga tgaatggatc aacgtaggtg acagcgcgat tgtcaatcct 780
gagggggact tcattgcggg gcctgtgagc aagcaggagg aaattctcta tgcggagatt 840
gatccgcgga tggtgcaagg tccgaagtgg atgctggatg tggcggggca ttacgcgagg 900
cctgatgtgt tcgagttgac ggtgcatacg gatgtgaggg agatgatgcg ggtggagcat 960
gattatcaat ga 972
<210>242
<211>323
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>242
Met Ile Ser Asn Glu His Asn Asn Thr Pro Phe Lys Val Ala Ala Val
1 5 10 15
Gln Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr Ile Asp Lys Ala
20 25 30
Cys Glu Leu Ile Ala Thr Ala Gly His Glu Gly Ala Arg Leu Ile Val
35 40 45
Phe Pro Glu Ala Phe Ile Pro Ser Tyr Pro Glu Trp Val Trp Gly Ile
50 55 60
Pro Ser Gly Glu Gln Gly Leu Leu Asn Asp Leu Tyr Ala Glu Leu Leu
65 70 75 80
Thr Asn Ser Val Thr Ile Pro Ser Asn Ala Thr Asp Arg Leu Cys Arg
85 90 95
Ala Ala Lys Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg
100 105 110
Asn Ile Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Met Leu Tyr Ile
115 120 125
Asp Ala Gln Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr
130 135 140
Gly Gly Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln
145 150 155 160
Val Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu
165 170 175
Asn Tyr Met Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gln
180 185 190
Ile Tyr Val Ala Ala Thr Trp Asp Arg Gly Gln Pro Trp Leu Ser Thr
195 200 205
Ile Arg His Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys
210 215 220
Ile Ala Met Arg Lys Asp Asp Ile Pro Asp Arg Tyr Ser Met Lys Gln
225 230 235 240
Lys Tyr Tyr Ala Glu Met Asp Glu Trp Ile Asn Val Gly Asp Ser Ala
245 250 255
Ile Val Asn Pro Glu Gly Asp Phe Ile Ala Gly Pro Val Ser Lys Gln
260 265 270
Glu Glu Ile Leu Tyr Ala Glu Ile Asp Pro Arg Met Val Gln Gly Pro
275 280 285
Lys Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe
290 295 300
Glu Leu Thr Val His Thr Asp Val Arg Glu Met Met Arg Val Glu His
305 310 315 320
Asp Tyr Gln
<210>243
<211>999
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>243
atgaagcaaa ctcgagtagc gatcattcag gcggaacctg tatatctcaa tttgcaagcg 60
agtgttgcga gggctatcga tcttgccgga cgcgccgcga agcaaggagc gcgtctgata 120
gtgtttggag agacctggtt gccgggttat cccgcgtggc tggattactg tcccggcatg 180
gcgttctggg atcaccggcc gacaaaagaa gtgtttgccc ggacccgcga gaacagtgtt 240
gtaattccgg gaaaggaaat cgaacagctc tgtaaaactg cggcggagct gggagttgta 300
atttcgatcg gtgtaaacga aaaaattctg gaaggccccg gaaacggcac gctctacaat 360
tctcttttgc tgattgatga atcaggaaaa ctggccggcc atcaccgcaa actggttccg 420
acttatacgg aacggatggt gtggggaatg ggtgatggag gtggaatgga agccatatcg 480
actgcagccg gcagggttgg cggattgatt tgctgggagc actggatgcc attgagccgg 540
caggtgctgc acatgtcggg tgaggaaatt catgtggcag tgtggcccac ggttcatgag 600
gtgcaccagc ttgcatcacg ccattatgca tttgaagggc gttgttttgt gctcgcagcc 660
ggattgttga tgaaggtccg ggatattcct ccggagctgg aattgccttc tcagatgtcg 720
cgtgaatccg aagactggct tctgcgcggc gggagcgccg tcattggtcc ggatggaaag 780
tacattgtgg agccgttgtt tgatcgagag gcgattctca cagccgatct tgaattagcc 840
gcatgcgatc gtgaaaaaat gacgctggac gtaacgggac attattcccg ccccgatctt 900
tttcacctgg aattcaggaa acagcaatcc ggccatattg cgggagcagg aacgatcagc 960
cggcaaaaat cagcgccgga ccgcgcggac gatcactaa 999
<210>244
<211>332
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>244
Met Lys Gln Thr Arg Val Ala Ile Ile Gln Ala Glu Pro Val Tyr Leu
1 5 10 15
Asn Leu Gln Ala Ser Val Ala Arg Ala Ile Asp Leu Ala Gly Arg Ala
20 25 30
Ala Lys Gln Gly Ala Arg Leu Ile Val Phe Gly Glu Thr Trp Leu Pro
35 40 45
Gly Tyr Pro Ala Trp Leu Asp Tyr Cys Pro Gly Met Ala Phe Trp Asp
50 55 60
His Arg Pro Thr Lys Glu Val Phe Ala Arg Thr Arg Glu Asn Ser Val
65 70 75 80
Val Ile Pro Gly Lys Glu Ile Glu Gln Leu Cys Lys Thr Ala Ala Glu
85 90 95
Leu Gly Val Val Ile Ser Ile Gly Val Asn Glu Lys Ile Leu Glu Gly
100 105 110
Pro Gly Asn Gly Thr Leu Tyr Asn Ser Leu Leu Leu Ile Asp Glu Ser
115 120 125
Gly Lys Leu Ala Gly His His Arg Lys Leu Val Pro Thr Tyr Thr Glu
130 135 140
Arg Met Val Trp Gly Met Gly Asp Gly Gly Gly Met Glu Ala Ile Ser
145 150 155 160
Thr Ala Ala Gly Arg Val Gly Gly Leu Ile Cys Trp Glu His Trp Met
165 170 175
Pro Leu Ser Arg Gln Val Leu His Met Ser Gly Glu Glu Ile His Val
180 185 190
Ala Val Trp Pro Thr Val His Glu Val His Gln Leu Ala Ser Arg His
195 200 205
Tyr Ala Phe Glu Gly Arg Cys Phe Val Leu Ala Ala Gly Leu Leu Met
210 215 220
Lys Val Arg Asp Ile Pro Pro Glu Leu Glu Leu Pro Ser Gln Met Ser
225 230 235 240
Arg Glu Ser Glu Asp Trp Leu Leu Arg Gly Gly Ser Ala Val Ile Gly
245 250 255
Pro Asp Gly Lys Tyr Ile Val Glu Pro Leu Phe Asp Arg Glu Ala Ile
260 265 270
Leu Thr Ala Asp Leu Glu Leu Ala Ala Cys Asp Arg Glu Lys Met Thr
275 280 285
Leu Asp Val Thr Gly His Tyr Ser Arg Pro Asp Leu Phe His Leu Glu
290 295 300
Phe Arg Lys Gln Gln Ser Gly His Ile Ala Gly Ala Gly Thr Ile Ser
305 310 315 320
Arg Gln Lys Ser Ala Pro Asp Arg Ala Asp Asp His
325 330
<210>245
<211>999
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>245
atgggtgaga atgccaactt caccgtcgca gctgtccagg caacgccggt cttcttagac 60
cgggatgcga cggtcgagaa ggcttgcgag ctcatcgctg aagccgggcg aaacggagcg 120
cgcctggccg tttttcccga ggcgtttgtg ccggcttacc cggactgggt ctgggctgtt 180
ccgccgggcg attcaaggct gctgcacgag ctctacggtg agctgatcca gaactctgtc 240
acgattccca gcgagtcgac ggagaaactc tgccgggccg cccgcggggc caaagtctgc 300
gtggcgatcg gcatcaacga gaggaatgcg gaggcaagcg ggggtagcct ctacaacagc 360
ctcctgtaca tcagcccgga cggccaggtc ctcgggaagc accgcaagct cgttcccacc 420
ggagcggagc ggcttgtctg ggcgcagggc gacggcagca ctatcgacgt gtttgagttg 480
cctttctgtc gtttgggtgg cctcatctgt tgggagaact acatgccgct ggcccgttat 540
gcgatgtacg cctggggcac gcaggtctac gtcgcggcaa cgtgggacca cggcgaacct 600
tggctctcaa ccttgaggca tatcgccagg gaggggcgtg catatgtcat tggcgtttgc 660
atgccgatgc gcatgagcga catcccggac cgatacgagt tcaagcgcaa gtactatggc 720
gggcgcgact ggatcaatac tggtgacagc gccatcgtgg gtccggacgg aaacttcatc 780
gccggccccc tgagcgagcg cgaagagatc ctgtacgccg atatagacct gaatcggctt 840
gcgaactcga agtggatgct ggacgtcgcc gggcactatg cacggccgga cgtcttccag 900
ttgaccgtta accgcgagcc gaacccgatg atctctgagg atgggcacaa gacggttccc 960
acgctaccga aacgtgcggg gaagagtagg acgagatga 999
<210>246
<211>332
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>246
Met Gly Glu Asn Ala Asn Phe Thr Val Ala Ala Val Gln Ala Thr Pro
1 5 10 15
Val Phe Leu Asp Arg Asp Ala Thr Val Glu Lys Ala Cys Glu Leu Ile
20 25 30
Ala Glu Ala Gly Arg Asn Gly Ala Arg Leu Ala Val Phe Pro Glu Ala
35 40 45
Phe Val Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly Asp
50 55 60
Ser Arg Leu Leu His Glu Leu Tyr Gly Glu Leu Ile Gln Asn Ser Val
65 70 75 80
Thr Ile Pro Ser Glu Ser Thr Glu Lys Leu Cys Arg Ala Ala Arg Gly
85 90 95
Ala Lys Val Cys Val Ala Ile Gly Ile Asn Glu Arg Asn Ala Glu Ala
100 105 110
Ser Gly Gly Ser Leu Tyr Asn Ser Leu Leu Tyr Ile Ser Pro Asp Gly
115 120 125
Gln Val Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Ala Glu Arg
130 135 140
Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Ile Asp Val Phe Glu Leu
145 150 155 160
Pro Phe Cys Arg Leu Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met Pro
165 170 175
Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gln Val Tyr Val Ala
180 185 190
Ala Thr Trp Asp His Gly Glu Pro Trp Leu Ser Thr Leu Arg His Ile
195 200 205
Ala Arg Glu Gly Arg Ala Tyr Val Ile Gly Val Cys Met Pro Met Arg
210 215 220
Met Ser Asp Ile Pro Asp Arg Tyr Glu Phe Lys Arg Lys Tyr Tyr Gly
225 230 235 240
Gly Arg Asp Trp Ile Asn Thr Gly Asp Ser Ala Ile Val Gly Pro Asp
245 250 255
Gly Asn Phe Ile Ala Gly Pro Leu Ser Glu Arg Glu Glu Ile Leu Tyr
260 265 270
Ala Asp Ile Asp Leu Asn Arg Leu Ala Asn Ser Lys Trp Met Leu Asp
275 280 285
Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln Leu Thr Val Asn
290 295 300
Arg Glu Pro Asn Pro Met Ile Ser Glu Asp Gly His Lys Thr Val Pro
305 310 315 320
Thr Leu Pro Lys Arg Ala Gly Lys Ser Arg Thr Arg
325 330
<210>247
<211>990
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>247
atgccgaccc ccaaggaaaa gtttagaatc gccgccgttc aggcttgccc cgtttttctg 60
gaccgggggg agacggtaaa gaaggcctgc cggctggcgg ccgaggccgg gggccagggc 120
gcccggctga tcgtgtttcc ggagtccttc atccccgctt acccggactg ggtgtgggcc 180
gttccgccgg ggagggagaa gctcctgaat gaaatgtacg ccgaattcct ggccggcgcg 240
gtggaagtcc cggggccggt gacggaggaa ttgggccggg cggcggaaag ggccggcgct 300
tacctggtca tgggggtcac cgagcgggac accgaggcca gcggggcaag cctgtacaac 360
accctcctct atttcggtcc ccagggaagc ctgctgggaa aacaccgcaa actggtgccc 420
acgggagggg aacggaccgt ctgggcccgg ggggacggca gcacgctgca ggtgtacgat 480
acccccctgg gaaagatcgg cggcctgatc tgctgggaga actacatgcc cctggcccgc 540
tacgccatgt atgcgtgggg cactcagatt tacctggccc ccacctggga ccggggggaa 600
ccctggcttt caaccctgcg gcacatcgcc aaggaagggc gggtttacgt ggtggggtgc 660
tgcatggcta tgcaaaaagg ggacatcccg gatcgcttcg aatacaagca aaaatactat 720
cccgcagccc gggagtggat caacacgggc gacagcgcca tcctgaaccc ggagggggaa 780
ttcatcgccg ggccggcggg aaagaaggaa gagatcctgt acgctgaaat agacccccgg 840
cagatggggg ggcccaagtg gatgctggac gtggccggcc attacgcccg gccggatgtc 900
ttcgaactga tcgttcaccg ggaggcccgg ccgatgatcc gggtgacgga agcgccgtct 960
ccgggagaga aagaaacggg tgaaggatag 990
<210>248
<211>329
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>248
Met Pro Thr Pro Lys Glu Lys Phe Arg Ile Ala Ala Val Gln Ala Cys
1 5 10 15
Pro Val Phe Leu Asp Arg Gly Glu Thr Val Lys Lys Ala Cys Arg Leu
20 25 30
Ala Ala Glu Ala Gly Gly Gln Gly Ala Arg Leu Ile Val Phe Pro Glu
35 40 45
Ser Phe Ile Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly
50 55 60
Arg Glu Lys Leu Leu Asn Glu Met Tyr Ala Glu Phe Leu Ala Gly Ala
65 70 75 80
Val Glu Val Pro Gly Pro Val Thr Glu Glu Leu Gly Arg Ala Ala Glu
85 90 95
Arg Ala Gly Ala Tyr Leu Val Met Gly Val Thr Glu Arg Asp Thr Glu
100 105 110
Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Phe Gly Pro Gln
115 120 125
Gly Ser Leu Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu
130 135 140
Arg Thr Val Trp Ala Arg Gly Asp Gly Ser Thr Leu Gln Val Tyr Asp
145 150 155 160
Thr Pro Leu Gly Lys Ile Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met
165 170 175
Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gln Ile Tyr Leu
180 185 190
Ala Pro Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His
195 200 205
Ile Ala Lys Glu Gly Arg Val Tyr Val Val Gly Cys Cys Met Ala Met
210 215 220
Gln Lys Gly Asp Ile Pro Asp Arg Phe Glu Tyr Lys Gln Lys Tyr Tyr
225 230 235 240
Pro Ala Ala Arg Glu Trp Ile Asn Thr Gly Asp Ser Ala Ile Leu Asn
245 250 255
Pro Glu Gly Glu Phe Ile Ala Gly Pro Ala Gly Lys Lys Glu Glu Ile
260 265 270
Leu Tyr Ala Glu Ile Asp Pro Arg Gln Met Gly Gly Pro Lys Trp Met
275 280 285
Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu Ile
290 295 300
Val His Arg Glu Ala Arg Pro Met Ile Arg Val Thr Glu Ala Pro Ser
305 310 315 320
Pro Gly Glu Lys Glu Thr Gly Glu Gly
325
<210>249
<211>1017
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>249
atgggcattc aacatccgaa atatcgcgtc gcggtggtgc aggcggcacc ggcctggctc 60
gacctcgagg cgtcggtcag caagagcatc gcgctgatag aggaggccgc cgccaagggc 120
gccaagctga tcgcgttccc cgaggccttc atccccggct atccctggta catctggctg 180
gactcgccgg cctgggcgat cggccgcggc ttcgtgcagc gctatttcga caattcgctc 240
agctatgaca gcccgcaggc ggagcgcctg aggctcgcag tgaagaaggc cggcatgacc 300
gcagtgctcg gcctgtccga gcgcgacggc ggcagcctct atctcgcgca atggttgatc 360
ggacccgacg gcgagaccat cgcaaagcgg cgcaagctgc ggccgaccca tgccgagcgc 420
accgtctacg gcgagggcga cggcagcgac cttgcggtgc atgaccgccc cggcatcggc 480
cggctcggtg cgctgtgctg ctgggagcat ctgcagccgc tgtcgaaata cgcgatgtac 540
gcccagaacg agcaggtgca tgtcgcggcc tggccgagct tctcgctgta cgatccgttc 600
gcgccggcgc tcggctggga ggtcaacaat gcggcctcgc gcgtctatgc cgtcgagggc 660
tcctgcttcg tgctggcgcc ctgcgccacc gtctcgcagg cgatgatcga cgagctctgc 720
gaccgcgacg acaagcatgc gctgctgcat gttggcggcg gccatgccgc gatcttcggc 780
cccgacggca gcgcgatcgc ggacaagctt ccgtccgacc aggagggcct cctgttcgcc 840
gacatcgatc tcggcgcgat cgggatcgcg aagaatgccg ctgatccggc cgggcactat 900
tcgcgcccgg acgtgacgcg gctgctgctc aacaagaagc cctcgaagcg cgtcgagcac 960
ttcgcgctgc cgctcgacac gctcgcgggc gaggagatcg acgcggccgc aagctaa 1017
<210>250
<211>338
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>250
Met Gly Ile Gln His Pro Lys Tyr Arg Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Ala Trp Leu Asp Leu Glu Ala Ser Val Ser Lys Ser Ile Ala Leu
20 25 30
Ile Glu Glu Ala Ala Ala Lys Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Ala Phe Ile Pro Gly Tyr Pro Trp Tyr Ile Trp Leu Asp Ser Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ser Tyr Asp Ser Pro Gln Ala Glu Arg Leu Arg Leu Ala Val Lys Lys
85 90 95
Ala Gly Met Thr Ala Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Leu Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Pro Gly Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Pro Ala Leu Gly Trp Glu Val
195 200 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Arg Asp Asp Lys His Ala Leu Leu His Val Gly Gly Gly His Ala
245 250 255
Ala Ile Phe Gly Pro Asp Gly Ser Ala Ile Ala Asp Lys Leu Pro Ser
260 265 270
Asp Gln Glu Gly Leu Leu Phe Ala Asp Ile Asp Leu Gly Ala Ile Gly
275 280 285
Ile Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Ser Lys Arg Val Glu His
305 310 315 320
Phe Ala Leu Pro Leu Asp Thr Leu Ala Gly Glu Glu Ile Asp Ala Ala
325 330 335
Ala Ser
<210>251
<211>978
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>251
gtgaccatcg tgagagctgc cgccgtgcag atcagtcccg tgctctacag ccgggaagcc 60
accgtagaaa aagtcgttcg caagatccgc gaactgggaa gaaacggcgt gcagttcgcc 120
accttcccgg aaaccctggt gccctactac ccgtacttcg cggccgtgca gacgggcatc 180
gaactgctgt ccggcaagga gcacctgcga ctgctggaac aatccgtaac ggttccctcg 240
cccgccaccg atgccattgc ccaggcggca cgcgaagccg gcatggtggt gtccatcggt 300
gtcaacgagc gcgacggagg caccatctac aacacgcagc tgctgtttga cgccgacggc 360
acgctggtac agcgccgccg caagatcacg ccgacgcatt tcgagcgcat ggtctggggc 420
cagggcgacg gctcgggcct gcgagccgtg gacaccaagg ccggccgcat cggtcagctc 480
gcctgcttcg agcacaacaa cccgctggcg cgctacgcca tgatcgccga cggtgagcag 540
atccattcgg ccatgtaccc gggctctgcc ttcggcgagg gcttcgcgca gcgcatggaa 600
atcaacatac gccagcacgc cctggagtct ggctgcttcg tggtgaatgc gaccgcgtgg 660
ctggatgccg accagcaggc gcagatcatg aaggatacgg gctgcggcat cggcccgatc 720
tccggcggct gcttcacgac catcgtcacg ccggacggca tgctgatcgg tgaacccctc 780
cgcgaaggcg aaggcgaagt catcgccgac ctcgatttca ccctgatcga ccggcgcaag 840
ctgctggtgg actcggtggg ccactacaac cgtccggagc tgctgagcct gctgatcgat 900
cgcacccctg cggcgaactt ccatgagcgc aatgcgcttc cgtccgtcaa caccgccagc 960
agcctcgaaa tcgtctga 978
<210>252
<211>325
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>252
Val Thr Ile Val Arg Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Ala Thr Val Glu Lys Val Val Arg Lys Ile Arg Glu Leu
20 25 30
Gly Arg Asn Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Leu Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ala Ala Val Gln Thr Gly Ile Glu Leu Leu Ser
50 55 60
Gly Lys Glu His Leu Arg Leu Leu Glu Gln Ser Val Thr Val Pro Ser
65 70 75 80
Pro Ala Thr Asp Ala Ile Ala Gln Ala Ala Arg Glu Ala Gly Met Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Val Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Phe Glu Arg Met Val Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Thr Lys Ala Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Ile Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly
180 185 190
Glu Gly Phe Ala Gln Arg Met Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Gly Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Thr Ile Val Thr Pro Asp Gly Met Leu Ile
245 250 255
Gly Glu Pro Leu Arg Glu Gly Glu Gly Glu Val Ile Ala Asp Leu Asp
260 265 270
Phe Thr Leu Ile Asp Arg Arg Lys Leu Leu Val Asp Ser Val Gly His
275 280 285
Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Ala
290 295 300
Ala Asn Phe His Glu Arg Asn Ala Leu Pro Ser Val Asn Thr Ala Ser
305 310 315 320
Ser Leu Glu Ile Val
325
<210>253
<211>924
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>253
atgtcaaacg agaaccacaa ccaaacattc aaagttgccg cggtgcaggc cacacctgta 60
ttcctcgatc gtgaagcgac catcgacaaa gcttgcgagt tgattgctgc agccggcaat 120
gaaggggcga ggctggttgt cttcccggag gcattcatcc cgtcctatcc agattgggta 180
tgggcaatcc caccgggcga agaaggcgtg ctcaatgagt tgtacgcgga actgctctcc 240
aattcggtca cgattcccag tgacgtgacg gatagactgt gccgggccgc gaggcttgcc 300
aatgcctacg tagtgatggg gatgagcgaa tgcaatgccg aggccagtgg cgcaagcctg 360
tataacacgc tattgtacat cgatgcgaag ggtgaaatcc tgggtaaaca tcgaaagttg 420
gtgccaactg gcggcgagcg actggtgtgg gcacagggcg atggcagcac gctgcaggtc 480
tacgatactc cactgggtaa actcggcggt ttaatttgct gggagaatta tatgccgctg 540
gcccgctaca ccatgtacgc ctggggcaca caaatctata tcgcagcgac atgggatcgc 600
gggcaaccct ggctctccac cttgcggcat atcgccaaag aaggcagggt gtacgtgatc 660
ggctgttgta tcgcgatgcg caaagatgat atcccagagc gttacccaat gaagcagaag 720
ttttacgcgg aggccgatga gtggatcaat ataggcgaca gcgcgatcgt caatcctgaa 780
gggcagttta tcgcggggcc ggtacgcaaa caggaagaga ttctctacgc ggagattaat 840
ccgcgcatgg tgcaaggccc gaagtggatg ctcgacgtgg cagggcacta cgccaggccg 900
gacgtattcc agttgacagt gtaa 924
<210>254
<211>307
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>254
Met Ser Asn Glu Asn His Asn Gln Thr Phe Lys Val Ala Ala Val Gln
1 5 10 15
Ala Thr Pro Val Phe Leu Asp Arg Glu Ala Thr Ile Asp Lys Ala Cys
20 25 30
Glu Leu Ile Ala Ala Ala Gly Asn Glu Gly Ala Arg Leu Val Val Phe
35 40 45
Pro Glu Ala Phe Ile Pro Ser Tyr Pro Asp Trp Val Trp Ala Ile Pro
50 55 60
Pro Gly Glu Glu Gly Val Leu Asn Glu Leu Tyr Ala Glu Leu Leu Ser
65 70 75 80
Asn Ser Val Thr Ile Pro Ser Asp Val Thr Asp Arg Leu Cys Arg Ala
85 90 95
Ala Arg Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Cys Asn
100 105 110
Ala Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Ile Asp
115 120 125
Ala Lys Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly
130 135 140
Gly Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val
145 150 155 160
Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn
165 170 175
Tyr Met Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gln Ile
180 185 190
Tyr Ile Ala Ala Thr Trp Asp Arg Gly Gln Pro Trp Leu Ser Thr Leu
195 200 205
Arg His Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile
210 215 220
Ala Met Arg Lys Asp Asp Ile Pro Glu Arg Tyr Pro Met Lys Gln Lys
225 230 235 240
Phe Tyr Ala Glu Ala Asp Glu Trp Ile Asn Ile Gly Asp Ser Ala Ile
245 250 255
Val Asn Pro Glu Gly Gln Phe Ile Ala Gly Pro Val Arg Lys Gln Glu
260 265 270
Glu Ile Leu Tyr Ala Glu Ile Asn Pro Arg Met Val Gln Gly Pro Lys
275 280 285
Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln
290 295 300
Leu Thr Val
305
<210>255
<211>1005
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>255
atgacaatgt ctaagaccaa atttagggtt gcagctgtgc aggcggcgcc ggttttcctt 60
gatcgggaag cgacgttgga taaagcttgt ggattgattg aggaggcggg ccgcaacggc 120
gccagcctcg tcgtcttccc tgagtcattc attccggcct accccgattg ggtttgggct 180
gtgccggcgg gcgaagaagc tttactcaat gaactgtacg cacaactgtt ggccaacgcc 240
gttgaaattc ccggcccggc cactcaacgt ttgagccagg cggctaaaaa ggctaaggtt 300
cacctggcta tgggcctgac cgaacgcaac agcgaggcca gcggcggcag cctttacaac 360
accttgctct atcttgaccc gcagggccac attctgggca agcatcgtaa gctggtgccc 420
accggcggtg agcggctggt ttgggcccag ggcgacggca gcactttgca agtttacgat 480
acgcctctgg gtaaactcag cggcctgatt tgctgggaaa attatatgcc gctggcgcgc 540
tacgcgctgt atgcctgggg tacgcaaatt tatattgcgg ccacctggga tcggggtgag 600
ccgtggcttt cgacgttgcg gcatattgcc aaagagggcc gggtgttggt catcggttgc 660
ggtatggcct tgcgcaaggc tgatattcct gatcattttg aattcaagca gcgcttttat 720
caaaacgccg ccgagtggat caacgggggc gacagcgcca ttgtcaaccc tgatggtgaa 780
tttattgctg gccccttaag cgagcaggaa ggcattttgt acgccgagat tgatccggcc 840
cagatgggcg ggccaaagtg gatgctcgac gtggccgggc attacgctcg cccggatgtg 900
tttgaactga cggtccatac cgccgcccga cccatgatca cctcgaaaaa ggatggccta 960
acacccgccg aggccgttac gcaagtaacg aaagcattat tgtaa 1005
<210>256
<211>334
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>256
Met Thr Met Ser Lys Thr Lys Phe Arg Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Val Phe Leu Asp Arg Glu Ala Thr Leu Asp Lys Ala Cys Gly Leu
20 25 30
Ile Glu Glu Ala Gly Arg Asn Gly Ala Ser Leu Val Val Phe Pro Glu
35 40 45
Ser Phe Ile Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Ala Gly
50 55 60
Glu Glu Ala Leu Leu Asn Glu Leu Tyr Ala Gln Leu Leu Ala Asn Ala
65 70 75 80
Val Glu Ile Pro Gly Pro Ala Thr Gln Arg Leu Ser Gln Ala Ala Lys
85 90 95
Lys Ala Lys Val His Leu Ala Met Gly Leu Thr Glu Arg Asn Ser Glu
100 105 110
Ala Ser Gly Gly Ser Leu Tyr Asn Thr Leu Leu Tyr Leu Asp Pro Gln
115 120 125
Gly His Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu
130 135 140
Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val Tyr Asp
145 150 155 160
Thr Pro Leu Gly Lys Leu Ser Gly Leu Ile Cys Trp Glu Asn Tyr Met
165 170 175
Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly Thr Gln Ile Tyr Ile
180 185 190
Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His
195 200 205
Ile Ala Lys Glu Gly Arg Val Leu Val Ile Gly Cys Gly Met Ala Leu
210 215 220
Arg Lys Ala Asp Ile Pro Asp His Phe Glu Phe Lys Gln Arg Phe Tyr
225 230 235 240
Gln Asn Ala Ala Glu Trp Ile Asn Gly Gly Asp Ser Ala Ile Val Asn
245 250 255
Pro Asp Gly Glu Phe Ile Ala Gly Pro Leu Ser Glu Gln Glu Gly Ile
260 265 270
Leu Tyr Ala Glu Ile Asp Pro Ala Gln Met Gly Gly Pro Lys Trp Met
275 280 285
Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu Thr
290 295 300
Val His Thr Ala Ala Arg Pro Met Ile Thr Ser Lys Lys Asp Gly Leu
305 310 315 320
Thr Pro Ala Glu Ala Val Thr Gln Val Thr Lys Ala Leu Leu
325 330
<210>257
<211>942
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>257
atgaacaaga tcgcgatcat tcagcgceca cccgtgctac tcgaccgcat cgccacgctg 60
gcggttgcgg tggagtcgat cgacgaagct gccgcagccg gtgcctcact gatcgttctt 120
ccagaaacct tcatccccgg ctacccgtcc tggatctggc gtctcgcgcc ggggagggat 180
ggtgcgctca ttgcccagtt gcatgcccga ctgctcgcca acgcggtcga tcttgcggct 240
ggagatctgg atgccctgtg tgaagttgcg cacggccacc gggtgaccgt ggtgtgcggc 300
ctcaacgaat gcgagcgcag tcgcggcggg ggcactctct acaacacggt cgtcgtgatc 360
gaccccgacg gcaagctgtg caatcgccac cgcaagctga tgccgaccaa cccggaacgc 420
atggtgcacg gtctgggtga tgcatcgggc ctgcgcgccg tcgacacccc ggtgggtcga 480
gtgggcgcac tcatctgctg ggaaaactat atgccgctgg cacgctacgc actttacgcc 540
gagggggtgg aagtctacgt ggcgcccacc tatgacagcg gcgatggctg gatcagtacg 600
atgcgtcata ttgcgcttga gggacgctgc tgggtgctgg gtagcggaac cgtactgcgt 660
ggcagcgacg tcccagaaga ctttccgtca cacctggacc tgtttcccga cgcggaggaa 720
tggatcaatc cgggcgactc ggtggtcgtc gatcctcagg gcaagatcgt cgcaggcccg 780
atgcgacgtg agacaggcat tctctacgca gaaatcgacg ccgaacgggt cgcgccttcg 840
cgccgcacgc tcgatgtcgc cggacactac gcccgcccgg atattttcga gctccatgtc 900
cgacgtacgc cggcgatgcc ggtccacgcc gttgatgcat ga 942
<210>258
<211>313
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>258
Met Asn Lys Ile Ala Ile Ile Gln Arg Pro Pro Val Leu Leu Asp Arg
1 5 10 15
Ile Ala Thr Leu Ala Val Ala Val Glu Ser Ile Asp Glu Ala Ala Ala
20 25 30
Ala Gly Ala Ser Leu Ile Val Leu Pro Glu Thr Phe Ile Pro Gly Tyr
35 40 45
Pro Ser Trp Ile Trp Arg Leu Ala Pro Gly Arg Asp Gly Ala Leu Ile
50 55 60
Ala Gln Leu His Ala Arg Leu Leu Ala Asn Ala Val Asp Leu Ala Ala
65 70 75 80
Gly Asp Leu Asp Ala Leu Cys Glu Val Ala His Gly His Arg Val Thr
85 90 95
Val Val Cys Gly Leu Asn Glu Cys Glu Arg Ser Arg Gly Gly Gly Thr
100 105 110
Leu Tyr Asn Thr Val Val Val Ile Asp Pro Asp Gly Lys Leu Cys Asn
115 120 125
Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val His Gly
130 135 140
Leu Gly Asp Ala Ser Gly Leu Arg Ala Val Asp Thr Pro Val Gly Arg
145 150 155 160
Val Gly Ala Leu Ile Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr
165 170 175
Ala Leu Tyr Ala Glu Gly Val Glu Val Tyr Val Ala Pro Thr Tyr Asp
180 185 190
Ser Gly Asp Gly Trp Ile Ser Thr Met Arg His Ile Ala Leu Glu Gly
195 200 205
Arg Cys Trp Val Leu Gly Ser Gly Thr Val Leu Arg Gly Ser Asp Val
210 215 220
Pro Glu Asp Phe Pro Ser His Leu Asp Leu Phe Pro Asp Ala Glu Glu
225 230 235 240
Trp Ile Asn Pro Gly Asp Ser Val Val Val Asp Pro Gln Gly Lys Ile
245 250 255
Val Ala Gly Pro Met Arg Arg Glu Thr Gly Ile Leu Tyr Ala Glu Ile
260 265 270
Asp Ala Glu Arg Val Ala Pro Ser Arg Arg Thr Leu Asp Val Ala Gly
275 280 285
His Tyr Ala Arg Pro Asp Ile Phe Glu Leu His Val Arg Arg Thr Pro
290 295 300
Ala Met Pro Val His Ala Val Asp Ala
305 310
<210>259
<211>981
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>259
atggccatca tcaaagccgc cgccgtccag atcagtccgg tgctgtatag ccgcgaggga 60
accgtggaca aggtctgcca gcagatcatc gccctcggtc agcaaggcgt gcaatttgcg 120
gtctttccgg aaacggtggt gccgtactac ccctatttct ccttcgtcca gccgccgttc 180
gccatgggca aggaacacct gaaactgttg gaacaatcgg tgattgtgcc gtcggctgcc 240
accttggcga tcggcgaagc gtgcaaacaa gcgggaatgg tggtgtctat cggcgtcaat 300
gagcgcgatg gcggcacgat ctacaacgcc cagttgctgt ttgacgctga tggcagcttg 360
attcagcacc gtcgcaaaat aaccccgacg taccacgagc ggatgatctg gggtcaaggc 420
gacggctccg ggttgcgcgc catcgacagc gcggtcgggc gtattggctc gctggcctgc 480
tgggaacatt acaacccctt ggcccgctac gccctgatgg ccgacggcga gcagattcac 540
gcggctatgt ttcccggctc tctggtgggt gacatttttg ccgatcagat agaggtcact 600
attcgtcatc acgccttgga gtccggctgc ttcgtggtca actccaccgc gtggcttgat 660
gctgatcagc aaggccaaat catgcaggac accggttgca gcattggccc aatctcgggt 720
ggctgcttca cggccatcgt ttccccggaa ggcaaattac tcggcgaacc gctgcgttca 780
ggtgagggcg cagtcatcgc cgacctggac atggcattga tcgacaagcg caaacggatg 840
atggattccg tcggccatta cagccgccca gaactgctca gtttattgat cgaccgcacg 900
cccaccgctc atgtgcatga gcgcggcgcc catcaccttg ccgtagcctc tatcggggag 960
cttgaccatg caaaccaatg a 981
<210>260
<211>326
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>260
Met Ala Ile Ile Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Asp Lys Val Cys Gln Gln Ile Ile Ala Leu
20 25 30
Gly Gln Gln Gly Val Gln Phe Ala Val Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Pro Pro Phe Ala Met Gly Lys
50 55 60
Glu His Leu Lys Leu Leu Glu Gln Ser Val Ile Val Pro Ser Ala Ala
65 70 75 80
Thr Leu Ala Ile Gly Glu Ala Cys Lys Gln Ala Gly Met Val Val Ser
85 90 95
Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Ala Gln Leu
100 105 110
Leu Phe Asp Ala Asp Gly Ser Leu Ile Gln His Arg Arg Lys Ile Thr
115 120 125
Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp Gly Ser Gly
130 135 140
Leu Arg Ala Ile Asp Ser Ala Val Gly Arg Ile Gly Ser Leu Ala Cys
145 150 155 160
Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp Gly
165 170 175
Glu Gln Ile His Ala Ala Met Phe Pro Gly Ser Leu Val Gly Asp Ile
180 185 190
Phe Ala Asp Gln Ile Glu Val Thr Ile Arg His His Ala Leu Glu Ser
195 200 205
Gly Cys Phe Val Val Asn Ser Thr Ala Trp Leu Asp Ala Asp Gln Gln
210 215 220
Gly Gln Ile Met Gln Asp Thr Gly Cys Ser Ile Gly Pro Ile Ser Gly
225 230 235 240
Gly Cys Phe Thr Ala Ile Val Ser Pro Glu Gly Lys Leu Leu Gly Glu
245 250 255
Pro Leu Arg Ser Gly Glu Gly Ala Val Ile Ala Asp Leu Asp Met Ala
260 265 270
Leu Ile Asp Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser
275 280 285
Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Thr Ala His
290 295 300
Val His Glu Arg Gly Ala His His Leu Ala Val Ala Ser Ile Gly Glu
305 310 315 320
Leu Asp His Ala Asn Gln
325
<210>261
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>261
atgggtctgg ttcatcagaa atacaaggta gcggtggttc aggcggcacc ggtttttctc 60
gatctcgatg caaccgtgga caagaccatt gcgctgatcg aggaagcttc cgcgcaaggc 120
gcaaaactgg tcgcgtttcc cgagaccttc attcccggat atccgtggca gatctggctc 180
ggcgcgccgg cctgggcgat cggccgcggc tttgtgcagc gctacttcga caattcgctg 240
ggcttcgaca gcccgcaggc ggaaaaaatc cgccaggccg tgaagcgcgc caagctgacc 300
gcggtgcttg ggctatccga acgcgacggc ggcagcctct atatcgcgca gtggctgatc 360
ggccctgacg gcgagaccat cgccaagcgc cggaaactgc gtccgaccca tgccgaacgc 420
accgtgttcg gcgaaggcga cggcagcgat cttgcggtcc acgatcgcgc cgacgtcggg 480
cgcttgggcg cactgtgctg ctgggagcac ctgcagccgc tatcgaaata cgcgatgtac 540
gcccagaacg aacaggtgca tgtcggcgcc tggccgagct tttcgctgta cgatccgttc 600
gcacatgcgc tcggccatga ggtcaacaac gccgccagca aggtctatgc ggtcgaaggc 660
tcctgcttct tcctgggtcc ctgcgctgtc gtctcgcagg cgatgatcga cgagctctgc 720
gactctcccg agaaacatgc cttcctgcat gtcggcggcg gccacgcggt gatctacggc 780
ccggacggca gttcgctggc cgagaaactc ccgcccgacc aggaaggcat tctgtacgcc 840
gatatcgatc tcggcatgat cggggtcgcg aagaacgccg ccgatccggc cgggcattat 900
tcgcggcccg acgtcacgcg gctgttgctc aacaccacgc gcgccaaccg cgtcgagcat 960
ttctcgcttc ccgtcgatgc cgaggtcatg agcgaaatca ggctgcaggc ctga 1014
<210>262
<211>337
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>262
Met Gly Leu Val His Gln Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Val Phe Leu Asp Leu Asp Ala Thr Val Asp Lys Thr Ile Ala Leu
20 25 30
Ile Glu Glu Ala Ser Ala Gln Gly Ala Lys Leu Val Ala Phe Pro Glu
35 40 45
Thr Phe Ile Pro Gly Tyr Pro Trp Gln Ile Trp Leu Gly Ala Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Gly Phe Asp Ser Pro Gln Ala Glu Lys Ile Arg Gln Ala Val Lys Arg
85 90 95
Ala Lys Leu Thr Ala Val Leu Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Phe Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asp Arg Ala Asp Val Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Gly Ala Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala His Ala Leu Gly His Glu Val
195 200 205
Asn Asn Ala Ala Ser Lys Val Tyr Ala Val Glu Gly Ser Cys Phe Phe
210 215 220
Leu Gly Pro Cys Ala Val Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Ser Pro Glu Lys His Ala Phe Leu His Val Gly Gly Gly His Ala
245 250 255
Val Ile Tyr Gly Pro Asp Gly Ser Ser Leu Ala Glu Lys Leu Pro Pro
260 265 270
Asp Gln Glu Gly Ile Leu Tyr Ala Asp Ile Asp Leu Gly Met Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Thr Thr Arg Ala Asn Arg Val Glu His
305 310 315 320
Phe Ser Leu Pro Val Asp Ala Glu Val Met Ser Glu Ile Arg Leu Gln
325 330 335
Ala
<210>263
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>263
atgaaagtcg tcaaagcggc agcggttcag ttgagccctg tcctctacag ccgcgaggca 60
accgtcgcga aggtcgtgcg gaagatccac gagcttgggc agcagggcgt gcagttcgcc 120
accttcccgg aaaccgttgt gccgtactac ccgtatttct ccgcggtcca gacgccgatg 180
cagcttctgg ctggaaccga gcatctgaaa ttgctcgacc aggccgtgac ggtgccgtct 240
cccgcgaccg acgcgatcgg cgaggcagcc cggaaggcgg gcatggtggt gtccatcggc 300
gtcaacgagc gtgatggtgg aaccctgtac aacacccaat tgctcttcga cgccgatggc 360
accctgatcc agcgccgccg caagatcacg cccacccatt tcgagcgcat gatctggggc 420
cagggtgacg ggtcgggcct gcgtgccgtc gacagcaagg tcggccgcat tggccagctg 480
gcatgcttcg agcacaacaa tcctctggcg cgctacgcga tgatggccga cggcgagcag 540
atccattcgg ccatgtatcc gggttctgcc ttcggcgagg gctttgccca gaggatggaa 600
atcaatatcc gccagcacgc actggagtcc gggtgcttcg tcgtgaacgc gacggcctgg 660
ctggatgccg accagcaggc gcaaatcatg aaagacacgg gctgcgggat cggtccgatc 720
tcgggcggtt gcttcaccac gatcgtggca cccgacggca cgctgctggg ggaacctctg 780
cgctcgggcg agggcgaggt catcgccgat ctcgatttca cggagatcga ccggcgcaag 840
atgctgatgg actcggcagg ccactacaac cgtccggaac tgctcagtct gctgategac 900
cgcacgccga ccgcaaacgt gcacgaacgg atggcgcatc cccaagcgag cacgaagcag 960
ccgcgctccg gcgatctgcc cgctgcgctg gctggcgcgc aggagatcct gtga 1014
<210>264
<211>337
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>264
Met Lys Val Val Lys Ala Ala Ala Val Gln Leu Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Ala Thr Val Ala Lys Val Val Arg Lys Ile His Glu Leu
20 25 30
Gly Gln Gln Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Ala Val Gln Thr Pro Met Gln Leu Leu Ala
50 55 60
Gly Thr Glu His Leu Lys Leu Leu Asp Gln Ala Val Thr Val Pro Ser
65 70 75 80
Pro Ala Thr Asp Ala Ile Gly Glu Ala Ala Arg Lys Ala Gly Met Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Phe Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Val Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Phe Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Met Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Ala Phe Gly
180 185 190
Glu Gly Phe Ala Gln Arg Met Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Gly Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Thr Ile Val Ala Pro Asp Gly Thr Leu Leu
245 250 255
Gly Glu Pro Leu Arg Ser Gly Glu Gly Glu Val Ile Ala Asp Leu Asp
260 265 270
Phe Thr Glu Ile Asp Arg Arg Lys Met Leu Met Asp Ser Ala Gly His
275 280 285
Tyr Asn Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Thr
290 295 300
Ala Asn Val His Glu Arg Met Ala His Pro Gln Ala Ser Thr Lys Gln
305 310 315 320
Pro Arg Ser Gly Asp Leu Pro Ala Ala Leu Ala Gly Ala Gln Glu Ile
325 330 335
Leu
<210>265
<211>999
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>265
atgcttaatc taggcatagt ccagatgaac gcagagccgc tcaacgtgga aggcaacctg 60
ctcaaggcgg agcgctatgt cgcgaagtgc gccgcggacg gcgcccaact cgtggtgctg 120
ccggagatgt tcaacgtcgg cttccacctc ggcgagtccc tgatgatggt cgccgagccc 180
ctggacggca agaccgtgca gtggctgcaa cggcaggcgt ccacccataa catatatatc 240
accgggagct tatacgagcg ttacgacgag catttctaca acaccatggt catggtggga 300
tacgacggca gcgtgcagta ctaccgcaag cgcaatccta cctggtccga gtcggcggtg 360
tggcgccgca gcgaggtgcc aggccccggt atattcgata ccccgttcgg gcgcatcggg 420
ggcgtcatct gcttcgattc cttcgcgcgc gagacccacg agggcttcaa gcagagcggg 480
gtcgaggcgg tggtaatcat cgccctgtgg ggcgccaacc gtgcgcgggc attcttctgg 540
cgcccggacc tcctgctaag ccgggaaggg ctggtccgtt ggtcccggct ggcctcggag 600
gacgtccccc gaaatcacgc gaaagagctc ggggtcccgg tcgccttcgt caaccagagt 660
ggcaccatcc gcatgaccag ccccatccct ttccccgact ggccggtgca gagctccttc 720
tacgacttca tcggcaagtc ccacgtccgg gacgcatccg gagaggtgat cgcgagggtg 780
gacgaggggg agatcgactc ctgcctggtg gtcccggtag aggtcgagca ggcgcagagc 840
aggccggaga tcaggaagtc aaatatatcg cccggctacc tcggcaagga ttactatttc 900
gtggagccgc cgcttatctg caagctcttc caggtctggt tcctcagcgg cctggtgccc 960
accgaatacg aggcgcggcg tctgcgccac ctgttctga 999
<210>266
<211>332
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>266
Met Leu Asn Leu Gly Ile Val Gln Met Asn Ala Glu Pro Leu Asn Val
1 5 10 15
Glu Gly Asn Leu Leu Lys Ala Glu Arg Tyr Val Ala Lys Cys Ala Ala
20 25 30
Asp Gly Ala Gln Leu Val Val Leu Pro Glu Met Phe Asn Val Gly Phe
35 40 45
His Leu Gly Glu Ser Leu Met Met Val Ala Glu Pro Leu Asp Gly Lys
50 55 60
Thr Val Gln Trp Leu Gln Arg Gln Ala Ser Thr His Asn Ile Tyr Ile
65 70 75 80
Thr Gly Ser Leu Tyr Glu Arg Tyr Asp Glu His Phe Tyr Asn Thr Met
85 90 95
Val Met Val Gly Tyr Asp Gly Ser Val Gln Tyr Tyr Arg Lys Arg Asn
100 105 110
Pro Thr Trp Ser Glu Ser Ala Val Trp Arg Arg Ser Glu Val Pro Gly
115 120 125
Pro Gly Ile Phe Asp Thr Pro Phe Gly Arg Ile Gly Gly Val Ile Cys
130 135 140
Phe Asp Ser Phe Ala Arg Glu Thr His Glu Gly Phe Lys Gln Ser Gly
145 150 155 160
Val Glu Ala Val Val Ile Ile Ala Leu Trp Gly Ala Asn Arg Ala Arg
165 170 175
Ala Phe Phe Trp Arg Pro Asp Leu Leu Leu Ser Arg Glu Gly Leu Val
180 185 190
Arg Trp Ser Arg Leu Ala Ser Glu Asp Val Pro Arg Asn His Ala Lys
195 200 205
Glu Leu Gly Val Pro Val Ala Phe Val Asn Gln Ser Gly Thr Ile Arg
210 215 220
Met Thr Ser Pro Ile Pro Phe Pro Asp Trp Pro Val Gln Ser Ser Phe
225 230 235 240
Tyr Asp Phe Ile Gly Lys Ser His Val Arg Asp Ala Ser Gly Glu Val
245 250 255
Ile Ala Arg Val Asp Glu Gly Glu Ile Asp Ser Cys Leu Val Val Pro
260 265 270
Val Glu Val Glu Gln Ala Gln Ser Arg Pro Glu Ile Arg Lys Ser Asn
275 280 285
Ile Ser Pro Gly Tyr Leu Gly Lys Asp Tyr Tyr Phe Val Glu Pro Pro
290 295 300
Leu Ile Cys Lys Leu Phe Gln Val Trp Phe Leu Ser Gly Leu Val Pro
305 310 315 320
Thr Glu Tyr Glu Ala Arg Arg Leu Arg His Leu Phe
325 330
<210>267
<211>1038
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>267
atgggcatta cgcatccgaa gttcaaggcc gcggcggtac aggcggcgcc gggctttctc 60
gacagcgagg ccaccgtcga caagacgatc cgcctgatgc aggaagcggc ggaccacggc 120
gcctcgctga tcgtctttcc ggaagcctgg ctgcccggtt atccgtggtg gatctggctc 180
ggtccgcccg cctggggcat gcagttcgtg cagcgctact tcgacaattc gccgagcgtc 240
ggcgatgatc ttttccgccg gatcgagcgc gcggccgcca aggcgaagat cgaagtggtc 300
ctcggtctca gcgagcgcgc tgccggctcg ctgtacctcg cgcaggcgtt catctcctca 360
acgggcgaga cgcgcgcagt gcgccgcaag ttgcgaccaa cgcacgtcga gcgaaccgtt 420
ttcggcgagg gcgatggcag cgacttcaag gtgttcgaca ctccgctggg ccgcgtcggt 480
ggtctcttgt gctgggaaca cctgcaaccg ctgtcgcgct acgcgatgtt ctcgatgaac 540
gagcaggtgc acgccgccgc ctggccgacg ttcagcctct acacggattt tgcccatgcc 600
ctcggccacg aactgaatct cgcagccagc gctacttacg cggctgaagg gcagtgctac 660
gtgattgccg cctgtggcgt ggtcacgcag gagatgctgg atctgatgaa ggcgccgtgc 720
cccccggaat atctgcgggt cggcggcgga tacgccatga tctttgcgcc cgacggacgg 780
cgcattgcgg cggcgctgcc gccggaacaa gaagggctga tttacgccga catcgatctt 840
tcgatgatct ctctcgccaa ggcggctgcc gatcccaccg gtcactactc gcggccggat 900
gtcgtgcggc ttatgctgaa taccgaaccg atgcagcggg tcgaaaagct gcagccgccg 960
ctggactcag ccgcgcgccg tgagaatgaa ccggcacgcg agaccgcagc ggcgaccgag 1020
agccgccagc cccagtaa 1038
<210>268
<211>344
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>268
Met Gly Ile Thr His Pro Lys Phe Lys Ala Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Gly Phe Leu Asp Ser Glu Ala Thr Val Asp Lys Thr Ile Arg Leu
20 25 30
Met Gln Glu Ala Ala Asp His Gly Ala Ser Leu Ile Val Phe Pro Glu
35 40 45
Ala Trp Leu Pro Gly Tyr Pro Trp Trp Ile Trp Leu Gly Pro Pro Ala
50 55 60
Trp Gly Met Gln Phe Val Gln Arg Tyr Phe Asp Asn Ser Pro Ser Val
65 70 75 80
Gly Asp Asp Leu Phe Arg Arg Ile Glu Arg Ala Ala Ala Lys Ala Lys
85 90 95
Ile Glu Val Val Leu Gly Leu Ser Glu Arg Ala Ala Gly Ser Leu Tyr
100 105 110
Leu Ala Gln Ala Phe Ile Ser Ser Thr Gly Glu Thr Arg Ala Val Arg
115 120 125
Arg Lys Leu Arg Pro Thr His Val Glu Arg Thr Val Phe Gly Glu Gly
130 135 140
Asp Gly Ser Asp Phe Lys Val Phe Asp Thr Pro Leu Gly Arg Val Gly
145 150 155 160
Gly Leu Leu Cys Trp Glu His Leu Gln Pro Leu Ser Arg Tyr Ala Met
165 170 175
Phe Ser Met Asn Glu Gln Val His Ala Ala Ala Trp Pro Thr Phe Ser
180 185 190
Leu Tyr Thr Asp Phe Ala His Ala Leu Gly His Glu Leu Asn Leu Ala
195 200 205
Ala Ser Ala Thr Tyr Ala Ala Glu Gly Gln Cys Tyr Val Ile Ala Ala
210 215 220
Cys Gly Val Val Thr Gln Glu Met Leu Asp Leu Met Lys Ala Pro Cys
225 230 235 240
Pro Pro Glu Tyr Leu Arg Val Gly Gly Gly Tyr Ala Met Ile Phe Ala
245 250 255
Pro Asp Gly Arg Arg Ile Ala Ala Ala Leu Pro Pro Glu Gln Glu Gly
260 265 270
Leu Ile Tyr Ala Asp Ile Asp Leu Ser Met Ile Ser Leu Ala Lys Ala
275 280 285
Ala Ala Asp Pro Thr Gly His Tyr Ser Arg Pro Asp Val Val Arg Leu
290 295 300
Met Leu Asn Thr Glu Pro Met Gln Arg Val Glu Lys Leu Gln Pro Pro
305 310 315 320
Leu Asp Ser Ala Ala Arg Arg Glu Asn Glu Pro Ala Arg Glu Thr Ala
325 330 335
Ala Ala Thr Glu Ser Arg Gln Pro
340
<210>269
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>269
atgtctgaaa cagccttcaa gatcgcggtc gtacaggcgg ctccggtttt tctcgacgca 60
aaggcgacgg tggacaaggc gatcggtctg atggccgaag ccggcgccaa gggcgccaag 120
ctgctcgcat tcccggaagt attcatcccc ggctaccctt ggtggctgtg gctgggcaca 180
ccggcatggg gcatgcagtt tgttgccaag tatcacgcga actcgcttcg tgcagacggg 240
cctgaattgg cagccctcgc ggcggcggcg gcgaagtccg atatcaatgc cgtcatcggc 300
ttctcggaga tcgacggcgg ttccctctac atcagccagg cgctcatcag cgacaagggc 360
gagataatgt tcaaacggcg caagctgaag ccgacgcacg tcgaacgcac gttgttcggc 420
gaaggggacg ggtccgactt ccaggtcgtg gacacgagcg tcggcaggct cggtgccttg 480
tgttgcgccg aacacataca gccgctgtcg aagtacgcga tgtactccat gcacgaacag 540
gtgcacgtcg cctcctggcc gtcatttact ttgtaccgcg gcacggcata tgccttgggc 600
cacgaggtca atctggccgc gagccagatt tatgcgctcg agggaggctg tttcgtcctt 660
catgcgagcg ccatcaccgg ccaggacatg tttgacgtgc tgtgcgacac tccggagagg 720
acgcaactgc tgaactccga cggcggcaag gtcggcggcg gctactcgat gatcttcggt 780
cccgatggcc agccccttgt tgggcatctg cctcaagaca ccgagggaat actctacgca 840
gatattgacc tggcgaacat ttccgttgcc aaagcggcct acgacccgtc cggacactat 900
gcgcgcggag acgtggtgcg cttgatggtc aatcgcaacc cgcggcatac gagtgttgcg 960
ttcggcgggg gcgccggcga ggcagcaacc tggacggaag caaaagcgga gtga 1014
<210>270
<211>337
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>270
Met Ser Glu Thr Ala Phe Lys Ile Ala Val Val Gln Ala Ala Pro Val
1 5 10 15
Phe Leu Asp Ala Lys Ala Thr Val Asp Lys Ala Ile Gly Leu Met Ala
20 25 30
Glu Ala Gly Ala Lys Gly Ala Lys Leu Leu Ala Phe Pro Glu Val Phe
35 40 45
Ile Pro Gly Tyr Pro Trp Trp Leu Trp Leu Gly Thr Pro Ala Trp Gly
50 55 60
Met Gln Phe Val Ala Lys Tyr His Ala Asn Ser Leu Arg Ala Asp Gly
65 70 75 80
Pro Glu Leu Ala Ala Leu Ala Ala Ala Ala Ala Lys Ser Asp Ile Asn
85 90 95
Ala Val Ile Gly Phe Ser Glu Ile Asp Gly Gly Ser Leu Tyr Ile Ser
100 105 110
Gln Ala Leu Ile Ser Asp Lys Gly Glu Ile Met Phe Lys Arg Arg Lys
115 120 125
Leu Lys Pro Thr His Val Glu Arg Thr Leu Phe Gly Glu Gly Asp Gly
130 135 140
Ser Asp Phe Gln Val Val Asp Thr Ser Val Gly Arg Leu Gly Ala Leu
145 150 155 160
Cys Cys Ala Glu His Ile Gln Pro Leu Ser Lys Tyr Ala Met Tyr Ser
165 170 175
Met His Glu Gln Val His Val Ala Ser Trp Pro Ser Phe Thr Leu Tyr
180 185 190
Arg Gly Thr Ala Tyr Ala Leu Gly His Glu Val Asn Leu Ala Ala Ser
195 200 205
Gln Ile Tyr Ala Leu Glu Gly Gly Cys Phe Val Leu His Ala Ser Ala
210 215 220
Ile Thr Gly Gln Asp Met Phe Asp Val Leu Cys Asp Thr Pro Glu Arg
225 230 235 240
Thr Gln Leu Leu Asn Ser Asp Gly Gly Lys Val Gly Gly Gly Tyr Ser
245 250 255
Met Ile Phe Gly Pro Asp Gly Gln Pro Leu Val Gly His Leu Pro Gln
260 265 270
Asp Thr Glu Gly Ile Leu Tyr Ala Asp Ile Asp Leu Ala Asn Ile Ser
275 280 285
Val Ala Lys Ala Ala Tyr Asp Pro Ser Gly His Tyr Ala Arg Gly Asp
290 295 300
Val Val Arg Leu Met Val Asn Arg Asn Pro Arg His Thr Ser Val Ala
305 310 315 320
Phe Gly Gly Gly Ala Gly Glu Ala Ala Thr Trp Thr Glu Ala Lys Ala
325 330 335
Glu
<210>271
<211>966
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>271
atgtccagcg agaataacaa cgctacattc aaagttgccg cagttcaggc cacacctgtg 60
tatcttgatc gtgaagcaac catcgacaag gcttgcgagt tgatcgctac tgctggcagc 120
gaaggagctc gcctgattat ctttccagaa gcgttcatcc caacctatcc tgagtgggta 180
tggggtattc cttctggtga gcaaggttta ctcaacgagc tctattcaga gttgctcacc 240
aattcggtca cgattcccag cgacgcgact gacagactgt gcgaggccgc gaagcttgct 300
aatgcctacg tggtgatggg aatgagtgaa cggaatgtcg aagcgagtgg tgcaagcctg 360
tataatacgc tcttgtacat agatgcgcag ggggagattt tagggaaaca tcgaaagttg 420
gtaccaacgg gcggtgagcg cctggtatgg gcgcaaggtg atggcagcac gctgcaggtc 480
tacgatactc cattgggaaa actcggtggt ttaatttgct gggaaaatta tatgccactg 540
gcacgctacg ctatgtatgc ctggggtaca caaatctatg tcgcagcaac gtgggatcgc 600
ggccaaccct ggctctcaac gttacggcat attgccaaag aaggcagggt atacgtaatt 660
ggttgctgta ttgcgatgcg taaagatgat attccagatc gttactccat gaagcagaag 720
tattacgctg aaatggagga atggattaat attggtgaca gcgcgattgt caatcccgaa 780
ggacacttta ttgcagggcc tgtgcgcaag caagaagaaa ttctttacgc ggagatcgat 840
ccacgcatgg tgcaaggccc gaagtggatg ctcgatgtgg ctgggcacta tgcgagacca 900
gatgtgttcc agttgacggt gcatacggat gtgaggcaga tgattcgggt ggaacatgat 960
tcataa 966
<210>272
<211>321
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>272
Met Ser Ser Glu Asn Asn Asn Ala Thr Phe Lys Val Ala Ala Val Gln
1 5 10 15
Ala Thr Pro Val Tyr Leu Asp Arg Glu Ala Thr Ile Asp Lys Ala Cys
20 25 30
Glu Leu Ile Ala Thr Ala Gly Ser Glu Gly Ala Arg Leu Ile Ile Phe
35 40 45
Pro Glu Ala Phe Ile Pro Thr Tyr Pro Glu Trp Val Trp Gly Ile Pro
50 55 60
Ser Gly Glu Gln Gly Leu Leu Asn Glu Leu Tyr Ser Glu Leu Leu Thr
65 70 75 80
Asn Ser Val Thr Ile Pro Ser Asp Ala Thr Asp Arg Leu Cys Glu Ala
85 90 95
Ala Lys Leu Ala Asn Ala Tyr Val Val Met Gly Met Ser Glu Arg Asn
100 105 110
Val Glu Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Ile Asp
115 120 125
Ala Gln Gly Glu Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly
130 135 140
Gly Glu Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val
145 150 155 160
Tyr Asp Thr Pro Leu Gly Lys Leu Gly Gly Leu Ile Cys Trp Glu Asn
165 170 175
Tyr Met Pro Leu Ala Arg Tyr Ala Met Tyr Ala Trp Gly Thr Gln Ile
180 185 190
Tyr Val Ala Ala Thr Trp Asp Arg Gly Gln Pro Trp Leu Ser Thr Leu
195 200 205
Arg His Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile
210 215 220
Ala Met Arg Lys Asp Asp Ile Pro Asp Arg Tyr Ser Met Lys Gln Lys
225 230 235 240
Tyr Tyr Ala Glu Met Glu Glu Trp Ile Asn Ile Gly Asp Ser Ala Ile
245 250 255
Val Asn Pro Glu Gly His Phe Ile Ala Gly Pro Val Arg Lys Gln Glu
260 265 270
Glu Ile Leu Tyr Ala Glu Ile Asp Pro Arg Met Val Gln Gly Pro Lys
275 280 285
Trp Met Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Gln
290 295 300
Leu Thr Val His Thr Asp Val Arg Gln Met Ile Arg Val Glu His Asp
305 310 315 320
Ser
<210>273
<211>1023
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>273
atgaataact caccaccaac catccgcgct gctgccattc agctcagtcc agttctgttt 60
agtcgggatg ggactaccga gaaggtgttg caggcgatcg ctagtgctgc caaggaaggg 120
gcacaactgg ttgtttttcc agaaactttc atcccctact acccctattt ctcattcatt 180
caaccccctg tgttgatggg caaagagcac atgcggctct atgaggaagc cgtgacggtc 240
ccgggtccgg tgacagatgc ggtcagtcga gcagcccgtt cttacggcat ggtggtagtg 300
ctgggggtga atgagcgaga tggtggctca atttacaata cacagttgat tttcgatgct 360
gacggcacat tgttgctgaa gcgacgcaaa atcaccccta cctatcatga gcgcatggtc 420
tgggggcagg gagacggtgc tggattgaag gtattggata cagcagtcgg taaggtgggt 480
gcgctggcat gttgggaaca ttacaatccc ctggcacgat ttgcgctgat ggcacagcat 540
gagcagattc actgcgctca gttccccggt tctctggtgg gacaaatttt cactgatcag 600
attgaggtaa cgattcggca tcatgcgttg gaatcgggtt gttttgtggt gaatgctact 660
ggctggctct ctccagaaca ggtggcacaa atcaccacgg atgaaaagtt gcaacgggtg 720
ctgagtggcg ggtgtaatac cgccattatt ggacctgaag gcaatcatct ctgtcctccc 780
attaccgatg gtgagggcat agcgatcgcc gatctcgact tctcactaat caccaaacgc 840
aaacgcatga tggattgcgt cggtcactac tcccgccctg acttgttgaa gctgcaactc 900
aatgcaacgg catggtcggt gctggctggg gagcaggggg caggtgccag ggagcagggg 960
ctaggtgtgc cggatgccat gctgtctacg cctaagccag aatactcaac actggatcag 1020
tag 1023
<210>274
<211>340
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>274
Met Asn Asn Ser Pro Pro Thr Ile Arg Ala Ala Ala Ile Gln Leu Ser
1 5 10 15
Pro Val Leu Phe Ser Arg Asp Gly Thr Thr Glu Lys Val Leu Gln Ala
20 25 30
Ile Ala Ser Ala Ala Lys Glu Gly Ala Gln Leu Val Val Phe Pro Glu
35 40 45
Thr Phe Ile Pro Tyr Tyr Pro Tyr Phe Ser Phe Ile Gln Pro Pro Val
50 55 60
Leu Met Gly Lys Glu His Met Arg Leu Tyr Glu Glu Ala Val Thr Val
65 70 75 80
Pro Gly Pro Val Thr Asp Ala Val Ser Arg Ala Ala Arg Ser Tyr Gly
85 90 95
Met Val Val Val Leu Gly Val Asn Glu Arg Asp Gly Gly Ser Ile Tyr
100 105 110
Asn Thr Gln Leu Ile Phe Asp Ala Asp Gly Thr Leu Leu Leu Lys Arg
115 120 125
Arg Lys Ile Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly
130 135 140
Asp Gly Ala Gly Leu Lys Val Leu Asp Thr Ala Val Gly Lys Val Gly
145 150 155 160
Ala Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Phe Ala Leu
165 170 175
Met Ala Gln His Glu Gln Ile His Cys Ala Gln Phe Pro Gly Ser Leu
180 185 190
Val Gly Gln Ile Phe Thr Asp Gln Ile Glu Val Thr Ile Arg His His
195 200 205
Ala Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Ser
210 215 220
Pro Glu Gln Val Ala Gln Ile Thr Thr Asp Glu Lys Leu Gln Arg Val
225 230 235 240
Leu Ser Gly Gly Cys Asn Thr Ala Ile Ile Gly Pro Glu Gly Asn His
245 250 255
Leu Cys Pro Pro Ile Thr Asp Gly Glu Gly Ile Ala Ile Ala Asp Leu
260 265 270
Asp Phe Ser Leu Ile Thr Lys Arg Lys Arg Met Met Asp Cys Val Gly
275 280 285
His Tyr Ser Arg Pro Asp Leu Leu Lys Leu Gln Leu Asn Ala Thr Ala
290 295 300
Trp Ser Val Leu Ala Gly Glu Gln Gly Ala Gly Ala Arg Glu Gln Gly
305 310 315 320
Leu Gly Val Pro Asp Ala Met Leu Ser Thr Pro Lys Pro Glu Tyr Ser
325 330 335
Thr Leu Asp Gln
340
<210>275
<211>849
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>275
atggagacgg ctcacaaagc aaaggtcgat ttccttgtgc tgggtgagac gtggctctca 60
ggctacccgg cttggctgga ccactgcccc gatgttggcc ggtgggatta tgaaccgatg 120
aaaaaagtgt atttgagatt tcgacaaagt gctatttctg ttcctggcaa agaatttgat 180
ttccttactg gcctctgtaa aaaatattca caaacgcttg ccatcggtgt taatgagaaa 240
gtagatcatg gggtaggtaa tggtaccatt tataattcat ttctactgat tgattctgat 300
ggaacactgt tgaatcatca tcgcaagtta gttcccactt ttactgagaa attattatac 360
ggccatggag atggccatgg gctgaagtcg atggatactt cggtgggaag aatcggaggg 420
agcatttgtt gggaacattg gatgccacta tgcagacaag cacttcatga tgcaggtgag 480
caaatccatg ttgccctttg gccgactgtt catgacatcc atcaagtggc aagtagaagc 540
tatgcatttg aagggcgctg ctttgtattg gctgccgggc agatttttgc tgctaaagat 600
tttccaaagg aacttgtctt accagactat ctaaagcaaa atccggatca gctcattttg 660
aatgggggga gctgcgtgat cggccctgat gggaaatatt tgattgagcc cgtgtttgat 720
cgggaagaac tgattgtgtg tgaacttgac cttgacgaag cttataaaga aagaatgacg 780
atggacgttt caggtcacta ccaaagacga gacgttttca gttttgacgt gaaccaacat 840
cgacattga 849
<210>276
<211>310
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>276
Met Thr Lys Leu Lys Ile Ala Ile Gly Gln Phe Ser Ser Asn His Leu
1 5 10 15
Asp Leu Lys Cys Ser Leu Glu Lys Leu Glu Lys Ile Met Glu Thr Ala
20 25 30
His Lys Ala Lys Val Asp Phe Leu Val Leu Gly Glu Thr Trp Leu Ser
35 40 45
Gly Tyr Pro Ala Trp Leu Asp His Cys Pro Asp Val Gly Arg Trp Asp
50 55 60
Tyr Glu Pro Met Lys Lys Val Tyr Leu Arg Phe Arg Gln Ser Ala Ile
65 70 75 80
Ser Val Pro Gly Lys Glu Phe Asp Phe Leu Thr Gly Leu Cys Lys Lys
85 90 95
Tyr Ser Gln Thr Leu Ala Ile Gly Val Asn Glu Lys Val Asp His Gly
100 105 110
Val Gly Asn Gly Thr Ile Tyr Asn Ser Phe Leu Leu Ile Asp Ser Asp
115 120 125
Gly Thr Leu Leu Asn His His Arg Lys Leu Val Pro Thr Phe Thr Glu
130 135 140
Lys Leu Leu Tyr Gly His Gly Asp Gly His Gly Leu Lys Ser Met Asp
145 150 155 160
Thr Ser Val Gly Arg Ile Gly Gly Ser Ile Cys Trp Glu His Trp Met
165 170 175
Pro Leu Cys Arg Gln Ala Leu His Asp Ala Gly Glu Gln Ile His Val
180 185 190
Ala Leu Trp Pro Thr Val His Asp Ile His Gln Val Ala Ser Arg Ser
195 200 205
Tyr Ala Phe Glu Gly Arg Cys Phe Val Leu Ala Ala Gly Gln Ile Phe
210 215 220
Ala Ala Lys Asp Phe Pro Lys Glu Leu Val Leu Pro Asp Tyr Leu Lys
225 230 235 240
Gln Asn Pro Asp Gln Leu Ile Leu Asn Gly Gly Ser Cys Val Ile Gly
245 250 255
Pro Asp Gly Lys Tyr Leu Ile Glu Pro Val Phe Asp Arg Glu Glu Leu
260 265 270
Ile Val Cys Glu Leu Asp Leu Asp Glu Ala Tyr Lys Glu Arg Met Thr
275 280 285
Met Asp Val Ser Gly His Tyr Gln Arg Arg Asp Val Phe Ser Phe Asp
290 295 300
Val Asn Gln His Arg His
305 310
<210>277
<211>1056
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>277
atgccaaccc ccagcgatca tttcaaaatc gccgctgttc aggcctcgcc cgtgtttctg 60
gaccgggagg ccactgtgga aaaggcctgc cggttgatcg ccgaagccgc aaagcagggc 120
gcccgcctca tcgtctttcc ggaatctttc atcccgacct acccggactg ggtatgggcc 180
gttcccccgg gaagggaaag aatcctgaac cagctgtatt ctgaattcct ggccaatgcc 240
gtcgatgttc ccggcgcggc gaccgaacaa cttgcccagg ctgcacgaat ggccggcgcc 300
tatgtgatta tgggcgtcac cgaaagagac acctcggcca gcggggccag cctctacaac 360
accctgctct acttcagccc cgaaggcatc ctaatgggca aacaccggaa gctggttccc 420
acggggggcg aacggctggt ctgggcctac ggagacggca gcacgctgga ggtctacgac 480
actccgctgg gaaagatcgg cgggctgatc tgctgggaga actacatgcc cctggcccgg 540
tacacgatgt acgcctgggg cacccagatt tacatcgccg ccacctggga ccgcggggaa 600
ccgtggctct ccaccctgcg gcatatcgcc aaggaaggaa gggtctacgt catcgggtgc 660
tgcatcgccc tgcgccaggg ggatatcccg gaccggttcg agtacaaggg aaaattttat 720
tccgggtccc gggagtggat caatgagggc gacagcgcca tcgtgaaccc ggacggggaa 780
ttcatcgccg ggccggtgcg gatgaaggag gagatcctgt atgccgagat agacccccgg 840
cagatgcggg gccccaagtg gatgctcgat gtggccggtc attacgcccg gccggatatc 900
ttcgagctca tcgtccaccg gaatccccac ccgatgatca aaatcgccga agacaggggc 960
gcggggatcg cctcaagttt gattcgcccc cgccctaacc ttcccccatc aagggggagg 1020
aaatcggcaa gaagcaaacg caagcccaaa aaatga 1056
<210>278
<211>351
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>278
Met Pro Thr Pro Ser Asp His Phe Lys Ile Ala Ala Val Gln Ala Ser
1 5 10 15
Pro Val Phe Leu Asp Arg Glu Ala Thr Val Glu Lys Ala Cys Arg Leu
20 25 30
Ile Ala Glu Ala Ala Lys Gln Gly Ala Arg Leu Ile Val Phe Pro Glu
35 40 45
Ser Phe Ile Pro Thr Tyr Pro Asp Trp Val Trp Ala Val Pro Pro Gly
50 55 60
Arg Glu Arg Ile Leu Asn Gln Leu Tyr Ser Glu Phe Leu Ala Asn Ala
65 70 75 80
Val Asp Val Pro Gly Ala Ala Thr Glu Gln Leu Ala Gln Ala Ala Arg
85 90 95
Met Ala Gly Ala Tyr Val Ile Met Gly Val Thr Glu Arg Asp Thr Ser
100 105 110
Ala Ser Gly Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Phe Ser Pro Glu
115 120 125
Gly Ile Leu Met Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu
130 135 140
Arg Leu Val Trp Ala Tyr Gly Asp Gly Ser Thr Leu Glu Val Tyr Asp
145 150 155 160
Thr Pro Leu Gly Lys Ile Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met
165 170 175
Pro Leu Ala Arg Tyr Thr Met Tyr Ala Trp Gly Thr Gln Ile Tyr Ile
180 185 190
Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His
195 200 205
Ile Ala Lys Glu Gly Arg Val Tyr Val Ile Gly Cys Cys Ile Ala Leu
210 215 220
Arg Gln Gly Asp Ile Pro Asp Arg Phe Glu Tyr Lys Gly Lys Phe Tyr
225 230 235 240
Ser Gly Ser Arg Glu Trp Ile Asn Glu Gly Asp Ser Ala Ile Val Asn
245 250 255
Pro Asp Gly Glu Phe Ile Ala Gly Pro Val Arg Met Lys Glu Glu Ile
260 265 270
Leu Tyr Ala Glu Ile Asp Pro Arg Gln Met Arg Gly Pro Lys Trp Met
275 280 285
Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Ile Phe Glu Leu Ile
290 295 300
Val His Arg Asn Pro His Pro Met Ile Lys Ile Ala Glu Asp Arg Gly
305 310 315 320
Ala Gly Ile Ala Ser Ser Leu Ile Arg Pro Arg Pro Asn Leu Pro Pro
325 330 335
Ser Arg Gly Arg Lys Ser Ala Arg Ser Lys Arg Lys Pro Lys Lys
340 345 350
<210>279
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>279
atgggcatta cccatcccaa atacaaagtc gctgcagtcc aggccgcgcc ggtctggctg 60
gacctggatg ccacggtgga caagtgcatc cgcctgatcc aggaggccgc tgacaagggc 120
tgcaagctga tcgcgtttcc ggagacgttc attcccggct atccctggca catctggatg 180
ggtgcgccgg cctgggccat cggccggggc tttgtgcagc ggtatttcga caactccctg 240
tcctatgaca gcccgcaggc cgaaaagctg cgccaggccg tcaaggccgc aggcatcacc 300
gcgtcgctgg gactgtcgga gcgctcgggc ggcagtctct acatcgcgca gtggctcatt 360
ggccccgacg gcgaaacgat ctcgcagcgg cgtaagctgc ggcccacgca ttccgaacgc 420
accgtcttcg gcgacggcga tgggagcgac ctcaaggtgc acgacacgcc gctgggccgt 480
gtgggtgagc tggcgtgctg ggagaacatc ctgtcgctga acaagtacgc catgttctcg 540
cagcacgagc aggtgcacat tgcagcctgg cctagcttct ccacctacga gccctttgca 600
catgccctgg gctgggaagt gaacaacgca gtcagcaagg tgtacgcagt ggaaggcggg 660
tgtttcgtgg tggcaccctg cgccatcatt tccaaggaga tggtcgatga actgtgcgac 720
accccggaca agcacaccct gacccatgtg ggaggcggcc acgcggtgat ctacgggccg 780
gatggcgccc ccctggcgga caagctgcct gaagacgccg agggcctgct gatcgcggag 840
atcgacctcg ggatgattgg cgtggccaag aacgccatgg acccggtagg gcactactcg 900
cgccccgacg tgcaccgcct gcttctcaac cgcaagaagg cagcgcaggt ggagcacttt 960
gcgctgccgg tggacgccat cgattccgag ccccaagcca ccacggcgca ctga 1014
<210>280
<211>337
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>280
Met Gly Ile Thr His Pro Lys Tyr Lys Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Val Trp Leu Asp Leu Asp Ala Thr Val Asp Lys Cys Ile Arg Leu
20 25 30
Ile Gln Glu Ala Ala Asp Lys Gly Cys Lys Leu Ile Ala Phe Pro Glu
35 40 45
Thr Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Met Gly Ala Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ser Tyr Asp Ser Pro Gln Ala Glu Lys Leu Arg Gln Ala Val Lys Ala
85 90 95
Ala Gly Ile Thr Ala Ser Leu Gly Leu Ser Glu Arg Ser Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ser
115 120 125
Gln Arg Arg Lys Leu Arg Pro Thr His Ser Glu Arg Thr Val Phe Gly
130 135 140
Asp Gly Asp Gly Ser Asp Leu Lys Val His Asp Thr Pro Leu Gly Arg
145 150 155 160
Val Gly Glu Leu Ala Cys Trp Glu Asn Ile Leu Ser Leu Asn Lys Tyr
165 170 175
Ala Met Phe Ser Gln His Glu Gln Val His Ile Ala Ala Trp Pro Ser
180 185 190
Phe Ser Thr Tyr Glu Pro Phe Ala His Ala Leu Gly Trp Glu Val Asn
195 200 205
Asn Ala Val Ser Lys Val Tyr Ala Val Glu Gly Gly Cys Phe Val Val
210 215 220
Ala Pro Cys Ala Ile Ile Ser Lys Glu Met Val Asp Glu Leu Cys Asp
225 230 235 240
Thr Pro Asp Lys His Thr Leu Thr His Val Gly Gly Gly His Ala Val
245 250 255
Ile Tyr Gly Pro Asp Gly Ala Pro Leu Ala Asp Lys Leu Pro Glu Asp
260 265 270
Ala Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Met Ile Gly Val
275 280 285
Ala Lys Asn Ala Met Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val
290 295 300
His Arg Leu Leu Leu Asn Arg Lys Lys Ala Ala Gln Val Glu His Phe
305 310 315 320
Ala Leu Pro Val Asp Ala Ile Asp Ser Glu Pro Gln Ala Thr Thr Ala
325 330 335
His
<210>281
<211>936
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>281
atgtccaaga tcgccgtcgt ccaagagcct ccggtgctgc tcgatcgcgc cgccaccctc 60
gagcgcgccg tctcggccat cgagcgggcg gccgacgtgg gggcgcacct cgtcgtgttc 120
cccgagacgt acgtcccggg gtaccccgac tgggtctggc ggacgcgccc cgacgacttc 180
aagctcgcgg gcgcgctgca cgagcgcctc ctcgcgaacg cggtcgacct cgaaaaggac 240
cagctcgcgc cgctccgcga ggccgcgcgc cggcgaggcg tcaccatcgc gtgcggcgtg 300
aacgagcgcg aggggagcca cggccgcgcg accctctaca acaccgtcgt cgtcgtcgga 360
cccgacggcg cgatcctcaa ccgccaccgc aagctcgtcc ccacgaaccc cgagcgcatg 420
gtctggggcc cgggtgacgc gagcgggctg cgcgtcgtcg acacgccggc cggccgcgtg 480
ggggcgctca tctgctggga gaactacatg ccgctcgcgc gcttcgcgct ctacgcgcag 540
ggcgtcgagg tctacctcgc gccgacgtgg gatcacggcg acacttggct cgcctccatg 600
cggcacatcg cgcgcgaggc gcgcgcctgg gtcgtctcgg gggccatctg catgcaggcg 660
aaggacgtcc ccgccgactt cccgcagcgc gcggcgatct accccgacga ggaggagtgg 720
ctcaaccccg gcgatgccgt cgtcgtcgat cccaccggcg ccgtcgccgc cggcccgctg 780
caccgcgagc gcggcatcct ctacgccgag tgcgatcccg cgcgggcgtc gctcgcccgc 840
cgcacgctcg acgtctccgg gcactacgga cggcccgacg tctttcacct gcagatcgac 900
cgcacaccgc gcgtgccggc gtcgttccgg gactga 936
<210>282
<211>311
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>282
Met Ser Lys Ile Ala Val Val Gln Glu Pro Pro Val Leu Leu Asp Arg
1 5 10 15
Ala Ala Thr Leu Glu Arg Ala Val Ser Ala Ile Glu Arg Ala Ala Asp
20 25 30
Val Gly Ala His Leu Val Val Phe Pro Glu Thr Tyr Val Pro Gly Tyr
35 40 45
Pro Asp Trp Val Trp Arg Thr Arg Pro Asp Asp Phe Lys Leu Ala Gly
50 55 60
Ala Leu His Glu Arg Leu Leu Ala Asn Ala Val Asp Leu Glu Lys Asp
65 70 75 80
Gln Leu Ala Pro Leu Arg Glu Ala Ala Arg Arg Arg Gly Val Thr Ile
85 90 95
Ala Cys Gly Val Asn Glu Arg Glu Gly Ser His Gly Arg Ala Thr Leu
100 105 110
Tyr Asn Thr Val Val Val Val Gly Pro Asp Gly Ala Ile Leu Asn Arg
115 120 125
His Arg Lys Leu Val Pro Thr Asn Pro Glu Arg Met Val Trp Gly Pro
130 135 140
Gly Asp Ala Ser Gly Leu Arg Val Val Asp Thr Pro Ala Gly Arg Val
145 150 155 160
Gly Ala Leu Ile Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Phe Ala
165 170 175
Leu Tyr Ala Gln Gly Val Glu Val Tyr Leu Ala Pro Thr Trp Asp His
180 185 190
Gly Asp Thr Trp Leu Ala Ser Met Arg His Ile Ala Arg Glu Ala Arg
195 200 205
Ala Trp Val Val Ser Gly Ala Ile Cys Met Gln Ala Lys Asp Val Pro
210 215 220
Ala Asp Phe Pro Gln Arg Ala Ala Ile Tyr Pro Asp Glu Glu Glu Trp
225 230 235 240
Leu Asn Pro Gly Asp Ala Val Val Val Asp Pro Thr Gly Ala Val Ala
245 250 255
Ala Gly Pro Leu His Arg Glu Arg Gly Ile Leu Tyr Ala Glu Cys Asp
260 265 270
Pro Ala Arg Ala Ser Leu Ala Arg Arg Thr Leu Asp Val Ser Gly His
275 280 285
Tyr Gly Arg Pro Asp Val Phe His Leu Gln Ile Asp Arg Thr Pro Arg
290 295 300
Val Pro Ala Ser Phe Arg Asp
305 310
<210>283
<211>1017
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>283
atgggcatcg aacatccgaa gtacaaggtc gcggtggtgc aggcggcacc ggcctggctc 60
gatctcgacg cgtcgatcga caagaccatc gggctgatcg aggaggccgc ccaaaaaggc 120
gccaagctga ttgcattccc cgaggccttc atccccggtt acccctggca catctggatg 180
gactcgccgg cctgggcgat cggccgcggt ttcgtgcagc gctattttga caattcgctc 240
gcctatgaca gcccgcaggc cgagaaactg cgcgcggcgg ttcgcaaggc aaagctcacg 300
gccgtgatcg ggctgtccga gcgcgacggc ggcagtcttt acctcgcgca atggctgatc 360
gggcccgacg gtgagaccat cgccaagcgc cgcaagctgc ggccgacaca tgcggagcgc 420
acggtatacg gcgagggcga cggcagcgac ctcgcggttc acaaccgtcc ggacattggc 480
cgccttggcg cgctctgctg ctgggagcat ctccagccgc tgtcgaaata cgcgatgtac 540
gcgcagaacg agcaggtgca tgtcgcggcc tggccgagct tttcgctgta cgatcccttc 600
gcggtggcgc tcggcgccga agtgaacaac gcggcctcgc gcgtctatgc ggtcgaaggc 660
tcctgcttcg tgctggcgcc gtgcgcgacg gtctcgcagg ccatgatcga cgaactctgc 720
gaccggccgg acaagcacgc gctgttgcat gtcggcggcg gttttgccgc gatctacggt 780
cctgacggca gccagatcgg cgacaaactc gctcccgacc aggaagggtt gctgatcgcg 840
gagatcgatc tcggcgccat cggcgtcgcc aagaatgcgg cggatcccgc cgggcattat 900
tcgcggcctg acgtgacgcg actgttgctc aacaagaagc cgtacaagcg cgtcgagcag 960
ttttcgccgc cggccgaggc gctcgagccg acggatatcg cagcagcagc aagctaa 1017
<210>284
<211>338
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>284
Met Gly Ile Glu His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Ala Trp Leu Asp Leu Asp Ala Ser Ile Asp Lys Thr Ile Gly Leu
20 25 30
Ile Glu Glu Ala Ala Gln Lys Gly Ala Lys Leu Ile Ala Phe Pro Glu
35 40 45
Ala Phe Ile Pro Gly Tyr Pro Trp His Ile Trp Met Asp Ser Pro Ala
50 55 60
Trp Ala Ile Gly Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ala Tyr Asp Ser Pro Gln Ala Glu Lys Leu Arg Ala Ala Val Arg Lys
85 90 95
Ala Lys Leu Thr Ala Val Ile Gly Leu Ser Glu Arg Asp Gly Gly Ser
100 105 110
Leu Tyr Leu Ala Gln Trp Leu Ile Gly Pro Asp Gly Glu Thr Ile Ala
115 120 125
Lys Arg Arg Lys Leu Arg Pro Thr His Ala Glu Arg Thr Val Tyr Gly
130 135 140
Glu Gly Asp Gly Ser Asp Leu Ala Val His Asn Arg Pro Asp Ile Gly
145 150 155 160
Arg Leu Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Met Tyr Ala Gln Asn Glu Gln Val His Val Ala Ala Trp Pro
180 185 190
Ser Phe Ser Leu Tyr Asp Pro Phe Ala Val Ala Leu Gly Ala Glu Val
195 200 205
Asn Asn Ala Ala Ser Arg Val Tyr Ala Val Glu Gly Ser Cys Phe Val
210 215 220
Leu Ala Pro Cys Ala Thr Val Ser Gln Ala Met Ile Asp Glu Leu Cys
225 230 235 240
Asp Arg Pro Asp Lys His Ala Leu Leu His Val Gly Gly Gly Phe Ala
245 250 255
Ala Ile Tyr Gly Pro Asp Gly Ser Gln Ile Gly Asp Lys Leu Ala Pro
260 265 270
Asp Gln Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Ala Ile Gly
275 280 285
Val Ala Lys Asn Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp
290 295 300
Val Thr Arg Leu Leu Leu Asn Lys Lys Pro Tyr Lys Arg Val Glu Gln
305 310 315 320
Phe Ser Pro Pro Ala Glu Ala Leu Glu Pro Thr Asp Ile Ala Ala Ala
325 330 335
Ala Ser
<210>285
<211>918
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>285
atgaatacta atctagtaaa ggtcgcggcg gctcaagttg ctccccattt tctcaatttg 60
agcaatacgg tggaaaaaac ctgcaactta atttctgaag caggcaaaaa tggagcaaag 120
ctaatcgtat ttccagaagc cttcatctct ggttatcccg attgggtctg gcttattccc 180
aatgcgaatt ctgcaatgct ggatgattta taccaggaat tggttgagaa cgcagtaacg 240
atccctgata caacaacaca taaactatgt caggctgcaa aagatgcagg ggtatatgtt 300
gcggtaggta tacatgagag aaattcagaa gcaagtggct tcacgctttt caatacgctt 360
ctatacatca acgatcaagg cgtaatcatc ggaaaacacc gaaaattaat ccctacaggg 420
ggcgaacggc tggtctgggg gcagggtaat ggggatacac tttctgcatt cgatacagac 480
ttcggcaaat taggaggatt gctttgttgg gaaaattata tgccactcgc gcgtcaagct 540
atgtattccg ttggaactga agtatatgta gccccaacct gggactccag cgagaattgg 600
ttgttaagca tgcgccatat tgcccgagag ggcggtatgt ttgtaattag tgtttgccag 660
gctctccgaa aagacgacat ccctgatcag tatgaattta agaaactcta tcctgataat 720
tcagaatgga tcaatagcgg taacagttgc atcatcaacc cgcgcggtga gattattgct 780
ggaccaatct caaaccaaca agaaatactt tatgcagatt tagacctgag tttaattgca 840
aaatctaaac gtatgttcga tgttactggg cattattccc ggccggatgt gtttagctat 900
gaaatcaaca aaagctag 918
<210>286
<211>305
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>286
Met Asn Thr Asn Leu Val Lys Val Ala Ala Ala Gln Val Ala Pro His
1 5 10 15
Phe Leu Asn Leu Ser Asn Thr Val Glu Lys Thr Cys Asn Leu Ile Ser
20 25 30
Glu Ala Gly Lys Asn Gly Ala Lys Leu Ile Val Phe Pro Glu Ala Phe
35 40 45
Ile Ser Gly Tyr Pro Asp Trp Val Trp Leu Ile Pro Asn Ala Asn Ser
50 55 60
Ala Met Leu Asp Asp Leu Tyr Gln Glu Leu Val Glu Asn Ala Val Thr
65 70 75 80
Ile Pro Asp Thr Thr Thr His Lys Leu Cys Gln Ala Ala Lys Asp Ala
85 90 95
Gly Val Tyr Val Ala Val Gly Ile His Glu Arg Asn Ser Glu Ala Ser
100 105 110
Gly Phe Thr Leu Phe Asn Thr Leu Leu Tyr Ile Asn Asp Gln Gly Val
115 120 125
Ile Ile Gly Lys His Arg Lys Leu Ile Pro Thr Gly Gly Glu Arg Leu
130 135 140
Val Trp Gly Gln Gly Asn Gly Asp Thr Leu Ser Ala Phe Asp Thr Asp
145 150 155 160
Phe Gly Lys Leu Gly Gly Leu Leu Cys Trp Glu Asn Tyr Met Pro Leu
165 170 175
Ala Arg Gln Ala Met Tyr Ser Val Gly Thr Glu Val Tyr Val Ala Pro
180 185 190
Thr Trp Asp Ser Ser Glu Asn Trp Leu Leu Ser Met Arg His Ile Ala
195 200 205
Arg Glu Gly Gly Met Phe Val Ile Ser Val Cys Gln Ala Leu Arg Lys
210 215 220
Asp Asp Ile Pro Asp Gln Tyr Glu Phe Lys Lys Leu Tyr Pro Asp Asn
225 230 235 240
Ser Glu Trp Ile Asn Ser Gly Asn Ser Cys Ile Ile Asn Pro Arg Gly
245 250 255
Glu Ile Ile Ala Gly Pro Ile Ser Asn Gln Gln Glu Ile Leu Tyr Ala
260 265 270
Asp Leu Asp Leu Ser Leu Ile Ala Lys Ser Lys Arg Met Phe Asp Val
275 280 285
Thr Gly His Tyr Ser Arg Pro Asp Val Phe Ser Tyr Glu Ile Asn Lys
290 295 300
Ser
305
<210>287
<211>936
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>287
gtgatcaagg tagcaatcgc ccaggtggca ccggtggttc tggacaaggc gcgcaccatt 60
gagaaagcgg taggaattat tcgcgctgcc gcgcaagagg gcattgagct cctggttttc 120
ccggagacgt ttatcccgac ctatccagcc tgggtatggc gcttgcgtcc gggtactgat 180
tacggcctga gcgaggaact gcacgcgctc ctgctggata attcggtaga tatggagagc 240
aaggacctgg agccattgca agctgttgct gcagagacca gcatgaccgt ggtaataggt 300
atgaacgagc gagacggccg attcagccgg ggtacaatct acaatgccct ggttgtgatc 360
ggtccaggtg gcacgatcct gaacaggcac cgcaagctta tgcccaccaa ccccgagcgt 420
atggtttggg gtatgggcga tgccagcggg ctgaaggtag tggaaatgtc ttacgggcgc 480
ctgggtgggc tgatttgctg ggagaatttc atgcctctcg cgcgctatgg cttgtatgcc 540
cagggtgtgg agatttacgt ggcgcccacc tatgaccagg gcgacggctg ggtcggcagc 600
atgcagcata tagcccggga gggtcgttgc tgggtactct cggccgggac ccttttgcgt 660
ggcagtgatt ttctgccgga ttttccgggc aagaccgagt tatatcccga tgaccaggag 720
tgggtgaatc cgggtggctc ggtgatcgtg gcaccggggg gagagattgt ggccggcccc 780
atgtatcgcg acgaaggtct gctggtctgc gagttggatg cgacgcttag tgtccgcggc 840
aagcgctcgc tggatgtggc cggccattac tcccggccgg atttgtttga actggaaata 900
gatggcgacc cgctggaacc catagagtgg gattga 936
<210>288
<211>311
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>288
Val Ile Lys Val Ala Ile Ala Gln Val Ala Pro Val Val Leu Asp Lys
1 5 10 15
Ala Arg Thr Ile Glu Lys Ala Val Gly Ile Ile Arg Ala Ala Ala Gln
20 25 30
Glu Gly Ile Glu Leu Leu Val Phe Pro Glu Thr Phe Ile Pro Thr Tyr
35 40 45
Pro Ala Trp Val Trp Arg Leu Arg Pro Gly Thr Asp Tyr Gly Leu Ser
50 55 60
Glu Glu Leu His Ala Leu Leu Leu Asp Asn Ser Val Asp Met Glu Ser
65 70 75 80
Lys Asp Leu Glu Pro Leu Gln Ala Val Ala Ala Glu Thr Ser Met Thr
85 90 95
Val Val Ile Gly Met Asn Glu Arg Asp Gly Arg Phe Ser Arg Gly Thr
100 105 110
Ile Tyr Asn Ala Leu Val Val Ile Gly Pro Gly Gly Thr Ile Leu Asn
115 120 125
Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly
130 135 140
Met Gly Asp Ala Ser Gly Leu Lys Val Val Glu Met Ser Tyr Gly Arg
145 150 155 160
Leu Gly Gly Leu Ile Cys Trp Glu Asn Phe Met Pro Leu Ala Arg Tyr
165 170 175
Gly Leu Tyr Ala Gln Gly Val Glu Ile Tyr Val Ala Pro Thr Tyr Asp
180 185 190
Gln Gly Asp Gly Trp Val Gly Ser Met Gln His Ile Ala Arg Glu Gly
195 200 205
Arg Cys Trp Val Leu Ser Ala Gly Thr Leu Leu Arg Gly Ser Asp Phe
210 215 220
Leu Pro Asp Phe Pro Gly Lys Thr Glu Leu Tyr Pro Asp Asp Gln Glu
225 230 235 240
Trp Val Asn Pro Gly Gly Ser Val Ile Val Ala Pro Gly Gly Glu Ile
245 250 255
Val Ala Gly Pro Met Tyr Arg Asp Glu Gly Leu Leu Val Cys Glu Leu
260 265 270
Asp Ala Thr Leu Ser Val Arg Gly Lys Arg Ser Leu Asp Val Ala Gly
275 280 285
His Tyr Ser Arg Pro Asp Leu Phe Glu Leu Glu Ile Asp Gly Asp Pro
290 295 300
Leu Glu Pro Ile Glu Trp Asp
305 310
<210>289
<211>921
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>289
atggtgacgg tggccgccgt acaggcaacg ccggtgttcc tcgaccgcga ggcgacctcg 60
gacaaggtct gcgccttggt caaggaggcg gccggccacg gggcagaact gatcgtcttc 120
cccgagtcct tcgtccctgc ctatccggac tgggtgtggc gcacccctgc ctggagtgac 180
accgagttcg tgaagcgctt ctacgcgaac gcggtgaccg tccccggcgc gaccctcgag 240
cgcatcggcg cagcggcggc ggaggcggag gcgtacgtcg tgatcggcgt gaccgagatc 300
gacggcggaa ctctctacaa cacccttctc tacctgggcc cggacggaca gctgttgcaa 360
cggcatcgca agctcatgcc caccggtggg gagcggaccg tgtggggaat gggagacggc 420
tctgagctcg acgtcgtgag cacgccgttc ggcgtcgtcg gtgggttgtt gtgctgggag 480
aactacatgc cgctcgcccg ggcggcgatc tacgcccagc actgtgacat ctacctggct 540
ccgacatggg acaacagcga cacgtgggta gccacgttgc gtcacatcgc caaggagggg 600
cggcagttcg tcatcggcgt cgccccgctg ctgcgcggct ccgacgtacc ggaggacctc 660
cgcggcacgc tctacgggct gtcggacgac tggatgtcgc gcggctacac caccatcgtc 720
gcaccaagcg gcgaggtgat cgccggcccg gtcctggagc gtgaggagat cctcttcgcg 780
gacctcgacc tggccgacgt gcaggagcag agaaggatgt tcgaccctgt cggccactac 840
tcacgacccg acgtcttcac gctccacgtc gacgcacgac cgaagagccc ggtcgtcttc 900
gagagggatg caccgacctg a 921
<210>290
<211>306
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>290
Met Val Thr Val Ala Ala Val Gln Ala Thr Pro Val Phe Leu Asp Arg
1 5 10 15
Glu Ala Thr Ser Asp Lys Val Cys Ala Leu Val Lys Glu Ala Ala Gly
20 25 30
His Gly Ala Glu Leu Ile Val Phe Pro Glu Ser Phe Val Pro Ala Tyr
35 40 45
Pro Asp Trp Val Trp Arg Thr Pro Ala Trp Ser Asp Thr Glu Phe Val
50 55 60
Lys Arg Phe Tyr Ala Asn Ala Val Thr Val Pro Gly Ala Thr Leu Glu
65 70 75 80
Arg Ile Gly Ala Ala Ala Ala Glu Ala Glu Ala Tyr Val Val Ile Gly
85 90 95
Val Thr Glu Ile Asp Gly Gly Thr Leu Tyr Asn Thr Leu Leu Tyr Leu
100 105 110
Gly Pro Asp Gly Gln Leu Leu Gln Arg His Arg Lys Leu Met Pro Thr
115 120 125
Gly Gly Glu Arg Thr Val Trp Gly Met Gly Asp Gly Ser Glu Leu Asp
130 135 140
Val Val Ser Thr Pro Phe Gly Val Val Gly Gly Leu Leu Cys Trp Glu
145 150 155 160
Asn Tyr Met Pro Leu Ala Arg Ala Ala Ile Tyr Ala Gln His Cys Asp
165 170 175
Ile Tyr Leu Ala Pro Thr Trp Asp Asn Ser Asp Thr Trp Val Ala Thr
180 185 190
Leu Arg His Ile Ala Lys Glu Gly Arg Gln Phe Val Ile Gly Val Ala
195 200 205
Pro Leu Leu Arg Gly Ser Asp Val Pro Glu Asp Leu Arg Gly Thr Leu
210 215 220
Tyr Gly Leu Ser Asp Asp Trp Met Ser Arg Gly Tyr Thr Thr Ile Val
225 230 235 240
Ala Pro Ser Gly Glu Val Ile Ala Gly Pro Val Leu Glu Arg Glu Glu
245 250 255
Ile Leu Phe Ala Asp Leu Asp Leu Ala Asp Val Gln Glu Gln Arg Arg
260 265 270
Met Phe Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Phe Thr Leu
275 280 285
His Val Asp Ala Arg Pro Lys Ser Pro Val Val Phe Glu Arg Asp Ala
290 295 300
Pro Thr
305
<210>291
<211>1002
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>291
atgatgaaaa caactgttac cgttgcctgc gttcaggccg cccccgtatt tatggattta 60
gaaggcacca tagataaaac gatcaccctc atctctgaag ccgcacagaa aggcgcggag 120
ctcatcgctt ttccggagac ctggataccc ggttacccgt ggttcttatg gctgaactcg 180
cccgcgacaa atatgcccct ggtttatcaa tatcatcaga actctctggt gctggacagt 240
gcccaggcga agcgaattgc ggatgctgca cagcagaata acatcactgt cgttctggga 300
ttcagcgagc gcgatcatgg aagcctctat atctcacagt ggctgattgg cagcgacggg 360
gaaactattg gcatccggcg caagctcaag gccacacacg tggagcgtac gctgttcggc 420
gaaagcgacg gctcctccct gaccacctgg gagacacctc tgggtaacgt cggggccctc 480
tgctgctggg agcacctgca gccgctgtct cgctatgcga tgtattccca gcatgaagag 540
atccatatcg ctgcgtggcc cagcttcagt ctttacacca gcgcaactgc cgcgctcgga 600
cctgacgtca acacggcggc ttcacgcctc tatgccgcgg aggggcagtg cttcgtgtta 660
gccccatgtg ccgtggtttc tgatgagatg attgatttac tctgtcctga tgatgaccgg 720
agagcgttac tcagtgccgg agggggacat gcccgtattt acggacctga tggaagagaa 780
ctcgtcaccc ctctcgggga aaatgaggaa ggactgctta tcgctgagct cgactctgct 840
gcgatcacct ttgccaaact ggcggcagat cccgtaggcc actattcccg ccctgacgtg 900
acccgccttc tttttaatcc ttcagccaac aagactgtta ttaaacggca ttcgcctcct 960
gagttaattg ccgaacaggc tccggaagaa gaggaggagt ag 1002
<210>292
<211>333
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>292
Met Met Lys Thr Thr Val Thr Val Ala Cys Val Gln Ala Ala Pro Val
1 5 10 15
Phe Met Asp Leu Glu Gly Thr Ile Asp Lys Thr Ile Thr Leu Ile Ser
20 25 30
Glu Ala Ala Gln Lys Gly Ala Glu Leu Ile Ala Phe Pro Glu Thr Trp
35 40 45
Ile Pro Gly Tyr Pro Trp Phe Leu Trp Leu Asn Ser Pro Ala Thr Asn
50 55 60
Met Pro Leu Val Tyr Gln Tyr His Gln Asn Ser Leu Val Leu Asp Ser
65 70 75 80
Ala Gln Ala Lys Arg Ile Ala Asp Ala Ala Gln Gln Asn Asn Ile Thr
85 90 95
Val Val Leu Gly Phe Ser Glu Arg Asp His Gly Ser Leu Tyr Ile Ser
100 105 110
Gln Trp Leu Ile Gly Ser Asp Gly Glu Thr Ile Gly Ile Arg Arg Lys
115 120 125
Leu Lys Ala Thr His Val Glu Arg Thr Leu Phe Gly Glu Ser Asp Gly
130 135 140
Ser Ser Leu Thr Thr Trp Glu Thr Pro Leu Gly Asn Val Gly Ala Leu
145 150 155 160
Cys Cys Trp Glu His Leu Gln Pro Leu Ser Arg Tyr Ala Met Tyr Ser
165 170 175
Gln His Glu Glu Ile His Ile Ala Ala Trp Pro Ser Phe Ser Leu Tyr
180 185 190
Thr Ser Ala Thr Ala Ala Leu Gly Pro Asp Val Asn Thr Ala Ala Ser
195 200 205
Arg Leu Tyr Ala Ala Glu Gly Gln Cys Phe Val Leu Ala Pro Cys Ala
210 215 220
Val Val Ser Asp Glu Met Ile Asp Leu Leu Cys Pro Asp Asp Asp Arg
225 230 235 240
Arg Ala Leu Leu Ser Ala Gly Gly Gly His Ala Arg Ile Tyr Gly Pro
245 250 255
Asp Gly Arg Glu Leu Val Thr Pro Leu Gly Glu Asn Glu Glu Gly Leu
260 265 270
Leu Ile Ala Glu Leu Asp Ser Ala Ala Ile Thr Phe Ala Lys Leu Ala
275 280 285
Ala Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val Thr Arg Leu Leu
290 295 300
Phe Asn Pro Ser Ala Asn Lys Thr Val Ile Lys Arg His Ser Pro Pro
305 310 315 320
Glu Leu Ile Ala Glu Gln Ala Pro Glu Glu Glu Glu Glu
325 330
<210>293
<211>1008
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>293
atgaaaaata tcaaaaactc agaaaaaagc agcacagtaa gagtcgctgc ggtacaaatc 60
agtccggtgt tgtacaaccg cgaagctacc gttcaaaaag tagtcaacaa aatccttgaa 120
ctaggaaaac aaggggtaca attcgccact tttccggaaa cgatagtgcc ttattatcct 180
tatttctctt ttattcaggc gccttatgcc atgggcaaag aacacctgcg cttgcttgaa 240
caatcagtta ctgttccgtc agccgcgacc gatgccataa gtgaggcggc aaaggaagcc 300
aatatggtag tgtctattgg tgtcaatgaa cgagacggtg gtaccattta caatacgcaa 360
ctcctttttg atgctgacgg aacattaatt cagcgcagac gtaaacttac accaacgtat 420
catgaaagaa tgatttgggg acaaggtgac gcttcaggtc ttcgtgccac agacagcgct 480
gttgggcgta tcgggcagtt ggcttgttgg gaacattaca atccattgtt ccgttatgct 540
ttgattgctg atggagaaca aatccattct gccatgtatc ccggatcatt tttaggtgcg 600
ttgcacggtg aacaaaccga aatcaatgta cgccaacacg ctttagaatc ggccagcttc 660
gtcgtagtgg ctaccggttg gttggatgcc gatcaacaag cacaaattgc gaaagacacc 720
ggtggaccaa tcggaccaat ttcgggaggt tgttttacag ccgttatagg ccctgacgga 780
caactaatcg gggaagccct tacatcaggt gaaggggaag tgattgccga tattgatttg 840
gcacaaattg atgcccgcaa aagattaatg gatgccagtg gtcactacaa ccgtcctgaa 900
ttgttgagct tgcatatcga tcacactccg actgctccta tgcatgaaag agtagtttac 960
actgagccgg gattagcaaa aagacaaaat gaaaattcat caaattaa 1008
<210>294
<211>335
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>294
Met Lys Asn Ile Lys Asn Ser Glu Lys Ser Ser Thr Val Arg Val Ala
1 5 10 15
Ala Val Gln Ile Ser Pro Val Leu Tyr Asn Arg Glu Ala Thr Val Gln
20 25 30
Lys Val Val Asn Lys Ile Leu Glu Leu Gly Lys Gln Gly Val Gln Phe
35 40 45
Ala Thr Phe Pro Glu Thr Ile Val Pro Tyr Tyr Pro Tyr Phe Ser Phe
50 55 60
Ile Gln Ala Pro Tyr Ala Met Gly Lys Glu His Leu Arg Leu Leu Glu
65 70 75 80
Gln Ser Val Thr Val Pro Ser Ala Ala Thr Asp Ala Ile Ser Glu Ala
85 90 95
Ala Lys Glu Ala Asn Met Val Val Ser Ile Gly Val Asn Glu Arg Asp
100 105 110
Gly Gly Thr Ile Tyr Asn Thr Gln Leu Leu Phe Asp Ala Asp Gly Thr
115 120 125
Leu Ile Gln Arg Arg Arg Lys Leu Thr Pro Thr Tyr His Glu Arg Met
130 135 140
Ile Trp Gly Gln Gly Asp Ala Ser Gly Leu Arg Ala Thr Asp Ser Ala
145 150 155 160
Val Gly Arg Ile Gly Gln Leu Ala Cys Trp Glu His Tyr Asn Pro Leu
165 170 175
Phe Arg Tyr Ala Leu Ile Ala Asp Gly Glu Gln Ile His Ser Ala Met
180 185 190
Tyr Pro Gly Ser Phe Leu Gly Ala Leu His Gly Glu Gln Thr Glu Ile
195 200 205
Asn Val Arg Gln His Ala Leu Glu Ser Ala Ser Phe Val Val Val Ala
210 215 220
Thr Gly Trp Leu Asp Ala Asp Gln Gln Ala Gln Ile Ala Lys Asp Thr
225 230 235 240
Gly Gly Pro Ile Gly Pro Ile Ser Gly Gly Cys Phe Thr Ala Val Ile
245 250 255
Gly Pro Asp Gly Gln Leu Ile Gly Glu Ala Leu Thr Ser Gly Glu Gly
260 265 270
Glu Val Ile Ala Asp Ile Asp Leu Ala Gln Ile Asp Ala Arg Lys Arg
275 280 285
Leu Met Asp Ala Ser Gly His Tyr Asn Arg Pro Glu Leu Leu Ser Leu
290 295 300
His Ile Asp His Thr Pro Thr Ala Pro Met His Glu Arg Val Val Tyr
305 310 315 320
Thr Glu Pro Gly Leu Ala Lys Arg Gln Asn Glu Asn Ser Ser Asn
325 330 335
<210>295
<211>1134
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>295
atggccgcaa aagtacttgg aggacgtgac acagtaaaag tagcagttgt tcaaacacca 60
tcagtcttta tggacaagaa agcctgtctc gaacttgcct gcgataagat tatcgaagcc 120
ggcaaagagg gcgcggagct tgttgttttt cctgaaacct ggattccgac atatccttat 180
tggaccatgg gatgggatac catcgcccat ggcttccatg atgtgatggc ggacctgcag 240
gacaattccg tggtcgtcgg gagcgaagac accgacatat tgggcaaagc cgcccgggaa 300
gctggcgcct acgtcgtcat gggctgtaat gagctcgatg accgggtcgg cagcaggacc 360
ttgttcaact cgctggtcta tatggacaaa tatggcggcg tgctcggccg tcaccgtaaa 420
ttaatcccgt cctttatcga acgcatctgg tggggcaatg gggacagccg cgatctcaaa 480
gtttacgaca cggaaattgg gcgcatcggc ggtcaaatct gctgggaaaa tcacattgtg 540
aacatcaccg catggtacat tgcccaaggc gtggatattc atgtctcggt ttggccggga 600
atgtggaact gtggcgcaga agaaggagaa tccttcatat acgccggtca cgacatcaac 660
aaatgtgacc tcatccctgc tacacgcgga cgggcattta cgggtcagtg ttatgtcctc 720
tcagccaaca acctactgcg gatggaagac attcctgacg atttcccgtt ccgggatacc 780
atgaactatg gcggtccggg ccaggaggat tttgtcggat gggcttgcgg tggcagccat 840
attgttgcgc caacgtctga atttatggtg ccgccgacgt ttgatataga caccatcatc 900
tatgcagaac ttcaggcgaa atacatcaaa gtggtgaagt cggtcttcga ttccctcggc 960
cactacgcgc ggtgggacct cgtcaacttg accacgccgc caccgccgta tgaacctgaa 1020
accgacgcac cagctctcac ggccgatatc cgtgatcggg tcatcgagag tgtggctaaa 1080
gagttcaagc tcgaaccaga aaaagtggct gaagttgtgc gcaatgccgc ctag 1134
<210>296
<211>377
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>296
Met Ala Ala Lys Val Leu Gly Gly Arg Asp Thr Val Lys Val Ala Val
1 5 10 15
Val Gln Thr Pro Ser Val Phe Met Asp Lys Lys Ala Cys Leu Glu Leu
20 25 30
Ala Cys Asp Lys Ile Ile Glu Ala Gly Lys Glu Gly Ala Glu Leu Val
35 40 45
Val Phe Pro Glu Thr Trp Ile Pro Thr Tyr Pro Tyr Trp Thr Met Gly
50 55 60
Trp Asp Thr Ile Ala His Gly Phe His Asp Val Met Ala Asp Leu Gln
65 70 75 80
Asp Asn Ser Val Val Val Gly Ser Glu Asp Thr Asp Ile Leu Gly Lys
85 90 95
Ala Ala Arg Glu Ala Gly Ala Tyr Val Val Met Gly Cys Asn Glu Leu
100 105 110
Asp Asp Arg Val Gly Ser Arg Thr Leu Phe Asn Ser Leu Val Tyr Met
115 120 125
Asp Lys Tyr Gly Gly Val Leu Gly Arg His Arg Lys Leu Ile Pro Ser
130 135 140
Phe Ile Glu Arg Ile Trp Trp Gly Asn Gly Asp Ser Arg Asp Leu Lys
145 150 155 160
Val Tyr Asp Thr Glu Ile Gly Arg Ile Gly Gly Gln Ile Cys Trp Glu
165 170 175
Asn His Ile Val Asn Ile Thr Ala Trp Tyr Ile Ala Gln Gly Val Asp
180 185 190
Ile His Val Ser Val Trp Pro Gly Met Trp Asn Cys Gly Ala Glu Glu
195 200 205
Gly Glu Ser Phe Ile Tyr Ala Gly His Asp Ile Asn Lys Cys Asp Leu
210 215 220
Ile Pro Ala Thr Arg Gly Arg Ala Phe Thr Gly Gln Cys Tyr Val Leu
225 230 235 240
Ser Ala Asn Asn Leu Leu Arg Met Glu Asp Ile Pro Asp Asp Phe Pro
245 250 255
Phe Arg Asp Thr Met Asn Tyr Gly Gly Pro Gly Gln Glu Asp Phe Val
260 265 270
Gly Trp Ala Cys Gly Gly Ser His Ile Val Ala Pro Thr Ser Glu Phe
275 280 285
Met Val Pro Pro Thr Phe Asp Ile Asp Thr Ile Ile Tyr Ala Glu Leu
290 295 300
Gln Ala Lys Tyr Ile Lys Val Val Lys Ser Val Phe Asp Ser Leu Gly
305 310 315 320
His Tyr Ala Arg Trp Asp Leu Val Asn Leu Thr Thr Pro Pro Pro Pro
325 330 335
Tyr Glu Pro Glu Thr Asp Ala Pro Ala Leu Thr Ala Asp Ile Arg Asp
340 345 350
Arg Val Ile Glu Ser Val Ala Lys Glu Phe Lys Leu Glu Pro Glu Lys
355 360 365
Val Ala Glu Val Val Arg Asn Ala Ala
370 375
<210>297
<211>1059
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>297
atgacacgtt ttcgggacgt cacggtggcg gcggttcagg ccgcacccgt ctatttcgat 60
cgggaggcct ccacagataa ggcgtgccaa ttgattcacg aagcggcgaa gaaaggcgca 120
gccctcgcgg cgttcggcga aacgtggttg ccgggatatc cgttctttgc atgggggttc 180
gcgcacaacc ggagcctgtt ctggaatgcg gccgccgagt acatcgccaa tgctgtggag 240
attccgagtc caacgacgga ccgcctctgc gccgcagcga agatcgccgg gattgacgtg 300
gtaatcggcg tcgtagaact ggatggacga acgcgagcgt cggtttacag cacactgctg 360
ttcatcggga gagagggggc gatcctgggg cgccaccgca aattgaagcc aacccacatg 420
gagcgaacgg tgtggggtga aggggacgct cacgggctcc gcgttcacga gcgtccgtac 480
ggccgcctca gcgggctgaa ttgctgggaa cacaacatga tgctgcccgg ctatgtgctt 540
gccgcgcagg gcacgcagtt tcacgtcgcg acatggcctg ggaaagagag gctcacagtt 600
ccgccgaacg aggcggctta tacgcgccag cttctcctct ctcgcgccta tgcatcccag 660
gcgggcgcgt acgtgatcag cgtcgcgggg ctgctcggac ccgactcgat gccggagcgt 720
tatcgcgaac tgggacagtc ctatgagttg accggcgaca gcgtcatcat cgatccgcgc 780
ggcgaggtca tcgcggggcc cgcgaagggc gagaccatcc tgctcgcgca atgcagccag 840
gaagccctct tcgccgcaaa gtccgccatt gacgtcggcg gtcattactc gcgcccggat 900
atttttcagc tgcgtgtcaa cgatcagcta cagcatcagg tccggagact cgaggcgact 960
ctcacgcccc cagtcgccgt agttgtcggg cctgagggta gctcacatga gcaggagacg 1020
gccttcgggc catccagcct cctggccacg acaagctag 1059
<210>298
<211>352
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>298
Met Thr Arg Phe Arg Asp Val Thr Val Ala Ala Val Gln Ala Ala Pro
1 5 10 15
Val Tyr Phe Asp Arg Glu Ala Ser Thr Asp Lys Ala Cys Gln Leu Ile
20 25 30
His Glu Ala Ala Lys Lys Gly Ala Ala Leu Ala Ala Phe Gly Glu Thr
35 40 45
Trp Leu Pro Gly Tyr Pro Phe Phe Ala Trp Gly Phe Ala His Asn Arg
50 55 60
Ser Leu Phe Trp Asn Ala Ala Ala Glu Tyr Ile Ala Asn Ala Val Glu
65 70 75 80
Ile Pro Ser Pro Thr Thr Asp Arg Leu Cys Ala Ala Ala Lys Ile Ala
85 90 95
Gly Ile Asp Val Val Ile Gly Val Val Glu Leu Asp Gly Arg Thr Arg
100 105 110
Ala Ser Val Tyr Ser Thr Leu Leu Phe Ile Gly Arg Glu Gly Ala Ile
115 120 125
Leu Gly Arg His Arg Lys Leu Lys Pro Thr His Met Glu Arg Thr Val
130 135 140
Trp Gly Glu Gly Asp Ala His Gly Leu Arg Val His Glu Arg Pro Tyr
145 150 155 160
Gly Arg Leu Ser Gly Leu Asn Cys Trp Glu His Asn Met Met Leu Pro
165 170 175
Gly Tyr Val Leu Ala Ala Gln Gly Thr Gln Phe His Val Ala Thr Trp
180 185 190
Pro Gly Lys Glu Arg Leu Thr Val Pro Pro Asn Glu Ala Ala Tyr Thr
195 200 205
Arg Gln Leu Leu Leu Ser Arg Ala Tyr Ala Ser Gln Ala Gly Ala Tyr
210 215 220
Val Ile Ser Val Ala Gly Leu Leu Gly Pro Asp Ser Met Pro Glu Arg
225 230 235 240
Tyr Arg Glu Leu Gly Gln Ser Tyr Glu Leu Thr Gly Asp Ser Val Ile
245 250 255
Ile Asp Pro Arg Gly Glu Val Ile Ala Gly Pro Ala Lys Gly Glu Thr
260 265 270
Ile Leu Leu Ala Gln Cys Ser Gln Glu Ala Leu Phe Ala Ala Lys Ser
275 280 285
Ala Ile Asp Val Gly Gly His Tyr Ser Arg Pro Asp Ile Phe Gln Leu
290 295 300
Arg Val Asn Asp Gln Leu Gln His Gln Val Arg Arg Leu Glu Ala Thr
305 310 315 320
Leu Thr Pro Pro Val Ala Val Val Val Gly Pro Glu Gly Ser Ser His
325 330 335
Glu Gln Glu Thr Ala Phe Gly Pro Ser Ser Leu Leu Ala Thr Thr Ser
340 345 350
<210>299
<211>987
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>299
atgactgttg ttaaggccgc cgcagtgcag atcagcccgg tgctctacag ccgtgcagga 60
accgtcgaga aggtcgtgaa gaagatcgac gagctgggcc agaagggtgt cgagtttgcc 120
gtcttccctg aaaccgttgt cccctactac ccctacttct ccttcgtgca gcccccctac 180
aaactggcca cggagcacct gcgcctgctt gaggagtcgg tgaccgtgcc ctctgccgag 240
acggacgcca tcggcgacgc cgcccgcaag gccaacatgg tcgtctcgat cggtgtcaac 300
gaacgtgatg gcggcaccat ttacaacacc caactcctgt tcgacgccga cggaaccctg 360
atccagcgcc gccgcaagat cacgccgacc taccacgagc ggatgatctg gggacaggga 420
gacggatcag gcttgcgcgc ggtcgacagc gtcgtcggcc gcatcggcca gctcgcctgc 480
tgggagcact accagccgct ggcccgttac gctctcatcg ccgacggcga gcagatccac 540
gccgcgatgt accccggcgc cttcgggggc gatctgttcg cagagcagat cgaagtcaat 600
gtccgtcagc acgctctgga atcggccagt ttcgtcgtca gcgccaccgc ctggctcgac 660
gccgaccagc aggcccagat tgcgaaggac accggcggcc ccgtacaggc gatctccggc 720
ggcttcttca cagccatcat cgaccccgac ggccgcatca tcggcgaacc gatcacctcc 780
ggcgaaggcg aagtcatcgc tgacctcgac tttgcgctca tcgaccgccg caagcgcctg 840
atggacgcca gcggccacta cagccgcccc gaactgctca gcctgcagat cgaccggacg 900
ccggcacccg ccgtccacga tcgcaatcgc caggggtcct caagcgctcc ggcaactgaa 960
aagggccgct cagccgaggc caagtga 987
<210>300
<211>328
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>300
Met Thr Val Val Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Ala Gly Thr Val Glu Lys Val Val Lys Lys Ile Asp Glu Leu
20 25 30
Gly Gln Lys Gly Val Glu Phe Ala Val Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Pro Pro Tyr Lys Leu Ala Thr
50 55 60
Glu His Leu Arg Leu Leu Glu Glu Ser Val Thr Val Pro Ser Ala Glu
65 70 75 80
Thr Asp Ala Ile Gly Asp Ala Ala Arg Lys Ala Asn Met Val Val Ser
85 90 95
Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Ile Tyr Asn Thr Gln Leu
100 105 110
Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys Ile Thr
115 120 125
Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp Gly Ser Gly
130 135 140
Leu Arg Ala Val Asp Ser Val Val Gly Arg Ile Gly Gln Leu Ala Cys
145 150 155 160
Trp Glu His Tyr Gln Pro Leu Ala Arg Tyr Ala Leu Ile Ala Asp Gly
165 170 175
Glu Gln Ile His Ala Ala Met Tyr Pro Gly Ala Phe Gly Gly Asp Leu
180 185 190
Phe Ala Glu Gln Ile Glu Val Asn Val Arg Gln His Ala Leu Glu Ser
195 200 205
Ala Ser Phe Val Val Ser Ala Thr Ala Trp Leu Asp Ala Asp Gln Gln
210 215 220
Ala Gln Ile Ala Lys Asp Thr Gly Gly Pro Val Gln Ala Ile Ser Gly
225 230 235 240
Gly Phe Phe Thr Ala Ile Ile Asp Pro Asp Gly Arg Ile Ile Gly Glu
245 250 255
Pro Ile Thr Ser Gly Glu Gly Glu Val Ile Ala Asp Leu Asp Phe Ala
260 265 270
Leu Ile Asp Arg Arg Lys Arg Leu Met Asp Ala Ser Gly His Tyr Ser
275 280 285
Arg Pro Glu Leu Leu Ser Leu Gln Ile Asp Arg Thr Pro Ala Pro Ala
290 295 300
Val His Asp Arg Asn Arg Gln Gly Ser Ser Ser Ala Pro Ala Thr Glu
305 310 315 320
Lys Gly Arg Ser Ala Glu Ala Lys
325
<210>301
<211>1032
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>301
atgaccacgc ccgcgccgtt caccgccgcc gtcgtccagg cgtcgcccgt gttcctcgac 60
cgcgatgcca cggtcgataa ggcgtgcgcc ctgatcaccg cggccggagc gcgcggcgcg 120
aagctcgtcg tgctgcccga gaccttcgtg cccgcatacc ccgcgtgggt ctggtacctg 180
ccgctcacgc gtcggcccga cgtcgcgaag ctctaccgcg cgctcgtcga gaacgcgatc 240
gacgtgccgg gtccggaaac cgagcgcctc gggcgcgcgg cgcgtgacgc gggcgcgtgg 300
gtggcgatcg gtgtcaacga gcgcaacgcg aaggcgagcc gcacgtcgct ctacaacacc 360
gtgctcctgt tcgacgacca gggcacgctg gtcgagtcgc ggcgcaagct gatgccgacc 420
ggcggcgagc gcctggtgtg gacgccgggc gagccggtgc cgttgcgggt tcacgacacg 480
ccgctcggcc gcgtcggcgc gctgatctgc tgggagaact acatgccgct cgcgcgcttc 540
gcgctctacg agcagggcgt ccagatctat ctcgcgccga cctgggacta cagcgaggcg 600
tggctcgcgt ccatgcggca cgtcgcgcgt gaaggtcgca cgtgggtgat cgggtgcagc 660
caggcggtgc ggcgtgacga gatcccggac cacctgccgt tcaaggacgc gatccccgag 720
agcctcgagt ggatcaacgc cgggaacagc atcgtcgtcg atccggacgg cgcggtggtc 780
gcgggaccgc tggcgcgggc gcacgacacg ctctacgtcg agatcgatcc gggccgcgcc 840
gccggctccc gatggatctt cgacgccgcc ggtcactatc accgtcccga cctgttccat 900
ttcgcgatgc gcgcgccggt ggcgccagag accgcgccgg ttgtgcggaa gcgcggccgg 960
cggctagcct cggccccggc gcggaagcca cggaaacggc cggcgcgtcc cacccgaagg 1020
aggactcgat ga 1032
<210>302
<211>335
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>302
Met Thr Thr Pro Ala Pro Phe Thr Ala Ala Val Val Gln Ala Ser Pro
1 5 10 15
Val Phe Leu Asp Arg Asp Ala Thr Val Asp Lys Ala Cys Ala Leu Ile
20 25 30
Thr Ala Ala Gly Ala Arg Gly Ala Lys Leu Val Val Leu Pro Glu Thr
35 40 45
Phe Val Pro Ala Tyr Pro Ala Trp Val Trp Tyr Leu Pro Leu Thr Arg
50 55 60
Arg Pro Asp Val Ala Lys Leu Tyr Arg Ala Leu Val Glu Asn Ala Ile
65 70 75 80
Asp Val Pro Gly Pro Glu Thr Glu Arg Leu Gly Arg Ala Ala Arg Asp
85 90 95
Ala Gly Ala Trp Val Ala Ile Gly Val Asn Glu Arg Asn Ala Lys Ala
100 105 110
Ser Arg Thr Ser Leu Tyr Asn Thr Val Leu Leu Phe Asp Asp Gln Gly
115 120 125
Thr Leu Val Glu Ser Arg Arg Lys Leu Met Pro Thr Gly Gly Glu Arg
130 135 140
Leu Val Trp Thr Pro Gly Glu Pro Val Pro Leu Arg Val His Asp Thr
145 150 155 160
Pro Leu Gly Arg Val Gly Ala Leu Ile Cys Trp Glu Asn Tyr Met Pro
165 170 175
Leu Ala Arg Phe Ala Leu Tyr Glu Gln Gly Val Gln Ile Tyr Leu Ala
180 185 190
Pro Thr Trp Asp Tyr Ser Glu Ala Trp Leu Ala Ser Met Arg His Val
195 200 205
Ala Arg Glu Gly Arg Thr Trp Val Ile Gly Cys Ser Gln Ala Val Arg
210 215 220
Arg Asp Glu Ile Pro Asp His Leu Pro Phe Lys Asp Ala Ile Pro Glu
225 230 235 240
Ser Leu Glu Trp Ile Asn Ala Gly Asn Ser Ile Val Val Asp Pro Asp
245 250 255
Gly Ala Val Val Ala Gly Pro Leu Ala Arg Ala His Asp Thr Leu Tyr
260 265 270
Val Glu Ile Asp Pro Gly Arg Ala Ala Gly Ser Arg Trp Ile Phe Asp
275 280 285
Ala Ala Gly His Tyr His Arg Pro Asp Leu Phe His Phe Ala Met Arg
290 295 300
Ala Pro Val Ala Pro Glu Thr Ala Pro Val Val Arg Lys Arg Gly Arg
305 310 315 320
Arg Leu Ala Ser Ala Pro Ala Arg Lys Pro Arg Lys Arg Pro Ala
325 330 335
<210>303
<211>1011
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>303
atgggcatcg ttcaccccaa gtacaaagtc gctgtcgttc aggctgcccc tgtatggcta 60
gacctggagg ccactgttga taaatgcatt cagttgattg aagaagcagc cagcaagggc 120
tgcaagctca tcgccttccc tgagaccttc attcccggat acccttggta tatctggatg 180
ggaacgcctg cctggactat tcagcgcggc tttgtacagc gctattttga taattctctg 240
tcttatgaca gtccgcaagc ggagaagcta aggcaggcag tcaaaaaggc tgagatcacg 300
gccgtactag gtctttctga gcgcagcggc ggtagcttgt acattgcaca atggaccatt 360
ggccctgacg gagaaaccat acacaaacgc agaaaagtgc gtccaacgca tggtgagcgt 420
acggtatttg gcgacggtga cggtagtgat cttgcggtgc acgatacccc cctggggcgg 480
ctgggcgcgc ttgcgtgctg ggagaacata ctgtcactga acaagtatgc gatgtattca 540
cagaatgagc aggtgcacgt agccgcttgg cctagcttct cggtctacga gcctttcgcc 600
catgcattgg gttgggaggt caataacgca gtcagcaagg tttacgcggt agaaggcggc 660
tgttttgtat tggcgccttg cgcagtggtc tccgaggaaa tgatcgaagc actgtgcgat 720
acacccgata agcaccaact ggctcatgcg ggtggagggc atgctgtcat ttacggacca 780
gatggcagtc ctctggcaga taagttaccc gaaggagagg aggggctatt aattgcagaa 840
attgatctcg gtctcatcag cttggcgaaa aatgccatgg acccggtggg gcattactct 900
cgacctgacg tacatcgctt gctattgaat cgcaatccag caaagcgggt tgaggaattt 960
tctctgccca ttgatttggc agagacaact ccgccaatat taggcacgta g 1011
<210>304
<211>336
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>304
Met Gly Ile Val His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Val Trp Leu Asp Leu Glu Ala Thr Val Asp Lys Cys Ile Gln Leu
20 25 30
Ile Glu Glu Ala Ala Ser Lys Gly Cys Lys Leu Ile Ala Phe Pro Glu
35 40 45
Thr Phe Ile Pro Gly Tyr Pro Trp Tyr Ile Trp Met Gly Thr Pro Ala
50 55 60
Trp Thr Ile Gln Arg Gly Phe Val Gln Arg Tyr Phe Asp Asn Ser Leu
65 70 75 80
Ser Tyr Asp Ser Pro Gln Ala Glu Lys Leu Arg Gln Ala Val Lys Lys
85 90 95
Ala Glu Ile Thr Ala Val Leu Gly Leu Ser Glu Arg Ser Gly Gly Ser
100 105 110
Leu Tyr Ile Ala Gln Trp Thr Ile Gly Pro Asp Gly Glu Thr Ile His
115 120 125
Lys Arg Arg Lys Val Arg Pro Thr His Gly Glu Arg Thr Val Phe Gly
130 135 140
Asp Gly Asp Gly Ser Asp Leu Ala Val His Asp Thr Pro Leu Gly Arg
145 150 155 160
Leu Gly Ala Leu Ala Cys Trp Glu Asn Ile Leu Ser Leu Asn Lys Tyr
165 170 175
Ala Met Tyr Ser Gln Asn Glu Gln Val His Val Ala Ala Trp Pro Ser
180 185 190
Phe Ser Val Tyr Glu Pro Phe Ala His Ala Leu Gly Trp Glu Val Asn
195 200 205
Asn Ala Val Ser Lys Val Tyr Ala Val Glu Gly Gly Cys Phe Val Leu
210 215 220
Ala Pro Cys Ala Val Val Ser Glu Glu Met Ile Glu Ala Leu Cys Asp
225 230 235 240
Thr Pro Asp Lys His Gln Leu Ala His Ala Gly Gly Gly His Ala Val
245 250 255
Ile Tyr Gly Pro Asp Gly Ser Pro Leu Ala Asp Lys Leu Pro Glu Gly
260 265 270
Glu Glu Gly Leu Leu Ile Ala Glu Ile Asp Leu Gly Leu Ile Ser Leu
275 280 285
Ala Lys Asn Ala Met Asp Pro Val Gly His Tyr Ser Arg Pro Asp Val
290 295 300
His Arg Leu Leu Leu Asn Arg Asn Pro Ala Lys Arg Val Glu Glu Phe
305 310 315 320
Ser Leu Pro Ile Asp Leu Ala Glu Thr Thr Pro Pro Ile Leu Gly Thr
325 330 335
<210>305
<211>1068
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>305
atgacgatgc aacatcccaa gtttcgcgct gctgccgtgc aggcggcacc ggtcttcctt 60
gaccttgatg cgtcgataga taaagccatc gacctgatcg cgcaagccgc caaaggcggc 120
gcgcaattga ttgcctttcc ggaaacctgg ctgcccggct accccttttt catctggctc 180
gattcgccgg cctggggcat gcaattcatc cagcgctacc atgacaattc cctggtctac 240
ggcacaccgc aggccgagcg catcgcgcag gctgcgaaaa agcatcgcat catggtcgtc 300
atggggcaca gcgagcggga tcatggaagt ctgtacatcg ctcagtggat catcggtgcc 360
gatggggaaa cggttgcgac acgtcgtaag ctcaaaccga ctcatgccga gcgcaccttg 420
tttggggaag gcgatggcag tgacctgagc gtgttcgata caccgctggg aagggttggc 480
gcactatgct gctgggagca cctccagccc ctgtcgaaat acgcgctata cgcacagaat 540
gagcaggtcc acattgcttc ctggccaagc ttttccctgt atcgcggggg cgcctacgcg 600
ctcggcgccg aagtgaacaa tgcggccagc cagatttatg ctgtcgaagg ccagtgtttt 660
gtgatcgcgc cgtgcggggt cgtcacgaaa gaaatgctgg acgtgctgtg caccgacgaa 720
atgaaaaagc agttgttggt tgaaggcggc gggttcgcgc gaatttacgc gcccgatgga 780
cagatgatgc acgcgccgct ggcggaaaac gaagagggcc tggtgtatgc cgatctcgac 840
ctgggcatga tctcgctggc caaagtagtc gccgacccgg ccgggcatta tgcgcggccc 900
gacgtaaccc ggttactgct ggacaagact ccgggagacc gcgtcatgct ggcgagccgt 960
cgcggcaagg aggtcagccg cgctggtaac gacgagccgc aagtgctggt ctcgcgcaac 1020
gacacgctga cctcgccgaa gccagcgtct tcgcgcaaag caagctga 1068
<210>306
<211>355
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>306
Met Thr Met Gln His Pro Lys Phe Arg Ala Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Val Phe Leu Asp Leu Asp Ala Ser Ile Asp Lys Ala Ile Asp Leu
20 25 30
Ile Ala Gln Ala Ala Lys Gly Gly Ala Gln Leu Ile Ala Phe Pro Glu
35 40 45
Thr Trp Leu Pro Gly Tyr Pro Phe Phe Ile Trp Leu Asp Ser Pro Ala
50 55 60
Trp Gly Met Gln Phe Ile Gln Arg Tyr His Asp Asn Ser Leu Val Tyr
65 70 75 80
Gly Thr Pro Gln Ala Glu Arg Ile Ala Gln Ala Ala Lys Lys His Arg
85 90 95
Ile Met Val Val Met Gly His Ser Glu Arg Asp His Gly Ser Leu Tyr
100 105 110
Ile Ala Gln Trp Ile Ile Gly Ala Asp Gly Glu Thr Val Ala Thr Arg
115 120 125
Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Leu Phe Gly Glu Gly
130 135 140
Asp Gly Ser Asp Leu Ser Val Phe Asp Thr Pro Leu Gly Arg Val Gly
145 150 155 160
Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr Ala Leu
165 170 175
Tyr Ala Gln Asn Glu Gln Val His Ile Ala Ser Trp Pro Ser Phe Ser
180 185 190
Leu Tyr Arg Gly Gly Ala Tyr Ala Leu Gly Ala Glu Val Asn Asn Ala
195 200 205
Ala Ser Gln Ile Tyr Ala Val Glu Gly Gln Cys Phe Val Ile Ala Pro
210 215 220
Cys Gly Val Val Thr Lys Glu Met Leu Asp Val Leu Cys Thr Asp Glu
225 230 235 240
Met Lys Lys Gln Leu Leu Val Glu Gly Gly Gly Phe Ala Arg Ile Tyr
245 250 255
Ala Pro Asp Gly Gln Met Met His Ala Pro Leu Ala Glu Asn Glu Glu
260 265 270
Gly Leu Val Tyr Ala Asp Leu Asp Leu Gly Met Ile Ser Leu Ala Lys
275 280 285
Val Val Ala Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Thr Arg
290 295 300
Leu Leu Leu Asp Lys Thr Pro Gly Asp Arg Val Met Leu Ala Ser Arg
305 310 315 320
Arg Gly Lys Glu Val Ser Arg Ala Gly Asn Asp Glu Pro Gln Val Leu
325 330 335
Val Ser Arg Asn Asp Thr Leu Thr Ser Pro Lys Pro Ala Ser Ser Arg
340 345 350
Lys Ala Ser
355
<210>307
<211>942
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>307
atggctgaac cggaatcctt tatcgtcgct gctgtacagg ctacaccgat ctttcttgac 60
cgccaggcaa cccttgagaa agcgtgcgac ctgattgccg aagctggcag caatggcgca 120
aaactcgttc tttttcccga agcctttatc cccacctatc ctgattggat atgggcggtg 180
acaggctcac aatctgcgct gctcgacgaa ctttatgtgg aactactgga aaactccgtg 240
accatccccg acgcgaccac tgagcaactt tgtgaagcag cacgtaacgc cggtctctac 300
gtcgtcatgg gagtgaatga gcgcaacgcc gaggcgagca acgccacact ctataacacc 360
ctgctctata ttgacgatca gggcaaaatt ctcggcaagc atcgcaaatt ggtcccgacc 420
gccctggagc gaatcgtctg gggctatggc gatggcagca cgcttgacgc ctttgaaacg 480
ccgctgggca agattggcgg gctgatctgt tgggaaaatt acatgccact ggcgcgccaa 540
acactttatg cctggggggt gcaaatttac ttggccgcaa cgtgggatcg cggcgaagtt 600
tggcaggcga ccatgcgcca tattgccagg gaaggcggcg tctatgtagt cgcctcctgt 660
attccatttc acatcaaaga cattcctgac cacatgcctg aaatccgcaa tctctatgca 720
ccggggacag actggatcaa cgtcggccaa agctgcatca tcaaccccag cggcgactat 780
attgcaggcc ctgtcgagtg tcgcgaggag attctttatg ccgaggtaaa tctgcgccag 840
agtgcggcgg caaaacgtat gttggatgtg gcgggccatt atggacgccc tgatgtcttt 900
cacctcaccg tcaaccgcac gcccaatccg catattcgat aa 942
<210>308
<211>313
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>308
Met Ala Glu Pro Glu Ser Phe Ile Val Ala Ala Val Gln Ala Thr Pro
1 5 10 15
Ile Phe Leu Asp Arg Gln Ala Thr Leu Glu Lys Ala Cys Asp Leu Ile
20 25 30
Ala Glu Ala Gly Ser Asn Gly Ala Lys Leu Val Leu Phe Pro Glu Ala
35 40 45
Phe Ile Pro Thr Tyr Pro Asp Trp Ile Trp Ala Val Thr Gly Ser Gln
50 55 60
Ser Ala Leu Leu Asp Glu Leu Tyr Val Glu Leu Leu Glu Asn Ser Val
65 70 75 80
Thr Ile Pro Asp Ala Thr Thr Glu Gln Leu Cys Glu Ala Ala Arg Asn
85 90 95
Ala Gly Leu Tyr Val Val Met Gly Val Asn Glu Arg Asn Ala Glu Ala
100 105 110
Ser Asn Ala Thr Leu Tyr Asn Thr Leu Leu Tyr Ile Asp Asp Gln Gly
115 120 125
Lys Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Ala Leu Glu Arg
130 135 140
Ile Val Trp Gly Tyr Gly Asp Gly Ser Thr Leu Asp Ala Phe Glu Thr
145 150 155 160
Pro Leu Gly Lys Ile Gly Gly Leu Ile Cys Trp Glu Asn Tyr Met Pro
165 170 175
Leu Ala Arg Gln Thr Leu Tyr Ala Trp Gly Val Gln Ile Tyr Leu Ala
180 185 190
Ala Thr Trp Asp Arg Gly Glu Val Trp Gln Ala Thr Met Arg His Ile
195 200 205
Ala Arg Glu Gly Gly Val Tyr Val Val Ala Ser Cys Ile Pro Phe His
210 215 220
Ile Lys Asp Ile Pro Asp His Met Pro Glu Ile Arg Asn Leu Tyr Ala
225 230 235 240
Pro Gly Thr Asp Trp Ile Asn Val Gly Gln Ser Cys Ile Ile Asn Pro
245 250 255
Ser Gly Asp Tyr Ile Ala Gly Pro Val Glu Cys Arg Glu Glu Ile Leu
260 265 270
Tyr Ala Glu Val Asn Leu Arg Gln Ser Ala Ala Ala Lys Arg Met Leu
275 280 285
Asp Val Ala Gly His Tyr Gly Arg Pro Asp Val Phe His Leu Thr Val
290 295 300
Asn Arg Thr Pro Asn Pro His Ile Arg
305 310
<210>309
<211>951
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>309
ttgaccgact gtctcaaaat agccgccgcc caaatcactc cggtcttcct cgaccgcgtt 60
gcgaccacga agaaggtcgt cgaaaccatc gaaaaagcgg ccgccggcgg tgcgcggctg 120
gtcgcattcg gcgaagcgct gttgcccgcc tatccattgt ggctgacgcg caccgacgcc 180
gcgcggttca attccgacgt gcaaaaaaac ttgcacgcga tctatctcaa gcaatccgtc 240
tcgatagcag gcggtcacct atctccgatt tgcaaaatcg caagcgaacg caagattgcc 300
gtcatcctcg gcatcgccga gcgcgcgacc gaccggggcg accacaccat ttactgctcg 360
tgcgtgttca tcgatgccga cggccgaatc gcgtcggtcc atcgcaagct gatgccgaca 420
tacgaagaac gcctcagttg gggattcggc gacggtgcgg gactcgtcac gcatccggtc 480
gggccgttca cggtgggcgc gttgaactgc tgggaaaact ggatgcccct cgcgcgcacc 540
gcgctgtatg ccggcggaga agatttgcac gttgcgatct ggcccggcgg atcggtgctc 600
acggaagaca tcacgcgctt catcgcacgc gagtcgcgct cgttcgtcct gtccgtcagc 660
ggcatcattc gcgaaagcga catccccagc ggggtcccct atcgcgatga aatgtgtgcg 720
aaaggcgaaa ccatctacaa cggcggaagc tgcatcgccg gacccgacgg tcagtggatc 780
atcgcgcccg taaccgaccg tgaagagttg atcttcgcgg agatcgacca cgaacacgtc 840
cgccgcgagc ggcagaattt cgacccggcc gggcattacg cgcggcccga tgtgttgcaa 900
ataaccgttg atcgtcgacg acaaacagcg gcgaatttta ttgatgacta a 951
<210>310
<211>316
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>310
Leu Thr Asp Cys Leu Lys Ile Ala Ala Ala Gln Ile Thr Pro Val Phe
1 5 10 15
Leu Asp Arg Val Ala Thr Thr Lys Lys Val Val Glu Thr Ile Glu Lys
20 25 30
Ala Ala Ala Gly Gly Ala Arg Leu Val Ala Phe Gly Glu Ala Leu Leu
35 40 45
Pro Ala Tyr Pro Leu Trp Leu Thr Arg Thr Asp Ala Ala Arg Phe Asn
50 55 60
Ser Asp Val Gln Lys Asn Leu His Ala Ile Tyr Leu Lys Gln Ser Val
65 70 75 80
Ser Ile Ala Gly Gly His Leu Ser Pro Ile Cys Lys Ile Ala Ser Glu
85 90 95
Arg Lys Ile Ala Val Ile Leu Gly Ile Ala Glu Arg Ala Thr Asp Arg
100 105 110
Gly Asp His Thr Ile Tyr Cys Ser Cys Val Phe Ile Asp Ala Asp Gly
115 120 125
Arg Ile Ala Ser Val His Arg Lys Leu Met Pro Thr Tyr Glu Glu Arg
130 135 140
Leu Ser Trp Gly Phe Gly Asp Gly Ala Gly Leu Val Thr His Pro Val
145 150 155 160
Gly Pro Phe Thr Val Gly Ala Leu Asn Cys Trp Glu Asn Trp Met Pro
165 170 175
Leu Ala Arg Thr Ala Leu Tyr Ala Gly Gly Glu Asp Leu His Val Ala
180 185 190
Ile Trp Pro Gly Gly Ser Val Leu Thr Glu Asp Ile Thr Arg Phe Ile
195 200 205
Ala Arg Glu Ser Arg Ser Phe Val Leu Ser Val Ser Gly Ile Ile Arg
210 215 220
Glu Ser Asp Ile Pro Ser Gly Val Pro Tyr Arg Asp Glu Met Cys Ala
225 230 235 240
Lys Gly Glu Thr Ile Tyr Asn Gly Gly Ser Cys Ile Ala Gly Pro Asp
245 250 255
Gly Gln Trp Ile Ile Ala Pro Val Thr Asp Arg Glu Glu Leu Ile Phe
260 265 270
Ala Glu Ile Asp His Glu His Val Arg Arg Glu Arg Gln Asn Phe Asp
275 280 285
Pro Ala Gly His Tyr Ala Arg Pro Asp Val Leu Gln Ile Thr Val Asp
290 295 300
Arg Arg Arg Gln Thr Ala Ala Asn Phe Ile Asp Asp
305 310 315
<210>311
<211>1011
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>311
atgtcagaaa agcgaataat cagagcagct gcagttcaga tcacacctga atttgactca 60
gcagatggaa cagttaagaa ggtatgcaag gtaatcgatg aagcaggcgc aaagggtgta 120
caaattatag tattccccga aaccttcatt ccgtattacc catacttctc attcattacc 180
cccccagttt ctgctggcgc tgagcatttg aagctttatg aaaaaagtgt cgtgatacct 240
ggtcctgtca ctcaagcgat cgccgagaga gctagagtga atcaaatggt cgtcgtactc 300
ggtgtaaacg agagagataa cggtagcctc tataacactc aactgatctt cgataccaat 360
ggtgagttga tgttgaagag aagaaaaatc actcctacat atcatgagcg catgatctgg 420
ggacaaggtg atgcttcagg cttaaaagta gttgaaacga gcattgcccg ggtaggtgct 480
ctagcttgct gggaacatta caacccgctg gccagatatt ctctcatgac acagcatgaa 540
gaaattcact gcgcacaatt cccaggttct atggttggcc aaatatttgc cgaccaaatg 600
gatgtcacta tcagacatca cgcattggaa tctggctgtt tcgtcattaa tgccaccggc 660
tggctcacag acgagcaaat ccagtccatt acagatgacc caaaaatgca gaaagcatta 720
cgtggcggct gcaacacagc aatcatttct cccgaaggtg tgcacttaac agagccctta 780
cgtgaaggtg aaggcatttt gattgctgac ctggacatgt cactcatcac aaaacgaaaa 840
agaatgatgg attcagtagg tcattattca agacctgaac tattaagtct ggcgatcaat 900
gacaagccag caacaacaaa attttcaatg actgaggggt gtactcaaac tgagcaattt 960
cgaatcgcag aggagttgaa aaatgacgac aagcttagca ccggaaacta a 1011
<210>312
<211>336
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>312
Met Ser Glu Lys Arg Ile Ile Arg Ala Ala Ala Val Gln Ile Thr Pro
1 5 10 15
Glu Phe Asp Ser Ala Asp Gly Thr Val Lys Lys Val Cys Lys Val Ile
20 25 30
Asp Glu Ala Gly Ala Lys Gly Val Gln Ile Ile Val Phe Pro Glu Thr
35 40 45
Phe Ile Pro Tyr Tyr Pro Tyr Phe Ser Phe Ile Thr Pro Pro Val Ser
50 55 60
Ala Gly Ala Glu His Leu Lys Leu Tyr Glu Lys Ser Val Val Ile Pro
65 70 75 80
Gly Pro Val Thr Gln Ala Ile Ala Glu Arg Ala Arg Val Asn Gln Met
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp Asn Gly Ser Leu Tyr Asn
100 105 110
Thr Gln Leu Ile Phe Asp Thr Asn Gly Glu Leu Met Leu Lys Arg Arg
115 120 125
Lys Ile Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp
130 135 140
Ala Ser Gly Leu Lys Val Val Glu Thr Ser Ile Ala Arg Val Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ser Leu Met
165 170 175
Thr Gln His Glu Glu Ile His Cys Ala Gln Phe Pro Gly Ser Met Val
180 185 190
Gly Gln Ile Phe Ala Asp Gln Met Asp Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Ile Asn Ala Thr Gly Trp Leu Thr Asp
210 215 220
Glu Gln Ile Gln Ser Ile Thr Asp Asp Pro Lys Met Gln Lys Ala Leu
225 230 235 240
Arg Gly Gly Cys Asn Thr Ala Ile Ile Ser Pro Glu Gly Val His Leu
245 250 255
Thr Glu Pro Leu Arg Glu Gly Glu Gly Ile Leu Ile Ala Asp Leu Asp
260 265 270
Met Ser Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Ala Ile Asn Asp Lys Pro Ala
290 295 300
Thr Thr Lys Phe Ser Met Thr Glu Gly Cys Thr Gln Thr Glu Gln Phe
305 310 315 320
Arg Ile Ala Glu Glu Leu Lys Asn Asp Asp Lys Leu Ser Thr Gly Asn
325 330 335
<210>313
<211>987
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>313
atggcggccg ttcaggctgc gccggttccg tttgatgctg aggcttcggt ggataaggcc 60
tgtcgcttaa ttcaagaagc tgcagccaaa ggcgcagata tagttgcttt cggtgaggca 120
tggctacccg gctaccccta ttttgcctgg ttaccccaag taacaccaga gtggtatagt 180
gcggctgccg attatcttgc cagctctgtt gatatccctg gcccgatcac cgataaactt 240
tgccaggctg cccgtcgtgc atcggttgaa ctcatcatgg gtgtggtaga acgcagcaag 300
tctcagggaa ccacctattg cacgcttctt tttattagca aggatggcga aataattggc 360
aagcaccgca agctgaagcc cacactcgcc gaacgaaccg tctggggtga aggcgatgcc 420
agcggactga gggttcacga tcggcctatt gccagaatca gtgggctctc ctgctgggag 480
aacaaaatga tgctgccagg ttacgcactg atggcgcagg gtacgcaggt gcatgtctcc 540
gcctggccag ggatcccaga ggattcaccc atggaggtgc ctgcacaccc ccgtcaaaag 600
ctgctttccc aagcctttgc actgcaaggc gggtgctatg ttatttctcc ctccattgtc 660
cttagggcag aagatgtgcc cgagaaacac gccgctctac tgatgggaga ccaagtgggc 720
ggtagctata tcattgaccc ctgcggaaaa gtgatcgccg aggccggtgc gggtgagact 780
atcctgattg ccaaaggcaa ccttgacctc gtaagggccg ccaaaatggc cagtgatgta 840
ggggggtctt attcacgccc ggatcttttg cagttgatga tcaataaccg accactcgaa 900
cagctgattg agttcagtgc tgaaggtgca ggtaggggga atctagtatc caactcaccc 960
gaggtgtcag aacaagaagg tgagtaa 987
<210>314
<211>328
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>314
Met Ala Ala Val Gln Ala Ala Pro Val Pro Phe Asp Ala Glu Ala Ser
1 5 10 15
Val Asp Lys Ala Cys Arg Leu Ile Gln Glu Ala Ala Ala Lys Gly Ala
20 25 30
Asp Ile Val Ala Phe Gly Glu Ala Trp Leu Pro Gly Tyr Pro Tyr Phe
35 40 45
Ala Trp Leu Pro Gln Val Thr Pro Glu Trp Tyr Ser Ala Ala Ala Asp
50 55 60
Tyr Leu Ala Ser Ser Val Asp Ile Pro Gly Pro Ile Thr Asp Lys Leu
65 70 75 80
Cys Gln Ala Ala Arg Arg Ala Ser Val Glu Leu Ile Met Gly Val Val
85 90 95
Glu Arg Ser Lys Ser Gln Gly Thr Thr Tyr Cys Thr Leu Leu Phe Ile
100 105 110
Ser Lys Asp Gly Glu Ile Ile Gly Lys His Arg Lys Leu Lys Pro Thr
115 120 125
Leu Ala Glu Arg Thr Val Trp Gly Glu Gly Asp Ala Ser Gly Leu Arg
130 135 140
Val His Asp Arg Pro Ile Ala Arg Ile Ser Gly Leu Ser Cys Trp Glu
145 150 155 160
Asn Lys Met Met Leu Pro Gly Tyr Ala Leu Met Ala Gln Gly Thr Gln
165 170 175
Val His Val Ser Ala Trp Pro Gly Ile Pro Glu Asp Ser Pro Met Glu
180 185 190
Val Pro Ala His Pro Arg Gln Lys Leu Leu Ser Gln Ala Phe Ala Leu
195 200 205
Gln Gly Gly Cys Tyr Val Ile Ser Pro Ser Ile Val Leu Arg Ala Glu
210 215 220
Asp Val Pro Glu Lys His Ala Ala Leu Leu Met Gly Asp Gln Val Gly
225 230 235 240
Gly Ser Tyr Ile Ile Asp Pro Cys Gly Lys Val Ile Ala Glu Ala Gly
245 250 255
Ala Gly Glu Thr Ile Leu Ile Ala Lys Gly Asn Leu Asp Leu Val Ara
260 265 270
Ala Ala Lys Met Ala Ser Asp Val Gly Gly Ser Tyr Ser Arg Pro Asp
275 280 285
Leu Leu Gln Leu Met Ile Asn Asn Arg Pro Leu Glu Gln Leu Ile Glu
290 295 300
Phe Ser Ala Glu Gly Ala Gly Arg Gly Asn Leu Val Ser Asn Ser Pro
305 310 315 320
Glu Val Ser Glu Gln Glu Gly Glu
325
<210>315
<211>960
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>315
atgacaatgc ttaagaccaa attccgggtt gctgcggtgc aggcagcgcc tgtattcctg 60
gatcgagagg ctacgctgga gaaagcctgc ggactgattg aggaggcggg tcgcaacggg 120
gccagcctgg tggtctttcc tgagtcgttc attccggcct atcccgattg ggtctgggct 180
gtgccggcgg gtgaagaagc tttacttaat gaactgtatg ctcaactgct ggccaacgcc 240
gttgaaattc ccagcccggc caccgaacgc ttgagccagg cagcgaaaaa ggctaaagtc 300
catgtggtta tgggcctgac cgaacgcaac agcgaggcca gcggcggcag cctctacaat 360
accttgctct atcttgaccc acagggccaa attctgggca aacatcgcaa gctggtgccc 420
accggcggcg agcggctggt ttgggcccag ggcgacggca gtaccctgca agtctacgag 480
acccccttgg gtaaactcag cggcttgatt tgctgggaaa attatatgcc gctggcccgc 540
tacgcgctct atgcctgggg tacgcaaatc tatatcgccg ccacctggga tcgaggcgag 600
ccgtggcttt cgacgctgcg gcatatcgcc aaagagggcc gggtgtttgt catcggctgt 660
ggcatggcct tgcgtaaggc tgatattccc gaccattttg aattcaagca gcgcttttat 720
caaaatgccg gcgagtggat caatggaggc gacagcgcca ttgtcaatcc tgagggtgaa 780
tttattgctg gacccctgag cgagcaggaa ggtattttgt acgccgagat tgatcctggc 840
cagatggccg gaccaaaatg gatgctcgat gtggccgggc actatgctcg cccggatgtc 900
tttgaactga ccgtccagac cgtagctcgg cccatgatta cctctactca gccgcgatag 960
<210>316
<211>319
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>316
Met Thr Met Leu Lys Thr Lys Phe Arg Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Val Phe Leu Asp Arg Glu Ala Thr Leu Glu Lys Ala Cys Gly Leu
20 25 30
Ile Glu Glu Ala Gly Arg Asn Gly Ala Ser Leu Val Val Phe Pro Glu
35 40 45
Ser Phe Ile Pro Ala Tyr Pro Asp Trp Val Trp Ala Val Pro Ala Gly
50 55 60
Glu Glu Ala Leu Leu Asn Glu Leu Tyr Ala Gln Leu Leu Ala Asn Ala
65 70 75 80
Val Glu Ile Pro Ser Pro Ala Thr Glu Arg Leu Ser Gln Ala Ala Lys
85 90 95
Lys Ala Lys Val His Val Val Met Gly Leu Thr Glu Arg Asn Ser Glu
100 105 110
Ala Ser Gly Gly Ser Leu Tyr Asn Thr Leu Leu Tyr Leu Asp Pro Gln
115 120 125
Gly Gln Ile Leu Gly Lys His Arg Lys Leu Val Pro Thr Gly Gly Glu
130 135 140
Arg Leu Val Trp Ala Gln Gly Asp Gly Ser Thr Leu Gln Val Tyr Glu
145 150 155 160
Thr Pro Leu Gly Lys Leu Ser Gly Leu Ile Cys Trp Glu Asn Tyr Met
165 170 175
Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Trp Gly Thr Gln Ile Tyr Ile
180 185 190
Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu Arg His
195 200 205
Ile Ala Lys Glu Gly Arg Val Phe Val Ile Gly Cys Gly Met Ala Leu
210 215 220
Arg Lys Ala Asp Ile Pro Asp His Phe Glu Phe Lys Gln Arg Phe Tyr
225 230 235 240
Gln Asn Ala Gly Glu Trp Ile Asn Gly Gly Asp Ser Ala Ile Val Asn
245 250 255
Pro Glu Gly Glu Phe Ile Ala Gly Pro Leu Ser Glu Gln Glu Gly Ile
260 265 270
Leu Tyr Ala Glu Ile Asp Pro Gly Gln Met Ala Gly Pro Lys Trp Met
275 280 285
Leu Asp Val Ala Gly His Tyr Ala Arg Pro Asp Val Phe Glu Leu Thr
290 295 300
Val Gln Thr Val Ala Arg Pro Met Ile Thr Ser Thr Gln Pro Arg
305 310 315
<210>317
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>317
atgaccattg taaaagccgc tgccgtccaa ataagccctg tcctttacag ccgcgaaggc 60
accgtggaca aggttgtcca gaagatcctc gaactcggca agcaaggcgt ccagttcgcc 120
actttcccgg agacggtggt cccctactac ccctacttct ccttcgtcca gtcgggctac 180
gccctcaagg tgggcaagga acatctgcgc ttgctcgaac agtcggtcac cgtgccatcg 240
gccaccacgc tcgccatcgg cgaagcctgc aagcaggcgg ggatggtggt gtccatcggc 300
gtcaacgaac gcgacggcag cacgatctac aacacgcagc tgctcttcga tgccgacggc 360
accttgattc agcgccgccg aaagatcagc ccgaccttcc atgaacgcat ggtctggggc 420
cagggcgacg gctccgggct gcgcgcggtc gacagcgcgg tcgggcgcat cggccagttg 480
gcgtgctggg agcactacaa cccgctggcc cgctacgcca tgatggccga cggcgagcag 540
atccactcgg cgatgtaccc cggttccttc gcaggcgacg ccttctccga acagatccag 600
gtcaacatcc gccagcacgc attggaagcc ggctgcttcg tcgtgaacgc caccgcgtgg 660
ctggacgccg atcagcaggc gcagatcatg caggacaccg gttgcgccat cggcccgatc 720
tccagtggtt gcttcaccgc catcgtttcg ccggacggcg tgttgctggg cgagcctctg 780
cggtccggtg agggcgaggt gattgccgat ctcgacttca cgctgatcga caagcgcaag 840
cagatgatgg attcacgcgg gcactatgcg cgcccggaat tgctcagcct gttgatcgac 900
cgcacggcga ccgcgcatgt gcatgagcgc agcgcgcatc cgaaggcgac tgccgagcag 960
gcggacggtc catccgccgt gaatgcgcag taa 993
<210>318
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>318
Met Thr Ile Val Lys Ala Ala Ala Val Gln Ile Ser Pro Val Leu Tyr
1 5 10 15
Ser Arg Glu Gly Thr Val Asp Lys Val Val Gln Lys Ile Leu Glu Leu
20 25 30
Gly Lys Gln Gly Val Gln Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Ser Gly Tyr Ala Leu Lys Val
50 55 60
Gly Lys Glu His Leu Arg Leu Leu Glu Gln Ser Val Thr Val Pro Ser
65 70 75 80
Ala Thr Thr Leu Ala Ile Gly Glu Ala Cys Lys Gln Ala Gly Met Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Ser Thr Ile Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Thr Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Ser Pro Thr Phe His Glu Arg Met Val Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Ala Val Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Met Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Phe Ala Gly
180 185 190
Asp Ala Phe Ser Glu Gln Ile Gln Val Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ala Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Gln Asp Thr Gly Cys Ala Ile Gly Pro Ile
225 230 235 240
Ser Ser Gly Cys Phe Thr Ala Ile Val Ser Pro Asp Gly Val Leu Leu
245 250 255
Gly Glu Pro Leu Arg Ser Gly Glu Gly Glu Val Ile Ala Asp Leu Asp
260 265 270
Phe Thr Leu Ile Asp Lys Arg Lys Gln Met Met Asp Ser Arg Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Ala Thr
290 295 300
Ala His Val His Glu Arg Ser Ala His Pro Lys Ala Thr Ala Glu Gln
305 310 315 320
Ala Asp Gly Pro Ser Ala Val Asn Ala Gln
325 330
<210>319
<211>1017
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>319
atgacgaccc cgcgtatcgt ccgcgtcgcc gcggtgcaga tggcgcccga cctggaatcc 60
gcacacggca cggtagacaa ggtctgccgc gccattctgg aggccggcga aaaaggcgcg 120
cgcatggtgg tctttcccga gaccttcgtg ccctactacc cctacttctc cttcatccag 180
ccggcggtga cgatgggcgc cgcccacctc gagctctacg agcgcgcggt gacggtgccc 240
ggcccggtca ccctggccgt gggcgaagcg gcgcggcggg cgggcgcggt cgtcgtgctc 300
ggcgtcaatg aacgcgacca cggctcgctc tacaacaccc agctgatctt cgacgagacc 360
ggcgcgctgg tgctcaagcg ccgcaagctc accccgacct atcacgagcg catggtgtgg 420
ggccagggcg atggcagcgg tctcaaggtg gtggacaccg gcatcggccg catcggcgcg 480
ctggcctgct gggagcacta caacccgctc gcccgctaca ccctgatggc gcagcatgaa 540
gagatccacg ccgcgcagtt ccccggttcc atggtcggcc agatctttgc cgaccagatg 600
gcggtgacca tccgccacca cgcgctggaa tccggctgct tcgtggtgaa cgccaccggc 660
tggctcaccg atgcgcagat caccgccatc acccccgacc cggccatgca gcgcgcgctg 720
cgcggcggct gccacaccgc catcgtgtcg ccggaaggca gctacgtctg cgaaccgctc 780
accgagggcg aaggcatgct ggtggccgac ctcgacatgc ggctggtcac caagcgcaag 840
cggatgatgg attcggtcgg ccactacgcc cggccggagc tgctttcgct caacgccgat 900
cttgccccca agccggcgct gcacacacag ccggctgcgt ccctccctct ctcccttcag 960
gcaggagccg accatgtcga cgacgaccgc accgcgtccg caacagctga tctctga 1017
<210>320
<211>338
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>320
Met Thr Thr Pro Arg Ile Val Arg Val Ala Ala Val Gln Met Ala Pro
1 5 10 15
Asp Leu Glu Ser Ala His Gly Thr Val Asp Lys Val Cys Arg Ala Ile
20 25 30
Leu Glu Ala Gly Glu Lys Gly Ala Arg Met Val Val Phe Pro Glu Thr
35 40 45
Phe Val Pro Tyr Tyr Pro Tyr Phe Ser Phe Ile Gln Pro Ala Val Thr
50 55 60
Met Gly Ala Ala His Leu Glu Leu Tyr Glu Arg Ala Val Thr Val Pro
65 70 75 80
Gly Pro Val Thr Leu Ala Val Gly Glu Ala Ala Arg Arg Ala Gly Ala
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Ser Leu Tyr Asn
100 105 110
Thr Gln Leu Ile Phe Asp Glu Thr Gly Ala Leu Val Leu Lys Arg Arg
115 120 125
Lys Leu Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Gln Gly Asp
130 135 140
Gly Ser Gly Leu Lys Val Val Asp Thr Gly Ile Gly Arg Ile Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Thr Leu Met
165 170 175
Ala Gln His Glu Glu Ile His Ala Ala Gln Phe Pro Gly Ser Met Val
180 185 190
Gly Gln Ile Phe Ala Asp Gln Met Ala Val Thr Ile Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Gly Trp Leu Thr Asp
210 215 220
Ala Gln Ile Thr Ala Ile Thr Pro Asp Pro Ala Met Gln Arg Ala Leu
225 230 235 240
Arg Gly Gly Cys His Thr Ala Ile Val Ser Pro Glu Gly Ser Tyr Val
245 250 255
Cys Glu Pro Leu Thr Glu Gly Glu Gly Met Leu Val Ala Asp Leu Asp
260 265 270
Met Arg Leu Val Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Asn Ala Asp Leu Ala Pro Lys
290 295 300
Pro Ala Leu His Thr Gln Pro Ala Ala Ser Leu Pro Leu Ser Leu Gln
305 310 315 320
Ala Gly Ala Asp His Val Asp Asp Asp Arg Thr Ala Ser Ala Thr Ala
325 330 335
Asp Leu
<210>321
<211>993
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>321
atgagagttg tcaaagccgc tgctgtccaa ctgagtcctg tcctccatag ccgcgacgga 60
acggtcgaaa aggtcgtgcg gaagatccat gaactcgccg aagagggagt cgagttcgcc 120
acctttcctg agaccgtggt gccttactac ccgtactttt ccttcgttca gacgcccttg 180
cagcaaatct tcggaactga gtatctgagg ctgctcgacc aggcagtcac cgtgccatcc 240
gctgccaccg acgcgatcgg cgaggctgcc aggtgggctg gacttgttgt ctcgatcggc 300
gtcaacgagc gagacggggg aactctctac aacactcagc ttctcttcga tgccgacggc 360
agcttaattc agcggcgtcg caagatcaca cccacccatt acgagcgcat gatctggggc 420
cagggcgacg gctcaggtct gcgggccgtt gatagcaagg ccggccgcat tggtcagctg 480
gcatgctggg aacacaacaa ccccctggcg cgctacgcgc tgatggccga cggcgagcag 540
atccattcgg ccatgtatcc gggctccatg ttcggcgact cgttttccca aaagaccgaa 600
atcaatatcc ggcagcatgc gctggaatct gcgtgcttcg tcgtgaacgc aacggcctgg 660
ctggacgccg atcagcaggc gcaaatcatg aaggacaccg gctgcggcat cggcccgatc 720
tccggcggtt gcttcactgc gatcgttgca cccgatggta gcctgctggg cgaacccatc 780
cgttccggtg agggcgtcgt cgtcgccaac ctcgacttca cgctgatcga caggcgtaag 840
caggtgatgg actcgcgagg ccactacagc cggccggagt tgctcagcct cttaatagac 900
cgcactccta ccgcgcacgt tcacgaacgc gctacgcacc ccacgacagg agctgagcaa 960
ggctccgagg atgtgttcga ggctcgcatt taa 993
<210>322
<211>330
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>322
Met Arg Val Val Lys Ala Ala Ala Val Gln Leu Ser Pro Val Leu His
1 5 10 15
Ser Arg Asp Gly Thr Val Glu Lys Val Val Arg Lys Ile His Glu Leu
20 25 30
Ala Glu Glu Gly Val Glu Phe Ala Thr Phe Pro Glu Thr Val Val Pro
35 40 45
Tyr Tyr Pro Tyr Phe Ser Phe Val Gln Thr Pro Leu Gln Gln Ile Phe
50 55 60
Gly Thr Glu Tyr Leu Arg Leu Leu Asp Gln Ala Val Thr Val Pro Ser
65 70 75 80
Ala Ala Thr Asp Ala Ile Gly Glu Ala Ala Arg Trp Ala Gly Leu Val
85 90 95
Val Ser Ile Gly Val Asn Glu Arg Asp Gly Gly Thr Leu Tyr Asn Thr
100 105 110
Gln Leu Leu Phe Asp Ala Asp Gly Ser Leu Ile Gln Arg Arg Arg Lys
115 120 125
Ile Thr Pro Thr His Tyr Glu Arg Met Ile Trp Gly Gln Gly Asp Gly
130 135 140
Ser Gly Leu Arg Ala Val Asp Ser Lys Ala Gly Arg Ile Gly Gln Leu
145 150 155 160
Ala Cys Trp Glu His Asn Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala
165 170 175
Asp Gly Glu Gln Ile His Ser Ala Met Tyr Pro Gly Ser Met Phe Gly
180 185 190
Asp Ser Phe Ser Gln Lys Thr Glu Ile Asn Ile Arg Gln His Ala Leu
195 200 205
Glu Ser Ala Cys Phe Val Val Asn Ala Thr Ala Trp Leu Asp Ala Asp
210 215 220
Gln Gln Ala Gln Ile Met Lys Asp Thr Gly Cys Gly Ile Gly Pro Ile
225 230 235 240
Ser Gly Gly Cys Phe Thr Ala Ile Val Ala Pro Asp Gly Ser Leu Leu
245 250 255
Gly Glu Pro Ile Arg Ser Gly Glu Gly Val Val Val Ala Asn Leu Asp
260 265 270
Phe Thr Leu Ile Asp Arg Arg Lys Gln Val Met Asp Ser Arg Gly His
275 280 285
Tyr Ser Arg Pro Glu Leu Leu Ser Leu Leu Ile Asp Arg Thr Pro Thr
290 295 300
Ala His Val His Glu Arg Ala Thr His Pro Thr Thr Gly Ala Glu Gln
305 310 315 320
Gly Ser Glu Asp Val Phe Glu Ala Arg Ile
325 330
<210>323
<211>951
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>323
atgagcaacg tgaaagtcgc ggtcgtgcaa cgcgcgccgg tgttcggcaa ccgcgccgcc 60
accctcgatc gcgccgtcgc cgcgctggcc gaagcggcgc agcagggggc gaagctggtc 120
gtgatgcccg agcacttcat ccccggctac ccggcctgga tctggcgcct gcgtccgggc 180
accgacctgc gattgtgcga acagctgcac gcgatgctgc gcgccaacgc cgtgaggctg 240
gatgacggtg acctggcccc gttgaccgag gccgcgcagc ggcatgcgct caccgtggtc 300
tgcggcgtct gcgagatcga caccgaattc agtcgcggca ccctgtacaa caccgtggtc 360
gtgatcgggc ccgacggcac gctgctcaac cggcatcgca agctgatgcc caccaacccc 420
gagcgcatgg tctggggcat gggcgacgcc acggggctga aggtggtcga cacgccctgc 480
gggcgcatcg gcacgctgat ttgctgggag aactacatgc cattcgcacg cgccgcgctg 540
tacgcgcagg gggtcgaggt cctggttgca ccgacctacg acgaaggccc ggtatggctg 600
gcgtcgatgc agcacatcgc ccgcgaaggc ggctgctggg tggtgggcaa cggctgcgca 660
ttccagggcc gcgacatgcc ggacaccttg ccgggcaagg cccagctgtt tcccgaggcc 720
gacgcctggg tcaacgccgg ggactcggtc atcgtcgcgc caggcggccg gacagtggcg 780
ggtccgttgc acgaggcgtt cgggctgttc accgccgaga tcgacctctc ccgggtcgga 840
atggcccggc gcagcctgga tgtggccggg cactatggac ggcccgacat cttctgcctg 900
caggtcaacg cccgggcgca gccgccggtt gaggtgacgc accatggctg a 951
<210>324
<211>316
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>324
Met Ser Asn Val Lys Val Ala Val Val Gln Arg Ala Pro Val Phe Gly
1 5 10 15
Asn Arg Ala Ala Thr Leu Asp Arg Ala Val Ala Ala Leu Ala Glu Ala
20 25 30
Ala Gln Gln Gly Ala Lys Leu Val Val Met Pro Glu His Phe Ile Pro
35 40 45
Gly Tyr Pro Ala Trp Ile Trp Arg Leu Arg Pro Gly Thr Asp Leu Arg
50 55 60
Leu Cys Glu Gln Leu His Ala Met Leu Arg Ala Asn Ala Val Arg Leu
65 70 75 80
Asp Asp Gly Asp Leu Ala Pro Leu Thr Glu Ala Ala Gln Arg His Ala
85 90 95
Leu Thr Val Val Cys Gly Val Cys Glu Ile Asp Thr Glu Phe Ser Arg
100 105 110
Gly Thr Leu Tyr Asn Thr Val Val Val Ile Gly Pro Asp Gly Thr Leu
115 120 125
Leu Asn Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val
130 135 140
Trp Gly Met Gly Asp Ala Thr Gly Leu Lys Val Val Asp Thr Pro Cys
145 150 155 160
Gly Arg Ile Gly Thr Leu Ile Cys Trp Glu Asn Tyr Met Pro Phe Ala
165 170 175
Arg Ala Ala Leu Tyr Ala Gln Gly Val Glu Val Leu Val Ala Pro Thr
180 185 190
Tyr Asp Glu Gly Pro Val Trp Leu Ala Ser Met Gln His Ile Ala Arg
195 200 205
Glu Gly Gly Cys Trp Val Val Gly Asn Gly Cys Ala Phe Gln Gly Arg
210 215 220
Asp Met Pro Asp Thr Leu Pro Gly Lys Ala Gln Leu Phe Pro Glu Ala
225 230 235 240
Asp Ala Trp Val Asn Ala Gly Asp Ser Val Ile Val Ala Pro Gly Gly
245 250 255
Arg Thr Val Ala Gly Pro Leu His Glu Ala Phe Gly Leu Phe Thr Ala
260 265 270
Glu Ile Asp Leu Ser Arg Val Gly Met Ala Arg Arg Ser Leu Asp Val
275 280 285
Ala Gly His Tyr Gly Arg Pro Asp Ile Phe Cys Leu Gln Val Asn Ala
290 295 300
Arg Ala Gln Pro Pro Val Glu Val Thr His His Gly
305 310 315
<210>325
<211>1077
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>325
atgactgcaa aaaagattgt ccgcgccgcc gctgtccagc tcaatcccgt gctggacagc 60
gccgacggca cccttgtgaa agtgttgcag gcgattgccg acgccgcagc gcagggcgtg 120
caactgatcg tctttcccga gacggtggtg ccctactatc cttacttttc cttcgtcacg 180
cctgcggttt cgatgggcgc ggcgcatctg aaactgtacg aacaatcgcc cacggtgcca 240
ggtccactga ccgacgccgt cgccgcagcc gcgcgggcac atcagatggt ggttgtgctc 300
ggcgtcaacg agcgcgatca cggcacgctc tacaacacgc aactgatctt cgacgccgac 360
ggcacgctcc cactgaagcg tcgcaagatc acgccgacct atcacgagcg catggtctgg 420
ggcatgggcg atggctccgg cctgcgcacg gtgaagaccg aggtcggaac cgttggcgcg 480
ctggcctgct gggaacacta caacccgctg gcacgctacg cgctgatggc gcagcacgaa 540
gagatccatt gcagccagtt ccccggctcg ctggtcggcc cgatcttttc cgagcagatg 600
gaaatcacca tgcgtcatca cgcgctggaa tccggctgct tcgtcgtcaa cgccactgca 660
tggctgacgc ctgagcaggt gcgatcacag gcgccaacac cggcaatgga aaaagccttc 720
tccggtggtt gctacaccgc gatcatttcg ccggaaggaa aacatctggg cgaacctctt 780
cgcgacggcg aaggcatggt catcgccgat cttgattttg atctcatcac caagcgcaag 840
cgaatgatgg attcggttgg ccactacgca cggccggaat tgttgagcct gcagctcgac 900
aaccgatcaa ctgcaccgct gacaacgtcg ccggtggccg ccgcagcgcc gtcgcttgca 960
gagatggaag cacagcgcct gtcacgttat ctcgatgcca gctccggcag cgccgcacaa 1020
ggcatcgaag ccgcctacat caatgccctc agctcttttt caggaaaacc ctcatga 1077
<210>326
<211>358
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>326
Met Thr Ala Lys Lys Ile Val Arg Ala Ala Ala Val Gln Leu Asn Pro
1 5 10 15
Val Leu Asp Ser Ala Asp Gly Thr Leu Val Lys Val Leu Gln Ala Ile
20 25 30
Ala Asp Ala Ala Ala Gln Gly Val Gln Leu Ile Val Phe Pro Glu Thr
35 40 45
Val Val Pro Tyr Tyr Pro Tyr Phe Ser Phe Val Thr Pro Ala Val Ser
50 55 60
Met Gly Ala Ala His Leu Lys Leu Tyr Glu Gln Ser Pro Thr Val Pro
65 70 75 80
Gly Pro Leu Thr Asp Ala Val Ala Ala Ala Ala Arg Ala His Gln Met
85 90 95
Val Val Val Leu Gly Val Asn Glu Arg Asp His Gly Thr Leu Tyr Asn
100 105 110
Thr Gln Leu Ile Phe Asp Ala Asp Gly Thr Leu Pro Leu Lys Arg Arg
115 120 125
Lys Ile Thr Pro Thr Tyr His Glu Arg Met Val Trp Gly Met Gly Asp
130 135 140
Gly Ser Gly Leu Arg Thr Val Lys Thr Glu Val Gly Thr Val Gly Ala
145 150 155 160
Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met
165 170 175
Ala Gln His Glu Glu Ile His Cys Ser Gln Phe Pro Gly Ser Leu Val
180 185 190
Gly Pro Ile Phe Ser Glu Gln Met Glu Ile Thr Met Arg His His Ala
195 200 205
Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala Trp Leu Thr Pro
210 215 220
Glu Gln Val Arg Ser Gln Ala Pro Thr Pro Ala Met Glu Lys Ala Phe
225 230 235 240
Ser Gly Gly Cys Tyr Thr Ala Ile Ile Ser Pro Glu Gly Lys His Leu
245 250 255
Gly Glu Pro Leu Arg Asp Gly Glu Gly Met Val Ile Ala Asp Leu Asp
260 265 270
Phe Asp Leu Ile Thr Lys Arg Lys Arg Met Met Asp Ser Val Gly His
275 280 285
Tyr Ala Arg Pro Glu Leu Leu Ser Leu Gln Leu Asp Asn Arg Ser Thr
290 295 300
Ala Pro Leu Thr Thr Ser Pro Val Ala Ala Ala Ala Pro Ser Leu Ala
305 310 315 320
Glu Met Glu Ala Gln Arg Leu Ser Arg Tyr Leu Asp Ala Ser Ser Gly
325 330 335
Ser Ala Ala Gln Gly Ile Glu Ala Ala Tyr Ile Asn Ala Leu Ser Ser
340 345 350
Phe Ser Gly Lys Pro Ser
355
<210>327
<211>975
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>327
atggttgatc aaatacttaa cgatcgttct gaattattaa cagttggttt ggcgcagatt 60
gctccaatct ggctgaaccg cgaaaaaaca cttgcgaaag tggttgagaa agtaaatcaa 120
gctgcaaagc aagattgtca tcttgttgct tttggtgaag ctttggtccc cggatatccc 180
ttttggattg aacttacaga tggcgcgcga ttcaactcca atgttcaaaa agaaattcac 240
gcgcactata tggatcaggc agtacagata gagaatggtc atttaaaagc gctttgtgaa 300
acatcggccg caaacaagat tgccgtgatc gttgggtgca ttgaacgcgc agccgatcgc 360
ggcgggcaca gcttatatgc ttcattggtt tttattaatc ctcaagggca aatcggatcg 420
gtgcatcgca aacttatgcc aacttatgaa gaacgtttaa cctggtcacc tggcgatgga 480
catggtttgc gaacgcatca actaggtgct ttcactgttg gcggcttgaa ttgttgggaa 540
aactggatgc cattgccacg cgccgcattg tacgcacaag gtgaagactt tcatgtcgca 600
atctggccgg gaagcattca caatacgcaa gatattacgc gctatattgc gaaggaatcc 660
agatcgtttg taatgtctac ttccgggttc atgcgtaaag aagactttcc atctgatacg 720
cctcacttag acaaaatact tgcaaactct ccgaatgtac ttgcaaatgg tggatcctgt 780
ctcgccggac cggacggtca gtggatcgtt gaaccatttg tgaatgagga aaaattagtt 840
gttgcgactg tgaaccacaa gcaggtccgt gaagaaagac aaaattttga tccggtcgga 900
cactattccc gtcctgatgt cacgcagttg attgtgaaca gacaaagaca atcaacaatc 960
aagctcaacg attaa 975
<210>328
<211>324
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>328
Met Val Asp Gln Ile Leu Asn Asp Arg Ser Glu Leu Leu Thr Val Gly
1 5 10 15
Leu Ala Gln Ile Ala Pro Ile Trp Leu Asn Arg Glu Lys Thr Leu Ala
20 25 30
Lys Val Val Glu Lys Val Asn Gln Ala Ala Lys Gln Asp Cys His Leu
35 40 45
Val Ala Phe Gly Glu Ala Leu Val Pro Gly Tyr Pro Phe Trp Ile Glu
50 55 60
Leu Thr Asp Gly Ala Arg Phe Asn Ser Asn Val Gln Lys Glu Ile His
65 70 75 80
Ala His Tyr Met Asp Gln Ala Val Gln Ile Glu Asn Gly His Leu Lys
85 90 95
Ala Leu Cys Glu Thr Ser Ala Ala Asn Lys Ile Ala Val Ile Val Gly
100 105 110
Cys Ile Glu Arg Ala Ala Asp Arg Gly Gly His Ser Leu Tyr Ala Ser
115 120 125
Leu Val Phe Ile Asn Pro Gln Gly Gln Ile Gly Ser Val His Arg Lys
130 135 140
Leu Met Pro Thr Tyr Glu Glu Arg Leu Thr Trp Ser Pro Gly Asp Gly
145 150 155 160
His Gly Leu Arg Thr His Gln Leu Gly Ala Phe Thr Val Gly Gly Leu
165 170 175
Asn Cys Trp Glu Asn Trp Met Pro Leu Pro Arg Ala Ala Leu Tyr Ala
180 185 190
Gln Gly Glu Asp Phe His Val Ala Ile Trp Pro Gly Ser Ile His Asn
195 200 205
Thr Gln Asp Ile Thr Arg Tyr Ile Ala Lys Glu Ser Arg Ser Phe Val
210 215 220
Met Ser Thr Ser Gly Phe Met Arg Lys Glu Asp Phe Pro Ser Asp Thr
225 230 235 240
Pro His Leu Asp Lys Ile Leu Ala Asn Ser Pro Asn Val Leu Ala Asn
245 250 255
Gly Gly Ser Cys Leu Ala Gly Pro Asp Gly Gln Trp Ile Val Glu Pro
260 265 270
Phe Val Asn Glu Glu Lys Leu Val Val Ala Thr Val Asn His Lys Gln
275 280 285
Val Arg Glu Glu Arg Gln Asn Phe Asp Pro Val Gly His Tyr Ser Arg
290 295 300
Pro Asp Val Thr Gln Leu Ile Val Asn Arg Gln Arg Gln Ser Thr Ile
305 310 315 320
Lys Leu Asn Asp
<210>329
<211>1023
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>329
gtgccggcca aagtacgcgc cgccgccgtg cagctcagcc ccgtgctctt cagccgtgaa 60
ggcacgacca gcaaggtctg cgacaagatt gccgaggcgg cggcgcaggg cgccgagctg 120
gtggtgtttc ccgaaaccgt agtgccgtat tacccgtatt tctcgttcat caaggctccg 180
gccgtgatcg gcgccgagca cttactcttg ctcgaacaag ccgtcacggt gcccgggccc 240
agcgtcgaag ccatcgcaga agccgctcgc aaggcgggcg cggtggtttc gatcggcgtc 300
aacgaacgcg atcacggcac gctgtacaac acccagctgt tgttcgacgc cgacggtcgg 360
ttggcgcaag cccgccgcaa gatcaccccc acgtatcacg agcggatgat ctgggggcag 420
ggcgatggct cgggcttggt ggcggtggat acgcgagtcg gcaggattgg ctccctggcg 480
tgctgggagc actacaaccc gctggctcgc tatgcgctga tggccgacca cgagcaaatt 540
cacgtggcca tgttccctgg ctcgctcgtg ggcgacatct tccgcgagca aatcgaggtc 600
acgattcggc accacgcgct cgagtcgggc tgcttcgtcg tcaacgcgac gggctacctg 660
agcgacgcgc aggtgacgca gatcgcgggc gacaccaagc tcgaccgcgc cctgcgcggc 720
ggttgcttca ccgccatcgt atcgcccgag ggcacgctgc tggcgccacc gctcaccgac 780
ggtgagggca tggtcattgc cgacctcgat ctgtcgctca tcgccaaacg caaacgcatg 840
atggacagcg tcggccatta cagccggccc gagctgctca gcgtgctgat cgaccgctcg 900
ccgcagccgc atttgcgcga gaaaactgcc tctttacccg aacctcggat gtcccatgaa 960
tcgctcgctc ccgatagtaa cggcctccgc gacgctgacg cgaaagcctc gaccctctcc 1020
tga 1023
<210>330
<211>340
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>330
Val Pro Ala Lys Val Arg Ala Ala Ala Val Gln Leu Ser Pro Val Leu
1 5 10 15
Phe Ser Arg Glu Gly Thr Thr Ser Lys Val Cys Asp Lys Ile Ala Glu
20 25 30
Ala Ala Ala Gln Gly Ala Glu Leu Val Val Phe Pro Glu Thr Val Val
35 40 45
Pro Tyr Tyr Pro Tyr Phe Ser Phe Ile Lys Ala Pro Ala Val Ile Gly
50 55 60
Ala Glu His Leu Leu Leu Leu Glu Gln Ala Val Thr Val Pro Gly Pro
65 70 75 80
Ser Val Glu Ala Ile Ala Glu Ala Ala Arg Lys Ala Gly Ala Val Val
85 90 95
Ser Ile Gly Val Asn Glu Arg Asp His Gly Thr Leu Tyr Asn Thr Gln
100 105 110
Leu Leu Phe Asp Ala Asp Gly Arg Leu Ala Gln Ala Arg Arg Lys Ile
115 120 125
Thr Pro Thr Tyr His Glu Arg Met Ile Trp Gly Gln Gly Asp Gly Ser
130 135 140
Gly Leu Val Ala Val Asp Thr Arg Val Gly Arg Ile Gly Ser Leu Ala
145 150 155 160
Cys Trp Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp
165 170 175
His Glu Gln Ile His Val Ala Met Phe Pro Gly Ser Leu Val Gly Asp
180 185 190
Ile Phe Arg Glu Gln Ile Glu Val Thr Ile Arg His His Ala Leu Glu
195 200 205
Ser Gly Cys Phe Val Val Asn Ala Thr Gly Tyr Leu Ser Asp Ala Gln
210 215 220
Val Thr Gln Ile Ala Gly Asp Thr Lys Leu Asp Arg Ala Leu Arg Gly
225 230 235 240
Gly Cys Phe Thr Ala Ile Val Ser Pro Glu Gly Thr Leu Leu Ala Pro
245 250 255
Pro Leu Thr Asp Gly Glu Gly Met Val Ile Ala Asp Leu Asp Leu Ser
260 265 270
Leu Ile Ala Lys Arg Lys Arg Met Met Asp Ser Val Gly His Tyr Ser
275 280 285
Arg Pro Glu Leu Leu Ser Val Leu Ile Asp Arg Ser Pro Gln Pro His
290 295 300
Leu Arg Glu Lys Thr Ala Ser Leu Pro Glu Pro Arg Met Ser His Glu
305 310 315 320
Ser Leu Ala Pro Asp Ser Asn Gly Leu Arg Asp Ala Asp Ala Lys Ala
325 330 335
Ser Thr Leu Ser
340
<210>331
<211>1041
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>331
atgtccggaa caggacctat cggaccggtc gtgaaggtcg ccgtcgtaca agccgcgcca 60
tgtctgctcg atcttgatgc gggcgtcgag aaggcgatcg ccttgatcga ccaagcgggc 120
aaggctggcg cgcggctcat caattttccc gaaatttggc tgccgggtta tccttggtgg 180
atctggctga atccacccgc catcaacatg cagtatgtcg cgccttacat gaacaactcc 240
attgtcgcag gcagcaagca tgaccacgca cttcgggccg ccgcgcgccg caacaatatt 300
cacgtcgtga tcggtgtctc cgagcgcgcc ggcggcagtc tgtacatggc tcagtggcac 360
tatgggcccg agggcgaggt gatctcgcgt cgtcgtaagc taaaacccac ccatgtcgaa 420
cgcagcgtct ttggcgaagg cgacggcagc gacatgatcg ttactcagac cgatttcggg 480
cgcgtcggcg cgctatgctg ttgggaacac ctgcaaccgc tgtcgaaata cgcgctcttt 540
tcccaggacg agcagattca ttgcgcggcg tggcccgctt tcagcctcta tgcaaaactc 600
tcgaaggcct tcagccccga agtcagcgtc aacgtgaacc aaatctacgc cgtagaaggg 660
caatgcttcg tcctgtcgtc gtgctcggtt atcgatcagg cgatctacga cacgttggtg 720
cagaacgaat tgcaccagaa gttcctcgag gtgggcgggg gctacagccg aatcttcggg 780
ccgaacggtg cggaattcgg tgagaatctc ccacccgata gggaaggcct ggtggttgcc 840
gatatcgatc tcggcctgat ctcgcactcc aagagtgccg ctgatccggc tggtcactac 900
gcgagggccg acgctctggc gctcatgcat aaccgtaatc cccgccgtcc ggttatcggt 960
ttcggcgagg cgacgcgcaa ggttgccgac gcactgccta aaggcgcgga acccgcggaa 1020
gcgctcgaag cggccgagtg a 1041
<210>332
<211>346
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>332
Met Ser Gly Thr Gly Pro Ile Gly Pro Val Val Lys Val Ala Val Val
1 5 10 15
Gln Ala Ala Pro Cys Leu Leu Asp Leu Asp Ala Gly Val Glu Lys Ala
20 25 30
Ile Ala Leu Ile Asp Gln Ala Gly Lys Ala Gly Ala Arg Leu Ile Asn
35 40 45
Phe Pro Glu Ile Trp Leu Pro Gly Tyr Pro Trp Trp Ile Trp Leu Asn
50 55 60
Pro Pro Ala Ile Asn Met Gln Tyr Val Ala Pro Tyr Met Asn Asn Ser
65 70 75 80
Ile Val Ala Gly Ser Lys His Asp His Ala Leu Arg Ala Ala Ala Arg
85 90 95
Arg Asn Asn Ile His Val Val Ile Gly Val Ser Glu Arg Ala Gly Gly
100 105 110
Ser Leu Tyr Met Ala Gln Trp His Tyr Gly Pro Glu Gly Glu Val Ile
115 120 125
Ser Arg Arg Arg Lys Leu Lys Pro Thr His Val Glu Arg Ser Val Phe
130 135 140
Gly Glu Gly Asp Gly Ser Asp Met Ile Val Thr Gln Thr Asp Phe Gly
145 150 155 160
Arg Val Gly Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys
165 170 175
Tyr Ala Leu Phe Ser Gln Asp Glu Gln Ile His Cys Ala Ala Trp Pro
180 185 190
Ala Phe Ser Leu Tyr Ala Lys Leu Ser Lys Ala Phe Ser Pro Glu Val
195 200 205
Ser Val Asn Val Asn Gln Ile Tyr Ala Val Glu Gly Gln Cys Phe Val
210 215 220
Leu Ser Ser Cys Ser Val Ile Asp Gln Ala Ile Tyr Asp Thr Leu Val
225 230 235 240
Gln Asn Glu Leu His Gln Lys Phe Leu Glu Val Gly Gly Gly Tyr Ser
245 250 255
Arg Ile Phe Gly Pro Asn Gly Ala Glu Phe Gly Glu Asn Leu Pro Pro
260 265 270
Asp Arg Glu Gly Leu Val Val Ala Asp Ile Asp Leu Gly Leu Ile Ser
275 280 285
His Ser Lys Ser Ala Ala Asp Pro Ala Gly His Tyr Ala Arg Ala Asp
290 295 300
Ala Leu Ala Leu Met His Asn Arg Asn Pro Arg Arg Pro Val Ile Gly
305 310 315 320
Phe Gly Glu Ala Thr Arg Lys Val Ala Asp Ala Leu Pro Lys Gly Ala
325 330 335
Glu Pro Ala Glu Ala Leu Glu Ala Ala Glu
340 345
<210>333
<211>1038
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>333
atggccattg aacatccccg ctatcgggtc gctgccgttc aggcggcgcc agagttcctt 60
aatctcgagg caaccgttga caagacgatt gcgcttatcg aggaggccgc ccgcggaggc 120
gcaagtctca ttgctttccc tgaaacttgg atacctggat atccatggtt tgcctggctt 180
ggtgcgccga tctggggcat gaaattcatc caggcgtacc atgacaactc gatggtgatc 240
gacggcgccc agtttgagcg tattgcgcaa gcagcatcgc gttgcaacat cactgttgtg 300
cttggcttta gcgagaagga cgcaggaagc ctgtatattg cccaggccat cctgagccct 360
gaagggaaga ccatcgccac gcgtcgcaag ctgaaaccca ctcatgtcga acgcgcgatc 420
ttcggcgaag gcgacggcag cgacctggca gttcacgaca ccaagctcgg cagggtgggc 480
gccctttgct gctgggagca tcttcagcca ctttccaaat acgcgatgta tgctcagaac 540
gagcaggtcc atatcgccgc ctggcccagt ttttcccttt acgtcgacgc ggcttacgcg 600
cttgggccag aggtgaacaa cgccgcgagt cggttgtatg cggtcgaggg ccagtgcttc 660
gtggttgcgc cgtgtgcaac ggtttcgcaa aagatgatcg atatgctttg cgagacaccc 720
gagcaacaag cgctcttgaa gccggggggt ggtcacgcgc aaatctacgg tcccgacgga 780
cgatcgctgg ccgatccgct gcctccggac gcggaggggc tgttgtatgc agacattgac 840
cttgcagcca tcaccctcgc gaaagcagct gcagatcctg ctggccatta ctctcgccct 900
gacgtgacac aactgctgct tgaccgcaat ccaaagcccc gtgtcgtgca tgctaagccc 960
ggccaaagcg ccaacaacag ctcacccggc atgcgggccg tcgagcatac cgagctcgaa 1020
gaaggtgaac aggcctga 1038
<210>334
<211>345
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>334
Met Ala Ile Glu His Pro Arg Tyr Arg Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Glu Phe Leu Asn Leu Glu Ala Thr Val Asp Lys Thr Ile Ala Leu
20 25 30
Ile Glu Glu Ala Ala Arg Gly Gly Ala Ser Leu Ile Ala Phe Pro GIu
35 40 45
Thr Trp Ile Pro Gly Tyr Pro Trp Phe Ala Trp Leu Gly Ala Pro Ile
50 55 60
Trp Gly Met Lys Phe Ile Gln Ala Tyr His Asp Asn Ser Met Val Ile
65 70 75 80
Asp Gly Ala Gln Phe Glu Arg Ile Ala Gln Ala Ala Ser Arg Cys Asn
85 90 95
Ile Thr Val Val Leu Gly Phe Ser Glu Lys Asp Ala Gly Ser Leu Tyr
100 105 110
Ile Ala Gln Ala Ile Leu Ser Pro Glu Gly Lys Thr Ile Ala Thr Arg
115 120 125
Arg Lys Leu Lys Pro Thr His Val Glu Arg Ala Ile Phe Gly Glu GIy
130 135 140
Asp Gly Ser Asp Leu Ala Val His Asp Thr Lys Leu Gly Arg Val Gly
145 150 155 160
Ala Leu Cys Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr Ala Met
165 170 175
Tyr Ala Gln Asn Glu Gln Val His Ile Ala Ala Trp Pro Ser Phe Ser
180 185 190
Leu Tyr Val Asp Ala Ala Tyr Ala Leu Gly Pro Glu Val Asn Asn Ala
195 200 205
Ala Ser Arg Leu Tyr Ala Val Glu Gly Gln Cys Phe Val Val Ala Pro
210 215 220
Cys Ala Thr Val Ser Gln Lys Met Ile Asp Met Leu Cys Glu Thr Pro
225 230 235 240
Glu Gln Gln Ala Leu Leu Lys Pro Gly Gly Gly His Ala Gln Ile Tyr
245 250 255
Gly Pro Asp Gly Arg Ser Leu Ala Asp Pro Leu Pro Pro Asp Ala Glu
260 265 270
Gly Leu Leu Tyr Ala Asp Ile Asp Leu Ala Ala Ile Thr Leu Ala Lys
275 280 285
Ala Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr Gln
290 295 300
Leu Leu Leu Asp Arg Asn Pro Lys Pro Arg Val Val His Ala Lys Pro
305 310 315 320
Gly Gln Ser Ala Asn Asn Ser Ser Pro Gly Met Arg Ala Val Glu His
325 330 335
Thr Glu Leu Glu Glu Gly Glu Gln Ala
340 345
<210>335
<211>1053
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>335
atgactacgc gcgtgattaa agtcgctgcc gcgcagctgt ctccggtgct ggcaactgca 60
tcggagcaca gccgcgagga cacgattgcg aaagtgatcg acgccatcgc tgcggcgtcg 120
caacagggcg cgcaactgat tgtgtttccg gaaacagtcg ttccgtatta cccgtatttt 180
tcatttatca cgcccgcggt aacgatgggt gccgaacatt tgaaattgta cgaacaggca 240
gtgacggtac ccagcgcagc gacagatgct gtcgctgcgg cggcaaaaaa ttatggcatg 300
gtagtggtgc tcggaattaa tgaacgcgat cacggctcgc tgtacaacgc gcaattaatt 360
ttcgacgccg atggtgagct gctattaaag cgtcgaaaaa ttacgccgac ttatcacgaa 420
cgcatggtgt ggggacaagg cgacggcagc ggtttaaaag ttgtcgatac tgctgccggc 480
cgtgtcggcg cgctcgcgtg ctgggaacat tacaacccgc tcgcgcgtta cagcctgatg 540
gcacaacacg aagaaattca ttgcagtcaa tttccgggat cgttggtcgg ccctattttc 600
gccgaacaaa tggaaatcac catgcgtcat cacgcactcg aatccggctg ttttgtggtg 660
aatgcaacgg cctggcttag cgatacacaa atccaatcaa ttacccccga taaagccatg 720
cagaaagcac tgcgcggcgg ttgctacacc gcaatcatct cgcccgaagg caaacatctg 780
tgcccaccgc tgtatgacgg agaaggaata attgtggcgg aattggactt cgcgttaatc 840
accaaacgta aacgcatgat ggattccgtc ggccattacg cgcgaccaga actactttct 900
ttgctcctcg atgatcgcgt aactgcgccg ctcaaaaact tacagacgac gatggcctct 960
gccaaatccg ctgaagatgg ttttccttta tttgcagacg ttttatatcc agacagttct 1020
ttcattgaga cgtcgaaatt cgcggagtca tga 1053
<210>336
<211>350
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>336
Met Thr Thr Arg Val Ile Lys Val Ala Ala Ala Gln Leu Ser Pro Val
1 5 10 15
Leu Ala Thr Ala Ser Glu His Ser Arg Glu Asp Thr Ile Ala Lys Val
20 25 30
Ile Asp Ala Ile Ala Ala Ala Ser Gln Gln Gly Ala Gln Leu Ile Val
35 40 45
Phe Pro Glu Thr Val Val Pro Tyr Tyr Pro Tyr Phe Ser Phe Ile Thr
50 55 60
Pro Ala Val Thr Met Gly Ala Glu His Leu Lys Leu Tyr Glu Gln Ala
65 70 75 80
Val Thr Val Pro Ser Ala Ala Thr Asp Ala Val Ala Ala Ala Ala Lys
85 90 95
Asn Tyr Gly Met Val Val Val Leu Gly Ile Asn Glu Arg Asp His Gly
100 105 110
Ser Leu Tyr Asn Ala Gln Leu Ile Phe Asp Ala Asp Gly Glu Leu Leu
115 120 125
Leu Lys Arg Arg Lys Ile Thr Pro Thr Tyr His Glu Arg Met Val Trp
130 135 140
Gly Gln Gly Asp Gly Ser Gly Leu Lys Val Val Asp Thr Ala Ala Gly
145 150 155 160
Arg Val Gly Ala Leu Ala Cys Trp Glu His Tyr Asn Pro Leu Ala Arg
165 170 175
Tyr Ser Leu Met Ala Gln His Glu Glu Ile His Cys Ser Gln Phe Pro
180 185 190
Gly Ser Leu Val Gly Pro Ile Phe Ala Glu Gln Met Glu Ile Thr Met
195 200 205
Arg His His Ala Leu Glu Ser Gly Cys Phe Val Val Asn Ala Thr Ala
210 215 220
Trp Leu Ser Asp Thr Gln Ile Gln Ser Ile Thr Pro Asp Lys Ala Met
225 230 235 240
Gln Lys Ala Leu Arg Gly Gly Cys Tyr Thr Ala Ile Ile Ser Pro Glu
245 250 255
Gly Lys His Leu Cys Pro Pro Leu Tyr Asp Gly Glu Gly Ile Ile Val
260 265 270
Ala Glu Leu Asp Phe Ala Leu Ile Thr Lys Arg Lys Arg Met Met Asp
275 280 285
Ser Val Gly His Tyr Ala Arg Pro Glu Leu Leu Ser Leu Leu Leu Asp
290 295 300
Asp Arg Val Thr Ala Pro Leu Lys Asn Leu Gln Thr Thr Met Ala Ser
305 310 315 320
Ala Lys Ser Ala Glu Asp Gly Phe Pro Leu Phe Ala Asp Val Leu Tyr
325 330 335
Pro Asp Ser Ser Phe Ile Glu Thr Ser Lys Phe Ala Glu Ser
340 345 350
<210>337
<211>957
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>337
atgaagggag tgtgtgccat gtcgacgaaa gttgccatcg tccaggcgcc gccggtcctg 60
ctgcatcgcg acaggacgat tgcgaaggtg cgttcgtcga tcgaggatgc cgccaatgcg 120
ggcgcctcgc tgatcgtatt tcccgaagct tatgtacccg gctatccgag ttggatctgg 180
cgtctcaggc ccggaggcga catgggactg tcgtctgaga ttcacgcaag attgcgggaa 240
aatgccgttg atctcgcgaa cggaggcctg gcgcatgtcc agggggctgc agcaaaattc 300
ggcgcgactg tcgttatcgg catcaatgaa ctcgacagcg agttcagcgg aacgacattg 360
ttcaacaccg tggtggtcat cggccccgac ggaacgcgcc tcaacaggca tcgaaaatta 420
atgccgacca acccggagcg catggtgtgg ggcacgggcg atgcctcggg tctgcgtgtc 480
atcgatacgc cggcgggacg gctgggaacc atgatctgct gggagagcta catgccgctg 540
gcgcgctatg ctctctatgc gcaaggcatc gagatatacg tcgctcccac gtgggacgca 600
ggcgagagct ggattgctac gatgcgccac atcgccaagg aggccggctg ctgggtgatc 660
ggcacggcaa ccgtcatcca gggcagcgat gttccggacg attttcccga acgcgacaag 720
ctcttcaagc cggaggagtg gatcaacgac ggcgatgcgg tcgtggtcaa gcccatgggc 780
gcgattgctg ccggaccgca caatcgacag aaaagcatac tctacgccga catcgaccgg 840
gaggccgcgc ggcgagcccg ccggtcgctc gatgtctgtg gccactattc ccgcccagac 900
gttttctctt tctcggtcaa ccgaaagcca ttccgccctg ccgactttgt gggttga 957
<210>338
<211>313
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>338
Met Lys Gly Val Cys Ala Met Ser Thr Lys Val Ala Ile Val Gln Ala
1 5 10 15
Pro Pro Val Leu Leu His Arg Asp Gly Arg Leu Arg Arg Cys Val Arg
20 25 30
Arg Ser Arg Met Pro Pro Met Arg Ala Pro Arg Ser Tyr Phe Pro Ala
35 40 45
Tyr Val Pro Gly Tyr Pro Ser Trp Ile Trp Arg Leu Arg Pro Gly Gly
50 55 60
Asp Met Gly Leu Ser Ser Glu Ile Thr Gln Asp Cys Gly Lys Met Pro
65 70 75 80
Leu Ile Ser Arg Thr Glu Ala Trp Arg Met Ser Arg Gly Leu Gln Gln
85 90 95
Asn Ser Ala Arg Leu Ser Leu Ser Ala Ser Met Asn Ser Thr Ala Ser
100 105 110
Ser Ala Glu Arg His Cys Ser Thr Val Val Val Ile Gly Pro Asp Gly
115 120 125
Thr Arg Leu Asn Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu His
130 135 140
Gly Val Gly His Gly Arg Cys Leu Gly Ser Ala Cys His Arg Tyr Ala
145 150 155 160
Gly Gly Thr Ala Gly Asn His Ile Cys Trp Glu Ser Tyr Met Pro Leu
165 170 175
Ala Arg Tyr Ala Leu Tyr Ala Gln Gly Ile Glu Ile Tyr Val Ala Pro
180 185 190
Thr Trp Asp Ala Gly Glu Ser Trp Ile Ala Thr Met Arg His Ile Ala
195 200 205
Lys Glu Ala Gly Cys Trp Val Ile Gly Thr Ala Thr Val Ile Gln Gly
210 215 220
Ser Asp Val Pro Asp Asp Phe Pro Glu Arg Asp Lys Leu Phe Lys Pro
225 230 235 240
Arg Ser Gly Ser Thr Thr Ala Met Arg Ser Trp Ser Ser Pro Trp Ala
245 250 255
Arg Leu Leu Pro Asp Arg Thr Ile Asp Arg Lys Ala Tyr Ser Thr Pro
260 265 270
Thr Ser Thr Gly Arg Pro Arg Gly Glu Pro Ala Gly Arg Ser Met Cys
275 280 285
Gly His Tyr Ser Arg Pro Asp Val Phe Ser Phe Ser Val Asn Arg Lys
290 295 300
Pro Phe Arg Pro Ala Asp Phe Val Gly
305 310
<210>339
<211>1020
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>339
atgtcaggat cattcaaagt tgcagcagtg caggcggctc cggccttcct caatctcgat 60
gcgggcatcg acaaggcggt ggcgctgatc gagcaagccg ccgcgcagga cgttcagctc 120
atcgcctttc ccgagacctg gttgcccgga tacccctggt ggatctggct cgatgcaccc 180
gccgttacca tgggctatgt cgttccctac aatctcaatt cactagaggc gggcagcccg 240
caggacaagc gtctggcaaa tgccgcgcgt gagaacaaca tccaggtggt gatgggtctg 300
tctgaacgcc atgacggcac gctctacatc gcgcagtggc actatggtga agatggcgag 360
gtgatctcgc gacggcgcaa gctcaagccg acccatgtcg aacgcacggt gttcggggaa 420
ggcgacggca gcgacatggt ggtcaaggac acaagtctgg gacgggtcgg cgctctgtgc 480
tgttgggaac acctgcagcc gctcaacaaa tacgcgatgt actcccagaa cgagcagatc 540
cacatcggtt cctggcccag cttcagcctc tacaagggcg gcgcctatgc gcttggggca 600
gacctcaaca cggcggcaag ccagatgtat gcggccgagg gccagtgctt tgttctggct 660
gcctgcgcca cggtcagtca ggacatgttc gacatgctct gcgacacgga aatgaagcag 720
cagttcctga ccaccggtgg cggattcgcc cgcattttcg gacctgacgg ctcgcccatg 780
ggcaatgtgc ttgaagaaca tgaagaaggg ctggtgatcg ccgaaatcga tctcaccatg 840
attgcgatcg ccaaggcggc ggctgacccg tgcgggcact attcccggcc cgatgtgttc 900
cgactgatgt tcaaccagaa gccaagcccg gtggtgatgc cattcgaaaa cgacgtggcc 960
cgcgagattg tcgaggccgc cgaggatcag gtcagcgcaa accggctcgc agcggaatga 1020
<210>340
<211>329
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>340
Met Ser Gly Ser Phe Lys Val Ala Ala Val Gln Ala Ala Pro Ala Phe
1 5 10 15
Leu Asn Leu Asp Ala Gly Ile Asp Gly Gly Gly Ala Asp Arg Ala Ser
20 25 30
Arg Arg Ala Gly Arg Ser Ala His Arg Leu Ser Arg Asp Leu Val Ala
35 40 45
Gly Tyr Pro Trp Trp Ile Trp Leu Asp Ala Pro Ala Val Thr Met Gly
50 55 60
Tyr Val Val Pro Tyr Asn Leu Asn Ser Arg Gly Gly Gln Pro Ala Gly
65 70 75 80
Gln Ala Ser Gly Lys Cys Arg Ala Glu Gln His Pro Gly Gly Asp Gly
85 90 95
Leu Ser Glu Arg His Asp Gly Thr Leu Tyr Ile Ala Gln Trp His Tyr
100 105 110
Gly Glu Asp Gly Glu Val Ile Ala Thr Ala Gln Ala Gln Ala Asp Pro
115 120 125
Cys Arg Thr His Gly Val Arg Gly Arg Arg Arg Gln Arg His Val Val
130 135 140
Lys Asp Thr Ser Leu Gly Arg Val Gly Ala Leu Cys Cys Trp Glu His
145 150 155 160
Leu Gln Pro Leu Asn Lys Thr Arg Cys Thr Pro Arg Thr Ser Arg Ser
165 170 175
Thr Ser Val Pro Gly Pro Ala Ser Ala Ser Thr Arg Ala Ala Leu Cys
180 185 190
Ala Trp Gly Arg Pro Gln His Gly Gly Lys Pro Asp Val Cys Gly Arg
195 200 205
Gly Pro Val Leu Cys Ser Leu Pro Ala Pro Arg Ser Val Arg Thr Cys
210 215 220
Ser Thr Cys Ser Ala Thr Arg Lys Ser Ser Ser Ser Thr Gly Gly Gly
225 230 235 240
Phe Ala Arg Ile Phe Gly Pro Asp Gly Ser Pro Met Gly Asn Val Leu
245 250 255
Glu Glu His Glu Arg Ala Gly Asp Arg Arg Asn Arg Ser His His Asp
260 265 270
Cys Asp Arg Gln Gly Gly Gly Pro Val Arg Ala Leu Phe Pro Ala Arg
275 280 285
Cys Val Pro Thr Asp Val Gln Pro Glu Ala Lys Pro Gly Gly Asp Ala
290 295 300
Ile Arg Lys Thr Trp Pro Ala Arg Leu Ser Arg Pro Pro Arg Ile Arg
305 310 315 320
Ser Ala Gln Thr Gly Ser Gln Arg Asn
325
<210>341
<211>1056
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>341
atggcactga ccaacccaaa atataaagtc gccgccgtcc aggccgcgcc agcgttcctc 60
gatctggatg cgtccgtcga aaaagcggtc cggctgatcg acgaggccgg cgccaagggc 120
gcgcgcctca ttgcgttccc ggaaacctgg atccccggct atccctggtg gatctggctc 180
ggcgcgccgg cctgggcgat catgaaaggt tttgtctcgg cctatttcga caattcgctc 240
acctatgaca gtccggccgc ggacaaattg cgccaggccg ccaagcgcaa cgatatcgtc 300
gtggtgctcg gcctgtcgga gcgcgacggc ggcagtctct atatcgcgca atggatcatc 360
ggcccggacg gcgaaactgt cgcccagcgc cgcaagctca agccgaccca tgtcgagcgt 420
tcggtgttcg gcgagggcga tggcagcgac cttgccgtgc atgagctcgc gatcgggcgc 480
gtcggcgcgc tgtgctgctg ggagcatctg caaccgctgt cgaaatacgc catgtatgcg 540
cagaacgagc aggttcacgt ggcggcctgg ccgagctttt cgctgtacga tccgttcgcc 600
catgccctcg gtgcggaggt caacaacgcc gccagcaaga tctacgcggt cgagggatcg 660
tgcttcgtcg tggcaccctg cgccaccgtt tcaaaggaaa tgatcgacct gttgtgcgac 720
acgccggaca agcacggtct gctgcacgcc ggcggcgggt ttgccgcgat ctacggccct 780
gacggctcgc cgatcggcga ccgcctggcg cccgaccagg aaggtctgat ctatgccgat 840
gtcgatctcg gcatgatctc ggtcgcgaag gccgccgccg atccggccgg acattatgcg 900
cggccggatg tcacaaggtt gctgctcaac aaacgcccgg gcaatcgtgt cgaggcgctg 960
gcacttccgg tggatcaggt tgcggcaggt gaggagatcc cctcgatatc gcgatcggcc 1020
agaggggttg ccgaactgcc aaacgcggcc gaatag 1056
<210>342
<211>342
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>342
Met Ala Leu Thr Asn Pro Lys Tyr Lys Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Ala Phe Leu Asp Leu Asp Ala Pro Ser Lys Lys Arg Ser Gly Ser
20 25 30
Thr Arg Pro Ala Pro Arg Ala Arg Ala Ser Leu Arg Ser Arg Thr Trp
35 40 45
Ile Pro Gly Tyr Pro Trp Trp Ile Trp Leu Gly Ala Pro Ala Trp Ala
50 55 60
Ile Met Lys Gly Phe Val Ser Pro Ile Ser Thr Ile Arg Ser Pro Met
65 70 75 80
Thr Val Arg Pro Arg Thr Asn Cys Ala Arg Pro Pro Ser Ala Thr Tyr
85 90 95
Arg Arg Gly Ala Arg Pro Val Gly Ala Arg Arg Arg Gln Ser Leu Tyr
100 105 110
Arg Ala Met Asp His Arg Pro Thr Ala Lys Leu Ser Pro Ser Ala Ala
115 120 125
Ser Ser Ser Arg Pro Met Ser Ser Val Arg Cys Ser Ala Arg Ala Met
130 135 140
Ala Ala Thr Leu Pro Cys Met Ser Ser Arg Ser Gly Ala Ser Ala Arg
145 150 155 160
Cys Ala Ala Gly Ser Ile Cys Thr Ala Val Glu Ile Arg His Val Cys
165 170 175
Ala Glu Arg Ala Gly Ser Arg Gly Gly Leu Ala Glu Leu Phe Ala Thr
180 185 190
Ile Arg Ser Pro Met Pro Ser Val Arg Arg Ser Thr Thr Pro Pro Ala
195 200 205
Arg Ser Thr Arg Ser Arg Asp Arg Ala Ser Ser Trp His Pro Ala Pro
210 215 220
Pro Phe Gln Arg Lys Ser Thr Cys Cys Ala Thr Arg Arg Thr His Gly
225 230 235 240
Leu Leu His Ala Gly Gly Gly Phe Ala Ala Ile Tyr Gly Pro Asp Gly
245 250 255
Ser Pro Ile Gly Asp Arg Trp Arg Pro Thr Arg Lys Val Ser Met Pro
260 265 270
Met Ser Ile Ser Ala Ser Arg Ser Arg Arg Pro Pro Asp Pro Ala Gly
275 280 285
His Tyr Ala Arg Pro Asp Val Thr Arg Leu Leu Leu Asn Lys Arg Pro
290 295 300
Gly Asn Arg Val Arg Arg Trp His Phe Arg Trp Ile Arg Leu Arg Gln
305 310 315 320
Val Arg Arg Ser Pro Arg Tyr Arg Asp Arg Pro Glu Gly Cys Arg Thr
325 330 335
Ala Lys Arg Gly Arg Ile
340
<210>343
<211>942
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>343
atgaagtcaa aaattgcggt tattcagcga cctcctgtat tgttagacct tcaggcttca 60
atcgcccgtg ccattacttc tgttgaggaa gcggcaggca agggatctga gctccttgtt 120
tttcccgaga catttctgcc tggttatccg tcctggatct ggcgtctcaa gccgggcgga 180
gacatggtgc tgacatctga aatccacgca aaatatcgcg cgaactctgt tgatgttgag 240
cgcggggatc tggccccttt atgcgaagcg gcggcgaaac acggcgtcac aattgtcatg 300
gggctcagtg aaattgatgg gcgctacagc gggactacac tctttaatac agtggtgacc 360
attggcgcgg aaggagagct ccttaataga caccgcaagc tcatgccgac aaacccagag 420
cgtatggtct gggggcaagg ggatgcctct ggtctgcggg ttgtcgacac gcccgtgggc 480
cgcgtcggca cgctgatctg ctgggaaaac tacatgccgc tatcgcgcta tgcgctttat 540
tctcaaaaca ttgacatcta tgtggcgccg acctgggacg cgggcgagag ctggatcgcc 600
tccatgcagc atatcgccaa agaaggtggc tgctgggtga tcggcacggc cacggcgatg 660
gagggctctg atgtcccagc cgacttccct cagcgggagg tgcttttccc tgatagcagc 720
gaatggatca atgacggtga cgctgtagtg gttaaaccca tgggggcgat tgtcgcgggt 780
ccgcatcacc gggataagag tattctctat gctgagattg acgtcgaagt ggcacgcaat 840
gcgcggcgct cgctcgatgt ggcggggcat tactcccggc cggatatttt ttcctttggc 900
gtggatcgcc ggcctttgcc gccggttacg tttgaggatt ga 942
<210>344
<211>303
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>344
Met Lys Ser Lys Ile Ala Val Ile Gln Arg Pro Pro Val Leu Leu Asp
1 5 10 15
Leu Gln Ala Ser Ile Ala Arg Ala Ile Thr Leu Leu Arg Lys Arg Gln
20 25 30
Ala Arg Asp Leu Ser Ser Leu Phe Phe Pro Arg His Phe Cys Leu Val
35 40 45
Ile Arg Pro Gly Ser Gly Val Ser Ser Arg Ala Glu Thr Trp Cys His
50 55 60
Leu Lys Ser Thr Gln Asn Ile Ala Arg Thr Leu Asp Val Glu Arg Gly
65 70 75 80
Asp Leu Ala Pro Leu Cys Glu Ala Ala Ala Lys His Gly Val Thr Ile
85 90 95
Val Met Gly Gln Asn Trp Ala Leu Gln Arg Asp Tyr Thr Leu Tyr Ser
100 105 110
Gly Asp His Trp Arg Gly Arg Arg Leu Leu Asn Arg His Arg Lys Leu
115 120 125
Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly Gln Gly Asp Ala Ser
130 135 140
Val Cys Gly Leu Ser Thr Arg Pro Trp Ala Ala Ser Ala Arg Ser Ala
145 150 155 160
Gly Lys Thr Thr Cys Arg Tyr Arg Tyr Ala Leu Tyr Ser Gln Asn Ile
165 170 175
Asp Ile Tyr Val Ala Pro Thr Trp Asp Ala Gly Glu Ser Trp Ile Ala
180 185 190
Ser Met Gln His Ile Ala Lys Glu Gly Gly Cys Trp Val Ile Gly Thr
195 200 205
Ala Thr Ala Met Glu Gly Ser Asp Ser Gln Pro Thr Ser Leu Ser Gly
210 215 220
Arg Cys Phe Ser Leu Ile Ala Ala Asn Gly Ser Met Thr Val Thr Leu
225 230 235 240
Trp Leu Asn Pro Trp Gly Arg Leu Ser Arg Val Arg Ile Thr Gly Ile
245 250 255
Arg Val Phe Ser Met Leu Arg Leu Thr Arg Ser Gly Thr Gln Cys Ala
260 265 270
Ala Leu Ala Arg Cys Gly Gly Ala Leu Leu Pro Ala Gly Tyr Phe Phe
275 280 285
Leu Trp Val Asp Arg Arg Pro Leu Pro Pro Val Thr Phe Glu Asp
290 295 300
<210>345
<211>1011
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>345
atgaaagcga tcaaggccgc tgccgtgcag gcagcaccgg tattcctgaa cctcgacgca 60
tcgatcacca aggcggaaac attcgtcgcc gaggccgccg cgaatggtgc caagctggtg 120
gcgtttccgg aaacctggct gccgggctat ccctggttca tctggctcgg tgcgcccgcc 180
gaaggcatgc agttcatccc gcgctatcac gaaaacagca tggagctccg ctcgcccgag 240
atgcgccgct tgcaggcgat cgcgcgcaag tatgaagtga cgctcgtcat gggctattcc 300
gagcgcgatg gtggcagccg ctacatgtcc caggtcatta ttggcgatca gggcgacatc 360
cttctcaatc gccgtaaatt gaagccaacc catgtcgagc ggacggtctt cggcgaaggc 420
gacggttcgg acctggtggt ggtcgaaacg gcattcggca ggctcggtgc gctcaattgc 480
tgggaacata tccagccgct cgtcaagatg tcgatgtatg cccagcatga ggaaatccat 540
gtcgcgggtt ggccgagctt ctgcgtctac cgcgatctcg cctatgccct gggaccggaa 600
gtcaacaatg ccgtcagtca ggtctatgcc gtggagggta gcgcctatgt tctggcaccc 660
tgtgcgatcg taagccagga gatgttcgac attctggccg acaagcctga aaaggccttt 720
ctcctcaatc cccgcacatc caagcccggc ggtggcttca cgcagatcta tgcgccggat 780
ggtcgaccgc tttgcgagcc gcttgccgac gatgtggaag gcatcctcta tgccgatctc 840
gatccggcaa cgatcgccgt cgcgaaggcg gccgccgatc ctgcggggca ctattcgcgg 900
ccggacgcac tctcgctggt gatcaatcgc gaaaagcgcg cggtgatggc tgaaatcaac 960
gcgccggcga cgccgacctt cacccccatc tccctggacg ctgcggagta g 1011
<210>346
<211>329
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>346
Met Lys Ala Ile Lys Ala Ala Ala Val Gln Ala Ala Pro Val Phe Leu
1 5 10 15
Asn Leu Asp Ala Ser Ile Thr Lys Ala Glu Thr Phe Val Ala Glu Ala
20 25 30
Ala Ala Asn Gly Ala Lys Leu Val Ala Phe Pro Glu Thr Trp Leu Pro
35 40 45
Gly Tyr Pro Trp Phe Ile Trp Leu Gly Ala Pro Ala Glu Gly Met Gln
50 55 60
Phe Ile Pro Arg Tyr His Glu Asn His Gly Ala Pro Leu Ala Arg Asp
65 70 75 80
Ala Pro Leu Ala Gly Asp Arg Ala Gln Val Ser Asp Ala Arg His Gly
85 90 95
Tyr Ser Glu Arg Asp Gly Gly Ser Arg Tyr Met Ser Gln Val Ile Ile
100 105 110
Gly Asp Gln Gly Asp Ile Leu Leu Asn Arg Arg Lys Leu Lys Pro Thr
115 120 125
His Val Glu Arg Thr Val Phe Gly Glu Gly Asp Gly Ser Asp Leu Val
130 135 140
Trp Ser Lys Arg His Ser Ala Gly Ser Val Arg Ser Ile Ala Gly Asn
145 150 155 160
Ile Ser Ser Arg Ser Ser Arg Cys Met Tyr Ala Gln His Glu Glu Ile
165 170 175
His Val Ala Gly Trp Pro Ser Phe Cys Val Tyr Arg Asp Leu Ala Tyr
180 185 190
Ala Trp Asp Arg Lys Ser Thr Met Pro Ser Val Arg Ser Met Pro Trp
195 200 205
Arg Val Ala Pro Met Phe Trp His Pro Val Arg Ser Ala Arg Arg Cys
210 215 220
Ser Thr Phe Trp Pro Thr Ser Leu Lys Arg Pro Phe Ser Ser Ile Pro
225 230 235 240
Ala His Pro Ser Pro Ala Val Ala Ser Arg Arg Ser Met Arg Arg Met
245 250 255
Val Asp Arg Phe Ala Ser Arg Leu Pro Thr Val Glu Gly Ile Leu Tyr
260 265 270
Ala Asp Leu Asp Pro Ala Thr Ile Ala Val Ala Lys Ala Ala Ala Asp
275 280 285
Pro Ala Gly Thr Ile Arg Gly Arg Thr His Ser Arg Trp Ser Ile Ala
290 295 300
Lys Ser Ala Arg Trp Leu Lys Ser Arg Ala Gly Asp Ala Asp Leu His
305 310 315 320
Pro His Leu Pro Gly Arg Cys Gly Val
325
<210>347
<211>909
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>347
atgaaagtcg cggcgattca agcggcgccg gtctacttgg accggcaagc cacgctggaa 60
aaggcgcttt ctctaatgga tgaagcggcc gcaaacggcg cccaagtctg cgccttccct 120
gagaccttcc tcgcgggcta tcccgtctgg atggacttga cggacggcgc caagtggaat 180
gacgacaaac aaaaagcggc ctacgcatgt tatgtagatg cggccgtcga agccgacgga 240
cctgagttgc aagccattgc caagaaatcg aaggcgctcg gcctgttcac ctatctgggc 300
atggtcgaac gcgcggcgtc ggccgggtca gtatattgtt ccctcgctgc cttcgatccc 360
gacaagggtc tcgtcagcct acaccgaaaa ctcatgccca cctacaccga gcgcctcgtg 420
tggagccaag gcgacggaca cggactccag gtgcatgaat tcgccggctt caaaatcggc 480
gcgctaaact gctgggaaaa ttggatgccc ctcgcccgtt acgcaatgta cgcccagggc 540
gaacagctcc acgtcgcgac ctggcccggg tccccctggc tcaccaagga catcacccgt 600
ttcatcgccc ttgagggacg catctacgtc atgtccgttg gcggcgtcct tagcgcaaac 660
gatatccccg actccttccc gcttaaaacc gacctcctca agatccgaga ccgctacctc 720
agcggcggca ccatgatagt cgccccagac ggcaccaccc tcgaaggccc cgccaaaaac 780
gaagagacca tactctacgc ggagctcgac ctcaacaccg tcctccaaga acgccaaaat 840
ttcgatcccg cgggacacta cgcgcgccca gacgtcttcc aattggaaat cgacaaaaat 900
cgacgatag 909
<210>348
<211>297
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>348
Met Lys Val Ala Ala Ile Gln Ala Ala Pro Val Tyr Leu Asp Arg Gln
1 5 10 15
Ala Thr Leu Glu Lys Ala Leu Ser Trp Met Lys Arg Pro Gln Thr Ala
20 25 30
Pro Lys Ser Ala Pro Ser Leu Arg Pro Ser Ser Arg Ala Ile Pro Trp
35 40 45
Met Asp Leu Thr Asp Gly Ala Lys Trp Asn Asp Asp Lys Gln Lys Ala
50 55 60
Ala Tyr Ala Cys Tyr Val Asp Ala Ala Val Glu Ala Asp Gly Pro Glu
65 70 75 80
Leu Gln Ala Ile Ala Lys Lys Ser Lys Ala Leu Gly Leu Phe Thr Ile
85 90 95
Trp Ala Trp Ser Asn Ala Arg Arg Arg Pro Gly Gln Tyr Ile Val Pro
100 105 110
Ser Leu Pro Ser Ile Pro Thr Arg Val Ser Ser Ala Tyr Thr Glu Asn
115 120 125
Ser Cys Pro Pro Thr Pro Ser Ala Ser Cys Gly Ala Lys Ala Thr Asp
130 135 140
Thr Asp Ser Arg Cys Met Asn Ser Pro Ala Ser Lys Ser Ala Arg Thr
145 150 155 160
Ala Gly Lys Ile Gly Cys Pro Ala Arg Tyr Ala Met Tyr Ala Gln Gly
165 170 175
Glu Gln Leu His Val Ala Thr Trp Pro Gly Ser Pro Trp Leu Thr Lys
180 185 190
Asp Ile Thr Arg Phe Ile Ala Leu Glu Gly Arg Ile Tyr Val Met Ser
195 200 205
Val Gly Gly Val Leu Ser Ala Asn Asp Ile Pro Asp Ser Phe Pro Leu
210 215 220
Lys Thr Asp Leu Leu Lys Ile Arg Asp Arg Tyr Leu Ser Gly Gly Thr
225 230 235 240
Asp Ser Arg Pro Arg Arg His His Pro Arg Arg Pro Arg Gln Lys Arg
245 250 255
Arg Asp His Thr Leu Arg Gly Leu Asp Leu Asn Thr Val Leu Gln Glu
260 265 270
Arg Gln Asn Phe Asp Pro Ala Gly His Tyr Ala Arg Pro Asp Val Ser
275 280 285
Asn Trp Lys Ser Thr Lys Ile Asp Asp
290 295
<210>349
<211>1002
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>349
atgacgtccc ctgttcaaac caagtacaaa gtcgcctgtg ttcaggcggc gcccgagttt 60
ctcgatctcg acaaaggcgt tgccaaagcg gtgcgcctga tcgaagaagc cgccacccaa 120
aaggcctcgc tgatcgcgtt tcccgaagtc tggctccccg gctatccgtg gtggatctgg 180
ctcgactcgc cggcctgggg cttgcagttc gtccagcgct acttcgaaaa cgctctggtc 240
gtcggcagcc cccaatggga gcgcttgtgc aaggccgccg ccgacaacaa tatccatgtt 300
gtgctcggat tctccgaacg ggacggcagc acgctgtacc tcgcacaggc catcatcgat 360
aacaccggga aggtgatcgc cacgcggcgc aaactcaagc caacacacgc cgaacgcacg 420
gttttcggcg aaggcgacgg gagccacatc gcggtgcatg aaaccacttt gggccgcatg 480
ggtgcactct gctgcgccga gcacatccag ccactgacca agtacgccat gtactcgcag 540
cacgagcaga ttcacattgc cgcatggccc agcttctcgg tctaccgcgg agcagcgttc 600
cagctgagcg ccgaagccaa caacgccgcg agccaggtct atgccctgga gggcagttgc 660
tacgtggtgg ccccttgcgc gacggtgtcc aaggagatgt tggacatgct ggctgattcg 720
ccgcaaaaga agcagctcct gctggaaggc ggtggctacg ccatgattta tgggcccgac 780
gccaagcccc tgtgcgagcc cattccagag acagaagaag gcattcttta cgcagatgtg 840
gacctgggct tcatcggtgt caccaaggca gcgtatgacc ccgccggtca ctattcacgc 900
cccgacgtgc tgcgcctttt gttcaatcgg aagcctgccc ctcgggttca cgatttcgat 960
cctgaataca cggccaccga gcagaagaca gacgcggcct ga 1002
<210>350
<211>333
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>350
Met Thr Ser Pro Val Gln Thr Lys Tyr Lys Val Ala Cys Val Gln Ala
1 5 10 15
Ala Pro Glu Phe Leu Asp Leu Asp Lys Gly Val Ala Lys Ala Val Arg
20 25 30
Leu Ile Glu Glu Ala Ala Thr Gln Lys Ala Ser Leu Ile Ala Phe Pro
35 40 45
Glu Val Trp Leu Pro Gly Tyr Pro Trp Trp Ile Trp Leu Asp Ser Pro
50 55 60
Ala Trp Gly Leu Gln Phe Val Gln Arg Tyr Phe Glu Asn Ala Leu Val
65 70 75 80
Val Gly Ser Pro Gln Trp Glu Arg Leu Cys Lys Ala Ala Ala Asp Asn
85 90 95
Asn Ile His Val Val Leu Gly Phe Ser Glu Arg Asp Gly Ser Thr Leu
100 105 110
Tyr Leu Ala Gln Ala Ile Ile Asp Asn Thr Gly Lys Val Ile Ala Thr
115 120 125
Arg Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly Glu
130 135 140
Gly Asp Gly Ser His Ile Ala Val His Glu Thr Thr Leu Gly Arg Met
145 150 155 160
Gly Ala Leu Cys Cys Ala Glu His Ile Gln Pro Leu Thr Lys Tyr Ala
165 170 175
Met Tyr Ser Gln His Glu Gln Ile His Ile Ala Ala Trp Pro Ser Phe
180 185 190
Ser Val Tyr Arg Gly Ala Ala Phe Gln Leu Ser Ala Glu Ala Asn Asn
195 200 205
Ala Ala Ser Gln Val Tyr Ala Leu Glu Gly Ser Cys Tyr Val Val Ala
210 215 220
Pro Cys Ala Thr Val Ser Lys Glu Met Leu Asp Met Leu Ala Asp Ser
225 230 235 240
Pro Gln Lys Lys Gln Leu Leu Leu Glu Gly Gly Gly Tyr Ala Met Ile
245 250 255
Tyr Gly Pro Asp Ala Lys Pro Leu Cys Glu Pro Ile Pro Glu Thr Glu
260 265 270
Glu Gly Ile Leu Tyr Ala Asp Val Asp Leu Gly Phe Ile Gly Val Thr
275 280 285
Lys Ala Ala Tyr Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Leu
290 295 300
Arg Leu Leu Phe Asn Arg Lys Pro Ala Pro Arg Val His Asp Phe Asp
305 310 315 320
Pro Glu Tyr Thr Ala Thr Glu Gln Lys Thr Asp Ala Ala
325 330
<210>351
<211>936
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>351
atgattacag caggcatcgc agtcgccgct ccggtggtgt tggacaaaac aaaaaccatt 60
gagaaagccg ttggcattat tcacgaggca gcgggtaagg gtgtgaacct gcttgtgttt 120
cccgaggcat ttattccctc ctatcccgcc tggggttggc gcctgcgtcc cggtggagat 180
ttcgggttgt gcgaggagtt gcacgccctg ttgcttgata attcggtaaa tttgcaaggt 240
gatgacctgg accctgtccg gggcgctgca gccgagcatt caatgaccgt ggtgatggga 300
ttgaatgagc gcgaaggcca gttcggtcgg gctaccctgt ttaacgccat ggtatttatc 360
ggtccggacg gcagcatcct gaaccatcat cggaaactta tgccaaccaa tcatgagcgt 420
acgattcatg gcttcggcga tgcgcgggga ttgaaagtgg tggatacccc gtgcggtcgc 480
gtgggtggtc tgatttgctg ggagaatttc atgcccctgg ctcgctacgg cctgtatgcc 540
cagggcgtag aagtgtatgt tgcgcccacc tacgaccagg gtgatgggtg gataggatcc 600
atgcagcata ttgcccggga aggacggtgc tgggtgttat cggccggaac accgctacgc 660
ggcagtgatt ttcccgcgga catgccgggc aaggctcaac tgtttcccga tgacgatgaa 720
tgggtgaatc ccggtgggtc agtggttatc gcaccgggtg gggaattagt ggctggaccg 780
cttttccgtg aggagggcat ccttgtctgt gaattggatc cggcgaaaag tgctcatgcc 840
aagcggtcct ttgacgtggc cggtcattac gccaggccag atattttcga gttggaaata 900
gaccgtgatc cacaggatcc cgtcgagtgg gactga 936
<210>352
<211>301
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>352
Met Ile Thr Ala Gly Ile Ala Val Ala Ala Pro Val Val Leu Asp Lys
1 5 10 15
Thr Lys Thr Ile Glu Lys Ala Val Ala Leu Phe Thr Arg Gln Arg Val
20 25 30
Arg Val Thr Cys Leu Cys Phe Pro Arg His Leu Phe Pro Pro Ile Pro
35 40 45
Pro Gly Val Gly Ala Cys Val Pro Val Glu Ile Ser Gly Cys Ala Arg
50 55 60
Ser Cys Thr Pro Cys Cys Leu Ile Ile Arg Asn Leu Gln Gly Asp Asp
65 70 75 80
Leu Asp Pro Val Arg Gly Ala Ala Ala Glu His Ser Met Thr Val Val
85 90 95
Met Gly Glu Ala Arg Arg Pro Val Arg Ser Gly Tyr Pro Val Arg His
100 105 110
Gly Ile Tyr Arg Ser Gly Arg Gln His Pro Glu Pro Ser Ser Glu Thr
115 120 125
Tyr Ala Asn Gln Ser Ala Tyr Asp Ser Trp Leu Arg Arg Cys Ala Gly
130 135 140
Ile Glu Ser Gly Gly Tyr Pro Val Arg Ser Arg Gly Trp Ser Asp Leu
145 150 155 160
Leu Gly Glu Phe His Ala Pro Gly Ser Leu Gly Leu Tyr Ala Gln Gly
165 170 175
Val Glu Val Tyr Val Ala Pro Thr Tyr Asp Gln Gly Asp Gly Trp Ile
180 185 190
Gly Ser Ala Ala Tyr Cys Pro Gly Arg Thr Val Leu Gly Val Ile Gly
195 200 205
Arg Asn Thr Ala Thr Arg Gln Phe Ser Arg Thr Cys Arg Ala Arg Leu
210 215 220
Asn Cys Phe Pro Met Thr Met Asn Gly Ile Pro Val Gly Gln Trp Leu
225 230 235 240
Ala Pro Gly Gly Glu Leu Val Ala Gly Pro Leu Phe Arg Glu Glu Gly
245 250 255
Ile Leu Val Cys Glu Leu Asp Pro Ala Lys Ser Ala His Ala Lys Arg
260 265 270
Ser Phe Asp Val Ala Gly His Tyr Ala Arg Pro Asp Ile Phe Glu Leu
275 280 285
Glu Ile Asp Arg Asp Pro Gln Asp Pro Val Glu Trp Asp
290 295 300
<210>353
<211>1035
<212>DNA
<213>Psuedomonas putida ATCC 700801
<400>353
atgaatgctt caaaaacaca ttataaagtt gcagctgtgc aagccgctcc tgaattcctg 60
gatttggaca aaggggttga taaggctata cgcctcatta aagaagcagc tgataacgga 120
gcatcgctgg ttgcctttcc ggaagtttat cttccagggt atccgtggtg gatctggctt 180
ggctctcccg catggggaat gcagtttgtt cagcgctacg tagaaaactc gcttgattta 240
aaaagcgagc agttcgagcg actgtgcaaa gcggctgcta cttaccgtat tcacgttgta 300
atgggttaca gcgaacgctc gtttggcacc ctctacctcg gtcaagcaat tatcgatgac 360
agcgggaaag taattggtac gcgccgtaag ctcaagccca cccatgctga gcgtactgtt 420
tacggtgagg gtaacggcag tgacctcagg gttttcaatt cgcaacttgg aagggtaggt 480
gcactctgct gcgcagagca cgtacaacca ctctcgaagt ttgcgatgta cagccagcat 540
gagcagttgc acattgcctc ttggccgagc ttctcggtat atcgcggtgg cgcatatcaa 600
ctgagcgctg aggctaactg cgcggccacc caagtctatg ctcttgaagg ccagtgcttc 660
gtaatttcag catgcgcaat cgtatctaaa gacatgctaa acgttctcat cgacacccct 720
gacaagggta acctgttgca ggatggcggt ggcttcgcga tgatttatgg ccccgatggc 780
gcaccgctgt gtgaacccct gggcgaacat gaagaaggca tcctttatgc cgacgtcgat 840
ttgggcgcca tttccgtagc taaagcggca ctcgacccgg ttggccatta ctcgcggcca 900
gatgttttgc gtctgctttt caacgatcaa ccgacacctt gcgtggaggc gttcaatccg 960
gctcctgtcg gtactgatgc tccaggtaca gaccttcagg gtgatgagcc cgatgcgcaa 1020
ccgatatctg aataa 1035
<210>354
<211>312
<212>PRT
<213>Psuedomonas putida ATCC 700801
<400>354
Met Asn Ala Ser Lys Thr His Tyr Lys Val Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Glu Phe Leu Asp Leu Asp Lys Gly Val Asp Lys Ala Ile Arg Leu
20 25 30
Ile Lys Glu Ala Ala Asp Asn Gly Ala Ser Leu Val Ala Phe Pro Glu
35 40 45
Val Tyr Ser Arg Val Ser Val Val Asp Leu Ala Trp Leu Ser Arg Met
50 55 60
Gly Asn Ala Val Cys Ser Ala Leu Arg Arg Lys Arg Leu Ile Lys Ala
65 70 75 80
Ser Ser Ser Ser Asp Cys Ala Lys Arg Leu Leu Leu Thr Val Phe Thr
85 90 95
Leu Trp Leu Gln Arg Thr Leu Val Trp His Pro Leu Pro Arg Ser Ser
100 105 110
Asn Tyr Arg Gln Arg Glu Ser Asn Trp Tyr Ala Pro Ala Gln Ala His
115 120 125
Pro Cys Ala Tyr Cys Leu Arg Gly Arg Gln Pro Gln Gly Phe Gln Phe
130 135 140
Ala Thr Trp Lys Gly Arg Cys Thr Leu Leu Arg Arg Ala Arg Thr Thr
145 150 155 160
Thr Leu Glu Val Cys Asp Val Gln Pro Ala Ala Val Ala His Cys Leu
165 170 175
Leu Ala Glu Leu Leu Gly Ile Ser Arg Trp Arg Ile Ser Thr Glu Arg
180 185 190
Gly Leu Arg Gly His Pro Ser Leu Cys Ser Arg Pro Val Leu Arg Asn
195 200 205
Phe Ser Met Arg Asn Val Ser Lys Asp Met Leu Asn Val Leu Ile Asp
210 215 220
Thr Pro Asp Lys Gly Asn Leu Leu Gln Asp Gly Gly Gly Phe Ala Met
225 230 235 240
Ile Tyr Gly Pro Asp Gly Ala Pro Leu Cys Glu Pro Leu Gly Glu His
245 250 255
Glu Glu Gly Ile Leu Tyr Arg Arg Arg Phe Gly Arg His Phe Arg Ser
260 265 270
Ser Gly Thr Arg Pro Gly Trp Pro Leu Leu Ala Ala Arg Cys Leu Arg
275 280 285
Leu Leu Phe Asn Asp Gln Pro Thr Pro Cys Val Glu Ala Phe Asn Pro
290 295 300
Ala Pro Val Gly Thr Asp Ala Pro
305 310
<210>355
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>355
atggccgtct ctaaagacgg tactgtttca ggaaagtcgc ctatccgatt gcatgtcgcc 60
gcgatacaga tggtcccaaa gctgggtgac gcgcaggcga acgtgaatca ggcagaagcc 120
cttattcgga aggctcttgg gctgggtgcg cgttggatcg tgttaccaga gatgtttacc 180
tccggtgcgg cgtttcatcc cgacatgctc aaagccattc agccattcga tggcgcccca 240
ctccagttgc tgaaagacct ttctcgcaag ggcaatgctg tcatcggcgg ctcgtttctc 300
gccaagcgtg ggcaacaagt attcaatacc ttcgttttgg tttctccgga cgggtcagtc 360
gtaacgcatg acaaggattc accgacctat tgggaaaatt gctattaccg gggcggtacc 420
gatgatggcg tgttgtctac gcccattggc ccggtcggct ccgtcctctg ttgggaattt 480
atccgctcaa gaaccgcgag acggctggcg aacaaggtca agatggtcgt gggaggctcc 540
tgttggtgga cgctccccga tgatgctgat ccagacagcc cgcgcagagc cgtgaacctc 600
aagatgctgc aagaagcgcc ggttcgcatg gcgcggatgc tgggtgttcc ggtaatacat 660
ggctcccacg cgggcagctt cgaaggattc ttcagtccgg aacttgcgga tgttccctat 720
aactcgacgt acctgggcga gacaatgatt gtcgacgcgg gtggccgggt acttgcccgt 780
agagcgcaag atgcaggcga aggcgtggta acggcagaag tggttttgcc cgacaagtcc 840
gtaccaagcg aacccatccc ggagactttc tggattccca aggaaatgcc ggatgattgg 900
aaagaagcct gggagcgttg gttcgatacc ggtgcggatt actacgagat ggtgaccgcg 960
ccctttatca agacgggtgt gataaacgag tacacaccgg aatatcttag gtag 1014
<210>356
<211>325
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>356
Met Ala Val Ser Lys Asp Gly Thr Val Ser Gly Lys Ser Pro Ile Arg
1 5 10 15
Leu His Val Ala Ala Ile Gln Met Val Gln Ser Trp Val Thr Arg Arg
20 25 30
Arg Thr Ile Arg Gln Lys Pro Leu Phe Gly Arg Leu Leu Gly Trp Val
35 40 45
Arg Val Gly Ser Cys Tyr Gln Arg Cys Leu Pro Pro Val Arg Arg Phe
50 55 60
Ile Pro Thr Cys Ser Lys Pro Phe Ser Phe Asp Gly Ala Pro Leu Gln
65 70 75 80
Leu Leu Lys Asp Leu Ser Arg Lys Gly Asn Ala Val Ile Gly Gly Ser
85 90 95
Phe Leu Gln Ala Trp Ala Thr Ser Ile Gln Tyr Leu Arg Phe Gly Phe
100 105 110
Ser Gly Arg Val Ser Arg Asn Ala Gln Gly Ser Pro Thr Tyr Trp Glu
115 120 125
Asn Cys Tyr Tyr Arg Gly Gly Thr Asp Asp Gly Val Leu Ser Thr Pro
130 135 140
Ile Gly Pro Ser Ala Pro Ser Ser Val Gly Asn Leu Ser Ala Gln Glu
145 150 155 160
Pro Arg Asp Gly Trp Arg Thr Arg Ser Arg Trp Val Gly Gly Ser Cys
165 170 175
Trp Trp Thr Leu Pro Asp Asp Ala Asp Pro Asp Ser Pro Arg Arg Ala
180 185 190
Val Asn Leu Arg Cys Cys Lys Lys Arg Arg Phe Ala Trp Arg Gly Cys
195 200 205
Trp Val Phe Arg Tyr Met Ala Pro Thr Arg Gln Leu Arg Arg Ile Leu
210 215 220
Gln Ser Gly Thr Cys Gly Cys Ser Leu Leu Asp Val Pro Gly Arg Asp
225 230 235 240
Asn Asp Val Asp Ala Gly Gly Arg Val Leu Ala Arg Arg Ala Gln Asp
245 250 255
Ala Gly Glu Gly Val Val Thr Ala Glu Gly Phe Ala Arg Gln Val Arg
260 265 270
Thr Lys Arg Thr His Pro Gly Asp Phe Leu Asp Ser Gln Gly Asn Ala
275 280 285
Gly Ile Gly Lys Lys Pro Gly Ser Val Gly Ser Ile Pro Val Arg Ile
290 295 300
Thr Thr Arg Trp Pro Arg Pro Leu Ser Arg Arg Val Thr Ser Thr His
305 310 315 320
Arg Asn Ile Leu Gly
325
<210>357
<211>951
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>357
atgacaaaac tagccatcgt acaaaaaccg ccagtctttc tggataagca aaaaaccatt 60
gagctggccg tcgccaacat tgaagaggcc gccgccaagg gtgccgatct cgtggtgttt 120
tctgaagctt tcattcccgg ctatcctgcc tggatctggc gtctacgccc cggcggtgac 180
tgggggcttt cagaagagtt gcaccagcgt ttgctgcgca atgccgtcaa tgtggactcc 240
gatgatctgg ctccgttgtt tgaggtcgcc cgcaagcacg aactcaccat cgtttgcggt 300
atcgaggagc gtgacaacaa actaagtcaa acaaccttat ataacaccgt catcacgatt 360
ggtcccgatg gatcgttact gaacaaacat cgcaagctta tgcccaccaa cccggagcga 420
atggtgtggg ggtttggtga cgcatccggt ttaaaagtcg ttgataccaa tgctggtcga 480
attggctcat taatgtgctg ggaaaattac atgccgctgg ctcgctatgc cctatatgca 540
caaggtgtcg agatctatat cgcaccgacc tacgacagcg gtgatggctg gataggcagc 600
atgcagcaca tcgcacgtga agggggctgt tgggtggtgg gatgtgggtg tctcatgaaa 660
ggcagtgata ttccagatga tttcccggag aaatccacgt tgtatccaga tgcagatgaa 720
tgggtgaacc cgggtgattc tgtagtgata gcacccggcg gtgaaattat ggccggccca 780
atgaacagag agtccggtat tttgtatcac gagctagaca gagaaaaagt cagcaacgct 840
aaacgagcat tcgatgttgc cgggcattat tcacgtcccg atatctttca gctgcatgta 900
aatacacagg agcagtcacc ctgcgtattc gaaaataatt ccataactta a 951
<210>358
<211>300
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>358
Met Thr Lys Leu Ala Ile Val Gln Lys Pro Pro Val Phe Leu Asp Lys
1 5 10 15
Gln Lys Thr Ile Glu Leu Ala Val Pro Thr Leu Lys Arg Pro Pro Pro
20 25 30
Arg Val Pro Ile Ser Trp Cys Phe Leu Lys Leu Ser Phe Pro Ala Ile
35 40 45
Leu Trp Ile Trp Arg Leu Arg Pro Gly Gly Asp Trp Gly Leu Ser Glu
50 55 60
Glu Leu His Gln Arg Leu Leu Arg Asn Ala Val Asn Val Asp Ser Asp
65 70 75 80
Asp Leu Ala Pro Leu Phe Glu Val Ala Arg Lys His Glu Leu Thr Ile
85 90 95
Val Cys Gly Arg Gly Ala Gln Gln Thr Lys Ser Asn Asn Leu Ile His
100 105 110
Arg His His Asp Trp Ser Arg Trp Ile Tyr Thr Asn Ile Ala Ser Leu
115 120 125
Cys Pro Pro Thr Arg Ser Glu Trp Cys Gly Gly Leu Val Thr His Pro
130 135 140
Leu Lys Val Val Asp Thr Asn Ala Gly Arg Ile Gly Ser Leu Met Cys
145 150 155 160
Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr Ala Leu Tyr Ala Gln Gly
165 170 175
Val Glu Ile Tyr Ile Ala Pro Thr Tyr Asp Ser Gly Asp Gly Trp Ile
180 185 190
Gly Ser Ala Ala His Arg Thr Arg Gly Leu Leu Gly Gly Gly Met Trp
195 200 205
Val Ser His Glu Arg Gln Tyr Ser Met Ile Ser Arg Arg Asn Pro Arg
210 215 220
Cys Ile Gln Met Gln Met Asn Gly Thr Arg Val Ile Leu Thr Arg Arg
225 230 235 240
Asn Tyr Gly Arg Pro Asn Glu Gln Arg Val Arg Tyr Phe Val Ser Arg
245 250 255
Ala Arg Gln Arg Lys Val Ser Asn Ala Lys Arg Ala Phe Asp Val Ala
260 265 270
Gly His Tyr Ser Arg Pro Asp Ile Phe Gln Leu His Val Ile His Arg
275 280 285
Ser Ser His Pro Ala Tyr Ser Lys Ile Ile Pro Leu
290 295 300
<210>359
<211>1029
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>359
atggcaaacg tcgttcgtgc tgcggccgta cagttgagcc ctgttttggg tagtcgcgag 60
ggtacggtag agaaggtagt tgctgcgatc cgtgacgccg cctcgcaggg cgcacagctg 120
tgcgttttcc cggagacggt tgttccctat tatccgtatt tctcgttcat tcggccgccc 180
gcggccatgg gcaaagacca catgcagctg tacgagcaag ctgtggtcgt gccttctccc 240
agcacgaacg cgattgccgc ggcggccaaa caacactcga tcgtcgtttc aatcggcgtc 300
aatgaacgcg atcacggtac gatatacaac acgcagttgt tgttcgatgc cgacgggaca 360
ctcgtgcaac ggcgtcgcaa gataaccccc acgttccacg agcgtatggt gtggggtcaa 420
ggtgatgggt cgggtttgcg ctgtgtcgac acacaaatcg ggcgcatcgg tagcctggct 480
tgttgggaac attacaatcc cttggcgcgc tacgcattga tggccgatca cgaagagatc 540
cacgtcgcca tgtttccggg ttcgatggtg ggtcagatct tcgccgatca aattcaggta 600
accattcgcc accacgcgct cgaaagcggc tgtttcgtcg tcaacgctac ggggtatctg 660
agcaaggaac aggtcgccca gttgtcacaa ggcacgtcgc tcgacgcggc gttgaccggt 720
ggttgttaca ccgcgattgt atcgcctgaa ggcgtcgtac tgggcgaacc gctcaccgac 780
ggcgaaggca tggtcgtggc ggatatggat ctcagcctca tcaccaaacg caaacgcatg 840
atggatagcg tcgggcacta cagtcgcccg gaattgctgt ctctgctgat caatcgaacg 900
ccaacccaca cggcggtcga cgtcgaattc aactccaatt ccgagtctca tcatgtcagc 960
aatacacgaa caccaaagcg cacaactggc ccacgttcga accttcaagt tgccgctgat 1020
cgcgagtaa 1029
<210>360
<211>335
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>360
Met Ala Asn Val Val Arg Ala Ala Ala Val Gln Leu Ser Pro Val Leu
1 5 10 15
Gly Ser Arg Glu Gly Thr Val Glu Lys Val Val Ala Ala Ile Arg Asp
20 25 30
Ala Ala Ser Gln Gly Ala Gln Leu Cys Val Phe Pro Glu Thr Val Val
35 40 45
Pro Tyr Ser Val Phe Leu Val His Ser Ala Ala Arg Gly His Gly Gln
50 55 60
Arg Pro His Ala Ala Val Arg Ala Ser Cys Gly Arg Ala Phe Ser Gln
65 70 75 80
His Glu Arg Asp Cys Arg Gly Gly Gln Thr Thr Leu Asp Arg Arg Phe
85 90 95
Asn Arg Arg Met Asn Ala Ile Thr Val Arg Tyr Thr Thr Arg Ser Cys
100 105 110
Cys Ser Met Pro Thr Gly His Ser Cys Asn Gly Arg Lys Ile Thr Pro
115 120 125
Thr Phe His Glu Arg Met Val Trp Gly Gln Gly Asp Gly Ser Gly Leu
130 135 140
Arg Cys Val Asp Thr Gln Ile Gly Arg Ile Gly Ser Leu Ala Cys Trp
145 150 155 160
Glu His Tyr Asn Pro Leu Ala Arg Tyr Ala Leu Met Ala Asp His Glu
165 170 175
Glu Ile His Val Ala Met Phe Pro Gly Ser Met Val Gly Gln Ile Phe
180 185 190
Ala Asp Gln Ile Gln Val Pro Phe Ala Thr Thr Arg Ser Lys Ala Ala
195 200 205
Val Ser Ser Ser Thr Leu Arg Gly Ile Ala Arg Asn Arg Ala Gln Leu
210 215 220
Ser Gln Gly Thr Ser Leu Asp Ala Ala Leu Thr Gly Gly Cys Tyr Thr
225 230 235 240
Ala Ile Val Ser Pro Glu Gly Val Val Leu Gly Glu Pro Leu Thr Asp
245 250 255
Gly Glu Gly Met Val Val Ala Asp Met Asp Leu Ser Leu Ile Pro Asn
260 265 270
Ala Asn Ala Trp Ile Ala Ser Gly Thr Thr Val Ala Arg Asn Cys Cys
275 280 285
Leu Cys Ser Ile Thr Pro Thr His Thr Ala Val Asp Val Glu Phe Asn
290 295 300
Ser Asn Ser Glu Ser His His Val Ser Asn Thr Arg His Gln Ser Ala
305 310 315 320
Gln Leu Ala His Val Arg Thr Phe Lys Leu Pro Leu Ile Ala Ser
325 330 335
<210>361
<211>951
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>361
atgctaaacg agaacaatgg cgctactttc aaggttgctg ccgtgcaggc ttcaccggta 60
tttcttgatc gcgctgctac aatcgacaag gcttgcgatt taattgctac ggccggacgc 120
gagggggctc gcctgatcgt ctttccagaa gcgttcgtcc cggcctatcc tgattgggta 180
tgggcgattc ccgcgggtga tgagggcatg ctcaatgagc tgtatgcaga attacttgcc 240
aatgctgtca ccattcccag cgatgcgacc gagaggttgt gtcgcgcggc gcggcttgct 300
aatgcttacg tggtgatggg gatgagcgaa cgcaatgccg aagcgagtgg cgccagcctg 360
tataatacgc tgttgtatat tgatgcacag ggacaaatcc tgggaaagca ccggaagctg 420
gttccaacgg gcggcgagcg cctggtatgg gcacagggag atggcagcac cctggaggtt 480
tacgatactc ctttgggaaa actcggtggc ttaatctgct gggagaatta tatgccgctg 540
gcacgctata ctatgtatgc ctggggcacg caaatctaca ttgcagcgac gtgggatcgc 600
gggcagccat ggctatccac tttgcgacac attgctaaag agggcagagt atatgtgatc 660
ggctgttgta ttgctatgcg caaagatgat atccccgacc attacgcgat gaaggagaag 720
tattacgcgg aagaagacga gtggatcaat attggcgata gcgcaatcgt caatccagaa 780
ggggtattta ttgccgggcc agtgcgtaag caagaagaaa tcctctacgc cgaggttgac 840
ccgcgaatga tgcaggggcc aaagtggatg ctcgacgtgg caggacatta cgcgcgcccg 900
gatgtattcc agttgacggt gcacacggag aggcggcaga tgatccacta g 951
<210>362
<211>302
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>362
Met Leu Asn Glu Asn Asn Gly Ala Thr Phe Lys Val Ala Ala Val Gln
1 5 10 15
Ala Ser Pro Val Phe Leu Asp Arg Ala Tyr Asn Arg Gln Gly Leu Arg
20 25 30
Phe Asn Cys Tyr Gly Arg Thr Arg Gly Gly Ser Pro Asp Arg Leu Ser
35 40 45
Arg Ser Val Arg Pro Gly Leu Ser Leu Gly Met Gly Asp Ser Arg Gly
50 55 60
Gly His Ala Gln Ala Val Cys Glu Leu Leu Ala Asn Ala Val Thr Ile
65 70 75 80
Pro Ser Asp Ala Thr Glu Arg Leu Cys Arg Ala Ala Arg Leu Ala Asn
85 90 95
Ala Tyr Val Val Met Gly Met Ser Glu Arg Asn Ala Glu Ala Ser Gly
100 105 110
Ala Ser Leu Tyr Asn Thr Leu Leu Tyr Cys Thr Gly Thr Asn Pro Gly
115 120 125
Lys Ala Pro Glu Ala Gly Ser Asn Gly Arg Arg Ala Pro Gly Met Gly
130 135 140
Thr Gly Arg Trp Gln His Pro Gly Gly Leu Arg Tyr Ser Phe Gly Lys
145 150 155 160
Thr Arg Trp Leu Asn Leu Leu Arg Ile Ile Cys Arg Trp His Ala Ile
165 170 175
Leu Cys Met Pro Gly Ala Arg Lys Ser Thr Leu Gln Arg Arg Gly Ile
180 185 190
Ala Gly Ser His Gly Tyr Pro Leu Cys Asp Thr Leu Leu Lys Arg Ala
195 200 205
Glu Tyr Met Ser Ala Val Val Leu Tyr Ala Gln Arg Tyr Pro Arg Pro
210 215 220
Leu Arg Asp Glu Gly Glu Val Leu Arg Gly Arg Arg Arg Val Asp Gln
225 230 235 240
Tyr Trp Arg Arg Asn Arg Gln Ser Arg Arg Gly Ile Tyr Cys Arg Ala
245 250 255
Ser Ala Ala Arg Arg Ile Leu Tyr Ala Glu Val Asp Pro Arg Met Met
260 265 270
Gln Gly Pro Lys Trp Met Leu Asp Val Ala Gly His Tyr Arg Ala Arg
275 280 285
Met Tyr Ser Ser Arg Cys Thr Arg Arg Gly Gly Arg Ser Thr
290 295 300
<210>363
<211>1053
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>363
atgccgaaaa agtcgaccgt ccgggtcgca gccgtccaga ttgcgccgga tctgacatcg 60
cgggaaaaga cggtggcacg cgtgatcgag gcgatcgccc aggcatccgc caaaggtgcg 120
gagcttgtgg tttttcccga gacctttgtg ccgtggtatc cttatttctc gttcgtgttg 180
cccccggtct tgtcgggcaa ggagcacctg cggctctacg aagaggcggt tgcggtgcca 240
agtgccgcca caagaagcgt agcggctgcc gctcgcgaac atggcatcgt cgtggcgctt 300
ggcgtcaacg agcgcgacta tggcacgctc tacaatacgc aactgctttt cgatgccgat 360
ggcagtctga tcctgaagcg gcgcaagatc accccgactt tccacgagcg gatgatctgg 420
ggccagggcg atgcctcagg cctgaaggtt gtcgacagcg ccattggccg catcggcgcg 480
ctggcctgct gggaacacta caatccgcta gcccgctatg cgctgatggc gcagcacgag 540
gaaatccaca ttgcgcagtt tcccggctcc atggtcgggc cgatctttgc cgatcagatg 600
gaggtgacga tccgccatca cgcgctggaa agcggctgct tcgtcgtcaa tgccacggga 660
tggctgacgg atgatcagat cgtctcgatc acaccggata ccggcctgca aaaagcgctg 720
cggggtggct gcatgacggc gatcatttcc cccgaaggca agcatctcgt gccgccgctc 780
accgaaggtg agggtatcct cgtcgccgat ctcgacatga gcctcattct caagcgcaag 840
cgcatgatgg attcggtcgg ccactatgcc cggcccgagt tgctgcacct cgtcatggac 900
gcccggccgg ctgcgccgat gagggaatcg tccatgccca ctgccttccc cggcgaaaca 960
ttgacaaccg acatgaccga tggagaacag gatgcgtctt tcgacggaaa cgctgatcaa 1020
cgaatt gcag tccttcggag cccggctggt tga 1053
<210>364
<211>335
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>364
Met Pro Lys Lys Ser Thr Val Arg Val Ala Ala Val Gln Ile Ala Pro
1 5 10 15
Asp Leu Thr Ser Arg Glu Lys Gly Gly Thr Arg Asp Arg Gly Asp Arg
20 25 30
Pro Gly Ile Arg Gln Arg Cys Gly Ala Cys Gly Phe Ser Arg Asp Leu
35 40 45
Cys Arg Gly Ile Leu Ile Ser Arg Ser Cys Cys Pro Arg Ser Cys Arg
50 55 60
Ala Arg Ser Thr Cys Gly Ser Thr Lys Arg Arg Leu Arg Cys Gln Val
65 70 75 80
Pro Pro Gln Glu Ala Arg Leu Pro Leu Ala Asn Met Ala Ser Ser Trp
85 90 95
Arg Leu Ala Ser Thr Ser Ala Thr Met Ala Arg Ser Thr Ile Arg Asn
100 105 110
Cys Phe Ser Met Pro Met Ala Val Ser Ser Gly Ala Arg Ser Pro Arg
115 120 125
Leu Ser Thr Ser Gly Ser Gly Ala Arg Ala Met Pro Gln Ala Glu Gly
130 135 140
Cys Arg Gln Arg His Trp Pro His Arg Arg Ala Gly Leu Leu Gly Thr
145 150 155 160
Leu Gln Ser Ala Ser Pro Tyr Ala Leu Met Ala Gln His Glu Glu Ile
165 170 175
His Ile Ala Gln Phe Pro Gly Ser Met Val Gly Pro Ile Phe Ala Asp
180 185 190
Gln Met Glu Val Thr Ile Arg His His Ala Leu Glu Ser Gly Cys Phe
195 200 205
Val Val Asn Ala Thr Gly Trp Arg Met Ile Arg Ser Ser Arg Ser His
210 215 220
Arg Ile Pro Ala Cys Lys Lys Arg Cys Gly Val Ala Ala Gly Asp His
225 230 235 240
Phe Pro Arg Arg Gln Ala Ser Arg Ala Ala Ala His Arg Arg Gly Tyr
245 250 255
Pro Arg Arg Arg Ser Thr Ala Ser Phe Ser Ser Ala Ser Ala Trp Ile
260 265 270
Arg Ser Ala Thr Met Pro Gly Pro Ser Cys Cys Thr Ser Ser Trp Thr
275 280 285
Pro Gly Arg Leu Arg Arg Gly Asn Arg Pro Cys Pro Leu Pro Ser Pro
290 295 300
Ala Lys Ile Asp Asn Arg His Asp Arg Trp Arg Thr Gly Cys Val Phe
305 310 315 320
Arg Arg Lys Arg Ser Thr Asn Cys Ser Leu Arg Ser Pro Ala Gly
325 330 335
<210>365
<211>975
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>365
atgacaaagc tagccatcgt tcaaaaaccc cccgtctttc tggataaaga aaaaaccata 60
gcgaagacgg ttgattccat aaaagaggcc gcgacacaaa atgccgactt ggtcatcttc 120
accgaagcct tcatcccggg ctaccccacc tggatatggc gacttaggcc aggcgctgat 180
tggggcctct cagaagagct gcacgagcag ttattgcgta acgcggtgag tatgggatcg 240
accgacctgg atccgcttta tgaagccgcc caacagcata acgtcactat tgtttgcggc 300
atcgtagaaa gagaccacca actcagccaa tcaaccctct acaacagcat ggtcgtcatt 360
gacacagacg gaacccttct caacaagcac cgcaaactca tgccaactaa tcccgaacgc 420
atggtgtggg gctttggcga cgcctcagga ctcaaagccg ttgcaacacc tgcaggccgc 480
atcagcacgt tgttgtgttg ggagaactac atgccattgg cccgatatgc tctgtatgca 540
caaggcgtgg aaatctatat cgcgccaact tacgacagtg gtgcgggttg gataggaagc 600
ttgcaacaca tagcacgcga aggtcgatgc tgggtcgtgg gctgtggcaa cctgattcag 660
gccagtgatc tgcctgaaga cttcccggac aaggacaacc tctacccgga cgcagaagag 720
tgggtgaacc cgggtgactc catagtcatt gcaccagacg gtgagattgt ggccggtcca 780
atgcacaaag agacaggaat tttgtactgc gagatagatc tggagaaagt cagaattgca 840
aaacgagcat tagacgtgac cgggcattat tcgcgaccgg acgttttcaa actgcatgtg 900
aatacccgac ctcaatcacc tgtggaattt gaaggtcagg agaccaacaa tccaacaaca 960
ggagaaagct catga 975
<210>366
<211>315
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>366
Met Thr Lys Leu Ala Ile Val Gln Lys Pro Pro Val Phe Leu Asp Lys
1 5 10 15
Glu Lys Thr Ile Ala Lys Thr Val Ile Pro Lys Arg Pro Arg His Lys
20 25 30
Met Pro Thr Trp Ser Ser Ser Pro Lys Pro Ser Ser Arg Ala Thr Thr
35 40 45
Trp Ile Trp Arg Leu Arg Pro Gly Ala Asp Trp Gly Leu Ser Glu Glu
50 55 60
Leu His Glu Gln Leu Leu Arg Arg Gly Glu Tyr Gly Ile Asp Arg Pro
65 70 75 80
Gly Ser Ala Leu Ser Arg Pro Thr Ala Arg His Tyr Cys Leu Arg His
85 90 95
Arg Arg Lys Arg Pro Pro Thr Gln Pro Ile Asn Pro Leu Gln Gln His
100 105 110
Gly Arg His His Arg Arg Asn Pro Ser Gln Gln Ala Pro Gln Thr His
115 120 125
Ala Asn Ser Arg Thr His Gly Val Gly Leu Trp Arg Ala Ser Gly Leu
130 135 140
Lys Ala Val Ala Thr Pro Ala Gly Arg Ile Ser Thr Leu Leu Cys Trp
145 150 155 160
Glu Asn Tyr Met Ile Gly Pro Ile Cys Ser Val Cys Thr Arg Arg Gly
165 170 175
Asn Leu Tyr Arg Ala Asn Leu Arg Gln Trp Cys Gly Leu Asp Arg Lys
180 185 190
Leu Ala Thr His Ser Thr Arg Arg Ser Met Leu Gly Arg Gly Leu Trp
195 200 205
Gln Pro Asp Ser Gly Ser Asp Leu Pro Glu Asp Phe Pro Asp Lys Asp
210 215 220
Asn Leu Tyr Pro Asp Ala Glu Glu Trp Val Asn Pro Gly Asp Ser Ile
225 230 235 240
Val Ile Ala Pro Asp Gly Glu Ile Val Ala Gly Pro Met His Lys Glu
245 250 255
Thr Gly Ile Leu Tyr Arg Asp Arg Ser Gly Glu Ser Gln Asn Cys Lys
260 265 270
Thr Ser Ile Arg Arg Asp Arg Ala Leu Phe Ala Thr Gly Arg Phe Gln
275 280 285
Thr Ala Cys Glu Tyr Pro Thr Ser Ile Thr Cys Gly Ile Arg Ser Gly
290 295 300
Asp Gln Gln Ser Asn Asn Arg Arg Lys Leu Met
305 310 315
<210>367
<211>981
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>367
atgactgccc actttgccga taccctgacc gttgccgtgg cccagatcgc gccggtgtgg 60
ctacagcgcg aagctaccct gcaaaagatg ctggtctggg tggagcgcgc cgcgcgcgaa 120
ggcgcgggcc tggtcgcctt cagtgagggc ctgctgcctg gctacccctt ctggattgaa 180
cacacggacg gcgcccgctt cgagtcgccg ttacagaagc gcctgtacgc ccactactgc 240
gatcagtcgg tccagatcaa tgctgggcac ctcgcaccac tctgcgctgc cgcggccaga 300
caccagattt gggtggtctg cggcgtcatc gagcgcgaca gcgcacgcgg actcagcgtg 360
tttgcatcaa tggtcaccat cgatgccgaa ggcgcgatcc gcagtgtgca ccgcaagctg 420
atgccgacct acgaagaacg cctggtgtgg tcgcccggcg acgcgcacgg actgcgctgc 480
catccgctcg gccagttccg cctcggcagc ctcaattgct gggagaactg gatgccgctg 540
gcgcgcgccg ccctgtacgc ccagggcgag tctttgcatg ttgcatcctg gcccggcagt 600
cgccgcaaca ccgagaccat tactcccttc atcgcccgcg aaggccgcag ttacgcgctg 660
tccgccagtt ccgtgctgca ccgggatgat ctgcccgact ccgttccggc gctgtcggtg 720
ctgcgcgact gcctgccgga cgtgatggcc gacggcggct cctgcgtcgc cggccccgac 780
ggacatttcc tgatcgagcc ggtcgtcggc cgggaagagc tgctgctcgc gcagatcgat 840
catgcccggg tacgcgagga acgtcagaac ttcgacccct tcggccacta ctcgcggccg 900
gaactcctgt cgctggtggt ggatacgcgc cgggcgagcg gagtgcagat agtgaatgct 960
gaccat ggct ttaagccctg a 981
<210>368
<211>317
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>368
Met Thr Ala His Phe Ala Asp Thr Leu Thr Val Ala Val Ala Gln Ile
1 5 10 15
Ala Pro Val Trp Leu Gln Arg Glu Tyr Pro Ala Lys Asp Ala Gly Leu
20 25 30
Gly Gly Ala Arg Arg Ala Arg Arg Arg Gly Pro Gly Arg Leu Gln Gly
35 40 45
Pro Ala Ala Trp Leu Pro Leu Leu Asp Thr His Gly Arg Arg Pro Leu
50 55 60
Arg Val Ala Val Thr Glu Ala Cys Thr Pro Thr Thr Ala Ile Ser Arg
65 70 75 80
Ser Arg Ser Met Leu Gly Thr Ser His His Ser Ala Leu Pro Ala Arg
85 90 95
His Gln Ile Trp Val Val Cys Gly Val Ile Glu Arg Asp Ser Ala Arg
100 105 110
Gly Leu Ser Val Phe Ala Gln Trp Ser Pro Ser Met Pro Lys Ala Arg
115 120 125
Ser Ala Val Cys Thr Ala Ser Cys Arg Pro Thr Lys Asn Ala Trp Cys
130 135 140
Gly Arg Pro Ala Thr Arg Thr Asp Cys Ala Ala Ile Arg Ser Ala Ser
145 150 155 160
Ser Ala Ser Ala Ala Gln Leu Leu Gly Glu Leu Asp Ala Ala Gly Ala
165 170 175
Arg Arg Pro Val Arg Pro Gly Arg Val Phe Ala Cys Cys Ile Leu Ala
180 185 190
Arg Gln Ser Pro Gln His Arg Asp His Tyr Ser Leu His Arg Pro Arg
195 200 205
Arg Pro Gln Leu Arg Ala Ser Ala Ser Ser Val Leu His Arg Asp Asp
210 215 220
Leu Pro Asp Ser Val Pro Ala Leu Ser Val Leu Arg Asp Cys Leu Arg
225 230 235 240
Thr Trp Pro Thr Ala Ala Pro Ala Ser Pro Ala Pro Thr Asp Ile Ser
245 250 255
Ser Ser Arg Ser Ser Pro Gly Arg Ala Ala Ala Arg Ala Asp Arg Ser
260 265 270
Cys Pro Gly Thr Arg Gly Thr Ser Glu Leu Arg Pro Leu Gly His Tyr
275 280 285
Ser Arg Pro Glu Leu Leu Ser Leu Val Val Asp Thr Arg Arg Ala Ser
290 295 300
Gly Val Gln Ile Val Asn Ala Asp His Gly Phe Lys Pro
305 310 315
<210>369
<211>1074
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>369
atgtccgaag agaacaccac caccgaatca agctcggacg cgatctggac cgttgcggcc 60
gtccaggcgg cgccggtgtt cttgaatcgg gatgcgacgg tgcgtaaggc cgtctcgctg 120
attgccgagg ccgcggcgca cggcgcgcgt ctcattgttt ttcccgaggc gttcattcca 180
tcgtacccgg attgggcgtg ggcggtcccg cccggacagg gcggaaccaa ctcgagactg 240
tatgccaaac tgctcgacaa ttcggttacg gtgcccagcc cagccaccga tgccctggcc 300
agggcggctc gcgacgcggg cgcctacgtc gtcatgggca taaacgagcg gaacacggcg 360
gcgagcggcg gaagtctcta caacagcttg ctctatattg gtccggacgg tcgcatcctg 420
ggcattcatc ggaagttggt gccgacctcg gcggagcggc tgatctgggc acaaggcgat 480
ggaagcacgc ttggcgtgtt cgatacgccg ggcggccggc ttggcggctt gatctgctgg 540
gaaaactata tgccgctggc ccgatattcg atgtacgcgc gcggcgtgca aatatatgtt 600
gcggcgacct gggacagggg cgagccatgg ctttcgacgc tgcgacacat agccaaggaa 660
ggccagacct acgtgatagg ctgttgcatc gccatgcgga cggccgacat cgacgacgcc 720
gagctcgtcg agaagtatta cgcggacgcc ggtgagtgga tcaatgaagg cgacagcgcg 780
attgtcgatc cgagcggaac aatcattgcc ggtccggccc atcagaccaa tgaaatcctc 840
tacgccgcga tcgatcgcca gaaggtgctg gaatcaaaat ggatgttgga cgtggccggg 900
cactacgcgc gtccggacgt gttttcattt ggcgttcgaa ccgatgccaa cccgataatg 960
accatgaacg aaccaagcgc gacggccgag ccgaggcaca atagcgcggg agccgagggt 1020
cgcgacggcc tacgcgggcg tcgtgaccct cgctcgagaa ttcggcagat gtag 1074
<210>370
<211>346
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>370
Met Ser Glu Glu Asn Thr Thr Thr Glu Ser Ser Ser Asp Ala Ile Trp
1 5 10 15
Thr Val Ala Ala Val Gln Ala Ala Gly Val Leu Glu Ser Gly Cys Asp
20 25 30
Gly Ala Gly Arg Leu Ala Asp Cys Arg Gly Arg Gly Ala Arg Ala Arg
35 40 45
Leu Ile Val Phe Pro Glu Ala Phe Ile Pro Ser Tyr Pro Asp Trp Ala
50 55 60
Trp Ala Val Pro Pro Gly Gln Ala Glu Pro Thr Arg Asp Cys Met Pro
65 70 75 80
Asn Cys Ser Thr Ile Arg Leu Arg Cys Pro Ala Gln Pro Pro Met Pro
85 90 95
Gly Gln Gly Gly Ser Arg Arg Gly Arg Leu Arg Arg His Gly His Lys
100 105 110
Arg Ala Glu His Gly Gly Ser Gly Gly Ser Leu Tyr Asn Ser Leu Leu
115 120 125
Tyr Ile Gly Pro Asp Gly Arg Ile Leu Gly Ile His Arg Lys Leu Cys
130 135 140
Arg Pro Arg Arg Ser Gly Ser Gly His Lys Ala Met Glu Ala Arg Leu
145 150 155 160
Ala Cys Ser Ile Arg Arg Ala Ala Gly Leu Ala Ala Ser Ala Gly Lys
165 170 175
Thr Ile Cys Arg Trp Pro Asp Ile Arg Cys Thr Arg Ala Val Gln Ile
180 185 190
Tyr Val Ala Ala Thr Trp Asp Arg Gly Glu Pro Trp Leu Ser Thr Leu
195 200 205
Arg His Ile Ala Lys Lys Ala Arg Pro Thr Ala Val Ala Ser Pro Cys
210 215 220
Gly Arg Pro Thr Ser Thr Thr Pro Ser Ser Ser Arg Ser Ile Thr Arg
225 230 235 240
Thr Pro Val Ser Gly Ser Met Lys Ala Thr Ala Arg Leu Ser Ile Arg
245 250 255
Ala Glu Ile Ile Ala Gly Pro Ala His Gln Thr Asn Glu Ile Leu Tyr
260 265 270
Ala Ala Ile Asp Arg Gln Lys Val Leu Glu Lys Met Asp Val Gly Arg
275 280 285
Gly Arg Ala Leu Arg Ala Ser Gly Arg Val Phe Ile Trp Arg Ser Asn
290 295 300
Arg Cys Gln Pro Asp Asn Asp His Glu Arg Thr Lys Arg Asp Gly Arg
305 310 315 320
Ala Glu Ala Gln Arg Gly Ser Arg Val Ala Thr Ala Tyr Ala Gly Val
325 330 335
Val Thr Leu Ala Arg Glu Phe Gly Arg Cys
340 345
<210>371
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>371
atgggtatcg aacatccgaa gtacaaggtt gcggtggtac aggcggcgcc ggcctggctc 60
gatctcgacg catcgatcgc caagaccatc gcgctgatcg aggaggcggc cgccaagggc 120
gccaagctga tcgcattccc tgaggccttc attcccggct atccttggca tatctggatg 180
gactcgccgg cctgggcgat cgggcgcgga tttgtgcagc gctatttcga caattcgctt 240
tcctacgaca gcccgcaggc cgaacggctg cggctcgcgg tgaagaaggc cggcatcacc 300
gccgtgctcg gcctgtccga gcgggaaggc ggcagccttt atctcgcgca atggctgatc 360
ggtcccgacg gcgagaccat cgccaagcgg cgcaagctgc ggccgacgca tgccgagcgc 420
accgtctacg gcgaaggcga cggcagtgac cttgcggtcc atgaccgcgc tgacattggc 480
cgtctcggcg cgctgtgctg ctgggaacat ctgcagccgc tgtcgaaata cgccatgtat 540
gcccagaacg agcaggtgca tgtcgcggcc tggccgagtt tttcgctgta cgacccgttc 600
gcgccggcgc tgggctggga ggtcaacaac gcggcatccc gcgtctatgc ggtcgaaggc 660
tcctgcttcg tgctggcgcc gtgtgccacc gtctcgcagg cgatggtgga cgaactctgc 720
gaccgcgacg acaagcatgc gctgctgcat gtcggcggcg gccacgccgc gatctacgga 780
ccggacggca gctcgatggc gaacaagctc gatcccgagc aggagggcct gctgttcgcc 840
gacatcgatc tcggggcgat cggggtggca aagaacgccg ccgatccggc cgggcactat 900
tcgcggccgg atgtgacccg tctgctcttg aacagaaaac cctcaaagcg cgtcgagcac 960
tttgcgctgc cgctcgacca tctcgcggac gagggcgttg ctccggtgac ctga 1014
<210>372
<211>327
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>372
Met Gly Ile Glu His Pro Lys Tyr Lys Val Ala Val Val Gln Ala Ala
1 5 10 15
Pro Ala Trp Leu Asp Leu Asp Ala Arg Ser Pro Arg Pro Ser Arg Ser
20 25 30
Arg Arg Arg Pro Pro Arg Ala Pro Ser Ser His Ser Leu Ala Phe Ile
35 40 45
Pro Gly Tyr Pro Trp His Ile Trp Met Asp Ser Pro Ala Trp Ala Ile
50 55 60
Gly Arg Gly Phe Val Gln Ala Ile Ser Thr Ile Arg Phe Pro Thr Thr
65 70 75 80
Ala Arg Arg Pro Asn Gly Cys Gly Ser Arg Arg Arg Pro His His Arg
85 90 95
Arg Ala Arg Pro Val Arg Ala Gly Arg Arg Gln Pro Leu Ser Arg Ala
100 105 110
Met Ala Asp Arg Ser Thr Ala Arg Pro Ser Pro Ser Gly Ala Ser Cys
115 120 125
Gly Arg Arg Met Pro Ser Ala Pro Ser Thr Ala Lys Ala Thr Ala Val
130 135 140
Thr Leu Arg Ser Met Thr Ala Leu Thr Leu Ala Val Ser Ala Arg Cys
145 150 155 160
Ala Ala Gly Asn Leu Gln Pro Leu Ser Lys Tyr Ala Met Tyr Ala Gln
165 170 175
Asn Glu Gln Val His Val Ala Ala Trp Pro Ser Phe Ala Val Arg Pro
180 185 190
Val Arg Ala Gly Ala Gly Leu Gly Gly Gln Gln Arg Gly Ile Pro Arg
195 200 205
Leu Cys Gly Arg Gly Ser Cys Phe Val Leu Ala Pro Cys Ala Thr Val
210 215 220
Ser Gln Ala Met Val Asp Glu Leu Cys Asp Arg Asp Gln Ala Cys Ala
225 230 235 240
Ala Ala Cys Arg Arg Arg Pro Arg Arg Asp Leu Arg Thr Gly Arg Gln
245 250 255
Leu Asp Gly Thr Ser Ser Ile Pro Ser Arg Arg Ala Cys Cys Ser Pro
260 265 270
Thr Ser Ile Ser Gly Arg Ser Gly Trp Gln Arg Thr Pro Pro Ile Arg
275 280 285
Pro Gly Thr Ile Arg Gly Arg Met Pro Val Cys Ser Thr Glu Asn Pro
290 295 300
Gln Ala Arg Arg Ala Leu Cys Ala Ala Ala Arg Pro Ser Arg Gly Arg
305 310 315 320
Gly Arg Cys Ser Gly Asp Leu
325
<210>373
<211>1056
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>373
atgacgaatt ttctggacgt gacggtggca gcggttcagg cggctcccgt ctatttcgat 60
cgggaggcat cgacggagaa agcgtgccgg ttgatccacg aagcggcagg gctcggcgcg 120
acgctcgcag cgttcggcga aacctggttg ccagggtatc cgttcttcgt ctgggggttc 180
gcgcacaacc ggagcctgtt ctggcaggcc gccgccgagt acatcgccaa tgcggtggag 240
attccgagtc ccacaacgga ccgtctctgt gcggcggcga aggctgccgg ggtcgacgtc 300
gtcattggcg tcgttgaact ggatgaacga acacgagctt cggtttacag tacgctgctt 360
ttcatcggtc gcgacgggac gatcctgggc cgccaccgca agctgaagcc aacacacatg 420
gagcggacga tctggggcga aggggacgca tatggactcc gcgtctacga acgttcgtac 480
gggcggctga gcggcctgaa ttgctgggaa cacaatatga tgctgcccgg ctacgtgctt 540
gccgcacagg gcacgcagtt tcacgtcgcc gcatggcccg gaaaggagag gctcaccgtc 600
ccgccgaacg aagcggctta tacgcgccag cttctcctct ctcgcgcgta tgcatcccag 660
gccggcgcgt acgtgatcag cgtcgccggg ctgctcgcac cagactccat gcccgagcgt 720
tatcgcgagt tagggcggtc atatgagttg accggcgaca gcgtcatcgt cgacccgcgc 780
ggcgaggtca ttgccgggcc tgcaaaaggc gagaccatcc tgctcgcgca gtgcagtcag 840
gaagctctcc tcgcggccaa gtccgccatc gacctcggcg gccattactc acgcccggat 900
atctttcagc tgcgtgtcaa cgatcaactg cagcatcggg tccggagagt tgagccacac 960
ttcacggcgg cgatcggaca tatcggagcc gagcgccgat cccaggagga tggtactggt 1020
cccttcgacc tggcggaatc tctcacgaac tcctag 1056
<210>374
<211>351
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>374
Met Thr Asn Phe Leu Asp Val Thr Val Ala Ala Val Gln Ala Ala Pro
1 5 10 15
Val Tyr Phe Asp Arg Glu Ala Ser Thr Glu Lys Ala Cys Arg Leu Ile
20 25 30
His Glu Ala Ala Gly Leu Gly Ala Thr Leu Ala Ala Phe Gly Glu Thr
35 40 45
Trp Leu Pro Gly Tyr Pro Phe Phe Val Trp Gly Phe Ala His Asn Arg
50 55 60
Ser Leu Phe Trp Gln Ala Ala Ala Glu Tyr Ile Ala Asn Ala Val Glu
65 70 75 80
Ile Pro Ser Pro Thr Thr Asp Arg Leu Cys Ala Ala Ala Lys Ala Ala
85 90 95
Gly Val Asp Val Val Ile Gly Val Val Glu Leu Asp Glu Arg Thr Arg
100 105 110
Ala Ser Val Tyr Ser Thr Leu Leu Phe Ile Gly Arg Asp Gly Thr Ile
115 120 125
Leu Gly Arg His Arg Lys Leu Lys Pro Thr His Met Glu Arg Thr Ile
130 135 140
Trp Gly Glu Gly Asp Ala Tyr Gly Leu Arg Val Tyr Glu Arg Ser Tyr
145 150 155 160
Gly Arg Leu Ser Gly Leu Asn Cys Trp Glu His Asn Met Met Leu Pro
165 170 175
Gly Tyr Val Leu Ala Ala Gln Gly Thr Gln Phe His Val Ala Ala Trp
180 185 190
Pro Gly Lys Glu Arg Leu Thr Val Pro Pro Asn Glu Ala Ala Tyr Thr
195 200 205
Arg Gln Leu Leu Leu Ser Arg Ala Tyr Ala Ser Gln Ala Gly Ala Tyr
210 215 220
Val Ile Ser Val Ala Gly Leu Leu Ala Pro Asp Ser Met Pro Glu Arg
225 230 235 240
Tyr Arg Glu Leu Gly Arg Ser Tyr Glu Leu Thr Gly Asp Ser Val Ile
245 250 255
Val Asp Pro Arg Gly Glu Val Ile Ala Gly Pro Ala Lys Gly Glu Thr
260 265 270
Ile Leu Leu Ala Gln Cys Ser Gln Glu Ala Leu Leu Ala Ala Lys Ser
275 280 285
Ala Ile Asp Leu Gly Gly His Tyr Ser Arg Pro Asp Ile Phe Gln Leu
290 295 300
Arg Val Asn Asp Gln Leu Gln His Arg Val Arg Arg Val Glu Pro His
305 310 315 320
Phe Thr Ala Ala Ile Gly His Ile Gly Ala Glu Arg Arg Ser Gln Glu
325 330 335
Asp Gly Thr Gly Pro Phe Asp Leu Ala Glu Ser Leu Thr Asn Ser
340 345 350
<210>375
<211>939
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>375
atgaccaaaa tcgctgtcat tcaagaaccc ccggtctatc tgaatctgag caaatcgatg 60
gaccgagcgg tcgacttgat tgccgatgcc gcgagccagg ggtgtcagtt gattgtgttt 120
cccgaagcct ggcttgcagg ttaccccacc ttcgtctggc gtcttgcgcc gggcagcgga 180
atgggaaaaa ccgatgagct ttacgcgcgt ttgctcgcca actcggtcga ccgcagcaaa 240
gaggggcttc ggcctttgca ggaggctgca aaagagcatg gcgttgtcat tgttctgggt 300
tatcaagagg tggacggctc gggcagcagc agcacaatct tcaacagctg cgcgattatt 360
gatgccgacg ggcgactggc caacaaccat cgcaagttga tgcccaccaa tgcggagcgg 420
atggtgtggg ggtttggcga cggttcgggc ctgaacgttg ttgacaccgc ggtgggcagg 480
atcggcacgc tgatttgctg ggaaaactac atgcccttgg cgcgctacgc gctgtatgcc 540
caaaacatcg aaatctatgt ggcgccgacc tgggacagcg gtgccatgtg gcaagccacc 600
ctgcaacata tcgcacgtga aggtggctgc tgggtcatcg gatgtgcaac ctcgctgcaa 660
gcctctgaca tcccggacga ccttccccat cgggatgagt tattcccgaa caaagacgaa 720
tgggtgaacc ctggcgatgc ggtggtttac aaaccttttg gcggccttgt ggccggcccc 780
atgcatcagg aaaaggggct tctcatcgca gagttggacg tcgccgctgt tcaggtctca 840
cgtcggaagt tcgatgcgac cgggcattac gctcgccccg atgtcttcca actgcacgtg 900
aatcgcagcg cgatgcggcc ggttgagttc acgaattag 939
<210>376
<211>312
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>376
Met Thr Lys Ile Ala Val Ile Gln Glu Pro Pro Val Tyr Leu Asn Leu
1 5 10 15
Ser Lys Ser Met Asp Arg Ala Val Asp Leu Ile Ala Asp Ala Ala Ser
20 25 30
Gln Gly Cys Gln Leu Ile Val Phe Pro Glu Ala Trp Leu Ala Gly Tyr
35 40 45
Pro Thr Phe Val Trp Arg Leu Ala Pro Gly Ser Gly Met Gly Lys Thr
50 55 60
Asp Glu Leu Tyr Ala Arg Leu Leu Ala Asn Ser Val Asp Arg Ser Lys
65 70 75 80
Glu Gly Leu Arg Pro Leu Gln Glu Ala Ala Lys Glu His Gly Val Val
85 90 95
Ile Val Leu Gly Tyr Gln Glu Val Asp Gly Ser Gly Ser Ser Ser Thr
100 105 110
Ile Phe Asn Ser Cys Ala Ile Ile Asp Ala Asp Gly Arg Leu Ala Asn
115 120 125
Asn His Arg Lys Leu Met Pro Thr Asn Ala Glu Arg Met Val Trp Gly
130 135 140
Phe Gly Asp Gly Ser Gly Leu Asn Val Val Asp Thr Ala Val Gly Arg
145 150 155 160
Ile Gly Thr Leu Ile Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr
165 170 175
Ala Leu Tyr Ala Gln Asn Ile Glu Ile Tyr Val Ala Pro Thr Trp Asp
180 185 190
Ser Gly Ala Met Trp Gln Ala Thr Leu Gln His Ile Ala Arg Glu Gly
195 200 205
Gly Cys Trp Val Ile Gly Cys Ala Thr Ser Leu Gln Ala Ser Asp Ile
210 215 220
Pro Asp Asp Leu Pro His Arg Asp Glu Leu Phe Pro Asn Lys Asp Glu
225 230 235 240
Trp Val Asn Pro Gly Asp Ala Val Val Tyr Lys Pro Phe Gly Gly Leu
245 250 255
Val Ala Gly Pro Met His Gln Glu Lys Gly Leu Leu Ile Ala Glu Leu
260 265 270
Asp Val Ala Ala Val Gln Val Ser Arg Arg Lys Phe Asp Ala Thr Gly
275 280 285
His Tyr Ala Arg Pro Asp Val Phe Gln Leu His Val Asn Arg Ser Ala
290 295 300
Met Arg Pro Val Glu Phe Thr Asn
305 310
<210>377
<211>1050
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>377
atgtccagca ccacccatcc ccgcctgcgc gtcgccgcgg tgcaagccgc ccccgtcttc 60
ctcgacctcg acggcaccat cgacaagacc atcgacctga tggcccgggc ggccggccag 120
ggcgtgcagc tgatcgcctt tcccgagacc tgggtgcccg gctatccgtg gtggatctgg 180
ctcgattcgc cggcctgggg catgcagttc gtgcagcgct accacgacaa cgccctggtc 240
gtcggctcgc ccgagttcga ccgcattcgc gaggccgcgc gcaagcaccg catctgggtc 300
tcgctcggct acagcgagaa ggccgccggc agcctctaca tcgcccaggc gctgatcgac 360
gaccagggca acacgctgca gactcggcgc aagctcaagc cgacgcacgt ggagcgcacc 420
gtgttcggcg agggcgacgg atcggacctg agcgtggtcg agacggctat cggcaacatc 480
ggctcgctgt cgtgctggga gcacctgcag ccgctcagca agtacgcgat gtactcgcag 540
aacgagcaga tccattgcgg cgcctggccc agcttctcgc tctaccgcgg cggcgcctac 600
gcgctcggcg ccgaagtgaa caacgccgcc agccaggtgt acgcggccga gggccagtgc 660
ttcgtgatcg cgccctgcgc cacggtctcg aaggcgatgc acgaactgct gtgcaccgac 720
cctggcaagc agcagatgct gctggtcggc ggcggcttcg cgcgcatcta cggacccgac 780
ggatcgccgc tcggcaagaa cctggcagag gacgaggaag ggctggtggt ggccgacatc 840
gacctcggca tgatctccct ggccaaggcg gccggcgacc cggccggcca ctattcgcgg 900
cccgacgtga cgcagttgct gttcaacagg aagcggcgcg agccggtggt gctgcaaggc 960
ccggccgagc ccgagaaggc ggtcgccgag ccggtgtcca cgccgagcga agcggcggcg 1020
gccgccgccc gccagccggt cgtggcctga 1050
<210>378
<211>349
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>378
Met Ser Ser Thr Thr His Pro Arg Leu Arg Val Ala Ala Val Gln Ala
1 5 10 15
Ala Pro Val Phe Leu Asp Leu Asp Gly Thr Ile Asp Lys Thr Ile Asp
20 25 30
Leu Met Ala Arg Ala Ala Gly Gln Gly Val Gln Leu Ile Ala Phe Pro
35 40 45
Glu Thr Trp Val Pro Gly Tyr Pro Trp Trp Ile Trp Leu Asp Ser Pro
50 55 60
Ala Trp Gly Met Gln Phe Val Gln Arg Tyr His Asp Asn Ala Leu Val
65 70 75 80
Val Gly Ser Pro Glu Phe Asp Arg Ile Arg Glu Ala Ala Arg Lys His
85 90 95
Arg Ile Trp Val Ser Leu Gly Tyr Ser Glu Lys Ala Ala Gly Ser Leu
100 105 110
Tyr Ile Ala Gln Ala Leu Ile Asp Asp Gln Gly Asn Thr Leu Gln Thr
115 120 125
Arg Arg Lys Leu Lys Pro Thr His Val Glu Arg Thr Val Phe Gly Glu
130 135 140
Gly Asp Gly Ser Asp Leu Ser Val Val Glu Thr Ala Ile Gly Asn Ile
145 150 155 160
Gly Ser Leu Ser Cys Trp Glu His Leu Gln Pro Leu Ser Lys Tyr Ala
165 170 175
Met Tyr Ser Gln Asn Glu Gln Ile His Cys Gly Ala Trp Pro Ser Phe
180 185 190
Ser Leu Tyr Arg Gly Gly Ala Tyr Ala Leu Gly Ala Glu Val Asn Asn
195 200 205
Ala Ala Ser Gln Val Tyr Ala Ala Glu Gly Gln Cys Phe Val Ile Ala
210 215 220
Pro Cys Ala Thr Val Ser Lys Ala Met His Glu Leu Leu Cys Thr Asp
225 230 235 240
Pro Gly Lys Gln Gln Met Leu Leu Val Gly Gly Gly Phe Ala Arg Ile
245 250 255
Tyr Gly Pro Asp Gly Ser Pro Leu Gly Lys Asn Leu Ala Glu Asp Glu
260 265 270
Glu Gly Leu Val Val Ala Asp Ile Asp Leu Gly Met Ile Ser Leu Ala
275 280 285
Lys Ala Ala Gly Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr
290 295 300
Gln Leu Leu Phe Asn Arg Lys Arg Arg Glu Pro Val Val Leu Gln Gly
305 310 315 320
Pro Ala Glu Pro Glu Lys Ala Val Ala Glu Pro Val Ser Thr Pro Ser
325 330 335
Glu Ala Ala Ala Ala Ala Ala Arg Gln Pro Val Val Ala
340 345
<210>379
<211>936
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>379
atggctaaaa tagcgattgt acaaaaggcg tctgttacct tgaataaaca agaaactgtc 60
gccagtgctg taagagaggt agagctggca gcggcggaag gtgccgagct ggtagtgttt 120
actgaggcat ttattgctgg gtatccggcc tggatctggc gtttgcggcc aggtggtgac 180
tgggggctat ctgaagatct tcattcccgt ttgttgacaa gcgccgtaga cctgggtggt 240
gatgacctgg atccacttta tgccgcagct aaagaaaata acgtgacgat agtgtgcggt 300
attaatgaac gtgataaccg gctcagtaag gcaacgctat ataattctat cgttattatt 360
ggttccgatg gttcattgtt aaatcgacat cgtaagttga tgccgacgaa tccggagaga 420
atggtatggg gctttggtga tgcctctggt ctgaaggtcg ttgatacccc cgttggtcgt 480
gttggtacgc ttgtctgttg ggaaaactat atgcccttgg ccagatatgc gttgtattcg 540
cagggggtag aggtttatat tgcgccgacc tacgatagcg gtgatgactg gatttctaca 600
ttacagcata ttgccaggga gggtcgttgt tgggttgttg gctgtggcaa tctattgcgt 660
ggcagcgata taccggatga cttccctgag aagttggcgt tatacccaga tgaggatgag 720
tggataaatc ctggggattc cgttgtgatt gcacctgggg gtaaaatcat ggccgggcca 780
ttgcgccagg aggcggggat tgtctattgt gatattgcgt ctgaaagtgc cagtcaggca 840
aaacgtgcgc tggatgtggc tggacattat tcccggcctg atatctttga gttgcatgtg 900
aatacgaagg tgcagacccc ggttgtatat gattag 936
<210>380
<211>311
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>380
Met Ala Lys Ile Ala Ile Val Gln Lys Ala Ser Val Thr Leu Asn Lys
1 5 10 15
Gln Glu Thr Val Ala Ser Ala Val Arg Glu Val Glu Leu Ala Ala Ala
20 25 30
Glu Gly Ala Glu Leu Val Val Phe Thr Glu Ala Phe Ile Ala Gly Tyr
35 40 45
Pro Ala Trp Ile Trp Arg Leu Arg Pro Gly Gly Asp Trp Gly Leu Ser
50 55 60
Glu Asp Leu His Ser Arg Leu Leu Thr Ser Ala Val Asp Leu Gly Gly
65 70 75 80
Asp Asp Leu Asp Pro Leu Tyr Ala Ala Ala Lys Glu Asn Asn Val Thr
85 90 95
Ile Val Cys Gly Ile Asn Glu Arg Asp Asn Arg Leu Ser Lys Ala Thr
100 105 110
Leu Tyr Asn Ser Ile Val Ile Ile Gly Ser Asp Gly Ser Leu Leu Asn
115 120 125
Arg His Arg Lys Leu Met Pro Thr Asn Pro Glu Arg Met Val Trp Gly
130 135 140
Phe Gly Asp Ala Ser Gly Leu Lys Val Val Asp Thr Pro Val Gly Arg
145 150 155 160
Val Gly Thr Leu Val Cys Trp Glu Asn Tyr Met Pro Leu Ala Arg Tyr
165 170 175
Ala Leu Tyr Ser Gln Gly Val Glu Val Tyr Ile Ala Pro Thr Tyr Asp
180 185 190
Ser Gly Asp Asp Trp Ile Ser Thr Leu Gln His Ile Ala Arg Glu Gly
195 200 205
Arg Cys Trp Val Val Gly Cys Gly Asn Leu Leu Arg Gly Ser Asp Ile
210 215 220
Pro Asp Asp Phe Pro Glu Lys Leu Ala Leu Tyr Pro Asp Glu Asp Glu
225 230 235 240
Trp Ile Asn Pro Gly Asp Ser Val Val Ile Ala Pro Gly Gly Lys Ile
245 250 255
Met Ala Gly Pro Leu Arg Gln Glu Ala Gly Ile Val Tyr Cys Asp Ile
260 265 270
Ala Ser Glu Ser Ala Ser Gln Ala Lys Arg Ala Leu Asp Val Ala Gly
275 280 285
His Tyr Ser Arg Pro Asp Ile Phe Glu Leu His Val Asn Thr Lys Val
290 295 300
Gln Thr Pro Val Val Tyr Asp
305 310
<210>381
<211>945
<212>DNA
<213>Clostridium acetobutylicum ATCC 3625
<400>381
atgggcgaat tcggtgaagt taccctgggc gtggctcagg ctgctcccgt gtactttgac 60
cgcgaggcct cgaccgagaa agcctgtggc ctgatccgcg aggcgggcga aaagggtgta 120
gatcttctgg cgttcggtga aacgtggtta accggctatc catactggaa agatgcgcct 180
tggtctcgcg aatacaacga cttgcgtgca cgttatgttg cgaacggtgt gatgattcct 240
ggtccggaaa cggacgctct gtgccaagca gccgcggaag caggtgtgga tgtggcgatc 300
ggagtagtag aactggagcc gggctctctt tcctcggttt attgcactct gttatttata 360
agccgcgagg gcgagatcct gggtcgtcac cgcaaactga aaccgaccga tagcgaacgt 420
cgttactggt ctgaaggcga cgcgactggt ctgcgcgttt atgagcgccc atatggtcgg 480
cttagcggcc tgaattgctg ggagcacact atgatgctgc cggggtacgc cctggcggcg 540
cagggcaccc agttccatgt ggccgcttgg ccaaacatgg catcctcgaa ttctgaactt 600
ctgtctcgtg cctacgctat gcaggcgggc tgctacgttt tatgcgcggg tggcctgggc 660
ccggccccag gtgaactgcc ggatggtatc gcggcggaaa gtttagatca cttgactgga 720
gagtcatgta tcatcgaccc gtgggggaaa gtaattgctg gtccggtgtc ttgcgaggaa 780
acccttatca cggctcgcgt tagcaccgca tctatttatc gccgcaaaag tttgacggac 840
gtgggcggtc attatagccg cccggatgtt ttccgttttg aagtcgatcg ctctgagcgt 900
ccccgtgtcg tgtttcgcga tggtgacgtg gatgaccgag gttaa 945
<210>382
<211>314
<212>PRT
<213>Clostridium acetobutylicum ATCC 3625
<400>382
Met Gly Glu Phe Gly Glu Val Thr Leu Gly Val Ala Gln Ala Ala Pro
1 5 10 15
Val Tyr Phe Asp Arg Glu Ala Ser Thr Glu Lys Ala Cys Gly Leu Ile
20 25 30
Arg Glu Ala Gly Glu Lys Gly Val Asp Leu Leu Ala Phe Gly Glu Thr
35 40 45
Trp Leu Thr Gly Tyr Pro Tyr Trp Lys Asp Ala Pro Trp Ser Arg Glu
50 55 60
Tyr Asn Asp Leu Arg Ala Arg Tyr Val Ala Asn Gly Val Met Ile Pro
65 70 75 80
Gly Pro Glu Thr Asp Ala Leu Cys Gln Ala Ala Ala Glu Ala Gly Val
85 90 95
Asp Val Ala Ile Gly Val Val Glu Leu Glu Pro Gly Ser Leu Ser Ser
100 105 110
Val Tyr Cys Thr Leu Leu Phe Ile Ser Arg Glu Gly Glu Ile Leu Gly
115 120 125
Arg His Arg Lys Leu Lys Pro Thr Asp Ser Glu Arg Arg Tyr Trp Ser
130 135 140
Glu Gly Asp Ala Thr Gly Leu Arg Val Tyr Glu Arg Pro Tyr Gly Arg
145 150 155 160
Leu Ser Gly Leu Asn Cys Trp Glu His Thr Met Met Leu Pro Gly Tyr
165 170 175
Ala Leu Ala Ala Gln Gly Thr Gln Phe His Val Ala Ala Trp Pro Asn
180 185 190
Met Ala Ser Ser Asn Ser Glu Leu Leu Ser Arg Ala Tyr Ala Met Gln
195 200 205
Ala Gly Cys Tyr Val Leu Cys Ala Gly Gly Leu Gly Pro Ala Pro Gly
210 215 220
Glu Leu Pro Asp Gly Ile Ala Ala Glu Ser Leu Asp His Leu Thr Gly
225 230 235 240
Glu Ser Cys Ile Ile Asp Pro Trp Gly Lys Val Ile Ala Gly Pro Val
245 250 255
Ser Cys Glu Glu Thr Leu Ile Thr Ala Arg Val Ser Thr Ala Ser Ile
260 265 270
Tyr Arg Arg Lys Ser Leu Thr Asp Val Gly Gly His Tyr Ser Arg Pro
275 280 285
Asp Val Phe Arg Phe Glu Val Asp Arg Ser Glu Arg Pro Arg Val Val
290 295 300
Phe Arg Asp Gly Asp Val Asp Asp Arg Gly
305 310
<210>383
<211>1041
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>383
atgtcggagc ccatgacgaa gtatcgcggc gcggcggtgc aggccgcgcc ggtgttcctc 60
gatctcgacc gcacagtcga gaaagcgatc ggcctgatcg agcaggcggc caagcaggac 120
gtgcgcctga tcgcattccc agagacttgg attcccggct atcccttttg gatatggctg 180
ggcgcgccgg cttggggcat gcgcttcgtc cagcgctatt tcgagaattc gctcgtgcgc 240
ggcagcaagc agtggcaggc cctggcggat gcggcccgcc gccacggcat gcatgtcgtg 300
gccggctata gcgagcgcgc gggcggcagc ctctatatgg gccaggcgat cttcggcccc 360
gatggcgatc tgatcgccgc gcgccgcaag ctcaagccta cccatgcgga gcgcaccgtg 420
ttcggcgagg gagacggcag ccatctcgcg gtgcacgata ccgccatcgg gcgcctcggc 480
gcgctctgtt gctgggagca catccagcca ttgtcgaaat acgccatgta cgccgccgac 540
gaacaggtcc acgtcgcgtc gtggccgagc ttcagcctct atcgcggcat ggcctatgcg 600
ctcggaccgg aggtcaatac cgccgcaagc cagatctacg cggtcgaggg cggctgctac 660
gtgctggcgt cgtgcgcgac cgtttcgccg gagatgatca aggtattggt ggatacgccc 720
gacaaggaga tgttcctcaa ggccggcggc ggttttgcca tgattttcgg gcccgacggc 780
cgcgccctgg ccgagccgct cccggagacc gaagagggac tgctggtcgc cgatatcgac 840
ctcggcatga tcgcgttggc caaggcggcg gccgatccgg cgggccacta ttcacggccc 900
gacgtaacgc ggctgctgct ggatcgacgt ccggcccaac gcgtcgtcac gcttgatgcc 960
gcattcgaac cgcaaaacga ggacaagggc gacgcgcccg cgctgcgcgt ggtggcggaa 1020
agcgccgccg ccgcgcagta g 1041
<210>384
<211>346
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>384
Met Ser Glu Pro Met Thr Lys Tyr Arg Gly Ala Ala Val Gln Ala Ala
1 5 10 15
Pro Val Phe Leu Asp Leu Asp Arg Thr Val Glu Lys Ala Ile Gly Leu
20 25 30
Ile Glu Gln Ala Ala Lys Gln Asp Val Arg Leu Ile Ala Phe Pro Glu
35 40 45
Thr Trp Ile Pro Gly Tyr Pro Phe Trp Ile Trp Leu Gly Ala Pro Ala
50 55 60
Trp Gly Met Arg Phe Val Gln Arg Tyr Phe Glu Asn Ser Leu Val Arg
65 70 75 80
Gly Ser Lys Gln Trp Gln Ala Leu Ala Asp Ala Ala Arg Arg His Gly
85 90 95
Met His Val Val Ala Gly Tyr Ser Glu Arg Ala Gly Gly Ser Leu Tyr
100 105 110
Met Gly Gln Ala Ile Phe Gly Pro Asp Gly Asp Leu Ile Ala Ala Arg
115 120 125
Arg Lys Leu Lys Pro Thr His Ala Glu Arg Thr Val Phe Gly Glu Gly
130 135 140
Asp Gly Ser His Leu Ala Val His Asp Thr Ala Ile Gly Arg Leu Gly
145 150 155 160
Ala Leu Cys Cys Trp Glu His Ile Gln Pro Leu Ser Lys Tyr Ala Met
165 170 175
Tyr Ala Ala Asp Glu Gln Val His Val Ala Ser Trp Pro Ser Phe Ser
180 185 190
Leu Tyr Arg Gly Met Ala Tyr Ala Leu Gly Pro Glu Val Asn Thr Ala
195 200 205
Ala Ser Gln Ile Tyr Ala Val Glu Gly Gly Cys Tyr Val Leu Ala Ser
210 215 220
Cys Ala Thr Val Ser Pro Glu Met Ile Lys Val Leu Val Asp Thr Pro
225 230 235 240
Asp Lys Glu Met Phe Leu Lys Ala Gly Gly Gly Phe Ala Met Ile Phe
245 250 255
Gly Pro Asp Gly Arg Ala Leu Ala Glu Pro Leu Pro Glu Thr Glu Glu
260 265 270
Gly Leu Leu Val Ala Asp Ile Asp Leu Gly Met Ile Ala Leu Ala Lys
275 280 285
Ala Ala Ala Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Val Thr Arg
290 295 300
Leu Leu Leu Asp Arg Arg Pro Ala Gln Arg Val Val Thr Leu Asp Ala
305 310 315 320
Ala Phe Glu Pro Gln Asn Glu Asp Lys Gly Asp Ala Pro Ala Leu Arg
325 330 335
Val Val Ala Glu Ser Ala Ala Ala Ala Gln
340 345
<210>385
<211>1014
<212>DNA
<213>未知
<220>
<223>从环境样品获得
<400>385
atgaaagaag ctatcaaggt cgcctgcgtg caagccgccc cgatctacat ggatttgaag 60
gcgacggtgg acaaaaccat tgagttgatg gaagaagcag cacgtaataa tgctcgtctg 120
atcgcctttc cggaaacttg gattccaggc tacccatggt ttctttggct tgactcacca 180
gcatgggcaa tgcaatttgt acgccaatac catgagaact cattggagtt ggatggccct 240
caagctaagc gcatttcaga tgcagccaag cggttgggaa tcatggtcac cctggggatg 300
agtgaacggg tcggtggcac cctttacatc agtcagtggt tcataggcga taatggtgac 360
accattgggg cccggcgaaa gttgaaacct acttttgttg aacgtacttt gttcggcgaa 420
ggggatggtt catcgctagc ggttttcgag acgtctgttg gaaggctggg tggcttatgc 480
tgttgggagc accttcaacc gctaacaaaa tacgctttgt atgcacaaaa tgaagagatt 540
cattgtgcgg cttggccgag ctttagcctt tatcctaatg cggcgaaagc cctggggcct 600
gatgtcaatg tagcggcctc tcgaatctat gccgttgaag ggcaatgctt cgtactagcg 660
tcgtgtgcgc tcgtttcaca atccatgatc gatatgcttt gtacagatga cgaaaagcat 720
gcgttgcttc tggctggtgg tggacactca cgtatcatag ggcctgatgg tggtgacttg 780
gtcgcgcctc ttgccgaaaa tgaagagggt attctctacg caaaccttga tcctggagta 840
cgcatccttg ctaaaatggc ggcagaccct gctggtcatt attcccgtcc cgacattact 900
cgcttgctaa tagatcgcag ccctaaatta ccggtagttg aaattgaagg tgatcttcgt 960
ccttacgctt tgggtaaagc gtctgagacg ggtgcgcaac tcgaagaaat ttga 1014
<210>386
<211>337
<212>PRT
<213>未知
<220>
<223>从环境样品获得
<400>386
Met Lys Glu Ala Ile Lys Val Ala Cys Val Gln Ala Ala Pro Ile Tyr
1 5 10 15
Met Asp Leu Lys Ala Thr Val Asp Lys Thr Ile Glu Leu Met Glu Glu
20 25 30
Ala Ala Arg Asn Asn Ala Arg Leu Ile Ala Phe Pro Glu Thr Trp Ile
35 40 45
Pro Gly Tyr Pro Trp Phe Leu Trp Leu Asp Ser Pro Ala Trp Ala Met
50 55 60
Gln Phe Val Arg Gln Tyr His Glu Asn Ser Leu Glu Leu Asp Gly Pro
65 70 75 80
Gln Ala Lys Arg Ile Ser Asp Ala Ala Lys Arg Leu Gly Ile Met Val
85 90 95
Thr Leu Gly Met Ser Glu Arg Val Gly Gly Thr Leu Tyr Ile Ser Gln
100 105 110
Trp Phe Ile Gly Asp Asn Gly Asp Thr Ile Gly Ala Arg Arg Lys Leu
115 120 125
Lys Pro Thr Phe Val Glu Arg Thr Leu Phe Gly Glu Gly Asp Gly Ser
130 135 140
Ser Leu Ala Val Phe Glu Thr Ser Val Gly Arg Leu Gly Gly Leu Cys
145 150 155 160
Cys Trp Glu His Leu Gln Pro Leu Thr Lys Tyr Ala Leu Tyr Ala Gln
165 170 175
Asn Glu Glu Ile His Cys Ala Ala Trp Pro Ser Phe Ser Leu Tyr Pro
180 185 190
Asn Ala Ala Lys Ala Leu Gly Pro Asp Val Asn Val Ala Ala Ser Arg
195 200 205
Ile Tyr Ala Val Glu Gly Gln Cys Phe Val Leu Ala Ser Cys Ala Leu
210 215 220
Val Ser Gln Ser Met Ile Asp Met Leu Cys Thr Asp Asp Glu Lys His
225 230 235 240
Ala Leu Leu Leu Ala Gly Gly Gly His Ser Arg Ile Ile Gly Pro Asp
245 250 255
Gly Gly Asp Leu Val Ala Pro Leu Ala Glu Asn Glu Glu Gly Ile Leu
260 265 270
Tyr Ala Asn Leu Asp Pro Gly Val Arg Ile Leu Ala Lys Met Ala Ala
275 280 285
Asp Pro Ala Gly His Tyr Ser Arg Pro Asp Ile Thr Arg Leu Leu Ile
290 295 300
Asp Arg Ser Pro Lys Leu Pro Val Val Glu Ile Glu Gly Asp Leu Arg
305 310 315 320
Pro Tyr Ala Leu Gly Lys Ala Ser Glu Thr Gly Ala Gln Leu Glu Glu
325 330 335
Ile
Claims (65)
1.一种分离的或重组的核酸,所述分离的或重组的核酸包括核苷酸,所述核苷酸具有一个序列,所述序列与下述序列具有至少50%的同一性,SEQ ID NO:195,205,207,209或237,SEQ ID NO:195,205,207,209或237的变体,具有一个或多个突变:位点163-165 AAA,AAG,GGT,GGC,GGA,GGG,CAA或CAG;位点178-180 GAA或GAG;位点331-333 TCT,TCC,TCA,TCG,AGT或AGC;位点568-570 CAT,CAC,TCT,TCC,TCA,TCG,AGT,AGC,ACT,ACC,ACA,TCA,TAT,TAC,ATG或ACG;位点571-573 TTA,TTG,CTT,CTC,CTA,CTG,GTT,GTC,GTA,GTG,ATG,ACT,ACC,ACA,GAT,GAC,GGT,GGC,GGA,GGG,GAA,GAG,TAT,TAC或ACG;位点595-597GAA,GAG,TTA,TTG,CTT,CTC,CTA或CTG;位点646-666 TTA,TTG,CTT,CTC,CTA或CTG;或其任意组合,其片段,其中所述核酸或片段编码具有腈水解酶活性的多肽,或它们的补体。
2.如权利要求1所述的分离的或重组的核酸,其中所述核酸包括核苷酸,所述核苷酸具有一个序列,所述序列与如下序列是基本上相同的,SEQ ID NO:195,205,207,209或237,SEQ ID NO:195,205,207,209或237的变体,在如下位点具有一个或多个突变:位点163-165 AAA,AAG,GGT,GGC,GGA,GGG,CAA或CAG;位点178-180 GAA或GAG;位点331-333 TCT,TCC,TCA,TCG,AGT或AGC;位点568-570 CAT,CAC,TCT,TCC,TCA,TCG,AGT,AGC,ACT,ACC,ACA,TCA,TAT,TAC,ATG或ACG;位点571-573 TTA,TTG,CTT,CTC,CTA,CTG,GTT,GTC,GTA,GTG,ATG,ACT,ACC,ACA,GAT,GAC,GGT,GGC,GGA,GGG,GAA,GAG,TAT,TAC或ACG;位点595-597 GAA,GAG,TTA,TTG,CTT,CTC,CTA或CTG;位点646-666 TTA,TTG,CTT,CTC,CTA或CTG;或其任意组合,或它们的补体。
3.一种分离的或重组的核酸,其中所述核酸包括核苷酸,所述核苷酸具有一个与如下序列同一的序列,SEQ ID NO:195,205,207,209或237,具有腈水解酶活性的片段,或它们的补体。
4.一种分离的或重组的核酸,其中所述核酸包括核苷酸,所述核苷酸具有一个与如下序列同一的序列,SEQ ID NO:195,205,207,209或237的变体,在如下位点具有一个或多个突变:位点163-165 AAA,AAG,GGT,GGC,GGA,GGG,CAA或CAG;位点178-180 GAA或GAG;位点331-333 TCT,TCC,TCA,TCG,AGT或AGC;位点568-570 CAT,CAC,TCT,TCC,TCA,TCG,AGT,AGC,ACT,ACC,ACA,TCA,TAT,TAC,ATG或ACG;位点571-573 TTA,TTG,CTT,CTC,CTA,CTG,GTT,GTC,GTA,GTG,ATG,ACT,ACC,ACA,GAT,GAC,GGT,GGC,GGA,GGG,GAA,GAG,TAT,TAC或ACG;位点595-597GAA,GAG,TTA,TTG,CTT,CTC,CTA或CTG;位点646-666 TTA,TTG,CTT,CTC,CTA或CTG;或其任意组合,具有腈水解酶活性的片段,或它们的补体。
5.一种分离的或重组的核酸,所述核酸可以与如下序列杂交,SEQ ID NO:195,205,207,209或237,SEQ ID NO:195,205,207,209或237的变体,在如下位点具有一个或更多突变:位点163-165AAA,AAG,GGT,GGC,GGA,GGG,CAA或CAG;位点178-180GAA或GAG;位点331-333TCT,TCC,TCA,TCG,AGT或AGC;位点568-570CAT,CAC,TCT,TCC,TCA,TCG,AGT,AGC,ACT,ACC,ACA,TCA,TAT,TAC,ATG或ACG;位点571-573TTA,TTG,CTT,CTC,CTA,CTG,GTT,GTC,GTA,GTG,ATG,ACT,ACC,ACA,GAT,GAC,GGT,GGC,GGA,GGG,GAA,GAG,TAT,TAC或ACG;位点595-597GAA,GAG,TTA,TTG,CTT,CTC,CTA或CTG;位点646-666TTA,TTG,CTT,CTC,CTA或CTG;或其任意组合,具有腈水解酶活性的片段,或它们的补体。
6.如权利要求5所述的分离的或重组的核酸,其中严格杂交条件包括至少50%甲酰胺,和大约37℃到大约42℃的温度。
7.一种核酸探针,所述探针包括从大约15个核苷酸到大约50个核苷酸,其中,至少15个连续核苷酸是与在如下核酸序列之中的核酸靶区域具有至少50%的互补性,SEQ ID NO:195,205,207,209或237,SEQ ID NO:195,205,207,209或237的变体,在如下位点具有一个或更多突变:位点163-165AAA,AAG,GGT,GGC,GGA,GGG,CAA或CAG;位点178-180GAA或GAG;位点331-333TCT,TCC,TCA,TCG,AGT或AGC;位点568-570CAT,CAC,TCT,TCC,TCA,TCG,AGT,AGC,ACT,ACC,ACA,TCA,TAT,TAC,ATG或ACG;位点571-573TTA,TTG,CTT,CTC,CTA,CTG,GTT,GTC,GTA,GTG,ATG,ACT,ACC,ACA,GAT,GAC,GGT,GGC,GGA,GGG,GAA,GAG,TAT,TAC或ACG;位点595-597GAA,GAG,TTA,TTG,CTT,CTC,CTA或CTG;位点646-666TTA,TTG,CTT,CTC,CTA或CTG;或其任意组合,或它们的补体。
8.一种核酸探针,所述探针包括SEQ ID NO:195,205,207,209或237,SEQ ID NO:195,205,207,209或237的变体的核酸序列内的核酸靶区域的至少15个连续核苷酸,其中所述SEQ ID NO:195,205,207,209或237的变体具有一个或更多突变:位点163-165AAA,AAG,GGT,GGC,GGA,GGG,CAA或CAG;位点178-180GAA或GAG;位点331-333TCT,TCC,TCA,TCG,AGT或AGC;位点568-570CAT,CAC,TCT,TCC,TCA,TCG,AGT,AGC,ACT,ACC,ACA,TCA,TAT,TAC,ATG或ACG;位点571-573TTA,TTG,CTT,CTC,CTA,CTG,GTT,GTC,GTA,GTG,ATG,ACT,ACC,ACA,GAT,GAC,GGT,GGC,GGA,GGG,GAA,GAG,TAT,TAC或ACG;位点595-597GAA,GAG,TTA,TTG,CTT,CTC,CTA或CTG;位点646-666TTA,TTG,CTT,CTC,CTA或CTG;或其任意组合,或者它们的补体。
9.一种能在宿主细胞内复制的核酸载体,其中所述载体包括权利要求1到6,12或13中的任一项所述的核酸。
10.一种宿主细胞,所述宿主细胞包括权利要求1到6,12或13中的任一项所述的核酸。
11.一种宿主生物体,包括权利要求10的宿主细胞。
12.一种分离的或重组的核酸,所述核酸编码包括氨基酸的多肽,所述氨基酸具有一个与SEQ ID NO:196,206,208,210或238,或SEQ ID NO:196,206,208,210或238的变体具有至少50%同一性的序列,其中SEQ ID NO:196,206,208,210或238的变体具有一个或多个突变:在残基55处,赖氨酸、甘氨酸或谷氨酰胺;在残基60处,谷氨酸;在残基111处,丝氨酸;在残基190处,丝氨酸、组氨酸、酪氨酸或苏氨酸;在残基191处,亮氨酸、缬氨酸、蛋氨酸、天冬氨酸、甘氨酸、谷氨酸、酪氨酸或苏氨酸;在残基199处,谷氨酸或亮氨酸;在残基222处,亮氨酸;或其任意组合,编码多肽的片段,其中多肽具有腈水解酶活性,或它的补体。
13.一种分离的或重组的核酸,所述核酸编码包括氨基酸的多肽,所述氨基酸具有SEQ ID NO:196,206,208,210或238,或SEQ ID NO:196,206,208,210或238的变体的序列,其中SEQ ID NO:196,206,208,210或238的变体具有一个或多个突变:在残基55处,赖氨酸、甘氨酸或谷氨酰胺;在残基60处,谷氨酸;在残基111处,丝氨酸;在残基190处,丝氨酸、组氨酸、酪氨酸或苏氨酸;在残基191处,亮氨酸、缬氨酸、蛋氨酸、天冬氨酸、甘氨酸、谷氨酸、酪氨酸或苏氨酸;在残基199处,谷氨酸或亮氨酸;在残基222处,亮氨酸;或其任意组合,编码多肽的具有腈水解酶活性的片段,或它的补体。
14.如权利要求1到6,12或13中的任一项所述的分离的或重组的核酸,其中所述核酸被固定到固相支持体上。
15.如权利要求14所述的分离的或重组的核酸,其中固相支持体选自凝胶、树脂、聚合物、陶瓷制品、玻璃、微电极和其任意组合。
16.一种分离的或重组的多肽,所述多肽包括具有如下序列的氨基酸,所述序列与SEQ ID NO:196,206,208,210或238,或SEQ ID NO:196,206,208,210或238的变体具有至少50%同一性,SEQ ID NO:196,206,208,210或238的变体具有一个或多个突变:在残基55处,赖氨酸、甘氨酸或谷氨酰胺;在残基60处,谷氨酸;在残基111处,丝氨酸;在残基190处,丝氨酸、组氨酸、酪氨酸或苏氨酸;在残基191处,亮氨酸、缬氨酸、蛋氨酸、天冬氨酸、甘氨酸、谷氨酸、酪氨酸或苏氨酸;在残基199处,谷氨酸或亮氨酸;在残基222处,亮氨酸;或其任意组合,或其片段,其中所述多肽具有腈水解酶活性。
17.一种分离的或重组的多肽,所述多肽包括具有SEQ ID NO:196,206,208,210或238,或SEQ ID NO:196,206,208,210或238的变体的氨基酸,其中SEQID NO:196,206,208,210或238的变体具有一个或多个突变:在残基55处,赖氨酸、甘氨酸或谷氨酰胺;在残基60处,谷氨酸;在残基111处,丝氨酸;在残基190处,丝氨酸、组氨酸、酪氨酸或苏氨酸;在残基191处,亮氨酸、缬氨酸、蛋氨酸、天冬氨酸、甘氨酸、谷氨酸、酪氨酸或苏氨酸;在残基199处,谷氨酸或亮氨酸;在残基222处,亮氨酸;或其任意组合,或其片段,其中所述多肽具有腈水解酶活性。
18.如权利要求16或权利要求17中的任一项所述的分离的或重组的多肽,其中所述片段在长度上至少是20个氨基酸,并且其中所述片段具有腈水解酶活性。
19.如权利要求16或权利要求17所述的多肽的肽模拟体,或其片段,具有腈水解酶活性。
20.如权利要求16或权利要求17中的多肽的密码子最优化的多肽,或其片段,具有腈水解酶活性,其中密码子使用被最优化,以适合特定生物体或细胞。
21.如权利要求16或权利要求17所述的多肽,或其片段,或其具有腈水解酶活性的肽模拟体,其中多肽、片段或肽模拟体被固定到固相支持体上。
22.如权利要求21所述的多肽,其中固相支持体选自凝胶、树脂、聚合物、陶瓷制品、玻璃、微电极和其任意组合。
23.一种纯化的抗体,所述抗体特异性地与权利要求16或权利要求17所述的多肽或其具有腈水解酶活性的片段结合。
24.如权利要求23所述的抗体的片段,其中所述片段与具有腈水解酶活性的多肽特异性地结合。
25.一种酶制剂,所述制剂包括权利要求16或17中任一项所述多肽的至少一种,其中所述制剂是液体或干燥的。
26.如权利要求25所述的酶制剂,其中所述制剂被固定到固相支持体上。
27.一种组合物,包括权利要求1到6,12或13所述核酸的至少一种,或包括权利要求16或权利要求17所述的至少一种多肽或其片段,或其具有腈水解酶活性的肽模拟体,或其任意组合。
28.一种方法,用于将腈水解为羧酸,所述方法包括将分子与权利要求16或权利要求17所述的至少一种多肽,或其片段,或其肽模拟体接触,具有腈水解酶活性,条件是适合腈水解酶活性。
29.一种方法,用于水解分子的羟腈部分或氨基腈部分,所述方法包括将所述分子与权利要求16或权利要求17中的任一项所述的至少一种多肽或其片段,或其肽模拟体接触,具有腈水解酶活性,条件是适合腈水解酶活性。
30.一种方法,用于产生手性α-羟基酸分子或手性氨基酸分子,所述方法包括将具有羟腈部分或氨基酸部分的分子与至少一种多肽,所述多肽具有权利要求16或权利要求17所述的氨基酸序列,或其片段,或其肽模拟体混和,具有对映选择性腈水解酶活性。
31.一种方法,用于制备一种组合物及其中间产物,所述方法包括:将该组合物或中间产物的前体与权利要求16或权利要求17中任一项所述的至少一种具有腈水解酶活性的多肽、或其片段或其肽模拟体接触,其中所述前体包括羟腈部分或氨基腈部分,水解前体中的羟腈部分或氨基腈部分,从而制备该组合物或其中间产物。
32.一种方法,用于制备(R)-乙基4-氰基-3-羟基丁酸,所述方法包括将羟基戊二酰基腈与至少一种多肽接触,所述至少一种多肽由具有SEQ ID NO:195,205,207,209或237,或SEQ ID NO:195,205,207,209或237的变体的一个序列的核酸编码,其中SEQ ID NO:195,205,207,209或237的变体在如下位点具有一个或多个突变:位点163-165AAA,AAG,GGT,GGC,GGA,GGG,CAA或CAG;位点178-180GAA或GAG;位点331-333TCT,TCC,TCA,TCG,AGT或AGC;位点568-570CAT,CAC,TCT,TCC,TCA,TCG,AGT,AGC,ACT,ACC,ACA,TCA,TAT,TAC,ATG或ACG;位点571-573TTA,TTG,CTT,CTC,CTA,CTG,GTT,GTC,GTA,GTG,ATG,ACT,ACC,ACA,GAT,GAC,GGT,GGC,GGA,GGG,GAA,GAG,TAT,TAC或ACG;位点595-597GAA,GAG,TTA,TTG,CTT,CTC,CTA或CTG;位点646-666 TTA,TTG,CTT,CTC,CTA或CTG;或其任意组合,或其编码具有腈水解酶活性的多肽的片段,选择性地产生了(R)-对映异构体,从而制备(R)-乙基4-氰基-3-羟基丁酸。
33.一种方法,用于制备(S)-乙基4-氰基-3-羟基丁酸,所述方法包括将羟基戊二酰基腈与至少一种多肽接触,所述至少一种多肽具有SEQ ID NO:196,206,208,210或238,或SEQ ID NO:196,206,208,210或238的变体中的任一个序列的氨基酸序列,其中SEQ ID NO:196,206,208,210或238的变体具有一个或多个突变:在残基55处,赖氨酸、甘氨酸或谷氨酰胺;在残基60处,谷氨酸;在残基111处,丝氨酸;在残基190处,丝氨酸、组氨酸、酪氨酸或苏氨酸;在残基191处,亮氨酸、缬氨酸、蛋氨酸、天冬氨酸、甘氨酸、谷氨酸、酪氨酸或苏氨酸;在残基199处,谷氨酸或亮氨酸;在残基222处,亮氨酸;或其任意组合,或其具有腈水解酶活性的片段或肽模拟体,选择性地产生了(S)-对映异构体,从而制备(S)-乙基4-氰基-3-羟基丁酸。
34.一种方法,用于制备(R)-扁桃酸,所述方法包括将扁桃腈与至少一种多肽混和,所述至少一种多肽具有氨基酸序列,所述氨基酸序列是SEQ ID NO:196,206,208,210或238,或SEQ ID NO:196,206,208,210或238的变体中的任一个所述序列,其中SEQ ID NO:196,206,208,210或238的变体具有一个或多个突变:在残基55处,赖氨酸、甘氨酸或谷氨酰胺;在残基60处,谷氨酸;在残基111处,丝氨酸;在残基190处,丝氨酸、组氨酸、酪氨酸或苏氨酸;在残基191处,亮氨酸、缬氨酸、蛋氨酸、天冬氨酸、甘氨酸、谷氨酸、酪氨酸或苏氨酸;在残基199处,谷氨酸或亮氨酸;在残基222处,亮氨酸;或其任意组合,或其具有腈水解酶活性的片段或肽模拟体。
35.一种方法,用于制备(S)-扁桃酸,所述方法包括将扁桃腈与至少一种多肽混和,所述至少一种多肽具有氨基酸序列,所述氨基酸序列是SEQ ID NO:196,206,208,210或238,或SEQ ID NO:196,206,208,210或238的变体中的任一个序列,具有一个或多个突变:在残基55处,赖氨酸、甘氨酸或谷氨酰胺;在残基60处,谷氨酸;在残基111处,丝氨酸;在残基190,丝氨酸、组氨酸、酪氨酸或苏氨酸;在残基191处,亮氨酸、缬氨酸、蛋氨酸、天冬氨酸、甘氨酸、谷氨酸、酪氨酸或苏氨酸;在残基199处,谷氨酸或亮氨酸;在残基222处,亮氨酸;或其任意组合,或其具有腈水解酶活性的片段或肽模拟体。
36.一种方法,所述方法用于制备(S)-苯基乳酸衍生物或(R)-苯基乳酸衍生物,所述方法包括将苯基乳氰腈与至少一种多肽混和,所述至少一种多肽选自SEQ IDNO:196,206,208,210或238,或SEQ IDNO:196,206,208,210或238的变体,具有一个或多个突变:在残基55处,赖氨酸、甘氨酸或谷氨酰胺;在残基60处,谷氨酸;在残基111处,丝氨酸;在残基190处,丝氨酸、组氨酸、酪氨酸或苏氨酸;在残基191处,亮氨酸、缬氨酸、蛋氨酸、天冬氨酸、甘氨酸、谷氨酸、酪氨酸或苏氨酸;在残基199处,谷氨酸或亮氨酸;在残基222处,亮氨酸;或其任意组合,或其具有腈水解酶活性的任意片段或肽模拟体,或其任意活性片段或肽模拟体,选择性地产生了(S)-对映异构体或(R)-对映异构体,从而产生(S)-苯基乳酸衍生物或(R)-苯基乳酸衍生物。
37.一种方法,用于制备权利要求16或权利要求17所述的多肽或其片段,所述方法包括:
(a)将编码多肽的核酸在允许通过宿主细胞产生多肽的条件下引入宿主细胞,和
(b)回收如此所产生的多肽。
38.一种方法,用于产生编码具有腈水解酶活性的多肽的核酸变体,其中所述变体具有相对于天然发生的生物活性而发生改变的生物活性,所述方法包括:
(a)通过如下步骤修饰权利要求1到6,12,或13中任一项所述的核酸
(i)用一个不同的核苷酸取代一个或多个核苷酸,其中所述核苷酸包括天然或非天然核苷酸,
(ii)删除一个或多个核苷酸,
(iii)添加一个或多个核苷酸,或
(iv)其任意组合。
39.一种方法,用于从两个或多个核酸产生一个多核苷酸,所述方法包括:
(a)鉴别两个或多个核酸之间的同一性区域和多样性区域,其中所述核酸中的至少一个包括权利要求1到6,12,或13中任一项所述的一个核酸;
(b)提供一组寡核苷酸,所述寡核苷酸在序列上与两个或多个核酸中的至少两个相应;和
(c)用聚合酶延伸寡核苷酸,从而产生多核苷酸。
40.一种筛选测定法,用于鉴别腈水解酶,所述测定法包括:
(a)提供多个核酸或多肽,包括权利要求1到6,12,或13中任一项所述的核酸中的至少一个核酸,或权利要求16或权利要求17所述的多肽的至少一个多肽,或其片段;
(b)从所述多个中,得到将被用来测试腈水解酶活性的多肽候选者;
(c)测试候选者的腈水解酶活性;和
(d)鉴别那些是腈水解酶的多肽候选者。
41.一个试剂盒,所述试剂盒包括(a)权利要求1到6,12,或13中任一项权利要求所述的核酸,或其编码具有腈水解酶活性的多肽的片段,或(b)权利要求16或权利要求17中任一项权利要求所述的多肽,或其片段,或其具有腈水解酶活性的肽模拟体,或其组合;和(c)缓冲剂。
42.一种方法,用于修饰分析,所述方法包括:
(a)将权利要求16或权利要求17中任一项权利要求所述的多肽,或其片段,或其具有腈水解酶活性的肽模拟体,与起始分子混和,以产生反应混合物;
(b)将起始分子与多肽反应,以产生修饰的分子。
43.一种方法,用于鉴别修饰的化合物,所述方法包括:
(a)将权利要求16或权利要求17中任一项权利要求所述的多肽,或其片段,或其具有腈水解酶活性的肽模拟体,与起始化合物混和,以产生反应混合物,随后产生修饰的起始化合物的文库;
(b)测试所产生的文库,以确定文库中是否存在表现出期望活性的修饰的起始化合物;
(c)鉴别表现出期望活性的修饰化合物。
44.一种计算机可读的介质,其上已经存储了至少一个选自如下的核苷酸序列:SEQ ID NO:1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,35,37,39,41,43,45,47,49,51,53,55,57,59,61,63,65,67,69,71,73,75,77,79,81,83,85,87,89,91,93,95,97,99,101,103,105,107,109,111,113,115,117,119,121,123,125,127,129,131,133,135,137,139,141,143,145,147,149,151,153,155,157,159,161,163,165,167,169,171,173,175,177,179,181,183,185,187,189,191,193,195,197,199,201,203,205,207,209,211,213,215,217,219,221,223,225,227,229,231,233,235,237,239,241,243,245,247,249,251,253,255,257,259,261,263,265,267,269,271,273,275,277,279,281,283,285,287,289,291,293,295,297,299,301,303,305,307,309,311,313,315,317,319,321,323,325,327,329,331,333,335,337,339,341,343,345,347,349,351,353,355,357,359,361,363,365,367,369,371,373,375,377,379,381,383,385,或其变体,和/或至少一个选自如下序列的氨基酸序列:SEQ ID NO:2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62,64,66,68,70,72,74,76,78,80,82,84,86,88,90,92,94,96,98,100,102,104,106,108,110,112,114,116,118,120,122,124,126,128,130,132,134,136,138,140,142,144,146,148,150,152,154,156,158,160,162,164,166,168,170,172,174,176,178,180,182,184,186,188,190,192,194,196,198,200,202,204,206,208,210,212,214,216,218,220,222,224,226,228,230,232,234,236,238,240,242,244,246,248,250,252,254,256,258,260,262,264,266,268,270,272,274,276,278,280,282,284,286,288,290,292,294,296,298,300,302,304,306,308,310,312,314,316,318,320,322,324,326,328,330,332,334,336,338,340,342,344,346,348,350,352,354,356,358,360,362,364,366,368,370,372,374,376,378,380,382,384和386,及其变体。
45.一种计算机系统,所述系统包括一个处理器和一个数据存储设备,其中所述数据存储设备上已经存储了至少一个选自如下的核苷酸序列:SEQ ID NO:1,3,5,7,9,11,13,15,17,19,21,23,25,27,29,31,35,37,39,41,43,45,47,49,51,53,55,57,59,61,63,65,67,69,71,73,75,77,79,81,83,85,87,89,91,93,95,97,99,101,103,105,107,109,111,113,115,117,119,121,123,125,127,129,131,133,135,137,139,141,143,145,147,149,151,153,155,157,159,161,163,165,167,169,171,173,175,177,179,181,183,185,187,189,191,193,195,197,199,201,203,205,207,209,211,213,215,217,219,221,223,225,227,229,231,233,235,237,239,241,243,245,247,249,251,253,255,257,259,261,263,265,267,269,271,273,275,277,279,281,283,285,287,289,291,293,295,297,299,301,303,305,307,309,311,313,315,317,319,321,323,325,327,329,331,333,335,337,339,341,343,345,347,349,351,353,355,357,359,361,363,365,367,369,371,373,375,377,379,381,383,385,和其变体,和/或至少一个选自如下序列的氨基酸序列:SEQ ID NO:2,4,6,8,10,12,14,16,18,20,22,24,26,28,30,32,34,36,38,40,42,44,46,48,50,52,54,56,58,60,62,64,66,68,70,72,74,76,78,80,82,84,86,88,90,92,94,96,98,100,102,104,106,108,110,112,114,116,118,120,122,124,126,128,130,132,134,136,138,140,142,144,146,148,150,152,154,156,158,160,162,164,166,168,170,172,174,176,178,180,182,184,186,188,190,192,194,196,198,200,202,204,206,208,210,212,214,216,218,220,222,224,226,228,230,232,234,236,238,240,242,244,246,248,250,252,254,256,258,260,262,264,266,268,270,272,274,276,278,280,282,284,286,288,290,292,294,296,298,300,302,304,306,308,310,312,314,316,318,320,322,324,326,328,330,332,334,336,338,340,342,344,346,348,350,352,354,356,358,360,362,364,366,368,370,372,374,376,378,380,382,384,386及其变体。
46.一种方法,用于鉴别序列中的特征,所述方法包括:
(a)将序列输入到计算机中;
(b)在计算机上运行序列特征识别程序,以便在序列中识别特征;和
(c)识别序列中的特征,
其中所述序列包括SEQ ID NO:1-386,变体,或其任意组合。
47.一种测定方法,用于识别多肽的功能片段,所述方法包括:
(a)获得权利要求16或权利要求17所述的至少一个多肽的片段;
(b)将步骤(a)的至少一个片段与具有一个羟腈部分或氨基腈部分的底物接触,条件是适合腈水解酶活性;
(c)测量从步骤(b)中至少一个片段中的每一个所产生的反应产物的量;和
(d)识别能产生腈水解酶反应产物的至少一个片段;从而识别多肽的功能片段。
48.一种测定方法,用于识别多肽的功能变体,所述方法包括:
(a)获得权利要求16或权利要求17的至少一个多肽的至少一个变体;
(b)将步骤(a)的至少一个变体与具有一个羟腈部分或氨基腈部分的底物接触,条件是适合腈水解酶活性;
(c)测量步骤(b)的至少一个变体中的每一个所产生的反应产物的量;和
(d)识别能产生腈水解酶反应产物的至少一个变体;从而识别多肽的功能变体。
49.一种测定方法,用于筛选对映选择性转化,所述测定方法包括:
(a)标记分中子的两个前手性或对映性部分中的一个;
(b)通过选择性催化剂修饰两个部分中的至少一个,从而产生产物;和
(c)通过质谱分析法测定所得产物。
50.如权利要求49所述的测定方法,其中所述标记物是重同位素或轻同位素。
51.如权利要求49所述的测定方法,其中所述选择性催化剂是酶。
52.如权利要求49所述的测定方法,其中所述使用质谱学技术是通过阳性模式或阴性模式。
53.如权利要求49所述的测定方法,其中所述分析或者是亲本质谱或者是断裂质谱。
54.如权利要求49所述的测定方法,其中所述所述测定方法可以被用来监控或确定对映体过量百分比,或非对映体过量百分比。
55.一种具有腈水解酶活性的分离的或重组的多肽,所述多肽包括一个在SEQ ID NO:196,206,208,210或238中所示的序列,在如下残基处具有一个或多个突变:残基55赖氨酸,残基55甘氨酸,残基55谷氨酰胺,残基60谷氨酸,残基111丝氨酸,残基190,残基190丝氨酸,残基190组氨酸,残基190酪氨酸,残基190苏氨酸,残基191亮氨酸,残基191缬氨酸,残基191蛋氨酸,残基191天冬氨酸,残基191甘氨酸,残基191谷氨酸,残基191酪氨酸,残基191苏氨酸,残基199谷氨酸,残基199亮氨酸,残基222亮氨酸;和它们的任意组合。
56.一种分离的或重组的多肽,具有腈水解酶活性,所述多肽包括一个在SEQID NO:196,206,208,210或238中所示的序列,在残基190或等效残基位置具有一个突变,其中丙氨酸被结合氢的氨基酸或肽模拟体残基替换。
57.一种分离的或重组的多肽,具有腈水解酶活性,所述多肽包括一个在SEQID NO:196,206,208,210或238中所示的序列,在残基190或等效残基位置具有一个突变,其中丙氨酸被疏水氨基酸或肽模拟体残基替换。
58.一种分离的或重组的腈水解酶,所述腈水解酶在SEQ ID NO:196,206,208,210或238的如下位置具有一个或多个突变的等价物:在残基55处,赖氨酸、甘氨酸或谷氨酰胺;在残基60处,谷氨酸;在残基111处,丝氨酸;在残基190处,丝氨酸、组氨酸、酪氨酸或苏氨酸;在残基191处,亮氨酸、缬氨酸、蛋氨酸、天冬氨酸、甘氨酸、谷氨酸、酪氨酸或苏氨酸;在残基199处,谷氨酸或亮氨酸;在残基222处,亮氨酸。
59.一种扩增引物对,用于扩增编码具有腈水解酶活性的多肽的核酸,其中所述引物对能扩增包括权利要求1中所述的一个序列,或其子序列的核酸。
60.如权利要求59所述的扩增引物对,其中扩增引物序列对的一个成员包括寡核苷酸,所述寡核苷酸包括所述序列的至少大约10个到50个连续碱基,或所述序列的大约12、13、14、15、16、17、18、19、20、21、22、23、24、25、26、27、28、29、30或多个连续碱基。
61.一种编码腈水解酶的核酸,其中所述核酸是通过使用如权利要求59中所述的扩增引物对扩增多核苷酸产生的。
62.如权利要求61的编码腈水解酶的核酸,其中所述扩增是通过聚合酶链式反应(PCR)。
63.如权利要求62的编码腈水解酶的核酸,其中所述核酸是通过扩增基因文库产生的。
64.如权利要求63的编码腈水解酶的核酸,其中所述基因文库是环境文库。
65.一种分离的或重组的腈水解酶,所述腈水解酶是由如权利要求61中所述的编码腈水解酶的核酸编码的。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210217394.7A CN102796750B (zh) | 2002-05-15 | 2003-05-15 | 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 |
CN201510514378.8A CN105296512A (zh) | 2002-05-15 | 2003-05-15 | 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/146,772 US7521216B2 (en) | 1999-12-29 | 2002-05-15 | Nitrilases and methods for making and using them |
US10/146,772 | 2002-05-15 | ||
US10/241,742 US20040002147A1 (en) | 1999-12-29 | 2002-09-09 | Nitrilases |
US10/241,742 | 2002-09-09 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510514378.8A Division CN105296512A (zh) | 2002-05-15 | 2003-05-15 | 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 |
CN201210217394.7A Division CN102796750B (zh) | 2002-05-15 | 2003-05-15 | 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1849391A true CN1849391A (zh) | 2006-10-18 |
CN1849391B CN1849391B (zh) | 2014-06-11 |
Family
ID=29552719
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN03816682.8A Expired - Fee Related CN1849391B (zh) | 2002-05-15 | 2003-05-15 | 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 |
CN201210217394.7A Expired - Fee Related CN102796750B (zh) | 2002-05-15 | 2003-05-15 | 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 |
CN201510514378.8A Pending CN105296512A (zh) | 2002-05-15 | 2003-05-15 | 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210217394.7A Expired - Fee Related CN102796750B (zh) | 2002-05-15 | 2003-05-15 | 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 |
CN201510514378.8A Pending CN105296512A (zh) | 2002-05-15 | 2003-05-15 | 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 |
Country Status (8)
Country | Link |
---|---|
US (1) | US20040002147A1 (zh) |
EP (1) | EP1576108B1 (zh) |
JP (2) | JP4384594B2 (zh) |
CN (3) | CN1849391B (zh) |
AU (2) | AU2003231789C1 (zh) |
CA (2) | CA2857899A1 (zh) |
DK (1) | DK1576108T3 (zh) |
WO (1) | WO2003097810A2 (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108486088A (zh) * | 2018-02-14 | 2018-09-04 | 浙江工业大学 | 腈水解酶突变体及其应用 |
CN113151233A (zh) * | 2021-04-13 | 2021-07-23 | 浙江工业大学 | 腈水合酶赖氨酸突变体hba-k2h2、编码基因及应用 |
CN113151234A (zh) * | 2021-04-13 | 2021-07-23 | 浙江工业大学 | 腈水合酶赖氨酸突变体hba-k2h2r、编码基因及应用 |
WO2022073331A1 (zh) * | 2020-10-09 | 2022-04-14 | 浙江工业大学 | 一种腈水解酶突变体及其在催化合成2-氯烟酸中的应用 |
CN114908075A (zh) * | 2022-04-02 | 2022-08-16 | 浙江工业大学 | 一种酶法合成布瓦西坦手性中间体的方法 |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7521216B2 (en) | 1999-12-29 | 2009-04-21 | Verenium Corporation | Nitrilases and methods for making and using them |
US20040002147A1 (en) * | 1999-12-29 | 2004-01-01 | Desantis Grace | Nitrilases |
US7300775B2 (en) | 1999-12-29 | 2007-11-27 | Verenium Corporation | Methods for producing α-substituted carboxylic acids using nitrilases and strecker reagents |
US7435562B2 (en) | 2000-07-21 | 2008-10-14 | Modular Genetics, Inc. | Modular vector systems |
WO2003106415A2 (en) * | 2002-06-13 | 2003-12-24 | Diversa Corporation | Processes for making (r)-ethyl 4-cyano-3-hydroxybutyric acid |
CA2590245A1 (en) | 2004-11-11 | 2006-05-18 | Modular Genetics, Inc. | Ladder assembly and system for generating diversity |
US7671231B2 (en) * | 2006-01-18 | 2010-03-02 | Lloyd Michael C | Process for making amino acids |
EP2115153B1 (en) | 2007-03-01 | 2013-06-05 | BP Corporation North America Inc. | Nitrilases, nucleic acids encoding them and methods for making and using them |
FI3522713T3 (fi) * | 2016-10-03 | 2023-01-13 | Amiinien ja hydratsiinien fluoresoiva tunnistus ja niiden määritysmenetelmiä | |
CN106636044B (zh) * | 2017-03-07 | 2019-10-18 | 东莞东阳光药物研发有限公司 | 腈水解酶突变体及其编码基因和应用 |
CN111172140B (zh) * | 2020-01-21 | 2022-04-19 | 浙江工业大学 | 一种腈水解酶突变体及其在制备抗癫痫药物中间体中的应用 |
CN112058241B (zh) * | 2020-09-15 | 2022-11-15 | 太原科技大学 | 一种含大豆蛋白的复合吸附材料的制备方法 |
CN113755477B (zh) * | 2021-08-30 | 2023-08-18 | 上海晖胧生物医药有限公司 | 腈水解酶突变体及其在制备苯乙酮酸类化合物中的应用 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4946778A (en) | 1987-09-21 | 1990-08-07 | Genex Corporation | Single polypeptide chain binding molecules |
US5331573A (en) | 1990-12-14 | 1994-07-19 | Balaji Vitukudi N | Method of design of compounds that mimic conformational features of selected peptides |
JP3154633B2 (ja) * | 1994-12-28 | 2001-04-09 | 三菱レイヨン株式会社 | ニトリラーゼ遺伝子発現のための調節因子およびその遺伝子 |
US5631280A (en) | 1995-03-29 | 1997-05-20 | Merck & Co., Inc. | Inhibitors of farnesyl-protein transferase |
US5958672A (en) | 1995-07-18 | 1999-09-28 | Diversa Corporation | Protein activity screening of clones having DNA from uncultivated microorganisms |
JPH0937788A (ja) * | 1995-07-31 | 1997-02-10 | Nitto Chem Ind Co Ltd | 新規なニトリラーゼ遺伝子 |
US5939250A (en) | 1995-12-07 | 1999-08-17 | Diversa Corporation | Production of enzymes having desired activities by mutagenesis |
DE19848129A1 (de) * | 1998-10-19 | 2000-04-20 | Basf Ag | Verfahren zur Herstellung chiraler Carbonsäuren aus Nitrilen mit Hilfe einer Nitrilase oder Mikroorganismen, die ein Gen für die Nitrilase enthalten |
US6470277B1 (en) * | 1999-07-30 | 2002-10-22 | Agy Therapeutics, Inc. | Techniques for facilitating identification of candidate genes |
US7300775B2 (en) * | 1999-12-29 | 2007-11-27 | Verenium Corporation | Methods for producing α-substituted carboxylic acids using nitrilases and strecker reagents |
US20040002147A1 (en) * | 1999-12-29 | 2004-01-01 | Desantis Grace | Nitrilases |
EP2327765B1 (en) * | 2001-06-21 | 2015-04-01 | BASF Enzymes LLC | Nitrilases |
US9409174B2 (en) | 2013-06-21 | 2016-08-09 | Bio-Rad Laboratories, Inc. | Microfluidic system with fluid pickups |
-
2002
- 2002-09-09 US US10/241,742 patent/US20040002147A1/en not_active Abandoned
-
2003
- 2003-05-15 EP EP03753097.9A patent/EP1576108B1/en not_active Expired - Lifetime
- 2003-05-15 DK DK03753097.9T patent/DK1576108T3/en active
- 2003-05-15 JP JP2004506469A patent/JP4384594B2/ja not_active Expired - Fee Related
- 2003-05-15 AU AU2003231789A patent/AU2003231789C1/en not_active Ceased
- 2003-05-15 CN CN03816682.8A patent/CN1849391B/zh not_active Expired - Fee Related
- 2003-05-15 CA CA2857899A patent/CA2857899A1/en not_active Abandoned
- 2003-05-15 CA CA2486062A patent/CA2486062C/en not_active Expired - Fee Related
- 2003-05-15 CN CN201210217394.7A patent/CN102796750B/zh not_active Expired - Fee Related
- 2003-05-15 WO PCT/US2003/015712 patent/WO2003097810A2/en active Application Filing
- 2003-05-15 CN CN201510514378.8A patent/CN105296512A/zh active Pending
-
2009
- 2009-07-29 AU AU2009203078A patent/AU2009203078A1/en not_active Abandoned
- 2009-07-30 JP JP2009177263A patent/JP4528872B2/ja not_active Expired - Fee Related
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108486088A (zh) * | 2018-02-14 | 2018-09-04 | 浙江工业大学 | 腈水解酶突变体及其应用 |
CN108486088B (zh) * | 2018-02-14 | 2021-02-02 | 浙江工业大学 | 腈水解酶突变体及其应用 |
WO2022073331A1 (zh) * | 2020-10-09 | 2022-04-14 | 浙江工业大学 | 一种腈水解酶突变体及其在催化合成2-氯烟酸中的应用 |
CN113151233A (zh) * | 2021-04-13 | 2021-07-23 | 浙江工业大学 | 腈水合酶赖氨酸突变体hba-k2h2、编码基因及应用 |
CN113151234A (zh) * | 2021-04-13 | 2021-07-23 | 浙江工业大学 | 腈水合酶赖氨酸突变体hba-k2h2r、编码基因及应用 |
CN113151233B (zh) * | 2021-04-13 | 2022-08-12 | 浙江工业大学 | 腈水合酶赖氨酸突变体hba-k2h2、编码基因及应用 |
CN114908075A (zh) * | 2022-04-02 | 2022-08-16 | 浙江工业大学 | 一种酶法合成布瓦西坦手性中间体的方法 |
CN114908075B (zh) * | 2022-04-02 | 2024-03-26 | 浙江工业大学 | 一种酶法合成布瓦西坦手性中间体的方法 |
Also Published As
Publication number | Publication date |
---|---|
CN105296512A (zh) | 2016-02-03 |
AU2009203078A1 (en) | 2009-08-20 |
AU2003231789B2 (en) | 2009-04-30 |
AU2003231789C1 (en) | 2010-02-25 |
AU2003231789B8 (en) | 2009-05-14 |
JP2006511195A (ja) | 2006-04-06 |
WO2003097810A2 (en) | 2003-11-27 |
CA2486062A1 (en) | 2003-11-27 |
EP1576108A4 (en) | 2008-01-09 |
CA2486062C (en) | 2014-10-14 |
JP2009279005A (ja) | 2009-12-03 |
AU2009203078A2 (en) | 2009-10-08 |
US20040002147A1 (en) | 2004-01-01 |
DK1576108T3 (en) | 2015-10-05 |
CN102796750A (zh) | 2012-11-28 |
AU2003231789A1 (en) | 2003-12-02 |
CN1849391B (zh) | 2014-06-11 |
WO2003097810A3 (en) | 2006-03-23 |
EP1576108A2 (en) | 2005-09-21 |
EP1576108B1 (en) | 2015-07-08 |
JP4384594B2 (ja) | 2009-12-16 |
CA2857899A1 (en) | 2003-11-27 |
JP4528872B2 (ja) | 2010-08-25 |
CN102796750B (zh) | 2015-09-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DK2327765T3 (en) | nitrilases | |
CN1849391A (zh) | 腈水解酶、编码腈水解酶的核酸,以及制备和使用它们的方法 | |
US8273547B2 (en) | Engineered ketoreductases and methods for producing stereoisomerically pure statins | |
CN1277843C (zh) | 分枝杆菌比较基因组学作为鉴定分枝杆菌病的诊断、预防或治疗靶的工具 | |
CN1871351A (zh) | 一种新的真菌蛋白及其编码核酸 | |
CN1240833C (zh) | 新型腈水合酶 | |
CN101044243A (zh) | 类异戊二烯的生产方法 | |
CN1723281A (zh) | 突变的d-氨基转移酶和使用它们生产旋光性谷氨酸衍生物的方法 | |
CN1934132A (zh) | 能耐受氰化物的腈水合酶 | |
CN1766111A (zh) | 编码参与内环境稳定和适应的蛋白质的谷氨酸棒杆菌基因 | |
CN1331743A (zh) | 一种通过使用腈水解酶或含有腈水解酶的微生物来从腈类物质中制备手性羧酸的方法 | |
CN1977046A (zh) | 编码参与普拉地内酯生物合成的多肽的dna | |
CN1675371A (zh) | 具有α-H-α-氨基酸酰胺消旋酶活性的多肽和编码该多肽的核酸 | |
CN1571843A (zh) | 腈水合酶和生产酰胺的方法 | |
CN1340101A (zh) | 新型红球菌属细菌、源自于红球菌属细菌的腈水解酶基因、腈水合酶基因和酰胺酶基因,以及利用它们生产羧酸的方法 | |
CN1262651C (zh) | 使植物产生杂草防治化合物耐受性的方法 | |
CN1771323A (zh) | L-肉碱脱氢酶、其衍生物以及取代的(s)-链烷醇的制备 | |
CN101031652A (zh) | 4-氨基-4-去氧分支酸盐/酯(adc)和[3r,4r]-4-氨基-3-羟基环己-1,5-二烯-1-羧酸(3,4-cha)的生物合成生产 | |
CN1639320A (zh) | 冷休克诱导的异源多肽的表达和生产 | |
CN1617931A (zh) | 具有脂肪酶活性的酯酶 | |
CN1298411A (zh) | 与半胱天冬蛋白酶-8相互作用的蛋白 | |
CN1637143A (zh) | 生成合成胆固醇合成抑制剂莫那可林k的相关基因 | |
CN1894408A (zh) | 除草剂代谢蛋白质,其基因及其应用 | |
Asano et al. | Exploiting Natural Diversity for Industrial Enzymatic Applications | |
CN1325959A (zh) | 来自基因簇的基因 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: WIRAINIM CO.,LTD. Free format text: FORMER OWNER: DIVERSA CORP. Effective date: 20091113 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20091113 Address after: American California Applicant after: Diversa Corp. Address before: American California Applicant before: Diversa Corp. |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20140611 Termination date: 20160515 |