I1RVD9 · FUSA2_GIBZE
- ProteinNonribosomal peptide synthetase 7
- GeneNRPS7
- StatusUniProtKB reviewed (Swiss-Prot)
- Amino acids4423 (go to sequence)
- Protein existenceInferred from homology
- Annotation score4/5
Function
function
Nonribosomal peptide synthetase; part of the gene cluster that mediates the biosynthesis of the lipopeptide fusaristatin A (PubMed:25412204).
Fusaristatin A consists of a polyketide chain linked to three amino acid residues glutamine (Gln), dehydroalanine (dehydro-Ala), and beta-aminoisobutyric acid (PubMed:25412204).
The biosynthesis starts with formation of a linear polyketide chain by the highly reducing polyketide synthase PKS6 (PubMed:25412204).
The gene cluster does not contain an acyl-CoA ligase or an acyl-transferase, and it is therefore predicted that the polyketide is transferred directly to the nonribosomal peptide synthetase NRPS7 (Probable). Modules 1-3 from NRPS7 incorporate dehydro-Ala, Gln, and beta-aminoisobutyric acid in the compound, which is released by cyclization (PubMed:25412204).
The beta-aminoisobutyric acid units are most likely not freely available to the NRPS, but can be synthesized from thymine, which requires a dehydrogenase, a monooxygenase, and an aminotransferase. The fusaristatin A cluster contains a cytochrome P450 monooxygenase (FGSG_08207) and an aminotransferase (FGSG_17085), which theoretically can perform two of the enzymatic steps (Probable). The enzymes may however also be involved in biosynthesis of dehydroalanine or modification of the polyketide (Probable). The dehydro-Ala residue can be a result of cyclization, where serine is dehydrated (Probable). The last gene of the cluster encodes a protein with an A/B barrel domain found in variable enzymes, which hampers functional prediction (Probable)
Fusaristatin A consists of a polyketide chain linked to three amino acid residues glutamine (Gln), dehydroalanine (dehydro-Ala), and beta-aminoisobutyric acid (PubMed:25412204).
The biosynthesis starts with formation of a linear polyketide chain by the highly reducing polyketide synthase PKS6 (PubMed:25412204).
The gene cluster does not contain an acyl-CoA ligase or an acyl-transferase, and it is therefore predicted that the polyketide is transferred directly to the nonribosomal peptide synthetase NRPS7 (Probable). Modules 1-3 from NRPS7 incorporate dehydro-Ala, Gln, and beta-aminoisobutyric acid in the compound, which is released by cyclization (PubMed:25412204).
The beta-aminoisobutyric acid units are most likely not freely available to the NRPS, but can be synthesized from thymine, which requires a dehydrogenase, a monooxygenase, and an aminotransferase. The fusaristatin A cluster contains a cytochrome P450 monooxygenase (FGSG_08207) and an aminotransferase (FGSG_17085), which theoretically can perform two of the enzymatic steps (Probable). The enzymes may however also be involved in biosynthesis of dehydroalanine or modification of the polyketide (Probable). The dehydro-Ala residue can be a result of cyclization, where serine is dehydrated (Probable). The last gene of the cluster encodes a protein with an A/B barrel domain found in variable enzymes, which hampers functional prediction (Probable)
Pathway
Secondary metabolite biosynthesis.
GO annotations
Aspect | Term | |
---|---|---|
Molecular Function | isomerase activity | |
Molecular Function | ligase activity | |
Molecular Function | phosphopantetheine binding |
Keywords
- Molecular function
Enzyme and pathway databases
Names & Taxonomy
Protein names
- Recommended nameNonribosomal peptide synthetase 7
- EC number
- Short namesNRPS 7
- Alternative names
Gene names
Organism names
- Strain
- Taxonomic lineageEukaryota > Fungi > Dikarya > Ascomycota > Pezizomycotina > Sordariomycetes > Hypocreomycetidae > Hypocreales > Nectriaceae > Fusarium
Accessions
- Primary accessionI1RVD9
- Secondary accessions
Proteomes
Organism-specific databases
Phenotypes & Variants
Disruption phenotype
impairs the production of fusaristatin A.
PTM/Processing
Features
Showing features for chain, modified residue.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Chain | PRO_0000445374 | 1-4423 | Nonribosomal peptide synthetase 7 | |||
Sequence: MQGGPVICNGIPSVTISILPHPRSRARTSLLLRRGGALGPGSGCMSDRLFTTPDMRKNRQRLTLAASVVLKKEENEIDLEKSLVEQGGDMLSGVFFAGQCRELGLIVDIANASSMSLRDMALQLVESNPELLRPSTAMDCRSASVPHTLLQSLYRNFGASTGVTLDVDIPLTLEDVTKMLQRLVERHPILGSTAEPNDKTSYVFSNTVPSPILVSFESDKTHAEMRMLQEDSKSKVAVFSVMAFGEESRITNLSFVADNAAIDAQSWSILLHDIQAFPAGFNDLTTQSNTFPNWIESAFRVTDTTVETAGTTKLPIHSEMSAEDSSPALSSCVFTLNPELTQAIQSEACHRTLRTEVQDVMFGALASTFGTQLPASARYLEIKDGRPQDEGNAWNSVIGCFDEIFELAYECHGDIIDACRSAKDSRKQSSLSPVHYASCRYNLILDTTWLKACIGTSSTGKMRLMDNEPGRYAAEALVKSMGGLCMTPFWQGTGLSFLVVSSTDFGSDEDLKLNSEMFINHVQHISETLPNRSPWPTLSDFPHVSFDYPSLDRMFQQKLLQITQTPLADIHNIYPCTSIQENMLMGNSLDKDAYVCSFTARATTSGAFTHFDAAKWAEAWGRVVEKHSSLRTIFIESEGRPGHFEQVILKSVAAPVDIMTGPSVPSKIEFQDFSVPHHLAIIQEGPGRCLMILTMSHAITDGHSAEVLLGDLCAEAVQTDGTGEEAFAYSEFALTEYQSTNTEVSDYWQDYLLKTQETILPVTREKSDFHDFNTVHSTMPVNVSSMDRICRRHNINLASVCQFAWGVVLRSRLGVDGVCFSYISSLRNKPLKGIMTAVGPLITTLLCSMNLEGERPVLDAIRAVDSEYVESLSHEKELYNITSPRRWCNTVMSFRRRLVQDDGGIPGLSYKLVKAFSPTNYDFSLIVSAGQSDLDIMLDYWGSRMDRDNAQSLLQIFQEVLHSIFRDIDSTVSGLDLISQDDKNRILERNRLVPKGLNRCVHELVNERIQKQPSAVAIDAWDGSLTYSELDNLTSRLAQYLSNIGVGPEVPVGICMDKSKLVPVTVLAILQAGGAVLPIGVEEPEARVEAILADATPVAIVGDGRQVTRLSELGTQVLNVVDILADMSSSLPSSTSKQETRATPDTTAWIFYTSGSTGTPKGVLVEHQALATSMRAHGVALKVLPEDRVLQFAAHTFDVSLSELFTTLIFGGCVCIPDETNRVNDLAGSVHGLQANVLSLTSSMASTIRPRDVPMVRKLVLFGEEVKASVVEAWLGKADIYNAYGPTESSIFASVSKPFQSVDDLSNIGYPMDVNFWVTDPQNPGRLCPPGSPGELLIEGPLLARGYLNDDNKTSTAFIQDPSFASLLGLEKGRRFYRTGDLVRQNSDFSMSYLGRRDTQVKIRGQRLDVSEVEHWITASLEGAVRVVVDLLPGAILFAAVEFSPHAVVVVSGDKTILQTSDEFRHRFSQLKNALQGKLPSYMLPTFYVPFRRIPLTSSAKTDRKMVRLLVSQLDTIILQQYISNDSNESDDLPLTETEQKLKVLWANVLNVTHSSIGAGDHFLYRGGDSLMAIKLVEQARLESISMTVKDVLSFPRLQDLARTIDERNEVGIISSAQMANQHDPPAFSLWKPSSSDRETELADIAMQCGLGTEDIEDVYPCTSLQESMLAATQQRPTAYIVRQMYSLSDSIDLSHFRKIWDVLVQQAPVMRTHILLGQRSGSLQVLSKKPLTWYNHDDLDEYVAADQARAMATGQPLMRLALVQDKSKGVRYFVWTAHHSVYDGWSAQLIYKRLAALYLYEEVPTAVPFTRFIEYQQRTKDDSDINLGSYWRGQLGGDVPSAFPSMPSPFYQPKPVSLHRSEIDTLSFKPSKYGFSLADILRAGWAMTLGQYLGTNDVVFGAILSGRNAPVAEITQLIAPTITTVPVRVTVDREAKVSWYLAAIQSQAVEMIPYEHTGLDEIKRLCPDLQPATDVNHAFVIEPPYAKDGDAHTILPGLELVDTALDTFDTFALTIQCQLPSKQGGSIKVEARFDAKVVSDAQVTVLLRQFEHWVSQFLDETHHETQLKSLEEITSADLAQIKKQNSRIPVRDMVCLHHLIRDVAKEQPDSPAVCAWDGDFTYEELWTNARRLAQHLSNLGVGPKSRVAVCMDKSRWTVASILGILESGGVVVMLRSQSPLEQAKALVADCQATAMLVNAGHTARFAGSGPRIVEVNDALLASLPDPTVSGPICPALNPGHPAWIVYTSGSTGLPKGCLLIHGGLATSLPAHGRATRWHKESRTLQFASHEFDVTLQEIMTTLIFKGCVCIPSEDQRINSLSQAIRDMNVTQMVLTPTVASMINPVDVPCIVQLQVAGELIKPSVVERWIDHAEVVNIYGPSECSVYSSCGTPMQTIEDAPVIGYPLDNCNFWVTSTTDHNRLCPIGIPGELLIENSWQAWGYVNNPELTAQCFVVEPGFIKQLGLDGSGRRMYRTGDLVQQNPNGSYTYIGRMGSEVKFRGHRVDLGRIEYWIGKLLEGVQTIAVDLVELDTGKKANDLVAVIDFTDDCDLFDLDQTEDIDGVAILTPSTKIRKALCRLRDGLTDKLPSYMVPTAFMPWKKIPFTSSGKTNRKAIRQLLTNLEAGSSLLQRYLADGDVKEVPQTRIGKKLQQLWAEVLSVKVDSIGSQDHFTRLGGDSLAAMKLVASARQVGLELSVTSIFTYPVLEDCARILEADQDSSLVKLPEEDPAPFELMPEDWSTGGFEDRLADFAAQCRVAPSQIEDVYPCTPMQEALFAITARNPTAYTYRQVFRASGEDVDMVRFQTAWETVASILPILRTRIVLDQSGFLQTVIDQPLIWHIGGDLDSYIAADKLVGFEPGTPLLRCAIVEGGGAKYFVLTTHHSMFDKWSIEKIMYRYLIPAYFGQQLPEAVPFPRFVRHVLNIDMDSASQFWTQKLEDDEPFTEFPSLPSVGFYEPKPTGLLSQTFRIDGVNKLETPFPSLLRAAWALTVSQYAGAEDVMFAVNLSGRSAPVADISELAAPTFTTVPVRVRINRSQRVRDFLDGLHRETIAMVPFEHVGLRNIKRFVPTFNPSDLRHLFLVHTAADDTLDDPSFRLPGFEQVHQKAETLDDYPLTILCKLDDHKGEAEVVARFDSTVIPADQIQSVLRQFEHNVVQLAASASSDDQTVGGLPLVSSYDLDRISAWNVTGPPSLGCVHDLFIRSLETRPDSQAVCSWDGEFTYRELDQAARILAQLLVAEGGVGTEVAVGLCMDKSRWAMVAVLAILYAGGAVVPLGVDLPPERISVILQDSSPTMVLCDEAKADRFRSLGCKIAVVNETEIDGVAKSYDGYNPNIPSTSVSAENMAWIIYTSGSTGVPKGVTLEHGGIYNIILNKGTTLGFDSTTRTFQFAAFTFDVSIADPLMAWAFGGCVCLPSEDERMNDLVGSINRLNANFALLTASTAALITPSEVPRMTKLLLGGESNTPSLMEKWLLDSNITVGNSYGPAECSITSTINARVTDKNGCNIIGNPIQGTQAWIADFHDCNRLVPIGAVGELLIEGPHVARGYRNDAVKTMAAFITDPRFTTDVGPKRHGRRMYRSGDLVRYTSDGNIEFLGRGDSQIKIRGQRVDLGEIESCIVKLVPKVRTALVEYLHLSEDQRALIAALEFHNADKDQDVEGLATWLKESLAQQLPAYMIPRAYLQIDMIPKTVSGKTNRKAIRQFMMNKYMQIADENSLNDFQTGKVDTESEYITRTLWAAVLGVDADRIDRHDNFFDIGGDSIIAMKLVAAAKVKGFQIRVLDIFENPVLFKMAVVAQHQTEMALEAVSPPPYYPFQLLDSDDNDIDTILEEFVCPVTGTGKESIQDVFPAPDAIAFGVAGALTAAQPEVNTFVLDAEGDLDLVRLQQSCVLLAHHIEAFRTAFAFDLRSGRLLQIVLKSYQHNVLVVRTRESLEDATERLFEKDIYHEPFRLGTPLVSMTILQEHNSRNTRILLRMSHAIYDAMSLPIILRTLRSLYHKQDAYKPPLFSFAEYVADLNRHTGNTSYNYWRNLLQGSTMTEVIPTAAYGGQNPVQMAFTNAKMIAVRKSKGDGITTSTIISCAWAHVLAQYTGKPDVVFGDTISGRNLVDPSISSTVVGCCATNVPMRVRFAGDSGEHSILQLLNQVRDQQRSRIPHEGVGVRSLIHECTDWSPEARFTSVVNHRPANDPAVKSISNQIDFKVSTITTENKPFMTWYDLAVISQENNGHVEMSLGYSTTGFHPETAQSLLEDLADTVQILLNAVSSQDEKLALLGTEVMPRSSSKLTKLQRVNSPKEQTLRKDKPTNGVFSDKPDDATLSVLDTIWFSIFTSNRAGVGTLASDELTPDLRYLPFYKVGGDLLDAAWFIALIQRRVKTSGRESNGDGILASHNQLTVDDVLRHPSVVEFAGLLKQKQVELN | ||||||
Modified residue | 1570 | O-(pantetheine 4'-phosphoryl)serine | ||||
Sequence: S | ||||||
Modified residue | 2679 | O-(pantetheine 4'-phosphoryl)serine | ||||
Sequence: S | ||||||
Modified residue | 3765 | O-(pantetheine 4'-phosphoryl)serine | ||||
Sequence: S |
Keywords
- PTM
Interaction
Protein-protein interaction databases
Family & Domains
Features
Showing features for region, domain, compositional bias.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Region | 572-986 | Condensation 1 | ||||
Sequence: NIYPCTSIQENMLMGNSLDKDAYVCSFTARATTSGAFTHFDAAKWAEAWGRVVEKHSSLRTIFIESEGRPGHFEQVILKSVAAPVDIMTGPSVPSKIEFQDFSVPHHLAIIQEGPGRCLMILTMSHAITDGHSAEVLLGDLCAEAVQTDGTGEEAFAYSEFALTEYQSTNTEVSDYWQDYLLKTQETILPVTREKSDFHDFNTVHSTMPVNVSSMDRICRRHNINLASVCQFAWGVVLRSRLGVDGVCFSYISSLRNKPLKGIMTAVGPLITTLLCSMNLEGERPVLDAIRAVDSEYVESLSHEKELYNITSPRRWCNTVMSFRRRLVQDDGGIPGLSYKLVKAFSPTNYDFSLIVSAGQSDLDIMLDYWGSRMDRDNAQSLLQIFQEVLHSIFRDIDSTVSGLDLISQDDKNRI | ||||||
Region | 1007-1404 | Adenylation 1 | ||||
Sequence: ERIQKQPSAVAIDAWDGSLTYSELDNLTSRLAQYLSNIGVGPEVPVGICMDKSKLVPVTVLAILQAGGAVLPIGVEEPEARVEAILADATPVAIVGDGRQVTRLSELGTQVLNVVDILADMSSSLPSSTSKQETRATPDTTAWIFYTSGSTGTPKGVLVEHQALATSMRAHGVALKVLPEDRVLQFAAHTFDVSLSELFTTLIFGGCVCIPDETNRVNDLAGSVHGLQANVLSLTSSMASTIRPRDVPMVRKLVLFGEEVKASVVEAWLGKADIYNAYGPTESSIFASVSKPFQSVDDLSNIGYPMDVNFWVTDPQNPGRLCPPGSPGELLIEGPLLARGYLNDDNKTSTAFIQDPSFASLLGLEKGRRFYRTGDLVRQNSDFSMSYLGRRDTQVKIR | ||||||
Domain | 1533-1609 | Carrier 1 | ||||
Sequence: LPLTETEQKLKVLWANVLNVTHSSIGAGDHFLYRGGDSLMAIKLVEQARLESISMTVKDVLSFPRLQDLARTIDERN | ||||||
Region | 1657-2066 | Condensation 2 | ||||
Sequence: EDVYPCTSLQESMLAATQQRPTAYIVRQMYSLSDSIDLSHFRKIWDVLVQQAPVMRTHILLGQRSGSLQVLSKKPLTWYNHDDLDEYVAADQARAMATGQPLMRLALVQDKSKGVRYFVWTAHHSVYDGWSAQLIYKRLAALYLYEEVPTAVPFTRFIEYQQRTKDDSDINLGSYWRGQLGGDVPSAFPSMPSPFYQPKPVSLHRSEIDTLSFKPSKYGFSLADILRAGWAMTLGQYLGTNDVVFGAILSGRNAPVAEITQLIAPTITTVPVRVTVDREAKVSWYLAAIQSQAVEMIPYEHTGLDEIKRLCPDLQPATDVNHAFVIEPPYAKDGDAHTILPGLELVDTALDTFDTFALTIQCQLPSKQGGSIKVEARFDAKVVSDAQVTVLLRQFEHWVSQFLDETHHET | ||||||
Region | 2102-2499 | Adenylation 2 | ||||
Sequence: RDVAKEQPDSPAVCAWDGDFTYEELWTNARRLAQHLSNLGVGPKSRVAVCMDKSRWTVASILGILESGGVVVMLRSQSPLEQAKALVADCQATAMLVNAGHTARFAGSGPRIVEVNDALLASLPDPTVSGPICPALNPGHPAWIVYTSGSTGLPKGCLLIHGGLATSLPAHGRATRWHKESRTLQFASHEFDVTLQEIMTTLIFKGCVCIPSEDQRINSLSQAIRDMNVTQMVLTPTVASMINPVDVPCIVQLQVAGELIKPSVVERWIDHAEVVNIYGPSECSVYSSCGTPMQTIEDAPVIGYPLDNCNFWVTSTTDHNRLCPIGIPGELLIENSWQAWGYVNNPELTAQCFVVEPGFIKQLGLDGSGRRMYRTGDLVQQNPNGSYTYIGRMGSEVK | ||||||
Domain | 2642-2718 | Carrier 2 | ||||
Sequence: VPQTRIGKKLQQLWAEVLSVKVDSIGSQDHFTRLGGDSLAAMKLVASARQVGLELSVTSIFTYPVLEDCARILEADQ | ||||||
Region | 2764-3170 | Condensation 3 | ||||
Sequence: EDVYPCTPMQEALFAITARNPTAYTYRQVFRASGEDVDMVRFQTAWETVASILPILRTRIVLDQSGFLQTVIDQPLIWHIGGDLDSYIAADKLVGFEPGTPLLRCAIVEGGGAKYFVLTTHHSMFDKWSIEKIMYRYLIPAYFGQQLPEAVPFPRFVRHVLNIDMDSASQFWTQKLEDDEPFTEFPSLPSVGFYEPKPTGLLSQTFRIDGVNKLETPFPSLLRAAWALTVSQYAGAEDVMFAVNLSGRSAPVADISELAAPTFTTVPVRVRINRSQRVRDFLDGLHRETIAMVPFEHVGLRNIKRFVPTFNPSDLRHLFLVHTAADDTLDDPSFRLPGFEQVHQKAETLDDYPLTILCKLDDHKGEAEVVARFDSTVIPADQIQSVLRQFEHNVVQLAASASSDDQT | ||||||
Region | 3205-3609 | Adenylation 3 | ||||
Sequence: RSLETRPDSQAVCSWDGEFTYRELDQAARILAQLLVAEGGVGTEVAVGLCMDKSRWAMVAVLAILYAGGAVVPLGVDLPPERISVILQDSSPTMVLCDEAKADRFRSLGCKIAVVNETEIDGVAKSYDGYNPNIPSTSVSAENMAWIIYTSGSTGVPKGVTLEHGGIYNIILNKGTTLGFDSTTRTFQFAAFTFDVSIADPLMAWAFGGCVCLPSEDERMNDLVGSINRLNANFALLTASTAALITPSEVPRMTKLLLGGESNTPSLMEKWLLDSNITVGNSYGPAECSITSTINARVTDKNGCNIIGNPIQGTQAWIADFHDCNRLVPIGAVGELLIEGPHVARGYRNDAVKTMAAFITDPRFTTDVGPKRHGRRMYRSGDLVRYTSDGNIEFLGRGDSQIKIR | ||||||
Domain | 3731-3804 | Carrier 3 | ||||
Sequence: TESEYITRTLWAAVLGVDADRIDRHDNFFDIGGDSIIAMKLVAAAKVKGFQIRVLDIFENPVLFKMAVVAQHQT | ||||||
Region | 3875-4278 | Condensation 4 | ||||
Sequence: TFVLDAEGDLDLVRLQQSCVLLAHHIEAFRTAFAFDLRSGRLLQIVLKSYQHNVLVVRTRESLEDATERLFEKDIYHEPFRLGTPLVSMTILQEHNSRNTRILLRMSHAIYDAMSLPIILRTLRSLYHKQDAYKPPLFSFAEYVADLNRHTGNTSYNYWRNLLQGSTMTEVIPTAAYGGQNPVQMAFTNAKMIAVRKSKGDGITTSTIISCAWAHVLAQYTGKPDVVFGDTISGRNLVDPSISSTVVGCCATNVPMRVRFAGDSGEHSILQLLNQVRDQQRSRIPHEGVGVRSLIHECTDWSPEARFTSVVNHRPANDPAVKSISNQIDFKVSTITTENKPFMTWYDLAVISQENNGHVEMSLGYSTTGFHPETAQSLLEDLADTVQILLNAVSSQDEKLALLG | ||||||
Region | 4288-4312 | Disordered | ||||
Sequence: KLTKLQRVNSPKEQTLRKDKPTNGV | ||||||
Compositional bias | 4298-4312 | Basic and acidic residues | ||||
Sequence: PKEQTLRKDKPTNGV |
Domain
NRP synthetases are composed of discrete domains (adenylation (A), thiolation (T) or peptidyl carrier protein (PCP) and condensation (C) domains) which when grouped together are referred to as a single module. Each module is responsible for the recognition (via the A domain) and incorporation of a single amino acid into the growing peptide product. Thus, an NRP synthetase is generally composed of one or more modules and can terminate in a thioesterase domain (TE) that releases the newly synthesized peptide from the enzyme. Occasionally, epimerase (E) domains (responsible for L- to D-amino acid conversion) are present within the NRP synthetase (By similarity).
NRPS7 has the following architecture: C-A-T-C-A-T-C-A-T-Cy. The last condensation domain (Cy) could be responsible for cyclization of the final product (Probable)
NRPS7 has the following architecture: C-A-T-C-A-T-C-A-T-Cy. The last condensation domain (Cy) could be responsible for cyclization of the final product (Probable)
Sequence similarities
Belongs to the NRP synthetase family.
Keywords
- Domain
Phylogenomic databases
Family and domain databases
Sequence
- Sequence statusComplete
- Length4,423
- Mass (Da)488,395
- Last updated2012-06-13 v1
- ChecksumE80EB1E8F4402D91
Features
Showing features for compositional bias.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Compositional bias | 4298-4312 | Basic and acidic residues | ||||
Sequence: PKEQTLRKDKPTNGV |
Keywords
- Technical term
Sequence databases
Nucleotide Sequence | Protein Sequence | Molecule Type | Status | |
---|---|---|---|---|
HG970333 EMBL· GenBank· DDBJ | CEF76489.1 EMBL· GenBank· DDBJ | Genomic DNA |