Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA/nlmT   Type   Regulator
Locus tag   FGL21_RS02980 Genome accession   NZ_LR594047
Coordinates   551124..553277 (-) Length   717 a.a.
NCBI ID   WP_014612084.1    Uniprot ID   -
Organism   Streptococcus dysgalactiae subsp. equisimilis strain NCTC11554     
Function   transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
ICE 512319..557179 551124..553277 within 0


Gene organization within MGE regions


Location: 512319..557179
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  FGL21_RS02775 (NCTC11554_00567) - 514907..515509 (+) 603 WP_111681583.1 hypothetical protein -
  FGL21_RS02780 (NCTC11554_00568) - 515532..515804 (+) 273 WP_111681584.1 hypothetical protein -
  FGL21_RS02785 (NCTC11554_00569) - 515804..516586 (+) 783 WP_111681585.1 hypothetical protein -
  FGL21_RS02790 (NCTC11554_00570) - 516608..517099 (+) 492 WP_111681586.1 DUF3801 domain-containing protein -
  FGL21_RS02795 (NCTC11554_00571) - 517099..519144 (+) 2046 WP_111681587.1 type IV secretory system conjugative DNA transfer family protein -
  FGL21_RS02800 (NCTC11554_00572) - 519186..519551 (+) 366 WP_111681588.1 hypothetical protein -
  FGL21_RS02805 (NCTC11554_00573) - 519553..522789 (+) 3237 WP_138127991.1 PBECR4 domain-containing protein -
  FGL21_RS02810 (NCTC11554_00574) - 522841..523140 (+) 300 WP_111681590.1 hypothetical protein -
  FGL21_RS02820 (NCTC11554_00575) mobC 523482..523865 (+) 384 WP_111681591.1 plasmid mobilization relaxosome protein MobC -
  FGL21_RS02825 (NCTC11554_00576) - 523828..525471 (+) 1644 WP_111681592.1 relaxase/mobilization nuclease domain-containing protein -
  FGL21_RS02830 (NCTC11554_00577) - 525483..526733 (+) 1251 WP_138127993.1 site-specific DNA-methyltransferase -
  FGL21_RS02835 (NCTC11554_00578) - 526745..527044 (+) 300 WP_111681594.1 hypothetical protein -
  FGL21_RS02840 (NCTC11554_00579) - 527336..528616 (+) 1281 WP_111681595.1 ISLre2 family transposase -
  FGL21_RS02845 (NCTC11554_00580) rplK 529081..529506 (+) 426 WP_003049875.1 50S ribosomal protein L11 -
  FGL21_RS02850 (NCTC11554_00581) rplA 529612..530301 (+) 690 WP_003049876.1 50S ribosomal protein L1 -
  FGL21_RS11220 (NCTC11554_00582) - 530759..530926 (-) 168 WP_003054374.1 hypothetical protein -
  FGL21_RS02855 (NCTC11554_00583) pyrH 531208..531936 (+) 729 WP_003049878.1 UMP kinase -
  FGL21_RS02860 (NCTC11554_00584) frr 531965..532522 (+) 558 WP_003054381.1 ribosome recycling factor -
  FGL21_RS02865 (NCTC11554_00585) - 532631..533488 (+) 858 WP_003054345.1 S1 RNA-binding domain-containing protein -
  FGL21_RS02870 (NCTC11554_00586) msrA 533561..534070 (+) 510 WP_003054395.1 peptide-methionine (S)-S-oxide reductase MsrA -
  FGL21_RS02875 (NCTC11554_00587) - 534067..534282 (+) 216 WP_003054340.1 YozE family protein -
  FGL21_RS02880 (NCTC11554_00588) - 534442..535692 (+) 1251 WP_138127995.1 LysM domain-containing protein -
  FGL21_RS02885 (NCTC11554_00589) - 536003..537775 (+) 1773 WP_014612076.1 oleate hydratase -
  FGL21_RS02895 (NCTC11554_00590) - 537936..538997 (+) 1062 WP_003054347.1 PhoH family protein -
  FGL21_RS02900 (NCTC11554_00591) - 539044..539619 (+) 576 WP_003054351.1 uracil-DNA glycosylase family protein -
  FGL21_RS02905 (NCTC11554_00592) ybeY 539778..540275 (+) 498 WP_003054373.1 rRNA maturation RNase YbeY -
  FGL21_RS02910 (NCTC11554_00593) - 540256..540663 (+) 408 WP_003054389.1 diacylglycerol kinase -
  FGL21_RS02915 (NCTC11554_00594) era 540784..541680 (+) 897 WP_003054336.1 GTPase Era -
  FGL21_RS11355 (NCTC11554_00595) - 541867..542172 (+) 306 Protein_503 NUDIX domain-containing protein -
  FGL21_RS02920 - 542159..543852 (-) 1694 Protein_504 ABC transporter transmembrane domain-containing protein -
  FGL21_RS02930 (NCTC11554_00598) - 544154..544381 (+) 228 WP_003054378.1 bacteriocin-type signal sequence -
  FGL21_RS02935 (NCTC11554_00599) - 544430..544747 (+) 318 WP_002994587.1 hypothetical protein -
  FGL21_RS02940 (NCTC11554_00600) - 545155..545394 (+) 240 WP_003054359.1 Blp family class II bacteriocin -
  FGL21_RS02945 (NCTC11554_00601) - 545408..545593 (+) 186 WP_003054392.1 hypothetical protein -
  FGL21_RS11225 (NCTC11554_00602) - 545847..546017 (+) 171 WP_014612080.1 hypothetical protein -
  FGL21_RS02955 - 546036..547141 (-) 1106 Protein_510 IS3 family transposase -
  FGL21_RS02960 (NCTC11554_00606) comE/blpR 547388..548137 (+) 750 WP_046159788.1 response regulator transcription factor Regulator
  FGL21_RS02965 (NCTC11554_00607) - 548138..549472 (+) 1335 WP_138127997.1 GHKL domain-containing protein -
  FGL21_RS02970 (NCTC11554_00608) - 549533..549658 (-) 126 WP_011017462.1 ComC/BlpC family leader-containing pheromone/bacteriocin -
  FGL21_RS02975 (NCTC11554_00609) - 549749..551113 (-) 1365 WP_012766651.1 bacteriocin secretion accessory protein -
  FGL21_RS02980 (NCTC11554_00610) comA/nlmT 551124..553277 (-) 2154 WP_014612084.1 peptide cleavage/export ABC transporter Regulator
  FGL21_RS02985 (NCTC11554_00611) - 553564..553791 (+) 228 WP_002992018.1 Blp family class II bacteriocin -
  FGL21_RS02990 (NCTC11554_00612) - 553804..554004 (+) 201 WP_003060803.1 class IIb bacteriocin, lactobin A/cerein 7B family -
  FGL21_RS03000 (NCTC11554_00614) - 554299..554493 (+) 195 WP_002990768.1 hypothetical protein -
  FGL21_RS03010 (NCTC11554_00616) - 554659..555093 (+) 435 WP_014612086.1 helix-turn-helix domain-containing protein -
  FGL21_RS03020 - 556291..556824 (+) 534 Protein_521 IS30 family transposase -
  FGL21_RS11550 - 556855..557094 (+) 240 WP_014612088.1 hypothetical protein -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80532.38 Da        Isoelectric Point: 6.8704

>NTDB_id=1127851 FGL21_RS02980 WP_014612084.1 551124..553277(-) (comA/nlmT) [Streptococcus dysgalactiae subsp. equisimilis strain NCTC11554]
MISYRKTFVAQIDARDCGVAALASIAKYYGSDYSLAHLRELAKTNKEGTTALGIVKAAKLMGFETRAIQADMTLFDIEDV
PYPFIVHVNKEGKFQHYYVVYQNKKNYLIIGDPDPTVNVTKMTKERFTSEWTGVAIFLAPEPSYKPHKDKKNGLISFLPL
IFKQRSLIFYIILASLLVTLINIVGSYYLQGILDDYIPNQLKSTLGIISIGLIITYILQQMMSFSRDYLLTVLSQRLSID
VILSYIRHIFELPMSFFATRRTGEIISRFTDANAIIDALASTILSLFLDVSILTIVGTVLLVQNTNLFLLSLVSVPLYIV
IIFIFMHPFEKMNNDVMQSNSMVSSAIIEDTNGIETIKSLTSEENRYQKIDSEFVDYLDKSFKLSKYSILQSSLKQGAQL
ILNVIVLWFGAKLVMGGKISVGQLITFNTLLSYFTNPLENIINLQTKLQSAKVANNRLNEVYLVDSEFQEAGTLVNQELL
HGDIQFEELSYKYGFGRDTLSNINLTIKQGDKVSLVGISGSGKTTLAKMIVNFFEPYNGRITINHNDLKMIDKKSLRQHI
NYLPQQAYIFNGSILENLTLGANDCTSHEDILRACEVAEIRQDIEQMPMGYQTELSDGAGLSGGQKQRIALARALLTKAP
VLILDEATSGLDVLTEKRVIDNLLAMTDKTIIFVAHRLSISARTNQVIVLDEGKIIEIGSHQELMTKQGFYHHLFSS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=1127851 FGL21_RS02980 WP_014612084.1 551124..553277(-) (comA/nlmT) [Streptococcus dysgalactiae subsp. equisimilis strain NCTC11554]
ATGATATCCTATCGAAAAACATTTGTTGCTCAGATTGATGCTAGAGACTGTGGTGTGGCTGCTCTTGCTTCCATCGCTAA
ATATTACGGTTCAGACTACTCTTTAGCTCATTTAAGAGAGTTAGCTAAAACAAATAAAGAGGGAACAACAGCGCTTGGCA
TCGTCAAAGCTGCCAAATTAATGGGATTTGAAACAAGAGCTATTCAAGCAGACATGACCCTCTTTGATATAGAAGATGTC
CCTTATCCTTTTATTGTTCATGTTAATAAAGAAGGTAAATTCCAACACTACTATGTCGTCTATCAAAACAAAAAAAATTA
TCTGATAATCGGCGACCCTGACCCGACAGTAAATGTCACTAAGATGACAAAGGAGCGATTTACTTCTGAATGGACGGGTG
TCGCTATTTTTCTAGCACCCGAGCCCAGCTACAAGCCTCATAAGGATAAAAAGAACGGTCTAATAAGCTTTTTACCTCTC
ATTTTTAAACAGCGATCTCTGATTTTCTATATCATATTAGCTAGCCTACTCGTTACTCTGATTAATATTGTAGGTTCTTA
TTACCTACAAGGTATATTAGATGACTATATTCCAAATCAACTGAAATCAACACTCGGCATTATTTCAATTGGTCTTATCA
TTACCTATATCCTCCAACAGATGATGAGTTTTTCGCGAGATTATCTCCTGACTGTACTGAGTCAGCGATTAAGCATTGAT
GTTATTTTATCCTATATTCGTCATATTTTTGAGCTTCCCATGTCTTTCTTTGCAACACGTCGGACAGGAGAAATCATTTC
ACGGTTTACAGATGCTAATGCTATTATTGATGCACTCGCTTCTACTATTTTGTCTCTTTTTCTTGATGTCAGTATTTTAA
CTATCGTAGGGACAGTTCTGTTAGTGCAAAATACCAATCTTTTTCTTCTATCTCTGGTTTCAGTCCCTCTATATATTGTT
ATTATCTTTATATTTATGCACCCATTTGAAAAGATGAATAATGATGTCATGCAGAGTAACTCTATGGTTAGCTCAGCCAT
TATCGAAGATACCAATGGCATTGAAACTATTAAATCACTGACCAGTGAAGAAAATCGTTATCAGAAAATTGATAGTGAAT
TTGTGGACTATCTGGATAAATCTTTCAAGTTAAGTAAGTACTCTATTTTACAAAGCAGTCTCAAACAAGGGGCGCAACTA
ATCCTTAACGTGATAGTCTTATGGTTTGGGGCCAAATTAGTTATGGGAGGAAAAATTTCTGTTGGTCAGTTGATTACGTT
TAATACTCTCTTATCTTATTTTACCAATCCTTTAGAAAATATCATTAATCTTCAAACTAAACTGCAATCAGCCAAGGTAG
CTAATAATCGTCTCAATGAAGTTTACCTAGTCGATTCTGAGTTTCAAGAGGCTGGGACATTAGTCAATCAAGAATTATTA
CACGGAGACATTCAATTTGAAGAGTTATCCTATAAATATGGCTTTGGACGCGATACTCTATCAAATATTAATCTCACTAT
TAAGCAAGGGGACAAAGTTAGCCTTGTTGGTATTAGTGGATCTGGTAAAACGACTCTAGCTAAAATGATTGTTAATTTTT
TTGAGCCTTATAACGGCCGAATTACCATTAATCATAATGACTTAAAAATGATTGATAAAAAAAGTCTTCGCCAACATATT
AATTATCTCCCCCAACAAGCCTATATTTTTAATGGTTCTATTTTAGAAAATTTAACCTTAGGTGCTAATGACTGTACTAG
TCATGAAGATATTCTAAGAGCTTGTGAGGTTGCAGAAATTCGACAAGATATTGAACAAATGCCTATGGGCTATCAAACAG
AGCTTTCTGATGGTGCCGGACTCTCAGGAGGCCAAAAGCAACGAATTGCACTTGCCAGAGCTCTCCTAACCAAGGCGCCT
GTGCTTATTCTAGATGAGGCTACAAGCGGTCTTGACGTACTAACTGAAAAGAGGGTTATTGATAATCTTTTGGCTATGAC
GGATAAAACTATTATTTTCGTAGCTCATCGTCTCAGCATTTCAGCGCGGACCAATCAAGTCATTGTGCTTGATGAGGGTA
AAATTATTGAAATAGGATCTCATCAAGAATTAATGACTAAACAAGGTTTCTATCATCATTTATTTAGTAGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA/nlmT Streptococcus mutans UA159

72.587

99.721

0.724

  comA Streptococcus mitis NCTC 12261

67.367

99.582

0.671

  comA Streptococcus mitis SK321

66.947

99.582

0.667

  comA Streptococcus pneumoniae Rx1

66.947

99.582

0.667

  comA Streptococcus pneumoniae D39

66.947

99.582

0.667

  comA Streptococcus pneumoniae R6

66.947

99.582

0.667

  comA Streptococcus pneumoniae TIGR4

66.667

99.582

0.664

  comA Streptococcus gordonii str. Challis substr. CH1

65.738

100

0.658


Multiple sequence alignment