Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   DT351_RS07870 Genome accession   NZ_CP031003
Coordinates   1498356..1500512 (-) Length   718 a.a.
NCBI ID   WP_116843679.1    Uniprot ID   A0A385AFB1
Organism   Latilactobacillus curvatus strain TMW 1.1928     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1479521..1508927 1498356..1500512 within 0
IScluster/Tn 1500816..1503883 1498356..1500512 flank 304


Gene organization within MGE regions


Location: 1479521..1508927
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DT351_RS07740 (DT351_07740) - 1479521..1479832 (-) 312 WP_162875349.1 ArpU family phage packaging/lysis transcriptional regulator -
  DT351_RS07745 (DT351_07745) - 1479867..1480286 (-) 420 WP_076786405.1 ArpU family phage packaging/lysis transcriptional regulator -
  DT351_RS07750 (DT351_07750) - 1480539..1480868 (+) 330 Protein_1512 integrase core domain-containing protein -
  DT351_RS07755 (DT351_07755) - 1480965..1481216 (-) 252 WP_116843666.1 Arm DNA-binding domain-containing protein -
  DT351_RS07760 (DT351_07760) - 1481436..1482290 (-) 855 WP_162875350.1 hypothetical protein -
  DT351_RS07765 (DT351_07765) - 1482283..1483643 (+) 1361 WP_116843668.1 IS3 family transposase -
  DT351_RS11435 - 1483652..1483822 (-) 171 WP_162875351.1 hypothetical protein -
  DT351_RS07770 (DT351_07770) - 1483968..1484528 (-) 561 WP_116843669.1 DUF4352 domain-containing protein -
  DT351_RS07775 (DT351_07775) - 1484623..1485183 (-) 561 WP_116843670.1 Ltp family lipoprotein -
  DT351_RS07785 (DT351_07785) - 1485666..1486004 (+) 339 Protein_1519 IS30 family transposase -
  DT351_RS07790 (DT351_07790) - 1486237..1487157 (-) 921 WP_011373852.1 IS30-like element ISLsa1 family transposase -
  DT351_RS07795 (DT351_07795) - 1487283..1487675 (-) 393 WP_116843671.1 hypothetical protein -
  DT351_RS07800 (DT351_07800) - 1487821..1488752 (-) 932 Protein_1522 IS30-like element ISLpl1 family transposase -
  DT351_RS07810 (DT351_07810) - 1489137..1490918 (+) 1782 WP_116843673.1 glycerophosphodiester phosphodiesterase -
  DT351_RS07815 (DT351_07815) - 1490974..1491249 (-) 276 WP_004271077.1 hypothetical protein -
  DT351_RS07820 (DT351_07820) - 1491325..1492680 (-) 1356 WP_116843674.1 FAD-dependent oxidoreductase -
  DT351_RS07825 (DT351_07825) - 1492827..1493267 (+) 441 WP_240327062.1 hypothetical protein -
  DT351_RS07830 (DT351_07830) - 1493315..1493935 (+) 621 WP_116843676.1 endonuclease III -
  DT351_RS07835 (DT351_07835) tdcB 1493973..1495013 (-) 1041 WP_116843677.1 bifunctional threonine ammonia-lyase/L-serine ammonia-lyase TdcB -
  DT351_RS07840 (DT351_07840) - 1495229..1495561 (+) 333 WP_004271076.1 hypothetical protein -
  DT351_RS07845 (DT351_07845) - 1495607..1495804 (-) 198 WP_004271075.1 hypothetical protein -
  DT351_RS07850 (DT351_07850) - 1495905..1496231 (-) 327 WP_004271083.1 bacteriocin immunity protein -
  DT351_RS07860 (DT351_07860) - 1496559..1496762 (-) 204 WP_076790047.1 Blp family class II bacteriocin -
  DT351_RS07865 (DT351_07865) - 1496965..1498341 (-) 1377 WP_099947277.1 HlyD family secretion protein -
  DT351_RS07870 (DT351_07870) comA 1498356..1500512 (-) 2157 WP_116843679.1 peptide cleavage/export ABC transporter Regulator
  DT351_RS07875 (DT351_07875) - 1500816..1501885 (+) 1070 WP_112234043.1 IS3-like element IS1163 family transposase -
  DT351_RS07880 (DT351_07880) - 1502046..1502342 (-) 297 WP_076787949.1 bacteriocin immunity protein -
  DT351_RS07885 (DT351_07885) - 1502436..1502621 (-) 186 WP_056946897.1 leucocin A/sakacin P family class II bacteriocin -
  DT351_RS07895 (DT351_07895) - 1503948..1504697 (-) 750 WP_116843680.1 response regulator transcription factor -
  DT351_RS07900 (DT351_07900) - 1504699..1505865 (-) 1167 WP_035187109.1 GHKL domain-containing protein -
  DT351_RS07905 (DT351_07905) - 1506064..1506177 (-) 114 WP_004271310.1 bacteriocin -
  DT351_RS07910 (DT351_07910) - 1506352..1507404 (-) 1053 WP_089541951.1 IS30 family transposase -
  DT351_RS07915 (DT351_07915) - 1507621..1507875 (-) 255 WP_004271311.1 hypothetical protein -
  DT351_RS07920 (DT351_07920) - 1508091..1508438 (+) 348 WP_054644151.1 bacteriocin immunity protein -
  DT351_RS07925 (DT351_07925) - 1508637..1508927 (-) 291 WP_004265964.1 hypothetical protein -

Sequence


Protein


Download         Length: 718 a.a.        Molecular weight: 80204.37 Da        Isoelectric Point: 5.8513

>NTDB_id=302782 DT351_RS07870 WP_116843679.1 1498356..1500512(-) (comA) [Latilactobacillus curvatus strain TMW 1.1928]
MIDRKIYIPQVDEMDCGVAALAMILKHYGSSASLAYLRNEAKTDLEGTTALGLVKTAENLGFETKAIQADMSLFEVADLP
FPFIAHVLKNGELLHYYVVLGIKKDAIIIADPDPTVGIAKMTQSQFKSEWTGVALFIAPKPSYQPVKQQTKASLFSFVPT
LMQQKRLVINITLAALLITIISILGSYFLQTVIDTYIPNNMQSTLAIVAIGLIIFYGFQSIFTYAQNFLLAVLGQRLSIE
IILSYLRHVFELPMSFFATRKTGEIISRFNDANKIIDALASAIISIFLDITVVVIMGIILIIQNTTLFWLTLAVIPIYIA
VITLFAKPFEKLNQKEMESNAVLNSAIIEDLHGIETIKALTSESERYQKIDTAFVDYLKKSFDYVKADTLQQAIKLFVQL
GLNVFVLWVGAILVTHNELSIGQLMTYNALLAYFVNPLQNIINLQTKLQSARVANNRLNEVYLVDSEFKTERPIKSESQL
VGPIELQNVSYQYGYGQRVLESISLKIEPNEKLTIVGMSGSGKSTLVKLLVNFFEPTEGKVLFNQQDTQTIDKKTLRQFV
NYIAQTPYIFSGSIWDNLKLGNRPGITQADIMHACELAEIKNDIEKMPLQFETLLDEDGNTLSGGQKQRLTIARALLSPA
RVLIFDESTSGLDTITEGLLVDHLMALEERTIIFIAHRLAIAKKTNNILVLENGSVVETGTHDALVDTQGRYYQLLNH

Nucleotide


Download         Length: 2157 bp        

>NTDB_id=302782 DT351_RS07870 WP_116843679.1 1498356..1500512(-) (comA) [Latilactobacillus curvatus strain TMW 1.1928]
ATGATAGATCGTAAAATTTATATACCACAAGTAGATGAAATGGATTGTGGCGTAGCAGCCTTAGCGATGATTCTTAAACA
TTATGGTTCTTCAGCTTCGCTGGCTTATTTACGTAATGAGGCTAAAACTGATTTAGAAGGAACAACGGCGCTTGGACTAG
TGAAGACAGCTGAGAATTTAGGTTTCGAAACGAAGGCGATTCAAGCTGATATGAGTTTGTTTGAAGTCGCAGATTTACCA
TTTCCTTTTATTGCACATGTTCTAAAAAATGGCGAGCTATTACATTATTATGTTGTACTAGGGATAAAAAAAGATGCCAT
TATAATCGCCGATCCTGATCCAACAGTGGGCATCGCTAAGATGACGCAATCACAGTTTAAATCGGAATGGACGGGCGTTG
CATTATTTATTGCTCCAAAACCGAGCTATCAACCAGTTAAGCAGCAAACAAAAGCGAGTTTATTTTCATTTGTTCCAACT
TTGATGCAGCAAAAGCGATTGGTAATTAATATTACTTTGGCGGCGCTACTCATAACCATTATTAGTATTCTAGGATCTTA
TTTCTTACAGACTGTCATTGATACCTATATCCCAAATAACATGCAGAGCACGCTGGCGATTGTTGCAATTGGTTTAATTA
TCTTTTATGGCTTCCAGTCAATCTTTACATATGCTCAAAATTTTTTATTGGCGGTTTTAGGACAGCGGTTATCAATTGAG
ATTATTTTAAGTTACTTAAGACATGTTTTTGAATTACCAATGTCTTTTTTTGCAACACGTAAAACAGGTGAAATAATTTC
ACGATTTAATGACGCGAATAAAATTATTGATGCGCTAGCAAGCGCAATTATTTCAATTTTTTTAGATATTACAGTCGTTG
TTATTATGGGGATTATTCTGATCATTCAAAATACAACTTTATTCTGGTTAACCTTAGCAGTGATTCCAATCTATATTGCC
GTGATTACTTTATTTGCAAAACCTTTTGAAAAATTAAATCAAAAGGAAATGGAAAGTAATGCAGTATTGAATTCGGCGAT
CATTGAAGATTTACATGGCATTGAAACGATTAAAGCGTTAACTAGTGAAAGTGAACGATATCAAAAAATAGATACAGCAT
TTGTTGATTATCTGAAGAAGAGTTTTGACTATGTAAAAGCTGATACATTGCAACAAGCAATTAAGTTGTTTGTCCAATTA
GGGCTGAATGTCTTTGTGCTGTGGGTAGGCGCTATTTTAGTCACGCATAACGAGTTATCAATTGGGCAATTGATGACCTA
CAATGCCTTGCTGGCGTATTTTGTAAATCCGTTACAGAATATCATCAATTTACAGACTAAGTTACAGAGCGCGAGGGTAG
CCAATAATCGGTTGAATGAAGTTTATCTGGTTGATTCTGAGTTCAAGACAGAACGACCAATTAAATCCGAGTCACAGTTG
GTAGGCCCAATTGAGTTACAAAATGTGAGTTACCAATATGGTTATGGGCAGAGAGTGCTTGAATCGATTAGTTTAAAGAT
TGAGCCGAATGAAAAATTAACAATTGTGGGCATGAGCGGTTCGGGGAAATCAACATTGGTTAAGTTACTCGTTAATTTTT
TTGAACCAACGGAGGGAAAGGTTCTTTTTAATCAACAAGATACACAGACAATTGATAAAAAGACATTGCGGCAGTTTGTT
AACTATATTGCACAGACACCGTACATTTTCTCAGGAAGTATTTGGGATAATCTTAAGTTAGGTAATCGGCCAGGAATCAC
ACAAGCGGACATCATGCATGCGTGTGAGCTAGCAGAGATAAAGAATGATATTGAAAAAATGCCGCTCCAATTTGAAACAT
TATTGGATGAAGATGGAAACACATTGTCAGGTGGTCAAAAACAGCGATTAACAATTGCGCGAGCACTATTATCACCAGCT
CGGGTACTAATTTTTGATGAGTCTACTAGTGGGTTAGATACGATAACAGAGGGGCTATTAGTTGATCATCTAATGGCGTT
AGAAGAAAGAACAATCATTTTCATAGCGCATCGTTTAGCGATTGCTAAGAAAACGAACAATATTCTTGTTTTAGAAAATG
GTAGTGTTGTTGAAACTGGGACACATGATGCGTTGGTGGATACACAAGGGCGATATTACCAATTGCTCAATCATTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A385AFB1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus mitis NCTC 12261

58.824

99.443

0.585

  comA Streptococcus pneumoniae TIGR4

58.263

99.443

0.579

  comA Streptococcus mitis SK321

58.263

99.443

0.579

  comA Streptococcus pneumoniae R6

58.123

99.443

0.578

  comA Streptococcus pneumoniae Rx1

58.123

99.443

0.578

  comA Streptococcus pneumoniae D39

58.123

99.443

0.578

  comA Streptococcus gordonii str. Challis substr. CH1

56.723

99.443

0.564

  comA/nlmT Streptococcus mutans UA159

54.366

98.886

0.538


Multiple sequence alignment