Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   LQF73_RS10230 Genome accession   NZ_CP088015
Coordinates   2067028..2069190 (-) Length   720 a.a.
NCBI ID   WP_068813212.1    Uniprot ID   A0A2N5KWZ3
Organism   Lactobacillus crispatus strain M247     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2040711..2069193 2067028..2069190 within 0


Gene organization within MGE regions


Location: 2040711..2069193
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  LQF73_RS10055 (LQF73_10005) - 2040711..2041604 (-) 894 WP_005718198.1 C40 family peptidase -
  LQF73_RS10060 (LQF73_10010) - 2041774..2042517 (-) 744 WP_005726403.1 C40 family peptidase -
  LQF73_RS10065 (LQF73_10015) - 2042786..2043334 (-) 549 WP_005722445.1 C40 family peptidase -
  LQF73_RS10070 (LQF73_10020) - 2043505..2044047 (-) 543 WP_005718194.1 guanylate kinase -
  LQF73_RS10075 (LQF73_10025) - 2044144..2044446 (+) 303 WP_005718193.1 DUF2187 family protein -
  LQF73_RS10080 (LQF73_10030) - 2044532..2046040 (+) 1509 WP_005718192.1 O-antigen ligase family protein -
  LQF73_RS10085 (LQF73_10035) - 2046095..2046718 (-) 624 WP_005722447.1 LVIS_2131 family protein -
  LQF73_RS10090 (LQF73_10040) - 2046906..2048057 (-) 1152 WP_020993065.1 RNA-guided endonuclease TnpB family protein -
  LQF73_RS10095 (LQF73_10045) nrdI 2048350..2048796 (+) 447 WP_005718188.1 class Ib ribonucleoside-diphosphate reductase assembly flavoprotein NrdI -
  LQF73_RS10100 (LQF73_10050) - 2048799..2049734 (+) 936 WP_005718186.1 ribonucleotide-diphosphate reductase subunit beta -
  LQF73_RS10105 (LQF73_10055) - 2049774..2050304 (-) 531 WP_005718185.1 hypothetical protein -
  LQF73_RS10110 (LQF73_10060) - 2050616..2051794 (+) 1179 WP_005727505.1 IS256 family transposase -
  LQF73_RS10115 (LQF73_10065) - 2051928..2052515 (+) 588 WP_005729079.1 C39 family peptidase -
  LQF73_RS10120 (LQF73_10070) - 2052712..2053197 (-) 486 WP_005718183.1 threonine/serine exporter family protein -
  LQF73_RS10125 (LQF73_10075) - 2053200..2053970 (-) 771 WP_005718182.1 threonine/serine exporter family protein -
  LQF73_RS10130 (LQF73_10080) - 2053981..2055522 (-) 1542 WP_005718181.1 ABC-F family ATP-binding cassette domain-containing protein -
  LQF73_RS10135 (LQF73_10085) - 2055536..2055913 (-) 378 WP_005718180.1 hypothetical protein -
  LQF73_RS10140 (LQF73_10090) - 2056058..2056609 (+) 552 WP_005722451.1 GNAT family N-acetyltransferase -
  LQF73_RS10145 (LQF73_10095) - 2056924..2057085 (+) 162 WP_005718177.1 hypothetical protein -
  LQF73_RS10150 (LQF73_10100) - 2057099..2057482 (+) 384 WP_005718176.1 PBECR4 domain-containing protein -
  LQF73_RS10155 (LQF73_10105) - 2057479..2058168 (-) 690 WP_005718174.1 DUF554 domain-containing protein -
  LQF73_RS10160 (LQF73_10110) - 2058233..2059471 (-) 1239 WP_005718173.1 LCP family protein -
  LQF73_RS10165 (LQF73_10115) pepF 2059639..2061435 (+) 1797 WP_005718172.1 oligoendopeptidase F Regulator
  LQF73_RS10170 (LQF73_10120) - 2061587..2061802 (+) 216 WP_005723825.1 AbrB/MazE/SpoVT family DNA-binding domain-containing protein -
  LQF73_RS10175 (LQF73_10125) - 2061806..2062213 (+) 408 WP_005723824.1 type II toxin-antitoxin system death-on-curing family toxin -
  LQF73_RS10180 (LQF73_10130) - 2062327..2062602 (-) 276 WP_005723823.1 bacteriocin immunity protein -
  LQF73_RS10185 (LQF73_10135) - 2062606..2062824 (-) 219 WP_005726358.1 TIGR04139 family peptide modification target -
  LQF73_RS10190 (LQF73_10140) - 2062998..2063294 (-) 297 WP_005725464.1 bacteriocin immunity protein -
  LQF73_RS10195 (LQF73_10145) - 2063310..2063570 (-) 261 WP_005726357.1 thioredoxin domain-containing protein -
  LQF73_RS10200 (LQF73_10150) - 2063757..2064935 (+) 1179 WP_005727388.1 IS256 family transposase -
  LQF73_RS10205 (LQF73_10155) - 2065113..2065259 (-) 147 WP_005723821.1 hypothetical protein -
  LQF73_RS10210 (LQF73_10160) - 2065269..2065415 (-) 147 WP_005727423.1 hypothetical protein -
  LQF73_RS10215 (LQF73_10165) - 2065718..2065993 (-) 276 WP_005723819.1 bacteriocin immunity protein -
  LQF73_RS10220 (LQF73_10170) - 2066015..2066212 (-) 198 WP_005723817.1 hypothetical protein -
  LQF73_RS10225 (LQF73_10175) - 2066379..2067017 (-) 639 WP_225520247.1 HlyD family efflux transporter periplasmic adaptor subunit -
  LQF73_RS10230 (LQF73_10180) comA 2067028..2069190 (-) 2163 WP_068813212.1 peptide cleavage/export ABC transporter Regulator

Sequence


Protein


Download         Length: 720 a.a.        Molecular weight: 80952.45 Da        Isoelectric Point: 9.5103

>NTDB_id=632441 LQF73_RS10230 WP_068813212.1 2067028..2069190(-) (comA) [Lactobacillus crispatus strain M247]
MGMLTYKSIYVPQIDETDCGAACLAMILKNYHSQVSIAHLRHIAKTNTEGTTALGLVKTAEKFNMDVQAVKADMSLFNMK
DIQYPFIVHVIKDGGLLHYYVVLKSTKRKIIVADPDPTSGIKKMSKKAFEKEWTGITLFMVPKADFKPVKEKKNNLLSLF
PYMFKQKKLVRNIILAALLMTLISICSSYFVQGLIDTYIPNGTFTTLSVLAIGLLIAYVFNSIFSYGQQFLLNVLGQRLS
IDLNLQYIKHIFELPMEFFTTRKTGEITSRFSDASRIIDALASTVISLFLDLSIVILMGIILAVQNMTLFLITLASLPLY
AIVILGFSKRFERLNNDQMESNAVVSSSIIEDIQGIETIKALNSEQVRYRKIDSQFVDYLKKAFKYSKTEAFQTALKTFI
RLSLNVIILWVGASVVMHNQMSIGELMAFNALLSYFIDPLQNIINLQPTLQAANVAQNRLNEVYLVQSEFKGKTSVNDSH
DLNGDINFSHVDYRYGYGEDVLKDINLTIKDNEKLTIVGMSGSGKSTMVKLLVDFFSPTKGKVTFNGHSTRSINKHVLRS
YVNYVPQTPYIFAGTIKENLLLGSRKNITEKEIEEACQIAEIKNDIEKLPLGFNTQLDENAKILSGGQKQRLTIARALLS
PAKVLIFDEATSGLDTITEKKVVDNLIRLKNKTVIFIAHRLAIAERTDNIVVLNQGQIVEQGNHSELMAKHGYYYDLVKS

Nucleotide


Download         Length: 2163 bp        

>NTDB_id=632441 LQF73_RS10230 WP_068813212.1 2067028..2069190(-) (comA) [Lactobacillus crispatus strain M247]
ATGGGCATGTTAACGTATAAATCAATTTATGTTCCACAAATCGATGAAACAGACTGTGGTGCAGCTTGTTTAGCAATGAT
CTTAAAGAATTATCATTCTCAAGTATCAATAGCACATTTACGTCATATTGCTAAAACCAATACTGAAGGAACTACTGCTT
TAGGATTAGTTAAAACTGCGGAAAAATTCAATATGGATGTTCAAGCGGTAAAAGCAGATATGTCCTTATTCAATATGAAA
GATATTCAATATCCTTTTATTGTTCATGTGATAAAAGATGGAGGATTGTTACATTATTATGTAGTTCTTAAAAGTACTAA
AAGAAAAATAATCGTAGCCGATCCTGATCCTACTAGTGGAATTAAAAAAATGTCCAAAAAGGCGTTTGAAAAAGAGTGGA
CAGGTATTACATTGTTTATGGTGCCTAAGGCAGATTTTAAGCCTGTAAAGGAGAAAAAGAATAATCTATTATCATTATTC
CCTTATATGTTTAAACAGAAAAAGTTAGTTAGAAACATTATTTTAGCTGCTTTGCTGATGACACTGATTAGTATCTGTAG
CTCATATTTTGTACAAGGGTTGATTGATACATATATTCCTAACGGTACATTTACCACGTTATCTGTTTTAGCTATTGGGT
TATTGATTGCATATGTCTTTAACTCAATTTTTTCATATGGTCAGCAATTTTTATTGAATGTTTTGGGGCAGCGGTTATCA
ATCGATCTTAATTTACAATACATTAAGCATATTTTTGAATTACCGATGGAATTTTTTACTACCAGAAAAACTGGTGAGAT
TACTTCTCGTTTTTCTGATGCAAGTAGAATTATAGATGCATTGGCAAGTACGGTGATTTCATTATTCCTTGATCTTTCAA
TTGTAATTTTAATGGGTATAATTTTGGCGGTTCAAAACATGACATTGTTTTTGATTACGCTAGCTTCATTACCACTCTAT
GCTATTGTTATTCTCGGCTTTTCAAAAAGGTTTGAAAGATTAAATAATGACCAGATGGAAAGTAATGCGGTAGTTAGTTC
ATCAATTATTGAAGATATTCAAGGAATTGAGACGATTAAAGCATTAAACAGTGAACAAGTACGTTATCGAAAAATTGATA
GTCAGTTTGTTGATTATTTAAAAAAAGCTTTTAAGTATAGCAAAACAGAAGCTTTTCAAACTGCTTTGAAAACTTTTATT
AGATTATCACTGAATGTCATTATTCTTTGGGTGGGGGCAAGTGTTGTAATGCATAATCAGATGTCTATTGGTGAATTGAT
GGCATTTAATGCATTATTATCATATTTTATTGATCCATTACAAAATATTATTAACTTGCAACCTACACTTCAAGCTGCAA
ATGTAGCTCAGAATCGTTTAAATGAAGTTTATTTAGTTCAAAGTGAATTTAAGGGCAAAACTTCAGTTAATGATAGTCAT
GATTTAAATGGAGACATTAATTTCTCACATGTCGATTATCGTTATGGATATGGTGAAGATGTCTTAAAAGATATTAATTT
GACGATCAAAGATAATGAGAAATTAACTATTGTGGGAATGAGTGGATCAGGTAAGTCAACTATGGTTAAATTGTTGGTTG
ATTTTTTCAGTCCCACAAAAGGAAAAGTAACTTTTAATGGTCATTCGACTAGAAGTATAAATAAGCATGTTTTACGCTCA
TATGTGAATTATGTGCCACAAACTCCATATATTTTTGCAGGTACCATTAAAGAGAATCTTTTGTTGGGAAGTCGTAAGAA
TATAACTGAAAAAGAGATAGAAGAAGCTTGCCAAATCGCGGAGATAAAGAATGACATTGAAAAATTGCCATTAGGATTTA
ATACACAATTAGATGAAAATGCTAAGATTTTGTCAGGTGGTCAGAAGCAAAGATTAACAATTGCTAGGGCCTTACTTTCA
CCTGCTAAGGTATTGATTTTTGATGAAGCAACAAGTGGGCTTGATACAATTACTGAGAAAAAAGTGGTTGATAATTTAAT
TAGGTTAAAGAATAAAACTGTTATATTTATTGCGCATCGTTTAGCTATTGCTGAAAGAACTGACAATATTGTGGTATTGA
ATCAGGGTCAAATTGTAGAACAAGGTAATCACAGTGAATTAATGGCTAAACATGGGTATTATTATGATTTGGTGAAAAGC
TAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A2N5KWZ3

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus mitis NCTC 12261

55.602

99.167

0.551

  comA Streptococcus pneumoniae Rx1

55.462

99.167

0.55

  comA Streptococcus pneumoniae D39

55.462

99.167

0.55

  comA Streptococcus pneumoniae R6

55.462

99.167

0.55

  comA Streptococcus pneumoniae TIGR4

55.322

99.167

0.549

  comA Streptococcus mitis SK321

55.182

99.167

0.547

  comA Streptococcus gordonii str. Challis substr. CH1

54.342

99.167

0.539

  comA/nlmT Streptococcus mutans UA159

54.173

98.194

0.532