Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   CPQ89_RS02585 Genome accession   NZ_CP023566
Coordinates   510178..512337 (+) Length   719 a.a.
NCBI ID   WP_112193848.1    Uniprot ID   A0AAD0L3G7
Organism   Ligilactobacillus murinus strain CR141     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Genomic island 508502..525248 510178..512337 within 0
IScluster/Tn 508900..510072 510178..512337 flank 106


Gene organization within MGE regions


Location: 508502..525248
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CPQ89_RS02575 (CPQ89_02560) - 508502..508822 (+) 321 WP_112193846.1 ArsR family transcriptional regulator -
  CPQ89_RS02580 (CPQ89_02565) - 508900..510072 (+) 1173 Protein_528 IS256 family transposase -
  CPQ89_RS02585 (CPQ89_02570) comA 510178..512337 (+) 2160 WP_112193848.1 peptide cleavage/export ABC transporter Regulator
  CPQ89_RS02590 (CPQ89_02575) - 512351..513433 (+) 1083 WP_112193850.1 HlyD family efflux transporter periplasmic adaptor subunit -
  CPQ89_RS02595 (CPQ89_02580) - 513455..514213 (+) 759 WP_112193852.1 LytR/AlgR family response regulator transcription factor -
  CPQ89_RS02600 (CPQ89_02585) - 514390..514530 (+) 141 WP_082613054.1 ComC/BlpC family leader-containing pheromone/bacteriocin -
  CPQ89_RS02605 - 514564..514716 (+) 153 WP_112193854.1 Blp family class II bacteriocin -
  CPQ89_RS11225 - 514735..514908 (+) 174 WP_143441406.1 DUF4175 domain-containing protein -
  CPQ89_RS02610 (CPQ89_02590) - 515006..515968 (-) 963 WP_309299713.1 IS3 family transposase -
  CPQ89_RS02615 (CPQ89_02595) - 515854..516567 (-) 714 WP_191981800.1 helix-turn-helix domain-containing protein -
  CPQ89_RS02620 (CPQ89_02600) - 516723..517706 (-) 984 WP_112193856.1 DUF1002 domain-containing protein -
  CPQ89_RS02625 (CPQ89_02605) pnuC 517986..518756 (+) 771 WP_112193858.1 nicotinamide riboside transporter PnuC -
  CPQ89_RS02630 (CPQ89_02610) - 518998..521013 (+) 2016 WP_039936164.1 DHH family phosphoesterase -
  CPQ89_RS02635 (CPQ89_02615) rplI 521041..521490 (+) 450 WP_004049682.1 50S ribosomal protein L9 -
  CPQ89_RS02640 (CPQ89_02620) - 521975..522859 (-) 885 WP_112193861.1 LysR family transcriptional regulator -
  CPQ89_RS02645 (CPQ89_02625) - 522982..523902 (+) 921 WP_112193863.1 DMT family transporter -
  CPQ89_RS02650 (CPQ89_02630) - 524125..524547 (+) 423 WP_112193865.1 hypothetical protein -
  CPQ89_RS02655 (CPQ89_02635) - 524631..525248 (-) 618 WP_112193867.1 hypothetical protein -

Sequence


Protein


Download         Length: 719 a.a.        Molecular weight: 81071.26 Da        Isoelectric Point: 9.2546

>NTDB_id=248493 CPQ89_RS02585 WP_112193848.1 510178..512337(+) (comA) [Ligilactobacillus murinus strain CR141]
MFNFHNYYISQIDESDCGVASLAMLLKYYGSDVSLSYLRNIAKTDKDGTTALGIVKTAQQLNFETKAIKADMRLFDIADL
HYPFIAHVIKDNQLLHYYVVISSNKKYVTIADPDPSVKVKKISKKKFASEWSGVAIFVTPSIEYIPIKERKKGLFSLFNN
VFKNKALLTNIVLSATLMTIISILGSYFIQGIIDTYIPNELSTTLGIVAVGLLVFYMFNSIFMYSKEFLLIILGQRLSID
IILGYIRHVFNLPMEFFATRKTGEIISRFNDASKIIDALASAVLSMFLDVGTVFIIGIVLAFQSLKLFLVTLSVLPVYIL
VILIFANYFEKLNSQRMESNATVSSSIIEDVRGIETIKSLNSEQQRYQHIDIEFVDFLKKSMEYSKKDILQQAIKSFIQL
SLSLIVLWIGSNLVIYNKMSLGQLMTYNALLVYFTTPLQNIINLQPKLQAAKVANNRLNEIYLVKGEHDDKHSSTKSLHN
SSGNIQFNDVSYRYGFGKNVLNDINLTINSNKKVTFVGESGSGKSTLVKLLVNFFEPTSGNITIDNQDVQQFSKKELRSF
ITYVPQNPYVFSGTILENLKLGNRRNISVNDIFKACEIAMIKDDIEKMPLQYDTILDEEGHTLSGGQKQRLTIARALLSP
AKVLILDESTSGLDTLTEKKLIENLIGMSDKTIIFIAHRLAVAEKTDTIFVLNNGTIVEQGTHQELLSKKGFYYKLVNN

Nucleotide


Download         Length: 2160 bp        

>NTDB_id=248493 CPQ89_RS02585 WP_112193848.1 510178..512337(+) (comA) [Ligilactobacillus murinus strain CR141]
ATGTTTAATTTTCATAACTACTATATATCACAAATCGATGAGAGTGATTGTGGTGTTGCTAGTTTGGCGATGCTTTTAAA
ATATTACGGCTCAGATGTTTCATTATCTTATTTGAGAAACATAGCTAAAACCGATAAGGATGGCACAACAGCATTAGGAA
TCGTCAAAACTGCTCAGCAATTAAACTTCGAAACTAAAGCTATAAAAGCAGATATGCGATTATTTGATATTGCTGATCTT
CACTATCCTTTTATTGCACATGTTATAAAAGACAATCAGCTTTTGCATTACTATGTTGTAATATCATCAAATAAAAAATA
TGTAACTATTGCTGACCCTGATCCATCAGTAAAGGTAAAAAAAATTTCCAAGAAAAAGTTTGCTTCAGAGTGGAGTGGAG
TAGCTATATTCGTAACACCTTCAATCGAATATATACCTATCAAGGAAAGAAAAAAAGGGTTATTTTCACTATTCAACAAT
GTTTTTAAAAACAAAGCGCTATTAACAAATATAGTGCTATCTGCTACTTTGATGACTATCATTAGTATCTTGGGATCCTA
TTTTATCCAAGGCATAATTGATACATACATACCTAATGAATTAAGTACCACACTAGGCATAGTCGCTGTGGGTCTTCTGG
TGTTCTATATGTTCAATTCTATTTTCATGTACTCTAAAGAATTTTTACTCATAATATTAGGACAACGGTTATCAATAGAT
ATTATTTTGGGATATATTAGGCACGTTTTTAACCTTCCGATGGAATTTTTTGCCACTAGAAAAACTGGTGAAATAATATC
CAGATTTAATGACGCAAGTAAAATTATTGACGCATTAGCGAGTGCTGTTTTATCTATGTTTTTAGACGTAGGCACTGTTT
TTATTATAGGAATAGTTTTAGCTTTCCAAAGCTTAAAGTTATTTTTAGTGACCTTATCCGTACTTCCGGTATACATTTTA
GTGATATTGATATTTGCTAACTATTTTGAAAAACTAAATTCACAACGAATGGAAAGTAATGCTACTGTTAGTTCTTCAAT
TATAGAAGATGTTAGAGGCATTGAAACAATCAAATCTTTGAATAGTGAACAGCAAAGATATCAACACATAGATATAGAAT
TTGTAGACTTTTTAAAAAAGAGTATGGAATATTCCAAAAAAGATATACTGCAACAAGCAATTAAATCTTTTATTCAACTA
AGCTTAAGTCTAATCGTACTTTGGATTGGATCAAACTTAGTCATTTACAATAAAATGTCGTTAGGACAATTGATGACATA
TAATGCATTACTAGTTTATTTTACAACACCCCTACAAAATATTATAAATCTCCAACCCAAATTACAGGCTGCAAAAGTTG
CTAATAATAGACTAAATGAGATATATCTTGTTAAAGGTGAACATGATGATAAACATTCAAGTACAAAAAGTCTCCATAAT
TCTAGTGGAAACATTCAATTTAATGATGTCTCATATAGATATGGATTTGGAAAAAACGTGTTAAACGATATAAATCTAAC
AATTAATTCAAACAAAAAAGTCACCTTTGTCGGTGAAAGTGGTTCCGGAAAATCAACTTTGGTTAAACTACTAGTTAATT
TTTTTGAACCCACTTCGGGAAATATCACAATCGACAATCAAGATGTACAGCAATTTTCTAAAAAGGAACTACGTAGTTTT
ATTACCTATGTTCCCCAAAACCCATATGTTTTTTCAGGAACAATTTTGGAAAATTTAAAGCTTGGAAATAGACGTAATAT
TTCTGTGAATGACATCTTTAAGGCTTGTGAAATAGCAATGATCAAAGATGATATTGAAAAAATGCCTTTGCAGTATGACA
CGATTCTAGATGAGGAAGGACATACATTATCTGGTGGACAAAAACAACGTTTGACGATTGCTAGAGCATTACTATCACCT
GCAAAAGTTTTAATCCTAGATGAATCAACTAGTGGTTTGGATACTCTAACAGAAAAGAAGCTTATCGAAAACCTAATTGG
AATGTCAGATAAAACAATTATTTTCATCGCCCATAGATTAGCTGTAGCAGAAAAAACGGATACTATTTTTGTATTGAATA
ACGGAACTATAGTTGAACAAGGAACCCACCAGGAATTATTATCTAAAAAAGGGTTTTACTATAAATTGGTAAATAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus mitis SK321

56.162

99.305

0.558

  comA Streptococcus mitis NCTC 12261

56.162

99.305

0.558

  comA Streptococcus pneumoniae Rx1

56.022

99.305

0.556

  comA Streptococcus pneumoniae D39

56.022

99.305

0.556

  comA Streptococcus pneumoniae R6

56.022

99.305

0.556

  comA Streptococcus pneumoniae TIGR4

56.022

99.305

0.556

  comA Streptococcus gordonii str. Challis substr. CH1

55.742

99.305

0.554

  comA/nlmT Streptococcus mutans UA159

54.406

99.444

0.541


Multiple sequence alignment