Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   ACLV7H_RS00450 Genome accession   NZ_OZ217340
Coordinates   65712..67865 (+) Length   717 a.a.
NCBI ID   WP_411866066.1    Uniprot ID   -
Organism   Streptococcus oralis isolate S. oralis A22     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 11789..79770 65712..67865 within 0


Gene organization within MGE regions


Location: 11789..79770
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACLV7H_RS00065 (SORA22_00130) ftsH 11789..13747 (+) 1959 WP_045592596.1 ATP-dependent zinc metalloprotease FtsH -
  ACLV7H_RS00070 - 13787..14047 (+) 261 Protein_13 23S rRNA (uracil-5-)-methyltransferase RumA -
  ACLV7H_RS00105 (SORA22_00170) comW 19587..19823 (+) 237 WP_045593760.1 sigma(X)-activator ComW -
  ACLV7H_RS00110 (SORA22_00180) - 20065..21351 (+) 1287 WP_411866023.1 adenylosuccinate synthase -
  ACLV7H_RS00115 (SORA22_00200) - 21612..22760 (-) 1149 WP_411866024.1 tyrosine-type recombinase/integrase -
  ACLV7H_RS00120 (SORA22_00210) - 22933..23358 (-) 426 WP_411866025.1 Ltp family lipoprotein -
  ACLV7H_RS00125 (SORA22_00220) - 23368..23757 (-) 390 WP_411866026.1 ImmA/IrrE family metallo-endopeptidase -
  ACLV7H_RS00130 (SORA22_00230) - 23770..24135 (-) 366 WP_411866027.1 helix-turn-helix domain-containing protein -
  ACLV7H_RS00135 (SORA22_00240) - 24418..24564 (+) 147 WP_001001334.1 hypothetical protein -
  ACLV7H_RS00140 (SORA22_00250) - 24557..24760 (+) 204 WP_038804347.1 transcriptional regulator -
  ACLV7H_RS00145 (SORA22_00260) - 24903..25616 (+) 714 WP_411866028.1 ORF6C domain-containing protein -
  ACLV7H_RS00150 (SORA22_00270) - 25630..25887 (+) 258 WP_000370959.1 hypothetical protein -
  ACLV7H_RS00155 (SORA22_00280) - 25968..26180 (+) 213 WP_247925709.1 hypothetical protein -
  ACLV7H_RS00160 (SORA22_00290) - 26201..27073 (+) 873 WP_411866029.1 phage replisome organizer N-terminal domain-containing protein -
  ACLV7H_RS00165 - 27073..27267 (+) 195 Protein_26 hypothetical protein -
  ACLV7H_RS00170 (SORA22_00310) - 27264..27725 (+) 462 WP_411866030.1 class I SAM-dependent methyltransferase -
  ACLV7H_RS00175 (SORA22_00320) - 27735..27956 (+) 222 WP_411866031.1 hypothetical protein -
  ACLV7H_RS00180 (SORA22_00330) - 27958..28449 (+) 492 WP_411866032.1 MazG-like family protein -
  ACLV7H_RS00185 (SORA22_00340) - 28451..28717 (+) 267 WP_411866033.1 hypothetical protein -
  ACLV7H_RS00190 (SORA22_00350) - 28732..28881 (+) 150 WP_015997044.1 hypothetical protein -
  ACLV7H_RS00195 (SORA22_00360) - 28878..29201 (+) 324 WP_411866034.1 3-dehydroquinate synthase -
  ACLV7H_RS00200 (SORA22_00380) ssbA 29288..29704 (+) 417 WP_270747357.1 single-stranded DNA-binding protein Machinery gene
  ACLV7H_RS00205 (SORA22_00390) - 29718..30002 (+) 285 WP_124788199.1 hypothetical protein -
  ACLV7H_RS00210 (SORA22_00400) - 30002..30682 (+) 681 WP_411866035.1 DUF1642 domain-containing protein -
  ACLV7H_RS00215 (SORA22_00410) - 30846..31106 (+) 261 WP_411866036.1 DUF1372 family protein -
  ACLV7H_RS00220 (SORA22_00420) - 31106..31357 (+) 252 WP_411866037.1 hypothetical protein -
  ACLV7H_RS00225 (SORA22_00430) - 31348..31686 (+) 339 WP_411866038.1 helix-turn-helix domain-containing protein -
  ACLV7H_RS00230 (SORA22_00440) - 31683..32084 (+) 402 WP_411866039.1 transcriptional regulator -
  ACLV7H_RS00235 (SORA22_00450) - 32277..32819 (+) 543 WP_033585091.1 site-specific integrase -
  ACLV7H_RS00240 (SORA22_00460) - 33280..33600 (+) 321 WP_195594768.1 HNH endonuclease -
  ACLV7H_RS00245 (SORA22_00470) - 33737..34129 (+) 393 WP_001118282.1 P27 family phage terminase small subunit -
  ACLV7H_RS00250 (SORA22_00480) - 34122..35852 (+) 1731 WP_411866040.1 terminase large subunit -
  ACLV7H_RS00255 (SORA22_00490) - 35860..36078 (+) 219 WP_411866041.1 hypothetical protein -
  ACLV7H_RS00260 (SORA22_00500) - 36096..37298 (+) 1203 WP_411866042.1 phage portal protein -
  ACLV7H_RS00265 (SORA22_00510) - 37282..37872 (+) 591 WP_411866043.1 HK97 family phage prohead protease -
  ACLV7H_RS00270 (SORA22_00520) - 37857..39029 (+) 1173 WP_411866044.1 phage major capsid protein -
  ACLV7H_RS00275 (SORA22_00530) - 39041..39331 (+) 291 WP_411866045.1 hypothetical protein -
  ACLV7H_RS00280 (SORA22_00540) - 39334..39615 (+) 282 WP_411866046.1 hypothetical protein -
  ACLV7H_RS00285 (SORA22_00550) - 39602..39901 (+) 300 WP_049505450.1 phage head closure protein -
  ACLV7H_RS00290 (SORA22_00560) - 39898..40245 (+) 348 WP_000063884.1 HK97 gp10 family phage protein -
  ACLV7H_RS00295 (SORA22_00570) - 40242..40565 (+) 324 WP_049505449.1 hypothetical protein -
  ACLV7H_RS00300 (SORA22_00580) - 40577..41155 (+) 579 WP_015997064.1 major tail protein -
  ACLV7H_RS00305 (SORA22_00590) - 41167..41586 (+) 420 WP_411866047.1 hypothetical protein -
  ACLV7H_RS00310 (SORA22_00600) - 41864..44884 (+) 3021 WP_411866048.1 phage tail tape measure protein -
  ACLV7H_RS00315 (SORA22_00610) - 44881..45603 (+) 723 WP_411866049.1 phage tail protein -
  ACLV7H_RS00320 (SORA22_00620) - 45604..48993 (+) 3390 WP_411866050.1 phage tail spike protein -
  ACLV7H_RS00325 (SORA22_00630) - 48990..49202 (+) 213 WP_411866051.1 hypothetical protein -
  ACLV7H_RS00330 (SORA22_00640) - 49189..49797 (+) 609 WP_411866052.1 hypothetical protein -
  ACLV7H_RS00335 (SORA22_00650) - 49807..50229 (+) 423 WP_411866053.1 phage holin family protein -
  ACLV7H_RS00340 (SORA22_00660) - 50233..50565 (+) 333 WP_411866054.1 phage holin -
  ACLV7H_RS00345 (SORA22_00670) - 50568..51536 (+) 969 WP_411866055.1 autolysin -
  ACLV7H_RS00350 (SORA22_00680) - 51630..51821 (-) 192 WP_000109864.1 hypothetical protein -
  ACLV7H_RS00355 (SORA22_00690) tadA 52400..52867 (+) 468 WP_411866056.1 tRNA adenosine(34) deaminase TadA -
  ACLV7H_RS00365 (SORA22_00700) - 53054..53497 (+) 444 WP_045593764.1 dUTP diphosphatase -
  ACLV7H_RS00370 (SORA22_00710) - 53448..54014 (+) 567 WP_411866057.1 histidine phosphatase family protein -
  ACLV7H_RS00375 (SORA22_00720) radA 54028..55389 (+) 1362 WP_411867061.1 DNA repair protein RadA Machinery gene
  ACLV7H_RS00380 (SORA22_00730) - 55462..55956 (+) 495 WP_162279191.1 beta-class carbonic anhydrase -
  ACLV7H_RS00385 (SORA22_00740) - 56171..57139 (+) 969 WP_000010174.1 ribose-phosphate diphosphokinase -
  ACLV7H_RS00390 (SORA22_00750) abpA 57479..58111 (+) 633 WP_411866058.1 amylase-binding adhesin AbpA -
  ACLV7H_RS00395 (SORA22_00760) srtB 58197..59030 (+) 834 WP_411866059.1 class B sortase, LPKTxAVK-specific -
  ACLV7H_RS00400 (SORA22_00770) - 59149..59586 (+) 438 WP_000076506.1 CoA-binding protein -
  ACLV7H_RS00405 (SORA22_00780) - 59615..60625 (-) 1011 WP_411866060.1 YeiH family protein -
  ACLV7H_RS00410 (SORA22_00790) - 60777..61946 (+) 1170 WP_045593785.1 pyridoxal phosphate-dependent aminotransferase -
  ACLV7H_RS00415 (SORA22_00800) recO 61943..62713 (+) 771 WP_411866061.1 DNA repair protein RecO -
  ACLV7H_RS00420 (SORA22_00810) plsX 62710..63702 (+) 993 WP_411866062.1 phosphate acyltransferase PlsX -
  ACLV7H_RS00425 (SORA22_00820) - 63706..63942 (+) 237 WP_411866063.1 acyl carrier protein -
  ACLV7H_RS00430 (SORA22_00830) - 64262..64456 (+) 195 WP_411866064.1 class IIb bacteriocin, lactobin A/cerein 7B family -
  ACLV7H_RS00435 (SORA22_00850) - 64687..64860 (+) 174 WP_001056923.1 hypothetical protein -
  ACLV7H_RS00440 - 64861..65088 (+) 228 WP_080933732.1 bacteriocin immunity protein -
  ACLV7H_RS00445 (SORA22_00860) - 65160..65348 (+) 189 WP_411866065.1 hypothetical protein -
  ACLV7H_RS00450 (SORA22_00880) comA 65712..67865 (+) 2154 WP_411866066.1 peptide cleavage/export ABC transporter ComA Regulator
  ACLV7H_RS00455 (SORA22_00890) comB 67878..69227 (+) 1350 WP_411866067.1 competence pheromone export protein ComB Regulator
  ACLV7H_RS00460 (SORA22_00900) purC 69395..70102 (+) 708 WP_045593802.1 phosphoribosylaminoimidazolesuccinocarboxamide synthase -
  ACLV7H_RS00465 (SORA22_00910) - 70305..74030 (+) 3726 WP_411866068.1 phosphoribosylformylglycinamidine synthase -
  ACLV7H_RS00470 (SORA22_00920) purF 74192..75634 (+) 1443 WP_411866069.1 amidophosphoribosyltransferase -
  ACLV7H_RS00475 (SORA22_00930) purM 75849..76871 (+) 1023 WP_411866070.1 phosphoribosylformylglycinamidine cyclo-ligase -
  ACLV7H_RS00480 (SORA22_00940) purN 76868..77419 (+) 552 WP_411866071.1 phosphoribosylglycinamide formyltransferase -
  ACLV7H_RS00485 (SORA22_00950) - 77474..78181 (+) 708 WP_411866072.1 suppressor of fused domain protein -
  ACLV7H_RS00490 (SORA22_00960) purH 78223..79770 (+) 1548 WP_411866073.1 bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80398.65 Da        Isoelectric Point: 7.0210

>NTDB_id=1169924 ACLV7H_RS00450 WP_411866066.1 65712..67865(+) (comA) [Streptococcus oralis isolate S. oralis A22]
MKFRKRHYRPQVDQMDCGVASLAMVFGYYGSYYSLAHLRELAKTTMDGTTALGLVKVAEEIGFETRAIKADMTLFDLPDL
TFPFVAHVLKEGKLLHYYVVIGQDKKHIHIADPDPGVKLTNISRERFAQEWTGVSLFMAPSPDYKPHKEKKQGLLSFLPI
LLKQRGLIANIVLATLLVTLINIVGSYYLQSIIDSYVPDQMRSTLGIISMGLVIVYILQQILSYAQEYLLLVLGQRLSID
VILSYIKHVFHLPMSFFATRRTGGIVSRFTDANSIIDALASTILSIFLDVSTILIISLVLFSQNMTLFFISLLALPIYTV
IIFAFMKPFEKMNRDTMEANAVLSSSIIEDIHGIETIKSLTSESSRYQKIDKEFVAYLKKSFTYSRAESQQKALKKVAQL
LLNVAVLWLGAILVMDGKMSLGQLITYNTLLVYFTNPLENIINLQTKLQTAQVANNRLNEVYLVASEFEEKKTVEDLSMM
KGDMTFKQVHYKYGYGRDVLSDINLTIPQGSKMAFVGISGSGKTTLAKMMVNFYDPSQGEISLGGVNLNQIDKKALRQYI
NYLPQQPYVFNGTILENLLLGAKEGTTQEDILRAVELAEIREDIERMPLNYQTELTSDGAGISGGQRQRIALARAFLTDA
PALILDEATSSLDILTEKRIVDNLMALDKTLIFIAHRLTIAERTEKVVVLDQGKIVEEGTHADLLAQDGFYAHLVNS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=1169924 ACLV7H_RS00450 WP_411866066.1 65712..67865(+) (comA) [Streptococcus oralis isolate S. oralis A22]
ATGAAATTTAGGAAAAGGCACTATCGTCCTCAGGTGGATCAGATGGATTGTGGCGTGGCTTCCTTGGCCATGGTATTTGG
TTACTACGGTAGTTATTACTCCTTGGCTCACTTGCGAGAGTTAGCCAAGACGACTATGGATGGGACGACTGCTTTGGGAC
TTGTAAAGGTGGCAGAGGAGATTGGGTTTGAGACGCGGGCTATCAAGGCGGATATGACGCTCTTTGATCTGCCAGATTTG
ACCTTTCCTTTTGTGGCTCATGTGCTCAAGGAAGGGAAATTGCTCCACTACTATGTGGTGATAGGTCAGGATAAAAAGCA
CATTCATATCGCTGATCCAGATCCTGGTGTCAAGCTGACTAACATTTCCCGTGAGCGATTTGCGCAAGAATGGACAGGGG
TCAGTCTCTTTATGGCTCCGTCTCCAGACTATAAACCCCATAAGGAGAAAAAACAAGGACTCTTATCATTTTTACCTATC
TTGCTCAAACAGCGTGGCTTGATTGCTAATATCGTTTTGGCAACACTCTTGGTAACCCTGATCAATATCGTAGGCTCCTA
CTATCTTCAGTCCATCATTGACAGTTACGTACCAGATCAGATGCGCTCGACACTGGGTATTATCTCAATGGGGCTGGTCA
TTGTCTATATCCTCCAGCAAATTTTGTCTTATGCGCAGGAATACCTCTTACTTGTTCTGGGTCAACGGTTGTCCATCGAT
GTGATTTTGTCCTACATCAAGCATGTTTTTCACCTGCCGATGTCCTTTTTCGCGACACGCAGGACAGGGGGAATCGTGTC
TCGTTTTACAGATGCCAATAGTATCATCGATGCGTTGGCGTCGACCATTCTGTCGATTTTCCTAGATGTGTCGACGATTT
TGATTATCTCGCTTGTCCTGTTTTCACAAAATATGACACTCTTTTTCATTAGTCTGCTTGCGCTTCCCATCTATACAGTG
ATTATTTTTGCCTTTATGAAGCCTTTTGAAAAGATGAATCGGGATACCATGGAAGCCAATGCGGTTCTGTCTTCTTCCAT
CATCGAGGACATCCACGGTATTGAGACCATTAAGTCTTTGACCAGTGAAAGTTCACGCTATCAAAAGATTGACAAGGAAT
TTGTGGCTTATCTGAAAAAATCCTTTACCTATAGTCGGGCAGAAAGCCAGCAAAAGGCACTGAAAAAAGTTGCCCAGCTC
CTGCTCAATGTTGCCGTTCTCTGGCTAGGAGCCATTCTTGTTATGGATGGGAAAATGAGTTTGGGGCAGTTGATTACCTA
TAATACCTTGCTTGTTTACTTTACCAATCCTTTGGAAAATATCATCAACCTGCAAACCAAACTTCAGACAGCGCAGGTTG
CCAATAATCGCTTAAATGAGGTTTATCTGGTCGCTTCGGAGTTTGAGGAGAAGAAAACAGTCGAAGATTTGAGCATGATG
AAGGGAGATATGACCTTTAAGCAGGTTCACTATAAGTATGGCTATGGTCGTGACGTCTTGTCAGATATCAATTTGACCAT
TCCGCAAGGGTCCAAGATGGCTTTCGTGGGGATTTCTGGGTCAGGTAAGACTACCTTGGCAAAGATGATGGTTAATTTCT
ACGACCCTAGTCAGGGAGAGATTAGTCTGGGTGGGGTCAATCTCAATCAGATTGACAAAAAGGCCCTGCGCCAGTATATC
AACTACCTGCCTCAACAGCCCTATGTCTTTAACGGAACGATTTTGGAGAATCTTCTCCTTGGAGCCAAGGAGGGGACGAC
TCAGGAAGACATCTTACGGGCGGTTGAGTTGGCCGAGATTCGGGAGGATATTGAGCGCATGCCCCTGAATTACCAGACAG
AATTAACTTCGGATGGGGCAGGGATTTCAGGTGGACAACGTCAGAGAATTGCTCTGGCGCGTGCTTTCTTGACAGATGCG
CCTGCCTTGATCTTGGACGAGGCGACCAGCAGTCTGGACATTTTGACAGAGAAGCGGATTGTCGATAATCTCATGGCTTT
GGACAAGACCTTGATTTTCATTGCCCACCGTTTGACCATTGCTGAGCGGACAGAGAAGGTTGTTGTCTTGGATCAGGGCA
AGATTGTCGAAGAAGGCACCCATGCAGACTTACTTGCGCAGGATGGCTTTTACGCCCATTTGGTCAATAGCTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus mitis NCTC 12261

94.561

100

0.946

  comA Streptococcus pneumoniae Rx1

94.282

100

0.943

  comA Streptococcus pneumoniae D39

94.282

100

0.943

  comA Streptococcus pneumoniae R6

94.282

100

0.943

  comA Streptococcus mitis SK321

94.142

100

0.941

  comA Streptococcus pneumoniae TIGR4

93.863

100

0.939

  comA Streptococcus gordonii str. Challis substr. CH1

79.777

100

0.798

  comA/nlmT Streptococcus mutans UA159

63.877

100

0.639


Multiple sequence alignment