Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Regulator
Locus tag   DQM54_RS10450 Genome accession   NZ_LS483375
Coordinates   2106071..2108224 (-) Length   717 a.a.
NCBI ID   WP_111724245.1    Uniprot ID   -
Organism   Streptococcus gordonii strain NCTC3165     
Function   processing and transport of ComC (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 2106071..2139803 2106071..2108224 within 0


Gene organization within MGE regions


Location: 2106071..2139803
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DQM54_RS10450 (NCTC3165_02080) comA 2106071..2108224 (-) 2154 WP_111724245.1 peptide cleavage/export ABC transporter ComA Regulator
  DQM54_RS10455 (NCTC3165_02081) - 2108926..2109135 (-) 210 WP_111724246.1 hypothetical protein -
  DQM54_RS10465 (NCTC3165_02083) - 2109997..2110500 (-) 504 WP_111724247.1 hypothetical protein -
  DQM54_RS10470 (NCTC3165_02084) - 2110573..2111097 (-) 525 WP_111724248.1 hypothetical protein -
  DQM54_RS10475 (NCTC3165_02085) - 2111462..2112901 (-) 1440 WP_111724323.1 phage/plasmid primase, P4 family -
  DQM54_RS10480 (NCTC3165_02086) - 2112981..2113841 (-) 861 WP_111724249.1 primase alpha helix C-terminal domain-containing protein -
  DQM54_RS10485 (NCTC3165_02087) - 2113982..2114258 (-) 277 Protein_2012 XRE family transcriptional regulator -
  DQM54_RS10490 (NCTC3165_02088) - 2114251..2114592 (-) 342 WP_111724250.1 DNA-binding protein -
  DQM54_RS10495 (NCTC3165_02089) - 2114589..2114804 (-) 216 WP_111724251.1 hypothetical protein -
  DQM54_RS10500 (NCTC3165_02090) - 2114806..2115009 (-) 204 WP_111724252.1 hypothetical protein -
  DQM54_RS10505 (NCTC3165_02091) - 2115291..2115917 (-) 627 WP_111724253.1 Rha family transcriptional regulator -
  DQM54_RS10510 (NCTC3165_02092) - 2115932..2116129 (-) 198 WP_111724254.1 helix-turn-helix domain-containing protein -
  DQM54_RS10515 (NCTC3165_02093) - 2116293..2117066 (+) 774 WP_111724255.1 helix-turn-helix domain-containing protein -
  DQM54_RS10520 (NCTC3165_02094) - 2117110..2118099 (+) 990 WP_084849558.1 DUF3644 domain-containing protein -
  DQM54_RS10525 (NCTC3165_02095) - 2118445..2119611 (+) 1167 WP_111724256.1 tyrosine-type recombinase/integrase -
  DQM54_RS10530 (NCTC3165_02096) rpsD 2119700..2120311 (-) 612 WP_008810013.1 30S ribosomal protein S4 -
  DQM54_RS10535 (NCTC3165_02097) - 2120577..2121299 (-) 723 WP_111724257.1 ABC transporter ATP-binding protein -
  DQM54_RS10540 (NCTC3165_02098) - 2121299..2122300 (-) 1002 WP_111724258.1 ABC transporter substrate-binding protein -
  DQM54_RS10545 (NCTC3165_02099) - 2122341..2123093 (-) 753 WP_060554273.1 ABC transporter permease -
  DQM54_RS10550 (NCTC3165_02100) - 2123056..2123346 (-) 291 WP_045504760.1 MTH1187 family thiamine-binding protein -
  DQM54_RS10555 (NCTC3165_02102) - 2123686..2124345 (-) 660 WP_061597425.1 HAD family hydrolase -
  DQM54_RS10560 (NCTC3165_02103) srtB 2124424..2125281 (-) 858 WP_045504765.1 class B sortase, LPKTxAVK-specific -
  DQM54_RS10565 (NCTC3165_02104) abpA 2125369..2125956 (-) 588 WP_045504766.1 amylase-binding adhesin AbpA -
  DQM54_RS10570 (NCTC3165_02105) - 2126173..2127138 (-) 966 WP_111724259.1 ribose-phosphate diphosphokinase -
  DQM54_RS10575 (NCTC3165_02106) pcsB 2127298..2128488 (-) 1191 WP_111724260.1 peptidoglycan hydrolase PcsB -
  DQM54_RS10580 (NCTC3165_02107) mreD 2128588..2129088 (-) 501 WP_111724261.1 rod shape-determining protein MreD -
  DQM54_RS10585 (NCTC3165_02108) mreC 2129090..2129905 (-) 816 WP_111724262.1 rod shape-determining protein MreC -
  DQM54_RS10695 (NCTC3165_02130) comR/comR2 2136810..2137292 (-) 483 WP_045772576.1 sigma-70 family RNA polymerase sigma factor Regulator
  DQM54_RS10710 (NCTC3165_02133) ftsH 2137821..2139803 (-) 1983 WP_046164766.1 ATP-dependent zinc metalloprotease FtsH -

Sequence


Protein


Download         Length: 717 a.a.        Molecular weight: 80754.75 Da        Isoelectric Point: 7.0816

>NTDB_id=1139161 DQM54_RS10450 WP_111724245.1 2106071..2108224(-) (comA) [Streptococcus gordonii strain NCTC3165]
MKFRKRHYRAQVDTRDCGVAALAMVFGYYGSYFSLATLREKAKTTNDGTTALGLVKVAEGLNFETRAFKADMGIFDLEYV
SYPFIVHVLKEGKLLHYYVVTGQDKHTIHIADPDPQVKMTKISRERFEQEWTGITIFLAPSPAYKPSQEKKNGLLDFIPL
LIKQKGLITNIVLATLLVTVINIVGSYYLQSIIDTYVPDHMKTTLGMISIGLIIVYILQQFLSYAQEYLLLVLGQRLSID
VILSYIKHVFQLPMSFFATRRTGEIVSRFTDANRIIDALASTILSIFLDVSIVSIIAIVLFSQNSSLFFLTLLGIPVYAL
IIFLFMKPFEKMNHETMEANSLLSSSIIEDINGIETIKSLTSEKQRYQKIDKEFVTYLKKSFAYGRSESLQKVLKAAARL
ILNVLILWLGATLVMDQKISLGQLITYNTLLVYFTNPLENIINLQTKLQSARVANERLNEVYLVQSEFEEEKLIKDLSHF
QADIDFRGVSYKYGYGANVLSEIDLHIPAGSKTSFVGVSGSGKTTLAKMMVHFYAPNHGDICLGGVNLNQLDKQALRQYI
NYLPQQPYVFNGTILENLLLGAREGTTQEDILRAVELAEIRSDIERMPLNYQTELSADGAGISGGQRQRIALARALLTDA
PVLILDEATSSLDILTEKRIIDNLMALDKTIIFIAHRLTIAERSEQVVVLDQGRIVESGSHKELIEREGFYHHLVNS

Nucleotide


Download         Length: 2154 bp        

>NTDB_id=1139161 DQM54_RS10450 WP_111724245.1 2106071..2108224(-) (comA) [Streptococcus gordonii strain NCTC3165]
ATGAAATTTAGGAAACGGCATTACAGAGCGCAAGTTGATACTAGGGATTGCGGTGTGGCAGCTCTGGCTATGGTTTTTGG
TTATTATGGATCGTATTTTTCCTTAGCTACACTGCGAGAAAAGGCAAAAACTACCAATGATGGCACAACTGCTTTGGGTT
TGGTAAAAGTTGCTGAGGGTCTAAATTTTGAGACAAGGGCTTTTAAAGCTGATATGGGAATCTTTGATCTGGAATATGTA
TCTTATCCTTTCATCGTTCATGTACTTAAGGAAGGGAAGCTCCTGCATTATTATGTTGTGACGGGACAAGATAAACACAC
GATTCATATTGCAGACCCAGACCCTCAAGTTAAGATGACAAAGATTTCGCGAGAGCGTTTTGAACAGGAGTGGACAGGAA
TTACCATTTTCCTAGCTCCTAGTCCAGCCTATAAGCCTAGTCAGGAAAAGAAGAATGGGCTTTTGGACTTCATTCCCTTA
TTGATCAAGCAAAAGGGGTTGATTACTAATATTGTGCTTGCAACTTTGCTGGTAACAGTGATTAATATTGTTGGATCTTA
CTACCTTCAGTCGATTATTGATACTTATGTGCCTGATCATATGAAAACGACGCTGGGTATGATTTCTATCGGCTTGATTA
TTGTCTATATTTTGCAGCAATTCTTGTCTTATGCCCAAGAATATTTGCTTTTGGTTTTAGGGCAGCGCCTATCGATTGAT
GTGATCTTATCCTATATTAAGCATGTTTTTCAGCTACCTATGTCATTTTTTGCGACAAGGCGGACAGGAGAAATCGTCTC
TCGTTTTACAGATGCCAATAGAATTATTGATGCTTTGGCCAGTACAATCTTGTCTATTTTCTTGGATGTATCGATTGTGT
CGATCATTGCGATTGTTCTTTTTTCACAAAATAGCAGTCTTTTCTTTTTGACCTTGTTAGGAATCCCAGTTTACGCTCTC
ATTATTTTTCTTTTTATGAAGCCTTTTGAAAAGATGAATCATGAGACGATGGAAGCAAATAGTTTGTTGTCGTCTTCCAT
CATTGAAGATATCAATGGGATTGAAACCATCAAGTCTTTGACCAGCGAAAAACAACGTTATCAGAAGATTGATAAGGAAT
TTGTGACTTATCTGAAAAAGTCTTTTGCTTACGGGCGTTCTGAGAGCCTACAAAAAGTCTTGAAAGCAGCAGCTCGTTTG
ATTTTAAACGTCTTGATTTTATGGCTAGGAGCTACACTTGTCATGGATCAAAAAATCAGCCTTGGTCAGCTCATTACTTA
TAACACACTTCTAGTCTACTTTACTAATCCTTTAGAAAATATTATCAATCTGCAAACGAAGTTACAGTCTGCACGAGTGG
CCAATGAGCGTCTCAATGAGGTCTATTTAGTCCAGTCTGAATTTGAAGAAGAGAAGCTTATTAAAGACTTGAGCCACTTT
CAGGCTGACATCGATTTTCGTGGAGTCAGCTATAAATATGGCTATGGGGCCAATGTATTATCAGAAATTGACTTGCATAT
TCCTGCTGGAAGTAAGACAAGCTTTGTTGGTGTATCCGGTTCAGGAAAAACAACCTTGGCCAAGATGATGGTGCATTTCT
ACGCTCCGAATCATGGGGATATTTGTTTGGGAGGAGTCAACCTCAATCAGCTGGATAAGCAGGCACTGCGCCAATATATC
AACTATCTTCCCCAGCAACCGTATGTTTTTAATGGAACTATTTTGGAAAATTTACTCTTAGGTGCGCGTGAAGGTACGAC
TCAAGAAGATATTCTGCGGGCGGTAGAATTGGCAGAAATTAGGTCGGATATTGAGCGGATGCCTCTCAATTATCAGACAG
AGCTGTCTGCAGATGGAGCAGGAATCTCGGGTGGTCAACGTCAGCGTATTGCTTTAGCAAGGGCTCTCTTGACAGATGCG
CCAGTGCTTATCTTAGATGAAGCAACGAGTAGTCTGGATATTCTAACAGAGAAGCGGATCATTGATAATCTGATGGCGCT
TGATAAGACGATTATTTTCATCGCGCATCGTTTGACTATTGCGGAACGTTCGGAGCAGGTGGTCGTTTTGGATCAAGGTA
GGATTGTCGAGAGTGGAAGTCATAAGGAGCTGATCGAAAGAGAAGGCTTTTACCACCATTTGGTCAATAGTTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Streptococcus gordonii str. Challis substr. CH1

98.187

100

0.982

  comA Streptococcus pneumoniae Rx1

80.893

100

0.809

  comA Streptococcus pneumoniae D39

80.893

100

0.809

  comA Streptococcus pneumoniae R6

80.893

100

0.809

  comA Streptococcus mitis NCTC 12261

80.753

100

0.808

  comA Streptococcus mitis SK321

80.614

100

0.806

  comA Streptococcus pneumoniae TIGR4

80.474

100

0.805

  comA/nlmT Streptococcus mutans UA159

64.714

100

0.647


Multiple sequence alignment