Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   CG481_RS16195 Genome accession   NZ_CP024104
Coordinates   3079597..3080181 (-) Length   194 a.a.
NCBI ID   WP_012095387.1    Uniprot ID   -
Organism   Bacillus cytotoxicus strain CH_23     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 3076700..3121649 3079597..3080181 within 0


Gene organization within MGE regions


Location: 3076700..3121649
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CG481_RS16170 (CG481_016100) comGG 3076700..3077074 (-) 375 WP_048723614.1 competence type IV pilus minor pilin ComGG -
  CG481_RS16175 (CG481_016105) comGF 3077071..3077541 (-) 471 WP_041810544.1 competence type IV pilus minor pilin ComGF -
  CG481_RS23860 - 3077520..3077837 (-) 318 WP_235436438.1 type II secretion system protein -
  CG481_RS16180 (CG481_016110) comGD 3077815..3078261 (-) 447 Protein_3109 competence type IV pilus minor pilin ComGD -
  CG481_RS16185 (CG481_016115) comGC 3078258..3078557 (-) 300 WP_012095385.1 competence type IV pilus major pilin ComGC -
  CG481_RS16190 (CG481_016120) comGB 3078570..3079610 (-) 1041 WP_041810011.1 competence type IV pilus assembly protein ComGB -
  CG481_RS16195 (CG481_016125) comGA 3079597..3080181 (-) 585 WP_012095387.1 ATPase, T2SS/T4P/T4SS family Machinery gene
  CG481_RS16200 (CG481_016130) - 3080075..3080671 (-) 597 WP_338142142.1 DUF1071 domain-containing protein -
  CG481_RS16205 (CG481_016135) - 3080668..3081150 (-) 483 WP_012095420.1 siphovirus Gp157 family protein -
  CG481_RS16210 (CG481_016140) - 3081214..3081582 (-) 369 WP_012095421.1 helix-turn-helix domain-containing protein -
  CG481_RS16215 (CG481_016145) - 3081605..3081871 (-) 267 WP_012095422.1 hypothetical protein -
  CG481_RS22885 - 3082019..3082162 (-) 144 WP_157671227.1 hypothetical protein -
  CG481_RS16220 (CG481_016150) - 3082191..3082385 (-) 195 WP_048723899.1 hypothetical protein -
  CG481_RS23865 (CG481_016155) - 3082414..3083190 (-) 777 WP_048723896.1 ORF6C domain-containing protein -
  CG481_RS16230 (CG481_016160) - 3083187..3083375 (-) 189 WP_012095426.1 helix-turn-helix transcriptional regulator -
  CG481_RS16235 (CG481_016165) - 3083648..3084070 (+) 423 WP_048723893.1 helix-turn-helix domain-containing protein -
  CG481_RS16240 (CG481_016170) - 3084085..3084510 (+) 426 WP_012095428.1 ImmA/IrrE family metallo-endopeptidase -
  CG481_RS16245 (CG481_016175) - 3084524..3085948 (+) 1425 WP_012095429.1 recombinase family protein -
  CG481_RS23245 - 3086044..3086208 (+) 165 WP_164468640.1 hypothetical protein -
  CG481_RS16250 (CG481_016180) - 3086325..3086897 (+) 573 WP_012095388.1 DUF4352 domain-containing protein -
  CG481_RS16255 (CG481_016185) - 3087208..3087648 (+) 441 WP_012095389.1 hypothetical protein -
  CG481_RS16260 (CG481_016190) - 3087723..3088544 (-) 822 WP_012095390.1 GH25 family lysozyme -
  CG481_RS16265 (CG481_016195) - 3088544..3088984 (-) 441 WP_012095391.1 phage holin family protein -
  CG481_RS16270 (CG481_016200) - 3089019..3093512 (-) 4494 WP_094398326.1 phage tail spike protein -
  CG481_RS16275 (CG481_016205) - 3093513..3095006 (-) 1494 WP_048723927.1 distal tail protein Dit -
  CG481_RS16280 (CG481_016210) - 3095019..3097946 (-) 2928 WP_012095394.1 phage tail tape measure protein -
  CG481_RS16285 (CG481_016215) - 3097952..3098149 (-) 198 WP_235436463.1 hypothetical protein -
  CG481_RS16290 (CG481_016220) - 3098254..3098691 (-) 438 WP_012095396.1 hypothetical protein -
  CG481_RS16295 (CG481_016225) - 3098737..3099369 (-) 633 WP_048723925.1 hypothetical protein -
  CG481_RS16300 (CG481_016230) - 3099387..3099767 (-) 381 WP_048723923.1 hypothetical protein -
  CG481_RS16305 (CG481_016235) - 3099764..3100180 (-) 417 WP_012095399.1 hypothetical protein -
  CG481_RS16310 (CG481_016240) - 3100164..3100523 (-) 360 WP_012095400.1 hypothetical protein -
  CG481_RS16315 (CG481_016245) - 3100505..3100843 (-) 339 WP_012095401.1 head-tail connector protein -
  CG481_RS16320 (CG481_016250) - 3100880..3101722 (-) 843 WP_012095402.1 hypothetical protein -
  CG481_RS16325 (CG481_016255) - 3101736..3102326 (-) 591 WP_012095403.1 DUF4355 domain-containing protein -
  CG481_RS16330 (CG481_016260) - 3102396..3103421 (-) 1026 WP_012095404.1 minor capsid protein -
  CG481_RS16335 (CG481_016265) - 3103408..3104763 (-) 1356 WP_012095405.1 phage portal protein -
  CG481_RS16340 (CG481_016270) - 3104775..3106079 (-) 1305 WP_012095406.1 PBSX family phage terminase large subunit -
  CG481_RS16345 (CG481_016275) - 3106066..3106497 (-) 432 WP_048723920.1 terminase small subunit -
  CG481_RS16350 (CG481_016280) - 3107261..3107506 (-) 246 WP_048723940.1 helix-turn-helix domain-containing protein -
  CG481_RS23250 - 3107503..3107670 (-) 168 WP_164468646.1 hypothetical protein -
  CG481_RS22890 - 3107740..3107907 (-) 168 WP_157671230.1 hypothetical protein -
  CG481_RS16355 (CG481_016285) - 3107910..3108290 (-) 381 WP_048723918.1 ArpU family transcriptional regulator -
  CG481_RS16360 (CG481_016290) - 3108295..3108582 (-) 288 WP_048723916.1 hypothetical protein -
  CG481_RS23255 - 3108625..3108792 (-) 168 WP_164468645.1 hypothetical protein -
  CG481_RS22895 - 3108806..3108946 (-) 141 WP_157671229.1 hypothetical protein -
  CG481_RS16365 (CG481_016295) - 3108962..3109213 (-) 252 WP_048723938.1 helix-turn-helix domain-containing protein -
  CG481_RS22900 - 3109300..3109446 (-) 147 WP_157671228.1 hypothetical protein -
  CG481_RS16370 (CG481_016300) - 3109465..3109749 (-) 285 WP_012095411.1 hypothetical protein -
  CG481_RS16375 (CG481_016305) - 3109850..3110059 (+) 210 WP_048723913.1 hypothetical protein -
  CG481_RS16380 (CG481_016310) - 3110120..3110551 (-) 432 WP_048723912.1 hypothetical protein -
  CG481_RS16390 (CG481_016320) - 3110806..3111339 (-) 534 WP_048723909.1 hypothetical protein -
  CG481_RS16395 (CG481_016325) - 3111382..3111588 (-) 207 WP_048723907.1 hypothetical protein -
  CG481_RS16400 (CG481_016330) - 3111667..3111867 (-) 201 WP_048723905.1 hypothetical protein -
  CG481_RS16405 (CG481_016335) - 3111904..3112476 (-) 573 WP_048723936.1 dUTP diphosphatase -
  CG481_RS16410 (CG481_016340) - 3112630..3113421 (-) 792 WP_094398390.1 hypothetical protein -
  CG481_RS16415 (CG481_016345) - 3113423..3113605 (-) 183 WP_162533109.1 hypothetical protein -
  CG481_RS16420 (CG481_016350) - 3113574..3113801 (-) 228 WP_048723903.1 hypothetical protein -
  CG481_RS16425 (CG481_016355) - 3113812..3114714 (-) 903 WP_313928212.1 ATP-binding protein -
  CG481_RS16430 (CG481_016360) - 3114614..3115417 (-) 804 WP_048723901.1 conserved phage C-terminal domain-containing protein -
  CG481_RS16435 (CG481_016365) - 3115418..3115687 (-) 270 WP_012095418.1 hypothetical protein -
  CG481_RS16440 (CG481_016370) - 3115701..3116372 (-) 672 WP_012095419.1 DUF1071 domain-containing protein -
  CG481_RS16445 (CG481_016375) - 3116369..3116851 (-) 483 WP_012095420.1 siphovirus Gp157 family protein -
  CG481_RS16450 (CG481_016380) - 3116915..3117283 (-) 369 WP_012095421.1 helix-turn-helix domain-containing protein -
  CG481_RS16455 (CG481_016385) - 3117306..3117572 (-) 267 WP_012095422.1 hypothetical protein -
  CG481_RS22905 - 3117720..3117863 (-) 144 WP_157671227.1 hypothetical protein -
  CG481_RS16460 (CG481_016390) - 3117892..3118086 (-) 195 WP_048723899.1 hypothetical protein -
  CG481_RS23870 (CG481_016395) - 3118115..3118891 (-) 777 WP_048723896.1 ORF6C domain-containing protein -
  CG481_RS16470 (CG481_016400) - 3118888..3119076 (-) 189 WP_012095426.1 helix-turn-helix transcriptional regulator -
  CG481_RS16475 (CG481_016405) - 3119349..3119771 (+) 423 WP_048723893.1 helix-turn-helix domain-containing protein -
  CG481_RS16480 (CG481_016410) - 3119786..3120211 (+) 426 WP_012095428.1 ImmA/IrrE family metallo-endopeptidase -
  CG481_RS16485 (CG481_016415) - 3120225..3121649 (+) 1425 WP_012095429.1 recombinase family protein -

Sequence


Protein


Download         Length: 194 a.a.        Molecular weight: 21858.48 Da        Isoelectric Point: 9.0444

>NTDB_id=251975 CG481_RS16195 WP_012095387.1 3079597..3080181(-) (comGA) [Bacillus cytotoxicus strain CH_23]
MYALLEVAKKGQTRRIVTLEDPVEKRKNGLLQIQINEKAGITYETGLKAILRHDPDIILVGEIRDEETAKVAVRASLTGH
LVMTTLHTNDAKGAVLRLMDFGISRQEIEQSLLAVAAQRLVELKCPFCKGKCSPLCKSIRQVRQASIYELLYGHELKKAI
REANGERVVYRYETLQSSLKKGYALGFLEEDVYV

Nucleotide


Download         Length: 585 bp        

>NTDB_id=251975 CG481_RS16195 WP_012095387.1 3079597..3080181(-) (comGA) [Bacillus cytotoxicus strain CH_23]
ATGTATGCGTTATTAGAAGTTGCCAAAAAAGGTCAAACACGCCGGATTGTTACATTGGAAGATCCGGTGGAGAAAAGAAA
GAACGGTTTATTACAAATTCAAATTAATGAAAAAGCAGGGATTACGTATGAGACAGGTTTAAAGGCAATTTTACGGCATG
ACCCAGATATTATTTTAGTGGGGGAAATTCGTGATGAAGAAACAGCAAAAGTGGCTGTACGTGCAAGTTTAACAGGTCAT
TTAGTAATGACAACGCTGCATACAAATGATGCGAAAGGTGCTGTATTGCGCTTGATGGATTTTGGAATTAGCAGACAGGA
AATTGAACAGTCGTTATTAGCTGTAGCCGCTCAGCGCCTTGTAGAGTTGAAATGTCCGTTTTGTAAAGGGAAGTGTTCAC
CATTATGTAAATCGATACGCCAAGTACGGCAAGCAAGCATTTATGAGTTGCTATATGGTCATGAATTGAAGAAAGCCATT
CGCGAAGCGAATGGCGAACGTGTGGTGTATCGCTATGAAACATTACAATCTTCATTAAAGAAAGGCTATGCTTTAGGATT
TTTAGAAGAGGATGTTTATGTTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

58.763

100

0.588

  pilB Vibrio campbellii strain DS40M4

36

100

0.371


Multiple sequence alignment