Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   CA207_RS07215 Genome accession   NZ_CP021058
Coordinates   1379686..1380642 (-) Length   318 a.a.
NCBI ID   WP_086038846.1    Uniprot ID   -
Organism   Macrococcoides caseolyticum strain IMD0819     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1328128..1379735 1379686..1380642 flank -49


Gene organization within MGE regions


Location: 1328128..1380642
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  CA207_RS06870 (CA207_14260) - 1328128..1328649 (-) 522 WP_235606408.1 IbrB-like domain-containing protein -
  CA207_RS06875 (CA207_14270) - 1328624..1329907 (-) 1284 WP_086038780.1 DUF3440 domain-containing protein -
  CA207_RS06880 (CA207_14280) - 1329907..1331067 (-) 1161 WP_086038781.1 helicase-related protein -
  CA207_RS06885 (CA207_14290) - 1331127..1331492 (-) 366 WP_086038782.1 hypothetical protein -
  CA207_RS06890 (CA207_14300) - 1331470..1332108 (-) 639 WP_086038783.1 hypothetical protein -
  CA207_RS06895 (CA207_14310) - 1332131..1332526 (-) 396 WP_086038784.1 hypothetical protein -
  CA207_RS06900 (CA207_14320) - 1332656..1332925 (-) 270 WP_086038785.1 hypothetical protein -
  CA207_RS06905 (CA207_14330) - 1333035..1333796 (-) 762 WP_086038786.1 N-acetylmuramoyl-L-alanine amidase -
  CA207_RS06910 (CA207_14340) - 1333798..1334064 (-) 267 WP_086038787.1 phage holin -
  CA207_RS06915 (CA207_14350) - 1334107..1334427 (-) 321 WP_086038788.1 hypothetical protein -
  CA207_RS06920 (CA207_14360) - 1334458..1334745 (-) 288 WP_086038789.1 hypothetical protein -
  CA207_RS12200 (CA207_14370) - 1334791..1334958 (-) 168 WP_162485214.1 hypothetical protein -
  CA207_RS06925 (CA207_14380) - 1334951..1339066 (-) 4116 WP_086038790.1 phage tail spike protein -
  CA207_RS06930 (CA207_14390) - 1339066..1339500 (-) 435 WP_086038791.1 hypothetical protein -
  CA207_RS06935 (CA207_14400) - 1339490..1345354 (-) 5865 WP_086038792.1 LysM peptidoglycan-binding domain-containing protein -
  CA207_RS06940 (CA207_14410) - 1345394..1345777 (-) 384 WP_086038793.1 hypothetical protein -
  CA207_RS06945 (CA207_14420) - 1345732..1346253 (-) 522 WP_086038794.1 hypothetical protein -
  CA207_RS06950 (CA207_14430) - 1346329..1346907 (-) 579 WP_086038795.1 hypothetical protein -
  CA207_RS06955 (CA207_14440) - 1346971..1347351 (-) 381 WP_086038796.1 phage tail terminator protein -
  CA207_RS06960 (CA207_14450) - 1347356..1347838 (-) 483 WP_157821166.1 HK97 gp10 family phage protein -
  CA207_RS12205 (CA207_14460) - 1347828..1348172 (-) 345 WP_157821347.1 hypothetical protein -
  CA207_RS06965 (CA207_14470) - 1348172..1348555 (-) 384 WP_086038798.1 hypothetical protein -
  CA207_RS06970 (CA207_14480) - 1348573..1348848 (-) 276 WP_086038799.1 hypothetical protein -
  CA207_RS06975 (CA207_14490) - 1348895..1349908 (-) 1014 WP_086038800.1 major capsid protein -
  CA207_RS06980 (CA207_14500) - 1349928..1350293 (-) 366 WP_086038801.1 hypothetical protein -
  CA207_RS06985 (CA207_14510) - 1350308..1350904 (-) 597 WP_086038802.1 phage scaffolding protein -
  CA207_RS06990 (CA207_14520) - 1351159..1351416 (-) 258 WP_086038803.1 hypothetical protein -
  CA207_RS06995 (CA207_14530) - 1351431..1351646 (-) 216 WP_086038804.1 hypothetical protein -
  CA207_RS07000 (CA207_14540) - 1351624..1352712 (-) 1089 WP_086038805.1 phage minor capsid protein -
  CA207_RS07005 (CA207_14550) - 1352717..1354336 (-) 1620 WP_086038806.1 hypothetical protein -
  CA207_RS07010 (CA207_14560) - 1354349..1355641 (-) 1293 WP_086038807.1 PBSX family phage terminase large subunit -
  CA207_RS07015 (CA207_14570) - 1355638..1356351 (-) 714 WP_086038808.1 hypothetical protein -
  CA207_RS07020 (CA207_14580) - 1356462..1356890 (-) 429 WP_086038809.1 hypothetical protein -
  CA207_RS07025 (CA207_14590) - 1357165..1357545 (-) 381 WP_235606410.1 hypothetical protein -
  CA207_RS07030 (CA207_14600) - 1357586..1358071 (-) 486 WP_086038811.1 Holliday junction resolvase RecU -
  CA207_RS07035 (CA207_14610) - 1358120..1358647 (-) 528 WP_086038812.1 dUTP diphosphatase -
  CA207_RS12210 (CA207_14620) - 1358651..1358821 (-) 171 WP_162485215.1 hypothetical protein -
  CA207_RS07040 (CA207_14630) - 1358818..1359180 (-) 363 WP_235606412.1 hypothetical protein -
  CA207_RS07045 (CA207_14650) - 1359303..1359542 (-) 240 WP_101141824.1 helix-turn-helix domain-containing protein -
  CA207_RS07050 (CA207_14660) - 1359565..1359792 (-) 228 WP_086038815.1 hypothetical protein -
  CA207_RS07055 (CA207_14670) - 1359814..1360032 (-) 219 WP_086038816.1 hypothetical protein -
  CA207_RS07060 (CA207_14680) - 1360054..1360446 (-) 393 WP_086038817.1 hypothetical protein -
  CA207_RS07065 (CA207_14690) - 1360484..1360864 (-) 381 WP_086038818.1 YopX family protein -
  CA207_RS07070 (CA207_14700) - 1360876..1361145 (-) 270 WP_157820873.1 hypothetical protein -
  CA207_RS07075 (CA207_14710) - 1361142..1362299 (-) 1158 WP_086038820.1 DUF3310 domain-containing protein -
  CA207_RS07080 (CA207_14720) - 1362302..1362529 (-) 228 WP_086038821.1 helix-turn-helix domain-containing protein -
  CA207_RS07085 (CA207_14730) - 1362532..1362924 (-) 393 WP_086038822.1 hypothetical protein -
  CA207_RS07090 (CA207_14740) - 1362937..1363116 (-) 180 WP_086038823.1 hypothetical protein -
  CA207_RS07095 (CA207_14750) - 1363113..1363334 (-) 222 WP_086038824.1 hypothetical protein -
  CA207_RS07100 (CA207_14760) - 1363331..1363609 (-) 279 WP_086038825.1 hypothetical protein -
  CA207_RS07105 (CA207_14770) - 1363636..1364130 (-) 495 WP_086038826.1 single-stranded DNA-binding protein -
  CA207_RS07110 (CA207_14780) - 1364127..1364789 (-) 663 WP_162485216.1 NUMOD4 domain-containing protein -
  CA207_RS07115 (CA207_14790) - 1364779..1364979 (-) 201 WP_086038828.1 hypothetical protein -
  CA207_RS12215 (CA207_14800) - 1364976..1365152 (-) 177 WP_162485217.1 hypothetical protein -
  CA207_RS12220 (CA207_14810) - 1365124..1365300 (-) 177 WP_157820871.1 hypothetical protein -
  CA207_RS12225 (CA207_14820) - 1365297..1365473 (-) 177 WP_157821557.1 hypothetical protein -
  CA207_RS07120 (CA207_14830) - 1365466..1366131 (-) 666 WP_235606414.1 hypothetical protein -
  CA207_RS07125 (CA207_14840) - 1366106..1366891 (-) 786 WP_086038830.1 ATP-binding protein -
  CA207_RS07130 (CA207_14850) - 1366869..1367753 (-) 885 WP_086038831.1 phage replisome organizer N-terminal domain-containing protein -
  CA207_RS07135 (CA207_14860) - 1367767..1368483 (-) 717 WP_086038832.1 MBL fold metallo-hydrolase -
  CA207_RS07140 (CA207_14870) bet 1368483..1369235 (-) 753 WP_086038833.1 phage recombination protein Bet -
  CA207_RS07145 (CA207_14880) - 1369237..1371213 (-) 1977 WP_086038834.1 AAA family ATPase -
  CA207_RS07150 (CA207_14890) - 1371213..1371395 (-) 183 WP_086038835.1 hypothetical protein -
  CA207_RS07155 (CA207_14900) - 1371392..1372153 (-) 762 WP_201260159.1 phage antirepressor KilAC domain-containing protein -
  CA207_RS07160 (CA207_14910) - 1372292..1372528 (+) 237 WP_041635943.1 hypothetical protein -
  CA207_RS07165 (CA207_14930) - 1372622..1372813 (-) 192 WP_086038837.1 hypothetical protein -
  CA207_RS07170 (CA207_14940) - 1372815..1373060 (-) 246 WP_086038838.1 hypothetical protein -
  CA207_RS07175 (CA207_14950) - 1373199..1373702 (-) 504 WP_086038839.1 hypothetical protein -
  CA207_RS07180 (CA207_14960) - 1373818..1374078 (-) 261 WP_162485219.1 helix-turn-helix transcriptional regulator -
  CA207_RS07185 (CA207_14970) - 1374231..1374605 (+) 375 WP_086038841.1 helix-turn-helix domain-containing protein -
  CA207_RS07190 (CA207_14980) - 1374630..1375709 (+) 1080 WP_162485220.1 DUF4352 domain-containing protein -
  CA207_RS07195 (CA207_14990) - 1375736..1376683 (+) 948 WP_235606420.1 thermonuclease family protein -
  CA207_RS07200 (CA207_15000) - 1376802..1377263 (+) 462 WP_086038843.1 ImmA/IrrE family metallo-endopeptidase -
  CA207_RS07205 (CA207_15010) - 1377267..1378481 (+) 1215 WP_086038844.1 tyrosine-type recombinase/integrase -
  CA207_RS12230 - 1378569..1378640 (-) 72 WP_232618450.1 prepilin-type N-terminal cleavage/methylation domain-containing protein -
  CA207_RS07210 (CA207_15020) comGB 1378662..1379735 (-) 1074 WP_086038845.1 competence type IV pilus assembly protein ComGB -
  CA207_RS07215 (CA207_15030) comGA 1379686..1380642 (-) 957 WP_086038846.1 ATPase, T2SS/T4P/T4SS family Machinery gene

Sequence


Protein


Download         Length: 318 a.a.        Molecular weight: 36566.13 Da        Isoelectric Point: 6.0518

>NTDB_id=227991 CA207_RS07215 WP_086038846.1 1379686..1380642(-) (comGA) [Macrococcoides caseolyticum strain IMD0819]
MEKLFNEIIEQAILQSASDIHFIPCDKNVSIKFRVQGDIEEYSDIDDILFKKLLSYIKFTAHLDVSEKNKAQSGIIHFNL
DNLRYNIRASTLPRSLGDEACVLRIIRQSFIDEYQTDDQILFDQMKKSSGIIIISGPTGSGKSTLMYQLVHFAKDTLKRQ
VISIEDPVEQHLDGIIQVNVNEKAEITYQTAIKAILRCDPDIIMLGEVAQQVINAGLSGHLVLTTLHANDCIGALFRLKE
MGINAVDLYQSINLIINQRLIKKRDEKERILAYEFLTKKDIEKYLKNKHINYRTLADILKEMYETNQISQHEFEKFDL

Nucleotide


Download         Length: 957 bp        

>NTDB_id=227991 CA207_RS07215 WP_086038846.1 1379686..1380642(-) (comGA) [Macrococcoides caseolyticum strain IMD0819]
ATGGAGAAATTATTCAATGAAATAATAGAGCAGGCAATTTTACAAAGTGCATCAGATATACACTTCATTCCTTGTGATAA
AAATGTATCTATTAAATTTAGGGTACAAGGTGATATCGAAGAATATAGTGACATCGACGATATATTATTTAAAAAATTAC
TTTCATATATTAAATTTACAGCACATCTTGATGTATCAGAAAAGAATAAGGCTCAGAGTGGGATAATACATTTTAATCTG
GATAACTTACGATATAATATTCGCGCATCTACTTTACCTCGTTCATTAGGCGATGAAGCATGTGTATTGAGAATCATCAG
ACAAAGTTTTATAGATGAATATCAGACAGATGATCAGATATTGTTCGATCAGATGAAAAAATCAAGCGGTATAATTATTA
TTAGTGGGCCAACTGGAAGTGGTAAGAGTACATTAATGTATCAACTTGTACATTTTGCAAAGGACACATTGAAACGCCAA
GTAATTTCAATAGAAGATCCTGTGGAGCAGCATCTTGACGGTATCATACAAGTTAATGTTAACGAAAAAGCAGAAATAAC
ATATCAAACCGCAATTAAGGCAATCCTTAGATGTGATCCAGATATCATTATGCTAGGTGAAGTAGCACAGCAAGTTATCA
ATGCGGGGCTGAGTGGTCATTTAGTTTTAACGACATTACATGCAAATGATTGTATTGGCGCATTATTTCGGTTAAAAGAA
ATGGGAATTAATGCTGTTGATCTTTATCAAAGTATCAATTTGATAATCAATCAAAGACTGATTAAAAAAAGAGATGAAAA
GGAGCGTATTCTAGCCTATGAATTTCTGACAAAAAAGGATATTGAAAAATATTTAAAGAATAAGCACATAAACTATAGAA
CGTTGGCTGACATACTAAAGGAGATGTATGAAACAAATCAGATTTCACAACATGAATTTGAAAAATTTGATCTTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Staphylococcus aureus MW2

48

100

0.491

  comGA Staphylococcus aureus N315

48

100

0.491

  comGA Bacillus subtilis subsp. subtilis str. 168

33.908

100

0.371


Multiple sequence alignment