Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGA   Type   Machinery gene
Locus tag   C2I17_RS21145 Genome accession   NZ_CP026040
Coordinates   4445800..4446591 (+) Length   263 a.a.
NCBI ID   WP_338137882.1    Uniprot ID   -
Organism   Niallia circulans strain PK3_15     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 4444225..4501169 4445800..4446591 within 0


Gene organization within MGE regions


Location: 4444225..4501169
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C2I17_RS21135 (C2I17_20965) - 4444225..4445157 (+) 933 WP_249878184.1 cation diffusion facilitator family transporter -
  C2I17_RS21145 (C2I17_20975) comGA 4445800..4446591 (+) 792 WP_338137882.1 ATPase, T2SS/T4P/T4SS family Machinery gene
  C2I17_RS21150 (C2I17_20980) - 4446527..4447981 (-) 1455 WP_249878185.1 recombinase family protein -
  C2I17_RS21155 (C2I17_20985) - 4448003..4449064 (-) 1062 WP_249878186.1 ImmA/IrrE family metallo-endopeptidase -
  C2I17_RS21160 (C2I17_20990) - 4449051..4449410 (-) 360 WP_249878187.1 helix-turn-helix domain-containing protein -
  C2I17_RS21165 (C2I17_20995) - 4449516..4449785 (+) 270 WP_249878188.1 helix-turn-helix transcriptional regulator -
  C2I17_RS21170 - 4449878..4450042 (+) 165 WP_249878189.1 hypothetical protein -
  C2I17_RS21175 (C2I17_21000) - 4449999..4450754 (+) 756 WP_249878190.1 phage regulatory protein/antirepressor Ant -
  C2I17_RS24880 - 4451065..4451190 (+) 126 WP_275068781.1 hypothetical protein -
  C2I17_RS21180 (C2I17_21005) - 4451183..4451488 (+) 306 WP_249878191.1 hypothetical protein -
  C2I17_RS21185 (C2I17_21010) - 4451481..4451690 (+) 210 WP_249878192.1 YqaI family protein -
  C2I17_RS21190 - 4451702..4451854 (+) 153 WP_164463660.1 DUF6906 family protein -
  C2I17_RS21195 - 4451950..4452123 (+) 174 WP_171509798.1 hypothetical protein -
  C2I17_RS21200 (C2I17_21015) - 4452123..4453010 (+) 888 WP_249878193.1 hypothetical protein -
  C2I17_RS21205 (C2I17_21020) - 4452961..4453770 (+) 810 WP_249878194.1 PD-(D/E)XK nuclease-like domain-containing protein -
  C2I17_RS21210 (C2I17_21025) - 4453954..4454778 (+) 825 WP_249879589.1 replication protein -
  C2I17_RS21215 (C2I17_21030) - 4454666..4455532 (+) 867 WP_420916350.1 ATP-binding protein -
  C2I17_RS21220 - 4455525..4455701 (+) 177 WP_249878195.1 hypothetical protein -
  C2I17_RS21225 (C2I17_21035) - 4455691..4455912 (+) 222 WP_249878196.1 hypothetical protein -
  C2I17_RS21230 (C2I17_21040) - 4455956..4456165 (+) 210 WP_249878197.1 hypothetical protein -
  C2I17_RS21235 (C2I17_21045) - 4456162..4456764 (+) 603 WP_249878198.1 hypothetical protein -
  C2I17_RS21240 (C2I17_21050) - 4456734..4456889 (+) 156 WP_249878199.1 Fur-regulated basic protein FbpA -
  C2I17_RS21245 (C2I17_21055) - 4456880..4457323 (+) 444 WP_249878200.1 hypothetical protein -
  C2I17_RS21250 (C2I17_21065) - 4457666..4457968 (+) 303 WP_249878201.1 MazG-like family protein -
  C2I17_RS21255 (C2I17_21070) - 4458288..4458665 (+) 378 WP_249878202.1 hypothetical protein -
  C2I17_RS21260 (C2I17_21075) - 4458762..4459220 (+) 459 WP_249879591.1 ArpU family phage packaging/lysis transcriptional regulator -
  C2I17_RS21265 (C2I17_21080) - 4459247..4459618 (-) 372 WP_249878203.1 GIY-YIG nuclease family protein -
  C2I17_RS21270 (C2I17_21085) - 4460095..4461045 (+) 951 WP_249878204.1 hypothetical protein -
  C2I17_RS21275 (C2I17_21090) - 4461020..4461625 (+) 606 WP_249878205.1 hypothetical protein -
  C2I17_RS21280 - 4461698..4461874 (-) 177 WP_249878206.1 hypothetical protein -
  C2I17_RS21285 (C2I17_21095) - 4462032..4462676 (+) 645 WP_249878207.1 hypothetical protein -
  C2I17_RS21290 (C2I17_21100) tnpA 4462861..4463316 (+) 456 WP_095330205.1 IS200/IS605 family transposase -
  C2I17_RS21295 (C2I17_21105) - 4463701..4463895 (+) 195 WP_249878208.1 hypothetical protein -
  C2I17_RS21300 (C2I17_21110) - 4463901..4464332 (+) 432 WP_249878209.1 hypothetical protein -
  C2I17_RS21305 (C2I17_21115) - 4464361..4464693 (+) 333 WP_249878210.1 hypothetical protein -
  C2I17_RS21310 (C2I17_21120) - 4464693..4465004 (+) 312 WP_249878211.1 HNH endonuclease -
  C2I17_RS21315 (C2I17_21125) - 4465086..4465478 (+) 393 WP_249878212.1 P27 family phage terminase small subunit -
  C2I17_RS21320 (C2I17_21130) - 4465475..4467190 (+) 1716 WP_249878213.1 terminase large subunit -
  C2I17_RS21325 (C2I17_21135) - 4467207..4468379 (+) 1173 WP_249878214.1 phage portal protein -
  C2I17_RS21330 (C2I17_21140) - 4468380..4468949 (+) 570 WP_144545100.1 HK97 family phage prohead protease -
  C2I17_RS21335 (C2I17_21145) - 4468942..4470147 (+) 1206 WP_249878215.1 phage major capsid protein -
  C2I17_RS21340 - 4470170..4470346 (+) 177 WP_249878216.1 hypothetical protein -
  C2I17_RS21345 (C2I17_21150) - 4470339..4470650 (+) 312 WP_249878217.1 hypothetical protein -
  C2I17_RS21350 (C2I17_21155) - 4470619..4470984 (+) 366 WP_249878218.1 hypothetical protein -
  C2I17_RS21355 (C2I17_21160) - 4470977..4471336 (+) 360 WP_420916351.1 HK97 gp10 family phage protein -
  C2I17_RS21360 (C2I17_21165) - 4471333..4471650 (+) 318 WP_249878219.1 hypothetical protein -
  C2I17_RS21365 (C2I17_21170) - 4471652..4472290 (+) 639 WP_249878220.1 major tail protein -
  C2I17_RS21370 (C2I17_21175) - 4472290..4472634 (+) 345 WP_249878221.1 hypothetical protein -
  C2I17_RS21375 (C2I17_21185) - 4472872..4475694 (+) 2823 WP_249878222.1 phage tail protein -
  C2I17_RS21380 (C2I17_21190) - 4475684..4476484 (+) 801 WP_249878223.1 distal tail protein Dit -
  C2I17_RS21385 (C2I17_21195) - 4476496..4478190 (+) 1695 WP_249878224.1 SGNH/GDSL hydrolase family protein -
  C2I17_RS21390 (C2I17_21200) - 4478248..4480119 (+) 1872 WP_249878225.1 phage tail spike protein -
  C2I17_RS21395 (C2I17_21205) - 4480119..4480385 (+) 267 WP_249878226.1 hypothetical protein -
  C2I17_RS21400 (C2I17_21210) - 4480478..4480792 (+) 315 WP_249878227.1 hypothetical protein -
  C2I17_RS21405 (C2I17_21215) - 4480805..4481098 (+) 294 WP_249878228.1 holin -
  C2I17_RS21410 (C2I17_21220) - 4481114..4481956 (+) 843 WP_249878229.1 glycoside hydrolase family protein -
  C2I17_RS21415 (C2I17_21225) - 4482069..4482563 (-) 495 WP_249878230.1 hypothetical protein -
  C2I17_RS24885 - 4482753..4482875 (+) 123 WP_275068782.1 hypothetical protein -
  C2I17_RS21420 (C2I17_21230) - 4482863..4483225 (-) 363 WP_249878231.1 hypothetical protein -
  C2I17_RS21425 (C2I17_21235) - 4483240..4483569 (-) 330 WP_249878232.1 YolD-like family protein -
  C2I17_RS21430 (C2I17_21240) - 4483747..4483998 (+) 252 WP_249878233.1 hypothetical protein -
  C2I17_RS21435 (C2I17_21245) - 4484087..4485085 (-) 999 WP_249878234.1 hypothetical protein -
  C2I17_RS21440 (C2I17_21250) - 4485188..4485508 (-) 321 WP_249878235.1 DUF771 domain-containing protein -
  C2I17_RS21445 (C2I17_21255) - 4485533..4485742 (-) 210 WP_249878236.1 helix-turn-helix domain-containing protein -
  C2I17_RS21450 (C2I17_21260) - 4485900..4486295 (+) 396 WP_249878237.1 ATPase, T2SS/T4P/T4SS family -
  C2I17_RS21455 (C2I17_21265) comGB 4486285..4487319 (+) 1035 WP_249878238.1 competence type IV pilus assembly protein ComGB -
  C2I17_RS21460 (C2I17_21270) comGC 4487332..4487646 (+) 315 WP_095332771.1 competence type IV pilus major pilin ComGC -
  C2I17_RS21465 (C2I17_21275) comGD 4487643..4488080 (+) 438 WP_249878239.1 competence type IV pilus minor pilin ComGD -
  C2I17_RS21470 (C2I17_21280) - 4488064..4488387 (+) 324 WP_163185985.1 hypothetical protein -
  C2I17_RS21475 (C2I17_21285) comGF 4488384..4488827 (+) 444 WP_235973249.1 competence type IV pilus minor pilin ComGF -
  C2I17_RS21480 (C2I17_21290) comGG 4488834..4489214 (+) 381 WP_249878240.1 competence type IV pilus minor pilin ComGG -
  C2I17_RS21485 (C2I17_21295) - 4489269..4489457 (+) 189 WP_249878241.1 YqzE family protein -
  C2I17_RS21490 (C2I17_21300) - 4489522..4490328 (-) 807 WP_095332781.1 YqhG family protein -
  C2I17_RS21495 (C2I17_21305) - 4490297..4492009 (-) 1713 WP_095332783.1 DEAD/DEAH box helicase -
  C2I17_RS21500 (C2I17_21310) gcvT 4492589..4493689 (+) 1101 WP_249878242.1 glycine cleavage system aminomethyltransferase GcvT -
  C2I17_RS21505 (C2I17_21315) gcvPA 4493747..4495093 (+) 1347 WP_163185994.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPA -
  C2I17_RS21510 (C2I17_21320) gcvPB 4495086..4496543 (+) 1458 WP_095332789.1 aminomethyl-transferring glycine dehydrogenase subunit GcvPB -
  C2I17_RS21515 (C2I17_21325) - 4496640..4497017 (-) 378 WP_371000273.1 rhodanese-like domain-containing protein -
  C2I17_RS21520 (C2I17_21330) - 4497226..4498062 (+) 837 WP_095332791.1 lipoate--protein ligase family protein -
  C2I17_RS21525 (C2I17_21340) - 4498617..4501169 (+) 2553 WP_249878243.1 vitamin B12-dependent ribonucleotide reductase -

Sequence


Protein


Download         Length: 263 a.a.        Molecular weight: 29604.39 Da        Isoelectric Point: 9.0233

>NTDB_id=265947 C2I17_RS21145 WP_338137882.1 4445800..4446591(+) (comGA) [Niallia circulans strain PK3_15]
MLSIEALANRIICEAVEQNATDIHITPKQKESVLQFRIANHLINHLVISANECDKLISHFKFTASMDIGERRKPQNGSIS
TTVKGQYIGLRLSTLPSHPHESLVIRILSDQNMLPIYHISLFPNISRKLISLLKHAHGLIILTGPTGSGKTTTLYSLLNE
NAHLYQRNVISLEDPIEKTQENVLQIQVNEKAGITYSTGLKAILRHDPDIIMVGEIRDKETAHIAIRASLTGHLVLIIDT
CRVIRRLYFQLQQTQVLVVAQFY

Nucleotide


Download         Length: 792 bp        

>NTDB_id=265947 C2I17_RS21145 WP_338137882.1 4445800..4446591(+) (comGA) [Niallia circulans strain PK3_15]
GTGTTATCAATTGAAGCTCTAGCAAATCGTATTATTTGTGAGGCAGTAGAACAGAATGCTACTGATATTCACATAACCCC
AAAACAAAAGGAATCTGTCCTCCAATTCCGAATTGCCAATCATTTAATTAATCACTTAGTAATTTCAGCAAATGAATGCG
ACAAGTTGATCTCTCATTTCAAATTTACAGCATCTATGGATATTGGCGAAAGAAGAAAACCGCAAAATGGATCTATTTCC
ACAACAGTCAAAGGTCAGTATATTGGCTTACGACTTTCAACCTTACCATCTCACCCTCATGAGAGTTTAGTAATTCGTAT
ACTTTCCGACCAAAATATGCTTCCAATCTATCACATCTCTTTATTTCCGAATATTTCTCGTAAATTAATCTCCTTGTTAA
AGCATGCACACGGATTAATAATTCTGACCGGTCCAACTGGGTCTGGCAAGACAACAACACTCTACTCTCTCCTAAATGAA
AATGCACATCTTTATCAGCGAAATGTGATTAGCTTAGAAGATCCCATTGAAAAAACGCAGGAAAACGTTTTGCAAATTCA
AGTCAATGAGAAAGCTGGTATTACGTATTCCACTGGCTTAAAAGCAATTCTTCGCCATGATCCAGATATTATTATGGTTG
GAGAAATCCGCGACAAAGAAACTGCTCATATTGCGATAAGGGCGAGCTTAACAGGTCATTTAGTATTGATAATAGACACA
TGCAGGGTTATAAGAAGGCTATACTTTCAATTACAACAGACTCAGGTGTTAGTCGTTGCTCAGTTTTATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGA Bacillus subtilis subsp. subtilis str. 168

51.292

100

0.529

  pilB Glaesserella parasuis strain SC1401

39.147

98.099

0.384

  comYA Streptococcus mutans UA140

42.373

89.734

0.38

  comYA Streptococcus mutans UA159

42.373

89.734

0.38

  pilB Haemophilus influenzae Rd KW20

36.33

100

0.369

  comGA Staphylococcus aureus MW2

40.254

89.734

0.361

  comGA Staphylococcus aureus N315

40.254

89.734

0.361


Multiple sequence alignment