Detailed information    

insolico Bioinformatically predicted

Overview


Name   comGF/cglF   Type   Machinery gene
Locus tag   SUT_RS01105 Genome accession   NZ_AP025331
Coordinates   168353..168787 (+) Length   144 a.a.
NCBI ID   WP_024532251.1    Uniprot ID   -
Organism   Streptococcus ruminantium strain GUT-183     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 126732..167447 168353..168787 flank 906


Gene organization within MGE regions


Location: 126732..168787
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  SUT_RS00800 (GUT183_01290) - 126732..128186 (-) 1455 WP_237373309.1 recombinase family protein -
  SUT_RS00805 (GUT183_01300) - 128314..128982 (-) 669 WP_237373311.1 NYN domain-containing protein -
  SUT_RS00810 (GUT183_01310) - 129146..129928 (-) 783 WP_237373313.1 XRE family transcriptional regulator -
  SUT_RS00815 (GUT183_01340) - 130455..130844 (-) 390 WP_237373315.1 DUF2513 domain-containing protein -
  SUT_RS00820 (GUT183_01350) - 130904..131191 (+) 288 WP_237373317.1 ImmA/IrrE family metallo-endopeptidase -
  SUT_RS10220 (GUT183_01360) - 131181..131309 (-) 129 WP_269089070.1 hypothetical protein -
  SUT_RS00825 (GUT183_01370) - 131445..131627 (+) 183 WP_237373318.1 helix-turn-helix domain-containing protein -
  SUT_RS00830 (GUT183_01380) - 131596..131847 (+) 252 WP_419580030.1 helix-turn-helix transcriptional regulator -
  SUT_RS00835 (GUT183_01390) - 131860..132117 (+) 258 WP_237373319.1 hypothetical protein -
  SUT_RS00840 (GUT183_01400) - 132061..132849 (-) 789 WP_170244289.1 TIGR02391 family protein -
  SUT_RS00845 (GUT183_01410) - 132899..133627 (+) 729 WP_237373320.1 phage antirepressor KilAC domain-containing protein -
  SUT_RS00850 (GUT183_01420) - 133702..133878 (+) 177 WP_237373321.1 BOW99_gp33 family protein -
  SUT_RS00855 (GUT183_01430) - 133889..134242 (+) 354 WP_237373322.1 HTH domain-containing protein -
  SUT_RS00860 (GUT183_01440) - 134481..134741 (+) 261 WP_237373323.1 hypothetical protein -
  SUT_RS00865 (GUT183_01450) - 134753..134890 (+) 138 WP_165437825.1 hypothetical protein -
  SUT_RS00870 (GUT183_01460) - 134894..135079 (+) 186 WP_237373717.1 hypothetical protein -
  SUT_RS00875 (GUT183_01470) bet 135076..135846 (+) 771 WP_237373324.1 phage recombination protein Bet -
  SUT_RS00880 (GUT183_01480) - 135856..136884 (+) 1029 WP_237373326.1 DUF1351 domain-containing protein -
  SUT_RS00885 (GUT183_01490) - 136887..137156 (+) 270 WP_237373328.1 hypothetical protein -
  SUT_RS00890 (GUT183_01500) - 137166..137489 (+) 324 WP_237373330.1 hypothetical protein -
  SUT_RS00895 (GUT183_01510) - 137482..137721 (+) 240 WP_237373332.1 sporulation protein Cse60 -
  SUT_RS00900 (GUT183_01520) ssb 137818..138309 (+) 492 WP_237373334.1 single-stranded DNA-binding protein Machinery gene
  SUT_RS00905 (GUT183_01530) - 138484..138816 (+) 333 WP_237373336.1 hypothetical protein -
  SUT_RS00910 (GUT183_01540) - 138813..139328 (+) 516 WP_237373338.1 hypothetical protein -
  SUT_RS00915 - 139489..139722 (+) 234 WP_237373673.1 DUF3310 domain-containing protein -
  SUT_RS00920 (GUT183_01560) - 139788..140138 (+) 351 WP_237373340.1 hypothetical protein -
  SUT_RS00925 (GUT183_01570) - 140131..140631 (+) 501 WP_237373341.1 DUF1642 domain-containing protein -
  SUT_RS00930 (GUT183_01580) - 140628..140873 (+) 246 WP_237373344.1 hypothetical protein -
  SUT_RS00935 (GUT183_01600) - 140998..141405 (+) 408 WP_272877506.1 YopX family protein -
  SUT_RS00940 (GUT183_01610) - 141407..141625 (+) 219 WP_237373347.1 hypothetical protein -
  SUT_RS00945 (GUT183_01620) - 141622..141894 (+) 273 WP_237373349.1 helix-turn-helix domain-containing protein -
  SUT_RS00950 (GUT183_01630) - 141968..142390 (+) 423 WP_237373351.1 ArpU family phage packaging/lysis transcriptional regulator -
  SUT_RS00955 (GUT183_01640) - 142553..143035 (+) 483 WP_237373353.1 terminase -
  SUT_RS00960 (GUT183_01650) - 142995..144332 (+) 1338 WP_237373355.1 PBSX family phage terminase large subunit -
  SUT_RS00965 (GUT183_01660) - 144399..145883 (+) 1485 WP_237373357.1 phage portal protein -
  SUT_RS00970 (GUT183_01670) - 145876..147531 (+) 1656 WP_237373359.1 phage minor capsid protein -
  SUT_RS00975 (GUT183_01680) - 147524..147781 (+) 258 WP_237373361.1 hypothetical protein -
  SUT_RS00980 (GUT183_01690) - 147974..148219 (+) 246 WP_237373363.1 hypothetical protein -
  SUT_RS00985 (GUT183_01700) - 148347..148895 (+) 549 WP_237373365.1 phage scaffolding protein -
  SUT_RS00990 (GUT183_01710) - 148911..149783 (+) 873 WP_155962137.1 capsid protein -
  SUT_RS00995 (GUT183_01720) - 149794..149991 (+) 198 WP_237373367.1 hypothetical protein -
  SUT_RS01000 (GUT183_01730) - 150023..150400 (+) 378 WP_156009326.1 hypothetical protein -
  SUT_RS01005 (GUT183_01740) - 150400..150741 (+) 342 WP_155962131.1 putative minor capsid protein -
  SUT_RS01010 (GUT183_01750) - 150741..151085 (+) 345 WP_155962129.1 minor capsid protein -
  SUT_RS01015 (GUT183_01760) - 151085..151480 (+) 396 WP_155962127.1 minor capsid protein -
  SUT_RS01020 (GUT183_01770) - 151484..151945 (+) 462 WP_155962125.1 phage tail tube protein -
  SUT_RS01025 (GUT183_01780) - 151984..152436 (+) 453 WP_155962123.1 hypothetical protein -
  SUT_RS01030 (GUT183_01790) - 152445..153026 (+) 582 WP_155962121.1 Gp15 family bacteriophage protein -
  SUT_RS01035 (GUT183_01800) - 153016..156291 (+) 3276 WP_237373369.1 tape measure protein -
  SUT_RS01040 (GUT183_01810) - 156291..157787 (+) 1497 WP_237373371.1 distal tail protein Dit -
  SUT_RS01045 (GUT183_01820) - 157788..160388 (+) 2601 WP_237373372.1 hypothetical protein -
  SUT_RS01050 (GUT183_01830) - 160388..162442 (+) 2055 WP_237373373.1 DUF859 family phage minor structural protein -
  SUT_RS01055 (GUT183_01840) - 162456..162860 (+) 405 WP_155962111.1 DUF1366 domain-containing protein -
  SUT_RS10225 (GUT183_01850) - 162880..163011 (+) 132 WP_265575086.1 hypothetical protein -
  SUT_RS01060 (GUT183_01860) - 163015..163308 (+) 294 WP_155962109.1 DUF7365 family protein -
  SUT_RS01065 (GUT183_01870) - 163310..163549 (+) 240 WP_237373374.1 phage holin -
  SUT_RS01070 (GUT183_01880) - 163646..164893 (+) 1248 WP_237373375.1 CHAP domain-containing protein -
  SUT_RS01075 (GUT183_01890) - 165182..166144 (+) 963 WP_237373376.1 Abi family protein -
  SUT_RS01080 (GUT183_01900) - 166346..167119 (-) 774 WP_336512695.1 DNA/RNA non-specific endonuclease -
  SUT_RS01085 (GUT183_01910) prx 167259..167447 (+) 189 WP_237373378.1 Paratox Regulator
  SUT_RS01090 (GUT183_01920) comYC 167468..167701 (+) 234 WP_156009628.1 competence type IV pilus major pilin ComGC Machinery gene
  SUT_RS01095 (GUT183_01930) comGD 167688..168101 (+) 414 WP_024532253.1 competence type IV pilus minor pilin ComGD -
  SUT_RS01100 (GUT183_01940) comYE 168073..168366 (+) 294 WP_155963927.1 competence type IV pilus minor pilin ComGE Machinery gene
  SUT_RS01105 (GUT183_01950) comGF/cglF 168353..168787 (+) 435 WP_024532251.1 competence type IV pilus minor pilin ComGF Machinery gene

Sequence


Protein


Download         Length: 144 a.a.        Molecular weight: 16473.95 Da        Isoelectric Point: 8.8413

>NTDB_id=91118 SUT_RS01105 WP_024532251.1 168353..168787(+) (comGF/cglF) [Streptococcus ruminantium strain GUT-183]
MLKTRAPAFTLLECLVALVVLSGSLLVFEGLSKLISHEVRYQSKVLQKDWLLFSDQLRAEWAQAALVRVENNRIYINKEG
QGLAFGKSRSDDFRKTNEKGQGYQPMLYGLQATEIVQEGRLVRMDFTFTNGEERTFIYAFEKTG

Nucleotide


Download         Length: 435 bp        

>NTDB_id=91118 SUT_RS01105 WP_024532251.1 168353..168787(+) (comGF/cglF) [Streptococcus ruminantium strain GUT-183]
TTGCTAAAAACTAGGGCTCCAGCTTTTACCCTTTTAGAATGTTTAGTGGCTTTGGTCGTCCTGTCGGGGAGTCTCTTAGT
ATTTGAAGGATTAAGCAAATTGATTTCTCATGAAGTTCGTTACCAAAGCAAGGTACTTCAAAAGGATTGGCTCCTTTTCT
CGGATCAGCTGCGTGCAGAATGGGCTCAGGCGGCTTTGGTTAGAGTTGAGAATAACAGAATCTACATCAATAAGGAAGGT
CAAGGTCTTGCCTTTGGAAAATCACGTTCGGATGATTTTCGAAAGACGAATGAGAAGGGGCAAGGATACCAGCCTATGTT
ATACGGTCTTCAAGCGACGGAAATTGTTCAAGAGGGAAGACTAGTTAGAATGGATTTTACCTTTACAAATGGAGAGGAGC
GAACCTTTATCTATGCTTTTGAAAAAACAGGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comGF/cglF Streptococcus mitis NCTC 12261

53.237

96.528

0.514

  comGF/cglF Streptococcus pneumoniae Rx1

52.899

95.833

0.507

  comGF/cglF Streptococcus pneumoniae D39

52.899

95.833

0.507

  comGF/cglF Streptococcus pneumoniae R6

52.899

95.833

0.507

  comGF/cglF Streptococcus pneumoniae TIGR4

52.899

95.833

0.507

  comGF/cglF Streptococcus mitis SK321

51.799

96.528

0.5

  comYF Streptococcus mutans UA159

50.355

97.917

0.493

  comYF Streptococcus mutans UA140

49.645

97.917

0.486

  comGF Lactococcus lactis subsp. cremoris KW2

45.775

98.611

0.451


Multiple sequence alignment