Detailed information    

insolico Bioinformatically predicted

Overview


Name   comYF   Type   Machinery gene
Locus tag   A4H00_RS03435 Genome accession   NZ_CP015196
Coordinates   677337..677774 (-) Length   145 a.a.
NCBI ID   WP_067087309.1    Uniprot ID   -
Organism   Streptococcus marmotae strain HTS5     
Function   dsDNA binding to the cell surface; assembly of the pseudopilus (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 630972..684155 677337..677774 within 0


Gene organization within MGE regions


Location: 630972..684155
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A4H00_RS03205 - 630972..631925 (+) 954 WP_067086998.1 IS30 family transposase -
  A4H00_RS03210 - 631940..632239 (-) 300 WP_157770977.1 hypothetical protein -
  A4H00_RS03215 - 632525..633385 (-) 861 WP_067087217.1 helix-turn-helix domain-containing protein -
  A4H00_RS03220 leuS 633481..635982 (-) 2502 WP_067087220.1 leucine--tRNA ligase -
  A4H00_RS03225 - 636228..636572 (+) 345 WP_067087222.1 hypothetical protein -
  A4H00_RS03230 - 636648..638540 (-) 1893 WP_067087225.1 M13-type metalloendopeptidase -
  A4H00_RS03235 - 638751..639476 (+) 726 WP_067091429.1 metal ABC transporter ATP-binding protein -
  A4H00_RS03240 - 639473..640324 (+) 852 WP_067087228.1 metal ABC transporter permease -
  A4H00_RS03245 - 640335..641264 (+) 930 WP_067087231.1 metal ABC transporter substrate-binding protein -
  A4H00_RS03250 - 641633..642280 (+) 648 WP_067087233.1 metal-dependent transcriptional regulator -
  A4H00_RS03255 - 642335..642723 (+) 389 Protein_632 transposase -
  A4H00_RS03260 - 642887..644110 (-) 1224 WP_067087237.1 MFS transporter -
  A4H00_RS03265 - 644079..646055 (-) 1977 WP_067087239.1 kinase -
  A4H00_RS11810 - 646058..646225 (-) 168 WP_157770978.1 hypothetical protein -
  A4H00_RS03270 - 646558..647424 (-) 867 WP_067087241.1 Rgg/GadR/MutR family transcriptional regulator -
  A4H00_RS03275 dtd 647940..648383 (-) 444 WP_067087242.1 D-aminoacyl-tRNA deacylase -
  A4H00_RS03280 - 648392..650593 (-) 2202 WP_067087244.1 RelA/SpoT family protein -
  A4H00_RS03290 - 651405..652736 (+) 1332 WP_067087248.1 ATP-binding cassette domain-containing protein -
  A4H00_RS03295 - 652720..653709 (+) 990 WP_067087250.1 ArsR/SmtB family transcription factor -
  A4H00_RS03300 - 654010..654546 (+) 537 WP_067087251.1 STM3941 family protein -
  A4H00_RS12075 - 654569..655312 (-) 744 WP_067087254.1 16S rRNA (uracil(1498)-N(3))-methyltransferase -
  A4H00_RS12080 prmA 655313..656257 (-) 945 WP_067087256.1 50S ribosomal protein L11 methyltransferase -
  A4H00_RS03315 - 656268..657176 (-) 909 WP_067087259.1 GIY-YIG nuclease family protein -
  A4H00_RS03320 - 657179..657649 (-) 471 WP_067087262.1 DUF3013 family protein -
  A4H00_RS03325 - 657735..659020 (+) 1286 Protein_646 replication-associated recombination protein A -
  A4H00_RS03340 - 659918..661462 (+) 1545 WP_067087265.1 quinol oxidase -
  A4H00_RS03345 nrdD 661613..663823 (+) 2211 WP_067087268.1 anaerobic ribonucleoside-triphosphate reductase -
  A4H00_RS12005 - 663801..663938 (+) 138 WP_099092152.1 hypothetical protein -
  A4H00_RS03350 - 664090..664593 (+) 504 WP_067087269.1 GNAT family N-acetyltransferase -
  A4H00_RS03355 - 664597..665106 (+) 510 WP_067087272.1 GNAT family N-acetyltransferase -
  A4H00_RS03360 nrdG 665103..665699 (+) 597 WP_067087275.1 anaerobic ribonucleoside-triphosphate reductase activating protein -
  A4H00_RS03365 glf 665761..666866 (+) 1106 Protein_653 UDP-galactopyranose mutase -
  A4H00_RS03375 - 667284..667625 (-) 342 WP_067087279.1 DUF1033 family protein -
  A4H00_RS03380 groL 667961..669583 (-) 1623 WP_067087281.1 chaperonin GroEL -
  A4H00_RS03385 groES 669597..669881 (-) 285 WP_067087286.1 co-chaperone GroES -
  A4H00_RS03390 ssbB/cilA 670604..670999 (-) 396 WP_067087289.1 single-stranded DNA-binding protein Machinery gene
  A4H00_RS03395 ytpR 671065..671691 (-) 627 WP_067087291.1 YtpR family tRNA-binding protein -
  A4H00_RS03400 - 671695..672015 (-) 321 WP_067087294.1 thioredoxin family protein -
  A4H00_RS03405 - 672012..672296 (-) 285 WP_067091432.1 DUF4651 domain-containing protein -
  A4H00_RS03410 pepA 672348..673421 (+) 1074 WP_067087297.1 glutamyl aminopeptidase -
  A4H00_RS03415 - 673461..674024 (-) 564 WP_067087300.1 folate family ECF transporter S component -
  A4H00_RS03420 - 674641..675831 (-) 1191 WP_067087302.1 acetate kinase -
  A4H00_RS03425 comYH 675880..676833 (-) 954 WP_067087305.1 class I SAM-dependent methyltransferase Machinery gene
  A4H00_RS03430 comGG 676856..677365 (-) 510 WP_067087307.1 competence type IV pilus minor pilin ComGG -
  A4H00_RS03435 comYF 677337..677774 (-) 438 WP_067087309.1 competence type IV pilus minor pilin ComGF Machinery gene
  A4H00_RS03440 comGE 677758..677997 (-) 240 WP_237334209.1 competence type IV pilus minor pilin ComGE -
  A4H00_RS03445 comGD/cglD 678023..678460 (-) 438 WP_067087315.1 competence type IV pilus minor pilin ComGD Machinery gene
  A4H00_RS03450 comGC/cglC 678411..678725 (-) 315 WP_067087318.1 competence type IV pilus major pilin ComGC Machinery gene
  A4H00_RS03455 comYB 678727..679755 (-) 1029 WP_167541367.1 competence type IV pilus assembly protein ComGB Machinery gene
  A4H00_RS03460 comGA/cglA/cilD 679676..680626 (-) 951 WP_067087321.1 competence type IV pilus ATPase ComGA Machinery gene
  A4H00_RS03465 - 681214..681525 (-) 312 WP_067087324.1 DUF1292 domain-containing protein -
  A4H00_RS03470 ruvX 681536..681955 (-) 420 WP_067087326.1 Holliday junction resolvase RuvX -
  A4H00_RS03475 - 681955..682221 (-) 267 WP_067087329.1 IreB family regulatory phosphoprotein -
  A4H00_RS03480 spx 682378..682776 (-) 399 WP_067087332.1 transcriptional regulator Spx -
  A4H00_RS03485 recA 682977..684155 (-) 1179 WP_067087334.1 recombinase RecA Machinery gene

Sequence


Protein


Download         Length: 145 a.a.        Molecular weight: 16901.30 Da        Isoelectric Point: 7.1639

>NTDB_id=177424 A4H00_RS03435 WP_067087309.1 677337..677774(-) (comYF) [Streptococcus marmotae strain HTS5]
MWRKSKVAGFTLLECLVALLILSGGLLVFEGLTKLIHHEIAYQTTNVEKDWLVFVDQFRSELDGVRLVKVENNRLYVDKS
GQQLSFGKSKADDFRKTNDRGRGYQPMLYQVAAATVYQESQYVQIDFVFENGWERSFVYQFEEKK

Nucleotide


Download         Length: 438 bp        

>NTDB_id=177424 A4H00_RS03435 WP_067087309.1 677337..677774(-) (comYF) [Streptococcus marmotae strain HTS5]
ATGTGGCGAAAGAGTAAGGTGGCAGGTTTTACCTTGTTAGAGTGTTTGGTTGCTCTGCTGATTTTATCTGGTGGATTACT
CGTTTTTGAGGGGCTGACGAAGCTGATTCATCATGAAATAGCCTATCAGACGACGAATGTAGAGAAGGACTGGCTTGTTT
TTGTAGACCAGTTTCGCTCGGAATTGGATGGGGTTCGGCTAGTCAAAGTGGAGAACAATCGCCTATATGTCGATAAATCA
GGGCAACAATTATCTTTCGGAAAGTCAAAGGCAGATGATTTTCGCAAGACAAACGATCGTGGTCGTGGTTACCAACCGAT
GCTGTATCAGGTAGCCGCTGCAACTGTCTACCAAGAAAGCCAGTATGTTCAGATTGATTTTGTATTTGAAAATGGTTGGG
AGAGAAGTTTTGTTTATCAATTTGAAGAAAAGAAGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comYF Streptococcus mutans UA140

54.61

97.241

0.531

  comYF Streptococcus mutans UA159

53.901

97.241

0.524

  comGF/cglF Streptococcus pneumoniae D39

48.252

98.621

0.476

  comGF/cglF Streptococcus pneumoniae R6

48.252

98.621

0.476

  comGF/cglF Streptococcus pneumoniae TIGR4

48.252

98.621

0.476

  comGF/cglF Streptococcus pneumoniae Rx1

48.252

98.621

0.476

  comGF Lactococcus lactis subsp. cremoris KW2

47.183

97.931

0.462

  comGF/cglF Streptococcus mitis SK321

45.455

98.621

0.448

  comGF/cglF Streptococcus mitis NCTC 12261

45.455

98.621

0.448


Multiple sequence alignment