Detailed information    

insolico Bioinformatically predicted

Overview


Name   comB   Type   Machinery gene
Locus tag   K8P04_RS03275 Genome accession   NZ_CP082856
Coordinates   662953..663459 (-) Length   168 a.a.
NCBI ID   WP_011272049.1    Uniprot ID   Q4QNA7
Organism   Haemophilus influenzae strain FDAARGOS_1479     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 654876..689448 662953..663459 within 0


Gene organization within MGE regions


Location: 654876..689448
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  K8P04_RS03225 (K8P04_03200) dsbB 654876..655409 (+) 534 WP_015701641.1 disulfide bond formation protein DsbB -
  K8P04_RS03230 (K8P04_03205) glmS 655464..657296 (-) 1833 WP_015701642.1 glutamine--fructose-6-phosphate transaminase (isomerizing) -
  K8P04_RS03235 (K8P04_03210) - 657409..657681 (-) 273 WP_005630954.1 HU family DNA-binding protein -
  K8P04_RS03240 (K8P04_03215) - 657821..658411 (-) 591 WP_015701643.1 YjaG family protein -
  K8P04_RS03245 (K8P04_03220) nudC 658447..659241 (-) 795 WP_015701644.1 NAD(+) diphosphatase -
  K8P04_RS03250 (K8P04_03225) nfuA 659308..659904 (-) 597 WP_005649300.1 Fe-S biogenesis protein NfuA -
  K8P04_RS03255 (K8P04_03230) - 659980..660666 (-) 687 WP_015701645.1 ComF family protein -
  K8P04_RS03260 (K8P04_03235) comE 660679..662016 (-) 1338 WP_015701646.1 type IV pilus secretin PilQ family protein Machinery gene
  K8P04_RS03265 (K8P04_03240) - 662026..662438 (-) 413 Protein_606 competence protein -
  K8P04_RS03270 (K8P04_03245) comC 662435..662956 (-) 522 WP_077643691.1 competence protein ComC Machinery gene
  K8P04_RS03275 (K8P04_03250) comB 662953..663459 (-) 507 WP_011272049.1 competence protein ComB Machinery gene
  K8P04_RS03280 (K8P04_03255) comA 663460..664257 (-) 798 WP_005693722.1 competence protein A Machinery gene
  K8P04_RS03285 (K8P04_03260) - 664356..666950 (+) 2595 WP_077643690.1 penicillin-binding protein 1A -
  K8P04_RS03290 (K8P04_03265) - 667027..667872 (+) 846 WP_015701650.1 23S rRNA (adenine(2030)-N(6))-methyltransferase RlmJ -
  K8P04_RS03295 (K8P04_03270) - 668025..668354 (+) 330 WP_005629464.1 YbaB/EbfC family nucleoid-associated protein -
  K8P04_RS03300 (K8P04_03275) recR 668487..669089 (+) 603 WP_005626527.1 recombination mediator RecR -
  K8P04_RS03305 (K8P04_03280) - 669105..671060 (+) 1956 WP_015701651.1 DNA topoisomerase III -
  K8P04_RS03310 (K8P04_03285) secG 671169..671510 (+) 342 WP_005649335.1 preprotein translocase subunit SecG -
  K8P04_RS03320 (K8P04_03295) - 671888..673114 (+) 1227 WP_015701652.1 integrase arm-type DNA-binding domain-containing protein -
  K8P04_RS03325 (K8P04_03300) - 673286..675055 (-) 1770 WP_015701653.1 phage/plasmid primase, P4 family -
  K8P04_RS03330 (K8P04_03305) - 675084..675434 (-) 351 WP_015701654.1 hypothetical protein -
  K8P04_RS03335 (K8P04_03310) - 675427..676065 (-) 639 WP_015701655.1 hypothetical protein -
  K8P04_RS03340 (K8P04_03315) - 676052..676258 (-) 207 WP_015701656.1 hypothetical protein -
  K8P04_RS03345 (K8P04_03320) - 676248..676448 (-) 201 WP_015701657.1 hypothetical protein -
  K8P04_RS03350 (K8P04_03325) - 676438..676809 (-) 372 WP_015701658.1 hypothetical protein -
  K8P04_RS03355 (K8P04_03330) - 676802..677731 (-) 930 WP_015701659.1 host cell division inhibitor Icd-like protein -
  K8P04_RS03360 (K8P04_03335) - 677898..678209 (-) 312 WP_015701660.1 hypothetical protein -
  K8P04_RS03365 (K8P04_03340) - 678221..678430 (-) 210 WP_015701661.1 AlpA family transcriptional regulator -
  K8P04_RS03370 (K8P04_03345) - 678603..679565 (-) 963 WP_050792024.1 hypothetical protein -
  K8P04_RS03375 (K8P04_03350) - 679662..680048 (-) 387 WP_015701663.1 hypothetical protein -
  K8P04_RS03380 (K8P04_03355) - 680289..680573 (-) 285 WP_015701664.1 hypothetical protein -
  K8P04_RS03385 (K8P04_03360) - 680570..682237 (-) 1668 WP_015701665.1 terminase large subunit -
  K8P04_RS03390 (K8P04_03365) - 682244..682612 (-) 369 WP_015701666.1 phage terminase small subunit P27 family -
  K8P04_RS03395 (K8P04_03370) - 682809..683195 (-) 387 WP_015701667.1 HNH endonuclease signature motif containing protein -
  K8P04_RS03400 (K8P04_03375) - 683210..683524 (-) 315 WP_015701668.1 head-tail connector protein -
  K8P04_RS03405 (K8P04_03380) - 683511..683855 (-) 345 WP_041175043.1 phage head closure protein -
  K8P04_RS03410 (K8P04_03385) - 683839..685071 (-) 1233 WP_041175044.1 phage portal protein -
  K8P04_RS03415 (K8P04_03390) - 685073..685639 (-) 567 WP_015701671.1 HK97 family phage prohead protease -
  K8P04_RS03420 (K8P04_03395) - 685693..686880 (-) 1188 WP_015701672.1 phage major capsid protein -
  K8P04_RS03425 (K8P04_03400) - 687778..689448 (-) 1671 WP_015701673.1 fructose-specific PTS transporter subunit EIIC -

Sequence


Protein


Download         Length: 168 a.a.        Molecular weight: 19697.54 Da        Isoelectric Point: 7.4684

>NTDB_id=603083 K8P04_RS03275 WP_011272049.1 662953..663459(-) (comB) [Haemophilus influenzae strain FDAARGOS_1479]
MSMNLLPWRTYQHQKRLRRLAFYIALFILLAINLMLAFSNLIEQQKQNLQAQQTSFEQLNQQLHKTTMQIDQLRSAVKVG
EVLTSIPNEQVKKSLQQLSELPFQQGELNKFKQDANNLSLEGNAQDQTEFELIHQFLKKHFPNVKLSQVQPEQDTLFFHF
DVEQGAEK

Nucleotide


Download         Length: 507 bp        

>NTDB_id=603083 K8P04_RS03275 WP_011272049.1 662953..663459(-) (comB) [Haemophilus influenzae strain FDAARGOS_1479]
ATGTCGATGAATTTATTGCCTTGGCGTACTTATCAACATCAAAAGCGTTTACGTCGTTTAGCTTTTTATATCGCTTTATT
TATCTTGCTTGCTATTAATTTAATGTTGGCTTTTAGCAATTTGATTGAACAACAGAAACAAAATTTGCAAGCGCAGCAAA
CATCTTTTGAACAACTTAATCAGCAACTTCACAAAACTACCATGCAAATTGATCAGTTACGCAGTGCGGTGAAAGTTGGT
GAAGTTTTGACATCTATTCCCAACGAGCAAGTAAAAAAGAGTTTACAACAGCTAAGTGAATTACCTTTTCAACAAGGAGA
ACTGAATAAATTTAAACAAGATGCCAATAACTTAAGCTTGGAAGGTAACGCACAAGATCAAACAGAATTTGAACTGATTC
ATCAATTTTTAAAGAAACATTTTCCCAATGTGAAATTAAGTCAGGTTCAACCTGAACAAGATACATTGTTTTTTCACTTT
GATGTGGAACAAGGGGCGGAAAAATGA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q4QNA7

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comB Haemophilus influenzae 86-028NP

100

100

1

  comB Haemophilus influenzae Rd KW20

98.81

100

0.988