Detailed information    

insolico Bioinformatically predicted

Overview


Name   comD   Type   Machinery gene
Locus tag   DV441_RS04980 Genome accession   NZ_CP031238
Coordinates   1038709..1039122 (+) Length   137 a.a.
NCBI ID   WP_046953214.1    Uniprot ID   A0A0M3G6S1
Organism   Haemophilus haemolyticus strain M28486     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 1011422..1047175 1038709..1039122 within 0


Gene organization within MGE regions


Location: 1011422..1047175
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV441_RS04805 mutY 1011422..1012558 (+) 1137 WP_046953237.1 A/G-specific adenine glycosylase -
  DV441_RS04810 - 1012536..1012808 (+) 273 WP_005625838.1 oxidative damage protection protein -
  DV441_RS04815 mltC 1012823..1013899 (+) 1077 WP_046953236.1 membrane-bound lytic murein transglycosylase MltC -
  DV441_RS04830 - 1015230..1016420 (+) 1191 WP_046953234.1 phage major capsid protein -
  DV441_RS04835 - 1016465..1017031 (+) 567 WP_046953233.1 HK97 family phage prohead protease -
  DV441_RS04840 - 1017033..1018265 (+) 1233 WP_046953232.1 phage portal protein -
  DV441_RS04845 - 1018249..1018593 (+) 345 WP_046953231.1 phage head closure protein -
  DV441_RS04850 - 1018580..1018888 (+) 309 WP_046953230.1 head-tail connector protein -
  DV441_RS04855 - 1018905..1019267 (+) 363 WP_046953229.1 HNH endonuclease signature motif containing protein -
  DV441_RS04860 - 1019369..1019743 (+) 375 WP_005626376.1 phage terminase small subunit P27 family -
  DV441_RS04865 - 1019751..1021418 (+) 1668 WP_046953228.1 terminase large subunit -
  DV441_RS04870 - 1021739..1022026 (+) 288 WP_230599552.1 hypothetical protein -
  DV441_RS04875 - 1022224..1022607 (+) 384 WP_046953227.1 hypothetical protein -
  DV441_RS04880 - 1022708..1023547 (+) 840 WP_046953226.1 hypothetical protein -
  DV441_RS04885 - 1023719..1023940 (+) 222 WP_005760675.1 helix-turn-helix transcriptional regulator -
  DV441_RS04890 - 1024030..1024629 (+) 600 WP_230599549.1 host cell division inhibitor Icd-like protein -
  DV441_RS04895 - 1024622..1024870 (+) 249 WP_046953225.1 hypothetical protein -
  DV441_RS04900 - 1024863..1025063 (+) 201 WP_046953224.1 hypothetical protein -
  DV441_RS04905 - 1025053..1025259 (+) 207 WP_005629564.1 hypothetical protein -
  DV441_RS04910 - 1025246..1025794 (+) 549 WP_046953223.1 hypothetical protein -
  DV441_RS04915 - 1025802..1026239 (+) 438 WP_046953240.1 hypothetical protein -
  DV441_RS04920 - 1026223..1027991 (+) 1769 Protein_935 DNA primase family protein -
  DV441_RS04925 - 1028163..1029389 (-) 1227 WP_046953221.1 tyrosine-type recombinase/integrase -
  DV441_RS04935 secG 1029688..1030026 (-) 339 WP_046939867.1 preprotein translocase subunit SecG -
  DV441_RS04940 - 1030135..1032090 (-) 1956 WP_046953220.1 DNA topoisomerase III -
  DV441_RS04945 recR 1032106..1032708 (-) 603 WP_005626527.1 recombination mediator RecR -
  DV441_RS04950 - 1032771..1033100 (-) 330 WP_005629464.1 YbaB/EbfC family nucleoid-associated protein -
  DV441_RS04955 - 1033268..1034113 (-) 846 WP_046953219.1 23S rRNA (adenine(2030)-N(6))-methyltransferase RlmJ -
  DV441_RS04960 - 1034187..1036787 (-) 2601 WP_046953218.1 penicillin-binding protein 1A -
  DV441_RS04965 comA 1036887..1037666 (+) 780 WP_114892495.1 competence protein ComA Machinery gene
  DV441_RS04970 comB 1037685..1038194 (+) 510 WP_046953216.1 competence protein B Machinery gene
  DV441_RS04975 comC 1038191..1038712 (+) 522 WP_046953215.1 competence protein ComC Machinery gene
  DV441_RS04980 comD 1038709..1039122 (+) 414 WP_046953214.1 pilus assembly protein PilP Machinery gene
  DV441_RS04985 comE 1039132..1040475 (+) 1344 WP_114892496.1 type IV pilus secretin PilQ family protein Machinery gene
  DV441_RS04990 tnpA 1040472..1040927 (+) 456 WP_005626546.1 IS200/IS605 family transposase -
  DV441_RS04995 - 1041129..1041815 (+) 687 WP_005626547.1 ComF family protein -
  DV441_RS05000 nfuA 1041891..1042487 (+) 597 WP_005626550.1 Fe-S biogenesis protein NfuA -
  DV441_RS05005 nudC 1042556..1043353 (+) 798 WP_005626553.1 NAD(+) diphosphatase -
  DV441_RS05010 - 1043386..1043976 (+) 591 WP_005626555.1 YjaG family protein -
  DV441_RS05015 - 1044114..1044386 (+) 273 WP_005626557.1 HU family DNA-binding protein -
  DV441_RS05020 - 1044514..1045287 (+) 774 WP_005626560.1 DeoR/GlpR family DNA-binding transcription regulator -
  DV441_RS05025 glmS 1045343..1047175 (+) 1833 WP_005626562.1 glutamine--fructose-6-phosphate transaminase (isomerizing) -

Sequence


Protein


Download         Length: 137 a.a.        Molecular weight: 15593.71 Da        Isoelectric Point: 4.4237

>NTDB_id=304398 DV441_RS04980 WP_046953214.1 1038709..1039122(+) (comD) [Haemophilus haemolyticus strain M28486]
MKYWFYLIILFFMNCSWGQDPFDKTQRTHSQFDNAQTVMEQTEIISSDVPNNLCGADENRQAAEIPLNALKLVGVVISKD
KAFALLLDQDLQIYSVLEGVDVAQEGYVVTKINQNKVQFMRKSGAQCDSTEQKELSF

Nucleotide


Download         Length: 414 bp        

>NTDB_id=304398 DV441_RS04980 WP_046953214.1 1038709..1039122(+) (comD) [Haemophilus haemolyticus strain M28486]
ATGAAATATTGGTTTTACCTGATTATATTATTTTTTATGAATTGCAGTTGGGGACAAGATCCTTTCGATAAAACACAGCG
TACCCATTCTCAGTTTGATAACGCACAAACAGTAATGGAGCAGACAGAAATAATTTCCTCAGATGTACCTAATAATCTAT
GCGGAGCTGATGAAAATCGCCAAGCAGCTGAAATTCCTTTGAACGCTTTAAAATTGGTGGGCGTAGTGATTTCTAAAGAT
AAAGCGTTTGCGTTATTGCTGGATCAGGATTTGCAAATTTACAGCGTTTTAGAGGGCGTTGATGTAGCTCAAGAAGGATA
TGTTGTAACAAAAATCAATCAAAATAAAGTTCAATTTATGCGTAAGTCTGGTGCTCAATGTGATAGTACTGAACAGAAAG
AATTAAGTTTTTAA

Domains


Predicted by InterproScan.

(31-119)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A0M3G6S1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comD Haemophilus influenzae 86-028NP

89.781

100

0.898

  comD Haemophilus influenzae Rd KW20

89.051

100

0.891


Multiple sequence alignment