Detailed information    

insolico Bioinformatically predicted

Overview


Name   comA   Type   Machinery gene
Locus tag   DV441_RS04965 Genome accession   NZ_CP031238
Coordinates   1036887..1037666 (+) Length   259 a.a.
NCBI ID   WP_114892495.1    Uniprot ID   -
Organism   Haemophilus haemolyticus strain M28486     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1031887..1042666
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV441_RS04945 recR 1032106..1032708 (-) 603 WP_005626527.1 recombination mediator RecR -
  DV441_RS04950 - 1032771..1033100 (-) 330 WP_005629464.1 YbaB/EbfC family nucleoid-associated protein -
  DV441_RS04955 - 1033268..1034113 (-) 846 WP_046953219.1 23S rRNA (adenine(2030)-N(6))-methyltransferase RlmJ -
  DV441_RS04960 - 1034187..1036787 (-) 2601 WP_046953218.1 penicillin-binding protein 1A -
  DV441_RS04965 comA 1036887..1037666 (+) 780 WP_114892495.1 competence protein ComA Machinery gene
  DV441_RS04970 comB 1037685..1038194 (+) 510 WP_046953216.1 competence protein B Machinery gene
  DV441_RS04975 comC 1038191..1038712 (+) 522 WP_046953215.1 competence protein ComC Machinery gene
  DV441_RS04980 comD 1038709..1039122 (+) 414 WP_046953214.1 pilus assembly protein PilP Machinery gene
  DV441_RS04985 comE 1039132..1040475 (+) 1344 WP_114892496.1 type IV pilus secretin PilQ family protein Machinery gene
  DV441_RS04990 tnpA 1040472..1040927 (+) 456 WP_005626546.1 IS200/IS605 family transposase -
  DV441_RS04995 - 1041129..1041815 (+) 687 WP_005626547.1 ComF family protein -
  DV441_RS05000 nfuA 1041891..1042487 (+) 597 WP_005626550.1 Fe-S biogenesis protein NfuA -

Sequence


Protein


Download         Length: 259 a.a.        Molecular weight: 30780.04 Da        Isoelectric Point: 5.8034

>NTDB_id=304395 DV441_RS04965 WP_114892495.1 1036887..1037666(+) (comA) [Haemophilus haemolyticus strain M28486]
MQFSLKNHRTLQIGVHRKQGYFDFVWFDELEQPQCYQAFVNERDFKNRFLRHLKTQSQGKTFSLQFVASISAHLTWSKVL
MLPQILNAQECHQQCKFVIEKELPIPLEELWFDYISTPLKQGFRLEITAIREASAQTYLQDFQPFKINVLDVLPHSILRA
FQYLLNEQVRSENTLFLFQEDDYCLAICERSQQSQILQSHENLTALYEQFTERFEGQLEQVFVYQIPSSHTPLPENWQRV
ETDLPFIALGNALWQKDLH

Nucleotide


Download         Length: 780 bp        

>NTDB_id=304395 DV441_RS04965 WP_114892495.1 1036887..1037666(+) (comA) [Haemophilus haemolyticus strain M28486]
ATGCAATTCTCCTTGAAAAATCACCGCACTTTACAAATCGGCGTTCATCGTAAGCAGGGTTATTTTGATTTTGTGTGGTT
TGATGAACTTGAACAGCCACAATGTTATCAAGCCTTTGTCAATGAGCGTGATTTTAAAAATCGTTTTTTGCGACATTTAA
AAACACAATCTCAAGGTAAAACCTTTTCTTTGCAGTTTGTGGCAAGTATTTCTGCTCACTTGACTTGGTCGAAAGTATTA
ATGTTGCCGCAAATATTAAATGCTCAAGAATGTCATCAACAATGTAAATTTGTGATTGAGAAAGAGTTGCCCATTCCTTT
AGAGGAATTGTGGTTTGATTATATTTCTACTCCGTTAAAGCAAGGTTTTCGTTTAGAGATTACAGCAATTCGAGAGGCGA
GTGCGCAAACTTATCTGCAGGACTTTCAACCATTTAAAATTAATGTGTTAGATGTTCTTCCTCATAGCATCTTGCGTGCA
TTTCAGTATTTGTTGAATGAACAAGTGCGGTCAGAAAATACCTTATTTTTATTTCAAGAAGATGACTATTGTTTGGCGAT
TTGTGAAAGATCTCAGCAATCGCAAATTTTACAATCTCACGAAAATTTGACCGCACTTTATGAGCAATTTACCGAACGTT
TTGAAGGGCAACTTGAACAAGTTTTTGTGTATCAAATTCCCTCAAGTCATACACCATTACCAGAAAACTGGCAGCGAGTA
GAAACGGATTTACCTTTTATCGCACTTGGCAATGCGCTGTGGCAAAAAGATTTACATTAA

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comA Haemophilus influenzae 86-028NP

89.961

100

0.9

  comA Haemophilus influenzae Rd KW20

89.575

100

0.896


Multiple sequence alignment