Detailed information    

insolico Bioinformatically predicted

Overview


Name   comC   Type   Machinery gene
Locus tag   DV389_RS04150 Genome accession   NZ_CP031250
Coordinates   841943..842443 (-) Length   166 a.a.
NCBI ID   WP_114893184.1    Uniprot ID   -
Organism   Haemophilus influenzae strain M21384     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 836943..847443
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  DV389_RS04120 (DV389_04120) - 837306..837896 (-) 591 WP_114893178.1 DUF416 family protein -
  DV389_RS04125 (DV389_04125) nudC 837932..838726 (-) 795 WP_114893179.1 NAD(+) diphosphatase -
  DV389_RS04130 (DV389_04130) nfuA 838794..839390 (-) 597 WP_114893180.1 Fe-S biogenesis protein NfuA -
  DV389_RS04135 (DV389_04135) - 839466..840152 (-) 687 WP_162816767.1 ComF family protein -
  DV389_RS04140 (DV389_04140) comE 840165..841502 (-) 1338 WP_114893182.1 type IV pilus secretin PilQ Machinery gene
  DV389_RS04145 (DV389_04145) comD 841512..841925 (-) 414 WP_114893183.1 pilus assembly protein PilP Machinery gene
  DV389_RS04150 (DV389_04150) comC 841943..842443 (-) 501 WP_114893184.1 competence protein ComC Machinery gene
  DV389_RS04155 (DV389_04155) comB 842440..842946 (-) 507 WP_114893185.1 competence protein B Machinery gene
  DV389_RS04160 (DV389_04160) comA 842947..843744 (-) 798 WP_114893186.1 pilus assembly protein PilM Machinery gene
  DV389_RS04165 (DV389_04165) - 843843..846443 (+) 2601 WP_114893187.1 penicillin-binding protein 1A -
  DV389_RS04170 (DV389_04170) - 846520..847365 (+) 846 WP_114893188.1 23S rRNA (adenine(2030)-N(6))-methyltransferase RlmJ -

Sequence


Protein


Download         Length: 166 a.a.        Molecular weight: 19153.15 Da        Isoelectric Point: 8.6191

>NTDB_id=304951 DV389_RS04150 WP_114893184.1 841943..842443(-) (comC) [Haemophilus influenzae strain M21384]
MKAFFNDPFSPFGKWLSQPFYVHGLTFLLLLSAVIFRPVLDYIEGNSRLHETEDELAVKRSELLHQQKILTSLQQQSESR
KLSPELATQIMPLNKQIQHLAARNGLSQHLRWEMGQQPILHLQLTGHFEKTKTFLTALLANTSQLSVSRLQFMKPEDSPL
QTEIIF

Nucleotide


Download         Length: 501 bp        

>NTDB_id=304951 DV389_RS04150 WP_114893184.1 841943..842443(-) (comC) [Haemophilus influenzae strain M21384]
GTGAAAGCCTTTTTTAACGATCCTTTTAGTCCTTTTGGCAAATGGCTAAGTCAGCCTTTTTATGTTCATGGATTAACCTT
TTTATTGCTATTAAGTGCGGTAATTTTTCGCCCCGTTTTAGATTATATAGAGGGGAACTCCCGTCTCCATGAAACTGAAG
ATGAGTTAGCGGTGAAGCGTTCAGAATTATTGCATCAACAGAAAATTTTAACTTCTTTACAGCAGCAATCGGAAAGTCGA
AAACTTTCTCCAGAACTGGCTACACAAATTATGCCGTTGAATAAACAAATTCAACATTTAGCTGCACGCAACGGTTTATC
TCAGCATTTACGTTGGGAAATGGGGCAACAGCCTATTTTGCATTTACAGCTTACAGGGCATTTTGAAAAAACGAAGACAT
TTTTAACCGCACTTTTAGCTAATACGTCACAGCTTTCAGTAAGTCGGCTGCAATTTATGAAACCCGAAGACAGCCCATTG
CAAACCGAGATTATTTTTTAG

Domains



No domain identified.



Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comC Haemophilus influenzae 86-028NP

92.771

100

0.928

  comC Haemophilus influenzae Rd KW20

92.771

100

0.928


Multiple sequence alignment