Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE   Type   Machinery gene
Locus tag   C3V42_RS02795 Genome accession   NZ_CP027235
Coordinates   573711..575048 (-) Length   445 a.a.
NCBI ID   WP_065265620.1    Uniprot ID   A0A1B8QNS1
Organism   Haemophilus sp. oral taxon 036 strain F0629     
Function   type IV pilus biogenesis and function (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 568711..580048
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C3V42_RS02765 (C3V42_02765) - 569394..569666 (-) 273 WP_048952927.1 HU family DNA-binding protein -
  C3V42_RS02770 (C3V42_02770) - 569806..570396 (-) 591 WP_106013559.1 DUF416 family protein -
  C3V42_RS02775 (C3V42_02775) hemE 570406..571470 (-) 1065 WP_106013560.1 uroporphyrinogen decarboxylase -
  C3V42_RS02780 (C3V42_02780) nudC 571471..572271 (-) 801 WP_106013561.1 NAD(+) diphosphatase -
  C3V42_RS02785 (C3V42_02785) nfuA 572340..572936 (-) 597 WP_005634545.1 Fe-S biogenesis protein NfuA -
  C3V42_RS02790 (C3V42_02790) - 573012..573698 (-) 687 WP_164994244.1 ComF family protein -
  C3V42_RS02795 (C3V42_02795) comE 573711..575048 (-) 1338 WP_065265620.1 type IV pilus secretin PilQ family protein Machinery gene
  C3V42_RS02800 (C3V42_02800) comD 575058..575471 (-) 414 WP_048952933.1 pilus assembly protein PilP Machinery gene
  C3V42_RS02805 (C3V42_02805) comC 575468..575989 (-) 522 WP_065265619.1 competence protein ComC Machinery gene
  C3V42_RS02810 (C3V42_02810) comB 575986..576492 (-) 507 WP_065265618.1 competence protein B Machinery gene
  C3V42_RS02815 (C3V42_02815) comA 576493..577290 (-) 798 WP_106013563.1 competence protein ComA Machinery gene
  C3V42_RS02820 (C3V42_02820) - 577389..579992 (+) 2604 WP_106013564.1 penicillin-binding protein 1A -

Sequence


Protein


Download         Length: 445 a.a.        Molecular weight: 48981.23 Da        Isoelectric Point: 6.9209

>NTDB_id=275929 C3V42_RS02795 WP_065265620.1 573711..575048(-) (comE) [Haemophilus sp. oral taxon 036 strain F0629]
MKKYLLKCGCFLMYFWLPLIVFANPKTDNERFFIRLSQAPLAQTLEQLAFQHDVNLVMDETLEGNISLKLDNIDMPRLLQ
IIAKSKKLTLNKDEGIYYLNGGLSGKGQVAGNLATNELHLVSHTVKLHFAKASELMKSLTTGSGSLLSPAGSITFDDRSN
LLVIQDEPRSVQNIKKLISEMDKPIEQIAIEARIVTITDESLKELGVRWGIFNPTENAARVAGSLAGNGFENIADNLNVN
FATTTTPAGSIALQVAKINGRLLDLELSALERENNVEIIASPRLLTTNKKSASIKQGTEIPYVVSNTRNDTQSVEFREAV
LGLEVTPHISKDNNILLDLLVSQNSPGSRVAYGQNEVVSIDKQEINTQVFAKDGETIVLGGVFHDTITKSEDKVPVLGDI
PGIKRLFSKESERHQKRELVIFVTPHILKAGETLDALKQKSAGKK

Nucleotide


Download         Length: 1338 bp        

>NTDB_id=275929 C3V42_RS02795 WP_065265620.1 573711..575048(-) (comE) [Haemophilus sp. oral taxon 036 strain F0629]
ATGAAGAAATATCTTTTAAAGTGCGGTTGTTTTTTAATGTATTTTTGGTTGCCATTAATCGTTTTTGCTAACCCCAAAAC
TGATAACGAACGTTTTTTTATTCGTTTGTCGCAAGCGCCTTTAGCTCAAACTCTGGAGCAATTAGCTTTCCAACATGACG
TCAATTTAGTGATGGATGAGACGCTAGAAGGCAATATTTCTTTGAAATTAGATAATATTGACATGCCACGTTTGCTACAA
ATAATCGCTAAAAGTAAGAAGCTTACTTTGAATAAAGATGAGGGTATTTATTATCTGAATGGAGGGCTATCAGGCAAAGG
TCAAGTTGCAGGAAATCTTGCTACAAATGAACTGCACTTAGTGAGCCATACGGTAAAACTCCATTTTGCTAAAGCCTCTG
AATTAATGAAATCCTTAACAACAGGAAGTGGATCGTTGCTTTCGCCTGCAGGGAGCATTACCTTTGATGATCGTAGTAAT
TTACTGGTTATTCAGGATGAACCTCGTTCTGTGCAAAATATTAAAAAATTAATTTCTGAAATGGATAAACCTATTGAGCA
GATCGCTATTGAAGCTCGTATTGTGACAATTACGGATGAGAGTTTGAAAGAACTTGGCGTTCGGTGGGGGATTTTTAATC
CAACGGAAAATGCTGCGCGTGTAGCCGGCAGCTTAGCCGGTAATGGCTTTGAAAATATTGCGGATAATCTTAATGTAAAT
TTTGCGACAACGACGACACCTGCTGGCTCGATAGCATTACAAGTCGCGAAAATTAATGGGCGATTGCTTGATTTAGAATT
GAGTGCGTTGGAGCGTGAAAATAATGTAGAAATTATCGCTAGTCCTCGCTTACTAACCACGAATAAGAAAAGTGCGAGTA
TTAAACAGGGAACAGAAATTCCTTATGTTGTGAGCAATACTCGTAACGATACGCAATCTGTGGAATTTCGTGAGGCAGTG
CTAGGTTTGGAAGTGACACCACATATTTCTAAAGATAACAATATTTTGCTTGATTTATTGGTGAGCCAAAATTCCCCAGG
TTCTCGTGTCGCTTATGGACAAAACGAGGTGGTTTCTATTGATAAGCAAGAAATTAATACTCAAGTTTTTGCCAAAGATG
GGGAAACCATTGTGCTTGGCGGCGTGTTCCACGATACGATCACAAAAAGCGAAGATAAAGTGCCAGTACTTGGCGATATA
CCAGGGATTAAACGATTGTTTAGTAAAGAAAGTGAACGACATCAAAAACGTGAGCTAGTGATTTTCGTGACGCCGCATAT
TTTAAAAGCAGGAGAAACGTTAGATGCTTTGAAGCAAAAAAGTGCGGGTAAAAAATAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1B8QNS1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE Haemophilus influenzae Rd KW20

93.708

100

0.937

  comE Haemophilus influenzae 86-028NP

93.708

100

0.937

  comE Glaesserella parasuis strain SC1401

51.643

95.73

0.494

  pilQ Vibrio campbellii strain DS40M4

42.721

94.157

0.402

  pilQ Vibrio cholerae O1 biovar El Tor strain E7946

42.476

92.584

0.393

  pilQ Vibrio cholerae strain A1552

42.476

92.584

0.393


Multiple sequence alignment