Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   C4E16_RS06840 Genome accession   NZ_CP026647
Coordinates   1521606..1521917 (-) Length   103 a.a.
NCBI ID   WP_001166105.1    Uniprot ID   Q9KQT1
Organism   Vibrio cholerae O1 biovar El Tor strain HC1037     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1516606..1526917
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  C4E16_RS06815 (C4E16_06815) lapB 1516800..1517969 (-) 1170 WP_000889971.1 lipopolysaccharide assembly protein LapB -
  C4E16_RS06820 (C4E16_06820) - 1517981..1518262 (-) 282 WP_000691492.1 LapA family protein -
  C4E16_RS06825 (C4E16_06825) ihfB 1518401..1518679 (-) 279 WP_000167341.1 integration host factor subunit beta -
  C4E16_RS06830 (C4E16_06830) rpsA 1518967..1520637 (-) 1671 WP_000140318.1 30S ribosomal protein S1 -
  C4E16_RS06835 (C4E16_06835) cmk 1520746..1521423 (-) 678 WP_000094752.1 (d)CMP kinase -
  C4E16_RS06840 (C4E16_06840) comEA 1521606..1521917 (-) 312 WP_001166105.1 ComEA family DNA-binding protein Machinery gene
  C4E16_RS06845 (C4E16_06845) ppiD 1522052..1523911 (-) 1860 WP_000969331.1 peptidylprolyl isomerase -
  C4E16_RS06850 (C4E16_06850) - 1524064..1524336 (-) 273 WP_001044516.1 HU family DNA-binding protein -
  C4E16_RS06855 (C4E16_06855) lon 1524520..1526880 (-) 2361 WP_001047611.1 endopeptidase La -

Sequence


Protein


Download         Length: 103 a.a.        Molecular weight: 10966.78 Da        Isoelectric Point: 5.0506

>NTDB_id=271180 C4E16_RS06840 WP_001166105.1 1521606..1521917(-) (comEA) [Vibrio cholerae O1 biovar El Tor strain HC1037]
MQIKTKIVTLFLSLCLPTLPLLANAEETAPAAQVEEGIVITVNINTASAEELATLLKGIGLKKAQAIVDYREANGPFTHI
DDLTNVKGIGEATVRNNAARILL

Nucleotide


Download         Length: 312 bp        

>NTDB_id=271180 C4E16_RS06840 WP_001166105.1 1521606..1521917(-) (comEA) [Vibrio cholerae O1 biovar El Tor strain HC1037]
ATGCAAATCAAAACCAAAATAGTGACACTGTTTCTTTCTCTCTGCCTGCCGACATTACCGTTACTGGCCAATGCCGAGGA
AACGGCACCCGCTGCGCAGGTAGAAGAAGGTATTGTGATCACTGTCAATATTAATACCGCTTCTGCAGAAGAGCTGGCGA
CGTTACTCAAAGGCATCGGGCTTAAGAAAGCTCAGGCCATTGTCGATTATCGAGAAGCCAACGGTCCTTTTACTCATATC
GATGATCTGACGAATGTGAAAGGGATTGGTGAAGCGACAGTGCGCAACAATGCCGCAAGGATCTTGTTATAA

Domains


Predicted by InterproScan.

(41-101)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB Q9KQT1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae C6706

100

100

1

  comEA Vibrio cholerae strain A1552

100

100

1

  comEA Vibrio parahaemolyticus RIMD 2210633

62.766

91.262

0.573

  comEA Vibrio campbellii strain DS40M4

58.333

93.204

0.544

  comE1/comEA Haemophilus influenzae Rd KW20

40.541

100

0.437


Multiple sequence alignment