Detailed information    

insolico Bioinformatically predicted

Overview


Name   comE1/comEA   Type   Machinery gene
Locus tag   ACMCNI_RS04900 Genome accession   NZ_AP038791
Coordinates   1105690..1106025 (+) Length   111 a.a.
NCBI ID   WP_226692204.1    Uniprot ID   -
Organism   Rodentibacter sp. THUN1657     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1100690..1111025
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACMCNI_RS04885 (THUN1657_09410) - 1102962..1103699 (-) 738 WP_226692207.1 ABC transporter ATP-binding protein -
  ACMCNI_RS04890 (THUN1657_09420) - 1103766..1104734 (-) 969 WP_226692206.1 FecCD family ABC transporter permease -
  ACMCNI_RS04895 (THUN1657_09430) - 1104731..1105528 (-) 798 WP_226692205.1 helical backbone metal receptor -
  ACMCNI_RS04900 (THUN1657_09440) comE1/comEA 1105690..1106025 (+) 336 WP_226692204.1 ComEA family DNA-binding protein Machinery gene
  ACMCNI_RS04905 (THUN1657_09450) - 1106078..1106545 (-) 468 WP_226692203.1 surface-adhesin E family protein -
  ACMCNI_RS04910 (THUN1657_09460) pflA 1106579..1107319 (-) 741 WP_226692202.1 pyruvate formate lyase 1-activating protein -
  ACMCNI_RS04915 (THUN1657_09470) pflB 1107447..1109759 (-) 2313 WP_226692201.1 formate C-acetyltransferase -
  ACMCNI_RS04920 (THUN1657_09480) focA 1109809..1110660 (-) 852 WP_226692200.1 formate transporter FocA -

Sequence


Protein


Download         Length: 111 a.a.        Molecular weight: 11913.75 Da        Isoelectric Point: 10.0128

>NTDB_id=111240 ACMCNI_RS04900 WP_226692204.1 1105690..1106025(+) (comE1/comEA) [Rodentibacter sp. THUN1657]
MKLIKTLISSLVLGSALIGSSVLAEEKAVETPTTQTVTEKVPSTNMSNKLNINTATASEIQKSLIGIGAKKAEAIVQYRE
KHGNFTSAEQLLEVQGIGKATLEKNRDRLAF

Nucleotide


Download         Length: 336 bp        

>NTDB_id=111240 ACMCNI_RS04900 WP_226692204.1 1105690..1106025(+) (comE1/comEA) [Rodentibacter sp. THUN1657]
ATGAAATTAATAAAAACCTTAATCAGCTCACTTGTTCTAGGAAGCGCACTAATCGGTTCATCAGTTTTGGCAGAGGAAAA
AGCAGTAGAAACACCAACCACACAAACTGTCACTGAAAAAGTACCTTCAACTAATATGAGCAATAAGCTAAATATCAACA
CGGCGACAGCAAGTGAAATCCAAAAATCCTTAATCGGTATCGGTGCAAAGAAAGCGGAGGCAATCGTGCAATATCGTGAG
AAACACGGTAATTTCACCTCAGCGGAACAATTACTTGAAGTTCAAGGTATCGGTAAAGCGACACTTGAGAAGAATCGTGA
TCGTTTGGCCTTTTAA

Domains


Predicted by InterproScan.

(48-109)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comE1/comEA Haemophilus influenzae Rd KW20

69.027

100

0.703

  comEA/comE1 Glaesserella parasuis strain SC1401

55.455

99.099

0.55

  comEA Vibrio cholerae C6706

44.34

95.495

0.423

  comEA Vibrio cholerae strain A1552

44.34

95.495

0.423

  comEA Acinetobacter baylyi ADP1

47.126

78.378

0.369

  comEA Legionella pneumophila str. Paris

37.736

95.495

0.36

  comEA Legionella pneumophila strain ERS1305867

37.736

95.495

0.36


Multiple sequence alignment