Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/comE1   Type   Machinery gene
Locus tag   A4G17_RS01420 Genome accession   NZ_CP015029
Coordinates   298656..299000 (+) Length   114 a.a.
NCBI ID   WP_123956896.1    Uniprot ID   A0AAE7C225
Organism   Frederiksenia canicola strain HPA 21     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 293656..304000
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A4G17_RS01395 (A4G17_01380) - 294357..294923 (-) 567 WP_123956891.1 hypothetical protein -
  A4G17_RS01400 (A4G17_01385) waaF 294913..295965 (-) 1053 WP_123956892.1 lipopolysaccharide heptosyltransferase II -
  A4G17_RS01405 (A4G17_01390) - 295969..296370 (-) 402 WP_123956893.1 MliC family protein -
  A4G17_RS01410 (A4G17_01395) - 296437..297117 (-) 681 WP_123956894.1 hypothetical protein -
  A4G17_RS01415 (A4G17_01400) - 297134..298423 (-) 1290 WP_123956895.1 LysM-like peptidoglycan-binding domain-containing protein -
  A4G17_RS01420 (A4G17_01405) comEA/comE1 298656..299000 (+) 345 WP_123956896.1 ComEA family DNA-binding protein Machinery gene
  A4G17_RS01425 (A4G17_01410) asd 299158..300045 (+) 888 WP_123956897.1 archaetidylserine decarboxylase -
  A4G17_RS01430 (A4G17_01415) - 300121..300681 (+) 561 WP_123956898.1 DUF1287 domain-containing protein -
  A4G17_RS01435 (A4G17_01420) - 300694..301233 (+) 540 WP_123956899.1 DsbE family thiol:disulfide interchange protein -
  A4G17_RS01440 (A4G17_01425) - 301233..301691 (+) 459 WP_123956900.1 cytochrome c-type biogenesis protein -
  A4G17_RS01445 (A4G17_01430) ccmI 301688..302605 (+) 918 WP_123956901.1 c-type cytochrome biogenesis protein CcmI -

Sequence


Protein


Download         Length: 114 a.a.        Molecular weight: 12486.54 Da        Isoelectric Point: 10.1659

>NTDB_id=176271 A4G17_RS01420 WP_123956896.1 298656..299000(+) (comEA/comE1) [Frederiksenia canicola strain HPA 21]
MKKITQLALTLLLGVFTTQSFAKEKAQVPTAEPPPMQQQTEAKKVALNPNLVNINTASAAEIQDKLVGIGAKKAQAILEY
REKNGKFLNIEQLTEVSGIGKATLEKNRDRIVLE

Nucleotide


Download         Length: 345 bp        

>NTDB_id=176271 A4G17_RS01420 WP_123956896.1 298656..299000(+) (comEA/comE1) [Frederiksenia canicola strain HPA 21]
ATGAAGAAAATCACACAACTGGCTTTAACACTGCTGCTCGGTGTATTTACCACCCAATCTTTTGCCAAAGAAAAAGCCCA
AGTTCCAACCGCTGAACCGCCTCCAATGCAACAACAGACAGAAGCCAAAAAGGTAGCACTCAATCCGAATTTAGTGAATA
TCAACACGGCTTCTGCGGCAGAGATCCAAGATAAACTGGTCGGAATTGGGGCAAAGAAAGCTCAAGCAATCCTTGAATAT
CGTGAGAAAAACGGCAAATTTCTCAATATTGAGCAATTGACCGAGGTCTCAGGTATCGGTAAAGCAACGCTTGAGAAAAA
CCGCGATCGCATCGTTTTAGAATAA

Domains


Predicted by InterproScan.

(50-111)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/comE1 Glaesserella parasuis strain SC1401

65.789

100

0.658

  comE1/comEA Haemophilus influenzae Rd KW20

52.83

92.982

0.491

  comEA Vibrio campbellii strain DS40M4

39.815

94.737

0.377


Multiple sequence alignment