Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   NI390_RS04880 Genome accession   NZ_CP099915
Coordinates   1036891..1037187 (+) Length   98 a.a.
NCBI ID   WP_020327823.1    Uniprot ID   S7IAY2
Organism   Vibrio fluvialis strain Isc7A     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1031891..1042187
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NI390_RS04865 (NI390_04850) lon 1031919..1034270 (+) 2352 WP_020327825.1 endopeptidase La -
  NI390_RS04870 (NI390_04855) - 1034465..1034737 (+) 273 WP_004726502.1 HU family DNA-binding protein -
  NI390_RS04875 (NI390_04860) ppiD 1034893..1036752 (+) 1860 WP_202644212.1 peptidylprolyl isomerase -
  NI390_RS04880 (NI390_04865) comEA 1036891..1037187 (+) 297 WP_020327823.1 ComEA family DNA-binding protein Machinery gene
  NI390_RS04885 (NI390_04870) cmk 1037367..1038047 (+) 681 WP_024374736.1 (d)CMP kinase -
  NI390_RS04890 (NI390_04875) rpsA 1038152..1039822 (+) 1671 WP_020327821.1 30S ribosomal protein S1 -
  NI390_RS04895 (NI390_04880) ihfB 1040053..1040337 (+) 285 WP_020430001.1 integration host factor subunit beta -
  NI390_RS04900 (NI390_04885) - 1040477..1040761 (+) 285 WP_020327818.1 LapA family protein -
  NI390_RS04905 (NI390_04890) lapB 1040774..1041943 (+) 1170 WP_020430003.1 lipopolysaccharide assembly protein LapB -

Sequence


Protein


Download         Length: 98 a.a.        Molecular weight: 10694.42 Da        Isoelectric Point: 7.2029

>NTDB_id=702483 NI390_RS04880 WP_020327823.1 1036891..1037187(+) (comEA) [Vibrio fluvialis strain Isc7A]
MKIHKFWLAILLAVTWPVSHSWAEENVSATEGVEITVNINTAPAEELATLLKGIGLKKAQAIVDYREANGAFKSKEDLTQ
VKGIGPAIVAQNDKRILL

Nucleotide


Download         Length: 297 bp        

>NTDB_id=702483 NI390_RS04880 WP_020327823.1 1036891..1037187(+) (comEA) [Vibrio fluvialis strain Isc7A]
ATGAAAATTCACAAATTTTGGTTGGCGATTCTCCTGGCTGTCACTTGGCCGGTGTCCCACAGTTGGGCGGAAGAAAATGT
TTCTGCTACAGAAGGGGTCGAGATAACTGTCAATATCAATACGGCACCAGCGGAAGAGTTGGCGACTTTACTCAAAGGTA
TCGGCCTGAAAAAAGCCCAAGCAATAGTTGACTATCGTGAGGCCAATGGTGCGTTCAAATCTAAAGAAGATCTGACACAA
GTGAAAGGCATTGGGCCAGCTATAGTTGCTCAGAATGATAAACGTATCTTGTTATAA

Domains


Predicted by InterproScan.

(36-96)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB S7IAY2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae C6706

63

100

0.643

  comEA Vibrio cholerae strain A1552

63

100

0.643

  comEA Vibrio parahaemolyticus RIMD 2210633

57.447

95.918

0.551

  comEA Vibrio campbellii strain DS40M4

55.208

97.959

0.541

  comEA/comE1 Glaesserella parasuis strain SC1401

48.276

88.776

0.429

  comE1/comEA Haemophilus influenzae Rd KW20

38.532

100

0.429

  comEA Legionella pneumophila str. Paris

41.489

95.918

0.398

  comEA Legionella pneumophila strain ERS1305867

41.489

95.918

0.398