Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   NAF16_RS09125 Genome accession   NZ_CP098021
Coordinates   1941627..1941923 (-) Length   98 a.a.
NCBI ID   WP_020327823.1    Uniprot ID   S7IAY2
Organism   Vibrio fluvialis strain V13     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1936627..1946923
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NAF16_RS09100 (NAF16_09095) lapB 1936871..1938040 (-) 1170 WP_024374735.1 lipopolysaccharide assembly protein LapB -
  NAF16_RS09105 (NAF16_09100) - 1938053..1938337 (-) 285 WP_020327818.1 LapA family protein -
  NAF16_RS09110 (NAF16_09105) ihfB 1938477..1938761 (-) 285 WP_020430001.1 integration host factor subunit beta -
  NAF16_RS09115 (NAF16_09110) rpsA 1938992..1940662 (-) 1671 WP_020327821.1 30S ribosomal protein S1 -
  NAF16_RS09120 (NAF16_09115) cmk 1940767..1941447 (-) 681 WP_024374736.1 (d)CMP kinase -
  NAF16_RS09125 (NAF16_09120) comEA 1941627..1941923 (-) 297 WP_020327823.1 ComEA family DNA-binding protein Machinery gene
  NAF16_RS09130 (NAF16_09125) ppiD 1942062..1943921 (-) 1860 WP_323694679.1 peptidylprolyl isomerase -
  NAF16_RS09135 (NAF16_09130) - 1944077..1944349 (-) 273 WP_004726502.1 HU family DNA-binding protein -
  NAF16_RS09140 (NAF16_09135) lon 1944544..1946895 (-) 2352 WP_020327825.1 endopeptidase La -

Sequence


Protein


Download         Length: 98 a.a.        Molecular weight: 10694.42 Da        Isoelectric Point: 7.2029

>NTDB_id=693551 NAF16_RS09125 WP_020327823.1 1941627..1941923(-) (comEA) [Vibrio fluvialis strain V13]
MKIHKFWLAILLAVTWPVSHSWAEENVSATEGVEITVNINTAPAEELATLLKGIGLKKAQAIVDYREANGAFKSKEDLTQ
VKGIGPAIVAQNDKRILL

Nucleotide


Download         Length: 297 bp        

>NTDB_id=693551 NAF16_RS09125 WP_020327823.1 1941627..1941923(-) (comEA) [Vibrio fluvialis strain V13]
ATGAAAATTCACAAATTTTGGTTGGCGATTCTCCTGGCTGTCACTTGGCCAGTATCTCACAGTTGGGCGGAAGAAAATGT
TTCTGCTACAGAAGGGGTCGAGATAACTGTCAATATCAATACGGCACCAGCGGAAGAGTTGGCAACTTTACTCAAAGGTA
TCGGCCTGAAAAAAGCCCAAGCAATAGTTGACTATCGTGAGGCCAATGGTGCGTTCAAATCTAAAGAAGATCTGACGCAA
GTGAAAGGCATTGGGCCAGCTATCGTTGCTCAGAATGACAAACGTATCTTGTTATAA

Domains


Predicted by InterproScan.

(36-96)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB S7IAY2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae C6706

63

100

0.643

  comEA Vibrio cholerae strain A1552

63

100

0.643

  comEA Vibrio parahaemolyticus RIMD 2210633

57.447

95.918

0.551

  comEA Vibrio campbellii strain DS40M4

55.208

97.959

0.541

  comEA/comE1 Glaesserella parasuis strain SC1401

48.276

88.776

0.429

  comE1/comEA Haemophilus influenzae Rd KW20

38.532

100

0.429

  comEA Legionella pneumophila str. Paris

41.489

95.918

0.398

  comEA Legionella pneumophila strain ERS1305867

41.489

95.918

0.398