Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   B5S57_RS15135 Genome accession   NZ_CP020534
Coordinates   2041669..2041968 (-) Length   99 a.a.
NCBI ID   WP_017047776.1    Uniprot ID   A0A1Y0NUT1
Organism   Vibrio anguillarum strain 425     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2036669..2046968
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  B5S57_RS15110 (B5S57_14985) lapB 2037084..2038253 (-) 1170 WP_026028046.1 lipopolysaccharide assembly protein LapB -
  B5S57_RS15115 (B5S57_14990) - 2038273..2038548 (-) 276 WP_013857177.1 LapA family protein -
  B5S57_RS15120 (B5S57_14995) ihfB 2038690..2038974 (-) 285 WP_010317449.1 integration host factor subunit beta -
  B5S57_RS15125 (B5S57_15000) rpsA 2039051..2040721 (-) 1671 WP_013857179.1 30S ribosomal protein S1 -
  B5S57_RS15130 (B5S57_15005) cmk 2040825..2041505 (-) 681 WP_013857180.1 (d)CMP kinase -
  B5S57_RS15135 (B5S57_15010) comEA 2041669..2041968 (-) 300 WP_017047776.1 ComEA family DNA-binding protein Machinery gene
  B5S57_RS15140 (B5S57_15015) ppiD 2042105..2043970 (-) 1866 WP_013857182.1 peptidylprolyl isomerase -
  B5S57_RS15145 (B5S57_15020) - 2044128..2044400 (-) 273 WP_026027116.1 HU family DNA-binding protein -
  B5S57_RS15150 (B5S57_15025) lon 2044596..2046947 (-) 2352 WP_013857184.1 endopeptidase La -

Sequence


Protein


Download         Length: 99 a.a.        Molecular weight: 10804.74 Da        Isoelectric Point: 9.6909

>NTDB_id=223955 B5S57_RS15135 WP_017047776.1 2041669..2041968(-) (comEA) [Vibrio anguillarum strain 425]
MKLKTKLGLLLLSIVLPLSPTLAEEKVVETHQGIEITVNINQASAEEIATLLKGIGLKKAQAIVEYRQQNGDFKTKEDLS
LVKGIGAVTVRQNAERIIL

Nucleotide


Download         Length: 300 bp        

>NTDB_id=223955 B5S57_RS15135 WP_017047776.1 2041669..2041968(-) (comEA) [Vibrio anguillarum strain 425]
ATGAAATTGAAAACAAAACTCGGGTTGTTACTTCTAAGTATCGTATTGCCTCTTTCACCGACATTAGCAGAAGAAAAGGT
GGTAGAAACGCACCAAGGCATTGAAATTACGGTTAATATAAATCAGGCTTCCGCTGAGGAAATTGCTACTTTGTTGAAAG
GTATTGGACTTAAAAAAGCACAAGCGATTGTTGAATATCGTCAACAAAATGGCGATTTTAAAACCAAAGAAGATCTGAGC
TTGGTGAAAGGCATTGGTGCCGTTACTGTCAGGCAAAACGCTGAGCGTATCATTTTATAA

Domains


Predicted by InterproScan.

(37-97)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1Y0NUT1

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio cholerae C6706

59.223

100

0.616

  comEA Vibrio cholerae strain A1552

59.223

100

0.616

  comEA Vibrio campbellii strain DS40M4

58.163

98.99

0.576

  comEA Vibrio parahaemolyticus RIMD 2210633

58.889

90.909

0.535

  comE1/comEA Haemophilus influenzae Rd KW20

39.286

100

0.444

  comEA/comE1 Glaesserella parasuis strain SC1401

40

100

0.444

  comEA Legionella pneumophila str. Paris

36.735

98.99

0.364

  comEA Legionella pneumophila strain ERS1305867

36.735

98.99

0.364


Multiple sequence alignment