Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   GGC04_RS04775 Genome accession   NZ_CP046065
Coordinates   1044818..1045117 (+) Length   99 a.a.
NCBI ID   WP_019277884.1    Uniprot ID   A0A1V0I6C0
Organism   Vibrio sp. THAF191d     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1039818..1050117
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GGC04_RS04760 (GGC04_04810) lon 1039829..1042180 (+) 2352 WP_021457152.1 endopeptidase La -
  GGC04_RS04765 (GGC04_04815) - 1042373..1042645 (+) 273 WP_004414742.1 HU family DNA-binding protein -
  GGC04_RS04770 (GGC04_04820) ppiD 1042819..1044678 (+) 1860 WP_006959344.1 peptidylprolyl isomerase -
  GGC04_RS04775 (GGC04_04825) comEA 1044818..1045117 (+) 300 WP_019277884.1 helix-hairpin-helix domain-containing protein Machinery gene
  GGC04_RS04780 (GGC04_04830) cmk 1045258..1045938 (+) 681 WP_006959340.1 (d)CMP kinase -
  GGC04_RS04785 (GGC04_04835) rpsA 1046042..1047712 (+) 1671 WP_006959338.1 30S ribosomal protein S1 -
  GGC04_RS04790 (GGC04_04840) ihfB 1047923..1048204 (+) 282 WP_006959336.1 integration host factor subunit beta -
  GGC04_RS04795 (GGC04_04845) - 1048343..1048624 (+) 282 WP_006959333.1 LapA family protein -
  GGC04_RS04800 (GGC04_04850) lapB 1048650..1049819 (+) 1170 WP_006959331.1 lipopolysaccharide assembly protein LapB -

Sequence


Protein


Download         Length: 99 a.a.        Molecular weight: 10588.39 Da        Isoelectric Point: 6.2913

>NTDB_id=400850 GGC04_RS04775 WP_019277884.1 1044818..1045117(+) (comEA) [Vibrio sp. THAF191d]
MLRILWTSLLMVLAVYSPTTVSAEKTTNSEYAGIEITVNVNTAEAAELAALLTGVGAKKAEAIVQYRDENGPFLKVTDLA
LVKGIGPALIDKNLSRIKL

Nucleotide


Download         Length: 300 bp        

>NTDB_id=400850 GGC04_RS04775 WP_019277884.1 1044818..1045117(+) (comEA) [Vibrio sp. THAF191d]
ATGCTACGCATACTTTGGACGAGTCTTTTAATGGTGCTCGCGGTATATAGCCCAACTACTGTATCGGCTGAAAAAACCAC
CAACTCAGAATACGCTGGAATAGAGATTACAGTGAATGTTAATACGGCTGAAGCGGCTGAATTAGCGGCGCTACTAACTG
GTGTTGGTGCGAAAAAAGCTGAGGCTATTGTTCAATATAGGGACGAAAACGGTCCCTTTTTGAAGGTGACTGATTTGGCT
TTGGTTAAAGGTATTGGCCCCGCTTTAATAGACAAAAACCTCAGTCGTATAAAGTTGTAA

Domains


Predicted by InterproScan.

(37-97)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A1V0I6C0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Vibrio parahaemolyticus RIMD 2210633

52.747

91.919

0.485

  comEA Vibrio campbellii strain DS40M4

51.648

91.919

0.475

  comEA Vibrio cholerae C6706

53.571

84.848

0.455

  comEA Vibrio cholerae strain A1552

53.571

84.848

0.455

  comE1/comEA Haemophilus influenzae Rd KW20

39.091

100

0.434

  comEA/comE1 Glaesserella parasuis strain SC1401

38.679

100

0.414


Multiple sequence alignment