Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA/comE1   Type   Machinery gene
Locus tag   F544_RS10100 Genome accession   NZ_CP006956
Coordinates   2187907..2188245 (-) Length   112 a.a.
NCBI ID   WP_025289901.1    Uniprot ID   W0R851
Organism   Bibersteinia trehalosi USDA-ARS-USMARC-190     
Function   dsDNA binding (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2182907..2193245
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  F544_RS10065 (F544_20390) nlpD 2182920..2184128 (+) 1209 WP_025289896.1 murein hydrolase activator NlpD -
  F544_RS10070 (F544_20400) - 2184139..2184510 (+) 372 WP_025289897.1 hypothetical protein -
  F544_RS10075 (F544_20410) - 2184563..2185126 (+) 564 WP_025289898.1 DNA-3-methyladenine glycosylase I -
  F544_RS10080 (F544_20420) - 2185123..2185842 (+) 720 WP_025289899.1 4'-phosphopantetheinyl transferase family protein -
  F544_RS10085 (F544_20430) psiE 2185846..2186256 (+) 411 WP_015433468.1 phosphate-starvation-inducible protein PsiE -
  F544_RS10090 (F544_20440) nudF 2186271..2186882 (+) 612 WP_015433469.1 ADP-ribose diphosphatase -
  F544_RS10095 (F544_20450) asd 2186953..2187843 (-) 891 WP_025289900.1 archaetidylserine decarboxylase -
  F544_RS10100 (F544_20460) comEA/comE1 2187907..2188245 (-) 339 WP_025289901.1 ComEA family DNA-binding protein Machinery gene
  F544_RS10105 (F544_20470) - 2188504..2189868 (+) 1365 WP_025289902.1 LysM-like peptidoglycan-binding domain-containing protein -
  F544_RS10110 (F544_20480) - 2189924..2190325 (+) 402 WP_025289903.1 MliC family protein -
  F544_RS10115 (F544_20490) waaF 2190336..2191373 (+) 1038 WP_025289904.1 lipopolysaccharide heptosyltransferase II -
  F544_RS10120 (F544_20500) hemB 2191383..2192408 (+) 1026 WP_015433475.1 porphobilinogen synthase -
  F544_RS10125 (F544_20510) - 2192418..2192915 (+) 498 WP_015433476.1 YbhB/YbcL family Raf kinase inhibitor-like protein -

Sequence


Protein


Download         Length: 112 a.a.        Molecular weight: 12123.90 Da        Isoelectric Point: 6.9794

>NTDB_id=115257 F544_RS10100 WP_025289901.1 2187907..2188245(-) (comEA/comE1) [Bibersteinia trehalosi USDA-ARS-USMARC-190]
MKKYLRIALSSLVAFCTLSALAQTQGEATHTEPVAIEQVAQAQVQNVNLVNLNTATAAEIQDKLVGIGAKKAQAIVEYRE
KNGNFISLEQLTEVSGIGKATLDKNRDRLVLE

Nucleotide


Download         Length: 339 bp        

>NTDB_id=115257 F544_RS10100 WP_025289901.1 2187907..2188245(-) (comEA/comE1) [Bibersteinia trehalosi USDA-ARS-USMARC-190]
ATGAAAAAATATCTGCGTATTGCGCTTTCTTCCCTTGTGGCATTTTGTACGCTATCTGCTTTAGCACAAACGCAAGGTGA
AGCAACACATACTGAACCGGTTGCTATTGAGCAAGTTGCGCAGGCTCAGGTACAGAATGTCAATTTAGTTAATCTGAATA
CAGCAACGGCGGCTGAAATTCAAGATAAATTAGTTGGTATTGGTGCGAAAAAAGCGCAAGCGATAGTAGAATATCGCGAG
AAAAATGGTAATTTTATTAGCCTTGAACAGCTCACGGAAGTTTCTGGTATTGGTAAAGCAACACTAGATAAGAACCGCGA
TCGCCTTGTATTAGAGTAA

Domains


Predicted by InterproScan.

(46-109)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB W0R851

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA/comE1 Glaesserella parasuis strain SC1401

61.062

100

0.616

  comE1/comEA Haemophilus influenzae Rd KW20

66.216

66.071

0.437

  comEA Vibrio parahaemolyticus RIMD 2210633

45

89.286

0.402


Multiple sequence alignment