Detailed information    

insolico Bioinformatically predicted

Overview


Name   htrA   Type   Regulator
Locus tag   I4Q40_RS01360 Genome accession   NZ_CP065211
Coordinates   274633..275922 (-) Length   429 a.a.
NCBI ID   WP_002290101.1    Uniprot ID   A0A828ZUK2
Organism   Enterococcus lactis strain KCTC 21015     
Function   repress competence development (predicted from homology)   
Competence regulation

Related MGE


Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.

Gene-MGE association summary

MGE type MGE coordinates Gene coordinates Relative position Distance (bp)
Prophage 276350..294214 274633..275922 flank 428


Gene organization within MGE regions


Location: 274633..294214
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  I4Q40_RS01360 (I4Q40_01360) htrA 274633..275922 (-) 1290 WP_002290101.1 S1C family serine protease Regulator
  I4Q40_RS01365 (I4Q40_01365) rlmH 276350..276829 (+) 480 WP_002315620.1 23S rRNA (pseudouridine(1915)-N(3))-methyltransferase RlmH -
  I4Q40_RS13170 - 276939..277364 (+) 426 WP_231401897.1 recombinase family protein -
  I4Q40_RS13175 - 277368..278084 (+) 717 WP_231401898.1 recombinase family protein -
  I4Q40_RS01375 (I4Q40_01375) - 278081..278503 (+) 423 WP_010721782.1 hypothetical protein -
  I4Q40_RS13180 - 279185..280564 (-) 1380 WP_231401899.1 hypothetical protein -
  I4Q40_RS01385 (I4Q40_01385) - 280717..281025 (+) 309 WP_002324315.1 DUF960 domain-containing protein -
  I4Q40_RS01390 (I4Q40_01390) - 281052..281351 (+) 300 WP_002324316.1 hypothetical protein -
  I4Q40_RS01395 (I4Q40_01395) - 281525..282076 (+) 552 WP_002296259.1 DUF1643 domain-containing protein -
  I4Q40_RS01400 (I4Q40_01400) - 282222..282515 (+) 294 WP_002326407.1 Mor transcription activator family protein -
  I4Q40_RS01405 (I4Q40_01405) - 282508..282804 (+) 297 WP_002336120.1 hypothetical protein -
  I4Q40_RS01410 (I4Q40_01410) - 282879..283769 (+) 891 WP_104661357.1 DUF4767 domain-containing protein -
  I4Q40_RS01415 (I4Q40_01415) - 283854..284983 (+) 1130 WP_104835207.1 IS3 family transposase -
  I4Q40_RS01420 (I4Q40_01420) fic 285736..286341 (+) 606 WP_156272355.1 protein adenylyltransferase Fic -
  I4Q40_RS01425 (I4Q40_01425) - 286637..287590 (+) 954 WP_156272353.1 IS30 family transposase -
  I4Q40_RS01430 (I4Q40_01430) - 287935..288972 (+) 1038 WP_129236504.1 hypothetical protein -
  I4Q40_RS01435 (I4Q40_01435) - 289120..290766 (+) 1647 WP_070770549.1 DUF927 domain-containing protein -
  I4Q40_RS01440 (I4Q40_01440) - 290945..291184 (+) 240 WP_070770548.1 hypothetical protein -
  I4Q40_RS01445 (I4Q40_01445) - 291200..292573 (+) 1374 WP_137264936.1 recombinase family protein -
  I4Q40_RS01450 (I4Q40_01450) - 292577..294214 (+) 1638 WP_137264935.1 recombinase family protein -

Sequence


Protein


Download         Length: 429 a.a.        Molecular weight: 44916.93 Da        Isoelectric Point: 4.5419

>NTDB_id=508088 I4Q40_RS01360 WP_002290101.1 274633..275922(-) (htrA) [Enterococcus lactis strain KCTC 21015]
MDRKNVTPKMKKNKNNSLWRKLGLGLVGGIVGGLLTAGIFYAAMGTGNNSSTATSGNQNSAGETVVENVKVNVDSDITKA
VDKVQDAVVSVINLQSQNQSSGFGGLFGQQEESSSSDDSNLEAYSEGSGVIYKKDGNTAYVVTNNHVVDGQQGLEVMMKD
GTKVKAELVGTDAYSDLAVLKINSDKVETVASFGDSSALKVGEPAIAIGSPLGSEYANSVTSGIISSLNRQVTSTNESNE
TVNINAIQTDAAINPGNSGGPLVNIEGQVIGINSSKIASTSESTSNVSVEGMGFAIPSNDVVNIINQLEKDGKVTRPALG
ITMVDLSAVSTQQQEQILKIPSSVTNGVIVTNVQTATPAEKAGLKQYDVITKIDDKEVSSGVELQSALYKKKVGDSVKIT
YYEGSKEKTTTVELTIDQSALKQNQNSEN

Nucleotide


Download         Length: 1290 bp        

>NTDB_id=508088 I4Q40_RS01360 WP_002290101.1 274633..275922(-) (htrA) [Enterococcus lactis strain KCTC 21015]
ATGGATAGAAAAAATGTGACACCAAAGATGAAAAAAAATAAAAATAATAGCCTCTGGCGTAAGTTGGGTCTGGGGCTTGT
TGGAGGGATCGTTGGAGGACTACTGACAGCTGGCATTTTTTATGCAGCGATGGGCACTGGCAACAATAGTTCTACTGCTA
CAAGCGGCAATCAAAATTCAGCCGGTGAAACAGTTGTCGAAAATGTCAAAGTCAATGTAGATTCTGATATCACCAAAGCG
GTAGATAAAGTTCAAGACGCTGTCGTTTCTGTTATCAACTTACAAAGTCAAAATCAATCAAGCGGTTTTGGCGGATTATT
CGGGCAGCAAGAAGAAAGTAGCAGCTCAGATGATAGTAATTTGGAAGCTTATAGTGAAGGAAGCGGCGTAATCTATAAAA
AAGACGGGAATACTGCTTATGTAGTCACAAACAACCACGTAGTAGATGGACAGCAAGGTTTGGAAGTCATGATGAAAGAC
GGGACGAAAGTAAAAGCTGAATTAGTTGGAACAGATGCTTATTCAGATTTAGCGGTGTTGAAGATCAATTCAGATAAAGT
CGAAACGGTTGCCTCTTTTGGTGACTCAAGCGCTCTGAAGGTTGGTGAACCTGCGATTGCGATTGGTTCCCCATTAGGAT
CTGAATATGCGAATTCTGTCACTTCAGGAATCATTTCTTCTTTGAATCGACAAGTAACAAGTACGAATGAATCGAACGAA
ACAGTCAATATCAATGCGATCCAAACAGATGCAGCGATCAACCCGGGAAATTCTGGTGGTCCTTTAGTGAATATCGAAGG
ACAAGTTATCGGTATCAACTCAAGTAAGATCGCAAGTACTTCTGAATCGACTTCTAATGTAAGCGTAGAAGGAATGGGAT
TTGCTATTCCAAGTAACGATGTAGTGAATATCATCAATCAACTGGAAAAAGACGGAAAAGTGACACGTCCAGCTCTAGGT
ATCACAATGGTCGACTTATCTGCCGTTTCTACACAGCAACAAGAACAAATATTGAAGATTCCTTCATCTGTAACTAACGG
TGTCATCGTGACAAATGTCCAAACAGCTACACCGGCTGAAAAAGCTGGATTGAAACAATATGATGTCATTACTAAGATTG
ATGACAAAGAAGTTTCTTCTGGCGTAGAACTTCAATCTGCTTTGTATAAGAAAAAAGTGGGAGATTCTGTCAAAATCACT
TACTATGAAGGCTCGAAAGAAAAAACAACTACTGTTGAATTGACGATCGATCAATCGGCATTAAAACAAAATCAAAATAG
TGAAAATTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A828ZUK2

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  htrA Streptococcus gordonii str. Challis substr. CH1

54.569

91.841

0.501

  htrA Streptococcus mutans UA159

60.169

82.517

0.496

  htrA Streptococcus mitis NCTC 12261

52.778

92.308

0.487

  htrA Streptococcus pneumoniae TIGR4

59.05

78.555

0.464

  htrA Streptococcus pneumoniae Rx1

59.05

78.555

0.464

  htrA Streptococcus pneumoniae D39

59.05

78.555

0.464

  htrA Streptococcus pneumoniae R6

59.05

78.555

0.464