Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   ACEN42_RS14500 Genome accession   NZ_OZ032133
Coordinates   3091960..3093504 (+) Length   514 a.a.
NCBI ID   WP_371880279.1    Uniprot ID   -
Organism   Caballeronia sp. S22 isolate ena-SAMPLE-TAB-09-04-2024-15:28:32:345-2572     
Function   ssDNA binding (predicted from homology)   
DNA processing

Genomic Context


Location: 3086960..3098504
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  ACEN42_RS14480 ltrA 3087437..3088804 (-) 1368 WP_371880278.1 group II intron reverse transcriptase/maturase -
  ACEN42_RS14485 - 3089313..3090788 (-) 1476 WP_187608679.1 ammonium transporter -
  ACEN42_RS14490 - 3090820..3091158 (-) 339 WP_014190273.1 P-II family nitrogen regulator -
  ACEN42_RS14495 - 3091604..3091870 (+) 267 WP_187608680.1 accessory factor UbiK family protein -
  ACEN42_RS14500 comM 3091960..3093504 (+) 1545 WP_371880279.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  ACEN42_RS14505 - 3093884..3094510 (-) 627 WP_187608781.1 hypothetical protein -
  ACEN42_RS14510 - 3094639..3095175 (+) 537 WP_061116845.1 peroxiredoxin -
  ACEN42_RS14515 - 3095234..3096568 (-) 1335 WP_371880280.1 sigma-54-dependent transcriptional regulator -

Sequence


Protein


Download         Length: 514 a.a.        Molecular weight: 54854.01 Da        Isoelectric Point: 8.0697

>NTDB_id=1163317 ACEN42_RS14500 WP_371880279.1 3091960..3093504(+) (comM) [Caballeronia sp. S22 isolate ena-SAMPLE-TAB-09-04-2024-15:28:32:345-2572]
MSLAVVRSRAPAPGRAPEVVVEVHLANGLPSFSIVGLPDLEVRESRERVRAALQNCGFDFPVRRITVNLAPADLPKESGR
FDLPIALGILAASGQIPAQSLEHREFAGELSLTGALRPMRGAFAMVCGAARHYASMRAPDTRIPEIYVPLASAAEAALVP
GIEVFGAADLPTLCAHLNDVPDARLRPIRAAPLSPTSTPAADLADVIGHPAARRALEVAAAGGHHLLMVGPPGAGKSMLA
ARLPGLLPPMNDDEALTSAAILSASSIGFTPQQWRRRPFRAPHHSSSAAALVGGRNPPQPGEITLAHLGVLFLDEFPEFD
RKVLETLREPLEVGQITISRAAQQADFPAACQLVAAMNPCPCGWRGDPSGRCRCSPDVAARYLRKLSGPLMDRIDIQIEL
PALSPAELSARVVERGESSRSVAARVAAARDIQMQRQGKINRDLDGREADEVCRPDSAGEALLRAAGERFGWSARAYYRV
LKVVRTIADLARADMPDASHVAEAVQYRRLLSVA

Nucleotide


Download         Length: 1545 bp        

>NTDB_id=1163317 ACEN42_RS14500 WP_371880279.1 3091960..3093504(+) (comM) [Caballeronia sp. S22 isolate ena-SAMPLE-TAB-09-04-2024-15:28:32:345-2572]
ATGTCGCTTGCCGTGGTACGCAGCCGCGCGCCCGCTCCTGGGCGCGCGCCCGAAGTTGTCGTCGAAGTCCATCTCGCCAA
CGGGTTGCCGTCGTTTTCAATCGTCGGACTGCCCGATCTCGAAGTCCGTGAAAGTCGCGAGCGCGTGCGCGCCGCGCTTC
AGAACTGCGGCTTCGACTTTCCCGTCCGCAGAATCACCGTCAATCTCGCCCCCGCCGATCTGCCGAAGGAAAGCGGTCGC
TTCGATCTGCCGATCGCGCTCGGCATACTTGCCGCGAGCGGTCAGATTCCGGCGCAATCGCTGGAGCATCGCGAGTTCGC
CGGCGAGCTGTCGCTGACCGGCGCGCTGCGTCCGATGCGCGGCGCGTTCGCGATGGTCTGCGGCGCGGCGCGTCATTACG
CATCGATGCGCGCTCCGGACACGCGCATCCCGGAAATCTACGTGCCGCTCGCGAGCGCGGCGGAGGCGGCGCTGGTGCCG
GGCATCGAAGTCTTCGGCGCGGCGGACTTGCCGACGCTCTGCGCGCATCTGAACGATGTCCCCGATGCGCGGCTCAGACC
GATTCGCGCCGCACCGCTTTCCCCGACAAGTACGCCAGCCGCCGATCTCGCCGACGTGATCGGCCATCCCGCTGCGCGCC
GCGCACTCGAAGTAGCGGCGGCTGGCGGGCATCACCTGTTGATGGTCGGGCCGCCCGGCGCGGGCAAATCGATGCTCGCC
GCGCGCCTGCCGGGCCTGCTTCCGCCGATGAACGACGACGAAGCGCTCACCTCCGCGGCAATCCTGTCGGCGAGTTCGAT
CGGCTTCACGCCACAGCAGTGGCGGCGGCGGCCATTTCGCGCGCCGCATCACTCGTCGAGCGCGGCCGCGCTCGTCGGCG
GGCGCAATCCGCCGCAGCCGGGCGAGATCACGCTCGCGCATCTCGGCGTTTTGTTTCTTGACGAATTTCCCGAGTTCGAT
CGCAAGGTGCTGGAAACCCTGCGCGAGCCGCTCGAAGTCGGGCAGATCACGATTTCGCGCGCCGCGCAGCAGGCGGATTT
TCCGGCCGCCTGCCAGCTCGTCGCCGCGATGAACCCGTGCCCGTGCGGCTGGCGCGGCGATCCGAGCGGACGTTGCCGCT
GCTCGCCCGATGTTGCCGCGCGTTATCTGCGCAAATTGTCGGGTCCGCTGATGGACCGCATCGATATTCAGATCGAACTG
CCGGCGCTTTCGCCCGCGGAATTGTCGGCGCGTGTGGTGGAGCGCGGCGAGTCGAGCCGCAGCGTCGCGGCGCGTGTCGC
GGCAGCGCGCGACATCCAGATGCAGCGTCAGGGCAAGATCAACCGCGATCTCGATGGCCGCGAAGCCGACGAAGTCTGCC
GCCCCGATTCCGCGGGCGAAGCGTTGCTGCGCGCGGCAGGCGAGCGTTTCGGCTGGTCGGCGCGGGCGTACTACCGCGTG
CTCAAAGTCGTGCGGACGATCGCCGATCTCGCCCGCGCCGATATGCCTGATGCGTCGCATGTCGCCGAAGCGGTTCAGTA
TCGGCGCCTGCTGTCGGTCGCGTGA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio cholerae strain A1552

49.312

99.027

0.488

  comM Glaesserella parasuis strain SC1401

48.438

99.611

0.482

  comM Haemophilus influenzae Rd KW20

47.86

100

0.479

  comM Vibrio campbellii strain DS40M4

48.228

98.833

0.477

  comM Legionella pneumophila str. Paris

46.004

99.805

0.459

  comM Legionella pneumophila strain ERS1305867

46.004

99.805

0.459

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

43.164

99.611

0.43


Multiple sequence alignment