Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   NHAL_RS15910 Genome accession   NC_013960
Coordinates   3373479..3374996 (-) Length   505 a.a.
NCBI ID   WP_013034172.1    Uniprot ID   D5C0K0
Organism   Nitrosococcus halophilus Nc 4     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 3368479..3379996
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NHAL_RS15895 (Nhal_3273) - 3368531..3369190 (+) 660 WP_238985522.1 TIGR04211 family SH3 domain-containing protein -
  NHAL_RS15900 (Nhal_3274) ilvD 3369431..3371302 (-) 1872 WP_013034170.1 dihydroxy-acid dehydratase -
  NHAL_RS15905 (Nhal_3275) rep 3371486..3373492 (-) 2007 WP_013034171.1 DNA helicase Rep -
  NHAL_RS15910 (Nhal_3276) comM 3373479..3374996 (-) 1518 WP_013034172.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  NHAL_RS15915 (Nhal_3277) ubiK 3375042..3375359 (-) 318 WP_013034173.1 ubiquinone biosynthesis accessory factor UbiK -
  NHAL_RS15920 (Nhal_3278) - 3375468..3376262 (-) 795 WP_041355083.1 RNA-guided endonuclease InsQ/TnpB family protein -
  NHAL_RS15925 (Nhal_3279) - 3376560..3377798 (+) 1239 WP_013034175.1 porin -
  NHAL_RS15930 (Nhal_3280) - 3377865..3379124 (+) 1260 WP_041355870.1 ammonium transporter -

Sequence


Protein


Download         Length: 505 a.a.        Molecular weight: 54641.76 Da        Isoelectric Point: 7.2764

>NTDB_id=36766 NHAL_RS15910 WP_013034172.1 3373479..3374996(-) (comM) [Nitrosococcus halophilus Nc 4]
MSLAIAYSRAQAGVDAPLVTVEVHLSNGLPAFSIVGLPETAVKESRDRVRGALLNCHFEFPARRITVNLAPADLPKEGGR
FDLAIALGILAASGQISPSVLKTYEFVSELALSGEVRGIRGVLAVALQAAKAGRTLVVAEENAPEAALVSTIEVLVASHL
LEVCQHLRGESLLAPFTGNSLEAVPVEEMDIADVRGQYHVKRALEVAAAGAHNLLMIGPPGTGKTMLASRLPGLLPGMTE
VEALDSATVQSISSQGFDFSRWRQRPFRAPHHTASAVALVGGGGQPRPGEISLAHHGVLFLDELPEFERRVLEVLREPLE
SGRIVISRAAQQVEFPACVQLVAAMNPCPCGYLGDPKGRCRCTMEQVQRYRARISGPLLDRIDIQIEVPPVPLKQLRSEP
EGTMETSRQIQVRVEAARQRQLARSGQPNSGLSNREVERTCRLGDKDYRLLDQALEQLGLSARAYHRILKLARTIADLEG
SEAICTPHLSEAIGYRRLDRPHGNP

Nucleotide


Download         Length: 1518 bp        

>NTDB_id=36766 NHAL_RS15910 WP_013034172.1 3373479..3374996(-) (comM) [Nitrosococcus halophilus Nc 4]
ATGTCGCTGGCGATTGCCTACAGCCGTGCTCAGGCAGGAGTTGATGCTCCCTTAGTGACCGTGGAGGTCCATCTTTCTAA
TGGGCTCCCTGCTTTCTCTATCGTGGGCTTGCCGGAAACTGCGGTTAAGGAGAGTAGAGATCGGGTACGGGGCGCGTTGC
TCAATTGCCACTTTGAGTTTCCGGCTCGCCGTATCACGGTAAATCTGGCGCCTGCGGATTTGCCCAAGGAAGGGGGGCGC
TTTGATTTGGCTATCGCCTTAGGCATTTTGGCCGCTTCGGGGCAGATCTCACCGTCGGTATTAAAAACCTATGAATTTGT
CAGCGAGCTTGCTTTAAGTGGCGAGGTCCGGGGCATCCGGGGAGTGTTGGCCGTCGCATTGCAAGCTGCCAAAGCGGGGC
GCACCCTGGTTGTTGCAGAGGAAAATGCCCCCGAAGCGGCCCTAGTATCTACGATTGAGGTGTTAGTGGCTTCCCACCTT
TTAGAGGTCTGTCAGCATCTCCGGGGAGAATCCTTACTGGCTCCCTTTACCGGAAATTCCCTCGAGGCGGTTCCTGTAGA
GGAGATGGATATTGCTGATGTTCGGGGTCAGTATCATGTCAAACGGGCGCTAGAGGTGGCAGCGGCTGGGGCTCATAATC
TCTTAATGATCGGGCCCCCGGGAACGGGCAAGACGATGCTGGCCAGTCGCCTGCCGGGTCTTTTGCCTGGGATGACCGAA
GTCGAAGCGCTGGACAGTGCCACTGTGCAATCCATCAGCAGCCAAGGCTTTGATTTTAGCCGCTGGCGTCAGCGTCCTTT
CCGAGCGCCCCATCATACCGCCTCCGCGGTGGCTTTAGTGGGGGGAGGCGGCCAGCCCCGGCCCGGGGAAATTTCCCTGG
CCCATCATGGGGTATTATTTCTCGATGAGCTGCCAGAGTTTGAGCGTCGTGTATTGGAGGTTCTCAGAGAGCCTTTGGAG
TCGGGCCGTATTGTGATTTCCCGAGCTGCCCAGCAGGTGGAGTTTCCCGCTTGTGTGCAGTTAGTTGCGGCCATGAATCC
CTGTCCCTGCGGCTATTTGGGAGATCCCAAGGGCCGTTGCCGGTGCACCATGGAACAAGTGCAGCGTTATCGGGCGCGGA
TTTCCGGGCCTTTGCTAGACCGCATCGATATACAAATCGAGGTGCCGCCTGTGCCCCTGAAGCAGTTACGGAGTGAACCT
GAAGGCACCATGGAAACTAGCCGCCAGATTCAAGTTCGGGTGGAAGCGGCGCGACAGCGTCAGTTAGCCCGCTCGGGACA
GCCTAACAGTGGGTTAAGCAATCGGGAGGTAGAACGCACTTGCCGCCTCGGTGATAAGGATTATCGTTTACTGGACCAGG
CTTTGGAGCAGCTGGGGCTTTCCGCCCGGGCCTACCACCGGATACTGAAGTTAGCCCGGACGATTGCCGATTTGGAAGGG
AGCGAGGCTATCTGCACCCCCCATCTTTCTGAGGCGATTGGTTATCGGCGATTAGACCGCCCCCATGGCAACCCTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB D5C0K0

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

57.312

100

0.574

  comM Vibrio cholerae strain A1552

57.4

99.01

0.568

  comM Vibrio campbellii strain DS40M4

56.4

99.01

0.558

  comM Glaesserella parasuis strain SC1401

55.467

99.604

0.552

  comM Legionella pneumophila str. Paris

50.495

100

0.505

  comM Legionella pneumophila strain ERS1305867

50.495

100

0.505

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.654

100

0.469


Multiple sequence alignment