Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   R2K28_RS00180 Genome accession   NZ_OY734020
Coordinates   41564..43072 (-) Length   502 a.a.
NCBI ID   WP_316367426.1    Uniprot ID   -
Organism   Candidatus Thiodiazotropha sp. CDECU1 isolate 46184     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 36564..48072
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  R2K28_RS00165 - 36646..37890 (-) 1245 WP_316367421.1 patatin-like phospholipase family protein -
  R2K28_RS00170 - 37905..39707 (-) 1803 WP_316367423.1 oleate hydratase -
  R2K28_RS00175 - 40345..41334 (+) 990 WP_316367425.1 GGDEF domain-containing protein -
  R2K28_RS00180 comM 41564..43072 (-) 1509 WP_316367426.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  R2K28_RS00185 - 43118..43474 (-) 357 WP_316367427.1 accessory factor UbiK family protein -
  R2K28_RS00190 - 43716..44393 (+) 678 WP_316367428.1 TorF family putative porin -
  R2K28_RS00195 - 44467..44805 (+) 339 WP_316367429.1 P-II family nitrogen regulator -
  R2K28_RS00200 - 44836..46098 (+) 1263 WP_316367430.1 ammonium transporter -
  R2K28_RS00205 - 46323..46796 (+) 474 WP_316367431.1 DUF4124 domain-containing protein -
  R2K28_RS00210 - 46810..47865 (-) 1056 WP_316367432.1 diguanylate cyclase -

Sequence


Protein


Download         Length: 502 a.a.        Molecular weight: 54789.58 Da        Isoelectric Point: 7.2351

>NTDB_id=1160743 R2K28_RS00180 WP_316367426.1 41564..43072(-) (comM) [Candidatus Thiodiazotropha sp. CDECU1 isolate 46184]
MSLAILYSRAQEGIQAPLVTVEVHLSNGLPGLSIVGLPEMAVRESKDRVRGALINSQFEFPARRITINLAPADLPKEGGR
FDLPIALGILAASNQLAADPLNHYEFTGELALSGEMRPISGILPVALTARDAGRSLILPQQNAEEAGLVSGLQCYPAKHL
LEVCSHINSVNQLEQFKGVRSPANVTKQQLDMADVYGQSHARRALEISAAGAHSLLYIGPPGTGKSMLASRLPGILPPMS
EEEALECAAIHSVANNRAFEPAQWRQRPYRAPHHTASAAALVGGGSNPKPGEISLAHCGVLFLDELPEFDRHTLEVLREP
LENGHITISRANRQVDYPSRFQMIAAMNPCPCGHLGDGSNRCHCTLDRITRYRNRISGPLLDRIDMHVEVPRQPLQINQE
SPTLEEPSDAIRRRVIDARDIQLERQGCTNQALQGVQIEQVAAPGKEGNALLHRAIEKLGLSMRAYHRILKVARTIADLE
ASPKVETAHISEAIGYRRLDRS

Nucleotide


Download         Length: 1509 bp        

>NTDB_id=1160743 R2K28_RS00180 WP_316367426.1 41564..43072(-) (comM) [Candidatus Thiodiazotropha sp. CDECU1 isolate 46184]
ATGTCACTCGCCATTCTCTATTCCCGGGCCCAAGAGGGCATCCAAGCGCCCCTGGTCACCGTCGAGGTCCACCTCTCCAA
CGGCCTGCCCGGCCTCTCCATCGTCGGCTTGCCCGAAATGGCGGTACGCGAGAGCAAGGACCGGGTCAGGGGTGCCCTGA
TCAACAGCCAGTTTGAATTTCCCGCCCGCCGCATAACCATCAACCTGGCGCCCGCCGATCTGCCGAAAGAGGGGGGGCGA
TTCGACCTTCCCATCGCCCTCGGCATCCTTGCCGCATCGAATCAACTGGCGGCGGACCCATTGAACCACTACGAATTCAC
CGGCGAGCTTGCCCTGTCCGGTGAAATGCGCCCGATCAGCGGGATTCTTCCGGTGGCGCTCACAGCCCGTGATGCGGGAC
GCTCCCTCATCCTGCCGCAACAGAATGCCGAGGAGGCGGGTCTGGTGAGCGGGCTCCAATGCTACCCGGCAAAACACCTG
CTCGAGGTCTGTTCGCACATCAATAGCGTCAACCAGCTGGAACAGTTCAAAGGCGTTCGATCACCAGCGAATGTAACAAA
GCAGCAGCTCGATATGGCCGACGTCTATGGTCAGAGCCACGCCCGGCGCGCCTTGGAGATCAGTGCCGCGGGGGCCCACT
CTCTGCTCTATATCGGCCCCCCCGGCACCGGCAAGTCGATGCTCGCCTCCCGCCTGCCCGGGATACTCCCGCCCATGAGC
GAGGAGGAGGCCCTGGAGTGTGCCGCCATCCACTCGGTGGCCAACAACCGGGCATTCGAGCCTGCCCAGTGGCGTCAAAG
ACCCTATCGCGCACCTCACCACACGGCATCGGCAGCCGCCCTGGTGGGCGGTGGCAGTAATCCGAAGCCGGGGGAGATCT
CCCTGGCCCATTGCGGGGTGCTGTTCCTGGACGAATTGCCTGAGTTCGACCGGCATACCCTGGAGGTGTTGCGCGAACCC
CTGGAGAACGGCCATATCACCATCTCCCGGGCCAATCGCCAGGTCGACTACCCATCCCGCTTTCAGATGATAGCGGCCAT
GAATCCCTGCCCCTGCGGCCACCTGGGGGACGGCAGCAACCGCTGTCACTGCACCCTGGACCGCATAACCCGCTACCGCA
ACCGCATCTCCGGCCCCCTGCTGGATCGCATCGACATGCATGTGGAAGTGCCCCGGCAGCCCCTGCAGATCAACCAGGAA
TCACCCACCCTTGAAGAACCGAGCGATGCCATTCGGCGCCGGGTTATAGATGCCCGTGATATCCAATTAGAACGACAAGG
CTGCACCAACCAGGCACTGCAGGGCGTGCAGATCGAACAGGTGGCCGCCCCGGGGAAGGAGGGTAATGCACTACTGCATC
GGGCCATCGAAAAACTCGGCCTCTCGATGCGGGCCTACCACCGGATATTGAAAGTGGCGCGTACCATCGCCGATCTGGAG
GCGAGCCCGAAGGTGGAGACTGCGCATATCAGCGAGGCGATTGGGTATCGGCGTTTGGACAGGAGTTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

53.968

100

0.542

  comM Vibrio campbellii strain DS40M4

54.092

99.801

0.54

  comM Vibrio cholerae strain A1552

54.092

99.801

0.54

  comM Glaesserella parasuis strain SC1401

53.346

100

0.54

  comM Legionella pneumophila str. Paris

51.503

99.402

0.512

  comM Legionella pneumophila strain ERS1305867

51.503

99.402

0.512

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

47.173

100

0.482


Multiple sequence alignment