Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   A6J38_RS04435 Genome accession   NZ_CP020411
Coordinates   904476..906005 (+) Length   509 a.a.
NCBI ID   WP_005693440.1    Uniprot ID   -
Organism   Haemophilus influenzae strain FDAARGOS_199     
Function   require for natural transformation (predicted from homology)   
Unclear

Genomic Context


Location: 899476..911005
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  A6J38_RS04410 (A6J38_04415) oppC 899822..900757 (+) 936 WP_005651799.1 oligopeptide ABC transporter permease OppC -
  A6J38_RS04415 (A6J38_04420) - 900767..901738 (+) 972 WP_005651797.1 ABC transporter ATP-binding protein -
  A6J38_RS04420 (A6J38_04425) oppF 901735..902733 (+) 999 WP_005657282.1 murein tripeptide/oligopeptide ABC transporter ATP binding protein OppF -
  A6J38_RS04425 (A6J38_04430) - 902773..903639 (-) 867 WP_005651791.1 VirK/YbjX family protein -
  A6J38_RS04430 (A6J38_04435) yihA 903744..904361 (+) 618 WP_005651790.1 ribosome biogenesis GTP-binding protein YihA/YsxC -
  A6J38_RS04435 (A6J38_04440) comM 904476..906005 (+) 1530 WP_005693440.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  A6J38_RS04440 (A6J38_04445) deoC 906039..906710 (-) 672 WP_005693439.1 deoxyribose-phosphate aldolase -
  A6J38_RS04445 (A6J38_04450) - 906821..907324 (+) 504 WP_005693438.1 protein disulfide oxidoreductase -
  A6J38_RS04450 (A6J38_04455) rfaD 907377..908303 (+) 927 WP_005632797.1 ADP-glyceromanno-heptose 6-epimerase -
  A6J38_RS04455 (A6J38_04460) xylB 908354..909835 (-) 1482 WP_032828343.1 xylulokinase -

Sequence


Protein


Download         Length: 509 a.a.        Molecular weight: 55871.35 Da        Isoelectric Point: 9.7683

>NTDB_id=222388 A6J38_RS04435 WP_005693440.1 904476..906005(+) (comM) [Haemophilus influenzae strain FDAARGOS_199]
MSLAIVYSRASMGVQAPLVTIEVHLSNGKPGFTLVGLPEKTVKEAQDRVRSALMNAQFKYPAKRITVNLAPADLPKEGGR
FDLPIAIGILAASDQLDASHLKQFEFVAELALTGQLRGVHGVIPAILAAQKSKRELIIAKQNANEASLVSDQNTYFAQTL
LDVVQFLNGQEKLPLATEIVKESAVNFSGKNTLDLTDIIGQQHAKRALTIAAAGQHNLLFLGPPGTGKTMLASRLTGLLP
EMTDLEAIETASVTSLVQNELNFHNWKQRPFRAPHHSASMPALVGGGTIPKPGEISLATNGVLFLDELPEFERKVLDALR
QPLESGEIIISRANAKIQFPARFQLVAAMNPSPTGHYTGTHNRTSPQQIMRYLNRLSGPFLDRFDLSIEVPLLPQGSLQN
TGDRGETSAQVREKVLKVREIQMERAGKINAYLNSKEIERDCKLNDKDAFFLEKALNKLGLSVRAYHRILKVSRTIADLQ
GEQQISQPHLAEALGYRAMDRLLQKLSNM

Nucleotide


Download         Length: 1530 bp        

>NTDB_id=222388 A6J38_RS04435 WP_005693440.1 904476..906005(+) (comM) [Haemophilus influenzae strain FDAARGOS_199]
ATGTCCCTTGCTATTGTTTACAGCCGTGCCTCTATGGGCGTGCAAGCGCCGCTTGTCACCATTGAGGTGCATTTAAGCAA
CGGAAAACCAGGATTTACACTTGTTGGTTTGCCAGAAAAAACCGTGAAAGAGGCACAAGATAGGGTGCGTAGTGCGTTGA
TGAATGCACAATTTAAATACCCAGCCAAGCGTATCACTGTAAATCTTGCACCTGCGGATTTACCGAAAGAAGGCGGAAGA
TTTGATTTGCCTATCGCCATCGGAATTTTAGCCGCATCAGATCAGCTTGATGCGAGCCACTTAAAGCAATTTGAATTTGT
GGCAGAGCTTGCGCTGACGGGCCAATTGCGTGGCGTACATGGTGTGATTCCCGCTATTCTTGCAGCGCAAAAGTCAAAGC
GAGAGTTAATTATTGCGAAACAAAATGCTAATGAAGCCTCGCTCGTGTCTGATCAAAATACTTATTTTGCTCAAACGCTT
TTAGATGTAGTGCAATTTCTCAATGGTCAGGAAAAATTACCACTCGCCACTGAAATTGTGAAAGAAAGTGCGGTAAATTT
TTCAGGTAAAAATACGTTAGATTTAACGGATATTATCGGACAACAGCACGCTAAACGAGCATTGACCATTGCCGCAGCGG
GGCAGCATAATTTGCTCTTTCTTGGCCCACCGGGTACAGGGAAAACCATGTTAGCCAGCCGTTTGACAGGGCTTTTACCT
GAAATGACAGATTTAGAAGCGATAGAAACGGCATCTGTAACGAGTTTAGTTCAAAACGAGTTAAATTTTCATAATTGGAA
GCAACGTCCTTTTCGTGCACCACATCATAGTGCATCAATGCCTGCTTTAGTTGGTGGCGGAACGATCCCTAAACCTGGCG
AAATATCCTTAGCAACAAATGGGGTACTTTTTCTTGATGAACTTCCAGAATTTGAGCGAAAAGTGTTAGATGCACTACGT
CAGCCTTTGGAAAGTGGTGAGATTATTATTTCTCGTGCTAATGCCAAAATTCAATTTCCGGCTCGTTTTCAATTAGTGGC
AGCAATGAATCCAAGTCCGACAGGCCATTATACAGGAACACATAACCGCACTTCACCACAGCAAATTATGCGTTATTTGA
ATCGACTTTCAGGGCCATTTTTAGATCGTTTTGATTTATCTATTGAAGTGCCTTTACTGCCACAAGGTAGCTTGCAAAAT
ACGGGCGATCGTGGCGAAACCAGCGCGCAAGTTCGTGAAAAAGTGTTAAAAGTACGTGAGATTCAAATGGAAAGAGCGGG
GAAAATCAACGCTTATTTGAACAGTAAAGAGATTGAGCGTGATTGCAAGTTAAATGATAAAGATGCCTTTTTCCTTGAAA
AAGCACTGAATAAACTGGGGCTTTCTGTTCGGGCTTATCATCGTATTTTGAAAGTTTCTCGAACCATTGCCGATCTACAA
GGAGAACAACAAATTTCTCAACCGCACTTGGCGGAGGCCTTGGGCTATCGAGCAATGGATCGTTTGCTGCAGAAATTGTC
GAATATGTAA


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Haemophilus influenzae Rd KW20

99.607

100

0.996

  comM Glaesserella parasuis strain SC1401

79.093

99.607

0.788

  comM Vibrio cholerae strain A1552

67.456

99.607

0.672

  comM Vibrio campbellii strain DS40M4

66.075

99.607

0.658

  comM Legionella pneumophila str. Paris

51.093

98.821

0.505

  comM Legionella pneumophila strain ERS1305867

51.093

98.821

0.505

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

46.99

100

0.475


Multiple sequence alignment