Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | AzCIB_RS21760 | Genome accession | NZ_CP011072 |
| Coordinates | 4848183..4849682 (-) | Length | 499 a.a. |
| NCBI ID | WP_050417818.1 | Uniprot ID | - |
| Organism | Azoarcus sp. CIB | ||
| Function | require for natural transformation (predicted from homology) Unclear |
||
Genomic Context
Location: 4843183..4854682
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| AzCIB_RS21740 (AzCIB_4349) | - | 4843405..4844349 (-) | 945 | WP_050417814.1 | peptidase U32 family protein | - |
| AzCIB_RS21745 (AzCIB_4350) | - | 4844365..4845357 (-) | 993 | WP_050417815.1 | peptidase U32 family protein | - |
| AzCIB_RS21750 (AzCIB_4351) | - | 4845371..4846738 (-) | 1368 | WP_050417816.1 | aminotransferase class I/II-fold pyridoxal phosphate-dependent enzyme | - |
| AzCIB_RS21755 (AzCIB_4352) | - | 4846925..4848118 (-) | 1194 | WP_050417817.1 | multidrug effflux MFS transporter | - |
| AzCIB_RS21760 (AzCIB_4353) | comM | 4848183..4849682 (-) | 1500 | WP_050417818.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| AzCIB_RS21765 (AzCIB_4354) | - | 4849795..4850058 (-) | 264 | WP_050417819.1 | accessory factor UbiK family protein | - |
| AzCIB_RS21770 (AzCIB_4355) | - | 4850388..4851131 (+) | 744 | WP_050417820.1 | TorF family putative porin | - |
| AzCIB_RS21775 (AzCIB_4356) | glnK | 4851182..4851520 (+) | 339 | WP_050417821.1 | P-II family nitrogen regulator | - |
| AzCIB_RS21780 (AzCIB_4357) | - | 4851532..4852989 (+) | 1458 | WP_050417822.1 | ammonium transporter | - |
| AzCIB_RS21785 (AzCIB_4358) | - | 4853168..4853614 (+) | 447 | WP_050417823.1 | rhodanese-like domain-containing protein | - |
Sequence
Protein
Download Length: 499 a.a. Molecular weight: 53009.62 Da Isoelectric Point: 8.8417
>NTDB_id=142041 AzCIB_RS21760 WP_050417818.1 4848183..4849682(-) (comM) [Azoarcus sp. CIB]
MSLALVRTRALAGMGAPEVTVEVHLANGLPAFNLVGLPDTEVREARERVRAAIATSQFEFPQRRITVNLAPADLPKEGGR
FDLPIALGILVASGQVDAAGLAKHEFVGELSLDGSLRAVRGGLAMALESGRTGRALVLPAANADEAALARGASVLPAASL
LAVCAHLNSHTALTRRLPPAVAEGRDDEADLAEVKGQLQARRALEVAAAGQHSLLMFGPPGTGKSMLARRLPGLLPPLDE
AEAIECASIQSLEGGFDARRWARRPYRAPHHSASAPALVGGGTTPRPGEISLAHRGVLFLDELPEFDRRVLEALREPLET
GSVTVSRARQRTEFPARFQLVAAMNPCPCGHAGDKGGRCRCTPDQVARYRGRLSGPLLDRMDIVIEVPLLDHAEMTGQAA
GEPSATVRERVAQAWAVQRGRQGGANSHLAPGRVDTLCAPDDQGKALLDHAIRRLNLSARGYHRILKVARTIADLAGAER
VGPAHLAEAIQYRRGLDSR
MSLALVRTRALAGMGAPEVTVEVHLANGLPAFNLVGLPDTEVREARERVRAAIATSQFEFPQRRITVNLAPADLPKEGGR
FDLPIALGILVASGQVDAAGLAKHEFVGELSLDGSLRAVRGGLAMALESGRTGRALVLPAANADEAALARGASVLPAASL
LAVCAHLNSHTALTRRLPPAVAEGRDDEADLAEVKGQLQARRALEVAAAGQHSLLMFGPPGTGKSMLARRLPGLLPPLDE
AEAIECASIQSLEGGFDARRWARRPYRAPHHSASAPALVGGGTTPRPGEISLAHRGVLFLDELPEFDRRVLEALREPLET
GSVTVSRARQRTEFPARFQLVAAMNPCPCGHAGDKGGRCRCTPDQVARYRGRLSGPLLDRMDIVIEVPLLDHAEMTGQAA
GEPSATVRERVAQAWAVQRGRQGGANSHLAPGRVDTLCAPDDQGKALLDHAIRRLNLSARGYHRILKVARTIADLAGAER
VGPAHLAEAIQYRRGLDSR
Nucleotide
Download Length: 1500 bp
>NTDB_id=142041 AzCIB_RS21760 WP_050417818.1 4848183..4849682(-) (comM) [Azoarcus sp. CIB]
ATGTCGCTGGCTCTCGTACGCACCAGGGCGCTCGCAGGCATGGGCGCGCCCGAGGTGACGGTGGAAGTCCATCTCGCCAA
CGGCCTGCCCGCATTCAACCTGGTCGGCCTGCCCGACACCGAAGTCCGCGAGGCGCGCGAACGCGTGCGCGCCGCGATCG
CGACCTCGCAGTTCGAATTCCCGCAACGCCGCATCACCGTCAATCTCGCTCCCGCGGACCTCCCCAAGGAGGGCGGGCGC
TTCGACCTGCCGATCGCGCTGGGGATCCTTGTGGCATCGGGGCAGGTGGATGCGGCGGGGCTCGCGAAGCACGAGTTCGT
CGGCGAACTCTCGCTCGACGGCAGCCTGCGCGCGGTGCGCGGCGGCCTCGCGATGGCGCTGGAGAGCGGTCGCACCGGTC
GCGCGCTGGTGCTGCCGGCCGCCAACGCCGACGAGGCGGCGCTCGCGCGCGGCGCTAGCGTACTGCCCGCCGCCAGCCTG
CTTGCCGTGTGCGCACACCTCAACAGCCACACGGCGCTCACTCGGCGCCTGCCGCCGGCGGTCGCCGAGGGGCGGGACGA
CGAAGCGGATCTGGCCGAGGTGAAGGGGCAGCTGCAGGCCCGCCGCGCGCTCGAGGTCGCGGCTGCCGGGCAGCATTCGC
TGCTGATGTTCGGTCCGCCGGGGACCGGCAAGTCCATGCTCGCGCGCCGCCTTCCGGGCCTGCTGCCGCCGCTCGACGAG
GCCGAGGCGATCGAGTGTGCGTCGATCCAGTCGCTCGAAGGCGGTTTCGACGCCCGCCGCTGGGCCCGTCGGCCGTACCG
CGCGCCGCACCACTCGGCCTCTGCGCCGGCGCTCGTCGGCGGCGGCACGACACCGCGGCCCGGCGAAATCAGCCTTGCGC
ACCGCGGCGTGCTCTTCCTCGACGAGCTTCCGGAGTTCGACCGCCGCGTCCTCGAGGCGCTGCGCGAGCCGCTGGAAACC
GGCTCCGTGACGGTGTCGCGCGCACGCCAGCGCACCGAGTTCCCGGCGCGCTTCCAGCTCGTCGCGGCGATGAATCCCTG
TCCCTGCGGCCACGCCGGCGACAAGGGCGGACGCTGCCGCTGCACGCCGGATCAGGTCGCGCGTTATCGCGGACGCCTGT
CGGGGCCGCTGCTCGACCGTATGGACATCGTCATCGAGGTGCCGCTGCTCGACCATGCCGAGATGACCGGGCAGGCGGCG
GGCGAACCCAGCGCGACCGTGCGCGAGCGCGTCGCGCAGGCCTGGGCGGTGCAGCGCGGGCGCCAGGGCGGGGCGAACAG
CCATCTCGCGCCGGGCCGCGTCGATACGCTGTGCGCGCCCGACGACCAGGGCAAGGCGCTGCTCGATCATGCGATCCGGC
GGCTGAATCTGTCCGCACGCGGGTATCATCGCATCCTGAAGGTCGCCCGCACGATCGCGGACCTGGCCGGCGCCGAGCGC
GTCGGTCCCGCGCATCTGGCCGAGGCGATCCAGTACCGGCGCGGCCTCGACTCGCGCTGA
ATGTCGCTGGCTCTCGTACGCACCAGGGCGCTCGCAGGCATGGGCGCGCCCGAGGTGACGGTGGAAGTCCATCTCGCCAA
CGGCCTGCCCGCATTCAACCTGGTCGGCCTGCCCGACACCGAAGTCCGCGAGGCGCGCGAACGCGTGCGCGCCGCGATCG
CGACCTCGCAGTTCGAATTCCCGCAACGCCGCATCACCGTCAATCTCGCTCCCGCGGACCTCCCCAAGGAGGGCGGGCGC
TTCGACCTGCCGATCGCGCTGGGGATCCTTGTGGCATCGGGGCAGGTGGATGCGGCGGGGCTCGCGAAGCACGAGTTCGT
CGGCGAACTCTCGCTCGACGGCAGCCTGCGCGCGGTGCGCGGCGGCCTCGCGATGGCGCTGGAGAGCGGTCGCACCGGTC
GCGCGCTGGTGCTGCCGGCCGCCAACGCCGACGAGGCGGCGCTCGCGCGCGGCGCTAGCGTACTGCCCGCCGCCAGCCTG
CTTGCCGTGTGCGCACACCTCAACAGCCACACGGCGCTCACTCGGCGCCTGCCGCCGGCGGTCGCCGAGGGGCGGGACGA
CGAAGCGGATCTGGCCGAGGTGAAGGGGCAGCTGCAGGCCCGCCGCGCGCTCGAGGTCGCGGCTGCCGGGCAGCATTCGC
TGCTGATGTTCGGTCCGCCGGGGACCGGCAAGTCCATGCTCGCGCGCCGCCTTCCGGGCCTGCTGCCGCCGCTCGACGAG
GCCGAGGCGATCGAGTGTGCGTCGATCCAGTCGCTCGAAGGCGGTTTCGACGCCCGCCGCTGGGCCCGTCGGCCGTACCG
CGCGCCGCACCACTCGGCCTCTGCGCCGGCGCTCGTCGGCGGCGGCACGACACCGCGGCCCGGCGAAATCAGCCTTGCGC
ACCGCGGCGTGCTCTTCCTCGACGAGCTTCCGGAGTTCGACCGCCGCGTCCTCGAGGCGCTGCGCGAGCCGCTGGAAACC
GGCTCCGTGACGGTGTCGCGCGCACGCCAGCGCACCGAGTTCCCGGCGCGCTTCCAGCTCGTCGCGGCGATGAATCCCTG
TCCCTGCGGCCACGCCGGCGACAAGGGCGGACGCTGCCGCTGCACGCCGGATCAGGTCGCGCGTTATCGCGGACGCCTGT
CGGGGCCGCTGCTCGACCGTATGGACATCGTCATCGAGGTGCCGCTGCTCGACCATGCCGAGATGACCGGGCAGGCGGCG
GGCGAACCCAGCGCGACCGTGCGCGAGCGCGTCGCGCAGGCCTGGGCGGTGCAGCGCGGGCGCCAGGGCGGGGCGAACAG
CCATCTCGCGCCGGGCCGCGTCGATACGCTGTGCGCGCCCGACGACCAGGGCAAGGCGCTGCTCGATCATGCGATCCGGC
GGCTGAATCTGTCCGCACGCGGGTATCATCGCATCCTGAAGGTCGCCCGCACGATCGCGGACCTGGCCGGCGCCGAGCGC
GTCGGTCCCGCGCATCTGGCCGAGGCGATCCAGTACCGGCGCGGCCTCGACTCGCGCTGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Haemophilus influenzae Rd KW20 |
52.883 |
100 |
0.533 |
| comM | Vibrio campbellii strain DS40M4 |
53.333 |
99.198 |
0.529 |
| comM | Vibrio cholerae strain A1552 |
52.929 |
99.198 |
0.525 |
| comM | Glaesserella parasuis strain SC1401 |
50.902 |
100 |
0.509 |
| comM | Legionella pneumophila str. Paris |
48.697 |
100 |
0.487 |
| comM | Legionella pneumophila strain ERS1305867 |
48.697 |
100 |
0.487 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
43.281 |
100 |
0.439 |