Detailed information
Overview
| Name | comM | Type | Machinery gene |
| Locus tag | GCD22_RS14510 | Genome accession | NZ_CP045571 |
| Coordinates | 2761837..2763348 (+) | Length | 503 a.a. |
| NCBI ID | WP_081577707.1 | Uniprot ID | - |
| Organism | Acidithiobacillus thiooxidans ATCC 19377 | ||
| Function | DNA uptake (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 2756837..2768348
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| GCD22_RS14480 (GCD22_03093) | - | 2756982..2757371 (-) | 390 | WP_010637778.1 | SCP2 sterol-binding domain-containing protein | - |
| GCD22_RS14485 (GCD22_03094) | - | 2757456..2758202 (-) | 747 | WP_031568745.1 | tRNA threonylcarbamoyladenosine dehydratase | - |
| GCD22_RS14490 (GCD22_03095) | - | 2758203..2759012 (-) | 810 | WP_031568743.1 | TatD family hydrolase | - |
| GCD22_RS14495 (GCD22_03096) | - | 2759153..2760382 (+) | 1230 | WP_031568741.1 | FAD-dependent monooxygenase | - |
| GCD22_RS14500 (GCD22_03097) | - | 2760372..2761559 (+) | 1188 | WP_031568739.1 | FAD-dependent monooxygenase | - |
| GCD22_RS14505 (GCD22_03098) | - | 2761570..2761833 (+) | 264 | WP_010637786.1 | accessory factor UbiK family protein | - |
| GCD22_RS14510 (GCD22_03099) | comM | 2761837..2763348 (+) | 1512 | WP_081577707.1 | YifB family Mg chelatase-like AAA ATPase | Machinery gene |
| GCD22_RS14515 (GCD22_03101) | - | 2763535..2763978 (-) | 444 | WP_031574014.1 | SEL1-like repeat protein | - |
| GCD22_RS14520 (GCD22_03102) | - | 2764006..2765628 (-) | 1623 | WP_051690695.1 | peptide chain release factor 3 | - |
| GCD22_RS14525 (GCD22_03103) | rimI | 2765638..2766108 (-) | 471 | WP_031574019.1 | ribosomal protein S18-alanine N-acetyltransferase | - |
| GCD22_RS14530 (GCD22_03104) | tsaB | 2766105..2766770 (-) | 666 | WP_031574022.1 | tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex dimerization subunit type 1 TsaB | - |
Sequence
Protein
Download Length: 503 a.a. Molecular weight: 53869.75 Da Isoelectric Point: 7.7577
>NTDB_id=395264 GCD22_RS14510 WP_081577707.1 2761837..2763348(+) (comM) [Acidithiobacillus thiooxidans ATCC 19377]
MPLAIVQSRALTGVHAAAVAVECDLGPGLPTFAVVGLAETAVKESRDRVRSAIQNSGFEFPARRMVVNLAPADLPKDGGR
FDLPIAIGILAASGQIPLKALENLEMIGELALDGSLRPVTGALSSSLAAGQAGHALLLPEGNAVEAALARTAPVLACRNL
AEAVAHLRGTQPLPETVPHDEIGINAIPYPDLRDVRGQETVKRALIIAAVGGHHILLSGPPGTGKSMLAARLPGLLPPLR
RSEALEVAAIHSLQSGGFDVQRWGQRPFRSPHHSASSAALVGGGSRPRPGEISLAHHGVLFLDEMPEFSRSVLEVLREPL
ESGEIHISRAARQSTFPARFQLIAAMNPCPCGNLGNPHQACVCTPTQISQYRGRLSGPLLDRMDIQIEVPSLPVELLQKA
PEGESSAYWRERISAAINRQWQRQSCRNAQLQGEALDQHCALQPEGARLLTRAADTLHLSARGYHRILRLARSIADLEES
TTISLQHLSEAIQYRRLAQSFTL
MPLAIVQSRALTGVHAAAVAVECDLGPGLPTFAVVGLAETAVKESRDRVRSAIQNSGFEFPARRMVVNLAPADLPKDGGR
FDLPIAIGILAASGQIPLKALENLEMIGELALDGSLRPVTGALSSSLAAGQAGHALLLPEGNAVEAALARTAPVLACRNL
AEAVAHLRGTQPLPETVPHDEIGINAIPYPDLRDVRGQETVKRALIIAAVGGHHILLSGPPGTGKSMLAARLPGLLPPLR
RSEALEVAAIHSLQSGGFDVQRWGQRPFRSPHHSASSAALVGGGSRPRPGEISLAHHGVLFLDEMPEFSRSVLEVLREPL
ESGEIHISRAARQSTFPARFQLIAAMNPCPCGNLGNPHQACVCTPTQISQYRGRLSGPLLDRMDIQIEVPSLPVELLQKA
PEGESSAYWRERISAAINRQWQRQSCRNAQLQGEALDQHCALQPEGARLLTRAADTLHLSARGYHRILRLARSIADLEES
TTISLQHLSEAIQYRRLAQSFTL
Nucleotide
Download Length: 1512 bp
>NTDB_id=395264 GCD22_RS14510 WP_081577707.1 2761837..2763348(+) (comM) [Acidithiobacillus thiooxidans ATCC 19377]
TTGCCTCTGGCGATTGTCCAGAGTCGTGCGCTCACGGGGGTGCATGCTGCTGCAGTCGCCGTAGAATGCGATCTGGGGCC
GGGTCTACCCACTTTTGCTGTGGTGGGGTTGGCCGAGACTGCCGTCAAGGAATCCCGGGATCGCGTCCGCTCCGCCATTC
AGAACAGTGGATTCGAGTTTCCGGCCCGGCGCATGGTCGTCAATCTGGCGCCCGCCGATCTCCCCAAGGATGGAGGGCGT
TTTGACCTGCCTATTGCCATTGGCATTCTCGCCGCCAGTGGTCAGATTCCTCTGAAAGCCTTGGAAAATCTGGAAATGAT
CGGGGAACTGGCGCTGGATGGCAGTTTGCGTCCGGTGACCGGGGCCCTCTCCAGCAGTCTGGCTGCCGGTCAGGCCGGAC
ATGCTTTACTGCTCCCGGAAGGCAATGCCGTGGAAGCCGCTTTAGCTCGCACTGCGCCCGTATTGGCCTGCCGCAATCTC
GCGGAAGCTGTGGCCCACCTGCGCGGCACCCAACCACTGCCGGAAACGGTGCCTCATGACGAGATCGGCATAAACGCCAT
CCCCTATCCGGACTTGCGGGATGTGCGCGGTCAGGAAACCGTCAAACGGGCACTGATTATTGCGGCCGTCGGAGGTCATC
ATATTCTCCTGTCCGGGCCTCCCGGGACCGGGAAAAGCATGCTGGCCGCGCGTTTGCCCGGATTGCTTCCGCCCCTGCGC
CGCAGCGAAGCCCTGGAAGTGGCCGCCATCCATAGTTTGCAAAGCGGTGGATTTGATGTGCAACGCTGGGGACAACGTCC
CTTTCGCTCCCCTCATCATAGTGCCTCTTCAGCAGCACTGGTGGGTGGGGGATCGCGCCCCCGCCCCGGAGAAATCAGTC
TTGCGCATCATGGCGTTCTTTTTCTGGATGAAATGCCGGAATTCTCGCGGAGTGTTCTCGAAGTGCTTCGGGAACCTCTG
GAGTCCGGCGAAATCCATATTTCCAGAGCCGCCCGGCAAAGCACCTTCCCGGCGCGTTTTCAGTTGATTGCCGCCATGAA
TCCCTGCCCCTGCGGCAATCTCGGTAATCCACATCAGGCCTGTGTGTGCACCCCGACCCAAATTTCCCAGTATCGTGGCC
GTCTTTCCGGCCCTCTGCTGGATCGTATGGACATTCAAATAGAAGTACCGAGTCTTCCGGTAGAACTCCTGCAAAAAGCG
CCAGAAGGCGAAAGTTCTGCCTACTGGCGCGAACGGATCAGCGCGGCCATAAACCGACAGTGGCAACGACAGTCCTGCCG
CAATGCCCAACTGCAGGGCGAAGCGTTGGACCAGCATTGTGCGCTACAGCCCGAGGGCGCCCGCTTACTGACGCGCGCCG
CTGACACCCTGCATCTTTCCGCCCGTGGTTATCACCGCATTTTGCGTCTGGCCCGCAGCATTGCCGATCTGGAAGAAAGC
ACCACGATCAGCCTGCAGCATCTGTCCGAAGCCATCCAGTATCGGCGGCTGGCGCAAAGCTTTACTTTGTAG
TTGCCTCTGGCGATTGTCCAGAGTCGTGCGCTCACGGGGGTGCATGCTGCTGCAGTCGCCGTAGAATGCGATCTGGGGCC
GGGTCTACCCACTTTTGCTGTGGTGGGGTTGGCCGAGACTGCCGTCAAGGAATCCCGGGATCGCGTCCGCTCCGCCATTC
AGAACAGTGGATTCGAGTTTCCGGCCCGGCGCATGGTCGTCAATCTGGCGCCCGCCGATCTCCCCAAGGATGGAGGGCGT
TTTGACCTGCCTATTGCCATTGGCATTCTCGCCGCCAGTGGTCAGATTCCTCTGAAAGCCTTGGAAAATCTGGAAATGAT
CGGGGAACTGGCGCTGGATGGCAGTTTGCGTCCGGTGACCGGGGCCCTCTCCAGCAGTCTGGCTGCCGGTCAGGCCGGAC
ATGCTTTACTGCTCCCGGAAGGCAATGCCGTGGAAGCCGCTTTAGCTCGCACTGCGCCCGTATTGGCCTGCCGCAATCTC
GCGGAAGCTGTGGCCCACCTGCGCGGCACCCAACCACTGCCGGAAACGGTGCCTCATGACGAGATCGGCATAAACGCCAT
CCCCTATCCGGACTTGCGGGATGTGCGCGGTCAGGAAACCGTCAAACGGGCACTGATTATTGCGGCCGTCGGAGGTCATC
ATATTCTCCTGTCCGGGCCTCCCGGGACCGGGAAAAGCATGCTGGCCGCGCGTTTGCCCGGATTGCTTCCGCCCCTGCGC
CGCAGCGAAGCCCTGGAAGTGGCCGCCATCCATAGTTTGCAAAGCGGTGGATTTGATGTGCAACGCTGGGGACAACGTCC
CTTTCGCTCCCCTCATCATAGTGCCTCTTCAGCAGCACTGGTGGGTGGGGGATCGCGCCCCCGCCCCGGAGAAATCAGTC
TTGCGCATCATGGCGTTCTTTTTCTGGATGAAATGCCGGAATTCTCGCGGAGTGTTCTCGAAGTGCTTCGGGAACCTCTG
GAGTCCGGCGAAATCCATATTTCCAGAGCCGCCCGGCAAAGCACCTTCCCGGCGCGTTTTCAGTTGATTGCCGCCATGAA
TCCCTGCCCCTGCGGCAATCTCGGTAATCCACATCAGGCCTGTGTGTGCACCCCGACCCAAATTTCCCAGTATCGTGGCC
GTCTTTCCGGCCCTCTGCTGGATCGTATGGACATTCAAATAGAAGTACCGAGTCTTCCGGTAGAACTCCTGCAAAAAGCG
CCAGAAGGCGAAAGTTCTGCCTACTGGCGCGAACGGATCAGCGCGGCCATAAACCGACAGTGGCAACGACAGTCCTGCCG
CAATGCCCAACTGCAGGGCGAAGCGTTGGACCAGCATTGTGCGCTACAGCCCGAGGGCGCCCGCTTACTGACGCGCGCCG
CTGACACCCTGCATCTTTCCGCCCGTGGTTATCACCGCATTTTGCGTCTGGCCCGCAGCATTGCCGATCTGGAAGAAAGC
ACCACGATCAGCCTGCAGCATCTGTCCGAAGCCATCCAGTATCGGCGGCTGGCGCAAAGCTTTACTTTGTAG
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comM | Vibrio campbellii strain DS40M4 |
51.473 |
100 |
0.521 |
| comM | Vibrio cholerae strain A1552 |
52.515 |
98.807 |
0.519 |
| comM | Legionella pneumophila strain ERS1305867 |
51.503 |
99.205 |
0.511 |
| comM | Legionella pneumophila str. Paris |
51.503 |
99.205 |
0.511 |
| comM | Haemophilus influenzae Rd KW20 |
50.098 |
100 |
0.507 |
| comM | Glaesserella parasuis strain SC1401 |
50.501 |
99.205 |
0.501 |
| RA0C_RS07335 | Riemerella anatipestifer ATCC 11845 = DSM 15868 |
44.51 |
100 |
0.451 |