Detailed information
Overview
| Name | comZ | Type | Machinery gene |
| Locus tag | TCCBUS3UF1_RS04275 | Genome accession | NC_017278 |
| Coordinates | 851262..852929 (-) | Length | 555 a.a. |
| NCBI ID | WP_014515277.1 | Uniprot ID | - |
| Organism | Thermus sp. CCB_US3_UF1 | ||
| Function | assembly of type IV pilus (predicted from homology) DNA binding and uptake |
||
Genomic Context
Location: 846262..857929
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| TCCBUS3UF1_RS04255 | - | 847369..848742 (+) | 1374 | WP_155983259.1 | O-antigen ligase family protein | - |
| TCCBUS3UF1_RS04260 (TCCBUS3UF1_8660) | - | 848674..850254 (-) | 1581 | WP_014515273.1 | O-antigen ligase family protein | - |
| TCCBUS3UF1_RS04265 (TCCBUS3UF1_8670) | - | 850263..850547 (-) | 285 | WP_050802035.1 | hypothetical protein | - |
| TCCBUS3UF1_RS12425 | - | 850566..850616 (-) | 51 | Protein_858 | prepilin-type N-terminal cleavage/methylation domain-containing protein | - |
| TCCBUS3UF1_RS04270 (TCCBUS3UF1_8680) | - | 850686..851045 (-) | 360 | WP_014515275.1 | type II secretion system protein | - |
| TCCBUS3UF1_RS04275 (TCCBUS3UF1_8700) | comZ | 851262..852929 (-) | 1668 | WP_014515277.1 | pilus assembly PilX N-terminal domain-containing protein | Machinery gene |
| TCCBUS3UF1_RS04280 (TCCBUS3UF1_8710) | pilA/pilA3 | 852939..853643 (-) | 705 | WP_014515278.1 | PilW family protein | Machinery gene |
| TCCBUS3UF1_RS04285 (TCCBUS3UF1_8720) | pilA/pilA2 | 853640..854248 (-) | 609 | WP_014515279.1 | prepilin-type N-terminal cleavage/methylation domain-containing protein | Machinery gene |
| TCCBUS3UF1_RS04290 (TCCBUS3UF1_8730) | - | 854245..854718 (-) | 474 | WP_041433756.1 | GspH/FimT family protein | - |
| TCCBUS3UF1_RS04295 (TCCBUS3UF1_8740) | pilA/pilA1 | 854773..855243 (-) | 471 | WP_014515281.1 | GspH/FimT family protein | Machinery gene |
| TCCBUS3UF1_RS04300 (TCCBUS3UF1_8750) | - | 855310..856500 (+) | 1191 | WP_014515282.1 | thiolase family protein | - |
| TCCBUS3UF1_RS04305 | - | 856497..856805 (-) | 309 | WP_041433963.1 | DUF503 domain-containing protein | - |
Sequence
Protein
Download Length: 555 a.a. Molecular weight: 59979.47 Da Isoelectric Point: 7.4035
>NTDB_id=45822 TCCBUS3UF1_RS04275 WP_014515277.1 851262..852929(-) (comZ) [Thermus sp. CCB_US3_UF1]
MRSKGIALVATLALMVLIGLLVFSTFFRTQIELWVTRNDTTSVQAFYAAEAGLQKYKAVLFQQYVWREQQGQTGGGSGCF
TSLVTGLDLDRNGNLLTFVNNQITLATNEVVVDADNRPIGRYTVTLYRDANDGQLFTLVSQGTSGGAKATVQATVRLSNT
GYLEQAIFAGTGQANKWLNGGATIRGGIYIVGSPSNPNQTVIDANGNFELLNWYDLSSYSGIAARVDTAYRQANDLCASL
RVQYGKISVGGSTRLGEPSNKLKGVFVGRGGQDITGQNVDVCQNNKGVCTEAMGPFDLANPPAFPTLDARLNSEACKDYS
TWRACLQDRAALRIQRVSNTVSLAYPLSVTLNASCFNAINNSGVLTLDKSTVDCTYTRLDGSQGGFKYTYASNQGLLEIY
GDVVLEGLNVVFNQPTEYKALSGNEKNATFAVLAKNNQGGNVDLNDNLLPQTNHGLFPNHALGLVAEDDIYQRGQYVMAP
VYAGGTFRVVKDNVLFGSVISNEFCTTSAGNQTNCNAGQKAEVVYIRIPQENRPVLLPAIRGGTPVFQILSYERR
MRSKGIALVATLALMVLIGLLVFSTFFRTQIELWVTRNDTTSVQAFYAAEAGLQKYKAVLFQQYVWREQQGQTGGGSGCF
TSLVTGLDLDRNGNLLTFVNNQITLATNEVVVDADNRPIGRYTVTLYRDANDGQLFTLVSQGTSGGAKATVQATVRLSNT
GYLEQAIFAGTGQANKWLNGGATIRGGIYIVGSPSNPNQTVIDANGNFELLNWYDLSSYSGIAARVDTAYRQANDLCASL
RVQYGKISVGGSTRLGEPSNKLKGVFVGRGGQDITGQNVDVCQNNKGVCTEAMGPFDLANPPAFPTLDARLNSEACKDYS
TWRACLQDRAALRIQRVSNTVSLAYPLSVTLNASCFNAINNSGVLTLDKSTVDCTYTRLDGSQGGFKYTYASNQGLLEIY
GDVVLEGLNVVFNQPTEYKALSGNEKNATFAVLAKNNQGGNVDLNDNLLPQTNHGLFPNHALGLVAEDDIYQRGQYVMAP
VYAGGTFRVVKDNVLFGSVISNEFCTTSAGNQTNCNAGQKAEVVYIRIPQENRPVLLPAIRGGTPVFQILSYERR
Nucleotide
Download Length: 1668 bp
>NTDB_id=45822 TCCBUS3UF1_RS04275 WP_014515277.1 851262..852929(-) (comZ) [Thermus sp. CCB_US3_UF1]
ATGCGGTCTAAAGGCATTGCCCTGGTAGCCACCCTGGCCCTGATGGTGCTGATCGGCCTTTTGGTCTTCAGCACCTTCTT
CCGAACCCAGATAGAACTCTGGGTAACCCGCAACGACACCACCTCGGTCCAGGCCTTTTACGCCGCCGAAGCCGGCCTGC
AAAAGTACAAGGCCGTTCTCTTCCAGCAGTACGTATGGCGGGAACAGCAGGGCCAGACGGGAGGGGGTAGCGGCTGCTTT
ACCTCCTTGGTCACGGGCCTGGACCTGGACCGCAACGGCAACCTCCTCACCTTCGTCAACAACCAGATCACCTTGGCCAC
CAACGAGGTGGTGGTGGACGCGGACAACCGCCCCATCGGCCGCTACACCGTAACCCTCTACCGGGATGCCAACGACGGCC
AGCTCTTTACCCTGGTTTCCCAGGGCACCTCGGGCGGGGCCAAGGCCACGGTGCAGGCCACGGTCCGCCTCAGCAACACG
GGCTACCTGGAGCAGGCCATCTTTGCCGGTACCGGCCAGGCCAACAAGTGGCTGAACGGCGGGGCCACCATCCGCGGGGG
CATCTACATCGTGGGTAGCCCCAGCAACCCCAACCAGACGGTCATAGACGCCAACGGCAACTTTGAGCTGCTCAACTGGT
ACGACCTGAGCAGCTACAGCGGCATCGCCGCCCGGGTGGACACCGCCTACCGCCAAGCCAACGACCTCTGCGCCAGCCTG
CGGGTGCAGTACGGCAAGATTTCCGTGGGCGGCAGCACCCGCTTGGGGGAACCCAGCAACAAGCTCAAAGGGGTCTTTGT
GGGCCGGGGCGGCCAGGACATCACCGGGCAGAACGTGGACGTCTGCCAGAACAACAAGGGGGTCTGCACCGAGGCCATGG
GCCCCTTTGACCTGGCCAACCCCCCGGCCTTCCCCACCCTGGACGCCAGGCTCAACTCCGAAGCCTGCAAGGACTACAGC
ACCTGGCGGGCCTGCTTGCAGGACAGGGCCGCCTTGCGCATCCAGCGCGTGAGCAACACCGTGAGCCTGGCCTATCCCTT
GAGCGTCACCCTCAACGCCTCCTGCTTCAACGCCATCAACAACTCGGGGGTCCTGACCCTGGATAAGAGCACCGTGGACT
GCACCTACACCCGGCTGGACGGCTCCCAGGGCGGCTTCAAGTACACCTACGCCAGCAACCAGGGGCTTCTGGAAATCTAC
GGGGACGTGGTCCTGGAAGGCCTGAACGTGGTCTTCAACCAGCCCACGGAGTACAAAGCCCTTTCCGGAAACGAAAAGAA
CGCCACCTTCGCCGTGCTGGCCAAGAACAACCAAGGCGGCAACGTGGACCTGAACGACAACCTTCTCCCCCAGACCAACC
ACGGGCTTTTCCCCAACCACGCCCTGGGCCTGGTGGCGGAAGACGACATCTACCAAAGGGGCCAGTACGTGATGGCTCCC
GTGTATGCCGGGGGCACGTTCCGCGTGGTGAAGGATAACGTCCTCTTCGGCTCGGTCATCAGCAACGAGTTCTGCACCAC
CAGCGCAGGCAACCAGACGAACTGCAACGCCGGGCAGAAGGCCGAGGTGGTGTACATCCGCATCCCCCAGGAAAACCGCC
CCGTCCTCCTGCCCGCCATCAGGGGCGGCACCCCCGTCTTCCAAATCCTTTCCTATGAGCGGCGCTAG
ATGCGGTCTAAAGGCATTGCCCTGGTAGCCACCCTGGCCCTGATGGTGCTGATCGGCCTTTTGGTCTTCAGCACCTTCTT
CCGAACCCAGATAGAACTCTGGGTAACCCGCAACGACACCACCTCGGTCCAGGCCTTTTACGCCGCCGAAGCCGGCCTGC
AAAAGTACAAGGCCGTTCTCTTCCAGCAGTACGTATGGCGGGAACAGCAGGGCCAGACGGGAGGGGGTAGCGGCTGCTTT
ACCTCCTTGGTCACGGGCCTGGACCTGGACCGCAACGGCAACCTCCTCACCTTCGTCAACAACCAGATCACCTTGGCCAC
CAACGAGGTGGTGGTGGACGCGGACAACCGCCCCATCGGCCGCTACACCGTAACCCTCTACCGGGATGCCAACGACGGCC
AGCTCTTTACCCTGGTTTCCCAGGGCACCTCGGGCGGGGCCAAGGCCACGGTGCAGGCCACGGTCCGCCTCAGCAACACG
GGCTACCTGGAGCAGGCCATCTTTGCCGGTACCGGCCAGGCCAACAAGTGGCTGAACGGCGGGGCCACCATCCGCGGGGG
CATCTACATCGTGGGTAGCCCCAGCAACCCCAACCAGACGGTCATAGACGCCAACGGCAACTTTGAGCTGCTCAACTGGT
ACGACCTGAGCAGCTACAGCGGCATCGCCGCCCGGGTGGACACCGCCTACCGCCAAGCCAACGACCTCTGCGCCAGCCTG
CGGGTGCAGTACGGCAAGATTTCCGTGGGCGGCAGCACCCGCTTGGGGGAACCCAGCAACAAGCTCAAAGGGGTCTTTGT
GGGCCGGGGCGGCCAGGACATCACCGGGCAGAACGTGGACGTCTGCCAGAACAACAAGGGGGTCTGCACCGAGGCCATGG
GCCCCTTTGACCTGGCCAACCCCCCGGCCTTCCCCACCCTGGACGCCAGGCTCAACTCCGAAGCCTGCAAGGACTACAGC
ACCTGGCGGGCCTGCTTGCAGGACAGGGCCGCCTTGCGCATCCAGCGCGTGAGCAACACCGTGAGCCTGGCCTATCCCTT
GAGCGTCACCCTCAACGCCTCCTGCTTCAACGCCATCAACAACTCGGGGGTCCTGACCCTGGATAAGAGCACCGTGGACT
GCACCTACACCCGGCTGGACGGCTCCCAGGGCGGCTTCAAGTACACCTACGCCAGCAACCAGGGGCTTCTGGAAATCTAC
GGGGACGTGGTCCTGGAAGGCCTGAACGTGGTCTTCAACCAGCCCACGGAGTACAAAGCCCTTTCCGGAAACGAAAAGAA
CGCCACCTTCGCCGTGCTGGCCAAGAACAACCAAGGCGGCAACGTGGACCTGAACGACAACCTTCTCCCCCAGACCAACC
ACGGGCTTTTCCCCAACCACGCCCTGGGCCTGGTGGCGGAAGACGACATCTACCAAAGGGGCCAGTACGTGATGGCTCCC
GTGTATGCCGGGGGCACGTTCCGCGTGGTGAAGGATAACGTCCTCTTCGGCTCGGTCATCAGCAACGAGTTCTGCACCAC
CAGCGCAGGCAACCAGACGAACTGCAACGCCGGGCAGAAGGCCGAGGTGGTGTACATCCGCATCCCCCAGGAAAACCGCC
CCGTCCTCCTGCCCGCCATCAGGGGCGGCACCCCCGTCTTCCAAATCCTTTCCTATGAGCGGCGCTAG
Domains
No domain identified.
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comZ | Thermus thermophilus HB27 |
73.333 |
100 |
0.733 |