Detailed information of oriT
oriT
The information of the oriT region
oriTDB ID | 200477 |
Name | oriT_ICEHaeULPAs1-1 |
Organism | Herminiimonas arsenicoxydans |
Sequence Completeness | - |
NCBI accession of oriT (coordinates [strand]) | CU207211 (59314..59788 [-], 475 nt) |
oriT length | 475 nt |
IRs (inverted repeats) | 375..380, 393..398 (GCATTG..CAATGC) 141..147, 154..160 (TGCGGCC..GGCCGCA) |
Location of nic site | _ |
Conserved sequence flanking the nic site |
_ |
Note | Predicted by oriTfinder 2.0 |
oriT sequence
Download Length: 475 nt
>oriT_ICEHaeULPAs1-1
GAAGCGGGAAATTGCTGGACCAGCTCGGCATATCGTTCCAGCGGCGTGCGGTACAGGGTGACGAACTGCTTGCGCGAGAGGGAGGTGCGCTGCCAGATGTATTCCAGCAGCTTCTGCCGGCGCGGCGTGGCCAGCAGCGATGCGGCCGACTCGGGCCGCAGCAGCCCTTTCGGGAGATCGAGGGCGGGCGCTGGCGACGGAGCGGCAGCGACCGGAGGTCGTTTCCGCTGGAACAGAAAGAGCATGCGGGTGTCCTGGTGGTGGGCCGAGCGGGGGGCCTTTTCGCCTTTTCGAGGTAGGGCCTTTCCCCTTGCACCCCATTCCCTTGCCCTTTCGGCCCTTTAGCCTTTAACCATTTGGGTATAGGGCGTGGGGCATTGACCGTCCACGGCCAATGCGAGGTTCGGCGGGCCGGATTGATGCGTGAAGGCTTGGTGTTCCCGCCCTACGATGGGCCGATTCTTGGACTCGGGAG
GAAGCGGGAAATTGCTGGACCAGCTCGGCATATCGTTCCAGCGGCGTGCGGTACAGGGTGACGAACTGCTTGCGCGAGAGGGAGGTGCGCTGCCAGATGTATTCCAGCAGCTTCTGCCGGCGCGGCGTGGCCAGCAGCGATGCGGCCGACTCGGGCCGCAGCAGCCCTTTCGGGAGATCGAGGGCGGGCGCTGGCGACGGAGCGGCAGCGACCGGAGGTCGTTTCCGCTGGAACAGAAAGAGCATGCGGGTGTCCTGGTGGTGGGCCGAGCGGGGGGCCTTTTCGCCTTTTCGAGGTAGGGCCTTTCCCCTTGCACCCCATTCCCTTGCCCTTTCGGCCCTTTAGCCTTTAACCATTTGGGTATAGGGCGTGGGGCATTGACCGTCCACGGCCAATGCGAGGTTCGGCGGGCCGGATTGATGCGTGAAGGCTTGGTGTTCCCGCCCTACGATGGGCCGATTCTTGGACTCGGGAG
Visualization of oriT structure
oriT secondary structure
Predicted by RNAfold.
Download structure fileRelaxase
ID | 14514 | GenBank | CAL62175 |
Name | TraI_2_HEAR2031_ICEHaeULPAs1-1 | UniProt ID | _ |
Length | 614 a.a. | PDB ID | |
Note | Predicted by oriTfinder 2.0 |
Relaxase protein sequence
Download Length: 614 a.a. Molecular weight: 67517.78 Da Isoelectric Point: 6.6258
>CAL62175.1 Conserved hypothetical protein, putative relaxase [Herminiimonas arsenicoxydans]
MLFLFQRKRPPVAAAPSPAPALDLPKGLLRPESAASLLATPRRQKLLEYIWQRTSLSRKQFVTLYRTPLE
RYAELVQQFPASESHHHAYPGGMLDHGLEIVAYSLKLRQSHLLPIGASPEDQAAQSEAWTAAVAYAALLH
DIGKIAVDLHVELADGNTWHPWHGPLLQPYRFRYREDREYRLHSAATGLLYRQLLDRHVLDWLSGYPALW
APLLYVLAGQYEHAGVLGELVVQADRASVAQELGGDPARVMAAPKHALQRKLLDGLRYLLKEELKLNQPE
ASDGWLTEDGLWLVSKTVSDKLRAHLLSQGIDGIPANNTAVFNVLQDHGMLQPTSDGKAVWRATVTSTTG
WSHSFTLLRLAPALIWESGERPAPFAGTVEIDATPAENDACMSAPAPTVSVNPAQGGQEPPIWEGDSTTI
VSPPAAQPVPDVMEDLLAMVGLGESAGVGQDAEEFLHTPAPATAATSIPSPAPAPAPTPGSSATKPSGEQ
FMVWLKQGIASRRLIINDAKALVHTVNETAYLVSPGVFQRYAQEHPEVAALAKQENQQDWQWVQKRFEKL
QLHRKQPNGLNIWTCEVTGPRKSRRLHGYLLENRSLVFAEIPPNNPYLALTQEG
MLFLFQRKRPPVAAAPSPAPALDLPKGLLRPESAASLLATPRRQKLLEYIWQRTSLSRKQFVTLYRTPLE
RYAELVQQFPASESHHHAYPGGMLDHGLEIVAYSLKLRQSHLLPIGASPEDQAAQSEAWTAAVAYAALLH
DIGKIAVDLHVELADGNTWHPWHGPLLQPYRFRYREDREYRLHSAATGLLYRQLLDRHVLDWLSGYPALW
APLLYVLAGQYEHAGVLGELVVQADRASVAQELGGDPARVMAAPKHALQRKLLDGLRYLLKEELKLNQPE
ASDGWLTEDGLWLVSKTVSDKLRAHLLSQGIDGIPANNTAVFNVLQDHGMLQPTSDGKAVWRATVTSTTG
WSHSFTLLRLAPALIWESGERPAPFAGTVEIDATPAENDACMSAPAPTVSVNPAQGGQEPPIWEGDSTTI
VSPPAAQPVPDVMEDLLAMVGLGESAGVGQDAEEFLHTPAPATAATSIPSPAPAPAPTPGSSATKPSGEQ
FMVWLKQGIASRRLIINDAKALVHTVNETAYLVSPGVFQRYAQEHPEVAALAKQENQQDWQWVQKRFEKL
QLHRKQPNGLNIWTCEVTGPRKSRRLHGYLLENRSLVFAEIPPNNPYLALTQEG
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4CP
ID | 17130 | GenBank | CAL62149 |
Name | t4cp2_HEAR2005_ICEHaeULPAs1-1 | UniProt ID | _ |
Length | 728 a.a. | PDB ID | _ |
Note | Predicted by oriTfinder 2.0 |
T4CP protein sequence
Download Length: 728 a.a. Molecular weight: 80698.35 Da Isoelectric Point: 7.3166
>CAL62149.1 Conserved hypothetical protein, putative ATPase domain [Herminiimonas arsenicoxydans]
MSGKQPVEVLLRPAVELYTVAACAGAAFLSLVAPWSLALSPAMGVGSALAFGAYGAIRYRDARVILCYRR
NIRRLPRYVMTSKDVPVSQQRLFVGRGFLWEQKHTHRLMQTYRPEFRRYVEPTPAYRLARRLEERLEFAP
FPLSRLSKLTGWDVPFNPVRPLPPVGGLPRLHGIEPDEVDVSLPLGERVGHSLVLGTTRVGKTRLAELFV
TQDIRRRNAAGEHEVVIVIDPKGDADLLKRMYVEAKRAGREGEFYVFHLGWPDISARYNAVGRFGRISEV
ATRIAGQLSGEGNSAAFREFAWRFVNIIARALVELGQRPDYMLIQRHVINIDALFIEYAQHYFAKTEPKA
WEVIVQIEAKLNEKNIPRNMIGREKRVVALEQYLSQARNYDPVLDGLRSAVRYDKTYFDKIVASLLPLLE
KLTSGKISQLLAPNYSDLADPRPIFDWMQVIRKRAVVYVGLDALSDAEVAAAVGNSMFSDLVSVAGHIYK
HGIDDGLPGASAGTRVPINVHADEFNELMGDEFVPLINKGGGAGLQVTAYTQTLSDIEARIGNRAKAGQV
IGNFNNLFMLRVRETATAELLTRQLPKVEVYTTTIVSGATDSSDIRGATDFTSNTQDRISMSSVPMIEPS
HVVALPKGQCFALLQGGQLWKVRMPLPAPDPDEVMPQDLQQLAGYMRQSYSEATQWWEFTSSPALQDGAL
PDDLLDDAAAAEPGPVATDDSAGNEASP
MSGKQPVEVLLRPAVELYTVAACAGAAFLSLVAPWSLALSPAMGVGSALAFGAYGAIRYRDARVILCYRR
NIRRLPRYVMTSKDVPVSQQRLFVGRGFLWEQKHTHRLMQTYRPEFRRYVEPTPAYRLARRLEERLEFAP
FPLSRLSKLTGWDVPFNPVRPLPPVGGLPRLHGIEPDEVDVSLPLGERVGHSLVLGTTRVGKTRLAELFV
TQDIRRRNAAGEHEVVIVIDPKGDADLLKRMYVEAKRAGREGEFYVFHLGWPDISARYNAVGRFGRISEV
ATRIAGQLSGEGNSAAFREFAWRFVNIIARALVELGQRPDYMLIQRHVINIDALFIEYAQHYFAKTEPKA
WEVIVQIEAKLNEKNIPRNMIGREKRVVALEQYLSQARNYDPVLDGLRSAVRYDKTYFDKIVASLLPLLE
KLTSGKISQLLAPNYSDLADPRPIFDWMQVIRKRAVVYVGLDALSDAEVAAAVGNSMFSDLVSVAGHIYK
HGIDDGLPGASAGTRVPINVHADEFNELMGDEFVPLINKGGGAGLQVTAYTQTLSDIEARIGNRAKAGQV
IGNFNNLFMLRVRETATAELLTRQLPKVEVYTTTIVSGATDSSDIRGATDFTSNTQDRISMSSVPMIEPS
HVVALPKGQCFALLQGGQLWKVRMPLPAPDPDEVMPQDLQQLAGYMRQSYSEATQWWEFTSSPALQDGAL
PDDLLDDAAAAEPGPVATDDSAGNEASP
Protein domains
Predicted by InterproScan.
Protein structure
No available structure.
T4SS
T4SS were predicted by using oriTfinder2.
Region 1: 2010192..2036064
Locus tag | Coordinates | Strand | Size (bp) | Protein ID | Product | Description |
---|---|---|---|---|---|---|
HEAR1997 | 2005559..2006338 | + | 780 | CAL62141 | Conserved hypothetical protein, putative competence protein | - |
HEAR1998 | 2006405..2008171 | - | 1767 | CAL62142 | putative DNA helicase II | - |
HEAR1999 | 2008206..2009990 | - | 1785 | CAL62143 | Conserved hypothetical protein, putative ATP-dependent endonuclease | - |
HEAR2000 | 2010192..2010812 | + | 621 | CAL62144 | Conserved hypothetical protein | tfc2 |
HEAR2001 | 2010809..2011459 | + | 651 | CAL62145 | Conserved hypothetical protein | - |
HEAR2002 | 2011474..2012199 | + | 726 | CAL62146 | Conserved hypothetical protein; putative exported protein | tfc3 |
HEAR2003 | 2012181..2012771 | + | 591 | CAL62147 | Conserved hypothetical protein, putative lytic transglycolase | virB1 |
HEAR2004 | 2012768..2013316 | + | 549 | CAL62148 | Conserved hypothetical protein; putative exported protein | tfc5 |
HEAR2005 | 2013321..2015507 | + | 2187 | CAL62149 | Conserved hypothetical protein, putative ATPase domain | - |
HEAR2006 | 2015504..2016253 | + | 750 | CAL62150 | Conserved hypothetical protein; putative membrane protein | tfc7 |
HEAR2007 | 2016283..2018907 | - | 2625 | CAL62151 | Conserved hypothetical protein, putative ATPase involved in DNA repair | - |
HEAR2008 | 2019008..2020060 | - | 1053 | CAL62152 | Conserved hypothetical protein | - |
HEAR2010 | 2020212..2020886 | - | 675 | CAL62154 | Conserved hypothetical protein | - |
HEAR2011 | 2020949..2021893 | + | 945 | CAL62155 | Conserved hypothetical protein | - |
HEAR2012 | 2022030..2022410 | + | 381 | CAL62156 | Conserved hypothetical protein; putative exported protein | tfc8 |
HEAR2013 | 2022407..2022640 | + | 234 | CAL62157 | Conserved hypothetical protein; putative membrane protein | tfc9 |
HEAR2014 | 2022657..2023016 | + | 360 | CAL62158 | Conserved hypothetical protein; putative exported or membrane protein | tfc10 |
HEAR2015 | 2023029..2023439 | + | 411 | CAL62159 | Conserved hypothetical protein; putative membrane protein | tfc11 |
HEAR2016 | 2023436..2024125 | + | 690 | CAL62160 | Conserved hypothetical protein | tfc12 |
HEAR2017 | 2024122..2025039 | + | 918 | CAL62161 | Conserved hypothetical protein; putative exported protein | tfc13 |
HEAR2018 | 2025029..2026459 | + | 1431 | CAL62162 | Conserved hypothetical protein | tfc14 |
HEAR2019 | 2026440..2026880 | + | 441 | CAL62163 | Conserved hypothetical protein; putative exported protein | tfc15 |
HEAR2020 | 2026880..2029762 | + | 2883 | CAL62164 | Conserved hypothetical protein | virb4 |
HEAR2021 | 2029776..2030540 | + | 765 | CAL62165 | Conserved hypothetical protein, putative disulfide isomerase | - |
HEAR2022 | 2030725..2031219 | + | 495 | CAL62166 | putative DNA repair protein RadC | - |
HEAR2023 | 2031378..2031824 | + | 447 | CAL62167 | Conserved hypothetical protein | tfc24 |
HEAR2024 | 2031821..2032768 | + | 948 | CAL62168 | Conserved hypothetical protein | tfc23 |
HEAR2025 | 2032778..2034175 | + | 1398 | CAL62169 | Conserved hypothetical protein; putative exported protein | tfc22 |
HEAR2026 | 2034172..2034531 | + | 360 | CAL62170 | Conserved hypothetical protein; putative exported protein | tfc18 |
HEAR2027 | 2034547..2036064 | + | 1518 | CAL62171 | Conserved hypothetical protein; putative membrane protein | tfc19 |
HEAR2028 | 2036078..2036455 | - | 378 | CAL62172 | Conserved hypothetical protein; putative exported protein | - |
HEAR2029 | 2036483..2036941 | - | 459 | CAL62173 | Conserved hypothetical protein | - |
HEAR2030 | 2036941..2037258 | - | 318 | CAL62174 | Conserved hypothetical protein, putative transcriptional regulator | - |
HEAR2031 | 2037564..2039408 | + | 1845 | CAL62175 | Conserved hypothetical protein, putative relaxase | - |
Host bacterium
ID | 422 | Element type | ICE (Integrative and conjugative element) |
Element name | ICEHaeULPAs1-1 | GenBank | CU207211 |
Element size | 3424307 bp | Coordinate of oriT [Strand] | 59314..59788 [-] |
Host bacterium | Herminiimonas arsenicoxydans | Coordinate of element | 1978021..2066138 |
Cargo genes
Drug resistance gene | - |
Virulence gene | - |
Metal resistance gene | arsH, arsC |
Degradation gene | - |
Symbiosis gene | - |
Anti-CRISPR | - |