Detailed information    

insolico Bioinformatically predicted

Overview


Name   comM   Type   Machinery gene
Locus tag   GCD22_RS14510 Genome accession   NZ_CP045571
Coordinates   2761837..2763348 (+) Length   503 a.a.
NCBI ID   WP_081577707.1    Uniprot ID   -
Organism   Acidithiobacillus thiooxidans ATCC 19377     
Function   DNA uptake (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 2756837..2768348
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  GCD22_RS14480 (GCD22_03093) - 2756982..2757371 (-) 390 WP_010637778.1 SCP2 sterol-binding domain-containing protein -
  GCD22_RS14485 (GCD22_03094) - 2757456..2758202 (-) 747 WP_031568745.1 tRNA threonylcarbamoyladenosine dehydratase -
  GCD22_RS14490 (GCD22_03095) - 2758203..2759012 (-) 810 WP_031568743.1 TatD family hydrolase -
  GCD22_RS14495 (GCD22_03096) - 2759153..2760382 (+) 1230 WP_031568741.1 FAD-dependent monooxygenase -
  GCD22_RS14500 (GCD22_03097) - 2760372..2761559 (+) 1188 WP_031568739.1 FAD-dependent monooxygenase -
  GCD22_RS14505 (GCD22_03098) - 2761570..2761833 (+) 264 WP_010637786.1 accessory factor UbiK family protein -
  GCD22_RS14510 (GCD22_03099) comM 2761837..2763348 (+) 1512 WP_081577707.1 YifB family Mg chelatase-like AAA ATPase Machinery gene
  GCD22_RS14515 (GCD22_03101) - 2763535..2763978 (-) 444 WP_031574014.1 SEL1-like repeat protein -
  GCD22_RS14520 (GCD22_03102) - 2764006..2765628 (-) 1623 WP_051690695.1 peptide chain release factor 3 -
  GCD22_RS14525 (GCD22_03103) rimI 2765638..2766108 (-) 471 WP_031574019.1 ribosomal protein S18-alanine N-acetyltransferase -
  GCD22_RS14530 (GCD22_03104) tsaB 2766105..2766770 (-) 666 WP_031574022.1 tRNA (adenosine(37)-N6)-threonylcarbamoyltransferase complex dimerization subunit type 1 TsaB -

Sequence


Protein


Download         Length: 503 a.a.        Molecular weight: 53869.75 Da        Isoelectric Point: 7.7577

>NTDB_id=395264 GCD22_RS14510 WP_081577707.1 2761837..2763348(+) (comM) [Acidithiobacillus thiooxidans ATCC 19377]
MPLAIVQSRALTGVHAAAVAVECDLGPGLPTFAVVGLAETAVKESRDRVRSAIQNSGFEFPARRMVVNLAPADLPKDGGR
FDLPIAIGILAASGQIPLKALENLEMIGELALDGSLRPVTGALSSSLAAGQAGHALLLPEGNAVEAALARTAPVLACRNL
AEAVAHLRGTQPLPETVPHDEIGINAIPYPDLRDVRGQETVKRALIIAAVGGHHILLSGPPGTGKSMLAARLPGLLPPLR
RSEALEVAAIHSLQSGGFDVQRWGQRPFRSPHHSASSAALVGGGSRPRPGEISLAHHGVLFLDEMPEFSRSVLEVLREPL
ESGEIHISRAARQSTFPARFQLIAAMNPCPCGNLGNPHQACVCTPTQISQYRGRLSGPLLDRMDIQIEVPSLPVELLQKA
PEGESSAYWRERISAAINRQWQRQSCRNAQLQGEALDQHCALQPEGARLLTRAADTLHLSARGYHRILRLARSIADLEES
TTISLQHLSEAIQYRRLAQSFTL

Nucleotide


Download         Length: 1512 bp        

>NTDB_id=395264 GCD22_RS14510 WP_081577707.1 2761837..2763348(+) (comM) [Acidithiobacillus thiooxidans ATCC 19377]
TTGCCTCTGGCGATTGTCCAGAGTCGTGCGCTCACGGGGGTGCATGCTGCTGCAGTCGCCGTAGAATGCGATCTGGGGCC
GGGTCTACCCACTTTTGCTGTGGTGGGGTTGGCCGAGACTGCCGTCAAGGAATCCCGGGATCGCGTCCGCTCCGCCATTC
AGAACAGTGGATTCGAGTTTCCGGCCCGGCGCATGGTCGTCAATCTGGCGCCCGCCGATCTCCCCAAGGATGGAGGGCGT
TTTGACCTGCCTATTGCCATTGGCATTCTCGCCGCCAGTGGTCAGATTCCTCTGAAAGCCTTGGAAAATCTGGAAATGAT
CGGGGAACTGGCGCTGGATGGCAGTTTGCGTCCGGTGACCGGGGCCCTCTCCAGCAGTCTGGCTGCCGGTCAGGCCGGAC
ATGCTTTACTGCTCCCGGAAGGCAATGCCGTGGAAGCCGCTTTAGCTCGCACTGCGCCCGTATTGGCCTGCCGCAATCTC
GCGGAAGCTGTGGCCCACCTGCGCGGCACCCAACCACTGCCGGAAACGGTGCCTCATGACGAGATCGGCATAAACGCCAT
CCCCTATCCGGACTTGCGGGATGTGCGCGGTCAGGAAACCGTCAAACGGGCACTGATTATTGCGGCCGTCGGAGGTCATC
ATATTCTCCTGTCCGGGCCTCCCGGGACCGGGAAAAGCATGCTGGCCGCGCGTTTGCCCGGATTGCTTCCGCCCCTGCGC
CGCAGCGAAGCCCTGGAAGTGGCCGCCATCCATAGTTTGCAAAGCGGTGGATTTGATGTGCAACGCTGGGGACAACGTCC
CTTTCGCTCCCCTCATCATAGTGCCTCTTCAGCAGCACTGGTGGGTGGGGGATCGCGCCCCCGCCCCGGAGAAATCAGTC
TTGCGCATCATGGCGTTCTTTTTCTGGATGAAATGCCGGAATTCTCGCGGAGTGTTCTCGAAGTGCTTCGGGAACCTCTG
GAGTCCGGCGAAATCCATATTTCCAGAGCCGCCCGGCAAAGCACCTTCCCGGCGCGTTTTCAGTTGATTGCCGCCATGAA
TCCCTGCCCCTGCGGCAATCTCGGTAATCCACATCAGGCCTGTGTGTGCACCCCGACCCAAATTTCCCAGTATCGTGGCC
GTCTTTCCGGCCCTCTGCTGGATCGTATGGACATTCAAATAGAAGTACCGAGTCTTCCGGTAGAACTCCTGCAAAAAGCG
CCAGAAGGCGAAAGTTCTGCCTACTGGCGCGAACGGATCAGCGCGGCCATAAACCGACAGTGGCAACGACAGTCCTGCCG
CAATGCCCAACTGCAGGGCGAAGCGTTGGACCAGCATTGTGCGCTACAGCCCGAGGGCGCCCGCTTACTGACGCGCGCCG
CTGACACCCTGCATCTTTCCGCCCGTGGTTATCACCGCATTTTGCGTCTGGCCCGCAGCATTGCCGATCTGGAAGAAAGC
ACCACGATCAGCCTGCAGCATCTGTCCGAAGCCATCCAGTATCGGCGGCTGGCGCAAAGCTTTACTTTGTAG


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comM Vibrio campbellii strain DS40M4

51.473

100

0.521

  comM Vibrio cholerae strain A1552

52.515

98.807

0.519

  comM Legionella pneumophila strain ERS1305867

51.503

99.205

0.511

  comM Legionella pneumophila str. Paris

51.503

99.205

0.511

  comM Haemophilus influenzae Rd KW20

50.098

100

0.507

  comM Glaesserella parasuis strain SC1401

50.501

99.205

0.501

  RA0C_RS07335 Riemerella anatipestifer ATCC 11845 = DSM 15868

44.51

100

0.451


Multiple sequence alignment