Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEC/celB   Type   Machinery gene
Locus tag   H1W98_RS07865 Genome accession   NZ_LR822026
Coordinates   1529648..1531888 (-) Length   746 a.a.
NCBI ID   WP_179972172.1    Uniprot ID   -
Organism   Streptococcus thermophilus isolate STH_CIRM_967     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1524648..1536888
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  H1W98_RS07845 (STHERMO_1778) - 1525512..1526372 (-) 861 WP_002951571.1 methionyl aminopeptidase -
  H1W98_RS07850 (STHERMO_1779) spxR 1526416..1527696 (-) 1281 WP_179972170.1 CBS-HotDog domain-containing transcription factor SpxR -
  H1W98_RS07855 - 1527689..1528234 (-) 546 Protein_1499 GNAT family N-acetyltransferase -
  H1W98_RS07860 (STHERMO_1782) - 1528300..1529562 (-) 1263 WP_179972171.1 UDP-N-acetylglucosamine 1-carboxyvinyltransferase -
  H1W98_RS07865 (STHERMO_1783) comEC/celB 1529648..1531888 (-) 2241 WP_179972172.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  H1W98_RS07870 (STHERMO_1784) comEA 1531878..1532573 (-) 696 WP_180482451.1 helix-hairpin-helix domain-containing protein Machinery gene
  H1W98_RS07875 (STHERMO_1785) - 1532680..1533435 (-) 756 WP_180482450.1 lysophospholipid acyltransferase family protein -
  H1W98_RS11245 - 1533565..1534474 (+) 910 Protein_1504 polysaccharide deacetylase family protein -
  H1W98_RS07885 (STHERMO_1788) - 1534524..1535288 (+) 765 WP_180482449.1 tRNA1(Val) (adenine(37)-N6)-methyltransferase -
  H1W98_RS07890 (STHERMO_1789) - 1535292..1535561 (+) 270 WP_180482448.1 GIY-YIG nuclease family protein -

Sequence


Protein


Download         Length: 746 a.a.        Molecular weight: 84981.00 Da        Isoelectric Point: 9.9593

>NTDB_id=1131294 H1W98_RS07865 WP_179972172.1 1529648..1531888(-) (comEC/celB) [Streptococcus thermophilus isolate STH_CIRM_967]
MWLKKAPINLFSLALLIVALYFTIFVSNFYAIGTFVFLMICFLRHHWKNRAALKLVGLIGGFFLVYFLFLYNIAIMQDKQ
APAEIHQVTLVADTLSVNGERLSAIGKSNGQTYRVFYRLKSDKEQHFFKTTSQTLVLKGKIKLSSATGQRNFQGFDYQSY
LASQGIYRIAQIERLEHVVTPKSISPIAFFHQLRRKALVHIQTHFPNPMRHYMTGLLFGYLDKEFDEQSQLYTSLGIIHL
FALSGMQVGFFLGWFRYGLLRLGLPKNYILAILLPFSIFYGLMTGWTASVLRSLIQSLLAECGIKKLNNMGITLLLFFLV
LPHFLLTVGGVLSCSYAFLLCLFNFEEMPSFKKSIYMSLVLSLGTLPFLTYYYGTFQPLSLILTAIFSLVFDSFLLPVLT
VLFTLSGMVIFSQFNPLFEWMEFFLTWIQSWGVQPLILGKPSLFQFVSMIFVLVLLFDFWKKPQFRISLLMIFSLLMVWV
KHPLINEVTVVDVGQGDSIFLRSMKGETVLIDVGGKVTFVSKEKWQEGSQTSNAEKTLIPYLQERGVSQIDYLVLTHTDT
DHIGDLEEVAKRFKIKEICVSKGALTKPSFAKRIRFLKRPVRTLKAGDKLTMMGSNLQVLYPNKIGDGGNNDSLVLYGKL
LGTSFLFTGDLEKEGEEELMASYPNLKVRVLKAGHHGSKGSSSEAFLDQLKPSLALVSAGENNRYKHPNDETLERFKERH
IKVLRTDLDGAIRFKGWFQLSSETVR

Nucleotide


Download         Length: 2241 bp        

>NTDB_id=1131294 H1W98_RS07865 WP_179972172.1 1529648..1531888(-) (comEC/celB) [Streptococcus thermophilus isolate STH_CIRM_967]
ATGTGGCTTAAAAAAGCTCCAATCAATCTTTTTTCCCTAGCGCTTTTAATAGTTGCTCTTTATTTTACTATTTTTGTATC
TAATTTCTATGCTATTGGGACTTTTGTCTTTCTGATGATATGTTTTTTGAGGCATCATTGGAAGAATAGAGCAGCTCTAA
AGTTGGTGGGACTTATAGGTGGTTTCTTTTTGGTCTACTTTTTATTTTTATACAATATAGCTATCATGCAAGATAAACAA
GCTCCTGCTGAAATTCATCAGGTGACACTGGTTGCAGATACGCTCTCAGTTAATGGTGAGCGATTATCAGCAATCGGGAA
GTCAAATGGACAAACCTATCGGGTCTTTTACCGGCTAAAATCTGATAAGGAGCAGCATTTTTTTAAGACTACGAGTCAAA
CGCTTGTTTTAAAAGGAAAAATAAAGTTGTCCTCGGCAACTGGTCAACGTAATTTTCAAGGCTTTGATTATCAGTCTTAT
CTAGCTAGTCAGGGCATTTATAGGATTGCTCAGATTGAGCGTCTGGAACATGTCGTAACCCCAAAATCGATATCTCCAAT
AGCTTTTTTTCATCAATTGAGGAGGAAGGCTCTAGTTCATATTCAGACGCATTTTCCTAATCCGATGAGACACTACATGA
CAGGACTGCTCTTTGGGTATTTGGACAAGGAGTTTGATGAGCAGAGTCAACTCTACACTAGCTTGGGGATTATTCATCTT
TTTGCGCTATCGGGTATGCAGGTTGGATTTTTTCTTGGCTGGTTTCGCTATGGACTTCTCCGTTTGGGGCTTCCCAAAAA
TTATATCCTTGCTATCTTATTACCTTTTTCGATTTTCTATGGCTTAATGACTGGTTGGACGGCTTCGGTCTTACGTTCTT
TGATTCAAAGCCTCTTGGCTGAGTGTGGTATTAAAAAACTGAACAATATGGGCATAACGCTACTTCTATTTTTTCTAGTC
TTGCCTCATTTTCTTTTAACGGTAGGAGGTGTTTTAAGTTGTTCTTATGCCTTCTTGTTGTGTTTATTTAATTTTGAGGA
GATGCCGTCCTTTAAAAAGTCAATTTATATGAGTTTAGTATTGAGTCTTGGGACTTTGCCTTTTTTGACCTACTATTATG
GAACTTTTCAACCATTGAGTTTGATTCTGACGGCAATCTTCTCTCTAGTTTTTGATAGCTTTCTCTTACCTGTCTTAACA
GTACTTTTTACACTTTCAGGAATGGTAATTTTTTCTCAATTTAATCCACTTTTTGAATGGATGGAGTTCTTTTTGACTTG
GATACAATCTTGGGGAGTCCAGCCATTGATTTTAGGAAAACCTAGCTTGTTTCAGTTTGTCTCAATGATATTTGTATTGG
TTTTGCTCTTTGATTTTTGGAAAAAGCCTCAGTTTAGAATATCCCTCTTGATGATTTTTAGTCTTTTGATGGTCTGGGTC
AAACATCCTTTAATAAATGAAGTAACGGTGGTCGACGTTGGTCAAGGAGATAGTATTTTTTTAAGGAGTATGAAAGGTGA
GACGGTCCTAATAGATGTAGGTGGCAAGGTGACTTTCGTCTCTAAGGAAAAATGGCAAGAGGGGAGCCAGACGAGTAATG
CGGAGAAAACCTTAATTCCCTATTTACAGGAAAGGGGAGTGTCTCAAATTGACTATCTGGTTCTGACTCATACGGATACA
GATCATATTGGTGATTTGGAGGAAGTGGCTAAACGGTTTAAGATTAAGGAAATTTGTGTCAGTAAGGGGGCTTTGACTAA
GCCTAGTTTTGCCAAACGTATTCGATTTCTTAAACGTCCCGTTCGCACTTTAAAGGCTGGTGATAAGCTGACTATGATGG
GAAGTAACCTACAGGTTCTTTATCCTAATAAAATTGGTGATGGTGGTAACAATGATTCACTTGTTCTCTATGGAAAGTTA
TTGGGAACCAGTTTTTTGTTTACTGGTGATTTAGAAAAAGAAGGAGAGGAGGAATTAATGGCTAGCTATCCAAATTTAAA
AGTAAGGGTCCTAAAAGCAGGACATCACGGTTCTAAAGGCTCTTCATCAGAAGCTTTTTTGGATCAGCTAAAGCCATCAC
TTGCTCTTGTCTCAGCTGGTGAAAATAATCGTTATAAGCATCCAAATGATGAAACATTAGAGCGTTTCAAAGAACGTCAC
ATTAAGGTTTTACGCACAGACCTGGATGGTGCCATTCGGTTTAAGGGATGGTTCCAACTATCAAGTGAAACTGTCCGATA
A


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEC/celB Streptococcus mitis NCTC 12261

48.193

100

0.483

  comEC/celB Streptococcus mitis SK321

46.791

100

0.469

  comEC/celB Streptococcus pneumoniae TIGR4

46.123

100

0.462

  comEC/celB Streptococcus pneumoniae Rx1

45.856

100

0.46

  comEC/celB Streptococcus pneumoniae D39

45.856

100

0.46

  comEC/celB Streptococcus pneumoniae R6

45.856

100

0.46

  comEC Lactococcus lactis subsp. cremoris KW2

44.69

97.185

0.434


Multiple sequence alignment