Detailed information    

insolico Bioinformatically predicted

Overview


Name   comEA   Type   Machinery gene
Locus tag   NSQ79_RS08005 Genome accession   NZ_CP150200
Coordinates   1742710..1743405 (-) Length   231 a.a.
NCBI ID   WP_004183181.1    Uniprot ID   -
Organism   Streptococcus sp. FSL W7-1342     
Function   dsDNA binding to the cell surface (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 1737710..1748405
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  NSQ79_RS07990 (NSQ79_07990) - 1738521..1739066 (-) 546 WP_164332044.1 GNAT family N-acetyltransferase -
  NSQ79_RS07995 (NSQ79_07995) - 1739132..1740394 (-) 1263 WP_013991032.1 UDP-N-acetylglucosamine 1-carboxyvinyltransferase -
  NSQ79_RS08000 (NSQ79_08000) comEC/celB 1740480..1742720 (-) 2241 WP_339313492.1 DNA internalization-related competence protein ComEC/Rec2 Machinery gene
  NSQ79_RS08005 (NSQ79_08005) comEA 1742710..1743405 (-) 696 WP_004183181.1 helix-hairpin-helix domain-containing protein Machinery gene
  NSQ79_RS08010 (NSQ79_08010) - 1743514..1744269 (-) 756 WP_048790321.1 1-acyl-sn-glycerol-3-phosphate acyltransferase -
  NSQ79_RS08015 (NSQ79_08015) - 1744400..1745335 (+) 936 WP_129852028.1 polysaccharide deacetylase family protein -
  NSQ79_RS08020 (NSQ79_08020) - 1745385..1746149 (+) 765 WP_037599202.1 tRNA1(Val) (adenine(37)-N6)-methyltransferase -
  NSQ79_RS08025 (NSQ79_08025) - 1746153..1746422 (+) 270 WP_084870988.1 GIY-YIG nuclease family protein -
  NSQ79_RS08030 (NSQ79_08030) - 1746514..1746900 (-) 387 WP_045768432.1 IS110 family transposase -
  NSQ79_RS08035 (NSQ79_08035) - 1747022..1747681 (-) 660 WP_339313495.1 transposase -

Sequence


Protein


Download         Length: 231 a.a.        Molecular weight: 24284.10 Da        Isoelectric Point: 4.5517

>NTDB_id=966115 NSQ79_RS08005 WP_004183181.1 1742710..1743405(-) (comEA) [Streptococcus sp. FSL W7-1342]
MKEKILAYVKDNCLFVSVIAVLMVIFCFFLWMTYGAGNSMEAETSYTDVTALSTSSSKQSSQSLSEASSQSKTEGSEKGE
SKVTVDVKGAVANPGVYTLKASARVTDAIKAAGGMTEDADAKSVNLAASLSDEEVVYVATKDENLSVLGQSGTGQVSDKG
GQTSAKDGKINLNTATSEELQTISGIGAKRAEDIIAYRESHGGFQSVDDLKNVSGIGDKTLDKIRESLYVA

Nucleotide


Download         Length: 696 bp        

>NTDB_id=966115 NSQ79_RS08005 WP_004183181.1 1742710..1743405(-) (comEA) [Streptococcus sp. FSL W7-1342]
GTGAAGGAAAAGATTCTAGCCTATGTCAAAGATAATTGTCTGTTTGTGAGTGTTATTGCTGTACTGATGGTGATTTTTTG
CTTCTTCCTATGGATGACTTACGGTGCCGGCAACAGCATGGAGGCGGAGACGTCTTATACAGATGTGACAGCTTTGTCAA
CCTCCTCCTCCAAACAAAGCTCACAGTCTCTTTCTGAGGCGTCTTCCCAGTCAAAGACTGAAGGAAGTGAAAAGGGTGAG
TCAAAAGTAACGGTAGATGTTAAGGGGGCTGTGGCTAATCCGGGTGTTTATACCTTAAAAGCAAGCGCTAGGGTGACTGA
TGCCATCAAAGCTGCTGGGGGAATGACTGAGGATGCGGATGCTAAGAGTGTTAATTTAGCCGCAAGTCTGTCGGACGAAG
AGGTTGTCTATGTGGCAACTAAGGATGAAAACCTCTCTGTTCTTGGTCAATCAGGAACTGGTCAGGTCTCTGACAAAGGA
GGGCAAACTAGTGCTAAGGATGGTAAAATCAACTTAAACACAGCGACCTCAGAGGAGTTGCAAACCATTTCTGGAATTGG
AGCTAAGCGGGCAGAGGATATCATTGCCTATCGTGAAAGTCATGGAGGTTTTCAATCCGTAGATGACTTGAAAAATGTCT
CAGGAATTGGTGATAAAACTTTAGATAAAATCAGAGAGTCCCTCTATGTGGCTTAA

Domains


Predicted by InterproScan.

(85-138)

(168-228)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comEA Streptococcus thermophilus LMD-9

91.775

100

0.918

  comEA/celA/cilE Streptococcus mitis SK321

41.048

99.134

0.407

  comEA/celA/cilE Streptococcus pneumoniae TIGR4

40.611

99.134

0.403


Multiple sequence alignment