Detailed information    

insolico Bioinformatically predicted

Overview


Name   comF   Type   Machinery gene
Locus tag   IMZ16_RS03575 Genome accession   NZ_CP063145
Coordinates   769661..770311 (+) Length   216 a.a.
NCBI ID   WP_193440521.1    Uniprot ID   A0A7M1T5G6
Organism   Cruoricaptor ignavus strain M1214     
Function   ssDNA transport into the cell (predicted from homology)   
DNA binding and uptake

Genomic Context


Location: 764661..775311
Locus tag Gene name Coordinates (strand) Size (bp) Protein ID Product Description
  IMZ16_RS03535 (IMZ16_03535) - 764762..764971 (+) 210 WP_073180683.1 DUF2683 family protein -
  IMZ16_RS03540 (IMZ16_03540) - 764953..765234 (+) 282 WP_193440515.1 Txe/YoeB family addiction module toxin -
  IMZ16_RS03545 (IMZ16_03545) - 765239..765970 (-) 732 WP_193440516.1 DUF4230 domain-containing protein -
  IMZ16_RS03550 (IMZ16_03550) - 766014..766565 (-) 552 WP_193440517.1 hypothetical protein -
  IMZ16_RS03555 (IMZ16_03555) - 766565..767566 (-) 1002 WP_193440518.1 acyl transferase -
  IMZ16_RS03560 (IMZ16_03560) - 767855..768247 (-) 393 WP_317173238.1 GxxExxY protein -
  IMZ16_RS03565 (IMZ16_03565) - 768328..769098 (-) 771 WP_193440519.1 UDP-2,3-diacylglucosamine diphosphatase -
  IMZ16_RS03570 (IMZ16_03570) - 769100..769543 (-) 444 WP_193440520.1 6-carboxytetrahydropterin synthase -
  IMZ16_RS03575 (IMZ16_03575) comF 769661..770311 (+) 651 WP_193440521.1 ComF family protein Machinery gene
  IMZ16_RS03580 (IMZ16_03580) upp 770308..770955 (-) 648 WP_193440522.1 uracil phosphoribosyltransferase -
  IMZ16_RS03585 (IMZ16_03585) der 770967..772274 (-) 1308 WP_193440523.1 ribosome biogenesis GTPase Der -
  IMZ16_RS03590 (IMZ16_03590) - 772751..772957 (-) 207 Protein_709 tyrosine-type recombinase/integrase -
  IMZ16_RS03595 (IMZ16_03595) - 773084..773614 (-) 531 WP_193440524.1 hypothetical protein -
  IMZ16_RS03600 (IMZ16_03600) - 773629..773997 (-) 369 WP_193440525.1 hypothetical protein -
  IMZ16_RS03605 (IMZ16_03605) - 774003..774179 (-) 177 WP_159430207.1 hypothetical protein -

Sequence


Protein


Download         Length: 216 a.a.        Molecular weight: 25097.31 Da        Isoelectric Point: 9.7070

>NTDB_id=493323 IMZ16_RS03575 WP_193440521.1 769661..770311(+) (comF) [Cruoricaptor ignavus strain M1214]
MLLDLLLPNRCLHCNRIIPAAMPVCEACEAQIHYTHWNFDDKNPLAQKCRMLFPTEKAFALMHFGKEGLSRELIHSLKYR
QREILGKMLAERVAERVIFDDDKPQLIASVPLHPKKLRQRGYNQLHLFADALSEKWKIPHNKNLLKRNIHKKSQATSKFD
ERFKTQNIFSLSKTIENQHIMVVDDVFTTGNTMASIAWEFLKSEGNRVSVLVMAMD

Nucleotide


Download         Length: 651 bp        

>NTDB_id=493323 IMZ16_RS03575 WP_193440521.1 769661..770311(+) (comF) [Cruoricaptor ignavus strain M1214]
ATGCTTCTCGATTTGCTGCTACCGAACCGCTGCCTGCATTGCAACCGCATCATCCCTGCTGCAATGCCGGTTTGCGAAGC
CTGCGAAGCGCAAATCCATTACACCCACTGGAATTTCGACGATAAAAATCCGCTGGCGCAAAAATGCCGGATGCTCTTCC
CTACCGAAAAAGCCTTCGCGCTAATGCATTTCGGCAAGGAAGGACTGAGCCGCGAACTCATCCACAGCCTGAAATACAGG
CAGCGCGAAATTCTTGGGAAAATGCTTGCCGAGCGAGTTGCCGAGCGCGTTATTTTTGATGACGACAAACCGCAGCTGAT
AGCATCTGTGCCGCTTCATCCTAAAAAACTCCGACAGCGCGGCTACAATCAGCTGCATCTTTTTGCGGATGCTTTGTCGG
AAAAATGGAAAATTCCGCACAATAAAAATTTGCTGAAAAGAAACATCCATAAAAAATCCCAAGCCACCAGTAAATTTGAC
GAGAGATTTAAAACGCAAAACATTTTTAGCCTAAGCAAAACAATTGAAAATCAGCACATTATGGTTGTAGATGATGTATT
CACCACGGGCAATACGATGGCAAGCATTGCGTGGGAATTCCTGAAATCAGAGGGCAACAGGGTAAGCGTTTTGGTAATGG
CGATGGATTAG

Domains


Predicted by InterproScan.

(2-34)


Secondary structure


Protein secondary structures were predicted by S4PRED and visualized by seqviz.



3D structure


Source ID Structure
  AlphaFold DB A0A7M1T5G6

Transmembrane helices


Transmembrane helices of protein were predicted by TMHMM 2.0 and visualized by seqviz and ECharts.



Visualization of predicted probability:


Similar proteins


Only experimentally validated proteins are listed.

Protein Organism Identities (%) Coverage (%) Ha-value
  comF Riemerella anatipestifer ATCC 11845 = DSM 15868

49.074

100

0.491