Detailed information
Overview
| Name | comFA/cflA | Type | Machinery gene |
| Locus tag | LACPI_RS10055 | Genome accession | NZ_LN774769 |
| Coordinates | 2073480..2074754 (+) | Length | 424 a.a. |
| NCBI ID | WP_047916184.1 | Uniprot ID | - |
| Organism | Lactococcus piscium MKFS47 | ||
| Function | ssDNA transport into the cell (predicted from homology) DNA binding and uptake |
||
Related MGE
Note: This gene co-localizes with putative mobile genetic elements (MGEs) in the genome predicted by VRprofile2, as detailed below.
Gene-MGE association summary
| MGE type | MGE coordinates | Gene coordinates | Relative position | Distance (bp) |
|---|---|---|---|---|
| Prophage | 2020518..2079520 | 2073480..2074754 | within | 0 |
Gene organization within MGE regions
Location: 2020518..2079520
| Locus tag | Gene name | Coordinates (strand) | Size (bp) | Protein ID | Product | Description |
|---|---|---|---|---|---|---|
| LACPI_RS09685 (LACPI_1910) | - | 2020518..2021261 (-) | 744 | WP_047916120.1 | ABC transporter ATP-binding protein | - |
| LACPI_RS09690 (LACPI_1911) | - | 2021393..2021926 (-) | 534 | WP_047916121.1 | GNAT family N-acetyltransferase | - |
| LACPI_RS09695 (LACPI_1912) | - | 2022051..2022446 (+) | 396 | WP_047916122.1 | HIT family protein | - |
| LACPI_RS09700 (LACPI_1913) | - | 2022488..2022718 (+) | 231 | WP_047916123.1 | hypothetical protein | - |
| LACPI_RS09705 (LACPI_1914) | - | 2022921..2024054 (+) | 1134 | WP_050703037.1 | DNA/RNA non-specific endonuclease | - |
| LACPI_RS09710 (LACPI_1915) | - | 2024451..2025161 (+) | 711 | WP_047916125.1 | MBL fold metallo-hydrolase | - |
| LACPI_RS09715 (LACPI_1916) | - | 2025145..2025705 (+) | 561 | WP_047916126.1 | TetR/AcrR family transcriptional regulator | - |
| LACPI_RS09720 (LACPI_1917) | - | 2025957..2026577 (-) | 621 | WP_047916127.1 | lipoprotein | - |
| LACPI_RS12840 (LACPI_1918) | - | 2026876..2027511 (+) | 636 | WP_047916128.1 | hypothetical protein | - |
| LACPI_RS09730 | - | 2028181..2028738 (+) | 558 | WP_047916129.1 | hypothetical protein | - |
| LACPI_RS09735 (LACPI_1919) | - | 2028786..2030912 (-) | 2127 | WP_047916130.1 | glycoside hydrolase domain-containing protein | - |
| LACPI_RS09740 (LACPI_1920) | - | 2030952..2032055 (-) | 1104 | WP_047916655.1 | GH25 family lysozyme | - |
| LACPI_RS09745 (LACPI_1921) | - | 2032052..2032380 (-) | 329 | Protein_1940 | holin | - |
| LACPI_RS09750 (LACPI_1922) | - | 2032753..2033490 (-) | 738 | WP_050703038.1 | CHAP domain-containing protein | - |
| LACPI_RS09755 (LACPI_1923) | - | 2033656..2034057 (-) | 402 | WP_047916131.1 | hypothetical protein | - |
| LACPI_RS09760 (LACPI_1924) | - | 2034463..2034807 (+) | 345 | WP_047916132.1 | helix-turn-helix domain-containing protein | - |
| LACPI_RS09765 (LACPI_1925) | - | 2034822..2035409 (+) | 588 | WP_047916133.1 | hypothetical protein | - |
| LACPI_RS09770 (LACPI_1926) | - | 2035825..2036205 (-) | 381 | WP_050703039.1 | MAG6450 family protein | - |
| LACPI_RS09775 (LACPI_1927) | - | 2036208..2036660 (-) | 453 | WP_047916134.1 | Panacea domain-containing protein | - |
| LACPI_RS09780 (LACPI_1928) | - | 2036843..2037691 (-) | 849 | WP_047916135.1 | peptidoglycan amidohydrolase family protein | - |
| LACPI_RS09785 (LACPI_1929) | - | 2037681..2037869 (-) | 189 | WP_047916136.1 | hypothetical protein | - |
| LACPI_RS09790 (LACPI_1930) | - | 2037862..2038080 (-) | 219 | WP_047916137.1 | phage holin | - |
| LACPI_RS09795 (LACPI_1931) | - | 2038085..2038327 (-) | 243 | WP_047916138.1 | hemolysin XhlA family protein | - |
| LACPI_RS09805 (LACPI_1932) | - | 2038532..2042146 (-) | 3615 | WP_047916140.1 | phage tail spike protein | - |
| LACPI_RS09810 (LACPI_1933) | - | 2042146..2043708 (-) | 1563 | WP_047916141.1 | distal tail protein Dit | - |
| LACPI_RS09815 (LACPI_1934) | - | 2043709..2046183 (-) | 2475 | WP_047916142.1 | hypothetical protein | - |
| LACPI_RS09820 (LACPI_1935) | - | 2046413..2047018 (-) | 606 | WP_047916143.1 | hypothetical protein | - |
| LACPI_RS09825 (LACPI_1936) | - | 2047018..2047290 (-) | 273 | WP_047916144.1 | hypothetical protein | - |
| LACPI_RS09830 (LACPI_1937) | - | 2047377..2047757 (-) | 381 | WP_047916145.1 | tail assembly chaperone | - |
| LACPI_RS09835 (LACPI_1938) | - | 2047818..2048321 (-) | 504 | WP_047916146.1 | phage major tail protein, TP901-1 family | - |
| LACPI_RS09840 (LACPI_1939) | - | 2048330..2048719 (-) | 390 | WP_047916147.1 | DUF3168 domain-containing protein | - |
| LACPI_RS09845 (LACPI_1940) | - | 2048732..2049097 (-) | 366 | WP_047916148.1 | HK97-gp10 family putative phage morphogenesis protein | - |
| LACPI_RS09850 (LACPI_1941) | - | 2049090..2049392 (-) | 303 | WP_047916149.1 | hypothetical protein | - |
| LACPI_RS09855 (LACPI_1942) | - | 2049389..2049721 (-) | 333 | WP_047916150.1 | phage head-tail connector protein | - |
| LACPI_RS12780 (LACPI_1943) | - | 2049745..2049888 (-) | 144 | WP_157761141.1 | hypothetical protein | - |
| LACPI_RS09860 (LACPI_1944) | - | 2049902..2050738 (-) | 837 | WP_047916151.1 | N4-gp56 family major capsid protein | - |
| LACPI_RS09865 (LACPI_1945) | - | 2050754..2051353 (-) | 600 | WP_047916152.1 | capsid assembly scaffolding protein Gp46 family protein | - |
| LACPI_RS09870 (LACPI_1946) | - | 2051457..2052365 (-) | 909 | WP_047916153.1 | minor capsid protein | - |
| LACPI_RS09875 (LACPI_1947) | - | 2052369..2053685 (-) | 1317 | WP_047916154.1 | phage portal protein | - |
| LACPI_RS09880 (LACPI_1948) | - | 2053685..2054962 (-) | 1278 | WP_047916155.1 | PBSX family phage terminase large subunit | - |
| LACPI_RS09885 (LACPI_1949) | - | 2054959..2055381 (-) | 423 | WP_047916156.1 | terminase small subunit | - |
| LACPI_RS09890 (LACPI_1950) | - | 2055926..2056330 (-) | 405 | WP_047916157.1 | ArpU family phage packaging/lysis transcriptional regulator | - |
| LACPI_RS09895 (LACPI_1951) | - | 2056386..2056661 (-) | 276 | WP_047916158.1 | hypothetical protein | - |
| LACPI_RS09900 (LACPI_1952) | - | 2056758..2056949 (-) | 192 | WP_047916159.1 | hypothetical protein | - |
| LACPI_RS09905 (LACPI_1953) | - | 2056946..2057500 (-) | 555 | WP_050703041.1 | DUF1642 domain-containing protein | - |
| LACPI_RS09910 (LACPI_1954) | - | 2057497..2057787 (-) | 291 | WP_047916160.1 | hypothetical protein | - |
| LACPI_RS09915 | - | 2057784..2058002 (-) | 219 | WP_047916161.1 | hypothetical protein | - |
| LACPI_RS12380 (LACPI_1955) | - | 2057995..2058507 (-) | 513 | WP_068876856.1 | DUF3310 domain-containing protein | - |
| LACPI_RS12785 (LACPI_1956) | - | 2058497..2058634 (-) | 138 | WP_157761142.1 | hypothetical protein | - |
| LACPI_RS09925 (LACPI_1957) | - | 2058631..2059023 (-) | 393 | WP_047916162.1 | YopX family protein | - |
| LACPI_RS09930 (LACPI_1958) | - | 2059020..2059283 (-) | 264 | WP_047916163.1 | winged helix-turn-helix domain-containing protein | - |
| LACPI_RS09935 (LACPI_1959) | - | 2059295..2060146 (-) | 852 | WP_231858752.1 | DNA-methyltransferase | - |
| LACPI_RS09940 (LACPI_1960) | - | 2060152..2060544 (-) | 393 | WP_047916164.1 | RusA family crossover junction endodeoxyribonuclease | - |
| LACPI_RS09945 (LACPI_1961) | - | 2060541..2060729 (-) | 189 | WP_047916165.1 | hypothetical protein | - |
| LACPI_RS09950 (LACPI_1962) | - | 2060738..2061616 (-) | 879 | WP_047916166.1 | ATP-binding protein | - |
| LACPI_RS09955 (LACPI_1963) | - | 2061628..2062395 (-) | 768 | WP_047916167.1 | conserved phage C-terminal domain-containing protein | - |
| LACPI_RS12790 | - | 2062396..2062551 (-) | 156 | WP_157761143.1 | hypothetical protein | - |
| LACPI_RS09965 (LACPI_1964) | ssb | 2062770..2063210 (-) | 441 | WP_047916169.1 | single-stranded DNA-binding protein | Machinery gene |
| LACPI_RS09970 (LACPI_1965) | - | 2063207..2063827 (-) | 621 | WP_047916170.1 | ERF family protein | - |
| LACPI_RS09975 (LACPI_1966) | - | 2063829..2064239 (-) | 411 | WP_047916171.1 | hypothetical protein | - |
| LACPI_RS09980 (LACPI_1967) | - | 2064243..2064503 (-) | 261 | WP_047916172.1 | hypothetical protein | - |
| LACPI_RS09985 (LACPI_1968) | - | 2064752..2065030 (-) | 279 | WP_047916173.1 | helix-turn-helix domain-containing protein | - |
| LACPI_RS09990 (LACPI_1969) | - | 2065103..2065291 (-) | 189 | WP_050703042.1 | helix-turn-helix domain-containing protein | - |
| LACPI_RS09995 (LACPI_1970) | - | 2065408..2065599 (+) | 192 | WP_047916174.1 | hypothetical protein | - |
| LACPI_RS10000 (LACPI_1971) | - | 2065576..2065809 (-) | 234 | WP_047916175.1 | hypothetical protein | - |
| LACPI_RS10005 (LACPI_1972) | - | 2065867..2066052 (-) | 186 | WP_047916176.1 | hypothetical protein | - |
| LACPI_RS10010 (LACPI_1973) | - | 2066045..2066578 (-) | 534 | WP_050703043.1 | ORF6C domain-containing protein | - |
| LACPI_RS10015 (LACPI_1974) | - | 2066575..2067393 (-) | 819 | WP_157761144.1 | hypothetical protein | - |
| LACPI_RS10020 (LACPI_1975) | - | 2067467..2067766 (-) | 300 | WP_047916178.1 | DUF559 domain-containing protein | - |
| LACPI_RS10025 (LACPI_1976) | - | 2067985..2068239 (-) | 255 | WP_047916179.1 | DUF739 family protein | - |
| LACPI_RS10030 (LACPI_1977) | - | 2068420..2069142 (+) | 723 | WP_047916180.1 | LexA family transcriptional regulator | - |
| LACPI_RS12845 (LACPI_1978) | - | 2069202..2069375 (+) | 174 | WP_099046825.1 | hypothetical protein | - |
| LACPI_RS10035 (LACPI_1979) | - | 2069470..2070420 (+) | 951 | WP_047914552.1 | IS30 family transposase | - |
| LACPI_RS10040 (LACPI_1980) | - | 2070549..2071007 (+) | 459 | WP_047916181.1 | hypothetical protein | - |
| LACPI_RS10045 (LACPI_1981) | - | 2071122..2072204 (+) | 1083 | WP_047916182.1 | tyrosine-type recombinase/integrase | - |
| LACPI_RS10050 (LACPI_1982) | - | 2072767..2073390 (-) | 624 | WP_047916183.1 | YigZ family protein | - |
| LACPI_RS10055 (LACPI_1983) | comFA/cflA | 2073480..2074754 (+) | 1275 | WP_047916184.1 | DEAD/DEAH box helicase | Machinery gene |
| LACPI_RS10060 (LACPI_1984) | - | 2074856..2075398 (+) | 543 | WP_157761145.1 | ComF family protein | - |
| LACPI_RS10065 (LACPI_1985) | - | 2075401..2075802 (-) | 402 | WP_047916186.1 | diacylglycerol kinase family protein | - |
| LACPI_RS10070 (LACPI_1986) | ybeY | 2075786..2076277 (-) | 492 | WP_047916187.1 | rRNA maturation RNase YbeY | - |
| LACPI_RS10075 (LACPI_1987) | - | 2076280..2076732 (-) | 453 | WP_047916188.1 | HIT family protein | - |
| LACPI_RS10080 (LACPI_1988) | - | 2076732..2077715 (-) | 984 | WP_047916189.1 | PhoH family protein | - |
| LACPI_RS10085 (LACPI_1989) | - | 2077712..2077903 (-) | 192 | WP_047916190.1 | hypothetical protein | - |
| LACPI_RS10090 (LACPI_1990) | obgE | 2077964..2079283 (-) | 1320 | WP_047916191.1 | GTPase ObgE | - |
Sequence
Protein
Download Length: 424 a.a. Molecular weight: 47396.85 Da Isoelectric Point: 9.5348
>NTDB_id=1113994 LACPI_RS10055 WP_047916184.1 2073480..2074754(+) (comFA/cflA) [Lactococcus piscium MKFS47]
MENLFGRLLTQQELGPDMSLLPQDVIQFAGMAITGKNVICKRCGTASSCQTVRLEIPAYFCPECLQLGRVRSDEFLYHLP
QQPFPRKDALLWTGTLTPYQADISKQLVQAVDQKEQILVHAVTGAGKTEMIYAAISRSLASGGAVCIATPRTDVARELYT
RLSKDFAVPISLLHADSPPYFRTPLVISTTHQLLRFREAFDLLIIDEVDAFPFSDNPALYYAAEHAQKTSATLVYLTATS
TDTLDKLVSSDLLKRITLSRRFHGHPLVVPKSIFSYPEKIIYRHIQKQRQTGFPLLLFAPVIRFGQSFTAQLSKLLPHEK
IGFVASTTENRSGIISQFREGELTILVSTTILERGVTFPKVDVFVLESQHRLFTASSLIQIAGRAGRSIERPTGLVYFFH
NGLTKQMTRAISDIRKMNRLGGFS
MENLFGRLLTQQELGPDMSLLPQDVIQFAGMAITGKNVICKRCGTASSCQTVRLEIPAYFCPECLQLGRVRSDEFLYHLP
QQPFPRKDALLWTGTLTPYQADISKQLVQAVDQKEQILVHAVTGAGKTEMIYAAISRSLASGGAVCIATPRTDVARELYT
RLSKDFAVPISLLHADSPPYFRTPLVISTTHQLLRFREAFDLLIIDEVDAFPFSDNPALYYAAEHAQKTSATLVYLTATS
TDTLDKLVSSDLLKRITLSRRFHGHPLVVPKSIFSYPEKIIYRHIQKQRQTGFPLLLFAPVIRFGQSFTAQLSKLLPHEK
IGFVASTTENRSGIISQFREGELTILVSTTILERGVTFPKVDVFVLESQHRLFTASSLIQIAGRAGRSIERPTGLVYFFH
NGLTKQMTRAISDIRKMNRLGGFS
Nucleotide
Download Length: 1275 bp
>NTDB_id=1113994 LACPI_RS10055 WP_047916184.1 2073480..2074754(+) (comFA/cflA) [Lactococcus piscium MKFS47]
ATGGAAAACTTATTTGGCCGACTACTCACCCAGCAAGAACTGGGCCCTGATATGTCCCTTTTACCACAGGATGTCATCCA
GTTTGCTGGAATGGCAATAACAGGAAAGAACGTCATCTGTAAGCGGTGTGGTACTGCATCCTCTTGCCAAACAGTCCGAC
TTGAAATTCCGGCCTATTTTTGTCCTGAATGTCTGCAACTAGGTCGCGTTCGGTCTGATGAATTTCTCTATCACTTACCG
CAACAACCTTTCCCAAGGAAAGACGCCTTACTTTGGACTGGCACACTGACGCCTTATCAGGCAGACATCTCTAAGCAACT
CGTACAGGCAGTGGATCAAAAAGAGCAAATTCTGGTTCATGCGGTTACAGGTGCTGGTAAAACAGAGATGATTTATGCAG
CGATTAGCAGAAGTCTCGCTAGTGGTGGTGCTGTTTGTATCGCTACACCTAGAACTGACGTCGCTCGCGAGCTATATACG
CGTCTATCTAAAGATTTTGCTGTCCCTATCTCGCTCCTACATGCCGATAGTCCCCCTTACTTTCGCACTCCCCTTGTTAT
TTCGACAACGCACCAGCTTCTCCGATTTAGAGAAGCATTTGATTTACTAATCATCGACGAAGTAGATGCCTTTCCTTTCT
CTGATAATCCTGCGCTTTACTATGCAGCTGAGCATGCACAAAAAACATCAGCTACTCTGGTTTATCTGACAGCCACTTCT
ACTGATACGCTAGACAAACTCGTCAGCTCAGACCTATTAAAACGGATTACTTTATCACGTCGCTTTCACGGCCACCCACT
CGTTGTTCCCAAATCAATTTTTAGCTATCCGGAAAAAATCATCTACCGACATATCCAAAAACAGCGCCAAACTGGCTTTC
CCTTATTGCTATTTGCACCTGTCATTCGGTTTGGACAATCGTTTACAGCCCAGTTAAGCAAGCTTTTGCCCCACGAGAAG
ATAGGGTTTGTTGCATCGACTACTGAAAATCGCTCTGGGATCATCTCCCAGTTTCGCGAAGGTGAGTTAACGATTCTAGT
TTCTACCACCATATTAGAACGCGGGGTCACGTTTCCGAAAGTTGATGTTTTTGTACTAGAAAGTCAGCATCGACTCTTCA
CAGCCTCTAGTCTGATTCAGATTGCTGGACGTGCTGGACGAAGTATTGAGCGGCCGACTGGTCTTGTCTATTTTTTTCAT
AATGGCCTAACAAAACAGATGACTCGTGCTATTTCTGATATTCGCAAGATGAATCGACTTGGTGGCTTCTCATGA
ATGGAAAACTTATTTGGCCGACTACTCACCCAGCAAGAACTGGGCCCTGATATGTCCCTTTTACCACAGGATGTCATCCA
GTTTGCTGGAATGGCAATAACAGGAAAGAACGTCATCTGTAAGCGGTGTGGTACTGCATCCTCTTGCCAAACAGTCCGAC
TTGAAATTCCGGCCTATTTTTGTCCTGAATGTCTGCAACTAGGTCGCGTTCGGTCTGATGAATTTCTCTATCACTTACCG
CAACAACCTTTCCCAAGGAAAGACGCCTTACTTTGGACTGGCACACTGACGCCTTATCAGGCAGACATCTCTAAGCAACT
CGTACAGGCAGTGGATCAAAAAGAGCAAATTCTGGTTCATGCGGTTACAGGTGCTGGTAAAACAGAGATGATTTATGCAG
CGATTAGCAGAAGTCTCGCTAGTGGTGGTGCTGTTTGTATCGCTACACCTAGAACTGACGTCGCTCGCGAGCTATATACG
CGTCTATCTAAAGATTTTGCTGTCCCTATCTCGCTCCTACATGCCGATAGTCCCCCTTACTTTCGCACTCCCCTTGTTAT
TTCGACAACGCACCAGCTTCTCCGATTTAGAGAAGCATTTGATTTACTAATCATCGACGAAGTAGATGCCTTTCCTTTCT
CTGATAATCCTGCGCTTTACTATGCAGCTGAGCATGCACAAAAAACATCAGCTACTCTGGTTTATCTGACAGCCACTTCT
ACTGATACGCTAGACAAACTCGTCAGCTCAGACCTATTAAAACGGATTACTTTATCACGTCGCTTTCACGGCCACCCACT
CGTTGTTCCCAAATCAATTTTTAGCTATCCGGAAAAAATCATCTACCGACATATCCAAAAACAGCGCCAAACTGGCTTTC
CCTTATTGCTATTTGCACCTGTCATTCGGTTTGGACAATCGTTTACAGCCCAGTTAAGCAAGCTTTTGCCCCACGAGAAG
ATAGGGTTTGTTGCATCGACTACTGAAAATCGCTCTGGGATCATCTCCCAGTTTCGCGAAGGTGAGTTAACGATTCTAGT
TTCTACCACCATATTAGAACGCGGGGTCACGTTTCCGAAAGTTGATGTTTTTGTACTAGAAAGTCAGCATCGACTCTTCA
CAGCCTCTAGTCTGATTCAGATTGCTGGACGTGCTGGACGAAGTATTGAGCGGCCGACTGGTCTTGTCTATTTTTTTCAT
AATGGCCTAACAAAACAGATGACTCGTGCTATTTCTGATATTCGCAAGATGAATCGACTTGGTGGCTTCTCATGA
3D structure
| Source | ID | Structure |
|---|
Similar proteins
Only experimentally validated proteins are listed.
| Protein | Organism | Identities (%) | Coverage (%) | Ha-value |
|---|---|---|---|---|
| comFA/cflA | Streptococcus mitis SK321 |
53.596 |
100 |
0.545 |
| comFA/cflA | Streptococcus mitis NCTC 12261 |
51.972 |
100 |
0.528 |
| comFA/cflA | Streptococcus pneumoniae D39 |
51.74 |
100 |
0.526 |
| comFA/cflA | Streptococcus pneumoniae Rx1 |
51.74 |
100 |
0.526 |
| comFA/cflA | Streptococcus pneumoniae R6 |
51.74 |
100 |
0.526 |
| comFA/cflA | Streptococcus pneumoniae TIGR4 |
51.508 |
100 |
0.524 |
| comFA | Lactococcus lactis subsp. cremoris KW2 |
55.696 |
93.16 |
0.519 |
| comFA | Bacillus subtilis subsp. subtilis str. 168 |
37.44 |
97.642 |
0.366 |