|
Putative Escherichia coli A39 RM systems
Ref: Kang,Y. et al. Brief Bioinform 22 (4) (2021)
REBASE ref # 34039
Complete sequence: 4,746,311 bp
GenBank #: CP028737
REBASE acronym: EcoA39
Org_num: 56594
All begin AOY92_
Type I | ||||
---|---|---|---|---|
ORF | Gene | Most similar | Specificity | Name |
20850 | 322 aa hypothetical protein | |||
20855 | R | EcoXH993ORF23720P (100% identity) | AAGNNNNNNNRTTTC | EcoA39IP |
20860 | M | M.EcoZ503ORFCP (97% identity) | AAGNNNNNNNRTTTC | M.EcoA39I |
20865 | S | S.Ecoa005ORF2260P (100% identity) | AAGNNNNNNNRTTTC | S.EcoA39I |
20870 | endoribonuclease SymE | |||
21090 | 128 aa hypothetical protein | |||
21095 | M | M.EcoSTEC866ORF20040P (99% identity) | CCAYNNNNNTGT | M.EcoA39II |
21100 | S | S.EcoSTEC640ORF20915P (100% identity) | CCAYNNNNNTGT | S.EcoA39II |
21105 | 433 aa hypothetical protein | |||
Type II | ||||
ORF | Gene | Most similar | Specificity | Name |
2660 | DUF2556 domain-containing protein | |||
2665 | M | M.SspS13ORF21055P (100% identity) | ATGCAT | M.EcoA39ORF2665P |
2670 | Fis family transcriptional regulator | |||
9860 | phosphohydrolase | |||
9865 | M | M.SflLIN6DcmP (100% identity) | CCWGG | M.EcoA39DcmP |
9870 | V | V.SflFF64DcmP (100% identity) | CCWGG | V.EcoA39DcmP |
9875 | EamA family transporter | |||
17255 | LexA family transcriptional regulator | |||
17260 | M | M.Eco0157ORF19825P (100% identity) | GATC | M.EcoA39ORF17260P |
17265 | PerC family transcriptional regulator | |||
18835 | toxin | |||
18840 | M | M.EcoC9120ORF3586P (100% identity) | M.EcoA39ORF18840P | |
18845 | R | EcoC9120ORF3586P (100% identity) | EcoA39ORF18840P | |
18855 | glutamate-5-semialdehyde dehydrogenase | |||
21070 | sialate O-acetylesterase | |||
21075 | R | EcoSTEC640ORF20895P (100% identity) | EcoA39ORF21075P | |
21080 | restriction endonuclease | |||
22505 | 74 aa hypothetical protein | |||
22515 | M | M.EcoM3ORF410P (94% identity) | M.EcoA39ORF22515P | |
22520 | TrmB family transcriptional regulator | |||
22550 | LexA family transcriptional regulator | |||
22555 | M | M.Sso90ORF15820P (99% identity) | GATC | M.EcoA39ORF22555P |
22560 | 164 aa hypothetical protein | |||
Type IV | ||||
ORF | Gene | Most similar | Specificity | Name |
21075 | HNH endonuclease | |||
21080 | R | SflN101McrBP (100% identity) | EcoA39McrBP | |
21085 | R | SflN101McrCP (100% identity) | EcoA39McrCP | |
21090 | 128 aa hypothetical protein |