assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_007750395.1_ASM775039v1	NZ_CP036281	Planctomycetes bacterium Pla110 chromosome, complete genome	1	877359-877511	1	CRISPRCasFinder	no		DinG,WYL,cas3,csa3,RT,DEDDh	Orphan	GCAGCTTCAGGAGCACAGCAGCTGGG	26	0	0	NA	NA	NA	2	2	Orphan	DinG,WYL,cas3,csa3,RT,DEDDh	NA|342aa|up_8|NZ_CP036281.1_869613_870639_+,NA|161aa|up_6|NZ_CP036281.1_872440_872923_-,NA|139aa|up_4|NZ_CP036281.1_874443_874860_-,NA|160aa|up_3|NZ_CP036281.1_874856_875336_-,NA|86aa|up_2|NZ_CP036281.1_875339_875597_-,NA|91aa|up_1|NZ_CP036281.1_875547_875820_-,NA|420aa|down_3|NZ_CP036281.1_881767_883027_+,NA|161aa|down_4|NZ_CP036281.1_883115_883598_+,NA|399aa|down_5|NZ_CP036281.1_883755_884952_+,NA|179aa|down_6|NZ_CP036281.1_885247_885784_+,NA|91aa|down_7|NZ_CP036281.1_885934_886207_+,NA|127aa|down_8|NZ_CP036281.1_886253_886634_+,NA|142aa|down_9|NZ_CP036281.1_886623_887049_+	NA|317aa|up_9|NZ_CP036281.1_868504_869455_-	COG1312, UxuA, D-mannonate dehydratase [Carbohydrate transport and metabolism]	NA|342aa|up_8|NZ_CP036281.1_869613_870639_+	NA	NA|462aa|up_7|NZ_CP036281.1_870845_872231_-	PRK15452, PRK15452, putative protease; Provisional	NA|161aa|up_6|NZ_CP036281.1_872440_872923_-	NA	NA|270aa|up_5|NZ_CP036281.1_872949_873759_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|139aa|up_4|NZ_CP036281.1_874443_874860_-	NA	NA|160aa|up_3|NZ_CP036281.1_874856_875336_-	NA	NA|86aa|up_2|NZ_CP036281.1_875339_875597_-	NA	NA|91aa|up_1|NZ_CP036281.1_875547_875820_-	NA	NA|84aa|up_0|NZ_CP036281.1_876102_876354_+	PRK10457, PRK10457, hypothetical protein; Provisional	NA|313aa|down_0|NZ_CP036281.1_878632_879571_-	cd05230, UGD_SDR_e, UDP-glucuronate decarboxylase (UGD) and related proteins, extended (e) SDRs	NA|257aa|down_1|NZ_CP036281.1_880164_880935_+	cd07331, M48C_Oma1_like, Peptidase M48C, integral membrane endopeptidase	NA|179aa|down_2|NZ_CP036281.1_881026_881563_-	pfam07609, DUF1572, Protein of unknown function (DUF1572)	NA|420aa|down_3|NZ_CP036281.1_881767_883027_+	NA	NA|161aa|down_4|NZ_CP036281.1_883115_883598_+	NA	NA|399aa|down_5|NZ_CP036281.1_883755_884952_+	NA	NA|179aa|down_6|NZ_CP036281.1_885247_885784_+	NA	NA|91aa|down_7|NZ_CP036281.1_885934_886207_+	NA	NA|127aa|down_8|NZ_CP036281.1_886253_886634_+	NA	NA|142aa|down_9|NZ_CP036281.1_886623_887049_+	NA
GCF_007750395.1_ASM775039v1	NZ_CP036281	Planctomycetes bacterium Pla110 chromosome, complete genome	2	877584-877693	2	CRISPRCasFinder	no		DinG,WYL,cas3,csa3,RT,DEDDh	Orphan	GCAGCTTCAGGAGCACAGCAGCTGGG	26	0	0	NA	NA	NA	1	1	Orphan	DinG,WYL,cas3,csa3,RT,DEDDh	NA|342aa|up_8|NZ_CP036281.1_869613_870639_+,NA|161aa|up_6|NZ_CP036281.1_872440_872923_-,NA|139aa|up_4|NZ_CP036281.1_874443_874860_-,NA|160aa|up_3|NZ_CP036281.1_874856_875336_-,NA|86aa|up_2|NZ_CP036281.1_875339_875597_-,NA|91aa|up_1|NZ_CP036281.1_875547_875820_-,NA|420aa|down_3|NZ_CP036281.1_881767_883027_+,NA|161aa|down_4|NZ_CP036281.1_883115_883598_+,NA|399aa|down_5|NZ_CP036281.1_883755_884952_+,NA|179aa|down_6|NZ_CP036281.1_885247_885784_+,NA|91aa|down_7|NZ_CP036281.1_885934_886207_+,NA|127aa|down_8|NZ_CP036281.1_886253_886634_+,NA|142aa|down_9|NZ_CP036281.1_886623_887049_+	NA|317aa|up_9|NZ_CP036281.1_868504_869455_-	COG1312, UxuA, D-mannonate dehydratase [Carbohydrate transport and metabolism]	NA|342aa|up_8|NZ_CP036281.1_869613_870639_+	NA	NA|462aa|up_7|NZ_CP036281.1_870845_872231_-	PRK15452, PRK15452, putative protease; Provisional	NA|161aa|up_6|NZ_CP036281.1_872440_872923_-	NA	NA|270aa|up_5|NZ_CP036281.1_872949_873759_-	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|139aa|up_4|NZ_CP036281.1_874443_874860_-	NA	NA|160aa|up_3|NZ_CP036281.1_874856_875336_-	NA	NA|86aa|up_2|NZ_CP036281.1_875339_875597_-	NA	NA|91aa|up_1|NZ_CP036281.1_875547_875820_-	NA	NA|84aa|up_0|NZ_CP036281.1_876102_876354_+	PRK10457, PRK10457, hypothetical protein; Provisional	NA|313aa|down_0|NZ_CP036281.1_878632_879571_-	cd05230, UGD_SDR_e, UDP-glucuronate decarboxylase (UGD) and related proteins, extended (e) SDRs	NA|257aa|down_1|NZ_CP036281.1_880164_880935_+	cd07331, M48C_Oma1_like, Peptidase M48C, integral membrane endopeptidase	NA|179aa|down_2|NZ_CP036281.1_881026_881563_-	pfam07609, DUF1572, Protein of unknown function (DUF1572)	NA|420aa|down_3|NZ_CP036281.1_881767_883027_+	NA	NA|161aa|down_4|NZ_CP036281.1_883115_883598_+	NA	NA|399aa|down_5|NZ_CP036281.1_883755_884952_+	NA	NA|179aa|down_6|NZ_CP036281.1_885247_885784_+	NA	NA|91aa|down_7|NZ_CP036281.1_885934_886207_+	NA	NA|127aa|down_8|NZ_CP036281.1_886253_886634_+	NA	NA|142aa|down_9|NZ_CP036281.1_886623_887049_+	NA
GCF_007750395.1_ASM775039v1	NZ_CP036281	Planctomycetes bacterium Pla110 chromosome, complete genome	3	4528248-4528366	3	CRISPRCasFinder	no	cas3	DinG,WYL,cas3,csa3,RT,DEDDh	Unclear	AGCAGGAACCACGGGAACCACGGG	24	0	0	NA	NA	NA	2	2	Unclear	DinG,WYL,cas3,csa3,RT,DEDDh	NA|236aa|up_9|NZ_CP036281.1_4508132_4508840_+,NA|233aa|up_8|NZ_CP036281.1_4508947_4509646_+,NA|153aa|up_3|NZ_CP036281.1_4515326_4515785_+,NA|199aa|down_8|NZ_CP036281.1_4538214_4538811_-	NA|236aa|up_9|NZ_CP036281.1_4508132_4508840_+	NA	NA|233aa|up_8|NZ_CP036281.1_4508947_4509646_+	NA	NA|526aa|up_7|NZ_CP036281.1_4509806_4511384_-	cd16031, G6S_like, unchracterized sulfatase homologous to glucosamine (N-acetyl)-6-sulfatase(G6S, GNS)	NA|404aa|up_6|NZ_CP036281.1_4511543_4512755_-	PRK04135, PRK04135, 2,3-bisphosphoglycerate-independent phosphoglycerate mutase	NA|436aa|up_5|NZ_CP036281.1_4512859_4514167_-	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|246aa|up_4|NZ_CP036281.1_4514255_4514993_-	cd01741, GATase1_1, Subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain	NA|153aa|up_3|NZ_CP036281.1_4515326_4515785_+	NA	NA|883aa|up_2|NZ_CP036281.1_4516035_4518684_+	pfam04932, Wzy_C, O-Antigen ligase	cas3|1094aa|up_1|NZ_CP036281.1_4518924_4522206_+	COG1197, Mfd, Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription]	NA|425aa|up_0|NZ_CP036281.1_4522437_4523712_+	PRK00059, prsA, peptidylprolyl isomerase; Provisional	NA|318aa|down_0|NZ_CP036281.1_4528875_4529829_-	COG3386, COG3386, Gluconolactonase [Carbohydrate transport and metabolism]	NA|274aa|down_1|NZ_CP036281.1_4530098_4530920_+	pfam02548, Pantoate_transf, Ketopantoate hydroxymethyltransferase	NA|212aa|down_2|NZ_CP036281.1_4531172_4531808_+	PRK09347, folE, GTP cyclohydrolase I; Provisional	NA|997aa|down_3|NZ_CP036281.1_4532118_4535109_+	PRK11168, glpC, anaerobic glycerol-3-phosphate dehydrogenase subunit C	NA|83aa|down_4|NZ_CP036281.1_4535105_4535354_+	cd00754, Ubl_MoaD, ubiquitin-like (Ubl) domain found in molybdenum cofactor biosynthesis protein D (MoaD) and similar proteins	NA|138aa|down_5|NZ_CP036281.1_4535492_4535906_+	cd00756, MoaE, MoaE family	NA|333aa|down_6|NZ_CP036281.1_4535963_4536962_+	PRK00164, moaA, GTP 3',8-cyclase MoaA	NA|385aa|down_7|NZ_CP036281.1_4537028_4538183_+	TIGR01977, am_tr_V_EF2568, cysteine desulfurase family protein	NA|199aa|down_8|NZ_CP036281.1_4538214_4538811_-	NA	NA|215aa|down_9|NZ_CP036281.1_4539052_4539697_-	PRK05591, rplQ, 50S ribosomal protein L17; Validated
GCF_007750395.1_ASM775039v1	NZ_CP036281	Planctomycetes bacterium Pla110 chromosome, complete genome	4	4800093-4800307	4,1	CRISPRCasFinder,CRT	no		DinG,WYL,cas3,csa3,RT,DEDDh	Orphan	CGGTAGCAGGTTGTGTAAACCGGCTTCTGAGTTTTGCA,CGGTAGCAGGTTTTGTAAACCGG	38,23	0	0	NA	NA	NA:NA	1,4	4	Orphan	DinG,WYL,cas3,csa3,RT,DEDDh	NA|86aa|up_9|NZ_CP036281.1_4780849_4781107_-,NA	NA|86aa|up_9|NZ_CP036281.1_4780849_4781107_-	NA	NA|355aa|up_8|NZ_CP036281.1_4781621_4782686_+	cd01166, KdgK, 2-keto-3-deoxygluconate kinase (KdgK) phosphorylates 2-keto-3-deoxygluconate (KDG) to form 2-keto-3-deoxy-6-phosphogluconate (KDGP)	NA|437aa|up_7|NZ_CP036281.1_4782825_4784136_+	PRK05474, PRK05474, xylose isomerase; Provisional	NA|1108aa|up_6|NZ_CP036281.1_4784241_4787565_+	TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase domain	NA|333aa|up_5|NZ_CP036281.1_4787762_4788761_-	cd13964, PT_UbiA_1, UbiA family of prenyltransferases (PTases), Unknown subgroup	NA|485aa|up_4|NZ_CP036281.1_4789830_4791285_+	PRK11883, PRK11883, protoporphyrinogen oxidase; Reviewed	NA|375aa|up_3|NZ_CP036281.1_4791300_4792425_-	pfam01264, Chorismate_synt, Chorismate synthase	NA|261aa|up_2|NZ_CP036281.1_4792697_4793480_+	pfam07758, DUF1614, Protein of unknown function (DUF1614)	NA|749aa|up_1|NZ_CP036281.1_4793521_4795768_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|703aa|up_0|NZ_CP036281.1_4796124_4798233_-	NF033203, entero_EhxA, enterohemolysin EhxA	NA|276aa|down_0|NZ_CP036281.1_4801228_4802056_+	COG1082, IolE, Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism]	NA|230aa|down_1|NZ_CP036281.1_4803020_4803710_-	TIGR02191, Ribonuclease_3, ribonuclease III, bacterial	NA|705aa|down_2|NZ_CP036281.1_4804256_4806371_-	PHA03247, PHA03247, large tegument protein UL36; Provisional	NA|331aa|down_3|NZ_CP036281.1_4808412_4809405_+	cd04187, DPM1_like_bac, Bacterial DPM1_like enzymes are related to eukaryotic DPM1	NA|564aa|down_4|NZ_CP036281.1_4809455_4811147_+	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|371aa|down_5|NZ_CP036281.1_4811316_4812429_+	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|66aa|down_6|NZ_CP036281.1_4812676_4812874_+	PRK00270, rpsU, 30S ribosomal protein S21; Reviewed	NA|231aa|down_7|NZ_CP036281.1_4813057_4813750_+	pfam02545, Maf, Maf-like protein	NA|371aa|down_8|NZ_CP036281.1_4813823_4814936_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|251aa|down_9|NZ_CP036281.1_4815190_4815943_+	cd14529, TpbA-like, bacterial protein tyrosine and dual-specificity phosphatases related to Pseudomonas aeruginosa TpbA
GCF_007750395.1_ASM775039v1	NZ_CP036281	Planctomycetes bacterium Pla110 chromosome, complete genome	5	5423318-5423459	5	CRISPRCasFinder	no		DinG,WYL,cas3,csa3,RT,DEDDh	Orphan	CGTTCGCCGGGTTGGGTCCGCGATGGGGACGACGCTCTTCATCG	44	0	0	NA	NA	NA	1	1	Orphan	DinG,WYL,cas3,csa3,RT,DEDDh	NA|167aa|up_6|NZ_CP036281.1_5412358_5412859_-,NA|182aa|up_0|NZ_CP036281.1_5422516_5423062_+,NA|377aa|down_5|NZ_CP036281.1_5429782_5430913_-,NA|248aa|down_8|NZ_CP036281.1_5435248_5435992_-	NA|1137aa|up_9|NZ_CP036281.1_5404918_5408329_-	COG0841, AcrB, Cation/multidrug efflux pump [Defense mechanisms]	NA|304aa|up_8|NZ_CP036281.1_5408960_5409872_+	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)	NA|812aa|up_7|NZ_CP036281.1_5409910_5412346_+	TIGR04039, MXAN_0977_Heme2, di-heme enzyme, MXAN_0977 family	NA|167aa|up_6|NZ_CP036281.1_5412358_5412859_-	NA	NA|519aa|up_5|NZ_CP036281.1_5412903_5414460_-	pfam04293, SpoVR, SpoVR like protein	NA|370aa|up_4|NZ_CP036281.1_5414505_5415615_-	PRK05325, PRK05325, hypothetical protein; Provisional	NA|686aa|up_3|NZ_CP036281.1_5415704_5417762_-	COG2766, PrkA, Putative Ser protein kinase [Signal transduction mechanisms]	NA|634aa|up_2|NZ_CP036281.1_5418991_5420893_+	pfam07583, PSCyt2, Protein of unknown function (DUF1549)	NA|425aa|up_1|NZ_CP036281.1_5421012_5422287_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|182aa|up_0|NZ_CP036281.1_5422516_5423062_+	NA	NA|340aa|down_0|NZ_CP036281.1_5423962_5424982_+	COG0057, GapA, Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase [Carbohydrate transport and metabolism]	NA|350aa|down_1|NZ_CP036281.1_5425142_5426192_-	cd07399, MPP_YvnB, Bacillus subtilis YvnB and related proteins, metallophosphatase domain	NA|433aa|down_2|NZ_CP036281.1_5426456_5427755_+	cd16913, YkuD_like, L,D-transpeptidases/carboxypeptidases similar to Bacillus YkuD	NA|154aa|down_3|NZ_CP036281.1_5428038_5428500_+	cd12131, HGbI-like, Hell's gate globin I (HGbI) from Methylacidophilum infernorum and related proteins	NA|296aa|down_4|NZ_CP036281.1_5428726_5429614_+	COG1802, GntR, Transcriptional regulators [Transcription]	NA|377aa|down_5|NZ_CP036281.1_5429782_5430913_-	NA	NA|278aa|down_6|NZ_CP036281.1_5432783_5433617_-	cd07185, OmpA_C-like, Peptidoglycan binding domains similar to the C-terminal domain of outer-membrane protein OmpA	NA|402aa|down_7|NZ_CP036281.1_5433992_5435198_-	COG0520, csdA, Selenocysteine lyase/Cysteine desulfurase [Posttranslational modification, protein turnover, chaperones]	NA|248aa|down_8|NZ_CP036281.1_5435248_5435992_-	NA	NA|1432aa|down_9|NZ_CP036281.1_5436144_5440440_+	pfam07631, PSD4, Protein of unknown function (DUF1592)
