assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000817975.1_ASM81797v1	NZ_CP010415	Azotobacter chroococcum NCIMB 8003 chromosome, complete genome	1	98946-99766	1,1,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2	cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,cas14j,c2c9_V-U4,csa3,RT,DEDDh,WYL,DinG	Type I-E	GTCTTCCCCACGCCCGTGGGGGTGTTTC,GTCTTCCCCACGCCCGTGGGGGTGTTTC,GTCTTCCCCACGCCCGTGGGGGTGTTTC	28,28,28	0	0	NA	NA	I-B,III-A,III-B:I-B,III-A,III-B:I-B,III-A,III-B	12,13,13	13	TypeI-E	cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,cas14j,c2c9_V-U4,csa3,RT,DEDDh,WYL,DinG,PD-DExK,csb1gr7,csb2gr5,cas8u1	NA,NA	NA|725aa|up_9|NZ_CP010415.1_85845_88020_-	cd01948, EAL, EAL domain	NA|727aa|up_8|NZ_CP010415.1_88140_90321_+	PRK11773, uvrD, DNA-dependent helicase II; Provisional	cas3|861aa|up_7|NZ_CP010415.1_90713_93296_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|499aa|up_6|NZ_CP010415.1_93308_94805_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|175aa|up_5|NZ_CP010415.1_94810_95335_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas6e|217aa|up_4|NZ_CP010415.1_95331_95982_+	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas7|344aa|up_3|NZ_CP010415.1_95996_97028_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|239aa|up_2|NZ_CP010415.1_97038_97755_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas1|292aa|up_1|NZ_CP010415.1_97757_98633_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|96aa|up_0|NZ_CP010415.1_98610_98898_+	cd09648, Cas2_I-E, CRISPR/Cas system-associated protein Cas2	NA|198aa|down_0|NZ_CP010415.1_99857_100451_-	PRK03846, PRK03846, adenylylsulfate kinase; Provisional	NA|240aa|down_1|NZ_CP010415.1_100737_101457_+	pfam03567, Sulfotransfer_2, Sulfotransferase family	NA|189aa|down_2|NZ_CP010415.1_101555_102122_+	TIGR02396, conserved_hypothetical_protein, rpsU-divergently transcribed protein	NA|285aa|down_3|NZ_CP010415.1_102279_103134_+	COG4395, COG4395, Uncharacterized protein conserved in bacteria [Function unknown]	NA|466aa|down_4|NZ_CP010415.1_103182_104580_-	cd14757, GS_EcDosC-like_GGDEF, Globin sensor domain of Escherichia coli Direct Oxygen Sensing Cyclase and related proteins; coupled to a C-terminal GGDEF domain	NA|136aa|down_5|NZ_CP010415.1_105072_105480_+	pfam14567, SUKH_5, SMI1-KNR4 cell-wall	NA|410aa|down_6|NZ_CP010415.1_105483_106713_-	COG2715, SpmA, Uncharacterized membrane protein, required for spore maturation in B	NA|199aa|down_7|NZ_CP010415.1_106847_107444_+	PRK03767, PRK03767, NAD(P)H:quinone oxidoreductase; Provisional	NA|524aa|down_8|NZ_CP010415.1_107731_109303_+	cd01949, GGDEF, Diguanylate-cyclase (DGC) or GGDEF domain	NA|105aa|down_9|NZ_CP010415.1_109367_109682_-	COG4654, COG4654, Cytochrome c551/c552 [Energy production and conversion]
GCF_000817975.1_ASM81797v1	NZ_CP010415	Azotobacter chroococcum NCIMB 8003 chromosome, complete genome	2	1636298-1638835	2,2,2	PILER-CR,CRISPRCasFinder,CRT	no	cas3,cas8e,cse2gr11,cas7,cas5,cas6e,cas1,cas2	cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,cas14j,c2c9_V-U4,csa3,RT,DEDDh,WYL,DinG	Type I-E	GTGTTCCCCGCGCCTGCGGGGATGAACCG,GTGTTCCCCGCGCCTGCGGGGATGAACCG,GTGTTCCCCGCGCCTGCGGGGATGAACCG	29,29,29	0	0	NA	NA	I-E:I-E:I-E	40,41,41	41	TypeI-E	cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,cas14j,c2c9_V-U4,csa3,RT,DEDDh,WYL,DinG,PD-DExK,csb1gr7,csb2gr5,cas8u1	NA,NA	cas5|221aa|up_9|NZ_CP010415.1_1625776_1626439_+	TIGR01868, hypothetical_protein, CRISPR-associated protein Cas5/CasD, subtype I-E/ECOLI	cas6e|205aa|up_8|NZ_CP010415.1_1626413_1627028_+	cd09727, Cas6_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas6e	cas3|918aa|up_7|NZ_CP010415.1_1627113_1629867_+	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	cas8e|519aa|up_6|NZ_CP010415.1_1630375_1631932_+	cd09729, Cse1_I-E, CRISPR/Cas system-associated protein Cse1	cse2gr11|209aa|up_5|NZ_CP010415.1_1631928_1632555_+	cd09731, Cse2_I-E, CRISPR/Cas system-associated protein Cse2	cas7|345aa|up_4|NZ_CP010415.1_1632551_1633586_+	pfam09344, Cas_CT1975, CT1975-like protein	cas5|255aa|up_3|NZ_CP010415.1_1633589_1634354_+	cd09756, Cas5_I-E, CRISPR/Cas system-associated RAMP superfamily protein Cas5	cas6e|223aa|up_2|NZ_CP010415.1_1634350_1635019_+	smart01101, CRISPR_assoc, This domain forms an anti-parallel beta strand structure with flanking alpha helical regions	cas1|305aa|up_1|NZ_CP010415.1_1635018_1635933_+	TIGR03638, cas1_ECOLI, CRISPR-associated endonuclease Cas1, subtype I-E/ECOLI	cas2|100aa|up_0|NZ_CP010415.1_1635932_1636232_+	PRK11558, PRK11558, putative ssRNA endonuclease; Provisional	NA|237aa|down_0|NZ_CP010415.1_1639097_1639808_-	COG0412, COG0412, Dienelactone hydrolase and related enzymes [Secondary metabolites biosynthesis, transport, and catabolism]	NA|443aa|down_1|NZ_CP010415.1_1640301_1641630_+	pfam03573, OprD, outer membrane porin, OprD family	NA|375aa|down_2|NZ_CP010415.1_1641744_1642869_+	pfam04339, FemAB_like, Peptidogalycan biosysnthesis/recognition	NA|185aa|down_3|NZ_CP010415.1_1642914_1643469_-	PRK10903, PRK10903, peptidylprolyl isomerase A	NA|309aa|down_4|NZ_CP010415.1_1643554_1644481_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|200aa|down_5|NZ_CP010415.1_1644690_1645290_+	COG1182, AcpD, Acyl carrier protein phosphodiesterase [Lipid metabolism]	NA|138aa|down_6|NZ_CP010415.1_1645317_1645731_-	pfam11776, RcnB, Nickel/cobalt transporter regulator	cas3|440aa|down_7|NZ_CP010415.1_1645908_1647228_+	COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis]	NA|321aa|down_8|NZ_CP010415.1_1647235_1648198_-	TIGR03438, conserved_hypothetical_protein, dimethylhistidine N-methyltransferase	NA|428aa|down_9|NZ_CP010415.1_1648200_1649484_-	TIGR03440, egtB_TIGR03440, ergothioneine biosynthesis protein EgtB
GCF_000817975.1_ASM81797v1	NZ_CP010415	Azotobacter chroococcum NCIMB 8003 chromosome, complete genome	3	2998100-2998211	3	CRISPRCasFinder	no		cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,cas14j,c2c9_V-U4,csa3,RT,DEDDh,WYL,DinG	Orphan	GACGAACAGGCCAAGCTCGCCAACGAGC	28	0	0	NA	NA	NA	1	1	Orphan	cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,cas14j,c2c9_V-U4,csa3,RT,DEDDh,WYL,DinG,PD-DExK,csb1gr7,csb2gr5,cas8u1	NA|279aa|up_6|NZ_CP010415.1_2985477_2986314_+,NA|284aa|up_4|NZ_CP010415.1_2987665_2988517_-,NA|209aa|down_0|NZ_CP010415.1_2998521_2999148_+,NA|374aa|down_5|NZ_CP010415.1_3005450_3006572_+	NA|460aa|up_9|NZ_CP010415.1_2980937_2982317_-	TIGR03023, Sugar_transferase	NA|406aa|up_8|NZ_CP010415.1_2982379_2983597_-	cd03818, GT4_ExpC-like, Rhizobium meliloti ExpC and similar proteins	NA|362aa|up_7|NZ_CP010415.1_2984199_2985285_-	pfam05598, DUF772, Transposase domain (DUF772)	NA|279aa|up_6|NZ_CP010415.1_2985477_2986314_+	NA	NA|346aa|up_5|NZ_CP010415.1_2986420_2987458_+	pfam13946, DUF4214, Domain of unknown function (DUF4214)	NA|284aa|up_4|NZ_CP010415.1_2987665_2988517_-	NA	NA|306aa|up_3|NZ_CP010415.1_2988473_2989391_+	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|141aa|up_2|NZ_CP010415.1_2991904_2992327_-	pfam12323, HTH_OrfB_IS605, Helix-turn-helix domain	NA|203aa|up_1|NZ_CP010415.1_2992457_2993066_+	pfam08241, Methyltransf_11, Methyltransferase domain	NA|1217aa|up_0|NZ_CP010415.1_2993399_2997050_+	COG1196, Smc, Chromosome segregation ATPases [Cell division and chromosome partitioning]	NA|209aa|down_0|NZ_CP010415.1_2998521_2999148_+	NA	NA|329aa|down_1|NZ_CP010415.1_2999269_3000256_+	COG1087, GalE, UDP-glucose 4-epimerase [Cell envelope biogenesis, outer membrane]	NA|587aa|down_2|NZ_CP010415.1_3000293_3002054_+	COG1216, COG1216, Predicted glycosyltransferases [General function prediction only]	NA|422aa|down_3|NZ_CP010415.1_3002095_3003361_+	cd03823, GT4_ExpE7-like, glycosyltransferase ExpE7 and similar proteins	NA|388aa|down_4|NZ_CP010415.1_3003626_3004790_+	pfam04230, PS_pyruv_trans, Polysaccharide pyruvyl transferase	NA|374aa|down_5|NZ_CP010415.1_3005450_3006572_+	NA	NA|350aa|down_6|NZ_CP010415.1_3006677_3007727_+	pfam04577, DUF563, Protein of unknown function (DUF563)	NA|878aa|down_7|NZ_CP010415.1_3008250_3010884_+	cd04184, GT2_RfbC_Mx_like, Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis	NA|377aa|down_8|NZ_CP010415.1_3010924_3012055_+	cd04194, GT8_A4GalT_like, A4GalT_like proteins catalyze the addition of galactose or glucose residues to the lipooligosaccharide (LOS) or lipopolysaccharide (LPS) of the bacterial cell surface	NA|389aa|down_9|NZ_CP010415.1_3012198_3013365_+	PRK15057, PRK15057, UDP-glucose 6-dehydrogenase; Provisional
GCF_000817975.1_ASM81797v1	NZ_CP010420	Azotobacter chroococcum NCIMB 8003 plasmid pAcX50e, complete sequence	1	71137-71286	1	CRISPRCasFinder	no		c2c9_V-U4,csb1gr7,csb2gr5,cas3,cas8u1	Orphan	TGATACGTGATACGTGATACGTGA	24	0	0	NA	NA	NA	3	3	Orphan	cas3,cas8e,cse2gr11,cas6e,cas7,cas5,cas1,cas2,cas14j,c2c9_V-U4,csa3,RT,DEDDh,WYL,DinG,PD-DExK,csb1gr7,csb2gr5,cas8u1	NA|118aa|up_6|NZ_CP010420.1_65580_65934_+,NA|177aa|down_7|NZ_CP010420.1_79333_79864_-,NA|80aa|down_8|NZ_CP010420.1_79878_80118_-	NA|262aa|up_9|NZ_CP010420.1_62064_62850_+	cd07424, MPP_PrpA_PrpB, PrpA and PrpB, metallophosphatase domain	NA|154aa|up_8|NZ_CP010420.1_63477_63939_+	smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|408aa|up_7|NZ_CP010420.1_64214_65438_+	cd00801, INT_P4_C, Bacteriophage P4 integrase, C-terminal catalytic domain	NA|118aa|up_6|NZ_CP010420.1_65580_65934_+	NA	NA|226aa|up_5|NZ_CP010420.1_65987_66665_-	COG1670, RimL, Acetyltransferases, including N-acetylases of ribosomal proteins [Translation, ribosomal structure and biogenesis]	NA|211aa|up_4|NZ_CP010420.1_66687_67320_-	COG2808, PaiB, Transcriptional regulator [Transcription]	NA|495aa|up_3|NZ_CP010420.1_67410_68895_+	COG1167, ARO8, Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]	NA|191aa|up_2|NZ_CP010420.1_68887_69460_-	pfam13302, Acetyltransf_3, Acetyltransferase (GNAT) domain	NA|248aa|up_1|NZ_CP010420.1_69818_70562_+	pfam11740, KfrA_N, Plasmid replication region DNA-binding N-term	NA|91aa|up_0|NZ_CP010420.1_70701_70974_-	cd13831, HU, histone-like DNA-binding protein HU	NA|145aa|down_0|NZ_CP010420.1_71553_71988_+	pfam13590, DUF4136, Domain of unknown function (DUF4136)	NA|348aa|down_1|NZ_CP010420.1_72031_73075_-	COG1752, RssA, Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only]	NA|93aa|down_2|NZ_CP010420.1_74126_74405_+	pfam10038, DUF2274, Protein of unknown function (DUF2274)	NA|300aa|down_3|NZ_CP010420.1_74379_75279_-	PRK15092, PRK15092, DNA-binding transcriptional repressor LrhA; Provisional	NA|296aa|down_4|NZ_CP010420.1_75382_76270_+	COG0596, MhpC, Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only]	NA|349aa|down_5|NZ_CP010420.1_76580_77627_+	cd16283, RomA-like_MBL-fold, Enterobacter cloacae RomA and related proteins; MBL-fold metallo hydrolase domain	NA|503aa|down_6|NZ_CP010420.1_77739_79248_+	cd18807, SF1_C_UvrD, C-terminal helicase domain of UvrD family helicases	NA|177aa|down_7|NZ_CP010420.1_79333_79864_-	NA	NA|80aa|down_8|NZ_CP010420.1_79878_80118_-	NA	NA|231aa|down_9|NZ_CP010420.1_80579_81272_-	pfam09414, RNA_ligase, RNA ligase
