assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_002220155.1_ASM222015v1	NZ_CP011835	Azotobacter chroococcum strain B3, complete genome	1	1012069-1012151	1	CRISPRCasFinder	no		csa3,cas3,DEDDh,cas14j,RT,casR,DinG,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,c2c9_V-U4,WYL	Orphan	TAGGGCCTGTTGACGTTTTGGCGT	24	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DEDDh,cas14j,RT,casR,DinG,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,c2c9_V-U4,WYL,PrimPol	NA,NA	NA|74aa|up_9|NZ_CP011835.1_1003703_1003925_-	pfam12088, DUF3565, Protein of unknown function (DUF3565)	NA|162aa|up_8|NZ_CP011835.1_1003939_1004425_+	COG1047, SlpA, FKBP-type peptidyl-prolyl cis-trans isomerases 2 [Posttranslational modification, protein turnover, chaperones]	NA|161aa|up_7|NZ_CP011835.1_1004505_1004988_+	COG0386, BtuE, Glutathione peroxidase [Posttranslational modification, protein turnover, chaperones]	NA|137aa|up_6|NZ_CP011835.1_1004988_1005399_-	PRK00567, mscL, large-conductance mechanosensitive channel protein MscL	NA|324aa|up_5|NZ_CP011835.1_1005489_1006461_-	PRK11537, PRK11537, putative GTP-binding protein YjiA; Provisional	NA|68aa|up_4|NZ_CP011835.1_1006615_1006819_-	COG2879, COG2879, Uncharacterized small protein [Function unknown]	NA|686aa|up_3|NZ_CP011835.1_1006875_1008933_-	PRK15015, PRK15015, carbon starvation protein CstA	NA|459aa|up_2|NZ_CP011835.1_1009300_1010677_+	COG4564, COG4564, Signal transduction histidine kinase [Signal transduction mechanisms]	NA|209aa|up_1|NZ_CP011835.1_1010690_1011317_+	COG2197, CitB, Response regulator containing a CheY-like receiver domain and an HTH DNA-binding domain [Signal transduction mechanisms / Transcription]	NA|154aa|up_0|NZ_CP011835.1_1011327_1011789_-	COG2020, STE14, Putative protein-S-isoprenylcysteine methyltransferase [Posttranslational modification, protein turnover, chaperones]	NA|454aa|down_0|NZ_CP011835.1_1012448_1013810_-	PRK11823, PRK11823, DNA repair protein RadA; Provisional	NA|87aa|down_1|NZ_CP011835.1_1013881_1014142_-	COG2841, COG2841, Uncharacterized protein conserved in bacteria [Function unknown]	NA|173aa|down_2|NZ_CP011835.1_1014385_1014904_-	TIGR03344, VI_effect_Hcp1, type VI secretion system effector, Hcp1 family	NA|418aa|down_3|NZ_CP011835.1_1015232_1016486_-	PRK00011, glyA, serine hydroxymethyltransferase; Reviewed	NA|555aa|down_4|NZ_CP011835.1_1016800_1018465_+	PRK11819, PRK11819, putative ABC transporter ATP-binding protein; Reviewed	NA|509aa|down_5|NZ_CP011835.1_1018594_1020121_-	PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional	NA|387aa|down_6|NZ_CP011835.1_1020362_1021523_-	cd01301, rDP_like, renal dipeptidase (rDP), best studied in mammals and also called membrane or microsomal dipeptidase, is a membrane-bound glycoprotein hydrolyzing dipeptides and is involved in hydrolytic metabolism of penem and carbapenem beta-lactam antibiotics	NA|220aa|down_7|NZ_CP011835.1_1021547_1022207_-	COG0400, COG0400, Predicted esterase [General function prediction only]	NA|233aa|down_8|NZ_CP011835.1_1022342_1023041_+	pfam17172, GST_N_4, Glutathione S-transferase N-terminal domain	NA|93aa|down_9|NZ_CP011835.1_1023037_1023316_+	PRK00329, PRK00329, GIY-YIG nuclease superfamily protein; Validated
GCF_002220155.1_ASM222015v1	NZ_CP011835	Azotobacter chroococcum strain B3, complete genome	2	1050961-1051066	2	CRISPRCasFinder	no		csa3,cas3,DEDDh,cas14j,RT,casR,DinG,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,c2c9_V-U4,WYL	Orphan	GTCCCATGCACTCGATGGGCTGGGCGAGTTGCTTGAC	37	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DEDDh,cas14j,RT,casR,DinG,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,c2c9_V-U4,WYL,PrimPol	NA|112aa|up_8|NZ_CP011835.1_1043321_1043657_-,NA|92aa|up_0|NZ_CP011835.1_1050514_1050790_+,NA|156aa|down_7|NZ_CP011835.1_1060903_1061371_+	NA|945aa|up_9|NZ_CP011835.1_1040351_1043186_-	PRK05729, valS, valyl-tRNA synthetase; Reviewed	NA|112aa|up_8|NZ_CP011835.1_1043321_1043657_-	NA	NA|143aa|up_7|NZ_CP011835.1_1043663_1044092_-	PRK05728, PRK05728, DNA polymerase III subunit chi; Validated	NA|497aa|up_6|NZ_CP011835.1_1044136_1045627_-	PRK00913, PRK00913, multifunctional aminopeptidase A; Provisional	NA|374aa|up_5|NZ_CP011835.1_1045901_1047023_+	TIGR04407, LptF_YjgP, LPS export ABC transporter permease LptF	NA|354aa|up_4|NZ_CP011835.1_1047015_1048077_+	TIGR04408, LptG_lptG, LPS export ABC transporter permease LptG	NA|165aa|up_3|NZ_CP011835.1_1048114_1048609_-	pfam06271, RDD, RDD family	NA|71aa|up_2|NZ_CP011835.1_1048706_1048919_-	COG1278, CspC, Cold shock proteins [Transcription]	NA|106aa|up_1|NZ_CP011835.1_1049925_1050243_+	pfam10976, DUF2790, Protein of unknown function (DUF2790)	NA|92aa|up_0|NZ_CP011835.1_1050514_1050790_+	NA	NA|338aa|down_0|NZ_CP011835.1_1052362_1053376_-	TIGR03302, OM_YfiO, outer membrane assembly lipoprotein YfiO	NA|324aa|down_1|NZ_CP011835.1_1053522_1054494_+	PRK11180, rluD, 23S rRNA pseudouridine(1911/1915/1917) synthase RluD	NA|243aa|down_2|NZ_CP011835.1_1054493_1055222_+	PRK10723, PRK10723, polyphenol oxidase	NA|855aa|down_3|NZ_CP011835.1_1055392_1057957_+	TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB	NA|433aa|down_4|NZ_CP011835.1_1058620_1059919_-	COG1252, Ndh, NADH dehydrogenase, FAD-containing subunit [Energy production and conversion]	NA|58aa|down_5|NZ_CP011835.1_1059994_1060168_-	pfam11293, DUF3094, Protein of unknown function (DUF3094)	NA|142aa|down_6|NZ_CP011835.1_1060481_1060907_+	PRK03624, PRK03624, putative acetyltransferase; Provisional	NA|156aa|down_7|NZ_CP011835.1_1060903_1061371_+	NA	NA|229aa|down_8|NZ_CP011835.1_1061367_1062054_+	COG3235, COG3235, Predicted membrane protein [Function unknown]	NA|67aa|down_9|NZ_CP011835.1_1062074_1062275_-	PRK00418, PRK00418, DNA gyrase inhibitor YacG
GCF_002220155.1_ASM222015v1	NZ_CP011835	Azotobacter chroococcum strain B3, complete genome	3	1774489-1774584	3	CRISPRCasFinder	no		csa3,cas3,DEDDh,cas14j,RT,casR,DinG,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,c2c9_V-U4,WYL	Orphan	GCAGCCAGCAAGGCCATCAAGGTGGTCAAC	30	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DEDDh,cas14j,RT,casR,DinG,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,c2c9_V-U4,WYL,PrimPol	NA,NA|311aa|down_9|NZ_CP011835.1_1788061_1788994_+	NA|119aa|up_9|NZ_CP011835.1_1758622_1758979_+	PRK05185, rplT, 50S ribosomal protein L20; Provisional	NA|339aa|up_8|NZ_CP011835.1_1759176_1760193_+	PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha; Validated	NA|793aa|up_7|NZ_CP011835.1_1760229_1762608_+	PRK00629, pheT, phenylalanyl-tRNA synthetase subunit beta; Reviewed	NA|101aa|up_6|NZ_CP011835.1_1762611_1762914_+	PRK00285, ihfA, integration host factor subunit alpha; Reviewed	NA|119aa|up_5|NZ_CP011835.1_1762894_1763251_+	cd04765, HTH_MlrA-like_sg2, Helix-Turn-Helix DNA binding domain of putative MlrA-like transcription regulators	NA|517aa|up_4|NZ_CP011835.1_1763678_1765229_+	pfam09820, AAA-ATPase_like, Predicted AAA-ATPase	NA|712aa|up_3|NZ_CP011835.1_1766400_1768536_-	PRK05632, PRK05632, phosphate acetyltransferase; Reviewed	NA|396aa|up_2|NZ_CP011835.1_1768672_1769860_-	PRK00180, PRK00180, acetate kinase A/propionate kinase 2; Reviewed	NA|154aa|up_1|NZ_CP011835.1_1770352_1770814_+	COG4396, COG4396, Mu-like prophage host-nuclease inhibitor protein Gam [General function prediction only]	NA|558aa|up_0|NZ_CP011835.1_1770897_1772571_-	sd00006, TPR, Tetratricopeptide repeat	NA|718aa|down_0|NZ_CP011835.1_1775221_1777375_+	COG0145, HyuA, N-methylhydantoinase A/acetone carboxylase, beta subunit [Amino acid transport and metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|773aa|down_1|NZ_CP011835.1_1777411_1779730_+	COG0146, HyuB, N-methylhydantoinase B/acetone carboxylase, alpha subunit [Amino acid transport and metabolism / Secondary metabolites biosynthesis, transport, and catabolism]	NA|169aa|down_2|NZ_CP011835.1_1779740_1780247_+	COG4647, AcxC, Acetone carboxylase, gamma subunit [Secondary metabolites biosynthesis, transport, and catabolism]	NA|665aa|down_3|NZ_CP011835.1_1780537_1782532_+	COG3284, AcoR, Transcriptional activator of acetoin/glycerol metabolism [Secondary metabolites biosynthesis, transport, and catabolism / Transcription]	NA|267aa|down_4|NZ_CP011835.1_1782726_1783527_+	COG0631, PTC1, Serine/threonine protein phosphatase [Signal transduction mechanisms]	NA|582aa|down_5|NZ_CP011835.1_1783542_1785288_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|282aa|down_6|NZ_CP011835.1_1785345_1786191_+	cd00060, FHA, Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins	NA|174aa|down_7|NZ_CP011835.1_1786426_1786948_+	cd01522, RHOD_1, Member of the Rhodanese Homology Domain superfamily, subgroup 1	NA|314aa|down_8|NZ_CP011835.1_1787068_1788010_+	TIGR01172, Serine_acetyltransferase, serine O-acetyltransferase	NA|311aa|down_9|NZ_CP011835.1_1788061_1788994_+	NA
GCF_002220155.1_ASM222015v1	NZ_CP011835	Azotobacter chroococcum strain B3, complete genome	4	2740496-2740580	4	CRISPRCasFinder	no		csa3,cas3,DEDDh,cas14j,RT,casR,DinG,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,c2c9_V-U4,WYL	Orphan	ATCCGTTACCGACGGCGCAGACACC	25	0	0	NA	NA	NA	1	1	Orphan	csa3,cas3,DEDDh,cas14j,RT,casR,DinG,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,c2c9_V-U4,WYL,PrimPol	NA|132aa|up_9|NZ_CP011835.1_2734512_2734908_+,NA|48aa|up_7|NZ_CP011835.1_2735497_2735641_-,NA|48aa|up_4|NZ_CP011835.1_2736114_2736258_+,NA|81aa|up_3|NZ_CP011835.1_2736545_2736788_-,NA|163aa|up_1|NZ_CP011835.1_2738107_2738596_-,NA|79aa|up_0|NZ_CP011835.1_2738656_2738893_-,NA|583aa|down_0|NZ_CP011835.1_2740636_2742385_-,NA|229aa|down_1|NZ_CP011835.1_2742384_2743071_-,NA|327aa|down_2|NZ_CP011835.1_2743137_2744118_-,NA|1181aa|down_3|NZ_CP011835.1_2744117_2747660_-,NA|366aa|down_8|NZ_CP011835.1_2753444_2754542_+	NA|132aa|up_9|NZ_CP011835.1_2734512_2734908_+	NA	NA|96aa|up_8|NZ_CP011835.1_2735204_2735492_-	COG2944, COG2944, Predicted transcriptional regulator [Transcription]	NA|48aa|up_7|NZ_CP011835.1_2735497_2735641_-	NA	NA|65aa|up_6|NZ_CP011835.1_2735594_2735789_+	pfam07927, HicA_toxin, HicA toxin of bacterial toxin-antitoxin,	NA|111aa|up_5|NZ_CP011835.1_2735785_2736118_+	COG4226, HicB, Predicted nuclease of the RNAse H fold, HicB family [General    function prediction only]	NA|48aa|up_4|NZ_CP011835.1_2736114_2736258_+	NA	NA|81aa|up_3|NZ_CP011835.1_2736545_2736788_-	NA	NA|153aa|up_2|NZ_CP011835.1_2737652_2738111_-	pfam11351, GTA_holin_3TM, Holin of 3TMs, for gene-transfer release	NA|163aa|up_1|NZ_CP011835.1_2738107_2738596_-	NA	NA|79aa|up_0|NZ_CP011835.1_2738656_2738893_-	NA	NA|583aa|down_0|NZ_CP011835.1_2740636_2742385_-	NA	NA|229aa|down_1|NZ_CP011835.1_2742384_2743071_-	NA	NA|327aa|down_2|NZ_CP011835.1_2743137_2744118_-	NA	NA|1181aa|down_3|NZ_CP011835.1_2744117_2747660_-	NA	NA|51aa|down_4|NZ_CP011835.1_2747746_2747899_-	cd14737, PAAR_1, proline-alanine-alanine-arginine (PAAR) domain	NA|263aa|down_5|NZ_CP011835.1_2748009_2748798_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|711aa|down_6|NZ_CP011835.1_2748922_2751055_-	PRK10044, PRK10044, ferrichrome outer membrane transporter; Provisional	NA|414aa|down_7|NZ_CP011835.1_2751715_2752957_-	COG3617, COG3617, Prophage antirepressor [Transcription]	NA|366aa|down_8|NZ_CP011835.1_2753444_2754542_+	NA	NA|82aa|down_9|NZ_CP011835.1_2754726_2754972_+	PRK09778, PRK09778, type I toxin-antitoxin system antitoxin YafN
GCF_002220155.1_ASM222015v1	NZ_CP011835	Azotobacter chroococcum strain B3, complete genome	5	2751567-2751716	5	CRISPRCasFinder	no		csa3,cas3,DEDDh,cas14j,RT,casR,DinG,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,c2c9_V-U4,WYL	Orphan	CGGTTCATCCCCGCGTACGCGGGGAAAT	28	0	0	NA	NA	I-E	2	2	Orphan	csa3,cas3,DEDDh,cas14j,RT,casR,DinG,cas2,cas6e,cas5,cas7,cse2gr11,cas8e,c2c9_V-U4,WYL,PrimPol	NA|163aa|up_9|NZ_CP011835.1_2738107_2738596_-,NA|79aa|up_8|NZ_CP011835.1_2738656_2738893_-,NA|482aa|up_7|NZ_CP011835.1_2739197_2740643_+,NA|583aa|up_6|NZ_CP011835.1_2740636_2742385_-,NA|229aa|up_5|NZ_CP011835.1_2742384_2743071_-,NA|327aa|up_4|NZ_CP011835.1_2743137_2744118_-,NA|1181aa|up_3|NZ_CP011835.1_2744117_2747660_-,NA|366aa|down_0|NZ_CP011835.1_2753444_2754542_+,NA|131aa|down_3|NZ_CP011835.1_2755304_2755697_-,NA|102aa|down_5|NZ_CP011835.1_2759131_2759437_-,NA|251aa|down_6|NZ_CP011835.1_2759679_2760432_-,NA|146aa|down_7|NZ_CP011835.1_2760447_2760885_-,NA|95aa|down_8|NZ_CP011835.1_2760886_2761171_-	NA|163aa|up_9|NZ_CP011835.1_2738107_2738596_-	NA	NA|79aa|up_8|NZ_CP011835.1_2738656_2738893_-	NA	NA|482aa|up_7|NZ_CP011835.1_2739197_2740643_+	NA	NA|583aa|up_6|NZ_CP011835.1_2740636_2742385_-	NA	NA|229aa|up_5|NZ_CP011835.1_2742384_2743071_-	NA	NA|327aa|up_4|NZ_CP011835.1_2743137_2744118_-	NA	NA|1181aa|up_3|NZ_CP011835.1_2744117_2747660_-	NA	NA|51aa|up_2|NZ_CP011835.1_2747746_2747899_-	cd14737, PAAR_1, proline-alanine-alanine-arginine (PAAR) domain	NA|263aa|up_1|NZ_CP011835.1_2748009_2748798_+	pfam13340, DUF4096, Putative transposase of IS4/5 family (DUF4096)	NA|711aa|up_0|NZ_CP011835.1_2748922_2751055_-	PRK10044, PRK10044, ferrichrome outer membrane transporter; Provisional	NA|366aa|down_0|NZ_CP011835.1_2753444_2754542_+	NA	NA|82aa|down_1|NZ_CP011835.1_2754726_2754972_+	PRK09778, PRK09778, type I toxin-antitoxin system antitoxin YafN	NA|94aa|down_2|NZ_CP011835.1_2754961_2755243_+	COG2026, RelE, Cytotoxic translational repressor of toxin-antitoxin stability system [Translation, ribosomal structure and biogenesis / Cell division and chromosome partitioning]	NA|131aa|down_3|NZ_CP011835.1_2755304_2755697_-	NA	NA|1097aa|down_4|NZ_CP011835.1_2755697_2758988_-	COG3941, COG3941, Mu-like prophage protein [General function prediction only]	NA|102aa|down_5|NZ_CP011835.1_2759131_2759437_-	NA	NA|251aa|down_6|NZ_CP011835.1_2759679_2760432_-	NA	NA|146aa|down_7|NZ_CP011835.1_2760447_2760885_-	NA	NA|95aa|down_8|NZ_CP011835.1_2760886_2761171_-	NA	NA|336aa|down_9|NZ_CP011835.1_2761187_2762195_-	pfam03864, Phage_cap_E, Phage major capsid protein E
