assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_900231165.1_HDIA1	NZ_LT960614	Hartmannibacter diazotrophicus strain E19T chromosome 1	1	590112-590209	1	CRISPRCasFinder	no	csa3	WYL,csa3,cas3,PD-DExK,RT,DEDDh	Type I-A	AGCGCCTCGACCGCAACGGCGACGGC	26	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,cas3,PD-DExK,RT,DEDDh	NA|106aa|up_1|NZ_LT960614.1_588650_588968_+,NA|96aa|up_0|NZ_LT960614.1_589070_589358_+,NA|179aa|down_5|NZ_LT960614.1_597064_597601_-	NA|734aa|up_9|NZ_LT960614.1_578195_580397_-	PRK05402, PRK05402, 1,4-alpha-glucan branching protein GlgB	NA|359aa|up_8|NZ_LT960614.1_580516_581593_-	COG3842, PotA, ABC-type spermidine/putrescine transport systems, ATPase components [Amino acid transport and metabolism]	NA|263aa|up_7|NZ_LT960614.1_581607_582396_-	COG1177, PotC, ABC-type spermidine/putrescine transport system, permease component II [Amino acid transport and metabolism]	NA|302aa|up_6|NZ_LT960614.1_582392_583298_-	COG4132, COG4132, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|361aa|up_5|NZ_LT960614.1_583578_584661_-	COG1840, AfuA, ABC-type Fe3+ transport system, periplasmic component [Inorganic ion transport and metabolism]	NA|320aa|up_4|NZ_LT960614.1_585303_586263_+	COG0392, COG0392, Predicted integral membrane protein [Function unknown]	NA|488aa|up_3|NZ_LT960614.1_586288_587752_-	PRK09467, envZ, osmolarity sensor protein; Provisional	NA|243aa|up_2|NZ_LT960614.1_587754_588483_-	PRK09468, ompR, osmolarity response regulator; Provisional	NA|106aa|up_1|NZ_LT960614.1_588650_588968_+	NA	NA|96aa|up_0|NZ_LT960614.1_589070_589358_+	NA	NA|458aa|down_0|NZ_LT960614.1_590521_591895_-	COG0174, GlnA, Glutamine synthetase [Amino acid transport and metabolism]	NA|453aa|down_1|NZ_LT960614.1_591916_593275_-	PRK07046, PRK07046, aminotransferase; Validated	NA|306aa|down_2|NZ_LT960614.1_593418_594336_-	TIGR00950, Uncharacterized_inner_membrane_transporter_YicL, Carboxylate/Amino Acid/Amine Transporter	NA|401aa|down_3|NZ_LT960614.1_594495_595698_+	COG4782, COG4782, Uncharacterized protein conserved in bacteria [Function unknown]	NA|415aa|down_4|NZ_LT960614.1_595730_596975_+	pfam13406, SLT_2, Transglycosylase SLT domain	NA|179aa|down_5|NZ_LT960614.1_597064_597601_-	NA	csa3|116aa|down_6|NZ_LT960614.1_597643_597991_+	cd00090, HTH_ARSR, Arsenical Resistance Operon Repressor and similar prokaryotic, metal regulated homodimeric repressors	NA|401aa|down_7|NZ_LT960614.1_598028_599231_+	cd17355, MFS_YcxA_like, MFS-type transporter YcxA and similar proteins of the Major Facilitator Superfamily of transporters	NA|283aa|down_8|NZ_LT960614.1_599292_600141_-	PRK05198, PRK05198, 2-dehydro-3-deoxyphosphooctonate aldolase; Provisional	NA|336aa|down_9|NZ_LT960614.1_600178_601186_-	PRK10892, PRK10892, arabinose-5-phosphate isomerase KdsD
GCF_900231165.1_HDIA1	NZ_LT960614	Hartmannibacter diazotrophicus strain E19T chromosome 1	2	706420-706506	2	CRISPRCasFinder	no		WYL,csa3,cas3,PD-DExK,RT,DEDDh	Orphan	GCATCCAGCCGGAAAAGTGGTTTCCGGTTTT	31	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,cas3,PD-DExK,RT,DEDDh	NA|437aa|up_4|NZ_LT960614.1_699312_700623_-,NA	NA|405aa|up_9|NZ_LT960614.1_690689_691904_+	cd17325, MFS_MdtG_SLC18_like, bacterial MdtG-like and eukaryotic solute carrier 18 (SLC18) family of the Major Facilitator Superfamily of transporters	NA|458aa|up_8|NZ_LT960614.1_691974_693348_+	COG0076, GadB, Glutamate decarboxylase and related PLP-dependent proteins [Amino acid transport and metabolism]	NA|308aa|up_7|NZ_LT960614.1_693371_694295_-	cd19920, REC_PA4781-like, phosphoacceptor receiver (REC) domain of cyclic di-GMP phosphodiesterase PA4781 and similar domains	NA|1152aa|up_6|NZ_LT960614.1_694287_697743_-	TIGR02956, sensor_protein_TorS, TMAO reductase sytem sensor TorS	NA|343aa|up_5|NZ_LT960614.1_698025_699054_-	COG0715, TauA, ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components [Inorganic ion transport and metabolism]	NA|437aa|up_4|NZ_LT960614.1_699312_700623_-	NA	NA|437aa|up_3|NZ_LT960614.1_701285_702596_+	pfam13433, Peripla_BP_5, Periplasmic binding protein domain	NA|545aa|up_2|NZ_LT960614.1_702873_704508_+	TIGR03409, urea_trans_UrtB, urea ABC transporter, permease protein UrtB	NA|383aa|up_1|NZ_LT960614.1_704504_705653_+	TIGR03408, urea_trans_UrtC, urea ABC transporter, permease protein UrtC	NA|251aa|up_0|NZ_LT960614.1_705657_706410_+	TIGR03411, urea_trans_UrtD, urea ABC transporter, ATP-binding protein UrtD	NA|232aa|down_0|NZ_LT960614.1_706590_707286_+	TIGR03410, urea_trans_UrtE, urea ABC transporter, ATP-binding protein UrtE	NA|563aa|down_1|NZ_LT960614.1_707932_709621_-	smart00283, MA, Methyl-accepting chemotaxis-like domains (chemotaxis sensory transducer)	NA|823aa|down_2|NZ_LT960614.1_709958_712427_-	TIGR01970, ATP-dependent_RNA_helicase_HrpB, ATP-dependent helicase HrpB	NA|1033aa|down_3|NZ_LT960614.1_712485_715584_-	PRK11904, PRK11904, bifunctional proline dehydrogenase/L-glutamate gamma-semialdehyde dehydrogenase PutA	NA|164aa|down_4|NZ_LT960614.1_715741_716233_+	PRK11169, PRK11169, leucine-responsive transcriptional regulator Lrp	NA|409aa|down_5|NZ_LT960614.1_716335_717562_+	PRK11622, PRK11622, ABC transporter substrate-binding protein	NA|592aa|down_6|NZ_LT960614.1_717548_719324_+	COG4135, COG4135, ABC-type uncharacterized transport system, permease component [General function prediction only]	NA|227aa|down_7|NZ_LT960614.1_719323_720004_+	COG4136, COG4136, ABC-type uncharacterized transport system, ATPase component [General function prediction only]	NA|341aa|down_8|NZ_LT960614.1_720564_721587_-	PRK00292, glk, glucokinase; Provisional	NA|332aa|down_9|NZ_LT960614.1_722014_723010_+	cd06354, PBP1_PrnA-like, periplasmic binding domain of basic membrane lipoprotein, PnrA, in Treponema pallidum and its homologs from other bacteria and Archaea
GCF_900231165.1_HDIA1	NZ_LT960614	Hartmannibacter diazotrophicus strain E19T chromosome 1	3	842650-844630	1,3,1	PILER-CR,CRISPRCasFinder,CRT	no	cas3	WYL,csa3,cas3,PD-DExK,RT,DEDDh	Unclear	CGGTTCAGCCCCGCGTGTGCGGGGAACAC,CGGTTCAGCCCCGCGTGTGCGGGGAACAC,CGGTTCAGCCCCGCGTGTGCGGGGAACAC	29,29,29	0	0	NA	NA	I-E:I-E:I-E	32,32,32	32	Unclear	WYL,csa3,cas3,PD-DExK,RT,DEDDh	NA|72aa|up_3|NZ_LT960614.1_838782_838998_-,NA|165aa|up_1|NZ_LT960614.1_841059_841554_-,NA|125aa|down_1|NZ_LT960614.1_848182_848557_-,NA|83aa|down_2|NZ_LT960614.1_849083_849332_+	NA|73aa|up_9|NZ_LT960614.1_833630_833849_-	PRK00276, infA, translation initiation factor IF-1; Validated	NA|150aa|up_8|NZ_LT960614.1_834172_834622_-	cd16345, LMWP_ArsC, Arsenate reductase of the LMWP family	NA|161aa|up_7|NZ_LT960614.1_834628_835111_-	PRK02853, PRK02853, hypothetical protein; Provisional	NA|431aa|up_6|NZ_LT960614.1_835155_836448_-	PRK00877, hisD, bifunctional histidinal dehydrogenase/ histidinol dehydrogenase; Reviewed	NA|144aa|up_5|NZ_LT960614.1_836613_837045_-	pfam11164, DUF2948, Protein of unknown function (DUF2948)	NA|430aa|up_4|NZ_LT960614.1_837189_838479_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|72aa|up_3|NZ_LT960614.1_838782_838998_-	NA	NA|559aa|up_2|NZ_LT960614.1_839417_841094_+	cd01184, INT_C_like_1, Uncharacterized site-specific tyrosine recombinase, C-terminal catalytic domain	NA|165aa|up_1|NZ_LT960614.1_841059_841554_-	NA	NA|273aa|up_0|NZ_LT960614.1_841770_842589_+	smart00857, Resolvase, Resolvase, N terminal domain	cas3|915aa|down_0|NZ_LT960614.1_844758_847503_-	PRK09694, PRK09694, CRISPR-associated helicase/endonuclease Cas3	NA|125aa|down_1|NZ_LT960614.1_848182_848557_-	NA	NA|83aa|down_2|NZ_LT960614.1_849083_849332_+	NA	NA|628aa|down_3|NZ_LT960614.1_849398_851282_+	pfam05544, Pro_racemase, Proline racemase	NA|281aa|down_4|NZ_LT960614.1_851426_852269_+	cd13621, PBP2_AA_binding_like_3, Substrate-binding domain of putative amino acid-binding protein; the type 2 periplasmic-binding protein fold	NA|216aa|down_5|NZ_LT960614.1_852347_852995_+	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|275aa|down_6|NZ_LT960614.1_852991_853816_+	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|303aa|down_7|NZ_LT960614.1_853845_854754_+	cd00408, DHDPS-like, Dihydrodipicolinate synthase family	NA|336aa|down_8|NZ_LT960614.1_854758_855766_+	pfam05544, Pro_racemase, Proline racemase	NA|492aa|down_9|NZ_LT960614.1_855789_857265_+	pfam00171, Aldedh, Aldehyde dehydrogenase family
GCF_900231165.1_HDIA1	NZ_LT960614	Hartmannibacter diazotrophicus strain E19T chromosome 1	4	3755696-3755800	4	CRISPRCasFinder	no	WYL	WYL,csa3,cas3,PD-DExK,RT,DEDDh	Unclear	TGGTCCGCCTTCGCGGACCATGACGG	26	0	0	NA	NA	NA	1	1	Orphan	WYL,csa3,cas3,PD-DExK,RT,DEDDh	NA,NA|123aa|down_3|NZ_LT960614.1_3761361_3761730_+,NA|257aa|down_5|NZ_LT960614.1_3764139_3764910_+	NA|104aa|up_9|NZ_LT960614.1_3744284_3744596_-	COG4095, COG4095, Uncharacterized conserved protein [Function unknown]	NA|126aa|up_8|NZ_LT960614.1_3744861_3745239_-	COG0797, RlpA, Lipoproteins [Cell envelope biogenesis, outer membrane]	NA|281aa|up_7|NZ_LT960614.1_3745328_3746171_-	TIGR03340, phn_DUF6, phosphonate utilization associated putative membrane protein	NA|124aa|up_6|NZ_LT960614.1_3746326_3746698_+	COG3474, COG3474, Cytochrome c2 [Energy production and conversion]	NA|300aa|up_5|NZ_LT960614.1_3746876_3747776_-	COG0583, LysR, Transcriptional regulator [Transcription]	NA|285aa|up_4|NZ_LT960614.1_3748016_3748871_+	pfam00797, Acetyltransf_2, N-acetyltransferase	NA|393aa|up_3|NZ_LT960614.1_3748874_3750053_-	pfam00924, MS_channel, Mechanosensitive ion channel	NA|684aa|up_2|NZ_LT960614.1_3750237_3752289_-	COG1835, COG1835, Predicted acyltransferases [Lipid metabolism]	NA|601aa|up_1|NZ_LT960614.1_3753328_3755131_+	cd01646, RT_Bac_retron_I, RT_Bac_retron_I: Reverse transcriptases (RTs) in bacterial retrotransposons or retrons	NA|104aa|up_0|NZ_LT960614.1_3755221_3755533_-	cd10448, GIY-YIG_unchar_3, GIY-YIG domain of uncharacterized hypothetical protein found in bacteria	NA|966aa|down_0|NZ_LT960614.1_3755817_3758715_-	PRK05298, PRK05298, excinuclease ABC subunit UvrB	NA|224aa|down_1|NZ_LT960614.1_3758843_3759515_-	COG4991, COG4991, Uncharacterized protein with a bacterial SH3 domain homologue [Function unknown]	NA|386aa|down_2|NZ_LT960614.1_3759839_3760997_+	COG2206, COG2206, c-di-GMP phosphodiesterase class II (HD-GYP domain) [Signal transduction mechanisms]	NA|123aa|down_3|NZ_LT960614.1_3761361_3761730_+	NA	NA|814aa|down_4|NZ_LT960614.1_3761726_3764168_+	pfam02384, N6_Mtase, N-6 DNA Methylase	NA|257aa|down_5|NZ_LT960614.1_3764139_3764910_+	NA	NA|75aa|down_6|NZ_LT960614.1_3765349_3765574_-	COG1942, COG1942, Uncharacterized protein, 4-oxalocrotonate tautomerase homolog [General function prediction only]	NA|545aa|down_7|NZ_LT960614.1_3765695_3767330_+	COG5616, COG5616, Predicted integral membrane protein [Function unknown]	WYL|233aa|down_8|NZ_LT960614.1_3767504_3768203_-	COG2378, COG2378, Predicted transcriptional regulator [Transcription]	NA|133aa|down_9|NZ_LT960614.1_3768290_3768689_+	cd07264, VOC_like, uncharacterized subfamily of vicinal oxygen chelate (VOC) family
GCF_900231165.1_HDIA1	NZ_LT960614	Hartmannibacter diazotrophicus strain E19T chromosome 1	5	5011314-5011682	5	CRISPRCasFinder	no		WYL,csa3,cas3,PD-DExK,RT,DEDDh	Orphan	CAGTTCGGCAACGGCAACGACGC	23	0	0	NA	NA	NA	5	5	Orphan	WYL,csa3,cas3,PD-DExK,RT,DEDDh	NA|148aa|up_1|NZ_LT960614.1_5009331_5009775_+,NA|137aa|down_0|NZ_LT960614.1_5013427_5013838_+	NA|567aa|up_9|NZ_LT960614.1_4999410_5001111_+	PRK06299, rpsA, 30S ribosomal protein S1; Reviewed	NA|314aa|up_8|NZ_LT960614.1_5001508_5002450_+	PRK05654, PRK05654, acetyl-CoA carboxylase carboxyltransferase subunit beta	NA|442aa|up_7|NZ_LT960614.1_5002466_5003792_+	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|320aa|up_6|NZ_LT960614.1_5004060_5005020_+	PRK05269, PRK05269, transaldolase B; Provisional	NA|363aa|up_5|NZ_LT960614.1_5005306_5006395_+	COG4663, FcbT1, TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|178aa|up_4|NZ_LT960614.1_5006391_5006925_+	COG4665, FcbT2, TRAP-type mannitol/chloroaromatic compound transport system, small permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|442aa|up_3|NZ_LT960614.1_5006928_5008254_+	COG4664, FcbT3, TRAP-type mannitol/chloroaromatic compound transport system, large permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|249aa|up_2|NZ_LT960614.1_5008490_5009237_-	pfam06776, IalB, Invasion associated locus B (IalB) protein	NA|148aa|up_1|NZ_LT960614.1_5009331_5009775_+	NA	NA|282aa|up_0|NZ_LT960614.1_5010015_5010861_+	PRK15184, PRK15184, curli production assembly/transport protein CsgG; Provisional	NA|137aa|down_0|NZ_LT960614.1_5013427_5013838_+	NA	NA|202aa|down_1|NZ_LT960614.1_5013863_5014469_+	PRK10101, csgB, curlin minor subunit CsgB; Provisional	NA|95aa|down_2|NZ_LT960614.1_5014504_5014789_+	pfam10614, CsgF, Type VIII secretion system (T8SS), CsgF protein	NA|137aa|down_3|NZ_LT960614.1_5014837_5015248_-	PRK10101, csgB, curlin minor subunit CsgB; Provisional	NA|307aa|down_4|NZ_LT960614.1_5015347_5016268_-	cd01166, KdgK, 2-keto-3-deoxygluconate kinase (KdgK) phosphorylates 2-keto-3-deoxygluconate (KDG) to form 2-keto-3-deoxy-6-phosphogluconate (KDGP)	NA|234aa|down_5|NZ_LT960614.1_5016267_5016969_-	COG1802, GntR, Transcriptional regulators [Transcription]	NA|469aa|down_6|NZ_LT960614.1_5017106_5018513_+	PRK02925, PRK02925, glucuronate isomerase; Reviewed	NA|328aa|down_7|NZ_LT960614.1_5018612_5019596_+	cd13671, PBP2_TRAP_SBP_like_3, Uncharacterized substrate-binding protein of the Tripartite ATP-independent  Periplasmic transporter family; the type 2 periplasmic-binding protein fold	NA|171aa|down_8|NZ_LT960614.1_5019747_5020260_+	COG3090, DctM, TRAP-type C4-dicarboxylate transport system, small permease component [Carbohydrate transport and metabolism]	NA|427aa|down_9|NZ_LT960614.1_5020265_5021546_+	COG1593, DctQ, TRAP-type C4-dicarboxylate transport system, large permease component [Carbohydrate transport and metabolism]
GCF_900231165.1_HDIA1	NZ_LT960614	Hartmannibacter diazotrophicus strain E19T chromosome 1	6	5011740-5012038	6	CRISPRCasFinder	no		WYL,csa3,cas3,PD-DExK,RT,DEDDh	Orphan	CAGTTCGGCAACGGCAACGACGC	23	0	0	NA	NA	NA	4	4	Orphan	WYL,csa3,cas3,PD-DExK,RT,DEDDh	NA|148aa|up_1|NZ_LT960614.1_5009331_5009775_+,NA|137aa|down_0|NZ_LT960614.1_5013427_5013838_+	NA|567aa|up_9|NZ_LT960614.1_4999410_5001111_+	PRK06299, rpsA, 30S ribosomal protein S1; Reviewed	NA|314aa|up_8|NZ_LT960614.1_5001508_5002450_+	PRK05654, PRK05654, acetyl-CoA carboxylase carboxyltransferase subunit beta	NA|442aa|up_7|NZ_LT960614.1_5002466_5003792_+	COG0285, FolC, Folylpolyglutamate synthase [Coenzyme metabolism]	NA|320aa|up_6|NZ_LT960614.1_5004060_5005020_+	PRK05269, PRK05269, transaldolase B; Provisional	NA|363aa|up_5|NZ_LT960614.1_5005306_5006395_+	COG4663, FcbT1, TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|178aa|up_4|NZ_LT960614.1_5006391_5006925_+	COG4665, FcbT2, TRAP-type mannitol/chloroaromatic compound transport system, small permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|442aa|up_3|NZ_LT960614.1_5006928_5008254_+	COG4664, FcbT3, TRAP-type mannitol/chloroaromatic compound transport system, large permease component [Secondary metabolites biosynthesis, transport, and catabolism]	NA|249aa|up_2|NZ_LT960614.1_5008490_5009237_-	pfam06776, IalB, Invasion associated locus B (IalB) protein	NA|148aa|up_1|NZ_LT960614.1_5009331_5009775_+	NA	NA|282aa|up_0|NZ_LT960614.1_5010015_5010861_+	PRK15184, PRK15184, curli production assembly/transport protein CsgG; Provisional	NA|137aa|down_0|NZ_LT960614.1_5013427_5013838_+	NA	NA|202aa|down_1|NZ_LT960614.1_5013863_5014469_+	PRK10101, csgB, curlin minor subunit CsgB; Provisional	NA|95aa|down_2|NZ_LT960614.1_5014504_5014789_+	pfam10614, CsgF, Type VIII secretion system (T8SS), CsgF protein	NA|137aa|down_3|NZ_LT960614.1_5014837_5015248_-	PRK10101, csgB, curlin minor subunit CsgB; Provisional	NA|307aa|down_4|NZ_LT960614.1_5015347_5016268_-	cd01166, KdgK, 2-keto-3-deoxygluconate kinase (KdgK) phosphorylates 2-keto-3-deoxygluconate (KDG) to form 2-keto-3-deoxy-6-phosphogluconate (KDGP)	NA|234aa|down_5|NZ_LT960614.1_5016267_5016969_-	COG1802, GntR, Transcriptional regulators [Transcription]	NA|469aa|down_6|NZ_LT960614.1_5017106_5018513_+	PRK02925, PRK02925, glucuronate isomerase; Reviewed	NA|328aa|down_7|NZ_LT960614.1_5018612_5019596_+	cd13671, PBP2_TRAP_SBP_like_3, Uncharacterized substrate-binding protein of the Tripartite ATP-independent  Periplasmic transporter family; the type 2 periplasmic-binding protein fold	NA|171aa|down_8|NZ_LT960614.1_5019747_5020260_+	COG3090, DctM, TRAP-type C4-dicarboxylate transport system, small permease component [Carbohydrate transport and metabolism]	NA|427aa|down_9|NZ_LT960614.1_5020265_5021546_+	COG1593, DctQ, TRAP-type C4-dicarboxylate transport system, large permease component [Carbohydrate transport and metabolism]
