assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_000092105.1_ASM9210v1	NC_014148	Planctopirus limnophila DSM 3776, complete sequence	1	673934-674034	1	CRISPRCasFinder	no		cas10,DinG,cas3,Cas9_archaeal,cas2,cas1,csa3,RT,WYL,cas8u2,cas7,cas5u	Orphan	CTCTCAACCGGCAGCCCCTGGTT	23	0	0	NA	NA	NA	1	1	Orphan	cas10,DinG,cas3,Cas9_archaeal,cas2,cas1,csa3,RT,WYL,cas8u2,cas7,cas5u	NA|973aa|up_7|NC_014148.1_660003_662922_-,NA|300aa|up_5|NC_014148.1_665221_666121_+,NA|630aa|up_3|NC_014148.1_667173_669063_+,NA|201aa|up_2|NC_014148.1_669009_669612_-,NA|239aa|up_1|NC_014148.1_669699_670416_-,NA|147aa|down_1|NC_014148.1_675538_675979_-	NA|249aa|up_9|NC_014148.1_658304_659051_+	PRK00090, bioD, ATP-dependent dethiobiotin synthetase BioD	NA|313aa|up_8|NC_014148.1_659098_660037_-	NF033188, internalin_H, InlH/InlC2 family class 1 internalin	NA|973aa|up_7|NC_014148.1_660003_662922_-	NA	NA|495aa|up_6|NC_014148.1_663252_664737_-	COG1807, ArnT, 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family [Cell envelope biogenesis, outer membrane]	NA|300aa|up_5|NC_014148.1_665221_666121_+	NA	NA|155aa|up_4|NC_014148.1_666339_666804_-	pfam04931, DNA_pol_phi, DNA polymerase phi	NA|630aa|up_3|NC_014148.1_667173_669063_+	NA	NA|201aa|up_2|NC_014148.1_669009_669612_-	NA	NA|239aa|up_1|NC_014148.1_669699_670416_-	NA	NA|949aa|up_0|NC_014148.1_670463_673310_-	COG4232, COG4232, Thiol:disulfide interchange protein [Posttranslational modification, protein turnover, chaperones / Energy production and conversion]	NA|433aa|down_0|NC_014148.1_674210_675509_+	PRK13342, PRK13342, recombination factor protein RarA; Reviewed	NA|147aa|down_1|NC_014148.1_675538_675979_-	NA	NA|233aa|down_2|NC_014148.1_676104_676803_+	pfam00578, AhpC-TSA, AhpC/TSA family	NA|606aa|down_3|NC_014148.1_676866_678684_+	cd14014, STKc_PknB_like, Catalytic domain of bacterial Serine/Threonine kinases, PknB and similar proteins	NA|430aa|down_4|NC_014148.1_679095_680385_+	cd00839, MPP_PAPs, purple acid phosphatases of the metallophosphatase superfamily, metallophosphatase domain	NA|197aa|down_5|NC_014148.1_680414_681005_-	pfam06962, rRNA_methylase, Putative rRNA methylase	NA|318aa|down_6|NC_014148.1_681246_682200_+	cd19084, AKR_AKR11B1-like, AKR11B1/AKR11B2 subfamily of aldo-keto reductase (AKR)	NA|119aa|down_7|NC_014148.1_682341_682698_+	COG3795, COG3795, Uncharacterized protein conserved in bacteria [Function unknown]	NA|430aa|down_8|NC_014148.1_682756_684046_+	COG4941, COG4941, Predicted RNA polymerase sigma factor containing a TPR repeat domain [Transcription]	NA|359aa|down_9|NC_014148.1_684099_685176_-	pfam07596, SBP_bac_10, Protein of unknown function (DUF1559)
GCF_000092105.1_ASM9210v1	NC_014148	Planctopirus limnophila DSM 3776, complete sequence	2	1518423-1518493	2	CRISPRCasFinder	no	Cas9_archaeal	cas10,DinG,cas3,Cas9_archaeal,cas2,cas1,csa3,RT,WYL,cas8u2,cas7,cas5u	Type II-A, or Type II-C?, Type II-B	ACCATCTTGCAGGTGCGAACAGG	23	0	0	NA	NA	NA	1	1	TypeII-A,orTypeII-C?,TypeII-B	cas10,DinG,cas3,Cas9_archaeal,cas2,cas1,csa3,RT,WYL,cas8u2,cas7,cas5u	NA|163aa|up_6|NC_014148.1_1509636_1510125_+,NA	NA|186aa|up_9|NC_014148.1_1504844_1505402_-	COG0071, IbpA, Molecular chaperone (small heat shock protein) [Posttranslational modification, protein turnover, chaperones]	NA|212aa|up_8|NC_014148.1_1505795_1506431_+	pfam00582, Usp, Universal stress protein family	NA|878aa|up_7|NC_014148.1_1506744_1509378_+	pfam13248, zf-ribbon_3, zinc-ribbon domain	NA|163aa|up_6|NC_014148.1_1509636_1510125_+	NA	NA|708aa|up_5|NC_014148.1_1510228_1512352_+	COG1331, COG1331, Highly conserved protein containing a thioredoxin domain [Posttranslational modification, protein turnover, chaperones]	NA|410aa|up_4|NC_014148.1_1512390_1513620_-	PRK08317, PRK08317, hypothetical protein; Provisional	NA|289aa|up_3|NC_014148.1_1513644_1514511_-	pfam01925, TauE, Sulfite exporter TauE/SafE	NA|477aa|up_2|NC_014148.1_1514546_1515977_-	cd07724, POD-like_MBL-fold, ETHE1 (PDO type I), persulfide dioxygenase A (PDOA, PDO type II) and related proteins; MBL-fold metallo-hydrolase domain	NA|195aa|up_1|NC_014148.1_1516079_1516664_-	cd00158, RHOD, Rhodanese Homology Domain (RHOD); an alpha beta fold domain found duplicated in the rhodanese protein	NA|316aa|up_0|NC_014148.1_1517043_1517991_+	TIGR02225, Tyrosine_recombinase_XerD, tyrosine recombinase XerD	NA|271aa|down_0|NC_014148.1_1520229_1521042_+	PRK00346, surE, 5'(3')-nucleotidase/polyphosphatase; Provisional	NA|289aa|down_1|NC_014148.1_1521148_1522015_+	COG0053, MMT1, Predicted Co/Zn/Cd cation transporters [Inorganic ion transport and metabolism]	NA|432aa|down_2|NC_014148.1_1522052_1523348_-	cd03324, rTSbeta_L-fuconate_dehydratase, Human rTS beta is encoded by the rTS gene which, through alternative RNA splicing, also encodes rTS alpha whose mRNA is complementary to thymidylate synthase mRNA	NA|454aa|down_3|NC_014148.1_1523400_1524762_-	COG1624, COG1624, Uncharacterized conserved protein [Function unknown]	NA|182aa|down_4|NC_014148.1_1525896_1526442_+	PRK11895, ilvH, acetolactate synthase 3 regulatory subunit; Reviewed	NA|335aa|down_5|NC_014148.1_1526826_1527831_+	PRK05479, PRK05479, ketol-acid reductoisomerase; Provisional	NA|164aa|down_6|NC_014148.1_1527951_1528443_-	TIGR01241, ATP-dependent_zinc_metalloprotease_FtsH, ATP-dependent metalloprotease FtsH	NA|495aa|down_7|NC_014148.1_1528475_1529960_-	PRK09369, PRK09369, UDP-N-acetylglucosamine 1-carboxyvinyltransferase; Validated	NA|274aa|down_8|NC_014148.1_1530114_1530936_-	pfam13365, Trypsin_2, Trypsin-like peptidase domain	NA|161aa|down_9|NC_014148.1_1531272_1531755_-	pfam04074, DUF386, Domain of unknown function (DUF386)
GCF_000092105.1_ASM9210v1	NC_014148	Planctopirus limnophila DSM 3776, complete sequence	3	1849844-1851259	3,1,1	CRISPRCasFinder,CRT,PILER-CR	no	cas2,cas1	cas10,DinG,cas3,Cas9_archaeal,cas2,cas1,csa3,RT,WYL,cas8u2,cas7,cas5u	Unclear	GCTTCAATGGGGCCGCGCTTGGTTAGCGCGGAAGAC,GCTTCAATGGGGCCGCGCTTGGTTAGCGCGGAAGAC,GCTTCAATGGGGCCGCGCTTGGTTAGCGCGGAAGAC	36,36,36	1	1	1851189-1851223	NC_014149.1_17735-17769	NA:NA:NA	19,19,18	19	Unclear	cas10,DinG,cas3,Cas9_archaeal,cas2,cas1,csa3,RT,WYL,cas8u2,cas7,cas5u	NA|501aa|up_0|NC_014148.1_1848284_1849787_+,NA|207aa|down_5|NC_014148.1_1859274_1859895_+,NA|331aa|down_6|NC_014148.1_1860596_1861589_-	NA|494aa|up_9|NC_014148.1_1833150_1834632_+	TIGR00591, Deoxyribodipyrimidine_photo-lyase, photolyase PhrII	NA|590aa|up_8|NC_014148.1_1834637_1836407_-	cd00400, Voltage_gated_ClC, CLC voltage-gated chloride channel	NA|189aa|up_7|NC_014148.1_1836732_1837299_+	pfam02622, DUF179, Uncharacterized ACR, COG1678	NA|333aa|up_6|NC_014148.1_1837624_1838623_+	pfam14100, PmoA, Methane oxygenase PmoA	NA|295aa|up_5|NC_014148.1_1839046_1839931_+	COG0657, Aes, Esterase/lipase [Lipid metabolism]	NA|188aa|up_4|NC_014148.1_1840178_1840742_+	pfam03745, DUF309, Domain of unknown function (DUF309)	NA|1059aa|up_3|NC_014148.1_1840832_1844009_-	TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase domain	NA|419aa|up_2|NC_014148.1_1844473_1845730_+	pfam03629, SASA, Carbohydrate esterase, sialic acid-specific acetylesterase	NA|797aa|up_1|NC_014148.1_1845886_1848277_+	TIGR04485, monoheme_cytochrome_c_SoxX, sulfur oxidation c-type cytochrome SoxX	NA|501aa|up_0|NC_014148.1_1848284_1849787_+	NA	cas2|98aa|down_0|NC_014148.1_1851637_1851931_-	cd09725, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2	cas1|564aa|down_1|NC_014148.1_1851966_1853658_-	pfam01867, Cas_Cas1, CRISPR associated protein Cas1	NA|328aa|down_2|NC_014148.1_1854063_1855047_-	cd05245, SDR_a2, atypical (a) SDRs, subgroup 2	NA|210aa|down_3|NC_014148.1_1855167_1855797_+	pfam09348, DUF1990, Domain of unknown function (DUF1990)	NA|998aa|down_4|NC_014148.1_1856138_1859132_+	cd00081, Hint, Hedgehog/Intein domain, found in Hedgehog proteins as well as proteins which contain inteins and undergo protein splicing (e	NA|207aa|down_5|NC_014148.1_1859274_1859895_+	NA	NA|331aa|down_6|NC_014148.1_1860596_1861589_-	NA	NA|292aa|down_7|NC_014148.1_1861852_1862728_-	pfam06167, Peptidase_M90, Glucose-regulated metallo-peptidase M90	NA|443aa|down_8|NC_014148.1_1863374_1864703_+	pfam07394, DUF1501, Protein of unknown function (DUF1501)	NA|821aa|down_9|NC_014148.1_1865243_1867706_+	pfam04151, PPC, Bacterial pre-peptidase C-terminal domain
