assembly_id	genome_id	genome_def	crispr_array_locus_merge	crispr_array_location_merge	crispr_locus_id	crispr_pred_method	array_in_prot	prot_within_array_20000	prot_in_genome	crispr_type_by_cas_prot	consensus_repeat	repeat_length	self-targeting_spacer_number	self-targeting_target_number	spacer_location	protospacer_location	repeat_type	spacer_locus_num	spacer_num	correct_crispr_type	genome_cas_prots	unknown_protein_around_crispr	L10	L10_domain	L9	L9_domain	L8	L8_domain	L7	L7_domain	L6	L6_domain	L5	L5_domain	L4	L4_domain	L3	L3_domain	L2	L2_domain	L1	L1_domain	R1	R1_domain	R2	R2_domain	R3	R3_domain	R4	R4_domain	R5	R5_domain	R6	R6_domain	R7	R7_domain	R8	R8_domain	R9	R9_domain	R10	R10_domain
GCF_009730515.1_ASM973051v1	NZ_CP046335	Streptococcus mitis strain FDAARGOS_684 chromosome, complete genome	1	365723-365814	1	CRISPRCasFinder	no		DEDDh,cas3,DinG,RT,csa3	Orphan	CAACTGTAGTGGGTTGCAGAAAAGCTAA	28	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,RT,csa3	NA,NA	NA|106aa|up_9|NZ_CP046335.1_350759_351077_+	cd02947, TRX_family, TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains	NA|209aa|up_8|NZ_CP046335.1_351092_351719_+	cd02796, tRNA_bind_bactPheRS, tRNA-binding-domain-containing prokaryotic phenylalanly tRNA synthetase (PheRS) beta chain	NA|254aa|up_7|NZ_CP046335.1_351760_352522_+	cd05346, SDR_c5, classical (c) SDR, subgroup 5	NA|132aa|up_6|NZ_CP046335.1_352597_352993_+	PRK07274, PRK07274, single-stranded DNA-binding protein; Provisional	NA|95aa|up_5|NZ_CP046335.1_353148_353433_+	PRK00364, groES, co-chaperonin GroES; Reviewed	NA|541aa|up_4|NZ_CP046335.1_353448_355071_+	PRK00013, groEL, chaperonin GroEL; Reviewed	NA|256aa|up_3|NZ_CP046335.1_355208_355976_+	COG1636, COG1636, Uncharacterized protein conserved in bacteria [Function unknown]	NA|178aa|up_2|NZ_CP046335.1_356046_356580_-	PRK13662, PRK13662, hypothetical protein; Provisional	NA|259aa|up_1|NZ_CP046335.1_356668_357445_-	PRK14135, recX, recombination regulator RecX; Provisional	NA|452aa|up_0|NZ_CP046335.1_357482_358838_+	COG2265, TrmA, SAM-dependent methyltransferases related to tRNA (uracil-5-)-methyltransferase [Translation, ribosomal structure and biogenesis]	NA|312aa|down_0|NZ_CP046335.1_367215_368151_-	PRK11886, PRK11886, bifunctional biotin--[acetyl-CoA-carboxylase] ligase/biotin operon repressor BirA	NA|287aa|down_1|NZ_CP046335.1_368147_369008_-	cd06986, cupin_MmsR-like_N, AraC/XylS family transcriptional regulators similar to MmsR, N-terminal cupin domain	NA|721aa|down_2|NZ_CP046335.1_369115_371278_+	COG3345, GalA, Alpha-galactosidase [Carbohydrate transport and metabolism]	NA|417aa|down_3|NZ_CP046335.1_371312_372563_+	cd14749, PBP2_XBP1_like, The periplasmic-binding component of ABC transport systems specific for xylo-oligosaccharides; possesses type 2 periplasmic binding fold	NA|291aa|down_4|NZ_CP046335.1_372581_373454_+	COG1175, UgpA, ABC-type sugar transport systems, permease components [Carbohydrate transport and metabolism]	NA|278aa|down_5|NZ_CP046335.1_373470_374304_+	COG0395, UgpE, ABC-type sugar transport system, permease component [Carbohydrate transport and metabolism]	NA|481aa|down_6|NZ_CP046335.1_374623_376066_+	TIGR03852, sucrose_gtfA, sucrose phosphorylase	NA|398aa|down_7|NZ_CP046335.1_376203_377397_+	COG3307, RfaL, Lipid A core - O-antigen ligase and related enzymes [Cell envelope biogenesis, outer membrane]	NA|148aa|down_8|NZ_CP046335.1_377542_377986_+	TIGR04472, reg_rSAM_mob, mobile rSAM pair MarR family regulator	NA|50aa|down_9|NZ_CP046335.1_378062_378212_+	COG4922, COG4922, Uncharacterized protein conserved in bacteria [Function unknown]
GCF_009730515.1_ASM973051v1	NZ_CP046335	Streptococcus mitis strain FDAARGOS_684 chromosome, complete genome	2	837974-838131	1	PILER-CR	no		DEDDh,cas3,DinG,RT,csa3	Orphan	ACACCAGTAGATCCAAAAGATCCAACTAAACC	32	0	0	NA	NA	NA	2	2	Orphan	DEDDh,cas3,DinG,RT,csa3	NA|53aa|up_2|NZ_CP046335.1_822844_823003_+,NA|266aa|down_9|NZ_CP046335.1_853998_854796_+	NA|370aa|up_9|NZ_CP046335.1_817111_818221_+	PRK09210, PRK09210, RNA polymerase sigma factor RpoD; Validated	NA|110aa|up_8|NZ_CP046335.1_818235_818565_+	COG2151, PaaD, Predicted metal-sulfur cluster biosynthetic enzyme [General function prediction only]	NA|339aa|up_7|NZ_CP046335.1_818704_819721_+	cd03801, GT4_PimA-like, phosphatidyl-myo-inositol mannosyltransferase	NA|442aa|up_6|NZ_CP046335.1_819736_821062_+	cd03817, GT4_UGDG-like, UDP-Glc:1,2-diacylglycerol 3-a-glucosyltransferase and similar proteins	NA|59aa|up_5|NZ_CP046335.1_821091_821268_+	pfam04024, PspC, PspC domain	NA|45aa|up_4|NZ_CP046335.1_821326_821461_+	pfam13253, DUF4044, Protein of unknown function (DUF4044)	NA|437aa|up_3|NZ_CP046335.1_821524_822835_+	PRK12297, obgE, GTPase CgtA; Reviewed	NA|53aa|up_2|NZ_CP046335.1_822844_823003_+	NA	NA|157aa|up_1|NZ_CP046335.1_823046_823517_-	COG1438, ArgR, Arginine repressor [Transcription]	NA|761aa|up_0|NZ_CP046335.1_823983_826266_-	PRK05371, PRK05371, x-prolyl-dipeptidyl aminopeptidase; Provisional	NA|1043aa|down_0|NZ_CP046335.1_839600_842729_+	PRK07279, dnaE, DNA polymerase III DnaE; Reviewed	NA|336aa|down_1|NZ_CP046335.1_842810_843818_+	PRK03202, PRK03202, ATP-dependent 6-phosphofructokinase	NA|502aa|down_2|NZ_CP046335.1_843874_845380_+	PRK05826, PRK05826, pyruvate kinase; Provisional	NA|329aa|down_3|NZ_CP046335.1_845698_846685_-	PRK00066, ldh, L-lactate dehydrogenase; Reviewed	NA|823aa|down_4|NZ_CP046335.1_846879_849348_+	PRK05560, PRK05560, DNA gyrase subunit A; Validated	NA|248aa|down_5|NZ_CP046335.1_849347_850091_+	cd06165, Sortase_A, Sortase domain found in class A sortases	NA|266aa|down_6|NZ_CP046335.1_850233_851031_+	COG2116, FocA, Formate/nitrite family of transporters [Inorganic ion transport and metabolism]	NA|427aa|down_7|NZ_CP046335.1_851178_852459_+	COG2873, MET17, O-acetylhomoserine sulfhydrylase [Amino acid transport and metabolism]	NA|425aa|down_8|NZ_CP046335.1_852554_853829_+	pfam09903, DUF2130, Uncharacterized protein conserved in bacteria (DUF2130)	NA|266aa|down_9|NZ_CP046335.1_853998_854796_+	NA
GCF_009730515.1_ASM973051v1	NZ_CP046335	Streptococcus mitis strain FDAARGOS_684 chromosome, complete genome	3	1147827-1147976	2	CRISPRCasFinder	no		DEDDh,cas3,DinG,RT,csa3	Orphan	TATTTTTCTTAAGGCACCTTAATTATAACACAAAA	35	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,RT,csa3	NA,NA	NA|275aa|up_9|NZ_CP046335.1_1134685_1135510_-	COG2819, COG2819, Predicted hydrolase of the alpha/beta superfamily [General function prediction only]	NA|405aa|up_8|NZ_CP046335.1_1135646_1136861_-	PRK01565, PRK01565, thiamine biosynthesis protein ThiI; Provisional	NA|381aa|up_7|NZ_CP046335.1_1136869_1138012_-	COG1104, NifS, Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes [Amino acid transport and metabolism]	NA|143aa|up_6|NZ_CP046335.1_1138150_1138579_+	PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional	NA|768aa|up_5|NZ_CP046335.1_1138623_1140927_-	COG1674, FtsK, DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning]	NA|651aa|up_4|NZ_CP046335.1_1141012_1142965_-	COG1299, FruA, Phosphotransferase system, fructose-specific IIC component [Carbohydrate transport and metabolism]	NA|304aa|up_3|NZ_CP046335.1_1142961_1143873_-	COG1105, FruK, Fructose-1-phosphate kinase and related fructose-6-phosphate kinase (PfkB) [Carbohydrate transport and metabolism]	NA|247aa|up_2|NZ_CP046335.1_1143869_1144610_-	COG1349, GlpR, Transcriptional regulators of sugar metabolism [Transcription / Carbohydrate transport and metabolism]	NA|450aa|up_1|NZ_CP046335.1_1144919_1146268_-	PHA02517, PHA02517, putative transposase OrfB; Reviewed	NA|445aa|up_0|NZ_CP046335.1_1146375_1147710_-	PRK00701, PRK00701, divalent metal cation transporter MntH	NA|212aa|down_0|NZ_CP046335.1_1148539_1149175_-	cd01841, NnaC_like, NnaC (CMP-NeuNAc synthetase) _like subfamily of SGNH_hydrolases, a diverse family of lipases and esterases	NA|276aa|down_1|NZ_CP046335.1_1149176_1150004_-	COG0561, Cof, Predicted hydrolases of the HAD superfamily [General function prediction only]	NA|442aa|down_2|NZ_CP046335.1_1150025_1151351_-	pfam09587, PGA_cap, Bacterial capsule synthesis protein PGA_cap	NA|204aa|down_3|NZ_CP046335.1_1151525_1152137_+	PRK00150, def, peptide deformylase; Reviewed	NA|243aa|down_4|NZ_CP046335.1_1152181_1152910_+	COG0566, SpoU, rRNA methylases [Translation, ribosomal structure and biogenesis]	NA|304aa|down_5|NZ_CP046335.1_1152949_1153861_-	TIGR01292, Thioredoxin_reductase, thioredoxin-disulfide reductase	NA|79aa|down_6|NZ_CP046335.1_1153926_1154163_-	pfam13268, DUF4059, Protein of unknown function (DUF4059)	NA|248aa|down_7|NZ_CP046335.1_1154228_1154972_-	COG1126, GlnQ, ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism]	NA|267aa|down_8|NZ_CP046335.1_1154971_1155772_-	COG0765, HisM, ABC-type amino acid transport system, permease component [Amino acid transport and metabolism]	NA|119aa|down_9|NZ_CP046335.1_1155929_1156286_-	cd03036, ArsC_like, Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC
GCF_009730515.1_ASM973051v1	NZ_CP046335	Streptococcus mitis strain FDAARGOS_684 chromosome, complete genome	4	1302196-1302297	3	CRISPRCasFinder	no		DEDDh,cas3,DinG,RT,csa3	Orphan	CTCGTGAAAAAAAGACTCAACCGGGTCT	28	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,RT,csa3	NA,NA	NA|482aa|up_9|NZ_CP046335.1_1294562_1296008_+	PRK14022, PRK14022, UDP-N-acetylmuramoyl-L-alanyl-D-glutamate--L-lysine ligase	NA|64aa|up_8|NZ_CP046335.1_1296054_1296246_-	COG3237, COG3237, Uncharacterized protein conserved in bacteria [Function unknown]	NA|217aa|up_7|NZ_CP046335.1_1296346_1296997_-	pfam08820, DUF1803, Domain of unknown function (DUF1803)	NA|171aa|up_6|NZ_CP046335.1_1296980_1297493_-	pfam05708, Peptidase_C92, Permuted papain-like amidase enzyme, YaeF/YiiX, C92 family	NA|312aa|up_5|NZ_CP046335.1_1297579_1298515_-	PRK05427, PRK05427, putative manganese-dependent inorganic pyrophosphatase; Provisional	NA|99aa|up_4|NZ_CP046335.1_1298693_1298990_-	COG2827, COG2827, Predicted endonuclease containing a URI domain [DNA replication, recombination, and repair]	NA|250aa|up_3|NZ_CP046335.1_1298979_1299729_-	COG4123, COG4123, Predicted O-methyltransferase [General function prediction only]	NA|121aa|up_2|NZ_CP046335.1_1299781_1300144_-	PRK07252, PRK07252, S1 RNA-binding domain-containing protein	NA|467aa|up_1|NZ_CP046335.1_1300145_1301546_-	COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones]	NA|168aa|up_0|NZ_CP046335.1_1301625_1302129_-	pfam00583, Acetyltransf_1, Acetyltransferase (GNAT) family	NA|80aa|down_0|NZ_CP046335.1_1302302_1302542_-	PRK00391, rpsR, 30S ribosomal protein S18; Reviewed	NA|157aa|down_1|NZ_CP046335.1_1302573_1303044_-	PRK07275, PRK07275, single-stranded DNA-binding protein; Provisional	NA|97aa|down_2|NZ_CP046335.1_1303055_1303346_-	PRK00453, rpsF, 30S ribosomal protein S6; Reviewed	NA|2172aa|down_3|NZ_CP046335.1_1303509_1310025_-	pfam17966, Mub_B2, Mub B2-like domain	NA|1932aa|down_4|NZ_CP046335.1_1310457_1316253_-	pfam17966, Mub_B2, Mub B2-like domain	NA|448aa|down_5|NZ_CP046335.1_1316512_1317856_-	PRK03932, asnC, asparaginyl-tRNA synthetase; Validated	NA|393aa|down_6|NZ_CP046335.1_1317878_1319057_-	PRK05764, PRK05764, aspartate aminotransferase; Provisional	NA|144aa|down_7|NZ_CP046335.1_1319053_1319485_-	COG5353, COG5353, Uncharacterized protein conserved in bacteria [Function unknown]	NA|182aa|down_8|NZ_CP046335.1_1319901_1320447_+	COG0431, COG0431, Predicted flavoprotein [General function prediction only]	NA|2738aa|down_9|NZ_CP046335.1_1320648_1328862_-	pfam08428, Rib, Rib/alpha-like repeat
GCF_009730515.1_ASM973051v1	NZ_CP046335	Streptococcus mitis strain FDAARGOS_684 chromosome, complete genome	5	1628413-1628484	4	CRISPRCasFinder	no		DEDDh,cas3,DinG,RT,csa3	Orphan	AAGTTTCCATGAGAACTATTTTA	23	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,RT,csa3	NA,NA|210aa|down_7|NZ_CP046335.1_1636225_1636855_-	NA|567aa|up_9|NZ_CP046335.1_1618175_1619876_-	PRK07282, PRK07282, acetolactate synthase large subunit	NA|556aa|up_8|NZ_CP046335.1_1620076_1621744_-	TIGR03599, YloV, DAK2 domain fusion protein YloV	NA|122aa|up_7|NZ_CP046335.1_1621746_1622112_-	COG1302, COG1302, Uncharacterized protein conserved in bacteria [Function unknown]	NA|63aa|up_6|NZ_CP046335.1_1622267_1622456_-	PRK00359, rpmB, 50S ribosomal protein L28; Reviewed	NA|225aa|up_5|NZ_CP046335.1_1622562_1623237_-	COG4758, COG4758, Predicted membrane protein [Function unknown]	NA|149aa|up_4|NZ_CP046335.1_1623241_1623688_-	COG3279, LytT, Response regulator of the LytR/AlgR family [Transcription / Signal transduction mechanisms]	NA|172aa|up_3|NZ_CP046335.1_1623801_1624317_-	cd01610, PAP2_like, PAP2_like proteins, a super-family of histidine phosphatases and vanadium haloperoxidases, includes type 2 phosphatidic acid phosphatase or lipid phosphate phosphatase (LPP), Glucose-6-phosphatase, Phosphatidylglycerophosphatase B and bacterial acid phosphatase, vanadium chloroperoxidases, vanadium bromoperoxidases, and several other mostly uncharacterized subfamilies	NA|587aa|up_2|NZ_CP046335.1_1624400_1626161_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|590aa|up_1|NZ_CP046335.1_1626150_1627920_-	COG1132, MdlB, ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms]	NA|154aa|up_0|NZ_CP046335.1_1627903_1628365_-	COG1846, MarR, Transcriptional regulators [Transcription]	NA|239aa|down_0|NZ_CP046335.1_1629273_1629990_-	PRK12378, PRK12378, YebC/PmpR family DNA-binding transcriptional regulator	NA|457aa|down_1|NZ_CP046335.1_1630086_1631457_-	cd13138, MATE_yoeA_like, Subfamily of the multidrug and toxic compound extrusion (MATE)-like proteins similar to Bacillus subtilis yoeA	NA|385aa|down_2|NZ_CP046335.1_1631572_1632727_-	PRK09354, recA, recombinase A; Provisional	NA|419aa|down_3|NZ_CP046335.1_1632781_1634038_-	PRK00549, PRK00549, competence damage-inducible protein A; Provisional	NA|352aa|down_4|NZ_CP046335.1_1634115_1635171_-	COG1316, LytR, Transcriptional regulator [Transcription]	NA|173aa|down_5|NZ_CP046335.1_1635178_1635697_-	PRK10140, PRK10140, N-acetyltransferase	NA|148aa|down_6|NZ_CP046335.1_1635686_1636130_-	COG0802, COG0802, Predicted ATPase or kinase [General function prediction only]	NA|210aa|down_7|NZ_CP046335.1_1636225_1636855_-	NA	NA|141aa|down_8|NZ_CP046335.1_1636987_1637410_-	PRK00668, ndk, mulitfunctional nucleoside diphosphate kinase/apyrimidinic endonuclease/3'-; Validated	NA|1217aa|down_9|NZ_CP046335.1_1637715_1641366_-	PRK00566, PRK00566, DNA-directed RNA polymerase subunit beta'; Provisional
GCF_009730515.1_ASM973051v1	NZ_CP046335	Streptococcus mitis strain FDAARGOS_684 chromosome, complete genome	6	1748415-1748510	5	CRISPRCasFinder	no		DEDDh,cas3,DinG,RT,csa3	Orphan	TTATATATAAAAATTTTACACATT	24	0	0	NA	NA	NA	1	1	Orphan	DEDDh,cas3,DinG,RT,csa3	NA|291aa|up_9|NZ_CP046335.1_1738081_1738954_+,NA|196aa|up_8|NZ_CP046335.1_1739139_1739727_-,NA|51aa|down_4|NZ_CP046335.1_1751497_1751650_-	NA|291aa|up_9|NZ_CP046335.1_1738081_1738954_+	NA	NA|196aa|up_8|NZ_CP046335.1_1739139_1739727_-	NA	NA|288aa|up_7|NZ_CP046335.1_1740049_1740913_+	cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins	NA|217aa|up_6|NZ_CP046335.1_1740963_1741614_-	cd00118, LysM, Lysin Motif is a small domain involved in binding peptidoglycan	NA|224aa|up_5|NZ_CP046335.1_1741853_1742525_+	COG1760, SdaA, L-serine deaminase [Amino acid transport and metabolism]	NA|291aa|up_4|NZ_CP046335.1_1742533_1743406_+	TIGR00718, Probable_L-serine_dehydratase_alpha_chain, L-serine dehydratase, iron-sulfur-dependent, alpha subunit	NA|211aa|up_3|NZ_CP046335.1_1743427_1744060_+	cd04302, HAD_5NT, haloacid dehalogenase (HAD)-like 5'-nucleotidases similar to the Pseudomonas aeruginosa PA0065	NA|617aa|up_2|NZ_CP046335.1_1744195_1746046_-	COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism]	NA|277aa|up_1|NZ_CP046335.1_1746237_1747068_-	cd04195, GT2_AmsE_like, GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis	NA|389aa|up_0|NZ_CP046335.1_1747222_1748389_+	cd17339, MFS_NIMT_CynX_like, 2-nitroimidazole and cyanate transporters and similar proteins of the Major Facilitator Superfamily of transporters	NA|109aa|down_0|NZ_CP046335.1_1748527_1748854_+	COG1695, COG1695, Predicted transcriptional regulators [Transcription]	NA|198aa|down_1|NZ_CP046335.1_1748840_1749434_+	COG4709, COG4709, Predicted membrane protein [Function unknown]	NA|316aa|down_2|NZ_CP046335.1_1749426_1750374_+	pfam13349, DUF4097, Putative adhesin	NA|355aa|down_3|NZ_CP046335.1_1750436_1751501_+	pfam10310, DUF5427, Family of unknown function (DUF5427)	NA|51aa|down_4|NZ_CP046335.1_1751497_1751650_-	NA	NA|287aa|down_5|NZ_CP046335.1_1751724_1752585_-	pfam14132, DUF4299, Domain of unknown function (DUF4299)	NA|329aa|down_6|NZ_CP046335.1_1752706_1753693_-	PRK00142, PRK00142, rhodanese-related sulfurtransferase	NA|101aa|down_7|NZ_CP046335.1_1753824_1754127_-	pfam08951, EntA_Immun, Enterocin A Immunity	NA|230aa|down_8|NZ_CP046335.1_1754177_1754867_-	pfam02517, Abi, CAAX protease self-immunity	NA|495aa|down_9|NZ_CP046335.1_1755099_1756584_-	pfam12010, DUF3502, Domain of unknown function (DUF3502)
