# 65 CORRECT # 13 ALMOST_CORRECT # 5 MISSED # 3 WRONG # 2 OVERPRED # 1 ACCEPTABLE # # 0 MISSED in Core () # 3 MISSED not in Core # # # 0 MISSED in ChloroDB () # 3 MISSED not in ChloroDB # FILE1 psba psbA 540 1601 R 1 1062 Ok MTAILERRESESLWGRFCNWITSTENRLYIGWFGVLMIPTLLTATSVFIIAFIAAPPVDIDGIREPVSGSLLYGNNIISGAIIPTSAAIGLHFYPIWEAASVDEWLYNGGPYELIVLHFLLGVACYMGREWELSFRLGMRPWIAVAYSAPVAAATAVFLIYPIGQGSFSDGMPLGISGTFNFMIVFQAEHNILMHPFHMLGVAGVFGGSLFSAMHGSLVTSSLIRETTENESANEGYRFGQEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLAAWPVVGIWFTALGISTMAFNLNGFNFNQSVVDSQGRVINTWADIINRANLGMEVMHERNAHNFPLDLAAIEAPSTNG photosystem_II_protein_D1 FILE2 psba psbA 540 1601 R 1 1062 Ok MTAILERRESESLWGRFCNWITSTENRLYIGWFGVLMIPTLLTATSVFIIAFIAAPPVDIDGIREPVSGSLLYGNNIISGAIIPTSAAIGLHFYPIWEAASVDEWLYNGGPYELIVLHFLLGVACYMGREWELSFRLGMRPWIAVAYSAPVAAATAVFLIYPIGQGSFSDGMPLGISGTFNFMIVFQAEHNILMHPFHMLGVAGVFGGSLFSAMHGSLVTSSLIRETTENESANEGYRFGQEEETYNIVAAHGYFGRLIFQYASFNNSRSLHFFLAAWPVVGIWFTALGISTMAFNLNGFNFNQSVVDSQGRVINTWADIINRANLGMEVMHERNAHNFPLDLAAIEAPSTNG photosystem_II_protein_D1 MATCH psba ID 100 CORRECT FILE1 matk matK 2127 3656 R 1 1530 Ok MEEIHRYLQPDSSQQHNFLYPLIFQEYIYALAQDHGLNRNRSILLENSGYNNKFSFLIVKRLITRMDQQNHLIISTNDSNKNPFLGCNKSLYSQMISEGFACIVEIPFSIRLISSLSSFEGKKIFKSHNLRSIHSTFPFLEDNFSHLNYVLDILIPYPVHLEILVQTLRYWVKDASSLHLLRFFLHEYCNLNSLITSKKPGYSFSKKNQRFFFFLYNSYVYECESTFVFLRNQSSHLRSTSFGALLERIYFYGKIERLVEAFAKDFQVTLWLFKDPVMHYVRYEGKSILASKGTFPWMNKWKFYLVNFWQCHFSMYFNTGRIHINQLSNHSRDFMGYLSSVRLNHSMVRSQMLENSFLINNPIKKFDTLVPIIPLIGSLAKAHFCTGLGHPISKPVWSDLSDSDIIDRFGRICRNLFHYYSGSSKKKTLYRIKYILRLSCARTLARKHKSTVRTFLKRSGSELLEEFLTSEEEVLSLTFPRASSSLWGVYRSRIWYLDIFCINDLANSQ maturase FILE2 matk matK 2127 3656 R 1 1530 Ok MEEIHRYLQPDSSQQHNFLYPLIFQEYIYALAQDHGLNRNRSILLENSGYNNKFSFLIVKRLITRMDQQNHLIISTNDSNKNPFLGCNKSLYSQMISEGFACIVEIPFSIRLISSLSSFEGKKIFKSHNLRSIHSTFPFLEDNFSHLNYVLDILIPYPVHLEILVQTLRYWVKDASSLHLLRFFLHEYCNLNSLITSKKPGYSFSKKNQRFFFFLYNSYVYECESTFVFLRNQSSHLRSTSFGALLERIYFYGKIERLVEAFAKDFQVTLWLFKDPVMHYVRYEGKSILASKGTFPWMNKWKFYLVNFWQCHFSMYFNTGRIHINQLSNHSRDFMGYLSSVRLNHSMVRSQMLENSFLINNPIKKFDTLVPIIPLIGSLAKAHFCTGLGHPISKPVWSDLSDSDIIDRFGRICRNLFHYYSGSSKKKTLYRIKYILRLSCARTLARKHKSTVRTFLKRSGSELLEEFLTSEEEVLSLTFPRASSSLWGVYRSRIWYLDIFCINDLANSQ maturase_K MATCH matk ID 100 CORRECT FILE1 rps16 rps16 4937 6067 R 2 267 Ok MVKLRLKRCGRKQRAVYRIVAIDVRSRREGKDLQKVGFYDPIKNQTYLNVPAILYFLEKGAQPTETVQDILKKAEVFKELRLNQPKFN ribosomal_protein_S16 FILE2 rps16 rps16 4937 6067 R 2 267 Ok MVKLRLKRCGRKQRAVYRIVAIDVRSRREGKDLQKVGFYDPIKNQTYLNVPAILYFLEKGAQPTETVQDILKKAEVFKELRLNQPKFN ribosomal_protein_S16 MATCH rps16 ID 100 CORRECT FILE1 psbk psbK 7587 7772 D 1 186 Ok MLNTFSLIGICLNSTLYSSSFFFGKLPEAYAFLNPIVDIMPVIPLFFFLLAFVWQAAVSFR photosystem_II_protein_K FILE2 psbk psbK 7587 7772 D 1 186 Ok MLNTFSLIGICLNSTLYSSSFFFGKLPEAYAFLNPIVDIMPVIPLFFFLLAFVWQAAVSFR photosystem_II_protein_K MATCH psbk ID 100 CORRECT FILE1 psbi psbI 8134 8244 D 1 111 Ok MLTLKLFVYTVVIFFVSLFIFGFLSNDPGRNPGREE photosystem_II_protein_I FILE2 psbi psbI 8083 8244 D 1 162 Ok MIYSLFFFQKNHLGDCVMLTLKLFVYTVVIFFVSLFIFGFLSNDPGRNPGREE photosystem_II_protein_I MATCH psbi ID 67 WRONG.BAD_START FILE1 atpa atpA 10224 11747 R 1 1524 Ok MVTIRADEISNIIRERIEQYNREVKIVNTGTVLQVGDGIARIHGLDEVMAGELVEFEEGTIGIALNLESNNVGVVLMGDGLLIQEGSSVKATGRIAQIPVSEAYLGRVVNALAKPIDGRGEISASEFRLIESAAPGIISRRSVYEPLQTGLIAIDSMIPIGRGQRELIIGDRQTGKTAVATDTILNQQGQNVICVYVAIGQKASSVAQVVTTLQERGAMEYTIVVAETADSPATLQYLAPYTGAALAEYFMYRERHTLIIYDDLSKQAQAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKLSSSLGEGSMTALPIVETQSGDVSAYIPTNVISITDGQIFLSADLFNSGIRPAINVGISVSRVGSAAQIKAMKQVAGKLKLELAQFAELEAFAQFASDLDKATQNQLARGQRLRELLKQSQSAPLTVEEQIMTIYTGTNGYLDSLEVGQVRKFLVELRTYLKTTKPQFQEIISSTKTFTEEAEALLKEAIQEQMDRFILQEQA ATP_synthase_CF1_alpha_chain FILE2 atpa atpA 10224 11747 R 1 1524 Ok MVTIRADEISNIIRERIEQYNREVKIVNTGTVLQVGDGIARIHGLDEVMAGELVEFEEGTIGIALNLESNNVGVVLMGDGLLIQEGSSVKATGRIAQIPVSEAYLGRVVNALAKPIDGRGEISASEFRLIESAAPGIISRRSVYEPLQTGLIAIDSMIPIGRGQRELIIGDRQTGKTAVATDTILNQQGQNVICVYVAIGQKASSVAQVVTTLQERGAMEYTIVVAETADSPATLQYLAPYTGAALAEYFMYRERHTLIIYDDLSKQAQAYRQMSLLLRRPPGREAYPGDVFYLHSRLLERAAKLSSSLGEGSMTALPIVETQSGDVSAYIPTNVISITDGQIFLSADLFNSGIRPAINVGISVSRVGSAAQIKAMKQVAGKLKLELAQFAELEAFAQFASDLDKATQNQLARGQRLRELLKQSQSAPLTVEEQIMTIYTGTNGYLDSLEVGQVRKFLVELRTYLKTTKPQFQEIISSTKTFTEEAEALLKEAIQEQMDRFILQEQA ATP_synthase_CF1_alpha_subunit MATCH atpa ID 100 CORRECT FILE1 atpf atpF 11803 13043 R 2 555 Ok MKNVTDSFVSLGHWPSAGSFGFNTDILATNPINLSVVLGVLIFFGKGVLSDLLDNRKQRILNTIRNSEELRGGAIEQLEKARSRLRKVETEAEQFRVNGYSEIEREKLNLINSTYKTLEQLENYKNETIQFEQQRAINQVRQRVFQQALRGALGTLNSCLNNELHLRTISANIGMLGTMKEITD ATP_synthase_CF0_B_chain FILE2 atpf atpF 11803 13043 R 2 555 Ok MKNVTDSFVSLGHWPSAGSFGFNTDILATNPINLSVVLGVLIFFGKGVCGDLLDNRKQRILNTIRNSEELRGGAIEQLEKARSRLRKVETEAEQFRVNGYSEIEREKLNLINSTYKTLEQLENYKNETIQFEQQRAINQVRQRVFQQALRGALGTLNSCLNNELHLRTISANIGMLGTMKEITD ATP_synthase_CF0_B_subunit MATCH atpf ID 98 ALMOST_CORRECT.BAD_JUNCTION FILE1 atph atpH 13442 13687 R 1 246 Ok MNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEGKIRGTLLLSLAFMEALTIYGLVVALALLFANPFV ATP_synthase_CF0_C_chain FILE2 atph atpH 13442 13687 R 1 246 Ok MNPLISAASVIAAGLAVGLASIGPGVGQGTAAGQAVEGIARQPEAEGKIRGTLLLSLAFMEALTIYGLVVALALLFANPFV ATP_synthase_CF0_C_subunit MATCH atph ID 100 CORRECT FILE1 atpi atpI 14845 15588 R 1 744 Ok MNVLSCSINTLKGLYDISGVEVGQHFYWQIGGFQVHGQVLITSWVVIAILLGSATIAVRNPQTIPTGGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKIIQLPHGELAAPTNDINTTVALALLTSVAYFYAGLTKRGLGYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPVMLLGLFTSGIQALIFATLAAAYIGESMEGHH ATP_synthase_CF0_A_chain FILE2 atpi atpI 14845 15588 R 1 744 Ok MNVLSCSINTLKGLYDISGVEVGQHFYWQIGGFQVHGQVLITSWVVIAILLGSATIAVRNPQTIPTGGQNFFEYVLEFIRDVSKTQIGEEYGPWVPFIGTMFLFIFVSNWSGALLPWKIIQLPHGELAAPTNDINTTVALALLTSVAYFYAGLTKRGLGYFGKYIQPTPILLPINILEDFTKPLSLSFRLFGNILADELVVVVLVSLVPLVVPIPVMLLGLFTSGIQALIFATLAAAYIGESMEGHH ATP_synthase_CF0_A_subunit MATCH atpi ID 100 CORRECT FILE1 rps2 rps2 15825 16535 R 1 711 Ok MTRRYWNINLEEMMEAGVHFGHGTRKWNPKMAPYISAKRKGIHITNLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVEWAAIRARCHYVNKKWLGGMLTNWSTTETRLHKFRDLRMEQKTGRLNRLPKRDAAMLKRQLSRLQTYLGGIKYMTGVPDIVIIVDQHEEYTALRECITLGIPTICLTDTNCDPDLADISIPANDDAISSIRLILNKLVFAICEGRSSYIRNP ribosomal_protein_S2 FILE2 rps2 rps2 15825 16535 R 1 711 Ok MTRRYWNINLEEMMEAGVHFGHGTRKWNPKMAPYISAKRKGIHITNLTRTARFLSEACDLVFDAASRGKQFLIVGTKNKAADSVEWAAIRARCHYVNKKWLGGMLTNWSTTETRLHKFRDLRMEQKTGRLNRLPKRDAAMLKRQLSRLQTYLGGIKYMTGVPDIVIIVDQHEEYTALRECITLGIPTICLTDTNCDPDLADISIPANDDAISSIRLILNKLVFAICEGRSSYIRNP ribosomal_protein_S2 MATCH rps2 ID 100 CORRECT FILE1 rpoc2 rpoC2 16761 20939 R 1 4179 Ok MEVLMAERANLVFHNKAIDGTAMKRLISRLIEHFGMAYTSHILDQVKTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQSLILEKHHQYGNVHAVEKLRQSIEIWYATSEYLRQEMNPNFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTARGISVSPRNGIMPERIFSQTLIGRVLADDIYMGSRCIATRNQAIGIGLVNRFITFRAQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELGEAVGIIAGQSIGEPGTQLTLRTFHTGGVFTGGTAEHVRAPSNGKIKFNEDLVHPTRTRHGHPAFLCSIDLYVTIESEDILHNVNIPPKSLLLVQNDQYVESEQVIAEIRAGISTLNFKEKVRKHIYSDSDGEMHWSTDVYHAPEFTYGNVHLLPKTSHLWILLGGPCRSSLVYLSIHKDQDQMNAHSLSGKRRYTSNLSVTNDQARQKLFSSDFYGQKEDRIPDYSDLNRIICTGQYNLVYSPILHGNSALLSKRRRNKFIIPLHSIQELENELMPCSGISIEIPVNGIFRRNSILAYFDDPRYRRKSSGIIKYGTIETHSVIKKEDLIEYRGVKEFRPKYQMKVDRFFFIPEEVHILPGSSSLMVRNNSIVGVDTQITLNLRSRVGGLVRVERKKKRIELKIFSGDIHFPGETDKISRHTGVLIPPGTGKRNSKEYKKVQNWIYVQRITPSKKRFFVLVRPVVTYEITDGINLGTLFPPDPLQERDNVQLRIVNYILYGNGKPIRGISDTSIQLVRTCLVLNWNQDKKSSSCEEARASFVEIRTNGLIRHFLKINLVKSPISYIGKRNDPSGSGLLSDNGSDCTNINPFSAIYSYSKAKIQQSLNQPQGTIHTLLNRNKECQSLIILSAANCSRMEPFKDVKYHSVIKESIKKDPLIPIRNSLGPLGTCLPIENFYSSYHLITHNQILVTKYLQLDNLKQTFQVIKLKYYLMDENGKIFNPDPCRNIILNPFNLNWSFLHHYYCAETSKIISLGQFICENVCIAKNGPPLKSGQVILVQVDSIVIRSAKPYLATPGATVHGHYGETLYEGDTLVTFIYEKSRSGDITQGLPKVEQVLEVRSIDSISMNLEKRVEGWNKCIPRILGIPWGFLIGAELTIAQSRISLVNKIQQVYRSQGVQIHNRHIEIIVRQITSKVLISEDGMSNVFSPGELIGLLRAERMGRALEEAICYRVVLLGITRASLNTQSFISEASFQETARVLAKAALRGRIDWLKGLKENVVLGGVIPVGTGFKGLVHPSKQHNNIPLETKKTNLFEGEMRDILFHHRKLFDSCLSKKFHDIPEQSFIGFNDS RNA_polymerase_beta''_chain FILE2 rpoc2 rpoC2 16761 20939 R 1 4179 Ok MEVLMAERANLVFHNKAIDGTAMKRLISRLIEHFGMAYTSHILDQVKTLGFQQATATSISLGIDDLLTIPSKGWLVQDAEQQSLILEKHHQYGNVHAVEKLRQSIEIWYATSEYLRQEMNPNFRMTDPFNPVHIMSFSGARGNASQVHQLVGMRGLMSDPQGQMIDLPIQSNLREGLSLTEYIISCYGARKGVVDTAVRTSDAGYLTRRLVEVVQHIVVRRTDCGTARGISVSPRNGIMPERIFSQTLIGRVLADDIYMGSRCIATRNQAIGIGLVNRFITFRAQPISIRTPFTCRSTSWICRLCYGRSPTHGDLVELGEAVGIIAGQSIGEPGTQLTLRTFHTGGVFTGGTAEHVRAPSNGKIKFNEDLVHPTRTRHGHPAFLCSIDLYVTIESEDILHNVNIPPKSLLLVQNDQYVESEQVIAEIRAGISTLNFKEKVRKHIYSDSDGEMHWSTDVYHAPEFTYGNVHLLPKTSHLWILLGGPCRSSLVYLSIHKDQDQMNAHSLSGKRRYTSNLSVTNDQARQKLFSSDFYGQKEDRIPDYSDLNRIICTGQYNLVYSPILHGNSALLSKRRRNKFIIPLHSIQELENELMPCSGISIEIPVNGIFRRNSILAYFDDPRYRRKSSGIIKYGTIETHSVIKKEDLIEYRGVKEFRPKYQMKVDRFFFIPEEVHILPGSSSLMVRNNSIVGVDTQITLNLRSRVGGLVRVERKKKRIELKIFSGDIHFPGETDKISRHTGVLIPPGTGKRNSKEYKKVQNWIYVQRITPSKKRFFVLVRPVVTYEITDGINLGTLFPPDPLQERDNVQLRIVNYILYGNGKPIRGISDTSIQLVRTCLVLNWNQDKKSSSCEEARASFVEIRTNGLIRHFLKINLVKSPISYIGKRNDPSGSGLLSDNGSDCTNINPFSAIYSYSKAKIQQSLNQPQGTIHTLLNRNKECQSLIILSAANCSRMEPFKDVKYHSVIKESIKKDPLIPIRNSLGPLGTCLPIENFYSSYHLITHNQILVTKYLQLDNLKQTFQVIKLKYYLMDENGKIFNPDPCRNIILNPFNLNWSFLHHYYCAETSKIISLGQFICENVCIAKNGPPLKSGQVILVQVDSIVIRSAKPYLATPGATVHGHYGETLYEGDTLVTFIYEKSRSGDITQGLPKVEQVLEVRSIDSISMNLEKRVEGWNKCIPRILGIPWGFLIGAELTIAQSRISLVNKIQQVYRSQGVQIHNRHIEIIVRQITSKVLISEDGMSNVFSPGELIGLLRAERMGRALEEAICYRVVLLGITRASLNTQSFISEASFQETARVLAKAALRGRIDWLKGLKENVVLGGVIPVGTGFKGLVHPSKQHNNIPLETKKTNLFEGEMRDILFHHRKLFDSCLSKKFHDIPEQSFIGFNDS RNA_polymerase_beta''_subunit MATCH rpoc2 ID 100 CORRECT FILE1 rpoc1 rpoC1 21080 23883 R 2 2067 Ok MNNNFSSMIDRYKHQQLRIGSVSPQQISAWATKILPNGEIVGEVTKPYTFHYKTNKPEKDGLFCERIFGPIKSGICACGNYRVIGDEKEDPKFCEQCGVEFVDSRIRRYQMGYIKLACPVTHVWYLKRLPSYIANLLDKPLKELEGLVYCDFSFARPITKKPTFLRLRGLFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLRIIIENSLVEWEELGEEGHTGNEWEDRKVGRRKDFLVRRVELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLTDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQTFVIRGLIRQHLASNIGVAKSKIREKEPIVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPVLVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQVEARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTSGNHRGICVNRYNPCNRRNYQNQKRSDNSYYKYTKEPFFSNSYDAIGAYRQKRINLDSPLWLRWRLDQRVIASRETPIEVHYESLGTFYEIYGHYLIVRSLKKKILFIYIRTTVGHIALYREIEEAIQGFSRAYSYAT RNA_polymerase_beta'_chain FILE2 rpoc1 rpoC1 21080 23883 R 2 2067 Ok MNNNFSSMIDRYKHQQLRIGSVSPQQISAWATKILPNGEIVGEVTKPYTFHYKTNKPEKDGLFCERIFGPIKSGICACGNYRVIGDEKEDPKFCEQCGVEFVDSRIRRYQMGYIKLACPVTHVWYLKRLPSYIANLLDKPLKELEGLVYCDFSFARPITKKPTFLRLRGLFEYEIQSWKYSIPLFFTTQGFDTFRNREISTGAGAIREQLADLDLRIIIENSLVEWEELGEEGHTGNEWEDRKVGRRKDFLVRRVELAKHFIRTNIEPEWMVLCLLPVLPPELRPIIQIDGGKLMSSDINELYRRVIYRNNTLTDLLTTSRSTPGELVMCQEKLVQEAVDTLLDNGIRGQPMRDGHNKVYKSFSDVIEGKEGRFRETLLGKRVDYSGRSVIVVGPSLSLHRCGLPREIAIELFQTFVIRGLIRQHLASNIGVAKSKIREKEPIVWEILQEVMQGHPVLLNRAPTLHRLGIQAFQPVLVEGRAICLHPLVCKGFNADFDGDQMAVHVPLSLEAQVEARLLMFSHMNLLSPAIGDPISVPTQDMLIGLYVLTSGNHRGICVNRYNPCNRRNYQNQKRSDNSYYKYTKEPFFSNSYDAIGAYRQKRINLDSPLWLRWRLDQRVIASRETPIEVHYESLGTFYEIYGHYLIVRSLKKKILFIYIRTTVGHIALYREIEEAIQGFSRAYSYAT RNA_polymerase_beta'_subunit MATCH rpoc1 ID 100 CORRECT FILE1 rpob rpoB 23889 27101 R 1 3213 Ok MLGDGNEGISTIPGFNQIQFEGFCRFIDQGLTEELYKFPKIEDTDQEIEFQLFVETYQLVEPLIKERDAVYESLTYSSELYVSAGLIWKNSRDMQEQTIFIGNIPLMNSLGTSIVNGIYRIVINQILQSPGIYYRSELDHNGISVYTGTIISDWGGRSELEIDRKARIWARVSRKQKISILVLSSAMGLNLREILENVCYPEIFLSFLNDKERKKIGSKENSILEFYQQFACVGGDPVFSESLCKELQKKFFQQRCELGRIGRRNMNRKLNLDIPQNNTFLLPRDILAAADHLIGLKFGMGALDDMNHLKNKRIRSVADLLQDQFGLALVRLENVVRGTICGAIRHKLIPTPQNLVTSPPLTTTYESFFGLHPLSQVLDRTNPLTQIVHGRKLSYLGPGGLTGRTASFRIRDIHPSHYGRICPIDTSEGINVGLIGSLSIHARIGHWGSLESPFYEISERSTGVRMLYLSPGSDEYYMVAAGNSLALNRDIQEEQVVPARYRQEFLTIAWEQVHLRSIFPFQYFSIGASLIPFIEHNDANRALMSSNMQRQAVPLSRSEKCIVGTGLERQAALDSGALAIAEREGRIVYTNTHKILLAGNGDILSIPLVIYQRSNKNTCMHQKFRVPRGKCIKKGQILADGAATVGGELALGKNVLVAYMPWEGYNSEDAVLISERLVYEDIYTSFHIRKYEIHTHVTSQGPEKVTNEIPHLEAHLLRNLDKKGIVMLGSWVETGDILVGKLTPQVVKESSYAPEDRLLRAILGIQVSTSKETCLKLPIGGRGRVIDVRWIQKRGGSSYNPETIRVYISQKREIKVGDKVAGRHGNKGIISKILPRQDMPYLQDGRSVDMVFNPLGVPSRMNVGQIFECSLGLAGSLLDRHYRIAPFDERYEQEASRKLVFSELYEASKQTANPWVFEPEYPGKSRIFDGRTGNPFEQPVIIGKPYILKLIHQVDDKIHGRSSGHYALVTQQPLRGRAKQGGQRVGEMEVWALEGFGVAHILQEMLTYKSDHIRARQEVLGTTIIGGTIPNPEDAPESFRLLVRELRSLALELNHFLVSEKNFQINRKEA RNA_polymerase_beta_chain FILE2 rpob rpoB 23889 27101 R 1 3213 Ok MLGDGNEGISTIPGFNQIQFEGFCRFIDQGLTEELYKFPKIEDTDQEIEFQLFVETYQLVEPLIKERDAVYESLTYSSELYVSAGLIWKNSRDMQEQTIFIGNIPLMNSLGTSIVNGIYRIVINQILQSPGIYYRSELDHNGISVYTGTIISDWGGRSELEIDRKARIWARVSRKQKISILVLSSAMGLNLREILENVCYPEIFLSFLNDKERKKIGSKENSILEFYQQFACVGGDPVFSESLCKELQKKFFQQRCELGRIGRRNMNRKLNLDIPQNNTFLLPRDILAAADHLIGLKFGMGALDDMNHLKNKRIRSVADLLQDQFGLALVRLENVVRGTICGAIRHKLIPTPQNLVTSPPLTTTYESFFGLHPLSQVLDRTNPLTQIVHGRKLSYLGPGGLTGRTASFRIRDIHPSHYGRICPIDTSEGINVGLIGSLSIHARIGHWGSLESPFYEISERSTGVRMLYLSPGSDEYYMVAAGNSLALNRDIQEEQVVPARYRQEFLTIAWEQVHLRSIFPFQYFSIGASLIPFIEHNDANRALMSSNMQRQAVPLSRSEKCIVGTGLERQAALDSGALAIAEREGRIVYTNTHKILLAGNGDILSIPLVIYQRSNKNTCMHQKFRVPRGKCIKKGQILADGAATVGGELALGKNVLVAYMPWEGYNSEDAVLISERLVYEDIYTSFHIRKYEIHTHVTSQGPEKVTNEIPHLEAHLLRNLDKKGIVMLGSWVETGDILVGKLTPQVVKESSYAPEDRLLRAILGIQVSTSKETCLKLPIGGRGRVIDVRWIQKRGGSSYNPETIRVYISQKREIKVGDKVAGRHGNKGIISKILPRQDMPYLQDGRSVDMVFNPLGVPSRMNVGQIFECSLGLAGSLLDRHYRIAPFDERYEQEASRKLVFSELYEASKQTANPWVFEPEYPGKSRIFDGRTGNPFEQPVIIGKPYILKLIHQVDDKIHGRSSGHYALVTQQPLRGRAKQGGQRVGEMEVWALEGFGVAHILQEMLTYKSDHIRARQEVLGTTIIGGTIPNPEDAPESFRLLVRELRSLALELNHFLVSEKNFQINRKEA RNA_polymerase_beta_subunit MATCH rpob ID 100 CORRECT FILE1 petn petN 29144 29233 D 1 90 Ok MDIVSLAWAALMVVFTFSLSLVVWGRSGL cytochrome_b6_/f_complex_subunit_VIII FILE2 petn petN 29135 29233 D 1 99 Ok MIHMDIVSLAWAALMVVFTFSLSLVVWGRSGL cytochrome_b6/f_complex_subunit_VIII MATCH petn ID 90 ALMOST_CORRECT.BAD_START FILE1 psbm psbM 30341 30445 R 1 105 Ok MEVNILAFIATALFILVPTAFLLIIYVKTVSQND photosystem_II_protein_M FILE2 psbm psbM 30341 30451 R 1 111 Ok EIMEVNILAFIATALFILVPTAFLLIIYVKTVSQND photosystem_II_protein_M MATCH psbm ID 94 ALMOST_CORRECT.BAD_START FILE1 psbd psbD 33585 34646 D 1 1062 Ok MTIAIGKFTKDENDLFDIMDDWLRRDRFVFVGWSGLLLFPCAYFAVGGWFTGTTFVTSWYTHGLASSYLEGCNFLTAAVSTPANSLAHSLLLLWGPEAQGDFTRWCQLGGLWTFVALHGAFGLIGFMLRQFELARSVQLRPYNAIAFSGPIAVFVSVFLIYPLGQSGWFFAPSFGVAAIFRFILFFQGFHNWTLNPFHMMGVAGVLGAALLCAIHGATVENTLFEDGDGANTFRAFNPTQAEETYSMVTANRFWSQIFGVAFSNKRWLHFFMLFVPVTGLWMSALGVVGLALNLRAYDFVSQEIRAAEDPEFETFYTKNILLNEGIRAWMAAQDQPHENLIFPEEVLPRGNAL photosystem_II_protein_D2 FILE2 psbd psbD 33585 34646 D 1 1062 Ok MTIAIGKFTKDENDLFDIMDDWLRRDRFVFVGWSGLLLFPCAYFAVGGWFTGTTFVTSWYTHGLASSYLEGCNFLTAAVSTPANSLAHSLLLLWGPEAQGDFTRWCQLGGLWTFVALHGAFGLIGFMLRQFELARSVQLRPYNAIAFSGPIAVFVSVFLIYPLGQSGWFFAPSFGVAAIFRFILFFQGFHNWTLNPFHMMGVAGVLGAALLCAIHGATVENTLFEDGDGANTFRAFNPTQAEETYSMVTANRFWSQIFGVAFSNKRWLHFFMLFVPVTGLWMSALGVVGLALNLRAYDFVSQEIRAAEDPEFETFYTKNILLNEGIRAWMAAQDQPHENLIFPEEVLPRGNAL photosystem_II_protein_D2 MATCH psbd ID 100 CORRECT FILE1 psbc psbC 34630 36015 D 1 1386 Ok METLFNGTLALAGRDQETTGFAWWAGNARLINLSGKLLGAHVAHAGLIVFWAGAMNLFEVAHFVPEKPMYEQGLILLPHLATLGWGVGPGGEVIDTFPYFVSGVLHLISSAVLGFGGIYHALLGPETLEESFPFFGYVWKDRNKMTTILGIHLILLGIGAFLLVFKALYFGGVYDTWAPGGGDVRKITNLTLSPSIIFGYLLKSPFGGEGWIVSVDDLEDIIGGHVWLGSICILGGIWHILTKPFAWARRALVWSGEAYLSYSLGALAVFGFIACCFVWFNNTAYPSEFYGPTGPEASQAQAFTFLVRDQRLGANVGSAQGPTGLGKYLMRSPTGEVIFGGETMRFWDLRAPWLEPLRGPNGLDLSRLKKDIQPWQERRSAEYMTHAPLGSLNSVGGVATEINAVNYVSPRSWLATSHFVLGFFFFVGHLWHAGRARAAAAGFEKGIDRDFEPVLSMTPLN photosystem_II_44_kDa_protein FILE2 psbc psbC 34552 36015 D 1 1464 Ok MKVFALGWRLKISLMKTLYSLRRFYHVETLFNGTLALAGRDQETTGFAWWAGNARLINLSGKLLGAHVAHAGLIVFWAGAMNLFEVAHFVPEKPMYEQGLILLPHLATLGWGVGPGGEVIDTFPYFVSGVLHLISSAVLGFGGIYHALLGPETLEESFPFFGYVWKDRNKMTTILGIHLILLGIGAFLLVFKALYFGGVYDTWAPGGGDVRKITNLTLSPSIIFGYLLKSPFGGEGWIVSVDDLEDIIGGHVWLGSICILGGIWHILTKPFAWARRALVWSGEAYLSYSLGALAVFGFIACCFVWFNNTAYPSEFYGPTGPEASQAQAFTFLVRDQRLGANVGSAQGPTGLGKYLMRSPTGEVIFGGETMRFWDLRAPWLEPLRGPNGLDLSRLKKDIQPWQERRSAEYMTHAPLGSLNSVGGVATEINAVNYVSPRSWLATSHFVLGFFFFVGHLWHAGRARAAAAGFEKGIDRDFEPVLSMTPLN photosystem_II_44_kDa_protein MATCH psbc ID 94 ALMOST_CORRECT.BAD_START FILE1 ycf9 ycf9 36709 36897 D 1 189 Ok MTLAFQLAVFALIATSLILLISVPVVFASPDGWSSNKNVVFSGTSLWIGLVFLVGILNSLIS Ycf9_protein FILE2 psbz psbZ 36709 36897 D 1 189 Ok MTLAFQLAVFALIATSLILLISVPVVFASPDGWSSNKNVVFSGTSLWIGLVFLVGILNSLIS photosystem_II_protein_Z MATCH ycf9 ID 100 CORRECT FILE1 rps14 rps14 37671 37973 R 1 303 Ok MARKSLIQREKKRQKLEQKYHSIRRSSKKEISKVPSLSDKWEIYGKLQSLPRNSAPTRLHRRCFLTGRPRANYRDFGLSGHILREMVHACLLPGATRSSW ribosomal_protein_S14 FILE2 rps14 rps14 37671 37973 R 1 303 Ok MARKSLIQREKKRQKLEQKYHSIRRSSKKEISKVPSLSDKWEIYGKLQSLPRNSAPTRLHRRCFLTGRPRANYRDFGLSGHILREMVHACLLPGATRSSW ribosomal_protein_S14 MATCH rps14 ID 100 CORRECT FILE1 psab psaB 38092 40296 R 1 2205 Ok MALRFPRFSQGLAQDPTTRRIWFGIATAHDFESHDDITEERLYQNIFASHFGQLAIIFLWTSGNLFHVAWQGNFESWVQDPLHVRPIAHAIWDPHFGQPAVEAFTRGGALGPVNIAYSGVYQWWYTIGLRTNEDLYTGALFLLFLSAISLIAGWLHLQPKWKPSVSWFKNAESRLNHHLSGLFGVSSLAWTGHLVHVAIPASRGEYVRWNNFLDVLPHPQGLGPLFTGQWNLYAQNPDSSSHLFGTAEGAGTAILTLLGGFHPQTQSLWLTDIAHHHLAIAFIFLVAGHMYRTNFGIGHSMKDLLDAHIPPGGRLGRGHKGLYDTINNSLHFQLGLALASLGVITSLVAQHMYSLPAYAFIAQDFTTQAALYTHHQYIAGFIMTGAFAHGAIFFIRDYNPEQNEDNVLARMLDHKEAIISHLSWASLFLGFHTLGLYVHNDVMLAFGTPEKQILIEPIFAQWIQSAHGKTSYGFDVLLSSTTGPAFNAGRSIWLPGWLNAVNENSNSLFLTIGPGDFLVHHAIALGLHTTTLILVKGALDARGSKLMPDKKDFGYSFPCDGPGRGGTCDISAWDAFYLAVFWMLNTIGWVTFYWHWKHITLWQGNVSQFNESSTYLMGWLRDYLWLNSSQLINGYNPFGMNSLSVWAWMFLFGHLVWATGFMFLISWRGYWQELIETLAWAHERTPLANLIRWRDKPVALSIVQARLVGLAHFSVGYIFTYAAFLIASTSGKFG photosystem_I_P700_apoprotein_A2 FILE2 psab psaB_1 38092 40296 R 1 2205 Ok MALRFPRFSQGLAQDPTTRRIWFGIATAHDFESHDDITEERLYQNIFASHFGQLAIIFLWTSGNLFHVAWQGNFESWVQDPLHVRPIAHAIWDPHFGQPAVEAFTRGGALGPVNIAYSGVYQWWYTIGLRTNEDLYTGALFLLFLSAISLIAGWLHLQPKWKPSVSWFKNAESRLNHHLSGLFGVSSLAWTGHLVHVAIPASRGEYVRWNNFLDVLPHPQGLGPLFTGQWNLYAQNPDSSSHLFGTAEGAGTAILTLLGGFHPQTQSLWLTDIAHHHLAIAFIFLVAGHMYRTNFGIGHSMKDLLDAHIPPGGRLGRGHKGLYDTINNSLHFQLGLALASLGVITSLVAQHMYSLPAYAFIAQDFTTQAALYTHHQYIAGFIMTGAFAHGAIFFIRDYNPEQNEDNVLARMLDHKEAIISHLSWASLFLGFHTLGLYVHNDVMLAFGTPEKQILIEPIFAQWIQSAHGKTSYGFDVLLSSTTGPAFNAGRSIWLPGWLNAVNENSNSLFLTIGPGDFLVHHAIALGLHTTTLILVKGALDARGSKLMPDKKDFGYSFPCDGPGRGGTCDISAWDAFYLAVFWMLNTIGWVTFYWHWKHITLWQGNVSQFNESSTYLMGWLRDYLWLNSSQLINGYNPFGMNSLSVWAWMFLFGHLVWATGFMFLISWRGYWQELIETLAWAHERTPLANLIRWRDKPVALSIVQARLVGLAHFSVGYIFTYAAFLIASTSGKFG photosystem_I_P700_chlorophyll_a_apoprotein_A2 MATCH psab ID 100 CORRECT FILE1 psaa psaA 40322 42574 R 1 2253 Ok MIIRSPEPEVKILVDRDPVKTSFEEWARPGHFSRTIAKGPDTTTWIWNLHADAHDFDSHTSDLEEISRKVFSAHFGQLSIIFLWLSGMYFHGARFSNYEAWLSDPTHIGPSAQVVWPIVGQEILNGDVGGGFRGIQITSGFFQLWRASGITSELQLYCTAIGALVFAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLAGLLGLGSLSWAGHQVHVSLPINQFLNAGVDPKEIPLPHEFILNRDLLAQLYPSFAEGATPFFTLNWSKYADFLTFRGGLDPVTGGLWLTDIAHHHLAIAILFLIAGHMYRTNWGIGHGLKDILEAHKGPFTGQGHKGLYEILTTSWHAQLSLNLAMLGSLTIVVAHHMYSMPPYPYLATDYGTQLSLFTHHMWIGGFLIVGAAAHAAIFMVRDYDPTTRYNDLLDRVLRHRDAIISHLNWACIFLGFHSFGLYIHNDTMSALGRPQDMFSDTAIQLQPVFAQWIQNTHALAPGATAPGATASTSLTWGGGDLVAVGGKVALLPIPLGTADFLVHHIHAFTIHVTVLILLKGVLFARSSRLIPDKANLGFRFPCDGPGRGGTCQVSAWDHVFLGLFWMYNSISVVIFHFSWKMQSDVWGSVSDQGVVTHITGGNFAQSSITINGWLRDFLWAQASQVIQSYGSSLSAYGLFFLGAHFVWAFSLMFLFSGRGYWQELIESIVWAHNKLKVAPATQPRALSIIQGRAVGVTHYLLGGIATTWAFFLARIIAVG photosystem_I_P700_apoprotein_A1 FILE2 psab psaB_2 40322 42574 R 2 2220 Ok MIIRSPEPEVKILVDRDPVKTSFEEWARPGHFSRTIAKGPDTTTWIWNLHADAHDFDSHTSDLEEISRKVFSAHFGQLSIIFLWLSGMYFHGARFSNYEAWLSDIWNLHADAHDFDSHTSDLEEISRKVFSAHFGQLSIIFLWLSGMYFHGARFSNYEAWLSDPTHIGPSAQVVWPIVGQEILNGDVGGGFRGIQITSGFFQLWRASGITSELQLYCTAIGAPTHIGPSAQVVWPIVGQEILNGDVGGGFRGIQITSGFFQLWRASGITSELQLYCTAIGALVFAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLAGLLGLGSLSWAGHQVHVSLPINLVFAALMLFAGWFHYHKAAPKLAWFQDVESMLNHHLAGLLGLGSLSWAGHQVHVSLPINQFLNAGVDPKEIPLPHEFILNRDLLAQLYPSFAEGATPFFTLNWSKYADFLTFRGGLDPQFLNAGVDPKEIPLPHEFILNRDLLAQLYPSFAEGATPFFTLNWSKYADFLTFRGGLDPVTGGLWLTDIAHHHLAIAILFLIAGHMYRTNWGIGHGLKDILEAHKGPFTGQGHKGLYEVTGGLWLTDIAHHHLAIAILFLIAGHMYRTNWGIGHGLKDILEAHKGPFTGQGHKGLYEILTTSWHAQLSLNLAMLGSLTIVVAHHMYSMPPYPYLATDYGTQLSLFTHHMWIGGFLIILTTSWHAQLSLNLAMLGSLTIVVAHHMYSMPPYPYLATDYGTQLSLFTHHMWIGGFLIVGAAAHAAIFMVRDYDPTTRYNDLLDRVLRHRDAIISHLNWACIFLGFHSFGLYIHNDTVGAAAHAAIFMVRDYDPTTRYNDLLDRVLRHRDAIISHLNWACIFLGFHSFGLYIHNDTMSALGRPQDMFSDTAIQLQPVFAQWIQNTHALAPGATAPGATASTSLTWGGGDLVAVGGMSALGRPQDMFSDTAIQLQPVFAQWIQNTHALAPGATAPGATASTSLTWGGGDLVAVGGKVALLPIPLGTADFLVHHIHAFTIHVTVLILLKGVLFARSSRLIPDKANLGFRFPCDGPKVALLPIPLGTADFLVHHIHAFTIHVTVLILLKGVLFARSSRLIPDKANLGFRFPCDGPGRGGTCQVSAWDHVFLGLFWMYNSISVVIFHFSWKMQSDVWGSVSDFAQSSITINGWLRGRGGTCQVSAWDHVFLGLFWMYNSISVVIFHFSWKMQSDVWGSVSDQGVVTHITGGNFADFLWAQASQVIQSYGSSLSAYGLFFLGAHFVWAFSLMFLFSGRGYWQELIESIVWAHNKQSSITINGWLRDFLWAQASQVIQSYGSSLSAYGLFFLGAHFVWAFSLMFLFSGRGYWQELIESIVWAHNKLKVAPATQPRALSIIQGRAVGVTHYLLGGIATTWAFFLARIIAVGLKVAPATQPRALSIIQGRAVGVTHYLLGGIATTWAFFLARIIAVG photosystem_I_P700_chlorophyll_a_apoprotein_A2 MATCH psaa ID 51 WRONG.BAD_NBEXON.BAD_JUNCTION FILE1 ycf3 ycf3 43333 45318 R 3 507 Ok MPRSRINGNFIDKTFSIVADILLRVIPTTSGEKEAFTYYRDGMSAQSEGNYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFRALERNPFLPQAFNNMAVICHYRGEQAIQQGDSEIAEAWFDQAAEYWKQAIALTPGNYIEARNWLKITRRFE photosystem_I_assembly_protein_Ycf3 FILE2 ycf3 ycf3 43333 45318 R 3 516 Ok MPRSRINGNFIDKTFSIVADILLRVIPTTSGEKEAFTYYRDGAILSAQSEGNYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFRALERNPFLPQAFNNMAVICHYVRGEQAIQQGDSEIAEAWFDQAAEYWKQAIALTPGNYIEARNWLKITRRFE photosystem_I_assembly_protein_Ycf3 MATCH ycf3 ID 97 ALMOST_CORRECT.BAD_JUNCTION FILE1 rps4 rps4 46609 47214 R 1 606 Ok MSRYRGPRFKKIRRLGALPGLTNKKPRTGSDLRNQSRSGKKSQYRIRLEEKQKLRFHYGLTERQLLKYVRIARKAKGSTGQVLLQLLEMRLDNILFRLGMASTIPAARQLVNHRHILVNGHIVDIPSYRCKPRDIITAKDEQKSRALIQISLDSSPHEELPNHLTLQPFQYKGLVNQIIDSKWVGLKINELLVVEYYSRQT ribosomal_protein_S4 FILE2 rps4 rps4 46609 47214 R 1 606 Ok MSRYRGPRFKKIRRLGALPGLTNKKPRTGSDLRNQSRSGKKSQYRIRLEEKQKLRFHYGLTERQLLKYVRIARKAKGSTGQVLLQLLEMRLDNILFRLGMASTIPAARQLVNHRHILVNGHIVDIPSYRCKPRDIITAKDEQKSRALIQISLDSSPHEELPNHLTLQPFQYKGLVNQIIDSKWVGLKINELLVVEYYSRQT ribosomal_protein_S4 MATCH rps4 ID 100 CORRECT FILE1 ndhj ndhJ 50074 50550 R 1 477 Ok MQGRLSAWLVKHGLIHRSLGFDYQGIETLQIKPEDWHSIAVIFYVYGYNYLRSQCAYDVAPGGLLASVYHLTRIEDGVAQPEELCIKVFASRRNPRIPSVFWVWKSVDFQERESYDMLGISYDNHPRLKRILMPESWIGWPLRKDYIAPNFYEIQDAH NADH_dehydrogenase_subunit_J FILE2 ndhj ndhJ 50074 50550 R 1 477 Ok MQGRLSAWLVKHGLIHRSLGFDYQGIETLQIKPEDWHSIAVIFYVYGYNYLRSQCAYDVAPGGLLASVYHLTRIEDGVAQPEELCIKVFASRRNPRIPSVFWVWKSVDFQERESYDMLGISYDNHPRLKRILMPESWIGWPLRKDYIAPNFYEIQDAH NADH_dehydrogenase_subunit_J MATCH ndhj ID 100 CORRECT FILE1 ndhk ndhK 50656 51510 R 1 855 Ok MGNEFRRIGCICIYRSFHFRAYLNYWFSLCMAKGGIGMVLAPEYSDNKKKNGKNKIETVMNSIQFPLLDRTAPNSVISTTLNDLSNWSRLSSLWPLLYGTSCCFIEFASLIGSRFDFDRYGLVPRSSPRQSDLILTAGTVTMKMAPSLVRLYEQMPEPKYVIAMGACTITGGMFSTDSYSTVRGVDKLIPVDVYLPGCPPKPEAVIDAITKLRKKISRELYEDRIRSQRANRCFTTNHKFHVRRSIHTGNYDQRVLYQPPSTSEIPTEIFFKYKNSVSSAELVN NADH_dehydrogenase_subunit_K FILE2 ndhk ndhK 50656 51510 R 1 855 Ok MGNEFRRIGCICIYRSFHFRAYLNYWFSLCMAKGGIGMVLAPEYSDNKKKNGKNKIETVMNSIQFPLLDRTAPNSVISTTLNDLSNWSRLSSLWPLLYGTSCCFIEFASLIGSRFDFDRYGLVPRSSPRQSDLILTAGTVTMKMAPSLVRLYEQMPEPKYVIAMGACTITGGMFSTDSYSTVRGVDKLIPVDVYLPGCPPKPEAVIDAITKLRKKISRELYEDRIRSQRANRCFTTNHKFHVRRSIHTGNYDQRVLYQPPSTSEIPTEIFFKYKNSVSSAELVN NADH_dehydrogenase_subunit_K MATCH ndhk ID 100 CORRECT FILE1 ndhc ndhC 51390 51752 R 1 363 Ok MFLLYEYDFFWAFLIISILVPILAFFISGVLAPISKGPEKLSTYESGIEPMGDAWLQFRIRYYMFALVFVVFDVETVFLYPWAMSFDVLGVSVFIEAFIFVLILIIGLVYAWRKGALEWS NADH_dehydrogenase_subunit_3 FILE2 ndhc ndhC 51390 51752 R 1 363 Ok MFLLYEYDFFWAFLIISILVPILAFFISGVLAPISKGPEKLSTYESGIEPMGDAWLQFRIRYYMFALVFVVFDVETVFLYPWAMSFDVLGVSVFIEAFIFVLILIIGLVYAWRKGALEWS NADH_dehydrogenase_subunit_3 MATCH ndhc ID 100 CORRECT FILE1 atpe atpE 53977 54378 R 1 402 Ok MTLNLSVLTPNRIVWDSEVEEIVLSTNSGQIGILPNHAPIATAVDIGILRIRLNDQWLTMALMGGFARIGNNEITVLVNDAEKGSDINPQEAQQTLEIAEANVKKAEGRRQKIEANLALRRARTRVEASNPIS ATP_synthase_CF1_epsilon_chain FILE2 atpe atpE 53977 54378 R 1 402 Ok MTLNLSVLTPNRIVWDSEVEEIVLSTNSGQIGILPNHAPIATAVDIGILRIRLNDQWLTMALMGGFARIGNNEITVLVNDAEKGSDINPQEAQQTLEIAEANVKKAEGRRQKIEANLALRRARTRVEASNPIS ATP_synthase_CF1_epsilon_subunit MATCH atpe ID 100 CORRECT FILE1 atpb atpB 54375 55871 R 1 1497 Ok MRINPTTSGSGVSTLEKKNPGRVVQIIGPVLDVAFPPGKMPNIYNALVVQGRDSVGQPINVACEVQQLLGNNRVRAVAMSATEGLTRGMAVIDTGAPISVPVGGATLGRIFNVLGEPVDNLGPVDTSTTSPIHRSAPAFIQLDTKLSIFETGIKVVDLLAPYRRGGKIGLFGGAGVGKTVLIMELINNIAKAHGGVSVFGGVGERTREGNDLYMEMKESGVINKENIAESKVALVYGQMNEPPGARMRVGLTALTMAEYFRDVNEQDVLLFIDNIFRFVQAGSEVSALLGRMPSAVGYQPTLSTEMGSLQERITSTKEGSITSIQAVYVPADDLTDPAPATTFAHLDATTVLSRGLAAKGIYPAVDPLDSTSTMLQPRIVGEEHYETAQRVKQTLQRYKELQDIIAILGLDELSEEDRLLVARARKIERFLSQPFFVAEVFTGSPGKYVGLAETIRGFQLILSGELDGLPEQAFYLVGTIDEATAKAMNLEMESNLKK ATP_synthase_CF1_beta_chain FILE2 atpb atpB 54375 55871 R 1 1497 Ok MRINPTTSGSGVSTLEKKNPGRVVQIIGPVLDVAFPPGKMPNIYNALVVQGRDSVGQPINVACEVQQLLGNNRVRAVAMSATEGLTRGMAVIDTGAPISVPVGGATLGRIFNVLGEPVDNLGPVDTSTTSPIHRSAPAFIQLDTKLSIFETGIKVVDLLAPYRRGGKIGLFGGAGVGKTVLIMELINNIAKAHGGVSVFGGVGERTREGNDLYMEMKESGVINKENIAESKVALVYGQMNEPPGARMRVGLTALTMAEYFRDVNEQDVLLFIDNIFRFVQAGSEVSALLGRMPSAVGYQPTLSTEMGSLQERITSTKEGSITSIQAVYVPADDLTDPAPATTFAHLDATTVLSRGLAAKGIYPAVDPLDSTSTMLQPRIVGEEHYETAQRVKQTLQRYKELQDIIAILGLDELSEEDRLLVARARKIERFLSQPFFVAEVFTGSPGKYVGLAETIRGFQLILSGELDGLPEQAFYLVGTIDEATAKAMNLEMESNLKK ATP_synthase_CF1_beta_subunit MATCH atpb ID 100 CORRECT FILE1 rbcl rbcL 56686 58119 D 1 1434 Ok MSPQTETKASVGFKAGVKEYKLTYYTPEYQTKDTDILAAFRVTPQPGVPPEEAGAAVAAESSTGTWTTVWTDGLTSLDRYKGRCYRIERVVGEKDQYIAYVAYPLDLFEEGSVTNMFTSIVGNVFGFKALRALRLEDLRIPPAYVKTFQGPPHGIQVERDKLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVNSQPFMRWRDRFLFCAEALFKAQTETGEIKGHYLNATAGTCEEMIKRAVFARELGVPIVMHDYLTGGFTANTTLAHYCRDNGLLLHIHRAMHAVIDRQKNHGIHFRVLAKALRMSGGDHIHSGTVVGKLEGERDITLGFVDLLRDDFVEQDRSRGIYFTQDWVSLPGVLPVASGGIHVWHMPALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVKARNEGRDLAREGNEIIREACKWSPELAAACEVWKEIVFNFAAVDVLDK ribulose_1,5-bisphosphate_carboxylase_/oxygenase_large_chain FILE2 rbcl rbcL 56686 58119 D 1 1434 Ok MSPQTETKASVGFKAGVKEYKLTYYTPEYQTKDTDILAAFRVTPQPGVPPEEAGAAVAAESSTGTWTTVWTDGLTSLDRYKGRCYRIERVVGEKDQYIAYVAYPLDLFEEGSVTNMFTSIVGNVFGFKALRALRLEDLRIPPAYVKTFQGPPHGIQVERDKLNKYGRPLLGCTIKPKLGLSAKNYGRAVYECLRGGLDFTKDDENVNSQPFMRWRDRFLFCAEALFKAQTETGEIKGHYLNATAGTCEEMIKRAVFARELGVPIVMHDYLTGGFTANTTLAHYCRDNGLLLHIHRAMHAVIDRQKNHGIHFRVLAKALRMSGGDHIHSGTVVGKLEGERDITLGFVDLLRDDFVEQDRSRGIYFTQDWVSLPGVLPVASGGIHVWHMPALTEIFGDDSVLQFGGGTLGHPWGNAPGAVANRVALEACVKARNEGRDLAREGNEIIREACKWSPELAAACEVWKEIVFNFAAVDVLDK ribulose-1,5-bisphosphate_carboxylase/oxygenase_large_subunit MATCH rbcl ID 100 CORRECT FILE1 accd accD 58879 60402 D 1 1524 Ok MTIHLLYFHANRGQENSMERWWFNSMLFKKEFERRCGLNKSMGSLGPIENTSEDPNLKVKNIHSCSNVDYLFGVKDIWNFISNDTFLVSDRNGDSYSIYFDIENHIFEVDNDHSFLSELESSFYSYRNSSYLNNGFRGEDPYYNSYMSYMYDTQYSWNNHINSCIDNYLQSQICIDTSIISGSESNGDSYIYRAICSGQSLNSSENEGSSRRTRTKDSDLTIRESSNDLEVTQKYKHLWVQCENCYGLNYKKFLKSKMNICEQCGYHLKMSSSDRIELLIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAVQTGIGQLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEHAANQNLPLMIVCASGGARMQEGSLSLMQMAKISSALYDYQLNKKLFYVSILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAEYLFQKGLFDLIVPRNLLKSVLSELFKLHAFFPLNQKSSKIK acetyl-CoA_carboxylase_beta_subunit FILE2 accd accD 58879 60402 D 1 1524 Ok MTIHLLYFHANRGQENSMERWWFNSMLFKKEFERRCGLNKSMGSLGPIENTSEDPNLKVKNIHSCSNVDYLFGVKDIWNFISNDTFLVSDRNGDSYSIYFDIENHIFEVDNDHSFLSELESSFYSYRNSSYLNNGFRGEDPYYNSYMSYMYDTQYSWNNHINSCIDNYLQSQICIDTSIISGSESNGDSYIYRAICSGQSLNSSENEGSSRRTRTKDSDLTIRESSNDLEVTQKYKHLWVQCENCYGLNYKKFLKSKMNICEQCGYHLKMSSSDRIELLIDPGTWDPMDEDMVSLDPIEFHSEEEPYKDRIDSYQRKTGLTEAVQTGIGQLNGIPVAIGVMDFQFMGGSMGSVVGEKITRLIEHAANQNLPLMIVCASGGARMQEGSLSLMQMAKISSALYDYQLNKKLFYVSILTSPTTGGVTASFGMLGDIIIAEPNAYIAFAGKRVIEQTLNKTVPEGSQAAEYLFQKGLFDLIVPRNLLKSVLSELFKLHAFFPLNQKSSKIK acetyl-CoA_carboxylase_beta_subunit MATCH accd ID 100 CORRECT FILE1 psai psaI 61149 61259 D 1 111 Ok MTNLNLPSIFVPLVGLVFPAIAMASLFLHVQKNKIV photosystem_I_subunit_VIII FILE2 psai psaI 61149 61259 D 1 111 Ok MTNLNLPSIFVPLVGLVFPAIAMASLFLHVQKNKIV photosystem_I_subunit_VIII MATCH psai ID 100 CORRECT FILE1 ycf4 ycf4 61704 62258 D 1 555 Ok MTWRSDDIWIELITGSRKISNFCWALILFLGSLGFLLVGTSSYLGRNLLSFFPPQQIIFFPQGIVMSFYGIAGLFISSYLWCTISWNVGSGYDRFDRKEGIVCIFRWGFPGKNRRIFLRFLIKDIQSVRIEVKEGIYARRVLYMDIRGQGSIPLTRTDENLTPREIEQKAAELAYFLRVPIEVF photosystem_I_assembly_protein_Ycf4 FILE2 ycf4 ycf4 61704 62258 D 1 555 Ok MTWRSDDIWIELITGSRKISNFCWALILFLGSLGFLLVGTSSYLGRNLLSFFPPQQIIFFPQGIVMSFYGIAGLFISSYLWCTISWNVGSGYDRFDRKEGIVCIFRWGFPGKNRRIFLRFLIKDIQSVRIEVKEGIYARRVLYMDIRGQGSIPLTRTDENLTPREIEQKAAELAYFLRVPIEVF photosystem_I_assembly_protein_Ycf4 MATCH ycf4 ID 100 CORRECT FILE1 ycf10 ycf10 62988 63677 D 1 690 Ok MAKKKAFTPLFYLASIVFLPWWISFSVNKWLESWVTNWWNTGQSQIVLNNIQEKSLLEKFRELEELLFLDEMIKEYSETHLEEFGIGIHKETIQLITIQNENRMDTILHFSTNIIWFGILSGYSILGKEKLVILNSWAQEFLYNLSDTAKALCILLVSEFFLGYHSPPGWEFVIRSIYNEVGVVANEQTITILVCILPVIFDTCFKYWLFRYLTSLSPSILLLYDSITE potential_heme-binding_protein FILE2 cema cemA 62988 63677 D 1 690 Ok MAKKKAFTPLFYLASIVFLPWWISFSVNKWLESWVTNWWNTGQSQIVLNNIQEKSLLEKFRELEELLFLDEMIKEYSETHLEEFGIGIHKETIQLITIQNENRMDTILHFSTNIIWFGILSGYSILGKEKLVILNSWAQEFLYNLSDTAKALCILLVSEFFLGYHSPPGWEFVIRSIYNEVGVVANEQTITILVCILPVIFDTCFKYWLFRYLTSLSPSILLLYDSITE envelope_membrane_protein MATCH ycf10 ID 100 CORRECT FILE1 peta petA 63897 64859 D 1 963 Ok MQTRNAFSWLKKQITRSISVSLMIYILTRTSISSAYPIFAQQGYENPREATGRIVCANCHLANKPVEIEVPQAVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVGAVLILPEGFELAPPDRISPEMKEKIGNLSFQSYRPNKTNILVVGPVPGKKYSEITFPILSPDPATKKDVHFLKYPIYVGGNRGRGQIYPDGNKSNNTVYNATAAGIVSKIIRKEKGGYEITITDASEGRQVVDIIPPGPELLVSEGESIKFDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLASVILAQIFLVLKKKQFEKVQLAEMNF cytochrome_f FILE2 peta petA 63897 64859 D 1 963 Ok MQTRNAFSWLKKQITRSISVSLMIYILTRTSISSAYPIFAQQGYENPREATGRIVCANCHLANKPVEIEVPQAVLPDTVFEAVVRIPYDMQLKQVLANGKKGGLNVGAVLILPEGFELAPPDRISPEMKEKIGNLSFQSYRPNKTNILVVGPVPGKKYSEITFPILSPDPATKKDVHFLKYPIYVGGNRGRGQIYPDGNKSNNTVYNATAAGIVSKIIRKEKGGYEITITDASEGRQVVDIIPPGPELLVSEGESIKFDQPLTSNPNVGGFGQGDAEIVLQDPLRVQGLLFFLASVILAQIFLVLKKKQFEKVQLAEMNF cytochrome_f MATCH peta ID 100 CORRECT FILE1 psbj psbJ 65928 66050 R 1 123 Ok MADTTGRIPLWIIGTVAGILVIGLIGIFFYGSYSGLGSSL PSII_reaction_center_subunit_X FILE2 psbj psbJ 65928 66050 R 1 123 Ok MADTTGRIPLWIIGTVAGILVIGLIGIFFYGSYSGLGSSL photosystem_II_protein_J MATCH psbj ID 100 CORRECT FILE1 psbl psbL 66175 66291 R 1 117 Ok MTQSNPNEQNVELNRTSLYWGLLLIFVLAVLFSNYFFN photosystem_II_protein_L FILE2 psbl psbL 66175 66291 R 1 117 Ok MTQSNPNEQNVELNRTSLYWGLLLIFVLAVLFSNYFFN photosystem_II_protein_L MATCH psbl ID 100 CORRECT FILE1 psbf psbF 66314 66433 R 1 120 Ok MTIDRTYPIFTVRWLAVHGLAVPTVFFLGSISAMQFIQR cytochrome_b559_beta_chain FILE2 psbf psbF 66314 66433 R 1 120 Ok MTIDRTYPIFTVRWLAVHGLAVPTVFFLGSISAMQFIQR photosystem_II_protein_VI MATCH psbf ID 100 CORRECT FILE1 psbe psbE 66443 66694 R 1 252 Ok MSGSTGERSFADIITSIRYWVIHSITIPSLFIAGWLFVSTGLAYDVFGSPRPNEYFTESRQGIPLITGRFDPLEQLDEFSRSF cytochrome_b559_alpha_chain FILE2 psbe psbE 66443 66724 R 1 282 Ok MTVQEYVELSMSGSTGERSFADIITSIRYWVIHSITIPSLFIAGWLFVSTGLAYDVFGSPRPNEYFTESRQGIPLITGRFDPLEQLDEFSRSF photosystem_II_protein_V MATCH psbe ID 89 ACCEPTABLE.BAD_START FILE1 petl petL 67692 67787 D 1 96 Ok MLTITSYFGFLLAALTITSALFIGLSKIRLI cytochrome_b6_/f_complex_subunit_VI FILE2 petl petL 67692 67787 D 1 96 Ok MLTITSYFGFLLAALTITSALFIGLSKIRLI cytochrome_b6/f_complex_subunit_VI MATCH petl ID 100 CORRECT FILE1 petg petG 67971 68084 D 1 114 Ok MIEVFLFGIVLGLIPITLAGLFVTAYLQYRRGDQLDL cytochrome_b6_/f_complex_subunit_V FILE2 petg petG 67971 68084 D 1 114 Ok MIEVFLFGIVLGLIPITLAGLFVTAYLQYRRGDQLDL cytochrome_b6/f_complex_subunit_V MATCH petg ID 100 CORRECT FILE1 psaj psaJ 68960 69094 D 1 135 Ok MRDLKTYLSVAPVLSTLWFGALAGLLIEINRFFPDALTFPFFSF photosystem_I_subunit_IX FILE2 psaj psaJ 68960 69094 D 1 135 Ok MRDLKTYLSVAPVLSTLWFGALAGLLIEINRFFPDALTFPFFSF photosystem_I_subunit_IX MATCH psaj ID 100 CORRECT FILE1 rpl33 rpl33 69532 69732 D 1 201 Ok MAKGKDVRVTVILECTSCVRNSVDKVSRGISRYITQKNRHNTPNRFELKKFCPYCYKHTIHGEIKK ribosomal_protein_L33 FILE2 rpl33 rpl33 69532 69732 D 1 201 Ok MAKGKDVRVTVILECTSCVRNSVDKVSRGISRYITQKNRHNTPNRFELKKFCPYCYKHTIHGEIKK ribosomal_protein_L33 MATCH rpl33 ID 100 CORRECT FILE1 rps18 rps18 69923 70228 D 1 306 Ok MDKSKRPFLKFKRSFRRRLPPIQSGDRIDYRNMSLISRFISEQGKILSRRVNRLTLKQQRLITLAIKQARILSLLPFLNNEKQFERTESTARTTGFKARNK ribosomal_protein_S18 FILE2 rps18 rps18 69923 70228 D 1 306 Ok MDKSKRPFLKFKRSFRRRLPPIQSGDRIDYRNMSLISRFISEQGKILSRRVNRLTLKQQRLITLAIKQARILSLLPFLNNEKQFERTESTARTTGFKARNK ribosomal_protein_S18 MATCH rps18 ID 100 CORRECT FILE1 rpl20 rpl20 70443 70829 R 1 387 Ok MTRIKRGYIARRRRTKIRLFASSFRGAHSRLTRTITQQKIRALVSAHRDRDRKKRDFRRLWITRINAVIRERGVSYSYSRLIHDLYKRQLLLNRKILAQIAISNRNCLYMISNEIIKEVDWKESTRII ribosomal_protein_L20 FILE2 rpl20 rpl20 70443 70829 R 1 387 Ok MTRIKRGYIARRRRTKIRLFASSFRGAHSRLTRTITQQKIRALVSAHRDRDRKKRDFRRLWITRINAVIRERGVSYSYSRLIHDLYKRQLLLNRKILAQIAISNRNCLYMISNEIIKEVDWKESTRII ribosomal_protein_L20 MATCH rpl20 ID 100 CORRECT FILE1 rps12 rps12 71639 100035 R 3 372 Ok MPTIKQLIRNTRQPIRNVTKSPALRGCPQRRGTCTRVYTITPKKPNSALRKVARVRLTSGFEITAYIPGIGHNSQEHSVVLVRGGRVKDLPGVRYHIVRGTLDAVGVKDRQQGRSKYGVKKPK ribosomal_protein_S12 FILE2 NONE MATCH rps12 ID 0 MISSED.WRONG_STOP FILE1 clpp clpP 71861 73896 R 3 621 Ok MPIGVPRVVFRNPGDPISSWVDIYNRLYRERLLFLGQGIGTELSNQLIGLMLYLSMEDENKDLYLFVNSPGGWVIPGIAIYDTMQFVRPDIHTICLGLAASMGSFILAGGQLTKRIAFPHARVMIHEPYSGFYMAQVGEFVLEAIEMAKLRETLTRVYAEKTGQPVWVIHEDMERDIFMSATEAQAYGIVDFVAVQGKEHGFHADL ATP-dependent_Clp_protease_proteolytic_subunit FILE2 clpp clpP 71861 73896 R 3 621 Ok MPIGVPRVVFRNPGDPISSWVDIYNRLYRERLLFLGQGIGTELSNQLIGLMLYLSMEDENKDLYLFVNSPGGWVIPGIAIYDTMQFVRPDIHTICLGLAASMGSFILAGGQLTKRIAFPHARVMIHEPYSGFYMAQVGEFVLEAIEMAKLRETLTRVYAEKTGQPVWVIHEDMERDIFMSATEAQAYGIVDFVAVQGKEHGFHADL ATP-dependent_Clp_protease_proteolytic_subunit MATCH clpp ID 100 CORRECT FILE1 psbb psbB 74341 75867 D 1 1527 Ok MGLPWYRVHTVVLNDPGRLLSVHIMHTALVAGWAGSMALYELAVFDPSDPVLDPMWRQGMFVIPFMTRLGITNSWGGWSITGGTVTNPGIWSYEGVAGAHIVFSGLCFLAAIWHWVYWDLEIFCDERTGKPSLDLPKIFGIHLFLSGVACFGFGAFHVTGLYGPGIWVSDPYGLTGKVQPVNPAWGVEGFDPFVPGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAFVVAGTMWYGSATTPIELFGPTRYQWDQGYFQQEIYRRVSAGLAENQSLSEAWSKIPEKLAFYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHPIFRDKEGRELFVRRMPTFFETFPVVLVDGDGIVRADVPFRRAESKYSVEQVGVTVEFYGGELNGVSYSDPATVKKYARRAQLGEIFELDRATLKSDGVFRSSPRGWFTFGHASFALLFFFGHIWHGARTLFRDVFAGIDPDLDAQVEFGAFQKLGDPTTKRQAA photosystem_II_47_kDa_protein FILE2 psbb psbB 74341 75867 D 1 1527 Ok MGLPWYRVHTVVLNDPGRLLSVHIMHTALVAGWAGSMALYELAVFDPSDPVLDPMWRQGMFVIPFMTRLGITNSWGGWSITGGTVTNPGIWSYEGVAGAHIVFSGLCFLAAIWHWVYWDLEIFCDERTGKPSLDLPKIFGIHLFLSGVACFGFGAFHVTGLYGPGIWVSDPYGLTGKVQPVNPAWGVEGFDPFVPGGIASHHIAAGTLGILAGLFHLSVRPPQRLYKGLRMGNIETVLSSSIAAVFFAAFVVAGTMWYGSATTPIELFGPTRYQWDQGYFQQEIYRRVSAGLAENQSLSEAWSKIPEKLAFYDYIGNNPAKGGLFRAGSMDNGDGIAVGWLGHPIFRDKEGRELFVRRMPTFFETFPVVLVDGDGIVRADVPFRRAESKYSVEQVGVTVEFYGGELNGVSYSDPATVKKYARRAQLGEIFELDRATLKSDGVFRSSPRGWFTFGHASFALLFFFGHIWHGARTLFRDVFAGIDPDLDAQVEFGAFQKLGDPTTKRQAA photosystem_II_47_kDa_protein MATCH psbb ID 100 CORRECT FILE1 psbt psbT 76069 76173 D 1 105 Ok MEALVYTFLLVSTLGIIFFAIFFREPPTIRTKKN photosystem_II_protein_T FILE2 psbt psbT 76069 76173 D 1 105 Ok MEALVYTFLLVSTLGIIFFAIFFREPPTIRTKKN photosystem_II_protein_T MATCH psbt ID 100 CORRECT FILE1 psbn psbN 76251 76382 R 1 132 Ok METATLVAIFISGLLVSFTGYALYTAFGQPSQQLRDPFEEHGD photosystem_II_protein_N FILE2 psbn psbN 76251 76382 R 1 132 Ok METATLVAIFISGLLVSFTGYALYTAFGQPSQQLRDPFEEHGD photosystem_II_protein_N MATCH psbn ID 100 CORRECT FILE1 psbh psbH 76494 76715 D 1 222 Ok MATQTVENSSRSGPRRTAVGDLLKPLNSEYGKVAPGWGTTPLMGVAMALFAVFLSIILEIYNSSVLLDGISMN photosystem_II_phosphoprotein FILE2 psbh psbH 76476 76715 D 1 240 Ok MNTIGFMATQTVENSSRSGPRRTAVGDLLKPLNSEYGKVAPGWGTTPLMGVAMALFAVFLSIILEIYNSSVLLDGISMN photosystem_II_protein_H MATCH psbh ID 92 ALMOST_CORRECT.BAD_START FILE1 petb petB 76845 78239 D 2 648 Ok MSKVYDWFEERLEIQAIADDITSKYVPPHVNIFYCLGGITLTCFLVQVATGFAMTFYYRPTVTEAFASVQYIMTEANFGWLIRSVHRWSASMMVLMMILHVFRVYLTGGFKKPRELTWVTGVVLAVLTASFGVTGYSLPWDQIGYWAVKIVTGVPDAIPVIGSPLVELLRGSASVGQSTLTRFYSLHTFVLPLLTAVFMLMHFPMIRKQGISGPL cytochrome_b6 FILE2 petb petB 77559 78239 D 1 681 Ok MYGSQRGSSAYLNKVYDWFEERLEIQAIADDITSKYVPPHVNIFYCLGGITLTCFLVQVATGFAMTFYYRPTVTEAFASVQYIMTEANFGWLIRSVHRWSASMMVLMMILHVFRVYLTGGFKKPRELTWVTGVVLAVLTASFGVTGYSLPWDQIGYWAVKIVTGVPDAIPVIGSPLVELLRGSASVGQSTLTRFYSLHTFVLPLLTAVFMLMHFPMIRKQGISGPL cytochrome_b6 MATCH petb ID 95 ALMOST_CORRECT.BAD_NBEXON.BAD_START FILE1 petd petD 78434 79654 D 2 483 Ok MGVTKKPDLNDPVLRAKLAKGMGHNYYGEPAWPNDLLYIFPVVILGTIACNVGLAVLEPSMIGEPADPFATPLEILPEWYFFPVFQILRTVPNKLLGVLLMVSVPAGLLTVPFLENVNKFQNPFRRPVATTVFLIGTAVALWLGIGATLPIDKSLTLGLF cytochrome_b6_/f_complex_subunit_IV FILE2 petd petD 79127 79654 D 1 528 Ok MMSSSLGGWIYKNSPIPITKKPDLNDPVLRAKLAKGMGHNYYGEPAWPNDLLYIFPVVILGTIACNVGLAVLEPSMIGEPADPFATPLEILPEWYFFPVFQILRTVPNKLLGVLLMVSVPAGLLTVPFLENVNKFQNPFRRPVATTVFLIGTAVALWLGIGATLPIDKSLTLGLF cytochrome_b6/f_complex_subunit_IV MATCH petd ID 90 ALMOST_CORRECT.BAD_NBEXON.BAD_START FILE1 rpoa rpoA 79846 80859 R 1 1014 Ok MVREKVTVSTRTLQWKCVESRTDSKRLYYGRFILSPLMKGQADTIGIAMRRALLGEIEGTCITRVKSEKVPHEYSTITGIQESVHEILMNLKEIVLRSNLYGTSEASICVKGPGYVTAQDIILPPYVEIVDNTQHIASLTEPIDFCIGLQIERNRGYLIKTPHNFQDGSYPIDAVFMPVRNANHSIHSYGNGNEKQEILFIEIWTNGSLTPKEALHDASRNLIDLFIPFLHMEEDNLYLQDNQHTVPLSPFTFHDKLAKLIKNKKKIALKSIFIDQSELSSRIYNCLKMSNIYTLLDLLNNSQEDLMKIEHFRSEDIKQILDILEKYFVIDLAKNKF RNA_polymerase_alpha_chain FILE2 rpoa rpoA 79846 80859 R 1 1014 Ok MVREKVTVSTRTLQWKCVESRTDSKRLYYGRFILSPLMKGQADTIGIAMRRALLGEIEGTCITRVKSEKVPHEYSTITGIQESVHEILMNLKEIVLRSNLYGTSEASICVKGPGYVTAQDIILPPYVEIVDNTQHIASLTEPIDFCIGLQIERNRGYLIKTPHNFQDGSYPIDAVFMPVRNANHSIHSYGNGNEKQEILFIEIWTNGSLTPKEALHDASRNLIDLFIPFLHMEEDNLYLQDNQHTVPLSPFTFHDKLAKLIKNKKKIALKSIFIDQSELSSRIYNCLKMSNIYTLLDLLNNSQEDLMKIEHFRSEDIKQILDILEKYFVIDLAKNKF RNA_polymerase_alpha_subunit MATCH rpoa ID 100 CORRECT FILE1 rps11 rps11 80925 81341 R 1 417 Ok MAKAIPKISSRRNGRISSRKGARRIPKGVIHVQASFNNTIVTVTDVRGRVVSWSSAGTSGFKGTRRGTPFAAQTAAANAIRTVVDQGMQRAEVMIKGPGLGRDAALRAIRRSGILLTFVRDVTPMPHNGCRPPKKRRV ribosomal_protein_S11 FILE2 rps11 rps11 80925 81341 R 1 417 Ok MAKAIPKISSRRNGRISSRKGARRIPKGVIHVQASFNNTIVTVTDVRGRVVSWSSAGTSGFKGTRRGTPFAAQTAAANAIRTVVDQGMQRAEVMIKGPGLGRDAALRAIRRSGILLTFVRDVTPMPHNGCRPPKKRRV ribosomal_protein_S11 MATCH rps11 ID 100 CORRECT FILE1 rpl36 rpl36 81443 81556 R 1 114 Ok MKIRASVRKICEKCRLIRRRGRIIVICSNPRHKQRQG ribosomal_protein_L36 FILE2 rpl36 rpl36 81443 81556 R 1 114 Ok MKIRASVRKICEKCRLIRRRGRIIVICSNPRHKQRQG ribosomal_protein_L36 MATCH rpl36 ID 100 CORRECT FILE1 NONE FILE2 infa infA 81668 81772 R 1 105 Ok MQILPGDRVKIEVSPYDSTKGHIIYRLHNKDLKD translation_initiation_factor_1 MATCH infa ID 0 OVERPRED.WRONG_STOP FILE1 rps8 rps8 81881 82285 R 1 405 Ok MGRDTIAEIITSIRNADMDRKRVVRIASTNITENIVQILLREGFIENVRKHRENNKYFLVLTLRHRRNRKRPYRNILNLKRISRPGLRIYSNYQRIPRILGGMGIVILSTSRGIMTDREARLEGIGGEILCYIW ribosomal_protein_S8 FILE2 rps8 rps8 81881 82285 R 1 405 Ok MGRDTIAEIITSIRNADMDRKRVVRIASTNITENIVQILLREGFIENVRKHRENNKYFLVLTLRHRRNRKRPYRNILNLKRISRPGLRIYSNYQRIPRILGGMGIVILSTSRGIMTDREARLEGIGGEILCYIW ribosomal_protein_S8 MATCH rps8 ID 100 CORRECT FILE1 rpl14 rpl14 82453 82821 R 1 369 Ok MIQPQTHLNVADNSGARELMCIRIIGASNRRYAHIGDVIVAVIKEAVPNMPLERSEVVRAVIVRTCKELKRDNGMIIRYDDNAAVVIDQEGNPKGTRIFGAIARELRELNFTKIVSLAPEVL ribosomal_protein_L14 FILE2 rpl14 rpl14 82453 82821 R 1 369 Ok MIQPQTHLNVADNSGARELMCIRIIGASNRRYAHIGDVIVAVIKEAVPNMPLERSEVVRAVIVRTCKELKRDNGMIIRYDDNAAVVIDQEGNPKGTRIFGAIARELRELNFTKIVSLAPEVL ribosomal_protein_L14 MATCH rpl14 ID 100 CORRECT FILE1 rpl16 rpl16 82947 84369 R 2 405 Ok MLSPKRTRFRKQHRGRMKGISYRGNRISFGKYALQALEPAWITSRQIEAGRRAMTRNARRGGKIWVRIFPDKPVTLRPAETRMGSGKGSPEYWVAVVKPGRILYEMGGVTENIARRAISLAASKMPIRTQFIIS ribosomal_protein_L16 FILE2 rpl16 rpl16 82947 83354 R 1 408 Ok MNYNPKRTRFRKQHRGRMKGISYRGNRISFGKYALQALEPAWITSRQIEAGRRAMTRNARRGGKIWVRIFPDKPVTLRPAETRMGSGKGSPEYWVAVVKPGRILYEMGGVTENIARRAISLAASKMPIRTQFIIS ribosomal_protein_L16 MATCH rpl16 ID 97 ALMOST_CORRECT.BAD_NBEXON.BAD_START FILE1 rps3 rps3 84531 85187 R 1 657 Ok MGQKINPLGFRLGTTQSHHSLWFSQPKNYSEGLQEDKKIRDCIKNYVQKNMRTSSGIEGIARIEIQKRIDLIQVIIFMGFPKLLIESRPRGIEELQMTLQKEFNCVNRKLNIAVTRIAKPYGNPNILAEFIAGQLKNRVSFRKAMKKAIELTEQADTKGIQIQIAGRIDGKEIARVEWIREGRVPLQTIRAKIDYCSYTVRTIYGILGIKIWIFLDEE ribosomal_protein_S3 FILE2 rps3 rps3 84531 85187 R 1 657 Ok MGQKINPLGFRLGTTQSHHSLWFSQPKNYSEGLQEDKKIRDCIKNYVQKNMRTSSGIEGIARIEIQKRIDLIQVIIFMGFPKLLIESRPRGIEELQMTLQKEFNCVNRKLNIAVTRIAKPYGNPNILAEFIAGQLKNRVSFRKAMKKAIELTEQADTKGIQIQIAGRIDGKEIARVEWIREGRVPLQTIRAKIDYCSYTVRTIYGILGIKIWIFLDEE ribosomal_protein_S3 MATCH rps3 ID 100 CORRECT FILE1 rpl22 rpl22 85172 85639 R 1 468 Ok MLKKKKTEVYALGEHISMSADKARRVIDQIRGRSYEETLMILELMPYRACYPILKLVYSAAANASYNMGSSETNLVISKAEVNEGTTVKKLKPRARGRSFPIKRSTCHITIVMKDISLDDEYGEMSSLKKTRWKKKSTAMTYRDMYNSGGLWDKK ribosomal_protein_L22 FILE2 rpl22 rpl22 85172 85639 R 1 468 Ok MLKKKKTEVYALGEHISMSADKARRVIDQIRGRSYEETLMILELMPYRACYPILKLVYSAAANASYNMGSSETNLVISKAEVNEGTTVKKLKPRARGRSFPIKRSTCHITIVMKDISLDDEYGEMSSLKKTRWKKKSTAMTYRDMYNSGGLWDKK ribosomal_protein_L22 MATCH rpl22 ID 100 CORRECT FILE1 rps19 rps19 85692 85970 R 1 279 Ok MTRSLKKNPFVANHLLKKIDKLNTKAEKEIIVTWSRASTIIPTMIGHTIAIHNGKEHLPIYITDSMVGHKLGEFAPTLNFRGHAKSDNRSRR ribosomal_protein_S19 FILE2 rps19 rps19 85692 85970 R 1 279 Ok MTRSLKKNPFVANHLLKKIDKLNTKAEKEIIVTWSRASTIIPTMIGHTIAIHNGKEHLPIYITDSMVGHKLGEFAPTLNFRGHAKSDNRSRR ribosomal_protein_S19 MATCH rps19 ID 100 CORRECT FILE1 rpl2 rpl2 86038 87528 R 2 825 Ok MAIHLYKTSTPSTRNGTVDSQVKSNPRNNLIYGQRRCGKGRNARGIITARHRGGGHKRLYRKIDFRRNEKDIYGRIVTIEYDPNRNAYICLIHYGDGEKRYILHPRGAIIGDTIVSGTEVPIKMGNALPLTDMPLGTAIHNIEITLGKGGQLARAAGAVAKLIAKEGKSATLKLPSGEVRLISKNCSATVGQVGNVGVNQKSLGRAGSKRWLGKRPVVRGVVMNPVDHPHGGGEGRAPIGRKKPTTPWGYPALGRRSRKRNKYSDNLILRRRSK ribosomal_protein_L2 FILE2 rpl2 rpl2_1 86038 87528 R 2 825 Ok MAIHLYKTSTPSTRNGTVDSQVKSNPRNNLIYGQRRCGKGRNARGIITARHRGGGHKRLYRKIDFRRNEKDIYGRIVTIEYDPNRNAYICLIHYGDGEKRYILHPRGAIIGDTIVSGTEVPIKMGNALPSTDMPLGTAIHNIEITLGKGGQLARAAGAVAKLIAKEGKSATLKLPSGEVRLISKNCSATVGQVGNVGVNQKSLGRAGSKRWLGKRPVVRGVVMNPVDHPHGGGEGRAPIGRKKPTTPWGYPALGRRSRKRNKYSDNLILRRRSK ribosomal_protein_L2 MATCH rpl2 ID 99 ALMOST_CORRECT.BAD_JUNCTION FILE1 rpl23 rpl23 87547 87828 R 1 282 Ok MDGIKYAVFTDKSIRLLGKNQYTSNVESGSTRTEIKHWVELFFGVKVIAMNSHRLPGKSRRMGPIMGHTMHYRRMIITLQPGYSIPPLRKKRT ribosomal_protein_L23 FILE2 rpl23 rpl23_1 87547 87828 R 1 282 Ok MDGIKYAVFTDKSIRLLGKNQYTSNVESGSTRTEIKHWVELFFGVKVIAMNSHRLPGKSRRMGPIMGHTMHYRRMIITLQPGYSIPPLRKKRT ribosomal_protein_L23 MATCH rpl23 ID 100 CORRECT FILE1 ycf2 ycf2 88196 95032 D 1 6837 Ok MRGHQFKSWIFELREILREIKNSHHFLDSWTQFNSVGSFIHIFFHQERFLKLFDPRIWSILLSRNSQGSPSNRYFTIKGVILFVVAVLIYRINNRNMVERKNLYLIGLLPIPMNSIGPRNDTLEESVGSSNINRLIVSLLYLPKGKKISESCFLNPKESTWVLPITKKCSMPESNWGSRWWRNWIGKKRDSSCKISNETVAGIEILFKEKDLKYLEFLFVYYMDDPIRKDHDWELFDRLSLRKSRNRINLNSGPLFEILVKHWISYLMSAFREKIPIEVEGFFKQQGAGSTIQSNDIEHVSHLFSRNKWAISLQNCAQFHMWQFRQDLFVSWGKNPPESDFLRNVSRENWIWLDNVWLVNKDRFFSKVQNVSSNIQYDSTRSSFVQVTDSSQLKGSSDQSRDHLDSISNEDSEYHTLINQREIQQRKERSILWDPSFLQTERKEIESGRFPKCLSGYSSMSRLFTEREKQMINHLFPEEIEEFLGNPTRSVRSFFSDRWSELHLGSNPTERSTRDQKLLKKQQDLSFVPSRRSEKKEMVNIFKIITYLQNTVSIHPISSDPGCDMVPKDEPDMDSSNKISFLNKNPFFDLFHLFHDRNRGGYTLHYDFASEERFQEMADLFTLSITEPDLVYHKGFAFSIDSCGLDQKQFLNEARDESKKKSLLVLPPIFYEENESFSRRIRKKWVRISCGNDLEDPKPKIVVFASNNIMEAVTQYRLIRNLIQIQYSTYGYIRNVLNRFFLMNRSDRNFEYGIQRDQIGKDTLNHRTIMKYTINQYLSNLKKSQKKWFEPLILISRTERSMNRDPDAYRYKWSNGSKSFQEHLEQSVSKQKSRFQVVFDRLRINQYSIDWSEVIDKKDLSKSLRFFLSKSLLFLSKLLLFLSNSLPFFCVSFGNIPIHRSEIYIYEELKGPNDQLCNQLLESIGLQIVHLKKLKPFLLDDHDTSQKSKFLINGGTISPFLFNKIPKWMIDSFHTRNNRRKSFDNPDSYFSMIFHDQDNWLNPVKPFHRSSLISSFYKANRLRFLNNPHHFCFYWNTRFPFSVEKARINNSDFTYGQFLNILFIRNKIFSLCVGKKKHAFWGRDTISPIESQVSNIFIPNDFPQSGDETYNLYKSFHFPSRSDPFVRRAIYSIADISGTPLTEGQIVNFERTYCQPLSDMNLSDSEGKNLHQYLNFNSNMGLIHTPCSEKDLSSEKRKKWSLCLKKCVEKGQTYRTFQRDSAFSTLSKWNLFQTYMPWFLTSTGYKYLNLIFLDTFSDLLPILSSSQKFVSIFPDIMHGSGISWRILQKKLCLPQWNLISEISSKCLHNLLLSEEMIHRNNESPLISTHLRSPNAREFLYSILFLLLVAGYLVRTHLLFVSRASSELQTEFERVKSLMTPSSMIELRKLLDRYPTSEPNSFWLKNLFLVALEQLGDSLEEIRGSASGGNMLGPAYGVKSIRSKKKDWNINLIEIIDLIPNPINRITFSRNTRHLSHTSKEIYSLIRKRKNVNGDWIDEKIESWVANSDSIDDEEREFLVQFSTLTTENRIDQILLSLTHSDHLSKNDSGYQMIEQPGAIYLRYLVDIHKKHLMNYEFNPSCLAERRIFLAHYQTITYSQTSCGENSFHFPSHGKPFSLRLALSPSRGILVIGSIGTGRSYLVKYLATNSYVPFITVFLNKFLDNKSKGFLLDEIDIDDSDDIDDSDNLDASDDIDRDLDTELELLTRMNGLTVDMMPEIDRFYITLQFELAKAMSPCIIWIPNIHDLDVNESNDLSLGLLVNHLSRDCERCSTRNILVIASTHIPQKVDPALIAPNKLNTCIKIRRLLIPQQRKHFFTLSYTRGFHLEKKMFHTNGFGSITMGSNARDLVALTNEVLSISITQKKSIIDTNTIRSALHRQTWDLRSQVRSVQDHGILFYQIGRAVAQNVLLSNCPIDPISIYMKKKSCNEGDSYLYKWYFELGTSMKRLTILLYLLSCSAGSVAQDLWSLSVPDEKNGITSYGLVENDSDLVHGLLEVEGALVGSSRTEKDCSQFDNDRVTLLLRPEPRNPLDMMQKGSWSILDQRFLYEKYESEFEEGEGEGALDPQEDLFNHIVWAPRIWRPWGFLFDCIERPNELGFPYWSRSFRGKRIIYDEEDELQENDSGFLQSGTMQYQTRDRSQGLFRISQFIWDPADPLFFLFKDQPPGSVFSHRELFADEEMSKGLLTSQTDPPTSLYKRWFIKNTQEKHFELLINRQRWLRTNSSLSNGSFRSNTLSESYQYLSNLFLSNGTLLDQMPKTLLRKRWLFPDEMKIGFM Ycf2_protein FILE2 ycf2 ycf2_1 88196 95032 D 1 6837 Ok MRGHQFKSWIFELREILREIKNSHHFLDSWTQFNSVGSFIHIFFHQERFLKLFDPRIWSILLSRNSQGSPSNRYFTIKGVILFVVAVLIYRINNRNMVERKNLYLIGLLPIPMNSIGPRNDTLEESVGSSNINRLIVSLLYLPKGKKISESCFLNPKESTWVLPITKKCSMPESNWGSRWWRNWIGKKRDSSCKISNETVAGIEILFKEKDLKYLEFLFVYYMDDPIRKDHDWELFDRLSLRKSRNRINLNSGPLFEILVKHWISYLMSAFREKIPIEVEGFFKQQGAGSTIQSNDIEHVSHLFSRNKWAISLQNCAQFHMWQFRQDLFVSWGKNPPESDFLRNVSRENWIWLDNVWLVNKDRFFSKVQNVSSNIQYDSTRSSFVQVTDSSQLKGSSDQSRDHLDSISNEDSEYHTLINQREIQQRKERSILWDPSFLQTERKEIESGRFPKCLSGYSSMSRLFTEREKQMINHLFPEEIEEFLGNPTRSVRSFFSDRWSELHLGSNPTERSTRDQKLLKKQQDLSFVPSRRSEKKEMVNIFKIITYLQNTVSIHPISSDPGCDMVPKDEPDMDSSNKISFLNKNPFFDLFHLFHDRNRGGYTLHYDFASEERFQEMADLFTLSITEPDLVYHKGFAFSIDSCGLDQKQFLNEARDESKKKSLLVLPPIFYEENESFSRRIRKKWVRISCGNDLEDPKPKIVVFASNNIMEAVTQYRLIRNLIQIQYSTYGYIRNVLNRFFLMNRSDRNFEYGIQRDQIGKDTLNHRTIMKYTINQYLSNLKKSQKKWFEPLILISRTERSMNRDPDAYRYKWSNGSKSFQEHLEQSVSKQKSRFQVVFDRLRINQYSIDWSEVIDKKDLSKSLRFFLSKSLLFLSKLLLFLSNSLPFFCVSFGNIPIHRSEIYIYEELKGPNDQLCNQLLESIGLQIVHLKKLKPFLLDDHDTSQKSKFLINGGTISPFLFNKIPKWMIDSFHTRNNRRKSFDNPDSYFSMIFHDQDNWLNPVKPFHRSSLISSFYKANRLRFLNNPHHFCFYWNTRFPFSVEKARINNSDFTYGQFLNILFIRNKIFSLCVGKKKHAFWGRDTISPIESQVSNIFIPNDFPQSGDETYNLYKSFHFPSRSDPFVRRAIYSIADISGTPLTEGQIVNFERTYCQPLSDMNLSDSEGKNLHQYLNFNSNMGLIHTPCSEKDLSSEKRKKWSLCLKKCVEKGQTYRTFQRDSAFSTLSKWNLFQTYMPWFLTSTGYKYLNLIFLDTFSDLLPILSSSQKFVSIFPDIMHGSGISWRILQKKLCLPQWNLISEISSKCLHNLLLSEEMIHRNNESPLISTHLRSPNAREFLYSILFLLLVAGYLVRTHLLFVSRASSELQTEFERVKSLMTPSSMIELRKLLDRYPTSEPNSFWLKNLFLVALEQLGDSLEEIRGSASGGNMLGPAYGVKSIRSKKKDWNINLIEIIDLIPNPINRITFSRNTRHLSHTSKEIYSLIRKRKNVNGDWIDEKIESWVANSDSIDDEEREFLVQFSTLTTENRIDQILLSLTHSDHLSKNDSGYQMIEQPGAIYLRYLVDIHKKHLMNYEFNPSCLAERRIFLAHYQTITYSQTSCGENSFHFPSHGKPFSLRLALSPSRGILVIGSIGTGRSYLVKYLATNSYVPFITVFLNKFLDNKSKGFLLDEIDIDDSDDIDDSDNLDASDDIDRDLDTELELLTRMNGLTVDMMPEIDRFYITLQFELAKAMSPCIIWIPNIHDLDVNESNDLSLGLLVNHLSRDCERCSTRNILVIASTHIPQKVDPALIAPNKLNTCIKIRRLLIPQQRKHFFTLSYTRGFHLEKKMFHTNGFGSITMGSNARDLVALTNEVLSISITQKKSIIDTNTIRSALHRQTWDLRSQVRSVQDHGILFYQIGRAVAQNVLLSNCPIDPISIYMKKKSCNEGDSYLYKWYFELGTSMKRLTILLYLLSCSAGSVAQDLWSLSVPDEKNGITSYGLVENDSDLVHGLLEVEGALVGSSRTEKDCSQFDNDRVTLLLRPEPRNPLDMMQKGSWSILDQRFLYEKYESEFEEGEGEGALDPQEDLFNHIVWAPRIWRPWGFLFDCIERPNELGFPYWSRSFRGKRIIYDEEDELQENDSGFLQSGTMQYQTRDRSQGLFRISQFIWDPADPLFFLFKDQPPGSVFSHRELFADEEMSKGLLTSQTDPPTSLYKRWFIKNTQEKHFELLINRQRWLRTNSSLSNGSFRSNTLSESYQYLSNLFLSNGTLLDQMPKTLLRKRWLFPDEMKIGFM Ycf2 MATCH ycf2 ID 100 CORRECT FILE1 ycf15 ycf15 95123 95386 D 1 264 Ok METLVSSIFWTLAPWKNMLLLKHGRIEILDQNTMYGWYELPKQEFLNSKQPVQIFTTKKYWILFRIGPERRRKAGMPIGVYYIEFTR Ycf15_protein FILE2 NONE MATCH ycf15 ID 0 MISSED.WRONG_STOP FILE1 ndhb ndhB 96224 98435 R 2 1533 Ok MIWHVQNENFILDSTRIFMKAFHLLLFDGSLIFPECILIFGLILLLMIDSTSDQKDIPWLYFISSTSLVMSITALLFRWREEPMISFSGNFQTNNFNEIFQFLILLCSTLCIPLSVEYIECTEMAITEFLLFVLTATLGGMFLCGANDLITIFVAPECFSLCSYLLSGYTKKDVRSNEATMKYLLMGGASSSILVHGFSWLYGSSGGEIELQEIVNGLINTQMYNSPGISIALIFITVGIGFKLSPAPSHQWTPDVYEGSPTPVVAFLSVTSKVAASASATRIFNIPFYFSSNEWHLLLEILAILSMILGNLIAITQTSMKRMLAYSSIGQIGYVIIGIIVGDSNDGYASMITYMLFYISMNLGTFACIVLFGLRTGTDNIRDYAGLYTKDPFLALSLALCLLSLGGLPPLAGFFGKLYLFWCGWQAGLYFLVLIGLLTSVVSIYYYLKIIKLLMTGRNQEITPHVRNYRRSPLRSNNSIELSMIVCVIASTIPGISMNPIIAIAQDSLF NADH_dehydrogenase_subunit_2 FILE2 ndhb ndhB_1 96224 98435 R 2 1626 Ok MIWHVQNENFILDSTRIFMKAFHLLLFDGSLIFPECILIFGLILLLMIDSTSDQKDIPWLYFISSTSLVMSITALLFRWREEPMISFSGNFQTNNFNEIFQFLILLCSTLCIPLSVEYIECTEMAITEFLLFVLTATLGGMFLCGANDLITIFVAPECFSLCSYLLSGYTKKDVRSNEATMKYLLMGGASSSILVHGFSWLYGSSGGEIELQEIVNGLINTQMYNSPGISIALIFITVGIGFKLSPAPSHQWTPDVYEGVRFVREIPTSLSISEMFGFFKTPWTCRREMLSPTPVVAFLSVTSKVAASASATRIFNIPFYFSSNEWHLLLEILAILSMILGNLIAITQTSMKRMLAYSSIGQIGYVIIGIIVGDSNDGYASMITYMLFYISMNLGTFACIVLFGLRTGTDNIRDYAGLYTKDPFLALSLALCLLSLGGLPPLAGFFGKLYLFWCGWQAGLYFLVLIGLLTSVVSIYYYLKIIKLLMTGRNQEITPHVRNYRRSPLRSNNSIELSMIVCVIASTIPGISMNPIIAIAQDSLF NADH_dehydrogenase_subunit_2 MATCH ndhb ID 94 ALMOST_CORRECT.BAD_JUNCTION FILE1 rps7 rps7 98721 99188 R 1 468 Ok MSRRGTAEKKTAKSDPIYRNRLVNMLVNRILKHGKKSLAYQIIYRAVKKIQQKTETNPLSVLRQAIRGVTPDITVKARRVGGSTHQVPIEIGSTQGKALAIRWLLAASRKRPGRNMAFKLSSELVDAAKGSGDAIRKKEETHRMAEANRAFAHFR ribosomal_protein_S7 FILE2 rps7 rps7_1 98721 99188 R 1 468 Ok MSRRGTAEKKTAKSDPIYRNRLVNMLVNRILKHGKKSLAYQIIYRAVKKIQQKTETNPLSVLRQAIRGVTPDITVKARRVGGSTHQVPIEIGSTQGKALAIRWLLAASRKRPGRNMAFKLSSELVDAAKGSGDAIRKKEETHRMAEANRAFAHFR ribosomal_protein_S7 MATCH rps7 ID 100 CORRECT FILE1 ndhf ndhF 111508 113721 R 1 2214 Ok MEQTYEYAWIIPFIPLPVPMLIGAGLILFPTATKRFRRMWAFQSVLLLSIVMIFSIYLSIQQINSSSVYQYVWSWIINNDFSLDFGYLIDPLTSIMSILITTVGIMVLIYSDNYMAHDQGYLRFFAYMSFFSTSMLGLVTSSNLIQIYIFWELVGLCSYLLIGFWFTRPVAANACQKAFVTNRVGDFGLLLGILGFYWITGSFEFRDLFEIFNNLIYNNELNFLFVTLCAVLLFAGAVAKSAQFPLHVWLPDAMEGPTPISALIHAATMVAAGIFLVARLLPLFRVIPYIMYLISVIGIITVLLGATLALAQKDIKRGLAYSTMSQLGYMMLALGMGSYRSALFHLITHAYSKALLFLGSGSIIHSMETIVGYSPAKSQNMGLMGGLRKHVPITKITFLLGTLSLCGIPPLACFWSKDEILNDSWLYSPIFAIIAWATAGLTAFYMFRIYLLTFEGHLNAHFQNYGGKQKIPFYSISLWGKNGVKKNSCLLTMNNNESTYFLSKTKYPIAKNGRKMTRPFMTIAHFKHKAVSSYPYESDNTMLFPIFVLGLFTLFVGAIGIPFNQEGVNLDILSKWLAPSINLLHPKSNNSLDWNEFLKDAVVSVSIAYFGIFIASFLYKPIYSSLKNLEFINSFVKKGPKRILWDKILNGIYDWSYNRAYIDAFYTRFFVGGIRGLAEFTHFVDRRVIDGMTNGVGVISFIVGEGIKYIGGGRISSYLFLYLAYVSVFLLVYYLLF NADH_dehydrogenase_subunit_5 FILE2 ndhf ndhF 111508 113721 R 1 2214 Ok MEQTYEYAWIIPFIPLPVPMLIGAGLILFPTATKRFRRMWAFQSVLLLSIVMIFSIYLSIQQINSSSVYQYVWSWIINNDFSLDFGYLIDPLTSIMSILITTVGIMVLIYSDNYMAHDQGYLRFFAYMSFFSTSMLGLVTSSNLIQIYIFWELVGLCSYLLIGFWFTRPVAANACQKAFVTNRVGDFGLLLGILGFYWITGSFEFRDLFEIFNNLIYNNELNFLFVTLCAVLLFAGAVAKSAQFPLHVWLPDAMEGPTPISALIHAATMVAAGIFLVARLLPLFRVIPYIMYLISVIGIITVLLGATLALAQKDIKRGLAYSTMSQLGYMMLALGMGSYRSALFHLITHAYSKALLFLGSGSIIHSMETIVGYSPAKSQNMGLMGGLRKHVPITKITFLLGTLSLCGIPPLACFWSKDEILNDSWLYSPIFAIIAWATAGLTAFYMFRIYLLTFEGHLNAHFQNYGGKQKIPFYSISLWGKNGVKKNSCLLTMNNNESTYFLSKTKYPIAKNGRKMTRPFMTIAHFKHKAVSSYPYESDNTMLFPIFVLGLFTLFVGAIGIPFNQEGVNLDILSKWLAPSINLLHPKSNNSLDWNEFLKDAVVSVSIAYFGIFIASFLYKPIYSSLKNLEFINSFVKKGPKRILWDKILNGIYDWSYNRAYIDAFYTRFFVGGIRGLAEFTHFVDRRVIDGMTNGVGVISFIVGEGIKYIGGGRISSYLFLYLAYVSVFLLVYYLLF NADH_dehydrogenase_subunit_5 MATCH ndhf ID 100 CORRECT FILE1 none none 110372 111511 D 1 1140 Ok MIFQSFLLGNLVSLCMKIINSVVVVGLYYGFLTTFSIGPSYLFLLRALVMEEGTEKKVSATTGFITGQLMMFISIYYAPLHLALGRPHTITVLALPYLLFHFFWNNHKHFFDYGSTTRNSMRNLSIQCVFLNNLIFQLFNHFILPSSMLARLVNIYLFRCNNKILFVTSGFVGWLIGHILFMKWLGLVLVWIRQNHSIRSNKYIRSNKYLVLELRNSMARIFSILLFITCVYYLGRIPSPILTKKLKEASKTEERVESEEERDVEIETASEMKGTKQEQEGSTEEDPYPSPSLFSEEGWDPDKIDETEEIRVNGKDKIKDKFHSHLTETGYNNINTSNSPIYDYQDSYLNNNNTGNLENCKLQLLDKKNENQEFLIQKV hypothetical_protein FILE2 NONE MATCH none ID 0 MISSED.WRONG_STOP FILE1 rpl32 rpl32 114504 114671 D 1 168 Ok MAVPKKRTSTSKKRIRKNIWKRKGYWVALKAFSLAKSLSTGNSKSFFVRQTKINK ribosomal_protein_L32 FILE2 rpl32 rpl32 114504 114671 D 1 168 Ok MAVPKKRTSTSKKRIRKNIWKRKGYWVALKAFSLAKSLSTGNSKSFFVRQTKINK ribosomal_protein_L32 MATCH rpl32 ID 100 CORRECT FILE1 ccsa ccsA 115765 116706 D 1 942 Ok MIFSTLEHILTHISFSIVSIVITIHLITFLVDEIVKLYDSSEKGIIVTFFCITGLLVTRWVSSGHFPLSDLYESLIFLSWSFSLIHIIPYFKKNVLILSKITGPSAILTQGFATSGILTEIHQSGILVPALQSEWLIMHVSMMILGYAALLCGSLLSVALLVITFRKNRKLFSKSNVFLNESFFLGENVVENTSFFCTKNYYRSQLIQQLDYWSYRVISLGFTFLTIGILSGAVWANEAWGSYWNWDPKETWAFITWIVFAIYLHTRTNRNLRGPNSAIVASIGFLIIWICYFGVNLLGIGLHSYGSFPSTFN cytochrome_c_biogenesis_protein FILE2 ccsa ccsA 115765 116706 D 1 942 Ok MIFSTLEHILTHISFSIVSIVITIHLITFLVDEIVKLYDSSEKGIIVTFFCITGLLVTRWVSSGHFPLSDLYESLIFLSWSFSLIHIIPYFKKNVLILSKITGPSAILTQGFATSGILTEIHQSGILVPALQSEWLIMHVSMMILGYAALLCGSLLSVALLVITFRKNRKLFSKSNVFLNESFFLGENVVENTSFFCTKNYYRSQLIQQLDYWSYRVISLGFTFLTIGILSGAVWANEAWGSYWNWDPKETWAFITWIVFAIYLHTRTNRNLRGPNSAIVASIGFLIIWICYFGVNLLGIGLHSYGSFPSTFN cytochrome_c_biogenesis_protein MATCH ccsa ID 100 CORRECT FILE1 ndhd ndhD 116944 118446 R 1 1503 Ok MNYFPWLTIIVVFPIFAGSLIFFLPHKGNRVIRWYTICICILELLLTTYAFCYHFQSDDPLIQLVEDYKWIDFFDFHWRLGIDGLSIGPILLTGFITTLATLAAWPVTRDSRLFHFLMLAMYSGQIGLFSSRDLLLFFIMWELELIPVYLLLAMWGGKKRLYSATKFILYTAGGSVFLLMGVLGVALYGSNEPTLNFETSVNQSYPVVLEIIFYIGFFIAFAVKSPIIPLHTWLPDTHGEAHYSTCMLLAGILLKMGAYGLIRINMELLPHAHSIFSPWLMIIGTIQIIYAASTSLGQRNLKKRIAYSSVSHMGFIIIGISSLTDTGLNGALLQIISHGFIGAALFFLAGTTYDRIRLVYLDEMGGIAIPMPKMFTMFSSFSMASLALPGMSGFVAELIVFFGIITGQKYLLMPKLLITFVMAIGIILTPIYSLSMPRQMFYGYKLFNAPKDSFFDSGPRELFLSISIFLPVIGIGIYPDFVLSLAVDKVEVILSNFFYR NADH_dehydrogenase_subunit_4 FILE2 ndhd ndhD 116944 118446 R 1 1503 Ok MNYFPWLTIIVVFPIFAGSLIFFLPHKGNRVIRWYTICICILELLLTTYAFCYHFQSDDPLIQLVEDYKWIDFFDFHWRLGIDGLSIGPILLTGFITTLATLAAWPVTRDSRLFHFLMLAMYSGQIGLFSSRDLLLFFIMWELELIPVYLLLAMWGGKKRLYSATKFILYTAGGSVFLLMGVLGVALYGSNEPTLNFETSVNQSYPVVLEIIFYIGFFIAFAVKSPIIPLHTWLPDTHGEAHYSTCMLLAGILLKMGAYGLIRINMELLPHAHSIFSPWLMIIGTIQIIYAASTSLGQRNLKKRIAYSSVSHMGFIIIGISSLTDTGLNGALLQIISHGFIGAALFFLAGTTYDRIRLVYLDEMGGIAIPMPKMFTMFSSFSMASLALPGMSGFVAELIVFFGIITGQKYLLMPKLLITFVMAIGIILTPIYSLSMPRQMFYGYKLFNAPKDSFFDSGPRELFLSISIFLPVIGIGIYPDFVLSLAVDKVEVILSNFFYR NADH_dehydrogenase_subunit_4 MATCH ndhd ID 100 CORRECT FILE1 psac psaC 118564 118809 R 1 246 Ok MSHSVKIYDTCIGCTQCVRACPTDVLEMIPWDGCKAKQIASAPRTEDCVGCKRCESACPTDFLSVRVYLWHETTRSMGLAY photosystem_I_subunit_VII FILE2 psac psaC 118564 118809 R 1 246 Ok MSHSVKIYDTCIGCTQCVRACPTDVLEMIPWDGCKAKQIASAPRTEDCVGCKRCESACPTDFLSVRVYLWHETTRSMGLAY photosystem_I_subunit_VII MATCH psac ID 100 CORRECT FILE1 ndhe ndhE 119061 119366 R 1 306 Ok MILEHVLVLSAYLFSIGIYGLITSRNMVRALMCLELILNAVNINFVTFSDFFDNRQLKGDIFSIFVIAIAAAEAAIGLAIVSSIYRNRKSTRINQSNLLNN NADH_dehydrogenase_subunit_4 FILE2 ndhe ndhE 119061 119366 R 1 306 Ok MILEHVLVLSAYLFSIGIYGLITSRNMVRALMCLELILNAVNINFVTFSDFFDNRQLKGDIFSIFVIAIAAAEAAIGLAIVSSIYRNRKSTRINQSNLLNN NADH_dehydrogenase_subunit_4L MATCH ndhe ID 100 CORRECT FILE1 ndhg ndhG 119590 120120 R 1 531 Ok MDLSEPIHDFLLVFLGSGLILGGLGVVLLPNPIYSAFSLGLVLVCTSLFYILSNAYFVAAAQLLIYVGAINVLIIFAVMFMNGSEYYKDFHLWTVGDGITSMVCISLFISLITTISDTSWYGIIWTTRSNQIIEQDFLSNSQQIGIHLSTDFFLPFELISIILLVALIGAIAVARQ NADH_degydrogenase_subunit_6 FILE2 ndhg ndhG 119590 120120 R 1 531 Ok MDLSEPIHDFLLVFLGSGLILGGLGVVLLPNPIYSAFSLGLVLVCTSLFYILSNAYFVAAAQLLIYVGAINVLIIFAVMFMNGSEYYKDFHLWTVGDGITSMVCISLFISLITTISDTSWYGIIWTTRSNQIIEQDFLSNSQQIGIHLSTDFFLPFELISIILLVALIGAIAVARQ NADH_dehydrogenase_subunit_6 MATCH ndhg ID 100 CORRECT FILE1 ndhi ndhI 120525 121028 R 1 504 Ok MLPMITEFINYGQQTIRAARYIGQGFMITLSHANRLPVTIQYPYEKLITSERFRGRIHFEFDKCIACEVCVRVCPIDLPVVDWKLETDIRKKRLLNYSIDFGICIFCGNCVEYCPTNCLSMTEEYELSTYDRHELNYNQIALGRLPMSVIDDYTIRTISNLPQINNE NADH_dehydrogenase_subunit_I FILE2 ndhi ndhI 120525 121028 R 1 504 Ok MLPMITEFINYGQQTIRAARYIGQGFMITLSHANRLPVTIQYPYEKLITSERFRGRIHFEFDKCIACEVCVRVCPIDLPVVDWKLETDIRKKRLLNYSIDFGICIFCGNCVEYCPTNCLSMTEEYELSTYDRHELNYNQIALGRLPMSVIDDYTIRTISNLPQINNE NADH_dehydrogenase_subunit_I MATCH ndhi ID 100 CORRECT FILE1 ndha ndhA 121113 123337 R 2 1092 Ok MIIDTTEIETINSFSKLESLKEVYGIIWMLVPIVTLVLGITIGVLVIVWLEREISAGIQQRIGPEYAGPLGILQALADGTKLLLKENLIPSTGDTRLFSIGPSIAVISIFLSYSVIPFGDHLVLADLSIGVFFWIAISSIAPVGLLMSGYGSNNKYSFLGGLRAAAQSISYEIPLALCVLSISLLSNSLSTVDIVEAQSKYGFWGWNLWRQPIGFIVFLISSLAECERLPFDLPEAEEELVAGYQTEYSGIKFGLFYIASYLNLLVSSLFVTVLYLGGWNLSIPYIFVPDIFGINKGGKVFGTLIGIFITLAKTYLFLFIPIATRWTLPRLRMDQLLNLGWKFLLPISLGNLLLTTSSQLLSL NADH_dehydrogenase_subunit_1 FILE2 ndha ndhA_1 121113 121652 R 1 540 Ok LSNSLSTVDIVEAQSKYGFWGWNLWRQPIGFIVFLISSLAECERLPFDLPEAEEELVAGYQTEYSGIKFGLFYIASYLNLLVSSLFVTVLYLGGWNLSIPYIFVPDIFGINKGGKVFGTLIGIFITLAKTYLFLFIPIATRWTLPRLRMDQLLNLGWKFLLPISLGNLLLTTSSQLLSL NADH_dehydrogenase_subunit_1 MATCH ndha ID 49 WRONG.BAD_NBEXON.BAD_START FILE1 NONE FILE2 ndha ndhA_2 122771 123337 R 1 567 Ok MIIDTTEIETINSFSKLESLKEVYGIIWMLVPIVTLVLGITIGVLVIVWLEREISAGIQQRIGPEYAGPLGILQALADGTKLLLKENLIPSTGDTRLFSIGPSIAVISIFLSYSVIPFGDHLVLADLSIGVFFWIAISSIAPVGLLMSGYGSNNKYSFLGGLRAAAQSISYEIPLALCVLSISLRVIR NADH_dehydrogenase_subunit_1 MATCH ndha ID 0 OVERPRED.WRONG_STOP FILE1 ndhh ndhH 123339 124520 R 1 1182 Ok MTAPTTRKDLMIVNMGPQHPSMHGVLRLIVTLDGEDVVDCEPILGYLHRGMEKIAENRTIIQYLPYVTRWDYLATMFTEAITINGPEQLGNIQVPKRASYIRVIMLELSRIASHLLWLGPFMADIGAQTPFFYIFRERELIYDLFEAATGMRMMHNYFRIGGVAADLPYGWIDKCLDFCDYFLTGVAEYQKLITRNPIFLERVEGVGIIGRDEALNWGLSGPMLRASGIEWDLRKVDHYESYDEFDWQVQWQREGDSLARYLVRIGEMTESIKIIQQALEGIPGGPYENLEMRRFDRLKDPEWNDFEYRFISKKPSPTFELSKQELYVRVEAPKGELGIFLIGDQSVFPWRWKIRPPGFINLQILPQLVKRMKLADIMTILGSIDIIMGEVDR NADH_dehydrogenase_subunit_7 FILE2 ndhh ndhH 123339 124520 R 1 1182 Ok MTAPTTRKDLMIVNMGPQHPSMHGVLRLIVTLDGEDVVDCEPILGYLHRGMEKIAENRTIIQYLPYVTRWDYLATMFTEAITINGPEQLGNIQVPKRASYIRVIMLELSRIASHLLWLGPFMADIGAQTPFFYIFRERELIYDLFEAATGMRMMHNYFRIGGVAADLPYGWIDKCLDFCDYFLTGVAEYQKLITRNPIFLERVEGVGIIGRDEALNWGLSGPMLRASGIEWDLRKVDHYESYDEFDWQVQWQREGDSLARYLVRIGEMTESIKIIQQALEGIPGGPYENLEMRRFDRLKDPEWNDFEYRFISKKPSPTFELSKQELYVRVEAPKGELGIFLIGDQSVFPWRWKIRPPGFINLQILPQLVKRMKLADIMTILGSIDIIMGEVDR NADH_dehydrogenase_subunit_7 MATCH ndhh ID 100 CORRECT FILE1 rps15 rps15 124632 124895 R 1 264 Ok MVKNSVISVISQEEKKGSVEFQVFNFTNKIRRLTSHLELHKKDYLSQRGLKKILGKRQRLLAYLAKKNRVRYKELINRLDIRETKTR ribosomal_protein_S15 FILE2 rps15 rps15 124632 124895 R 1 264 Ok MVKNSVISVISQEEKKGSVEFQVFNFTNKIRRLTSHLELHKKDYLSQRGLKKILGKRQRLLAYLAKKNRVRYKELINRLDIRETKTR ribosomal_protein_S15 MATCH rps15 ID 100 CORRECT FILE1 ycf1 ycf1 125297 130972 R 1 5676 Ok MIFQSFLLGNLVSLCMKIINSVVVVGLYYGFLTTFSIGPSYLFLLRALVMEEGTEKKVSATTGFITGQLMMFISIYYAPLHLALGRPHTITVLALPYLLFHFFWNNHKHFFDYGSTTRNSMRNLSIQCVFLNNLIFQLFNHFILPSSMLARLVNIYLFRCNNKILFVTSGFVGWLIGHILFMKWLGLVLVWIRQNHSIRSNKYIRSNKYLVLELRNSMARIFSILLFITCVYYLGRIPSPILTKKLKEASKTEERVESEEERDVEIETASEMKGTKQEQEGSTEEDPYPSPSLFSEEGWDPDKIDETEEIRVNGKDKIKDKFHSHLTETGYNNINTSNSPIYDYQDSYLNNNNTGNLENCKLQLLDKKNENQEQDLFWFQKPLVSLLFDYNRWNRPFRYIKNNRFEQAVRTEMSQYFFDTCKSDGKQKISFTYPPSLSTFWKMIKRKIPLLSLQKTLPNELDTQWVSTNKEKSNNLNKEFLNRLEILDKESLSLDILETRTRFCNDDTKKEYVPKMYDPLLNGLYRGTIKKGVSSSIINNTLLENWEKRVRLNRIHTIFLPNIDYQEFEQKAYTIDKKPLSTEIDEFLTLINELGNEAKSSLNLKGLSLFSDQEQRRANSEKRTKFVKFVFNALDPNETKSGKKSIGIKEISKKVPRWSHKLITELDQQMGEFKDRASMDHQLRSRKAKRVVIFTDNKATKDAEEEVALISYSQQSDFRRGIITGSMRAQRRKTFISKLFQANVHSPLFVDRITPLRLFSFDISELIKPILKNWTDKEGEFKILESREEQTKREEKKEKDKKEDNKRKEQARIAIEEAWDTIPLAQIIRGYMLITQSILRKYILLPALIIAKNIGRMLFLQLPEWSEDLQEWNREMQIKCTYNGVQLSETEFPKNWLRDGIQIKILFPFCLKPWHISKLYPSRRELMKKQKQKDDFCFLTVWGMEAELPFGSPRKRPSFFEPIFKELEKKIGKFKKKYFLTLKILKGKTKLFRKVSKETTKLFIKSIGFLKKIKKELSKVNLIVLFRFKEISESNETKKEKDYLISNQIINESFRQIESGNWPNSSLIETKMKDLTNRTSTIKNKIERITKEKKKVTPEIDINPNKTNNIKKFESPKKIFQILKSRNTRVIWKFHYFLKLFIQRLYINLFLSIINIPRITTQLFLKSTNKLIEKFISNNEINQEKINNKKKIHFMFISTIKKSLYNISKKNSHILCDLSYLSQAYVFYKLSQTQVINFSKFRSVLQYNTTSCFLKTKIKDYFKTLGIFHSELKHKKLQSYRINQWKNWLRWHYQYDLSQIRWSRLMPKKWRTRVNQSCMAQNKNRNLNKWNSYEKDQLLHYKKENDSELYSLSNEKDNFKKCYGYGLLAYKSINYENKSDSFFSRLPFEVQVKKNLEISYNSNTSKHNFVDMPGNLHINNYLRKGNILDRERNLDRKYFDWKIIHFSLRQKGDIEAWVKIDTNSNPNTKIGINNYQIIDKIEKKGVFYLTTHQNPEKTQKNSKKFFFDWMGMNEKIFNRPILNLEFWFFPEFVLLYNVYKIKPWIIPSKFLLFNLNTNKNVSQNKNQNFFLPSNKKIKIKNRSQEAKEPPSQRERGSDIENKGNLSPVFSKHQTDLEKDYVESDTKKGKNKKQYKSNTEAELDLFLKRYLLFQLRWNGALNQRMFENIKVYCLLLRLINPTKITISSIQRREMSLDIMLIQANLPLTDLMKKGVLIIEPIRLSVKDNGQFIMYQTIGISLIHKSKHQTNQRYREQRYVDKKNFDEFILQPQTQRINTEKTHFGLLVPENILWSRRRRELRIRSFFNSWNWNVVDRNSVFCNETNVKNWSQFLGERKPLYKDKNELIKFKFFFWPNYRLEDLACMNRYWFDTNNGSRFSILRIHMYPRLKIN ycf1_protein FILE2 ycf1 ycf1 125297 130972 R 1 5676 Ok MIFQSFLLGNLVSLCMKIINSVVVVGLYYGFLTTFSIGPSYLFLLRALVMEEGTEKKVSATTGFITGQLMMFISIYYAPLHLALGRPHTITVLALPYLLFHFFWNNHKHFFDYGSTTRNSMRNLSIQCVFLNNLIFQLFNHFILPSSMLARLVNIYLFRCNNKILFVTSGFVGWLIGHILFMKWLGLVLVWIRQNHSIRSNKYIRSNKYLVLELRNSMARIFSILLFITCVYYLGRIPSPILTKKLKEASKTEERVESEEERDVEIETASEMKGTKQEQEGSTEEDPYPSPSLFSEEGWDPDKIDETEEIRVNGKDKIKDKFHSHLTETGYNNINTSNSPIYDYQDSYLNNNNTGNLENCKLQLLDKKNENQEQDLFWFQKPLVSLLFDYNRWNRPFRYIKNNRFEQAVRTEMSQYFFDTCKSDGKQKISFTYPPSLSTFWKMIKRKIPLLSLQKTLPNELDTQWVSTNKEKSNNLNKEFLNRLEILDKESLSLDILETRTRFCNDDTKKEYVPKMYDPLLNGLYRGTIKKGVSSSIINNTLLENWEKRVRLNRIHTIFLPNIDYQEFEQKAYTIDKKPLSTEIDEFLTLINELGNEAKSSLNLKGLSLFSDQEQRRANSEKRTKFVKFVFNALDPNETKSGKKSIGIKEISKKVPRWSHKLITELDQQMGEFKDRASMDHQLRSRKAKRVVIFTDNKATKDAEEEVALISYSQQSDFRRGIITGSMRAQRRKTFISKLFQANVHSPLFVDRITPLRLFSFDISELIKPILKNWTDKEGEFKILESREEQTKREEKKEKDKKEDNKRKEQARIAIEEAWDTIPLAQIIRGYMLITQSILRKYILLPALIIAKNIGRMLFLQLPEWSEDLQEWNREMQIKCTYNGVQLSETEFPKNWLRDGIQIKILFPFCLKPWHISKLYPSRRELMKKQKQKDDFCFLTVWGMEAELPFGSPRKRPSFFEPIFKELEKKIGKFKKKYFLTLKILKGKTKLFRKVSKETTKLFIKSIGFLKKIKKELSKVNLIVLFRFKEISESNETKKEKDYLISNQIINESFRQIESGNWPNSSLIETKMKDLTNRTSTIKNKIERITKEKKKVTPEIDINPNKTNNIKKFESPKKIFQILKSRNTRVIWKFHYFLKLFIQRLYINLFLSIINIPRITTQLFLKSTNKLIEKFISNNEINQEKINNKKKIHFMFISTIKKSLYNISKKNSHILCDLSYLSQAYVFYKLSQTQVINFSKFRSVLQYNTTSCFLKTKIKDYFKTLGIFHSELKHKKLQSYRINQWKNWLRWHYQYDLSQIRWSRLMPKKWRTRVNQSCMAQNKNRNLNKWNSYEKDQLLHYKKENDSELYSLSNEKDNFKKCYGYGLLAYKSINYENKSDSFFSRLPFEVQVKKNLEISYNSNTSKHNFVDMPGNLHINNYLRKGNILDRERNLDRKYFDWKIIHFSLRQKGDIEAWVKIDTNSNPNTKIGINNYQIIDKIEKKGVFYLTTHQNPEKTQKNSKKFFFDWMGMNEKIFNRPILNLEFWFFPEFVLLYNVYKIKPWIIPSKFLLFNLNTNKNVSQNKNQNFFLPSNKKIKIKNRSQEAKEPPSQRERGSDIENKGNLSPVFSKHQTDLEKDYVESDTKKGKNKKQYKSNTEAELDLFLKRYLLFQLRWNGALNQRMFENIKVYCLLLRLINPTKITISSIQRREMSLDIMLIQANLPLTDLMKKGVLIIEPIRLSVKDNGQFIMYQTIGISLIHKSKHQTNQRYREQRYVDKKNFDEFILQPQTQRINTEKTHFGLLVPENILWSRRRRELRIRSFFNSWNWNVVDRNSVFCNETNVKNWSQFLGERKPLYKDKNELIKFKFFFWPNYRLEDLACMNRYWFDTNNGSRFSILRIHMYPRLKIN hypothetical_chloroplast_RF1 MATCH ycf1 ID 100 CORRECT FILE1 rps12 rps12 71639 142102 D 3 372 Error MPTIKQLIRNTRQPIRNVTKSPALRGCPQRRGTCTRVYTITPKKPNSALRKVARVRLTSGFEITAYIPGIGHNSQEHSVVLVRGGRVKDLPGVRYHIVRGTLDAVGVKDRQQGRSKYGVKKPK ribosomal_protein_S12 FILE2 NONE MATCH rps12 ID 0 MISSED.WRONG_STOP FILE1 rps7 rps7 142156 142623 D 1 468 Ok MSRRGTAEKKTAKSDPIYRNRLVNMLVNRILKHGKKSLAYQIIYRAVKKIQQKTETNPLSVLRQAIRGVTPDITVKARRVGGSTHQVPIEIGSTQGKALAIRWLLAASRKRPGRNMAFKLSSELVDAAKGSGDAIRKKEETHRMAEANRAFAHFR ribosomal_protein_S7 FILE2 rps7 rps7_2 142156 142623 D 1 468 Ok MSRRGTAEKKTAKSDPIYRNRLVNMLVNRILKHGKKSLAYQIIYRAVKKIQQKTETNPLSVLRQAIRGVTPDITVKARRVGGSTHQVPIEIGSTQGKALAIRWLLAASRKRPGRNMAFKLSSELVDAAKGSGDAIRKKEETHRMAEANRAFAHFR ribosomal_protein_S7 MATCH rps7 ID 100 CORRECT FILE1 ndhb ndhB 142909 145120 D 2 1533 Ok MIWHVQNENFILDSTRIFMKAFHLLLFDGSLIFPECILIFGLILLLMIDSTSDQKDIPWLYFISSTSLVMSITALLFRWREEPMISFSGNFQTNNFNEIFQFLILLCSTLCIPLSVEYIECTEMAITEFLLFVLTATLGGMFLCGANDLITIFVAPECFSLCSYLLSGYTKKDVRSNEATMKYLLMGGASSSILVHGFSWLYGSSGGEIELQEIVNGLINTQMYNSPGISIALIFITVGIGFKLSPAPSHQWTPDVYEGSPTPVVAFLSVTSKVAASASATRIFNIPFYFSSNEWHLLLEILAILSMILGNLIAITQTSMKRMLAYSSIGQIGYVIIGIIVGDSNDGYASMITYMLFYISMNLGTFACIVLFGLRTGTDNIRDYAGLYTKDPFLALSLALCLLSLGGLPPLAGFFGKLYLFWCGWQAGLYFLVLIGLLTSVVSIYYYLKIIKLLMTGRNQEITPHVRNYRRSPLRSNNSIELSMIVCVIASTIPGISMNPIIAIAQDSLF NADH_dehydrogenase_subunit_2 FILE2 ndhb ndhB_2 142909 145120 D 2 1626 Ok MIWHVQNENFILDSTRIFMKAFHLLLFDGSLIFPECILIFGLILLLMIDSTSDQKDIPWLYFISSTSLVMSITALLFRWREEPMISFSGNFQTNNFNEIFQFLILLCSTLCIPLSVEYIECTEMAITEFLLFVLTATLGGMFLCGANDLITIFVAPECFSLCSYLLSGYTKKDVRSNEATMKYLLMGGASSSILVHGFSWLYGSSGGEIELQEIVNGLINTQMYNSPGISIALIFITVGIGFKLSPAPSHQWTPDVYEGVRFVREIPTSLSISEMFGFFKTPWTCRREMLSPTPVVAFLSVTSKVAASASATRIFNIPFYFSSNEWHLLLEILAILSMILGNLIAITQTSMKRMLAYSSIGQIGYVIIGIIVGDSNDGYASMITYMLFYISMNLGTFACIVLFGLRTGTDNIRDYAGLYTKDPFLALSLALCLLSLGGLPPLAGFFGKLYLFWCGWQAGLYFLVLIGLLTSVVSIYYYLKIIKLLMTGRNQEITPHVRNYRRSPLRSNNSIELSMIVCVIASTIPGISMNPIIAIAQDSLF NADH_dehydrogenase_subunit_2 MATCH ndhb ID 94 ALMOST_CORRECT.BAD_JUNCTION FILE1 ycf15 ycf15 145958 146221 R 1 264 Ok METLVSSIFWTLAPWKNMLLLKHGRIEILDQNTMYGWYELPKQEFLNSKQPVQIFTTKKYWILFRIGPERRRKAGMPIGVYYIEFTR ycf15_protein FILE2 NONE MATCH ycf15 ID 0 MISSED.WRONG_STOP FILE1 ycf2 ycf2 146312 153148 R 1 6837 Ok MRGHQFKSWIFELREILREIKNSHHFLDSWTQFNSVGSFIHIFFHQERFLKLFDPRIWSILLSRNSQGSPSNRYFTIKGVILFVVAVLIYRINNRNMVERKNLYLIGLLPIPMNSIGPRNDTLEESVGSSNINRLIVSLLYLPKGKKISESCFLNPKESTWVLPITKKCSMPESNWGSRWWRNWIGKKRDSSCKISNETVAGIEILFKEKDLKYLEFLFVYYMDDPIRKDHDWELFDRLSLRKSRNRINLNSGPLFEILVKHWISYLMSAFREKIPIEVEGFFKQQGAGSTIQSNDIEHVSHLFSRNKWAISLQNCAQFHMWQFRQDLFVSWGKNPPESDFLRNVSRENWIWLDNVWLVNKDRFFSKVQNVSSNIQYDSTRSSFVQVTDSSQLKGSSDQSRDHLDSISNEDSEYHTLINQREIQQRKERSILWDPSFLQTERKEIESGRFPKCLSGYSSMSRLFTEREKQMINHLFPEEIEEFLGNPTRSVRSFFSDRWSELHLGSNPTERSTRDQKLLKKQQDLSFVPSRRSEKKEMVNIFKIITYLQNTVSIHPISSDPGCDMVPKDEPDMDSSNKISFLNKNPFFDLFHLFHDRNRGGYTLHYDFASEERFQEMADLFTLSITEPDLVYHKGFAFSIDSCGLDQKQFLNEARDESKKKSLLVLPPIFYEENESFSRRIRKKWVRISCGNDLEDPKPKIVVFASNNIMEAVTQYRLIRNLIQIQYSTYGYIRNVLNRFFLMNRSDRNFEYGIQRDQIGKDTLNHRTIMKYTINQYLSNLKKSQKKWFEPLILISRTERSMNRDPDAYRYKWSNGSKSFQEHLEQSVSKQKSRFQVVFDRLRINQYSIDWSEVIDKKDLSKSLRFFLSKSLLFLSKLLLFLSNSLPFFCVSFGNIPIHRSEIYIYEELKGPNDQLCNQLLESIGLQIVHLKKLKPFLLDDHDTSQKSKFLINGGTISPFLFNKIPKWMIDSFHTRNNRRKSFDNPDSYFSMIFHDQDNWLNPVKPFHRSSLISSFYKANRLRFLNNPHHFCFYWNTRFPFSVEKARINNSDFTYGQFLNILFIRNKIFSLCVGKKKHAFWGRDTISPIESQVSNIFIPNDFPQSGDETYNLYKSFHFPSRSDPFVRRAIYSIADISGTPLTEGQIVNFERTYCQPLSDMNLSDSEGKNLHQYLNFNSNMGLIHTPCSEKDLSSEKRKKWSLCLKKCVEKGQTYRTFQRDSAFSTLSKWNLFQTYMPWFLTSTGYKYLNLIFLDTFSDLLPILSSSQKFVSIFPDIMHGSGISWRILQKKLCLPQWNLISEISSKCLHNLLLSEEMIHRNNESPLISTHLRSPNAREFLYSILFLLLVAGYLVRTHLLFVSRASSELQTEFERVKSLMTPSSMIELRKLLDRYPTSEPNSFWLKNLFLVALEQLGDSLEEIRGSASGGNMLGPAYGVKSIRSKKKDWNINLIEIIDLIPNPINRITFSRNTRHLSHTSKEIYSLIRKRKNVNGDWIDEKIESWVANSDSIDDEEREFLVQFSTLTTENRIDQILLSLTHSDHLSKNDSGYQMIEQPGAIYLRYLVDIHKKHLMNYEFNPSCLAERRIFLAHYQTITYSQTSCGENSFHFPSHGKPFSLRLALSPSRGILVIGSIGTGRSYLVKYLATNSYVPFITVFLNKFLDNKSKGFLLDEIDIDDSDDIDDSDNLDASDDIDRDLDTELELLTRMNGLTVDMMPEIDRFYITLQFELAKAMSPCIIWIPNIHDLDVNESNDLSLGLLVNHLSRDCERCSTRNILVIASTHIPQKVDPALIAPNKLNTCIKIRRLLIPQQRKHFFTLSYTRGFHLEKKMFHTNGFGSITMGSNARDLVALTNEVLSISITQKKSIIDTNTIRSALHRQTWDLRSQVRSVQDHGILFYQIGRAVAQNVLLSNCPIDPISIYMKKKSCNEGDSYLYKWYFELGTSMKRLTILLYLLSCSAGSVAQDLWSLSVPDEKNGITSYGLVENDSDLVHGLLEVEGALVGSSRTEKDCSQFDNDRVTLLLRPEPRNPLDMMQKGSWSILDQRFLYEKYESEFEEGEGEGALDPQEDLFNHIVWAPRIWRPWGFLFDCIERPNELGFPYWSRSFRGKRIIYDEEDELQENDSGFLQSGTMQYQTRDRSQGLFRISQFIWDPADPLFFLFKDQPPGSVFSHRELFADEEMSKGLLTSQTDPPTSLYKRWFIKNTQEKHFELLINRQRWLRTNSSLSNGSFRSNTLSESYQYLSNLFLSNGTLLDQMPKTLLRKRWLFPDEMKIGFM Ycf2_protein FILE2 ycf2 ycf2_2 146312 153148 R 1 6837 Ok MRGHQFKSWIFELREILREIKNSHHFLDSWTQFNSVGSFIHIFFHQERFLKLFDPRIWSILLSRNSQGSPSNRYFTIKGVILFVVAVLIYRINNRNMVERKNLYLIGLLPIPMNSIGPRNDTLEESVGSSNINRLIVSLLYLPKGKKISESCFLNPKESTWVLPITKKCSMPESNWGSRWWRNWIGKKRDSSCKISNETVAGIEILFKEKDLKYLEFLFVYYMDDPIRKDHDWELFDRLSLRKSRNRINLNSGPLFEILVKHWISYLMSAFREKIPIEVEGFFKQQGAGSTIQSNDIEHVSHLFSRNKWAISLQNCAQFHMWQFRQDLFVSWGKNPPESDFLRNVSRENWIWLDNVWLVNKDRFFSKVQNVSSNIQYDSTRSSFVQVTDSSQLKGSSDQSRDHLDSISNEDSEYHTLINQREIQQRKERSILWDPSFLQTERKEIESGRFPKCLSGYSSMSRLFTEREKQMINHLFPEEIEEFLGNPTRSVRSFFSDRWSELHLGSNPTERSTRDQKLLKKQQDLSFVPSRRSEKKEMVNIFKIITYLQNTVSIHPISSDPGCDMVPKDEPDMDSSNKISFLNKNPFFDLFHLFHDRNRGGYTLHYDFASEERFQEMADLFTLSITEPDLVYHKGFAFSIDSCGLDQKQFLNEARDESKKKSLLVLPPIFYEENESFSRRIRKKWVRISCGNDLEDPKPKIVVFASNNIMEAVTQYRLIRNLIQIQYSTYGYIRNVLNRFFLMNRSDRNFEYGIQRDQIGKDTLNHRTIMKYTINQYLSNLKKSQKKWFEPLILISRTERSMNRDPDAYRYKWSNGSKSFQEHLEQSVSKQKSRFQVVFDRLRINQYSIDWSEVIDKKDLSKSLRFFLSKSLLFLSKLLLFLSNSLPFFCVSFGNIPIHRSEIYIYEELKGPNDQLCNQLLESIGLQIVHLKKLKPFLLDDHDTSQKSKFLINGGTISPFLFNKIPKWMIDSFHTRNNRRKSFDNPDSYFSMIFHDQDNWLNPVKPFHRSSLISSFYKANRLRFLNNPHHFCFYWNTRFPFSVEKARINNSDFTYGQFLNILFIRNKIFSLCVGKKKHAFWGRDTISPIESQVSNIFIPNDFPQSGDETYNLYKSFHFPSRSDPFVRRAIYSIADISGTPLTEGQIVNFERTYCQPLSDMNLSDSEGKNLHQYLNFNSNMGLIHTPCSEKDLSSEKRKKWSLCLKKCVEKGQTYRTFQRDSAFSTLSKWNLFQTYMPWFLTSTGYKYLNLIFLDTFSDLLPILSSSQKFVSIFPDIMHGSGISWRILQKKLCLPQWNLISEISSKCLHNLLLSEEMIHRNNESPLISTHLRSPNAREFLYSILFLLLVAGYLVRTHLLFVSRASSELQTEFERVKSLMTPSSMIELRKLLDRYPTSEPNSFWLKNLFLVALEQLGDSLEEIRGSASGGNMLGPAYGVKSIRSKKKDWNINLIEIIDLIPNPINRITFSRNTRHLSHTSKEIYSLIRKRKNVNGDWIDEKIESWVANSDSIDDEEREFLVQFSTLTTENRIDQILLSLTHSDHLSKNDSGYQMIEQPGAIYLRYLVDIHKKHLMNYEFNPSCLAERRIFLAHYQTITYSQTSCGENSFHFPSHGKPFSLRLALSPSRGILVIGSIGTGRSYLVKYLATNSYVPFITVFLNKFLDNKSKGFLLDEIDIDDSDDIDDSDNLDASDDIDRDLDTELELLTRMNGLTVDMMPEIDRFYITLQFELAKAMSPCIIWIPNIHDLDVNESNDLSLGLLVNHLSRDCERCSTRNILVIASTHIPQKVDPALIAPNKLNTCIKIRRLLIPQQRKHFFTLSYTRGFHLEKKMFHTNGFGSITMGSNARDLVALTNEVLSISITQKKSIIDTNTIRSALHRQTWDLRSQVRSVQDHGILFYQIGRAVAQNVLLSNCPIDPISIYMKKKSCNEGDSYLYKWYFELGTSMKRLTILLYLLSCSAGSVAQDLWSLSVPDEKNGITSYGLVENDSDLVHGLLEVEGALVGSSRTEKDCSQFDNDRVTLLLRPEPRNPLDMMQKGSWSILDQRFLYEKYESEFEEGEGEGALDPQEDLFNHIVWAPRIWRPWGFLFDCIERPNELGFPYWSRSFRGKRIIYDEEDELQENDSGFLQSGTMQYQTRDRSQGLFRISQFIWDPADPLFFLFKDQPPGSVFSHRELFADEEMSKGLLTSQTDPPTSLYKRWFIKNTQEKHFELLINRQRWLRTNSSLSNGSFRSNTLSESYQYLSNLFLSNGTLLDQMPKTLLRKRWLFPDEMKIGFM Ycf2 MATCH ycf2 ID 100 CORRECT FILE1 rpl23 rpl23 153516 153797 D 1 282 Ok MDGIKYAVFTDKSIRLLGKNQYTSNVESGSTRTEIKHWVELFFGVKVIAMNSHRLPGKSRRMGPIMGHTMHYRRMIITLQPGYSIPPLRKKRT ribosomal_protein_L23 FILE2 rpl23 rpl23_2 153516 153797 D 1 282 Ok MDGIKYAVFTDKSIRLLGKNQYTSNVESGSTRTEIKHWVELFFGVKVIAMNSHRLPGKSRRMGPIMGHTMHYRRMIITLQPGYSIPPLRKKRT ribosomal_protein_L23 MATCH rpl23 ID 100 CORRECT FILE1 rpl2 rpl2 153816 155306 D 2 825 Ok MAIHLYKTSTPSTRNGTVDSQVKSNPRNNLIYGQRRCGKGRNARGIITARHRGGGHKRLYRKIDFRRNEKDIYGRIVTIEYDPNRNAYICLIHYGDGEKRYILHPRGAIIGDTIVSGTEVPIKMGNALPLTDMPLGTAIHNIEITLGKGGQLARAAGAVAKLIAKEGKSATLKLPSGEVRLISKNCSATVGQVGNVGVNQKSLGRAGSKRWLGKRPVVRGVVMNPVDHPHGGGEGRAPIGRKKPTTPWGYPALGRRSRKRNKYSDNLILRRRSK ribosomal_protein_L2 FILE2 rpl2 rpl2_2 153816 155306 D 2 825 Ok MAIHLYKTSTPSTRNGTVDSQVKSNPRNNLIYGQRRCGKGRNARGIITARHRGGGHKRLYRKIDFRRNEKDIYGRIVTIEYDPNRNAYICLIHYGDGEKRYILHPRGAIIGDTIVSGTEVPIKMGNALPSTDMPLGTAIHNIEITLGKGGQLARAAGAVAKLIAKEGKSATLKLPSGEVRLISKNCSATVGQVGNVGVNQKSLGRAGSKRWLGKRPVVRGVVMNPVDHPHGGGEGRAPIGRKKPTTPWGYPALGRRSRKRNKYSDNLILRRRSK ribosomal_protein_L2 MATCH rpl2 ID 99 ALMOST_CORRECT.BAD_JUNCTION