全 文 :!#$%&’()*+,-./01234 8<89=8A& +>& ;+B+:8=8B>& 5@& =48& >5’B98>& 5@& A(C8B>(=PI & 748& 9<+>>8>& 5@& =48>8& (>O 5@O 6818>O
!#!$%&!’ (
!#$%&’(&)*’+, -./0 !!!#1
567 )*+,-! #$%&’%(%. /01-) *&*+%(,. 23456789 :;8<23=>?@AB $CD
AEFG %&9 &!9 #!’HIJKLMNOBPQRSTUVWXY Z[VW\]^_>?‘aCbc defgh
*+,ijklKmnop^_qrNst (#)*+Y uvoB (,-*.+w 01ijklKmnop^_qrNs
t ,*-%,+Y uvost (-*$+c x6Y )yH23>?zL678AE{Aq $C/’|678}~
/dLIJKL $OB $j8Y ij8L #RSTUVWXY |VW\] $
H>?Cb^_Y ^_qrNs (!0c
89:7 678w :;8w 23=>?w ~w VW\]
;<=>?7 1%
@AA1BC DEFG HEI E!!JKJL
!#! $%&’()*%+ *%,%! -./012 ,.03 +450 3226
7 M N
234567892341 :;:<=, >?
@AB23456C:DE45F G $!HIJ4
CKALMNOB .0:45, PQRSTUVW
XYZ?O[:F Q\*]^&:_‘Gab:
45Ccd]^, KAef:gb45?hi2
j=klm:_nF opV, q56rstuv
wx:yz, I{Q7|}~rq5rsc
:@
,
=+s:’56$7F KA4
5:_‘a !p, =
kc, ¡j¢O£¤¥¦§5&7F ;¨,
©ªQ«45¬:|‘@®u¯°±
89 : ²³´rsµ¶ 9;<=1 · ] ¸ >?@:
9A>?@1 c¹ºB:»¼½¾k¿®uÀ B#9: q
«Iabq5·ÁÂÃrs:ÄÅÆÇÈÀ B$9:É
QÊËÌÍÎxÏ92CCD1 ÐÑÒÓÔ|‘
5.6,7F Õ¯ª|‘CE:¾Ö
E:ÉQ×Ø0ÙÚ?ÛOÍÜ:, ÝÞ×
7 F>G:ßàá âã¸Qäåæçè°Ôr
séê:]^5(7F ²«;¨|‘Iq5C:
Ðq5ërsOìíîï, Dð:q
5¢Z?O¤¥:, «?KAñ3rs:Eò0
ÙÚ?óÖÛ@:F ©ª\*0ï?ôõ\*:
q5vö÷øãçù, ÖúYãF
²«û A>?@ü, øã:ýþÆæç}ÿ
óÖ!, 5#¤$q5%&ü, Iøã
úYã’(:KAóÖ)*5&7F +,½¾=ªx
-\*, ./E\0123á 4567Ðq56
89:;Ô0°ú, .!Hð<:23q5rs=
+,q5Y>?@5*7, AèBCr: (& ÷2
3DEq5C, .$ ÷+,:q5àFG 5!7F
HIJEKq5:ÁÂ23DE:ÁÂ/
ELMÆ57, 5#, NOªq56:ºBP
Q\0R&(:STF
UVðx-\*HIJÐ+,q56¾Ý, q
«Wjq56Cøãá úYãÐq5Xrs:O
L%&0°, cq56øãá úYãÐ
q5Xrs:YZÚ|‘F %[³\± /E
%&÷]t:YZÚ%[È^, HIJ©_‘
aNµbcMÖï´B (#-*+, dS
c¾ (,-*.+À +,©_‘aNµbcM
Öï´B ,*-%,+, dSc´B (-*$+F eú,
¾fïÖHIJÐ+,q5CÕ=úYãá C
XúYãÐghúYã, {QYZyÚI¯3úY
ã, ×7ÖiøãúYãæç
OPQI!#!!$I!I#,
’RST!:PjéZR&qkl;B$!%!!#.9
UVWX!:m¨nE:op± B!&,9&**#*.(E:
;IAJKLM: NOLKPAJKL-KAQ-RSQ-TU
!#! # # $ % &
’() *+,-./01’.23456789:
;<=>?@AB’CD45EFGH0IDJ
KL MNOPQRSTURVW’.23D45X
YZ[L \=]^_‘abDGH0Ic
! !#$%&
defghijk !!lmno5pqrs
e/ $%%%ltuD &’()*+c vmno #wxy
z{| ,wxyz}~hiL A/t
u -wxyz~
@DTUR PQR<
5prsDhYs> .c TUR/P
QR@kL 5prs{/0123420/5’67
O~haL =wxyzD
/~
@c =wxyzf
@DT
UR PQR/5prs ¡¢£@¤k
¥D .89 ¦F§xyz¨©ª«¬L @T®
!89rs¦FS¯8°D¨©±²¬c
Y³v;~mno/tu¤wxyz´µ¶r
se·?@D|¨TUR ·TUR/¸
¹TURL \ºwRrs»¼½. ,: ¾¿¹ %
©45/À¼½. 9: ¾¿¹ % ©45¨B@^Á
ÂTURÃÄDrs{;.!< => ÅÆc v
TUR·¡¢ Ç@ .
’()*+
#$ ’(
Ïi ?@A1B0 O .%CD !8ÐÑÒ{E/F23G/1H
/ÐÑ{I2@GJ32< BK< E/F23G/1H ÓÔDÕÖ L$!ML
O×·L$9N$#ML 8ÐÑÒ/ÐÑDÓÔOØ/
PÙ~ÚÛ‘ÜÝc
Ø $Þ 8> ! ©ßàáâDãäå L
æ #$ ç| $ ©ãäèD©hL éêÐÑë
%Þ L&$N! &!NìN! &!MDÐÑØÞ
’OPQ’O&.N< &!N<íN! &!PQ(RB4)*!
!
$ Q .
!&$RB4)&$ O.P
î· *Q
!
$ Q .
!&$N<8hDï )@ .{]ðßàD
’ñòóc
Ø !Þ ôIõË©ÐÑë Þ L&.N < &!N < í N
&!Mö +Þ L,.N< ,!N<íN< ,!ML ØÐÑ÷Þ
!O%N+PQ-OS+P!’OP!’O+P
< < < < < < Q’O.S*P!
!
$ Q $
!-O,$S&$P< < < < < < < < < < < < < < < < O!P
î· *Q
!
$ Q $
!&$L .Q
!
$ Q $
!,$L
-O.S*PQO.S*PRB4)O.S*P!.RB4).!*RB4)*
-O,$S&$PQO,$S&$PRB4)O,$S&$P!,$RB4),$!&$RB4)&$
ôI ,$ø &$ùL ú -O,$N< &$PQTc ÙûüNÐÑ
÷ñýþDL ÿÞ ON!+PTc
!L ÐÑ÷ #ON< +P#$%ñ¨©#
7&Ò’(Dçc )zè‘Ëqhi /
%&’() * UV2< R2041V(E/G13/=J1/B0< BK< 1V322< W/0EG< BK< G2XJ2052G< /0< 1V2< 5V3BIBGBI2G< BK< /0 123453&3 @0E< 60 74783&!
Standard set Test set
1st
subset
2nd
subset
3rd
subset sum
1st
subset
2nd
subset
3rd
subset sum
Total
Exon 13011 2893 740 16644 21897 5123 1071 28091 44735
Intron 13989 2857 874 17720 18159 3676 920 22755 40475
A.thaliana
Chr-
Intergenic 6092 2390 1080 9562 6604 2571 7387 16562 26124
Exon 8943 3626 780 13349 13489 6402 1390 21281 34630
Intron 9907 2319 2117 14343 15514 3477 2483 21474 35817
C.elegnas
Chr-
Intergenic 4376 1210 845 6431 1347 9329 15760
-$,9< < < < < $D!%
%&’() ! UV2< 0JI=23< BK< G2XJ2052G< /0< G1@0E@3E< G21< @0E< 12G1< G21
A. thaliana C. elegans
The first exon The mid exon The last exon The first exon The mid exon The last exon
Standard set 1000 1000 1000 1000 1000 1000
Test set 14609 66408 14791 9904 86743 10035
$!-
! ! #$%&’()*+,-./0123456
!7829:;<= !#! !$>?= @ABCD>
9:E
!# !#$%&’()* +,)-./012
34567
%&!&’( (FGHIJKHLM
IT*8+,N\]E N^PQ_‘abc2FG
HIdKHe+,2M
C2 /)-vw= M< )0!))( *+Z2VWXsp
uC2 1)-vwE = M<2m= \
+,2CiW= M+,~NOE
D{PQg= ?+,M<
mL= FGHIdKHe\
+,zUNO ¡M
¯°= _‘abc{±²+,2FGHIJKH³
´[\ªXH¥µ 11ªX¶¦= ·¸X¶2+,N
Ok79©2\ªM
R$j§¤‘abcFG¶2X¶ ’´[M
< !))( *+zZ2 2(1!.ªVWXY 2(32%ªZ[
Xz»M·¸+,= N^¼½P¾p}~+,¿À{\Á
cÂÃ29ÄÅ #= ÆÇÂ\
+,x #l^
Èm2ÉÊ\Ácµ !)ªE · !)ª\ÁcË
ÌÍÎÏÐ\
+,\ÁcNOLÉÊE
Ñ· !)ªÉÊ\ÁcL9ÄÅÒR$j
§¤‘abcFG¶X¶ ’ eFGÓL
ÔÕÖC= Ó×ØÙ !Ù 4
!
’5!
!
!5(Ú5(
!
!)6
¥!7$5( %5( &¦= DÛÜ 8’$= 9©2FGh
ÝÙ
’’8!$7#!9:;(#!!
!)
) 7 ’
!)9:;(!) 8%$
pe #!7
!)
) 7 ’
!)¥)7’5( !5(Ú5( !)¦= $Y %Y &N^Þ
hVWXY Z[X&)*x+,E
!
)hÝ !
+
,2§ )ªÉÊ\Ác29ÄÅE ß
%
*§ !
²ÉÊ\ÁcX¶ ’e}~Z[X{ÂÃ29
ÄÅà
&
’@§¤²ÉÊ\ÁcX¶ ’ e}~
)*x+,{ÂÃ29ÄÅE á
âE
ÑR$j§¤‘abcFG¶2X¶ !ãä{
¾¹;= o¿ÆÇÂX¶ !2 !)ªÉÊÁcE ¯°= åæX¶ ! e2\
+,2
’!8!$5¥!7$5! %5( &¦E qç‘abcFG¶2çª
X¶= ³Ëãä{¾¹è;Ù ÇéêX¶2ÉÊ
\Ác= ëìØíFGÓ= ¾¹p\ªFG
E
%&!&%( ( FG¶e+,2tîJK&JK¶2ïð
JK
R$j§¤‘abcFG¶X¶ ’eñò¤
‘+,= Òß«¾¹Ù P¾ %&!&!eóôõö2
÷qêX¶2 !)ªÉÊ\Ác·‘+,{_t
ÂÃ29ÄÅ = ×åê‘+,2Ó
Ù 4+’5! +!5(Ú5( +!)6= ¾¹p ’’8$à DÛ
Ü8!$Y 8%$åê+,2 ’’8$kX¶ ’2ªFG ’’8%$Y ’’8$$Y ’’8&$7x2
Ù
’8!5!$7’’8!<$!’’8!$,’’8$( ( 8!7$5( %5( &$( ( ( 82$
AªÓ7x2>?= øiAªÓ
7x29:;<ù>mE ·‘+,2
úùû·ª2ü?ýö= þÙ
#’8!5!$7=>?@$’8$5!A5(%’8%5!A5(&’8&5!AB((
(((((((((((((((((((((((((((((((((((((((((((((((((((((((((((8!7$5(%5(&A((((((((((((((((((8.A
XH ’eL}~+,= ÿ{!º= ¤ ¡t
îJKE
JK!ºÿq#$+,
úL%
&= ç‘abcJKHLçªXHeLñò¤’
+,5((ÿkp9©LFGHXH}õöLÔÕ
ÖC×Øê)+,LÓ= *¾¹p= ë
+kFGH©XH{L\ªFGÒ
E ê‘+,L
úû\ªLü?ý
ö= ¯°,÷¸+,
úL-$E Ò
= .ÑÓLÔÕÖCªCÇé 2)I 12=
/ ¡o¿L¾¹I-0E
!! !#$%&89’()34567
¯R$jI’|LT*B§¤VWXY exV
WXIü1VWXeÇé23ÓN^¾¹\
VWXFGHL ’’8%AY ’’8-AY ’’8)A5(¾
¹JKHe礑+,L ’’8A5(¯°¾¹
’!/
!#! # # $ % &
’()*+
!$%!!&!’()%!!*!’!)+!!,#$%!,- - - - - - +!($&- %&! &’-
,-.(/012.()*345 67,-.(/
0189:;<3=> ?@ABCDEF?G-.
()*CH4IJK5 LM
)+!!&!!’(./01#)%!$&!!,&- $)%!%&- !,&- %)%!&&!!,2-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - +!($&- %&- &,
NOPQRGDSTU8VW>
!# !#$%
XYZ[\]^_‘a^CKbcdef8V
g^h3)4&$56>
\]^i7807/9/:/9;jM ’(()*<+*&- kl$mV
W8hno
‘a^p7=8>/?/>/9;jM )(()*<**5 klVgq
r2st
{+ ,,( %)*,%)-,!%.*,%.-,
%**,%*-,%+*,%+-,!
5 |}c~VW
C
? **()**.*5 *-()-*.-5 +*()**.-5
+-()-*.*5
5 )* DABVW
8ABy5 .-?DABV
CABy5 .*
DABV{
DABCABy5 )-
DABV
{
DABCABy&! **V{DAB
CABy5 *-V{
DABCABy5 +*
DABCPABy5 +-
DABC
PABy> F\]^_‘a^R
5 9uvwxy|}c~VC
# &’()*
#$ +,-./0123456789:;<=>
?@=/23ABCDEF
¡R¢£¤_¥¦§¨©ª@«¬®¯
U° STU_§¨1ABCVW5 ±eqrk7Z
[ 5#-²yCVW³rH´> ¡Rª@«¬
Cµ¶_·¸¶C¹º»¼·¸_½¾·¸5 ¿
ÀÁVWÂÃÄk DÅl ¢£¤ª@«¬
¶¿VWÆÇÈÉÊ E!F$GH 5 ·¸¶{
EIFG4Jo ¥Ëª@«¬¶¿VgÆÇÈÉ
Ê IGF5IJ5 ·¸¶ÉÊ E)FGDJ Ì@«¬C
·¸¶ªÍABÎ<ÏG-Ð1ÏÑVWÅÒyÓ
C§ÔÕ5 Rª@«¬GDEABC·¸¶¿
ÀÁVWÈÅÒCG-ÖIÄk #Ål ¡×
CØÙÚÛABÏ{®¯U_STU,DABÜ
ÝVg5 VgÆÇÈ{ EJKGJÞß3)&)E65 OR
àáâЮ¯U_§¨1CÐÏãäåæ5 ‘Ñ
GçABèéVWêëä´Cqr ÄrÍìíî
ABïe5 ,DABCÆÇÈð{ 4J5 OG
DABCVgÆÇÈñ{ DDFDJ Å5 Lòèé
VgGDABCÆÇÈäèéVg,DABCÆÇ
Èóô5 õä´Cqr
¡RãVgqrC ,,I5 sö÷ø 5#
-²yC.(*Vgqrä´5 ¢£¤ª@«¬
¶¿ÈÉÊ E!F)GJ5 ·¸¶{ EIFG4Jo
¥Ëª@«¬¿CÈÉÊ IGF5IJ5 ·¸¶
ÉÊ E)FGDJ ¢£¤¿CVgÆÇÈiE4F!GJj
ã¥Ë¿CVgÆÇÈiEFDIJj ùú ¡×
ûVgÈäúCØÙüý÷þ9uvwxy
,,CI
STUVWÆÇÈäú5 íÎ<)=Oÿ
úo RàáâABO!5 §¨1AB_®¯UC
VWÆÇÈÁäô5 ®¯UC¿VW³rä´
§¨15 ¥ËC#ãä7T O5 Ì@«
¬àáâABCU¶ Di$¯äΰ -y%&C
ABjC¿VWÈ’ô5 ‘(¢£¤) !@«
¬C¶U¶ D_¥Ë) !@«¬C¶
U¶ D *
+¨5 sh,-.M +)’-áâA
B/0ůC1t2=àáâAB5
Gw
CÏ3‘47To +!’-®¯U§¨C5É67
89s:;CÙ[5 ÄM LE®¯U<=ø>?
@AÙ[° BC>?D.CEF3)G6G5 HIRJY
C#%KbCL*M<ùúN
S C hr
Chr
Chr
Chr
Chr
Chr
C hr
Chr
Chr
Chr
Standard set 86.49 80.87 87.05 82.22 81.34 76.19 78.81 77.50 76.71 78.67
Test set
O verall rate
86.59
86.75
86.50
85.04
86.80
85.83
84.04
85.18
77.46
82.71
76.49
81.16
77.68
74.76
72.02
79.71
75.96
79.55
79.16
81.58
%&’() ! MN8O 7807/9/:/9/87- P/9N- 5#- 9A/.8A7- /0- 79C0QCAQ- 7897- C0Q- 9879- 7897- @?- >NA@.@7@.87- @?- +/ 0123$2(2 C0Q- ,4 53562(7
)!E
! ! #$%&’()*+,-./0123456
+,27*8+,# $%*8+,9:;<=>
?@ABCDEFGHI JKLMNOPQRS
TU
LVWXI YZ[\]^_‘]abcde
fI g>cdhi@jak]lmnopqErU
)*sLtu]lvcdXw>xZyI z{|}
Jcd@I ~XI LX
Q+,CDvWzI cdef
U
!# !#$%&’()*+,-./
j$&’v:Hv
u¡!’(¢ £)k¤I ~g>¤ %u£)ef
¥¦U *§j$&’2:H¨©
2:;<ª« %¬¤® °¯±!w²:H
³´µ¢
)¯¢ *+¶³µ %u£)& %,¶´µ
%u£)®
-.-/ .0-!-1.# .··-# 1.0!.1-# -.-
# # !¯¢ *,¶³µ %u£)& %,¶´µ
%u£)®
-.-# .0-!-1.# .··-# 1.0!.1-# -.-#
# # %¯¢ *,¶³µ¸¹º %u£)& %,¶´
µ¸¹º %u£)®
-.-# .0-!-1.# .··-# 1.0!.1-# -.-
!#$%&’%(%2-&
)*&*+%(,20&
$%&’( ! 134/ 56789:;8/ 6 834/ :55=9:5># <69# 84;8# ;48;# ?@83# !AB# CA# :7D# EC# 89@F49;/ 6 ! #$%&’%(% :7D/ ) *&*+%(, G476F4;
20 trinucleotides 40 trinucleotides 64 trinucleotides
class
Sn
Tn
CC
Sn
Tn
CC
Sn
Tn
CC
exon 91.68 90.51 81.24 89.70 90.90 83.64 85.16 88.36 77.41
Chr(A) intron
inter
82.12
62.70
84.94
61.42
76.35
54.22
79.67
78.17
89.71
61.57
75.21
61.34
86.90
83.79
92.62
69.64
83.61
70.17
exon 88.60 88.82 81.60 80.28 87.70 72.56 91.55 87.55 83.14
Chr(A) intron
inter
71.85
87.80
85.48
63.94
64.67
68.28
76.22
82.91
83.45
60.27
68.26
62.82
75.17
77.73
86.30
67.10
68.87
63.51
exon 82.02 86.24 73.25 97.08 91.63 89.80 94.70 88.53 84.70
Chr(A) intron
inter
77.10
78.99
82.07
64.59
67.32
63.60
86.77
76.10
93.53
75.52
84.51
70.06
78.27
72.64
89.86
66.02
75.18
61.88
exon 80.50 84.86 71.41 81.30 89.12 73.74 83.80 97.85 84.79
Chr(A) intron
inter
78.20
72.54
81.42
62.64
68.24
57.11
78.47
93.84
86.81
60.60
71.87
70.53
84.28
89.91
89.46
63.32
79.09
68.37
exon 87.20 82.48 70.97 82.53 85.74 75.50 83.22 89.89 77.77
Chr(C) intron
inter
87.59
53.53
77.54
79.58
73.22
58.33
85.88
66.36
75.99
86.61
63.46
71.07
82.34
82.07
83.22
67.24
69.80
68.92
exon 78.71 82.45 65.63 84.32 86.32 75.54 83.74 88.52 75.78
Chr(C) intron
inter
65.06
58.62
71.74
44.77
50.66
38.81
79.37
66.93
83.47
57.88
69.15
52.72
88.28
63.41
88.55
53.31
79.73
51.14
exon 87.74 87.46 81.36 77.91 86.29 73.35 84.52 87.11 79.05
Chr(C) intron
inter
74.02
71.34
82.13
61.87
65.14
51.69
76.71
72.68
83.10
59.34
67.84
48.74
78.63
71.59
82.02
64.15
68.20
53.54
exon 84.67 89.69 79.27 73.38 84.96 66.05 93.58 86.29 82.76
Chr(C) intron
inter
74.74
61.05
83.00
43.86
63.63
39.99
77.89
52.34
72.00
46.45
56.63
37.19
80.73
71.56
94.51
56.12
77.85
57.24
exon 91.39 79.62 77.63 76.38 85.69 70.69 78.07 91.65 77.04
Chr(C) intron
inter
63.15
79.91
91.45
46.62
59.21
51.54
80.06
73.69
91.40
45.62
74.47
47.69
81.38
88.02
97.07
46.92
80.39
55.86
exon 92.61 86.95 78.92 80.46 75.04 62.54 78.89 88.48 71.97
Chr(C) intron
inter
70.38
46.11
72.23
51.73
61.19
34.12
78.72
50.96
89.21
43.77
71.90
36.78
77.58
100.00
89.63
52.60
71.98
68.48
)!(
!#! # # $ % &
’()*+,-./01 2,-.3 $456
789: $;<=> 9?@A $;<=B3 #+C
D34E> );<=BF #4GE !
!
%H !
!
&H !
!
’H
!
!
(> $ ;<=I )! 4JE !
!
%H !
!
&H !
!
’H !
!
(H
!
%H !
&H !
’H !
(H !
#
%H !
#
&H !
#
’H !
#
(K 8L *!
4JEMNOPQJE> AROPSTOPUS>
VWXYZOPUS[\]^> _‘ab]^cd
5efZ> ghijcdkl +mnopB $$,
ln5qr-sEtu
vl +B’wxyTz{,|}~<]^c
d784#+3*+,-./0
]
cdY> ?B}~<]^:
-,8> N<}~<
6CD3u +,-./0
*H Y¡}~<3]^:¢1 £7¤v(
D¥¦§*¨©8ª«¬¦< %(&®> 8¯
°¬¦< (&%H (%%H (%&c±u
*+,-./0² +,-./03]^
cd³~> ´p*+,-./03 $$µ
³~¢(¡+> ¶³}~<·¸3CD
¹(<·¸CD3u º,+,-
./0]^:»(¼½¾¶³L*
¿u ÀÁwxyTz{63CD[
\Â@A> cdl³
DÅ9> ?<· +. ¸8 &(
®H $.¸8 %&cÆ> LÁCDÅ9¾Ç
È<3 &(/%&ÉÊu wxy<· $.¸
ËB}~
Íz{5Î6ÏÐÌ (u 4#+*}~<
ÑÒÓ»Ô,6T¡Õ6CDÖ}1
6¿Ô3!#H !!H !) 6¿½Ö1 ©Á %0
Å9Ìu 4#+¯°¬¦<6Ï×Ì
(%%u ¯°¬¦<¡346¿CD½5Á1
wxyÌ &(1 z{Ì %(u LØCD½
ÙDÚ°ÛÜ3·9ÝÞov(ßàmá> â
ã@AEXltu
4#+B}~
z{ÊÌ %u (çèéÁº*}~<· $.
¸ê B}~<· +.¸T $.¸H Y¡}~<
· +.¸ -.CD[\Â@A1 cd~nwx
yTz{º*}~
~
6¿3òCD©Å9ÌóôCDu LØ
6¿ÁCD3ÌÎõDB<ö÷3
*ØøÝÞu
ù}çèéÁwxyTz{B}~<· +.
¸N &(ì $.¸N %&3úû[\Â@A1 wxy
120#+! ;BF 20233 ;1 ü ))4++,ï z{ 3202+1
;BF !0-2$ ;1 ü $4$-,u ýØB}~<8
&(®8 %& c±50½ÙLØ}~
3·9*+u Í’(< 6}~<Ó»6¿
]^*,²Ö-.3/0> 12í3GE3
a> Í]^:Våf¢7*38> Í’(,|}~<
3$4]^125Mf6u À]^70B> 8*
3]^GE99OPUS:íu
!#$% & (9:0 ;:<=>?0 @A0 B;:CDE?D@F0 A@;0 %&’()*!)+) GFC0 $&,*,-)+.
The result of prediction
A. thaliana C. elegans Kinds Class
Sensitivity % Specificity % CC% Sensitivity % Specificity % CC%
the first exon 86.04 73.69 75.66 86.04 69.75 74.93
the mid exon
92.80
93.24
77.37
95.67
97.04
81.11
No.1
the last exon
81.73
95.54
86.48
87.13
97.72
89.03
the first exon
89.86
54.38
63.18
82.29
32.91
44.76
the mid exon
68.30
94.50
54.72
61.50
95.49
38.14
No.2
the last exon
89.01
55.62
63.70
87.34
33.89
47.45
the first exon
86.36
56.78
63.48
85.57
39.52
52.26
the mid exon
73.99
93.55
57.89
74.44
96.41
50.05
No.3
the last exon
88.27
62.12
68.49
87.96
49.07
61.19
*$H
! ! #$%&’()*+,-./0123456
! #$%& ’( #)* *+$*(,* !(% -./,* /#* ’0 !#$%&’%(% !(% )*&*+%(, 1*(*
#$%& ’()*(+,& & & & -./ 0(+1234516,/ / / / -.%/ #+5/ / /
!#$%&’(#)’ *+ ,-./01/2 3*44#5# *+ 610#)1#/ %)7 8#1-)*4*5.2 9))#& :*)5*40% ;)0<#&/0’.2 =*--*’ >?>>@?A 3-0)%B
!23456748 748/ 95:;<8=8/ >8?’8198>& 5@& CD’-%40%)% +1A& 3D#4#5%)/ 6815:8& +B8& A(C(A8A& (1=5& =4B88& D(1A>E&
8*51>F / (1=B51>/ +1A/ (1=8B681(9G%HI / 748/ JKF / KL/ +1A/ !L/ =B(:8B>M& ;B5N+N(<(=(8>O 5@& =48& =4B88& D(1A>& 5@&
>8?’8198>& +B8& B8>;89=(C8
>8?’8198>& +B8& ;B8A(9=8A& NP& =48& (19B8:81=>& 5@& A(C8B>(=P& =48& :(1(:’:& 5@& =48& =4B88& (19B8:81=>I& 748& B8>’<=>&
>45Q1& =4+=& =48& 5C8B+<<& ;B8A(9=(51& +99’B+9(8>& 5@& CD’-%40%)% R>& & 8C8BP& 94B5:5>5:8& +B8& S!ITUV/ +1A/ SWIUXV/
@5B/ =48/ >=+1A+BA2>8=>/ +1A/ =8>=2>8=>Y / =48/ 5C8B+< ;B8A(9=(51/ +99’B+9(8>/ 5@/ 3D#4#5%)/R/ 8C8BP/ 94B5:5>5:8/ +B8/
WUIJWV/ +1A/ STIUZV/ @5B/ =48/ >=+1A+BA2>8=>/ +1A/ =8>=2>8=>F/ B8>;89=(C8
+1A/ 3D#4#5%)/ 6815:8/ +B8/ A(C(A8A/ (1=5/ =4B88/ =P;8>I / [+>8A/ 51/ =48/ @B8?’819(8>/ 5@/ K/ D(1A>/ 5@/ N+>8>/ (1/
B86(51>/ 18+B/ (1=B51\8*51/ N5’1A+BPF / (1(=(+=(51/ +1A/ =8B:(1+=(51/ >(=8/ @5B/ =B+1><+=(51F / =48/ A(C8B>(=P/ >5’B98/ (>/
95:;5>8A/ 5@/ T!/ >8?’8198/ ;+B+:8=8B>I/ 748/ =4B88/ D(1A>/ 5@/ 8*51>/ +B8/ ;B8A(9=8A/ NP/ ’>(16/ 5@/ +1/ +<65B(=4:/
N+>8A/ 51/ =48/ (19B8:81=/ 5@/ A(C8B>(=PI/ 748/ B+=8>/ 5@/ 95BB89=/ ;B8A(9=(51/ 4(648B/ =4+1/ SLV/ +B8/ 5N=+(18AI/
/ / / / 9:; <=5>38 $*51Y/ .1=B51Y/ .1=8B681(9G%HY/ ]8+>’B8/ 5@/ A(C8B>(=PY/ ^;<(98O >(=8
!#$%
_T‘O O789FO:;
DHIJKKLaMNOKPbFO !LL!FKEKLTcKLJ
_!‘ O O ^1PA8BO $$F O ^=5B:5O dGI O .A81=(@(9+=(51O 5@O ;B5=8(1O 95A(16O
B86(51O (1O 6815:(9O G%HIO E (*4 F*04FO TUUXF!KSETcTS
_Z‘O O ^+<3N8B6O ^-FO G8<948BO H-FO e+>(@O ^FO f4(=8O gIO ](9B5N(+
C107/ I#/FO TUUSF!EXKKcXKS
_K‘ O O h56(9O ^F O g’8<<8==8O [iiF O ]+9DQ5B=4O HeI O .:;B5C(16O 6818O
B89561(=(51O +O 99’B+9PO NPO 95:N(1(16O ;B8A(9=(51>O @B5:O =Q5O
68182@(1A(16O ;B56B+:>IO F0*0)+*&(%’01/FO !LL!FTSaSbETLZKcTLKX
_X‘ O O ]8P8BO .]F O G’BN(5O hI O 5:;+B+=(C8O +NO (1(=(5O ;B8A(9=(51O 5@O
6818O >=B’9=’B8O ’>(16O ;+(BO #]]>IO F0*0)+*&(%’01/A !LL!FTSaTLbE
TZLUcTZTS
_J‘ O O #+=3(685B6(5’O HdI O 7B+1><+=(51O (1(=(+=(51O >=+B=O ;B8A(9=(51O (1O
4’:+1O 9G%H>O Q(=4O 4(64O +99’B+9PI O F0*0)+*&(%’01/A !LL!F O
TSa!bEZKZcZXL
_W‘O O -.%O jFO klO j-IO mB8A(9=(51O 5@O ;B5D+BP5=(9O ;B5:5=8B>O N+>8AO 51O
;B8A(9=(51O 5@O =B+1>9B(;=(51+
!LLZFZXaKbEZTWcZ!K
_S‘ O O ]5B681>=8B1O [F O h(118BO gI O $*51O A(>95C8BPO NPO 6815:(9O
>8?’8198O +<(61:81=IO F0*0)+*&(%’01/A !LL!FTSaJbEWWWcWSW
_U‘O O H4B(168BO jIO 7’B1O =5O =48O Q5B:IO 3H&& J$0) K#)#’ #
_TL‘O [+>>8==O G$O jBFO [56’>D(O ]^FO ^;8198BO iFO e(:O ^FO f8+C8BO 7FO
#(8=8BO mI O d815:8O 9B5>>2B8@8B819(16O +1AO kh$iANE O
(:;<(9+=(51>O @5BO =48O (A81=(@(9+=(51O +1AO +1+
:’=+=8AO (1O 4’:+1O A(>8+>8IO G%’ K#)#’A TUUWFTXEZZUcZKK
_TT‘OTUFOVWXIOY$Z)*?56[\IO]QRK^LFO !LLTF
ZEUUcTLT
_T!‘O -+*=51O hhIO 748O :8+>’B8O 5@O A(C8B>(=PIO E 8-#*& F0*4FO TUWSFWTE
XTcJW
_TZ‘ O -(O 0nF O -’O n0I O 748O ;B8A(9=(51O 5@O =48O >=B’9=’B+
;B5=8(1E O +;;<(9+=(51O 5@O =48O :8+>’B8O 5@O A(C8B>(=PI O E 8-#*&
F0*4A !LLTF!TZEKUZcXL!
_TK‘O_‘aFOVbcIO OdefghijklmnoIOQRR3K
LFO !LLTFKEWLZcWT!
_TX‘OpqrFOVbcIOdefghistQRukl2vwx0yIO
DHIzKKLoMNOKPbFO !LLZFXEXTLcXTW
_TJ‘O [+p(9O q[FO ^8+4O ^#I O GB+651O 6818O >=+B=O @(1A8BE O +1O +AC+198AO
>P>=8:O @5BO @(1A(16O +;;B5*(:+=8O <59+=(51>O 5@O =48O >=+B=O 5@O 6818O
=B+1>9B(;=(51+
_TW‘O{|FO}~FOFOFO
IOtQR G%H
2?IOQRSKCQRR3[\FO !LL!FKEXSZcXSW
_TS‘ O m8B=8+O ]F O -(1O krF O ^+<3N8B6O ^-I O d818>;<(98BE O +O 18QO
95:;’=+=(51+
C107/ I#/FO !LLTF!UETTSXcTTUL
_TU‘OV‘FO7IO O .oDEB56[\IOQRSKCQRR
3[\FO !LLZFZEZJZcZJS
TZT