免费文献传递   相关文献

拟南芥和线虫基因序列及剪切位点的理论预测



全 文 :!#$%&’()*+,-./01234
!#!$%&!’ (
!#$%&’(&)*’+, -./0 !!!#1
567 )*+,-! #$%&’%(%. /01-) *&*+%(,. 23456789 :;8<23=>?@AB $CD
AEFG %&9 &!9 #!’HIJKLMNOBPQRSTUVWXY Z[VW\]^_>?‘aCbc defgh
*+,ijklKmnop^_qrNst (#)*+Y uvoB (,-*.+w 01ijklKmnop^_qrNs
t ,*-%,+Y uvost (-*$+c x6Y )yH23>?zL678AE{Aq $C/’|678}~€ ‚ƒ
„…/d†€‡ˆLIJKL $‰€OB $j8ŠY ‹ij8ŠL #‰RSTUVWXY |VW\]Œ $
H>?CbŽ^_Y ^_qrNs (!0‹c
89:7 678w :;8w 23=>?w ‘~€w VW\]
;<=>?7 1%
@AA1BC DEFG HEI E!!JKJL
!#! $%&’()*%+ *%,%! -./012 ,.03 +450 3226
7 M N
234567892341 :;:<=, >?
@AB23456C:DE45F G $!HIJ4
CKALMNOB .0:45, PQRSTUVW
XYZ?O[:F Q\*]^&:_‘Gab:
45Ccd]^, KAef:gb45?hi2
j=klm:_nF opV, q56rstuv
wx:yz, I{Q7|}~r€q5rsc
‚ƒ„:@…, †‡ˆ‰Š‹ŒŽŒ
Ž‘’‚=+s:’“€56$7F ”•–‡KA4
5:_‘a—˜™‚ !‡p, š›€œž=
kŸc„,  ¡j¢O£¤¥€¦§5&7F ;¨,
©ªQ«€45Ž¬:€|‘­@®u¯°±
89 : ²³´rsµ¶ 9;<=1 · ] ¸ >?@:
9A>?@1 c¹ºB:»¼½¾k¿®uÀ B#9: q
«Iabq5·ÁÂÃrs:ÄÅÆÇÈÀ B$9:É
QÊËÌÍÎxÏ92CCD1 ÐїÒÓÔ|‘‘
’€5.6,7F ŸÕ¯ª|‘€CE:¾ŽÖŒŽ
ŒŽE:ÉQ×Ø0ÙÚ?ÛOÍÜ:, ÝÞ×
7 F>G:ßàá âã¸Qäžåæçè°Ôr
séê:]^5(7F ²«;¨|‘Iq5C:Œ
ŽÐq5ërsOìíîŽï, Dð€:q
5¢Z?O¤¥:, «?KAñ3rs:Eò0
ÙÚ?óÖÛ@:F ©ª\*0ï?ôõ\*:
q5v‡ö‡÷øãçù, ֜–‡úYãF
²«Ÿûœ A>?@ü, øã:ýþÆæç}ÿ
óÖ!, 5#Ÿ€¤$q5%&ü, Iøã
úYã’(:KAóÖ)*5&7F +,½¾=ªx
-\*, .‚/E\0123á 4567Ðq56
89:;Ô0°ú, .!Hð<:23q5rs=
+,q5Y>?@5*7, ŸAèBC€r: (& ÷2
3DEq5C, .$ ÷+,:q5„àFG 5!7F
HIJE–Kq5Œ:Á23DE:ÁÂ/
ELMÆ57, 5#, NOªq56:ˆ‰ºB‚P
Q\0R&(:STF
UVðx-\*HIJÐ+,q56¾Ý, q
«Wjq56Cøãá úYãÐq5Xrs:O
L%&0°, c‚€q56øãá úYãÐ
q5Xrs:YZÚ|‘F €%[³\± /E
%&÷]t:YZÚ€%[È^, HIJ©_‘
aNµbcM€œžÖï´B (#-*+, dS
c¾ (,-*.+À +,©_‘aNµbcM€œ
žÖï´B ,*-%,+, dSc´B (-*$+F eú,
¾‚fïŽÖHIJÐ+,q5CÕ=úYãá C
XúYãÐghúYã, {QYZyÚI¯3úY
㑒‚€, š×7Öi‚øãúYãæç
OPQI!#!!$I!I#,
’RST!:PjéZR&qkl;B$!%!!#.9
UVWX!:m¨nE:op± B!&,9&**#*.(E:
;IAJKLM: NOLKPAJKL-KAQ-RSQ-TU
!#! # # $ % &
’() *+,-./01’.23456789:
;<=>?@AB’CD45EFGH0IDJ
KL MNOPQRSTURVW’.23D45X
YZ[L \=]^_‘abDGH0Ic
! !#$%&
defghijk !!lmno5pqrs
e/ $%%%ltuD &’()*+c vmno #wxy
z{| ,wxyz}~hi€L A‚ƒ„/t
u -wxyz~…@†‡ˆ‰DTURŠ PQR<
5p‹ŒrsDhŽYs> .c TUR/P
QR@k‘’“L 5p‹rs{/0123420/5’67„
O”“~hŽa•L –‚ƒ—˜=™wxyzDš
“/›œž“~…@c =™wxyzf…@DT
URŠ PQR/5p‹ŒrsŸ ¡¢£@¤k
¥ŽD .89 ¦F§xyz¨©ª«¬L @­T®
!89rs¦FS¯8°D¨©±²¬c
Y³v;~mno/tu¤wxyz´µ¶r
se·?@‡ˆ‰D|¨TURŠ ·‹TUR/¸
¹TURL \º™wRrs»¼½. ,: ¾¿¹ %
©45/À¼½. 9: ¾¿¹ % ©45¨B@^Á
ŒTURÃÄDrs{;.!< =>„ ÅÆc vŒ
TUR·¡¢Ÿ Ç@ .{™  .TURDÈ«¬/±²¬ÍÎ !c
’()*+
#$ ’(
Ïi ?@A1B0 O .%CD !8ÐÑÒ{E/F23G/1H„
/Ðю{I2@GJ32< BK< E/F23G/1H„ ÓÔDÕÖ L$!ML
Oׁ·L$9N$#ML 8ÐÑÒ/ÐюDÓÔO‰Ø/
PÙ~ÚÛ‘ÜÝc
‰Ø $Þ 8> ! ©ßàáâDãäå‹ L
æ #$ ç| $ ©ãä†èD©hL éêÐÑë
%Þ L&$N! &!NìN! &!MDÐю‰Ø–Þ
’OPQ’O&.N< &!N<íN! &!PQ(RB4)*!
!
$ Q .
!&$RB4)&$ O.P
î· *Q
!
$ Q .
!&$N<8hDï )@ .{]ðßàŽD”
’ñòó„c
‰Ø !Þ ôIõË©ÐÑë Þ L&.N < &!N < í N
&!Mö +Þ L,.N< ,!N<íN< ,!ML ‰ØÐÑ÷ŽÞ
!O%N+PQ-OS+P!’OP!’O+P
< < < < < < Q’O.S*P!
!
$ Q $
!-O,$S&$P< < < < < < < < < < < < < < < < O!P
î· *Q
!
$ Q $
!&$L .Q
!
$ Q $
!,$L
-O.S*PQO.S*PRB4)O.S*P!.RB4).!*RB4)*
-O,$S&$PQO,$S&$PRB4)O,$S&$P!,$RB4),$!&$RB4)&$
ôI ,$ø &$–ùL ú -O,$N< &$PQTc ÙûüNÐÑ
÷ŽñýþDL ÿÞ ON!+PTc
!L ÐÑ÷Ž #ON< +P#$%ñ¨©#
7&Ò’(D‰Žçc )zè‘Ëqhi /
%&’() * UV2< R2041V(E/G13/=J1/B0< BK< 1V322< W/0EG< BK< G2XJ2052G< /0< 1V2< 5V3BIBGBI2G< BK< /0 123453&3 @0E< 60 74783&!

Standard set Test set


1st
subset
2nd
subset
3rd
subset sum
1st
subset
2nd
subset
3rd
subset sum
Total
Exon 13011 2893 740 16644 21897 5123 1071 28091 44735
Intron 13989 2857 874 17720 18159 3676 920 22755 40475
A.thaliana
Chr-
Intergenic 6092 2390 1080 9562 6604 2571 7387 16562 26124
Exon 8943 3626 780 13349 13489 6402 1390 21281 34630
Intron 9907 2319 2117 14343 15514 3477 2483 21474 35817
C.elegnas
Chr-
Intergenic 4376 1210 845 6431 1347 9329 15760

-$,9< < < < < $D!%
%&’() ! UV2< 0JI=23< BK< G2XJ2052G< /0< G1@0E@3E< G21< @0E< 12G1< G21
A. thaliana C. elegans

The first exon The mid exon The last exon The first exon The mid exon The last exon
Standard set 1000 1000 1000 1000 1000 1000
Test set 14609 66408 14791 9904 86743 10035

$!-
! ! #$%&’()*+,-./0123456
!7829:;<= !#! !$>?= @ABCD>
9:E
!# !#$%&’()* +,)-./012
34567
%&!&’( (FGHIJKHLMR$%I’(ST*B+,UVWXY Z[X
IT*8+,N\]E N^PQ_‘abc2FG
HIdKHe+,2MZ[XkVWX2MM2 ,)-vw= Ms .)-z{E ’|}~+,e= M2Z[XI?q !()))( *+2T*x+,_spu
C2 /)-vw= M< )0!))( *+Z2VWXsp
uC2 1)-vwE €= ‚ƒM<2„m= \…
+,2C†‡iWˆ‰= ŠM+,‹~ŒNOE
ŽD{PQg= ‘’“†ˆ?+,M<”
•–m—˜™„†Lš›= FGHIdKHe\…
+,œzUNOžŸ ¡MXIZ[X£¤M*8+,—©L\ªl8L{«¬_­m ’) ®E
¯°= _‘abc{±²+,2FGHIJKH³
´[\ªXH¥µ 11ªX¶¦= ·¸X¶2+,N
Ok79—©2\ªM%&!&!( (¹º–;
R$j§¤‘abcFG¶2X¶ ’´[’M
< !))( *+zZ2 2(1!.ªVWXY 2(32%ªZ[
Xz»M—·¸+,= N^¼½P¾p}~+,¿À{\Á
cÂÃ29—ÄÅ #= ÆÇÂ\…+,x #l^
Èm2ÉÊ\Ácµ !)ªE · !)ª\Ác˓œ
ÌÍÎÏÐ\…+,\ÁcNOLÉÊE
Ñ· !)ªÉÊ\ÁcL9—ÄÅґR$j
§¤‘abcFG¶X¶ ’ eFG˜™ÓL
ÔÕÖC= ˜™Ó×ؑ٠!Ù 4
!
’5!
!
!5(Ú5(
!
!)6
¥!7$5( %5( &¦= ŽDÛÜ 8’$= 9©2FG˜™†h
ݑÙ
’’8!$7#!9:;(#!!
!)
) 7 ’
!)9:;(!) 8%$
pe #!7
!)
) 7 ’
!)¥)7’5( !5(Ú5( !)¦= $Y %Y &N^Þ
hVWXY Z[X&)*x+,E
!
)hÝ ! …+
,2§ )ªÉÊ\Ác29—ÄÅE ß
%
*Ԥ !
²ÉÊ\ÁcX¶ ’e}~Z[X{ÂÃ29—
ÄÅà
&
’@‘§¤²ÉÊ\ÁcX¶ ’ e}~
)*x+,{ÂÃ29—ÄÅE á…âE
ÑR$j§¤‘abcFG¶2X¶ !ãä{
¾¹–;= o¿œÆÇÂX¶ !2 !)ªÉÊÁcE ¯°= åæX¶ ! e2\…+,2˜™†
’!8!$5¥!7$5! %5( &¦E —qç‘abcFG¶2çª
X¶= ³Ëãä{¾¹è;Ù ÇéêX¶2ÉÊ
\Ác= ëìØíFG˜™Ó= ¾¹p\ªFG˜
™†E
%&!&%( ( FG¶e+,2tîJK&JK¶2ïð
JK
—R$j§¤‘abcFG¶X¶ ’eñò¤
‘+,= Òß«¾¹Ù P¾ %&!&!eóôõö2
÷qêX¶2 !)ªÉÊ\Ác·‘+,{_t
ÂÃ29—ÄÅ = ×åê‘+,2˜™Ó
Ù 4+’5! +!5(Ú5( +!)6= ¾¹p˜™† ’’8$à ŽDÛ
Ü8!$Y 8%$œåê+,2˜™† ’’8$kX¶ ’2ªFG˜™† ’’8%$Y ’’8$$Y ’’8&$7x2˜™„
†Ù
’8!5!$7’’8!<$!’’8!$,’’8$( ( 8!7$5( %5( &$( ( ( 82$
Aª˜™Ó7x2˜™„†>?= øiAª˜™Ó
7x29:;<ù>mE ·‘+,2…úùû·ª˜™„†2ü?Ÿýö= þÙ
#’8!5!$7=>?@$’8$5!A5(%’8%5!A5(&’8&5!AB((
(((((((((((((((((((((((((((((((((((((((((((((((((((((((((((8!7$5(%5(&A((((((((((((((((((8.A
—XH ’eL}~+,= ÿ{!º= ¤ ¡t
îJKE
‘’JK˜™†!ºÿq#$+,…úLœ%
&= —ç‘abcJKHLçªXHeLñò¤’
+,5((ÿkp9—©LFGHXH}õöLÔÕ
ÖC×Øê)+,L˜™Ó= *¾¹p˜™†= ë
+kFGH—©XH{L\ªFG˜™†Ò˜™„
†E ê‘+,L…úû\ª˜™„†Lü?Ÿý
ö= ¯°,җ·¸+,…úL-$E ґ—
= .ј™ÓLÔÕÖCªCÇé‘ 2)I 12=
/ ¡’o¿L¾¹I-0E
!! !#$%&89’()34567
¯R$jI’|LT*B§¤VWXY exV
WXIü1VWXeÇé23˜™ÓN^¾¹\…
VWXFGHL˜™† ’’8%AY ’’8-AY ’’8)A5(¾
¹JKHe礑+,L˜™† ’’8A5(¯°¾¹
’!/
!#! # # $ % &
’()*+
!$%!!&!’()%!!*!’!)+!!,#$%!,- - - - - - +!($&- %&! &’-
,-.(/012.()*345 67,-.(/
0189:;<3=> ?@ABCDEF?G-.
()*CH4IJK5 LM
)+!!&!!’(./01#)%!$&!!,&- $)%!%&- !,&- %)%!&&!!,2-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - +!($&- %&- &,
NOPQRGDSTU8VW>
!# !#$%
XYZ[\]^_‘a^CKbcdef8V
g^h3)4&$56>
\]^i7807/9/:/9;jM ’(()*<+*&- kl$mV
W8hno
‘a^p7=8>/?/>/9;jM )(()*<**5 klVgq
r2st9uvwxyp>@AA8BC9/@0- >@8??/>/809j zb
{+ ,,( %)*,%)-,!%.*,%.-,
%**,%*-,%+*,%+-,!
5 |}c~VW
€C‚ƒ
? „ **()**.*5 *-()-*.-5 +*()**.-5
+-()-*.*5 …†5 )* ‡ˆ‰DABŠVW‹Œ
8ABy5 .-‡?‰DAB†ŽŠV‹Œ
CABy5 .*‡‘’…“DAB”ŠV•{–
DABCABy5 )-‡‘’…“DAB—˜V
{…“DABCABy&! **‡V{ˆ‰DAB
CABy5 *-‡V{…“DABCABy5 +*
‡ˆ‰DABCP™ABy5 +-‡…“DABC
P™ABy> F’\]^_‘a^šš‡‰R›
œ5 ‰ž9uvwxyŸ|}c~VC€ƒ
# &’()*
#$ +,-./0123456789:;<=>
?@=/23ABCDEF
 ¡R¢£¤_¥¦§¨©ª@«¬­†®¯
U° STU_§¨1ABCVW5 ±eqrk7Z
[ 5#-²yCVW³rH´>  ¡Rª@«¬­
Cµ‚¶_·¸¶C¹º»¼·¸_½¾·¸5 ¿
­ÀÁVWÂÃÄk DÅlƒ ¢£¤ª@«¬­
‚¶¿VWÆÇÈÉÊ E!F$GH 5 ·¸¶{
EIFG4Jo ¥Ëª@«¬­‚¶¿VgÆÇÈÉ
Ê IGF5IJ5 ·¸¶ÉÊ E)FGDJƒ Ì@«¬­C
·¸¶ªÍABÎ<ÏG-Ð1ÏÑVWÅÒyÓ
C§ÔÕ5 Rª@«¬­GDEABC·¸¶¿­
ÀÁVWÈÅÒCG-ցIÄk #Ålƒ ¡×
CØÙڇÛABÏ{®¯U_STU,DABÜ
ÝVg5 VgÆÇÈ{ EJKGJÞß3)&)E65 OR’
àáâЮ¯U_§¨1CÐÏãäåæ5 ‘ч
GçABèéVWêëä´Cqrƒ ÄrÍìíî
ABŸïe5 ,DABCÆÇÈð{ 4J5 OG
DABCVgÆÇÈñ{ DDFDJƒ Ş5 Lòèé
VgGDABCÆÇÈäèéVg,DABCÆÇ
Èóô5 õ‡ä´Cqrƒ
 ¡RãVgqrC ,,I5 sžö÷ø 5#
-²yC.(*Vgqrä´5 ¢£¤ª@«¬­
‚¶¿‚ŒÈÉÊ E!F)GJ5 ·¸¶{ EIFG4Jo
¥Ëª@«¬­¿C‚ŒÈÉÊ IGF5IJ5 ·¸¶
ÉÊ E)FGDJƒ ¢£¤¿CVgÆÇÈiE4F!GJj
ã¥Ë¿CVgÆÇÈiEFDIJj ùúƒ ¡×‰
ûVg‹ŒÈäúCØÙüý÷þ9uvwxy
,,CIƒ
STUVWÆÇÈäú5 ˜íÎ<)=Oÿ
úo R’àáâABO!5 §¨1AB_®¯UC
VWÆÇÈÁäô5 ®¯UC¿­VW³r䴒
§¨15 ¥ËC#†ãä7Tƒ O˜5 Ì@«
¬­àáâABCU¶ Di$¯äΰ -y%&C
ABjC¿­VWÈ’ô5 ‘(‡¢£¤) !@«
¬­C‚¶U¶ D_¥Ë) !@«¬­C‚¶
U¶ Dƒ *…+¨5 shž,-.M +)’-áâA
B/0ůC1t2=’àáâAB5 …Gw­
CÏ3‘47To +!’-®¯U§¨C5É67†
89s:;CÙ[5 ÄM LE®¯U<=ø>?
@AÙ[° BC>?D.CEF3)G6G5 HIRJY
C#%KbCL*M<ùú’N‰A. thaliana C. elegans
S C hr

Chr

Chr
 Chr
Chr

Chr

C hr

Chr

Chr
 Chr
Standard set 86.49 80.87 87.05 82.22 81.34 76.19 78.81 77.50 76.71 78.67
Test set
O verall rate
86.59
86.75
86.50
85.04
86.80
85.83
84.04
85.18
77.46
82.71
76.49
81.16
77.68
74.76
72.02
79.71
75.96
79.55
79.16
81.58

%&’() ! MN8O 7807/9/:/9/87- P/9N- 5#- 9A/.8A7- /0- 79C0QCAQ- 7897- C0Q- 9879- 7897- @?- >NA@.@7@.87- @?- +/ 0123$2(2 C0Q- ,4 53562(7
)!E
! ! #$%&’()*+,-./0123456
+,27*8+,# $%*8+,9:;<=>
?@ABCDEFGHI JKLMNOPQRS
TU
LVWXI YZ[\]^_‘]abcde
fI g>cdhi@jak]lmnopqErU
)*sLtu]lvcdXw>xZyI z{|}
Jcd@I ~€XI L‚ƒ„X…†‡ˆ‰Š
Q‹Œ+,CDvŽWzI cdef‘ˆ’
‡“U
!# !#$%&’()*+,-./
”•j–$—&’˜v:™šH›œžŸv
 u¡!’(¢ £)k¤I ~g>¤ %u£)ef
¥¦U *§j–$—&’˜2:™šH›œ¨©
2:;<ª« %¬¤­® °¯±!w²:™šH›
³´œµ¢
)¯¢ *+¶³œµž %u£)& %,¶´œµ
Ÿ %u£)®
-.-/ .0-!-1.# .··-# 1.0!.1-# -.-
# # !¯¢ *,¶³œµŸ %u£)& %,¶´œµ
ž %u£)®
-.-# .0-!-1.# .··-# 1.0!.1-# -.-#
# # %¯¢ *,¶³œµ¸¹º %u£)& %,¶´œ
µ¸¹º %u£)®
-.-# .0-!-1.# .··-# 1.0!.1-# -.-
!#$%&’%(%2-&
)*&*+%(,20&
$%&’( ! 134/ 56789:;8/ 6# <69# 84;8# ;48;# ?@83# !AB# CA# :7D# EC# 89@F49;/ 620 trinucleotides 40 trinucleotides 64 trinucleotides
class
Sn

Tn

CC

Sn

Tn

CC

Sn

Tn

CC

exon 91.68 90.51 81.24 89.70 90.90 83.64 85.16 88.36 77.41
Chr(A) intron
inter
82.12
62.70
84.94
61.42
76.35
54.22
79.67
78.17
89.71
61.57
75.21
61.34
86.90
83.79
92.62
69.64
83.61
70.17
exon 88.60 88.82 81.60 80.28 87.70 72.56 91.55 87.55 83.14
Chr(A) intron
inter
71.85
87.80
85.48
63.94
64.67
68.28
76.22
82.91
83.45
60.27
68.26
62.82
75.17
77.73
86.30
67.10
68.87
63.51
exon 82.02 86.24 73.25 97.08 91.63 89.80 94.70 88.53 84.70
Chr(A) intron
inter
77.10
78.99
82.07
64.59
67.32
63.60
86.77
76.10
93.53
75.52
84.51
70.06
78.27
72.64
89.86
66.02
75.18
61.88
exon 80.50 84.86 71.41 81.30 89.12 73.74 83.80 97.85 84.79
Chr(A) intron
inter
78.20
72.54
81.42
62.64
68.24
57.11
78.47
93.84
86.81
60.60
71.87
70.53
84.28
89.91
89.46
63.32
79.09
68.37
exon 87.20 82.48 70.97 82.53 85.74 75.50 83.22 89.89 77.77
Chr(C) intron
inter
87.59
53.53
77.54
79.58
73.22
58.33
85.88
66.36
75.99
86.61
63.46
71.07
82.34
82.07
83.22
67.24
69.80
68.92
exon 78.71 82.45 65.63 84.32 86.32 75.54 83.74 88.52 75.78
Chr(C) intron
inter
65.06
58.62
71.74
44.77
50.66
38.81
79.37
66.93
83.47
57.88
69.15
52.72
88.28
63.41
88.55
53.31
79.73
51.14
exon 87.74 87.46 81.36 77.91 86.29 73.35 84.52 87.11 79.05
Chr(C) intron
inter
74.02
71.34
82.13
61.87
65.14
51.69
76.71
72.68
83.10
59.34
67.84
48.74
78.63
71.59
82.02
64.15
68.20
53.54
exon 84.67 89.69 79.27 73.38 84.96 66.05 93.58 86.29 82.76
Chr(C) intron
inter
74.74
61.05
83.00
43.86
63.63
39.99
77.89
52.34
72.00
46.45
56.63
37.19
80.73
71.56
94.51
56.12
77.85
57.24
exon 91.39 79.62 77.63 76.38 85.69 70.69 78.07 91.65 77.04
Chr(C) intron
inter
63.15
79.91
91.45
46.62
59.21
51.54
80.06
73.69
91.40
45.62
74.47
47.69
81.38
88.02
97.07
46.92
80.39
55.86
exon 92.61 86.95 78.92 80.46 75.04 62.54 78.89 88.48 71.97
Chr(C) intron
inter
70.38
46.11
72.23
51.73
61.19
34.12
78.72
50.96
89.21
43.77
71.90
36.78
77.58
100.00
89.63
52.60
71.98
68.48

)!(
!#! # # $ % &
’()*+,-./01 2,-.3 $456
789: $;<=> 9?@A $;<=B3 #+C
D34E> );<=BF #4GE !
!
%H !
!
&H !
!
’H
!
!
(> $ ;<=I )! 4JE !
!
%H !
!
&H !
!
’H !
!
(H
!

%H !

&H !

’H !

(H !
#
%H !
#
&H !
#
’H !
#
(K 8L *!
4JEMNOPQJE> AROPSTOPUS>
VWXYZOPUS[\]^> _‘ab]^cd
5efZ> ghijcdkl +mnopB $$,
ln5qr-sEtu
vl +B’wxyTz{,|}~<]^c
d78€‚4#+3ƒ*+,-./0„…]†
cdY‡> ˆ?‰BŠ}~<]^:‹ŒŽ
-,8> ‘‰’N“”<•}~<–Š—˜™
6š›œCD3žŸu ƒ +,-./0„…ƒ
*H Y¡}~<3]^:‹Œ¢1 £’7¤‰v(
D’¥¦§*¨©8ª«¬¦< %(&­®> 8¯
°¬¦< (&%H (%%H (%&c±u
ƒ*+,-./0²ƒ +,-./03]^
cd³~‘‡> ´p‰ƒ*+,-./03 $$µ
³~¢(¡‚+> ¶³}~<“·‚¸3CDž
Ÿ¹(“”<“·‚¸CD3žŸu º,+,-
./0]^:‹Œ»(‚¼–Š½¾‡¶³L*
¿u ’ÀÁwxyTz{—˜™6š›œ3CD[
\Â@A> cdl³—“” C
DÅ9ž> ˆ?‰—“”<“· +. ¸8 &( ­
®H $.¸8 %&cÆ> L‚ÁCDÅ9ž¾‡Ç
ȓ”<3 &(/%&ÉÊu wxy“”<“· $.¸
ËBŠ}~
Íz{—5Î6ÏÐ̇ (u ‚4#+ƒ*}~<
ÑÒÓ»šÔƒ,6T¡ƒÕ6CD֝ž}1 ˜
™6¿Ô3!#H !!H !) 6¿½Öž1 ©Á %0
Å9̇u ‚4#+—¯°¬¦<6Ï×̇
(%%u ¯°¬¦<¡3‚46¿CD½5Áž1
wxẏ &(1 z{̇ %(u L؝žCD½
ىD’Ú°ÛÜ3·9ÝÞov(ßàmá> â
ã@AEXltu
—‚4#+BŠ}~¸º*6CDž|äåæ> wxẏ &> Í
z{Ê̇ %u (‰çèéÁº*}~<“· $.
¸ê BŠ}~<“· +.¸T $.¸H Y¡}~<“
· +.¸ -.CD[\Â@A1 cd~nwx
yTz{º*}~8 %&c±ìwxy’p„íîŒÐ¢ï ÍBŠ}
~xẏ &(1 z{ñ̇ %(u v8EX_‘
—˜™6¿3‚òCD©Å9̇óôCDu LØ
6¿ÁCD3̇Îõ‰D’B“”<ö÷˜™3
*Øø‘ÝÞu
ù}çèéÁwxyTz{BŠ}~<“· +.
¸N &(ì $.¸N %&3úû[\Â@A1 wxy
120#+! ;BF 20233 ;1 ü ))4++,ï z{ 3202+1
;BF !0-2$ ;1 ü $4$-,u ýØBŠ}~<8
&(­®8 %& c±50½Ù‰LØ}~õM“”<˜™> #Í$%&’Ÿ()D’l
3·9*+u Í’(“”< 6}~<Ó»˜™6¿
]^*,‰²Ö-.3/0> 12„í3GE3
a> Í]^:‹ŒVåf¢7*38> Í’(,|}~<
3$4]^125Mf6u —À]^70B> 8*
3]^GE99OPUS:„íu
!#$% & (9:0 ;:<=>?0 @A0 B;:CDE?D@F0 A@;0 %&’()*!)+) GFC0 $&,*,-)+.
The result of prediction

A. thaliana C. elegans Kinds Class
Sensitivity % Specificity % CC% Sensitivity % Specificity % CC%
the first exon 86.04 73.69 75.66 86.04 69.75 74.93
the mid exon

92.80

93.24

77.37

95.67

97.04

81.11

No.1
the last exon

81.73

95.54

86.48

87.13

97.72

89.03

the first exon

89.86

54.38

63.18

82.29

32.91

44.76

the mid exon

68.30

94.50

54.72

61.50

95.49

38.14

No.2

the last exon

89.01

55.62

63.70

87.34

33.89

47.45

the first exon

86.36

56.78

63.48

85.57

39.52

52.26

the mid exon

73.99

93.55

57.89

74.44

96.41

50.05

No.3

the last exon

88.27

62.12

68.49

87.96

49.07

61.19


*$H
! ! #$%&’()*+,-./0123456
! #$%& ’( #)* *+$*(,* !(% -./,* /#* ’0 !#$%&’%(% !(% )*&*+%(, 1*(*
#$%& ’()*(+,& & & & -./ 0(+1234516,/ / / / -.%/ #+5/ / /
!#$%&’(#)’ *+ ,-./01/2 3*44#5# *+ 610#)1#/ %)7 8#1-)*4*5.2 9))#& :*)5*40% ;)0<#&/0’.2 =*--*’ >?>>@?A 3-0)%B
!23456748 748/ 95:;<8=8/ >8?’8198>& 5@& CD’-%40%)% +1A& 3D#4#5%)/ 6815:8& +B8& A(C(A8A& (1=5& =4B88& D(1A>E&
8*51>F / (1=B51>/ +1A/ (1=8B681(9G%HI / 748/ JKF / KL/ +1A/ !L/ =B(:8B>M& ;B5N+N(<(=(8>O 5@& =48& =4B88& D(1A>& 5@&
>8?’8198>& +B8& B8>;89=(C88<89=8A& +>& ;+B+:8=8B>& 5@& =48& >5’B98>& 5@& A(C8B>(=PI & 748& 9<+>>8>& 5@& =48>8&
>8?’8198>& +B8& ;B8A(9=8A& NP& =48& (19B8:81=>& 5@& A(C8B>(=P& =48& :(1(:’:& 5@& =48& =4B88& (19B8:81=>I& 748& B8>’<=>&
>45Q1& =4+=& =48& 5C8B+<<& ;B8A(9=(51& +99’B+9(8>& 5@& CD’-%40%)% R>& & 8C8BP& 94B5:5>5:8& +B8& S!ITUV/ +1A/ SWIUXV/
@5B/ =48/ >=+1A+BA2>8=>/ +1A/ =8>=2>8=>Y / =48/ 5C8B+</ 5@/ 3D#4#5%)/R/ 8C8BP/ 94B5:5>5:8/ +B8/
WUIJWV/ +1A/ STIUZV/ @5B/ =48/ >=+1A+BA2>8=>/ +1A/ =8>=2>8=>F/ B8>;89=(C8/ (1/ CD’-%40%)%
+1A/ 3D#4#5%)/ 6815:8/ +B8/ A(C(A8A/ (1=5/ =4B88/ =P;8>I / [+>8A/ 51/ =48/ @B8?’819(8>/ 5@/ K/ D(1A>/ 5@/ N+>8>/ (1/
B86(51>/ 18+B/ (1=B51\8*51/ N5’1A+BPF / (1(=(+=(51/ +1A/ =8B:(1+=(51/ >(=8/ @5B/ =B+1><+=(51F / =48/ A(C8B>(=P/ >5’B98/ (>/
95:;5>8A/ 5@/ T!/ >8?’8198/ ;+B+:8=8B>I/ 748/ =4B88/ D(1A>/ 5@/ 8*51>/ +B8/ ;B8A(9=8A/ NP/ ’>(16/ 5@/ +1/ +<65B(=4:/
N+>8A/ 51/ =48/ (19B8:81=/ 5@/ A(C8B>(=PI/ 748/ B+=8>/ 5@/ 95BB89=/ ;B8A(9=(51/ 4(648B/ =4+1/ SLV/ +B8/ 5N=+(18AI/
/ / / / 9:; <=5>38 $*51Y/ .1=B51Y/ .1=8B681(9G%HY/ ]8+>’B8/ 5@/ A(C8B>(=PY/ ^;<(98O >(=8
!#$%
_T‘O O789FO:;*?@ABCDEB+,FG56IO
DHIJKKLaMNOKPbFO !LL!FKEKLTcKLJ
_!‘ O O ^1PA8BO $$F O ^=5B:5O dGI O .A81=(@(9+=(51O 5@O ;B5=8(1O 95A(16O
B86(51O (1O 6815:(9O G%HIO E (*4 F*04FO TUUXF!KSETcTS
_Z‘O O ^+<3N8B6O ^-FO G8<948BO H-FO e+>(@O ^FO f4(=8O gIO ](9B5N(+(A81=(@(9+=(51O ’>(16O (1=8B;5<+=8AO ]+BD5CO :5A8<>I O GH14#01
C107/ I#/FO TUUSF!EXKKcXKS
_K‘ O O h56(9O ^F O g’8<<8==8O [iiF O ]+9DQ5B=4O HeI O .:;B5C(16O 6818O
B89561(=(51O +O 99’B+9PO NPO 95:N(1(16O ;B8A(9=(51>O @B5:O =Q5O
68182@(1A(16O ;B56B+:>IO F0*0)+*&(%’01/FO !LL!FTSaSbETLZKcTLKX
_X‘ O O ]8P8BO .]F O G’BN(5O hI O 5:;+B+=(C8O +NO (1(=(5O ;B8A(9=(51O 5@O
6818O >=B’9=’B8O ’>(16O ;+(BO #]]>IO F0*0)+*&(%’01/A !LL!FTSaTLbE
TZLUcTZTS
_J‘ O O #+=3(685B6(5’O HdI O 7B+1><+=(51O (1(=(+=(51O >=+B=O ;B8A(9=(51O (1O
4’:+1O 9G%H>O Q(=4O 4(64O +99’B+9PI O F0*0)+*&(%’01/A !LL!F O
TSa!bEZKZcZXL
_W‘O O -.%O jFO klO j-IO mB8A(9=(51O 5@O ;B5D+BP5=(9O ;B5:5=8B>O N+>8AO 51O
;B8A(9=(51O 5@O =B+1>9B(;=(51+IOQRSKCQRR3KLFO
!LLZFZXaKbEZTWcZ!K
_S‘ O O ]5B681>=8B1O [F O h(118BO gI O $*51O A(>95C8BPO NPO 6815:(9O
>8?’8198O +<(61:81=IO F0*0)+*&(%’01/A !LL!FTSaJbEWWWcWSW
_U‘O O H4B(168BO jIO 7’B1O =5O =48O Q5B:IO 3H&& J$0) K#)#’ #WEKTLcKTX
_TL‘O [+>>8==O G$O jBFO [56’>D(O ]^FO ^;8198BO iFO e(:O ^FO f8+C8BO 7FO
#(8=8BO mI O d815:8O 9B5>>2B8@8B819(16O +1AO kh$iANE O
(:;<(9+=(51>O @5BO =48O (A81=(@(9+=(51O +1AO +1+

(>O 5@O 6818>O
:’=+=8AO (1O 4’:+1O A(>8+>8IO G%’ K#)#’A TUUWFTXEZZUcZKK
_TT‘OTUFOVWXIOY$Z)*?56[\IO]QRK^LFO !LLTF
ZEUUcTLT
_T!‘O -+*=51O hhIO 748O :8+>’B8O 5@O A(C8B>(=PIO E 8-#*& F0*4FO TUWSFWTE
XTcJW
_TZ‘ O -(O 0nF O -’O n0I O 748O ;B8A(9=(51O 5@O =48O >=B’9=’B+>O 5@O
;B5=8(1E O +;;<(9+=(51O 5@O =48O :8+>’B8O 5@O A(C8B>(=PI O E 8-#*&
F0*4A !LLTF!TZEKUZcXL!
_TK‘O_‘aFOVbcIO OdefghijklmnoIOQRR3K
LFO !LLTFKEWLZcWT!
_TX‘OpqrFOVbcIOdefghistQRukl2vwx0yIO
DHIzKKLoMNOKPbFO !LLZFXEXTLcXTW
_TJ‘O [+p(9O q[FO ^8+4O ^#I O GB+651O 6818O >=+B=O @(1A8BE O +1O +AC+198AO
>P>=8:O @5BO @(1A(16O +;;B5*(:+=8O <59+=(51>O 5@O =48O >=+B=O 5@O 6818O
=B+1>9B(;=(51+IO K#)*(# I#/#%&1-A !LLZFTZaSbETU!ZcTU!U
_TW‘O{|FO}~FO€FO‚ƒ„FO…†‡IOˆtQR G%H‰Š‹
Œ2?ŽIOQRSKCQRR3[\FO !LL!FKEXSZcXSW
_TS‘ O m8B=8+O ]F O -(1O krF O ^+<3N8B6O ^-I O d818>;<(98BE O +O 18QO
95:;’=+=(51+;<(98O >(=8O ;B8A(9=(51I O GH14#01
C107/ I#/FO !LLTF!UETTSXcTTUL
_TU‘OV‘FO7IO O .oDEB‘’56[\IOQRSKCQRR
3[\FO !LLZFZEZJZcZJS
TZT