Long-range PCR - GC/AT-rich ¹Ýº¹ ¼¿ÀÇ multiplex PCR Àû¿ë (53 kb Ÿ°Ù)
PrimeSTAR LongSeq DNA Polymerase (Code R055A)
- ´ÙÄ«¶óÀÇ µ¶ÀÚ ±â¼ú·Î long fragment (50 kb ÀÌ»ó) ÁõÆø ½ÇÇö
- GC/AT rich ¼¿ÀÇ long fragmentµµ ÁõÆø °¡´É
- ³ôÀº ƯÀ̼º: ºñƯÀÌÀûÀÎ ÁõÆøÀ» ¾ïÁ¦Çϰí long fragmentÀÇ multiplex PCR Àû¿ë °¡´É
- ¹Ýº¹¼¿¿¡ ´ëÇÑ Å¹¿ùÇÑ ÁõÆø
- Long-read NGS¸¦ À§ÇÑ ±ÕÀÏÇÑ coverage »ý¼º °¡´É
¡á Introduction
3¼¼´ë ½ÃÄö½ÌÀ̶ó ºÒ¸®´Â Long-read NGS´Â ÀϹÝÀûÀ¸·Î 20kb ÀÌ»óÀÇ read¸¦ »ý¼ºÇϳª, ÀÌ¿Í ´ëÁ¶ÀûÀ¸·Î ±âÁ¸ NGS ½ÃÄö½Ì ±â¼úÀº ÀϹÝÀûÀ¸·Î 150~300 bpÀÇ read¸¦ »ý¼ºÇÑ´Ù. ÀϹÝÀûÀ¸·Î ªÀº read´Â À¯Àüü ³»ÀÇ ¹Ýº¹¼¿, ±¸Á¶Àû º¯ÀÌ, ±ä polymer ±¸¿ªÀ» Á¤È®ÇÏ°Ô ½Äº°Çϱ⿡ ÇѰ谡 ÀÖ´Ù. ¹Ý¸é Long-read NGS´Â ÀÌ·¯ÇÑ ¹Ýº¹¼¿¿¡ ´ëÇÑ Á¤È®ÇÑ read¸¦ Á¦°øÇÒ ¼ö ÀÖÁö¸¸, ±æ°í º¹ÀâÇÑ ±¸Á¶ÀÇ DNA¸¦ PCR ÁõÆøÇÏ´Â °ÍÀÌ Á¾Á¾ ¾î·Á¿î °æ¿ì°¡ ÀÖ¾î Long-read NGSÀÇ È°¿ë¿¡ ÇѰ谡 ÀÖ°í, ±æ°í ¹Ýº¹ÀûÀÎ GC/AT ¼¿ÀÇ DNA¸¦ ƯÀÌÀûÀ¸·Î ÁõÆøÇØ¾ß ÇÏ´Â ¿¬±¸¿¡´Â ÀûÇÕÇÏÁö ¾Ê´Ù.
ÀÌ·¯ÇÑ ¹®Á¦¸¦ ÇØ°áÇϱâ À§ÇØ ´ÙÄ«¶ó¹ÙÀÌ¿À´Â ±æÀ̰¡ ±æ°í GC/AT richÇÑ ¼¿ (ÃÖ´ë 80% GC)ÀÇ Æ¯À̼º ³ôÀº ÁõÆøÀ» À§ÇØ ÃÖÀûÈµÈ DNA polymerase¸¦ °³¹ßÇÏ¿© Å×½ºÆ®¸¦ ½Ç½ÃÇÏ¿´´Ù.
¡á Result
ÃÖ´ë 53kbÀÇ ÃÊÀå°Å¸® PCR ÁõÆø
PrimeSTAR
¢ç LongSeq DNA Polymerase (ÀÌÇÏ PrimeSTAR
¢ç LongSeq) ¸¦ ÀÌ¿ëÇÏ¿© 0.5 kb~ 53 kb »çÀÌÁîÀÇ human genomic DNA (gDNA)¸¦ ¼º°øÀûÀ¸·Î ÁõÆøÇÏ¿´´Ù.
±×¸²1. 0.5~53 kb±îÁö ´Ù¾çÇÑ ±æÀÌÀÇ Å¸°ÙÀ» ºñƯÀÌÀûÀÎ ÁõÆø ¾øÀÌ ¼º°øÀûÀ¸·Î ÁõÆøÇÑ PrimeSTAR¢ç LongSeq
Lane M1: 1 kb DNA ladder. Lane 0.5 kb:
p53 target. Lane 1 kb:
DCLRE1A target #1. Lane 2 kb:
DCLRE1A target #2. Lane 4 kb:
DCLRE1A target #3. Lane 15 kb: ¥â-globin target #1. Lane 24 kb: ¥â-globin target #2. Lane 30 kb: ¥â-globin target #3. Lane 40 kb:
HBB target. Lane 53 kb:
SLC30A9 target.
Ÿ»çÀÇ ´Ù¼¸ °¡Áö long-range PCR È¿¼Ò¿Í ºñ±³ÇÑ °á°ú, PrimeSTAR
¢ç LongSeq ´Â 52~53 kb Ÿ°Ù ÁõÆøÀÌ ´õ ¿ì¼öÇßÀ¸¸ç ´Ù¾çÇÑ Å¸°Ù¿¡¼ ³ôÀº ÁõÆøÈ¿À²°ú ³·Àº ºñƯÀÌÀûÀÎ ÁõÆøÀ» º¸¿´´Ù (±×¸² 2).
±×¸² 2. ¸Å¿ì ´Ù¾çÇÑ »çÀÌÁîÀÇ Å¸°Ù ÁõÆø¿¡¼ ¾ÐµµÀûÀÎ ÁõÆø È¿À²À» ³ªÅ¸³»´Â PrimeSTAR¢ç LongSeq
Human gDNA¿¡¼ 52~53 kb Ÿ°ÙÀ» ÁõÆøÇϱâ À§ÇØ PCRÀ» ¼öÇàÇÏ¿´°í, PCR »ê¹°Àº magnetic beads·Î Á¤Á¦ÇÏ¿© 100 pgÀ» Femto Pulse system (Agilent Technologies)·Î ºÐ¼®ÇÏ¿´´Ù.
Lane M: Agilent 165 kb DNA Ladder. Lane 1:
PUM1 target (52 kb). Lane 2:
SLC30A9 target (53 kb).
GC/AT rich ¼¿ÀÇ ÁõÆø
GC rich ¼¿Àº hairpin°ú °°Àº 2Â÷ ±¸Á¶¸¦ Çü¼ºÇϱ⠽±°í melting ¿Âµµ°¡ ´õ ³ôÀ¸¸ç, AT-rich ¼¿Àº DNA ÀÌÁß±¸Á¶¸¦ ºÒ¾ÈÁ¤ÇÏ°Ô ¸¸µé°í annealing ¿Âµµ°¡ ´õ ³·´Ù. ÀÌ·¯ÇÑ Æ¯¼ºÀº Á¾Á¾ PCR ¹ÝÀÀ Á¶°Ç ÃÖÀûȸ¦ ÇÊ¿ä·Î Çϸç multiplex PCR ½Ã GC/AT rich ¼¿·Î ÀÎÇØ Á¶°Ç ¼³Á¤¿¡ ¾î·Á¿òÀ» ¾ß±âÇÒ ¼ö ÀÖ´Ù.
±âº» ¹ÝÀÀ Á¶°Ç¿¡¼ PrimeSTAR
¢ç LongSeq´Â GC 65~66%ÀÇ 17~20 kb ¼¿À», AT 65~66%ÀÇ 16~21 kb ¼¿À» ¼º°øÀûÀ¸·Î ÁõÆø½ÃÄ×°í (±×¸² 3), GC/AT rich ¼¿À» ¼¿·ÎÀ¸·Î ÇÑ multiplex PCR¿¡¼µµ ±ú²ýÇÑ ÁõÆø °á°ú¸¦ È®ÀÎÇÏ¿´´Ù (±×¸² 4).
±×¸² 3. ¼º°øÀûÀÎ GC/AT rich ¼¿ ÁõÆø.
ºñƯÀÌÀûÀÎ ÁõÆøÀÌ ºó¹øÇÑ GC/AT rich ¼¿µµ PrimeSTAR
¢ç LongSeqÀ» »ç¿ëÇÏ¸é Æ¯º°ÇÑ buffer³ª º°µµÀÇ ¹ÝÀÀ Á¶°Ç ÃÖÀûÈ ¾øÀÌ long-range PCRÀÌ °¡´ÉÇÏ´Ù.
±×¸² 4. GC/AT rich ¼¿ÀÇ multiplex PCR¿¡¼µµ ¾ÈÁ¤ÀûÀÎ ÁõÆøÀÌ °¡´ÉÇÑ ¢ç LongSeq
µÎ ¼¼Æ®ÀÇ ÇÁ¶óÀ̸ӷΠGC/AT rich ¼¿À» µ¿½Ã¿¡ ÁõÆøÇÏ¿´°í, ÇÁ¶óÀ̸Ӵ ÃÖÁ¾ ³óµµ 0.2 ¥ìMÀ¸·Î »ç¿ëÇÏ¿´´Ù. ÁõÆø ÈÄ 4200 TapeStation System (Agilent Technologies)À¸·Î ÁõÆøÀ» È®ÀÎÇÏ¿´´Ù. Lane 1: 7 kb target (GC: 68%) ¹× 21 kb target (AT: 65%), Lane 2: 7 kb target (GC: 68%), Lane 3: 21 kb target (AT: 65%), Lane M: ladder, Lane 4: 8 kb target (AT: 66%) ¹× 20 kb target (GC: 65%), Lane 5: 20 kb target (GC: 65%), Lane 6: 8 kb target (AT: 66%)
¿Âµµ ¾ÈÁ¤¼º (Temperature stability)
¸ðµç mixture¸¦ ÷°¡ÇÑ ÈÄ PCR cycleÀ» ½ÃÀÛÇϱâ Àü±îÁö 4 ¡É ¶Ç´Â ½Ç¿Â¿¡¼ PCR ¹ÝÀÀ¾×ÀÇ ¾ÈÁ¤¼ºÀ» À¯ÁöÇÏ´Â °ÍÀº ÀÏ»óÀûÀÎ ½ÇÇè Áö¿¬ »óȲÀ̳ª high throughput ½ÇÇè¿¡ Àû¿ëÇϱâ À§ÇÑ Çʼö Á¶°ÇÀÌ´Ù. ÀÌ·¯ÇÑ ¿Âµµ ¾ÈÁ¤¼ºÀº ƯÈ÷ ÀÚµ¿ ºÐÁÖ ÀåÄ¡ (automated liquid handler)¸¦ »ç¿ëÇÏ¿© PCR ¹ÝÀÀÀ» ¼ÂÆÃÇÏ´Â °æ¿ì¿¡ ¸Å¿ì Áß¿äÇÏ´Ù.
PrimeSTAR
¢ç LongSeqÀº PCR ¹ÝÀÀ¾×À» ÁغñÇÑ ÈÄ 4 ¡É¿¡¼ 17½Ã°£ ¶Ç´Â ½Ç¿Â¿¡¼ 1½Ã°£ º¸°ü ÈÄ ¹ÝÀÀ¿¡ »ç¿ëÇØµµ ³ôÀº ƯÀ̼ºÀ» À¯ÁöÇÒ ¼ö ÀÖ¾ú´Ù (±×¸² 5).
±×¸² 5. ´Ù¾çÇÑ Á¶°Ç¿¡¼µµ ³ôÀº ƯÀ̼ºÀ» À¯ÁöÇÏ´Â PrimeSTAR¢ç LongSeq
4¡É¿¡¼ 17½Ã°£ ¶Ç´Â ½Ç¿Â¿¡¼ 1½Ã°£ µ¿¾È PCR mixture¸¦ º¸°üÇÑ °æ¿ì¿¡µµ ºñƯÀÌÀûÀÎ ÁõÆøÀ» ¾ïÁ¦Çϱ⠶§¹®¿¡ ÀÚµ¿ÈµÈ workflow¿¡ ÀûÇÕÇÏ´Ù.
¹Ýº¹¼¿ÀÇ 20-plex PCRÀ» ÅëÇÑ long-read NGS ºÐ¼®
¹Ýº¹µÇ´Â DNA ¼¿À» PCR ÁõÆøÇÏ´Â °ÍÀº Á¾Á¾ ½ÇÇè »óÀÇ ¹®Á¦¸¦ ¾ß±âÇÑ´Ù. ¹Ýº¹ ¼¿Àº hairpin ±¸Á¶¸¦ Çü¼ºÇϱ⠽¬¿îµ¥, ÀÌ´Â DNA polymeraseÀÇ ÇØ¸®¸¦ ÃËÁøÇÏ¿© ÀÌ·Î ÀÎÇØ ÇÕ¼ºµÈ ºÒ¿ÏÀüÇÑ ´ÜÆíÀÌ mega primer·Î¼ ±âÁ¸ primerÀÇ ¿ªÇÒÀ» ´ëüÇÏ°Ô µÈ´Ù.
PrimeSTAR
¢ç LongSeqÀ¸·Î PacBio PureTarget ÆÐ³ÎÀÇ repeat expansion loci 20°³¸¦ Å×½ºÆ® ÇÏ¿´°í (Ç¥ 1), ÀÌ·¯ÇÑ human loci´Â ÇåÆÃÅϺ´ ¹× ±Ù±äÀ强 ÀÌ¿µ¾çÁõÀ» Æ÷ÇÔÇÑ ¹Ýº¹ È®Àå Áúȯ (repeat expansion disorders) (Ibanez
et al. 2024)¿Í °ü·ÃÀÌ ÀÖ´Ù.
Ç¥ 1. ½ÇÇè¿¡ »ç¿ëµÈ repeat expansion ÆÐ³Î
Label |
Chromosome |
Target |
Repeat Definition |
A |
chr12 |
ATN1 |
MOTIFS = CAG; STRUC = (CAG)n |
B |
chrX |
AR |
MOTIFS = GCA; STRUC = (GCA)n |
C |
chr6 |
ATXN1 |
MOTIFS = TGC; STRUC = (TGC)n |
D |
chr22 |
ATXN10 |
MOTIFS = ATTCT; STRUC = (ATTCT)n |
E |
chr12 |
ATXN2 |
MOTIFS = GCT; STRUC = (GCT)n |
F |
chr14 |
ATXN3 |
MOTIFS = GCT; STRUC = (GCT)n |
G |
chr3 |
ATXN7 |
MOTIFS = GCA, GCC; STRUC = (GCA)n(GCC)n |
H |
chr13 |
ATXN8OS |
MOTIFS = CTA, CTG; STRUC = (CTA)n(CTG)n |
I |
chr9 |
C9ORF72 |
MOTIFS = GGCCCC; STRUC = (GGCCCC)n |
J |
chr19 |
CACNA1A |
MOTIFS = CTG; STRUC = (CTG)n |
K |
chr3 |
CNBP |
MOTIFS = CAGG, CAGA, CA; STRUC = (CAGG)n(CAGA)n(CA)n |
L |
chr19 |
DMPK |
MOTIFS = CAG; STRUC=(CAG)n |
M |
chrX |
FMR1 |
MOTIFS = CGG, AGG; STRUC=(CGG)n |
N |
chr9 |
FXN |
MOTIFS = A, GAA; STRUC = (A)n(GAA)n |
O |
chr4 |
HTT |
MOTIFS = CAG, CCG; STRUC = (CAG)nCAACAG(CCG)n |
P |
chr14 |
PABPN1 |
MOTIFS = GCG; STRUC = (GCG)n |
Q |
chr5 |
PPP2R2B |
MOTIFS = GCT; STRUC = (GCT)n |
R |
chr4 |
RFC1 |
MOTIFS = AAAAG, AAAGG, AAGGG, AAGAG, AGAGG, AACGG, GGGAC, AAAGGG; STRUC = <RFC1> |
S |
chr6 |
TBP |
MOTIFS = GCA; STRUC = (GCA)n |
T |
chr18 |
TCF4 |
MOTIFS = CAG; STRUC = (CAG)n |
20°³ÀÇ DNA ¹Ýº¹¼¿À» multiplex PCR (10~12 kb)·Î ÁõÆøÇÑ ÈÄ Native Barcoding Kit 96 V14 (Oxford Nanopore Technologies)·Î NGS library¸¦ Á¦ÀÛÇÏ¿© GridION (Oxford Nanopore Technologies) À¸·Î ºÐ¼®ÇÏ¿´´Ù.
PrimeSTAR
¢ç LongSeqÀ¸·Î ÁõÆøÇÑ »ê¹°Àº ´Ù¸¥ long-read PCR È¿¼Ò·Î ÁõÆøÇÑ »ê¹°º¸´Ù Ÿ°Ù¿¡ ´ëÇÑ 1Â÷ ¸®µå (primary read) ºñÀ²ÀÌ ´õ ³ô¾Ò´Ù (±×¸² 6). Ãß°¡ ºÐ¼® °á°ú PrimeSTAR
¢ç LongSeqÀ¸·Î ÁõÆøÇÑ »ê¹°Àº ¿ì¼öÇÑ ½ÃÄö½Ì coverage¿Í ³ôÀº ±ÕÀϼºÀ» ³ªÅ¸³½ ¹Ý¸é, Ÿ»ç Á¦Ç°µéÀº ´Ù¾çÇÑ Å¸°Ù¿¡ ´ëÇÑ ½ÃÄö½Ì coverage°¡ ¾ø°Å³ª ±ØÈ÷ ³·Àº depth¸¦ ³ªÅ¸³»¾ú´Ù (±×¸² 7).
PrimeSTAR
¢ç LongSeqÀ» »ç¿ëÇÏ¿© bias¸¦ ÁÙÀÌ¸é¼ ±ÕÀÏÇÑ ÁõÆøÀ» ÁøÇàÇÒ ¼ö ÀÖ°í ÃÖÀûÈµÈ multiplex ¹ÝÀÀÀ» ¼³°èÇÒ ¼ö ÀÖ¾î ¼Ò·®ÀÇ »ùÇ÷εµ ½ÃÄö½Ì ºÐ¼®ÀÌ °¡´ÉÇÒ °ÍÀ¸·Î ±â´ëÇÑ´Ù.
±×¸² 6. Ÿ»ç ´ëºñ ³ôÀº on-target primary read ºñÀ² (GridION Ç÷§Æû »ç¿ë)
PrimeSTAR
¢ç LongSeqÀº 99% ÀÌ»óÀÇ on-target ÁõÆøÀ²À» º¸¿´°í, ÀÌ´Â long-range multiplex PCR ¿¡¼µµ ¸Å¿ì ƯÀÌÀûÀ¸·Î ÁõÆøÀÌ Àß ÀÌ·ç¾îÁ³À½À» ÀǹÌÇÑ´Ù.
±×¸² 7. Ÿ»ç ´ëºñ ±ÕÀÏÇÑ 20-plex PCR ampliconÀÇ ½ÃÄö½Ì coverage (GridION Ç÷§Æû »ç¿ë)
ÆÐ³Î A. NGS ºÐ¼® °á°ú, PrimeSTAR
¢ç LongSeq (P)À» »ç¿ëÇÑ multiplex ÁõÆø»ê¹°Àº 20°³ ¸ðµÎ¿¡¼ Ÿ°Ù ¼¿ÀÇ Àüü ¿µ¿ªÀ» ±ÕÀÏÇÏ°Ô ÁõÆøÇÏ´Â µ¥ ¼º°øÇÏ¿´´Ù. ¹Ý¸é, Ÿ»ç long-range PCR È¿¼Ò (N, T)´Â ´Ù¼öÀÇ Å¸°Ù¿¡ ´ëÇØ depth°¡ ¾ø°Å³ª ±Øµµ·Î ³·Àº depthÀÇ °á°ú¸¦ º¸¿´´Ù.
ÆÐ³Î B. Depth ±ÕÀϼº¿¡ ´ëÇÑ Á¤·® ºÐ¼® °á°ú, PrimeSTAR
¢ç LongSeqÀ» »ç¿ëÇÏ¿© 20°³ Ÿ°Ù ¸ðµÎ¿¡¼ full-length ¼¿À» ¾ò¾úÀ½À» Ãß°¡·Î È®ÀÎÇÏ¿´´Ù (n=40, 2¹Ýº¹). ¹Ý¸é Ÿ»ç long-range PCR È¿¼Ò´Â Ÿ°ÙÀÇ 53~68%¿¡ ´ëÇØ¼ ¸¸ full-length ¼¿À» ¾òÀ» ¼ö ÀÖ¾ú°í, ÀÌ´Â ½ÃÄö½Ì read ¼ö¸¦ ´Ã·Áµµ °á°ú°¡ Å©°Ô °³¼±µÇÁö ¾Ê¾Ò´Ù (data not shown).
¡á Conclusions
PrimeSTAR
¢ç LongSeq DNA Polymerase´Â ÃÖ´ë 53 kb±îÁöÀÇ long-range PCR¿¡¼ Ÿ»ç µ¿Á¾Á¦Ç° ´ëºñ ¿ì¼öÇÑ ¼º´ÉÀ» ¹ßÈÖÇÏ¿´´Ù. (1) GC/AT rich ¼¿ ÁõÆø, (2) GC rich ¼¿°ú AT rich ¼¿À» µ¿½Ã¿¡ ÁõÆøÇÏ´Â multiplex PCR, ±×¸®°í (3) 4¡É ¿¡¼ 17½Ã°£ ¶Ç´Â ½Ç¿Â¿¡¼ 1½Ã°£ °æ°ú ÈÄ GC/AT rich ¼¿À» ÁõÆøÇØµµ ¸ðµÎ ³ôÀº Á¤È®µµ¸¦ À¯ÁöÇÏ¿´´Ù.
20Á¾ÀÇ DNA ¹Ýº¹¼¿¿¡ ´ëÇÑ multiplex PCR¿¡¼ PrimeSTAR
¢ç LongSeq DNA Polymerase´Â Ź¿ùÇÑ ½ÃÄö½Ì coverage¿Í ±ÕÀÏÇÑ depth¸¦ º¸¿´À¸¸ç, Oxford Nanopore TechnologiesÀÇ long-read NGS ºÐ¼®À» ÅëÇØ ½ÃÁß¿¡ ÆÇ¸ÅÁßÀΠŸ»ç long-range PCR È¿¼Òº¸´Ù ÇöÀúÈ÷ ¿ì¼öÇÑ °á°ú¸¦ º¸ÀÌ´Â °ÍÀ» È®ÀÎÇÏ¿´´Ù.
¡á Methods
¸ðµç PCR ¹ÝÀÀÀº human genomic DNA (100 ng / 50 §¡ ¹ÝÀÀ)¿Í Clontech PCR Thermal Cycler GP (Code WN400)¸¦ ÀÌ¿ëÇÏ¿© °¢ Á¦Á¶»çÀÇ ±ÇÀå ¸Å´º¾ó¿¡ µû¶ó ÁøÇàÇÏ¿´´Ù.
¡á Reference
Ibanez, K. et al. Increased frequency of repeat expansion mutations across different populations.
Nature Medicine 30, 3357-3368 (2024).
º» ¿¬±¸¿¡ »ç¿ëµÈ ultra long-range PCR È¿¼Ò PrimeSTAR LongSeq DNA Polymerase (Code R055A) Á¦Ç° È®ÀÎÇϱâ