DISCUSSION
The translation machinery is centrally poised to support protein synthesis and biomass expansion to which malignant cells are addicted (38). Despite the wide belief that cap-dependent translation drives tumor progression, it is puzzling to find that inhibiting eIF4F alone does not completely eliminate oncogenic translation. Here, we report an additional checkpoint during translation initiation that involves start codon–associated ribosome pausing. Since ribosome initiation pausing selectively occurs on mRNAs involved in cell growth and proliferation, cancer cells apparently take advantage of relaxed ribosome pausing during oncogenic translation. By releasing the initiation brake, malignant cells achieve the maximum translational output as a result of increased ribosome loading to the 5′ end and rapid transition from initiation to elongation at the start codon. The coordination between eIF4F-mediated cap recognition and start codon–associated ribosome pausing fine-tunes the translational flux, which is co-opted by mutant RAS cancer cells to satisfy their increased anabolic demands.
Factors affecting the residence time of the 80S ribosome at the start codon are likely to be multifaceted. In addition to translation initiation factors such as eIF5B (39), distinct sequence features following individual start codons could contribute to the wide range of ribosome pausing across transcriptome. This possibility partially explains why QTI-seq does not capture the initiating ribosomes with the same efficiency (25, 40). Nevertheless, differential initiation pausing on the same transcript must be due to factors beyond sequence. We unexpectedly found that m6A modification in the start codon vicinity serves as a linchpin in this process. A common feature of mRNA methylation is the asymmetric distribution with most of the m6A sites enriched near the stop codon (41, 42). Notably, methylation levels at stop codons and 3′UTR are relatively static (43). By contrast, 5′UTR methylation is subject to dynamic regulation in response to different growth conditions (44). The start codon–associated m6A enrichment was initially observed in plant species (45, 46). A growing number of recent studies started to appreciate the relative enrichment of m6A near start codons in various tissue samples, including cancers (13, 47). However, the physiological significance of region-specific mRNA methylation remains poorly understood. For many transcripts, the strong correlation between the m6A modification near the start codons and initiation pausing represents an additional layer of translational regulation. How exactly mRNA methylation in the start codon vicinity leads to initiation pausing warrants further investigation. We have searched for potential “readers” in m6A-controlled initiation pausing by examining published crosslinking immunoprecipitation (CLIP)-based datasets. However, all YTH family proteins (DF1, DF2, DF3, DC1, and DC2) do not seem to have their binding sites enriched around start codons (48). One plausible mechanism could involve eIF3 that plays critical roles in ribosome scanning and start codon selection. In particular, 80S ribosome–associated eIF3 has been reported to be crucial in translation reinitiation (49). Since eIF3 has been proposed to interact with m6A (50), we hypothesized that eIF3 could possibly retain the 80S ribosome at the start codon via interaction with m6A and subsequently delay the elongation commitment.
Perhaps the most unexpected finding is the decreased mRNA methylation by oncogenic RAS signaling. The mechanistic connection between m6A and cancer-relevant processes was initially suggested from studies linking m6A to cellular differentiation pathways that control the stem cell fate. A series of recent studies reported that mRNA m6A modification plays a critical role in the development of cancers such as AML and glioblastoma (12–14). While depleting m6A methyltransferases promoted tumorigenesis, knocking down FTO or ALKBH5 suppresses tumor progression. These results clearly point to a link between altered m6A levels and tumorigenesis, although few mechanistic details are currently known. It is noteworthy that the mRNA methylation program has a context-dependent effect on tumorigenesis. The consequences of m6A modifications can be complex and dependent on cellular identity as well as differentiation status (51). Depending on cell types, loss of m6A has been shown to yield different outcomes. In addition, tumor initiation and maintenance could involve different cellular programs. It will be interesting to demonstrate the role of m6A in early and late stages of tumor progression. The reduced METTL3 expression during RAS oncogenic translation is in line with the poor survival in patients with pancreatic cancer (Fig. 4D). In addition, the missense mutations of m6A methyltransferases found in pancreatic and endometrial cancers are associated with reduced methyltransferase activities (15). Collectively, oncogenic RAS signaling drives cancer development and progression by coordinating increased ribosome loading and relaxed initiation pausing.
The complex phenotype of m6A in cancer biology could also be attributed to the pleiotropic effects of mRNA methylation in cellular processes ranging from mRNA splicing, polyadenylation, mRNA export, translation, to degradation. However, in cells with oncogenic RAS signaling, we found limited changes of mRNA steady-state levels despite reduced mRNA methylation (Fig. 2). It is also worth noting that altering global m6A levels has limited effects on tumorigenesis of pancreatic cancer cells with wild-type RAS (Fig. 6D), arguing against the global effects of altered m6A levels. It further substantiates the notion that the translational effects of initiation pausing are coupled with oncogenic signaling. The discovery that m6A modification in the vicinity of start codons influences the behavior of initiating ribosomes greatly expands the breadth of physiological roles of “epitranscriptomics” in translational regulation.
Mutant RAS-driven cancers are extremely refractory to standard chemotherapeutic treatments (52, 53). There is a dire need for new therapeutic approaches that are likely based on a better understanding of the biology of this disease. The functional connection between reduced mRNA methylation and relaxed initiation pausing during oncogenic translation creates vulnerabilities for mutant RAS cancer cells that could be exploited as a therapeutic strategy. In this regard, FTO knockdown or FTO inhibitors offer a promising strategy for reversing oncogenic translation (36, 54). Our study demonstrates the feasibility of attenuated FTO demethylation for the inhibition of oncogenic translation. With the functional diversity of mRNA methylation on the rise, it will be highly desirable to control m6A modification in a transcript- and site-specific manner. We envision that compounds capable of adjusting ribosome initiation pausing may increase the efficacy of current cancer therapy regime.
MATERIALS AND METHODS
Cell lines and reagents
Primary TtH cells were stably infected to generate polyclonal populations with retrovirus derived from pBabe Puro by subcloning into complementary DNA (cDNA) encoding fused ER:HRasG12V. The primary TtH cells and ER:HRasG12V TtH cells were maintained in Dulbecco’s modified Eagle’s medium (DMEM) with 10% fetal bovine serum (FBS). ER:HRasG12V TtH cells were treated with 1 μM 4-OHT (Sigma-Aldrich) for 48 hours for oncogenic HRasG12V expression; BxPC-3 (KRasWT), and AsPC-1 (KRasG12D) cells were grown in RPMI 1640 medium with 10% FBS; CFPAC-1 (KRasG12V) cells were grown in Iscove’s modified Dulbecco’s medium with 10% FBS. Antibodies used in this study are listed as follows: anti–pan-Ras [Milipore MABS195, 1:1000 Western blotting (WB)], anti-Akt (Cell Signaling Technology, 9272; 1:1000 WB), anti–phospho (Ser473)–Akt (Cell Signaling Technology, 9271; 1:1000 WB), anti-p70 S6 kinase (Cell Signaling Technology, 9202; 1:1000 WB), anti–phospho-S6 (Thr389) kinase (Cell Signaling Technology, 9205; 1:1000 WB), anti–4E-BP1 (Cell Signaling Technology, 9452; 1:1000 WB), anti–phospho (Thr37/46) 4E-BP1 (Cell Signaling Technology, 2855; 1:1000 WB), anti-METTL3 (Abnova, H00056339-B01P; 1:1000 WB), anti-METTL14 (Sigma-Aldrich, HPA038002; 1:1000 WB), anti-WTAP (Santa Cruz Biotechnology, sc-374280; 1:1000 WB), anti-FTO (PhosphoSolutions, 597-FTO; 1:1000 WB), anti-ALKBH5 (Proteintech, 16837-1-AP; 1:1000 WB), anti-YTHDF1 (Proteintech, 17479-1-AP; 1:1000 WB), anti-YTHDF2 (Proteintech, 24744-1-AP; 1:1000 WB), anti-m6A (Millipore, ABE572; 1:1000 WB), anti-puromycin (Developmental Studies Hybridoma Bank, PMY-2A4; 1:100 WB) and anti–β-actin (Sigma-Aldrich, A5441; 1:2000 WB).
Lentiviral short hairpin RNAs
Short hairpin RNA (shRNA) targeting sequences based on The RNAi Consortium at Broad Institute (https://portals.broadinstitute.org/gpp/public/) are listed below: METTL3 (human): 5′-ATTCTGTGACTATGGAACCA-3′; FTO (human): 5′-GCCAGTGAAAGGGTCTAATAT-3′; and scramble control sequence: 5′-AACAGTCGCGTTTGCGACTGG-3′. shRNA targeting sequences were cloned into DECIPHER pRSI9-U6-(sh)-UbiC-TagRFP-2A-Puro (Cellecta, CA). Lentiviral particles were packaged using Lenti-X 293 T cells (Clontech). Virus-containing supernatants were collected at 48 hours after transfection and filtered to eliminate cells. Cells were infected by the lentivirus for 48 hours before selection by puromycin (1 to 2 μg ml−1).
Immunoblotting
Immunoblotting was conducted using a method described previously (44). In brief, cells were lysed on ice in tris-buffered saline (TBS) buffer [50 mM tris (pH 7.5), 150 mM NaCl, and 1 mM EDTA] containing a protease inhibitor cocktail tablet, 1% Triton X-100, and deoxyribonuclease (2 U ml−1). After incubating on ice for 30 min, the lysates were heated for 5 min, 95°C in SDS–polyacrylamide gel electrophoresis (SDS-PAGE) sample buffer [50 mM tris (pH 6.8), 100 mM dithiothreitol (DTT), 2% SDS, 0.1% bromophenol blue, and 10% glycerol]. Proteins were separated on an SDS–polyacrylamide gel and transferred to Immobilon-P membranes (Millipore). Membranes were blocked for 1 hour in TBS containing 5% nonfat milk and 0.1% Tween 20, followed by incubation with primary antibodies overnight at 4°C. After incubation with horseradish peroxidase (HRP)–coupled secondary antibodies at room temperature for 1 hour, immunoblots were visualized using enhanced chemiluminescence (ECLPlus, GE Healthcare).
Soft agar colony formation assay
Bottom agar layer was made with 0.8% noble agar (Sigma-Aldrich, A5431) in growth medium with 10% FBS in a six-well plate (1.5 ml per well). Cells were detached with trypsin-EDTA, washed twice with Dulbecco’s Phosphate Buffered Saline (DPBS) and counted, and 0.4% noble agar in growth medium mixed with single suspended cells (104 per well) was placed on top of a solidified bottom agar. Cells were fed twice a week with 500-μl growth medium with 10% FBS. After 3 to 4 weeks of growing, the colonies were counted and imaged using a dissecting microscope.
Xenograft injections
Cells at 70 to 80% confluence were changed with fresh medium to remove the dead cells 3 to 4 hours before harvesting, then detached with trypsin-EDTA, and washed twice with DPBS. Cells were counted after collection, and 5 × 106 cells were suspended in 50-μl DPBS in combination with 50-μl Matrigel (BD Biosciences, 356234). A total of 100 μl was injected subcutaneously into each side of the lower flank of the SCID-Beige mice (6 to 8 weeks of age) using insulin syringes. For testing the therapeutic potential of FTO inhibitor, 100-μl FB23-2 [dissolved in dimethyl sulfoxide (DMSO), 10 mg/kg mouse weight] or same volume of DMSO was injected intraperitoneally into mice every other day. Mice were examined every other day for evidence of tumor growth. Tumor diameters were measured with calipers, and the tumor volume in cubic millimeter was calculated by the formula: Volume = (width)2 × length/2. Tumor tissues were excised after tumors reached a diameter of 10 mm (3 to 4 weeks). Mice were housed in a temperature- and humidity-controlled facility with a 12-hour light-dark cycle. The experimental protocols were approved by Cornell Institutional Animal Care and Use Committee (no. 2008-0167).
Orthotopic mouse model of pancreatic tumor growth
All animals were maintained in facilities approved by the Association for Assessment and Accreditation of Laboratory Animal Care International in accordance with the current regulations and standards of the U.S. Department of Agriculture and Department of Health and Human Services. The sample sizes of the animals were justified by statistical considerations and statistical power analyses. The animals were randomized to different experimental groups. For tumorigenic assay using an orthotopic mouse model, AsPC-1, CFPAC-1, or BxPC-3 cells (1 × 106 cells per mouse, 5 mice per group) in 0.1 ml of Hanks’ balanced salt solution were injected into the pancreases of 7- to 8-week-old female athymic BALB/c nude mice (National Cancer Institute, Fredrick, MD), respectively. The mice were euthanized 4 to 5 weeks after tumor cell inoculation, and their tumors were harvested and weighed. The investigators were blinded to allocation during experiments and outcome assessment.
In vitro m6A methyltransferase activity assay
ER:HRasG12V cells with or without 48-hour 4-OHT treatment (1 μM) were lysed in denaturing lysis buffer [20 mM tris-HCl (pH 7.5), 137 mM NaCl, 1% NP-40, and 2 mM EDTA] after three times wash with cold DPBS, then incubated at 4°C for 30 min with rotation, followed by centrifuging at 12,000 rpm for 15 min. The supernatant were collected and the concentration of total protein was measured using 660-nm protein assay reagent (Pierce). The in vitro methylation assay was performed in a 50-μl reaction mixture containing 400 nM RNA probe (commercially synthesized in vitro, Thermo Fisher Scientific), cell lysate with same amount of total protein, 20 mM tris (pH 7.5), 50 μM ZnCl2, 1 mM DTT, RNaseOUT (0.2 U/μl), 1% glycerol, and 0.5 μCi [methyl-3H]AdoMet (PerkinElmer). The reaction was incubated at 30°C for 1 hour and then stopped by adding TRIzol reagent (Invitrogen). RNA after reaction was precipitated and purified using sodium acetate at −20°C for at least 2 hours. The precipitated RNA was subjected to radioactivity measurement using scintillation counting (Beckman). Levels of 3H-methyl–incorporated RNA are shown as disintegrations per minute.
Real-time quantitative polymerase chain reaction
Real-time quantitative polymerase chain reaction (qPCR) was conducted using a method described previously (44). In brief, total RNA was isolated with TRIzol reagent (Invitrogen), and reverse transcription was performed using the High Capacity cDNA Reverse Transcription Kit (Invitrogen). Real-time PCR analysis was conducted using Power SYBR Green PCR Master Mix (Applied Biosystems) and carried on a LightCycler 480 Real-Time PCR System (Roche Applied Science). All primers used in this study are listed in table S1.
Real-time luciferase assay
Real-time luciferase assay was conducted using a method described previously (44). In brief, cells grown in 35-mm dishes were transfected with 1-μg plasmids including luciferase gene (PGL3-luciferase or pcDNA3-CMV-luciferase) at 70 to 80% confluence. Luciferase substrate d-luciferin (1 mM, Regis Technologies) was added into the culture medium immediately after transfection. Luciferase activity was monitored continuously and recorded using Kronos Dio Luminometer (Atto).
Puromycin labeling
As described previously (55), cells at 70 to 80% confluence were treated with puromycin (10 μg ml−1) for 10 min. After washing twice with ice-cold DPBS, cells were lysed with SDS-PAGE sample buffer, and proteins were separated on SDS-PAGE and transferred to Immobilon-P membranes. Membranes were blocked for 1 hour in TBS containing 5% nonfat milk and 0.1% Tween 20, followed by incubation with puromycin antibodies (1:100 dilution) overnight at 4°C. After incubation with HRP-conjugated anti-mouse immunoglobulin G (IgG) (1:5000 dilution) for 1 hour at room temperature, the membrane was visualized using enhanced chemiluminescence.
[35S] methionine radiolabeling
ER:HRasG12V cells with or without 4-OHT treatment (1 μM for 48 hours) and Torin1 treatment (10 nM or 100 nM for 2 hours) were washed with DPBS before incubation in methionine-free DMEM for 15 min. Cells were resuspended in labeling media {methionine-free DMEM supplemented with 10% FBS and methionine (30 μCi ml−1 [35S])} for 15 min. Labeling was stopped by ice-cold DMEM containing cycloheximide (100 μg ml−1). Cells were washed with DPBS containing cycloheximide (100 μg ml−1), lysed with SDS sample loading buffer. Cell lysates were heated and then resolved on a 10% tris-glycine SDS-PAGE gel, and radiography captured was by Typhoon 9400.
m6A dot blotting
Total cellular RNAs were isolated with TRIzol reagent, and mRNAs were purified using Dynabeads Oligo (dT)25 (Thermo Fisher Scientific). Equal amounts of mRNA were spotted to a Hybond-N+ membrane (GE Healthcare), followed by ultraviolet (UV) cross-linking at UV 254 nm (0.12 J/cm2). After blocking in DPBS containing 5% nonfat milk and 0.1% Tween 20 for 1 hour, the membrane was incubated with anti-m6A antibody (1:1000 dilution) overnight at 4°C. The membrane was incubated with HRP-conjugated anti-rabbit IgG (1:5000 dilution) at room temperature for 1 hour and visualized using enhanced chemiluminescence.
Preparation of cell lysates for Ribo-seq and QTI-seq
As described previously (25), at least four 10-cm dishes of cells were harvested in 400-μl ice-cold polysome buffer [10 mM Hepes (pH 7.4), 100 mM KCl, and 5 mM MgCl2] containing cycloheximide (100 μg ml−1, for Ribo-seq) or lactimidomycin (5 μM, for QTI-seq). As described previously (25), cells were then disrupted by vortexing six times for 20 s using lysing matrix D (Fisher), followed by a 40-s interval each time on ice and then centrifuged at 12,000g, 4°C for 10 min. For QTI-seq, to dissociate noninitiating ribosomes, cell lysates were incubated in a solution containing 16 mM Hepes buffer (pH 7.4), 10 mM creatine phosphate, 0.1 mM spermidine, creatine phosphokinase (40 μg ml−1), 0.8 mM adenosine triphosphate (ATP), and 25 μM puromycin at 35°C for 15 min. Samples were then subjected to sucrose gradient sedimentation.
Polysome profiling
As described previously (25), sucrose solutions were prepared in polysome buffer [10 mM Hepes (pH 7.4), 100 mM KCl, 5 mM MgCl2, and 2% Triton X-100]. Fifteen to 45% (w/v) sucrose density gradients were freshly prepared in SW41 ultracentrifuge tubes (Backman) using a Gradient Master (BioComp Instruments). Five hundred microliters of supernatant from cell lysates prepared as described above was loaded onto sucrose gradients followed by centrifugation for 2.5 hours at 32,000 rpm, 4°C in a SW41 rotor. Separated samples were fractionated at 1.5 ml min−1 through an automated fractionation system (Isco) that continually monitors values of optical density at 254 nm.
RNA-seq and m6A-seq
Both RNA-seq and m6A-seq procedures have been described previously (44). For RNA-seq, total RNA was first isolated using TRIzol reagent followed by fragmentation using freshly prepared RNA fragmentation buffer [10 mM tris-HCl (pH 7.0) and 10 mM ZnCl2]. Five-microgram fragmented RNA was saved as input control. For m6A IP, 1-mg fragmented RNA was incubated with 15-μg anti-m6A antibody (Millipore ABE572) in 1× IP buffer [10 mM tris-HCl (pH 7.4), 150 mM NaCl, and 0.1% Igepal CA-630] for 2 hours at 4°C. The m6A-IP mixture was then incubated with protein A beads for additional 2 hours at 4°C on a rotating wheel. After washing three times with IP buffer, bound RNA was eluted using 100-μl elution buffer (6.7 mM N6-methyladenosine 5′-monophosphate sodium salt in 1× IP buffer), followed by ethanol precipitation. Precipitated RNA was used for cDNA library construction and high-throughput sequencing described below.
SELECT
All primers used in SELECT are listed in table S2. Five-microliter total RNA was mixed with 40 nM Up Primer, 40 nM Down Primer, and 5 μM deoxynucleotide triphosphate (dNTP) in 17-μl 1× CutSmart buffer [50 mM KAc, 20 mM tris-HAc, 10 mM MgAc2, and bovine serum albumin (100 μg/ml; pH 7.9) at 25°C]. The RNA and primers were annealed by incubating at 90°C for 1 min, 80°C for 1 min, 70°C for 1 min, 60°C for 1 min, 50°C for 1 min, and then 40°C for 6 min. Subsequently, 3 μl of mixture containing 0.01 U of Bst 2.0 DNA polymerase, 0.5 U of SplintR ligase, and 10 nmol of ATP was added to the final volume of 20 μl. The final reaction mixture was incubated at 40°C for 20 min and then denatured at 80°C for 20 min. Ten-microliter qPCR reaction was set up with 2× Power SYBR Green PCR Master Mix (Applied Biosystems), 200 nM qPCR-F primer, 200 nM qPCR-R primer, and 2.5 μl of the final reaction mixture. qPCR was run at the following conditions: 95°C, 1 min; (95°C, 10 s; 60°C, 45 s) × 45 cycles.
cDNA library construction
As described previously (25), Escherichia coli ribonuclease I (Ambion, 750 U per 100 units of absorbance at 260 nm) was added into the pooled fractions from ribosome profiling and incubated at 4°C for 1 hour, and then total RNAs were extracted using TRIzol reagent. RNA extracts (Ribo-seq and QTI-seq), fragmented RNAs (RNA-seq), and m6A-IP elutes (m6A-seq) were dephosphorylated for 2 hours at 37°C in a 15-μl reaction (1× T4 polynucleotide kinase buffer, 10 U of SUPERase_In, and 20 U of T4 polynucleotide kinase). The products were separated on a 15% polyacrylamide tris-borate EDTA (TBE)–urea gel (Invitrogen) and visualized using SYBR Gold (Invitrogen). Selected regions in the gel corresponding to 40 to 60 nt (for RNA-seq and m6A-seq) or 25 to 35 nt (for Ribo-seq and QTI-seq) were excised. The gel slices were disrupted by using centrifugation through the holes at the bottom of the tube. RNA fragments were dissolved by soaking overnight in 400-μl RNA elution buffer [300 mM NaOAc (pH 5.2), 1 mM EDTA, and SUPERase_In (0.1 U ml−1)]. The gel debris was removed using a Spin-X column (Corning), followed by ethanol precipitation. Purified RNA fragments were resuspended in nuclease-free water. Poly(A) tailing reaction was carried out for 45 min at 37°C [1× poly(A) polymerase buffer, 1 mM ATP, SUPERase_In (0.75 U μl−1), and 3 U of E. coli poly(A) polymerase].
For reverse transcription, the oligos containing barcodes were listed in table S3. In brief, the tailed-RNA sample was mixed with 0.5 mM dNTP and 2.5 mM synthesized primer and incubated at 65°C for 5 min, followed by incubation on ice for 5 min. The reaction mix was then added with 20 mM tris (pH 8.4), 50 mM KCl, 5 mM MgCl2, 10 mM DTT, 40 U of RNaseOUT, and 200 U of SuperScript III. Reverse transcription reaction was performed according to the manufacturer’s instruction. Reverse transcription products were separated on a 10% polyacrylamide TBE-urea gel as described earlier. The extended first-strand product band was expected to be approximately 100 nt, and the corresponding region was excised. The cDNA was recovered by using DNA gel elution buffer (300 mM NaCl and 1 mM EDTA). First-strand cDNA was circularized in 20 μl of reaction containing 1× CircLigase buffer, 2.5 mM MnCl2, 1 M betaine, and 100 U of CircLigase II (Epicentre). Circularization was performed at 60°C for 1 hour, and the reaction was heat-inactivated at 80°C for 10 min and then was precipitated by ethanol.
Deep sequencing
As described previously (25), circular template was amplified by PCR by using the Phusion high-fidelity (HF) enzyme (New England Biolabs) according to the manufacturer’s instructions. The oligonucleotide primers listed in table S3 were used to create DNA suitable for sequencing. The PCR contains 1× HF buffer, 0.2 mM dNTP, 0.5 μM oligonucleotide primers, and 0.5 U of Phusion polymerase. PCR was carried out with an initial 30-s denaturation at 98°C, followed by 12 cycles of 10-s denaturation at 98°C, 20-s annealing at 60°C, and 10-s extension at 72°C. PCR products were separated on a nondenaturing 8% polyacrylamide TBE gel as described earlier. Expected DNA at 120 base pairs was excised and recovered as described earlier. After quantification by Agilent BioAnalyzer DNA 1000 assay, equal amounts of barcoded samples were pooled into one sample. Approximately 5 pM mixed DNA samples were used for cluster generation followed by sequencing by using sequencing primer 5′-CGACAGGTTCAGAGTTCTACAGTCCGACGATC-3′ (Illumina HiSeq).
Alignment of sequencing reads
The 3′ adapters and low-quality bases were trimmed by Cutadapt as described previously (48). The trimmed reads with length <15 nt were excluded. For Ribo-seq and QTI-seq, reads without adapters were discarded. The remaining reads were mapped to the human transcriptome using Bowtie with parameters: -a –best -m1 –strata. The annotation file downloaded from ENSEMBL database (GRCh38) was used to construct the transcriptome index file. For each gene, the transcript with longest CDS was selected. In the case of equal CDS length, the longest transcript was used. For read alignment, two mismatches were permitted. To avoid ambiguity, the reads mapped to multiple positions were disregarded for further analyses. The 13th, 14th, and 15th positions (12-nt offset from the 5′ end) of the resulting uniquely mapped read was defined as the ribosome P-site position. For Ribo-seq, the reads mapped to CDS were used to calculate the RPKM (reads per kilobase of transcript per million mapped reads) values for translation levels. For RNA-seq, the reads mapped to entire transcript were used to calculate RPKM. Transcripts with RPKM <1 were excluded. Translation efficiency was defined as the ratio of RPKM of Ribo-seq over RPKM of RNA-seq.
Calculation of PI at the start codon
For each transcript, QTI-seq reads with the P-site in a window centering the annotated start codon (−5, +5 nt) were used to represent the abundance of translation initiation signal at the start codon, which we termed aTIS density. PI was defined as the ratio of mean aTIS density in the window over mean read captured by QTI-seq in the remaining CDS region. To reduce the noises from insufficiently translating mRNAs, which usually result in a large variation on PI estimation, we only considered the mRNAs that meet the following criteria: (i) RPKM estimated by Ribo-seq was higher than 1. (ii) aTIS density in the window either in wild-type or 4-OHT treatment sample was higher than 3. For the transcripts with 0 value of aTIS density, a pseudo-number 1 was used. The mean value by averaging PIs of two replicates was used in analyses. The transcripts with top (bottom) 20% of aTIS PI were defined as high (low) aTIS PI transcripts.
m6A sites identification and aggregation plot
m6A sites were identified using the method described previously (48). m6A coverages at individual sites were normalized by the mean coverage of the transcript. To avoid possible biases caused by insufficient coverage, analyses were limited to the transcripts with at least 30 reads per 1000 nt. Then, all transcripts were divided into three segments: 5′UTR, CDS, and 3′UTR. Each segment was then divided into 100 bins with equal length. All transcripts were aggregated by averaging the normalized m6A coverages across all available transcripts embracing the same bins. Besides aggregation plot in normalized mRNA regions, we also aggregated m6A coverages around the start codon (−300, +400), without normalizing the length of mRNA regions.
RNA secondary structure analysis
A sliding window of 30 nt with a step of 3 nt was used to calculate RNA minimum fold free energy (MFE) along transcript. For each window, MFE was calculated by ViennaRNA (56), using default parameters. For aggregation plot, a mean MFE in each window was calculated by averaging MFE values of the windows at the same position. DMS-seq data were used to investigate mRNA folding in vivo. DMS signal and Gini score were calculated using the methods in the work of Rouskin et al. (57). PARS data were used to analyze mRNA secondary structure in vitro (58).
Gene GO analysis
DAVID was used for GO enrichment analysis. The genes with fold change higher than 2 or lower than 0.5 were used for GO analysis. The expressed genes (FPKM >1) were used as the background gene set. Pathway analysis was performed by ClueGO (59) based on the annotation of Reactome (60).
Code availability
All statistical analyses were performed by R. The heatmap plots were made by heatmap.2 in R. All procedures but sequencing mapping were completed using custom Perl scripts. These scripts are available upon request.