HBV RNA pre-genome encodes specific motifs that mediate interactions with the viral core protein that promote nucleocapsid assembly

Patel, Nikesh; White, Simon J.; Thompson, Rebecca F.; Bingham, Richard; Weiß, Eva U.; Maskell, Daniel P.; Zlotnick, Adam; Dykeman, Eric C.; Tuma, Roman; Twarock, Reidun; Ranson, Neil A.; Stockley, Peter G.

doi:10.1038/nmicrobiol.2017.98

Download PDF

Article
Published: 19 June 2017

HBV RNA pre-genome encodes specific motifs that mediate interactions with the viral core protein that promote nucleocapsid assembly

Nikesh Patel¹^na1,
Simon J. White¹^na1,
Rebecca F. Thompson¹,
Richard Bingham²,
Eva U. Weiß²,
Daniel P. Maskell¹,
Adam Zlotnick³,
Eric C. Dykeman²,
Roman Tuma ORCID: orcid.org/0000-0003-0047-0013¹,
Reidun Twarock ORCID: orcid.org/0000-0002-1824-2003²,
Neil A. Ranson¹ &
…
Peter G. Stockley¹

Nature Microbiology volume 2, Article number: 17098 (2017) Cite this article

5403 Accesses
63 Citations
97 Altmetric
Metrics details

Subjects

Abstract

Formation of the hepatitis B virus nucleocapsid is an essential step in the viral lifecycle, but its assembly is not fully understood. We report the discovery of sequence-specific interactions between the viral pre-genome and the hepatitis B core protein that play roles in defining the nucleocapsid assembly pathway. Using RNA SELEX and bioinformatics, we identified multiple regions in the pre-genomic RNA with high affinity for core protein dimers. These RNAs form stem-loops with a conserved loop motif that trigger sequence-specific assembly of virus-like particles (VLPs) at much higher fidelity and yield than in the absence of RNA. The RNA oligos do not interact with preformed RNA-free VLPs, so their effects must occur during particle assembly. Asymmetric cryo-electron microscopy reconstruction of the T = 4 VLPs assembled in the presence of one of the RNAs reveals a unique internal feature connected to the main core protein shell via lobes of density. Biophysical assays suggest that this is a complex involving several RNA oligos interacting with the C-terminal arginine-rich domains of core protein. These core protein–RNA contacts may play one or more roles in regulating the organization of the pre-genome during nucleocapsid assembly, facilitating subsequent reverse transcription and acting as a nucleation complex for nucleocapsid assembly.

In vitro functional analysis of gRNA sites regulating assembly of hepatitis B virus

Article Open access 16 December 2021

Nikesh Patel, Sam Clark, … Peter G. Stockley

Short- and long-range interactions in the HIV-1 5′ UTR regulate genome dimerization and packaging

Article Open access 28 March 2022

Liqing Ye, Anne-Sophie Gribling-Burrer, … Redmond P. Smyth

Structural insight into Marburg virus nucleoprotein–RNA complex formation

Article Open access 04 March 2022

Yoko Fujita-Fujiharu, Yukihiko Sugita, … Takeshi Noda

The World Health Organization reports that hepatitis B virus (HBV) has infected more than 2 billion people worldwide¹. In adults, most infections are acute. However, approximately 240 million people live with a chronic infection that can ultimately lead to liver failure, cirrhosis or cancer, resulting in more than 700,000 deaths annually². The availability of an effective vaccine³ has decreased the spread of HBV but is not curative for chronic infections. Standard treatment using nucleos(t)ide analogues directed against the viral polymerase rarely leads to a cure, and is thus a lifelong therapy⁴. A better understanding of HBV will help identify and characterize additional drug targets that could lead to new curative therapies.

HBV is a para-retrovirus—a DNA virus that initially packages an RNA form of its genome, the pre-genome^5,6. In an infected cell, the basis of infection is viral, covalently closed, circular DNA (cccDNA) in the nucleus, a persistent, chromatinized episome whose protein complement also includes the HBV core or capsid protein (Cp)^7,8. It is 3,200 bp long and encodes four overlapping reading frames for polymerase (P), surface proteins (three different sizes are translated, collectively referred to as HBsAg or surface antigen), the cell regulatory factor protein X, and the core and pre-core proteins (HBcAg and HBeAg, respectively) (Fig. 1a). The P, Cp and HBeAg proteins are translated from the same RNA, the positive-sense, pre-genomic RNA (pgRNA), which also serves as the template for the reverse transcription reaction. The pgRNA is a terminally redundant transcript covering about 3,500 nucleotides, but is otherwise a typical mRNA. Most of the pgRNA is not spliced concomitant with export from the nucleus^9,10, suggesting a novel export mechanism, presumably involving the nuclear import and export signals on Cp^11–13.

In vivo assembly of an HBV nucleocapsid (NC) begins with a pgRNA–P protein complex that is required for pgRNA packaging. A correctly folded P and a functional stem-loop, termed epsilon (ε), located near the 5′ end of pgRNA are necessary for this process^14–19. Cp phosphorylation is associated with RNA packaging^20–22. Once encapsidated, P protein begins reverse transcription by priming DNA synthesis, adding the first three to four deoxynucleotides while bound to ε, before jumping to the 3′ end of the genome to complete synthesis of the minus strand. Three such template transfers are required for synthesis of the relaxed circular, double-stranded DNA (dsDNA) of mature HBV within the NC. Most of the RNA template is digested by the RNaseH domain of P protein during minus strand DNA synthesis. A sequence, phi (ϕ), at the 3′ end of the pre-genome complementary with ε (refs 23,24) is believed to facilitate strand transfer. Low-resolution structural studies show that pre-genomic RNA forms a thin shell associated with the inner surface of the NC and that P is internal, suggesting that it travels on an RNA track to complete DNA reverse transcription^25,26. The mature virion is enveloped by a host-derived membrane containing embedded HBsAg, which encloses an icosahedral NC with either T = 4 (∼95%) or T = 3 (5%) quasi-symmetry²⁷. Similar ratios of T = 4 to T = 3 capsids are observed in many expression systems and following in vitro assembly^28,29. NC is composed of dimers of the ∼183-residue Cp (Fig. 1b,c), organized as a shell-forming N-terminal domain of 149 residues connected via a linker region to a C-terminal arginine-rich domain (ARD).

Although HBV is ostensibly a DNA virus, we reasoned that the physics and functions of Cp–RNA interaction in HBV would resemble those found in RNA viruses. We recently uncovered a previously unsuspected principle of the assembly mechanisms of positive-sense, single-stranded RNA viruses that challenges the prevailing view that genomic RNAs are merely passive passengers in a process driven by viral coat proteins^30–33. Instead, it appears that many viral genomes encompass cryptic, sequence-degenerate, dispersed RNA packaging signals. Packaging signals have affinity for their cognate coat proteins and can act collectively to ensure encapsidation of cognate genomic RNA, while building capsids rapidly and with great fidelity at low concentrations. Mathematical modelling of such packaging signal-mediated assembly³⁴ suggests that it confers many selective advantages and would therefore be expected to occur widely throughout nature. This appears to be the case for viruses infecting humans^35,36, plants³⁷ and bacteria³³.

As HBV packages a pgRNA during assembly, we hypothesize that similar mechanistic constraints may contribute to formation of its NC. HBV RNA must be packaged in a manner that supports reverse transcription and this could be facilitated by packaging signal-like RNA motifs. In HBV we may more accurately redefine a packaging signal as a preferred site (PS) for Cp binding. We therefore investigated whether the HBV pgRNA also encodes such PSs. Due to their nature, preferred sites are difficult to identify by sequence analysis alone. We developed a novel approach that combines experimental and bioinformatics methods. We used RNA SELEX against HBV Cp to generate a library of sequences with affinity for Cp. These aptamer sequences were then aligned across the cognate viral pgRNA, revealing genomic regions with sequence similarity to the aptamer pool capable of forming stem-loop structures, that is, potential PSs. These sites are conserved across strain variants and each displays a RGAG sequence motif in the loop (R = purine). Individual genomic fragments encompassing these PSs show high-affinity, sequence-specific interaction with Cp, as demonstrated by their ability to induce the formation of closed virus-like particles (VLPs) in vitro. Asymmetric cryo-electron microscopy (cryo-EM) reconstruction of these VLPs suggests that they contain a group of PS oligonucleotides interacting with Cp principally via the C-terminal ARDs. Packaging signal-like sites in the pre-genome may therefore play a role in favouring formation of an assembly competent form of Cp, effectively creating an assembly initiation complex for NC and specifying the quasi-symmetry of the capsid. Inhibiting formation of this complex could therefore be an antiviral strategy.

Results

The HBV pgRNA contains preferred Cp binding sites

HBV VLPs assembled from (full-length) Cp subunits expressed in Escherichia coli were purified as described in ref. 36 (Supplementary Fig. 1a and Supplementary Table 1). They form a mixture of T = 3 and predominantly T = 4 shells. These were immobilized onto magnetic beads, disassembled by treatment with guanidinium chloride, and then washed to remove host RNA, resulting in immobilized Cp dimers³⁸ with their ARDs accessible. RNA SELEX was carried out using our standard protocols (Supplementary Fig. 1b) and the aptamer pool from the 10th round was analysed by NextGen DNA sequencing (see Methods).

The RNA sequences that bind Cp in the selected library were aligned to the HBV pre-genome most closely related to the protein used for the SELEX experiments (the laboratory strain, GenBank Seq ID NC_003977.1; ref. 25). Statistically significant matches (a Bernoulli score of 12 or more; see Methods) to the pgRNA of this strain (blue peaks in Fig. 2a) were benchmarked against an alignment of the unselected library (grey curve in Fig. 2a) to identify peaks that occur with significant frequency. This identifies multiple sites dispersed across the pgRNA that have similar sequences/structures to Cp-binding aptamers, consistent with our expectation for PS-like sites across the genome. We applied the same procedure to 14 randomly selected HBV strain variants from GenBank, the current NCBI HBV reference strain (GenBank Seq ID NC_003977.2) as well as the laboratory strain (GenBank Seq ID NC_003977.1) and identified all those peaks that are conserved in at least 80% of these strains (marked with green crosses in Fig. 2a). These genomic regions are thus likely to encompass PSs. The three peaks with the highest conservation (100%) and peak heights, the latter indicating how many aptamers matched these sites, are labelled PS1, PS2 and PS3 in Fig. 2a. For the nine sites with high conservation between strains, we extracted 30 nts 5′ and 3′ to the peak nucleotide in the genomic sequences of three representative strain variants, including the laboratory strain and the reference genome, and considered all their possible secondary structure folds with negative free energy via Mfold (see Methods). A similarity analysis of primary and secondary structure revealed the predicted existence of stem-loops sharing a purine-rich loop recognition motif, RGAG (Fig. 2b).

**Figure 2: Identification of conserved PS motifs in the pgRNA.**

We computed the frequency of this motif in stem-loops across the 16 HBV strains analysed. Across all strains, the RGAG motif occurs in stem-loops on average ∼25.4 times (precisely 25 times in the laboratory strain). Compared to 10,000 randomized versions of the pgRNAs, the frequency of the occurrence of RGAG in the actual genome is 4.68 standard deviations above the average (Fig. 2c), strongly implying a functional role(s).

pgRNA oligonucleotides trigger VLP formation in vitro

PS1, 2 and 3 oligonucleotides (Supplementary Fig. 2a) were tested for their ability to bind Cp dimers using single-molecule fluorescence correlation spectroscopy (smFCS) (Fig. 3 and Supplementary Fig. 2b). This technique yields a real-time estimate of the hydrodynamic radius (R_h) of dye-labelled species. Importantly, it allows reactions to be followed at low nanomolar concentrations, where we have shown that binding specificity more closely reflects the situation in vivo compared to most in vitro reactions. The latter are typically carried out at higher (for example, 0.1–0.8 µM) concentrations³⁸, where the specificity of PS-mediated assembly is reduced or lost. To avoid electrostatic effects due to differing oligo lengths, each PS was produced as part of a 47 nt long fragment, each dye-labelled at its 5′ end (see Methods³⁷). The labelled oligos (∼15 nM) were then titrated with increasing amounts of Cp (5–250 nM Cp dimer) and the R_h values were tracked over time (Fig. 3a). After each addition there was a pause of ∼10 min to allow reactions to equilibrate. The titrations led to distortions in the data collection and averaging, which are visible in the plots as noisy signals. After equilibration at 250 nM Cp, RNase was added to each reaction and the R_h values were monitored for ∼10 min. If these declined steeply, it was assumed that the VLPs produced were incomplete. Negative-stain EM images were obtained for the samples before RNase addition, and the sizes of the complexes present at this point were also assessed by calculation of R_h distribution plots (Supplementary Fig. 2c and Fig. 3b, respectively).

**Figure 3: PSs trigger sequence-specific VLP assembly.**

Each of the PS fragments stimulates the assembly of both T = 3 and T = 4 complete VLPs, with roughly equal efficiency, under these conditions (Fig. 3a,b), with the latter being the dominant product, as expected²⁹. Addition of Cp > 250 nM does not increase the R_h values obtained, implying that by this stage all the RNAs have been incorporated into VLPs. To assess whether these effects are a direct consequence of Cp–PS interaction, we carried out a number of controls. Dye-labelled PS fragments do not bind to pre-formed VLPs and remain RNase-sensitive in their presence (Supplementary Table 2), implying that the PSs only get internalized in assembling VLPs. To determine if the RNA triggers assembly, we compared the assembly efficiency of Cp with and without PS RNA present by adding a protein-modifying dye after incubation of Cp alone or completion of a titration of unlabelled PS1. The R_h distribution plots are shown in Fig. 3b. In the absence of RNA, <5% of Cp assembles under these conditions, in contrast to >80% of the Cp for assembly in the presence of RNA. It appears that the Cp–PS interaction triggers an increase in assembly efficiency. This effect varies with the age of the Cp, consistent with oxidation of an assembly-inhibiting disulfide at the dimer interface³⁹. Comparative statements here are based on the results of both positive and negative control experiments with each batch of Cp.

We then probed the RNA sequence specificity of these reactions (Supplementary Fig. 3a). Test oligos comprised the epsilon stem-loop, as well as loop and bulge variants of PS1. This included a variant in which the bulge region was fully base-paired. In similar assays to the PS1–3 reactions, the R_h values for all three RNAs remain sensitive to nuclease action, implying that assembly of closed shells requires a specific RNA sequence/structure. EM images and distribution plots confirm this interpretation. The sequence sensitivity of the assembly reaction is further highlighted by additional PS1 variants (Supplementary Fig. 3b,c and Supplementary Table 3). Their effects on assembly confirm the importance of the bulge and/or sequences within it and the loop RGAG (here a GGAG) motif. A DNA oligonucleotide encompassing the PS1 sequence (Supplementary Fig. 3d) elicits only aggregation, showing that faithful assembly is a specific property of the PS in its RNA form, that is, with an A helical duplex stem, as well as the Cp-recognition motif in the loop.

The C-terminal ARD of the HBV Cp is believed to mediate interactions with the pgRNA, and the 1–149 Cp fragment that lacks the ARD readily assembles in the absence of nucleic acid⁴⁰. We therefore assessed the ability of Cp₁₄₉ to respond to PSs in the smFCS assay. No RNA-dependent assembly, or PS binding by Cp₁₄₉, occurs under these conditions (Supplementary Fig. 4a), although EM images show that the truncated Cp alone readily assembles, confirming that the ARD is essential for the interaction with RNA. The ARD is extensively phosphorylated in vivo, although the responsible cellular kinase remains unknown⁴¹. Lowering the positive charge on the C terminus of Cp should reduce its ability to bind PS RNAs. We phosphorylated Cp in vitro⁴² (Supplementary Table 1) and tested its properties. EM images show that modified Cp readily assembles but does not bind to PS1 in smFCS assays (Supplementary Fig. 4b).

HBV NC assembly is triggered by formation of a sequence-specific RNA–Cp complex

The VLPs assembled around PS1 were purified on a larger scale, and their structures were determined by cryo-EM, yielding icosahedrally averaged reconstructions of the T = 3 and T = 4 particles (Fig. 4). A significant fraction (∼25%) of the T = 4 particles also contained an asymmetric feature located just below the protein shell. An asymmetric reconstruction of these particles was also calculated (Fig. 5). The result suggests the asymmetric feature represents a complex between PS1 oligonucleotides and the ARD domains of the overlying Cp subunits.

**Figure 4: The structures of T = 3 and T = 4 HBV VLPs suggest a mechanism for the specification of their quasi-conformations.**

**Figure 5: Asymmetric RNA feature in T = 4 HBV VLPs.**

From the EM map at this resolution it is not possible to determine the number of PS oligonucleotides present in the complex. The ratio of the absorbance at 260 and 280 nm (A_260/280) of the purified VLP suggests that the RNA content, assuming T = 4 morphology, is approximately five oligos per particle⁴³. An additional estimate of this stoichiometry was obtained by studying photobleaching of PS1 VLPs (Fig. 4; see Methods). VLPs show multiple bleaching steps, confirming that there are multiple oligos within each shell. Given the labelling efficiency of the oligos, the data are consistent with two to four oligos per VLP. We built a three-dimensional model of PS1 and manually positioned it within the EM map (Fig. 4f; see Methods). From the relative volume of the asymmetric density and the size of the PS1 oligo, it appears that at least two copies of the PS are present within the density. We cannot exclude the possibility that other RNA molecules are bound to the protein shell elsewhere, but are not visible due to mobility or an irregular location with respect to the ordered RNA density. The biochemical and structural data are consistent with the asymmetric structure being an assembly initiation complex, where an RNA PS(s) has initiated assembly, culminating in formation of the T = 4 NC.

The cryo-EM data hint at a further insight into HBV biology. A minority of HBV particles, whether from assembly reactions or wild-type virus infections, assemble with T = 3 quasi-symmetry, and both types of particle are visible in our cryo-EM data. Using two- and three-dimensional classification, the T = 3 (∼11%) and T = 4 (89%) particles are readily separable. Figure 4 shows three-dimensional reconstructions of the two particles with imposed icosahedral symmetry at 5.6 Å and 4.7 Å resolution, respectively. In addition to the obvious differences in size and number of Cp dimers that the two VLP structures contain, the T = 4 and T = 3 maps are different in the features visible on their inner surfaces, where the ARDs are located and where RNA binding occurs. As might be expected for icosahedrally averaged maps of a sub-stoichiometrically occupied VLP, both structures are essentially devoid of density attributable to RNA. The capsid shell of the T = 4 structure is visibly thinner than the T = 3 equivalent, however, and closer examination of the T = 3 map suggests that additional density corresponding to ordered segments of the ARDs is visible (Fig. 4), which is absent in the T = 4 structure (Fig. 4c,d). This difference persists when the T = 4 map is Fourier filtered to be at a similar resolution as the T = 3 (Supplementary Fig. 5). This is consistent with previous studies that showed that the Cp C-terminal region, including the ARD, plays a role(s) in determining capsid geometry^29,44.

Discussion

Previously, we identified multiple RNA PSs within the genomes of positive-sense ssRNA viruses that play essential roles in their assembly^33,37. Here, we have explored whether similar sequence-specific Cp–RNA interaction sites exist within the pre-genomic RNA of HBV. Many such sites emerge from this analysis, encompassing stem-loop structures presenting variations of a loop motif likely to be the Cp-recognition sequence. This motif is highly conserved across all strain variants and is statistically strongly over-represented within the HBV genome. Three of these sites bind Cp in a sequence-specific manner as RNA stem-loops, promoting efficient, high-fidelity assembly into predominantly T = 4 VLPs, with assembly properties similar to the packaging signals of ssRNA viruses. This sequence specificity has not been observed previously in in vitro reassembly reactions³⁸, which suggest that both pre-genomic and non-genomic RNA are packaged cooperatively. However, under similar conditions, Cp alone forms capsid shells, albeit with lower efficiency. This is in marked contrast to what we observe here at low nanomolar concentrations, perhaps mimicking in vivo assembly conditions. Under these conditions, Cp appears stable as dimers, but in the presence of RNA assembles into higher-order structures, forming closed icosahedral shells in a sequence-specific fashion. Such reactions probably mimic events in the cell, providing new insight into the genome packaging specificity of HBV.

In bona fide ssRNA viruses, packaging signals regulate assembly by facilitating the formation of the protein–protein interactions of the (nucleo)capsid, simultaneously collapsing the conformational ensemble of the genomic RNA^30,37. Individual packaging signals can also trigger VLP formation akin to the results seen here³⁷. HBV pgRNA by itself, that is, without bound polymerase, is insufficient to trigger packaging in vivo. Earlier observations indicate that activation and inactivation of assembly is sensitive to Cp conformation and is allosterically triggered^45,46. This is consistent with the findings here. Cp₁₄₉ assembles at low concentration without RNA, in contrast to Cp₁₈₅, implying that the ARD is inhibitory for assembly under these conditions. This inhibition is removed for Cp₁₈₅, either by binding PS RNA or by phosphorylation. Both these routes reduce the net charge on each ARD, implying that electrostatic repulsion might be the origin of the inhibition. It is therefore possible to postulate an assembly pathway (Fig. 6) that accounts for the known properties of HBV Cp. The Cp exists as a dimer with positively charged C-terminal ARDs. The latter create an electrostatic repulsion inhibiting the formation of Cp complexes larger than a dimer. This barrier is not absolute, and some dimers of dimers can form, their concentration increasing with Cp concentration. If that higher-order species is required to trigger NC assembly based on Cp–Cp contacts, then reassembly of Cp, alone or in non-specific RNA–Cp complexes at higher concentrations, can be explained. At low concentration the Cp binds specific PSs within the pre-genome, triggering formation of the critical higher-order species and hence NC formation. That species is likely to correspond to the structure seen in Fig. 5. In the pre-genome the PS sites forming the initiation complex would be different PSs, each folding into a stem-loop presenting the recognition motif rather than the multiple identical copies of PS1 as seen here. The efficient assembly of the closed T = 4 shell with PS1 suggests, however, that the assembly initiation step mediated by the nucleation complex would be similar.

**Figure 6: Proposed model of HBV NC assembly.**

HBV is not an ssRNA virus and has a much more complex lifecycle. Therefore, the roles fulfilled by specific Cp–RNA interactions may also be distinct. Evidence suggests that the polymerase–ε complex plays a critical role in pgRNA selection and NC assembly. Conversely, the PS sequences identified in this work are highly conserved and demonstrably have specific affinity for Cp. For correct assembly the virus needs to achieve the following: (1) identify full-length pgRNA; (2) assemble a quasi-equivalent shell of Cp around that RNA; (3) complete reverse transcription of the pgRNA using the encapsidated P protein while degrading the template; and (4) complete copying of the negative ssDNA strand, creating a partially dsDNA genome. Evidence suggests the polymerase translocates extensively on the pgRNA during these processes²⁵. The 5′ ε (Fig. 1a) can base-pair with 3′ ϕ, effectively circularizing the pgRNA, an interaction that may play a role(s) during both packaging and template transfers. The ϕ site, at nucleotides 3172–3190, is adjacent to PS1. It is therefore possible that the polymerase–ε/ϕ complex favours the folding of PS1 to present its recognition motif contributing to assembly initiation. Such a mechanism would ensure that Cp assembly only occurs on a pre-genome that has recruited polymerase. It would also permit co-localization of P with both ends of the pgRNA, imposing a defined position with respect to the encapsidated genome. The presence of the multiple PS sites would then result in formation of a defined, non-entangled path for the RNA within the NC, that is, corresponding to the track along which the polymerase must travel. The HBV pre-genome has many fewer PS sites than are seen in ssRNA viruses, consistent with the need to have most of the RNA readily available for reverse transcription. There may also be other roles for these specific PS–Cp interactions in HBV. For instance, specific interaction of Cp with pgRNA in the nucleus may facilitate export of unspliced RNAs using the nuclear export signals on the Cp¹³.

Previous in vitro studies of empty capsid assembly have suggested that Cp conformational change is needed to trigger nucleation^45,47. Candidate small-molecule antiviral therapeutics are known that act as allosteric effectors driving the assembly of HBV^48,49. In addition, structural studies have revealed the breadth of HBV Cp conformational flexibility, suggesting that small molecules and/or genomic sequences could restrict an ensemble of structures to particular active, or inactive, forms⁴⁶. The preferred RNA–Cp contacts identified here open new insights into the regulation of assembly around a genome that must be reversed-transcribed and therefore offer additional therapeutic targets.

Methods

Cloning, expression and purification of proteins used

We obtained an E. coli Cp-expressing plasmid (a gift of N. Stonehouse), known to produce assembled HBV VLPs containing host RNAs⁵⁰. The Cp encoded has the following amino-acid sequence differences compared to the current GenBank reference strain (NC_003977.2): A61, E77-FAGAS (single-letter amino-acid code) -D78 insertion, S92N, F102I, I121L, R156-RD-R157 insertion. Because the wild-type C61 has been implicated in assembly³⁹, this was restored to the gene before expression in a PET28b plasmid in BL21(DE3) E. coli cells. The inserted FAGAS epitope was also removed. Induction with 1 mM isopropyl-β-d-thiogalactoside at an optical density (OD₆₀₀) of 0.6 was followed by growth for 20 h at 21 °C. Cells were lysed using a Soniprep 150 with 5 × 30 s bursts on ice. The lysate was then clarified by spinning at 11,000g for 1 h. VLPs were then pelleted by centrifugation at 120,000g for 14 h, resuspended in 20 mM HEPES (pH 7.5), 250 mM NaCl and 5 mM dithiothreitol (DTT) and applied to an XK50 column packed with 25 ml of Capto core 700 resin (GE Life Sciences). Fractions containing VLPs were pooled and precipitated with 40% (wt/vol) ammonium sulfate. The Cp appeared pure on SDS–PAGE, and its identity, and that of variants, was confirmed by mass spectrometry (Supplementary Table 1). Cp lacking the ARD—that is, Cp₁₄₉—was produced by mutagenesis (Q5 site-directed mutagenesis kit, NEB) and prepared similarly. Note that the Cp₁₄₉ VLP expressed in E. coli lacks significant encapsidated cellular RNA. VLPs were visualized by negative-stain transmission electron microscopy (TEM). Full-length Cp VLPs were additionally purified by sucrose density gradient before dye-labelling using Alexa Fluor-488 SDP ester (Invitrogen) over 4 h at room temperature in 200 mM sodium carbonate buffer (pH 8.3), followed by desalting over a NAP5 column. There were two overlapping VLP peaks on the gradient, and it was impossible to separate them. TEM and smFCS confirmed that they are the expected T = 3 and T = 4 shells, with the latter the predominant form (Supplementary Fig. 1a). Cp region 140–148 has been shown to be a determinant of morphology, the shorter versions producing more T = 3 shells²⁹. It is possible that the dipeptide insertion adjacent to the linker region at position 157 may alter the properties of the Cp. However, when we removed the RD insertion, yielding Cp₁₈₃, we found no differences with Cp₁₈₅, either in RNA binding, ability to form VLPs with PS RNAs or preference for the dominant quasi-conformer shell formed. Because longer Cp was used for SELEX and the high-resolution EM work, those are the data shown throughout.

All HBV variants used for assembly assays were dissociated from VLPs into protein dimers as previously described³⁸, with the exception that dissociation was at pH 9.5, as opposed to 7.5. This was done in the presence of complete protease inhibitor tablets (Thermofisher Scientific). HBV core dimer concentration was determined by ultraviolet absorbance. Fractions with an A_260/280 ratio of ∼0.6 or lower were used in assembly assays. SRPKΔ kinase was expressed and purified from a pRSETb plasmid, as previously described⁴².

SELEX protocol

Purified HBV capsids (∼360 µg) were immobilized onto 6 mg of M270 carboxylic acid Dynabeads (Thermofisher Scientific) following the manufacturer's protocol. Beads were washed twice with selection buffer (25 mM HEPES, pH 7.5, 250 mM NaCl, 2 mM DTT, EDTA-free complete protease inhibitor) and unreacted N-hydroxysuccinamide blocked with a 15 m 50 mM Tris-HCl pH 7.4 wash. Beads were washed a further three times with selection buffer. Immobilized capsids were dissociated with a 30 min incubation of 2 M guandinium chloride in 0.5 M LiCl₂. Beads were then washed three times with B&W buffer (10 mM Tris-HCl, pH 7.5, 1 mM EDTA, 2 M NaCl) and then washed three times with selection buffer. Beads were resuspended in selection buffer so that the concentration of beads was 10 mg ml^–1. Negative selection beads were also prepared in the same manner but with no capsids. Ten rounds of SELEX were performed in vitro using a synthetic, combinatorial N40 2′OH RNA library (∼1 × 10²⁴ potential sequences), as described previously⁵¹. The amplified DNA of round 10 was then subjected to next generation sequencing on an Illumina MiSeq platform. This yielded ∼1.6 million sequence reads, in which one sequence occurred 65,802 times, and there were 1,149 aptamers with a multiplicity of 100 or higher. The overall frequencies of the four nucleotides in this aptamer pool were A 34.30%, C 9.09%, G 40.97% and U 15.64%, which compare with the same data for the unselected naive library of A 26.10%, C 22.03%, G 24.64% and U 27.22%. The highest multiplicity for sequences in the latter pool was 4. These data confirm that selection from the naive pool occurred and that the base composition of the selected aptamers is consistent with the RGAG motif identified within the HBV genomes.

PS identification

PS identification was carried out using the laboratory HBV strain (*NC_003977.1). The aptamer library contained 1,664,890 unique sequences, each 40 nt in length, that were aligned against the genome as follows. Each aptamer sequence was slid along the genome in increments of 1 nt. For each such position of the reference frame, the subset of the aptamer sequence with the best alignment to the genome was identified according to the Bernoulli score B, which benchmarks the probability of a non-contiguous alignment to that of a contiguous alignment of B nucleotides. The Bernoulli scores for all reference frames of a given aptamer sequence in the library were rank-ordered starting from the largest score, and all matches with the genome up to a Bernoulli score of 12 were counted. The procedure was then repeated for the other aptamer sequences and corresponding matches added, resulting in the peaks in Fig. 2a.

Identification of a consensus motif

HBV genome sequences with the following accession numbers were randomly extracted from 750 complete HBV genomes found in GenBank: KCS10648.1, *AF223955.1, AY781181.1, *AB116266.1, AB195943.1, KR014086.1, *KR014072.1, KR014055.1, KR013939.1, KR013921.1, KR013816.1, KR013800.1, EU796069.1 and AB540582.1. The NCBI HBV reference strain (GenBank Seq ID *NC_003977.2) and the laboratory strain (GenBank Seq ID NC_003977.1) were added to the ensemble. Sequences used for the statistical analysis in Fig. 2c are marked by an asterisk. Bernoulli peaks, which occurred within at most 10 nt of each other in at least 80% of these 16 HBV strain variants, are marked by a green cross in Fig. 2a to indicate their conservation. To identify the putative PS recognition motif, we extracted sequences of 60 nt, centred around the peak nucleotide of each Bernoulli peak, from three representative strains (AF223955.1, NC_003977.1 and NC_003977.2) and determined all possible stem-loops of negative free energy via Mfold⁵². We carried out a similarity analysis of these stem-loops, comparing both sequence and structure elements, and identified for each peak area the representative with the highest degree of similarity both with secondary structure elements in the other peak areas in the same genome and stem-loops corresponding to the same peak area in the other strains. This returned a stem-loop for each peak. An alignment of the corresponding loop sequences is shown in Fig. 2b.

RNA dye-labelling

PS1, PS2 and PS3 (47 nucleotides long) were purchased from Integrated DNA Technologies with a 5′ C6-amino group. To label RNA, 6 µl of RNA (200 µM) was mixed with 1 µl 1 M sodium borate buffer pH 8 and 3 µl 10 mM Alexa-488-SDP (Thermofisher Scientific) and mixed at room temperature for 4 h. A 10 µl volume of 2× denaturing loading dye was then added to the RNA, boiled for 5 min and loaded onto a pre-warmed denaturing PAGE. RNA was gel-extracted, isopropanol-precipitated and finally re-suspended in diethyl pyrocarbonate (DEPC)-H₂O and frozen at −80 °C until needed.

Assembly assays

Assembly reactions were performed by adding HBV Cp in dissociation buffer (50 mM Tris pH 9.5, 1.5 M guanidinium hydrochloride (GuHCl), 500 mM LiCl and 5 mM DTT) to 15 nM Alexa-488-labelled RNA in a reassembly buffer containing 20 mM HEPES pH 7.5, 250 mM NaCl, 5 mM DTT and 0.05%(vol/vol) Tween-20 at 25 °C. Successive additions of dimer were performed until assembly was deemed complete by the measured R_h value plateauing, but never exceeded 10% of the total reaction volume. Each addition of Cp is marked by a vertical dashed grey line in the titration plots, and the expected hydrodynamic radii of T = 3 and T = 4 particles (as determined for dye-labelled particles expressed in E. coli) are marked by an orange horizontal dashed line within figures.

Manual mixing throughout the reactions caused a delay of ∼1 min at the start of FCS data collection. FCS measurements were carried out using a custom-built FCS set-up with 30 s data accumulation per autocorrelation function (CF)⁵³. Individual CFs were decomposed into triplet state relaxation and diffusion (characterized by diffusion time, TD) components, and the latter was converted into an apparent hydrodynamic radius, R_h (ref. 54). Samples for TEM were taken at the end of each measurement. Plots of R_h over time (thin dashed line) were smoothed (thick solid line) using the FFT filter in Origin Pro-8 with a cutoff percentage of 35%. Plots of R_h distribution were also fitted, using Origin Pro-8 software, to a normal single- or multiple-peak Gaussian function (for example, Fig. 3). Samples taken for negative-stain TEM analysis were placed onto a glow-discharged carbon-coated Formvar 300-mesh Cu grid. Grids were stained with 2% uranyl acetate and dried.

Assembled particle labelling

Assembly was carried out as in smFCS experiments. In particular, Cp was titrated into reassembly buffer with and without 15 nM unlabelled PS1 to a final concentration of 250 nM. This was allowed to incubate at room temperature for 1 h and then buffer exchange was carried out via dialysis to remove any GuHCl. Labelling of protein was then carried out by adding Alexa Fluor-488 SDP ester (1:50 ratio of dye to Cp dimer) and incubating overnight at 4 °C. The resulting sample was then measured via smFCS in 30 s bins for 100 min, and the R_h data were plotted as above in a hydrodynamic radial distribution plot. A sample was then removed for analysis via TEM. Post labelling, Cp dimer became assembly-incompetent, so Cp could not be tracked during real-time assembly.

Photobleaching

HBV VLPs containing Alexa-488-labelled PS1 were assembled as described for smFCS assembly assays. Under those conditions, all RNA is bound to protein, as judged from fluorescence quenching and photon counting in the FCS experiments. VLPs were then added to two glow-discharge-irradiated carbon/Formvar 300-mesh grids (Agar Scientific) and one grid stained with 2% (wt/vol) uranyl acetate, then viewed with a Jeol 1400 microscope at ×40,000 magnification. The remaining, unstained grid was positioned Formvar side down onto a clean microscope coverslip and mounted onto an inverted total internal reflection fluorescence microscope. The laser (Coherent Sapphire, 488 nm, 25 mW) power was adjusted to excite and photobleach the labelled RNA within a timeframe of several minutes. Sequential images were taken with an electron-multiplying charge-coupled device camera (Andor iXon) with 0.2 s exposures and multiplying gain of 200. An unexposed field of view was used for each series.

Fluorescent spots were identified in the collected frames using previously described procedures then converted into time traces⁵⁵. These were inspected and classified according to the number of photobleaching steps. Frequencies of traces with a defined number of steps were collated in a histogram. Several bright spots per field of view exhibited continuous intensity decay, presumably representing larger aggregates. These were used to estimate the overall photobleaching rate (0.003 per frame) and formally included in the histogram as representing ten steps. The histogram without the bin representing continuum events was modelled as a weighted sum of binomial distributions for up to quadruple occupancy and the probability of labelling of 0.56 estimated from UV–vis spectra.

Electron microscopic reconstructions

Large-scale VLP preparation

smFCS experiments were scaled up into 96-well plates. Two 96-well plates (Non-Binding Surface, Corning) were used. PS1 RNA was labelled and gel-purified, and HBV dimer was purified as described. Each well contained 200 µl of 15 nM PS1 in reassembly buffer. As in smFCS, ten 2 µl injections of 2.5 µM dimer in dissociation buffer were performed. A PerkinElmer Envision plate reader was used to carry out the injections and record the anisotropy of the PS1 RNA (FITC excitation and emission filters). VLPs were purified away from free RNA and capsid using a 1.33 g ml^–1 caesium chloride gradient and spun at 113,652g for 90 h using an SW40Ti rotor. A single band was observed and fractionated. The band was dialysed into reassembly buffer to remove caesium chloride. The 2 ml fraction of VLP was concentrated to 200 µl using an Amicon 100 kDa MWCO spin concentrator.

Cryo-EM specimen preparation

After recovery of the PS1-containing VLPs and removal of caesium chloride by dialysis, their structures were analysed using single-particle cryo-EM. VLPs were vitrified. EM grids (200-mesh) with Quantifoil R 2/1 support film and an additional ∼5 nm continuous carbon film were washed using acetone and glow-discharged for 40 s before use. Cryo-EM grids were prepared by placing 3 µl of ∼3.2 mg ml^–1 HepB VLP on the grid, before blotting and plunge-freezing using a Leica EM GP freezing device. Chamber conditions were set at 8 °C and 95% relative humidity, with a liquid ethane temperature of −175 °C. Data were collected on an FEI Titan Krios (eBIC, Diamond Light Source) transmission electron microscope at 300 keV using an electron dose of 27 e⁻ Å⁻² s⁻¹, for a 2.5 s exposure, yielding a total electron dose of 67.5 e⁻ Å⁻². Data were recorded on a 17 Hz FEI Falcon II direct electron detector. The dose was fractionated across 33 frames. Final object sampling was 1.34 Å per pixel. A total of 2,397 micrographs were recorded using EPU (FEI) automated data collection software.

Single-particle image processing

In total, 2,397 micrographs were motion-corrected; averages of each video were generated using MotionCorr⁵⁶ and contrast transfer function (CTF) parameters for each were determined using CTFFIND4⁵⁷. Micrographs with unacceptable astigmatism or charging, as determined by examining the output from CTFFIND4, were discarded, leaving a total data set of 1,710 micrographs. All particle picking, classification and alignment was performed in RELION 1.3 (ref. 58).

Approximately 57,000 particles were manually picked and classified using reference-free two-dimensional classification in RELION 1.3. This classification confirmed the initial visual impression that, although the VLPs were purified as a single band on a caesium gradient, two sizes of VLPs were present. A selection of the resulting two-dimensional class averages were used as templates for automated particle picking. The particle stack generated using autopicking was subject to two-dimensional classification to separate T = 3 and T = 4 particles and to remove particles not corresponding to VLPs. The subsequent particle stacks (5,589 for T = 3 and 42,411 for T = 4) were subject to three-dimensional classification, using a sphere with the approximate diameter of the VLP as a starting model. Subsets of the data were reconstructed including data out to the Nyquist frequency using the three-dimensional autorefine option in RELION with I3 symmetry imposed to generate all structures presented in this work. Within the T = 4, 42,411 particle data set, it was clear that a further subset (10,851 particles) of the data contained a significant asymmetric feature inside the Cp shell where RNA binding would be expected to occur. An asymmetric (C1) reconstruction was performed on a relatively homogenous set of 10,851 such particles, giving the reconstruction at 11.5 Å resolution.

The three-dimensional model of PS1 RNA was made using RNA Composer⁵⁹. The cryo-EM figures were rendered using USCF Chimera⁶⁰.

Data availability

The data that supports the findings of this study are available from the corresponding authors upon request. Correspondence and requests for materials should be addressed to P.G.S. The cryo-EM reconstructions have been deposited in the Electron Microscopy Databank (EMDB) with the following accession codes: EMD-3714 (asymmetric T = 4 HBV VLP), EMD-3715 (T = 4 HBV VLP with I3 symmetry imposed) and EMD-3716 (T = 3 HBV VLP with I3 symmetry imposed).

Additional information

How to cite this article: Patel, N. et al. HBV RNA pre-genome encodes specific motifs that mediate interactions with the viral core protein that promote nucleocapsid assembly. Nat. Microbiol. 2, 17098 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

WHO. Weekly epidemiological record. WHO 84, 405–420 (2009).
Tillmann, H. L. Antiviral therapy and resistance with hepatitis B virus infection. World J. Gastroenterol. 13, 125–140 (2007).
Article CAS PubMed PubMed Central Google Scholar
Murray, K. et al. Protective immunisation against hepatitis B with an internal antigen of the virus. J. Med. Virol. 23, 101–107 (1987).
Article CAS PubMed Google Scholar
Nassal, M. Hepatitis B viruses: reverse transcription a different way. Virus Res. 134, 235–249 (2008).
Article CAS PubMed Google Scholar
Seeger, C., Zoulim, F. & Mason, W. S. in Fields Virology Vol. 2 (eds Knipe, D. M. & Howley, P. M. ) 2185–2221 (Lippincott Williams & Wilkins, 2013).
Google Scholar
Selzer, L. & Zlotnick, A. Assembly and Release of Hepatitis B Virus Vol. 5 (Cold Spring Harbor Laboratory Press, 2015).
Book Google Scholar
Bock, C. T. et al. Structural organization of the hepatitis B virus minichromosome. J. Mol. Biol. 307, 183–196 (2001).
Article CAS PubMed Google Scholar
Guo, Y.-H., Li, Y.-N., Zhao, J.-R., Zhang, J. & Yan, Z. HBc binds to the CpG islands of HBV cccDNA and promotes an epigenetic permissive state. Epigenetics 6, 720–726 (2011).
Article CAS PubMed Google Scholar
Günther, S., Sommer, G., Iwanska, A. & Will, H. Heterogeneity and common features of defective hepatitis B virus genomes derived from spliced pregenomic RNA. Virology 238, 363–371 (1997).
Article PubMed Google Scholar
Abraham, T. M., Lewellyn, E. B., Haines, K. M. & Loeb, D. D. Characterization of the contribution of spliced RNAs of hepatitis B virus to DNA synthesis in transfected cultures of Huh7 and HepG2 cells. Virology 379, 30–37 (2008).
Article CAS PubMed Google Scholar
Yeh, C. T., Liaw, Y. F. & Ou, J. H. The arginine-rich domain of hepatitis B virus precore and core proteins contains a signal for nuclear transport. J. Virol. 64, 6141–6147 (1990).
CAS PubMed PubMed Central Google Scholar
Eckhardt, S. G., Milich, D. R. & McLachlan, A. Hepatitis B virus core antigen has two nuclear localization sequences in the arginine-rich carboxyl terminus. J. Virol. 65, 575–582 (1991).
CAS PubMed PubMed Central Google Scholar
Li, H.-C. et al. Nuclear export and import of human hepatitis B virus capsid protein and particles. PLoS Pathog. 6, e1001162 (2010).
Article PubMed PubMed Central Google Scholar
Bartenschlager, R., Junker-Niepmann, M. & Schaller, H. The P gene product of hepatitis B virus is required as a structural component for genomic RNA encapsidation. J. Virol. 64, 5324–5332 (1990).
CAS PubMed PubMed Central Google Scholar
Bartenschlager, R. & Schaller, H. Hepadnaviral assembly is initiated by polymerase binding to the encapsidation signal in the viral RNA genome. EMBO J. 11, 3413–3420 (1992).
Article CAS PubMed PubMed Central Google Scholar
Junker-Niepmann, M., Bartenschlager, R. & Schaller, H. A short cis-acting sequence is required for hepatitis B virus pregenome encapsidation and sufficient for packaging of foreign RNA. EMBO J. 9, 3389–3396 (1990).
Article CAS PubMed PubMed Central Google Scholar
Hirsch, R. C., Lavine, J. E., Chang, L., Varmus, H. E. & Ganem, D. Polymerase gene products of hepatitis B viruses are required for genomic RNA packaging as well as for reverse transcription. Nature 344, 552–555 (1990).
Article CAS PubMed Google Scholar
Pollack, J. R. & Ganem, D. An RNA stem-loop structure directs hepatitis B virus genomic RNA encapsidation. J. Virol. 67, 3254–3263 (1993).
CAS PubMed PubMed Central Google Scholar
Knaus, T. & Nassal, M. The encapsidation signal on the hepatitis B virus RNA pregenome forms a stem-loop structure that is critical for its function. Nucleic Acids Res. 21, 3967–3975 (1993).
Article CAS PubMed PubMed Central Google Scholar
Lan, Y. T., Li, J., Liao, W. & Ou, J. Roles of the three major phosphorylation sites of hepatitis B virus core protein in viral replication. Virology 259, 342–348 (1999).
Article CAS PubMed Google Scholar
Gazina, E. V., Fielding, J. E., Lin, B. & Anderson, D. A. Core protein phosphorylation modulates pregenomic RNA encapsidation to different extents in human and duck hepatitis B viruses. J. Virol. 74, 4721–4728 (2000).
Article CAS PubMed PubMed Central Google Scholar
Köck, J., Nassal, M., Deres, K., Blum, H. E. & von Weizsäcker, F. Hepatitis B virus nucleocapsids formed by carboxy-terminally mutated core proteins contain spliced viral genomes but lack full-size DNA. J. Virol. 78, 13812–13818 (2004).
Article PubMed PubMed Central Google Scholar
Abraham, T. M. & Loeb, D. D. Base pairing between the 5′ half of ε and a cis-acting sequence, Φ, makes a contribution to the synthesis of minus-strand DNA for human hepatitis B virus. J. Virol. 80, 4380–4387 (2006).
Article CAS PubMed PubMed Central Google Scholar
Oropeza, C. E. & McLachlan, A. Complementarity between ε and Φ sequences in pregenomic RNA influences hepatitis B virus replication efficiency. Virology 359, 371–381 (2007).
Article CAS PubMed Google Scholar
Wang, J. C.-Y., Nickens, D. G., Lentz, T. B., Loeb, D. D. & Zlotnick, A. Encapsidated hepatitis B virus reverse transcriptase is poised on an ordered RNA lattice. Proc. Natl Acad. Sci. USA 111, 11329–11334 (2014).
Article PubMed PubMed Central Google Scholar
Wang, J. C.-Y., Dhason, M. S. & Zlotnick, A. Structural organization of pregenomic RNA and the carboxy-terminal domain of the capsid protein of hepatitis B virus. PLoS Pathog. 8, e1002919 (2012).
Article CAS PubMed PubMed Central Google Scholar
Stannard, L. M. & Hodgkiss, M. Morphological irregularities in Dane particle cores. J. Gen. Virol. 45, 509–514 (1979).
Article CAS PubMed Google Scholar
Crowther, R. A. et al. Three-dimensional structure of hepatitis B virus core particles determined by electron cryomicroscopy. Cell 77, 943–950 (1994).
Article CAS PubMed Google Scholar
Zlotnick, A. et al. Dimorphism of hepatitis B virus capsids is strongly influenced by the C-terminus of the capsid protein. Biochemistry 35, 7412–7421 (1996).
Article CAS PubMed Google Scholar
Borodavka, A., Tuma, R. & Stockley, P. G. Evidence that viral RNAs have evolved for efficient, two-stage packaging. Proc. Natl Acad. Sci. USA 109, 15769–15774 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dykeman, E. C., Stockley, P. G. & Twarock, R. Packaging signals in two single-stranded RNA viruses imply a conserved assembly mechanism and geometry of the packaged genome. J. Mol. Biol. 425, 3235–3249 (2013).
Article CAS PubMed Google Scholar
Stockley, P. G. et al. Packaging signals in single-stranded RNA viruses: nature's alternative to a purely electrostatic assembly mechanism. J. Biol. Phys. 39, 277–287 (2013).
Article CAS PubMed PubMed Central Google Scholar
Stockley, P. G. et al. A simple, RNA-mediated allosteric switch controls the pathway to formation of a T = 3 viral capsid. J. Mol. Biol. 369, 541–552 (2007).
Article CAS PubMed PubMed Central Google Scholar
Dykeman, E. C., Stockley, P. G. & Twarock, R. Solving a Levinthal's paradox for virus assembly identifies a unique antiviral strategy. Proc. Natl Acad. Sci. USA 111, 5361–5366 (2014).
Article CAS PubMed PubMed Central Google Scholar
Stewart, H. et al. Identification of novel RNA secondary structures within the hepatitis C virus genome reveals a cooperative involvement in genome packaging. Sci. Rep. 6, 22952 (2016).
Article CAS PubMed PubMed Central Google Scholar
Shakeel, S. et al. Genomic RNA folding mediates assembly of human parechovirus. Nat. Commun. 8, 5 (2017).
Article PubMed PubMed Central Google Scholar
Patel, N. et al. Revealing the density of encoded functions in a viral RNA. Proc. Natl Acad. Sci. USA 112, 2227–2232 (2015).
Article CAS PubMed PubMed Central Google Scholar
Porterfield, J. Z. et al. Full-length hepatitis B virus core protein packages viral and heterologous RNA with similarly high levels of cooperativity. J. Virol. 84, 7174–7184 (2010).
Article CAS PubMed PubMed Central Google Scholar
Selzer, L., Katen, S. P. & Zlotnick, A. The hepatitis B virus core protein intradimer interface modulates capsid assembly and stability. Biochemistry 53, 5496–5504 (2014).
Article CAS PubMed Google Scholar
Birnbaum, F. & Nassal, M. Hepatitis B virus nucleocapsid assembly: primary structure requirements in the core protein. J. Virol. 64, 3319–3330 (1990).
CAS PubMed PubMed Central Google Scholar
Ludgate, L. et al. Cyclin-dependent kinase 2 phosphorylates S/T-P sites in the hepadnavirus core protein C-terminal domain and is incorporated into viral capsids. J. Virol. 86, 12237–12250 (2012).
Article CAS PubMed PubMed Central Google Scholar
Aubol, B. E. et al. Processive phosphorylation of alternative splicing factor/splicing factor 2. Proc. Natl Acad. Sci. USA 100, 12601–12606 (2003).
Article CAS PubMed PubMed Central Google Scholar
Porterfield, J. Z. & Zlotnick, A. A simple and general method for determining the protein and nucleic acid content of viruses by UV absorbance. Virology 407, 281–288 (2010).
Article CAS PubMed Google Scholar
Watts, N. R. et al. The morphogenic linker peptide of HBV capsid protein forms a mobile array on the interior surface. EMBO J. 21, 876–884 (2002).
Article CAS PubMed PubMed Central Google Scholar
Packianathan, C., Katen, S. P., Dann, C. E. & Zlotnick, A. Conformational changes in the hepatitis B virus core protein are consistent with a role for allostery in virus assembly. J. Virol. 84, 1607–1615 (2010).
Article CAS PubMed Google Scholar
Venkatakrishnan, B. et al. Hepatitis B virus capsids have diverse structural responses to small-molecule ligands bound to the heteroaryldihydropyrimidine pocket. J. Virol. 90, 3994–4004 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hilmer, J. K., Zlotnick, A. & Bothner, B. Conformational equilibria and rates of localized motion within hepatitis B virus capsids. J. Mol. Biol. 375, 581–594 (2008).
Article CAS PubMed Google Scholar
Bourne, C. et al. Small-molecule effectors of hepatitis B virus capsid assembly give insight into virus life cycle. J. Virol. 82, 10262–10270 (2008).
Article CAS PubMed PubMed Central Google Scholar
Katen, S. P., Chirapu, S. R., Finn, M. G. & Zlotnick, A. Trapping of hepatitis B virus capsid assembly intermediates by phenylpropenamide assembly accelerators. ACS Chem. Biol. 5, 1125–1136 (2010).
Article CAS PubMed PubMed Central Google Scholar
Holmes, K. et al. Assembly pathway of hepatitis B core virus-like particles from genetically fused dimers. J. Biol. Chem. 290, 16238–16245 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bunka, D. H. J. et al. Degenerate RNA packaging signals in the genome of satellite tobacco necrosis virus: implications for the assembly of a T = 1 capsid. J. Mol. Biol. 413, 51–65 (2011).
Article CAS PubMed Google Scholar
Zuker, M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 31, 3406–3415 (2003).
Article CAS PubMed PubMed Central Google Scholar
Gell, et al. Single-molecule fluorescence resonance energy transfer assays reveal heterogeneous folding ensembles in a simple RNA stem-loop. J. Mol. Biol. 384, 264–278 (2008).
Article CAS PubMed Google Scholar
Podjarny, A., Dejaegere, A. P. & Kieffer, B. (eds) in Biophysical Approaches Determining Ligand Binding to Biomolecular Targets: Detection, Measurement and Modelling Ch. 5, 165 (Royal Society of Chemistry, 2011).
Book Google Scholar
Sharma, A. et al. Domain movements of the enhancer-dependent sigma factor drive DNA delivery into the RNA polymerase active site: insights from single molecule studies. Nucleic Acids Res. 42, 5177–5190 (2014).
Article CAS PubMed PubMed Central Google Scholar
Li, X. et al. Electron counting and beam-induced motion correction enable near-atomic-resolution single-particle cryo-EM. Nat. Methods 10, 584–590 (2013).
Article CAS PubMed PubMed Central Google Scholar
Rohou, A. & Grigorieff, N. CTFFIND4 fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 192, 216–221 (2015).
Article PubMed PubMed Central Google Scholar
Scheres, S. H. W. Semi-automated selection of cryo-EM particles in RELION-1.3. J. Struct. Biol. 189, 114–122 (2015).
Article CAS PubMed PubMed Central Google Scholar
Popenda, M. et al. Automated 3D structure composition for large RNAs. Nucleic Acids Res. 40, e112–e112 (2012).
Article CAS PubMed PubMed Central Google Scholar
Pettersen, E. F. et al. UCSF chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar
Yu, X., Jin, L., Jih, J., Shih, C. & Zhou, Z. H. 3.5Å cryoEM structure of hepatitis B virus core assembled from full-length core protein. PLoS ONE 8, e69729 (2013).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors acknowledge the UK MRC (MR/N021517/1) and the Universities of Leeds and York for financial support for parts of this work, which was also supported in part by grants from the Wellcome Trust (089311/Z/09/Z, 090932/Z/09/Z and 106692). P.G.S. and R.Tw. also acknowledge the Wellcome Trust for financial support for virus work (Joint Investigator Award nos. 110145 and 110146) and R.Tw. acknowledges funding via a Royal Society Leverhulme Trust Senior Research Fellowship (LT130088) and EPSRC grant EP/K028286/1 for R.J.B. E.C.D. acknowledges funding via an Early Career Leverhulme Trust Fellowship (ECF-2013-019). A.Z. acknowledges funding from National Institutes of Health grant R01-AI118933. The authors also thank the eBIC for collection time on the Titan Krios microscopes.

Author information

Nikesh Patel and Simon J. White: These authors contributed equally to this work.

Authors and Affiliations

Astbury Centre for Structural Molecular Biology, University of Leeds, LS2 9JT, Leeds, UK
Nikesh Patel, Simon J. White, Rebecca F. Thompson, Daniel P. Maskell, Roman Tuma, Neil A. Ranson & Peter G. Stockley
Departments of Biology and Mathematics & York Centre for Complex Systems Analysis, University of York, YO10 5DD, York, UK
Richard Bingham, Eva U. Weiß, Eric C. Dykeman & Reidun Twarock
Department of Molecular & Cellular Biochemistry, Indiana University, Bloomington, 47405, Indiana, USA
Adam Zlotnick

Authors

Nikesh Patel
View author publications
You can also search for this author in PubMed Google Scholar
Simon J. White
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca F. Thompson
View author publications
You can also search for this author in PubMed Google Scholar
Richard Bingham
View author publications
You can also search for this author in PubMed Google Scholar
Eva U. Weiß
View author publications
You can also search for this author in PubMed Google Scholar
Daniel P. Maskell
View author publications
You can also search for this author in PubMed Google Scholar
Adam Zlotnick
View author publications
You can also search for this author in PubMed Google Scholar
Eric C. Dykeman
View author publications
You can also search for this author in PubMed Google Scholar
Roman Tuma
View author publications
You can also search for this author in PubMed Google Scholar
Reidun Twarock
View author publications
You can also search for this author in PubMed Google Scholar
Neil A. Ranson
View author publications
You can also search for this author in PubMed Google Scholar
Peter G. Stockley
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.G.S. and R.Tw. conceived the project. N.P. performed smFCS and photobleaching experiments and, with R.Tu., analysed the data. S.J.W., R.F.T. and N.A.R. collected and analysed the cryo-EM data. S.J.W. performed SELEX and R.B., E.D., E.U.W. and R.Tw. analysed the resulting sequences to identify the PSs. N.P., S.J.W. and D.P.M. purified HBV VLPs. A.Z. provided reagents. All authors contributed to the writing and editing of the manuscript.

Corresponding authors

Correspondence to Reidun Twarock, Neil A. Ranson or Peter G. Stockley.

Ethics declarations

Competing interests

A.Z. is a co-founder and consultant of Assembly BioSciences. Research in the Zlotnick laboratory is supported by the NIH and Assembly. No Assembly BioSciences employee contributed to A.Z.'s contribution to this work.

Supplementary information

Supplementary Information

Supplementary Figures 1-5, Supplementary Tables 1-3 and Supplementary References. (PDF 4345 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Patel, N., White, S., Thompson, R. et al. HBV RNA pre-genome encodes specific motifs that mediate interactions with the viral core protein that promote nucleocapsid assembly. Nat Microbiol 2, 17098 (2017). https://doi.org/10.1038/nmicrobiol.2017.98

Download citation

Received: 18 August 2016
Accepted: 17 May 2017
Published: 19 June 2017
DOI: https://doi.org/10.1038/nmicrobiol.2017.98

This article is cited by

Molecular elucidation of drug-induced abnormal assemblies of the hepatitis B virus capsid protein by solid-state NMR
- Lauriane Lecoq
- Louis Brigandat
- Anja Böckmann
Nature Communications (2023)
An age-structured model of hepatitis B viral infection highlights the potential of different therapeutic strategies
- Farzad Fatehi
- Richard J. Bingham
- Reidun Twarock
Scientific Reports (2022)
Therapeutic interfering particles exploiting viral replication and assembly mechanisms show promising performance: a modelling study
- Farzad Fatehi
- Richard J. Bingham
- Reidun Twarock
Scientific Reports (2021)
In vitro functional analysis of gRNA sites regulating assembly of hepatitis B virus
- Nikesh Patel
- Sam Clark
- Peter G. Stockley
Communications Biology (2021)
Epidemiological Genetic Study for Novel World Records of Hepatitis B Virus Strains Detected by DNA Sequences in the South of Iraq/Al-Basrah Province
- Awatif H. Issa
- Hisham F. Mohammad
- Munaff J. Abd Al-Abbas
BioNanoScience (2021)