|
|
Efficacy of St. John’s Wort Extract WS 5570 in Major Depression: A Double-Blind, Placebo-Controlled Trial
OBJECTIVE: In a double-blind, randomized, placebo-controlled trial with 375 patients the authors investigated the antidepressant efficacy and safety of 300 mg t.i.d. of hydroalcoholic Hypericum perforatum extract WS 5570. METHOD: The study participants were male and female adult outpatients with mild to moderate major depression (single or recurrent episode, DSM-IV criteria). After a single-blind placebo run-in phase, the patients were randomly assigned, 186 to WS 5570 and 189 to placebo, after which they received double-blind treatment for 6 weeks. Follow-up visits were held after 1, 2, 4, and 6 weeks. The primary outcome measure was the change from baseline in the total score on the 17-item Hamilton Depression Rating Scale. In addition, analyses of responders (patients with at least a 50% reduction in Hamilton total score) and patients with remissions (patients with a total score of 6 or less on the Hamilton scale at treatment end) were carried out, and subscale/subgroup analyses were conducted. The design included an adaptive interim analysis performed after random assignment of 169 patients with options for group size adjustment or early termination. RESULTS: Compared to placebo, WS 5570 produced a significantly greater reduction in total score on the Hamilton depression scale and significantly more patients with treatment response or remission. It was more effective in patients with higher baseline Hamilton scores and led to global reduction of depression-related core symptoms, assessed with the melancholia subscale of the Hamilton scale. The placebo and WS 5570 groups had comparable adverse events. CONCLUSIONS: H. perforatum extract WS 5570 was found to be safe and more effective than placebo for the treatment of mild to moderate depression.
After decades of predominant reliance on synthetic antidepressants, the treatment of mildly and moderately severe forms of major depression with extracts from St. John’s wort (Hypericum perforatum) is becoming increasingly popular, with sales of $86 million in the U.S. market during 2000. Today, preparations based on H. perforatum extract are among the most widely prescribed drugs for depression in many European countries. The efficacy of drugs based on H. perforatum in alleviating mild to moderate depressive states was confirmed in comparisons with placebo and with effective synthetic standard antidepressants (e.g., imipramine, fluoxetine); reviews and meta-analyses have been conducted by Kim et al. (1), Linde et al. (2), and Volz (3). It has been claimed that H. perforatum is associated with fewer and less severe side effects than its active comparators (see references 1, 4–6). Despite the large body of published evidence supporting the efficacy of H. perforatum extract as an antidepressant, reviewers (1, 7) have identified serious design problems in existing studies and have criticized the meagerness of the database. Even in cases of a positive response with a classical scale such as the Hamilton Depression Rating Scale (8, 9), one may question whether the observed score changes reflect a true antidepressant effect. Therefore, quantitative and qualitative data for assessing the antidepressant efficacy of H. perforatum extract in relation to placebo are welcome. An aspect with potentially important clinical implications is the initial severity of the patient’s depression and its relationship to treatment efficacy. Laakmann and colleagues (10) investigated mildly to moderately depressed patients with a pretreatment total score on the Hamilton depression scale (17-item version) of 17 or higher, and they suggested that antidepressant treatment with H. perforatum extract was more efficacious for the more severely depressed subgroup (those with an initial total score on the Hamilton scale of 22 or higher). The aim of the present study was to compare the efficacy of H.
perforatum extract WS 5570 to that of placebo in a large group
of patients suffering from a mild to moderate major depressive episode
according to DSM-IV. In addition, particular attention was
paid to the efficacy observed for the core symptoms of depression, as
measured by the Bech melancholia scale (11),
and to the relationship between the initial severity of
depression and response to treatment.
This was a 6-week double-blind, placebo-controlled, randomized phase III trial comparing the efficacy of WS 5570, 300 mg t.i.d., and placebo. The investigation was conducted by the Hypericum Study Group between July 1997 and June 2000 in 26 clinical centers in France. The European Union’s Good Clinical Practice guidelines, the Declaration of Helsinki, and national regulatory and legal requirements (French Code of Public Health), including approval of the trial protocol by an independent ethics committee, were observed. After complete description of the study, written informed consent was obtained from all subjects. Centers and Subjects A patient was eligible for the study if he or she 1) was an outpatient aged 18 to 65 years at the time of screening, 2) provided written informed consent, 3) had a current major depressive episode of at least 2 weeks’ duration that met the criteria of DSM-IV code 296.21, 296.22, 296.31, or 296.32 (mild or moderate depression, single or recurrent episode), and 4) had a total score on the Hamilton depression scale between 18 and 25 and a score on item 1 ("depressed mood") of 2 or higher at screening and baseline. The reasons for exclusion were depression of any type other than those specified, any serious psychiatric disease other than depression, serious suicidal risk (score of 3 or higher on item 3 of the Hamilton depression scale), or response to placebo during the run-in phase; response was defined as a 25% or greater reduction of the Hamilton depression scale total score. Investigational Treatments The tablets containing placebo were indistinguishable from those containing WS 5570 in all aspects of their outward appearance. Procedure Measures of Efficacy and Safety In addition to the total score on the Hamilton depression scale, the melancholia subscore was analyzed. The melancholia subscale was derived from work by Bech and colleagues (11), who applied three formal psychometric criteria (calibration, ascending monotonicity, dispersion) to each item of the Hamilton depression scale. The additive combination of the six items that fulfilled all three criteria was suggested by the authors as a "valid subscale" of the Hamilton depression scale that primarily includes the items that measure the core symptoms of depression. The secondary analysis of the Hamilton scale also included an assessment of responder rates, which were determined as the percentage of patients in each treatment group whose total score on the Hamilton depression scale at the end of treatment was at least 50% lower than at baseline. Safety measures comprised physical examinations and laboratory tests
before and after double-blind treatment (glucose, sodium, potassium,
aspartate transaminase, alanine transaminase, Statistical Analysis The hypotheses were evaluated by using two-sample t tests. All other analyses were purely descriptive without adjustment for multiplicity. For the assessment of the course of change in Hamilton depression score during treatment, a repeated measures analysis of variance (ANOVA) was applied. To investigate the relationship between treatment efficacy and the severity of depression before the start of treatment, an explorative subgroup analysis was conducted for patients with an initial Hamilton depression scale total score of less than 22 and for those with a score of 22 or higher. The primary analysis of efficacy was based on the intention-to-treat
principle and included all randomly assigned patients. An
additional per-protocol analysis of the patients without
major protocol deviations was conducted for the primary
outcome measure. For all efficacy analyses, the last
observation was carried forward for patients who terminated
the trial prematurely. Safety analyses were based on all
patients who took at least one dose of the study medication
after random assignment.
Patient Accountability In the first part of the trial, 169 patients were randomly assigned to double-blind treatment (WS 5570: N=84; placebo: N=85), and 206 patients were randomly assigned in the second part after the interim analysis (WS 5570: N=102; placebo: N=104). Therefore, totals of 186 and 189 patients were randomly assigned to treatment with WS 5570 and placebo, respectively, and were included in the intention-to-treat analysis. After random assignment, 18 patients in the WS 5570 group (9.7%) and 25 in the placebo group (13.2%) terminated treatment prematurely. The primary reasons for early withdrawal were lack of efficacy (WS 5570: N=10, 5.4%; placebo: N=14, 7.4%), revocation of informed consent (WS 5570: N=4, 2.2%; placebo: N=7, 3.7%), and adverse events (WS 5570: N=2, 1.1%; placebo: N=2, 1.1%). The per-protocol analysis of all patients without major protocol violations included 164 patients in the WS 5570 group and 157 patients in the placebo group. The decisions with respect to the relevance of the protocol deviations were made before the code was broken. Baseline Characteristics
Efficacy
In the confirmatory hypothesis testing for the primary outcome measure for the first study stage, i.e., the interim analysis for the intention-to-treat data set, the null hypothesis relating to the difference between treatment groups in the decrease in total score on the Hamilton depression scale between baseline and day 42 was associated with a one-sided p value of p1=0.037 (t=1.80, df=167). Since this p value lies between the boundaries for early rejection and acceptance, the trial was continued with a second stage. The required number of subjects was reestimated on the basis of the results of the interim analysis. The group in the second stage showed a one-sided p value of p2=0.038 (t=1.78, df=204). Therefore, the product of p values for the final combination test fell below the critical limit (0.037·0.038=0.0014<0.0038), and so the null hypothesis was rejected, and the superiority of extract WS 5570 over placebo was demonstrated for a treatment duration of 6 weeks. For the pooled data from both study stages the one-sided p value for the change in Hamilton depression score between baseline and day 42 was p=0.02 (t=2.50, df=373). For the comparisons of the two treatment groups in terms of change from baseline in Hamilton scale total score at days 28, 14, and 7, the t test results were nonsignificant. A repeated measures ANOVA with independent variables of treatment and time and an interaction term was used to compare the postbaseline Hamilton depression scores of the two treatment groups and demonstrated a significant time-by-treatment interaction (F=3.41, df=4, 1492; Greenhouse-Geisser epsilon=0.58, two-sided Greenhouse-Geisser-corrected p=0.03). These results were confirmed in the per-protocol analysis, in which both treatment groups had the same mean decreases in Hamilton depression scale total score between baseline and day 42 as in the intention-to-treat analysis (t=2.31, df=319, p=0.02, two-sided t test). In the intention-to-treat study group, the percentage of responders
(those with at least 50% decreases in Hamilton score between baseline
and treatment end) was significantly higher for WS 5570
(52.7%, 98 of 186) than for placebo (42.3%, 80 of 189) ( A secondary outcome measure was the change in total score on the Montgomery-Åsberg Depression Rating Scale between baseline and treatment end. The mean decrease was 11.7 points (SD=9.0) for the WS 5570 group and 9.9 points (SD=9.2) for the placebo group (intention-to-treat analysis: t=1.90, df=373, p=0.06, two-sided t test). The depression subscore of the SCL-58 (11 items) showed a mean reduction of 7.9 points (SD=8.7) for the WS 5570 group and 6.5 points (SD=8.4) for the placebo group (intention-to-treat analysis: t=1.57, df=366, p=0.12, two-sided t test). Table
2 indicates the relationship between the initial severity of
depression and the magnitude of the treatment effect. Among the
patients receiving WS 5570, the difference in the decrease in
Hamilton depression scale total score between baseline and the
final visit was larger in the subgroup of patients with initial
scores equal to or above the median value of 22 points. Their
decrease was significantly greater than the decrease for the
patients receiving placebo (t=2.08, df=225, p=0.04, two-sided t
test), but the decrease for the patients with initial Hamilton depression
scores between 18 and 21 did not differ significantly from
that for the placebo group (t=1.50, df=146, p=0.14, two-sided t
test).
The score on the Bech melancholia subscale decreased by a mean of 5.5 points (SD=4.2) in the WS 5570 group and by 4.4 points (SD=4.1) in the placebo group (intention-to-treat analysis: t=2.60, df=373, p=0.001, two-sided t test). Safety and Tolerability
Both study drugs did not relevantly or systematically modify the
biological measures assessed—neither with regard to a
general trend nor on an individual patient basis.
This study demonstrates the antidepressant efficacy of 300 t.i.d. of H. perforatum extract WS 5570, as compared to placebo, for mildly to moderately depressed patients after a treatment duration of 6 weeks. The median difference in change in the Hamilton depression scale score between WS 5570 and placebo was 3.0 points, and the mean difference was 1.8 points. The percentage of responders in the group receiving H. perforatum was 52.7%, whereas the percentage in the placebo group was 42.3%; the difference of 10.4% was significant. This effect size is moderate, but the size of the effect observed in placebo-controlled phase III trials hardly reflects the real therapeutic potential. As observed by Montgomery (22), during recent years an increasing number of clinical trials testing the efficacy of new potential antidepressants have failed to demonstrate a difference from placebo even for established reference drugs (e.g., imipramine). The rise in placebo response rates has not been paralleled by a rise in drug response. Similar tendencies have been observed in trials for other psychiatric indications (e.g., obsessive-compulsive disorder, social phobia, panic disorder). Placebo-controlled phase III trials control for different sources of bias, making the therapeutic management artificial, e.g., no associated treatment in case of anxiety or insomnia, no dose adaptation depending on tolerance and efficacy. These methodological constraints are justified but likely to decrease the effect size of the active drug. On the other hand, the effect of the "therapeutic alliance" with the physician, including confidence, empathy, and helpful attitude, is increased by the awareness of the possible random assignment to a new innovative drug and the existence of weekly visits, thus increasing the placebo effect (23, 24). One of the strategies proposed to increase the difference between active drug and placebo was to recruit rather severely depressed patients (e.g., having a Hamilton depression scale score above 25), although it is not clear whether such a threshold effect is due to a limited placebo response in severely ill patients or to a mechanical effect yielding a larger difference between the evaluations before and after treatment. Therefore, the superiority observed in this study of mildly and moderately depressed patients was probably more difficult to evidence than it would have been with a group including more severely ill patients. In a recent well-designed and carefully conducted study (25), H. perforatum was compared to placebo and to a reference drug, and a significant effect was not demonstrated even for the reference compound. After reviewing the methods of the two studies, we consider only two major differences potentially relevant: our study had a small number of patients at each center, but all patients recruited were spontaneously seeking care. The difference in care-seeking behavior, reflecting a subjective perception of severity by the patient, is likely to explain the better detection of an advantage of active drug over placebo. A low dropout rate, possibly due to a low rate of adverse events during the first weeks of treatment, may have contributed to this positive result. The existence of a run-in phase was obviously of limited value, as already claimed by others (22, 26, 27). However, as suggested early on by Paykel et al. (28), we observed that severity did have an impact on outcome. The difference between treatment groups in the change from baseline in Hamilton depression score was significant for the subgroup of patients with baseline scores of 22 or higher but not for those with baseline scores less than 22, confirming the results of Laakmann et al. (10). These authors found in the H. perforatum group (extract WS 5572) responder rates of 50% for those with baseline Hamilton depression scores of 22 or higher and 49% for all patients, compared to 25% and 33%, respectively, in the placebo group. The significant effect on the Bech melancholia subscale score and a larger effect in more severely ill patients suggest a true antidepressant effect of H. perforatum. The results observed with the Montgomery-Åsberg Depression Rating Scale were consistent with those of the Hamilton depression scale and shortly missed significance (p=0.06). The duration of the study was 6 weeks because this duration was recommended for placebo-controlled phase III studies of acutely depressed patients. Greater improvement in patients receiving active drug than in those receiving placebo is commonly observed thereafter. Indeed, the improvement shown by the H. perforatum group in Figure 1 has not yet reached a plateau. In the present trial, the two withdrawals related to adverse events in the H. perforatum group were attributable to worsening depressive symptoms due to a lack of efficacy in these particular patients, rather than to problems regarding tolerability. The herbal extract was well tolerated by all 186 patients who received it, and tolerability-related withdrawal from treatment was not indicated in a single case. Our data therefore contribute to the overall favorable safety assessment of preparations from H. perforatum extract. In conclusion, this study demonstrates the existence of an
antidepressant effect of H. perforatum in mildly and
moderately depressed patients.
Received July 26, 2001; revision received April 4, 2002; accepted March 27, 2002. From the Unité Institut National de la Santé et de la Recherche Médicale 302, Hôpital Pitié Salpêtrière; the Centre Hospitalier Spécialisé en Psychiatrie de Pontorson, Pontorson, France; the Centre Hospitalier Spécialisé en Psychiatrie de La Chartreuse, Dijon, France; and the Biometrical Department, Dr. Willmar Schwabe Pharmaceuticals, Karlsruhe, Germany. Address reprint requests to Dr. Lecrubier, Unité INSERM 302, Hôpital Pitié Salpêtrière, Pavillon Clérambault, 75013 Paris, France; lecru@ext.jussieu.fr (e-mail). Sponsored by Dr. Willmar Schwabe Pharmaceuticals, Karlsruhe, Germany.Hypericum Study Group: D. Arnoux, G. Aspe, J.J. Ausseill, C. Bagot, A. Benoit, P. Bern, D. Bonnaffoux, B. Bonnet Guerin, J. Charbaut, J.Y. Charlot, C. Claden, G. Clerc, H. de Verbigier, R. Didi, M. Faure, F. Gheysen, S. Guibert, P. Khalifa, P. Le Goubey, D. Liegaut, Y. Mouhot, A. Pargade Moradell, L. Rochard, G. Saint Mard, H. Sauret, and J.R. Zekri.
Click here to read about the most researched St. John's Wort formulation in America. Click here to read about other natural antidepressant/emotional mood balancers
|