Clinical Evidence of Acupoint Stimulation for Primary Dysmenorrhea: A Systematic Review and Updated Meta-Analysis

Introduction

Primary dysmenorrhea (PD) is a widespread condition distinguished by the recurrent occurrence of menstrual pain in the absence of any detectable pelvic pathology, according to established guidelines.1 This condition affects a substantial portion of young women, ranging from approximately 71% to 94%, with a notable proportion exceeding 30%, enduring moderate-to-severe menstrual pain.2–5 Furthermore, approximately 4.6% of PD cases result in functional disability during the critical period of young adulthood.3 The repercussions of menstrual pain transcend mere personal discomfort, which significantly impedes the daily routines of approximately 15.5% of working women and disrupts the educational endeavors of over 10–40% of students. Consequently, this translates into an average productivity loss of 33% and an annual cumulative total of 8.9 lost productive days.3,4,6,7 Moreover, PD-induced central sensitization heightens individuals’ susceptibility to functional pain disorders such as irritable bowel syndrome and low back pain.8 Even more concerning, dysmenorrhea can exacerbate the prevalence of sleep disorders and elevate the risk of mental health issues including anxiety, depression, and attention deficit hyperactivity disorder (ADHD),9–11 with extreme cases leading to suicidal behavior.12

Currently, Medical treatment remains the mainstream approach for PD, encompassing both non-hormonal pharmacotherapy and hormonal interventions. Nonsteroidal anti-inflammatory drugs (NSAIDs) serve as the primary non-hormonal pharmacotherapy for PD. However, despite some authors advocating the use of NSAIDs before the onset of menstrual pain, irregular adherence to medication can result in the development of NSAID-resistant dysmenorrhea, with 18% of women with dysmenorrhea reporting ineffectiveness of these drugs.13 Furthermore, the gastrointestinal adverse reaction rate associated with NSAIDs can be as high as 32%, significantly hindering their widespread use.14 Hormone therapy is another medical treatment option; however, due to adverse reaction and cultural factors, fewer patients have reported the use of combined oral contraceptives (COC) for non-contraceptive purposes.15 COCs act by suppressing ovulation and lessening the endometrial lining of the uterus, which may lead to irregular uterine bleeding. They are also unsuitable for patients with fertility needs.16 In addition to pharmacological treatments, evidence-based guidelines recommend complementary management including exercise, Transcutaneous Electrical Nerve Stimulation (TENS), acupoint stimulation, behavioral interventions, topical heat, and dietary supplements, with surgery considered for refractory cases.1

As an effective alternative therapy for PD patients seeking non-pharmacological options,17 acupoint stimulation may exert its effects through: (1) Regulating the release of prostaglandins to relieve excessive uterine contraction;18 (2) Improving uterine microcirculation;19 (3) Regulating uterine oxidative stress response;20 (4) Promoting the release of central analgesic substances.21 This therapeutic effect is also rooted in the meridian theory of Traditional Chinese Medicine. Acupoint stimulation can restore the balance of yin and yang in the body, thereby optimizing the functions of the internal organs.22

Research on acupuncture and acupoint stimulation for PD has been growing; however, the findings have been inconsistent and contradictory. A Cochrane review published in 2011 tentatively concluded that acupuncture was more effective than placebo or Chinese herbs in alleviating menstrual pain.23

However, subsequent meta-analyses have expressed reservations, noting that current low-quality randomized controlled trials (RCTs) and single well-designed studies are insufficient to conclusively demonstrate the efficacy of acupuncture.24 Furthermore, recently published RCTs on acupoint stimulation were not included in previous systematic reviews. For instance, a systematic review of RCTs on acupoint stimulation for PD conducted in 2012 had several methodological shortcomings, including a lack of registration, unclear inclusion criteria for participants, and no assessment of quality of life.25

In light of the growing number of RCTs on acupoint stimulation for PD and the subsequent demand for nonpharmacological interventions, we conducted a comprehensive systematic review and meta-analysis of the existing evidence to guide clinical practice. Our specific research questions were as follows: (1) Is acupoint stimulation therapy effective in the management of PD? (2) Subgroup analysis was conducted to assess the efficacy of invasive and non-invasive acupoint stimulation compared with sham acupuncture or NSAIDs in the treatment of PD.

Methods

The PROSPERO registration number of this study is CRD42024586857 and complies with the PRISMA declaration requirements.

Search Strategy

We searched PubMed, Excerpta Medica Database (Embase), Allied and Complementary Medicine Database (AMED), Cochrane Central Register of Controlled Trials (CENTRAL), Web of Science, Scopus, VIP Database, China National Knowledge Infrastructure (CNKI), and Wanfang Database spanning up to May 2024. The search strategy was developed by building on the methodologies employed in previous similar reviews. Keywords were searched from both the medical subject headings (MeSH) and Chinese Biomedical Literature Service System (Sinomed) databases. The search strategy included the MeSH terms in combination with free words. The searched MeSH terms were as follows: Acupuncture; Acupuncture Therapy; Acupuncture Analgesia; Acupuncture Points; Acupuncture, Ear; Moxibustion; Acupressure; Electroacupuncture; Dysmenorrhea. Examples of the search formulas are provided in Appendix 1. To avoid overlooking any relevant studies, we did not apply any filter. For incomplete data, we contacted the authors to either exclude these studies or obtain missing information.

Study Selection

Published and unpublished studies were included if they met the following PICOS (participants, intervention, comparison, outcome, study type) criteria:

Participants

Inclusion criteria included:

1. Female participants were of fertile age, had no history of childbirth, and were below 40 years of age.

2. Regular menstrual cycles ranging from 21 to 35 days (28±7 days).

3. A history of primary dysmenorrhea lasting for at least one year.

4. Exclusion of secondary dysmenorrhea confirmed by imaging such as B-ultrasound.1

Exclusion criteria included:

1. Participants with compromised medical conditions or diagnosed with mental illnesses.

2. Participants suffering from other gynecological disorders.

Interventions

Inclusion criteria included all kinds of commonly used acupoint stimulation methods (including manual acupuncture, electroacupuncture, ear acupuncture, acupressure, and transcutaneous electrical acupoint stimulation (TEAS)), or a combination of the above.

The exclusion criteria were rare treatment methods or interventions that incorporated other therapies in addition to acupoint stimulation.

Comparison

Placebo controls (sham acupuncture), blank controls (no acupuncture, usual care, or health education), or clinically recommended treatments (oral medicine) without acupoint stimulation.

Outcome Measures Primary Outcomes

Visual Analogue Scale (VAS), effective rate and adverse reaction rate.

Secondary outcomes: dysmenorrhea symptom scores, Cox Menstrual Symptom Scale (CMSS), Pittsburgh Sleep Quality Index (PSQI), uterine hemodynamics, and serum prostaglandin contents.

Other Outcomes

Resting-State Scale (RSS), Verbal Rating Scale (VRS), Short Form - 36 Health Survey (SF-36), total pain duration, Self-Rating Anxiety Scale (SAS) and all reported outcomes.

Detailed descriptions of primary and secondary outcome variables, including reliability and validity of the scales, are provided in Appendix 3.18,26–29

Study Type

Inclusion criteria included randomized controlled trials (RCTs) in English or Chinese.

The exclusion criteria were studies of other types or languages, non-randomized allocation methods, animal experiments, and those with conflicts of interest.

Outcome Measures and Data Extraction

Title and abstract screening were independently conducted by two reviewers, and discrepancies were resolved through a consensus between them. Full-text screening for the inclusion of eligible studies was completed by two independent reviewers, and discrepancies were resolved through consensus. In case of any disagreement, a third researcher was consulted for the final adjudication. Following the completion of the screening process, the researchers extracted all data including the first author’s name, year of publication, sample size, age of the participants, intervention measures, duration of treatment, control measures, and outcome measures. Short-term efficacy was evaluated within one day of treatment, whereas long-term efficacy was assessed after at least one menstrual cycle of treatment.

Quality Assessment

For each time point and each outcome eligible for meta-analysis of each study, we assessed the risk of bias using the revised Cochrane risk of bias, version 2 (RoB2) tool.30 Two review authors independently judged whether each risk-of-bias criterion was adequately met. We used the Grade of Recommendations Assessment, Development and Evaluation (GRADE; Grade Pro version 3.6.1) to assess the quality of the evidence, which was categorized as high, moderate, low, and very low. We used Kappa scores to assess interrater agreements.

Data Synthesis and Statistical Analysis

Statistical analyses were conducted using Review Manager 5.4 and R studio 4.4.1. Considering the variety of treatment and duration, most effect pooling was conducted using a random effects model and presented with forest plots. For dichotomous variables, Relative Risk (RR) was used as the outcome indicator with Mantel-Haenszel method. For continuous outcomes, between-group differences were compared based on post-intervention values. The Mean Difference (MD) was used via inverse-variance. In cases where the units differed, Standardized Mean Difference (SMD) was adopted. We used Cohen’s classification of SMD to assess the effect as small (SMD < 0.5), medium (SMD 0.5 to 0.8), or large (SMD > 0.8).31 When judging the effect size of pain intensity, we adopted a clinically important difference of 15 points on a 0 to 100 scale, 2 points on a 0 to 10 scale, or 30% change of the scores between two comparison groups.32 RR < 1.25 or RR > 0.8 was considered clinically significant. The level of significance was set at α = 0.05.

Heterogeneity between studies was based on the results of I2, and an I2 > 50 was considered to indicate high heterogeneity. Sensitivity, subgroup, and meta-regression analyses were performed to reduce heterogeneity. A sensitivity analysis was conducted using the leave-one-out method for each outcome. Results were considered unstable if exclusion of any single study altered the direction of the pooled effect size. For unstable cases, we calculated and reported all leave-one-out iterations in Appendix 2 to identify potential sources of heterogeneity. For outcomes lacking standard deviation (SD) data, estimated SDs were used exclusively in sensitivity analyses to verify the robustness of primary results, with all such cases clearly flagged in supplementary materials. To explore potential sources of heterogeneity in cases where the sensitivity analysis suggested instability, we performed meta-regression analyses. Meta-regression analyses primarily based on treatment sessions.

For small samples (no more than 3) with poor stability, a Bayes analysis was performed. We chose a half-normal distribution with a shape parameter of 1.0, as the prior distribution when the heterogeneity was large (Tau2 > 0.4), whereas when the heterogeneity was small (Tau2 < 0.4), we selected a half-normal distribution with a shape parameter of 0.3 as the prior distribution.33

For meta-analyses that included more than 10 studies, funnel plots were used to detect publication bias, whereas Egger’s test was employed to detect potential publication bias.34 The trim and fill method were used to adjust for publication bias.

Trial Registration

This systematic review was registered in the PROSPERO International Prospective Register of Systematic Reviews (registration number CRD42024586857). Preferred reporting items were used for systematic reviews and meta-analyses.

Protocol Amendments

All pre-specified outcomes from our registered protocol were reported. Outcomes with single-study data were quantitatively presented following PRISMA guidelines.34

Results Study Selection

Searches were performed on May 20, 2024, resulting in 9168 returns (PubMed: 309; Embase: 189; AMED: 383; CENTRAL: 385; Web of Science: 333; Scopus: 777; CNKI: 2400; WanFang: 2636; VIP: 1756). After excluding duplicate articles and briefly reviewing the titles and abstracts, 810 articles remained and the full texts were read. Twenty-six articles were initially selected based on the inclusion and exclusion criteria; a total of 26 articles were initially selected. However, three of these articles were excluded because of their low quality (two of them neglected the outcome indicators,35,36 one had an error in the sample size of the outcome,37 and one trial had a duplicate reference,38 resulting in the final inclusion of 22 articles.39–60 A flow diagram representing the flow of studies through the selection process is shown in Figure 1.

Figure 1 Flow diagram.

Study Characteristics

Twenty-two RCTs involving a total of 1955 participants (median of 89, with a range from 44 to 216) met our updated selection criteria. The characteristics of the included studies are presented in Tables 1 and 2, respectively. Four studies were published in English44,50,56,59 while the other 18 studies were published in Chinese. Three trials were published between 2002 and 2014,44,50,51 and 19 trials were published between 2014 and 2024. Sixteen studies were two-armed, five were three-armed48,50,55,59,60 and one was four-armed.44 We attempted to contact the authors of one trial due to a lack of detailed data,44 yet we did not obtain additional information from them. All trials used parallel-group designs; there were no crossover studies; 21 trials were performed in a single center, and one was a multi-center trial.44

Table 1 Characteristics of Included Studies

Table 2 Detailed Characteristics of Included Studies

All participants were young, nulliparous females. The participants were 14–35 years old. Ten studies included minors under 18 years of age, whereas 12 did not. Acupoint stimulation interventions tested in the included trials varied significantly. Electroacupuncture was used in one trial,44 manual acupuncture was used in nine trials, including body acupuncture,40,55–57,60 abdominal acupuncture,43,47 auricular acupuncture48 and buccal acupuncture;59 moxibustion was used in eight trials, including herb-partition moxibustion,41,47,52 heat sensitive moxibustion,49 suspended moxibustion,42,51,55 ginger-partition moxibustion,53 long snake moxibustion,60 warm needle was used in one trial,54 acupressure was used in one trial,50 staging acupoint catgut embedment was used in one trial,39 transcutaneous electrical acupoint stimulation was used in one trial,46 and auricular plaster was used in two trials.45,48 Eighteen trials standardized acupuncture treatments (all participants were treated at the same points) and four semi-standardized treatments (either all participants were treated at some basic points and additional individualized points, or there were different predefined needling schemes depending on symptom patterns).39,56,58,59 Three trials chose to use the Sanyinjiao (SP6) acupoint alone.40,42,44 The most frequently used acupoints were Sanyinjiao (SP6, 13 trials) and Guanyuan (CV4, 11 trials). An overview of the acupoints that have been used more than once is presented in Figure 2. The course of most trials was three menstrual cycles, whereas two trials only treated participants for one menstrual cycle.44,50 Two trials were followed up for one menstrual cycle;47,48 six trials were followed up for three menstrual cycles39,43,46,49,57,58 and one trial was followed up for six menstrual cycles.59 Nineteen trials conducted prophylactic treatment before the onset of menstrual pain, while three trials initiated treatment at the start of menstrual pain.40,42,54

Figure 2 Acupoints used more than 2 times.

The comparator interventions tested in the included trials varied significantly between studies. Fourteen trials compared acupoint stimulation with NSAIDs treatment using ibuprofen,39,43,47,50,51,53–57,60 compound paracetamol41,59 or meloxicam.52 Two trials compared acupoint stimulation with Chinese medicine using Yuanhuzhitong tablets45 or Yueyueshu Granule.49 The techniques varied considerably in the six trials with sham controls. Two trials used “placebo” needles (telescopic needles with blunt tips not penetrating the skin).40,48 Two trials stimulated non-acupoints,44,50 and one trial stimulated acupoints unrelated to the PD.44 Another trial used a lactose pill (similar to the analgesic pill).59 Four trials included a group that received treatment for acute attacks only or routine care.31,34,37,42,44,46,58

Methodological Quality Assessment

The risk of bias in the included studies is shown in Figures 3 and 4, respectively. Most of the articles employed intention-to-treat (ITT) analysis, while 5 studies utilized per-protocol (PP) analysis.42,43,46,55,57 All 22 studies mentioned randomization. Fourteen trials used random number tables,39,40,42,43,45,47,52,53,55–60 while two trials used a computer-generated sequence.44,49 One trial used block randomization,48 one trial used simple randomization54 and one trial used “Envelope method”.51 Three trials were assessed as “some concerns” for the non-reported detailed method.41,46,50 Five studies reported appropriate allocation concealment40,44,49,56,59 using sealed envelopes, computer programs, and central telephone controls, which were considered to have a “low risk of bias”. However, most studies did not mention the allocation concealment strategy. Owing to the difficulty of blinding both participants and practitioners, only five studies tried blinding to avoid bias.40,42–44,56 The lack of blinding may introduce bias in subjective outcomes such as VAS scores; therefore these outcomes were rated as having “some concerns” in the “measure of the outcome” domain. Three studies had unclear definitions of effective rate, resulting in a high risk of bias.51,52,54 Most studies had no missing data or similar numbers or reasons for dropout. However, one article adopted the per-protocol (PP) approach but did not report the detailed reasons of drop-outs, so that was considered as “high risk of bias”.43 Only a few RCTs completed clinical registration,44,46,48,59 yet one article reported outcomes not mentioned in its protocol.48 The remaining articles lacked publicly available protocols to control for publication bias, thus receiving a “some concerns” rating.

Figure 3 Risk of bias summary.

Figure 4 Risk of bias graph.

A formal grading of evidence was conducted using the GRADEpro 3.6.1 to provide an overall level of evidence. The GRADE certainty of the evidence mainly ranged from low to moderate, owing to high risk of bias and high heterogeneity. It is presented in Table 3. The Kappa scores for the RoB2 bias domains ranged from 0.483 to 0.963, with an overall bias Kappa score of 0.831. The Kappa score for GRADE assessments was 0.899. Overall, the inter-rater agreement between the two reviewers was good.

Table 3 GRADE Evidence Profile for Outcomes

Primary Outcome Comparison with No Treatment or Sham Treatment

Eight trials that compared acupoint stimulation with a control group that received either no treatment or sham treatment were clinically heterogeneous. Five trials included a group that received a sham treatment. Two trials used “placebo” needles (telescopic needles with blunt tips not penetrating the skin).40,48 One trial used sham acupressure stimulating a non-acupoint.50 Another trial performed a four-armed trial (n = 194), including control groups that stimulated non-related acupoints (Xuanzhong, GB39), stimulating non-acupoints, and just waiting.44 Three trials included a group receiving emergency medication (ibuprofen46,58 or aspirin44) only when pain was unbearable (VAS ≥ 8), whereas the control group of one trial only received health education.42 Nevertheless, the findings show that acupoint stimulation is associated with a significant reduction in VAS scores compared to no treatment or sham treatment.

After one menstrual cycle of treatment (Figure 5A), among the four trials the pooled random-effects standardized mean difference (SMD) was −2.46 (95% CI −4.34 to −0.58, 326 participants), indicating a significant reduction in VAS scores (P = 0.01).48,50,58,59 Heterogeneity was high (I2 = 97%) and unstable, possibly owing to the different measures of each control group. Among these, Xu SW 2014’s control group received no treatment, so the effect size was larger.58 The revised results are unstable.

Figure 5 Forest plot of comparison: Acupoint stimulation vs no treatment or sham treatment, outcome: VAS (A) after one menstrual cycle; (B) after two menstrual cycles; (C) after three menstrual cycles; (D) after following three menstrual cycles; (E) after following three menstrual cycles (revised); (F) 30 minutes after the first session.

After two menstrual cycles of treatment (Figure 5B), the meta-analysis of pain intensity yielded a pooled SMD estimate of −5.34 (95% CI −8.96 to 1.73, P = 0.004; 182 participants), indicating a statistically significant reduction in VAS scores for participants in the acupoint stimulation group compared to those in the sham treatment or no treatment group.48,58,59 The heterogeneity was high (I² = 95%), suggesting considerable variation. The sensitivity analysis indicated that the results were stable.

After three menstrual cycles of treatment (Figure 5C), a meta-analysis of six studies yielded a pooled SMD estimate of −2.96 (95% CI −4.39 to −1.53, P < 0.0001, I2 = 97%, 453 participants), indicating a statistically significant reduction in VAS scores for participants in the acupoint stimulation group compared to those in the placebo group or no treatment group.42,46,48,50,58,59 The sensitivity analysis confirmed that the results were stable.

Two studies assessed VAS scores for one menstrual cycle after the end of treatment. One showed that compared to the placebo group, both auricular intradermal needling (P < 0.001) and auricular point sticking (P < 0.05) were more effective.48 Another showed that acupuncture was associated with a significant reduction in VAS scores compared to no treatment (t = −10.69, P < 0.01).58 Three studies were followed for three menstrual cycles (Figure 5D).46,58,59 There was a trend toward lower acupoint stimulation scores, but the sensitivity analysis indicated that the results were unstable (Appendix 2A). The source of high heterogeneity may be the proportion of unreported use of emergency medications. We excluded Xu SW, 2014 (Figure 5E), resulting in a revised SMD estimate of −1.36 (95% CI −2.13 to −0.59, P = 0.0005, I2 = 68%, 107 participants).

Immediately after the first session, a trial reported that acupoint stimulation had a trend toward lower VAS scores than placebo treatment (F=87.12, P < 0.001).59 Two trials compared the VAS scores of acupoint stimulation and sham treatment 30 minutes after the first session.40,59 The meta-analysis (Figure 5F) yielded a pooled MD estimate of −3.64 (95% CI −5.65 to −1.63, P = 0.0004, I2 = 92%, 107 participants), indicating a statistically significant reduction in pain intensity for participants in the acupoint stimulation group compared with those in the control group. The sensitivity analysis indicated that the results were stable Another trial that did not mention standard deviation (SD) reported that repeated-measures analysis of variance at baseline, 5, 10, 30, and 60 minutes following the start of the first intervention revealed a treatment effect (P < 0.001) and a treatment * time interaction (P < 0.001), but no effect of time (P = 0.267).44 The estimated SD was used as a sensitivity analysis with unstable results (Appendix 4), highlighting the need for more precise measurements and additional high-quality evidence to confirm the short-term efficacy.

Only two trials, including a control group that received sham treatment or no treatment, reported effective rates. One trial reported the effective rate of acupoint stimulation in sham treatment groups.48 The total effective rates of the auricular intradermal needling group and auricular point sticking group were 93.3% and 80.0%, respectively, which were higher than that of the placebo group (P < 0.05). A trial reported that the effective rate of the acupuncture group was 100%, whereas the effective rate of the control group was not mentioned.58

Comparison with Oral Medicine

Sixteen trials that compared acupoint stimulation with a control group receiving oral medical treatment were clinically heterogeneous. Among these, fourteen trials used NSAIDs, including using Ibuprofen,39,43,47,50,51,53–57,60 Compound paracetamol41,59 or Meloxicam.52 Sustained-release ibuprofen capsules were the most frequently used, with the most common dose being 300 mg twice a day. Previous research has suggested that NSAIDs are usually not required for more than two or three days if effective treatment is initiated with the onset of bleeding and/or associated symptoms.1 However, one trial included a control group receiving ibuprofen sustained-release capsules only once per cycle, which may have exaggerated the effect in the intervention group.54 Another trial started using ibuprofen sustained-release capsules five days before the onset of menstruation, lasting six days.56 The prolonged treatment may have increased the rate of adverse reactions in the control group.

The meta-analysis compared the VAS scores of the acupoint stimulation and medicine groups after treatment and during the follow-up period, showing that acupoint stimulation may have a more rapid and long-lasting effect, especially in RCTs with invasive stimulation.

After one menstrual cycle of treatment (Figure 6A),43,47,50,57,59 non-significant differences between the two groups are present for acupoint stimulation and NSAIDs with a SMD of −0.29 (95% CI −1.18 to 0.61, P = 0.53, I2 = 95%, 427 participants).

Figure 6 Continued.

Figure 6 Forest plot of comparison: Acupoint stimulation vs oral medicine, outcome: VAS (A) after one menstrual cycle; (B) after two menstrual cycles; (C) after three menstrual cycles; (D) non-invasive stimulation after three menstrual cycles.

After two menstrual cycles of treatment, the VAS scores were not significantly different (SMD = −0.63, 95% CI −1.61 to 0.35, P = 0.21, I2 = 93%, 283 participants; see Figure 6B). 43,47,57,59 However, the sensitivity analysis indicated that the results were unstable (Appendix 2B), possibly because of variations in acupoints. Subgroup analysis and meta-regression failed to reduce heterogeneity.

After three menstrual cycles of treatment, as shown in Figure 6C, acupoint stimulation was more effective than NSAIDs (nine RCTs, 699 participants, SMD = −1.58, 95% CI −2.43 to −0.73, P = 0.0003, I2 = 96%)43,47,50,53,55–57,59,60 and Chinese medicine (two RCTs, 141 participants, SMD = −1.39, 95% CI −2.18 to −0.60, P = 0.0006, I2 = 78%).45,49 Sensitivity analysis confirmed that the results were stable Compared with medical treatment (all NSAIDs), invasive acupoint stimulation was more effective (seven RCTs, 465 participants, SMD = −1.41, 95% CI −2.11 to −0.72, P < 0.00001, I2 = 91%).43,47,55–57,59,60 As for non-invasive acupoint, there was a trend toward lower VAS scores for non-invasive acupoint stimulation with a SMD of −1.75 (5 RCTs, 465 participants, 95% CI −2.73 to −0.11, P = 0.03, I2= 97%),45,49,50,53,55 but the sensitivity analysis indicated that the results were unstable Subgroup analysis (Figure 6D) showed that non-invasive acupoint stimulation reduced VAS scores more than Chinese medicine after treatment (two RCTs, 141 participants; SMD = −1.53, 95% CI −2.73 to −0.32, P = 0.01, I2=91%),45,49 yet the result did not meet the threshold for clinical significance. Moreover, compared to NSAIDs, the difference were not significant (three RCTs, 324 participants, SMD = −0.76, 95% CI −2.18 to 0.67, P = 0.30, I2=96%).50,53,55 Sensitivity analysis confirmed that the results were stable.

Two studies reported VAS scores after one menstrual cycle of follow-up (all treatment courses were 3 menstrual cycles). Results showed that (Figure 7A) after one menstrual cycle of follow-up, acupoint stimulation were still more effective than NSAIDs (SMD = −3.03, 95% CI −5.16 to −0.90, P = 0.005, I2 = 94%, 160 participants).43,47 The sensitivity analysis indicated that the results were stable After three menstrual cycles of follow-up (Figure 7B), acupoint stimulation was more effective than medicine (SMD = −3.74, 95% CI −5.57 to −1.90, P < 0.00001, I2 = 93%, 183 participants).43,57,59 One trial followed up for six menstrual cycles, indicating that at six menstrual cycles follow-up, the differences in VAS scores between the observation (buccal acupuncture) and control (NSAIDs) groups were −3.92, which were statistically significant (P < 0.01).59

Figure 7 Forest plot of comparison: Acupoint stimulation vs oral medicine, outcome: VAS (A) after following one menstrual cycle; (B) after following three menstrual cycles; (C) immediately / 5 minutes after the first session; (D) 30 minutes after the first session; (E) 30 minutes after the first session (revised).

Immediately (within five minutes) after the first session, meta-analysis of VAS scores of two studies (Figure 7C) yielded a pooled MD estimate of −2.85 (95% CI −4.06 to −1.64, P < 0.00001, I2 = 86%, 123 participants),57,59 indicating a statistically significant reduction in VAS scores for participants in the acupoint stimulation group compared to those in the NSAIDs group instantly. The sensitivity analysis indicated that the results were stable.

Three trials compared the VAS scores of acupoint stimulation and medicine after 30 minutes of the first session.54,57,59 Seemingly, non-significant differences between the two schedules were observed for acupoint stimulation and medicine (Figure 7D), but the sensitivity analysis indicated that the results were unstable (Appendix 2C). Shi ZH was excluded because the treatment received by the control group in this trial was irregular, which resulted in greater heterogeneity. The revised results indicated a trend toward lower VAS scores for acupoint stimulation (two RCTs, 123 participants; SMD = −0.50, 95% CI: −0.73, −0.26, P < 0.0001, I² = 0%, see Figure 7E), yet the result did not meet the threshold for clinical significance.54

Shi Z H, 2017 reported VAS scores at other points.54 The treatment group showed significant differences from the control group at all time points after 30 min of treatment (P < 0.05), with a faster onset (15 min vs 30 min) and an earlier peak analgesic effect (30 min vs 60 min), both lasting up to 180 min.

After treatment, 346 (89.18%) of the 388 participants receiving acupoint stimulation were more effective than 255 (72.24%) of the 353 participants receiving oral medicine treatment. The efficacy rate of the NSAIDs was 70.03%. As shown in Figure 8A, both invasive stimulation (6 RCTs, 396 participants; SMD = 1.29, 95% CI: 1.05 to 1.58, P = 0.02, I² = 73%)39,47,54–56,60 and non-invasive stimulation (5 RCTs, 375 participants; RR = 1.14, 95% CI: 1.02 to 1.29, P = 0.02, I² = 55%)41,49,51,52,55 were more effective than oral medicine. The efficacy rates of invasive and noninvasive stimulation were 87.89% and 90.53%, respectively. However, the results of the comparison between the non-invasive and medicine groups were not clinically significant. A study reported the efficacy rate in the follow-up period, which showed that the long-term efficacy of staging acupoint catgut embedding was superior to that of ibuprofen (P < 0.01).39

Figure 8 Continued.

Figure 8 Forest plot of dichotomous variables (A) comparison: Acupoint stimulation vs oral medicine, outcome: effective rate; (B) comparison: Acupoint stimulation vs control, outcome: adverse reaction rate; (C) adverse effective rate (revised).

Safety of Acupoint Stimulation

Six trials compared the adverse reaction rates of acupoint stimulation and control.43,44,49,56,57,59 The main adverse reactions to acupoint stimulation include faintness44 and subcutaneous hemorrhage43,56 while those of the control groups include gastrointestinal irritation43,56,57 and dizziness.57 The adverse reaction rate for acupoint stimulation was 4.74% (10 of 211).

Because there was data with a value of 0 in the two sets of studies, we conducted a continuity correction. There was a significant difference between the two group (Figure 8B), with an RR of 0.50 (95% CI 0.27 to 0.95, P = 0.0332, I2 = 0), but the sensitivity analysis indicated that the results were unstable (Appendix 2D). The main source of heterogeneity was reported by Wang HJ, 2019).56 We found that the number of adverse reactions reported in the control group (oral administration of ibuprofen, 300 mg/bid, from 5 days before menstruation until menstruation began) was significantly higher than that in the other groups (17 cases, all gastrointestinal reactions caused by ibuprofen). It is speculated that this bias may be due to longer treatment duration and excessive dosage. Therefore, this study was excluded from analysis.

A revised meta-analysis reported that non-significant differences between the two groups were present for acupoint stimulation and control (Figure 8C, with an RR of 0.94 (95% CI 0.26 to 3.33, P = 0.9204, I2 = 0).43,44,49,57,59

Secondary Outcomes Objective Indicators Serum Prostaglandin Contents

Three trials compared the serum levels of Prostaglandins E2 (PGE2) in the acupoint stimulation and control groups.47,48,60 As shown (Figure 9A), participants in the acupoint stimulation group had significantly higher PGE2 levels (SMD = 0.90, 95% CI 0.63 to 1.18, P < 0.00001, I2 = 0%, 220 participants). The sensitivity analysis indicated that the results were stable.

Figure 9 Forest plot of objective indicators (A) comparison: Acupoint stimulation vs control, outcome: PGE2; (B) comparison: Acupoint stimulation vs control, outcome: PGF2α; (C) comparison: Acupoint stimulation vs control, outcome: RI; (D) comparison: Acupoint stimulation vs control, outcome: PI.

Five trials compared the serum levels of PGF2α.47,48,51,57,60 The results (Figure 9B) showed that participants in the acupoint stimulation group had moderately lower Prostaglandins F2α (PGF2α) levels (SMD = −0.69, 95% CI −1.04 to −0.34, P = 0.0001, I2 = 63%, 375 participants). The sensitivity analysis indicated that the results were stable.

Uterine Hemodynamics

Two trials compared the uterine hemodynamics of acupoint stimulation and controls.46,52 The results showed that after treatment in the acupoint stimulation group, the uterine artery resistance index (RI) (Figure 9C, MD = −0.04, 95% CI −0.07 to −0.02, P < 0.0001, I2 = 0%, 181 participants) and pulsatility index (PI) (Figure 9D, MD = −0.37, 95% CI −0.57, −0.16, P = 0.0005, I2 = 0%, 181 participants) were significantly reduced compared to those in the control group, indicating that acupoint stimulation could improve uterine hemodynamics. Sensitivity analysis indicated that the results were stable.

Participant-Reported Outcomes CMSS

Three trials compared the total CMSS scores of the acupoint stimulation and control groups.48,59,60 The meta-analysis (Figure 10A) yielded a pooled MD estimate of −2.88 (95% CI −4.80 to −0.97, P = 0.003, I2 = 73%, 168 participants), but the sensitivity analysis indicated that the results were unstable Treatment sessions of three trials were 8, 4, 8.5. The results of the meta-regression (Figure 10B) indicated that the total CMSS scores of CMSS significantly correlated with the treatment sessions (P < 0.0001).

Figure 10 Continued.

Figure 10 Forest plot of participant-reported outcomes (A) comparison: Acupoint stimulation vs control, outcome: CMSS; (B) meta regression: Acupoint stimulation vs control, outcome: CMSS; (C) comparison: Acupoint stimulation vs control, outcome: severity of CMSS; (D) comparison: Acupoint stimulation vs control, outcome: duration of CMSS; (E) comparison: Acupoint stimulation vs medicine, outcome: dysmenorrhea symptom scores after treatment; (F) comparison: Acupoint stimulation vs control, outcome: dysmenorrhea symptom scores during follow-up period; (G) comparison: Acupoint stimulation vs control, outcome: PSQI.

A study reported the total CMSS after 6-months of follow-up, which supported that acupoint stimulation could still improve dysmenorrhea symptoms during the follow-up period (MD = −13.92, P < 0.01).59

Two studies separately reported CMSS scores for each group in detail in terms of severity and duration.42,53 Compared with those in the control groups, participants in the acupoint stimulation groups had lower scores for severity (Figure 10C, MD = −2.43, 95% CI −3.28 to −1.59, P < 0.00001, I2 = 0%, 187 participants) and duration (Figure 10D, MD = −6.84, 95% CI −10.05 to −3.62, P < 0.0001, I2 = 60%, 187 participants). The sensitivity analysis indicated that the results were stable.

Dysmenorrhea Symptom Scores

Four trials compared the dysmenorrhea symptom scores of acupoint stimulation and medicine after three menstrual cycles of treatment.41,43,49,55 Meta-analysis of dysmenorrhea symptom scores (Figure 10E) yielded a pooled MD estimate of −1.65 (95% CI −2.56 to −0.73, P = 0.0004, I2 = 53%, 254 participants), indicating a statistically significant reduction in symptom scores for participants in the acupoint stimulation group. The sensitivity analysis indicated that the results were stable.

Two trials also compared the dysmenorrhea symptom scores for acupoint stimulation and medicine during the follow-up period.43,49 Meta-analysis of dysmenorrhea symptom scores (Figure 10F) yielded a pooled MD estimate of −3.77 (95% CI −6.92 to −0.61, P = 0.02, I2 = 90%, 134 participants), indicating a statistically significant reduction in symptom scores for participants in the acupoint stimulation group compared to those in the medicine group. The sensitivity analysis indicated that the results were stable.

PSQI

Two trials compared the PSQI scores of acupoint stimulation and controls.42,53 There was a trend toward lower PSQI scores for acupoint stimulation (Figure 10G), but sensitivity analysis indicated that the results were unstable Given the small sample size, we tried bayes to explain this, resulting in a posterior probability of MD of −0.516 (95% CI −0.978 to −0.049), which indicated a trend toward lower PSQI scores for acupoint stimulation.

Other Outcomes Short Form 36 Health Survey (SF-36)

Two studies collected the data of SF-36.39,43 This table consists of 36 items, each of which was assigned a different score. The higher the score, the better is the health status and quality of life represented. Bi Y, 2014 argued that both acupoint stimulation and fenbid could improve the health status and quality of participants (t stimulation after treatment = −6.41, t stimulation during follow-up= −6.91, t Fenbid after treatment = −3.92, P < 0.01) and that acupoint stimulation performed better (t after treatment = 2.535, P &l

Comments (0)

No login
gif