Evidence‐based treatment recommendations for neck and low back pain across Europe: A systematic review of guidelines

Abstract Background and objective This systematic review synthesized evidence from European neck and low back pain (NLBP) clinical practice guidelines (CPGs) to identify recommended treatment options for use across Europe. Databases and Data Treatment Comprehensive searches of thirteen databases were conducted, from 1st January 2013 to 4th May 2020 to identify up‐to‐date evidence‐based European CPGs for primary care management of NLBP, issued by professional bodies/organizations. Data extracted included; aim and target population, methods for development and implementation and treatment recommendations. The AGREE II checklist was used to critically appraise guidelines. Criteria were devised to summarize and synthesize the direction and strength of recommendations across guidelines. Results Seventeen CPGs (11 low back; 5 neck; 1 both) from eight European countries were identified, of which seven were high quality. For neck pain, there were consistent weak or moderate strength recommendations for: reassurance, advice and education, manual therapy, referral for exercise therapy/programme, oral analgesics and topical medications, plus psychological therapies or multidisciplinary treatment for specific subgroups. Notable recommendation differences between back and neck pain included, i) analgesics for neck pain (not for back pain); ii) options for back pain‐specific subgroups—work‐based interventions, return to work advice/programmes and surgical interventions (but not for neck pain) and iii) a greater strength of recommendations (generally moderate or strong) for back pain than those for neck pain. Conclusions This review of European CPGs identified a range of mainly non‐pharmacological recommended treatment options for NLBP that have broad consensus for use across Europe. Significance Consensus regarding evidence‐based treatment recommendations for patients with neck and low back pain (NLBP) from recent European clinical practice guidelines identifies a wide range of predominantly non‐pharmacological treatment options. This includes options potentially applicable to all patients with NLBP and those applicable to only specific patient subgroups. Future work within our Back‐UP research team will transfer these evidence‐based treatment options to an accessible clinician decision support tool for first contact clinicians.


| INTRODUCTION
Neck and low back pain (NLBP) are among the most frequent reasons for visiting a general practitioner (GP) or physiotherapist in primary care in Europe (Bot et al., 2005;Jordan et al., 2010). The substantial burden of illness from these conditions was shown by the most recent Lancet-Global Burden of Disease study which highlighted low back pain (LBP) as the single highest cause of years lived with disability (out of 354 conditions studied), with neck pain ranked eighth (female) and twelfth (male) (GBD 2017 Disease andInjury Incidence andPrevalence Collaborators, 2018). Outlining potential ways to address this societal burden, the recent Lancet series on LBP  recommended a greater focus on improving decision making in first-contact consultations as current treatment is highly variable (Maserejian et al., 2014) and often not in line with clinical guidelines (Darlow et al., 2014;Somerville et al., 2008), leading to suboptimal treatment outcomes (Maher et al., 2017). For example referrals to secondary care specialists are too common, provision of self-management advice and education can be limited, opioids and imaging are over-prescribed, and sign-posting to locally available non-pharmacological options such as exercise groups are limited (Chou, et al., 2017a;Koes et al., 2010;Maserejian et al., 2014). Finding solutions that promote best-practice care for patients with NLBP in first-contact consultations is therefore a priority (Foster et al., 2012).
Our team is part of Back-UP, a European programme of research developing a digital health technology to support clinical decision making for patients with NLBP based on a stratified care approach for first-contact consultations [http:// backu p-proje ct.eu/]. Decision support tools have demonstrated promising results for helping clinicians to translate the most up to date recommended evidence into their practice (Murphy et al., 2014). For example a systematic review of over 160 randomized controlled trials testing clinical decision-support systems identified improved processes of clinical care (e.g. diagnosis, treatment, disease monitoring) or patient outcomes (e.g. clinical events, quality of life) in over half of the included studies (Roshanov et al., 2013).
The Keele STarT Back stratified care Tool for back pain has recently been superseded by the Keele STarT MSK Tool (Dunn et al., submited), which has been validated in UK primary care and shown to be predictive of pain and disability across a range of common musculoskeletal (MSK) pain sites, including NLBP. In addition, a new set of recommended matched treatment options for MSK patients at low, medium and high-risk of poor outcomes (Babatunde et al., 2017;Protheroe et al., 2019) have been piloted in a feasibility trial (Hill et al., 2020). However, these matched treatments were designed to evaluate stratified care in UK general practice rather than for use across European countries by a broader range of first-contact clinicians such as occupational health physicians. We therefore felt the matched treatments should be further refined for the specific context of this European project.
Recent systematic reviews of clinical practice guidelines (CPGs) for musculoskeletal pain (Lin et al., 2020), and back pain (Oliveira et al., 2018;Wong et al., 2017) aimed to summarize recommended treatments for either LBP or neck pain. However, less emphasis was placed on improving decision making in first-contact consultations, identifying specific CPG recommendations for patient subgroups defined by their risk of persistent pain and disability and the potential relevance, and on improving the referral process. To our knowledge, no prior reviews of CPGs have assessed treatment recommendations for both neck and low back pain and explored consistencies or similarities between recommendations for these common spinal pain presentations.
The aim of this study was therefore to conduct a systematic review of published European back and neck pain clinical guidelines to describe and synthesize the evidence of recommended treatment options with broad consensus for use across Europe.
Intercollegiate Guidelines (SIGN), WHO Guidelines, Guidelines International Network (G-I-N) and DynaMed Plus) from 1st January 2013 to 4th May 2020.
The search strategy utilized both text word searching in the title, abstract or keywords and database subject headings, combining terms for neck or back pain and practice guidelines (see Appendix S1: full-search strategy for OVID MEDLINE). For the other databases, search terms were adapted to the search capabilities of the platform.
In addition, our Back UP research partners were asked to identify any relevant guidelines from their country. Reference lists of included guidelines were checked to identify additional documents relevant to the methodology of the guideline.

| Inclusion criteria
• Recent evidence-based European clinical guidelines issued by professional bodies or organizations for guideline development [published from 2013 onwards]. We included recently published guidance only, to ensure treatment recommendations emerging from the review would be based on relatively up-to-date evidence; • Guidelines concern adult populations (18 years or over), with NLBP (including patients presenting to first contact health professionals with symptoms of whiplash-related disorders or symptoms of radiculopathy such as radicular pain); • Guidelines that include recommendations regarding treatment options for patients presenting with NLBP, in particular: a. Treatments deliverable within primary care (as broadly considered across Europe, including occupational healthcare) or referral pathways from primary to secondary care recommended for clinical practice (in at least two European countries). b. Treatments aiming to reduce pain, improve function and/or support return to work. Relevant outcomes also included evidence-based recommendations regarding factors (patient, clinician, environment) that may be associated with the effectiveness of treatment, and recommendations regarding clinical prediction rules or decision tools supporting the selection of treatment for specific patient subgroups (where mentioned in the guideline).

| Exclusion criteria
• Non-European guidelines; • All publications that are not evidence-based clinical practice guidelines, including guidelines based solely on consensus or without an explicit literature search, and other publication types: systematic reviews, randomized trials, cohort studies, case series, editorials, protocols, letters; • Paediatric only populations (under 18 years); • NLBP as a result of severe trauma, for example fracture and spinal cord injury, inflammatory arthritis including spondyloarthropathies, and those that focused on broader conditions, for example (chronic) pain that may encompass spinal pain; • Guidelines focused on patients managed in secondary care with an established diagnosis of radiculopathy; • Guidelines focused specifically on surgical treatment options/comparisons or on specific interventions not limited to spinal pain, for example analgesics in older adults; • Guidelines that involved populations admitted to hospital (not ambulatory care); • Guidelines for which translations could not be obtained.

| Guideline selection
Results from all searches were imported into EndNote X9 (reference management software, Clarivate Analytics. Available at https://endno te.com/) and duplicates removed. Unique citations were then imported into Covidence (Veritas Health Innovation, Melbourne, Australia. Available at https:// www.covid ence.org/) to manage the screening process.
Two reviewers (NC and GM) independently screened all titles and abstracts for relevance against eligibility criteria and excluded ineligible publications by agreement. Full texts were independently assessed for inclusion by pairs of independent reviewers (NC, GM and DvdW). Disagreements were noted and resolved between pairs of reviewers and where necessary the involvement of a third reviewer. Reasons for exclusion at the full-text stage were recorded.

| Data extraction
A data extraction form was purposively designed in Excel to record relevant information from each of the clinical practice guidelines included in the review. Complementary documents were sourced where relevant, such as methodological reports and evidence syntheses. Information was extracted regarding general guideline information (e.g. country, healthcare setting, publication year, target population and presenting symptoms); methods regarding guideline development and implementation (e.g. multidisciplinary group/single profession; how strength of evidence determined; details regarding consensus methods) and intervention recommendations, specifically only those that can be offered in primary care, and guidance for referral (e.g. [strength of] recommendations, any details regarding subgroups).
One reviewer extracted data from each guideline; in the case of guidelines in English, this was independently checked by a second reviewer with any disagreements resolved through discussion. For non-English guidelines, no independent check with a second experienced reviewer was feasible within the timeline of conducting this review.

| Assessment of guideline quality
The AGREE II (Appraisal of Guidelines Research and Evaluation) reporting checklist was used to critically appraise guidelines (Brouwers et al., 2010a). Internationally, this is the most widely used tool for assessing guideline quality (Siering et al., 2013), with good construct validity and reliability (Brouwers et al., 2010b(Brouwers et al., , 2010c. The instrument focuses on guideline development and reporting and consists of 23 items addressing 6 domains (1. Scope and purpose; 2. Stakeholder involvement; 3. Rigour of development; 4. Clarity of presentation; 5. Applicability and 6. Editorial independence). Each item is rated on a 7-point Likert scale from 1 (Strongly disagree) to 7 (Strongly agree). In addition, there are two final items that ask appraisers to give an overall judgement in light of ratings given for the 23 items.
The web-based platform My AGREE PLUS (https:// www.agree trust.org/my-agree/) was used to complete appraisals online, based on the user manual. Each item is presented for scoring alongside detailed guidance on how to score the item, including where to find relevant information and what to consider when deciding on the score for each item.
Critical appraisal was conducted concurrent to data extraction by the same reviewer(s). One reviewer appraised each guideline; in the case of guidelines in English, this was independently checked by a second reviewer with any disagreements resolved through discussion. For non-English guidelines no independent check was feasible.
No set thresholds exist for determining high-/low-quality guidelines, however, AGREE II guidance suggests users decide these according to their specific context (AGREE Next Steps Consortium, 2017). On the basis of the examples given in the AGREE II user manual, and with reference to previous studies (Bouwmeester et al., 2009;Lin et al., 2020), we considered guidelines to be of high quality if AGREE II Domain 3, that is 'Rigour of development' scored at least 70%, and the remaining five domains, along with the overall assessment, scored at least 50%.

| Synthesis of guidelines and identification of consistent recommendations
All recommendations extracted from the included guidelines were collated based on the way the treatment option was described in/translated from the guideline and then grouped according to treatment theme. Tables were drawn up to present all the recommendations, alongside details regarding the context of the guideline (i.e. professional organization(s), country and target population/diagnostic classification). Members of the review team, which included researchers with academic and clinical expertise in musculoskeletal pain, were presented with these tables for review. Following discussion of the many very specific intervention options, for example different forms of exercise, nuanced and/or inconsistently used terms and translation anomalies/country-specific terminology (often reported in only 1 or 2 guidelines), interventions were merged by treatment type/modality. A general practitioner (physician) was invited to review the recommendations relating to medications specifically and undertook a similar process of refining the grouping of treatment options.
The direction (i.e. for, against or open) and strength of recommendations were harmonized, taking into consideration the array of methods and terminologies used across included guidelines (see Appendix S2). The resulting nomenclature enables the reader to distinguish between strong or weak recommendations based on a formal grading system, for example GRADE; those where no formal grading system was applied and recommendations based on consensus/expert opinion. Treatment or referral options for which a recommendation was formulated in one guideline only, were not further considered.
To summarize and synthesize the direction and strength of recommendations across guidelines, a set of criteria was devised and followed, such that: • Strong FOR/AGAINST recommendation (should do/ should not do): consistent recommendations in at least two high-quality guidelines from different countries (at least one guideline of which reports a 'strong' i.e.// or XX recommendation). • Moderate FOR/AGAINST recommendation (could do/ could not do): consistent recommendations in at least one high quality (where the recommendation is not based on expert opinion i.e. O + or O-) and if only one high quality, then one or more low-quality guidelines. • Weak FOR/AGAINST recommendation: recommendations from high-quality guidelines but based on expert opinion only and/or recommendations from multiple low-quality guidelines.

| Guideline characteristics
An overview of characteristics of included CPGs and the methods used in their development and implementation is presented in Table 1, with guideline-specific details provided in Appendices S3 & S4. The 17 contemporary CPGs originate from eight European countries (Belgium, Denmark, France, Germany, Italy, The Netherlands, Poland and the UK). The majority address low back pain and/or radicular pain (n = 12; 71%), whereas six (35%) are concerned with neck pain and/or radicular pain. Five guidelines (29%) focus specifically on patients presenting with symptoms of radiculopathy. Three of these guidelines (Schaafstra et al., 2015;Sundhedsstyrelsen, 2015;2016b) are specifically developed Records idenƟfied through database searching (n = 5448)

Reasons:
Not (non-specific) back or neck pain focused (n = 39) • • Not a full guideline e.g. review, editorial, guideline summary (n = 86) • Non-evidence based guideline $ (n = 7) • Non-European guideline $ (n = 24) • Guideline not issued by professional body or organisaƟons for guideline development including arƟcle by same team to further disseminate guidance (n = 11). for the management of radiculopathy in general practice or primary care. The two other guidelines were designed for healthcare professionals responsible for the management of acute lumbar (Glocker et al., 2018) or cervical (Pohl et al., 2018) radiculopathy in any ambulant, outpatient or secondary care setting. Conversely, three CPGs (18%) explicitly exclude radiculopathy. A large majority of CPGs were developed by multidisciplinary groups (n = 14, 82%), employed formal grading of evidence and/or recommendations (n = 13, 76%). Just over half the guidelines detailed timeframes for future revisions (n = 10, 59%), whereas just under half detailed or undertook a consensus process (n = 8, 47%).

| Quality appraisal
The AGREE II domain scores for each guideline are presented in Table 2, along with our designation of the overall quality, that is high/low based on domain scores. Notably, one guideline (Kassolik et al., 2017) was not rated highly on any of the domains, achieving at its best 44% for clarity of presentation. With the exception of this guideline, the remaining 16/17 CPGs were all highly rated, achieved at least 50% of the maximum possible score, for Domains 1 (scope and purpose) and 4 (clarity of presentation). Conversely, a minority of CPGs (n = 7, 41%) achieved high ratings for Domain 5 (applicability). Domains 2 (stakeholder involvement) and 6 (Editorial independence), together with overall assessment score, were each reported to a high quality in a large majority of studies (n = 14, 82%). Domain 3 (rigour of development) with its higher cut-off point of 70% determining high quality was achieved by just over half the CPGs (n = 9, 53%).

| Consistency of CPG recommendations for neck pain
Six guidelines provided treatment recommendations for neck pain (Bier et al., 2016;Kassolik et al., 2017;Monticone et al., 2013;Pohl et al., 2018;Sundhedsstyrelsen, 2015;2016c). Appendix S5 details the specific treatment options or intervention modalities identified in each guideline together with the direction and strength of each recommendation. In total, recommendations were provided that covered a wide range of treatment options: reassurance; advice and education; medication; injection/infiltration; acupuncture; thermotherapy; manual therapy; exercise therapy; postural therapy; traction; electrotherapy; orthotics; ergonomic interventions; taping/strapping; psychological interventions; multidisciplinary treatments; referral for imaging and referral for specialist opinion; plus a disparate group of interventions that were labelled 'miscellaneous'.
Positive (weak to moderate) recommendations from high quality (Sundhedsstyrelsen, 2015;2016c) or multiple low quality (Bier et al., 2016;Kassolik et al., 2017;Monticone et al., 2013;Pohl et al., 2018) guidelines supported the use of reassurance; advice and education with the specific mention of physical activity and exercise; prescription of oral analgesic medications including for neuropathic pain, and specifically paracetamol, NSAIDs and opioids including tramadol; topical medication; exercise interventions alone or in combination with other treatments and manual therapy in combination with another (exercise) intervention.
Psychological or multimodal (multidisciplinary) interventions were recommended for specific subgroups of patients with neck pain, with either psychosocial risk factors or for those with more persistent neck pain or disability.
Recommendations were inconsistent or inconclusive regarding manual therapies (delivered without additional active treatment); traction; electrotherapies; thermotherapies; cervical orthoses; acupuncture/dry needling and referral for imaging.

| Consistency of CPG recommendations for low back pain
Twelve guidelines provided treatment recommendations for back pain (Bons et al., 2017;BÄK et al., 2017;Glocker et al., 2018;Kassolik et al., 2017;NICE, 2016;Regione Toscana, 2015;Schaafstra et al., 2015;SFMT, 2013;Staal et al., 2017;Sundhedsstyrelsen, 2016aSundhedsstyrelsen, , 2016bvan Wambeke et al., 2017). Details regarding the specific treatment options or intervention modalities identified from each guideline can be found in Appendix S6, along with the direction and strength of each recommendation. Similar to guidelines for neck pain, recommendations were provided that covered a wide range of treatment and referral options. For many of these treatment options, the body of evidence underpinning recommendations was larger compared to neck pain, although often still inconsistent or of low quality. Table 4 presents the summary of recommendations from high-and low-quality guidelines and the overall recommendations derived from our synthesis. A range of treatment options (n = 26) were only mentioned in one guideline, and these were not considered further. Positive (weak to strong) recommendations from high quality (BÄK et al., 2017;NICE, 2016;Sundhedsstyrelsen, 2016a;2016b;van Wambeke et al., 2017) or multiple low quality (Bons et al., 2017;Glocker et al., 2018;Kassolik et al., 2017;Regione Toscana, 2015;Schaafstra et al., 2015;SFMT, 2013;Staal et al., 2017) guidelines supported the use of 14 treatment options, including the following: reassurance; advice and education with specifics for physical activity, exercises and work; manual therapy in combination with active treatment; exercise interventions; group exercise programmes including back schools; psychological therapies including cognitive behavioural interventions as standalone interventions or in combination with exercise; work-based rehabilitation and return to work programmes.
Psychological therapies are mainly recommended for subgroups of patients with increased psychosocial risks, mood problems, or more complex, persistent back pain; whereas referral for surgery is only supported for cases with signs of specific pathology.
Overall, guidelines recommended strongly against the use of more than a couple of days bedrest for patients with low back pain. Referral for imaging is only supported for those with red flags, such as increased risk of fracture, infection, (metastatic) cancer, neurological emergencies including cauda equina syndrome, aortic aneurysm or systemic inflammatory arthritis (detailed in Appendix S7), or deterioration of symptoms. And although mixed, moderate to strong recommendations were also given against the use of paracetamol, anti-depressants, anticonvulsants and muscle relaxants; spinal injections for non-specific LBP; traction; orthoses and a range of applications (e.g. electrotherapies, shortwave, laser).
Recommendations were inconsistent or inconclusive with respect to medication (NSAIDs, opioids; topical); epidural steroid and other injections; acupuncture and manual, postural and thermotherapies.

| Comparison of CPG recommendations for neck and low back pain
In order to examine the consistency of CPG recommendations across neck and low back pain, overall strengths of recommendation for each identified intervention (see Tables 3&4), were assessed (Table 5). Despite a larger body of evidence for the effectiveness of treatment for back pain and a larger number of back pain guidelines, recommendations were generally consistent for neck and back pain (Table 5), in particular regarding support for the use of advice and education, reassurance, certain oral and topical pharmacologic treatments (with the exception of paracetamol), exercise interventions, manual therapy when combined with active treatment and psychological interventions. Guidance was also consistent in terms of the limited use of imaging (only for patients with red flags or where imaging is likely to change management), and recommendations against the use of bed rest, orthoses, traction and a range of modalities (laser therapy, electrotherapy, shortwave). Referral for imaging or surgical intervention, bed rest, antidepressant and muscle relaxant medications, psychological or multidisciplinary interventions are recommended for specific subgroups of patients (FOR 'SPECIFIC SUBGROUPS' or AGAINST 'WITH EXCEPTIONS' in Table 5).

| DISCUSSION
In this review, we have systematically identified, synthesized and graded 17 European clinical guidelines relating to the management of NLBP. On the basis of the quality of the evidence we have identified a short list of treatment options recommended for the management of NLBP (see Table 5). This information is aimed to provide clinicians, healthcare managers, funders, policymakers and researchers with a comprehensive summary of the current consensus from clinical guidelines across Europe on the management of NLBP.
The guidelines included in our review came from eight European countries (UK, Germany, France, Italy, Denmark, Poland, Belgium and the Netherlands). Eleven of them addressed low back pain, five neck pain and one both LBP and neck pain. Data extraction showed considerable variation in guideline development processes with seven guidelines (5 back, 2 neck) considered as high quality, based on their development rigour, strong stakeholder involvement and the applicability of their recommendations.
For neck pain, high-quality guidelines consistently recommended the following evidence-based treatment options: reassurance, advice and education (including to remain active and exercise), manual therapy in combination with other treatment, referral for exercise therapy/programme and a range of oral analgesics and topical medications, plus psychological therapies or multidisciplinary treatment for specific subgroups of patients. There was no strong evidence for use across Europe (as shown in Table 3). In contrast to the recommendations for low back pain, the neck pain guidelines included the use of painkillers such as paracetamol, NSAIDs (for acute pain only), opioids (for acute pain only) and neuropathic pain medication. However, these were only based on weak evidence (meaning the recommendations were based on expert opinion only from high-quality guidelines, and/or multiple low-quality guidelines) and it should be noted that these medications are no longer consistently recommended for low back pain within the recent European guidelines. In fact, for low back pain the guidelines recommended entirely non-pharmacological treatments, additionally including work-based interventions, advice/programmes to return to work and surgical intervention for specific subgroups. These recommendations were based on stronger evidence than those for neck pain.
In relation to previous literature, the findings of this review summarising the consensus from European guidelines, are consistent with recommendations in The Lancet back pain series  which advocated for greater use of non-pharmacological options for patients with back pain. The treatment options identified in this study are also broadly similar and consistent with two recent systematic reviews of clinical practice guidelines for musculoskeletal pain (Lin et al., 2020) and back pain (Oliveira et al., 2018) which identified similar key management recommendations (patient information, physical activity advice, return to work interventions, exercise interventions), although Oliveira et al., additionally identified antidepressants (for chronic LBP), NSAIDs and weak opioids for short periods of time (for acute LBP) to be frequently recommended across guidelines. Recommendations from the European guidelines included in our review contrast notably with a systematic review of non-invasive treatments for low back pain conducted to inform the American College of Physicians Clinical Practice Guideline (Chou et al., 2016) which not only recommended three medication options (NSAIDs, opioids, duloxetine) with moderate to strong evidence (Chou, et al., 2017b), but also included acupuncture within a group of 5 recommended non-pharmacological options (superficial heat, multi-disciplinary rehabilitation, acupuncture, exercise and manual therapy) (Chou, et al., 2017a).
Many of the European guidelines included treatment recommendations related to patient subgroups: psychological therapies, multi-disciplinary treatment and referral for surgery were recommended for specific subgroups only; and very strict indications (strong recommendation against with exception given for bed rest, anti-depressants and muscle relaxants). However, it was notable that clear assessment criteria to facilitate clinician decision making about when to use these treatment options for specific patient subgroups were largely lacking. Similar to Lin et al. who highlighted that guidelines for patients with thoracic pain are lacking (Lin et al., 2020), we only identified one (low quality) guideline (Kassolik et al., 2017) that specifically addressed thoracic pain. We would also highlight that most guidelines lacked detail about the specific dose, duration and other detail around the delivery of the recommended treatments. For example, there was little clarity on the delivery of physical exercise or the recommended components of patient education or reassurance.

Strength and limitations
The strength of this review is that it provides a helpful overall summary of the treatment and referral recommendations from recent European guidelines for NLBP. This overview enabled us to identify treatment options that have been consistently recommended across eight different countries and can therefore be considered to have broad European consensus. To facilitate the rigour of this evidence summary, we pre-specified inclusion and exclusion criteria for screening, quality-appraised guidelines using the AGREE II checklist, and devised a set of clear criteria to summarize and synthesize the direction and strength of recommendations across guidelines. Further strengths included independent assessment of eligibility for inclusion, data extraction and appraisal of the quality of guidelines, and a standardized approach to synthesizing evidence.
The guidelines included in our systematic review predominantly originate from northern and western European countries (except for the Italian guidelines), which can be considered a limitation. This may be partly explained by fewer guidelines being produced in southern or eastern Europe, but also by the fact that we only included guidelines published in the past 7 years. While focusing on contemporary guidelines (2013 onwards) ensured that we identified the most relevant treatment options for current practice, we acknowledge that this meant that some earlier European guidelines, were not included. However, for the purposes of this review, we felt it was important to exclude guidelines that may not be based on up-to-date evidence of effectiveness. Although we included guidelines written in any European language, one limitation was that we were not able to carry out independent data extraction and quality appraisal by a second reviewer for guidelines not available in English. However, for most of these guidelines, the reviewer had the advantage of being involved in data extraction for English language guidelines, which helped to ensure consistency of data extraction and interpretation.
Only seven CPGs (41%) were considered to be of overall high quality, with limitations mainly related to rigour of development (e.g. use of transparent methods to link evidence to recommendations, or processes to gain consensus regarding the strength of recommendations); and to applicability with few guidelines providing guidance on how to apply recommendations or taking into account practical and financial implications of their recommendations. Variation in the methods used to grade evidence and agree the strength of recommendations may potentially explain some of the variability in treatment recommendations across guidelines. We tried to incorporate quality as well as consistency in our synthesis of CPGs, aiming to arrive at a transparent and systematic approach for summarizing and grading recommendations across guidelines.
Future work within the Back-UP research project will embed these evidence-based treatment options in an accessible clinician decision-support tool for first contact clinicians, aiming to offer patients with NLBP treatment options better matched to the risk of persistent pain and disability.

| CONCLUSION
In conclusion, this systematic review identified seventeen contemporary clinical guidelines regarding NLBP (5 neck; 11 low back; 1 both) from 8 European countries, of which seven were considered high quality. Recommendations were notably consistent for neck and low back pain, despite the larger evidence base and more guidelines for the latter. The implications of this review are that clinicians have a broad range of mostly non-pharmacological evidence-based treatment options to consider for their patients with NLBP. These include some treatments which are a) potentially applicable to all patients such as advice and education and b) those applicable only to certain patient subgroups (e.g. referral to surgery).