Introduction

Bisphosphonate-associated osteonecrosis of the jaw (BONJ) is a concerning side effect of bisphosphonates. The use of these drugs has increased for cancer-related bone metastases, where dosages are relatively high (i.e., zoledronic acid, 4 mg every 4 weeks) compared with osteoporosis dosages (i.e., zoledronic acid, 5 mg annually). They are widely used among patients with osteoporosis; during 2009, 14.8 million prescriptions were written for non-generic (“branded”) osteoporosis treatments [1]. During this period, there have been substantial concerns expressed in the lay press about osteonecrosis of the jaw [2, 3]. These concerns paired with the very large number of patients using bisphosphonates, especially for osteoporosis, makes understanding the epidemiology of BONJ critical.

There are important challenges to epidemiologic studies of BONJ. First, BONJ associated with osteoporosis dosing seems to be relatively uncommon [4]. Determining the epidemiology of rare diseases presents difficulties, but these problems are not insurmountable. Second, since potent amino-bisphosphonates have only been prescribed since 1995, BONJ is a relatively “new” condition without clear clinical standards. Prior conditions of the jaw have included osteonecrosis, such as phosphorous or radiation exposure; however, these have become very rare [5]. Third, the American Academy of Oral and Maxillofacial Surgeons’ (AAOMS) definition of BONJ [6] has likely improved the consistency of clinical diagnosis; however, it is unclear how consistently this is applied. Moreover, until 2007, there was no specific diagnosis code in the International Classification of Diagnoses (ICD) for ONJ. This makes finding BONJ in databases that use ICD diagnosis codes challenging, a fact that is compounded by the new ICD for ONJ being very similar to a number of osteoporosis diagnosis codes.

In an attempt to estimate the risk of BONJ among osteoporosis patients and to assess the relative risk of BONJ among bisphosphonate users, we undertook two separate but related types of studies. First, we conducted a systematic review of the epidemiologic literature on BONJ associated with osteoporosis. Then, we applied this knowledge to perform cohort studies of BONJ using insurance claims data to find cases, hoping to further refine incidence rates and risk estimates for BONJ.

Methods

Systematic literature review

We searched Medline from 1999 to 2012 using the following search terms: osteonecrosis of the jaw and bisphosphonates and osteoporosis. Titles and abstracts were reviewed to determine an article’s relevance. We then hand-searched articles’ citation lists to find further references (Fig. 1).

Fig. 1
figure 1

Describes the selection process for articles included in the systematic review

Two authors (DHS and EM) independently reviewed each article selected for inclusion. Articles were included if they contained primary epidemiologic data regarding BONJ among non-cancer patients. Case series were not included. Cohorts as well were excluded if the denominator or source population was not clear, or if it was a selected group of dental patients, i.e., dental implants. The articles were reviewed to determine their methodologic quality using a system adapted from recommendation regarding the conduct of systematic reviews (see Table 1 for quality assessment form) [7]. We abstracted, as well, information about the study procedures and results. Their designs and methodologic quality were so heterogeneous that no attempt was made to meta-analyze the studies’ results.

Table 1 Quality assessment measures for studies included in systematic review

Cohort studies: design

Two separate attempts were made to study the epidemiology of BONJ using large cohorts—cohorts 1 and 2. For both cohort studies, we began with large insurance-based health care claims; cohort 1 used data from a single state Blue Cross Blue Shield Plan using data from 2005–2007 and cohort 2 used data from a multi-state Commercial Insurance Program including data from 2007–2010. Both databases included health care and pharmacy claims for all inpatient and outpatient medical services, such as physician and surgeon visits, radiology procedures, laboratory tests, and all drug dispensings. The cohort 1 database did not include dental claims but did include all claims from participating oral and maxillofacial surgeons, whereas the database for cohort 2 included dental claims. Moreover, in cohort 2, we required eligibility for both medical and dental insurance as a pre-requisite.

Both cohorts included three groups of patients, all who could not have diagnoses for solid organ malignancy or multiple myeloma during the complete period of medical benefits: (1) those who initiated bisphosphonates; (2) those who initiated another medication for osteoporosis, such as calcitonin, raloxifene, or hormone replacement therapy; and (3) those who had a diagnosis of osteoporosis or a fracture but started no osteoporosis-related medications (see Supplementary Table 1). Thus, included subjects could not have used a bisphosphonate or another medication during the 1 year prior to the start of follow-up. This broad definition was intended to allow us to estimate the risk of ONJ in persons exposed and unexposed to bisphosphonates. These cohorts were mutually exclusive. All subjects must have had at least 12 months of concurrent medical and pharmacy eligibility prior to entering the cohort.

The cohort studies were approved by the Partners Healthcare Institutional Review Board.

Osteonecrosis of the jaw: endpoint definition

In the first cohort, we used a very broad set of diagnosis and procedure codes to define possible cases of ONJ (see Supplementary Table 2 ). This list of codes was developed based on a comprehensive review of codes that had been used for known cases of BONJ that were evaluated clinically by two of the authors (NT and SBW). After applying these codes, we then attempted to improve the accuracy of possible cases by examining the sequence of diagnosis and procedure codes. Confirmation of the final list of possible cases identified through the claims-based analysis was attempted by contacting oral and maxillofacial surgeons to confirm the diagnosis of ONJ.

In the second cohort, we attempted to find cases of ONJ using the ICD code 733.45 which had been recently developed, prior to the establishment of this cohort. The broad set of codes were not used in cohort 1 were not applied in cohort 2. We further attempted to refine the possible cases by examining the sequence of diagnosis and procedure codes surrounding the diagnosis of ONJ. In cases of visits coded with the diagnosis code ICD 733.45, we examined the claims records for a visit to an oral and maxillofacial surgeon, radiologic studies of the head and/or oral cavity, and oral procedures. If none were found, these potential cases were considered unlikely. If any evidence of possible ONJ was found, we attempted to confirm the cases through dental records.

Statistical analyses

We considered a variety of covariates, but because of the few confirmed cases, we did not attempt any analyses adjusted for patient characteristics.

We calculated the frequency with 95 % confidence intervals (CI) of confirmed and possible ONJ cases in both cohorts. Follow-up of subjects continued until death, loss of benefits, end of study period, or ONJ diagnosis. The prevalence ratio was calculated based on comparing the prevalence among exposed cases of ONJ with unexposed cases.

Results

Systematic literature review

Our search yielded 374 potentially relevant papers. Further review of the titles and abstracts narrowed this to 17 papers. Full review revealed nine papers that provided primary epidemiologic data for osteoporosis-related BONJ.

The quality of these papers and their methods were variable (see Table 2). Several used a clearly defined source population to estimate the denominator of people at risk for BONJ [813] while others used less well-defined cohorts (i.e., marketing data for bisphosphonates to determine the at-risk group) [1417]. The definitions of BONJ ranged from the AAOMS criteria [17] to other clinical criteria [1012, 1416] or not actually BONJ, but other jaw disorders requiring surgery [8, 9, 13]. Some studies examined a broad range of patient characteristics (such as comorbidities and comedications) [810, 14, 15] while others did not [11, 12, 13, 16, 17].

Table 2 Epidemiologic aspects and overall quality of observational studies of osteonecrosis of the jaw

The estimates of the incidence of BONJ from prior studies range from 0.028 % to 4.0 % (see Table 3). Two prior studies estimated the relative risk of ONJ among bisphosphonate users compared with non-users [10, 15]. These studies were conducted in large networks of dental practices with careful case-definitions but not the AAOMS definition (i.e., did not require 8 weeks of exposed bone). Similar odds ratios for the probability of bisphosphonate use among cases of ONJ were calculated for both studies, 7.2 and 9.2. Three studies using insurance claims data examined the relationship of bisphosphonate use to miscellaneous jaw lesions and found relative risks that ranged from protective (odds ratio 0.65) to elevated (odds ratio 7.8) [8, 9, 13].

Table 3 Results of prior observational studies of bisphosphonate-related osteonecrosis of the jaw

Cohort studies

The cohort 1 study had a source population of 41,957 with a mean duration of follow-up of 13 months; cohort 2 had a source population of 466,645 and a mean follow-up of 13 months as well. The mean age of cohort 1 was 38.1 years, and it was 43.9 in cohort 2. From this source population, we found 84 who had a diagnosis or procedure code that had could be associated with BONJ (see Appendix I). We further refined this group through searching each potential case for sequences of diagnosis and/or procedure codes and narrowed the list to 51 possible cases. From this list, we were only able to confirm one case from dental records and dentist responses. Thus, from our list of 4,934 bisphosphonate users, we found one case of dentist-confirmed BONJ, for a prevalence of 0.02 % (95 % CI 0.004 %–0.11 %). With only one case, we were unable to pursue further analyses of relative risks or risk factors.

The second cohort was from a study period that included the period after the ICD diagnosis code for ONJ was established. From our source population of 466,645, we found 13 with a possible diagnosis of BONJ. We hand-searched the diagnosis and procedure codes for these possible cases and further defined a small list as probable cases, based on visits with relevant providers (dentists or oral and maxillofacial surgeons), procedures related to the oral cavity, or radiographs of the head and mouth areas. However, we could not confirm any of these cases as definite BONJ based on further review of dental records. We were not able to get complete records for nine of the 13 possible cases, and none of the dentists or oral and maxillofacial surgeons confirmed BONJ when surveyed.

We calculated prevalence rates using possible (unconfirmed) cases based on the ICD diagnosis (see Table 4). The estimated prevalence of BONJ among bisphosphonate users was 0.007 % (95 % CI 0.0007 %–0.014 %). No possible cases were found among the non-bisphosphonate osteoporosis drug users. The estimated incidence rate among the osteoporosis and fracture cases which did not start a medication was 0.002 % (95 % CI 0.0008 %–0.004 %). The unadjusted prevalence ratio was 4.10 (95 % CI 1.37–12.2).

Table 4 Prevalence rates of confirmed and possible osteonecrosis of the jaw cases from two different health care claims-based cohorts

Discussion

Osteonecrosis of the jaw remains a major concern for many patients using bisphosphonates. While there have been important advances in the diagnosis and treatment of BONJ, the epidemiology is still poorly understood. We conducted a systematic review of the BONJ epidemiology literature related to osteoporosis and found variable quality in prior studies. Moreover, estimates of the incidence rate vary considerably, likely because of very different methodologies. Our attempt to further this field through using large insurance claims databases for case-finding proved largely unsuccessful.

Our literature search found several examples of other methods for studying BONJ. Studies that surveyed oral medicine specialists and maxillofacial surgeons have the advantage of available and relevant records, however, it is unclear if all cases really met similar criteria for BONJ and whether all cases were included [11, 12]. Many times, the sampling was done in a convenience sample and not at a population-level [11, 16, 17]. Furthermore, such studies often relied on imprecise estimates of exposure based on bisphosphonate sales data [11]. The large study from Kaiser-Permanente may provide the most reliable incidence rate estimates [12]. The investigators found potential cases through a survey of patients with possible BONJ based on diagnoses. Cases were then confirmed by oral medicine specialists or the dental records. Bisphosphonate exposure data were obtained directly from pharmacy dispensing data from Kaiser-Permanente.

One of the included studies deserves special discussion. This study came from one academic dental practice and estimated an incidence rate ten times larger than any other study (see Table 3) [16]. The investigators described including consecutive dental patients and having pharmaceutical information available on all included patients. While the availability of dental records was a clear strength of this study, sampling patients in a dental practice may have biased towards bisphosphonate users with oral pathology (i.e., referral bias). As well, this study examined a relatively small sample, reducing the stability of the estimate. This outlier result should be viewed with caution.

The vagaries of the BONJ diagnosis code and the difficulty obtaining all relevant medical and dental records made our cohort study challenging. Even after the ONJ ICD code establishment in 2007, we found that this was not a useful way of identifying BONJ cases as the claim sequence for many of these potential cases had nothing to suggest oral pathology, i.e., no visits with oral medicine specialists, no procedures involving the oral cavity, and no radiographs of this anatomic area. This project speaks to the importance of validation studies with adequate medical and/or dental records, especially early after establishment of a diagnosis code. We also found it surprising that the majority of the dental claims that we reviewed had no diagnosis codes, but only Common Procedural Terminology used for billing. This severely limits the ability to find BONJ cases primarily seen at a dental practice. As more research is performed using insurance claims data, it will be important to attempt to link dental claims including diagnoses with other insurance claims. Furthermore, linking dental and medical records with full insurance claims would facilitate the study we conducted.

It is worthwhile to consider other methods for studies of uncommon or poorly defined outcomes like BONJ. The importance of a standard and well-accepted case definition cannot be under-estimated. While the AAOMS and others [6, 18] developed case definitions of BONJ early on, it is unclear whether this has been widely used in practice and whether it adequately excludes other diagnoses, such as a retained sequestrum. Post-marketing surveillance registries—using combination of written, telephone, and/or Internet communication—of all patients exposed to a given drug can be very useful tools to identify potential cases which can then be verified with review of appropriate records. Such registries have been used for many vaccines and biologics. It is not clear why such an effort was never undertaken for bisphosphonates, despite the substantial concerns raised regarding their potential association with various adverse events.

Six case series prospectively followed patients undergoing dental procedures for post-operative BONJ [14, 1923]; they were not included in the primary results because of this specialized source population. While these studies cannot be used to estimate incidence rates because they do not represent a general population, they do offer some insights into the potential relationship between dental procedures and BONJ. Four of the six studies found no cases of BONJ, and two found a total of five cases; thus the incidence of BONJ ranged from 0.0 % to 7.8 %. The large discrepancy in incidence rates may be explained by random variation across studies with small populations, or it may be that differences in the source populations (i.e., lower versus higher risk) explained the heterogeneity in results.

We attempted to estimate the incidence rate of BONJ related to the use of bisphosphonates for osteoporosis. The literature was systematically reviewed, and two cohort studies were attempted. We encountered important impediments to the cohort studies that we could not overcome in the setting of this study, such as difficulty defining the BONJ outcome using claims as a sampling method. It is likely that as the diagnosis code for ONJ is used in a more standard manner over time, health care claims will provide opportunities for BONJ epidemiology. Currently, access to oral medicine records is critical for confirming the diagnosis of BONJ.