ratios 3te

Upload: tyas-agustina

Post on 14-Apr-2018

224 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/27/2019 Ratios 3te

    1/32

    u m e o anagemen c ence

    manuscript MS-

    Nurse-To-Patient Ratios in Hospital Staffing:a Queuing Perspective

    Francis de VericourtFuqua School of Business, Duke University, Durham, NC 27708 [email protected]

    Otis B. JenningsFuqua School of Business, Duke University, Durham, NC 27708, [email protected]

    The immediate motivation of this paper is California Bill AB 394, a piece of legislation which mandates

    fixed nurse-to-patient staffing ratios as a means to address the current crisis in the quality of health care

    delivery. With a predictive queuing analytic approach we seek to determine whether or not ratio policies are

    effective and, if not, to provide an alternative staffing paradigm.

    We assume a significant correlation between delays in addressing patient needs and averse medical out-

    comes and we promote using the frequency of excessive delay as a measure of staffing policy performance.

    By applying new many-server asymptotic results, we develop two heuristic staffing policies that perform

    very well and are easy to implement. This is the first many-server asymptotic analysis of health care issues.

    Among the insights gained from the heuristics is the realization that no ratio policy can provide consistentlygood quality of service across medical units of different sizes. Moreover, the optimal staffing levels for larger

    systems display a type of super pooling effect in which the requisite workforce is significantly smaller than

    the nominal patient load.

    Key words: Queuing System, Health Care, Public Policy, Nursing, Staffing, Many-Server Limit Theorems

    1. Introduction

    In 1999, California introduced the nations first law mandating nurse-to-patient ratios in hospitals,Bill AB 394. The legislation, implemented in 2004, specifies the minimum number of nurses that

    should be staffed for each hospital unit, given the current number of patients therein. The min-

    imum number of nurses is a fixed, unit-specific fraction of the number of patients. For instance,

    according to AB 394 at least one registered nurse for every four patients should now be present at

    any time in any pediatrics unit within California. This legislation has inspired other states to con-

    sider establishing similar requirements. Further, two House bills in a recent Congressional session

    proposed regulating nurse staffing among Medicare-participating hospitals (see for instance Spetz

    2005).

    The rationale for implementing these ratios stems from the association between nurse staffing

    level and patient safety. Research studies suggest a significant connection between nurse workload

    and clinical outcomes. For instance, Aiken et al. (2002) conclude that the addition of one surgical

    1

  • 7/27/2019 Ratios 3te

    2/32

  • 7/27/2019 Ratios 3te

    3/32

    Article submitted to Management Science; manuscript no. MS- 3

    where n is the fixed number patients, who alternate between stable and needy states, and where

    s is the number of nurses. Each needy patient is served by a nurse, if available. Otherwise, the

    patient must wait in a (virtual) queue until service can be provided.

    In our framework, the principal metric of concern is the probability of excessive delay, i.e., the

    probability that the delay between the onset of neediness and the provision of care from a nurse

    exceeds a given time constraint. To our knowledge, no empirical studies have explored the impact

    of this metric on patient care and safety. Nonetheless, we argue that excessive delays are akin to

    possible adverse events from the supply side (pressure experienced by the nurses) and from the

    demand side (waiting patients), both of which factor into the overall quality of care. Accordingly,

    we pose the nurse staffing problem in terms of finding staffing levels that guarantee a bound

    on a specified probability of excessive delay. We prove new many-server asymptotic results to

    approximate this probability of excessive delay. To our knowledge, this constitutes the first many-

    server asymptotic analysis of health care related issues. Based on these findings we derive two

    heuristics corresponding to the so-called efficiency-driven (ED) and quality- and efficiency-driven

    (QED) staffing regimes. In addition to performing well, the heuristics provide managerial insights.

    In particular, we observe a super pooling effect in large medical units where the optimal number

    of required nurses is dramatically less than the nominal patient workload.

    It is worth noting that we do not try to find what an acceptable probability of excessive delay

    should be. Neither do we determine the precise values of the parameters describing our model (i.e.,

    average nurse service times and frequency of patient needs). Nevertheless, we argue that, whatever

    constitutes an excessive delay and our tolerance thereof, mandating nurse-to-patient ratios cannot

    ensure uniform quality of care across all hospitals.

    We present our basic model in Section 3. The impact of Bill AB 394 on probabilities of excessive

    delay is explored in Section 4. We formulate the optimal nurse staffing problem in Section 5, and

    derive and test two heuristics in Section 6. Section 7 presents our main managerial insights, which

    make use of the heuristics provided earlier. We conclude by discussing our results and suggesting

    future analytical and empirical research. The following section contains a short survey of the

    relevant literature.

    2. Literature Review

    Recent years have witnessed considerable debate in health-care practice, both within government

    and throughout academia, over the issue of administrating nurse staffing practices in health-care

    facilities. Spetz (2004) provides a comprehensive overview of the history and surveys some prelim-

    inary effects of nurse-to-patient ratios in California. Spetz (2005) also describes several alternative

  • 7/27/2019 Ratios 3te

    4/32

    4 Article submitted to Management Science; manuscript no. MS-

    policies, including the development of pay-for-performance systems in hospitals. The concept of

    mandatory nurse-to-patient ratios, which should help improve nurse working conditions, is moti-

    vated in part by the persistent shortage of registered nurses in the United States (Sochalski 2004,

    Kaestner 2005). But the primary justification for these ratios lies in the assumption that nurse

    staffing levels have a significant impact on patient safety and outcomes. The bulk of research in this

    field focuses on trying to establish (or negate) this basic premise. Data that are usually collected

    to explore this link include such outcome measures as mortality rates, adverse incidents, length of

    stay, and patient and nurse dissatisfaction (Neisner and Raymond 2002). Lang et al. (2004) provide

    a fairly exhaustive and systematic review of articles investigating this link.2

    The link between workload and quality of care is often thought of in terms of adverse events.

    For instance, in evaluating how hospitals improve quality of care, Tucker and Edmondson (2003)

    identify failures that occur in care delivery processes, such as tasks that are unnecessarily or

    incompletely performed. In our setting, staffing levels are set to prevent the assistance of patients in

    need from being delayed longer than a specified time constraint. We assume that the frequency at

    which delays exceed this constraint is significantly correlated with the frequency of adverse events.

    This is a reasonable assumption, since delaying certain procedures can endanger patient health.

    For instance, the medical guidelines for certain myocardial infarctions recommend the immediate

    administration of aspirin (American College of Cardiology 2002). Delays also give rise to unfinished

    tasks, either because nurses fail to remember them later or because they abandon them in order

    to take care of more urgent procedures. Unfinished care has been identified as a strong factor

    impacting the quality of nursing care (Sochalski 2004).

    Note also that in business settings, timely service has long been recognized as a main driver ofcustomer satisfaction and is widely used as a service quality indicator (see for instance Gans et al.

    2003) for applications in call centers). For businesses, excessive delays can lead to lost business

    opportunities.

    This paper deals exclusively with the nurse staffing regulatory issue. However, there are several

    other factors that go into nursing workforce management. For example, Kao and Tung (1980) use

    statistics to forecast patient loads a year into the future. Such forecasts are necessary for under-

    standing nursing requirements and for effectively setting budgets to cover nurse salaries. Another

    aspect of workforce management is the setting of nurse schedules, which typically is completed in

    the days or weeks preceding the actual work shifts. Approaches to solving the scheduling problem

    2 The study was commissioned by the CDHS and was included as part of the final statement of evidence justifyingthe implementation of California Bill AB 394 in 2003.

  • 7/27/2019 Ratios 3te

    5/32

    Article submitted to Management Science; manuscript no. MS- 5

    include linear programming (Jaumard et al. 1998), math programming (Warner and Prawda 1972,

    Miller et al. 1976), and genetic algorithms (Aickelin and Downsland 2003, Bailey et al. 1997). In

    some hospital settings, patients are matched with nurses; see Punnakitikashem et al. (2006) for a

    stochastic programming approach to the nurse assignment problem.

    Operations management studies also extend to more general hospital capacity issues beyond

    the nurse staffing problem. Green (2004) presents a recent overview of queuing models used in

    capacity planning and management of hospitals (see also Smith-Daniels et al. 1988). Typically,

    medical units are modelled as multi-server, open-loop, M/M/s queues, where the arrival process

    represents the stream of incoming patients and the service time captures the average length of stay.

    Studies of these models provide, for instance, the number of required beds to achieve availability

    targets in a given unit. On the other hand, our model seeks to capture the workload variability

    experienced by the nurses. During a time period when the number of patients is relatively constant,

    the dynamics of the system are better captured by a closed queue with a finite population of

    patients (an M/M/s//n queue), where each patient can independently require processing by a

    nurse. In a previous paper (de Vericourt and Jennings 2006a) we provide preliminary asymptotic

    results for this system via many-server limit theorems. Those results, combined with extensions

    herein, allow us to derive efficient staffing rules with general targets for the probability of excessive

    delay, as well as to evaluate the performance of nurse-to-patient ratio policies relative to this metric.

    Many-server asymptotic analysis has proven exceptionally useful for other service-oriented capac-

    ity management problems, specifically those focusing on call centers. Many recent call center papers

    can be directly linked to the conceptual breakthrough of Halfin and Whitt (1981), who recognized

    that, at least for large open queueing systems, one can staff service operations that achieve not only

    a high level of utilization but also very fast customer response times. In other words, the staffing

    policies studied by Halfin and Whitt are efficiency-driven (ED) from the standpoint of the service

    provider and quality-driven (QD) from the perspective of the customer, yielding a hybrid quality-

    and efficiency-driven (QED) staffing regime. Garnett et al. (2002) later studied the distribution of

    the virtual waiting time of call centers staffed in the QED regime in more detail. They also observed

    that by including abandonment in many-server queueing models, one can consider the ED case, for

    which the nominal load exceeds the processing capacity. Whitt (2004), Mandelbaum and Zeltyn

    (2006) and Baron and Milner (2006) have all extended the study of open many-server systems with

    abandonment and staffed in the ED regime. The latter two, both contemporaneous papers, also

    consider minimizing service capacity, subject to constraints on the frequency of excessive delay, as

    the primary objective of system design.

  • 7/27/2019 Ratios 3te

    6/32

    6 Article submitted to Management Science; manuscript no. MS-

    3. The Queuing Model

    Consider a medical unit of s nurses serving n patients. Over time, each patient repeatedly requires

    the assistance of a nurse for various care procedures. When a patient requires the assistance of a

    nurse, we refer to the patients state as needy. Otherwise the patient is said to be stable. We assume

    that the time for a stable patient to become needy, referred to as the activation time, is random

    and independent of all other system dynamics. Upon becoming needy, a patient is immediately

    treated by a nurse, if an idle nurse is available. Otherwise, a needy patient waits for an available

    nurse; queued patients are assisted in a first-come-first-served fashion. The nurses service time is

    also random and independent of other system features. Once treated, a patient becomes stable

    again, until she needs other care procedures. We assume that nurses work as a team so that any

    one of them can serve any patient. In some medical units however, a nurse is assigned to a specific

    set of patients. Our model remains relevant in such settings, provided that nurses whose patients

    temporarily require less care will help their busier colleagues, as is often the case in practice.

    In general, activation times and nurse service times depend on patient acuity levels and nurses

    skill levels. Unfortunately, when setting public policy, the possible combinations of these levels

    are too vast to accommodate fully using specific guidelines. Instead, policy makers may want to

    set nurse-to-patient ratios assuming either the lowest or the highest possible acuity level for all

    patients in the unit.3 For practical modeling purposes, the possible discrepancy in acuity levels

    and different skill levels of nurses can be captured by adjusting the probability distribution of the

    activation time and the nurse service time accordingly.

    To our knowledge, no empirical studies have tried to estimate the probability distributions of

    activation and service times. We suspect that these distributions can be quite general in practice.

    For analytical tractability, however, we assume that activation and service times are exponentially

    distributed with rates and , respectively. In some respects, the exponential assumption is a

    reasonable one. The wide range of possible patient needs and nursing tasks suggests a high degree

    of variability in the health care process, which is consistent with the high coefficient of variation

    of the exponential distribution.

    In a system in which each patient has a dedicated nurse, the long run fraction of nursing time

    required by a typical patient is equal to r := /( + ) while the long run fraction of time a patient

    is stable is given by r := 1 r . The quantity rn then is the nominal patient load (also referred3 In fact, California Bill AB 394 specifically advises that, in order to account for case mix and patient acuity levels,hospitals use the mandated minimum staffing ratio recommendations in conjunction with staffing policies and proce-dures developed locally at the hospitals. We consider the case of homogeneous patients for which such adjustmentsare not required.

  • 7/27/2019 Ratios 3te

    7/32

  • 7/27/2019 Ratios 3te

    8/32

    8 Article submitted to Management Science; manuscript no. MS-

    rate intensity. Letting k := (n k) denote the activation rate when k patients are needy (andsuppressing the parameter n except when necessary), we obtain

    pn(s, t) =n

    k=s

    kkni=0 ii

    P(V > t|N= k) = estn

    k=s

    (n k)kni=0(n i)i

    ksj=0

    (st)j

    j!. (2)

    Recall that the parameter T 0 delineates between acceptable and excessive delays. The probabil-ity of excessive delay for a system with n patients and s nurses is denoted pn(s, T), or simply pn(T)

    when no confusion is possible, and is a point along the function pn(s, ). An interesting property ofthe expression (2) when evaluated for a specific T is that, given the number of nurses and patients

    in the unit, the probability of excessive delay is entirely described by the parameters r and the

    target T since k in (1) only depends on = r/(1 r). For ease of exposition, we will alwaysspecify the time target T as an integer multiple of the service time (T = i/).

    In the next section we study the impact of nurse-to-patient ratios on the probability of excessive

    delay.

    4. A Queueing Model Perspective on California Bill AB 394

    Let us consider California Bill AB 394, which has governed nurse staffing over the past few years,

    and pediatric units in the state. The law requires that at all times, regardless of the size of a

    hospital, at least one nurse must be staffed for every four patients under care (see Table 2 in the

    appendix for the full list of these mandated ratios). In the following, we investigate the theoretical

    effect of the mandated ratios, as predicted by our queueing model. We focus our investigation on

    pediatric units exclusively, with the understanding that our analysis extends to all units for which

    pooling of nurses is a reasonable operating assumption.

    The main idea when deriving the collection of staffing ratios was to evaluate, for each hospital

    unit, the proportion of nursing time a patient requires during a typical shift (controlling for patients

    acuity level and nurses skills). One should note that this proportion of time is precisely the same

    as the load factor r in our queuing framework. Because quantitative data are lacking, the interest

    groups and research teams who framed Bill AB 394 and ultimately set the ratio values, relied

    heavily on expert panels comprised primarily of highly qualified and experienced registered nurses

    and nurse administrators (see California Department of Health Services 2003, Institute for Health

    & Socio-Economic Policy 2001). The suggested ratio values from these panels differed greatly (see

    Table 2), perhaps as a result of the various constituents involved in the process. Nevertheless,

    compromises among the various suggestions were made and the results are our best estimates for

    the nominal patient load for each hospital unit.

  • 7/27/2019 Ratios 3te

    9/32

    Article submitted to Management Science; manuscript no. MS- 9

    In this paper we consider only pediatric units, primarily b ecause of the relative consensus among

    the expert panels suggested pediatric unit ratio values. The mandated ratio policy sets the number

    of nurses sn equal to the nominal patient load rn, when there are n patients in the unit:

    sRn = rn, (3)

    with r = 1/4 (for pediatric units) and where

    x

    is equal to the smallest integer larger or equal to

    x. We also refer to this staffing rule as the nominal ratio policy.

    Given the size n of the pediatric unit, we can compute the required number of nurses sn based

    on (3) and deduce the resulting probability of excessive delay using (2) for different time thresholds

    T. Figures 13 show the impact of the size of the pediatric unit on the probability of excessive

    delay experienced by the needy patients for T = 0 (any delay), 1/ (delay longer than an average

    nurse service time), and 2/ (delay longer than two average nurse service times), respectively. The

    sharp oscillations in performance can be explained by the step-wise increase in the staffing level

    that occurs at four-patient intervals when rn is rounded up to the nearest integer. The magnitudeof these oscillations decreases with n. For T = 0, Figure 1 suggests that when disregarding these

    oscillation effects, the excessive delay probability remains roughly stable around 65% for most n.

    On the other hand, for T > 0, Figures 2 and 3 indicate that as the size of the unit grows, patients

    can expect to see progressively lower probabilities of excessive delay. In other words, patients in

    larger pediatric units appear to be better off than patients in smaller units, although all units are

    consistent in the implementation of the 1:4 staffing ratio dictated by law.

    In order to explore the magnitude of this inconsistency in California we need information on the

    distribution of the number of patients in Californian pediatric units, which can significantly vary

    over time. Unfortunately such detailed information is not publicly available. As an alternative,

    we use the data provided by the California Office of Statewide Health Planning and Development

    (OSHPD) as part of the state-wide hospital utilization data it collects annually. 5 In this study

    we use the data from 2003 which contains the patient numbers reported by all pediatric units of

    Californian hospitals, averaged over the year. These census averages are used as if they represent

    the typical sizes of corresponding units at any given time. For our sample, a total of 134 pediatric

    units in California with average occupancies between 2 and 154 were identified (see Table 3 in theappendix).

    5 The database was and continues to be used by many of the interest groups and research teams that framed Bill AB394 and its implementation.

  • 7/27/2019 Ratios 3te

    10/32

    10 Article submitted to Management Science; manuscript no. MS-

    Figure 1 Delay Probability under Nominal Ratio Staffing, T = 0

    0 10 20 30 40 50 600.25

    0.3

    0.35

    0.4

    0.45

    0.5

    0.55

    0.6

    0.65

    0.7

    n

    pn(

    T)

    Figure 2 Excessive Delay Probability under Nominal Ratio Staffing, T =

    1

    ,

    0 10 20 30 40 50 600

    0.05

    0.1

    0.15

    0.2

    0.25

    0.3

    0.35

    0.4

    n

    pn

    (T)

    The first two rows of Table 1 contain our chosen partition of the pediatric unit sizes in California.

    Interestingly, the average pediatric unit occupancies are considerably skewed, with a majority of

    hospitals reporting average occupancies below 20 patients; only 11% of the 134 pediatric units

    reported average occupancies greater than 20. The remaining rows of the table provide excessive

    delay probabilities, as predicted by our model, for the specified choice of time constraint parameter

    T. The data is presented in a fashion that controls for the rounding-up effect. Namely, for each unit

  • 7/27/2019 Ratios 3te

    11/32

    Article submitted to Management Science; manuscript no. MS- 11

    Figure 3 Excessive Delay Probability under Nominal Ratio Staffing, T = 2

    ,

    0 10 20 30 40 50 600

    0.02

    0.04

    0.06

    0.08

    0.1

    0.12

    0.14

    0.16

    0.18

    0.2

    n

    pn(

    T)

    Table 1 Excessive Delay Probabilities in Californian

    Pediatric Units

    n n 5 5 < n 10 10 < n 20 20 < n

    %Units 36% 38% 15% 11%

    T = 0 66% 67% 67% 67%

    T = 1/ 38% 28% 24% 15%

    T = 2/ 20% 10% 6% 2%

    size partition / delay threshold pair, the table reports the largest predicted probability of excessive

    delay among all unit sizes in the partition. The table summarizes the theoretical performance

    of the current law under alternative definitions of acceptable delay. For instance, when T = 0,

    the probability of delay hovers around 66% for all n. Although the performance is consistent,

    it is consistently poor. For acceptable delays that are strictly positive (T > 0), Table 1 suggests

    significant discrepancies in service quality across the different hospitals in California. For instance,

    in 36% of pediatric units (for which n 5) the probability that delays exceed a time equivalentto a nurses average service time (T = 1/) can be as high as 38%, whereas in another 26% of

    the units (for which n > 10) this probability is less than 24%. When T = 2/, the probability of

    excessive delay appears to be reasonable (less than 6%) for 26% of the units, but can still reach

    20% in small units.

    The nominal ratio policy is but one of many within the spectrum of possible ratio policies, each

    having the form sn = n for some real number > 0. Studies have argued that decreasing the

  • 7/27/2019 Ratios 3te

    12/32

    12 Article submitted to Management Science; manuscript no. MS-

    number of patients per nurse leads to a reduction in adverse events. In Figure 4, we provide the

    predicted probability of delay (T = 0) when r = 1/4 and a ratio policy with = 1/3 is employed

    (see Figure 10 in the appendix for the case where T = 1/). As expected, performance is improved,

    but exhibits a significant inconsistency across unit sizes. On the other hand, when = 1/5, the

    ratio policy provides consistent but poor performance for the delay threshold T = 1/, disregarding

    again the oscillation effect; see Figure 5 (see also Figure 11 in the appendix for T =0).

    Figure 4 Delay Probability under Ratio Staffing, r = 14

    , = 13

    , T = 0

    0 10 20 30 40 50 600.05

    0.1

    0.15

    0.2

    0.25

    0.3

    0.35

    0.4

    0.45

    0.5

    n

    pn(T

    )

    In summary, the nurse-to-patient staffing ratios

    appear to have different performance implications for smaller units than for units with largerpatient populations, with the trend showing that patients in smaller units will see higher proba-

    bilities of excessive delay in relation to units of larger size,

    can, in many cases, result in very high probabilities of excessive delays which in turn mightlead to poor levels of quality of care.

    In the remainder of this paper, we use a more analytic approach to formalize and generalizethe findings of this section. In particular, we will show that these insights depend neither on the

    precise values of the different parameters which describe our system, nor on the choice of the ratios

    mandated by law.

  • 7/27/2019 Ratios 3te

    13/32

  • 7/27/2019 Ratios 3te

    14/32

    14 Article submitted to Management Science; manuscript no. MS-

    Figure 6 Optimal versus nominal ratio staffing, r = 14

    , = 1%, T = 0

    0 10 20 30 40 50 600

    5

    10

    15

    20

    25

    n

    sn

    sn

    *

    sn

    R

    system exhibits any level of variability, such a target can only be achieved with a dedicated nurse

    for each patient, an unreasonable expense in most units. Determining acceptable values of and

    T is a subject for future research. On the other hand, our analytic approach, and in particular

    our test for a range of values for and T, allows one to develop insights into the form of the best

    possible staffing policy, as well as pointing out the shortcomings of the current legislation, without

    assuming the exact levels of the probability and/or defining what constitutes excessive delays.

    For each n 1, the optimal staffing level sn satisfying (4) can be obtained using a simpleenumerative search algorithm over all possible values of sn. For each step of the procedure, the

    probability of excessive delay pn(sn, T) is calculated using equation (2) and compared to the bound. Of course, the optimum staffing level is an increasing function of n. So sn provides a natural

    starting point in the search for sn+1.

    Figures 68 compare the staffing of pediatric units (r = 1/4) under the mandated nominal ratio

    policy (sRn = rn) with the optimal staffing policy sn, for = 1% and for T = 0, 1/ and 2/,respectively. When T = 0, Figure 6 shows that the nominal ratio policy underestimates the staffing

    requirements that ensure that the excessive delay probability is less than 1%. Furthermore, the gap

    between the ratio staffing rule and the optimum one seems to be increasing with n. For T = 1/,

    the number of nurses scheduled in the nominal ratio policy is still not large enough to guarantee

    that at most 1% of needy patients will wait more than a time equivalent to a nurses average service

    time, 1/. The deviation from the optimal staffing rule quantity seems however to decrease with

    n, at least within the range of unit sizes considered. Figure 8 provides a more intriguing situation

  • 7/27/2019 Ratios 3te

    15/32

    Article submitted to Management Science; manuscript no. MS- 15

    Figure 7 Optimal versus nominal ratio staffing, r = 14

    , = 1%, T = 1

    0 10 20 30 40 50 600

    2

    4

    6

    8

    10

    12

    14

    16

    n

    sn

    sn

    *

    sn

    R

    Figure 8 Optimal versus nominal ratio staffing, r =

    1

    4 , = 1%, T =

    2

    0 10 20 30 40 50 600

    5

    10

    15

    n

    sn

    sn

    *

    sn

    R

    in which the nominal ratio policy underestimates the number of required nurses needed by small

    units (n 25), but is conservative relative to the optimal number needed by larger units. In effect,

    only larger units achieve the target probability of excessive delay of at most 1%, but the staffing

    for these situations is needlessly excessive.

    The poor performance of the nominal ratio policy is also confirmed by the significant gap in the

    probability of excessive delay for the nominal ratio policy versus the optimum policy. For instance

  • 7/27/2019 Ratios 3te

    16/32

    16 Article submitted to Management Science; manuscript no. MS-

    Figure 9 Excessive Delay Probability as a Function of n, r = 0.25, = 0.05, T = 1/

    0 10 20 30 40 50 600

    0.05

    0.1

    0.15

    0.2

    0.25

    0.3

    0.35

    0.4

    n

    pn(

    T)

    p*

    n(T)

    pR

    n(T)

    Figure 9 illustrates the significant gap in the probability of excessive delay for the nominal ratiopolicy versus the optimum policy for r = 1/4, = 5% and T = 1/. As shown by the figure, the

    nominal ratio policy meets the service level requirement consistently only for large unit size (n 45)(see de Vericourt and Jennings 2006b, for more details).

    These results suggest that nominal ratio policies do not possess the structure of the optimal

    staffing rule. In particular, the unit size n seems to affect the slop of sn as a function of n, while

    the slop of a ratio policy is independent of n. This will be confirmed in the following section.

    6. Heuristics

    In the following, we introduce and test alternative staffing rules that perform well and are easy

    to implement. The heuristics provide important insights into the structure of appropriate nurse

    staffing rules that cannot be directly obtained from the optimization problem in (4).

    We derive the heuristics via many-server asymptotics, an approach that investigates properties

    of families of staffing rules (i.e., mappings n sn) as the number of patients becomes very large. Itturns out that depending on the limit s =limn sn/n, the system exhibits two distinct behaviors.

    When s < r, for large values of n, all nurses are practically always busy. Hence we refer to such

    policies as efficiency-driven (ED), or belonging to the ED staffing regime. Another phenomenon in

    this setting is that all needy patients must wait a non-trivial amount of time before being assisted.

    This suggests that, when staffing in the ED regime, the time delay threshold T must be strictly

    positive. On the other hand, when s > r , needy patients experience little or no wait and, hence,

  • 7/27/2019 Ratios 3te

    17/32

    Article submitted to Management Science; manuscript no. MS- 17

    such policies are referred to as quality-driven. Moreover, the corresponding value of T is small or

    zero. In this case there will often be some idle nurses. The so-called QED regime, arising when

    s = r, exhibits the defining characteristics of both the ED and QD regimes. (See de Vericourt and

    Jennings (2006b) for more details on the many-server framework.)

    6.1. QED-Based Heuristics

    Our first heuristic is based on the assumption that optimal staffing levels are slight deviations from

    those provided by the nominal ratio policy. Under this assumption, the resulting staffing policy

    can initially be thought of as being in the QED regime.

    Suppose we have a QED staffing policy sn. To capture the microscopic dynamics of the system,

    we consider the following process

    Nn(t) :=Nn(t) rn

    n, t 0, (5)

    where the process Nn tracks the number of needy patients and its initial value Nn(0) is assumed

    to be approximately equal to the centering constant rn. We refer to the transformation in (5) asdiffusive scaling. In de Vericourt and Jennings (2006a), functional central limit theorems (FCLTs)

    are taken for this sequence of processes. The result, summarized below, is a diffusion process which

    serves to approximate the stochastic dynamics of Nn.

    It was shown in de Vericourt and Jennings (2006a) that the sequence of virtual hitting time

    processes {Vn, n 1} converges to zero at the same rate as 1/

    n converges. It follows that, in order

    to approximate needy patient waiting times, we must zoom in on this process. Define the inflated

    virtual hitting time processes{Vn, n 1}, where Vn :=

    nVn, for each n 1. As Proposition 1states below, under the QED staffing regime, the diffusion-scaled needy patient process and the

    inflated hitting time process converge together to scalar multiples of the same diffusion process.

    Consider first the following series {n, n 1} associated with the staffing rule n sn,

    n :=sn

    n r

    n. (6)

    Modulo round-off error, the quantity n

    n is the deviation of sn from the nominal ratio policy.

    The following proposition states that if n converges to a finite value , which also requires that

    s = r (confirming that indeed the policy operates in the QED limiting regime), then the inflated

    virtual hitting time converges in distribution to a diffusion process.

    Proposition 1. Consider a staffing rule n sn such that n where (,) as n . If Nn(0) converges in distribution to a proper random variable N(0) as n , then Vn and

  • 7/27/2019 Ratios 3te

    18/32

    18 Article submitted to Management Science; manuscript no. MS-

    (Nn )+/(sn/n) converge jointly in distribution to (N )+/(r). Moreover, the steady statedistribution of N has probability density

    fQED(x) =

    1()rr

    xrr

    rr

    x < ,

    ()r

    rx+rrr

    rr

    x .

    (7)

    with

    () =

    1 + e2

    2r2r

    rrrr

    1

    , (8)

    and where the function () is the standard normal cumulative distribution function.

    Proof. The result is a direct consequence of Theorem 2 and Corollary 2 of de Vericourt and

    Jennings (2006a).

    To derive our first heuristic, consider a staffing rule of the form sn = rn + n

    n and assume

    that n converges to some constant . The excessive delay probability can be approximated by the

    tail of the virtual hitting time distribution: pn(sn, T) P(Vn >

    nT). 6 Using Proposition 1, we

    approximate the distribution of Vn with that of (N)+/(sn/n):

    P(Vn >

    n T) P

    (Nn)+sn/n

    > T

    n

    .

    We then consider the distribution of N in steady state given by (7) so that

    pn(sn, T) (n)r

    1

    nrr

    n+

    snTn

    rx + rn

    r

    r

    dx

    = (n) n

    r

    r snT

    n

    n

    r

    r

    . (9)

    where () is the standard normal distribution function. Our first heuristic consists of finding nsuch that the approximation in (9) is equal to the target :

    Heuristic 1. (H1)

    For a given > 0 and T 0,sH1n = rn + n

    n (10)

    where n satisfies

    = (n) n(1 + rT)rr rTnrnrr , (11)and where (n) is given by (8).

    6 Notice the implicit asymptotic PASTA assumption.

  • 7/27/2019 Ratios 3te

    19/32

    Article submitted to Management Science; manuscript no. MS- 19

    Note that when T = 0, (11) simplifies considerably: = (). It follows that, when T = 0, we

    have n = 1() for each n. When T = 0, our first heuristic is equivalent to the one developed in

    Section 5 of de Vericourt and Jennings (2006a) and performs exceptionally well. In particular, the

    heuristic policy never deviates from the optimal one by more than one nurse; i.e., for each n 1,|sn sH1n | 1.

    When T > 0, finding sH1n involves solving equation (11) in its general form. The next proposition

    guarantees the existence and uniqueness of the solution. It also leads to efficient numerical methodsto determine n.

    Proposition 2. The function : + such that

    (x) = (x)xrr

    (1 + rT)rTnr

    xrr

    is strictly decreasing in x.

    Proof. Note first that () can be written as

    (x) = (x)(ax b)

    (cx) ,

    where a, b and c are positive constants. Denote by h() the hazard rate of the standard normaldistribution, i.e., h(x) = (x)/(x) for every real x. We can restate (8) as

    (x) =

    1 +r h

    x

    rr

    hxrr

    1

    .

    Because the hazard rate of the normal distribution is increasing, the function () is decreasing. It

    remains to show that x (ax b)/(cx) is also decreasing. Taking the first order condition,this is equivalent to showing that,

    a(ax b)(cx) > c(ax b)(cx),

    or equivalently, because (x) = (x) for the standardized normal distribution, that

    h(ax + b) >c

    ah(cx).

    This final relation follows because h() is increasing, c/a = 1/(1 + rT) < 1, and ax + b > c x for any

    real x.

    The monotonicity of allows us to use a simple bisection search algorithm in order to solve

    equation (11). This procedure is more efficient than solving the optimal problem in (4) but still

    requires some computation effort.

  • 7/27/2019 Ratios 3te

    20/32

    20 Article submitted to Management Science; manuscript no. MS-

    6.2. ED-Based Heuristics

    In designing Heuristic 1 we assumed that the optimal staffing policy was a slight deviation from the

    nominal ratio policy. This approach is valid for small values of T, a feature that suggests staffing in

    the QED regime. However, when T is large we should staff in the ED regime because the nominal

    ratio policy is no longer close to the optimal one. Our second heuristic is based on the assumption

    that, for every value of T > 0, there exists a unique quantity rT < r such that the optimal policy is

    an order n deviation from rTn.Heuristic 2. (H2) For a given > 0 and T > 0,

    sH2n = rTn + T

    n (12)

    where

    rT =r

    1 + rT(13)

    and

    T =r 1

    () r + rT

    (1 + rT)3 . (14)

    The following argues that Heuristic 2 is asymptotically optimal as n , thereby justifying itsuse for large values of n. As discussed later, it works well for all values of n.

    Proposition 3. For any given T > 0, the probability of excessive delay pn(sn, T) has a nonde-

    generate limit (0, 1) if and only ifsn

    n rT

    n , as n,

    for some

    (

    ,

    ), with

    =

    r

    (1 + rT)

    3

    r + rT

    . (15)

    Proof. The probability of excessive delay can be written as

    pn(sn, T) =

    n

    k=0

    k(n k)1 n

    k=sn

    k(n k)esTksnj=0

    (sT)j

    j!

    =DnBn

    1 +

    AnBn

    1,

    where

    An sn1k=0

    nk

    (n k)k,

    Bnn

    k=sn

    n!

    sn!(n k)ssnn

    sn

    k

  • 7/27/2019 Ratios 3te

    21/32

    Article submitted to Management Science; manuscript no. MS- 21

    and

    Dn n

    k=sn

    n!

    (n k)!ssnnsn!

    sn

    kesT

    ksnj=0

    (snT)j

    j!.

    The quantity An can be expressed as

    An = n(1 + )n1

    sn1k=0

    n 1

    k

    rkrnk

    = n(1 + )n1P(Xn sn 1),

    where for each n, Xn is a binomial random variable with parameters n 1 and r. Similarly, onecan express Bn as

    Bn =n!

    sn!ssnn

    sn

    n1esn/

    nsn1k=0

    sn

    k1

    k!esn/

    =n!

    sn!ssnn

    sn

    n1esn/ P(Yn n sn 1)

    and Dn as

    Dn = n!sn!

    ssnn

    sn

    n1esn/

    nsn1k=0

    nsnk1j=0

    snk

    1k!

    (snT)j

    j!e(sn/+snT)

    =

    n!

    sn!ssnn

    sn

    n1esn/ P(Zn n sn 1),

    where Yn and Zn are independent Poisson random variables with parameters sn/ and sn/+ snT,

    respectively. For each sequence of the random variables {Xn}, {Yn} and {Zn}, we generate a centrallimit theorem. Starting with Xn, we center and rescale to obtain

    P(Xn sn 1) = P

    Xn (n 1)r(n 1)rr

    sn 1 (n 1)r(n 1)rr

    , (16)

    P(Yn n sn 1) = P

    Yn sn/sn/

    n 1 sn/rsn/

    (17)

    and

    P(Zn n sn 1) = PZn sn( 1 + T)

    sn(1

    + T) n 1 sn(

    1r

    + T)sn(

    1

    + T)

    . (18)

    The sequences (Xn (n 1)r)/

    (n 1)rr, (Yn sn/)/

    sn/ and (Zn sn( 1 +T))/

    sn(

    1

    + T) each converge in distribution to standard normal random variables (i.e., with

    mean zero and variance one). It is given that the staffing level as a function of n is sn = rn/(1 +rT) + o(n). It follows that [sn 1 (n 1)r]/

    (n 1)rr as n . By (16) one can

    conclude that

    limn

    P(Xn sn 1 ) = 0. (19)

  • 7/27/2019 Ratios 3te

    22/32

    22 Article submitted to Management Science; manuscript no. MS-

    Likewise, (n 1 sn/r)/

    sn/ as n, so that by (17),

    limn

    P(Yn n sn 1 ) = 1. (20)

    Finally, by the conditions provided for sn,

    n 1 sn( 1r + T)

    sn( 1 + T)

    r

    (1 + rT)

    3

    r + rT

    as n, and by (18) we have

    limn

    P(Zn n sn 1 ) =

    r

    (1 + rT)

    3

    r + rT

    . (21)

    The ratio An/Bn can be written as

    AnBn

    = CnP(X sn t)

    P(Y n sn 1) ,

    where

    Cn = sn!

    n!

    sn

    n1 (1 + )n1ssnn

    esn/.

    Following the analysis in the proof of Proposition 1 of de Vericourt and Jennings (2006a), we have

    Cn r

    1r

    + T

    e2/2r2 as n. The relevant feature is that the limit of Cn is finite. It follows

    then by (19) and (20) that An/Bn 0. Finally, notice that Dn/Bn = P(Zn n sn 1)/P(Yn n sn 1) so that by (20) and (21),

    limn

    pn(sn, T) =

    r

    (1 + rT)

    3

    r + rT

    .

    6.3. Numerical Study

    We explore the performance of the previously introduced heuristics through numerical examples.

    Tables 4-6 report the differences sn sH1n and sn sH2n for different values of T, n and for r =0.1, 0.25, 0.9 and = 1%. (de Vericourt and Jennings (2006b) provides supplementary numerical

    results.)

    For T = 0, H1 performs extremely well with |sn sH1n | 1 for all tested cases. This is consistent

    with the results in de Vericourt and Jennings (2006a). (Note that since H2 is only defined for

    positive T, the tables do not report its performance for T = 0.) For T > 0, H1 also performs well

    with |snsH1n | 1 for r = 0.1 and 0.25. However the performance of H1 deteriorates as n increasesfor r = 0.9, as shown by Table 6.

  • 7/27/2019 Ratios 3te

    23/32

    Article submitted to Management Science; manuscript no. MS- 23

    This is not the case for H2, which exhibits stable performance. In all tested cases, we find that

    |sn sH2n | 1 except in Table 4, where r = 0.1 and = 1%, for which |sn sH2n | 2. Further, forsmall r (r = 0.1 and r = 0.25), H1 seems to dominate H2 when T = 1/.

    Overall both heuristics perform well for reasonable values of r and n. When T and r are small

    (T 1/ and r 0.25) H1 outperforms H2 and guarantees that |sn sH1n | 1; H2 outperformsH1 otherwise, with |sn sH2n | 1.

    7. Managerial Insights

    In the previous section we developed heuristics for nurse staffing and, using numerical examples,

    demonstrated that the heuristics do well in approximating actual optimal staffing levels. In this

    section we use the heuristics to evaluate the performance of ratio policies. In particular, we find

    that only in limited, and arguably impractical, circumstances does the ratio policy mandated by

    California law i.e., the nominal ratio policy provide consistent quality of service. We also use the

    heuristics to investigate the structure of the optimal policy. Of particular interest is the unusually

    strong (super) pooling effects for large units that the optimal staffing policy exhibits.

    Recall that when the delay constraint is T = 0, we are forced to use Heuristic 1; Heuristic 2 is

    undefined. For strictly positive excessive delay thresholds (T > 0), the previous section suggests

    that we use Heuristic 1 for small values of T and Heuristic 2 for large values.

    Consider now the probability of delay (T = 0). If we assume that staffing levels and unit sizes

    take real values (i.e. we ignore round-up effects), the optimal staffing rule can be approximated

    by sn = rn + n

    n where n is the solution of (11). One interpretation is that the nominal ratio

    policy is close to optimal whenever n = 0; that is, when plugging n = 0 and T = 0 into (11) and

    solving for the probability of delay, we obtain

    =2

    1 +

    r(0) =

    1

    1 +

    r. (22)

    In this case, the probability of delay is at least as large as 50%. In fact, for pediatric units in

    California, this probability of delay under the ratio policy is predicted by our heuristic to be 2 /3,

    a quantity consistent with Figure 1: when controlling for the oscillation effects (due to the staffing

    level round-up), the ratio policy seems to guarantee a consistent delay probability of roughly 65%.

    Now consider another ratio policy sn = rTn with rT = r. If rT < r then for any reasonable targetprobability of delay , this policy will lead to understaffing. Supposing instead that rT > r, then

    for only one value of n (using real values for n), will the ratio policy deliver the target probability

    of delay:

    n =

    rT r2

    ,

  • 7/27/2019 Ratios 3te

    24/32

    24 Article submitted to Management Science; manuscript no. MS-

    where is chosen so that () as expressed in (8) yields the target delay probability. Of course,

    this yields only one unit size for which this particular ratio policy is optimal. This leads to our first

    observation.

    Observation 1. For any given unit type, only one ratio policy can yield a consistent probability

    of delay across all hospitals, the nominal one mandated by California legislation. However, the

    nominal ratio policy cannot guarantee a probability of delay smaller than 50%.

    Consider the probability of excessive delays with T > 0. Suppose that the nominal patient need

    is r, the threshold for delay is T > 0, and a ratio staffing policy is implemented for which (ignoring

    round-up effects) sn = rTn, where rT satisfies (13). For a given target probability of excessive delay

    , the approximately optimal staffing policy is, by Heuristic 2, rTn + T

    n, where T is given by

    (14). This implies that for sn to be optimal, T = 0, and hence, by (15), the probability of excessive

    delay must be 50%.

    Suppose a different ratio policy is chosen perhaps even the nominal policy and can be

    expressed as n, for some

    = rT. Then, given any target excessive delay probability , this ratio

    policy is optimal for at most one value of n. This leads to our second observation.

    Observation 2. For any given unit type and specified delay threshold T > 0, only one ratio

    policy can yield a consistent probability of excessive delay across all hospitals, and it is not the

    nominal ratio policy mandated by California legislation. Moreover, the consistent ratio policy cannot

    guarantee a probability of delay smaller than 50%.

    Suppose staffing conforms to the nominal ratios dictated by California Bill AB 394. When the

    delay threshold is strictly positive and the target probability of excessive delay is less than 50%,

    solvingrn = rTn + T

    n

    for n leads to a unique solution

    n =

    1()

    T

    2r + rT

    1 + rt

    .

    If > 50%, then the nominal ratio policy exceeds the optimal policy for all values of n. Otherwise,

    the nominal ratio policy understaffs for n < n and staffs more than the optimal staffing policy does

    for n > n. This leads to the following observation.

    Observation 3. For a strictly positive delay threshold (T > 0) and a reasonable target proba-

    bility ( n).

  • 7/27/2019 Ratios 3te

    25/32

    Article submitted to Management Science; manuscript no. MS- 25

    This observation is consistent with Figure 3 where n is roughly equal to 28. In Figure 2 the crossing

    point n is larger than 60.

    The next observation summarizes the trends in quality of service and identifies the limitation of

    the ratio policy mandated by California law.

    Observation 4. For any medical unit, any delay threshold, and a ratio policy sn = n, the

    probability of excessive delay increases (resp. decreases) monotonically with the size of the unit andasymptotically approaches 1 (resp. 0) if is less than (resp. greater than) the nominal patient load

    of that unit.

    Observation 4 is consistent with Figure 11 (in the appendix) which shows an increasing trend of

    the probability of excessive delay for = 1/5 < r = 1/4.

    In short, according to Observations 1-4 the Californian legislation appears to provide consistent

    performances only for poor service levels when the target times are restricted to zero. The previous

    discussion also sheds light on pooling effects under the optimal staffing rule. Pooling effects appear

    when the number of additional nurses required per new patient is decreasing in the unit size n,

    that is when sn n is concave in n. When unit size and the staffing levels are assumed to take realvalues, Heuristic 2 can easily be shown to be concave, which suggests that the optimal staffing level

    exhibits this pooling effect7. However, the effects are more dramatic than just concavity. Recall

    that rn represents the nominal patient load, that is, the long run cumulative fraction of nursing

    time required in the unit provided nurses are always available when patients become needy. For

    large system (n > n), Heuristics 1 and 2 actually specify a number of nurses less than the baseline

    rn (since n < 0 under Heuristic). In fact the deviation from the baseline is order n.

    Observation 5. When the time target is positive (T > 0), the optimal staffing rule shows super

    pooling effects for large units (n > n): the staffing level is less than the nominal patient load (sn 1. When n = 1, most reasonable staffing policies

    will dictate that one nurse is present.

    Table 4 Performance of Heuristics H1 and H2; r = 0.1; = 0.01

    T = 0 T = 1/ T = 2/

    n sn sn s

    H1n s

    n s

    n s

    H1n s

    n s

    H2n s

    n s

    n s

    H1n s

    n s

    H2n

    2 2 0 2 1 1 2 1 1

    3 3 1 2 1 1 2 1 1

    4 3 1 2 0 1 2 1 1

    5 3 0 2 0 1 2 0 1

    6 3 0 3 1 1 2 0 1

    7 4 1 3 1 1 2 0 0

    8 4 1 3 1 1 2 0 0

    9 4 0 3 1 1 3 1 1

    10 4 0 3 0 1 3 1 1

    15 5 0 4 1 1 3 0 1

    20 6 0 5 1 2 4 1 1

    25 7 0 5 1 1 4 0 0

    50 11 0 8 1 2 7 1 1

    100 18 0 13 1 1 11 0 0

    200 31 0 23 1 1 20 0 0

    500 67 1 51 0 1 46 0 0

  • 7/27/2019 Ratios 3te

    32/32