Patient Waiting Lists & Healthcare
In the previous page we saw how to apply
replenishment to log marshalling. Of
course people are not logs – we know that.
People are much more perishable, especially when they are unwell. However, is there something that we can
learn from log marshalling in general that can also be applied to
healthcare? Let’s run a test, let’s
compare the patient waiting/referral process against log marshalling and see
if it is different or similar. If it
is similar then maybe we already have a reference environment from which we
can extract the principles and apply to healthcare. Let’s see.
Firstly, however, if you have arrived at this page
directly rather than sequentially through the replenishment page and the
distribution page, then please consider reading the replenishment page
first. This will ensure your
understanding of the technical solution (the planning and control system)
that we are going to apply in this case.
Forearmed with such knowledge you will be in a much better position to
evaluate the description of the current problem and also the potential for
the detail of the solution.
The “producers” or the source node in this system –
the general practitioner, or local doctor – is in this case a service
operation having no set-up and appears to produce stock for the system –
patients – at a random rate and in units of one. Now, however, there is no longer a
geographic many-to-one relationship between producer and the next node, the
specialist. But rather there is a many
to many relationship. Any general
practitioner may wish to refer a patient to any one of a number of different
specialists depending upon the nature of the illness.
Let’s try to draw this network.
All we have done here is to have
changed the relationship from one-to-one in log marshalling to many-to-one in
public healthcare and changed the labels.
As a generalization then, the marshalling network seems to describe
the flow of patients. There is however
one interesting and critical difference in this network, the actual
production part isn’t at the source nodes any more, here labeled the local
doctor. The actual production is at
the end node, the intervention/admission.
Health professionals “bristle” when manufacturing
analogies are used in health, “we don’t make cans of beans you know!” However, we need to use manufacturing
terminology for a moment to describe part of this system in consistent
terms. When the intervention is
carried out in a theatre, that part of the process is a production
process. It has a set-up, it has
particular equipment and staff for particular procedures, and it may even
have particular rooms. We need to know
this to distinguish it from the supply chain portion, the referral and
waiting list part, because the way in which the supply chain portion and
production parts are managed are intrinsically different.
How can we be sure that intervention is a production
process and not a project process?
Well, if we look at an operating list we can characterize it in terms
of patients per day, rather than days per patient. Therefore we can be quite certain that from
the system’s perspective the intervention node is a production process. This is important because it is likely that
the intervention node will always be the control point – the drum in
drum-buffer-rope terminology – regardless of whether the constraint is
internal (we don’t have enough capacity) or external (we don’t have enough
patients).
How can we be sure that this node will always be the
control point? Well, at a guess, it is
the most capital extensive step, either a new theater in surgery or a new
ward in medical cases. It certainly
will be the operationally most expensive step in terms of on-going staffing
and support. Therefore it is unlikely
that new theaters will be built in rapid response to demand. The control point is therefore unlikely to
shift somewhere else as a consequence of additional theatres being built.
So it looks as though there is some validity in
investigating marshalling or a convergent supply chain as a descriptor for
patient waiting lists. However, this
brings us to a significant issue – waiting.
On the very first page, the introduction to these webpages, we
mentioned that we can batch in time or we can batch in quantity. Supply chains are dominated by batching in
time, and the patient waiting list is a supply chain. Batching in time is so pervasive in
healthcare that we absolutely accept it as normal. We fail to even question the reasons for
its existence. Whenever we batch in
time we cause waiting to occur.
Reducing batching in time, along with removing the policy constraints
that limit output; will substantially reduced patient waiting lists. Would this be a worthwhile cause to
pursue? I think so. Are you interested?
Good, then we need a plan of attack.
Plan Of Attack
There is only one plan of attack, the 5 step focusing
process that we have used to date in the analysis of all of our logistical
endeavors. Let’s repeat it here for
good measure;
(1) Identify the system’s constraints.
(2) Decide how to Exploit the system’s constraints.
(3) Subordinate everything else
to the above decisions.
(4) Elevate the system’s constraints.
(5) If in the previous steps a constraint has been broken Go back to step 1, but do not allow inertia to cause a
system constraint. In other words; Don’t
Stop.
What is the constraint in this system? What are we trying to protect? The constraint isn’t a lack of “customers”
in this instance; there is no shortage of patients. It something else; it is expensive and
finite capacity – limited physical space somewhere, possibly “funded beds”
(as opposed to beds that exist but are deemed to be unfunded) for medical
conditions or theatre space for surgical conditions. That almost answers the second question –
how to exploit the constraint. We lose
output from the system whenever we have patients who need intervention, but
who are not in the right place at the right time to receive that
intervention. Healthcare is a service;
we can’t store the intervention for use at a later time.
In order to exploit the system we need to ensure
that we can never waste an opportunity to carry out an intervention. We will need to deduce how to best do this
– a strategy for exploitation. We will
also need to develop how to best subordinate the rest of the system once the
exploitation strategy is in place.
To properly and successfully exploit and subordinate
the valuable capacity in this system would mean that we begin to have shorter
waiting times and eventually spare capacity.
First however in order to be able to determine the exploitation and
subordination strategy we need to examine the properties of waiting list
networks in a little more detail.
Let’s do that.
The Waiting List Network
Let’s acknowledge right at the outset that there are
two ways that we could effectively exploit the constraint – one way is to
decrease the input and the other is to increase the output. We could express this as follows;
(1) Approaches that avoid illness
in the first place.
(2) Approaches that mitigate or cure illness
once contracted.
Although there is now an increased emphasis on
pro-active prevention rather than reactive intervention, the fact remains
that much illness still requires active intervention and moreover there is a
backlog of work. This backlog arises
from rising general expectations from taxpayers and improved levels of
care/technology from practitioners.
Technology is a double edge sword here, it both substantially reduces
the effort in simple interventions freeing up bed space that was unimaginable
20 or 40 years ago, and at the same time making possible interventions that
tie up bed space that was unimaginable 20 or 40 years ago.
We need to recognize that the patient waiting list
network in the main is concerned with non-acute admissions although how we
handle this network strongly impinges upon acute work also. Public health systems must deal with
non-acute, acute, and emergency patients; all at the same time. But this is not unusual. I have not yet seen a process that didn’t
operate a concurrent but differential priority system of some sort. Concurrent differential priorities are the
rule not the exception in serial processes and health professionals need to
recognize this commonality. In any
system; manufacturing, service, or supply chain, the only way to manage
differential priorities concurrently is to have adequate sprint capacity
and/or buffering.
Let’s examine our non-acute network then, from the
perspective of a patient/taxpayer (tired pun intended). It is a bit radical to take a patient’s
perspective but let’s press on. The
patient books an appointment with the local general practitioner who
determines that the problem more correctly needs specialist assessment. The doctor might ask “public or private”
implying some differential service but we won’t go there. Our patient paid taxes damn it and is going
to go public. Well, please wait 2
weeks and you will get a letter from the hospital telling you when to attend
an outpatient’s clinic. Two weeks pass
and the letter arrives – please attend a clinic in 6 weeks time! Six weeks pass and the specialist
appointment date arrives. And isn’t it
funny how all the other people in the clinic seemed to have similar
conditions. Anyway an assessment is
made and intervention is recommended within 6 months. Ah, that doesn’t mean 3 or 4 months, that
means something like 5½ months or 6 months. That’s 6 months waiting plus 1½ months waiting plus ½ a months waiting. That’s 8 months waiting all up. That is, if the specialist didn’t refer you
back to the general practitioner.
Why all the waiting?
Well, we are just trying to be efficient didn’t you know!
If you look at the marshalling system it looks like
an “A” – upside down I grant you.
However A-plants describe a situation where there is general
convergence in manufacturing.
Patients’ waiting lists don’t manufacture anything, but as a supply
chain they seem to exhibit the same behavior as A-plants. Let’s look at an A-plant description; under
traditional management practices in A-plants the tendency is to misallocate
resource time in an attempt to maximize efficiency and utilization
figures. Large batches are used to
keep the measurements high resulting in a poor component mix and constant
shortage of the right parts (1).
Furthermore these large batches move in waves throughout the plant
causing temporary bottlenecks to wander from resource to resource. Since material is constantly out of
balance, overtime is used to ‘catch up’ so that shipments can be made on time
(1). We can expect similar things to
happen in patient waiting lists.
What evidence do we have of local efficiency measures
to substantiate this assertion? Well
just reflect for a moment on the number of measures that we must compile for
reporting and how many of these directly relate to improving wellness in the
community. Aren’t most really local
efficiency figures? Aren’t most of
them sheer frustration to substantiate even on a good day?
What, then, if we were to increase the frequency of
clinics in such as system – and indeed interventions as well. Nothing major, moving things from once a
fortnight to once a week or from once a week to twice a week. What effect would that have if we could
simultaneously address the backlog?
Think about it.
But Demand Will Increase!
Well, there is a very peculiar notion in healthcare
that if we improve patient service levels then demand will also
increase. We need to examine
this. This notion belongs on another
planet.
Would you ever wish upon yourself a serious
illness? Pretty damn unlikely. So if patient waiting lists decrease, and
service improves, are people going to become ill more often just to avail
themselves to the new levels of service?
Pretty damn unlikely also. So
where does this curious notion arise from?
One situation where it might arise from is if we are
currently failing to meet a real need (as opposed to a desire, or a want, or
what the marketers would call a latent need – one when you go out and buy
something you didn’t even know that you wanted). If we are failing to meet a real need and
we increase availability then of course there will be an apparent rise in
demand. However, that demand was
already present, it simply wasn’t being met.
Failure to meet a present and real demand is not a reason to restrict
services if it is possible to improve access those services using existing
resources.
There is another aspect to this apparent
paradox. Healthcare has very strong
negative reinforcing loops operating in it.
Failure to respond to a need at an early stage means that the need when
eventually met consumes far more resources.
This might help explain why demand is perceived to increase. It is not necessarily the total incidence
that is increasing but rather the severity of the individual incidences once
they reach an agreed level for intervention.
We will return to this thought later.
You Don’t Understand We Have Too Many Acute Patients
Yes but, we have too many acute patients. This is an interesting problem. Some people become acutely ill
suddenly. Others become acutely ill
over time – time spent waiting for non-acute intervention. Governments find it hard not to fund acute
work and not so hard not to fund non-acute work, so we can guess why acute
work load is, in part, so significant.
Moreover, we already know how to break this vicious cycle – earlier
intervention. Too many acute patients
is really just an excuse. There are
others too.
The local population is too old; the local
population is too young. We have
diverse socio-economic challenges in our area, the area is too rural and
dispersed, the area is too urban and condensed. We can’t retain good staff, we can’t attract
talented doctors to our specialty, our specialty is under-recognized, our staff are older and more expensive than the mean and
so forth.
What about; our young doctors are attracted to the
large cities, or (if you are in a large city) our young doctors are attracted
overseas. And you can’t get good
locums anymore. Our buildings are over
40 years old, our buildings are an earthquake risk, the air conditioning is
antiquated, our total corridor length is much greater than anywhere else (its
true trust me). The flu season was
early/late this year – but never on time.
We have an unreasonable orthopedic load, we are a national center for
……… but this isn’t recognized in the funding.
In fact, you have probably worked out that there is
no end to this list. But don’t worry
these are not the problems either.
They may be symptoms of people’s frustrations, but they are not the
core problem, and therefore solving them is not the solution (but that has
never stopped anyone yet).
Let’s move on.
Cart Before The Horse
Improving the patient waiting list network is indeed
a fine ideal, especially if it ensures that we don’t miss an opportunity to
do an intervention because we didn’t have a patient ready – even though there
are 100’s of patients on the list (it happens!). However, the reason supply chain follows
production on these webpages is that we have to sort out our production first
if we are to improve our supply chain.
Now there are exceptions to this; for instance where we don’t own the
production stage or it is beyond our span of control or sphere of
influence. In these instances then,
yes, indeed we have to do our best in spite of the limitations. However, this
is not the case in public health systems, the production stage and the supply
chain stage are integral. So we should
address the production side first. And
if we can’t win there, then that shouldn’t be an excuse not to look at the supply
chain mechanics nonetheless.
In order to increase the production side we must
address a policy constraint. Let’s
have a look then at that.
There Is No Goal In Public Health
It has been said that if there is no goal then the
absence of a goal is the
constraint. We also noted in the
measurements section that the goal of a system is in fact open-ended. You can’t have enough of the goal. In contrast the necessary conditions that
support the goal can be viewed as having limits. Once we satisfy a necessary condition
additional satisfaction does not improve the rate at which the organization
moves towards its goal.
A problem arises
however in not-for-profit organizations of which a public health service is
just one example. Scheinkopf notes
that in not-for-profit organizations “there is a tendency to believe that the
measures are so intangible and that attainment of purpose is such a
subjective call, that such measures are simply not discussed. The focus ends up to be on measuring and
managing the things we call ‘tangible,’ such as money (2).”
New Zealand
health boards must currently meet an 11% “capital charge” on some types of
new investment (the rate-of-return incidentally is one that some public
companies in the free market can currently only dream of). This means that the Government must pay the
health boards pro rata 11% too much to cover this capital charge, which the
boards then pay back to the Government showing that they are indeed
efficient. This financial efficiency
can be met by restricting access by raising the level (points) for non-acute
admissions. At best, meeting the
capital charge is a necessary condition and a perverse one at that.
Because the
capital charge is imposed upon the system from outside and must be met, it is a necessary condition for success. However, necessary condition aside, it is
just a Government policy, this doesn’t mean that its validity shouldn’t be
challenged, nor the cost mentality behind it.
However the very real danger is that once the necessary condition has
been satisfied (boards run a “balanced” budget) there is no driver for further
improvement. The necessary condition,
due to its prominence, is mistaken as a goal – which emphatically it is not.
It is no
exaggeration to say that public health is missing a goal. Instead it has, as an objective, a
necessary condition – meet budget. We
can illustrate this further.
“The
Nelson-Marlborough District Health Board confirmed all elective surgery will
be postponed for about six weeks over summer.
The
moves come at a time when some patients are waiting up to five years for
non-urgent surgery, and the board is preparing to cut people from its waiting
lists if their conditions are not considered serious enough to warrant
treatment in the public health system.”
“The
Health Ministry contracted the board to do fewer operations than it had the capacity
to perform. As a result it was already
significantly over budget less than four months into the financial year.
The
purpose of the cuts was to reduce surgery to contracted levels and save money
(3).”
If you believe
in reductionist/local optima viewpoint you will also believe that each
operation has a cost and by avoiding operations we can avoid all the costs
associated with them and thereby save money.
If you understand the systemic/global optimum viewpoint then you know
that such efforts will hardly save a penny.
Sure it will save on some variable costs. However, using the quote above as an
example, we should ask what will the buildings do for 6 weeks, what will the
staff do for 6 weeks, and what will the air conditioners do for 6 weeks? They are not variable expenses. And of course what is the final cost to the
system when the work is finally undertaken – is it more or is it less?
Let’s have a
look then at evidence of, not of postponement, but of removal from a list.
“Many
gallstone patients in Auckland must now suffer at least four attacks of
severe pain and vomiting in a year to qualify for surgery.
Or
they must have two attacks of gall bladder inflammation, or experience worse
symptoms or complications.
Less
that four pain and vomiting episodes, called biliary colic, and you would
probably fall below the cut-off point – set in response to Government funding
levels – for elective surgery at North Shore Hospital (4).”
In the same
article but a different hospital.
“Waitemata
officials started to introduce their new scheme in November after struggling,
like all district health boards, with having more patients than can be
treated. They hope to extend it to
other types of surgery later.
Under
it, about 40 patients have already been taken off the surgery waiting list
because they are not considered sick enough.”
Note that the
qualifier is 4 attacks in one year – and then you go onto an “elective”
waiting list; but there was no mention of how long the list is until
intervention. Removing people from
waiting lists who are deemed not ill enough to warrant treatment conforms
exactly to one of Senge’s system thinking archetypes – eroding goals (5).
So I think it
is safe to say that the objective illustrated here is characterized by a
limiting necessary condition and that necessary
condition is to meet budget. We don’t
have a goal at present.
Who Should Set The Goal Then?
So if we don’t
have a goal at present in the health system, who should set the goal
then? Well, the answer is clear, the owner
of the system should set the goal. And
the owners are the taxpayers aren’t they?
Sure, but the Government of the day administers the health service on
behalf of the taxpayers; so it is the Government who is the proxy owner of
the system in this instance, and it is the Government that
should set the goal.
The Government
currently sets a number of necessary conditions that are financial in nature
because articulating a non-financial goal and the fundamental measures to support
it is deemed to be too difficult. But
is it really that difficult? Let’s
try.
How Do We Set The Goal?
Let’s try and
set a goal for public health so that we can move along back to our objective
of showing marshalling as a viable model for public waiting lists. How do we do that? How do we set the goal? I guess that we need to ask where we want
the public health service to be at the present. That would be a good place to start.
A politically
correct goal might then become; a timely and appropriate outcome. But what is the outcome? Is it community wellness? If it is community wellness, are we seeking
to maximize it? That certainly seems
open ended as a goal should be.
However, it might also imply incorrectly that funding should be
maximized and clearly there is a problem here because most people don’t want
taxes to increase which is exactly where the funding must come from.
Then, how
about; improve community wellness, as an appropriate outcome? Improving community wellness seems
sufficiently open-ended at this point in time (maybe even bottomless), we
could certainly do with a lot, lot, more of it. Why don’t we run with this for a while and
see if it will work for us. Thus the
trial goal for a public health system is to; improve community wellness now
and in the future. Let’s write that.
Establishing a goal is fine; however, we now need to ask what are the
absolute necessary conditions or inputs that will give rise to this
goal. In order to obtain this goal it
seems that there are at least 2 necessary conditions that we must
satisfy. We alluded to these in
defining the goal. A timely and
appropriate outcome implies a timely and appropriate input. The appropriate input could be pro-active prevention or the reactive
intervention that we carry out.
The timeliness depends more upon availability at this moment than
anything else. So let’s add these two
necessary conditions to our goal.
It seems that the appropriateness of the intervention isn’t so much in
contention as the timeliness. It seems
then that one necessary condition is currently satisfied – the
appropriateness. Medical professionals
do not appear averse to taking up new approaches or technologies in either
treatment or prevention. However, the
other necessary condition – timeliness, isn’t currently satisfied.
In fact,
satisfying this non-financial necessary condition looks a little
untenable. The proverbial rock and a
hard place. We need to improve the
outcome – community wellness – with a level of availability and therefore
timeliness that many would consider is currently “insufficient.” It therefore would be too easy to write
another necessary condition leading into the current one saying “secure
sufficient funding” in order to increase the level of availability and
therefore increase the timeliness – however, it would be quite another thing
to actually receive that funding.
We should also
remember from the measurements page that a not-for-profit organization such
as a public health service must watch its operating expenditure against its
existing fixed level of funding least it runs a deficit (6). So running in the red and hoping is out as well. How then do we ensure sufficient timeliness
and maintain our operating expenditure at the same time?
Let’s go back
to one of the most important statements in Theory of Constraints;
Productivity = Throughput / Operating
Expense
We have
muddied the water a little because our goal is now non-financial and
throughput, as defined, is a financial measure (sales – totally variable
costs excluding direct labor).
However, we can jury-rig another equation that will do just about as
well – we will substitute output for throughput;
Productivity = Output / Operating
Expense
We can measure
our output – patients. We can measure
our input – operating expense. If
output goes up and input goes down or stays the same then we have increased
our productivity and we have also moved towards our goal.
Let’s be clear
however, increased productivity doesn’t mean working harder. It does mean though, knowing sufficient
about the system, its dependencies, and the variability in and between
dependencies that we can protect the most valuable or most important part,
that part that we have the least capacity to spare. Let’s make “not working harder” an explicit
necessary condition to our goal so that this aspect is not misinterpreted or
misrepresented. Let’s draw it.
This then is the goal and the necessary conditions for a public health
system. We have identified a
non-financial necessary condition – timeliness – that is currently not being
satisfied.
How Then Do We Measure Progress Towards The Goal?
Using
productivity as a measure of progress towards the goal is a bit of a blunt
weapon – in fact it is more an indication of the method than the measurement
that we should use. The fundamental measurement then is our non-financial
necessary condition, the one that we are failing to meet currently –
timeliness.
The Government – the owner of the system – must set maximum national patient wait-times that must be
met. We can measure this performance
and it is non-financial. Moreover we
can see that meeting an increase in demand at static maximum wait criteria
and funding must mean an increase in productivity. Also meeting a lowered maximum wait
criteria at static demand and funding means an increase in productivity. We can measure progress towards or away
from the fundamental measurement with two local measures; patient-days-wait,
and patient-days-late.
Yes But, The Government Already Uses Maximum Patient Wait Times
So how is our
proposal different then; the Government already uses maximum patient wait
times for many aspects of healthcare?
That is true, but how is the issue managed at present? Timeliness is currently managed not by
increasing productivity but by decreasing productivity. It is managed by raising the criteria for
consideration, so that the patient wait times may remain high and constant
but the level of unwellness in the waiting list becomes greater over time and
the number of people treated becomes fewer and fewer – we saw direct evidence
of this in the earlier quotes.
Moreover, the
maximum wait times are measured purely by the number of patients. Our local measures; patient-days-wait and
patient-days-late are much more revealing about the true nature of the
waiting list. But we ourselves must
wait a little before we can investigate this aspect further.
Broader Issues
First,
however, there is a broader aspect to productivity that applies to a public
health system. Public health systems
are not “stand-alone,” public health productivity impinges upon the productivity
of the whole nation/state. Consider
for instance a country with first class productivity in one of the primary
industries such as; agriculture, fisheries, forestry, or mining, or first
class productivity in any one of the secondary manufacturing industries. These activities generate national
income. Why do we constantly strive to
increase the effectiveness of these national income generating activities if
a major consumer of this income, healthcare, operates on assumptions once
thought valid in a previous century – and I mean the 19th not the 20th
century. Other sections of the economy
have moved on.
Currently most
hospitals are implementing some form of patient information management system
and some form of enterprise-wide scheduling system. Enterprise-wide scheduling systems were
described in the section on production, essentially they are finite
scheduling solutions based upon a reductionist/local optima approach. As we know from manufacturing, reliance on
these techniques depends on excellent data integrity but generally results in
increased work-in-process because they fail to protect the system from
variation even through they have ample protection embedded within the
schedule – in short they fail to protect the constraint – output goes down,
work-in-process increases. Increased
work-in-process in this environment means more patients-in-waiting and
waiting for longer.
The reality is
that in both manual and automated scheduling systems many theater
opportunities are lost due to poor protection of the constraint. These losses are buried in the general
theater utilization hours, we have to scratch the surface to find them, but
they are real, and they do present a real opportunity to improve output at
current operating expense. And that
brings us to our critical erroneous assumption.
A Critical Erroneous Assumption
There is an
assumption that we totally fail to challenge – the assumption that we are
sufficiently productive and that we can not improve further. The pervasiveness of this assumption can be
demonstrated every time someone says; ”yes we could
process more patients if only we had access to more funding.” The hospital in the earlier quotation is
very likely to have sufficient productivity – it could do more operations
than contracted for (don’t be fooled by contracted cost, you need to see the
flow of money in and out of the system).
The real issue is that if one hospital provides a better level of
service than others it is defying a charter that requires equitable access to
all people in all parts of the country.
This means other hospitals are currently not as productive. The most productive hospital and all other
hospitals in between must be hobbled to the level of the least productive
hospital in the system in order to ensure equitable access. Think about it.
Yes but, all
the other hospitals could improve to the same higher level couldn’t
they? Well you would think so; this
would be the ideal situation. However,
there are two reasons why this doesn’t happen. Firstly under a reductionist/local optima
costing process, if we improve our productivity our unit costs will go down
and next funding round we will receive less to do the same number of
procedures rather than the same amount to do more. This is a very real fear of hospital
management.
The other
reason is more important. Currently in
the health system there is little knowledge of the rules of engagement that
we first saw in the measurements section.
Let’s repeat them here;
(1) Define the system.
(2) Define the goal
of the system.
(3) Define the necessary
conditions.
(4) Define the fundamental
measurements.
(5) Define the role of the constraints.
As you can
see, in health at present we have just a few financial-based necessary
conditions; we are missing so much of the whole picture. Why won’t we do this if it is so
simple? Are we scared? No, I don’t think so. It might be that many people simply don’t
know how to evaluate the role of the constraints in this system yet – or that
they do know how to but common practice runs counter to this.
Well,
fortunately we are using our common sense rather than common practice, so
let’s continue with our examination of patient waiting lists and
marshalling. We really ought to stop
looking at the problem and start looking at the solution.
Semantics
How can we describe the actions of the nodes in the
patient waiting list network? We have
suggested that the supply chain here is a marshalling supply chain, or more
accurately marshalling and consolidation. Patients are marshaled in by referral from
numerous local doctors and consolidated into specialties and then lists. The consolidation is carried out in
accordance to a push-to-need basis.
General practitioners feel that a particular patient
needs specialist expertise (and it is the expertise of the general
practitioner to know when this is required) and “launches” the patient into
the process and hopes that the outcome will be favorable (and timely). As in all other supply chain solutions here
we need to replace this with some sort of pull-and-replace system. The constraint, the most valuable and
limited resource, must pull the patients via the waiting list network to a
position where they are ready to receive intervention as soon as possible. Maybe we should call this a pull-to-cure or
a pull-and-cure system.
If at some future point in time there are
insufficient patients to fully load the system then we are moving in the
right direction. And if currently we
can at least stop waiting list expansion (without fiddling with the criteria)
and affect a contraction then we know that we will eventually reach that
future point. The key is that the system must initially pull at a faster rate than the incidence
rate of the problem. How
are we going to achieve that?
Well, unlike the distribution problem or the log
marshalling problem, where the constraints in the system were non-production
constraints, here the constraint is a production constraint. The intervention produces something; it
produces favorable outcomes – but currently it produces an insufficient
number of them. Thus we need to break
our solution into two subsystems;
(1) Production subsystem – Intervention.
(2) Supply chain subsystem – Patient Waiting List Network.
And as you know, common sense tells us that the
answers are already in the system. So
let’s have a look.
General Solution – Part One; Intervention & Drum-Buffer-Rope
A solution with an unusual name and very powerful
consequences. Drum-buffer-rope is the
Theory of Constraints production solution, it is a logistical solution. It is fully described in the section on
production; it is really a way of thinking more than anything else – a way of
thinking that enables substantially increased output from constrained
situations without recourse to additional funding or manpower. There is a good example from neurosurgery
in the United Kingdom (7).
The Radcliffe Infirmary went from canceling 64
elective neuro-surgical procedures over a 3 month period to canceling none in
the same period the next year.
Out-of-hours operations were drastically cut and output went up by
16%. Would a reduction in non-acute
cancellations be useful to you? Would
reduced out-of-hours operating be useful to you? Would an increase in output be useful to
you? This is not a trivial solution.
We could get away here with just briefly mentioning
some aspects of the drum and the buffer.
The drum is the constraint, it beats out the rate at which the system
works at. In our case the constraint
is most probably a surgical theater or a medical bed. Let’s draw this using our systemic model
that we developed earlier. The
constraint – our drum – is the rate limiting step.
A buffer is quite tightly defined
– in this situation it is a measure of time, the time for a patient from the
moment of admission to the beginning of intervention. To properly exploit our scarce
resource in surgical cases we must admit patients in good time so that they
are always ready for intervention.
However, to properly subordinate the scarce resource we must
also not admit too many patients at any one time.
Watch the distinction; it is very, very,
important. After all, one of our local
ward measures is average “bed nights” or some such similar measure. Having a lot of patients for a short time
is locally positive; having few patients at any one time for longer is
locally negative. The current local
measures do not support the global objective of the system. If we have too many patients waiting for
too short a period we will absolutely miss some interventions “because the
patient wasn’t ready”. Hell, the
patient was ready. It was the system
that wasn’t ready. Our output goes
down.
If we have fewer patients waiting longer between
admission and intervention we won’t miss an intervention, output goes
up. System operating expense remains
the same. It seems counterintuitive,
but if it was intuitive we would have done it – right?
We could summarize this as follows;
Introducing constraint buffers and decreasing
process batch size automatically
aligns the process with the goal
What do we mean by process batch? Well, I guess that an operating list is a
process batch. The other sort of batch
size that we might refer to is a transfer batch, and in a service operation
like this a transfer batch will be in units of 1 – the patient. To decrease the size of the process batch
means that instead of operating all day only on Tuesdays for instance – and
causing uneven ward work-load, how about Tuesday and Monday and Wednesday
mornings instead. Forget the detail,
it is simply that we are trying to decrease the number of patients at any one
point and increase the frequency.
Really we are trying to better balance the
flow. Again be careful, we never
balance capacity but we always try to balance the flow – just the opposite
from local optimization. Of course
there are practical limits to this, but we should make sure that the limits
are real and not policy. We need to
make sure that the policy is not some assumption rooted in the 1960’s or the
1950’s. Increasing the production
frequency is the primary driver that flows on back up into the supply chain –
the patient waiting list network. We
had better look at that next.
General Solution – Part Two; Patient Waiting Lists & Replenishment
The constraints in this system are in the
intervention stage, the stage located within a hospital, and this is the
stage that we must exploit. Therefore,
all other stages are non-constraints and we must subordinate these to the
constraint. The patient waiting list
network, like the log marshalling network, must subordinate to the constraint
until such time as there is a substantially reduced waiting list and
additional patients present at admission at a rate that is less than rate of
intervention.
To properly subordinate we must ensure that the
waiting list network never “starves” the production node. It starves the production node when it fails
to produce a patient for admission in good time. It happens.
Talk to a scheduling clerk and you will hear stories
like “I need a patient for the operating list on Tuesday fortnight – and I
have rung and rung around the patients on the waiting list but do you think
that I can find one!” It’s amazing,
but true, and very frustrating for those trying to do their very best. Thus our intuition as well as a good dose
of common sense suggests that we should move patients through the waiting
list network as quickly as possible to the place of greatest aggregate safety
for both the patient and the system – just prior to admission.
In fact, in medical cases, it is likely that the
supply chain prior to admission will also form part of the constraint
buffer. This type of situation is not
so uncommon in manufacturing especially where the first step in the process
is so capital-intensive that to “buy another one” is prohibitive. The expense in both cases here is the
bricks and mortar and the considerable number of skilled staff required to
run the facilities around the clock.
Let’s consider some questions then;
What would happen if we could increase the frequency
of clinics prior to acceptance for intervention? As an example, instead of holding a clinic
once a month for a day (because it is efficient for staff) what about holding
a clinic for half a day every fortnight, or until mid-morning every
week? It is kind of like waiting for one 747 or one of two 737’s. The total waiting time is less for the
smaller more frequent service.
What potential could that have?
What about if we could remove nodes completely or
combine nodes so that they occur at the same time and place, maybe carry out
some tests on the same day in the same place for instance? What potential could that have?
Hold on to these thoughts for a moment.
As in linear supply chain, distribution, and log
marshalling, we need to introduce into this system the Theory of Constraints
supply chain solution – replenishment.
If you are unfamiliar with fixed-frequency variable-quantity
replenishment then please check the explanation on the replenishment page –
it is important.
Each node in the waiting list network becomes a
buffer for the next node containing sufficient patients to ensure that it can
supply the next node down while it pulls patients from the next node up. The constraint, the drum, in the production
portion is the originator of this pull signal. Let’s draw the supply chain portion then.
If we carry out replenishment
correctly we will move safety to the area that is most important, the area
closest to admission. Let’s draw that.
And if we increase the frequency
of the clinics and other waiting list processes then these buffers can be
very small indeed and passage from one end of the list to the other will be
very rapid. That way once a referral is
made the patient can move through the system quickly and be available to be
“worked upon” – either an operation or a medical treatment if required as
soon as possible. We can summarize
this;
Introducing replenishment buffers and increasing
resupply frequency automatically
aligns the process with the goal
Now if we return to those thoughts that you are
holding on to, there is probably a big red flag saying’ “yes but there are
too many patients-in-waiting in the system now to make such a process
work.” Yes there are. But unless we get the appropriate mechanism
in place even before it is apparently needed things simply can’t
improve. If we were to size our
buffers today we would find that they are way over-full. But at least we know where we are heading.
In every situation where the system is drowning in
work-in-process, people are reluctant to give up the system that causes the
work-in-process that drowns them; “because there is so much work in the
system that doing this will have no effect.”
Exactly wrong.
We recognize how chaotic huge numbers of
patients-in-waiting are because periodically we “fiddle” with the criteria to
try and reduce the numbers.
Unfortunately that just feeds a negative reinforcing loop – we get more
acute patients. The only solution is
to maintain the criteria and increase productivity. You will be very, very surprised at the
effects. Patients are not logs, and
they are not cars, but that doesn’t mean that we can dismiss the
principles. Well in fact we can
dismiss them, but they won’t dismiss us.
We need to cut the strong negative reinforcing loops
and replace them with strong positive reinforcing loops. We need to look for systemic/global optimum
solutions not reductionist/local optima solutions. We need to look at trying to reframe the
environment and not to continually applying band-aids. The solutions are already in the system,
and those solutions although they represent change, represent a change in
meaning only.
Let’s Put It All Together
Let’s try and pull all of this together by showing
the system in its proper order; the supply chain patient waiting list network
feeding into the intervention stage.
Likely as not there is another supply chain at the other end –
district nursing, but let’s leave that for another day.
Doing this it becomes clear that
there is a feedback between the two.
We need to make sure that we don’t ever waste our scarce intervention
stage, and at the same time we need to ensure that the supply chain doesn’t
ever fail to provide an appropriate patient at an appropriate time.
Local Performance Measures
Earlier we parachuted in a goal for public service
healthcare and looked at how to measure whether we are moving towards the goal
or away from it. The goal and
necessary conditions might provide a measure for a whole system, but how do
we know in a system as complicated as a large public hospital or a district
health board that the parts – the subsystems – are also aligned and moving in
the right direction? Really we are
asking; how do we know that the non-constraints are subordinated to the
constraints. For this we need local
performance measures.
Another way of looking at local performance measurements is that they should judge the quality of
the execution of the exploitation plan (8).
What is the plan in this case?
Surely it is to provide a timely and appropriate outcome. We can’t comment here on the
appropriateness but we certainly can on the timeliness.
Timeliness is
reflected in two particular measures;
(1) Unit-days-wait.
(2) Unit-days-late.
In fact of the two, late-days is more important, but
waiting-days always seems easier to explain first. These two measures are simply a
re-verbalization of the two measures that we have consistently applied to any
subsystems in production or supply chain processes. In fact, we used these exact measures to
introduce the concept of local performance in the measurements section. Let’s have a look at these again in detail.
Let’s say for
instance that a certain outpatients’ clinic for referrals has 50 people on
the waiting list at any one time and last year these people waited on average
for 12 weeks, this year we still have 50 people on the waiting list at any
one time but they now wait on average for 16 weeks. What is the total waiting time here?
Well, we know
that last year that there was on average 12 weeks by 5 days per week by 50
people = 3000 patient-days-wait. In
comparison, this year there are 4000 patient-days-wait on the list. Is the performance better or worse? It’s worse of course. If we can stop patient-waiting-days from
increasing, or better still reduce it, then we must have improved the system. Let’s add this measurement to a linear
representation of our health system (both patient waiting list network and
hospital intervention).
So waiting-days is one measure that we can use to evaluate a subsystem
with, or indeed even departments within a subsystem.
Another aspect
of timeliness is that regardless of how long we must wait, do we still
receive attention “in time” at the end of the wait or are we late? Let’s continue with our analogy. Let’s assume that last year our patients
were expected to be seen by a specialist within a recommended guideline of 12
weeks of referral. Some, however,
weren’t seen within this time-frame.
Let’s say that 3 patients were seen after 13 weeks and 2 were seen
after 14 weeks. Again we might argue
that just 5 out of 50 or 1 in 10 patients were not seen within the
recommended guidelines. However, a
more realistic measure is that 3 were 1 week late and 2 were 2 weeks
late. This gives us 1 week by 5 days
per week by 3 patients plus 2 weeks by 5 days per week by 2 patients = 35
patient-days-late. Is this bad? Of course it is, it should be zero. We can add this measurement to our system.
So unit-days-late is another measure that we can use to evaluate the
performance of a subsystem with – anywhere that there is a clear hand-off to
another subsystem.
If the subsystems are aligned to the goal of the system we should
expect patient-days-wait to decline and patient-days-late to be zero. Now, these measures are excellent at
monitoring subsystems – nodes in the waiting list network for instance – but
there is no reason why they can not be used for the whole system as
well. If we use them for the whole
system, maybe divided by specialization, then they also provide us with a
non-financial measure of system success.
They don’t measure wellness in the community directly but rather
indirectly as the absence or decrease in unwellness. We should strive to reduce the unwellness,
wouldn’t you agree?
Let’s hope
that one day we can see in district health board meetings a 12-24 month
running graph tabled for each major subsystem showing patient-days-wait
and patient-days-late. Then we will know at a
glance whether we are all moving in the right direction or not.
We can test
for obfuscation
with a simple graph.
In the graph below we have some initial criteria for
admission to an elective list – patients who have managed to reach the
“access threshold.” Over time the
total number of patient-days-wait increases as the effects of system
dependency, variability, and an absence of knowledge of how to protect the
constraint cause output to be lower than input into the list.
At some time the length of wait
and the number of patients waiting becomes too great. There is a reassessment of the “access
threshold” and a limit is imposed.
The limit is imposed via new
criteria for the access threshold.
Some of the previous patients are “parked” in new categories such as
the “residual waiting list.” Nevertheless,
the patient-days-wait continues to increase as before, and for the same
reasons, but now artificially depressed for a time by the adjustment.
Patient-days-wait increases, that
is until, once again, the length of wait and the number of patients waiting
becomes too great. There is another
reassessment of the “access threshold” and a new limit is imposed.
The new limit is imposed via new
criteria for the access threshold.
Some of the previous patients are once again “parked” in new
categories with new and different names such as “active review.”
And of course once again, patient-days-wait
continues to increase as before, and for the same reasons, because we have
still failed to address the fundamentals underlying the problem.
The important point is that if we
extrapolate from the earlier data we should be able to get a good estimate of
the “real” patient-days-wait had the criteria remained unchanged. An apple with apple comparison. Let’ draw what we mean below.
So even if people say the we can’t
compare different sets of criteria, because the criteria themselves have
change, that is the make-up of the rules in the “access threshold,” then we
don’t need to allow that to divert attention away from the very real
conclusion that the wait list based upon the initial criteria is much greater
and can be estimated with relative ease.
What should we see then, if we instead address the
fundamentals in the system? What
should we see if we identify, exploit, and subordinate to the system’s
constraints? Let’s have a look.
Rather than impose a new access
threshold and “bump” people off the list, we have identified and protected
our weakest link and increased its output.
Patient-days-wait haven’t been reduced to zero, but they have been brought
under control, the backlog removed and a new, lower, and stable rate
achieved. At each steep downward step
indicates that a constraint in the system has been overcome and output has
risen (and days-wait decreased). Then
after some time the cycle “flicks” from being negative and vicious to being
positive and virtuous (removal of excessive waiting time induced acute load)
and the rate of intervention becomes greater than the rate of admission to
the list.
Don’t allow the “we can’t compare criteria” to obfuscate
the real issues.
Yes But, Do We Do Tonsillectomies Or Hip Operations?
So far we have
mainly considered patient-waiting-days as some undifferentiated mass. Of course different specializations will
“own” different lists. Different lists
have different degrees of difficulty.
We could knock-off the easy ones first and substantially reduce total
patient-waiting-days. Well such things
do happen. But what do you think will
happen next? We will run out of easy
jobs to pick. Then we get down to more
serious matters. The more responsible
and boring approach is to work away at the total mix.
Patient Waiting Lists Are Not Simple Replenishment
Patient waiting list networks are not simple linear
replenishment through a series of dependent nodes. They are first and foremost a convergent
marshalling and consolidation supply chain.
Moreover, the characteristics of the supply chain are strongly
affected by the characteristics of the integral production step – the intervention. Why can we be so sure that this isn’t
simple replenishment? Let’s have a
look;
(1) There is a many to many
relationship from the source nodes, the local doctor, to the specialist
assessment nodes.
(2) We must consolidate from numerous
source nodes to a limited number of intermediate nodes.
(3) We must currently subordinate the
whole system to the intervention stage.
(4) We must position the maximum
buffer protection in the place that best protects the whole system, just
before the intervention node.
(5) We must protect the more
disaggregated source nodes differentially by a higher frequency of resupply.
Of these, positioning the protection for the system
in the place that does the most good is probably the most important; this
helps avoid a major problem, having a lot of patients on the list but no one
immediately available.
Let’s Think It Through
So what do we
do first? Two things; a chicken and
egg scenario. Here goes.
We must raise
productivity at the intervention node.
We do this by overcoming our key policy constraint (or simply ignoring
it), implementing a proper global production approach – drum-buffer-rope (a
spreadsheet will do), and improving flow from our patient waiting list
network. However, to improve our flow
from our patient waiting list network we must raise productivity at the intervention
node and implement replenishment and buffer management in the supply
chain. Yes it’s a circular
argument. And do you know how to break
it? Just go out and do something,
somewhere, anywhere. It’s shockingly
simple, do something rather than talking it to death and you will begin to
move the system towards its goal.
Once you have
tackled this crux, increasing productivity will become self-reinforcing;
improving flow in the patient waiting list network will also become
self-reinforcing. Think smaller
batches, either in time or quantity and you are on your way, think global
results not local efficiency and you can deduce the solution for yourself.
What Are The Unavoidable Outcomes?
If we approach
the problem as suggested then the unavoidable outcomes are;
(1) Increased patient output.
(2) Absence of “no-shows” in
wards/theater (zero patient-late-days).
(3) Decreased waiting lists (decreased
patient-waiting-days).
(4) Decreased waiting times on the
list (decreased patient-waiting-days).
(5) Lower severity of illness at
intervention.
(6) Shorter overall bed stay.
(7) Lower overall operational
expenditure.
We will arrive at a position where we can
pull-to-need or pull-to-cure with increasing rapidity.
But Wait, Reality Isn’t This Simple!
If this explanation seems quite simple and
straightforward then that it excellent; then we know that we have developed a
broad understanding of patient waiting list networks with replenishment. If experience tells us that reality is much
more complicated than this generalized explanation, then that too is
excellent. Now we are in a position to
better investigate how to apply this methodology to our own particular
situation.
Give Us Some Examples
The Radcliffe Infirmary whose results we have
already discussed is one of the published examples of Theory of Constraints
in healthcare. The closest
approximation to this approach in New Zealand that I am aware of is the
operational and logistical expertise that the Health-Med Group has brought to
the privately run Venturo public urology service. Although Theory of Constraints was not used
there is so much plain common sense that this example should be very much
better known.
In 1993 Venturo was awarded a population-based
contract to undertake the treatment of all referred patients in need of
specialist urology services within its operating area. Traditionally services had been (and still
are) contracted for in-advance by the number of procedures or pre-determined
volume rather than a fixed fee based upon the total population served.
During the first year the service sought to minimize
its risk by dealing with the existing waiting list backlog. Outpatient visits increased by 32%,
inpatient admissions increased 9%, and day-patients increased 16%. In total 70% more people were treated from
the waiting list than the previous services had in the previous year. Total waiting time between GP referral and
operation was reduced by 44% from 37 weeks to 26 weeks (9).
How did they do this?
Good Management.
Clinics, wards, and operating theatres have dedicated urology staff
and the booking systems ensure maximum utilization of both staff and
facilities. Some investment was made
in ultrasound and laser equipment which has improved the ease and timeliness
of diagnoses and treatments. Protocols
were developed for GP referral.
However, one of the most significant gains was the standardization of
clinical practices amongst urologists (9).
Now read back through the earlier arguments. It appears that Venturo ensured that
patient and system risk was reduced and managed by moving patients to the
place of maximum safety – pre-admission or thereabouts, and by increasing the
output of the intervention stage. Its
just common sense. More people were
treated sooner, and access to urology specialist services increased (9). Moreover the contract fee for this service
has remained fixed for the last 7 years.
Summary
There is a very powerful policy constraint in place
at the moment which we must break. We
must be prepared to demand that each hospital tries to raise its productivity
to the level of the very best. Clearly
we can’t all be the very best. But the
spread, or tail, down from the very best must be quite tight. Currently, rather than raise productivity,
the system policy actively hobbles each hospital to the level of the very
worst.
We don’t’ need “free” enterprise to make this
work. We do need, however, some
enterprise of thought. If we look at
this at a national level we buffer our health risk through our taxes. We gain maximal benefit from minimal
payment. It would be a shame to allow
a profit margin to accrue to “free” enterprise just because the public
enterprise is constrained by its own well intentioned policy.
In order to raise productivity we must adequately
exploit and subordinate the very valuable and limited capacity that we
already do have. We can do this
without recourse to additional funding.
Part of the exploitation and subordination is to have patients ready
for intervention from the patient waiting list network as soon as
possible. We can do this by
implementing a change in mind-set away from local efficiency measures and
towards global effectiveness measures.
A crude initial indicator is whether our total
patient-waiting-days on a list is decreasing or not. There is no other measure that doesn’t seek
to obfuscate reality is some fashion.
Returning to the Radcliffe Infirmary example. One further outcome was produced; "The
difference was visible, the staff just looked much better … and happy staff
means happy patients (7)." One of
our necessary conditions for success in Theory of Constraints is to provide
employees with a secure and satisfying workplace now and in the future. It seems that at Radcliffe they achieved
all of the necessary conditions.
Now why can’t you?
References
(1) Stein, R. E., (1994) The next phase of
total quality management: TQM II and the focus on profitability. Marcel Dekker, pg 36.
(2) Scheinkopf, L., (1999) Thinking
for a change: putting the TOC thinking processes to use. St Lucie Press/APICS
series on constraint management, pg 25.
(3) Surgery slashed in bid to reduce hospitals’ costs. New Zealand Herald, 20th October 2003.
(4) Galling
delays for surgery. New Zealand Herald, 26 February 2004.
(5) Senge, P. M., (1990) The fifth
discipline: the art & practice of the learning organization. Random House, pp 383-384.
(6) Newbold,
R. C. (1998) Project management in the fast lane: applying the Theory of
Constraints. St. Lucie Press, pg 228.
(7) Phipps,
B., (1999) Hitting the bottleneck. Health
Management Magazine, February, pp 16-17.
(8) Goldratt,
E. M., (1990) The
haystack syndrome: sifting information out of the data ocean. North River Press, pp 144-155.
(9) Hoskins,
R., Blaxall, B., and Sceats, J., (1996) Venturo, evaluation of a pilot
specialist budget-holding contract: report of the first two years. Midland Health, Health and disability
Analysis Unit Evaluations Series Number 1, 29 pp.
This Webpage Copyright © 2003-2009 by Dr K. J.
Youngman
|