The Bristol Royal Infirmary Inquiry Logo


bullet list decorationHome Page

bullet list decorationSearch

bullet list decorationFinal Report

bullet list decorationInterim Report

bullet list decorationEvidence

bullet list decorationInquiry Seminars

bullet list decorationAbout the Inquiry

bullet list decorationHelp

Seperator Bar

Hearing summary

3rd November 1999

Today, analysts commissioned by the Inquiry and members of the Inquiry’s Expert Group, who have assisted with the evaluation, fed back their findings from the analysis of six relevant data sources.

Brian Langstaff QC, Counsel to the Inquiry, opened the hearing this morning by explaining that the statistical review, analysis and synthesis of six key data sources relevant to the work of the Inquiry is part of the exploratory phase of the Inquiry's strategy for using statistics to inform its investigation of children's heart surgery at Bristol. He highlighted the need to place figures in context and stressed that today’s evidence is one part in the jigsaw of the Inquiry’s investigation.

The following analysts and members of the Inquiry’s expert group presented their findings to the panel during the course of the days hearings:

Professor Michael Campbell, Professor of Medical Statistics, University of Sheffield, explained how to convey complex statistical concepts and identified the techniques used during the analyses.

Professor Stephen Evans, Principle Consultant Statistician, Quintiles, reported on reported on the analysis of local data relating to children who received cardiac surgery under the terms of reference of the Bristol Royal Infirmary Inquiry.

Dr Paul Aylin, Senior Clinical Lecturer, Imperial College School of Medicine, London, reported on the analysis of Hospital Episode Statistics.

Professor Gordon Murray, Professor of Medical Statistics, University of Edinburgh, reported on the analysis of the UK Cardiac Surgical Register and the South West Congenital Heart Register.

Dr David Spiegelhalter, Senior Scientist, MRC Biostatistics Unit, University of Cambridge, presented a synthesis of all the statistical sources concerning the nature of the outcomes of paediatric cardiac surgical services at Bristol relative to other specialist centres from 1984 to 1995.

Dr Eric Silove, Paediatric Cardiologist, Birmingham Children’s Hospital,

Mr Leslie Hamilton, Paediatric Cardiac Surgeon, Newcastle Upon Tyne Hospitals, members of the Inquiry’s Expert Group, also attended the hearings to comment on the presentation of the reports.

 

FULL TRANSCRIPT

 

   1              Day 70, Wednesday, 3rd November 1999
   2   (10.00 am)
   3   THE CHAIRMAN: Good morning, everyone. Good morning,
   4     Mr Langstaff. An apology is due from me that we are
   5     starting somewhat later than we should. I accept
   6     responsibility for that. We had a number of things we
   7     had to deal with elsewhere. Forgive me. We are now
   8     ready to hear you, Mr Langstaff.
   9          INTRODUCTION TO TODAY'S EVIDENCE
  10   MR LANGSTAFF: Sir, thank you. Sir, today, as I have
  11     indicated on more than one occasion, we are renewing
  12     the acquaintance in evidence with statistics. Back in
  13     July we examined the way in which this Inquiry would
  14     deal with statistics and to an extent the part that
  15     statistics would play in the Inquiry, and outlined
  16     a strategy which involved an exploratory phase. This is
  17     today to be the report of that exploratory phase.
  18        At the outset the Inquiry outlined the process
  19     which statistical analysis would take. Figures can be
  20     seductive. It is tempting to regard the numbers of
  21     operations and the numbers of deaths recorded in any
  22     statistical analysis of Bristol as answering
  23     the question whether care was adequate or not. That
  24     would be entirely wrong.
  25        After all, if 40 years ago 8 out of 10 patients
0001
   1     suffering from a particular condition died, that might
   2     be regarded if the condition were particularly complex
   3     as not being a cause for investigation and
   4     recrimination, but rather a cause for saying, "In this
   5     operation for this condition in this hospital we managed
   6     to save 2 out of 10", and be a cause for celebration.
   7     Figures have to be placed in context.
   8        Although the evidence you will hear today will
   9     emphasise statistics and analyses of figures and
  10     databases, it is important from first to last that
  11     the context be remembered. The statistics are only part
  12     of the jigsaw. Figures alone can never say whether they
  13     represent an adequate state of affairs or not. They can
  14     only be part of the picture. They can suggest that an
  15     explanation is called for; they cannot themselves give
  16     the explanation.
  17        From first to last today it must be remembered
  18     that the Inquiry will approach its task set out in its
  19     terms of reference of assessing the adequacy of care in
  20     four principal ways. First is the evidence, written and
  21     oral. Part of the adequacy of care is, for instance,
  22     the way in which parents and children were treated as
  23     people. How can figures tell us that?
  24        Just as figures can tell us little about
  25     the approach of clinical staff to patients, because that
0002
   1     is a matter for qualitative assessment by the Panel, so
   2     too a qualitative assessment can be made on the basis of
   3     the evidence provided as to the level of care and its
   4     adequacy delivered at Bristol.
   5        Second, there is evidence given by experts and
   6     others as to the adoption of procedures and policies
   7     which in their expert view from long experience in
   8     the field might have made a difference. This too is
   9     part of the picture.
  10        Third, there is a mosaic of individual cases and
  11     individual stories which makes up the pattern of care.
  12     The Inquiry has already heard from many parents, each
  13     telling from their own individual perspective of
  14     the care given to their son or daughter. This is
  15     invaluable. It is part of the picture.
  16        In each of those three areas which I have already
  17     dealt with statistics play no part.
  18        Next the clinical case note review which we will
  19     look at in evidence tomorrow forms a basis for
  20     a representative snapshot of the care given to children
  21     at Bristol over the years with which the Inquiry is
  22     concerned. We shall hear more of that.
  23        Finally, and you will by now I hope appreciate
  24     that it is part and part only of the picture, part of
  25     the complex jigsaw that makes up the picture of adequacy
0003
   1     of care, there is that which the figures can tell us; if
   2     you excuse the expression, "a health warning".
   3        Figures in this area must be treated with
   4     caution. If I take a coin and toss it once, there is
   5     a 50 per cent chance that it will come down heads and a
   6     50 per cent chance that it will come down tails. If
   7     I toss it twice therefore it should come down heads on
   8     one occasion and tails on the other. If it however came
   9     down twice as heads, one could not safely conclude that
  10     I was using a double-headed coin. That is life.
  11        The true chance of it coming down heads or tails
  12     would still be 50 per cent and it would remain 50
  13     per cent on the next occasion, no matter what the run of
  14     heads or the run of tails had been. If I take the coin
  15     and toss it ten times, then on average, but only on
  16     average, you might expect it to come down heads five
  17     times and tails five times. But, if it came down eight
  18     times one way and twice the other, you would not
  19     immediately say there is something wrong with the coin.
  20        Everyone has in daily life experience of tossing
  21     a coin. If, however, all the evidence that we had to go
  22     on to compare two coins was that one when tossed ten
  23     times came down heads on eight occasions and the other
  24     when tossed ten times came down heads up on four
  25     occasions, if that was all we had to go on, it might be
0004
   1     tempting to think there was some real difference between
   2     the coins that made one more likely to come down heads.
   3        Yet the truth, as we know from everyday
   4     experience, will be that there was actually no
   5     difference at all between them, despite the twofold
   6     difference on the figures.
   7        I should say that I am grateful to our experts for
   8     giving me this everyday simple example in order to
   9     make two serious points about figures. First of all
  10     there is an inevitable variability about figures which
  11     can be very misleading. Secondly, any picture which is
  12     painted by figures must necessarily be general.
  13        We in this Inquiry must never forget when
  14     a general picture is painted, and perhaps particularly
  15     when it is painted by statistics, that at the centre of
  16     the figures are cases of individual children, some of
  17     whom will have been treated and will thankfully have
  18     survived, some of whom will have been treated and sadly
  19     will either have suffered subsequently in consequence of
  20     the treatment or of the condition which led to the
  21     treatment and others who will have lost their lives,
  22     whether because of the underlying condition for which
  23     they suffered or because the hospital failed to care for
  24     them as it might have done.
  25        Statistics are only part of the picture and any
0005
   1     figures are variable. It is false to treat a particular
   2     figure as being any more than within a range of likely
   3     figures. They can distract attention from
   4     the individual case which matters.
   5        But perhaps most important of all is the message
   6     which our statisticians have given us repeatedly:
   7     statistics on their own cannot show by demonstrating let
   8     us suppose low mortality rates at Bristol that surgical
   9     competence or any other factor was the cause of that
  10     success. If it shows the opposite, it cannot show what
  11     the cause was of the high mortality rate. What
  12     the statistics can do is show that there was
  13     a difference. They cannot on their own show what was
  14     the reason for that difference.
  15        We will explore in evidence today six data sets.
  16     The evidence you will hear will first of all be that of
  17     Professor Campbell, who will explain far better than
  18     I can possibly hope to do and much more authoritatively
  19     some of the statistical concepts which underlie
  20     the examination which others have conducted.
  21        Secondly, we will hear from Professor Stephen
  22     Evans as to his analysis of three local data sets:
  23     the patient administration system in Bristol;
  24     the Inquiry's own examination of the clinical case
  25     records and what they have shown -- bear in mind that
0006
   1     some 2,000 cases have been looked at, coded and
   2     analysed, and his is that analysis -- and the surgeon's
   3     logs from Mr Wisheart and Mr Dhasmana.
   4        We shall then hear from Dr Aylin as to
   5     the hospital episode statistics a national data set.
   6     That will be related by him and by Professor Gordon
   7     Murray to the Cardiothoracic Surgical Register of
   8     the United Kingdom. Both those data sets provide
   9     something of a national picture. Professor Murray will
  10     also compare the Cardiothoracic Register with a register
  11     kept locally by cardiologists, the South Western
  12     Congenital Heart Register.
  13        Each of them in turn will present their findings
  14     to you, each of them will be asked some questions at
  15     the end of their presentation, and each of them expects
  16     to show you various slides to demonstrate the points
  17     that they have to make.
  18        At the end of the day we shall have a synthesis of
  19     those various different data sources prepared for us by
  20     Dr David Spiegelhalter. He will indicate what in his
  21     view, and I expect the other statistical analysts to
  22     contribute to this in discussion, is the way forward and
  23     what the Inquiry needs to do next in order to examine
  24     what it is that the statistics are actually telling us
  25     so far as they can.
0007
   1        One of the difficulties with having six data sets
   2     is that there are inevitably different ways in which
   3     each of the groups that compile those data sets have
   4     gone about their task. If you take six people looking
   5     at the same data but separately, they will approach it
   6     in different ways. They will group, for instance,
   7     certain ages, they will take certain age ranges, they
   8     will look at certain operations in a certain way and
   9     immediately you have a problem of comparing one data set
  10     with another because the classification, the approach of
  11     each to the underlying data, is different.
  12        So if any message is to be learned from
  13     the available data, it is necessary that they look at
  14     that data in the same way if they can. The problems
  15     that occurred to the Inquiry in, as it were, getting
  16     the data sets to speak the same language so that they
  17     could usefully be compared -- and comparison is
  18     important if one is to gain credibility from the data,
  19     given the particular problems that a number of the data
  20     sets had that were identified to us back in July --
  21     the essential problems were those of definition: how
  22     does one define the operative procedure that is used?
  23     Each operation is, as Professor Anderson indicated to
  24     us, to an extent unique because the anatomy of each
  25     heart will be different from the anatomy of each other
0008
   1     heart. So although one may say, "This is a truncus,
   2     this is a case of transposition of the great arteries",
   3     it is never quite so simple.
   4        One of the problems therefore is comparing an
   5     analysis, for instance, of the PAS system locally which
   6     was made by administrative clerks with no particular
   7     clinical expertise looking at the discharge letters and
   8     working out from those what the nature of the operation
   9     was and under which heading which group it should be
  10     placed, comparing that with the surgeon's logs, where
  11     the surgeons have their own view as to how the operation
  12     should be classified and, when one comes to look at
  13     the whole position nationally, assessing how on earth
  14     one compares Bristol with the rest of the country
  15     without knowing for certain that people in one centre
  16     compared to people in another have adopted exactly
  17     the same pattern and the same degree of approach,
  18     the same approach, to similar data.
  19        What the Inquiry decided to do in order to allow
  20     a synthesis was to decide that it was necessary first of
  21     all to class procedures as either open or closed, and to
  22     class them as open if the procedure was one which was
  23     performed on cardiopulmonary bypass.
  24        Secondly, it was necessary to group the procedures
  25     in a valid way, valid both clinically and
0009
   1     statistically. Grouping is important for statistical
   2     purposes. If you go back to my example of each heart
   3     being unique, it would be unhelpful in any form of
   4     analysis or comparison to say, "The most one has in any
   5     particular data set is one case, because every case is
   6     unique." Although it is right, there has to be a measure
   7     of similarity -- only a measure, not complete
   8     similarity -- to enable one to have a sufficient number
   9     to conduct an analysis.
  10        You will hear in a moment or two that the Inquiry
  11     decided, on having taken advice -- and it has to be said
  12     that the advice was by no means entirely consistent,
  13     entirely with one voice -- to rank the procedures under
  14     13 headings; classify them under 13 headings; but,
  15     having classified them, to rank them. Let me
  16     demonstrate what I mean by showing you the first slide
  17     of the day, INQ 13/54. What you have here is a list of
  18     the groupings which the Inquiry has adopted for
  19     the purpose of the various analyses. You will see that
  20     each of the analyses approaches the data sets in
  21     the same way.
  22        There are 13 groups, "G" for group, and then
  23     the number. The second box, OPCS 4, procedure code,
  24     a word of explanation: throughout much of the period
  25     with which we have been concerned the large national
0010
   1     data set hospital episode statistics has described
   2     operations by a mixture of letters and numbers, an
   3     "operation code". The relevant codes are K and L
   4     covering cardiac and vascular conditions.
   5        In the list of codes a separate number is given
   6     for the description of different procedures. Bear in
   7     mind these are procedures and not diagnoses. That is
   8     a distinction which again has to be borne in mind in
   9     comparing data sets, some of which look at diagnoses and
  10     some of which look at procedures.
  11        What is under OPCS 4 procedure codes you can see,
  12     and the detail for the moment does not matter, is
  13     attributed to one or other group and then a description
  14     is given. G2 and G3 demonstrate, as you will hear in
  15     evidence, some of the difficulties of allocating
  16     a particular operation to a particular group. They
  17     demonstrate one of the problems with data sets, because
  18     of necessity once there has been a description in order
  19     to understand data over a period of time the same
  20     description has to be applied to the same procedure
  21     throughout history.
  22        The problem is that life is not like that.
  23     Medicine moves on. Procedures change. They develop.
  24     For instance, when the data sets reported upon
  25     the switch operation in the early 1980s, they would have
0011
   1     been reporting much more on what we know as Mustard and
   2     Sennings operations, interatrial switches for the same
   3     condition than the data set of the 1990s which will be
   4     reporting much more on arterial switch operations about
   5     which we have already heard in evidence.
   6        You will see when the data is examined that it
   7     appears that there is quite a grey area around
   8     the margins as to which operation, which procedure,
   9     should in the data sets that we will look at be put
  10     under which heading. To that extent the data has to be,
  11     it is, uncertain. To that extent, any conclusions that
  12     might be drawn in looking simply at group 2 in isolation
  13     or group 3 in isolation would have to be very cautious
  14     indeed.
  15        It may be that sense can be made by grouping both
  16     or looking at group 2 and group 3 together. But bear in
  17     mind the purpose of dividing it up into 13 groups was to
  18     look at each of the 13 groups individually at
  19     the outset.
  20        The fourth heading across the page, "Primary
  21     procedure ranking", has a number of numbers attributed
  22     to the various operations. The problem here is
  23     essentially a practical one. When the clerk,
  24     the administrative clerk or the person compiling
  25     the data set, comes to record his particular operation
0012
   1     as one or the other, and he looks at the list
   2     of procedures which may have been followed, if he has
   3     one code and one code alone to use, which does he
   4     choose? If you might, for instance, have something
   5     which will be classed as a Fontan type operation, G9,
   6     and if in association with that there needed to be
   7     a closure of an atrial septal defect, which of those two
   8     procedures would be coded?
   9        Here the Inquiry had to take clinical advice from,
  10     it has to be said, the bulk of our expert team of
  11     cardiologists, cardiac surgeons and others who were
  12     identified in a note giving full details of this
  13     procedure published today, had to identify which should
  14     take precedence. That is the ranking procedure that is
  15     indicated by the list which is third from the right on
  16     the chart as you look at it.
  17        So that, if there was a truncus arteriosus, that
  18     would in precedence to any other condition identified be
  19     the grouping that was chosen. You will see that
  20     therefore, if one comes down to the last of the open
  21     operations, there are 13 groupings in total, two closed,
  22     11 open; 11, group 6, closure of secundum and sinus
  23     venosus atrial septal defects. But that is not going to
  24     be coded, except where it occurs on its own as an open
  25     condition.
0013
   1        Of course one of the further problems that anyone
   2     may have in trying to group operations under one of
   3     these headings, the primary procedure groupings, is that
   4     it may just not fit. Take a transplant -- not in fact
   5     performed in Bristol, but bear in mind that there will
   6     be comparisons with other centres where transplants may
   7     have been performed. There are three in the country.
   8     There is no heading under which something such as
   9     a transplant can be placed. As a result, there has to
  10     be a residual coding, and there was indeed in
  11     the national data sets a residual grouping for
  12     procedures which could not be coded under one of
  13     the more precise letters and numbers used by OPCS.
  14        So for our purposes the statistical analysts have
  15     not only looked at 13 procedure groups, but also looked
  16     at open operations, all open operations, whether the 13
  17     or more, 14, 15, 16, 20, whatever it might be, and
  18     compared those with open operations elsewhere, and all
  19     closed operations with all closed operations elsewhere.
  20        When comparing one centre with another there can
  21     of course be a comparison made across each of these
  22     individual groupings, each of the 13, or indeed across
  23     the general open and general closed categories. But
  24     please remember that the boundaries may not always be
  25     clear between one group and another, and there may be
0014
   1     some groups in some centres which are not shown here.
   2        One of the reasons why these particular groupings
   3     were chosen is explained by the far right-hand column on
   4     the chart, "Map to UKCSR". UKCSR stands for United
   5     Kingdom Cardiac Surgical Register. As we already heard
   6     in the Inquiry, this register was throughout the period
   7     that the Inquiry is concerned with, except for a few
   8     months in the middle 1990s, collecting data sent into it
   9     by centres and allowing any one centre to compare its
  10     performance against the national performance of the year
  11     or two before.
  12        The precise data which gave rise to those reports
  13     is of course not now readily available. What we have is
  14     simply the groupings which the Cardiothoracic Register
  15     had. It is a valuable data source, particularly since
  16     it was reported to the Cardiothoracic Register by or on
  17     behalf of the clinicians most closely connected with
  18     the care of the particular child at the time.
  19        In order to make use of that data set there has to
  20     be a means of bringing, if you like, the other data sets
  21     into line with it. This, in my simple understanding, is
  22     what is meant by "mapping", trying to fit the procedures
  23     identified in the other data sets to the diagnoses with
  24     which the Cardiothoracic Surgical Register deals. That
  25     is some explanation as to how one can take the groupings
0015
   1     which you see here and make sense of them in
   2     a comparison with the Cardiothoracic Surgical Register.
   3        The statistical evidence that you will hear was
   4     essentially of two sorts. One will establish Bristol's
   5     performance as best the data can tell us. Secondly, it
   6     will compare that performance with elsewhere.
   7        In looking at the local data, there is some
   8     difference of numbers; bear in mind that the PAS,
   9     patient administration system, is coded by
  10     administrative clerks who are not medically qualified,
  11     the surgeons' logs are not formally validated, but
  12     the data may be in small groups and therefore subject to
  13     large statistical variation -- go back to my example of
  14     tossing the coin -- and much of the data was not
  15     collected for the purpose of assessing the adequacy of
  16     care. That is the local figures.
  17        In comparing data with elsewhere it needs to be
  18     borne in mind that the data may not have been validated
  19     and may therefore be inaccurate. Where a comparison is
  20     based on data from the Cardiac Surgical Register for
  21     the period from 1985 to 1991, there is no cross-check
  22     with the national HES, hospital episode statistics,
  23     system.
  24        Professor Murray will tell you that it is clear
  25     that the primary data quality of the UKCSR is poor, and
0016
   1     accordingly the ordinary degree of variability,
   2     the range within which you might expect figures to fall,
   3     is all the greater. Professor Murray will emphasise
   4     that he had been unable to visit cardiac units elsewhere
   5     in the country to assess the quality of their primary
   6     data, and further work to assess that quality will be
   7     needed before one could assess the weight which should
   8     be placed on the results of comparative analyses.
   9        When comparing Bristol with other centres
  10     a twofold comparison is adopted. A word of explanation
  11     of this before the evidence is called. First, there is
  12     a comparison of Bristol children with all children
  13     elsewhere. This is, you will hear, compares the rates
  14     of mortality which appear on the face of the data with
  15     those collected from the whole of the rest of
  16     the country grouped together.
  17        Secondly, Bristol as a centre will be compared
  18     with other centres. For this purpose the average of
  19     the performance by centre was determined. It is obvious
  20     that no centre will be exactly average. Some will, by
  21     reason of simple chance variability, in any one year
  22     have more or less deaths in any given operation than any
  23     other centre. The greater the number of operations and
  24     the periods considered, the greater the chance that
  25     the centre will both appear to have above average number
0017
   1     of deaths in some procedures and below average in
   2     others.
   3        It may be distressing to many parents to hear
   4     the terms in which our statistical analysts will express
   5     this. They will talk of excess deaths. It is essential
   6     to realise that this is simply a way of expressing an
   7     apparent difference, and it is important to remember
   8     from everything that I have already said that
   9     the difference may not be real and that some difference
  10     is inevitable. It will be most unfortunate if a simple
  11     and ordinary way in which statisticians express
  12     themselves may seem be to callous, unfeeling and
  13     upsetting.
  14        To say, for instance, that in any particular
  15     centre there have been, say, six excess deaths is not
  16     passing a judgment as one would in a court of law
  17     assessing whether there had been clinical incompetence
  18     or not. If this is not understood, there is a very
  19     great risk that someone might ask, "Which were the six
  20     deaths?" or even worse, "Was it my child?" To ask those
  21     questions will be to misunderstand the purpose of
  22     the statisticians expressing themselves as they do.
  23     They are painting a picture with a variability, with
  24     data which may not be entirely reliable in order to
  25     provide a generalised comparison.
0018
   1        If the point needed to be made, you will see that
   2     when they present their papers that quite often
   3     the excess deaths in one or other centre are presented
   4     with a decimal point. To say that there were six excess
   5     deaths in this or that centre might understandably give
   6     rise to the question, "Which were they?" But if the
   7     expression used is, for instance, 6.9, then I hope it
   8     will be clear to all that this is one way of expressing
   9     a difference. It is not an attempt to draw conclusions
  10     in any individual case.
  11        A word about the groupings which make it almost
  12     inevitable that there is likely to be an expression of
  13     excess deaths used in the statistical sense that I have
  14     mentioned in almost any centre. The data have been
  15     grouped in epochs, taking us from January 1984 to
  16     December 1987, epoch number 1; January 1988 to December
  17     1990, number 2; January 1991 to March 1995, number 3;
  18     and April 1995 to December 1995, number 4.
  19        Bear in mind, please, when we come to epoch number
  20     4 that the nature of the operations performed may not
  21     have been of the same type of case, say, mixture as in
  22     the preceding two or three epochs. The data have also
  23     been grouped by age. The ages chosen for the purposes
  24     of analyses are nought to 90 days, 90 days to 365 days,
  25     and 1 year to 15 years.
0019
   1        Again, you come back to points of difference
   2     between the data sets. The Cardiothoracic Register, for
   3     instance, has no cut off in age. There is no way of
   4     knowing simply from the figures presented whether it is
   5     looking at any particular number of children over
   6     the age of 16.
   7        There are therefore in total four epochs, three
   8     age groups, that itself will give you 12 comparisons,
   9     and we have 13 procedure groupings plus the open/closed
  10     difference.
  11        You will hear that there was no significant
  12     difference between Bristol and children elsewhere in
  13     England, and indeed between Bristol as a centre and
  14     centres elsewhere in England so far as closed procedures
  15     are concerned.
  16        So far as open procedures are concerned, you will
  17     hear a considerable amount of evidence. The overall
  18     picture is perhaps best summarised by showing you my
  19     second slide taken from the reports, INQ 15-4. Can we
  20     scroll up a bit, thank you. This comes from David
  21     Spiegelhalter's paper. It shows that over the period
  22     1988 to March 1995 there appears to be approximately two
  23     times the rate of death in open operations across all
  24     age groups combined. You will see immediately that that
  25     is expressed in the third column as excess deaths. You
0020
   1     will notice, to emphasise the point I have made that
   2     this is simply a way of demonstrating a difference, that
   3     the excess deaths recorded have decimal points to them.
   4        You will see from Dr Aylin's evidence, when he
   5     gives it, that the mortality is highest in the youngest
   6     age group, nought to 90 days. Bear in mind that, since
   7     there is also evidence that Bristol operated on a lower
   8     percentage of young babies than other centres did and on
   9     a greater proportion of Downs Syndrome children than
  10     other centres did, these factors may mean that there has
  11     to be further examination of the difference. It would
  12     not be right without further careful consideration to
  13     assume that the fact of difference implied a reason for
  14     the difference.
  15        Dr Aylin will present a comparison of the apparent
  16     mortality rate of Bristol compared to the 11 other
  17     principal centres in England. Bristol is significantly
  18     different. It is an outlier compared with all other
  19     centres, save one which is an outlier in the 1 to 15
  20     year age group. That other centre is known as centre
  21     number 10 in the analyses you will see.
  22        When our statistical analysts did the work, they
  23     did not know which centre it was. However, this Inquiry
  24     has made a commitment to openness. Since it may well be
  25     there are problems and difficulties with the figures,
0021
   1     even beyond those that have so far been identified, and
   2     they need to be stressed, it is important that anyone in
   3     a position to contribute should know the identity of all
   4     the other centres referred to by number in the figures
   5     and diagrams that you will see.
   6        Accordingly let me give you alphabetically
   7     the centres and their identifying numbers. Birmingham
   8     Children's Hospital will be known in the figures that
   9     you will see as centre 11; Freeman Hospital, which is
  10     now part of the Newcastle upon Tyne hospital's NHS Trust
  11     as centre number 9; Great Ormond Street Hospital, which
  12     is now part of the Great Ormond Street Hospital for
  13     children NHS Trust, as centre number 8; Guys Hospital,
  14     now part of the Guys and St Thomas Hospital NHS Trust,
  15     number 5. I should emphasise that these numbers are
  16     arbitrary numbers given by the Inquiry to these
  17     centres. They do not imply any form of ranking
  18     whatsoever. It is not a question of number 1 being
  19     best, number 11 being worst or anything of that sort.
  20        Harefield Hospital, now part of the Royal Brompton
  21     and Harefield NHS Trust, is centre number 10;
  22     Killingbeck Hospital, Leeds, part of the Leeds Teaching
  23     Hospital's NHS Trust, is centre number 3; the Royal
  24     Brompton SHA, special health authority, now part of
  25     the Royal Brompton and Harefield NHS Trust, is centre
0022
   1     number 12; the Royal Liverpool Children's Hospital, part
   2     of the Alderhey Children's Hospital and Royal Liverpool
   3     Children's NHS Trust, as centre number 6; Southampton
   4     University Trust, Southampton University Hospital's NHS
   5     Trust, centre number 7; Glenfield Hospital, Leicester,
   6     not a designated centre, but it did a significant number
   7     of operations, now part of the Glenfield Hospital NHS
   8     Trust, centre number 2; the Radcliffe Infirmary at
   9     Oxford, the Oxford Radcliffe Hospital NHS Trust, centre
  10     number 4; the United Bristol Healthcare NHS Trust,
  11     Bristol Royal Infirmary, as you might expect in this
  12     Inquiry, centre number 1.
  13        You will see that centre number 10 is now part of
  14     the Royal Brompton and Harefield NHS Trust. It was
  15     Harefield Hospital. A word of caution: it would be
  16     unduly alarmist to conclude that Harefield was a poor
  17     hospital or to suggest that this Inquiry had uncovered
  18     something that had not been appreciated to some extent
  19     before.
  20        Moreover, in drawing comparisons between Bristol
  21     and centre 10, it is known that centre 10 performed a
  22     much higher proportion of its procedures on children
  23     over one year of age, a much higher proportion of its
  24     work was outside the 13 procedure groups that I have
  25     mentioned and fell into the other category, and both of
0023
   1     those features are potentially associated with higher
   2     risks and hence with poorer outcomes.
   3        This makes a point which needs to be emphasised
   4     throughout. The purpose of statistics at their best is
   5     to compare like with like. The statistical analysts
   6     have in the case of Bristol done what they can to deal
   7     with the question of case mix. There is not much
   8     information to be given. Because the subject of our
   9     Inquiry is not centrally centre 10 or any other centre,
  10     it would be wrong to assume without Inquiry that
  11     the case mix was the same.
  12        Let me give an example. If a particular hospital
  13     had -- this is purely a hypothetical example, I hasten
  14     to had -- a policy of treating any case, however poor
  15     the outcome was likely to be, even though it might be as
  16     low as 1 per cent, the mortality figures produced by
  17     that hospital would be very different from the other
  18     hospitals faced with the same type of patient which
  19     declined to operate, perhaps on perfectly good clinical
  20     grounds, upon the same group of patients.
  21        The case mix of the patients would be different.
  22     The case mix would in such an example be an obvious
  23     possible explanation for any statistical difference
  24     the figures showed.
  25        I do not suggest that this is essentially
0024
   1     the difference between Harefield and Bristol. I am not
   2     in a position to make a suggestion one way or
   3     the other. But it must be borne in mind that there may
   4     well prove to be a difference which is of the greatest
   5     significance in interpreting these figures.
   6        I should emphasise that the statistical analysts
   7     have not been able to visit the other cardiac units to
   8     assess their primary data quality. Further work to
   9     assess that quality would be required before one could
  10     assess the weight which could be placed upon it.
  11        Moreover, as you will come to see from
  12     the figures, centre 10 is not the only centre which has
  13     some outlying results. Accordingly, the figures should
  14     be treated with some caution. Anyone listening must be
  15     remember that the figures at Bristol have been examined
  16     across a range of data sets and in much greater detail
  17     than has been possible with other centres. That said,
  18     if there is to be any explanation as to the reasons why
  19     Harefield appears to be an outlier as well as Bristol,
  20     that is a matter for Harefield to explain.
  21        The Department of Health knows today this Inquiry
  22     will reveal the performance of the various centres
  23     relative to each other. It is likely to be a matter of
  24     interest in the locality in which each centre is placed
  25     as to how that centre has performed. I would hope that
0025
   1     any comment which is made, perhaps in the press, perhaps
   2     by others who read this on the Internet, must be subject
   3     to the caveats which I have already entered and which
   4     will be entered in the course of the evidence today both
   5     orally and in the written papers submitted.
   6        The Royal Brompton and Harefield Trust has been
   7     told of the figures, as have the other centres.
   8     I should emphasise as one would hope to be the case that
   9     it is open and forthcoming about its figures.
  10        With that introduction, which I hope is not so
  11     obvious as to be simplistic, I shall call the evidence
  12     before you to put further detail on that which I have
  13     given to you in outline.
  14        Sir, the way in which we propose to arrange
  15     matters is that we shall begin with Professor Campbell,
  16     if he would come up to the desk, and for the purpose of
  17     enabling discussion if necessary between all
  18     statisticians, we shall arrange two chairs at the front
  19     and he will be joined by Professor Evans, whose
  20     presentation will be next.
  21        If the expert table to my right could then be
  22     filled, please, by Dr Aylin, Dr Spiegelhalter and
  23     Professor Murray. Sir, it would be convenient I think
  24     to swear each in turn as they come to give their
  25     presentation rather than all now. If therefore we may
0026
   1     begin, please, with Professor Campbell.
   2   THE CHAIRMAN: I think that is right. Thank you for that.
   3     I was just making sure everyone was comfortable and able
   4     to see. Thank you.
   5        PROFESSOR MICHAEL JOSEPH CAMPBELL (SWORN):
   6            Examined by MR LANGSTAFF:
   7   Q. Professor Campbell, your full name, please?
   8   A. Michael Joseph Campbell.
   9   Q. Can you tell us a bit about yourself and why you are an
  10     expert, briefly?
  11   A. I am Professor of Medical Statistics in the University
  12     of Sheffield. Previously I used to work in
  13     the University of Southampton. I have no idea why I am
  14     an expert, except I have written two books. One is
  15     called "Medical Statistics - a Commonsense Approach" and
  16     one is called "Statistics from Square One".
  17   Q. What I am going to ask you to do, you have read through
  18     the statistical evidence which is later to be presented
  19     to us, I think, and you have formed a view of those
  20     matters which may need to be explained first of all in
  21     greater detail to what one might call the wider audience
  22     and any explanation that may be needed for a more
  23     scientific and technical audience. You are in
  24     a position to do that for us, are you?
  25   A. I am.
0027
   1   Q. What role, as you see it, does statistics play in this
   2     Inquiry?
   3   A. I think statistics has a number of roles. I think
   4     the first and probably most important one is to
   5     ascertain the data validity and to make sure that
   6     the data that we are actually examining is
   7     representative of what really actually happened in
   8     the various centres.
   9        Data validity and design of studies is central in
  10     statistics to make sure that we are actually basing our
  11     conclusions on solid evidence. Having collected
  12     the data, the next question that statisticians have to
  13     ask is how these differences could have arisen and
  14     whether the differences are purely due to chance or
  15     whether there is some inherent difference between
  16     different centres that cannot be attributed to chance,
  17     and that is where the statistical analysis comes in.
  18        Finally, I think quite importantly, statistics
  19     need to be able to present these results in a way that
  20     is easily understood by the general public. I think
  21     data presentation is also very important.
  22   Q. So collection, analysis and presentation?
  23   A. Correct.
  24   Q. So far as the analysis is concerned, to what extent in
  25     that analysis does the statistician try to demonstrate
0028
   1     whether the results are beyond what one would expect as
   2     a matter of chance or not?
   3   A. Well, statisticians generally try and set up a null
   4     hypothesis which they then try and ascertain whether
   5     the data are congruent with this particular hypothesis.
   6     The example that you gave earlier on tossing a coin, for
   7     example, you toss the coin a number of times. Your
   8     null hypothesis might be that the coin is unbiased, and
   9     this would be your normal null hypothesis, that
  10     the probability of a head is a half. Then you collect
  11     your data and then you decide, "I assume that the coin
  12     is unbiased. What is the probability of getting
  13     the observed number of heads that I have seen?" This is
  14     where assessing whether the probability is due to chance
  15     or not.
  16   Q. By unbiased we mean that the coin is unweighted or it is
  17     not a double headed coin; there is nothing odd about it?
  18   A. Correct.
  19   Q. The word "bias" is something we may come across. Would
  20     you like to say a couple of words about how
  21     statisticians read the word?
  22   A. There are a large number of ways that you can interpret
  23     bias. I think one of the most important ones in
  24     the Bristol Health Inquiry is whether the way that
  25     the data were presented in Bristol is different from
0029
   1     the way it was presented or collected in other areas.
   2     So, for example, were Bristol more or less scrupulous in
   3     reporting their mortality data and were they more or
   4     less scrupulous in collecting all the babies that came
   5     to the hospital? I think that is where you can get
   6     bias, is where you have differential reporting of
   7     results from different centres.
   8   Q. Suppose, for example, that Bristol were scrupulous about
   9     collecting data on deaths and other centres were not.
  10     What effect would that have upon the apparent
  11     performance of Bristol compared to these other centres?
  12   A. Clearly if Bristol was scrupulous and reported all their
  13     deaths and the other centres did not report all their
  14     deaths, then in fact it would make Bristol appear worse
  15     than the other centres.
  16   Q. And is there, as it happens, some suggestion in what you
  17     have seen of the statistics that Bristol were
  18     scrupulous?
  19   A. There is some suggestion, but it seems difficult to
  20     understand how -- very difficult to understand how there
  21     could be loss from the other centres to explain
  22     the differences that we have actually observed.
  23   Q. We will come back to that in a little while, if we may.
  24     You mentioned that the statistician begins by taking an
  25     null hypothesis, and you demonstrated that in terms of
0030
   1     the coin and say, "What we want to assume is that
   2     the coin is a perfectly ordinary coin with heads on one
   3     side and tails on the other." What conclusion would you
   4     be likely to draw if, let us say, you tossed the coin
   5     100 times and on 98 occasions out of the 100 it came
   6     down heads?
   7   A. You could work out the probability of observing 98 heads
   8     or 99 heads or 100 heads assuming that the probability
   9     of a head was still a half, and you would find that
  10     this probability was actually very small and therefore
  11     you might conclude that the coin was in fact not an even
  12     balanced coin, that in fact it was biased in some way.
  13        So you make up the hypothesis, which is that
  14     the coin is unbiased, and then you look at the data and
  15     then you reject the hypothesis. That is essentially
  16     the statistical method which is very comparable to
  17     the scientific method where you make predictions from a
  18     theory, you then observe the data and then, if your data
  19     and the theory do not correspond, you reject your
  20     theory.
  21   Q. You have a slide for us. I think it is INQ 17-1.
  22   A. I wanted to emphasise here, and in fact since having
  23     discussions with Dr Spiegelhalter I think perhaps
  24     the emphasis is not necessarily warranted too much, in
  25     the different types of statistical inference there are
0031
   1     two basic approaches to statistical inference which are
   2     called the Bayesian approach and the frequentist
   3     approach. I put Thomas Bayes here to show that in fact
   4     Bayesian methods have a long and well-established
   5     tradition and that Thomas Bayes published his results
   6     posthumously and it has been said that it is a shame a
   7     lot of other statisticians did not follow his example.
   8        The important difference is that frequentists tend
   9     to think in terms of repeatedly running a study and
  10     the long-term proportion of successes. So this is
  11     the way you tend to think about probability. You tend
  12     to think of the probability of a boy being born, you
  13     observe a large number of successes or failures or
  14     whatever, and you end up with say 52 boys out of 100.
  15     This means that the probability of a boy being born is
  16     52 out of 100 or 0.52.
  17        What a frequentist thinks after an investigation
  18     is exactly what I said about the coin tossing. They
  19     ask, "What is the probability of getting these data if
  20     my model, if my hypothesis, is correct?"
  21        Bayesians tend to give probabilities of things
  22     that are not directly observable and that they can
  23     attach probabilities to hypotheses such as the true
  24     underlying mortality rate being 10 per cent.
  25     Essentially they have what are called prior probability
0032
   1     before the experiment and posterior probabilities after
   2     the experiment.
   3        They tend to attach probabilities like the
   4     probability that it is going to rain tomorrow is 20
   5     per cent. You attach a probability to that. But you
   6     cannot imagine running tomorrow 100 times to see if it
   7     rains ten times.
   8        They are able to give -- after an investigation
   9     they say, "Given what I believed about my model before
  10     the investigation and given the data I have collected,
  11     what is the probability that my model is now correct?"
  12     This is a very common way of arguing in medicine as well
  13     where a patient comes in to a surgery and the doctor
  14     will assess their possibility of having a certain
  15     disease. They will then question the patient and elicit
  16     certain symptoms and then they will modify their
  17     probability that the patient has a particular disease.
  18     If their probability is sufficiently high, they might
  19     refer them on for investigation.
  20   Q. In the surgeon's case or the GP's case, he may have what
  21     is called a differential diagnosis, have a number of
  22     ideas in his mind and begin to discard them as he gets
  23     more evidence?
  24   A. Yes, correct. He will discard them depending on
  25     the probabilities that he has -- the symptoms he has
0033
   1     elicited from the patient.
   2   Q. You have used words on your slide "prior probabilities"
   3     and "posterior probabilities". The posterior
   4     probability is what?
   5   A. The reason I put this on the slide is the posterior
   6     probability is the measure of how much you believe your
   7     model is actually correct, your hypothesis is actually
   8     true, given that you have collected some data to either
   9     refute or accept this particular hypothesis.
  10   Q. The statisticians in their reports to us have used
  11     a Bayesian approach, not a frequentist approach. Does
  12     it, do you think, make a difference? If so, to what
  13     extent?
  14   A. I have had long, long discussions, especially with
  15     Dr Spiegelhalter on this particular issue. Although
  16     the method is ostensibly Bayesian, in fact you can
  17     demonstrate that in fact it is largely a frequentist
  18     approach.
  19   Q. Best of both worlds.
  20   A. In fact if we go on to the next overhead.
  21   Q. Number 2, please.
  22   A. I said we have two hypotheses, H0, which is that Bristol
  23     is no different from the other centres, and H1 that
  24     Bristol is quantitatively different from the other
  25     centres. The data that was collected is the mortality
0034
   1     from 12 centres.
   2        So what the frequentist can answer is, "What is
   3     the probability of getting our data" -- and
   4     unfortunately I have to put in brackets (or data more
   5     extreme from H0), which means further away from the null
   6     hypothesis -- "assuming that Bristol is in fact no
   7     different from the other centres?"
   8        In the tables that are given in the evidence this
   9     is the probability -- that big P means
  10     the possibility -- of the data given the
  11     null hypothesis, or it is called the P value. In some
  12     of the earlier tables you will find mentions of P
  13     values. That is what the P value is telling us.
  14        What the Bayesian inference can answer is, "What
  15     is the probability that Bristol is different from
  16     the other centres given the data?" That is
  17     the probability of H1, which is the hypothesis that
  18     Bristol is different, given the data. That is also
  19     given in the tables as well, of the probability attached
  20     to the excess deaths or the deaths greater than
  21     expected.
  22        In fact it turns out, because the way the prior
  23     distributions were chosen and also because of the way
  24     that the hypothesis was set up, which was that we take
  25     Bristol out of the equation and we then try to predict
0035
   1     from the other centres what we should have expected in
   2     Bristol, which is the null hypothesis, we can in fact
   3     interpret the probabilities given in the Bayesian method
   4     in a very similar way to the probabilities we get under
   5     the frequentist approach, which I think is a very nice
   6     synthesis.
   7   Q. So essentially which ever approach had been taken is
   8     likely to have produced pretty much the same results,
   9     given the data?
  10   A. That is my opinion, yes.
  11   Q. You mention in the middle of that slide the P value.
  12     I wonder if you would like to say some words by way of
  13     explanation as to what a P value is and what, if any,
  14     comfort or the opposite we may take from it?
  15   A. If we take your example before of tossing a coin 100
  16     times and observing 98 heads, we can work out either by
  17     computer simulation or by mathematics the probability of
  18     observing this particular run of events if we assume
  19     the coin is unbiased. This probability is the P value.
  20     It is convention to reject the null hypothesis, which is
  21     the coin is unbiased, if the P value is sufficiently
  22     small. The usual accepted level for rejection is 5
  23     per cent. So we say that, if there is only a 1 in 20
  24     chance of getting our observed data or, more extreme, if
  25     the null hypothesis is true, then we are going to reject
0036
   1     the null hypothesis.
   2   Q. Would it be the case that if the coin had been tossed 90
   3     to 100 times and come down heads on 94 occasions, not
   4     95, that you could not safely assume statistically that
   5     the coin was biased?
   6   A. It could be that if you tossed the coin 94 times your P
   7     value might be 0.06 -- sorry, if you tossed it 100 times
   8     and it came up heads 94 times, your P value would be
   9     0.06. If you tossed it 100 times and it came up 95
  10     times, your P value would be 0.04. Therefore you could
  11     reject it on the latter case but not on the former.
  12   Q. If we think in terms of tossing a coin, this is the
  13     example we have used, if anything has or is given
  14     statistical significance, does it mean that as a matter
  15     of numbers and mathematical calculation it is
  16     the equivalent of, as it were, tossing the coin and
  17     finding that on less than 5 occasions out of 100 it came
  18     down one way as opposed to the other?
  19   A. Sorry, could you repeat that?
  20   Q. Yes, I was trying to make the example simple. It is my
  21     fault. When we see in the course of the analyses
  22     statistical significance or a P value of less than 0.05
  23     ascribed to a particular data set, does that mean that
  24     you can be as confident that that is not due to chance
  25     as you would be in the case of a coin which had been
0037
   1     tossed and on 95 occasions out of 100 it came down on
   2     one side only?
   3   A. You would have to -- I do not know what the actual
   4     probability of getting a 95 -- perhaps one of my learned
   5     colleagues here could tell me. Very, very small. But
   6     essentially you interpret the probability of -- if you
   7     have a P value of less than 5 per cent, it would mean
   8     that if you were to run the whole scenario 100 times you
   9     would only expect to see this particular event 5 times
  10     out of 100 or less.
  11   Q. Thank you. You heard what I said about excess deaths in
  12     the course of the introductory remarks that I have
  13     made. Broadly, and please do not be frighten to offend
  14     me, was I right in the approach that I took to it?
  15   A. Yes, I believe you were. One important thing to realise
  16     is that statistically speaking excess deaths could be
  17     negative as well as positive. It just means that they
  18     are different from predicted.
  19   Q. So by "excess" we should really read "different number
  20     of" rather than "excessive" in the ordinary colloquial
  21     sense?
  22   A. Yes.
  23   Q. Through the statistics which we will see, then we may
  24     have talk of P values and we have talk of excess
  25     deaths. So far as P values are concerned, there is some
0038
   1     reference that we will see to 95 per cent intervals.
   2     Could you give us an explanation of that?
   3   A. Yes. The authors have been quite careful to refer to 95
   4     per cent intervals. The frequentist approach is to call
   5     things 95 per cent confidence intervals. Essentially
   6     that enables us to -- if we have a 95 per cent
   7     confidence interval, it says -- let us go back to
   8     the coin tossing experiment again, since it seems to be
   9     generally understood.
  10        We observe a certain number of heads, say 80
  11     per cent, 80 out of 100. We can calculate about that 80
  12     per cent an interval. If we actually ran this
  13     experiment 100 times we could each time we ran
  14     the experiment calculate this interval, then 95 per cent
  15     of the intervals that we calculate would include
  16     the true probability of the coin coming up. So if
  17     the coin was actually unbiased, then we would expect
  18     that 95 per cent of these intervals would include 50
  19     per cent or P as 0.5 as the null hypothesis. That is
  20     a 95 per cent confidence interval.
  21        When you go on to 95 per cent interval, then this
  22     is a slightly different approach, but it is essentially
  23     the same. But it means that we have 95 per cent
  24     confidence that the true value, the value underlying
  25     the hypothesis, is somewhere within that interval;
0039
   1     although it is important to realise that most of
   2     the emphasis will lie in the centre of the interval and
   3     not at the extremes.
   4   Q. Excess deaths was the other part that I was going to ask
   5     you about. Is there any sense in which statistically
   6     one can identify which deaths are excess?
   7   A. No, no. Essentially, as I was saying before about
   8     the frequentist approach to probability, out of 100
   9     people you could say that 10 per cent are likely to die,
  10     but there is no way of identifying which particular
  11     10 per cent. It is a bit like the lottery. We cannot
  12     identify in advance who is going to win the lottery.
  13   Q. I said in my introductory remarks that if you take
  14     a comparison amongst 12 centres, given any particular
  15     performance table, one is bound to be best and the other
  16     is bound to be worst. That as a matter of logic must be
  17     right. How do statistics help in describing
  18     the difference so that one may be satisfied either that
  19     the apparent top or bottomness of one centre is chance
  20     or that it is more than chance, probably?
  21   A. This is a relatively new area of statistics, and
  22     Dr Spiegelhalter has been fundamental in doing research
  23     on this. But essentially we can say that if you rank
  24     the centres and one of them is well below what would
  25     have been predicted from the others, then it is unlikely
0040
   1     to have happened by chance and we can put
   2     the probability on it being at the bottom, even
   3     though -- so you might observe something which is at
   4     the bottom, but you might get a confidence interval
   5     which is between say 8 and 12. So it could have been
   6     that in fact in reality it was only 8 out of 12. But
   7     sometimes you can get the confidence interval is only
   8     12, in which case we can be 95 per cent sure that if we
   9     had run this thing a large number of times it would have
  10     come out at the bottom 95 per cent of the time.
  11   Q. Because the gap or the difference is so great that it is
  12     likely to be repeated on that number of occasions?
  13   A. Correct, yes.
  14   Q. Professor Campbell, I have asked you a number of
  15     questions. Is there anything that you would wish to add
  16     by way of introductory remarks, familiar as you are with
  17     the data which is going to be introduced, so that we
  18     have a proper perspective of what is going to be
  19     presented?
  20   A. I suppose I would re-emphasise the remarks that you made
  21     earlier, that the statistics and the probability are
  22     just part of a jigsaw and that one of the most important
  23     things is that we have very few explanatory variables
  24     because of the way the data was collected; we do not
  25     have things such as the clinical condition of
0041
   1     the babies. So we cannot provide any explanation as to
   2     why these -- the way that the data have evolved. Simply
   3     we can point to the fact that it is most unlikely to
   4     have arisen purely by chance. So I think it needs to be
   5     just part of a general picture.
   6   MR LANGSTAFF: What I am going to do now is to suggest to
   7     our chairman that we have a short break, perhaps 10 or
   8     15 minutes, and then Professor Evans will introduce
   9     the Bristol picture and the Bristol rates and present
  10     that to us.
  11   THE CHAIRMAN: Yes, Mr Langstaff. Just before we break for
  12     15 minutes, that introduction was extremely helpful
  13     I think in setting out the language, the terms of
  14     reference by reference to which we have to read
  15     the material before us. I hope it was helpful to
  16     everyone to hear what these various terms mean so that
  17     we can translate what we see. It was extremely helpful;
  18     thank you. Shall we now break and reconvene at 11.30.
  19   (11.15 am)
  20   (11.40 am)
  21   MR LANGSTAFF: Sir, Professor Evans, may he take the oath?
  22          PROFESSOR STEPHEN EVANS (SWORN):
  23            Examined by MR LANGSTAFF:
  24   Q. Professor Evans, your full names, please, and essential
  25     qualifications?
0042
   1   A. I am Stephen James Weston Evans. I have a BSc and MSc.
   2     I am a chartered statistician. I was Professor of
   3     Medical Statistics at the London Hospital Medical
   4     College, part of London University. I was there for 25
   5     years, and then I was head of epidemiology for
   6     Medicine's Control Agency. I now work locally, not far
   7     from Tonbridge Wells, and I would describe myself as
   8     a statistical epidemiologist.
   9   Q. Did you, for the purposes of this Inquiry, prepare
  10     a report which we have, the first page presently on the
  11     screen, beginning at INQ 12/1?
  12   A. I did.
  13   Q. And does the text of the report finish at INQ 12/33?
  14   A. Yes.
  15   Q. Let us look at this a minute, the very bottom of the
  16     page. Then that is followed by a number of tables and
  17     figures and an annex. The report as a whole ends, does
  18     it, at page 49, INQ 12/49?
  19   A. Yes.
  20   Q. You are prepared to present your findings, are you, to
  21     the Inquiry, and would you perhaps like to do so?
  22   A. Yes. I am happy to do that. I would like to in some
  23     senses preface my remarks, well aware of the parents,
  24     those who are concerned about this Inquiry, concerned
  25     health professionals, and say to them -- I am sure I can
0043
   1     speak on behalf of my colleagues -- we as statisticians
   2     are very aware of the personal trauma that many of you
   3     have gone through and when we talk about children and
   4     procedures and deaths and numbers, we do so in the
   5     knowledge that each one of those is someone who is
   6     important. When we talk about surgeons and centres and
   7     health professionals, we are aware that they are
   8     important.
   9        It may seem that sometimes our numbers are
  10     expressed in a way that is cold and uncaring. That to
  11     some degree is the lot of the statistician, that they
  12     will be seen in that way, but we would wish to say that
  13     we are aware of your hurts; we are aware of your current
  14     concerns, and we are doing our best to try and help this
  15     Inquiry in the terms of the purposes of the Inquiry, to
  16     come to answers that are as helpful as possible.
  17        So please do remember, when we use terms that may
  18     be distressing, I am sure that we would be happy to
  19     accept from the Panel or others, advice as to how
  20     perhaps we can ameliorate that distress, but we are
  21     conscious of it and we will do our best. But
  22     nevertheless, we are dealing in numbers and these things
  23     may appear hard. You said this yourself at the
  24     beginning, Mr Langstaff, but we wish to reiterate it.
  25        I think that we have to remember that when the
0044
   1     Inquiry was set up, there were about 2,000 sets of
   2     medical records, and each of these consisted of perhaps
   3     one folder, perhaps a number of folders, that related to
   4     the children who had received cardiac surgery between
   5     1st January 1984 and 31st December 1995. It was clearly
   6     impossible to scrutinise all of these records -- they
   7     form a very, very large volume in great detail -- using
   8     teams of medical, surgical and nursing experts, but at
   9     the same time, it was very important to be able to take
  10     into account the records of every child who had received
  11     care under the terms of the Inquiry. This again is
  12     important to the parents. All of the children have been
  13     considered and we have analysed data that relates to all
  14     of them.
  15        The Panel decided to take a sample of these
  16     records and more details, as you have heard, will be
  17     given of this tomorrow. However, if we are to take
  18     a representative sample that reflects the concerns that
  19     led to the Inquiry, it is important to have summary
  20     information available on every child. This was done
  21     using a team of people who are used to summarising
  22     medical records.
  23        One has to remember that the language in which
  24     a medical record is written may vary from time to time
  25     with the same clinician, describing the same sort of
0045
   1     conditions, and will vary between one clinician and
   2     another for the same conditions. They will write down
   3     and use words slightly differently.
   4        But when we come to try and summarise the data, we
   5     have to use a language that is common to all. It may be
   6     that this language is not the most appropriate one for
   7     ordinary conversation. The team of people who are doing
   8     the coding of the records in the hospitals use
   9     nationally and internationally agreed terms for these
  10     operations and diagnoses, and the groupings, as you have
  11     been shown, are labelled using a code number with
  12     a letter and this process is described as "coding". The
  13     people who carry out the task are called "coders". This
  14     process was described in evidence given in July.
  15        In generating a summary for each child, we entered
  16     this summary on to a computer and it is described as the
  17     "clinical coded record" database, the CCR.
  18        To check that these records did cover the relevant
  19     children and to compare the quality of the data recorded
  20     there, we have made comparison with two other sources
  21     and the first, as we have heard, is the Patient
  22     Administration System, known as PAS, a routine computer
  23     system used by the local Bristol Health Trust for
  24     administrative purposes.
  25        It is also the basis for preparing returns to be
0046
   1     submitted to the Department of Health of England,
   2     notably for the production of hospital episode
   3     statistics, and more details of this will be provided by
   4     Dr Aylin.
   5        The third source of local data is the logs of the
   6     operations done at the Bristol Royal Infirmary by
   7     Mr Wisheart and Mr Dhasmana, and they have been
   8     described by them in evidence given to the Inquiry.
   9        These logs used words that were typed or
  10     handwritten and they have also been coded by a coder
  11     from the team who coded the clinical records.
  12        Each of the sources has some basic information on
  13     the children and if we look at my report, INQ 0012/18,
  14     a summary of the information that is in common across
  15     the sources is given at the top there. This has patient
  16     name, date of operation, BRI number, whether the patient
  17     died, the surgeon, the date of death, the diagnosis, the
  18     age (derived from the date of birth that we recorded for
  19     the CCR and the PAS) and OPCS codes for operative
  20     procedures.
  21        So there is other information in each of the
  22     sources, but we have concentrated on the information
  23     that is in common across the sources because we want to
  24     carry out comparisons.
  25        So we had three purposes of bringing together
0047
   1     these sets of data, and I think my first overhead slide,
   2     which is number 50, described the purposes.
   3        If we can have that rotated and enlarged, the
   4     purposes of our looking at these sources of data was
   5     firstly, as we have said, to describe the overall care
   6     and to describe the children receiving that care in
   7     Bristol. That was an important purpose.
   8        The second one was to allow us to carefully select
   9     a sample to allow for detailed examination of the
  10     medical records in that sample, because we could not do
  11     it for all 2,000.
  12        If we are to do it for a sample, we have to select
  13     that in such a way that every child concerned has an
  14     equal chance, as every other similar child, of being
  15     included in the sample.
  16        The other thing that is important is that when we
  17     begin to make national comparisons, and Dr Aylin and
  18     Professor Murray will be making national comparisons, we
  19     need to be sure that the local data really reflect what
  20     actually is recorded nationally as far as we can. So we
  21     want to check that the Patient Administration System is
  22     a reasonable reflection of the care given and the
  23     outcome of the care.
  24        We also want to make sure that the surgeons' logs,
  25     which were the underlying source for the UK cardiac
0048
   1     surgeons' register, should also have similar results as
   2     the medical records.
   3        If we look at the processes of care, I think it is
   4     probably at this point helpful to look at Annex 1 of my
   5     report on pages 47 and 48. We will look at page 47
   6     first of all.
   7        If we look here, we see that there can be
   8     a child -- and each child must have been admitted to
   9     hospital to be included in any of the sets of data.
  10     They have to be admitted to the hospital.
  11        A child may have a single admission. Within that
  12     admission, for all practical purposes, they must also
  13     have an operation. Within one operation, when they go
  14     to the operating theatre, they may have more than one
  15     operative procedure carried out by the surgeon.
  16        These operative procedures are what are coded
  17     using the Office of Population Census and Surveys coding
  18     system. They use labels. This is the language that is
  19     used to describe the operations that the children had.
  20        If you go to the next page, we see that some
  21     children, of course, did not only have one operation, or
  22     one admission. Within an operation, they have more than
  23     one procedure. During a single admission to hospital,
  24     they may actually have more than one operation. You
  25     have here an example with two operations, and in each of
0049
   1     those two operations, if we scroll down a little
   2     further, we have two procedures.
   3        So the question is, what should we count? It is
   4     not going to be easy to count children in a way that is
   5     comparable nationally. In the hospital episode
   6     statistics, what they count is essentially an episode of
   7     care, and Dr Aylin will explain a little more about
   8     that. In terms of the cardiac surgeons register, they
   9     record operations. They do not identify individual
  10     children. So we have to make a decision as to how we
  11     are going to do our counting. It is very difficult to
  12     count only children. We could count admissions, except
  13     that, in the clinical coded notes, we do not have dates
  14     of admission and discharge. Nor do we have them in the
  15     surgeons' logs, but we do have operations, and so what
  16     we have done is, where we can count admissions, we count
  17     them; where we can only count operations, we count
  18     them. Hence, as Mr Langstaff was explaining, in
  19     a situation here, where we have two operations and in
  20     each of them we have two procedures, we do not want to
  21     count both those procedures, so we need to have a system
  22     whereby we decide which of these procedures we should
  23     count.
  24        So we can see a slightly more complex situation
  25     further down, where there are two admissions and, if we
0050
   1     scroll down, each of those can have two operations and
   2     two procedures.
   3        If we look at the language that has been used for
   4     doing the coding of these operations, as Mr Langstaff
   5     said, K codes are for the heart and the L codes are for
   6     arteries and veins, what he described as "vascular"
   7     operations. For example, I have not got an immediate
   8     example for you, but KO 1 stands for transplantation of
   9     heart and lung. Then there are a series of individual
  10     codes under that that go KO 1.1, which is
  11     allotransplantation of heart and lung; KO 1.2 is
  12     revision of transplantation of heart and lung; KO 1.8 is
  13     "other specified"; KO 1.9 is unspecified.
  14        We have these detailed codes. The language that
  15     is then used for that is not necessarily exactly the
  16     same language that a surgeon will write down. They do
  17     not write these codes down. They use their medical
  18     terminology, and somebody has to translate that medical
  19     terminology into these codes.
  20        For example, if we look at KO 4, that is the
  21     overall code for correction of tetralogy of Fallot. But
  22     KO 4.1 is correction of tetralogy of Fallot, using valve
  23     right ventricle outflow conduit, so there is a level of
  24     detail there.
  25        We will come to some of these groups of operations
0051
   1     where we have to use the greatest detail, and some where
   2     we can use a broader category.
   3        The consequence is, in doing this translation
   4     between whatever is written, on the surgeons' logs or in
   5     the medical notes, or perhaps in a discharge letter that
   6     is coded by the Patient Administration System, there
   7     will be inevitable differences in the way the
   8     translation happens. If I was speaking in French and
   9     somebody were carrying out simultaneous translation, you
  10     would realise that my use of French could be translated
  11     in more than one way in English and in some instances in
  12     French we have "tu" and "vous", both of which in English
  13     are "you", but they have subtly different meanings in
  14     French. We do not have any longer in English "thee" and
  15     "thou" that would reflect the "tu" of French. So we
  16     may find that there are things written in the record
  17     that are incapable of being translated.
  18        Having gone on at considerable length about that,
  19     we need to see that the Cardiac Surgeons' Register does
  20     not have those kind of operation codes. If we look at
  21     table 2.1 in Dr Aylin's report of his report INQ 13,
  22     page 54, which we have already seen, and there is
  23     a similar table 6 of Professor Murray's report, we will
  24     see these operation codes. I hope this has perhaps made
  25     it a little clearer. So group one is KO 4, tetralogy of
0052
   1     Fallot, and that means KO 4.1, point 2, point 3 and so
   2     on, and point 8 and point 9, and that is a broad
   3     grouping. If we look down to group 8 for truncus, that
   4     is specifically LO 1.1. So that if something was coded
   5     LO 1.8, which is an unspecified operation, it will not
   6     then get counted.
   7        So you can see the subtle differences here.
   8        This makes it amazing that there can be any
   9     comparability between the records. I expected, when
  10     I looked at these data, to find that when I started
  11     looking at the numbers, I would not get much agreement.
  12        There are also a larger group of procedures, as
  13     Mr Langstaff said, classified by whether they were open
  14     or closed.
  15        Again, if we look at INQ 13/82, again from
  16     Dr Aylin's report, here we see a list of procedures
  17     beginning at KO 2.1 which was an open procedure, and
  18     K 15.1 and K 15.2 that were closed procedures, and then
  19     on the right-hand side is K 16.1 and point 2 and so on
  20     all the way to point 8 which are excluded. It does not
  21     mean the children were not counted, we have looked at
  22     those, but in terms of classifying whether an operation
  23     was open or closed, we were unable to decide on the
  24     advice of the clinicians whether it was unequivocally
  25     open or closed. It could be either.
0053
   1        So, if we classed it as open we may make
   2     a mistake, if we class it as closed we may make
   3     a mistake. If we leave it out, we are reasonably sure
   4     we are comparing similar things.
   5        That explains, when we then move on to my next
   6     slide, essentially, which comes from my report OO12/43,
   7     in the top half of that page, we will see a figure and
   8     this shows us something of the data that we have.
   9        We see here a diagram that shows to us that in the
  10     Patient Administration System, which only ran from
  11     1st January 1988 -- it does not cover the whole period
  12     of the Inquiry -- we have nearly 2,000 children who were
  13     identified as being admitted under the care of
  14     cardiologists or cardiac surgeons, or paediatric
  15     cardiologists, or paediatric cardiac surgeons, and we
  16     have 2,000 children.
  17        We see there that they have nearly 4,000
  18     admissions. You can see that the number of operations
  19     and the number of admissions is rather similar; this
  20     number here is similar to this number here. So that
  21     when we decide to count operations or admissions, we are
  22     not making a big error. The number of operations per
  23     admission is close to 1 in the children we have.
  24        But in each operation, there are perhaps 1.5 or
  25     a little more procedures: nearly 6,000 procedures
0054
   1     received by those 2,000 children. Of course some
   2     children received only one procedure and some received
   3     a very large number indeed. There is enormous
   4     variability in that number.
   5        We when come to group these, if we group them by
   6     open and closed, it turns out that a lot of the
   7     admissions cannot be grouped by either open or closed in
   8     an unequivocal way.
   9        Most of them are open. There are a smaller number
  10     that are closed. If we look at the procedures, we can
  11     classify those by, for example, in the black here, these
  12     are non-cardiac procedures, so that a child who is
  13     admitted for cardiac surgery may have a procedure that
  14     is not cardiac.
  15        We also have a large number of procedures that are
  16     not grouped, which are nevertheless cardiac, and, for
  17     example, cardiac catheterisation, or contrast radiology
  18     of the heart, were not grouped, but yet those are
  19     recorded in the records and recorded in the Patient
  20     Administration System, and many of them will have had
  21     those, but they are not sufficiently serious procedures,
  22     and will nearly always be accompanied by other
  23     procedures, and then this rather smaller number, which
  24     makes it look as though we are only looking at a few,
  25     but we are looking at the principal procedures that
0055
   1     children who had heart surgery had. This is the pattern
   2     we see for the Patient Administration System.
   3        If we now move on down to the next page, page 44,
   4     to the top half of that, we see the same pattern for the
   5     clinically coded records. Here we cannot have
   6     admissions, we can only have operations. We see down
   7     here in procedures we have quite a reasonably large
   8     number that are non-cardiac, a considerable number that
   9     are not grouped and then still a large number that are
  10     grouped.
  11        If we move on down to the bottom of that page, we
  12     will see the same thing for the surgeons' logs, on
  13     a slightly different scale. We have got rather smaller
  14     numbers; we are talking about only something like 1,300
  15     children, with slightly more operations, but we now see
  16     that among these the classification is that the vast
  17     majority are open operations, and so they should be,
  18     because to go to the BRI meant that you were going there
  19     to have open-heart surgery; you did not need to go to
  20     the BRI if you were only having closed surgery.
  21        Here we see the procedures, the distribution of
  22     them is rather different; we have a very small number of
  23     non-cardiac procedures that are recorded. We have
  24     relatively few not grouped because the catheterisation
  25     and the contrast radiology of the heart would not have
0056
   1     been done there generally, and most of our procedures
   2     are grouped.
   3        So we end up comparing them and on the surface of
   4     it, it looks as though they are very different.
   5        The other thing we need to be aware of, and if we
   6     can go back a little to the previous page, page 43, to
   7     the bottom half, we need to think about how we have
   8     classified whether a child was alive or dead. Many of
   9     the children will have died relatively quickly after an
  10     operation, and this curve here shows us that immediately
  11     after the operation, there is a certain mortality. This
  12     shows us that all the children are alive here, but by
  13     this point (indicating) 95 per cent; 5 per cent have
  14     died quite rapidly.
  15        Then the deaths go on occurring during the first
  16     30 days. They do not stop occurring at the end of 30
  17     days; they go on happening, and you can see that the
  18     curve continues there. But we, in most of the sources
  19     of data, do not have long-term follow-up of the
  20     children. We have that in the medical notes, we have it
  21     to a degree in the Patient Administration System, and we
  22     have it in Bristol rather better than we have it
  23     elsewhere.
  24        But we have chosen to make sure that we can make
  25     comparisons between children in one centre and another.
0057
   1     We have chosen the 30-day. This is something that
   2     surgeons do not only for children but also in other
   3     operative areas, not only in paediatric cardiac surgery
   4     but elsewhere. But we need to be aware that sometimes
   5     we will be talking about children alive, meaning they
   6     had survived for 30 days. Sadly, they may have died
   7     a short time after that or in some instances they may
   8     have survived until 15 and have died at that point.
   9        So when we talk about that, we need to be aware
  10     that we have perhaps left some things out.
  11        Remember the Patient Administration System covers
  12     1st January 1988 to the end of 1995, but the coded
  13     records and the surgeons' logs cover the whole period.
  14     As Mr Langstaff said, we divided time into epochs and if
  15     we look at page 45, we see only open operations.
  16        If we go to page 39, which is table 5.1, if we go
  17     to the top, we see here this is from the surgeons'
  18     logs. We have age at which the operation took place,
  19     grouped into 0 to 90 days and so on, and 1 to 15 years
  20     and we see that the death rate is really a great deal
  21     higher, very sick children are operated on between 0 and
  22     90 days, so their death rate is much higher than older
  23     ages.
  24        So when we make our comparisons, it is important
  25     not only to make comparisons that are for similar
0058
   1     operations; they have to be for similar ages, because
   2     the death rate is varying.
   3        If we move further down to table 5.1 -- and I am
   4     coming shortly to an end, Mr Langstaff -- we see here
   5     comparisons for the three major data sources, the
   6     Patient Administration System, the clinical coded
   7     records, and the surgeons' logs.
   8        We find here that although the period of time is
   9     really rather different, for tetralogy of Fallot we have
  10     as it happens exactly the same death rate, even though
  11     the numbers differ somewhat. The Patient Administration
  12     System we would expect to be less in that it covers
  13     a shorter period of time.
  14        We look at the intra-atrial transposition
  15     operations. We have 10 per cent, 5 per cent and 1 per
  16     cent. There is a bit of a difference there, and we may
  17     need to examine that. You may recall that there is some
  18     difficulty in distinguishing between those and the
  19     arterial switches that people do not necessarily code
  20     them correctly and we see here also differences, 38 per
  21     cent, 30 and 41 per cent. If we were to average those,
  22     it turns out that we would have rather better
  23     comparability of the data.
  24        The other problem is that these are not making
  25     a comparison for the same epochs. If we can go to my
0059
   1     second overhead, which is my last one for the moment,
   2     which is, I think, page 51, if we can rotate that, we
   3     have here a similar table where we have the 11 groups
   4     only. We are just looking at the open operations
   5     because the surgeons' logs do not cover the closed ones,
   6     and we look at the Patient Administration System, the
   7     clinical coded records and the surgeons' logs, we find
   8     that the death rates are relatively similar, except in
   9     groups 2 and 3 and we find considerable agreement and
  10     down the bottom here, in terms of the totality, we have
  11     similar numbers of deaths and similar numbers of
  12     operations.
  13        The general conclusion from this is that we cannot
  14     use any of these sources of data to decide exactly what
  15     happened to individual children. It would be dangerous
  16     to go to one of those sources and say, "That gives us
  17     the truth about that".
  18        The medical records should give us a full idea,
  19     but do not forget, we have had to translate from a pile
  20     that may be one foot high into one or two sheets of
  21     information using our translation process to get the
  22     codes. That process is inevitably subject to error.
  23        When we look at it, the comparability of these
  24     different sources -- I think we can probably finish with
  25     the overheads now -- is really remarkably good. We will
0060
   1     see that later on. If we return to our purposes and we
   2     perhaps -- I am sorry, we could perhaps go to my
   3     conclusions which are on page 32, I can read out
   4     a little bit from paragraph 6.3 there in the middle:
   5        "The exercise of coding the medical records was
   6     a considerable task, and although the individual coding
   7     has been of high quality, there have been many minor
   8     problems with the data.
   9        "The Patient Administration System does seem to
  10     provide an adequate method for an overview of the amount
  11     of care and the mortality."
  12        It is very difficult to cross-check these. One of
  13     the things that came across to me was the number of
  14     changes of names of children. The family tragedies that
  15     are involved in that are really very considerable, and
  16     I think that one has to be aware of that. We are
  17     talking about children and families here that are very
  18     affected, but nevertheless, when we look at the whole
  19     thing, the coding and entry and so on has not been
  20     perfect.
  21        From a clinical perspective, as I have written at
  22     the bottom of the page there, each child is unique and
  23     not simply to be pigeon-holed, but we have to make
  24     comparisons, either over time or between surgeons in
  25     a unit, or between units, and if we are to do that, we
0061
   1     have to put things into categories. We cannot make
   2     progress without it and there will be imperfection. The
   3     great majority of the children who received care did not
   4     die, but there is also very strong evidence, to me, that
   5     these different sources of data suggest that none of
   6     them is perfect and none of them has a major problem in
   7     the way that it has described the deaths in the care.
   8     I am sorry to go on rather.
   9   THE CHAIRMAN: Thank you very much, Professor Evans.
  10   MR LANGSTAFF: Professor Evans, to what extent do the
  11     figures produced from one data source, in your view,
  12     support and give strength to the figures from another
  13     data source?
  14   A. I think the first thing to remember is that if they did
  15     not, then the exercise of national comparison would be
  16     a waste of time. So it is a necessary condition, but
  17     not a sufficient condition that the comparison is valid.
  18        They do complement one another. In looking at the
  19     details those numbers have errors in them. There are
  20     errors when people have typed them into the computers.
  21     There are errors in programming, doing the grouping,
  22     that sort of thing, and we still have to try and get
  23     that better. They are not perfect, even though they
  24     have been put in that report. But as I have looked at
  25     it more and more, the more detail I have, the more they
0062
   1     agree with one another and that is encouraging. If they
   2     did not, we would be wasting our time doing an analysis
   3     of the data, but it has not said that we should believe
   4     the numbers perfectly.
   5   Q. An alternative way of looking at it might be that the
   6     data that we had simply was not good enough and had not
   7     been collected in a way that was good enough to enable
   8     any valid comparison to be made?
   9   A. I certainly approached the data with that as
  10     a possibility, but I do not hold that view any longer.
  11     I am amazed at how consistent they are and this will
  12     come in Professor Murray's and Dr Spiegelhalter's view
  13     of these things. The consistency is very much greater
  14     than I would have expected.
  15   Q. Can I ask you about specific tables? If we go to
  16      INQ 12/35, table 4.3, what we are looking at there is
  17     from the PAS system so it only covers 3 epochs, because
  18     the Patient Administration System as you point out gives
  19     us data only from January 1988 onwards.
  20        If one compares the death rate from 1988 to 1995,
  21     13 per cent and 11 per cent, with the death rate from
  22     April 1995 to December 1995, 2 per cent, there is a very
  23     obvious drop. First of all, is that a drop in rate
  24     which is consistent across the data sources?
  25   A. Yes.
0063
   1   Q. Secondly, in looking at the material before you, did you
   2     notice any difference in the mix or nature of the
   3     operations being performed during that last period
   4     compared to the two earlier epochs?
   5   A. Yes. There were certainly very many fewer operations
   6     overall. There are obviously only 132 in total in the
   7     PAS for that period. We are only talking about a short
   8     period. The earlier epochs cover several years and this
   9     only covers part of the year, so the overall numbers are
  10     smaller, and some of the high risk operations almost
  11     disappeared in 1995; they were not there. So we are not
  12     comparing like with like across those epochs, the last
  13     epochs.
  14   Q. Does it follow that it would not be a safe conclusion to
  15     say, "April 1995 the surgeon was changed, and look what
  16     a difference it made to the death rate"?
  17   A. I think that one ought to examine the question of
  18     whether that is so, but I think that those figures on
  19     their own should certainly not be used to draw that
  20     conclusion.
  21   Q. If you would turn, please, to INQ 12/41, this is
  22     a comparative table which you have derived from the
  23     surgeons' logs which compares the rate attributable to
  24     operations which were conducted by Mr Dhasmana and
  25     Mr Wisheart.
0064
   1        The first question needs to be asked: is there any
   2     statistical basis for ascribing any particular success
   3     in the sense of survival, or failure if in the sense of
   4     death, to the surgeon as opposed to the whole process
   5     involving the whole of the surgical team?
   6   A. I do not think that I have data to answer that
   7     question. If we look at the overall rates, they are
   8     similar for the two. There are individual operations
   9     where there is a difference, for example, with truncus
  10     arteriosus, just over halfway down the table, but these
  11     are based on very small numbers.
  12        The consequence is that you cannot be certain of
  13     the differences. They are compatible with differences,
  14     but overall, there is no evidence for a systematically
  15     higher rate with one surgeon than another.
  16        That would be compatible with some system failure,
  17     if Bristol were shown to be different to other centres,
  18     but it does not mean it is system failure and you cannot
  19     use these data alone to draw that conclusion.
  20   Q. Can I take you to INQ 12/38? The last table we looked
  21     at, as indeed this table, comes as the "SL" suggests
  22     from the surgeons' logs. Am I right in thinking from
  23     your earlier description that these descriptions are not
  24     lifted straight, as it were, from the wording in the
  25     surgeons' logs, but go through a process whereby the
0065
   1     information in the logs has been coded and the same
   2     coding process applied to that as to other data?
   3   A. Yes.
   4   Q. So the surgeon himself may -- and may with force -- say
   5     "That is not how I would have described this particular
   6     operation"?
   7   A. Yes. I think that that is so. What we have here in
   8     this table, of course, though, is totals, and perhaps
   9     the interesting thing is that during the first three
  10     epochs there is no evidence of any particular trend, and
  11     while in the fourth epoch it appears to be much lower,
  12     we are based on very small numbers and there is some
  13     suggestion that the type of operations changed in that
  14     period.
  15   Q. You are talking of table 4.10 there, I think?
  16   A. I am sorry, yes.
  17   Q. Can we scroll down so we see what you are looking at
  18     there?
  19   A. I am saying that in those three periods, the 12, the
  20     15 per cent and the 13 per cent, they are from
  21     a statistical point of view rather similar.
  22   Q. Can I ask you to go back up the top of the page, which
  23     deals with individual operations? What I was asking
  24     you, and I think you were accepting, is that the
  25     description which an individual surgeon might give to
0066
   1     whether a particular operation should be classed as one
   2     or other of these groups, that may very well be said by
   3     a surgeon, might it, with force, because the nature of
   4     your data is to put it through a coding process where
   5     coders look at the data and make of it as best they can
   6     from their particular perspective?
   7   A. Yes. I think that one of the things is that in the
   8     surgeons' logs there we see a very different death rate
   9     for inter-atrial transposition of the great arteries,
  10     the second line of that table, from that for other
  11     transposition which is the arterial switches.
  12        To me, that demonstrates that the coding of the
  13     switches within the surgeons' logs was very much better
  14     than it was in, for example, the Cardiac Surgeons'
  15     Register, where the difference between the second and
  16     the third group, there was some confusion between them.
  17        My view is that this is reasonably reliable. It
  18     has not been done for the surgeons' logs, but for the
  19     clinically coded records, we actually carried out
  20     a re-sample and a re-look at the data, and found
  21     considerable consistency.
  22        So while a surgeon may say, for any one case this
  23     is not reliable, the overall pattern, I think, would
  24     have to be said to be reliable.
  25   THE CHAIRMAN: Mr Langstaff, I for one did not quite
0067
   1     understand that last answer. I wonder whether Professor
   2     Evans could go through it again? I am looking at "that
   3     demonstrates that the coding of the switches within the
   4     surgeons' logs was very much better than it was, even
   5     for example if the Cardiac Surgeons' Register, where the
   6     difference between the second and the third group, there
   7     was some confusion between them."
   8        That is what you said.
   9        That induces in me a degree of confusion also.
  10   A. Yes, I am sorry, because I have read the report on the
  11     Cardiac Surgeons' Register that has not yet been given
  12     in evidence, and I think it will become clearer when
  13     that does, but what was said by Mr Langstaff in his
  14     opening remarks was that the coding of the switches
  15     between the Mustard and Senning and the arterial
  16     switches, sometimes got confused. The consequence of
  17     a confusion of that kind would lead to the death rate in
  18     each of the groups ending up looking similar. Whereas
  19     in fact, the death rate in the inter-atrial
  20     transposition is very low, and the death rate in the
  21     arterial switches is rather high, and that is shown by
  22     the coding of the surgeons' logs, implying that that has
  23     been done relatively reliably, rather than muddling
  24     between the two.
  25   THE CHAIRMAN: That is very helpful, I am grateful.
0068
   1   MR LANGSTAFF: You say in fact, it is as we see in the
   2     surgeons' log. How can you say that when all you have
   3     to go on is figures?
   4   A. I think that the point is that if the death rates in the
   5     two had both been 8 per cent, which you might see if
   6     there was a lot of error, if you are randomly putting
   7     down one of those codes and assigning deaths to it, we
   8     would then see that the difference in the risk of those
   9     two different operations would be blurred and they would
  10     move together.
  11        Just from the figures, I can see a big separation
  12     in those, and in those it looks that the coding of those
  13     particular operations was done reasonably in the
  14     surgeons' logs and the surgeons' logs, in their words,
  15     also, were clear. Your problem of translation can be
  16     caused by failure to describe clearly, but it looks as
  17     though the combination of the translation, the coding
  18     and the original statement in the surgeons' logs, was
  19     sufficiently clear for distinctions to be made between
  20     them.
  21   Q. You, from your perspective, are reporting upon death
  22     rates here at Bristol shown by the process that you have
  23     gone through by a comparison of three different sets of
  24     data.
  25        Am I right in thinking that you, in your report,
0069
   1     do not make any comparison with national rates and so we
   2     have no way of knowing from your data alone how these
   3     particular rates may compare or do compare with
   4     elsewhere?
   5   A. No, you are absolutely correct: no national comparison
   6     was made.
   7   Q. The next area which I want to explore with you is
   8     whether you, as a statistician, had any input into the
   9     90 day cut-off, the grouping of the age from 0 to 90
  10     days, and then from 91 to 365?
  11   A. I had a little input into it, in that, within our data,
  12     we could group it by individual day for each of these
  13     sources, because we had exact dates of death and we had
  14     exact dates of operation. So we could look at that.
  15        Certainly my looking at the data was that in the
  16     first month, the first 30 days, which is traditionally
  17     regarded by many as being the highest risk time, this
  18     was not substantially different from the risks in the
  19     first 90 days, in the data that I looked at.
  20        So it seemed more sensible to compare that.
  21        Outside paediatric cardiac surgery, death rates
  22     fall very dramatically after the first week of life. So
  23     there is a tendency to want to group things in the first
  24     week, or at least the first month of life and not the
  25     rest. In paediatric cardiac surgery, in looking at the
0070
   1     data -- and I think perhaps others may be able to
   2     comment on that -- there is a pattern that suggests that
   3     the death rate in operations that are during the first
   4     90 days are relatively similar to one another, rather
   5     than just in the first 30 days.
   6   Q. I do not know whether there is a comment from my right,
   7     and our panel of experts, as to the degree of
   8     justification there is for having a 0 to 90 degree
   9     category, which plainly on the figures that you produce,
  10     shows a much higher death rate, and therefore, perhaps,
  11     may have a tendency to mislead by skewing, given what
  12     you say about early deaths being inevitably more likely
  13     than later ones.
  14        Do we have a comment that any of you would wish to
  15     make?
  16   THE CHAIRMAN: I think the way you ended that sentence
  17     slightly itself skewed our understanding, Mr Langstaff,
  18     with respect. Professor Evans, would you comment on the
  19     question?
  20   A. Can I just comment that the alternatives were to group
  21     0 to 30 days and 30 days to 1 year. Had we done that,
  22     we would have found a rate in the 0 to 30 days that was
  23     high, and perhaps slightly higher than the 0 to 90 days,
  24     but only slightly higher. If we had the 30 days to
  25     1 year, that would muddle a group who had a high rate
0071
   1     from 31 days to 90 with a group that had a notably lower
   2     rate from 91 days to the end of the year.
   3        So grouping it in the way that says "I will only
   4     put 30" ends up muddling things and not exaggerating the
   5     effect. We have not, by having the group at 90 days,
   6     exaggerated any effect.
   7   MR LANGSTAFF: I see nods from my right so that is
   8     a comment, I think, in itself.
   9        Professor Evans, thank you very much indeed for
  10     your presentation. Sir, I am in your hands. I think it
  11     may be convenient to proceed with Dr Aylin's
  12     presentation following on from that, as it does quite
  13     naturally, and review where we are at, say, 1 o'clock to
  14     a quarter past 1?
  15   THE CHAIRMAN: Yes.
  16   MR LANGSTAFF: I do not know if you are happy to change
  17     places, so Dr Aylin is nearer the screen, if he wishes
  18     to use it?
  19        Dr Aylin, can you remain standing so you can take
  20     the oath?
  21             DR PAUL AYLIN (SWORN):
  22            Examined by MR LANGSTAFF:
  23   Q. Dr Aylin, you have given evidence to us before. Have
  24     you, for the purposes of this part of the Inquiry's
  25     proceedings, prepared a report which we find at
0072
   1     INQ 13/1?
   2   A. Yes, I have.
   3   Q. Does it conclude with a number of tables and figures
   4     which begin at 13/53 --
   5   A. Yes, that is correct.
   6   Q. -- as the screen now shows us, and ends with
   7     a statistical appendix setting out the details of the
   8     statistical methodology you used, ending at page 86?
   9   A. Yes, it does.
  10   Q. You, I think, have prepared a presentation for us of
  11     your investigations. Would you like to give us the
  12     benefits of that, please?
  13   A. Yes. I would like to talk about our analysis of
  14     Hospital Episode Statistics, which was commissioned by
  15     the Inquiry. The report that we presented is in five
  16     sessions, essentially. First of all we looked at the
  17     quality of hospital activity data available to us. Then
  18     we looked at outcomes of surgery in terms of mortality
  19     and one or two other outcomes that we looked at,
  20     comparing the United Bristol Healthcare NHS Trust with
  21     the rest of England.
  22        We also looked at comparisons of UBHT with
  23     individual centres in England as well.
  24        We were asked also to look at activity rates and
  25     referrals, and patterns of these, and also we attempted
0073
   1     to examine co-morbidity or co-existing disease and case
   2     mix in Bristol, and comparing it with the rest of
   3     England.
   4        I will just briefly go over the review of hospital
   5     activity data. There are two main bodies of data that
   6     we could have looked at. The first was HIPE, the
   7     Hospital Inpatient Enquiry, which ran from 1985 up until
   8     1986. This was only based on a 10 per cent sample, and
   9     we felt that the numbers involved in the very small part
  10     of the time period that was covered by the Inquiry was
  11     not suitable for analysis.
  12        The next major set of data was the Hospital
  13     Episode Statistics, which was brought in in 1987, and it
  14     is running to this day.
  15        During its introduction in 1987 and for the
  16     following few years, the literature that we reviewed on
  17     this suggested that the data quality was very poor.
  18        The other problem with the data quality was that
  19     the surgical codes that we were interested in, OPCS 4
  20     codes, were not fully established nationally until 1991.
  21        So we felt that because of this patchy
  22     implementation of the coding for operations and
  23     procedures and because of the poor quality of data
  24     before 1991, that the HES data before 1991 was not
  25     suitable to inform the Inquiry. So this analysis
0074
   1     concentrates on Hospital Episode Statistics, beginning
   2     from the financial year 1991 to 1992, and going up to
   3     December 1995.
   4        I am just going to tell you a little bit about the
   5     methods we used for pulling this data off the national
   6     data system.
   7        We extracted all episodes between the period of
   8     1st April 1991 to 31st December 1995 for all children
   9     aged under 16, with a mention of a K or an L code.
  10     I can illustrate the process of this, if we go to
  11      INQ 13/96, I can show you some of the figures involved
  12     in this selection process.
  13        Can we just move down the page a little bit?
  14        If we look at the top, the first extract was based
  15     on years and as I say, we picked out all episodes in
  16     children aged under 16 which had a mention of a K and L
  17     procedure, and we also looked at all other episodes that
  18     belonged to children which we had pulled out beforehand.
  19        I ought to explain about the Hospital Episode
  20     Statistics, that they are based on episodes of care, and
  21     an episode of care is a continuous spell -- a continuous
  22     episode of care spent under a consultant.
  23        It may be possible that you have one or more
  24     episodes of care within an admission.
  25        Can I just show you slide 97? This just
0075
   1     illustrates how we managed to bring episodes together.
   2        Can we scan down a little bit? This is just
   3     a little diagram that may help to explain about episodes
   4     of care.
   5        One of the problems with Hospital Episode
   6     Statistics is that we do not have a patient identifier
   7     on the data that we had obtained, so we were not able to
   8     identify, through an identifying code, individual
   9     children.
  10        What we had to use instead was to use date of
  11     birth, sex and postcode, to try and link episodes of
  12     care to children, and link them together. We also had
  13     the date of admission, and we were therefore able to
  14     link episodes of care into admissions.
  15        This is a hypothetical example of one admission
  16     with three episodes of care, and there may have been an
  17     operation in the first episode, and a subsequent
  18     episode 2 and episode 3. That series of episodes, which
  19     we will call an "admission", ended in either a discharge
  20     home, a transfer to another hospital, or, in some cases,
  21     a death. Our linkage system was not perfect because we
  22     used date of birth, sex and postcode. During the course
  23     of an admission there may be a postcode change or there
  24     may be an error in data entry of a date of birth or sex
  25     or postcode which would make us unable to be able to
0076
   1     link episodes together to form an admission.
   2        In certain admissions, we were unable to find the
   3     final episode and therefore we were unable to ascertain
   4     whether a patient went home or was transferred or died
   5     and in that case, the outcome of the admission was
   6     unknown.
   7        I just want to go back to that page 96 now,
   8     please, and take you through this a little bit further.
   9        For this first extract, we pulled out all episodes
  10     of care which were linked to episodes which had a K or
  11     an L procedure in children aged under 16, for the
  12     financial years 1991/92 to 1995/96.
  13        For most of the tables and the analyses that we
  14     are going to present, they relate to epoch 3, which is
  15     1st April 1991 to 31st March 1995 and so this
  16     subselection here refers to this particular period in
  17     time.
  18        We identify 216,000 episodes out of those 289,000
  19     episodes.
  20        At this point, we started to link the episodes
  21     together, to form admissions. We identified 41,000
  22     admissions, with a mention of a K or an L procedure.
  23        There were many other episodes that may have been
  24     related to the same child but did not have a mention of
  25     a K and L procedure, so we may have an admission in the
0077
   1     early part of the period of a child who had a procedure
   2     with a K or an L code mentioned.
   3        Subsequently there may have been an admission for
   4     asthma or another surgical procedure later on in this
   5     period which we were not interested in, so we discarded
   6     those, so we kept all admissions with a mention of K or
   7     L procedures and that came to 41,000.
   8        We were then able to divide these up into
   9     admissions to UBHT, Bristol, and those admissions to the
  10     rest of the English admissions.
  11        Then, at this lowest point, we used the procedure
  12     groupings which we have talked about a little bit
  13     earlier, and the broad classes of procedures, the open
  14     and closed classes of procedures, to extract these more
  15     relevant procedures from the overall numbers of
  16     admissions with K and L codes.
  17        So these admissions that were discarded had
  18     mentions of K and L codes that were not included in our
  19     13 key procedure groups, or in our open and closed
  20     operations.
  21        I must say, I just want to make a point of
  22     clarification on the way in which the procedure
  23     groupings were actually used. On the HES database,
  24     there are four fields for operation codes and these use
  25     the OPCS 4 operation coding systems. There may be just
0078
   1     one procedure mentioned or there may be two, three or
   2     four procedures mentioned. If we are looking at an
   3     admission with a number of episodes, there may be
   4     a number of procedures in each of the episodes.
   5        The grouping systems were devised by our team and
   6     the other teams that are talking today, in order to make
   7     sense of these OPCS 4 coding systems.
   8        So the 13 procedure groupings and the broad class
   9     of "open" and "closed" were not known to the coders when
  10     coding the data; they just put the individual OPCS 4
  11     code in there. It was us that used the groupings in
  12     order to be able to have broadly clinical similar groups
  13     of operations so that we could make comparisons.
  14        The ranking system was devised by us in
  15     collaboration with the surgical experts, to enable us to
  16     pick a primary procedure out of a number of procedures,
  17     so the most important procedure out of a number of
  18     procedures in a particular admission.
  19        So these were developed by us for the analysis,
  20     and are not known to the coders in the hospitals who
  21     code these things up.
  22        Once we had our extract of data, we analysed the
  23     data in a number of different ways. If we could go to
  24     slide 100. I am going to present to you, if we could
  25     move up a little bit, four or three main kinds of
0079
   1     analysis, and I am going to talk a little bit about
   2     confidence intervals again, although we had a very good
   3     explanation earlier this morning.
   4        We looked at mortality rates for individual
   5     operations in individual age groups for our procedure
   6     groups of operations. We looked at the proportion of
   7     admissions which ended in a death as far as we could
   8     tell.
   9        You remember I talked to you about linking
  10     episodes together and in a few percentages of episodes,
  11     we were unable to find out what actually happened at the
  12     end of the admission. We were unable to find out
  13     whether they were discharged home or transferred or
  14     died.
  15        In calculating our mortality rates, we have
  16     excluded those unknown outcomes in our analysis, so we
  17     simply, in calculating mortality rates or the proportion
  18     of admissions that ended in death, looked at those
  19     admissions where we have known the outcome.
  20        We have calculated these rates both for UBHT and
  21     for the rest of England as a whole, and I will present
  22     those in a minute.
  23        We also looked at the ratio between mortality at
  24     UBHT and the rest of England. For instance, if we had
  25     10 deaths out of 100 admissions in the UBHT, and, say,
0080
   1     50 deaths out of 1,000 deaths in the rest of England,
   2     the ratio of the mortality rates between the UBHT and
   3     the rest of England would be 2, that is, 10 per cent
   4     mortality in UBHT and 5 per cent in the rest of
   5     England. So we gave the ratio of that and expressed
   6     that as a mortality ratio.
   7        We also calculated excess deaths according to the
   8     Bayesian principles that have been described before, and
   9     if we could go to slide 102, we looked at the difference
  10     in performance or in mortality between UBHT and
  11     a typical centre, quantified by predicting the numbers
  12     of deaths expected if UBHT had the typical mortality
  13     rate, or the mortality rate of a typical centre, and
  14     compared this to the observed numbers of deaths, so we
  15     worked out the numbers of deaths we would expect if UBHT
  16     had the mortality rate of a typical centre in the rest
  17     of England, and then we had the actual observed numbers
  18     of deaths and worked out this difference. This we
  19     expressed as the excess deaths.
  20        This estimate takes into account the variation in
  21     mortality rates between the other 11 centres.
  22        So this is perhaps a slightly more sophisticated
  23     analysis. This is a Bayesian figure that we come out
  24     with, whereas the mortality rates and mortality ratios
  25     are Frequentist calculations.
0081
   1        Also for the mortality rates and for the excess
   2     deaths we gave a confidence interval, a 95 per cent
   3     confidence interval, which reflected our uncertainty of
   4     the true mortality rate. When comparing a mortality
   5     rate between Bristol and the rest of England, if the
   6     95 per cent confidence intervals did not overlap, we
   7     could be fairly confident that the difference in
   8     mortality was not due to chance.
   9        Now I would like to start presenting some of the
  10     results. If we could go to the graph on page 73,
  11      INQ 13/73. If we could rotate that around, it is
  12     a shame that the copy has not come out too well, but
  13     I think we can still see the main results here.
  14        Along the bottom of the graph, you can see the
  15     various procedure groups and there are the 13 procedure
  16     groups, and these are open and the 12 and 13 are closed
  17     procedure groups.
  18        In the remaining two bars we have the broad
  19     classes of open and closed procedures. The open
  20     procedures include many of these open procedures
  21     mentioned in the groups 1 to 11, and the closed
  22     procedures also overlap on to groups 12 and 13, so they
  23     are not exclusive by any means, those two different
  24     kinds of classifications.
  25        The bars that you see here and here and here
0082
   1     (indicating) represent the mortality rate, the mortality
   2     experienced by the rest of England, excluding UBHT. You
   3     can see the mortality percentages up on the left-hand
   4     corner.
   5        So, for instance, if we look at group 3, other
   6     transposition of the great arteries, you can see that
   7     the national mortality is round about 10 per cent.
   8        The little diamonds which you will see here
   9     represent the mortality in UBHT for this particular
  10     period of time, and this graph refers to children aged
  11     under 90 days.
  12        The figure next to the little dot represents the
  13     number of valid cases, i.e. the cases where we knew the
  14     outcome, so that gives you an idea of the kind of
  15     numbers that we are dealing with.
  16        For the sake of clarity, we have not put
  17     confidence intervals on this graph for the national
  18     rates, but I will show you a table next which includes
  19     those confidence intervals.
  20        Let us take a look at this graph. As we go down
  21     the groups here, we can see that there were no
  22     admissions with a mention of a Fallot, a tetralogy of
  23     Fallot operation in children under 90 days during this
  24     time period in Bristol.
  25        If we look at inter-atrial TGAs, we can see that
0083
   1     there was only one case and this case, unfortunately,
   2     died, and that gives a mortality of 100 per cent, but
   3     this is a very small figure. When you are talking about
   4     very small numbers of cases of admissions, perhaps less
   5     than 5, certainly, it is very difficult to make
   6     inferences from these figures because we are talking
   7     about very, very small numbers, and one death might make
   8     a lot of difference to the mortality.
   9        The figure that stands out in this graph is the
  10     group 3, "other transposition of great arteries". We
  11     can see here that in the rest of England, the mortality
  12     is 10 per cent. If you look at the mortality rate in
  13     UBHT, based on 10 cases, 9 out of those 10 cases died
  14     and that gives us a mortality of 90 per cent. Round
  15     about this estimate, we have given confidence intervals,
  16     and you can see these two bars there, which, as I said,
  17     reflect our uncertainty about this mortality rate and
  18     give a range of values in which we can be fairly
  19     confident that the true estimate lies.
  20        So from this we can say that this is unlikely to
  21     be due to chance, this mortality.
  22        If we look at group 4, TAPVD, we can see that the
  23     mortality rate there is 50 per cent compared to
  24     a mortality in the rest of England of somewhat less than
  25     20 per cent -- I think it is 16 per cent.
0084
   1        The confidence intervals come very close to the
   2     national mortality, and actually, if you plot the
   3     confidence intervals of the national mortality, these
   4     two confidence intervals overlap, so we cannot be
   5     confident that this mortality rate has not occurred by
   6     chance.
   7        If we go down the rest of the operations, they are
   8     based on very, very small figures of one and two
   9     admissions, and our uncertainty in the mortality rates
  10     based on those small figures is reflected in the large
  11     confidence intervals.
  12        If we go to the broader class of open and closed
  13     operations, we can see that in the broad class of open
  14     operations, the mortality for UBHT is over 60 per cent
  15     while, in the rest of the country, it is under 20 per
  16     cent.
  17        The confidence intervals clearly do not overlap
  18     and therefore we can be fairly confident that this
  19     difference in mortality is unlikely to be due to chance.
  20        For closed operations, the mortality in Bristol is
  21     very, very similar to the rest of the country.
  22        Could we call up table 2.2 on INQ 13/55, please?
  23     This is the table on which this graph is based.
  24        In the first column we can see the total numbers
  25     of admissions for UBHT and elsewhere combined for this
0085
   1     period in time, 1st April 1991 to 31st March 1995, of
   2     children aged less than 90 days.
   3        In the second column here we have figures for
   4     Bristol, for UBHT. In the first subcolumn, we can see
   5     the total numbers of admissions, with mentions of these
   6     13 procedure groups, and our open and closed classes of
   7     operations.
   8        In the second column, we can see the numbers of
   9     admissions for which we knew the outcome, and in
  10     brackets, there is the percentage of that against the
  11     total number of admissions. We can see perhaps looking
  12     at the open operations, if we run along here, we knew 30
  13     outcomes out of a total of 37 total admissions, meaning
  14     that 81 per cent of our admissions we knew the outcome
  15     to.
  16        The third column here is the number of deaths for
  17     each of these operation groups, and the fourth column is
  18     the mortality and in the next column, we have given the
  19     95 per cent confidence intervals for each of those.
  20        In the third larger column, we have the same
  21     figures for operations carried out in hospitals other
  22     than UBHT and we have valid numbers, we have the numbers
  23     that have died, we have the percentage that have died,
  24     the mortality and the confidence intervals around this.
  25        In the last column, we have expressed the ratio of
0086
   1     the percentage that died in Bristol to the percentage
   2     that died elsewhere, as a ratio, as a mortality ratio.
   3        So just in summary, if we can go to page 103,
   4     [INQ 13/103] for children under 90 days for open
   5     procedures and I am just going to talk about the broad
   6     open class of procedures here, mortality was 63 per cent
   7     in the UBHT with a confidence interval of 44 to 80 per
   8     cent, which was four times higher than elsewhere in
   9     England, whose mortality was 16 per cent, with
  10     confidence intervals 14 to 18.
  11        In our analysis where we estimated the number of
  12     excess deaths for this group, using Bayesian methods, we
  13     estimated that there were 13.9 excess deaths out of
  14     a total of 19 deaths that occurred in Bristol for
  15     children aged under 90 days.
  16        To the next page, 104. For specific procedure
  17     groups, other TGAs, transposition of the great
  18     arteries -- this was group 3 -- had a mortality of
  19     90 per cent, although this was just based on 10 cases,
  20     which was 9 times higher than elsewhere, where the
  21     mortality was 10 per cent, with a confidence interval,
  22     8 to 13 per cent.
  23        We estimated that there were an excess of 7.8
  24     deaths with an interval of 5 to 9 out of a total of 10
  25     deaths in this group.
0087
   1        One thing to emphasise is that many of the other
   2     groups also had high mortalities, but they were based on
   3     only one or two cases, and it is difficult, if not
   4     impossible, to draw statistical inferences from those
   5     very small numbers.
   6        For closed procedures, if we go to page 105, just
   7     to summarise, mortality, GBHT was 5 per cent, the same
   8     as elsewhere, running at 5 per cent.
   9        If we can go to the next age group, 90 days to
  10     1 year and if you can put up the graph on page 74, and
  11     rotate that around? We can see that the groups with
  12     a high mortality are AVSDs and ASDs in groups 5 and 6.
  13     Where you can see that the mortality of group 5, for
  14     atrial ventricular septal defects is over 40 per cent in
  15     Bristol and at under 10 per cent in the rest of the
  16     country.
  17        For ASDs, the mortality rate was over 60 per cent
  18     in Bristol and less than 5 per cent in the rest of the
  19     country.
  20        All the other operations again are based on fairly
  21     small numbers, apart from these closed operations and
  22     also this closure of VSD. The small numbers, the
  23     confidence intervals are fairly wide and tend to overlap
  24     the mortality rate in the rest of the country.
  25        For open classes of procedures, you can see that
0088
   1     the mortality rate is round about 20 per cent in Bristol
   2     and well under 10 per cent in the rest of the country.
   3     I will give you the exact figures in a minute, although,
   4     for closed operations, again, the mortality was exactly
   5     the same as it was in the rest of the country.
   6        Could we have table 2.3 on page 56? Again, this
   7     just shows the figures the graph is based on. I will
   8     not dwell too much on that, except to look at perhaps
   9     just the open procedures here, and we can see that there
  10     was a mortality of 19 per cent in Bristol, with
  11     confidence intervals 13 to 28 per cent, compared to
  12     a mortality in the rest of the country of 7 per cent
  13     with confidence intervals of 5 to 8 per cent, with
  14     a ratio of the mortality in Bristol compared to
  15     elsewhere of 3. The mortality rate was three times
  16     higher.
  17        So just to summarise the children aged under
  18     90 days to 1 year, if we could go to page 106, as I have
  19     just said, the open class of operations, the mortality
  20     was three times higher than elsewhere in England, and
  21     our estimated numbers of excess deaths were 14.4 amongst
  22     a total of 22 deaths, with a 95 per cent interval
  23     between 7 and 20.
  24        To page 107. The procedure groups which stand out
  25     as having a high mortality in this age band, 90 days to
0089
   1     1 year, was AVSD procedures, and closure of atrial
   2     septal defects as well.
   3        For AVSDs, the mortality was 43 per cent, which
   4     was 4.6 times higher than elsewhere in England, with an
   5     estimated 7 excess deaths amongst a total of 9 deaths in
   6     that age group, and for closure of ASDs, the mortality
   7     was 63 per cent, which had a very much higher mortality
   8     than elsewhere, which was 4 per cent. That is 17 times
   9     higher, although you must remember that these are based
  10     on relatively small figures.
  11        The estimated excess numbers of deaths was 4.7 out
  12     of a total of 5 deaths.
  13        If we go to page 108, again we note that closed
  14     procedures had a very similar mortality in Bristol as
  15     elsewhere, at 4 per cent.
  16        For children aged 1 to 15 years, if we could call
  17     up graph 75 on page 75, and rotate that, you will note
  18     that none of the mortality rates for any of the
  19     operation groups or the broad open or closed classes of
  20     procedures had a significantly higher mortality than the
  21     average compared to the rest of England, although you
  22     will note in the open class of operations the confidence
  23     interval is very close to the national rate.
  24        For all ages, we were able to add up the total
  25     excess deaths which, for each of the age groups, for
0090
   1     each of the procedure groups, and for the open and
   2     closed classes of operations, which included minus
   3     numbers and plus numbers, representing the difference in
   4     the deaths that we would expect, and if we go to
   5     page 110, I can just summarise the figures here. So for
   6     all children aged under 16 years, based on the 13
   7     procedure groups, there were 35.3 total excess deaths
   8     out of 67, and that should be UBHT rather than BRI.
   9        With a confidence interval of between 21 and 48,
  10     a 95 per cent interval of 21 to 48.
  11        That is based on the 13 procedure groups. If we
  12     look alternatively at the other classification of open
  13     and closed class of procedures, we will find a similar
  14     figure of 32.9, total excess deaths, out of 69 in the
  15     UBHT, with a 95 per cent interval of between 9 and 49.
  16        For the moment I have spent quite a bit of time
  17     presenting data on mortality, but we wanted to look at
  18     other outcomes in a very limited way, limited by the
  19     limitations of the HES data, but we wanted to look at
  20     complications from surgery.
  21        There is a diagnosis code based on the
  22     international classification of diseases, the 9th
  23     revision, and this is a series of codes which is used to
  24     code for the diagnosis for each admission. There are
  25     several of these codes which refer to complications from
0091
   1     surgery.
   2        If we could go to page 58 and first of all look at
   3     table 2.5, so the top table, the ICD 9 codes for these
   4     complications are 997.0, which indicates complications
   5     to the central nervous system, which includes brain
   6     damage, cardiac complications, which is ICD 9 code 997.1
   7     and respiratory complications, 997.3, and also urinary
   8     complications, 997.5, which would include renal
   9     failure.
  10        We can see, looking at the central nervous system
  11     figures here, that out of -- this is looking at all
  12     ages, 0 to 15 -- 505 admissions, 8 admissions had
  13     a mention of complications from surgery to the central
  14     nervous system. That is 1.6 per cent of admissions,
  15     with confidence intervals of 0.7 to 3.1.
  16        If you look at the rate of complications mentioned
  17     elsewhere in hospitals other than UBHT, we can see that
  18     there are 28 admissions with a mention of this
  19     complication, a 0.4 per cent rate of complications, or
  20     rates of recorded complications, I should say, on this.
  21     We can see the confidence intervals here are 0.2 to 0.5.
  22        Because these two confidence intervals do not
  23     overlap, we can be reasonably sure that these
  24     differences are not due to chance. What they are due to
  25     we do not know.
0092
   1        We can go down the other complications here as
   2     well. We can see that for cardiac complications the
   3     rate is 11.1 per cent compared to 4.3 per cent, which
   4     has significantly more -- there is a significant
   5     difference there. For respiratory complications, again,
   6     there is a significant difference in the complication
   7     rate between Bristol and elsewhere. For renal failure
   8     or urinary complications the differences are not
   9     significant. So that is for open procedures.
  10        If we look at table 2.6, we can see the same table
  11     for closed procedures. The complication rates are
  12     a little lower than in open procedures. We can see that
  13     there are, certainly for cardiac and for -- for cardiac
  14     operations there is a significant difference in the
  15     complications. But, otherwise, there is no significant
  16     difference in the -- I am sorry, and also for
  17     respiratory. But otherwise there is no significant
  18     difference.
  19        One of the problems with looking at this diagnosis
  20     code is that it is not possible to distinguish between
  21     pre-existing disease or illness and complications
  22     resulting from the operation in question. So these
  23     complications may have existed as a result of another
  24     previous operation, an unrelated operation. We are
  25     unable to tell whether the diagnosis was made before the
0093
   1     operation or after the operation. So that is one
   2     warning with this data.
   3        The other warning, and we will see some evidence
   4     of this a little bit later, is that Bristol may have
   5     been slightly better at recording diagnosis, and this
   6     may help to explain this difference. So it may be
   7     a data quality issue rather than an actual complication
   8     rate from the operation.
   9        We also looked at length of stay as a possible
  10     indicator of outcome. If we could go to table 2.7 on
  11     page 58, this table 2.7 gives the proportion of patients
  12     going home for each week after admission. We can see,
  13     for Bristol and for elsewhere. We can see that this is
  14     the first seven days after admission. In Bristol only
  15     2 per cent go home within the first week for open
  16     operations. Elsewhere, 27 per cent go home in the first
  17     week.
  18        If we look at closed operations, we again see
  19     a big difference in the proportion of patients going
  20     home in the first week. For Bristol, we have 14 per
  21     cent of patients going home in the first week, and
  22     elsewhere we have 50 per cent of patients going home in
  23     the first week.
  24        If you look at the other weeks of admissions,
  25     there is not so much of a difference, and actually the
0094
   1     main difference appears to be in the first week after
   2     admission.
   3        There could be several explanations for this. We
   4     only looked at length of stay from admission, and it may
   5     be that Bristol may have had a longer period of time
   6     before the operation, a longer pre-op' time than
   7     elsewhere, which would account for less going home in
   8     the first week. It may also be true that one might
   9     argue that having a longer length of stay meant that
  10     Bristol hung on to their patients for longer and perhaps
  11     that might be an indication of better care or not so
  12     good care. It is very difficult to say, so I can only
  13     present the figures here and perhaps someone else might
  14     be able to interpret that in the light of other
  15     information.
  16        I have talked about outcomes and we have looked at
  17     mortality, we have looked at length of stay and we have
  18     looked at complication rates.
  19        One of the other things that we were asked to look
  20     at was activity, or operation rates.
  21        One of the problems of looking at operation rates
  22     for a geographical area, for a catchment area, is that
  23     you need to define a catchment area for the hospital.
  24     You need to know the population that each centre was
  25     working on in order to get a rate of operations.
0095
   1        There are problems with defining catchment areas
   2     for hospitals. There are some guidance on
   3     administrative catchment areas, but these can sometimes
   4     be a little vague, referring to parts of health
   5     authorities or parts of a region, and are sometimes
   6     difficult to interpret.
   7        So we wanted to look at activity for each of the
   8     centres, each of the 12 centres that we looked at, and
   9     therefore we decided to define our catchment areas in
  10     two different ways.
  11        We looked at a geographical catchment area based
  12     on geographical proximity to each centre. So we looked
  13     at each Health Authority across the country and
  14     whichever centre that Health Authority was closest to,
  15     we said that was within that centre's catchment area.
  16        We also looked empirically at where the majority
  17     of patients from each Health Authority were treated and
  18     defined catchment areas in that.
  19        I will just show you a map, although it does not
  20     come up very clearly on this reproduction. If we go to
  21     page 80 and rotate. Sadly it does not come up very well
  22     at all. The originals were actually produced in
  23     colour. But you can see the different centres dotted
  24     around the country, and this is Liverpool, Freeman
  25     Hospital, Killingbeck, Glenfield and the number of
0096
   1     hospitals that we looked at. Then this small section is
   2     enlarged to give the four London hospitals: Harefield,
   3     Royal Brompton, Great Ormond Street and Guy's.
   4        This map of England is divided up into health
   5     authorities, and each health authority is shaded
   6     according to its geographical proximity to the centre.
   7        You will notice that Wales is not shaded here, and
   8     that is because the data that we were able to use was
   9     based on admissions to English hospitals. Admissions to
  10     Welsh hospitals are recorded using a different system
  11     called PEDW, or the patient episode database in Wales.
  12     We did not have access to that, and therefore we did not
  13     include Wales in our analysis.
  14        In fact in the activity based analysis we excluded
  15     all Welsh children who were treated in Bristol, simply
  16     because we did not have the denominator for the rest of
  17     Wales. But in all the other analysis we have included
  18     all Welsh children treated in Bristol. But just for the
  19     activity analysis, we have excluded those so we can get
  20     rates.
  21        Could we go to table 4.1 on page 62 --
  22   Q. Could I ask, Dr Aylin, if it would be convenient to deal
  23     very quickly with this and then have lunch, or whether
  24     you prefer to come back to this after a lunch break?
  25   A. I would prefer to get it all over and done with. I can
0097
   1     go through it fairly quickly.
   2   Q. Can we deal with this table, and then we will have
   3     a break, please?
   4   A. All right, yes. Page 62, please.
   5        This table gives us activity based on the
   6     populations as we have defined geographically for each
   7     of the main centres. Because London has a very
   8     complicated catchment area arrangement and it is almost
   9     impossible to define a catchment area for London, we
  10     have grouped London into one large centre and one large
  11     catchment area.
  12        This first column represents the number of
  13     admissions for which we did not have a postcode, so we
  14     were therefore unable to identify where they had come
  15     from.
  16        This second column is the number of admissions.
  17     The third column is the population. When we are looking
  18     at children aged under 1 year, we base the population on
  19     the numbers of live births per year.
  20        Looking at activity, the first column is the
  21     centre activity. So that is the rate of operations
  22     carried out by Bristol in this first line on the Bristol
  23     population, and the same for the other centres. That
  24     gives an operation rate for open operations carried out
  25     by Bristol on its own catchment area of population,
0098
   1     which is 0.72.
   2        If we look at the second column we can look at the
   3     total number of operations, so that is operations
   4     carried out by Bristol and other centres on the Bristol
   5     catchment area, which is 1.09.
   6        Interestingly to note here, for Bristol the centre
   7     rate activity for open classes of procedures was
   8     relatively low compared to the other centres in the
   9     country. This was mainly, it seems, because many
  10     patients were actually going out of the geographical
  11     catchment area. We can actually see the number of
  12     patients who went out of the area to be treated, and
  13     that is 73. Very interestingly, one can note that no
  14     patients came in from other catchment areas to Bristol
  15     to be operated on by UBHT.
  16        That is interesting, as all the others do seem to
  17     have a flow in and out. So the one thing to note from
  18     this table is that there seemed to be a low centre rate
  19     activity, that is numbers of operations carried out by
  20     Bristol, which is probably explained by patients going
  21     out of the area to be operated on by other centres.
  22        Table 4.3 looks at another age band; those
  23     patients over 1 year. That is on page 64. But the main
  24     thing to note is that for open operations for children
  25     under 1 there seem to be a low operation rate by Bristol
0099
   1     and many of the patients seem to go out of the catchment
   2     area.
   3        For children aged 1 to 15, you can see a similar
   4     flow in that 90 patients went out and only 9 patients
   5     came in.
   6        So I think in terms of activity, I think we can
   7     pause there. Thank you.
   8   MR LANGSTAFF: Dr Aylin, thank you very much thus far. We
   9     will take a break. I know the Chairman will want to
  10     have one now. Can I say before the break is announced
  11     that there may well be, from those who are in the
  12     immediate vicinity, questions that they may have which
  13     it may be appropriate should be addressed to you through
  14     me when you have finished your presentation. If that is
  15     the case, I would be grateful to hear of them so I can
  16     deal with any question or queries that arise. But thank
  17     you thus far.
  18   THE CHAIRMAN: Shall we break now until 2 o'clock? Thank
  19     you, Professor Evans and Dr Aylin. We will come back
  20     and continue after the break.
  21   (1.25 pm)
  22            (Adjourned until 2.00 pm)
  23   (2.00 pm)
  24   MR LANGSTAFF: Sir, Dr Aylin had just dealt with the
  25     catchment areas, with activity rates and inflows and
0100
   1     outflows from the various different centres.
   2   A. I think that has covered activity for the moment. The
   3     next section of our report attempted to look at case
   4     mix, or disease severity, and co-existing diseases.
   5        It is important to look at this because high
   6     mortality or low mortality can be explained in a number
   7     of different ways, through chance and we have looked at
   8     our analysis incorporating confidence intervals in that,
   9     through aspect of data quality, perhaps I will talk
  10     a little more about that when I sum up, and perhaps more
  11     importantly about the severity of the cases actually
  12     seen.
  13        This is why we looked at case mix.
  14        The first thing we looked at was age at operation,
  15     to get an idea of the proportion of children in each age
  16     group that UBHT was operating on, compared to the rest
  17     of England.
  18        If we go to table 5.1 on page 69, [INQ 13/69] we
  19     can see this is the age breakdown of children operated
  20     on with open procedures between the time period that we
  21     were looking at.
  22        We can see that under 90 days here, in Bristol
  23     only 7 per cent of the total number of operations was
  24     made up of children under 90 days in the UBHT, compared
  25     with 22 per cent elsewhere. So that is for open
0101
   1     operations.
   2        For closed operations which we can see in
   3     table 5.2, we can see that the proportions are very much
   4     more similar, with 40 per cent of the patients that UBHT
   5     saw under 90 days, compared to 45 per cent elsewhere.
   6        There does appear to be a different age mix in
   7     open operations for this period of time.
   8        We also looked at the diagnosis given to the child
   9     on the Hospital Episode Statistics data set, and the
  10     primary diagnosis recorded in each episode. If we can
  11     go to table 5.3 on page 70, table 5.3 gives the most
  12     common primary diagnosis given in admissions with open
  13     procedures between the time period that we are looking
  14     at.
  15        Actually, certainly as far as the general pattern
  16     is concerned, the only difference that really stands out
  17     to my mind is this category here (indicating) which is
  18     the other ill-defined and unknown causes of morbidity,
  19     or ICD code 799. This is often used as a dump code
  20     where diagnosis is not clearly recorded either on the
  21     notes or by the coder, and instead of putting no
  22     diagnosis on there, this rather vague catch-all
  23     diagnosis of 799 is given.
  24        You can see here that in Bristol in the UBHT, none
  25     of the patients seen at Bristol were given this code for
0102
   1     open procedures, this is for all ages 1 to 15, whereas
   2     elsewhere, approximately 9 per cent of cases were given
   3     this rather vague code, which suggests to me that in
   4     Bristol, coding was of a higher quality than elsewhere.
   5        The other diagnoses seemed to have fairly similar
   6     proportions, and so, as far as that is concerned, it
   7     does not look as though the diagnosis are greatly
   8     different in Bristol than elsewhere, but the 799 code
   9     sticks out. You can see that to a lesser extent in
  10     table 5.4, which looks at closed procedures and for all
  11     ages, you can see very few, only one admission has this
  12     code of 799, compared to 7 per cent elsewhere, again
  13     suggesting better diagnostic coding.
  14        The last thing we looked at was co-existing
  15     disease and we looked at admissions with a diagnosis of
  16     Down's syndrome. If we could go to table 5.5 a little
  17     bit lower down on that page, we can see within the open
  18     category that in Bristol 10.3 per cent of admissions had
  19     a diagnosis of Down's syndrome, with confidence
  20     intervals 7.8 to 13.3, whereas elsewhere 7 per cent had
  21     a diagnosis of Down's syndrome, with confidence
  22     intervals 6.4 to 7.5.
  23        The confidence intervals do not overlap here, so
  24     there is a significant difference there, but as we have
  25     already mentioned earlier, it does seem that Bristol was
0103
   1     slightly better at recording their diagnosis and this
   2     may explain this difference.
   3        For closed operations, there is a difference
   4     between the diagnosis, but the confidence intervals
   5     overlap, so there is no significantly statistical
   6     difference there.
   7        We looked at deprivation as well, socio-economic
   8     deprivation, because for each admission we had
   9     a postcode. We could look at the area in which that
  10     postcode was and use 1991 census data to try and
  11     determine the kind of area that the admission came
  12     from. There is a well-used score of socio-economic
  13     deprivation called the Carstairs score and we used five
  14     divisions of this score, with division 5 or quintile 5
  15     being the most deprived group of areas, and Carstairs 1
  16     or the first quintile being the least deprived of
  17     areas.
  18        The idea behind this was, in many diseases the
  19     outcome for these diseases is worse for people living in
  20     deprived areas rather than less deprived areas. We
  21     thought if Bristol was operating on many more people
  22     from deprived areas, this may perhaps account for a high
  23     mortality.
  24        Let us just look at table 5.6 on page 71, first.
  25     We can see our Carstairs scores down here, 1 to 5, so
0104
   1     these are admissions from most deprived areas and
   2     admissions from least deprived areas, and the in-between
   3     scores, and we have worked out the percentage of
   4     patients from each of the areas that were operated on by
   5     Bristol.
   6        You can see that Bristol, out of all its cases,
   7     only operated on 11 per cent of its patients which were
   8     from the most deprived areas, whereas elsewhere 22 per
   9     cent came from the most deprived areas.
  10        This may be a result of several things. It may be
  11     that there are just less deprived people in the Bristol
  12     catchment area and that may explain why there are less
  13     operations there. I do not know. But there are several
  14     explanations for that.
  15        Table 5.7 looks at mortality rate for open
  16     procedures by Carstairs quintile for all centres
  17     combined excluding Bristol, to try and get an idea of
  18     whether, if you come from a deprived area, you have
  19     a greater chance of dying.
  20        Actually, the figures do not suggest this. The
  21     figures down the five Carstairs quintiles, 1, 2, 3, 4
  22     and 5, certainly for open operations are quite similar,
  23     so it does not look like the mortality is any different
  24     if the patient comes from a less deprived area or from
  25     a deprived area.
0105
   1        If we look at table 5.8, this is for closed
   2     operations and mortality. We can see that figures are
   3     fairly similar up in the most deprived area, where there
   4     is a slightly higher mortality there.
   5        That is essentially our analysis. I just want to
   6     sum up a little bit and talk about one or two of the
   7     problems with the data. First of all, I just wanted to
   8     talk about our mortality analysis. There were
   9     differences in mortality, quite large differences in
  10     mortality, for open operations in children under 1 year,
  11     particularly in children aged under 90 days.
  12        It seems, looking back at other studies that have
  13     looked at how well mortality is recorded on HES data,
  14     that the fact of death is well recorded on HES data, but
  15     we are a little bit concerned about the admissions in
  16     which we did not know whether the patient had been sent
  17     home or whether the patient had died.
  18        Certainly for some of the operation groups this
  19     was not an insignificant percentage.
  20        It could be that if Bristol were very good at
  21     recording mortality and everyone who died was actually
  22     recorded on there, then all these other admissions when
  23     the outcome was not recorded may have survived. What
  24     would make things worse was that elsewhere, if other
  25     centres in the country were not so good at recording
0106
   1     mortality and did not pick up all the cases that had
   2     died, then Bristol might appear to have a high mortality
   3     rate than elsewhere, simply because it was better at
   4     recording deaths.
   5        We actually did a little experiment and looked at
   6     this, and assumed that in Bristol all the cases where
   7     the outcome was unknown survived and for the rest of the
   8     country, we assumed that all the cases where we did not
   9     know the outcome had died. Then we did a re-analysis of
  10     this data.
  11        In doing that, in fact in being generous to
  12     Bristol in saying that it had picked up all its deaths
  13     and all the missing cases had survived and the opposite
  14     with the rest of the country, we still noted these
  15     differences in open operations in children under 90
  16     days; they were still there.
  17        So we think that actually mortality is probably
  18     quite well recorded. As far as complications are
  19     concerned, the complications from surgery we do not
  20     think are very well recorded. I think one could think
  21     of reasons why hospitals would not be recording that
  22     diagnosis in the HES data set, because often this
  23     diagnosis comes out some time after the admission. So
  24     we are not completely confident about what the
  25     complication rate actually tells us.
0107
   1        For the length of stay as well, that can be
   2     affected by many things, not just quality of care but
   3     administrative procedures. This was length of stay from
   4     admission to discharge, and it may be that there could
   5     have been a longer period before the operation for
   6     administration, for blood tests and things which may
   7     have accounted for this length of stay.
   8        So again with length of stay, I think that has to
   9     be interpreted with a great deal of caution.
  10        For our activity rates, which we found were on the
  11     lowish side for open operations and the patient flows --
  12     patients tend to go out of the catchment area rather
  13     than into the catchment area -- again, the methods by
  14     which we defined catchment areas, using the two methods
  15     empirically and geographically, did tend to change these
  16     results, and because of the difficulty in defining
  17     catchment areas, we would be rather cautious about
  18     interpreting those results.
  19        As far as case mix is concerned, we were only able
  20     to look in a very limited way at case mix, and I think
  21     there is more work that needs to be done on case mix,
  22     and possibly HES is not the right data set to explain or
  23     to go into detail about case mix.
  24        Certainly, as far as the Down's syndrome, it did
  25     seem as though Bristol was better at recording diagnosis
0108
   1     and that could account for the higher proportion of
   2     patients with Down's syndrome. That seems to be shown
   3     in the primary diagnosis tables which we looked at,
   4     which did suggest that UBHT was better at recording
   5     diagnosis.
   6        I think that is all I have to say.
   7   MR LANGSTAFF: Not quite all you have to say, I hope,
   8     because there are one or two questions for you, which
   9     I hope may not detain you long.
  10        There is a large chunk at the start of your
  11     report, and I should have told you at the outset of your
  12     evidence -- I am telling you now and you understand, and
  13     I know, that your report is of course received and taken
  14     as read, and you have been highlighting certain parts of
  15     it, but there is a large chunk at the start of your
  16     report which explores the quality of the data.
  17        What, in your view, are the weaknesses of the
  18     data, in just two or three sentences?
  19   A. I think there may be problems with misclassification of
  20     procedures, in that the coding process in the PAS
  21     systems which feed into the HES systems, and the
  22     difficulty in interpreting some of the diagnosis on the
  23     clinical records, and the procedure coding, could lead
  24     to some misclassification of procedures.
  25        So I think that is one problem that we have to
0109
   1     bear in mind.
   2   Q. That presumably is not only as across every centre;
   3     even if they had coders of equal experience dealing with
   4     matters in exactly the same way, you do not know, do
   5     you, necessarily, whether coders did have the same
   6     experience and did operates in the same way?
   7   A. I think that is one of the problems, but to add to the
   8     misclassification problem, if there was random
   9     misclassification, if high risk operations were coded as
  10     low risk operations and some low risk operations were
  11     coded as high risk operations, that would tend to blur
  12     the findings that we find. Actually, if the coding was
  13     perfect, we may see even greater differences in
  14     mortality than we see here, because the picture is
  15     somewhat clouded by misclassification. That is not
  16     necessarily an argument to discount the findings in this
  17     report.
  18   Q. Are there, do you think, any particular weaknesses, not
  19     so much in the data as in the analysis which you have
  20     conducted?
  21   A. This HES data has not been used very often for this kind
  22     of work. This is a first. I think actually this
  23     analysis has to be seen in the context of all the other
  24     analyses that will be presented to you and have been
  25     presented to you.
0110
   1   Q. Are you saying that without those other analyses, one
   2     might not be able to draw firm conclusions?
   3   A. I think the findings are quite strong, and before
   4     I started the analysis, I was not sure that I was
   5     actually going to find anything here, but the findings
   6     are so strong, I think it would be difficult to account
   7     for the findings in terms of data quality, in terms of
   8     case mix.
   9   Q. I was going to ask you about both of those things:
  10     strength and data quality. In terms of strength, for
  11     instance, you showed us slide 13/107, the bottom of the
  12     page, please, where the point figures might suggest that
  13     closure of ASD had a mortality nearly 18 times higher
  14     than elsewhere, which is a startling finding, if it be
  15     right?
  16   A. Yes.
  17   Q. One would have to read that subject, would one, to the
  18     confidence intervals, which might suggest that in fact
  19     the difference might be between 24 per cent at the
  20     bottom of one range and 10 per cent at the top of the
  21     other?
  22   A. Yes, that is true.
  23   Q. So it may in fact be only 2.4 times?
  24   A. Yes. We have to be very careful about the point
  25     estimates there and take into consideration the range of
0111
   1     values, because these are based on fairly small numbers.
   2   Q. On the same theme, you showed us a bar graph, 13/73.
   3     Turn it sideways. We are comparing, here, are we,
   4     Bristol against first of all the average of other
   5     centres?
   6   A. This is the average for the whole country.
   7   Q. So this is not other centres. But each of those bars
   8     would, itself, have to have a range, as it were an
   9     extension bar?
  10   A. Yes.
  11   Q. In order to demonstrate whether or not there was or was
  12     not 95 per cent statistical confidence that the results
  13     were different?
  14   A. Yes. The national figures are also based on numbers
  15     which may fluctuate from year to year and therefore we
  16     need a confidence interval around that.
  17        We have not included it on the graph, but we have
  18     included it on the tables.
  19   Q. It is the clarity for presentation that needs to be
  20     emphasised. Rightly you mentioned it at the time, but
  21     I give it emphasis now in this question.
  22        You do give us some data. Let us look at
  23      INQ 13/61. I am not going to ask you about this in
  24     detail, because Dr Spiegelhalter will have that
  25     pleasure, but you set out here 12 other centres, or 11
0112
   1     other centres, and their rates in respect of operations?
   2   A. These are excess numbers of deaths, yes.
   3   Q. In relation to those, you have drawn attention
   4     nationally, something like 9 per cent of all operations
   5     are ascribed to what you call a "dump code",
   6     unclassified?
   7   A. Yes -- no. No. The procedures, the diagnosis was
   8     assigned to a dump code, yes.
   9   Q. May it be the case that variability as to which centre
  10     is dumped and therefore had poor quality data may be
  11     quite great? So some centres may diagnose very well and
  12     record very well; some may not?
  13   A. Yes, that is true.
  14   Q. To the extent one is looking at a centre which may not,
  15     there may be quite a degree of variability around the
  16     results of an individual centre?
  17   A. That is true, although we are looking at the variability
  18     of diagnosis coding and probably variability of
  19     procedure coding, but the fact of death I think is
  20     probably fairly well recorded. So you would get less
  21     cases perhaps if you were trying to pull out all
  22     operations that you were interested in, because some
  23     people may have had operations that would not be coded
  24     to that in the records and you would not therefore be
  25     able to pull them out and look at them.
0113
   1   Q. Is this a question for you or for Dr Spiegelhalter:
   2     Bristol, as we have explored in the evidence, had
   3     relatively low numbers of operations. Some other
   4     centres did, too, and some had much higher numbers. We
   5     have been told that there was a feeling that less in
   6     number meant less success in terms of having mortality
   7     rates.
   8        Are you able to form any view as to whether your
   9     data justifies that conclusion or not? If so, should
  10     one not perhaps be comparing Bristol with those other
  11     centres of comparable size?
  12   A. We have not looked at that in the data. We have not
  13     compared Bristol to other centres of comparable size.
  14     That is a possibility. We are still talking about
  15     rather large differences in that particular age group of
  16     under 1s and I am not sure that would explain the
  17     difference. I think probably you might need a clinician
  18     to comment on that better than myself.
  19   Q. The last thing I need to ask you about is this: when we
  20     dealt with complications, you very fairly indicated that
  21     one cannot say, from looking at the data, whether the
  22     incidence of complication is recorded in respect of
  23     Bristol patients meant that they came to Bristol in
  24     poorer condition and were operated upon, or whether they
  25     were operated upon and thereby, or afterwards, developed
0114
   1     poorer condition; or whether it is a mixture of the
   2     two. The data simply does not tell you. Have I got
   3     that right?
   4   A. Yes.
   5   Q. But you did indicate that the statistical difference
   6     there may be a product of the better data quality which
   7     Bristol had.
   8        If that is true for morbidity, why is it not true
   9     for mortality?
  10   A. As I said before, from other literature and I have
  11     recorded something in my report, quoted one study which
  12     looked at recording of mortality. Henderson et al
  13     looked at data from six health districts between 1979
  14     and 1985 -- this is an old study -- but found that
  15     98.2 per cent of hospital records which had specified
  16     that death occurred in hospital could be matched with
  17     the corresponding death certificate.
  18   Q. That was 1985 and before, so that is looking at the
  19     pre-HES data set?
  20   A. Yes, and it is looking at the clinical records rather
  21     than the actual coding system.
  22   Q. So one would need to match HES up with national records?
  23   A. I think to be sure, if it was possible to link HES
  24     records with mortality records and follow up all these
  25     cases to see whether they had died or not, that might be
0115
   1     an interesting analysis to do and could lend perhaps
   2     some support to the analysis of the data.
   3   Q. Interesting, I have no doubt; useful for the Inquiry?
   4   A. Yes, I think it would be useful.
   5   MR LANGSTAFF: Sir, I do not know whether the Panel have any
   6     questions?
   7   THE CHAIRMAN: We have a number of detailed matters which we
   8     may deal with by exchange of correspondence and put in
   9     the public record over time. I think it would be
  10     advantageous to move along at the present moment. Thank
  11     you very much indeed.
  12   MR LANGSTAFF: Thank you, Dr Aylin. Would you like to
  13     remain at the front, as it were, while we hear from
  14     Professor Murray?
  15          PROFESSOR GORDON MURRAY (SWORN):
  16            Examined by MR LANGSTAFF:
  17   Q. Professor Murray, your full name, please?
  18   A. Gordon Douglas Murray.
  19   Q. And as before, briefly your qualifications?
  20   A. I am a Professor of Medical Statistics at Edinburgh
  21     University. I have an MA, a Diploma in Mathematical
  22     Statistics, a PhD. I am a chartered statistician and
  23     although I am not clinically qualified or trained, I am
  24     a Fellow of The Royal College of Physicians of
  25     Edinburgh.
0116
   1   Q. You prepared a report for the purposes of this Inquiry,
   2     together with Audrey Lawrence and John Pollock, which we
   3     find at Inquiry 14/1, do we?
   4   A. That is correct.
   5   Q. After a number of tables and figures and appendices, it
   6     finishes on page 78. Would you like to tell us what you
   7     found?
   8   A. I would like to present the results of the work that was
   9     undertaken by myself and colleagues at Edinburgh
  10     University.
  11        We were commissioned by the Inquiry to look at two
  12     of the relevant data sources. There was the United
  13     Kingdom Cardiac Surgical Register and the South West
  14     Congenital Heart Register.
  15        For both of these sources, we were asked to
  16     comment on the data quality and for the UK register,
  17     which is a national register, we were asked to look at
  18     surgical activity and outcomes in Bristol relative to
  19     elsewhere in England.
  20        I shall be presenting a summary of our findings.
  21     The report and my presentation this afternoon will very
  22     much focus on data quality. I will talk about activity
  23     and outcomes at Bristol relative to elsewhere, but
  24     I think Dr Spiegelhalter's report looks at this far more
  25     comprehensively than we have been able to do, and
0117
   1     I think I would defer to Dr Spiegelhalter's report for
   2     most of that analysis.
   3        The two data sources we were looking at have been
   4     described in some detail already at the Inquiry, but
   5     I think it is probably useful just to give a quick
   6     summary.
   7        The UK register was a voluntary system, an
   8     anonymous system that was run by the Society of
   9     Cardiothoracic Surgeons of Great Britain and Ireland.
  10     It was established in 1977, and cardiac units each year
  11     submitted summaries of their activity and of the
  12     outcome.
  13        If we could maybe turn to page 14/63, [INQ 14/63],
  14     this is an example of the form that was used to record
  15     the information. There was information on adult
  16     surgery, but I am not looking at that at the moment.
  17     I am focusing on the congenital surgery. For each unit
  18     for each year, there were three pages similar to the one
  19     we see here. We can see that the data are broken down,
  20     first by age, so we are looking at children under 1
  21     year, and then for individuals over 1 year. One aspect
  22     we have already discussed is that there was no upper age
  23     limit so there are adults included, unlike the other
  24     data we have been looking at. As well as that breakdown
  25     by age, there was a breakdown by open versus closed
0118
   1     surgery.
   2        We have had a lot of discussion this morning about
   3     the problems in classifying open versus closed. This
   4     essentially does not apply to this particular data
   5     source, because that is not something that we as
   6     analysts imposed upon the data. It is actually there;
   7     it is an intrinsic part of the data collection and the
   8     surgeons themselves have assigned cases as being
   9     actually open or closed.
  10        Within any of these categories, the open, closed,
  11     under 1 year, we have a huge number of diagnostic
  12     groups. A number of these diagnostic groups are then
  13     split further by the type of procedure that was done,
  14     whether it was palliative or corrective.
  15        So, within each of these categories, the surgeons
  16     returned the number of cases and the number of deaths.
  17     So, for example, in the form, for ASD, individuals aged
  18     over 1 year, undergoing open surgery, there were 19
  19     individuals treated that year, none of whom died.
  20        So that is another key difference between this
  21     source and other data sources; we are not actually
  22     getting data on identifiable individuals, we are getting
  23     aggregate figures, saying how many of a particular type
  24     of case and of those, how many died. So essentially,
  25     that is the information we were dealing with there.
0119
   1        That was actually a computerised database, and
   2     that was maintained by the cardiologists working at
   3     Bristol as against the surgeons who were responsible for
   4     the Cardiac Register. That database was used to record
   5     all patients who were referred from the South West
   6     Region to the Bristol cardiologists.
   7        That system actually dates back to 1966, and was
   8     certainly maintained up until May 1993, when the
   9     consultant largely responsible for running the system
  10     retired, and beyond 1993, up to the end of the Inquiry
  11     period, the data were still updated to some extent, but
  12     clearly the extent to which it was updated has decreased
  13     over time, as the one individual who wholly owned the
  14     system was no longer working there.
  15        So those were the sources we were looking at. Our
  16     first task was to examine the data quality, and when
  17     thinking about quality, it can be very helpful to
  18     separate out two different aspects, and it is what we
  19     have called the "primary quality" versus the "secondary
  20     quality".
  21        The primary quality essentially relates to the
  22     context under which the data were collected. Were the
  23     individuals collecting the data suitably trained? Were
  24     they suitably motivated? Were there clear written
  25     procedures explaining how the data ought to be
0120
   1     recorded? How did one handle ambiguous codings? Were
   2     the data checked when they came in? Were they
   3     validated? Were they actually fed back to the
   4     individuals who collected the data so they could be
   5     validated and feed back comments? Were the data
   6     actually used? If one collects data and sticks them in
   7     a filing cabinet, they are not going to be checked; they
   8     are not going to be used.
   9        This aspect of data quality is crucial. If one
  10     wants to collect reliable data there needs to be
  11     adequate systems. The staff needs to be trained,
  12     motivated. You need good validation, the data actually
  13     to be used, so any problems that are there are found and
  14     resolved.
  15        One of the problems with primary data quality is
  16     that one cannot actually observe it simply by looking at
  17     the data themselves. I can be given a large body of
  18     data and that tells me nothing about the motivation of
  19     the individual who collected it. I like to visit the
  20     cardiac units and speak to the people involved in
  21     collecting the data, find out more about the background,
  22     but I was not able to do that systematically, so instead
  23     we have had to rely on information already presented to
  24     the Inquiry and also on a large number of fairly
  25     informal conversations that we had with people who are
0121
   1     active in the area and knowledgeable about the
   2     background.
   3        The secondary quality issues really relate to
   4     things one can observe by looking at the data
   5     themselves. If, for argument's sake, the sex of the
   6     individual ought to be recorded, is it always recorded?
   7     Are dates always in sequence? So would a date of
   8     operation come after a date of birth, for instance? So
   9     one can look at the consistency and the completeness of
  10     the data.
  11        Those were the two aspects of quality that we
  12     looked at.
  13        If I start with the Cardiac Register, again here
  14     based on evidence that has already been presented to the
  15     Inquiry as well as our conversations, I think that it is
  16     very clear that the primary data quality was rather
  17     poor. In particular, the guidelines that were available
  18     for the data collection process were rather limited.
  19     There was next to no attempt to validate the data. The
  20     feedback to the centres that provided the data was
  21     fairly slow and not very systematic, and certainly,
  22     speaking to a large number of individuals involved in
  23     the process, they had the perception that the data were
  24     going into a black hole; nothing useful was going to
  25     emerge, so where was their motivation to collect
0122
   1     reliable data?
   2        I think one can get into a vicious circle: if the
   3     data are not useful and they are not used, then why
   4     bother collecting them? I think that perception
   5     actually pervades a lot of the routine Health Service
   6     data. It goes beyond the Cardiac Register.
   7        I do not know the HES data particularly well
   8     because I have worked much more with the Scottish
   9     equivalent, the Scottish morbidity records, but I know
  10     a lot of my clinical colleagues in Scotland are very
  11     sceptical about these systems, again because they have
  12     no sense of ownership and they feel they are putting
  13     data into a black hole and there is no motivation for
  14     them to collect reliable data.
  15        If we turn instead to the data from the South West
  16     register, again, superficially, the primary data quality
  17     there is not ideal. There did not seem to be very clear
  18     written guidelines, but what distinguishes it from the
  19     national register is a very clear sense of ownership.
  20     There was essentially one consultant who drove this
  21     process for over 30 years and the staff working with him
  22     were also very stable over that whole period, so clearly
  23     there would be a consistency through the very fact it
  24     was the same individuals collecting the data.
  25        There were a number of specific problems that we
0123
   1     encountered. With the Cardiac Register, one problem was
   2     the anonymity codes. The whole system was set up on the
   3     basis of anonymity and this was done by the Secretary to
   4     the Society allocating an anonymised code to each return
   5     before they were passed on for analysis.
   6        As we got to work with the data, it became
   7     apparent that there seemed to have been a change in
   8     those anonymous codes.
   9        We looked at this in quite a bit of detail and the
  10     Society also looked quite hard, but we could not find
  11     any documentation to support that change in coding,
  12     although the individual who is doing the analysis did
  13     recall that somewhere in the mid-1980s there was
  14     a change. It is our very strong belief from looking at
  15     the data that there was a change in those codes going
  16     from 1984 to 1985, and this has required us essentially
  17     to discard the data from 1984. We cannot be certain
  18     which return goes with which centre. I think from 1985
  19     onwards we are in a much more secure position.
  20        Another specific problem which I think relates to
  21     both data sources and we have discussed it at some
  22     length already today, is the question of reporting
  23     mortality. The guidelines that did exist for the
  24     national register said that the units ought to record
  25     all deaths within 30 days of operation. That is a very
0124
   1     standard definition for mortality.
   2        Mr Keogh, in his evidence that he presented to the
   3     Inquiry on Day 38 explained that this actually changed
   4     and later on, I think very recently, the register has
   5     gone for in-hospital mortality rather than 30-day
   6     mortality.
   7        That is obviously much easier to track. It is
   8     interesting too, that Mr Wisheart in his evidence, on
   9     I think Day 41 to the Inquiry, actually implied that the
  10     definition he was using was deaths within 30 days or
  11     beyond 30 days, but during the same admission. So if,
  12     for argument's sake, Bristol was using that definition,
  13     it would tend to over-report mortality; other centres
  14     were maybe using in-hospital mortality which would tend
  15     to under-report.
  16        So I think there is a real possibility, a real
  17     risk of systematic differences in the way that mortality
  18     has been reported in different units.
  19        Again, Mr Stark's evidence on Day 50, he was
  20     obviously very concerned about the reliability of the
  21     mortality data and he clearly believes that it is
  22     under-reported systematically in the register.
  23        Again, just to elaborate on how difficult this
  24     problem is, I think there are also questions that there
  25     may be double-counting. The surgeons filling in the
0125
   1     register were not meant to double-count procedures, but
   2     just thinking about how the return could have been
   3     completed, I think there is the potential for certain
   4     patients who had undergone several procedures, maybe to
   5     have each procedure counted each time, as it were, and
   6     if that patient died, maybe that death could have been
   7     recorded two or three times.
   8        Again, this is very anecdotal, but yet another
   9     surgeon we spoke to suggested that in his unit, deaths
  10     were only reported to the register if they were clearly
  11     a direct result of the operation, so we are not looking
  12     at all mortality but operative deaths.
  13        You would think death is so obvious and so easy to
  14     define, but that is not the case. There are all sorts
  15     of nuances in the way it can be detected, recorded, so
  16     I think we must be aware that there is a real question
  17     mark there.
  18        Looking at data quality, there was another
  19     exercise we were able to do which I think was very
  20     valuable and that was to compare the UK register with
  21     the report that was produced by a Working Party of the
  22     Royal College of Surgeons.
  23        In 1992, they were asked to prepare a report by
  24     the Department of Health, to look at supra-regional
  25     funding, and the Working Party from the College of
0126
   1     Surgeons approached each of the different cardiac units
   2     and asked them to report on their activity from 1988
   3     through to 1991.
   4        I think it is an interesting commentary on the
   5     register that they did not actually go there to get
   6     their data because the information they wanted was
   7     precisely what was available in the national register.
   8     But that is not what they did: they gathered data
   9     independently and if we could turn to page INQ 14/39,
  10     this is a comparison which we did looking at the returns
  11     to the national register, that is the ones under
  12      "return", versus the ones that were made to the Working
  13     Party.
  14        The original of this was in colour, which helped
  15     things a little. If we can see, for example, here is an
  16     example of a centre where there is a fairly large
  17     discrepancy between the information that was returned to
  18     the Working Party and the information that was returned
  19     to the UK register.
  20        If we could page down to the bottom of that table,
  21     it takes a little work, but there is actually only one
  22     centre, centre J here, that returned identical figures
  23     to the UK Registry and to the Working Party.
  24     Interestingly, that is Bristol, which may be another
  25     indication that Bristol were somewhat fastidious in
0127
   1     their data collection. Here, just an example, another
   2     centre, there in the last two years of their reporting,
   3     they are reporting substantially more activity to the
   4     Working Party than they did to the UK Register.
   5        I think that is just an example of how
   6     intrinsically difficult it is to report and record
   7     activity.
   8        You see two exercises which should have produced
   9     exactly the same information but raised some quite
  10     surprising discrepancies.
  11        If I could go on now to look at page 45 of my
  12     report, please, [INQ 14/45] this was a comparison
  13     between the returns to the UK register and the HES
  14     data. I want to just focus on a small part of the
  15     table. What I am doing there is looking just at the
  16     activity reported, the numbers here are the numbers per
  17     centre returned to the UK register and the corresponding
  18     numbers reported to HES. The column here to the right
  19     gives the ratio of those two numbers. You can see just
  20     by eye that the UK returns tend to be larger than the
  21     HES returns. 10 of the 12 centres report more activity
  22     to the register than was recorded through HES.
  23        You can see typically a centre will be returning
  24     maybe 20 to 30 per cent more activity to the other
  25     register than to HES. Centre 3 is a striking exception
0128
   1     to that, which, according to HES, is far more active
   2     than the data returned to the register.
   3        I have similar information there on deaths, and
   4     also looking at the relative reporting of deaths. There
   5     is a lot of information there, but essentially the
   6     reporting of mortality rates is more consistent than the
   7     reporting of activity, but it certainly does raise in my
   8     mind some concerns.
   9        I would imagine that the HES data are recorded
  10     more consistently from centre to centre, and so the
  11     variability here probably reflects the difference in the
  12     rigour with which the data was returned to the register,
  13     but that is speculation, it is not something supported
  14     by the data.
  15        If we could move on to page 47, please
  16     [INQ 14/47], this is looking at similar information but
  17     for Bristol alone. If we home in here just in the
  18     children aged under 1 undergoing open surgery, again, we
  19     see a fairly consistent pattern of more activity
  20     reported through the register than through HES. The
  21     numbers of deaths are actually rather similar, of the
  22     two different mechanisms.
  23        One part of the explanation for the
  24     over-reporting, as I have already mentioned, is that for
  25     the UK register there is no upper age limit, whereas the
0129
   1     HES analysis was restricted to 15, but obviously that is
   2     not a problem (we were looking at under 1s) that we
   3     should be getting similar activity. That is not what is
   4     being observed.
   5        If we turn now to look at the South West register,
   6     this was specific to Bristol. One of the major
   7     strengths of that register is that the records related
   8     to individual children. We were not looking at
   9     admissions, procedures or episodes. There is none of
  10     this possible worry over double-counting. The register
  11     was looking at children and following them through time,
  12     maybe reporting multiple operations. So intrinsically,
  13     it has the possibility to give good data with far less
  14     ambiguity to do with possible double-counting, although
  15     again, it is very likely to be under-reporting mortality
  16     because there were no systematic procedures to follow up
  17     patients once they were discharged.
  18        If we could look at page 50, please, of my report
  19     [INQ 14/50] this is a comparison between the UK register
  20     and the South West register. I have done this according
  21     to the consensus groups that we have already heard about
  22     in quite a bit of detail.
  23        Looked at sort of in the whole, just looking at
  24     the sort of volume of activity being reported according
  25     to the South West register and according to the national
0130
   1     register, there was actually pretty good agreement.
   2        Here I am looking at the fine detail and obviously
   3     the agreement is far from perfect. There are a number
   4     of the consensus groups where there is reasonable
   5     agreement, but there are others where there are striking
   6     discrepancies. One of the areas where there is
   7     particular interest is obviously to do with the switch
   8     procedures, the G2 was the earlier, the intra-atrial
   9     operation and the G3 was the switches. You can see
  10     there is almost a complete discordance there.
  11        This was a pattern that came out in quite a lot of
  12     our comparisons, so we went on to look at this in much
  13     more detail.
  14        If we could turn to page 51, please, this will
  15     need to be turned on its side. This is a very busy
  16     table with a lot of information in it, so I will try and
  17     take you through it slowly, because I think it shows
  18     quite an important caveat about the consensus groupings,
  19     especially in relation to the UK register.
  20        If we start with the South West register, what we
  21     see -- these are, if you want, the old-fashioned, the
  22     Mustard, the Senning procedures, and we can see that
  23     from 1984 onwards, the numbers are reasonably
  24     consistent. We have the numbers here with the number of
  25     deaths in brackets, so the numbers of deaths are small,
0131
   1     but suddenly it seems to go out of fashion and the
   2     numbers drop off quite dramatically. As that is
   3     happening, the number of switches is gradually taking
   4     off.
   5        So we can see that the Mustard, the Senning
   6     procedure that was the appropriate way to tackle
   7     transposition of the great arteries in the mid-1980s
   8     became the switch operation. So we are seeing a change
   9     in practice.
  10        If we look, for instance, at the coded clinical
  11     records, we see a very similar, very consistent
  12     pattern. The numbers are slightly down relative to the
  13     South West register, but we see this same pattern, the
  14     Senning, the Mustards, dropping off, being replaced by
  15     switches and similarly in the surgeons log, the older
  16     operation going out of fashion to be replaced by
  17     switches.
  18        All three sources are showing the relatively high
  19     number of deaths that go with the switches.
  20        If we look at the Cardiac Register, we really do
  21     not see anything remotely like that. There seems to be
  22     no activity to speak of under the column that was meant
  23     to be the Sennings, the Mustards, and it looks as if
  24     switches have been going on right from the start. The
  25     problem there is that the national register was based on
0132
   1     diagnoses rather than operations. We tried to tease
   2     these out by saying that the Mustard and Senning was
   3     a palliative operation for the TGA and the switch was
   4     the definitive, the corrective operation, but that
   5     clearly was not the perception. Clearly, at the time
   6     these data were being reported, the Senning and the
   7     Mustard was regarded as the curative procedure.
   8        So we have completely lost that distinction with
   9     the UK register, because we have diagnoses and not
  10     procedures.
  11        If you were to lump those two groups together, you
  12     get the children with transposition of the great
  13     arteries, and the figures become remarkably consistent,
  14     but we have lost that distinction between the different
  15     operative approaches.
  16        So that is a limitation we have to be aware of.
  17     It is a problem in getting the UK data to map directly
  18     on to the other data sources so we can compare like with
  19     like.
  20        If, for argument's sake, the UK register showed
  21     that certain procedures had an excess mortality in
  22     Bristol, those procedures we would not necessarily
  23     expect to match up with problem areas in HES, because we
  24     are not quite comparing like with like.
  25        If I can try and momentarily regroup and say what
0133
   1     does this tell us about data quality, the first thing,
   2     a priori, we would not expect the register to reveal
   3     reliable data and in particular, there is a real
   4     possibility of mortality being recorded systematically
   5     differently between different centres. But if we go on
   6     from there and look at the outcomes in the different
   7     centres, and if we could start, maybe, on page 54 of my
   8     report, again, that will need to flip around
   9     [INQ 14/54].
  10        I would be the first to say this is a very
  11     simplistic way of looking at these data. What I have
  12     done here is to look at the activity in Bristol and
  13     compare it with all the activity outside Bristol, just
  14     pooling it together as if there were one centre, i.e.
  15     not Bristol.
  16        This completely masks any variability from any
  17     centre outside Bristol. This rightly is called the
  18     "naive analysis" in Dr Spiegelhalter's report. I hope
  19     it does not reflect my naivety that I am showing you
  20     this because I think it is helpful, but it is not the
  21     definitive way to look at this data. What I am saying
  22     will be very much superseded by what Dr Spiegelhalter
  23     will tell you later. There are still some very striking
  24     patterns.
  25        If we look, for instance, here the open surgery in
0134
   1     the children aged under 1, the mortality rate at Bristol
   2     over the Inquiry period is sort of remarkably stable at
   3     around 25 per cent, whereas outside Bristol, it starts
   4     at a similar figure, but almost halves over the Inquiry
   5     period. So there seems to be a pattern outside Bristol
   6     for outcomes to improve, but that is not seen in
   7     Bristol. There are some odds ratios and confidence
   8     intervals here essentially saying that that difference
   9     is more than can be explained by chance. Whatever the
  10     explanation is, chance is not a good explanation.
  11        If we look at the individuals aged over 1, the
  12     mortality rates are lower, but we still see a similar
  13     pattern with rates in Bristol that are relatively high
  14     against a national pattern of falling mortalities. The
  15     numbers of deaths for children or for adults undergoing
  16     closed procedures are much smaller, so it is difficult
  17     to say anything definitive there, but even here, the
  18     point estimate, the mortality rates, the observed rates,
  19     are slightly higher in Bristol, but the numbers there
  20     are really too small to make very much of that.
  21        If we could move on to page 55, please,
  22     [INQ 14/55], and if we could page down just a little,
  23     this is lumping everything together over time, so I am
  24     running from 1985 right up to the financial year
  25     1994/95, but picking out the children aged under 1.
0135
   1     This is really telling us the same story as we have seen
   2     before. For the open surgery, a substantially higher
   3     mortality rate in Bristol than elsewhere, and the same
   4     trend amongst the closed procedures.
   5        I have already explained the difficulty of looking
   6     at the consensus groups with the Bristol detail, but
   7     there is still a reasonable consensus. The particular
   8     areas where we are seeing an excess are groups 4, 5 and
   9     6. That was the TAPVD, the AVSD and closure of ASD.
  10     These are actually the same areas of concern that seem
  11     to be cropping up in the other data sources.
  12        So in spite of all the limitations I have been
  13     talking about, there is a remarkable consistency of the
  14     sort of signal that is coming through against the noise.
  15        If we are to move on to the next page, page 56,
  16     again, if we could page down, this is now the same
  17     information but for individuals aged over 1 year, and
  18     the mortality rates are much lower in that age group,
  19     but again, we see this pattern of a mortality rate in
  20     Bristol that is at least 50 per cent higher, whether we
  21     are looking at the open or the closed surgery. The
  22     numbers with the closed surgery are too low to draw firm
  23     conclusions, but the open surgery, again, this is too
  24     big a difference to be explained by chance. There is
  25     not a huge consistency now in the areas where the excess
0136
   1     seems to be occurring, but one area is the Fallot and,
   2     again, group 5, the AVSD. That is coming out again,
   3     just as we saw with the earlier data.
   4        Those are very much descriptive, very much "naive"
   5     ways of looking at the data, but I think they are
   6     informative.
   7        If I could move on to the next page, 57, this is
   8     an analysis which I do not think I could possibly
   9     describe as naive, because it is based on a methodology
  10     devised and published by Dr Spiegelhalter!
  11        This is the ways of ranking the different
  12     centres. I want to focus on two lines here. This is
  13     looking at the open surgery in the children aged under
  14     a year, taking the whole time period for which the UK
  15     data are available. We can see that Bristol is ranked
  16     12 out of the 12, but the confidence interval for that
  17     ranking actually spans from 10 up to 12, so they are
  18     certainly in the lower quarter of the distribution, but
  19     even with these data, there is no firm evidence that
  20     their mortality rate is lower than all other centres.
  21        If we are to look at the open surgery --
  22   THE CHAIRMAN: Interrupting you just for a minute, do you
  23     mean higher rather than lower?
  24   MR LANGSTAFF: The mortality rate being higher is what you
  25     meant to say, I think, rather than lower?
0137
   1   A. Yes, apologies. If we look at the open surgery in the
   2     individuals aged over a year, out of the 12 centres,
   3     Bristol comes out as being 11th out of 12, and in fact,
   4     the confidence interval is from 8 to 11, so it actually
   5     excludes the possibility of Bristol being 12th out of
   6     12. This is highlighting the Harefield problem that has
   7     already been discussed.
   8        So although there is fairly strong evidence that
   9     the reported mortality rates from Bristol are higher
  10     than national norms, they are not necessarily an extreme
  11     outlier on the basis of this analysis.
  12        If I try and pull things together now in terms of
  13     my interpretation of what we have been looking at,
  14     I think the first thing we have to say is that if you
  15     are simply to take the data from the UK register on face
  16     value, then there is very strong evidence that the
  17     mortality rate at Bristol is higher than the norm
  18     elsewhere. There are at least three explanations why
  19     that might be the case. It could genuinely be that the
  20     clinical care delivered at Bristol was poorer than
  21     anywhere else. But it could indicate systematic
  22     differences in the way the data were collected and
  23     reported, or it could indicate systematic differences in
  24     case mix. Maybe the children being operated on at
  25     Bristol were systematically more ill than elsewhere.
0138
   1        There are a number of holes, I think, in the
   2     data. In my own opinion, by far the most serious one
   3     relates to the reporting of mortality. I think there
   4     could very well be systematic differences, but
   5     fortunately, that is the hole that is most easily
   6     addressed. There are national registries of mortality,
   7     and these data must be linked to get rid of that as an
   8     explanation.
   9        It might explain part of the difference, possibly
  10     not all. We need to know how much of the difference can
  11     be explained in that way.
  12        Another concern in my mind is this discrepancy
  13     between the activity reported by HES and that reported
  14     by the Register and by the South West Congenital Heart
  15     Register as well. I do not know whether HES is
  16     under-reporting or whether the other sources are over
  17     reporting. The way that the South West Register was
  18     developed, it seems very unlikely that it would
  19     over-report, because it is based on children, so that
  20     raises the question in my mind, is it HES that is
  21     under-reporting, maybe through quite substantial
  22     miscoding? Maybe we are not picking up children who
  23     have been through and been operated upon. I think that
  24     needs further work.
  25        The South West Register would be a good starting
0139
   1     point, because the children there, they are actually
   2     named, they have dates of birth, they are easily
   3     traceable and they could be linked to other sources to
   4     see if there is a systematic problem with HES, at least
   5     for the Bristol data.
   6        The final problem we have to wrestle with is case
   7     mix. I think all the statisticians working on the data
   8     agree that we do not have a good handle on case mix.
   9     I think all I can say in mitigation is all the
  10     experience I have in looking at adjustments for case mix
  11     in lots of medical contexts is that they tend to be
  12     subtle. We are looking here for observed differences of
  13     maybe 50 per cent or even 100 per cent and in general,
  14     case mix adjustments do not lead to, you know, effects
  15     of that size. So I do not believe that case mix is
  16     a complete explanation of what we are observing here.
  17     It might explain part of it, but I think that there is
  18     a need to look further to try and get a handle on case
  19     mix.
  20   THE CHAIRMAN: Thank you very much indeed. Mr Langstaff?
  21   MR LANGSTAFF: Does it follow from those last remarks that
  22     your preferred explanations of the perceived difference
  23     are either that it is a question of data collection
  24     differing, or there is some other explanation, other
  25     than case mix, and other than chance, and other than
0140
   1     data collection, for the apparent difference to
   2     Bristol's disadvantage in the data that you have
   3     collected?
   4   A. My own view, which is very much subjective, not
   5     evidence-based, is that we are probably over-estimating
   6     the extent of the excess mortality. I think the extent
   7     is such that it is probably not explained by an artefact
   8     of data collection or case mix. A two-fold difference,
   9     and not only that, but one where there are systematic
  10     patterns. The fact that mortality outside Bristol fell
  11     systematically over time, but in Bristol it did not, it
  12     is difficult to think of a mechanism, a systematic flaw
  13     in the data recording that would generate that effect
  14     spuriously.
  15        So I believe that there is a real difference, but
  16     probably, the magnitude of that difference is being
  17     influenced by various biases, some of which we could get
  18     a handle on; others of which we can never quantify.
  19   Q. One of the handles that you are recommending, I think
  20     you agree with Dr Aylin, is that we need to have some
  21     idea of deaths?
  22   A. Yes. I think that is absolutely mandatory, that that is
  23     looked at.
  24   Q. Can I ask you to go to a table you have not shown us,
  25     which is on page 52, table 15?
0141
   1   THE CHAIRMAN: Mr Langstaff, while that is being found, can
   2     I just again explore an answer? You said "my own view,
   3     which was very much subjective, and not evidence-based,
   4     is that we are probably over-estimating the extent of
   5     the excess mortality."
   6        You go on then to explain why it may it not be due
   7     to an artefact, and so on.
   8        Do you mean we are probably not over-estimating?
   9   A. I think we are over-estimating. I believe --
  10   Q. Notwithstanding what you then say?
  11   A. Notwithstanding that. There are a lot of indications
  12     that Bristol was meticulous in collecting data, and the
  13     whole problem in this area is that to be meticulous one
  14     is penalised.
  15   MR LANGSTAFF: Do I take it from what then followed that
  16     although there is a degree of over-estimation,
  17     nonetheless, one is left with a real difference?
  18   A. That is what I believe. That is my interpretation of
  19     the data.
  20   Q. Table 15: can we go down to the very bottom, death
  21     rates? You here are looking at five different data
  22     sources as they appeared to you. Each of these data
  23     sources would be collected in the various different ways
  24     you have described, subject to, depending on who was
  25     collecting it, different systematic problems?
0142
   1   A. Yes.
   2   Q. If one looks at the one line in which all five report
   3     death rates, albeit that they have different levels of
   4     activity recorded, can you say, as an expert, whether
   5     those rates are dissimilar -- after all, there may be
   6     something of a 20 per cent difference between UKCSR and
   7     HES -- or whether they are surprisingly similar?
   8   A. I think they are strikingly similar.
   9   Q. To what extent, if at all, does that answer any problems
  10     that you may have in terms of your worries about the
  11     difficulty of data collection for any one source?
  12   A. I think there is a very important issue there, in that
  13     although we are looking at five sources, we are not
  14     looking at five independent sources. If, for argument's
  15     sake, the culture at Bristol is that one is fastidious
  16     about following up patients so that all deaths are
  17     recorded, then every data collection system in Bristol
  18     is going to have good mortality, whereas, say in
  19     centre X there is a culture that "we get rid of the
  20     patients and we are not interested once they are out of
  21     the care of the surgeons", then every single data source
  22     from that centre will tend to report low mortality.
  23        So it is the same core of people who are
  24     collecting these data from all the sources. There are
  25     different individuals involved, there are people coding
0143
   1     data differently and so on, but fundamentally, it is the
   2     same children of which we are collecting data, and if
   3     there are good data available at a centre, all the
   4     sources will tend to reflect that, or, if there are poor
   5     data, all the sources will tend to reflect that.
   6        So the reinforcement could be illusory to some
   7     extent.
   8   Q. So the secretary of the department or surgeon who sends
   9     off the returns to the UKCSR will be likely to have the
  10     same approach in your experience as the administrative
  11     clerk who looks at the HES data and makes the data
  12     entry, PAS is probably similar to HES, and they in turn
  13     are similar to the information to be gained from the
  14     case notes themselves? That is the thesis, is it?
  15   A. Yes --
  16   THE CHAIRMAN: I did not think that was the thesis.
  17     I thought the thesis was, if, for example, at X you have
  18     the clerks who are not part of the culture who are
  19     over-reporting, or you have staff who move patients on
  20     rather than report them as mortality statistics,
  21     compared with an institution where that does not happen,
  22     then that will taint the data throughout?
  23   A. Yes. I mean, if we say for argument's sake that the
  24     cardiac surgeons see every patient five weeks
  25     post-operatively, you could say a unit might have that
0144
   1     as a policy. Then that unit is going to be remarkably
   2     good at picking up 30-day mortality. If there is no
   3     systematic follow-up, or only appropriate patients are
   4     followed up, then they could miss 30-day deaths.
   5   THE CHAIRMAN: It may be that there is no difference between
   6     Mr Langstaff and myself, save that I misunderstood his
   7     intervention. I apologise both to you and to him.
   8   MR LANGSTAFF: That is a good note, I think, to end your
   9     evidence on, unless there are further questions from the
  10     Panel?
  11   THE CHAIRMAN: Mrs Maclean?
  12            Examined by THE PANEL:
  13   MRS MACLEAN: It is really following on from your points
  14     about meticulous record-keeping and its potential
  15     impact. We have been addressing ourselves to the
  16     thought that there may be an impact following on from
  17     miscoding as open or closed. I would be interested in
  18     your comments on the possible consequences. For
  19     example, if a procedure is miscoded as closed, then the
  20     effect on the death rate for open procedures would be to
  21     reduce it, and similarly if a procedure was miscoded as
  22     open, the effect would go in the same direction.
  23        So again, is this an example of the kind of
  24     penalising effect of meticulous coding? I would be
  25     interested in your comments on our thoughts.
0145
   1   A. Yes, certainly, any errors of coding tend to blunt
   2     distinctions and groups are brought together, so as you
   3     say, the open cases would appear to have lower
   4     mortality; the closed cases would appear to have higher
   5     mortality. That is an example.
   6        I was thinking much more of, you know, the figure
   7     with which deaths are sought, complications are sought,
   8     these sorts of issues. If one is scrupulous about
   9     recording complications, then your unit will have a much
  10     higher complication rate than a unit that does not do it
  11     with the same rigour.
  12   MRS MACLEAN: Thank you.
  13   THE CHAIRMAN: There are no further questions from the
  14     Panel.
  15   MR LANGSTAFF: Sir, I wonder if there may be a very short
  16     break, perhaps no more than five minutes? I think it is
  17     principally for the convenience of the stenographers,
  18     before we proceed with the last session of the
  19     afternoon, in which Dr Spiegelhalter will present his
  20     figures.
  21   THE CHAIRMAN: Yes. Shall we take a short break of about
  22     five minutes? I understand that some of the witnesses
  23     do have to get away later in the day, so let us make it
  24     a short break of five minutes.
  25   (3.25 pm)
0146
   1               (A short break)
   2   (3.30 pm)
   3   MR LANGSTAFF: Dr Spiegelhalter, would you stand to take the
   4     oath, please?
   5          DR DAVID SPIEGELHALTER (AFFIRMED):
   6            Examined by MR LANGSTAFF:
   7   Q. Dr Spiegelhalter, we have heard from you before in this
   8     Inquiry so I only need, I think, ask you: is the report
   9     which is shown on your screen the first page of a report
  10     which you have produced for us?
  11   A. Yes.
  12   Q. Of which the tables and figures, INQ 15/102, is the end?
  13   A. Yes.
  14   Q. What you have done is synthesised the statistical
  15     sources we have been exploring today and we explored
  16     last July, in order to come to some overall statistical
  17     view at this stage of this Inquiry's investigation into
  18     the adequacy of care in Bristol?
  19   A. Yes. If I could try to put that in context, if I could
  20     have the Inquiry document INQ 15/104, please, this
  21     briefly summarises the statistical strategy that was
  22     laid out last July. It highlights Phase II, the
  23     exploratory investigation of available data sources,
  24     which is what we are engaged in at the moment. I put
  25     this up partly to emphasise that at this stage we can
0147
   1     neither confirm any apparent divergencies, as we would
   2     call them in performance, that may be found at this
   3     stage, and in particular, we are not seeking to explain
   4     these away. This will be the subject of further
   5     investigation.
   6        If I could have 106 [INQ 15/106], this attempts to
   7     protect myself slightly against the acknowledged
   8     limitations of the synthesising exercise that I have
   9     been engaged in.
  10        First of all, I should say, we are incredibly
  11     grateful to the analyst teams who have been producing
  12     all the data. It has been a pleasure to work together
  13     on this difficult problem, but there are definite
  14     limitations. I am only going to be talking today about
  15     short-term mortality and the whole issue of morbidity
  16     and long-term complications will not be dealt with.
  17     I think we fully acknowledge that mortality is an
  18     inadequate measure of performance for comparison.
  19        As we have already heard today, we have imperfect
  20     data sources that we are attempting to learn from, and
  21     risk stratification, although we have been trying to do
  22     our best by breaking children up into age groups,
  23     stratified by type of operation and by period or epoch
  24     of when the operation took place, it is still very
  25     limited. We acknowledge that further clinical
0148
   1     stratification would be an advantage.
   2        We are also careful to avoid at this stage saying
   3     anything about what might have been known at the time of
   4     the operations. We are acting completely in hindsight,
   5     trying to work out what actually went on and we will not
   6     be covering at all what could have been known at the
   7     time from the sources of information.
   8        Again to emphasise at this stage, we are not
   9     giving any causal reasons for any differences that are
  10     found.
  11   Q. It is sometimes said that a statistical survey is best
  12     done prospectively rather than retrospectively. How far
  13     is that then true of a situation such as this?
  14   A. I think in the situation, even more than any other
  15     situation, that is true, and I think when we look at all
  16     the data sources, I think we can see that having six
  17     data sources essentially covering much the same issue
  18     shows a certain redundancy and that if good information
  19     systems have been set up in the first place, we would
  20     not have to try to do this exercise now.
  21        Again, that is with the value of hindsight.
  22        If I could jump to page 45, a table in my report,
  23     although it is rather dense, that tries to summarise --
  24     if it is readable, I would like to talk about the first
  25     couple of lines in that. It tries to summarise very
0149
   1     briefly the six sources of information that we have
   2     heard about already today and there is no point in my
   3     repeating what has been said.
   4        If we look at the top two lines if I can just
   5     point with the pen, we can see that the purposes of the
   6     different systems vary enormously from PAS which is an
   7     administration system that feeds returns to the national
   8     HES system, to the code of clinical records, which are
   9     medical records completed by medical personnel, to the
  10     surgeons' logs which were used for their own personal
  11     record for audit and for constructing returns to the
  12     Cardiac Register, essentially owned by the surgeons, to
  13     the South West Congenital Heart Register, which was for
  14     epidemiological information and clinical backup,
  15     essentially owned by the cardiologists. The Hospital
  16     Episode Statistics, the national administration system,
  17     not a clinical system, now in fact being used by the
  18     Department of Health as a basis for its own high-level
  19     and clinical performance indicators.
  20        That is derived from Patient Administration
  21     Systems in various hospitals.
  22        Finally, the Cardiac Register, which was derived
  23     from a professional body for comparative but anonymous
  24     audit and that was essentially owned by the surgical
  25     team in Bristol.
0150
   1        The rest of the table shows the fact that these
   2     different systems had different definitions of activity,
   3     different times at which they are available, different
   4     possibilities for different ages.
   5        Perhaps I draw attention to the comment at the
   6     bottom: none of these six sources have established
   7     validation procedures or ability to have systematic
   8     follow-up, even as people have mentioned a couple of
   9     times already today, even to record deaths.
  10        That is just a summary showing diversity of
  11     sources, and I suppose I should say, quite a statistical
  12     comment, none of these sources are the kind of data that
  13     normally statisticians would be very happy doing their
  14     research on, or even making a statement about at all.
  15   Q. May I then ask why you have done it?
  16   A. We were requested to do so by the Inquiry team, who are
  17     very persuasive.
  18   Q. But did you think it was a valid exercise to attempt
  19     it?
  20   A. Given that these are the only available sources of data,
  21     I think we all went into it with great trepidation, and
  22     frankly, as some people have mentioned, with
  23     considerable expectation that these would come up with
  24     very different answers since none of these systems would
  25     normally be considered reliable enough for making secure
0151
   1     statements about mortality rates.
   2   Q. Do you still, having done the analyses, have that same
   3     view, jumping ahead, perhaps?
   4   A. Jumping ahead, I would say that I, like others, have
   5     been surprised certainly in terms of the fact that for
   6     Bristol we have all these sources, have been surprised
   7     at the degree of agreement between them.
   8   Q. Has that changed your view as to the degree of reliance
   9     that might be placed on the degree of reliability of
  10     these sources when combined?
  11   A. Yes. If there had been huge disagreement between these
  12     sources, it would make any generalisation from them,
  13     even if one source showed divergent performance, rather
  14     difficult to feel confident about. We are looking for
  15     corroborative evidence. As Professor Murray mentioned,
  16     we should not assume the systems are totally
  17     independent. There are obviously connections between
  18     them. The ethos of good data collection will obviously
  19     influence them all in common.
  20        If we go to page 48 of my report, this is a rather
  21     dense table. Could we just concentrate on the first
  22     part? Could we go back to the table?
  23   THE CHAIRMAN: I think there is a general premise that only
  24     one of you can touch it. Tony will do it at the back.
  25   A. If we can get back to the table, if we concentrate on
0152
   1     the first area of it, where it says "number of
   2     admissions", but could we include the left-hand column
   3     as well, that has the type of operation, the G1, and so
   4     on.
   5        The aim of this table is to compare all the six
   6     sources in the third epoch, that is when they are all
   7     available, that is 1991 to March 1995 -- maybe it is
   8     readable in its current form, so perhaps we should leave
   9     it as it is, then I can try to highlight things.
  10        First of all, I am going to look at this area
  11     here. We are looking at the 13 types of operation and
  12     procedure groups and the open and closed. This is just
  13     the number of admissions or operations, as some of the
  14     data sources are recording. These are the numbers. So
  15     the Fallot type operations, these are just 54 from PAS,
  16     56 from the clinical records, 63 from the surgeons'
  17     logs, and so on.
  18        This kind of data has been presented already
  19     today.
  20        I produced this in a bit of magical statistics at
  21     the end, a CV, a co-efficient of variation, which is
  22     simply a measure of the degree of agreement between
  23     those numbers. Technically it is the standard deviation
  24     divided by the mean. It is a measure of how well those
  25     numbers agree. Values of less than 20 per cent would
0153
   1     generally be considered as reasonable agreement. Values
   2     less than 10 would be considered fairly good agreement.
   3        This enables us quite quickly to run our eye down
   4     and see where the agreement seems to be quite good.
   5        So in terms of number of admissions, there is
   6     quite good agreement on Fallots, very poor agreement on
   7     the TGAs, because we have already heard that the Cardiac
   8     Register's coding of group 2 and group 3 is full of
   9     problems.
  10        A fairly good agreement on TAPVDs and on AVSDs and
  11     looking down further, really quite good agreement on the
  12     number of open operations, not perfect agreement, but
  13     not too bad.
  14   Q. That falls into your better than 10?
  15   A. Yes. If we can jump to the area of most interest, the
  16     mortality rates, we can look across again and see within
  17     Fallot operations the agreement in terms of the
  18     mortality rates between all six sources is really quite
  19     good. It is not good on say the switches, group 3,
  20     because the Cardiac Register is messed up. If we took
  21     that out, the agreement is not so bad at all. It is not
  22     too bad on TAPVDs and on AVSDs and in terms of the
  23     mortality rate for open surgery, it is extremely good.
  24     I actually use the word "extraordinary" agreement
  25     between these different sources.
0154
   1   Q. You used two superlatives there, "extremely" and
   2     "extraordinary"?
   3   A. I think it is surprising, given our initial scepticism
   4     about these sources of position.
   5   Q. One can see, looking at the figures, on five data sets
   6     there is a range of 13 or 14?
   7   A. Yes. Some areas the agreement is not so good. There
   8     are problems in coding clearly that have existed in the
   9     Fontans and surgeons' logs. There have been some
  10     problems. For the truncus, the numbers are so small
  11     even though they all come up with fairly high mortality
  12     rates, the agreement is not wonderful.
  13        I suppose I would like to call attention to the
  14     agreement on the open operations and the Fallots, the
  15     TAPVDs, the AVSDs, because it is actually where quite
  16     a lot of attention is going to focus.
  17        There are some more tables that compare the other
  18     epochs in my report, but I think that gives us some
  19     confidence in proceeding with an analysis of the
  20     national data on the basis of the Cardiac Register and
  21     HES, with the known limitations in those data sets.
  22        If I can go to page 110 which is a summary, really
  23     based on Professor Murray's presentation just now on the
  24     HES and CSR, when compared on the whole national data,
  25     not just on Bristol.
0155
   1        To repeat what Professor Murray said, there is
   2     more activity and deaths tend to be recorded in Cardiac
   3     Register than in HES. It is not clear whether that is
   4     under-reporting of HES or under-reporting of the Cardiac
   5     Register.
   6        However, with some notable exceptions where there
   7     are coding problems, the mortality rates are reasonably
   8     similar.
   9        Similarly, from centre to centre, the reasonably
  10     consistent pattern and level of agreement between the
  11     Cardiac Register and HES and Bristol is in the middle of
  12     that. Professor Murray pointed out 73 seemed to have
  13     very odd reporting on the Cardiac Register. Bristol
  14     seems to be unremarkable in its degree of agreement.
  15        The agreement between individual procedure groups
  16     is not always so good, but really, that is hardly
  17     surprising, given that the way in which these 13
  18     procedure groups have been mapped onto is so different
  19     in the two sources.
  20        As we have seen, the Cardiac Register has been an
  21     attempt to map from the diagnostic categories onto the
  22     13 groups, and from HES it has had to work through
  23     a two-stage system of the ops codes and being mapped
  24     onto the 13 groups.
  25   Q. So is a simple way to understand that to say the
0156
   1     distinction between an open operation and a closed one
   2     is comparatively obvious and if one is faced with that
   3     distinction across the whole range of cases done in
   4     a centre, taking any difficulty there may be in
   5     establishing mortality, nonetheless, one is looking at
   6     a rate which is much less variable than rates for
   7     individual operations about which there may be
   8     disagreement, a question of when one you code,
   9     a question of which code applies, which subcode applies,
  10     and so on?
  11   A. Yes and there will be more chance of disagreement
  12     because the numbers are small on the individual
  13     procedure groups.
  14        If I can now jump to page 51 of my report, this
  15     essentially puts in graphical form the data that
  16     Professor Murray showed in one of his slides. This is
  17     really a descriptive analysis. There is no, for
  18     instance, statistical inference going on here, but it
  19     enables us to compare what HES and the Cardiac Register
  20     is saying about open operations on children aged less
  21     than 1 year old, compared with in the three epochs.
  22        The fourth one, which is just the 9 months from
  23     April 1995 to December 1995. For the Cardiac Register,
  24     since it comes in units of a year, this epoch actually
  25     includes up to March 1996, which is strictly speaking
0157
   1     outside the Inquiry's period, but you have to include
   2     that.
   3        This demonstrates, I think, what Professor Murray
   4     said in his presentation, that while back in epoch 1,
   5     which is 1985 to 1987, for CSR, the mortality rates
   6     between Bristol and the rest of the country really are
   7     quite simple. This is what has been reported to the
   8     Cardiac Register since then, in the rest of the country
   9     is a steady decline in mortality. What apparently has
  10     happened in Bristol -- it does seem rather
  11     superficial -- is that this decline has not occurred, it
  12     has stayed at a fairly constant level, showing an
  13     increasing divergence with the rest of the country.
  14        What is quite nice here in being able to put both
  15     the Cardiac Register data and the HES data together is
  16     that in epoch 3 and epoch 4 we can see that really there
  17     is quite good agreement between these two sources, first
  18     of all on what was happening in the rest of the country
  19     in epoch 3 and epoch 4, and here this is the HES and
  20     Cardiac Register in Bristol, fairly good agreement with
  21     what was going on in Bristol in epoch 3 and epoch 4.
  22        The rather simplistic analysis we ought to be
  23     cautious about is that between epoch 3 and epoch 4,
  24     after March 1995, there was a decline in mortality.
  25     I would like to come back to that in a moment.
0158
   1        If I could go on to 7.2.2, at the bottom of the
   2     page, which shows the picture in children aged more than
   3     1 year old, we can see again this is now with actual
   4     lower mortality rate, but a steady decline in the rest
   5     of the country, and again, while Bristol appears to be
   6     rather typical in the early part of the period of the
   7     Inquiry, than in the intermediate time, although to
   8     a slightly lesser extent than in the younger children,
   9     there has been a divergence of performance which has
  10     declined in epoch 4.
  11        I would like to point out the level of agreement
  12     between the two data sources between what was going on
  13     in the rest of the country and what was going on in
  14     Bristol.
  15        There has been caution expressed about
  16     interpreting this apparent decline into epoch 4, because
  17     as Professor Evans mentioned, there is a possibility
  18     that the pattern of operations changed and I have done
  19     a little bit of analysis that is not in my report but
  20     will now be available.
  21        If we look at page 113, if we can get all of that
  22     on the screen at once, what this does is show, in the
  23     epoch 3, using the HES data, what the profile of
  24     operations was in terms of the number and in terms of
  25     the survival rate. Cases that unfortunately died are
0159
   1     coloured black and survivors are coloured grey. We can
   2     see immediately which were the more common operations,
   3     the fact that the high mortality rates are being
   4     observed under switches, TAPVD, AVSD and truncus
   5     operations in particular, which are also very much the
   6     rarer operations.
   7        If we look now at what was going on, just in the
   8     9 months after April 1995, clearly the scale is a lot
   9     smaller and so there is more variability attached to
  10     these numbers, but basically, the profile, in terms of
  11     the type of surgery being carried out, is broadly
  12     similar except if we look in these particular areas down
  13     here. There were no TAPVD operations recorded, only
  14     a couple of switches and one or two truncuses, so
  15     although there is a very low mortality rate, there is
  16     definitely a decline in the amount of the more severe
  17     high risk surgery being carried out.
  18        If we jump to 115, we can see whether the Cardiac
  19     Register is telling the same story or not, if we could
  20     blow that up.
  21        This is the Cardiac Register. Again, we see an
  22     agreement between the Cardiac Register and what has been
  23     reported to the Cardiac Register and HES, we see a very
  24     similar picture. The profile is rather similar, except
  25     that we can see a big decline in the number of switches,
0160
   1     the TAPVD, the proportion of switches, the TAPVDs none,
   2     the AVSDs being carried out.
   3        So again there is a low mortality rate, but the
   4     high risk operations are not being done in that period.
   5        So my feeling is, although there is an apparent
   6     strong decline after April 1995, I would not be happy to
   7     make absolutely firm conclusions on the basis of that,
   8     unless the period after April 1995 could be extended.
   9     If we could look at data which would now be outside the
  10     Inquiry's remit, of course, but I would not be happy to
  11     compare the mortality rates in this profile unless these
  12     numbers were considerably larger, because it is clear
  13     that the type of operations over that period did change.
  14        I suppose I am expressing considerable caution
  15     about over-interpreting the decline after April 1995.
  16   Q. But you suggest that it might be one way of assessing
  17     what happened before 1995 and before, to discover what
  18     has happened in 1996/97?
  19   A. I think if that was a straightforward exercise, it would
  20     be valuable to increase the sample size in that period,
  21     because that is telling one something about the
  22     institutional factors and about the population that
  23     Bristol is serving, which presumably will not have
  24     changed over these periods.
  25        If I could go on now to what -- these are really
0161
   1     just descriptive analyses. Now we get more tricky. If
   2     we go to page 108 which is a summary taken from my
   3     report of the strategy that we carried out, you have
   4     heard a considerable amount about this already, but I am
   5     afraid I am going to talk about it yet again.
   6        The basic idea is to try to investigate whether
   7     Bristol could be considered as divergent from the
   8     remainder of the units in the country: in which case, in
   9     which regard?
  10        Our basic strategy has been to remove Bristol from
  11     the database. Can I draw on this?
  12   Q. Yes.
  13   A. Remove Bristol from the database, and then analyse what
  14     was happening in the remainder of the centres. In
  15     particular, calculate the activity and mortality rates
  16     in the other centres, and then, within each stratum,
  17     which just means a particular set of patients with
  18     a particular operative procedure, a particular age
  19     group, particular epoch of operations, so this is the
  20     subgroup we want to look at individually, work out what
  21     was going on in the remaining centres, and essentially,
  22     characterise what would be considered a typical
  23     performance and the variability around that performance.
  24   Q. We may be used to the word "average". What is the
  25     difference between "average" and "typical" in this
0162
   1     context?
   2   A. I have avoided the word "average" because you then would
   3     tend to think of perhaps an average patient as if there
   4     was a great homogenous pool of patients out there. By
   5     using "typical" it is much more thinking of the fact
   6     that there are 11 other centres, we do not expect them
   7     to be exactly the same, they are bound to differ in ways
   8     we cannot measure, even if we could adjust for case mix,
   9     so we are always going to expect some variability. What
  10     we want to know is whether Bristol could be considered
  11     as typical of the sort of variability that inevitably
  12     exists between centres.
  13        So what we can do, then, is to obtain an interval,
  14     a confidence interval, an uncertainty interval, for what
  15     would be the true underlying performance of a typical
  16     centre, that is almost as if a centre had been drawn at
  17     random from the other 11, and compare that with what
  18     appears to be the true performance in Bristol.
  19   Q. And again, true performance is not the actual observed
  20     performance; it is an idea that there is, underlying
  21     that, some true figure from which each year there may be
  22     a random departure?
  23   A. Exactly. It is a slightly hypothetical idea. It is
  24     almost as, if they were to do a million operations, what
  25     would be the eventual long-term mortality rate?
0163
   1     Obviously, that is not realistic, but we can think of
   2     it, to use the rather unfortunate coin-tossing analogy,
   3     if each operation is if you were tossing a coin, one
   4     would observe the theory of heads or tails but when
   5     trying to assess the bias of the coin, it is reasonable
   6     to think of what is the true likelihood that it will
   7     come up heads. We can do that with an interval
   8     elsewhere and in Bristol and compare those graphically.
   9     We will see those in a minute.
  10        Then we will get on to the next stage, which we
  11     have already heard about today, to do with this
  12     unfortunate term, excess number of deaths. At stage 5
  13     we can say that with Bristol's activity, were Bristol to
  14     be essentially representative of the centres in the rest
  15     of the country, were it to be essentially
  16     indistinguishable, how many deaths would we expect to
  17     occur? We do not know how many deaths would have
  18     occurred had Bristol been really similar to the other
  19     centres in the country so that is an unknown quantity,
  20     but we can make essentially a prediction of what we
  21     would expect to have occurred in terms of the number of
  22     deaths.
  23        Since we know how many deaths actually occurred,
  24     we can subtract from the observed the number that were
  25     actually expected to occur were Bristol typical and call
0164
   1     that the excess deaths. We can call it the difference
   2     in deaths, but typically in statistics, something that
   3     can be derived from the expected would be called an
   4     excess. That excess could very well be called negative
   5     if in fact less deaths were observed than would have
   6     been expected were Bristol to be a typical centre.
   7        But since we have an interval on the number of
   8     excess deaths, we can calculate the probability that the
   9     true underlying excess, in other words, having an
  10     enormous number of operations, is actually greater than
  11     zero. These figures will give us some idea so we can
  12     say we are 99 per cent sure that really there was an
  13     excess mortality, in other words, there is only less
  14     than 1 per cent chance that were Bristol typical, we
  15     would have observed such an extreme number of deaths.
  16        So we are looking for figures like 95, 99 per cent
  17     certainty or confidence that there really is excess
  18     mortality.
  19        We can also do a ranking exercise where we see
  20     within that particular subgroup of patients where
  21     Bristol was, was it at the bottom, was it second to
  22     bottom or whatever, and rather than just quoting where
  23     it lay in the rank, we can give an interval around that,
  24     to say whether we are confident that Bristol was where
  25     it was in that range of the centres.
0165
   1        Finally, what we have done is repeat this entire
   2     analysis for every centre in turn, done a totally
   3     symmetrical analysis, because we felt we would not be
   4     able to say anything about Bristol's degree of
   5     divergence unless we could also say something about the
   6     divergence of all the other centres, so we have repeated
   7     the entire analysis 12 times.
   8        That is why I am afraid my report is filled up
   9     with a large number of stodgy tables which I have tried
  10     to extract the main things from.
  11        If we could jump to a table which I could talk
  12     through, because I would like to go through these
  13     tables. There is one table for each procedure group.
  14     Since the conclusions really depend on the very specific
  15     procedure groups, apart from the open and closed
  16     categories, it would be useful to go through these in
  17     turn.
  18        If we could have page 68 up, this is starts off
  19     looking at tetralogy of Fallot. If we could blow that
  20     up a bit. I hope this is readable as it stands.
  21     Essentially this page is trying to summarise all the
  22     information in both the Cardiac Register and the HES
  23     database that we have available about tetralogy of
  24     Fallot, so that is why it is rather dense, in trying to
  25     put everything we can on one page. If I could write on
0166
   1     this, to highlight some of this, to illustrate what
   2     I mean by analysis, this is data taken from 1984 to 1987
   3     from the Cardiac Register, on Fallot type operations
   4     from patients aged 1 to 15 years. First of all, we can
   5     just summarise what the mortality was in 11 other
   6     centres. That is 6 per cent, 24 deaths out of 404
   7     reported to the Cardiac Register.
   8        We can compare it in Bristol, they reported
   9     7 deaths out of 49 procedures, as 14 per cent. So it is
  10     over double the mortality rate.
  11        The little pictures after that, illustrate what we
  12     feel might be the true underlying mortality rate at
  13     Bristol, that is the solid line with the dot in the
  14     middle, which is round about 14 per cent but with quite
  15     a wide interval on each side and compare it with what we
  16     feel would be the mortality rate in a typical centre
  17     elsewhere in the country, which is round about 6 per
  18     cent but with quite a range as well. We are looking for
  19     divergence in those two intervals. If they do not
  20     overlap, then really that is pretty strong evidence we
  21     are not talking about Bristol -- it cannot be considered
  22     typical or representative of the other centres in the
  23     country. So we are looking for and we have a case here
  24     in the second epoch, 1988 to 1990, in the under 1 years,
  25     where there was a 10 per cent mortality in the rest of
0167
   1     the country, but out of the two cases in Bristol, both
   2     unfortunately died and in that case, really, we have got
   3     quite a strong confidence that Bristol is not as
   4     a centre representative of those other centres, only
   5     based on two cases.
   6        The next column actually tries to quantify the
   7     degree of divergence between Bristol and the rest of the
   8     country by saying how many deaths there were in Bristol
   9     in excess of what would have been expected had Bristol
  10     been typical. I think we can see there that out of
  11     these 49 deaths, had the mortality rate been around
  12     6 per cent, the national rate, we would have expected,
  13     instead of seven deaths, about 3, less than 3 and
  14     a half. So that means that our excess numbers of deaths
  15     is 3.6. I hasten to add again, as has been mentioned
  16     already, that these are not 3.6 identifiable
  17     individuals; this is purely saying that we would have
  18     expected round about 3.5 deaths were Bristol typical,
  19     and in fact we have observed 7.
  20        That 3.6 in itself, you cannot claim that that
  21     could not have been due to chance alone. The interval
  22     around that goes from 7 to minus 4. Essentially, we are
  23     looking for that interval to exclude zero before we
  24     could say really we are confident that that excess
  25     mortality was not due to chance alone. So essentially
0168
   1     getting 7 deaths out of 49 is not totally incompatible
   2     with having a 6 per cent mortality. There is excess,
   3     but we cannot be confident that it is real and it really
   4     reflects an increased risk in Bristol.
   5   THE CHAIRMAN: Dr Spiegelhalter, if I may interrupt for
   6     a moment, can you slow down sometimes just a little?
   7     Our stenographer has to ensure we have a complete
   8     record, and sometimes you have to slow down.
   9   DR SPIEGELHALTER: I will do my best, I am sorry.
  10   THE CHAIRMAN: Not at all?
  11   A. I assure you, I am not going to go through every line on
  12     every table.
  13        Essentially I have been going through this line,
  14     and then pointing directly at results.
  15        This column here, which is the probability that
  16     the excess is greater than zero, which one can think of
  17     as a probability that there really is something
  18     different, 88 per cent would not usually be considered
  19     sufficient to warrant very long evidence. Okay, there
  20     is an excess, but we are looking for 95/99 per cent
  21     really before we can be confident that there is
  22     something going on.
  23        Similarly, although in terms of its mortality
  24     Bristol ranks 10th out of 11 centres, there was only one
  25     in that particular time period, age group and for
0169
   1     Fallot, there was only one centre in the country that
   2     had a higher mortality than Bristol. Bristol is
   3     observed to be 10th or second from bottom, but the
   4     95 per cent interval goes between 5th and 11th, so
   5     essentially all we can do is say we are confident it is
   6     in the bottom half of the centres in the country, but we
   7     cannot say it too much stronger than that.
   8        So that essentially is warning us not to take too
   9     much attention of something just because it is bottom.
  10     Someone has to be bottom, so we are looking for
  11     consistency of pattern and we are looking for really
  12     strongly divergent behaviour.
  13        When one is confronted with a page like this, all
  14     these comparative pictures are based on different groups
  15     of people. This is the first epoch, the two age groups,
  16     this is the second and this is third, 1991 to 1995 in
  17     the two age groups. We are looking for consistency of
  18     pattern. In other words, is the line from Bristol
  19     consistently lying on the left of the line from the
  20     other centres?
  21        We can see that while each stratum on its own is
  22     not too exciting, although in fact on this particular
  23     group here this really is, one can be 99 per cent sure
  24     that this is not due to chance, that there really is
  25     something different in the performance. When you are
0170
   1     getting a similar pattern across different time periods,
   2     or age groups, it is only reasonable to accumulate them
   3     and look at what is happening in larger groups.
   4        So, for example, at the limit we can aggregate
   5     over all the time periods and all the ages, so this line
   6     here, which I will just highlight, is essentially adding
   7     up over all these six subgroups here.
   8        This says, the Cardiac Register data on Fallot
   9     operations reported double the mortality in Bristol as
  10     elsewhere in the country, and this represents an excess
  11     deaths of 11.4 out of 22 estimated to be about half, and
  12     the interval for that goes from 0 to 19, so essentially
  13     we can be 97 per cent sure that that excess is not due
  14     to chance alone but there really is excess mortality in
  15     Bristol compared with whether it is a typical centre.
  16        That is essentially by aggregating, we can say and
  17     it is one of the things I say in the report, if we do
  18     aggregate over the Cardiac Register, we find that there
  19     is reasonable evidence of excess mortality in Fallot
  20     operations.
  21   MR LANGSTAFF: Can I take you up to the line immediately
  22     above that one? You said a moment ago that looking at
  23     the excess deaths, one would be looking for a figure
  24     which would start no lower than 0 in order to see that
  25     there was a sufficient probability of an excess.
0171
   1        There we have a minus 1 figure, and then, on the
   2     right-hand column, in the probability of excess, we have
   3     0.96, which I had understood from what you were saying
   4     represented a 96 per cent exclusion of chance as
   5     relating to the probability of an excess?
   6   A. Yes. One would not like to draw an absolute line of
   7     what is in quotes a "significant" excess and what is
   8     not. The point is, it is a 95 per cent interval, so you
   9     can think of it as 2.5 per cent on each side. If the
  10     bottom end of the interval is just 0, that really means
  11     there is a 97.5 per cent chance that actually the excess
  12     is positive. So slightly below 0 will correspond to
  13     something like 96/95 per cent.
  14        I do not want to make an absolute strict
  15     division. I would prefer to look for things really up
  16     high, 98/99 per cent before getting too confident.
  17   Q. If you are looking for 98/99 per cent, what about
  18     looking for 97 per cent on the bottom line?
  19   A. That would be classically considered a significant
  20     result, but the bottom end is just on 0. It is a 1 in
  21     40 chance that -- the fact there is now excess
  22     mortality.
  23        So some caution on that, and I hope I have been
  24     suitably cautious on the conclusions.
  25        One is also looking not only for consistency
0172
   1     within the data sources but consistency across the data
   2     sources. The HES data is only available for the last
   3     epoch, so it is very useful to compare what was going on
   4     with HES in the last epoch with what was going on pooled
   5     over all the ages in the Cardiac Register for the last
   6     epoch, so that line and that line are often a very
   7     useful thing to compare. We can see in this case there
   8     is really quite strong agreement between what the
   9     Cardiac Register was reporting and what HES was
  10     reporting in the Fallot operations.
  11   Q. I think you may have said "last epoch". I think you may
  12     have meant the third?
  13   A. Yes, I am not using epoch 4 at all. All my analyses are
  14     1, 2 and 3 from now on.
  15        What I have been trying to look for as I have gone
  16     through these tables is to look for where the Cardiac
  17     Register and HES can be directly compared looking for
  18     consistency, in other words, corroboration between these
  19     two sources, and then also looking for consistency in
  20     the Cardiac Register across the periods of time, in
  21     particular, in epoch 2 and 3, when we feel there was an
  22     excess.
  23        What I would like to do, I hope without being too
  24     pedantic, would be to go through the procedure groups
  25     where there is some consistent evidence of interest. Is
0173
   1     that all right?
   2   Q. Yes, certainly.
   3   A. If I could jump to 70, this is the group 3, the
   4     inter-atrial -- I am sorry, the arterial solution for
   5     transposition of the great arteries, so these are
   6     supposed to be the switch operations.
   7        The problem with this is that we have already
   8     heard that the Cardiac Register -- I will just look at
   9     the -- it will be useful to look at the pooled Cardiac
  10     Register data. We have already heard that the Cardiac
  11     Register does not code this group well; that the earlier
  12     Mustard and Senning operations have been coded up as
  13     corrective procedures and therefore are all mixed in.
  14        We find, looking at the Cardiac Register, in this
  15     group we do not find a distinction between Bristol and
  16     the rest of the country, except that there is perhaps
  17     some evidence in the more recent era -- this is perhaps
  18     interesting -- where most of these will be switches
  19     now. We do see some evidence of excess mortality in the
  20     under 1 years, 13 per cent in the rest of the country,
  21     28 per cent in Bristol, in the under 1 years.
  22        In the HES data, this is the data down here, we
  23     see a much stronger pattern, partly due to their ability
  24     to divide up the age groups into under 90 days, 90 days
  25     to 1 year, and 1 to 15 years. Dr Aylin has already gone
0174
   1     through this data in detail, but I think it illustrates
   2     quite nicely -- "nicely" is not perhaps the right
   3     word -- the divergence between these two intervals.
   4     This is the national pattern elsewhere and the pattern
   5     in Bristol, with 90 per cent mortality, 9 out of 10
   6     deaths.
   7        When that is accumulated, there is apparently some
   8     excess, although very marginal in the other two age
   9     groups, and when accumulated over all ages for switch
  10     operations in the HES database, there were 11 deaths and
  11     they would -- in a typical centre I would only have
  12     expected 2, so one can estimate the excess deaths as 9,
  13     with an interval really well away from 0. I can be
  14     confident about that. The 1 should perhaps be 1.999,
  15     but it would not fit on the page.
  16        So there is an area where there is limited
  17     consistency between CSR and HES, but where we know the
  18     CSR has a lot of difficulty in the coding.
  19        If I could go on to group 4, TAPVD, page 71, this
  20     is really under 1 years of the - where these have been
  21     carried out, so that is the area of interest.
  22        We can see in CSR, if we look at these pictures,
  23     in the under 1 years, that is the top line, a consistent
  24     pattern of excess mortality over that observed in the
  25     other centres, over each era, each epoch in turn. So in
0175
   1     the first one, 19 per cent versus 37; 13 versus 45; 14
   2     versus 33. So that suggests that is cumulating over the
   3     whole period of the cardiac register, in under 1 years,
   4     of what mortality might be. We find that the difference
   5     between 15 per cent and 38 per cent, and here we have
   6     excess mortality of 9.9 out of 17, with an interval that
   7     confidentially excludes 0.
   8        We want to see if we have consistency between HES
   9     data and the Cardiac Register, so perhaps it is good to
  10     compare what was going on in 1991/95 in HES with what
  11     was going on in 1991/95 in Bristol, reported to the
  12     Cardiac Register, and we find very strong consistency
  13     between these two sources in terms of the excess
  14     mortality.
  15        So there is a situation there where one seems to
  16     have strong consistency between the sources over the
  17     period which are being measured, consistency in the
  18     Cardiac Register across the time periods. So the TAPVD,
  19     which is in the area where coding is quite clear, comes
  20     out as an area where we can be really quite confident of
  21     excess mortality.
  22        I think that is not unrelated to the fact that we
  23     have seen, looking right back at comparing the sources,
  24     that TAPVDs were the areas with the best agreement,
  25     which also indicates consistency of coding.
0176
   1        If we could go on to AVSD on page 72, here, if
   2     I could highlight, if we look at the pictures, we can
   3     again see a very consistent pattern, both in all the age
   4     groups and in all the periods. We are looking all the
   5     time to see whether the apparent mortality in Bristol,
   6     allowing for the uncertainty, is separate from the
   7     mortality in other centres.
   8        That kind of consistency again suggests perhaps
   9     looking directly at data pooled over the entire Cardiac
  10     Register period, and there we see mortality in 32 per
  11     cent AVSD, 14 per cent in other centres, so we are
  12     estimating about half of these deaths were in excess of
  13     what would have been expected in a typical centre, and
  14     again at intervals well away from 0. I am very
  15     confident that this is not due to chance.
  16        If we look at some consistency in the HES data and
  17     look at what was going on in the third epoch compared
  18     with what the Cardiac Register was reporting in the
  19     third epoch, we see again, while the agreement is not
  20     wonderful, it is reasonable good agreement between these
  21     two sources of information about what was going on
  22     between 1991 and 1995.
  23        So, again, both sources point to a very confident
  24     conclusion that mortality was in excess of what would
  25     have been expected.
0177
   1        As I said, I will not go through all the 13 like
   2     this. I am just pointing out the ones I think show the
   3     clearest patterns and the ones I would feel most
   4     confident about drawing some conclusion from.
   5        If we go to 73, the closure of ASD, here again we
   6     are seeing in the under 1s a consistent pattern across
   7     the three epochs of what has been reported to the
   8     Cardiac Register of mortality in excess of the remaining
   9     centres. So if we look at over the period 1984 to 1995
  10     under 1 years, although this is only based on 10
  11     operations, we see a 50 per cent mortality compared with
  12     6 per cent in the rest of the country, which again
  13     suggests, although in not a large number of operations
  14     at all, that the mortality is in excess.
  15        For 1 to 15 years, there is no evidence at all
  16     that there was any problem whatsoever.
  17        Again, we would want to check, compare that data
  18     in the under 1 years for what -- one would want to
  19     compare consistency across the sources so one would want
  20     to compare that evidence from the Cardiac Register with
  21     this evidence from HES. We can see there is not
  22     a wonderful agreement there, but they are pointing
  23     definitely in the same direction. The agreement is not
  24     tremendous.
  25        So the ASD findings I think must be more
0178
   1     tentative, but there is really evidence in here and in
   2     here of excess mortality just in the under 1 years.
   3        I am going to go through VSD, which there is no
   4     evidence of any difference, but I would like to go on to
   5     75, the truncus operations. That is of considerable
   6     interest.
   7        Following the way in which I have looked at these
   8     things before, one is looking for consistency of
   9     evidence across time periods and age groups, and here,
  10     although I think I could just look over the whole of the
  11     Cardiac Register data, although the mortality rate
  12     reported to the Cardiac Register is 9 out of 19, 47 per
  13     cent, the crucial thing is that the mortality rate
  14     reported by the rest of the country is very high as
  15     well.
  16        We can see -- I think this is quite an informative
  17     thing to look at, is what was being reported in the rest
  18     of the country over the time period. We see a mortality
  19     rate of around 50 per cent, which has now apparently
  20     dropped in the more recent period from 1991 to 1995, but
  21     still very high; the point being, and this is reflected
  22     as well in the HES data, that this is an area where,
  23     although there is apparently excess mortality in
  24     Bristol, there is observed to be a higher mortality
  25     rate, one really cannot be confident that is not due to
0179
   1     chance alone. The numbers are too small and the
   2     mortality in the rest of the country is so high that
   3     truncus is something where we would not want to be
   4     confident that there was -- we would not like to be
   5     dogmatic that there was excess mortality there.
   6        I would now like to jump to just the open
   7     operations, 81.
   8        Again, we are looking for consistency across the
   9     time periods. We can see that in the under 1 years, not
  10     so much in the first epoch, but really in the first
  11     epoch, as we saw in those early graphs, really Bristol
  12     could be considered as representative. But the
  13     divergence increases over time in the second and the
  14     third epoch, with the result that in the second two
  15     epochs we have some evidence of excess mortality in what
  16     is reported to the Cardiac Register. These intervals
  17     include 0.
  18        I am sorry, I should not be looking at all ages,
  19     it is the under 1 years that are important. If I look
  20     at this one, if I look at over the whole period, under 1
  21     year, then the mortality rate of 26 per cent reported to
  22     the CSR as compared to 16 per cent, and that represents
  23     excess mortality of 29.6 out of 90, and we can be
  24     borderline confident that that is not due to chance
  25     alone.
0180
   1        In fact in the first epoch there is no evidence of
   2     any difference, so the effect -- this confidence is
   3     slightly damped down by that first epoch.
   4        In the HES data, as we have already heard from
   5     Dr Aylin, in the under 1 years there really is quite
   6     strong evidence of excess mortality, and over all age
   7     groups, if we just pool over all age groups, we are
   8     seeing out of 62 deaths in open operations in 1991 to
   9     1995, one would only have expected round about 30 were
  10     Bristol to be typical. So we are getting an interval
  11     well away from 0 in this situation.
  12        We should note that the ranking for this,
  13     particularly the under 90 day mortality, we can actually
  14     be pretty well confident that Bristol was the worst, had
  15     the highest mortality in the country at that time.
  16   Q. In the next category, one can be confident that it was
  17     either the worst or the second worst?
  18   A. Yes. Unless there is something I could clarify, if
  19     I could stop going through all those -- I have gone
  20     through the operations where I feel there is
  21     a consistent pattern which I summarise at the end.
  22   Q. Others can look at the charts and the tables you have
  23     produced?
  24   A. I feel apologetic about the density of these tables.
  25     I just hope I have explained slightly how to read them.
0181
   1   THE CHAIRMAN: I would interject that no apology is called
   2     for. I find them very clear.
   3   A. Thank you very much. If I could then jump to the
   4     delicate issue of the fact that this entire analysis has
   5     been repeated for all the other centres and if I could
   6     then look at 99. If we could blow that up a little
   7     bit. What this looks at is HES data, open operations in
   8     epoch 3, broken up into the three age groups. Now we
   9     are actually seeing, centre number 1 is Bristol, so this
  10     is the data we have seen before. But, instead of all
  11     the rest being lumped together into one great elsewhere,
  12     we are now seeing the mortality rates in the individual
  13     centres.
  14        The reason for this is because we felt that we
  15     were not looking for divergent -- it would not be
  16     considered divergent performance if Bristol, or any
  17     centre, is just sitting on the outside of the
  18     distribution inevitably that exists. We are looking for
  19     something that even allowing for the interval around it
  20     is substantially way from the bulk of the other
  21     centres. For under 90 days and 90 days to 1 year, while
  22     there is rather good homogeneity between the remaining
  23     centres, we have centre number 1 sitting really
  24     substantially outside those intervals. That separation
  25     is reflected in the confidence with which we can state
0182
   1     that there was excess mortality.
   2   Q. If one looks at the left-hand table, and goes back to
   3     the question which I asked you earlier about the size of
   4     the centres, there is some suggestion, is there, that
   5     the smaller centres are lagging behind the larger
   6     centres?
   7   A. That is a standard analysis to do, and I have to say we
   8     have not done it. I would say very quickly, looking at
   9     the left here, you are right. If we look at the biggest
  10     centres, centres 11 and 8 and 3, they are the centres
  11     with the lowest mortality rate. The smaller centres,
  12     centre 4 and 1 and 5, 9 and 10, are the centres with the
  13     highest mortality rate. It seems to me that under 90
  14     days picture in particular is a very strong indication
  15     of the relationship of volume with mortality rate.
  16        We have not carried out a confirmatory analysis on
  17     that. In that particular instance, that seems to be the
  18     case.
  19   Q. If we find that picture is repeated across the board, it
  20     may actually provide some of the evidence we have been
  21     told elsewhere in this Inquiry has been looked for
  22     because it does not exist in this country, that small
  23     size may be inimical to good performance, or best
  24     performance?
  25   A. I think that data such as this could stand further
0183
   1     analysis in that regard, yes.
   2        This is looking at the HES data. If you go to
   3     page 101, it is repeating this for the Cardiac Register
   4     data.
   5        If we look up the two left-hand ones, I wonder if
   6     those could be made any bigger. That is looking at the
   7     first epoch. The point about these, the first epoch is
   8     when the Cardiac Register really did not report that
   9     Bristol was in any way different from the others. I do
  10     not want to make a strong interpretation of these,
  11     except a point perhaps already at centre 10 that in the
  12     1 to 15 year old open operations has a mortality rate of
  13     15 per cent compared with 8 per cent in the whole
  14     country.
  15        If we go on to the next column of the two graphs,
  16     the next bit along, if we go to the centre column, what
  17     has been reported to the Cardiac Register in epoch 2,
  18     between 1988 and 1990, now we are seeing Bristol with
  19     quite high mortality being reported, but not sitting out
  20     on its own at all. This is the kind of analysis that
  21     Professor Murray did showing that, although Bristol is
  22     in this instance essentially on the edge, we cannot be
  23     confident that it is really beyond the distribution of
  24     the others. I think perhaps we might point out again
  25     centre 10 in the under 1s and in the over 1s.
0184
   1        If we go on to the last epoch, the cardiac
   2     register, the final column of that picture, again it is
   3     only in the under 1s where Bristol, now really looking
   4     in this final period, is looking really substantially
   5     different from the others. We have already been through
   6     all that. Now I would like to repeat again centre 10 is
   7     sitting out here on the 1 to 15 years.
   8        So, building on that, I would like to jump to
   9     page 97. There are pages of tables like this which are
  10     very dull. What they do say is that for each of these
  11     centres what was the apparent -- the estimated excess
  12     mortality in all these operation groups.
  13        This particular table combines all the cardiac
  14     register data from 1984 to 1995, but just in the age
  15     group 1 to 15 years.
  16        As I say in my report, when you search through
  17     these tables, you find very few -- I have given a star
  18     when we can be 99 per cent confident that the excess
  19     mortality is real, that there really is differing
  20     performance, so when that figure that was in the
  21     previous tables reached 99.
  22        This is in 1 to 15 years. What I would like to
  23     point out here is centre 10, where there really is
  24     apparently substantial excess mortality reported to the
  25     Cardiac Register in 1 to 15 years.
0185
   1        That is in the report, I just say, centre 10,
   2     there is some consistent evidence about centre 10.
   3     I have actually done a little bit more analysis of
   4     that. If I could jump to page 123, I have done
   5     a further analysis of what was going on -- what was
   6     being reported to the Cardiac Register between 1984 and
   7     1995 by centre 10 in age 1 to 15 years.
   8        If I could jump to down here, we can see that
   9     centre 10 was reporting in open operations a mortality
  10     of 14 per cent compared with 6 per cent over the rest of
  11     the country. That represents an apparent excess
  12     mortality of around 60, and we can be very confident
  13     that is real. That is the figure I showed earlier on.
  14        What I am attempting to do here is to see some
  15     sort of explanation of what the source of that excess
  16     mortality might have been. I think the very strong
  17     finding here is that, when we look at these 13 groups,
  18     there is no real evidence of excess mortality. These 13
  19     groups also only cover 485 of those operations compared
  20     with 747 open operations. What this is saying is that
  21     in centre 10 there are a large number of open operations
  22     being reported in the Cardiac Register with high
  23     mortality rates that do not fit in with the now 13
  24     consensus groups.
  25   Q. So if it were to be the case that Harefield did a large
0186
   1     number of transplant operations, you would not see it in
   2     1 to 13, but you might see it in the difference between
   3     485 operations conducted and 747 open operations in
   4     total conducted?
   5   A. Exactly, and this period from 1984 would cover the
   6     developmental period of paediatric transplantation,
   7     where I imagine, although I do not have the data, one
   8     would expect to have a high mortality rate.
   9   Q. I have chosen transplants because we know Harefield does
  10     them.
  11   A. When I did the analysis I had no idea what centre 10
  12     was, but when I did find out, that provided some of the
  13     explanation which I think we had taken into account.
  14        If we can just jump to page 127 -- I am nearly
  15     finished now; this is my second to last -- this is again
  16     looking in more detail at centre 10 what the HES data is
  17     saying about them, again from ages 1 to 15 years.
  18        The HES data again is reporting really
  19     a substantially increased mortality in open operations
  20     with, out of 35 deaths, they would only have expected,
  21     were they typical, about 12.5.
  22        In HES it is a slightly different picture to the
  23     cardiac register, in that these open operations really
  24     are substantially falling within the coded up groups.
  25     We see when we run up our eye along here that really
0187
   1     there is considerable evidence of excess mortality in
   2     Fallot operations, 19 per cent mortality, and
   3     considerable evidence in Fontan operations, 9 per cent
   4     failure, 27 per cent in excess is 6. It is a very
   5     difficult pattern to that observed in Bristol. I am
   6     very anxious I do not give the impression in my report
   7     that the apparent estimated excess mortality in
   8     centre 10 reflects a similar pattern to in Bristol. It
   9     is a very, very different pattern being observed.
  10        This again suggests that there needs to be
  11     considerably more investigation of the case mix in
  12     centre 10, in case they are accepting patients who have
  13     already been operated on, perhaps have been left rather
  14     late, and so on.
  15   Q. Could I raise one matter with you in relation to
  16     centre 10? You say in the body of your report -- I am
  17     not going to take you to it -- that there is consistent
  18     evidence from both HES and the Cardiothoracic Surgical
  19     Register that Harefield has excess mortality similar to
  20     Bristol.
  21        I think that may convey the impression that that
  22     is true of all outcomes?
  23   A. Yes.
  24   Q. The consistency is limited to the 1 to 15 year group?
  25   A. Yes, exactly. That is what I wanted to apologise for,
0188
   1     that rather perhaps poor wording. First of all, when
   2     I said "similar to Bristol", I meant actually the
   3     extent, the actual numbers, not the pattern. It is very
   4     different. 1 to 15 year olds seems to be in very
   5     specific areas.
   6   Q. There is in fact an inconsistency in the picture we get
   7     from HES on the one hand and the CSR on the other?
   8   A. Not in terms of the excess. Both report strong excess
   9     mortality in open operations. There are obviously a lot
  10     of operations not coded up on in the CSR that are
  11     contributing to high mortality.
  12        If I might finish, I had not realised how late it
  13     was, I am sorry.
  14        If you go to page 129, I have my final
  15     conclusions.
  16        Essentially, this is just summarising the
  17     findings, that these are imperfect data. Professor
  18     Murray has expressed strong doubts about the cardiac
  19     register's lack of agreed procedures. However, the 6
  20     sources on Bristol do agree well, especially in the
  21     areas identified by the analysis as being of primary
  22     interest. HES and CSR do show a reasonably strong
  23     degree of consistency, and in that exercise of going
  24     through the operations, particularly identified on open
  25     procedures on children less than 1, there is some
0189
   1     consistency. For switches, TAPVDs, AVSDs and, although
   2     rare, ASDs. Truncus operations could have been due to
   3     chance alone. In over 1s, Fallot and AVSD, there is
   4     some evidence of excess mortality there. No source can
   5     be considered as the truth; however the consistency
   6     suggests the findings reinforce each other.
   7        I would agree with the others, the fact this data
   8     is imperfect, the imperfections are not sufficient to
   9     cast serious doubt on the fact there is divergence of
  10     performance in fairly identifiable areas.
  11   Q. You take that a shade further on page 131.
  12   A. Yes, I am sorry.
  13   Q. It is the "but" at the end of the page. You are echoing
  14     there what Professor Murray said?
  15   A. Exactly. I have some experience as well of using risk
  16     adjustment procedures in analysis, and while they do
  17     explain a certain percentage of the variability, I would
  18     find it remarkable if they could explain a substantial
  19     amount.
  20   Q. Although it is late, I want to take a couple of minutes
  21     to ask you, and for that matter the other experts we
  22     have heard from today, whether, in the light of the
  23     substantial measure of agreement, despite their
  24     imperfections, between the data sources, it is necessary
  25     and if so to what extent it is necessary, to further
0190
   1     confirm the data that we have.
   2        What would you say to that?
   3   A. I am not sure. I have been pondering this. I am not
   4     convinced that actually finding out yet more precisely
   5     what happened in Bristol, even what happened elsewhere,
   6     is going to add a huge amount. I think it would be very
   7     good, as people mentioned, to validate the mortality
   8     being reported elsewhere, possibly by linkage with
   9     national mortality records.
  10   Q. Can that be easily, speedily and economically done?
  11   A. If you can identify the patients, we can tag them with
  12     the national mortality records reasonably easily and
  13     cheaply and that seems to me a valuable thing to do.
  14   Q. You have given your view. May I ask our panel here
  15     whether they take the same view, or whether they would
  16     differ?
  17   DR AYLIN: I think in the case of the CSR, because it is
  18     based on aggregate tables, you cannot track those down,
  19     but with HES, given that we have a date of birth, a sex
  20     and a postcode, it may indeed be possible to link it
  21     with the national death registration figures.
  22   MR LANGSTAFF: You are dealing with linkage. Is there any
  23     other need you would see to further confirm the data, or
  24     are you happy with what Dr Spiegelhalter has said that
  25     essentially we have extracted a surprising amount from
0191
   1     the data and need very little further.
   2   PROFESSOR EVANS: I think I would agree with that in
   3     general. I think that there is a possibility that with
   4     the very different age distribution of children seen in
   5     the different centres that there could be that age
   6     difference having some effect. I do not think it does.
   7     In order to be rigorous, if one is going to look at
   8     other centres, it is very important to see that the age
   9     at which children got their operation was similar and if
  10     it is not, as we think it is not, there needs to be some
  11     adjustment for that. It is very difficult. It seems to
  12     me if we are going to go to other centres it could be
  13     that the key thing and the sensible thing to do would be
  14     to do it in one or two high risk groups.
  15   MR LANGSTAFF: Professor Murray?
  16   PROFESSOR MURRAY: I agree completely.
  17   MR LANGSTAFF: With Professor Evans.
  18   PROFESSOR MURRAY: With Professor Evans. I think the
  19     mortality is the biggest hole in the argument and most
  20     easily tackled. I think as Professor Evans says, there
  21     are a number of high risk procedures that we could
  22     follow up. I think that would provide added value, at
  23     relatively little extra work.
  24   MR LANGSTAFF: Professor Campbell?
  25   PROFESSOR CAMPBELL: I would agree as well. I mean, one of
0192
   1     the big problems would be the very major source of work
   2     to go through this sort of data validation exercise in
   3     the six different sources for all 11 other centres,
   4     which I think is a possible source of bias, but I think
   5     it is a very small one. I think the amount of work
   6     required to do that would not be justified.
   7   MR LANGSTAFF: So essentially, the message you are giving is
   8     that there is some benefit in the two steps you have
   9     identified, but otherwise very little that is likely to
  10     be gained from any further in-depth examination?
  11     I thank you for that.
  12        Dr Spiegelhalter, we have not dealt with the five
  13     further bullet points you make at the top of the page as
  14     to where we go from here. This is where you were left
  15     in July when you said this is where we go, and now you
  16     are reprising.
  17   DR SPIEGELHALTER: The original strategy, as I pointed out,
  18     did envisage the possibility of really confirming the
  19     accuracy of findings and then seeking an explanation,
  20     and I suppose my feeling is that the picture is quite
  21     clear. I would like to see the mortality data
  22     validated. I would very much like to see a move towards
  23     explanation, which I think while there is some
  24     statistical input into that, we need a lot of clinical
  25     input and clinical insight and the attempts to relate
0193
   1     outcomes to the process of care in Bristol would seem to
   2     be vital.
   3   MR LANGSTAFF: You have identified various other things,
   4     which I think probably speak for themselves. We
   5     approach tomorrow, as you know, the beginning of looking
   6     at the clinical case review which has been conducted,
   7     and getting a view from that: is that the sort of
   8     clinical input that you have in mind as perhaps giving
   9     some clue towards an explanation as to why the figures
  10     are as they appear to be?
  11   DR SPIEGELHALTER: Yes, very much so and I think the aim of
  12     this exploratory exercise was largely to identify the
  13     areas that would be worth deeper study and I think in
  14     that sense it has been quite successful in narrowing
  15     down where attention ought to be focused.
  16   PROFESSOR EVANS: May I make a comment that some people have
  17     not understood some of the key things Dr Spiegelhalter
  18     has done.
  19        Dr Spiegelhalter has done something very clever,
  20     effectively with league tables. If you imagine
  21     a football league table -- I am sorry, I am not wishing
  22     to trivialise this, but in order to try and help people
  23     understand, if after 10 games nearly all of the teams
  24     have had 10 draws, and one team has had 9 draws and
  25     a win, so they have 12 points and one team has had 9
0194
   1     draws and a loss, so they only have 9 points, the
   2     difference between the top and the bottom of the league
   3     is clearly due to chance and somebody would see that
   4     very clearly.
   5        But if after 10 games there was one team that had
   6     had 10 wins, and one team that had had 10 losses, it
   7     would be very clear that there was a really big
   8     difference between those.
   9        What Dr Spiegelhalter has done is by doing
  10     something very clever, he has allowed for the fact that
  11     whatever we do, there will always be someone top and
  12     bottom and the really important question is, if Bristol
  13     were bottom, are they bottom by a small amount and could
  14     they have just as easily been in the middle or near the
  15     top, or not? That is the key statistical thing that has
  16     been done very cleverly, and which is able to sort that
  17     out. It means that a lot of the medical world out there
  18     also believes that Bristol was just bottom by chance and
  19     that it was just bottom of the league and they know that
  20     league tables are capable of misinterpretation.
  21        What Dr Spiegelhalter and what Nicky Best and Paul
  22     Aylin have done is to make sure that kind of
  23     interpretation of it merely being bottom of the league
  24     is not done. I am sorry to take your time --
  25   MR LANGSTAFF: Not at all. I am very grateful.
0195
   1   PROFESSOR EVANS: I do not know whether they would agree
   2     with that.
   3   MRS MACLEAN: I just wanted to go back to the point of the
   4     size of centres. Would it be of value to investigate
   5     that further? I think it might be helpful to us.
   6   DR SPIEGELHALTER: I think just using the data available,
   7     that could be investigated further. I think in a sense,
   8     that is going beyond what we were asked to do, because
   9     it is not specifically Bristol, it is the whole idea of
  10     performance, but I do feel that having gone through
  11     enormous data together, it could be mined for
  12     considerably more.
  13   MR LANGSTAFF: Sir, can I thank all our participants today.
  14     Can I say, I am conscious of the hour and that there may
  15     be a number of those who have watched and listened today
  16     who may have questions to ask. If I can remind them of
  17     the procedure which is open in this Inquiry, for the
  18     Inquiry itself to ask further questions of witnesses, in
  19     writing, orally if need be, in order to clarify or
  20     expand, and if it seems to the legal team that you, the
  21     Panel, would benefit by an expansion, an exploration, or
  22     even a contradiction of what has been put before you
  23     today, that we would be happy to ask the appropriate
  24     questions of the appropriate or indeed all experts, and
  25     I am sure that they would be happy to answer in a way
0196
   1     which would help the Inquiry.
   2   THE CHAIRMAN: Thank you, Mr Langstaff. If we are coming to
   3     the end of the day, I just wanted to say a couple of
   4     things.
   5        We have seen today another piece of the jigsaw.
   6        I will make one comment if I may: data such as
   7     that which we have seen today explains nothing. It
   8     suggests, however, that we should seek an explanation,
   9     or explanations. Thus, it would be quite wrong yet to
  10     draw any firm conclusions about any individual or in
  11     general. We still have some work to do and some way to
  12     go and tomorrow we take the next step, taking I think
  13     quite properly the advice that we see in
  14     Dr Spiegelhalter's paper, that some clinical insights
  15     would now be valuable. That is indeed what we will see
  16     tomorrow: another piece of the jigsaw. But before
  17     I close today's proceedings, I must praise and thank the
  18     expert panel, both our statisticians and the clinicians
  19     who have sat quietly today but I know have been so
  20     important in the background.
  21        I notice in your papers that you have also
  22     generously thanked members of the Inquiry's Secretariat,
  23     and we echo our gratitude to all of them, especially
  24     Ruth Chadwick and Liz Baldock, and the whole of the
  25     Inquiry's Secretariat. They are the backroom girls,
0197
   1     rarely noticed and we praise them and thank them.
   2        The Inquiry, indeed, I would venture to suggest,
   3     the public generally, owes you, our experts, a great
   4     debt of gratitude. You have performed an outstanding
   5     service and you have done so brilliantly, if I may say
   6     so. May no-one underestimate the challenge you face,
   7     not least because of the time-scale we impose.
   8     I express our thanks.
   9        One further point, more generally. I have from
  10     time to time asked for patience as the Inquiry has
  11     proceeded. What we have seen today demonstrates the
  12     need for that patience. If we are to do our job
  13     properly, do our duty to the public, we must, amongst
  14     other things, do it thoroughly. Only in that way, will
  15     we be fair to everyone.
  16        You have seen today what "thoroughness" means.
  17        A remark, you have also seen what we meant when we
  18     committed ourselves to carrying out our role openly and
  19     in public. What we have sought to do today is to
  20     explain what to many is a mystery: we have sought not to
  21     sustain the mystery and keep it to ourselves. Good
  22     afternoon to you.
  23   MR LANGSTAFF: Before we part company and in the light of
  24     making it completely transparent what material is before
  25     the Panel in terms of statistics, I should say, perhaps
0198
   1     I should have said earlier, that today there has been
   2     published a witness statement of Professor Robert Curnow
   3     of the Department of Statistics of the University of
   4     Reading which is available as WIT 361 as to the
   5     contribution that statistical evidence can make to the
   6     Inquiry. Because of his position in the Statistical
   7     Society of Great Britain, he is particularly well placed
   8     to give that advice.
   9        WIT 375, Mr Nigel Bell of the National Health
  10     Service information authority, dealing with the process
  11     and nature of the coding exercise; and as I mentioned
  12     earlier, but did not identify by reference, INQ 18,
  13     a paper from the Inquiry Secretariat from Ruth Chadwick
  14     and her assistants to whom you have referred, who have
  15     explained, so that it is clear and transparent, the
  16     process by which the various groupings and rankings were
  17     reached.
  18   THE CHAIRMAN: Thank you for reminding us of that. Good
  19     afternoon again.
  20   (5.06 pm)
  21       (Adjourned until 9.30 am on 4th November 1999)
  22
  23
  24
  25
0199
   1                I N D E X
   2
   3
   4     INTRODUCTION TO TODAY'S EVIDENCE ................... 1
   5
   6     PROFESSOR MICHAEL JOSEPH CAMPBELL (sworn)
   7        Examined by MR LANGSTAFF ..................... 27
   8
   9     PROFESSOR STEPHEN EVANS (sworn)
  10        Examined by MR LANGSTAFF ..................... 42
  11
  12     DR PAUL AYLIN (sworn)
  13        Examined by MR LANGSTAFF ..................... 72
  14
  15     PROFESSOR GORDON MURRAY (sworn) .................... 116
  16        Examined by MR LANGSTAFF ..................... 116
  17        Examined by THE PANEL ........................ 145
  18
  19     DR DAVID SPIEGELHALTER (affirmed)
  20        Examined by MR LANGSTAFF ..................... 147
  21
  22
  23
  24
  25
0200

Published by the Bristol Royal Infirmary Inquiry, July 2001
© Crown Copyright 2001