The Bristol Royal Infirmary Inquiry Logo


bullet list decorationHome Page

bullet list decorationSearch

bullet list decorationFinal Report

bullet list decorationInterim Report

bullet list decorationEvidence

bullet list decorationInquiry Seminars

bullet list decorationAbout the Inquiry

bullet list decorationHelp

Seperator Bar

 

Hearing summary

 

12 July 1999

 

Inquiry hearings today heard evidence from Mr Hugh Ross, current Chief Executive, United Bristol Healthcare NHS Trust (UBHT) and Mr Ian Barrington, General Manager, Directorate of Children’s Services, UBHT. Issues under discussion today were post-mortems and inquests. Mr Ross began by saying that procedures relating to post mortems (PMs) at UBHT during the period of the Inquiry were consistent with elsewhere in the NHS. He went on to describe the difference between a hospital PM and a Coroner’s PM and explained the responsibility of clinicians to obtain informed consent. He also discussed the issue of retention of tissue following PM and the keeping of records in relation to this. Mr Ross then described UBHT’s reaction to the disclosure of information to parents about their child’s PM and the co-operation of the Bristol Heart Children Action Group in this matter.

 

Mr Ian Barrington then went on to describe his responsibility in relation to PM, saying that he would have only been involved if a concern had been brought to his attention. He then talked about his role in dealing with requests from parents for information about their child’s PM. He described the setting up of a help line and subsequent investigations assisted by Professor Berry (Consultant Paediatric Pathologist) to deal sympathetically and swiftly, but as accurately as possible, with the queries from parents. He concluded by discussing the information surrounding PM that is now discussed with parents following bereavement.

 

FULL TRANSCRIPT

     1                                          Day 38, 13th July, 1999
     2      (9.45 am)
     3      THE CHAIRMAN:  Good morning, everyone.  Good morning,
     4          Mr Langstaff.  Today we have a somewhat different
     5          configuration of the Inquiry hearing chamber.  This is
     6          in conformity with our desire to use whatever method we
     7          can so as to enable us to understand the matters into
     8          which we are enquiring.
     9                Mr Langstaff, you have the floor.
    10                MR LANGSTAFF: INTRODUCTION RE STATISTICS
    11      MR LANGSTAFF:  Thank you, sir.  Sir, some time ago, when
    12          I stood up to open this case, as part of what I had to
    13          say, I showed a slide on the screen.  That slide has now
    14          been called up again.  It shows, in simple numbered
    15          form, the stages by which this Inquiry proposes to lay
    16          the evidence before you, the Panel, so far as statistics
    17          can assist.
    18                What we are at is essentially Stage 1, the
    19          preliminary critical overview of sources of data.
    20                Statistics are, of course, essentially figures and
    21          conclusions that can be drawn from the figures as
    22          figures.  I would not wish anything to be taken from the
    23          next two days which would in any way suggest that
    24          figures can tell the whole picture.  They can be
    25          helpful, but statistics in themselves can never be the
0001
     1          whole picture.  Indeed, they can be dangerously
     2          seductive.
     3                The conclusion which you have to draw at the end
     4          of this Inquiry as to the adequacy of care is
     5          a summation of many different strands of evidence,
     6          principal among them, the stories which you have heard
     7          from individual parents, each one of which tells its own
     8          tale; the conclusions that you are able to draw from the
     9          evidence of the way in which matters were structured at
    10          Bristol and elsewhere in the country; the conclusions
    11          that you may wish to draw from seeing whether or not
    12          national policies which are thought now to have force
    13          and importance behind them were or were not adopted here
    14          locally in the years 1983 to 1995; the conclusions which
    15          we may reach from hearing individual people who were, as
    16          it were, main players in what happened in Bristol; and,
    17          as part of picture, but only as part of the picture, the
    18          conclusions one gets from looking at figures as a whole.
    19                As I said when I opened this Inquiry to you, it
    20          would be very wrong to forget that at the heart of the
    21          figures are the cases of individual children, some of
    22          whom will have been treated and will thankfully have
    23          survived; some of whom will have been treated and sadly
    24          will have either suffered subsequently in consequence of
    25          the treatment or of the condition which led to the
0002
     1          treatment, and others, tragically, because we have
     2          always said it was a tragedy, will have lost their
     3          lives.
     4                The GMC, when it examined the cases against the
     5          two surgeons and Dr Roylance, looked at and examined in
     6          detail 29 cases.  We, for our part, have obtained the
     7          clinical records of every child that has been identified
     8          to us as having had cardiac surgery, closed or open,
     9          from 1983 until 1995, and will, in a way which I shall
    10          describe, attempt to draw such conclusions as we can
    11          from them.
    12                In that sense, every single child, survivor or
    13          not, will have his or her case painted as part of the
    14          picture which you will have to determine your findings
    15          upon.
    16                Today and tomorrow we will hear from a number of
    17          experts who have different roles to perform in respect
    18          of the various data sources which will deal with the
    19          statistical side.  That is part of the picture.  First,
    20          after the few words that I have to say by way of
    21          introduction, I shall ask Dr David Spiegelhalter to come
    22          and tell us about the way in which, as an expert
    23          statistician, he sees statistics helping the Inquiry as
    24          best they can towards reaching a conclusion in respect
    25          of the terms of reference.
0003
     1                Once he has identified essentially what the
     2          process is, we will then concentrate on this part of the
     3          process, the preliminary critical overview and will hear
     4          first from Mr Richard Willmer, the chief statistician at
     5          the Department of Health.
     6                When he gives evidence, I hope that what will
     7          develop will be a symposium or seminar, if you like, in
     8          which, far from asking all the questions myself, Dr Paul
     9          Aylin, who sits nearest the screen to my right, of
    10          Imperial College, will intervene and develop a dialogue
    11          with Richard Willmer in which three things essentially
    12          will be explored, three essential messages for the
    13          Inquiry.  That is, so far as the data set with which
    14          Mr Willmer is concerned -- I should say data sets in the
    15          plural and I will explain why in a moment -- whether
    16          they have weaknesses, for our purposes; secondly,
    17          whether they have strengths for our purposes; and
    18          thirdly, what use the Inquiry can potentially make of
    19          the information which the data set can tell us.
    20                After we have examined the data sets to which he
    21          can speak, the national data sets, we will hear from
    22          Mr Bruce Keogh, who will tell us about the
    23          Cardiothoracic Register, a register which has been in
    24          existence since 1977.  Essentially, I anticipate that it
    25          should be possible to develop a dialogue in the same way
0004
     1          principally between him and Dr Spiegelhalter, who sits
     2          a little bit further away from the screen to my right,
     3          examining exactly the same things: the strengths, the
     4          weaknesses and the utility of that data source for our
     5          purposes.
     6                Tomorrow we expect to narrow down the field from
     7          the national to the local, and look at the major data
     8          set available locally.
     9                So far as the available data is concerned, if
    10          I just run through what is available, and their apparent
    11          utility to the Inquiry, the first of those data sets
    12          that you will hear of from Mr Willmer is that contained
    13          in the hospital episode statistics system, or HES for
    14          simplicity.  This was set up by the Department of Health
    15          to collect information about activity in all National
    16          Health Service hospitals in England.  Immediately,
    17          anyone listening with a critical ear will have detected
    18          one gap: it deals with NHS hospitals and not others.
    19          Anyone listening might already have picked up that it
    20          deals with inpatient activity and it was not essentially
    21          designed to tell surgeons or hospitals, putting it
    22          crudely, how well or how badly they were doing by
    23          comparison with others; that was not its function.
    24                One of the matters which we will wish to explore
    25          is whether it can, despite having been designed for
0005
     1          other purposes, nonetheless be used for that function in
     2          order to retrospectively inform us on the Inquiry.  The
     3          data should, in theory, be complete, but what it
     4          measures is not operations; it is not directly
     5          morbidity; it is what are called "finished consultant
     6          episodes" and a finished consultant episode, as we have
     7          already heard in evidence at an earlier stage in this
     8          Inquiry, is an occasion when a patient comes under the
     9          care of a particular named consultant.  For as long as
    10          that patient is an inpatient, under the care of that
    11          particular consultant, that is one unfinished consultant
    12          episode.  When he is discharged from that care, it
    13          becomes a finished consultant episode.
    14                What it means, because finished consultant
    15          episodes are picked up each as a unit, is that if
    16          a particular patient is in the course of his inpatient
    17          stay under the care of more than one consultant, he
    18          moves from one department to another to another, let us
    19          suppose -- it sometimes happens as we know -- then there
    20          will be three finished consultant episodes although
    21          there will be only one inpatient stay -- one course of
    22          treatment, if you like.  Looked at from the patient's
    23          point of view it is one hospital stay; looked at from
    24          the national statistics, it is three.
    25                Similarly, if the surgery, let us suppose, is
0006
     1          broken by a period of discharge from hospital followed
     2          by a return of that patient, maybe within months,
     3          perhaps, because the surgery needs to be revised, that
     4          gets counted as a second and separate consultant
     5          episode.
     6                So immediately, one can begin to see something
     7          of the advantages, which is in the completeness of the
     8          data, and something of the disadvantages, which is that
     9          it was designed for purposes which are not ours.
    10                I do not want to say too much more about those
    11          statistics because it is best dealt with, I think, by
    12          explanation from Mr Willmer and in particular because
    13          I am conscious here that he and our experts know far
    14          more about HES than I could ever hope to.
    15                The second source is the Cardiothoracic Register,
    16          and again, a brief description will help.  This was set
    17          up by, it happens, Sir Terence English who gave evidence
    18          to us, in 1977 following a pilot project in 1976, and it
    19          aimed to provide information to cardiothoracic surgeons
    20          on the volume, the type and the distribution of work
    21          performed, to promote the rational use and future
    22          planning of cardiothoracic services and to improve the
    23          overall quality of the cardiac services delivered by
    24          enabling individual surgeons to compare themselves
    25          against national data.
0007
     1                 The way in which the data was collected was for
     2          each unit to be asked, invited -- cajoled perhaps --
     3          into supplying data to the Society centrally in the
     4          country.  There was no compulsion about it.  You will
     5          hear, when Mr Keogh gives his evidence, that in fact
     6          some units did not provide data.  It might appear, at
     7          least from one of the charts which he has helpfully
     8          produced for us, that some units consistently did not
     9          provide data.  If they had data to give and did not
    10          consistently provide it, one has to ask whether there
    11          may be some reason for that, whether it was shame at
    12          poor results, for instance; whether it was simply an
    13          unwillingness to be co-operative for some reason of
    14          principle; whether it was chronic indolence; whether it
    15          was some other reason.  It is very difficult to
    16          speculate.  What one can say in respect of the figures
    17          is that to the extent that individual units did not
    18          provide figures to the central register, the central
    19          figures give one less than the full story.  The extent
    20          to which that is less than the full story, that may
    21          affect the use we in the Inquiry can make of those
    22          figures, will be a matter for your judgment, having
    23          heard what has to be said about them.
    24                The register was kept carefully from 1977
    25          throughout the period with which we are interested.  In
0008
     1          that respect, it differs from the HES data to which
     2          I have already referred, because the HES data began as
     3          a process in 1987/88; it evolved as a data collection
     4          exercise becoming, we think, as a preliminary
     5          observation, at its most reliable for various reasons in
     6          1991/92.  So whereas one may use the register for all
     7          years from 1983 to 1995, one can only use HES, really,
     8          probably, from 1991/92 onwards.
     9                One of the problems with the register is that in
    10          order to get it off the ground -- this is my probably
    11          imperfect lay understanding -- the Society had to
    12          promise the reporting surgeons that the data they
    13          supplied would be confidential; it would be anonymised
    14          so that the individual unit or surgeon would know what
    15          his results or the unit's results were, but that would
    16          be information kept to the unit or the surgeon.  The
    17          only point of comparison would be with the whole of the
    18          rest of the country generally, such as was reported to
    19          the register.
    20                Considerable lengths were taken to secure that
    21          anonymity, so that there was no fear among surgeons that
    22          the data would be used for any purpose to which they did
    23          not consent.
    24                To an extent, that means that one cannot look at
    25          the register and compare Bristol with, let us say,
0009
     1          Southampton or Great Ormond Street or Birmingham,
     2          because the data is not constructed to enable us to do
     3          that.
     4                So much for those two data sources.
     5                The other four main sources which we believe we
     6          have identified are these: there was a computerised
     7          information system kept in the South West by consultant
     8          cardiologists at the Bristol Royal Infirmary and the
     9          Bristol Children's Hospital throughout the period of the
    10          Inquiry.  The evidence that we have so far suggests that
    11          this system was designed to allow the cardiologists
    12          ready access to essential summary information on
    13          patients when clinical records were not available for
    14          audit purposes and to facilitate the presentation of
    15          clinical data at meetings.
    16                The hardware and the software were provided by the
    17          Bristol and South West Children's Heart Circle and the
    18          current version is held on a personal computer using
    19          Borland Paradox software.  Most of the data were entered
    20          by a secretary working for the cardiologists.
    21                We have an electronic copy of that register and
    22          plans are in hand to commission an independent review
    23          with statistical analysis of that which it shows.
    24          A full assessment of the data will necessarily involve
    25          cross-comparisons with other relevant data sources so
0010
     1          far as is feasible.
     2                Next, we have the Trust Patient Administration
     3          System.  It is a computer system for the purpose of
     4          tracking hospital activity, and it forms the basis for
     5          the returns to the Department of Health under the HES
     6          system.
     7                We do not have data before 1st January 1988.  The
     8          period which it covers for our purposes is 1st January
     9          1988 until 31st December 1995, and it again needs to be
    10          emphasised that this system was not designed for audit
    11          purposes; it was an information system to help with the
    12          tracking of patients, so that the hospital knew who was
    13          where, if you like, it had the ability to print off
    14          labels and addresses to keep in touch with patients and
    15          so on: essentially administrative purposes, also
    16          planning purposes and when the purchaser/provider split
    17          began, for contracting purposes.  It was not used for
    18          clinical purposes and therefore the usefulness of the
    19          data may be limited when one looks to see whether it can
    20          tell us anything about adequacy or quality of care.
    21                The data was generated by clinical coders using
    22          standard classification systems.
    23                The quality of the data appears to be good.  We
    24          think that probably it will mainly serve as a useful
    25          descriptor of what happened in respect of children
0011
     1          receiving heart surgery from 1988 onwards, and to
     2          cross-validate other information.  One of the great
     3          advantages of having a number of different sources of
     4          data is that if they all end up saying pretty much the
     5          same thing, one can draw strength from that.
     6          Individually there may be weaknesses which would not
     7          allow anyone with any integrity to place reliance upon
     8          that individual source, but if the fact is that one
     9          source cross-refers to another, or one source produces
    10          the same result as another, one can draw strength from
    11          that and the two together may be worth much more than
    12          the sum of the individuals.
    13                The next major data source is the records
    14          themselves, the clinical records.  These, the Inquiry
    15          has in great number.  They have all been copied, kept
    16          and stored at the Inquiry offices, and each set has been
    17          coded by clinical coders, an experienced team, using
    18          codes to demonstrate what the diagnosis and what the
    19          procedure in each case was.
    20                A fuller description of the way in which the
    21          Inquiry has dealt with the clinical case records, and
    22          indeed, a fuller description of each of the data sources
    23          to which I have briefly referred in this introduction,
    24          is contained in a paper which the Inquiry published last
    25          week entitled "A Preliminary Overview of the Data
0012
     1          Sources."
     2                Perhaps it is not necessary for me therefore to
     3          deal in any particular detail at this stage with the
     4          clinical cases, save to say this: that one of the great
     5          advantages is that we can be confident that, unlike any
     6          other researcher of whom we are aware who has looked at
     7          Bristol over the years 1983 to 1995, we have as full
     8          a picture as one possibly can by going back to the
     9          original source data, as it were, the clinical records.
    10          We have to accept that clinical records can of course
    11          have pages missing; they can, of course, be incomplete.
    12          But in general, they are kept for the purposes of care
    13          of patients, in general they are of good quality, and
    14          what is true in general throughout the Health Service we
    15          have found, I am told, to be true in respect of these
    16          records in Bristol.
    17                Of course, one of the advantages of having the
    18          available records is that one can look at those
    19          diagnoses which have led to successful surgery from 1984
    20          to 1995, those which have led to unsuccessful surgery,
    21          and unsuccessful not only in terms of mortality at some
    22          later date, but also in terms of morbidity.  So the
    23          whole range of outcomes it is possible to study from
    24          looking at the clinical case records.
    25                One of the hopes and expectations in the next
0013
     1          couple of days is that those who are listening and have
     2          an interest in the Inquiry's procedures and its findings
     3          may be able to make to us critical, constructive
     4          comments as to what may best be found from a combination
     5          of the data sources I have mentioned thus far.
     6                One cannot forget, as the last of the major
     7          sources, the surgeons' logs.  We have taken and we have
     8          or will publish the statements of the doctors describing
     9          the logs of the surgeons who were concerned in the
    10          operations.  Those logs, generally speaking, identify
    11          the procedures which those surgeons carried out.  They
    12          are, of course, a useful means of cross-checking against
    13          other data to see how complete and accurate that other
    14          data source is.
    15                Having spent some time dealing with the six main
    16          sources as they appear to us, I do not want to take too
    17          much time dealing with what might be thought to be
    18          lesser sources, but lest it be thought that this Inquiry
    19          is leaving any stone unturned, if I can simply run
    20          through and identify thus far the 10 principal other
    21          data sources, and invite anyone who is listening who
    22          knows of any further data which may help us, at any
    23          stage during our terms of reference, 1984 to 1995, to
    24          let us know.
    25                We have:
0014
     1                (1) a card index system which the cardiologists
     2          kept for part of the period;
     3                (2) the logs which were kept in the operating
     4          theatres;
     5                (3) records which were maintained by clinical
     6          perfusionists;
     7                (4) logs which were kept by anaesthetists;
     8                (5) the cardiac surgeons in Bristol themselves had
     9          a system, a patient analysis and tracing system, PATS,
    10          kept on a METASA system;
    11                (6) the NHS regional data archives;
    12                (7) something we will explore this morning with
    13          the help of Mr Willmer: the hospital inpatient data
    14          system, HIPE;
    15                (8) the Confidential Enquiry into Stillbirths and
    16          Deaths in Infancy, a national source but only, we think,
    17          of partial relevance to us, sometimes known by its
    18          acronym CESDI;
    19                (9) the National Confidential Enquiry into
    20          Peri-operative Deaths, NCEPOD;
    21                (10) the national reporting system for congenital
    22          abnormalities, which has an obvious relevance to the
    23          conditions which led to operation in Bristol.
    24                Those are the 10 principal lesser sources.  I do
    25          not wish to imply from that list that is all that we
0015
     1          have actually covered; there is more, but nonetheless,
     2          it will be unwise of anyone listening to assume that we
     3          know of the data source that he or she may know of and
     4          so if there is any further information which we have
     5          overlooked or which no-one has told us about relating to
     6          the years 1984 to 1995, to paediatric cardiac care, then
     7          please tell us about it.
     8                Sir, the purpose as I have indicated in looking at
     9          each of the data sources is to identify so far as this
    10          part of the picture is concerned what are the what are
    11          the strengths, what are the weaknesses, what is the
    12          usefulness for us.  This is part of the overall picture
    13          which the statistical investigation should help to show
    14          us, as indeed indicated on the screen in the four
    15          points, and it is perhaps helpful if then, at this
    16          stage, I ask Dr David Spiegelhalter to come forward and
    17          to explain from his perspective the content of each
    18          these stages and the overall strategy by which figures
    19          can assist, even though they may not control our
    20          findings.
    21                Dr Spiegelhalter, you would wish, I think, to
    22          affirm?
    23                    DR DAVID SPIEGELHALTER (AFFIRMED):
    24                       Examined by MR LANGSTAFF:
    25      Q.  Dr Spiegelhalter, can you give us your full name and
0016
     1          your qualifications, please?
     2      A.  My name is David John Spiegelhalter.  I have an MA in
     3          mathematics from Oxford University, an MSc and PhD in
     4          mathematical statistics from London University.  I am
     5          a Chartered Statistician with the Royal Statistical
     6          Society and I am currently senior scientist working for
     7          the Medical Research Council at the Medical Research
     8          Council Biostatistics Unit in Cambridge.
     9      Q.  You have had some involvement, I think, with the subject
    10          matter of the Inquiry before, having given evidence at
    11          the General Medical Council?
    12      A.  Yes.
    13      Q.  If we look at the screen, it sets out in very bare
    14          bones, does it, the processes as you would see them by
    15          which the statistics can inform the process of the
    16          Inquiry?
    17      A.  Yes.  Could I make a few preliminary points?  I would
    18          first like to say, as a professional statistician, that
    19          one must really acknowledge this is a very tricky area
    20          and for a number of reasons.
    21                I think the first is that we are looking backwards
    22          into a time when the whole process of collecting data in
    23          a rigorous way was rudimentary, to say the least, not
    24          least because of the lack of easy computer facilities,
    25          but also just because of the culture, which has now
0017
     1          changed quite dramatically.
     2                So as you have made clear, there are a variety of
     3          data sources, but none of them are what we would
     4          consider as really perfect, and of course if we were
     5          doing this prospectively, it would be a very different
     6          matter.
     7                The other crucial issue is that while, as you have
     8          said, there can be very detailed analysis of what went
     9          on in Bristol by going back through the case records,
    10          that is not the only issue.  One of the crucial things
    11          is comparison with what, perhaps, might be considered as
    12          acceptable practice or standard practice in levels of
    13          performance in the country at the time, and while I do
    14          not think the Inquiry has quite the resources to go
    15          through every record in the country, it means that there
    16          is going to be a problem which has to be faced of making
    17          comparisons with other centres.  That obviously has to
    18          be faced by looking at the available data, as you have
    19          described it.
    20                The final point I would like to make, which
    21          I suppose is slightly more technical, is what in the
    22          trade is known as "multiple comparisons".  As one looks
    23          at finer and finer level in terms of, say, disease
    24          categories or maybe over a short period of time or maybe
    25          just looking at what one surgeon is doing, the numbers
0018
     1          are going to get smaller and smaller and that means from
     2          a statistical point of view, the play of chance, as bad
     3          luck, is that starts having more and more of a role.
     4          From a statistical point of view one would like big
     5          numbers so everything is stable so one can attribute
     6          differences to some underlying cause rather than just
     7          the play of chance.  So there is always a danger as one
     8          looks more and more into these data sources and at finer
     9          and finer detail that apparent differences are going to
    10          come up which it is very difficult to say whether these
    11          are real or not.
    12                So inevitably, there has to be some sort of
    13          grouping by diagnostic category or over time in order to
    14          try to smooth out some of these potential I suppose, you
    15          know, random or chance occurrences.  I think we will
    16          come on to that, but any grouping like that obviously
    17          should be done in an unbiased way and should be done
    18          without looking at the data first.
    19      Q.  You used the word "unbiased" there in the statistical
    20          sense?
    21      A.  I could mean it in a somewhat non-technical sense as
    22          well.  One should do that sort of grouping before
    23          looking at the results so one is not just picking out
    24          the worst things one can find over a long period.
    25      Q.  So one is beginning, as it were, with the groupings, the
0019
     1          criteria, and then looking at the data in the light of
     2          those groupings and criteria?
     3      A.  That is the idea.
     4      Q.  The purpose, then, of this stage of the investigation is
     5          to establish what, essentially?  What would you hope
     6          that the Inquiry will be able to take away from the next
     7          day and a half?
     8      A.  We are starting on these four stages, really, and
     9          bearing in mind the points I made about the difficulty
    10          of drawing firm conclusions from the data available, the
    11          first stage is really to identify the sources and to get
    12          some idea of what their quality is and what conclusions
    13          might be able to be drawn from them.  As I said, it is
    14          best to do that before we actually see what that data is
    15          saying, so that we can make a fair judgment as to what
    16          might be drawn without knowing what actually is found
    17          inside those data sets.
    18                The kind of information that would be very good to
    19          know is who completed the data, who was responsible for
    20          submitting this data, was it different people for each
    21          of the different data sources?  Because in a sense, as
    22          you said, it would be very nice if these data sources
    23          corroborated each other, we would feel happier about the
    24          evidence, but only in as much as that evidence perhaps
    25          came from different sources.  If they are all coming
0020
     1          from the same source, then that corroboration might be
     2          somewhat illusory.
     3                I think it would be very useful to know about what
     4          the judgment is about the quality of the data sources
     5          and that quality can be judged in terms of the coverage,
     6          you know, are all the centres looking across the country
     7          now contributing their data, and in terms of the
     8          completeness of the records being given and in terms of
     9          the accuracy whether we can actually believe what we can
    10          see in front of us.
    11      Q.  Just distinguishing those three, because I think those
    12          are words we will probably come across a number of
    13          times, "coverage" you use to mean whether it covers
    14          every part of the country: geographical, as it were?
    15      A.  Yes whether some centres are just systematically not
    16          providing data.
    17      Q.  "Completeness" is giving the questions one is asked on
    18          the form which is filled in and sent back to the data
    19          centre: have all the questions been answered?
    20      A.  Yes, completeness is to do with missing data on some of
    21          the aspects of the cases being submitted.
    22      Q.  So for instance if one had a data set in which the date
    23          of birth was quite often missed off, or the date of the
    24          operation was quite often missed out, that data source
    25          to that extent would be incomplete, even though the
0021
     1          coverage might be 100 per cent across the country?
     2      A.  Yes, and even more important would be data sources in
     3          which the outcome was not recorded accurately.
     4      Q.  "Accuracy" is the last?
     5      A.  Accuracy is saying, given the data provided, can we
     6          actually believe it, that it is appropriate?  Are all
     7          the cases being included from that centre; are all the
     8          deaths being recorded appropriately; are the details
     9          correct?
    10      Q.  I stopped you in mid-flow.
    11      A.  I think that, I suppose, covers the main aspects of what
    12          we are trying to establish at this stage.  In the case
    13          of Bristol, as we will come on to later, by going deeper
    14          into the clinical records as has been done, we will
    15          actually be able to get a very good idea of these
    16          matters, because we will be able to cross-check the
    17          detailed data from the clinical records with what has
    18          been reported to the various sources that you have
    19          described.
    20                One of the problems, of course, is how much can we
    21          trust the data coming from other centres with which we
    22          would like to make comparison.  That is obviously
    23          something that will require somewhat more of a judgment,
    24          although there may be some other sources we could use.
    25          We will come on to that later.
0022
     1      Q.  Can I just explore with you for a moment the need of the
     2          data sources, if there is a need, for the data sources
     3          to speak as it were a common language, so when each of
     4          them talks about death, for instance, they mean the same
     5          thing.  It might be quite surprising to the listener to
     6          think "death" might mean something different in
     7          different data sources, but one might define it as
     8          a death within 30 days of operation, one might record
     9          death as "ever died" and receive that information from
    10          the local register of deaths, for instance, and so on?
    11      A.  And another might record it as deaths in hospital, which
    12          is a common one.
    13      Q.  So for the purposes of cross-reference, corroboration
    14          between the data sources, would one need to know they
    15          were talking a common language or could be made to talk
    16          a common language?
    17      A.  That is obviously very crucial, to try to do that.
    18          I think equally crucial, something that really has to be
    19          tackled is to make sure in terms of the coding, the
    20          diagnostic categories that the groupings are as similar
    21          as possible.  That has to be faced, that the different
    22          sources of information use different ways of describing
    23          patients, different ways of categorising them, and we
    24          would really like to be able to project all these
    25          sources on to a common framework, and I think I have to
0023
     1          admit that framework might not be, you know, the latest
     2          way if one was setting up a database from scratch we
     3          would use, but one that would allow at least common
     4          comparisons between the data sources.
     5      Q.  So if one has a particular operation -- whatever it may
     6          be, it would be wrong to suggest one -- one would need
     7          to know that in Bristol in the various data sources it
     8          was described in the same way, or one could derive the
     9          same description for the purposes of comparison from
    10          each of the data sources in order to corroborate one
    11          against the other.  One would need to know that the same
    12          description was applied by others elsewhere if one was
    13          going to compare Bristol with elsewhere?
    14      A.  Ideally one would, but it will be difficult both because
    15          of the types of data that have been collected and
    16          because of this area which is intrinsically extremely
    17          complex, as has been pointed out.  The fact that there
    18          may be multiple diagnoses in the child and the fact that
    19          there may be multiple operations will mean that this
    20          will be a difficult matter.
    21      Q.  I shall ask you to return to this theme tomorrow, but
    22          given that this is Stage 1 and you have briefly
    23          described some of the problems that are faced and what
    24          we might hope to go away with, what do you see the next
    25          three stages as involving, in brief?
0024
     1      A.  The next phase, which is labelled here as "Exploration",
     2          is to look at the available sources of data and conduct
     3          preliminary analyses of those, as independently as
     4          possible, except that the only sort of common ground
     5          between those analyses should be that they should be
     6          trying to use as common a coding categorisation as is
     7          feasible.  You have indicated already that such analyses
     8          have taken place or will be taking place for the main
     9          sources such as the HES data, the cardiac register and
    10          the data and the medical records themselves on the
    11          Bristol patient.
    12                That will enable one, one would hope, after that,
    13          to get an idea -- first of all within Bristol itself, we
    14          should be able to get a very good idea of what was going
    15          on and what the outcomes were, in the descriptive sense,
    16          for Bristol alone.  That, when cross-checked against the
    17          various other sources such as the logs you have
    18          suggested, should enable a very accurate picture of what
    19          went on in Bristol to be known.  That also can be used
    20          as a frame, to enable sampling for more detailed
    21          examination of particular cases.
    22                In terms of the data available for comparative
    23          purposes for other centres, using the main sources will
    24          be the HES and the Cardiac Register, then analysis of
    25          those, keeping always in mind the limitations we are to
0025
     1          hear about today, should reveal first of all what was
     2          the variability being reported between centres even
     3          without knowing the identity of any centres.  The
     4          background variability is extremely important to know:
     5          what were the standards being reported for various
     6          procedures at that time, and what was the reported
     7          variability between centres?  That requires some fairly
     8          careful analysis.  Then, by identifying Bristol and
     9          other centres, one should be able to see whether, on the
    10          face of it, there are some categories, some periods of
    11          time when Bristol or other centres appeared to be pretty
    12          systematically differing -- having differing outcomes to
    13          other centres, even allowing for the sort of variability
    14          that is inevitable, that there will be between centres.
    15                That really just provides -- whether there are
    16          variations, whether Bristol figured prominently in those
    17          variations with regard to particular categories, and so
    18          to provide whether there is any prima facie evidence for
    19          further, more detailed examination of the data.
    20      Q.  So the further, more detailed examination of the data
    21          is, is it, at the confirmational stage?
    22      A.  Yes.  You can say the exploration, stage, really, is
    23          seeking to identify whether Bristol is an outlier at
    24          some time for some procedures without in any way
    25          attributing any reason for it and keeping always in mind
0026
     1          the quality of the data.
     2      Q.  So simply saying, it appears that Bristol is different?
     3      A.  Yes.
     4      Q.  That is the most one could conclude at that stage?
     5      A.  Yes.  And of course, one has to keep in mind that, you
     6          know, an analysis like that which will inevitably
     7          involve some grouping, and bearing in mind the
     8          limitations of the data and the lack of any risk
     9          adjustment, severity adjustment in the data,
    10          differences, you know, small differences in particular
    11          areas, could be obscured by such approach, and so one
    12          needs to look really quite carefully at that data.
    13      Q.  One could equally, looking ahead, come to the conclusion
    14          that Bristol could be shown to be no different on
    15          available data?
    16      A.  Certainly.
    17      Q.  In which case, one could not ask any further, could one?
    18      A.  Unless there was other evidence that existed that
    19          perhaps the grouping had been too broad in certain
    20          categories, so one could cautiously look at perhaps
    21          finer categorisation, but with great caution, because as
    22          I said, if you really dig deep enough and finely enough,
    23          you will always find some variability.  There are
    24          statistics -- that is really what this subject about,
    25          trying to sort out what can be explained about chance
0027
     1          alone, luck, from what is systematically different.
     2      Q.  So if it appears after the exploratory stage that there
     3          is a systematic difference as opposed to a chance
     4          difference between Bristol and other centres, then one
     5          goes on to a third stage of confirmation?
     6      A.  Yes.  One of the things that is easiest to confirm is
     7          whether Bristol's performance and true performance was
     8          as reported in the national databases that are
     9          available, because we only get a very clear idea of that
    10          from the clinical records, so certainly Bristol's true
    11          performance can be calibrated against what has come out
    12          in the exploratory phase, which I regard as a sort of
    13          screening phase.
    14                The problem comes in making a judgment about
    15          whether the data reported from the other centres is
    16          accurate enough to be confident that there really are
    17          the differences that showed up on the preliminary
    18          screening.  That will rely, I would imagine, to some
    19          extent on judgments that we are going to hear about
    20          today on the accuracy of the data being recorded, and if
    21          some particular findings have been established at the
    22          screening stage, quite strong findings, there might be
    23          an argument that perhaps the fuller sources of
    24          information from other centres might be useful as well
    25          to confirm their results.
0028
     1      Q.  Following confirmation, one would then be in a position
     2          to say either what appeared at the end of the
     3          exploratory phase still appears to hold good; one would
     4          then be looking for an explanation as to why it should,
     5          because there would then be an identified systematic
     6          difference?
     7      A.  Yes.  The confirmatory phase still does not give any
     8          explanation or reason for variability, it is saying we
     9          can feel reasonably confident that there is something,
    10          some cause for Bristol to be an outlier or to be
    11          systematically different in some respects, because there
    12          is a limit to how much that is a statistical issue.
    13          Obviously by looking more carefully at the Bristol
    14          notes, one might be able to identify reasons for this
    15          variability: by that time, if these things have come
    16          about, the discussion will be much more focused and so
    17          I would imagine that would be the time that expertise
    18          from the substantive areas, the clinical expertise,
    19          would come in, and using perhaps the sampling that has
    20          gone on in the case notes, to identify reasons for the
    21          confirmed performance.
    22      Q.  Can I take a crude example and see whether I have
    23          understood what you are saying.
    24                Let us suppose it were shown at the end of stage 3
    25          that Bristol were different from other centres; let us
0029
     1          suppose the surgery was less successful in terms of
     2          death following surgery in Bristol than elsewhere.
     3                It might be said, "Well, the cases that were dealt
     4          with at Bristol were actually much more difficult than
     5          elsewhere", and if that were indeed the case, it would
     6          be a perfectly rational and sensible explanation of the
     7          difference.
     8                One would then want to explore that difference and
     9          one would do that by means of other expertise than the
    10          statisticians?
    11      A.  Yes.  I mean, if one had wonderful data available on all
    12          centres, then there are very established statistical
    13          methods of doing that risk adjustment which can be
    14          carried out by adult cardiac surgery by the more uniform
    15          conditions and the larger numbers involved.
    16                In the area, there is only rather rudimentary
    17          development of risk adjustment procedures, so those will
    18          be carried out with more caution.
    19                So in any identified area, though, one could
    20          certainly identify the factors in Bristol from the case
    21          notes, and then one perhaps could use other information
    22          from other centres to have an idea of whether there was
    23          systematic difference in the types of patient being
    24          treated in Bristol compared with other centres.  But
    25          that would have to be a very focused activity in an area
0030
     1          previously identified.
     2      Q.  In the example that I gave you, you would be focusing
     3          upon the question whether or not the case mix was
     4          different in Bristol than it was elsewhere generally,
     5          for instance?
     6      A.  Yes, in that particular area, yes.
     7      Q.  So what it means is that even if one gets to stage 4,
     8          one cannot simply say, well, the statistics show that
     9          Bristol was -- I use the word to sum up -- "bad", it is
    10          a very poor word sometimes to use, but one cannot draw
    11          that conclusion; one has to look for the explanation and
    12          evaluate the explanation there might be?
    13      A.  Statistics can never show a causal factor in this sense,
    14          and I think even at the end of this stage one would say
    15          that this variability -- if it is a negative result --
    16          it cannot be explained by variations in case mix or by
    17          chance alone, then it is left up to --
    18      Q.  There has to be some other explanation?
    19      A.  And other expertise to identify it.
    20      Q.  Whatever it may be?
    21      A.  Yes.
    22      MR LANGSTAFF:  Thank you, Dr Spiegelhalter.  Sir, I am
    23          conscious that it is time we might normally have a short
    24          break, and perhaps it would be convenient to have one
    25          now before we hear from Mr Willmer?
0031
     1      THE CHAIRMAN:  Yes, Mr Langstaff.  Shall we say 15 minutes
     2          and reconvene, therefore, at 11.00?  Thank you.
     3      (10.43 am)
     4                             (A short break)
     5      (11.07 am)
     6      MR LANGSTAFF:  Mr Richard Willmer, please.  Mr Willmer, if
     7          you would like to stand to take the affirmation?
     8                       MR RICHARD WILLMER (Affirmed):
     9                       Examined by MR LANGSTAFF:
    10      Q.  Mr Willmer, we have your statement which I hope will
    11          come up on the screen to your right, WIT 189/1: is that
    12          a statement which you have provided for the purposes of
    13          this Inquiry?
    14      A.  It is.
    15      Q.  And together with appendices, it runs for 132 pages.
    16          You have, I think, signed it for us at -- I do not know
    17          that you have signed it for us.  In any event, is it
    18          true and accurate?
    19      A.  It is, yes.
    20      Q.  You signed on the bottom of the front page, in fact,
    21          I think?
    22      A.  Yes, on the front page.
    23      Q.  In that statement, do you deal with two sets of national
    24          statistics of interest to us: HIPE and HES?
    25      A.  Yes.  The questions that were put to us related both to
0032
     1          HIPE and HES.  As has been mentioned previously, HIPE
     2          was the predecessor of HES, and our knowledge of HIPE is
     3          very much less than it was of HES but we have tried to
     4          answer the questions in relation to HIPE as best we
     5          can.  If the Inquiry requires further details, we can
     6          explore it and gather further information if that is
     7          required.  The majority of the information relates to
     8          HES.
     9      Q.  Can I deal with the two chronologically and deal first
    10          of all with the Hospital Inpatient Enquiry, HIPE.  First
    11          of all, to get a broad idea of when that began, that
    12          began, I think, in 1949, did it?
    13      A.  I think very close after the NHS actually came into
    14          being.
    15      Q.  So it was, if you like, the first national tool which
    16          the NHS developed?
    17      A.  Yes.
    18      Q.  A sea change came about in the way in which it was
    19          compiled in the 1960s, in which a sample was drawn from
    20          regional data sets?
    21      A.  That was the basis of the system from about the 1960s
    22          through to the mid-1980s, as I understand it.
    23      Q.  It stopped when -- in 1985, did it?
    24      A.  Yes.
    25      Q.  So, so far as we are concerned, we only have the 1984 to
0033
     1          1985 period which is of any relevance to us, directly.
     2                Did it attempt to cover the entire activity in NHS
     3          hospitals?
     4      A.  It was designed as a 10 per cent sample of activity
     5          right across the NHS and it was actually put into three
     6          areas, so it actually had a separate system for
     7          psychiatric data and for maternity data.
     8      Q.  So, so far as paediatric cardiac surgery is concerned,
     9          that would be in one of the three data sets?
    10      A.  It would be.
    11      Q.  And the data was collected from the regions?
    12      A.  Yes, from the HAA systems that were in the region at the
    13          time.
    14      Q.  You have used an acronym there.
    15      A.  HAA?  The hospital activity analysis brought together by
    16          the regions at that time.
    17      Q.  In order to get the national picture, one would have to
    18          depend upon the regional picture?
    19      A.  If I wanted a 100 per cent sample, one would presumably
    20          have to go back to the regions if they still had the
    21          information.
    22      Q.  And is it likely -- I do not expect you have carried out
    23          any particular Inquiry -- that they do hold data for
    24          a time as long ago as 1984 to 1995?
    25      A.  I would have to explore that, but off the top of my
0034
     1          head, it is unlikely they would have that information.
     2      Q.  So one does have a 10 per cent sample drawn from the
     3          regions?
     4      A.  Yes.
     5      Q.  Could you pull the microphone a little bit closer?  It
     6          is pretty good at picking up your voice, but your voice
     7          is naturally quiet.
     8                So in terms of what it can tell us, I identified
     9          three main areas we wanted to focus on: the strengths,
    10          the weaknesses, and any useful information that we can
    11          gain from a data set.
    12                What would you say were the principal weaknesses
    13          for our purposes -- you have heard me describe them
    14          briefly this morning -- of using the HIPE data set for
    15          1984 to 1985?
    16      A.  Probably the main weakness is the fact that there were
    17          very few analyses produced at the time down to the level
    18          of detail that would probably be of interest to this
    19          Inquiry.  I think a lot of the information was produced
    20          at regional level and probably down to the specialty and
    21          some operative procedure codes and diagnosis codes, but
    22          probably not to the level this Inquiry would be
    23          interested in.
    24      Q.  So it would not enable one to focus on particular
    25          operation codes in particular hospitals?
0035
     1      A.  As I understand it, no.
     2      Q.  Dealing with what one might achieve from it, would one
     3          get a broad picture of level of activity across the
     4          country in particular areas?
     5      A.  Yes.
     6      Q.  So far as one would be able to know, would one get
     7          a broad picture of activity generally in cardiac
     8          surgery, leaving aside for the moment whether it is
     9          adult or paediatric?
    10      A.  I think we would have to look at the tables that were
    11          produced at the time, and let you know exactly what
    12          level of detail is available on those tables.
    13      Q.  Can I put HIPE on one side, given what you have said
    14          about it, save to ask you this: at page 11 of your
    15          statement, the bottom of the page, you deal with the
    16          HIPE data and you use a quotation there which is why it
    17          is in italics.  The third line:
    18                "It is pointed out by the author that nevertheless
    19          the initial inclusion of some erroneous data is
    20          inevitable."
    21                If we go down to the next paragraph, we can see
    22          some examples of "totally unacceptable information".
    23                So far as you know, are those dreamt up or are
    24          those actual examples of unacceptable information that
    25          was in fact found in HIPE?
0036
     1      A.  I do not have personal knowledge of whether they are
     2          actual or fictitious examples, but they look like
     3          examples that probably occur.
     4      Q.  Non-existent hospitals?
     5      A.  Yes.
     6      Q.  So that happens?
     7      A.  Yes.  I mean, you may get an incorrect hospital code or
     8          something like that.  I think that would be what they
     9          mean.
    10      Q.  And males with female diseases is an obvious one?
    11      A.  Yes.
    12      Q.  Speaking generally -- this is probably true of all data
    13          collection, but what probably gives rise to errors in
    14          accuracy such as that?
    15      A.  It may be coding at source; it may be problems in the
    16          transmission of the data from the hospital through
    17          whatever route it goes nationally.  When we receive
    18          information nationally we ask for it to come in in
    19          a common format with common codes and sometimes the
    20          translation to local codes from national ones can create
    21          problems.
    22      Q.  So in the days of HIPE, at any rate, just focusing on
    23          old times before we come up to date, there would have to
    24          be a human being who would actually translate from some
    25          record on to a summary which constituted the HIPE
0037
     1          record, presumably?
     2      A.  It would either be a person or maybe there is a computer
     3          system that translates codes from the local system on to
     4          something that is required for the national purposes and
     5          transmitted.  I would think in the days of HIPE, a lot
     6          of the work was done manually.
     7      Q.  So there would be human input first of all, coding at
     8          local source; secondly translating from local to
     9          national?
    10      A.  Yes.
    11      Q.  Are you able to say anything as to the incentive that
    12          there may have been for any particular individual doing
    13          either the coding or the translation to get it right?
    14      A.  It is difficult to say at the time, because I think it
    15          depends on the ethos at the time.  I mean, we think
    16          there is a great incentive if you can pass the
    17          information back to the Health Service for their own
    18          purposes so that they can see the importance of getting
    19          the coding right, which is a philosophy we are following
    20          at the moment.
    21      Q.  Taking it up to today: today it is all, presumably,
    22          computerised, except at the point of the creation of the
    23          computer record, is it?
    24      A.  Yes.  Currently now information is put on to the Patient
    25          Administration Systems locally in hospitals.  It then
0038
     1          passes through --
     2      Q.  Can I ask you to stop there: an individual, a human
     3          being, puts the information from some source on to the
     4          database?
     5      A.  Yes.
     6      Q.  Are you able to say what training, what pay, what level
     7          of qualification that individual might have and how
     8          prestigious a job it is?
     9      A.  I do not know a lot about the work that is done by
    10          medical records clerks and clinical coding clerks within
    11          the NHS.  I do not know a lot about their work.  We do
    12          work with them to stress the importance of getting
    13          coding accurate by showing what uses we make of the
    14          information.  I do know that when a change was made to
    15          a new coding of international classification of diseases
    16          called ICD 10 there was extensive training of coders in
    17          order that they understood what the new codings were and
    18          the importance of getting them right, so the training is
    19          undertaken for people, but it would probably be better
    20          to talk to the people who, as it were, trained those
    21          people and get that information.
    22      Q.  Today, is there any feedback from the centre to the
    23          individual?  Let me tell you the point I have in mind.
    24          If one has an individual who is, let us say, doing an
    25          examination, there is a feedback in the sense of the
0039
     1          results, so the individual might try harder to get it
     2          right because he gets a better result if he does.
     3                If there is no feedback and what matters is that
     4          the piece of work is completed rather than that it is
     5          completed and completed well, then it may not in fact be
     6          completed as well, or as carefully.
     7                So the difference between those two examples is
     8          the feedback in the example I posed, the examiner's
     9          result; in other situations, it may be a reward for
    10          accuracy by promotion or pay or whatever.
    11                Is there any sort of feedback that you know of to
    12          those individuals who, at the moment, put the initial
    13          codes into the system?
    14      A.  I do not know what sort of feedback there is.  I know
    15          some hospitals have introduced data accreditation, data
    16          audit processes to check the accuracy of the information
    17          held on their computer records, and as part of the
    18          information for health strategy which was published last
    19          year, the aim is to make that mandatory across the
    20          Health Service in the acute sector.  I think it is for
    21          next year, 2000/2001, so there will be a greater
    22          incentive in the future to get it right than in the
    23          past.
    24      Q.  So we can say as at today what is being introduced is
    25          a system which, if you like, may not bring rewards, but
0040
     1          it will bring shame if you get it wrong?  I am speaking,
     2          again, generally.  That is the effect of it, is it: to
     3          make people realise that someone is checking on the
     4          quality of their work?
     5      A.  I think it is checking on data accuracy, and I suppose
     6          by the results that come out, it would be looking at the
     7          quality of the information that goes into the computer,
     8          but that may not be, as it were, reflecting on the
     9          coders themselves; it may reflect on the information
    10          they have in order to do that coding work.  So there may
    11          be a number of individuals involved locally in that
    12          process.
    13      Q.  But the picture I have given you is just one person
    14          coding.  You are pointing out that it could be a number
    15          of people before the information actually gets to the
    16          person who makes the record?
    17      A.  Yes.
    18      Q.  So there is more than one opportunity for what one might
    19          call "human error" to creep in?
    20      A.  Yes.
    21      THE CHAIRMAN:  May I interject and ask a question, just for
    22          my own understanding?  Given that coding involves to
    23          a degree a judgment, how does the concept of accuracy
    24          bear on that?  Does that question make sense?  The
    25          ultimate decision as to what code to ascribe to a degree
0041
     1          involves some judgment.  Can one therefore use the
     2          notion of accuracy there, or is it merely a question of
     3          whether other people agree with the judgment made?
     4      A.  This is an area I do not have a lot of personal
     5          knowledge about, but I think it is an area that this
     6          Inquiry will wish to look at: the degree to which the
     7          particular codings that are of interest to the Inquiry
     8          are actually a matter of subjective or objective methods
     9          when it is actually coded on the computer system.
    10      MR LANGSTAFF:  Where we began this discussion was
    11          looking at males who were recorded as having female
    12          diseases, which is unlikely ever to be a matter of
    13          judgment?
    14      A.  Sometimes you find problems arise with the quality of
    15          the data because you look at more than one field, like
    16          the operation was a hysterectomy and the person was male
    17          and you know that is wrong, but it is not until you
    18          bring those two pieces of information together that you
    19          find out that there is something that is incorrect
    20          there.
    21                As I said, whether that was coding at the hospital
    22          or in something that was translating the information or
    23          transmitting it to the centre, it is something you need
    24          to explore once you find there is a problem.
    25      Q.  The author of the piece that we have on the screen
0042
     1          plainly was aware that there could be unacceptable
     2          information in HIPE.  That must either have been because
     3          of a process of checking which was part of the system,
     4          or by review after the event by the commentator.
     5                Do you know which it was?
     6      A.  No, I do not.
     7      Q.  Are you able to say at this stage, in 1999, whether
     8          there were any, if I can loosely describe them as
     9          "quality checks" to cover both questions of accuracy
    10          and the stability of judgment?
    11      A.  I think the information we have are the quotes that we
    12          found from the documents available at the time.  That is
    13          basically the information we have.
    14      MR LANGSTAFF:  I do not know, Dr Aylin, if you want to come
    15          in and say anything?
    16      DR AYLIN:  There is this issue of accuracy and perhaps
    17          a better word, or another way of thinking about it is
    18          perhaps consistency.  Certainly, there have been studies
    19          looking at accuracy of diagnostic coding and procedural
    20          coding, comparing a particular hospital, the coders
    21          residing in a particular hospital with external coders
    22          and comparing their agreement rates.  The agreement
    23          rates, I am thinking of a study by Dixon et al in 1988,
    24          looked at coding of procedure codes and they showed
    25          between about a 70 and 83 per cent agreement on
0043
     1          procedure codes.  I have this summarised on INQ 2/14.
     2      MR LANGSTAFF:  Can we have on the screen, please,
     3          a summary chart and turn it round, please.
     4      DR AYLIN:  This was a study that compared local coders at
     5          two large hospitals and they had a set of records that
     6          they were both asked to code.  The level of agreement
     7          between the local coders and the external coders varied
     8          by 70 per cent in one hospital and 83 per cent in the
     9          second hospital at the 3 digit coding level, which
    10          I guess you could -- I mean, it depends how positive you
    11          want to look at that.  One might say that is actually
    12          quite good, but it is obviously not perfect.  There were
    13          similar differences in diagnostic codes.
    14      MR LANGSTAFF:  Could I just pause there?  At the bottom of
    15          that sheet there is the question posed, "Perhaps
    16          a higher level of agreement in later years?"
    17                Dixon's report which you are citing is 1998.  The
    18          questions we began with were in relation to this now
    19          very old data set of HIPE.
    20                Can I ask Mr Willmer: is it the general feeling
    21          from where you sit that accuracy has got better
    22          over time, or not?
    23      A.  I think certainly we can say that completeness has
    24          improved of many of the fields that we use, and the
    25          general quality of HES data seems to be improving, but
0044
     1          we do not have, as it were, a rigorous study such as the
     2          one quoted to show that the accuracy of these particular
     3          fields has increased from A to B, or anything like that
     4          to go by.  We just have a general impression that the
     5          completeness and usefulness of the data is improving
     6          over time and it is an evolving process.
     7      Q.  If it is improving over time, if one goes back from 1999
     8          to the last year of HIPE in 1985, one would expect the
     9          level of accuracy of those who were coding at the time,
    10          other things being equal, to be very much less reliable
    11          than data produced today?
    12      A.  It is difficult to judge, I think, at this point in
    13          time.
    14      Q.  But a big question mark one would have to put over it,
    15          would one?
    16      A.  Yes.
    17      Q.  Can you help, again really for general public
    18          consumption, as to the difference between the 3 and
    19          4 digit level of the OPCS codes?  These are procedure
    20          codes, are they?
    21      DR AYLIN:  Yes.
    22      MR LANGSTAFF:  Each code has 4 characters assigned to it,
    23          4 digits?
    24      A.  Yes, it can be up to 4 digits.
    25      Q.  Is it a question of, as it were, refining down so one
0045
     1          begins with the first digit is very general; the second
     2          digit is bringing from the general towards the
     3          particular; the third digit more so; the fourth digit is
     4          absolutely specific?  I am using layman's terms, but do
     5          I have the general picture right?
     6      A.  I think the 4-digit tends to be more specific, but I do
     7          not think it is a completely hierarchical system in that
     8          sense.
     9      Q.  How much does the fourth digit actually matter for
    10          comparative purposes?
    11      A.  It might depend on the areas you are looking at.
    12      DR AYLIN:  Yes.  I think it will not go down to individual
    13          named procedures at a 4-digit level.  It is still not as
    14          specific.  For instance, if you look at the K5 and K6
    15          codes which are operation procedure codes for
    16          transposition of the great vessels --
    17      MR LANGSTAFF:  That is only two digits --
    18      DR AYLIN:  K05 and K06, and then you have a fourth digit at
    19          the end of each one, so K052, K053.  So although these
    20          can be grouped K05 whatever and K056 into operations on
    21          transposition of the great vessels, and they will go
    22          down to a little bit more detail if you look at the
    23          4-digit level, you will not get named procedures like
    24          the switch operation or anything like that.  They are
    25          more generic terms of operations rather than named
0046
     1          operations.
     2      MR LANGSTAFF:  So what is one looking at in terms of the
     3          3-digit level in terms of what is going to be of use to
     4          us?
     5      DR AYLIN:  I think to give an example, the KO5 and KO6
     6          codes, those are the 3 digit levels.  You can look at
     7          this group of operations on a transposition of the great
     8          vessels as a fairly broad group, but it is difficult to
     9          relate the OPCS 4 procedure codings to other coding
    10          systems that might be in existence because they might go
    11          to a finer level or use a different hierarchy or
    12          whatever.
    13      MR LANGSTAFF:  It is plain what is shown on the screen there
    14          is a comparison as to judgment.  It is variability or
    15          a measure of variability of judgment of different people
    16          who do the work.  I was going to say "experts", but
    17          people who do the work is perhaps a better description
    18          from what information we have.
    19      DR AYLIN:  Yes.
    20      MR LANGSTAFF:  If there is a measure of agreement which is
    21          either better than two out of three or better than four
    22          out of five cases, is that -- again, I translate it into
    23          terms which might not be entirely appropriate, evaluated
    24          terms: is it good or is it bad?
    25      A.  It partly depends on the uses to which you want to put
0047
     1          the data.  I think a point which is worth making about
     2          any data source, but perhaps particularly about HES and
     3          probably HIPE, is that you want to make it fit for
     4          a purpose.  Where you have a wide range of purposes, it
     5          may be good for some but not other purposes.
     6      Q.  So if one were looking to compare, as it were, mortality
     7          rates from a particular surgical procedure, conducted in
     8          two separate hospitals, to what extent does it put
     9          a question mark over your conclusions to know that if
    10          you looked at one of those hospitals, you would, by
    11          using two different people looking at the same data,
    12          produce something which could be as much as 1 in 3 of
    13          a difference?  You get, at the 3 digit level,
    14          a variability of some 30 per cent?
    15      DR AYLIN:  May I say there, I think some fields are
    16          better coded than others and better mortality is
    17          probably well coded, but if you want to relate it back
    18          to the operations that they had, then you would have
    19          this problem.  Comparing one hospital with another, you
    20          would have to make a decision as to whether you were
    21          comparing actual differences from mortality rates in
    22          operations or differences in coding procedures.
    23      MR LANGSTAFF:  You have introduced another very important
    24          point by pointing out that mortality is a function of
    25          someone actually recording it accurately, and they may
0048
     1          be wrong on that?
     2      DR AYLIN:  Yes.
     3      MR LANGSTAFF:  So one may begin with, I do not know, 95,
     4          99, 98 per cent accuracy in getting it right that
     5          someone has died, but if you look, of that 98 per cent
     6          there may be a variability of 70 per cent or 83 per cent
     7          between two different coders looking at the same data.
     8          Is that the position?
     9      DR AYLIN:  Yes.
    10      MR LANGSTAFF:  Which makes the overall position rather
    11          less reliable?
    12      A.  I think it goes back to the point the previous people
    13          mentioned: that once you have produced the information,
    14          it raises questions rather than provides answers, and
    15          one of those questions is about the quality of debate
    16          and the quality of the coding behind the information,
    17          and that is something that needs to be explored before
    18          you start drawing conclusions from it.
    19      Q.  So let me move on from that.  We will probably come back
    20          to that and related issues in a moment or two, but
    21          I have been asking you principally about HIPE and we got
    22          into this discussion because we were looking at the
    23          origins of the data and how it came into the central
    24          database.
    25                If one were to concentrate on HES, and again, can
0049
     1          I deal with it in stages, I think from the history of
     2          the database as it is understood.
     3                After HIPE finished, was there any database for
     4          a couple of years which was in regular use nationally?
     5      A.  There was no detailed database like HIPE or HES for that
     6          period.  The department did receive some sort of broad
     7          aggregate returns which gave total numbers of finished
     8          consultant episodes that you have mentioned by
     9          specialty, but nothing down to specific operative
    10          procedures.
    11      Q.  That, I think you explain in your statement to us, was
    12          really because the system was changing and it was more
    13          difficult to change than had originally been
    14          anticipated?
    15      A.  The changes came about as a result of quite
    16          a wide-ranging review of the Health Service's
    17          information in the early 1980s by Mrs Korner, and
    18          I think the changes were found to be quite substantial
    19          for systems.  I think once, as it were, there started to
    20          be delays in the system, there was in some ways a sort
    21          of downward spiral in the perceived importance of
    22          information, because if you cannot get it in and turn it
    23          round and make it timely, people do not see why it is
    24          important to get it in.  So you actually have a downward
    25          spiral in terms of the use and presentation of
0050
     1          information and that is probably what happened in the
     2          late 1980s and early 1990s.  What we tried to do was
     3          actually turn that round and have an upward spiral so we
     4          can get the data back out to the Health Service and
     5          study use quickly, and hence people see it as important
     6          and hence they have a greater incentive to get it right.
     7      Q.  We mentioned a moment ago the OPCS 4 codes.  OPCS 4, the
     8          "4" suggests there may have been a 3, a 2 and a 1
     9          before that.  Am I right?
    10      A.  There was certainly a 3.  I assume there was a 2 and
    11          a 1, but my memory does not go back that far.
    12      Q.  The OPCS codes are procedures as opposed to diagnoses?
    13      A.  Yes.
    14      Q.  Diagnoses had their own coding system, did they?
    15      A.  Yes, the ICD, International Classification of Diseases.
    16      Q.  Currently it is ICD 10?
    17      A.  Yes.
    18      Q.  And it was throughout the rest of the period with which
    19          we are concerned ICD 9?
    20      A.  Yes, for the period you are interested in.
    21      Q.  Although ICD 10 was first promulgated in the early
    22          1990s, it was first used for HES purposes in the 1994 to
    23          1995 years?
    24      A.  I think it was 1995 it came in as the first year.
    25      Q.  So prior to 1995, ICD 9 was diagnosis code?
0051
     1      A.  Yes.
     2      Q.  So far as the procedure code, what was actually done in
     3          response to the particular diagnosis, one would look for
     4          OPCS 4, obviously given Dr Aylin's slide, some of the
     5          time, and OPCS 3 for some of the time?
     6      A.  I think OPCS 3 was probably connected with the latter
     7          half of HIPE, so I think you would probably only be
     8          interested in OPCS 3 if you were looking at the HIPE
     9          data.
    10      Q.  Did the changeover happen in 1988 to 1989?
    11      A.  It was in the late 1980s that the OPCS 4 was
    12          introduced.  As I have mentioned in my statement, there
    13          were a number of revisions to that which added
    14          complications, I think, for the people coding.
    15      Q.  What you have said, I think is that really, when HES
    16          first developed and began, it started off using OPCS 3,
    17          OPCS 4 began and there was what you described as an
    18          unstable mix of the two, partly because 4 came in during
    19          the course of a year when data was collected, so one of
    20          the problems with data for that year, the late 1980s,
    21          was the fact that you had two different sets of criteria
    22          for establishing what procedures you had used?
    23      A.  Yes, and I think in some areas OPCS 4 was subject to
    24          further revision during the year and unfortunately some
    25          of the same codes were used for old procedures and new
0052
     1          procedures, which again added complications.
     2                For the late 1980s, the information is not very
     3          good for operative procedures.
     4      Q.  So if we go back to the person putting the data in who
     5          would have been trained on OPCS 3, they would think in
     6          those terms, might not be familiar with OPCS 4 and might
     7          therefore tend to make errors more easily?
     8      A.  I would not like to say what was going through the minds
     9          of coders about 10 or 12 years ago, but certainly, if
    10          a coding system changes, that can obviously create
    11          problems for those feeding the information into systems.
    12      Q.  Is there evidence that when a coding system changes,
    13          there are problems with data quality for a while?
    14      DR AYLIN:  I think, just thinking about the way that this
    15          change happened, HIPE was based on hospital activity
    16          analysis data which was provided by each hospital at
    17          100 per cent level to the regions, and then the 10 per
    18          cent sample was given to HIPE.
    19                After HIPE finished in 1985, the hospital activity
    20          analysis still continued into 1986 and 1987 and I think
    21          it was adapted to provide information for HES.  So the
    22          system was still there, it was a gradual transformation
    23          at different times in different areas, so I think it is
    24          quite likely that there would have been problems
    25          implementing new codes, and I think this would be
0053
     1          a common problem in introducing new code systems, and
     2          new computer systems as well, that different hospitals
     3          would be using different systems of coding.
     4                What also happened, an initial coding system,
     5          OPCS 4 was introduced I think in April 1987, but there
     6          are a number of revisions of that coding system which
     7          went up to 1990, I think was the last consolidated
     8          revision of OPCS 4.  There were different versions of
     9          OPCS 4 being used in different hospitals around the
    10          country at that time.  So it does not make it a very
    11          reliable source of information.
    12      MR LANGSTAFF:  Some of what you are saying, I think, is
    13          speculation: "people would have behaved in this way
    14          because it is reasonable to think that is what would
    15          have happened".  So really what you are telling us is
    16          that you do not know of any definitive research which
    17          shows that they did, but it is a reasonable supposition.
    18      DR AYLIN:  I think it is a reasonable supposition.  I think
    19          it would be relatively easy to look back at the very
    20          early HES data sets and see whether there was this
    21          mixture of OPCS 3 and OPCS 4 codes.  I do not know
    22          whether that would be possible.  I assume that is what
    23          this statement is based on.
    24      A.  As much as what may be confusion for the individual
    25          coder, obviously if individual hospitals were moving at
0054
     1          different paces in their implementation, they may each
     2          think they were doing the right thing, but the right
     3          thing may be different in different places.
     4      Q.  If one is looking at the 1987/88/89/90 period, there
     5          would be, if you use the expression in the current
     6          context, a very big "health warning" over it?
     7      A.  Yes.
     8      DR AYLIN:  Yes.
     9      MR LANGSTAFF:  The disadvantages of HES as a system, then,
    10          from 1987 to 1990, let us say, are they then such,
    11          because of this confusion over the coding -- I say
    12          "confusion", you know what I mean, describing the
    13          process you have been talking about -- that really we
    14          should not place reliance upon it for the purposes of
    15          this Inquiry.  Does it go that far?
    16      DR AYLIN:  I think there are not only the questions about
    17          the coding, and as you say there is a big health warning
    18          there, but also the question of coverage.  If you look
    19          at the national coverage, comparing your HES records
    20          with the KP70 returns -- I do not know whether you want
    21          to get into this now?  The KP70 returns were based on
    22          a paper system.  I think essentially in the early days
    23          certainly they were based on head counts by the night
    24          nurse going through the hospital in the middle of the
    25          night, and these are compared to the electronic records,
0055
     1          the HES records.
     2                If you look back at 1987/88, you have a coverage
     3          of about 75 per cent, a national coverage, so if you
     4          compare your paper returns with your electronic returns,
     5          you are only catching three quarters of them on HES.
     6      MR LANGSTAFF:  Do you have a slide to show us this?
     7      MR WILLMER:  It is in report document 7.  I do not know
     8          whether that can easily be searched for, but it is near
     9          the back of the statement.
    10      THE CHAIRMAN:  Mr Langstaff, while we are looking for
    11          that, can I, just as a matter of housekeeping, make sure
    12          that the stenographers are able happy to deal with the
    13          innovation of today's hearing, voices coming thick and
    14          fast from various quarters?  I need to clarify that,
    15          because others in other places would wish to read our
    16          evidence, and I want to make sure that you are
    17          comfortable in the process.  I receive a nod and I am
    18          grateful.
    19      MR LANGSTAFF:  If we could have WIT 189/110 on the
    20          screen.
    21      DR AYLIN:  I am sorry, I was talking about diagnostic
    22          coverage there, quoting those figures of 75 per cent.
    23      MR LANGSTAFF:  I thought you had been talking about KP70.
    24      DR AYLIN:  For KP70 it is actually 1987/88 coverage.
    25      MR LANGSTAFF:  Let us concentrate, since we have it on
0056
     1          the screen, on what this tells us and shows us.  You
     2          have introduced the idea of KP70 for a moment.  KP70 has
     3          been called a "gold standard", has it not?
     4      A.  Yes.
     5      Q.  Does the "KP" stand for anything?
     6      A.  I think the K stands for Mrs Korner.  I cannot remember
     7          what the P stands for now.  There is a whole series of
     8          returns introduced as part of the Korner review and they
     9          would have an initial K.
    10      Q.  So perhaps there is no magic in the initials but that is
    11          what we have to work with.  KP70 was actually a physical
    12          head count, was it?
    13      A.  I am not sure whether it was a head count, but it
    14          certainly was an aggregate set of information to tell us
    15          how many finished consultant episodes there were for
    16          each -- at that time it would have been by District
    17          Health Authority, by specialty, which was seen as
    18          a baseline count, if you like, some sort of "gold
    19          standard" against which we could compare the number of
    20          records that were coming in on HES.
    21      Q.  One could only do the comparison if one knew that the
    22          KP70 figures were actually separately compiled from the
    23          HES figures?
    24      A.  Yes.
    25      Q.  Does one know that?
0057
     1      A.  One does not know exactly how people did them, but
     2          sometimes people would run off the information from
     3          similar systems to HES, but then could do separate audit
     4          checks.  Because it is an aggregate number, you could
     5          look at it and say "I had 3,000 finished consultant
     6          episodes in the last year; yes, that looks about
     7          right".  If it is only 1,500, you know you have only
     8          50 per cent of your records there.  With something like
     9          HES you cannot easily check because you are sending
    10          a huge amount of information on individual records by
    11          computer.
    12                So people may or may not have run off the
    13          information from a similar system to HES, but the idea
    14          of the aggregate return as a standard is that that is
    15          something you can look at and eyeball and say "It looks
    16          about right", or does not.
    17      Q.  So you can pick up the obvious error?
    18      A.  People should hopefully pick up the obvious errors on
    19          it.
    20      Q.  Is there a definition somewhere of the KP70, because --
    21      A.  Yes.
    22      Q.  There is a distinction in the way you have approached
    23          it, talking about it showing finished consultant
    24          episodes.  Dr Aylin has described it as a "head count"?
    25      DR AYLIN:  I was trying to make a point that it is a -- in
0058
     1          the early days, I assumed it was a manual count, the
     2          computer systems were not involved in counting it, but
     3          I think nowadays it is done on computer systems in
     4          a number of hospitals.
     5      MR LANGSTAFF:  Can we take it in stages?  In the older
     6          days, do you actually know how it was done?
     7      DR AYLIN:  The description I have heard of it is that it
     8          is much more of a manual system than an electronic
     9          system.  That is where you can make your comparison
    10          between the HES data and the KP.
    11      MR LANGSTAFF:  Can you help us, because there is a slight
    12          difference of information you both have about it, as to
    13          what the manual system consisted of?
    14      A.  I think the system for actually producing KP70 was
    15          different in different places.  I think Mr Aylin is
    16          right, there were manual systems in many places to do
    17          it.  What actually happens is that in terms of national
    18          standards, we asked the NHS for information, but the
    19          actual method of compiling it may not necessarily be set
    20          out for people, so it is up to people locally to decide
    21          the best way to actually produce that information.
    22      Q.  So we have got to the stage that KP70 constitutes some
    23          form of independent check, albeit broadbrush check --
    24      A.  Yes.
    25      Q.  -- I think you are agreeing with that, of the general
0059
     1          accuracy of the picture, completeness of coverage --
     2      A.  Yes.
     3      Q.  -- of the HES data?
     4      A.  Yes.
     5      Q.  Presumably, if one looks at the figures that you quote
     6          here on page 110, a percentage like 88 per cent, even
     7          though one is dealing with large numbers, is actually
     8          quite specific.  If we look to the independent count of
     9          finished episodes, KP70, we have 8,052,319, so someone
    10          has been able to go to the last one and say "This is not
    11          broadbrush, we have a figure we can put on KP70 here".
    12                Is that an actual accuracy, or is it a spurious
    13          accuracy?  Can you comment?
    14      A.  I mean, I think probably going down through the last
    15          consultant episode, perhaps either on a HES or KP70
    16          would introduce spurious accuracy.  I think the key
    17          thing from the Inquiry's point of view is to perhaps
    18          note the 88 per cent as something which actually could
    19          have been a lot worse, but at least we did have 88 per
    20          cent of the records in, but was an indication to us that
    21          we needed to get the complete numbers of records in as
    22          far as we could, and the figures for later years
    23          actually show that the overall coverage of HES compared
    24          with KP70 has been pretty good.
    25      Q.  So again, looking down the lists that we have on the
0060
     1          screen, there are two percentage columns -- one
     2          percentage coverage, one percentage diagnostic
     3          coverage.  Looking at them separately: percentage
     4          coverage obviously improves dramatically in the sense
     5          that KP70 and HES begin to correspond much more exactly
     6          up until -- between 1987 and 1990, when it gets up to
     7          98 per cent, between there or thereabouts, certainly
     8          within a 5 per cent margin of error of 100 per cent ever
     9          since?
    10      A.  Yes.  I mean, there may be variations between regions,
    11          as I think Mr Aylin brought out in one or two of his
    12          slides, but in general, the overall coverage of HES from
    13          about 1989/90 onwards has been good.
    14      Q.  Talking about regions, if we go to 189/113 and just turn
    15          it sideways, in 1991 to 1992 we can pick up South
    16          Western Regional Health Authority there, with its
    17          630,389, and there the correspondence is pretty good,
    18          2 per cent margin of error.  I say "pretty good"; is it
    19          pretty good?  Would one expect that figure to be 100 per
    20          cent, or is 102 per cent actually pretty good?
    21      A.  In theory, I suppose, it should not be above 100 per
    22          cent, but it highlights a problem we have on HES
    23          sometimes that actually we receive duplicate records, so
    24          sometimes regions have sent us the same records twice,
    25          but because we cannot separately identify the records,
0061
     1          you get two lots of them.
     2                So this sort of analysis is as good for us at
     3          spotting the duplicate record as it is for spotting
     4          under-reporting.
     5      Q.  The change in completeness appears to come about at
     6          about the same time as the purchaser/provider split.  Is
     7          this possibly because of any impact of contracting in
     8          the Health Service upon the need to produce better
     9          quality data so that the numbers contracted for could be
    10          followed through with the purchasers?
    11      A.  Certainly, the overall coverage is at about what one
    12          might term an acceptable level, 98 per cent or around
    13          that mark, from 1989 to 1990 onwards, which is prior to
    14          contracting.
    15                If one goes back to the previous table, 110, the
    16          percentage of diagnostic coverage did seem to do
    17          a further step improvement, that is the far right-hand
    18          column, where we notice an improvement from 1987/88 from
    19          75 per cent up to 88 per cent in 1990/91, but from
    20          1991/92 onwards when contracting was introduced it went
    21          up to 95 per cent, so that may have been a step change
    22          as a result of the contracting being introduced.
    23      Q.  The figure appears to be derived from the column
    24          immediately before it, the number of HES records without
    25          a usable primary diagnosis, and the informed speculation
0062
     1          would be, would it, that you need a usable primary
     2          diagnosis if that is what people have contracted for?
     3      A.  Yes.
     4      Q.  If there is no contract involved, then the need is not
     5          as great, so one might reasonably hypothesise that the
     6          change to purchaser/provider meant that people took
     7          greater care over the records.
     8                Is that a reasonable supposition?
     9      DR AYLIN:  I think so, yes.
    10      A.  Presumably if somebody goes into hospital, there is some
    11          diagnosis associated with why they have gone into
    12          hospital.  So they should all have a primary diagnosis,
    13          if not a hospital procedure.
    14      MR LANGSTAFF:  The incentive there is for people putting
    15          information into the system to get it right?
    16      A.  Yes.
    17      Q.  That is what the purchaser/provider split made
    18          a difference to, is the suggestion.
    19      A.  It is a possibility, yes.
    20      Q.  Going back for a moment, if I can, to 113 and turning it
    21          sideways, we picked up that the South West Regional
    22          Health Authority had a 102 per cent correspondence.  Its
    23          HES was showing rather more patients than its KP70 count
    24          in 1991/92.  If we look at 1992/93, it is 100 per cent.
    25          If we turn over the page, we will find it is 100 per
0063
     1          cent again in both the KP70, the HES and the diagnostic
     2          coverage and the following year, 101 and 99
     3          respectively.  Just completing the picture so far as you
     4          are able to show us, page 115, 101 and 98, that is the
     5          regional office, now?
     6      A.  Yes.
     7      Q.  Where the difference may be because the numbers are
     8          larger and obviously there are less regional offices
     9          than there were Regional Health Authorities?
    10      A.  Yes.  The figure of 1995/96 will include probably what
    11          was most of Wessex.
    12      Q.  Yes, and Wessex's results are slightly more variable
    13          throughout the period.  What those figures would suggest
    14          is that there appears to be, in the South Western
    15          region, very great correspondence, viewed at over
    16          650,000 admissions, between the numbers counted by HES
    17          and the numbers counted independently by KP70, and an
    18          appearance that of the records coming through, there is
    19          a diagnosis recorded for each and every episode?
    20      A.  Yes.
    21      Q.  That would tend to give one a bit of confidence, would
    22          it, at least as a starting point?
    23      A.  In overall terms, yes, it would give some confidence.
    24      MR LANGSTAFF:  You are agreeing with that?
    25      DR AYLIN:  I am agreeing with that.
0064
     1      MR LANGSTAFF:  If one looks at the problems which you
     2          identify in your statement to us, and the difficulties
     3          with the HES data, you, I think, have pointed out
     4          a number of things.  You have indicated that there may
     5          be regional variations and that would be important if
     6          one was trying to compare one region with another,
     7          presumably?
     8      A.  Yes, and a variation at regional level, or a figure that
     9          varies markedly from 100 per cent at regional level may
    10          mean at Trust level there are even greater variations,
    11          because you could have some Trusts that have sent in
    12          very little information at all, whilst a lot of them
    13          have achieved 100 per cent.  So variations below
    14          regional level may be even greater, so it tells you
    15          something about that as well.
    16      Q.  So that if, for instance, one has 100 per cent
    17          correspondence, it would argue that most Trusts have
    18          sent in full data; if it is 98 per cent, the two per
    19          cent may actually be a very bad Trust; it may be
    20          responsible for virtually all of that?
    21      A.  Yes.
    22      Q.  That is the point.  Do we have any evidence that
    23          that inequality of data actually happened?
    24      A.  Certainly when we did analyses in the early 1990s,
    25          because we were trying to decide how to make sense of
0065
     1          what was coming in.  We did find greater variability at
     2          the Trust level than we did at regional level, which led
     3          us to believe that it was safer to use the information
     4          at regional level rather than down at individual Trust
     5          level for a lot of purposes at that time.
     6      Q.  You make the point that the provision of data was not
     7          compulsory?
     8      A.  There was no legal obligation to provide it; it was part
     9          of what has been termed over the years a "mandatory"
    10          set of information which all parts of the NHS should
    11          submit.  There was no, as it were, legal statutory
    12          requirement.
    13      Q.  Does that actually matter if we see that, taken on the
    14          whole, there is something like 98, 100 per cent
    15          correspondence?  It is not really a problem for us, is
    16          it, if by and large it seems that everyone has actually
    17          provided the data?
    18      A.  That shows us that everyone has provided the number of
    19          records we would expect, and the vast majority have
    20          added diagnosis in.  It does not tell us anything about
    21          the other 40 or so data items that we collect on HES and
    22          the quality of those.  That is in many areas more
    23          difficult to assess until you start to use the HES
    24          data.  It is getting back to the male hysterectomies.
    25          You do not pick those up until you start to run analyses
0066
     1          of the system.
     2      Q.  In terms of usability for our purposes, moving from that
     3          particular defect to utility, can you help at all with
     4          what we need out of the data and whether it will provide
     5          it to a sufficient degree, it not being compulsory to
     6          supply all the data?
     7      DR AYLIN:  Given the problems with data quality, there are
     8          specific problems involved in actually using HES
     9          information for the kind of information that we want to
    10          find out, and I have summarised in INQ 2/15 some of the
    11          problems in actually analysing HES.
    12                The first point that we brought up was this
    13          problem of episodes versus spells.  The next page, 16,
    14          has a summary of episodes and spells, and perhaps
    15          illustrates some of the problems, although it does not
    16          come across very well.  You can have a number of
    17          episodes in a spell.  If you have an operation in
    18          a first episode within a stay or a spell in hospital and
    19          you want to find out what happened to that person,
    20          whether they died, whether they were sent home or
    21          whether they were transferred to another hospital, you
    22          have to be able to link these episodes together in order
    23          to be able to get an outcome for the end of the hospital
    24          spell.
    25                There are problems with linking these --
0067
     1      Q.  Can I stop you there?  What you are saying is --
     2          underneath the black bar at the bottom, incidentally, on
     3          the screen, if you have a photocopy, you would detect
     4          the words "episode 1, episode 2" in the middle and
     5          "episode 3" to the right, so that people watching can
     6          follow it a bit more easily.
     7                Are you saying that, taking episode 1, that is
     8          a finished consultant episode because somebody comes in
     9          under the care of a surgeon, let us say it is
    10          orthopaedics, and because it is quite obvious in the
    11          hospital he has a cardiac problem he is transferred to
    12          a cardiologist?
    13      A.  Yes.
    14      Q.  So that is episode 2.  If the result of the
    15          cardiologist's intervention is that he goes for surgery
    16          in a third discipline and then dies, is it the position
    17          that each of the first two finished consultant episodes
    18          will not record death as an outcome?
    19      A.  They will not have it directly on the record, yes,
    20          because we will get separate records in for each of
    21          those three episodes that are shown there.
    22                What we have started doing now is what we call
    23          "fuzzy matching the records": we cannot identify the
    24          person, but identify whether it is the same person in
    25          episode 1, 2 and 3, and hence, bring the records
0068
     1          together.  That forms the basis of some recent
     2          publications, clinical indicators which came out
     3          a couple of weeks ago.
     4                The basic building bricks of HES as they came in
     5          and were available, particularly in the early 1990s, as
     6          Mr Aylin described, they were the finished consultant
     7          episodes, the separate building bricks shown there of
     8          episode 1, episode 2 and episode 3.  You would not
     9          necessarily know, if someone died as a result, in
    10          episode 3, that they were linked with episode 1.  It
    11          would be a separate record that came in, and we would
    12          not be able to identify that they were the same person.
    13      Q.  You quote a percentage in your evidence to us of the
    14          number of people who have more than one finished
    15          consultant episode in a spell?
    16      A.  Yes.
    17      Q.  I think it is 5 per cent?
    18      A.  Yes.
    19      Q.  How do you know that it is 5 per cent if, looking
    20          back at the data, there is no way of saying it is the
    21          same patient in each of these episodes?
    22      A.  We do get a field of information on the record, actually
    23          called "episode order" and it tells us whether it is the
    24          first in a spell, the second in a spell, the third or
    25          a subsequent one.  So we can count up all the episodes
0069
     1          and see how many first episodes we have and see how many
     2          people would have subsequently had second episodes.
     3      Q.  Looking back historically at the data, if it is 5 per
     4          cent -- I do not know whether you want to ask any
     5          questions of Mr Willmer about the percentage here?
     6      DR AYLIN:  I think that 5 per cent would refer to all
     7          episodes, but I think there would be differences between
     8          different specialties and probably between different age
     9          groups and things like that, and perhaps within
    10          different hospitals and different specialist units, so
    11          that would be an overall figure.  But you would expect
    12          perhaps some specialties, if you go into a particular
    13          specialty, you are more likely to be transferred to
    14          another specialty or another consultant than other
    15          specialties, so I think there would be differences.
    16      A.  I think that is absolutely right.  If you look at basic
    17          elective admissions for, say it was a hip operation or
    18          something like that, the ratio is quite low.  If you
    19          start to look at emergency admissions, if you start to
    20          look at the very elderly, then the rates start to rise
    21          quite high, so there are quite wide variations.
    22                It is one of the things you have to bear in mind
    23          when you are using HES as to whether it is likely to be
    24          a significant problem or not.
    25      MR LANGSTAFF:  So can either of you say, or would you
0070
     1          like to come to a view to help us what the position is
     2          likely to be when you are looking at the specialty of
     3          cardiac surgery, paediatric cardiac surgery, which in
     4          essence is unlikely to be anything other than the
     5          rectification of a congenital heart problem, one might
     6          have hypothesised that it would be admitted as such?
     7      A.  I do not know what the figures are, but we could
     8          probably run something to find out what it is.
     9      DR AYLIN:  I do not know the figures offhand.
    10      MR LANGSTAFF:  But between you, you think that can be done?
    11      A.  We can run the HES data by specialty and look at the
    12          ratio by specialty.
    13      Q.  Because if one were to show that the 5 per cent overall
    14          figure, although it may be composed of a greater
    15          percentage in some specialties than others, was a much
    16          lesser percentage in paediatric cardiac surgery, then it
    17          would not be a problem for us, whatever the problem may
    18          be elsewhere?
    19      A.  Yes.
    20      DR AYLIN:  Yes.
    21      MR LANGSTAFF:  You were taking us, Dr Aylin, through the
    22          previous page.
    23      DR AYLIN:  Yes, the list of problems in analysing HES
    24          data.  The second point, that there is no actual patient
    25          identifier on the records, there is no patient
0071
     1          identification number, is a problem in both looking at
     2          joining these episodes to spells, and also not only
     3          counting the number of spells and their outcomes that
     4          are going through a particular unit, but counting the
     5          number of individuals going through a particular unit.
     6          It is very difficult to count individuals, the number of
     7          children or people that go through a particular unit,
     8          without this identifier.
     9                So that is another issue which you have to take
    10          into consideration when looking at the data.
    11      A.  I agree with that.  It was actually a specific
    12          recommendation of the review in the 1980s that the
    13          Department of Health should not have identifiers, purely
    14          for reasons of confidentiality, and it was not seen as
    15          important at the time, but in order to do the type of
    16          analysis that I think the Inquiry is interested in and
    17          Mr Aylin has mentioned, you want to be able to
    18          differentiate between individuals even if you do not
    19          want to identify them.  You want to see what happens to
    20          the same individual over a period of time.  And actually
    21          from 1997, I think it is, we are getting a new NHS
    22          number, so in future analyses of HES we will be able to
    23          do that, subject to maintaining confidentiality, which
    24          is a very firm thing we have to do all the time,
    25          clearly.  But for the period of the Inquiry, we would
0072
     1          have to rely on the "fuzzy" matching process which
     2          I have mentioned previously to identify or hopefully
     3          identify the same people.
     4      Q.  Just exploring how great a problem this is for us in
     5          terms of the utility of it, we dealt with the question
     6          of episodes and spells, which is one spell consisting of
     7          1, 2, 3, 4, 5, 6, however many, different episodes under
     8          different consultants.
     9                What the patient identifier would resolve is more
    10          than one spell, is it?
    11      DR AYLIN:  If a patient was admitted into hospital, had
    12          an operation, was sent home and then came back some time
    13          later and died, if we had a patient identifier, we would
    14          be able to pick up that outcome, but it is very
    15          difficult at the moment without a patient identifier.
    16      MR LANGSTAFF:  So, to put a sort of character to it, if
    17          one had a patient who came in for surgery, the surgery
    18          was not particularly successful but the patient was able
    19          to go home -- we had such an example earlier in one of
    20          the parents who gave evidence before us -- but
    21          subsequently then required further surgery which may or
    22          may not have been a complication of the first surgery,
    23          as well as of the condition, one would not pick it up
    24          from this data.
    25      DR AYLIN:  It would be difficult without a patient
0073
     1          identifier.  You could use data, birth, sex and
     2          postcode, but it might be that that person has moved
     3          between the two admissions or there might be error on
     4          inputting the postcode or the date of birth, so it would
     5          be more difficult to pick up different spells within the
     6          same children.
     7      Q.  But if one had a view from, let us suppose, clinical
     8          experts that in general, with cardiac surgery in
     9          infants, a patient was unlikely to come back within
    10          a matter of years, except for those operations where,
    11          necessarily, there would have to be further operations
    12          conducted -- one thinks of the Sennings operation as one
    13          we have heard and will hear of -- if the general view
    14          was that it was an operation which was unlikely
    15          therefore to bring someone back because, in crude terms,
    16          the outcome was likely to be survival for a long time
    17          or, sadly, death, then if that were the general
    18          evidence, this would not be a particular problem for us?
    19      DR AYLIN:  No.  You would have to identify what you
    20          wanted to look at.  If you were just interested in
    21          deaths within a spell, then that would be easier to look
    22          at than if you wanted to look at people coming back and
    23          being readmitted.
    24      Q.  Again, if one were making a comparison using the
    25          national database to make a comparison between, say,
0074
     1          Bristol and other hospitals or another hospital, then if
     2          in general terms within the same specialty surgery was
     3          conducted in the same way, on that assumption, this
     4          would not be a problem: you would still be able to make
     5          the comparison across crude rates of mortality?
     6      DR AYLIN:  Yes.
     7      A.  If you compiled the information in the same way, and you
     8          could make an assumption about the way the care is
     9          provided, then that would not be so much of a problem.
    10      Q.  But the conclusion would be falsified if the assumption
    11          is wrong, obviously?
    12      A.  Yes.
    13      Q.  One further matter before we have a break for lunch.  It
    14          is the next item that you have on your slide.  What is
    15          the point there?
    16      DR AYLIN:  We have already talked a little bit about the
    17          limits of the OPCS 4 coding system and the inaccuracies
    18          or the differences in the way that they are coded and
    19          disagreement between different hospitals, and if we
    20          wanted to look at comparing one unit with the rest of
    21          the country or with other specialist centres, we would
    22          have to think about this.  Certainly we would not want
    23          to be using data before 1991, say, because of thes