logo_2Cy.gif
Home About us Media Research Consultancy Training Site map Contact

Home » Anti-doping

The World Anti-Doping Agency (WADA) has been created in 1999 to improve and harmonize the global fight against doping.

Here it is stressed that, in our opinion, this timely and commendable initiative overly relies on the questionable science of anti-doping laboratories.

logo_WADA.gif

This caveat also applies to the biological passport, which has recently been introduced by the UCI, see presentation below. N.B. All these claims are substantiated in peer-reviewed articles.

logo_UCI.gif

Questionable science yields sub-optimal procedures and poorly defined decision rules. As a result, the risk of a false positive declaration is often unknown. Note, however, that WADA should currently also be prepared to accept an unnecessarily high risk of a false negative declaration. Since the false negative problem is obviously difficult to assess in practice, criticism will be illustrated with the former deficit.

Presentation

For practical examples of potentially flawed science, see:

  • N.M. Faber
    A think tank for anti-doping research
    Play the Game 2009
    Download (icon_ppt.gif=1,079 kB)

N.B. Researchers themselves question the scientific background of the IRMS test failed by Floyd Landis, see p.2 of internal document (icon_pdf.gif=118 kB).

Play the Game 2009.jpg
A 'worked example' from the Ghent laboratory

Now follows the treatment of a real case that should even embarrass a lay person. Interestingly, in a 2008 interview the director of this laboratory, Frans Delbeke, noted that he was proud ('fier') that his laboratory ranked second-best in the world after Cologne.

Briefly, the suspect sample is declared 'positive' because it sufficiently matches a reference sample for three (3) signals. Since measurement results cannot be obtained without negligible uncertainty, a match range is calculated for the reference signals. The procedure is summarized in:

Calculation of match between reference and suspect sample.

Ghent example.gif


N.B. As a first step, all signals for a sample (reference/suspect) are scaled to the largest one, the so-called base peak. The scaled signal for the base peak follows as 100 (%) for both samples - a fixed number without a match range. The laboratory documentation package gives a nonsensical range 85-115, but this is not repeated here.

For the remaining ones, the match range depends on scaled signal size, and is given in a WADA technical document (icon_pdf.gif=129 kB):

  • if scaled signal < 25
  • if 25 ≤ scaled signal ≤ 50
  • if scaled signal > 50
  • match range is ±10 (fixed size)
  • match range is ±25% of the scaled signal (relative size)
  • match range is ±15 (fixed size)

One observes that the relative-sized match range for signal 2 is calculated after rounding the scaled signal to integer, which is obviously the incorrect order.

Although the 'selected procedure' did not affect the decision in this particular case, it should be clear that it is not indicative of the sophistication expected in an anti-doping laboratory. In our opinion, it is nothing less than either thoughtless or careless.

Finally, it is noted that WADA's decision criteria lack a rigorous statistical underpinning, which can also be easily understood by a lay person:

  • The match range is the same in size, regardless whether the results are obtained from peak heigths or areas, i.e. different quantities with entirely different uncertainties.
  • The criteria do not provide a smooth transition for match range at scaled signal values 25 and 50. From a measurement science point of view, it does not make any sense at all to attach a different uncertainty to the indistinguishable signals 24.999 and 25.000 (say), namely 10 and 6.25 respectively.
  • The number of signals deemed sufficient for identification for all possible target compounds, namely three (3), is based on results obtained for a single model compound (diethylstilbestrol) and a single technique (EI-GC-MS), as stressed in:

    • R.A. de Zeeuw
      Substance identification: the weak link in analytical toxicology
      Journal of Chromatography B, 811 (2004) 312

It cannot be denied that procedures are carried out in either a thoughtless or careless manner.

General critique

Questionable science may affect critical stages of a doping test, for example:

  1. The processing of the analytical data
    Instruments deliver large amounts of data that need to be processed to arrive at an analytical finding. This task is commonly executed using commercial software that is not state-of-the art from a scientific point of view. It is simply embarrassing to see that this software (e.g. used for standard operations such as background correction) often lags behind by more than 20 years.

  2. The assessment of an analytical finding to be 'adverse'
    The analytical finding needs to be assessed as 'normal' or 'adverse'. Often, decision criteria are encountered that lack a rigorous statistical underpinning. N.B. For mass spectrometric detection, one of the work horses of anti-doping research, the statistical solution has been available in the scientific literature for more than a decade, but remains to be ignored (see 'worked example' for an illustration).

  3. The conclusion that an 'adverse analytical finding' is due to doping
    Anti-doping researchers often compare elite athletes with 'normal controls', which, as has been pointed out by others, is a basic flaw. Elite athletes, e.g. by their different genetic make-up, do not necessarily come out as 'normal' in a test although being 'clean'.

For written accounts of our scientific criticism, see:

  • N.M. Faber and R. Boqué
    On the calculation of decision limits in doping control
    Accreditation and Quality Assurance, 11 (2006) 536-538
  • N.M. Faber
    The limit of detection is not the analyte level for deciding between "detected" and "not detected"
    Accreditation and Quality Assurance, 13 (2008) 277-278
  • N.M. Faber
    Doping: using flexible criteria could reduce false positives
    Nature, 455 (2008) 1176
  • N.M. Faber
    Regulations in the field of residue and doping analysis should ensure the risk of false positive declaration is well-defined
    Accreditation and Quality Assurance, 14 (2009) 111-115
  • N.M. Faber
    Towards more fair and effective doping tests based on chromatography with mass spectrometric detection
    Accreditation and Quality Assurance, 14 (2009) 223-226
  • K. Faber
    On the unacceptable reporting of results in doping control
    Accreditation and Quality Assurance, 14 (2009) 341
  • N.M. Faber
    Validation of specificity in doping control: problems and prospects
    Accreditation and Quality Assurance, 14 (2009) 399
  • N.M. Faber
    Pleidooi voor een eerlijk proces in dopingzaken
    Nederlands Juristenblad, 84 (2009) 2880-2883
  • K. Faber and M. Sjerps
    Anti-doping researchers should conform to certain statistical standards from forensic science
    Science and Justice, 49 (2009) 214-215
  • N.M. Faber
    Floyd Landis: an unsafe conviction, regardless of the quality of the data
    Clinica Chimica Acta, 411 (2010) 117-118
  • N.M. Faber and B.G.M. Vandeginste
    Flawed science 'legalized' in the fight against doping: the example of the biological passport
    Accreditation and Quality Assurance, 15 (2010) 373-374
  • N.M. Faber
    Het biologisch paspoort: veelbelovende opsporingstechniek of juridisch wankel?
    Tijdschrift voor Sport & Recht, 2010, 2: 79-89
  • K. Faber
    Failing intuition and lying statistics have condemned Claudia Pechstein
    Deutsche Zeitschrift für Sportmedizin, 61 (2010) 245-246
  • M.G Burke and K. Faber
    A plea for thresholds, i.e., maximum allowed levels for prohibited substances, to prevent questionable doping convictions
    Science and Justice, 52 (2012) 199-201
  • N.M. Faber
    Reflections on a recent report of the World Anti-Doping Agency
    Accreditation and Quality Assurance, 19 (2014) 139