Electronic discovery company Applied Discovery and KPMG are amongst those who have recently partnered with Equivio to integrate Equivio>Relevance into their existing eDiscovery applications. These two recent announcements give me an opportunity to return to the subject of software-assisted document review using what is generally known (but see below) as predictive coding. Recent discussions with some lawyers have shown scope for fundamental misunderstandings about what this kind of software does, and a look at the explanations produced by Applied Discovery and KPMG, as well as those of some other players in this space, may help.
My primary objective is clarity for those who come across the names and the terminology but are not necessarily clear as to the functionality being offered or its purpose. There seem to be three aspects which some lawyers find difficult about this kind of software, each of which is worth challenging. These are the following:
- They think it is being sold as a substitute for human review of the documents which are to be disclosed. That is not its purpose. It provides (amongst other benefits) a means of identifying irrelevant (or less relevant) documents or, to put it the other way round, a way of prioritising documents so that those identified provisionally by the software as the most relevant or most important are brought to the top. The important word here is “provisionally”, with its clear implication that the lawyers get every opportunity to double-check both what has been ranked as important and what has not.
- It is the subject of debate about judicial acceptability which, again, follows from a misunderstanding both of what it does and of what the courts expect. You may care to read an article called Judge Peck and Predictive Coding at the Carmel eDiscovery Retreat which reports on what I describe as “one of the clearest statements yet by a judge that the use of new technology like predictive coding is an acceptable way to conduct search”. Judge Peck’s immediate context may have been the US Federal Rules of Civil Procedure, but the principles which he covers apply anywhere, and the UK’s Senior Master Whitaker has said much the same both in conference speeches and in his judgment in Goodale v Ministry of Justice. Master Whitaker has also emphasised repeatedly the first point made above – that none of this software is intended as a substitute for human review of documents to be disclosed; he has heard this reaction as well.
- There is a paradox inherent in the nature of lawyers. They crave certainty but prefer it expressed in words rather than in numbers (a point which comes up in my recent report about the risk management function of corporate counsel). The statistics-based evidence of an application’s accuracy may underpin a decision to use this type of software, but lawyers still prefer the evidence of their own eyes. In fact, the applications give every opportunity for output to be validated by humans, but (in a second paradox) this may be getting lost in the marketing literature’s emphasis on the science.
The second of these points, judicial acceptability, is adequately covered in my Judge Peck article. In this article, I will focus on the lawyers’ own confidence in these applications as an aid to lawyerly judgement, not a substitute for it. To do that, I will look at the descriptions of what these applications do rather than on the science behind them, supporting the relevant parts of the Applied Discovery and KPMG materials and by extracts from those of other products with similar functionality.
First, however, it is worth saying a few words about terminology, not so much to define the labels as to pick the generic purpose out from the proprietary or product-specific names given to this kind of functionality. They all work slightly differently, and I defeat my object of simplicity if I qualify my deliberately broad descriptions with any attempt to describe the distinguishing features of each of them. Read the rest of this entry »
Posted by Chris Dale


