Presented by Rebecca Shwayri. Introduction to Predictive Coding and its benefits How can records managers use Predictive Coding Predictive Coding in Action.

Slides:



Advertisements
Similar presentations
E-Discovery and Predictive Coding A Conversation With In-House and Outside Counsel.
Advertisements

Wiginton v. CB Richard Ellis, Inc.
Litigation Holds: Don’t Live in Fear of Spoliation Jason CISO – University of Connecticut October 30, 2014 Information Security Office.
Technology Assisted Review: Trick or Treat? Ralph Losey, Esq., Jackson Lewis 1.
© 2007 Morrison & Foerster LLP All Rights Reserved Attorney Advertising The Global Law Firm for Israeli Companies Dispute Resolution in the United States.
Privileged & Confidential Ready or Not, Here it Comes…. Why It’s Likely that You’ll be Employing Predictive Coding Technology In 2013 Texas Lawyer In-House.
Cache La Poudre Feeds, LLC v. Land O’Lakes, Inc.  Motion Hearing before a Magistrate Judge in Federal Court  District of Colorado  Decided in 2007.
Litigation and Alternatives for Settling Civil Disputes CHAPTER FIVE.
Strategies for Preserving the Attorney-Client Privilege in the World of Electronic Discovery Beth Rose Ford Motor Company.
Ethical Issues in the Electronic Age Ethical Issues in the Electronic Age Frost Brown Todd LLC Seminar May 24, 2007 Frost Brown.
A PROACTIVE APPROACH TO E-DISCOVERY March 4, 2009 Presented to the Corporate Counsel Section of the Tarrant County Bar Association Carl C. Butzer Jackson.
Intellectual Property Section Lunch & CLE July 10 Intellectual Property Section Lunch & CLE July 10 "What Can Be Done About the High Costs of IP Litigation."
5 Vital Components of Every Custodian Interview David Meadows, PMP, Managing Director – Discovery Consulting, Kroll Ontrack Dave Canfield, EJD, Managing.
E-Discovery for System Administrators Russell M. Shumway.
Project Planning and Management in E-Discovery DAVID A. ELLIS – MAYER BROWN BROWNING E. MAREAN – DLA PIPER.
Privilege, Privacy, and Waiver. Privilege Attorney/Client In the law of evidence, a client's privilege to refuse to disclose, and to prevent any other.
The Real Cost of Privilege Review Patrick Oot, Esq. - Verizon, Director of Electronic Discovery and Senior Litigation Counsel Anne E. Kershaw, Esq. – A.
1 Best Practices in Legal Holds Effectively Managing the e-Discovery Process and Associated Costs.
Avoiding Sanctions & Surprises The ethics of discovery Kat Meyer, Esq. President of Conquest eDiscovery, LLC.
Heartland Surgical Specialty Hospital, LLC v. Midwest Division, Inc. (D. Kan. Apr. 9, 2007) Andrew S. Lo E-Discovery 10/6/09.
The E-Discovery Games: A Closer Look at Technology Assisted Document Review David D. Lewis, Ph.D., Information Retrieval Consultant Kara M. Kirkeby, Esq.,
Technology-Assisted Review Can be More Effective and More Efficient Than Exhaustive Manual Review Gordon V. Cormack University of Waterloo
INFO 624 Week 3 Retrieval System Evaluation
Career number 1. E- Discovery Professional  Electronic Discovery- Technology based Identifies, preserves and manages electronically stored information.
Finding Your Way around the Courthouse. Court Records Online – PACER - Public Access to Court Electronic Records – Case Management: CM/ECF – State Courts.
Search and Retrieval: Relevance and Evaluation Prof. Marti Hearst SIMS 202, Lecture 20.
EMPLOYEE USE OF COMPANY MONITORING & PRIVACY ISSUES.
Victor Stanley, Inc. v. Creative Pipe, Inc. 250 F.R.D. 251 (D. Md. 2008)
Page 1 Records Management – 911 Case Study on Information Retention and Retrievability Rachel Verdugo March 23, 2010 Williamsburg, VA.
©2013 Foley & Lardner LLP 1 Discovery About Discovery Joanne Lee September 11, 2013.
TAR, CAR, Predictive Coding, Integrated Analytics What Does It All Mean? Huron Legal provides advisory and business services to assist law departments.
Civil Law Resolutions to disputes between people..
E-Discovery: Not Grandfather’s Litigation Finding Your Client’s Narrative.
January 20, 2007© 2007 The Prinz Law Office.1 HOTTEST TOPICS IN CYBERSPACE: CYBERINSURANCE, BLOGS, AND ON-LINE ADVERTISING By Kristie D. Prinz, Founder.
Discovery III Expert Witness Disclosure And Discovery Motions & Sanctions.
Rule 17 Originally promulgated in 1992, amended in 1996 to take effect in 1997 Subject to local adoption Jackson County adopted as Rule 25.
2009 CHANGES IN CALIFORNIA DISCOVERY RULES The California Electronic Discovery Act Batya Swenson E-discovery Task Force
Against: The Liberal Definition and use of Litigation Holds Team 9.
P RINCIPLES 1-7 FOR E LECTRONIC D OCUMENT P RODUCTION Maryanne Post.
The Challenge of Rule 26(f) Magistrate Judge Craig B. Shaffer July 15, 2011.
Material Covered in Assignment 4-1: The Attorney-Client Privilege A. Rationale for the Attorney-Client Privilege (p. 318) B. Criteria for Attorney-Client.
Cache La Poudre Feeds, LLC v. Land O’Lakes, Inc. 224 F.R.D. 614 (D. Colo. 2007) By: Sara Alsaleh Case starts on page 136 of the book!
Chapter 16.1 Civil Cases. Types of Civil Lawsuits In civil cases the plaintiff – the party bringing the lawsuit – claims to have suffered a loss and usually.
Conducting Modern Investigations Analytics & Predictive Coding for Investigations & Regulatory Matters.
The Risks of Waiver and the Costs of Pre- Production Privilege Review of Electronic Data 232 F.R.D. 228 (D. Md. 2005) Magistrate Judge, Grimm.
Defensible Records Retention and Preservation Linda Starek-McKinley Director, Records and Information Management Edward Jones
SEM A - Promotion PE – Develop content for use in marketing communications to create interest in product/business/idea PI – Write marketing.
Print your Name Print your Street Address Print your City, State, and Zip Code Self-Represented Print Courthouse Street Address Print Courthouse City,
Records Management for Paper and ESI Document Retention Policies addressing creation, management and disposition Minimize the risk and exposure Information.
Is Your Background Check Process Compliant?. 2 © Copyright 2015 ADP, LLC. Proprietary and Confidential Information. Agenda Privileged & Confidential.
Emerging Case Law and Recent eDiscovery Decisions.
Defensible Quality Control for E-Discovery Geoff Black and Albert Barsocchini.
© 2014, TERIS, Applying Analytics: Effective & Efficient Data Management in Litigation and Beyond.
Chapter. 3: Retrieval Evaluation 1/2/2016Dr. Almetwally Mostafa 1.
DISTRIBUTED INFORMATION RETRIEVAL Lee Won Hee.
Title of Presentation Technology and the Attorney-Client Relationship: Risks and Opportunities Jay Glunt, Ogletree DeakinsJohn Unice, Covestro LLC Jennifer.
Heartland Surgical Specialty Hospital, LLC v. Midwest Division, Inc 2007 WL (D. Kan. Apr. 9, 2007)
Residential Funding Corp. v. DeGeorge Financial Corp., 306 F.3d 99 (2d. Cir. 2002).
Purpose and Operation.   Pre-trial procedures – procedures taken before a trial and may result in the dispute being settled  If the dispute is not.
Electronic Discovery Guidelines FRCP 26(f) mandates that parties “meaningfully meet and confer” to consider the nature of their respective claims and defenses.
#16PACE Preparing For The Inevitable... How To Be Ready When The Lawsuit Comes And Steps To Proactively Limit Corporate Inconvenience And Liability Mitchell.
Major Issues n Information is mostly online n Information is increasing available in full-text (full-content) n There is an explosion in the amount of.
Data Minimization Framework
6/22/2018 2:09 PM BRK3102 How Microsoft Legal drives down eDiscovery costs with machine learning in Office 365 Rachi Messing Senior Program Manager, O365.
Speakers: Ian Campbell, Claire Hass,
Bennett B. Borden Conor R. Crowley Wendy Butler Curtis
An Introduction to Public Records Office of the General Counsel
Litigation Holds: Don’t Live in Fear of Spoliation
The Team Players and Play in a Complex Document Review Project: Past, Present and Future Ralph C. Losey, Jackson Lewis Principal and National e-Discovery.
Electronic Discovery Sabrina Jones 4/14/2011.
Presentation transcript:

Presented by Rebecca Shwayri

Introduction to Predictive Coding and its benefits How can records managers use Predictive Coding Predictive Coding in Action Limitations of keyword searches & human review Predictive Coding Defensibility

What is predictive coding? How does it work?

NOT Magic NOT a cure for cancer NOT based on voodoo

Keyword searching Concept searching threading These methods can be useful but do not predict relevance of future documents based on past documents

Expert (you) develops an understanding of the documents and classifies the documents Old tech In common use today Example: Spam Filter, Amazon.com Math and Statistics

Algorithms Mathematical model built Accuracy depends on quality of training set

Random Sample Single person reviews & codes the Sample Non- Responsive Responsive Computer learns & predicts Computer categorizes all remaining documents ResponsiveNon-Responsive Repeat as needed Review randomly selected documents One person’s time for hours Predictive Coding in Practice

Dramatic Reduction in e-discovery costs More accurate than human review and keyword search Light years faster than human review and keyword search

Fact driven, not fear driven, settlements Learn the facts of the case in a few days rather than over months or years using traditional methods of review Helps avoid litigation – uncovers the facts more quickly Use as an information governance tool

MethodRecall RatioCostSpeed Keywords20 percentHigh $$$Slow - Misses content Human Review60 percentVery High $$$$60 docs / hr Predictive Coding75-98 percentLow $>80-250x faster

Information Governance Tool (proactive) Litigation Tool (reactive)

Encompasses a variety of disciplines Records Management Knowledge Management Information Security and Privacy

Data breach risks E-discovery costs Unable to locate documents needed for the business units

Standardized IG policies Reduce the need to review every single document to determine the importance of the document to the company Locate data within the company’s IT infrastructure and categorize it appropriately for the business units Locate data that needs to be destroyed

Example: Company is sued in a dispute involving fraud and breach of contract Custodians: 20 Potential Custodians with average e- mail box of 40 GB each (800 total GB of data) Other electronic Files: 200 GB Total Data: 1 Terabyte

Company is served with a Request for Production of Documents by Plaintiffs’ Counsel Plaintiffs’ Counsel demands searching through ESI of custodians Plaintiffs’ Counsel makes a broad demand for accounting records

What do you do? Keyword search 1TB of data? How do you keyword search fraud? Information disadvantage! Human review? It will take many, many months and millions of dollars to review 1TB of data!

Use Predictive Coding Should you disclose? One school of thought suggests disclosing use of predictive coding to opposing counsel, agreeing to precision and recall rates (Full Agreement and Full Disclosure) The other school of thought suggests making no disclosures (Avoid litigation associated with use of predictive coding)

Recall (Completeness) Recall measures how successful the system was in finding all of the responsive documents. If 1,000 documents in the full set were actually responsive, but the system only marked 750 of those documents responsive, then the recall would be 75 percent. Precision (Accuracy) Precision measures how often the documents that were marked responsive were actually responsive. If the system marked 10 documents responsive, and only six of them were actually responsive, then the precision would be 60 percent.

Depends on collection “richness” 2-5 days – one person & one only! documents reviewed Stop when system exhibits: High rates of Precision & Recall – above the agreed to rates No longer discovering new topics to teach the computer about Computer is predicting with consistency

It is like Exit Polling…. Statistics Truth: Sample of a certain size yields a certain level of confidence and a certain margin of error. 400 randomly selected docs provides 95% confidence level in the estimate of Predictive Coding accuracy, with a ± 5% margin of error. Reference: Cochran, WG Sampling Techniques, 3 rd Ed. John Wiley & Sons, New York, New York, USA.

When you are out of time If you want to save money Consider using CAR for cases involving 5 GB or more of data Predictive coding makes sense when you have 20,000 documents or more

Judge Facciola (D.DC): “If you are practicing e-discovery without a clawback, you are committing malpractice.” Parties agree in writing that inadvertent production of privileged material does not automatically constitute a waiver

What if the other side won’t agree to the clawback agreement? Go to the Court! Rajala v. McGuire Woods, 2010 WL (D. Kan. July 22, 2010): Court issued clawback order with no need to show reasonable efforts

Consider Clawback Agreement during “meet and confer” conference Embody agreement in Court Order (Rule 502(d))

Predictive coding should be used to cull down data set to a manageable level This should occur AFTER predictive coding Attorneys should conduct privilege review Attorneys need to decide what is privileged: Do not put this on auto-pilot

Why Linear Review is Ineffective Linear Review compared to other methods

Catches only 20 percent of relevant evidence Therefore…misses 80 percent The “Google” phenomenon Limitations of Keywords

Failure of imagination (Example: Nasdaq versus Stock Market) How many synonyms for the word “think”? Precise Terms of Art Misspellings (Example: Mangment, Mangemnt…) Problems With Keywords

Human problem People express concepts differently Difficulties in learning to adopt another party’s language style TREC (Text Retrieval Conference) was a competition and it showed a complete failure in keyword searches

Human keyword based review is expensive It is slow & inaccurate It unnecessarily complicates a simple process Is widely used as until now, there were no alternatives Predictive coding – when “done right” – can save a corporation 80-90% of review costs.

Keyword searches missed 96 percent of relevant documents (recall ratio averaged less than 4 percent) TREC Legal Track Study 2009

97 percent of relevant documents not found Only a 3 percent recall ratio (76,373 relevant documents not discovered) Boolean searches reduced the initial corpus from 685,592 to 2,715 documents 87 percent precision ratio (2,362 documents out of 2,715 are relevant) TREC Legal Track Study 2010

Involved a San Francisco Bay Area Rapid Transit Accident Discovery database contained 40,000 documents and 350,000 pages Attorneys believed keyword searches uncovered 75 percent of relevant documents In reality: Only 20 percent of relevant documents uncovered Blair and Maron Study

Human eyeballs on every document Judge Peck: The “gold” standard does not have any gold Human assessors disagree on the relevance of a document to a single topic The “Gold” Standard

TREC Conclusion: 65% Recall and 65% Precision is best retrieval effectiveness for human reviewers Human eyeballs on every document is not working Reviewers disagree as frequently as 50 percent

Monique Da Silva Moore v. Publicis Groupe & MSL Group (SDNY) (endorsed using predictive coding) Complicated and confusing protocol – DO NOT USE Defendants offered plaintiffs everything they wanted – protocol was so confusing they could not see they got everything they ask for – so they went after the Judge. Global Aerospace, Inc. v. Landow Aviation Limited Partnership (Circuit Court of Loudoun County Virginia) (authorized use of predictive coding over objection) Nothing in news – as no controversy – everything worked!

Expensive Kleen case – 1400 attorney hours to determine search terms – and plaintiff was not satisfied – and neither was aware of overall effectiveness of terms Not effective Over or Under produces Known to be very problematic “Ostrich approach” is no longer advisable – technology has evolved Judges know it exists, plaintiffs know it exists and ask for it

EORHB, Inc., et. al. v. HOA Holdings, LLC (Delaware Chancery Court) Court ordered the parties sua sponte to use predictive coding and ordered the parties to use the same vendor Judge may have over stepped bounds

Technology is your friend Make data driven decisions We are living in the “MoneyBall” age If you are unsure, please ask – this is not going away

For more information contact Rebecca Shwayri Tel: (813)