November 7, 2007TREC 2007 Overview of the TREC 2007 Legal Track Stephen Tomlinson Douglas W. Oard Jason R. Baron Paul Thompson.

Slides:



Advertisements
Similar presentations
© 2008 Oracle Corporation – Proprietary and Confidential.
Advertisements

Due Diligence for Directors Martin Elliott Kovnats Jeffrey Kyle Merk.
Assurance Services Independent professional services that “improve the quality of information, or its context, for decision makers” Assurance service encompass.
Chapter 20 Legal Liability McGraw-Hill/IrwinCopyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Evidence 2/8/2014 Evidence 1 Evidence What is it? Where to find it?
Basel Convention Secretariat United Nations Environmental Programme ___________________________________ Key Elements of the Protocol Laura Thompson Legal.
Foreign Air Operator Validation & Surveillance
New Jersey Election Law Enforcement Commission Pay-to-Play January 2013.
Threshold System Presented by Jan Stanley, State Title I Director Office of Assessment and Accountability Fall Title I Directors Conference October 23-25,
Michelle L. Doyle For Catapult Learning 1.  What is IDEA?  Who is eligible?  How do they get identified?  How do they get services? ◦ Who pays? ◦
A GIA is a contract between a surety company and a contractor (or subcontractor)/principal. A GIA is a standard, typical document in the construction.
Webinar: June 6, :00am – 11:30am EDT The Community Eligibility Option.
Ch. 36, Promotion Is Communication
© 2007 Prentice Hall, Business Law, sixth edition, Henry R. Cheeseman Chapter 16: Remedies for Breach of Traditional and Online Contracts.
Copyright © 2004 by Prentice-Hall. All rights reserved. PowerPoint Slides to Accompany BUSINESS LAW E-Commerce and Digital Law International Law and Ethics.
1 Legal Principles of Insurance Contracts. Requirements of a Valid Insurance Contract 2 Legality Capacity Offer and Acceptance Consideration contracts.
© 2012 McNees Wallace & Nurick LLC CONTRACT ESSENTIALS Diane M. Tokarsky Chair, Construction Law 100 Pine Street, PO Box 1166 Harrisburg, PA
Chapter 17 Completing the Audit Engagement McGraw-Hill/IrwinCopyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
TOPIC 7: SHAREHOLDERS’ RIGHTS AND REMEDIES….contd
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
Basics of Insurance Law PLI: Bridge the Gap II Robert H. Friedman May 26, 2005 Robert H. Friedman May 26,
NC State University Office of General Counsel April 2010.
Chapter 7 Review Economics.
Minimum Necessary Standard Version 1.0
Considerations for Records and Information Management Programs in Light of the Pension Committee and Rimkus Consulting 2010 Decisions.
International Accounting Standard 33
Copyright © 2004 by Prentice-Hall. All rights reserved. © 2007 Prentice Hall, Business Law, sixth edition, Henry R. Cheeseman Chapter 6 Strict Liability.
Experiments with the Negotiated Boolean Queries of the TREC 2007 Legal Discovery Track Stephen Tomlinson Open Text Corporation 2007 Nov 8.
October 6, 2008iSchool Colloquium No, Not That PMI: Creating Search Technology for E-Discovery Jason Baron, 1,4 Douglas W. Oard, 1,3 Tamer Elsayed 2,3.
Q3 LAW NOTES 1 TORTS.
© 2013 Sri U-Thong Limited. All rights reserved. This presentation has been prepared by Sri U-Thong Limited and its holding company (collectively, “Sri.
Q UINCY COLLEGE Paralegal Studies Program Paralegal Studies Program Litigation & Procedure Introduction To Litigation Litigation & Procedure Introduction.
Confidentiality and HIPAA
Chapter 18: Torts A Civil Wrong
Law I Chapter 18.
Tort Law Part 2 Negligence and Liability. Negligence Most common tort Accidental or Unintentional Tort Failure to show a degree of care that a “reasonable”
Chapter 18 Torts.
“In the vast area of legal jurisprudence, there are undoubtedly many instances where being the first, or only, jurisdiction to grant rights to persons.
The Special Investigations Unit Act Presentation to Ad-hoc Committee on the President’s response to the Public Protector.
©2003 Prentice Hall Business Publishing, Auditing and Assurance Services 9/e, Arens/Elder/Beasley Legal Liability Chapter 5.
Planning for the TREC 2008 Legal Track Douglas Oard Stephen Tomlinson Jason Baron.
Interactive Task of the TREC Legal Track: Theory meets Practice Douglas W. Oard College of Information Studies and Institute for Advanced Computer Studies.
Chapter 7.1 – An introduction to civil law
 A body of rights, obligations, and remedies that is applied by courts in civil proceedings to provide relief for persons who have suffered harm from.
4Chapter SECTION OPENER / CLOSER: INSERT BOOK COVER ART Intentional Torts Section 4.1.
HIPAA Trading Partners, Legal Relationships October 2, 2001 presented by Peter B. Goldstein, Esq. Cap Gemini Ernst & Young, US LLC.
Liability of Environmental Health Specialists Aimee Wall County Attorneys’ Winter Conference February 11, 2006.
© 2009 Foley Hoag LLP. All Rights Reserved.Presentation Title Final Massachusetts Pharmaceutical and Medical Device Regulations Penalties and Enforcement.
Chapter 5 Torts and Civil Law.
Part 2 – The Law of Torts Chapter 5 – Negligence and Unintentional Torts Prepared by Michael Bozzo, Mohawk College © 2015 McGraw-Hill Ryerson Limited 5-1.
Woods et al., Professional Front Office Management © 2007 Pearson Education, Upper Saddle River, NJ All Rights Reserved. 1 Front Office Manager.
Chapter 04 Legal Liability of CPAs McGraw-Hill/IrwinCopyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.
Traffic Control & Tort Liability
McGraw-Hill/Irwin Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 22 Criminal Law and Procedure in Business.
 Development of Strict Liability.  Defendant’s liability for strict liability is without regard to: Fault, Foreseeability, Standard of Care or Causation.
Chapter 09 Negligence and Strict Liability Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin.
LAW for Business and Personal Use © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible.
The Role of the Courts.
TORTS: A Civil Wrong. Fairplay.org What is a Tort? A civil wrong A breach of some obligation Causing harm or injury to someone –Negligence –Libel Plaintiff.
Introduced some basic knowledge of the contract First, what is the contract? Contract, also known as contract. China's definition of the contract, the.
Complex legal requirements in electronic discovery search and retrieval: A NARA case study NSF Collaborative Expedition Workshop # 45 Advancing Information.
Published by Flat World Knowledge, Inc. © 2014 by Flat World Knowledge, Inc. All rights reserved. Your use of this work is subject to the License Agreement.
4Chapter SECTION OPENER / CLOSER: INSERT BOOK COVER ART Intentional Torts Section 4.1.
CHAPTER 18 PART I Torts: A Civil Wrong. A Civil Wrong In criminal law, when someone commits a wrong, we call it a crime. In civil law, when someone commits.
McGraw-Hill/Irwin ©2008 The McGraw-Hill Companies, All Rights Reserved Chapter 20 Legal Liability.
Negligence Tort law establishes standards for the care that people must show to one another. Negligence is the conduct that falls below this standard.
Law-Related Ch Notes I. Torts: 1. A tort is a civil wrong.
Chapter 13: Product Liability
CHAPTER 9 THE CORPORATE ORGANIZATION © 2013 Delmar Cengage Learning.
© 2013 Sri U-Thong Limited. All rights reserved
Presentation transcript:

November 7, 2007TREC 2007 Overview of the TREC 2007 Legal Track Stephen Tomlinson Douglas W. Oard Jason R. Baron Paul Thompson

Since TREC 2006 … 12/1/06: Amended Federal Rules of Civil Procedure go into effect, expressly allowing lawsuits to include in their discovery phase requests to produce “electronically stored information” or ESI. 6/1/07: First published legal opinion in U.S. discussing difference between “keyword” and “concept” searching. Disability Rights Council of Greater Washington, et al. v. Washington Metropolitan Transit Authority, 242 F.R.D. 139 (D.D.C. 2007)

Since TREC 2006 … 6/4/07 Workshop held on Supporting Search and Sensemaking For Electronically Stored Information in Discovery Proceedings (“DESI Workshop”), Eleventh International Conference on Artificial Intelligence and Law, Palo Alto, 8/7/07 Issuance of Sedona Best Practices Commentary on the Use of Search and Information Retrieval Methods in E-Discovery (August 2007 public draft),

Documents Same collection of 6,910,912 documents as 2006 –IIT Complex Document Information Processing (CDIP) test collection Docs from the tobacco Master Settlement Agreement (MSA) –Attorneys General of several US states settled lawsuits against 7 US tobacco organizations (5 tobacco companies, 2 research institutes). MSA required them to make public all documents produced in: –Discovery proceedings in the lawsuits by the states –A number of other smoking and health-related lawsuits. Organizations were required to provide: –Scanned documents –Metadata Assembled by UCSF and IIT –Including OCR for about half the scanned documents

Example Document Title: CIGNA WELL-BEING NEWSLETTER - FUTURE STRATEGY Organization Authors: PMUSA, PHILIP MORRIS USA Person Authors: HALLE, L Document Date: Document Type: MEMO, MEMORANDUM Bates Number: /9377 Page Count: 2 Collection: Philip Morris Philip Moxx's. U.S.A. x.dr~am~c. cvrrespoaa.aa Benffrts Departmext Rieh>pwna, Yfe&ia Ta: Dishlbutfon Data aday 90,1997. From: Lisa Fislla Sabj.csr CIGNA WeWedng Newsbttsr - Yntsre StratsU During our last CIGNA Aatfoa Plan meadng, tlu iasuo of wLetSae to i0op per'Irw+ng artieles aod discontinue mndia6 CIGNA Well-Being aawslener to om employees was a msiter of disanision. I Imvm done somme reaearc>>, and wanted to pruedt you with my Sadings and pcdiminary recwmmeadatioa for PM's atratezy Ieprding l4aas aewelattee*. I believe.vayone'a input is valusble, and would epproolate hoarlng fmaa aaeh of you on whetlne you concur with my reeommendatioa … ScannedOCRMetadata

4 “Complaints” for 43 Topics Drafted by The Sedona Conference® lawyers: (1)Wrongful death and products liability action based on the use of a certain type of radioactive phosphates resulting in contaminated candy as well as in drinking water; (2)Patent infringement action on a device named “Suck out the Bad, Blow in the Good,” designed to ventilate smoke; (3)Shareholder class action suit alleging securities fraud and false advertising in connection with a fictional “Smoke Longer, Feel Younger” campaign relying on ‘60s-era folk music; (4)Fictional Justice Department antitrust investigation looking in to a planned merger and acquisition of a casualty and property insurance company by a tobacco company.

- 52 Please produce any and all documents that discuss the use or introduction of high-phosphate fertilizers (HPF) for the specific purpose of boosting crop yield in commercial agriculture. - (("high-phosphat! fertiliz!" OR hpf) OR ((phosphat! OR phosphorus) w/15 (fertiliz! OR soil))) AND (boost! OR increas! OR rais! OR augment! OR affect! OR effect! OR multipl! OR doubl! OR tripl! OR high! OR greater) AND (yield! OR output OR produc! OR crop OR crops) - "high-phosphate fertilizer!" AND (boost! w/5 "crop yield") AND (commercial w/5 agricultur!) (phosphat! OR hpf OR phosphorus OR fertiliz!) AND (yield! OR output OR produc! OR crop OR crops) A These requests require the production of all responsive documents within the sole or joint possession, custody or control of the Defendant, including their agents, departments, attorneys, directors, officers, employees, consultants, investigators, insurance companies, or other persons subject to Defendant's custody or control. 2. All documents that respond, in whole or in part, to any portion of these Requests must be produced in their entirety, including all attachments and enclosures. 3. For purposes of these requests, the words used are considered to have, or should be understood to have their ordinary, everyday meanings. Plaintiffs refer Defendant to any dictionary in the event that Defendant asserts that the wording of a request is vague, ambiguous, unintelligible, or confusing The words "and," "or," "each," "any," "all," "refer," and "discuss," shall be construed in their broadest form and the singular shall include the plural and the plural shall include the singular whenever necessary so as to bring within the scope of these Requests all documents (defined below) that might otherwise be construed to be outside their scope. 5. Solely for the purpose of the TREC 2007 legal track, the term "Defendant" shall include the named defendant companies in this complaint as well as all other companies whose records are found in the TREC collection database. 6. Solely for the purpose of the TREC 2007 legal track, "document" means all data, information or writings stored in the TREC legal database, including, without limitation: any written, electronic or computerized files, data or software; memoranda, s correspondence, OCR scanned images, communications, reports, summaries, studies, analyses, evaluations, notes or notebooks, indices, spreadsheets, logs, books, pamphlets, binders, calendar or diary entries, ledger entries, press clippings, graphs, tables, charts, printouts, drawings, maps, meeting minutes, and transcripts. The term document encompasses all metadata associated with the document. The term also includes all drafts associated with any particular document. The term is also intended to include all electronically stored information as the term is used in the Federal Rules of Civil Procedure, 7. The terms "relating to," "regarding," ‘discussing," or "concerning," shall be synonymous and should be taken to mean in whole or in part constituting, containing, concerning, discussing, describing, analyzing, identifying or stating. 8. The term "high-phosphate fertilizers" (HPF) shall refer to any high phosphate fertilizer, including, but not limited to calcium phosphate fertilizers and superphosphate fertilizers. In some instances, "high-phosphate" fertilizers will be subsumed in the definition of "phosphatic fertlizers." However, phosphatic fertilizers are a more general term for fertilizers containing phosphate and the phosphate concentration of various phosphatic fertilizers is likely to vary. 9. The term "Maleic Hydrazide" (MH) refers to a pesticide that is sprayed on sugar beets for the purpose of decreasing sugar loss in beet roots A July 1, 2007 U.S. DISTRICT COURT SOUTHERN DISTRICT OF GLADSHEIM MR & MRS. N. EINHERJAR, individually and on behalf of the Estate of DRIFA EINHERJAR, a minor, and the CITY AND COUNTY OF VALHALLA, a government entity. GULLINKAMBI CANDY CO., a Gladsheim corporation; VIKING SUGAR FARMS, a Gladsheim corporation; and U.S. BEET SUGAR ASSOCIATION, a nationwide association with local chapters in Gladsheim Plaintiffs Mr. and Mrs. N. Einherjar bring this action individually and on behalf of the estate of their deceased daughter Drifa Einherjar. These plaintiffs and the City and County of Valhalla (collectively referred to as "Plaintiffs") bring this action against Defendants Gullinkambi Candy Co. (GCC), Viking Sugar Farms (VSF), and the U.S. Beet Sugar Association (BSA) (hereinafter referred to collectively as "Defendants," or individually by their respective acronyms). This complaint seeks equitable and injunctive relief for the use of lethal substances in the production of VSF sugar, resulting in the death of a child and contamination of the Valhalla County groundwater. This complaint additionally seeks damages for strict products liability and failure to warn against GCC for the use of and failure to disclose lethal substances contained in its candy. Finally, this complaint seeks treble and punitive damages for fraud and conspiracy in violation of the Racketeer Influenced and Corrupt Organizations Act (RICO), 18 U.S.C. (sec) 1962 for Defendants' collective and organized concealment of lethal substances from Plaintiffs, resulting in the death of a child and massive contamination of Valhalla County's sole source of drinking water. - Plaintiffs, Mr. and Mrs. N. Einherjar, are residents of Valhalla, Gladsheim, and their deceased daughter, on whose behalf they are suing, was also a Valhalla resident. 2. Defendants GCC and VSF are both Gladsheim Corporations with principal places of business in Valhalla, Gladsheim. The U.S. Beet Sugar Association has local chapters in Valhalla, Gladsheim, and directs the actions of VSF. - All events giving rise to this incident took place in Valhalla, Gladsheim. Therefore, jurisdiction of this court is proper Defendant VSF uses high-phosphate fertilizers (HPF) (sometimes referenced as phosphate fertilizers) to increase the flavor of its sugar beets. HPF contains traces of radioactive elements that remain as a byproduct of phosphate extraction. Phosphate used in HPF is taken from a rock mineral called Apatite which also contains radioactive radium. The resulting Apatite powder therefore contains traces of radioactive elements that become incorporated into HPF. Studies have shown that health problems caused by HPF include immune disorders, toxic myopathy, chronic fatigue syndrome, liver dysfunctions, irregular heart-beat, reactive depression, and memory loss. In addition to using HPF, VSF sprays its sugar beets with Maleic Hydrazide (MH) to decrease the loss of sugar content in its sugar beet crop. MH has been shown to cause renal dysfunction in laboratory mice and to eventually lead to death. 4. In 1933, the U.S. Beet Sugar Association conspired with cane-growers in Hawaii to form a powerful sugar cartel that controlled Congress through a strong sugar lobby. Together, the American sugar growers united to create an underground sugar-trade brotherhood secretly referred to as "The Sugar Program." Members of the brotherhood contributed large sums of money to hire sugar-interest lobbyists who successfully brought about a series of favorable Sugar Acts beginning in 1934 and continuing to the present day. The Sugar Program brotherhood has also been successful in preventing Congress from regulating HPF or MH. 5. For the past five years, the BSA has served as elected leader of The Sugar Program, and has been given the responsibility for regulating the actions of the brotherhood members and for approving all major contracts and actions taken by members under its control. 6. Defendant GCC is a candy company that uses VSF sugar in all of its candy. As part of its contract with VSF, GCC agreed to conceal the levels of HPF and MH contained in VSF sugar from its consumers in exchange for an exclusivity provision and a discount on the wholesale price of its sugar. GCC therefore omitted warnings about HPF and MH from its candy labels. 7. As a result of Defendants' collective actions and omissions an eight-year old girl died from consuming a piece of GCC candy and the Valhalla community as a whole has been harmed by the contamination of their drinking water with HPF and MH. - FIRST CAUSE OF ACTION Wrongful Death 8. On March 23, 2007, decedent Drifa Einherjar (hereinafter "Decedent") purchased a piece of GCC candy for $0.67 from the GCC store on Main Street, Valhalla, Gladsheim. At the time of purchase, Decedent was not warned or informed of any dangers of eating the candy and there were no warnings on the candy wrapper or labels of the candy bag. 9. GCC knew that VSF used HPF and MH in its sugar production process. Despite this knowledge, GCC contractually agreed to conceal the presence of HPF and MH in its candy as a condition of its agreement with VSF, in exchange for a discount on its bulk sugar purchases. 10. As a direct and proximate result of these stated acts and omissions, Decedent consumed a piece of GCC candy containing HPF and MH, resulting in her death on March 24, Decedent ate the candy in a manner in which it was intended to be eaten, and received no instructions from any agents of GCC to exercise caution or to eat the candy in any other way. SECOND CAUSE OF ACTION Strict Tort Liability 11. The aforementioned candy and VSF sugar used as a primary ingredient in the candy were unreasonably dangerous to human health due to their high content of HPF and MH. 12. Defendants GCC and VSF knew of this health risk and notwithstanding that knowledge, concealed these dangers from the consuming public. 13. As a result of the HPF and MH contained in GCC candy, Decedent died within 24 hours of consuming a single piece of GCC candy. THIRD CAUSE OF ACTION Public Nuisance (Against Defendant VSF only) 14. Defendant VSF's method of sugar beet farming creates a public nuisance that unreasonably endangers the health of all Valhalla residents by contaminating their groundwater. 15. By continuing to use HPF and MH in its sugar beet production and by failing to use the standard method of limestone quicklime phosphate precipitation in the treatment of its waste-water, VSF continues to contaminate the groundwater and will continue to endanger the health of Valhalla residents. The harm to Valhalla residents will continue until an injunction is issued to stop the use of HPF and MH or to require implementation of the limestone quicklime wastewater treatment to minimize contamination. 16. As a direct and proximate cause of Defendant's acts and omissions, residents of Valhalla have unknowingly ingested harmful substances from their contaminated water supply. FOURTH CAUSE OF ACTION Failure to Warn 17. VSF, as a sugar beet farm that uses HPF and MH, had a duty to issue warnings to Plaintiffs and the general public about the presence of HPF and MH in its sugar and the corresponding health risks that these substances posed in groundwater or direct consumption. 18. Defendants VSF and GCC knew, or with the exercise of reasonable care, should have known that HPF contained radioactive substances and that MH added to the diet of mice, resulted in renal dysfunction and eventual death. Despite this knowledge, no information was offered to the Valhalla Community about the potential hazards of HPF, the lethal nature of MH used in VSF's sugar production, or the presence of HPF or MH in GCC candy. 19. At all times relevant to this litigation, Defendants VSF and GCC had actual and/or constructive knowledge of the dangers mentioned above. Despite this knowledge, VSF continued to operate its sugar beet plant with reckless disregard for the community around it by contaminating their groundwater and GCC continued to sell candy containing HPF and MH in reckless disregard for the life of children whom it targeted in its advertising campaigns and who therefore could be expected to purchase and consume GCC candy. 20. VSF breached its duty to warn the community about HPF and MH groundwater contamination and GCC breached its duty to warn consumers of the HPF and MH in its candy. 21. Defendant VSF's failure to warn has resulted in the contamination of Valhalla County's drinking water and the endangerment of the health of Valhalla residents. 22. GCC's failure to warn resulted in the death of a child and the illness of several others. FIFTH CAUSE OF ACTION Conspiracy and Fraud in Violation of the Racketeer Influenced and Corrupt Organizations Act (RICO), 18 U.S.C. (sec) 1962, and Request for Treble Damages. 23. Defendants VSF, GCC, and BSA engaged in a conspiracy to defraud by collectively agreeing to conceal the presence and adverse health effects of HPF and MH from the American public, the Valhalla community and Plaintiffs in particular. 24. In 1933, Defendants formed a sugar cartel secretly known as "The Sugar Program" which successfully lobbied Congress in passing favorable sugar laws and prevented the regulation of HPF and MH in commercial agriculture. 25. All three Defendants contributed financially to a lobbying fund aimed at fighting HPF and MH regulation and obtaining the passage of favorable "Sugar Acts." 26. For the past five years, the BSA has lead lobbying efforts and approved all actions of The Sugar Program brotherhood. 27. BSA spearheaded the movement to discourage written warnings about HPF and MH, and approved the VSF contract with GCC which provided for a reduction of GCC's wholesale sugar price, and a favorable exclusivity provision between VSF and GCC, under the condition that GCC refrain from publishing warnings about HPF and MH on its product labels. 28. As a result of this collective action to defraud the public, Plaintiffs have suffered injuries indicated above. Treble damages are therefore appropriate under RICO to punish the conspiratorial nature of Defendants' planned concealment of known health risks presented by HPF and MH from the Valhalla community and from Plaintiffs, resulting in the death of a child. SIXTH CAUSE OF ACTION Negligence 29. Defendant VSF had a duty to the Valhalla community and to Plaintiffs to refrain from contaminating their groundwater and to provide warnings about the known health hazards associated with HPF and MH which it used in the production of its sugar beets. 30. Defendant GCC had a duty to the Valhalla community and to Plaintiffs to disclose the known levels of HPF and MH in VSF sugar which it used as a primary ingredient in its candy. 31. Defendant BSA had a duty to compel members of the brotherhood under its control to require lawful disclosures of HPF and MH. 32. All Defendants breached their respective duties to the Valhalla community and to Plaintiffs. As a result, Plaintiffs have suffered damages indicated above. Punitive Damages 33. The conduct of Defendants described above is outrageous. Defendants' conduct demonstrates a reckless disregard for human life and a conscious disregard for public safety. The acts and omissions described above were willful and performed with actual or implied malice. Punitive and exemplary damages are therefore appropriate and should be imposed in this instance. - WHEREFORE, Plaintiffs respectfully pray for a judgment against Defendants for: 1. Injunctive and equitable relief as the Court deems appropriate including: i) Requiring Defendant VSF to test and to monitor the water near its sugar plant; ii) Requiring Defendant VSF to use the quicklime limestone method for processing wastewater to minimize phosphate contamination of Valhalla groundwater, if it is permitted to continue operation of its plant and to continue use of HPF and MH in its sugar beet production; iii) Compelling Defendant VSF to remove existing HPF from the groundwater by any means necessary; and 2. Compensatory damages to be paid by all Defendants, according to proof at trial; 3. Punitive damages as the court deems appropriate; 4. Costs and attorneys fees of this lawsuit, with interest; 5. Any other relief as the court deems appropriate. Topic 52 (Long Form)

- 52 Please produce any and all documents that discuss the use or introduction of high-phosphate fertilizers (HPF) for the specific purpose of boosting crop yield in commercial agriculture. - (("high-phosphat! fertiliz!" OR hpf) OR ((phosphat! OR phosphorus) w/15 (fertiliz! OR soil))) AND (boost! OR increas! OR rais! OR augment! OR affect! OR effect! OR multipl! OR doubl! OR tripl! OR high! OR greater) AND (yield! OR output OR produc! OR crop OR crops) - "high-phosphate fertilizer!" AND (boost! w/5 "crop yield") AND (commercial w/5 agricultur!) (phosphat! OR hpf OR phosphorus OR fertiliz!) AND (yield! OR output OR produc! OR crop OR crops) A-1 Topic 52 (Short Form) Free Text Baseline Boolean Baseline

“Relevancy” Assessors Call for participation to law schools nationwide Enthusiastic response due to law school requirement for pro bono service hours 42 law school student volunteers from: Loyola-L.A. (23); University of Indiana-Indianapolis (5), George Washington (3), Case Western Reserve (3), Loyola-New Orleans (2), Boston University (2), University of Dayton (2), University of Maryland (1), University of Texas (1) + 1 DOJ attorney, 1 NARA archivist

Ad Hoc Task Participants Carnegie Mellon U Dartmouth College Fudan University Open Text Corporation Sabir Research, Inc. U of Amsterdam U of Iowa/Eichmann U of Iowa/Srinivasian U of Mass, Amherst U of Missouri, KC U of Waterloo Ursinus College

Reference Boolean Run Results of final negotiated Boolean query were available to the participants this year –refL07B run B was between 100 and 25,000 for all topics this year –“B” is the number of documents matching the final negotiated Boolean query Estimated was main measure

Background on Estimating Recall TREC 2006 findings (last year): Legal Track (sampling experiments) –Estimated <18% of relevant documents assessed –Marginal precision 4% at depth 9000 Terabyte Track –“inferred average precision” (infAP) based on random samples of 200 documents from (up to) depth-1252 pools –Good correlation with MAP from depth-50 pooling L07 Method (this year): Adapted infAP approach to support deeper pooling (depth ) by using non-uniform probabilities

Ad Hoc Pooling Overview 68 runs submitted by 12 groups –Max 25,000 documents submitted per topic Pool sizes (before sampling) ranged from 195,688 (topic 76) to 476,252 (topic 84) –added 100 random documents from the 6.5 million unsubmitted documents per topic (randomL07 run) ~500 documents judged per topic Measures estimated from samples

Which 500 Documents to Assess? Let hiRank(d) be the highest rank at which any system retrieved document d. Let p(d) be the probability of choosing document d: If hiRank(d) <= 5 p(d) = 1.0 else if (hiRank(d) <= B) p(d) = min(1.0, ((5/B) + (C/hiRank(d)))) else p(d) = min(1.0, ((5/25000) + (C/hiRank(d)))) C chosen so that p(d) summed to 500 for all documents d in the pool (0.34 <= C <= 2.42)

Extra Assessment Bins Bin 1 (required) –500 documents –Completed by 43 of 50 assessors Bins 2 through 6 (optional) –100 documents each Set C so that p(d) sum to 1,000 Draw 1000 from the pool Set C so p(d) sum to 900 Draw 900 from original 1,000, leftover are “bin 6”, etc. –8 of 43 assessors did at least 1 optional bin 5 assessors completed all 5 optional bins

Estimating Recall at B (or 25000) estRecall(S) = estRel(S) / estRel(pool) estPrecision(S) = estRel(S) / (estRel(S) + estNonrel(S))

Estimated # of Rel Docs in Pool Mean per Topic: Relevant: 16,904 Non-rel.: 298,678 Gray: 4,303 Topic 71 (bromhidrosis): Relevant: 77,467 Topic 63 (sugar contract): Relevant: 18

Boolean Run Estimated Recall Mean 0.22 Boolean run missed 78% of the relevant documents (on average per topic) Topic 84 (1960’s films) Topic 77 (smoke NOT tobacco) 0%

Failure Analysis Topic 74: “All scientific studies expressly referencing health effects tied to indoor air quality.” Boolean Query (77% precision, 22% recall): (scien! OR stud! OR research) AND ("air quality" w/15 health) Passages in Missed Relevant Documents: … Lowrey A.H. (1980). Indoor air pollution … … assessment … entitled “Respiratory Health Effects of Passive Smoking” … … study … funded by the Center for Indoor Air Research …

Median vs. Boolean Median won 8 of 43 Boolean won 31 of 43 (4 tied) Topic 99: 0.31 vs (natural disasters) Topic 58: 0.07 vs (phosphates and health) Boolean run had higher mean than all submitted runs. Boolean Better Median Better

Median vs. Boolean Median won 33 of 43 Boolean won 9 of 43 (1 tied) Topic 60: 0.91 vs (phosphate precip.) Topic 58: 0.09 vs (phosphates and health) Highest mean 47% (wat1fuse run). Boolean Better Median Better

Marginal Precision by Depth Band Depths : median P=18% Depths : median P=13% Depths : median P=11% Depths : median P=10% Depths : median P=10% 3 of 446 (0.7%) of random (unsubmitted) documents were judged relevant –another 50,000 relevant documents per topic?

Median “Run” Marginal Precision (Depths 20,001-25,000, by Topic) only 6 of 43 topics Marg. Prec. > 10% Topic 69: MP = 100% (indoor smoke vent.) Topic 74: MP = 46% (indoor air quality) Topic 71: MP = 21% (bromhidrosis)

Relevance Feedback Task Re-used 10 topics from last year (w/new judging) –Good kappa and ≥ 50 relevant in 2006, and “interesting” Residual evaluation –Documents judged in 2006 were discarded for 2007 RF Three participating groups: Carnegie Mellon U Open Text Corporation Sabir Research, Inc. 8 submitted runs –Only 5 used feedback from 2006 relevance assessments

Feedback vs. Boolean resid ) Feedback won 7 of 10 Boolean won 3 of 10 (0 tied) Topic 45: 0.94 vs (pigeon deaths) Topic 51: 0.24 vs (memory loss) (“Feedback” is median of 5 feedback runs.)

Interactive Task Exploratory task, new in 2007 –Goal: Find as many relevant docs as possible –Some RF topics from 2006 (all teams ≥3 topics) –All runs completely assessed (max depth 100) Documents judged in 2006 were re-judged in 2007 Three participating teams –Long Island U: 1 team, 3 searchers/topic, manual queries –Sabir Research: Multi-iteration RF, ~automatic queries –U Washington: 6 teams, 1 searcher/topic, manual queries Utility function:

Interactive Task Results

Interactive Task Lessons Learned Excellent inter-annotator agreement –103/135 from top-scoring team judged relevant More time  Better results –115 hours to find those 135 documents (!) A bit more standardization would be helpful –Particularly submission format guidelines

Looking Back Unique test collection –7 million documents with OCR and metadata –83 rich topics (Boolean, free text, context) –Recall-oriented evaluation measure Moderately robust research community –16 research teams from 4 countries –Attracting attention (and investment) in the law

Some Open Questions Test collection reusability –Unbiased estimates? Tight error bars? Why can’t we beat Boolean??? –Different strategies? Detailed failure analysis? Is OCR masking effects we need to see? –Is it time for a new collection? –Must it be de-duped? Is metadata needed? Can we improve topic formulation? –Structured relevance feedback? What should the interactive task measure?

Plans for 2008 New collection for Ad Hoc Task? >250,000 Enron s 550,000 State Department cables ( ) Continue Interactive and RF tasks Using the 2006/2007 (tobacco) collection

State Department Cables