Posts Tagged ‘Statistical methods in legal informatics’
May 17, 2013
Abstracts have been posted of papers presented at the Conference: The Many Faces of Contemporary Philosophy and Theory of Law, held 23-24 March 2013, at Jagellonian University, Cracow, Poland. The conference included a special working group on Bayesian analysis in law, abstracts of papers of which begin on page 6 of the abstracts volume and are excerpted below:
Dr Jeroen Keppens: Bayesian Perspectives on the Value of Evidence. Abstract:
Given the interdisciplinary audience, I would like to introduce the Bayesian approach to evidential reasoning in Law. Then I plan to move on the Bayesian modeling techniques and the various concerns and difficulties that arise from it.
Paweł Banaś and Krzysztof Kasparek: Some remarks about controversies concerning applying Bayes theorem to criminal policy-making. Abstract:
The following paper aims at summarizing a discussion concerning the exploitation of Baysesian analysis within criminal policy-making, namely problems with the so called postprison civil commitment of sex offenders as sexually violent predators (SVPs) employed currently in some of the US states. During this process it is determined whether a former convict will be “classified” as SVP. Typically, actuarial instruments are used in order to help decide on this issue. Recently, Richard Wollert has pointed out that exploitation of Bayesian theorem may prove useful in this type of cases when addressing at least some of the questions that may arise. However, his ideas were met with much criticism within risk-assessment community. In this paper we want to present main arguments of both sides of the debate and point to some of the possible problems with Bayesian analysis as used in forensic psychology.
Piotr Bystranowski: Czy da się nauczyć prawników statystyki? Sieci bayesowskie a unikanie błędów probabilistycznych w rozumowaniach prawniczych. Abstract:
Od lat siedemdziesiątych i czasów przełomowych eksperymentów Kahnemana i Tversky’ego powszechnym stało się przekonanie, iż ludzkie osądy w warunkach niepewności często dają rezultaty systematycznie i rażąco niezgodne z regułami matematycznego rachunku prawdopodobieństwa, w tym zwłaszcza z tzw. wzorem Bayesa. Od błędów tego rodzaju nie jest wolna sala sądowa. Przeciwnie – wyniki szeregu procesów karnych pokazują, że wymiar sprawiedliwości jest podatny na wiele błędów w rozumowaniach probabilistycznych (z tzw. złudzeniem prokuratora na czele). Ich skutkiem bywa, na przykład, przypisanie zbyt dużej pewności materiałowi dowodowemu, który z formalnego punktu widzenia zdaje się być dalece nierozstrzygający. Pociąga to za sobą pytanie, w jaki sposób rozwiązać ową ewidentną niezgodność mię-dzy intuicyjnymi rozumowaniami w warunkach niepewności a formalnymi metodami probabilistycznymi. [...] Tych mankamentów zdaje się unikać proponowana przez Normana Fentona i Martina Neila wizualizacja przy pomocy sieci bayesowskich. W ten sposób można modelować nawet najbardziej skomplikowany materiał dowodowy w sposób przejrzysty dla laika. Rola stron procesu ograniczałaby się tu do sprecyzowania prawdopodobieństw a priori i zależności między poszczególnymi zmiennymi, zaś zadanie skonstruowania architektury sieci pozostawiano by ekspertom. O prawomocności obliczeń dokonywanych „pod spodem” można by przekonać strony na prostych przykładach, z wykorzystaniem np. drzewek zdarzeniowych. Zatem zastosowanie sieci bayesowskich w procesie miałoby być, zdaniem Fentona i Neila, najprostszym sposobem uniknięcia błędów probabilistycznych bez konieczności podejmowania beznadziejnego zadania, jakim jest nauczenie prawników statystyki.
Bartosz Janik: Some remarks concerning Bayesian rationality in Law. Abstract:
This paper aims at providing some remarks concerning Bayesian decision theory (BDT) and rationality in the legal perspective. As a first point I would like to provide a philosophical account of rationality and I will try to, while focusing on most appropriate meaning of it, to judge it from a legal point of view. It will be clear that the general notion of legal rationality is very complicated and we must set some particular goals to achieve a more global perspective. In my paper, I will focus on legal reasoning and will try to adopt Rescher’s distinction of cognitive/practical/evaluative rationality for the purpose of this analysis. The main point of this part will be the evaluation, to what extent risk aversion is connected with rationality. The thesis will be formulated in the following manner: the mechanisms of risk avoidance could serve as local rationality–triggers (as to oppose skepticism in Rescher’s terminology and deal with imperfection of our cognitive resources). The second point will be the attempt to show the connection between Bayesian decision theory (which focuses on error minimizing and thus, risk avoidance) and rationality. I will introduce basic formalism of BDT and show how, on that basis, we could formulate the local rationality for legal decision making. Again, the central notion will be the risk and I will present formal mechanism of risk avoidance in BDT. The notion of rationality, as a risk optimizer, will be proposed for this local environment. The last point of the analysis will be the answer to the question to what extent we are free from legal–theoretic assumptions in formulations of rationality. It turns out that the choice of an underlying theory of law will always determine our global notion of rationality but in the local perspective we could formulate general tools and concepts.
Izabela Skoczeń: Why should a lawyer calculate the probability of implicature formation? Abstract:
This paper aims at providing examples of possible applications of methods for calculating the probability of implicature formation (with the use of the bayesian method) in legal situations. The basis for the present considerations will be the notion of scalar implicatures, based on the gricean approach to Pragmatics. Scalars are based on conventional meanings attributed to words with the use of lexical scales (Horn). Placing a word in a definite position in a scale enables the speakers to attribute it a definite meaning, that does not have to be consistent with the lexical meaning that would be understood with the use of classical logic. [...] As experiments have proven, in contexts with data deficit the probability of definite implicature formation is rather not intuitive. A quite striking example is the following situation: if while describing three objects, the speakers has information concerning the features of only two of them, the hearer seems more prone to infer, that the third item disposes of the same feature while hearing an utterance with the numeral “two”, rather than “some”. This surprising result seems most vital for lawyers, as it conveys a hidden pattern of linguistic manipulation. The conventional implicature that should be cancelled due to pragmatic reasons is so strong, that it still influences the meaning. Imagine, that we have three suspects A,B,C and we know that A and B were at the crime scene that day. We don’t know, whether C was at the crime scene. If the probability of omitting scalar implicature cancellation is higher when using expressions like some, rather than numerals, C’s defendant should rather say “Some of the suspects were at the crime scene.” rather than “Two were at the crime scene.”. The later formulation, according to Goodman and Stuhlm¨uller calculations, would boost the probability of the court inferring the implicature that C was also at the crime scene that day. This observation opens an entire new range of possibilities of manipulating implicature formation in contexts, where the hearer is aware of the speaker’s data being insufficient. It is often the in judicial environments, when the provided evidence is too scarce.
For full text of the papers, please contact the authors.
HT Bartosz Janik
Like this:
Like Loading...
Tags:Artificial intelligence and law, Bartosz Janik, Bayesian analysis in law, Bayesian inference in legal informatics, Bayesian statistics in legal informatics, Bayesian statistics in legal prediction, Bayesian statistics in quantitative legal prediction, Legal decision support systems, Legal reasoning, Modeling legal reasoning, Quantitative legal prediction, Statistical methods in legal informatics, Statistical methods in legal reasoning, The Many Faces of Contemporary Philosophy and Theory of Law
Posted in Abstracts, Applications, Articles and papers, Conference papers, Conference proceedings | Leave a Comment »
January 27, 2013
Dr. Joshua Tauberer has created a new bill prognosis methodology page for GovTrack, his U.S. federal open legislative data service.
The page includes (on right screen) three tabs of charts demonstrating output and functioning of the prognosis methodology. The third tab shows charts of precision vs. recall results, and is intended expressly “for machine learning researchers.”
HT @JoshData
Like this:
Like Loading...
Tags:GovTrack, Joshua Tauberer, Legislative information systems, Prediction in legal informatics, Prediction in legal information systems, Quantitative legal prediction, Statistical analysis of legislative data, Statistical methods in legal informatics, Statistical prediction of bill passage success, Statistical prediction of committee approval of a bill
Posted in Applications, Technology developments | Leave a Comment »
December 21, 2012
Simone Gittelson of l’Institut de Police Scientifique, Ecole des Sciences Criminelles, Université de Lausanne, and colleagues, have published Modeling the forensic two-trace problem with Bayesian networks, forthcoming in Artificial Intelligence and Law.
Here is the abstract:
The forensic two-trace problem is a perplexing inference problem introduced by Evett (J Forensic Sci Soc 27:375–381, 1987). Different possible ways of wording the competing pair of propositions (i.e., one proposition advanced by the prosecution and one proposition advanced by the defence) led to different quantifications of the value of the evidence (Meester and Sjerps in Biometrics 59:727–732, 2003). Here, we re-examine this scenario with the aim of clarifying the interrelationships that exist between the different solutions, and in this way, produce a global vision of the problem. We propose to investigate the different expressions for evaluating the value of the evidence by using a graphical approach, i.e. Bayesian networks, to model the rationale behind each of the proposed solutions and the assumptions made on the unknown parameters in this problem.
Like this:
Like Loading...
Tags:Alex Biedermann, Artificial intelligence and law, Bayesian analysis, Bayesian inference, Bayesian networks, Bayesian networks and legal evidence, Bayesian statistics and legal evidence, Forensic two-stain problem, Forensic two-trace problem, Franco Taroni, Legal evidence information systems, Silvia Bozza, Simone Gittelson, Statistical analysis of legal evidence, Statistical methods in criminal justice investigation, Statistical methods in forensic analysis, Statistical methods in legal evidentiary reasoning, Statistical methods in legal informatics, Two-stain problem, Two-trace problem
Posted in Applications, Articles and papers, Methodology | Leave a Comment »
December 19, 2012
Dr. Joshua Tauberer of GovTrack has posted Bill prognosis gets a few improvements, at the GovTrack Blog.
He writes:
…I’m adding three new factors to GovTrack’s analysis: whether the bill was introduced in the first 90 days of the Congress, whether it was introduced in the first year, and whether it was introduced in the last 90 days of the Congress. You can now see that last one in the factors for S. 3637, for example.
The post describes his decision to incorporate a component of the model described in: Yano et al.: Textual Predictors of Bill Survival in Congressional Committees.
For more details please see the complete post.
HT @JoshData
Like this:
Like Loading...
Tags:GovTrack, John D. Wilkerson, Joshua Tauberer, Legislative information systems, New Directions in Analyzing Text as Data, New Directions in Analyzing Text as Data 2012, Noah A. Smith, Prediction in legal informatics, Prediction in legal information systems, Quantitative legal prediction, Statistical analysis of legislative data, Statistical methods in legal informatics, Statistical prediction of bill passage success, Statistical prediction of committee approval of a bill, Tae Yano, Text as Data Conference, Text as Data Conference 2012
Posted in Applications, Technology developments | Leave a Comment »
December 14, 2012
Professor Dr. Daniel Martin Katz of the Michigan State University College of Law and the ReInvent Law Laboratory has published Quantitative Legal Prediction – or – How I Learned to Stop Worrying and Start Preparing for the Data Driven Future of the Legal Services Industry, forthcoming in Emory Law Journal.
Here is the abstract:
Do I Have a Case? What is Our Likely Exposure? How much is this Going to Cost? What will happen if we leave this particular provision out of this contract? How can we best staff this particular legal matter? These are core questions asked by sophisticated clients such as general counsels as well as consumers at the retail level. Whether generated by a mental model or a sophisticated algorithm, prediction is a core component of the guidance that lawyers offer. Indeed, it is by generating informed answers to these types of questions that many lawyers earn their respective wage.
Every single day lawyers and law firms are providing predictions to their clients regarding their prospects in litigation and the cost associated with its pursuit (defense). How are these predictions being generated? Precisely what data or model is being leveraged? Could a subset of these predictions be improved by access to outcome data in a large number of ‘similar’ cases. Simply put, the answer is yes. Quantitative legal prediction already plays a significant role in certain practice areas and this role is likely increase as greater access to appropriate legal data becomes available.
This article is dedicated to highlighting the coming age of Quantitative Legal Prediction with hopes that practicing lawyers, law students and law schools will take heed and prepare to survive (thrive) in this new ordering. Simply put, most lawyers, law schools and law students are going to have to do more to prepare for the data driven future of this industry. In other words, welcome to Law’s Information Revolution and yeah – there is going to be math on the exam.
Click here for slides from the presentation version of the paper, Quantitative Legal Prediction.
Like this:
Like Loading...
Tags:Artificial intelligence and law, Automatic creation of legal documents, Automatic creation of legal memoranda, Automatic creation of legal memos, Computation Legal Studies, Daniel Martin Katz, Disintermediation of lawyers, Emory Law Journal, Free access to law, Law.gov, PACER, Public access to legal information, Quantitative legal prediction, RECAP, Statistical methods in legal informatics, Statistical modeling of court decisions, Statistical modeling of legal information, Statistical modeling of similar court decisions, Statistical modeling of statutes
Posted in Applications, Articles and papers, Technology developments, Technology tools | Leave a Comment »
December 2, 2012
Tae Yano and Professor Dr. Noah A. Smith, both of Carnegie Mellon University Language Technologies Institute, and Professor Dr. John D. Wilkerson of the University of Washington Deaprtment of Political Science, presented a paper entitled Textual Predictors of Bill Survival in Congressional Committees, at New Directions in Analyzing Text as Data 2012, a conference held 5-6 October 2012 at the Harvard University Institute for Quantitative Social Science.
Here is the abstract:
A U.S. Congressional bill is a textual artifact that must pass through a series of hurdles to become a law. In this paper, we focus on one of the most precarious and least understood stages in a bill’s life: its consideration, behind closed doors, by a Congressional committee. We construct predictive models of whether a bill will survive committee, starting with a strong, novel baseline that uses features of the bill’s sponsor and the committee it is referred to. We augment the model with information from the contents of bills, comparing different hypotheses about how a committee decides a bill’s fate. These models give significant reductions in prediction error and highlight the importance of bill substance in explanations of policy-making and agenda-setting.
A notable technology related to this research is the new probability-of-bill-passage feature on Dr. Joshua Tauberer’s GovTrack service.
Click here for Dr. Tauberer’s comment on the Yano et al. paper.
An interesting discussion among academics and developers arose on Twitter in response to a tweet about this paper.
Like this:
Like Loading...
Tags:GovTrack, John D. Wilkerson, Joshua Tauberer, Legislative information systems, New Directions in Analyzing Text as Data, New Directions in Analyzing Text as Data 2012, Noah A. Smith, Prediction in legal informatics, Prediction in legal information systems, Quantitative legal prediction, Statistical analysis of legislative data, Statistical methods in legal informatics, Statistical prediction of bill passage success, Statistical prediction of committee approval of a bill, Tae Yano, Text as Data Conference, Text as Data Conference 2012
Posted in Applications, Articles and papers, Conference papers, Technology developments | Leave a Comment »
September 23, 2012
A call for presentation proposals — with submission deadline of 15 October 2012 — has been issued for ReInventLaw Dubai 2012: “an ‘un’conference devoted to law, technology, innovation, and entrepreneurship” — to be held 10 December 2012 at Media City in Dubai.
The organizers particularly welcome presentations about innovations in legal services or legal education. Presentations can take the form of 6 Minute Ignite Style Presentations or 12 Minute “TED Style” Presentations.
Registration is free.
The event Website describes the event as follows:
ReInvent Law Dubai is an “un”conference devoted to law, technology, innovation, and entrepreneurship.
Anyone interested in the future of law or technology or entrepreneurship will want to participate. Come hear about the innovative ideas generated by the highly-engaging atmosphere of the event!
The event is being sponsored by The ReInventLaw Laboratory at Michigan State University College of Law, and is modeled on the LawTechCamp London 2012 event held last summer.
For more information, please see the ReInventLaw Dubai 2012 Website.
HT @computational.
Like this:
Like Loading...
Tags:Big data and legal technology, Cloud computing and legal information, Daniel Martin Katz, Dubai Knowledge Village, Innovation in legal services delivery, Innovation in legal technology, Innovations in law practice, Law practice technology, lawTechcamp, LawTechCamp London, LawTechCamp London 2012, Legal education reform, Legal educational technology, Legal ethics, Legal instructional technology, Legal text processing, Quantitative legal prediction, ReInvent Law, ReInvent Law Dubai, ReInvent Law Dubai 2012, ReInvent Law London 2012, ReInventLaw Laboratory, Renee Newman Knake, Semantic annotation of legal texts, Semantic processing of legal texts, Statistical methods in legal informatics, Technology and access to justice, Technology and legal ethics
Posted in Calls for participation, Calls for proposals, Conference Announcements | 1 Comment »
September 15, 2012
Professor Edward K. Cheng of Vanderbilt University School of Law has posted Reconceptualizing the Burden of Proof, forthcoming in Yale Law Journal.
Here is the abstract:
The burden of proof is conventionally described as an absolute probability threshold – for example, the preponderance standard is commonly equated to anything greater than 0.5. In this Essay, I argue that this characterization of the burden of proof is wrong. Rather than focus on an absolute threshold, the Essay reconceptualizes the preponderance standard as a probability ratio, and I show how doing so eliminates many of the classical problems associated with probabilistic theories of evidence. Using probability ratios eliminates the so-called Conjunction Paradox, and developing the ratio tests under a Bayesian perspective further explains the Blue Bus problem and other puzzles surrounding statistical evidence. By harmonizing probabilistic theories of proof with recent critiques advocating for abductive models (inference to the best explanation), the Essay hopes the bridge a gap in current evidence scholarship.
HT Professor Dr. Dan Kahan
Like this:
Like Loading...
Tags:Bayesian statistics and legal evidence, Burden of proof, Edward K. Cheng, Judges evidentiary decision making, Judges' legal decision making, Legal burden of proof, Legal evidence information systems, Legal statistical evidence, Quantifying the legal burden of proof, Statistical evidence in law, Statistical methods in legal decision making, Statistical methods in legal informatics, Yale Law Journal
Posted in Articles and papers | Leave a Comment »
September 8, 2012
Professor Chad M. Oldfather of Marquette University School of Law, Professor Dr. Joseph P. Bockhorst of the University of Wisconsin Madison Department of Electrical Engineering and Computer Science, and Brian P. Dimmer, Esq., have published Triangulating Judicial Responsiveness: Automated Content Analysis, Judicial Opinions, and the Methodology of Legal Scholarship, Florida Law Review, 64, 1189-1242 (2012).
Here is the abstract:
The increasing availability of digital versions of court documents, coupled with increases in the power and sophistication of computational methods of textual analysis, promises to enable both the creation of new avenues of scholarly inquiry and the refinement of old ones. This Article advances that project in three respects. First, it examines the potential for automated content analysis to mitigate one of the methodological problems that afflicts both content analysis and traditional legal scholarship — their acceptance on faith of the proposition that judicial opinions accurately report information about the cases they resolve and courts’ decisional processes. Because automated methods can quickly process large amounts of text, they allow for assessment of the correspondence between opinions and other documents in the case, thereby providing a window into how closely opinions track the information provided by the litigants. Second, it explores one such novel measure — the responsiveness of opinions to briefs — in terms of its connection to both adjudicative theory and existing scholarship on the behavior of courts and judges. Finally, it reports our efforts to test the viability of automated methods for assessing responsiveness on a sample of briefs and opinions from the United States Court of Appeals for the First Circuit. Though we are focused primarily on validating our methodology, rather than on the results it generates, our initial investigation confirms that even basic approaches to automated content analysis provide useful information about responsiveness, and generates intriguing results that suggest avenues for further study.
Like this:
Like Loading...
Tags:Automated legal content analysis, Brian P. Dimmer, Chad M. Oldfather, Cosine similarity in legal content analysis, Florida Law Review, Joseph P. Bockhorst, Judicial responsiveness, Legal content analysis, Legal natural language processing, Legal text mining, Legal text processing, Natural language processing and law, Relationship between court decisions and briefs, Relationship between judicial decisions and briefs, Semantic analysis of court decisions, Semantic analysis of judicial decisions, Semantic analysis of legal briefs, Semantic analysis of legal documents, Semantic analysis of legal texts, Semantic processing of court decisions, Semantic processing of judicial decisions, Semantic processing of legal briefs, Semantic processing of legal texts, Statistical methods in legal informatics
Posted in Applications, Articles and papers, Research findings | Leave a Comment »
August 3, 2012
Professor Dr. Kevin R. Murphy and Professor Dr. Rick R. Jacobs, both of the Penn State University Department of Psychology, have published Using effect size measures to reform the determination of adverse impact in equal employment litigation, Psychology, Public Policy, and Law, 18, 477-499 (2012).
Here is the abstract:
A critical issue in large-scale employment discrimination litigation is whether the challenged practices have a systematic adverse impact on protected groups of applicants or employees. The methods that are currently used to detect adverse impact are seriously flawed, but these flaws could be addressed by putting more emphasis on commonly used research statistics that measure effect size, such as standardized difference ( d) or the percentage of variance (PV) explained. We recommend incorporating these indices as a baseline for detecting the presence of adverse impact, and that when PV or d is small, observed group differences should not be taken as evidence of adverse impact, regardless of the results of significance tests. Such an approach eliminates spurious charges of unfair discrimination, spares employers and the courts from dealing with claims that are frivolous and based on minutia, and helps enforcement agencies to concentrate their efforts on employment practices that are truly discriminatory. More generally, it will improve the use and interpretation of statistical evidence in litigation.
Like this:
Like Loading...
Tags:Effect size in legal evidence, Employment discrimination law information systems, Employment law information systems, Kevin R. Murphy, Legal evidence information systems, Legal statistical evidence, Psychology Public Policy and Law, Rick R. Jacobs, Statistical evidence, Statistical methods in legal communication, Statistical methods in legal evidence, Statistical methods in legal informatics
Posted in Applications, Methodology, Research findings | Leave a Comment »