Archive for the ‘Commentary’ Category

CFR in XML Available for Bulk Download from GPO

December 26, 2009

[NOTE: Updated on 27 December 2009 to add the final four paragraphs.]

The U.S. Code of Federal Regulations (CFR) for 2007-2009 is now available for bulk download in XML from U.S. Government Printing Office (GPO) FDsys. Click here for the user guide.

Why is this of potential interest to the legal community? First, attorneys, legal IT personnel, and law librarians can download the CFR in XML and process it so that they, their colleagues, and the communities they serve can use it free of charge, without incurring the costs of using a for-fee online service. The XML markup enables the code to be output in a wide range of formats or integrated with a number of other information resources.

Second, many organizations that publish legal information for free on the Internet or at low cost — such as the Legal Information Institute at Cornell University and Princeton University’s Center for Information Technology Policy and its FedThread federal regulations publishing project — can download the CFR, process it, and make it available to the legal community and the public. This should result in greater competition in the market for legal information and ultimately lower costs for users of legal information.

This is the second major legal information access initiative at GPO this year. Earlier in 2009, GPO began making available the Federal Register for bulk download in XML.

These data access initiatives are consistent with such law-related Open Government Data activities as the U.S. Government’s Open Government Directive and the Law.gov project, which will be the subject of a number of public meetings throughout the U.S. in the first half of 2010.

Legal Information & the Ithaka Report on the Federal Depository Library Program

December 26, 2009

[NOTE: Updated on 27 December 2009 to refer to the Law.gov project.]

A report recommending changes to the U.S. Federal Depository Library Program (FDLP) was issued this month by ITHAKA S+R: Roger C. Schonfeld & Ross Housewright, Documents for a Digital Democracy: A Model for the Federal Depository Library Program in the 21st Century (Dec. 17, 2009). The report was funded by the Association of Research Libraries (ARL) and the Chief Officers of State Library Agencies (COSLA), many of whose members are Federal Depository Libraries or administrators of such libraries. This post will discuss the report’s treatment of U.S. federal legal information.

The ITHAKA report recommends that the FDLP be reformed to meet five objectives:

  1. “Newly issued [U.S. federal] government information must be made freely available in digital form and must be preserved for the long term.”
  2. “To provide [] permanent public access for the historical collection [of U.S. federal government documents], a significant program of retrospective digitization is required.”
  3. “…[S]ome original print copies [of U.S. federal government documents] must continue to be preserved even though fewer depository library collections overall will be required”
  4. “The print format will continue to have advantages for certain subsets of material types and user communities, so the [FDLP] must provide appropriate access to certain historical and new [U.S. federal government documents] in print form, where appropriate via print on demand.”
  5. “Depository libraries must reemphasize their commitment to serving user needs for outreach, discovery, and access.”

Five issues respecting U.S. federal legal information seem especially relevant for the legal informatics/communication community: public access, metadata, long-term preservation, authentication, and usability. Happily, the ITHAKA report addresses all of these issues to some extent, and identifies potential roles for legal informatics developers and researchers in providing services related to U.S. federal legal information.

Respecting newly issued U.S. federal legal information, commonly called “born digital,” the report addresses public access, preservation, authentication, and discovery. While Congressional and some U.S. executive branch materials are discoverable, freely available to the public, and preserved via the U.S. Government Printing Office (GPO)‘s FDsys content management system, or via GPO’s partnerships with federal agencies, many executive and judicial branch law-related materials are either not freely accessible to the public or not treated for long-term preservation. Further, only a portion of the law-related information in FDsys is authenticated by GPO, while none of the law-related federal information distributed outside of FDsys appears to be authenticated. Respecting federal law-related information not in FDsys or under a GPO-agency partnership, the report urges GPO to implement technology “to harvest content directly from agency websites for incorporation into the FDLP.”

Yet whether GPO will implement such a harvesting system is unclear. Further, problems and delays with the U.S. National Archives and Records Administration (NARA) Electronic Records Archive (ERA) mean that NARA is now unable, and will likely be unable in the foreseeable future, to reliably preserve or authenticate law-related digital federal government information.

These recommendations and circumstances indicate several roles for civil society organizations, including free-access-to-law organizations such as the Legal Information Institute at Cornell University Law School, law libraries, other not-for-profit entities, and for-profit enterprises.

  • First, these entities could continue to publish and preserve current law-related federal digital information.
  • Second, these entities could continue to develop services to enable discovery of federal law-related information.

To foster these first two activities, the ITHAKA report urges GPO’s FDsys to offer “bulk download methods” (such as those already available for the Federal Register and Code of Federal Regulations, and consistent with the goals of the Law.gov project) and publicly available APIs (application programming interfaces) to enable civil-society participants to access and distribute full text and metadata of law-related U.S. government information.

  • Third, members of civil society might advocate for more extensive authentication of federal legal information by the appropriate federal bodies, and oversight of federal authentication and preservation efforts by third parties, such as by an entity using the Center for Research Libraries’ Trustworthy Repositories Audit & Certification (TRAC) process. The ITHAKA report recommends that “greater assurance about [GPO’s work in th[e] area [of digital preservation and authentication] should be provided via outside auditing and certification.”

The ITHAKA report also calls for “a significant program of retrospective digitization” of U.S. federal documents, in which historical documents would “be digitized comprehensively and at a sufficiently high level of quality.” The report recommends that civil society organizations play a key role in retrospective digitization. Since the U.S. federal government is unlikely to fund high-quality digitization of the entire historical federal documents collection, the report asserts that such digitization must involve “library digitization projects [and] mass digitization initiatives (involving a number of libraries sometimes in cooperation with Google, the Internet Archive, or other non-library partners).” The report urges GPO to coordinate these efforts.

As the ITHAKA report notes that most retrospective digitization efforts in the coming years are likely to use low-quality mass digitization methods — that tend to produce files exhibiting problems with legibility and usability by humans or machines, and that are usually unsuitable for long-term preservation — there appear to be rich opportunities for civil society organizations to undertake the digitization of historical law-related federal information, by means of high-quality digitization methods consistent with current long-term digital preservation standards. This suggests the continued value of efforts such as those of the Rutgers University Camden Law Library Congressional Documents and Federal District Court Decisions collections.

To ensure permanent public access to these digitized materials, the report recommends that “[d]igitized materials [] be deposited whenever possible into FDsys, documenting chain of custody and digitization standards.” “When [deposit into FDsys] is not possible, GPO should provide coordination including standard bibliographic control via formal partnership agreements with other government agencies and outside institutions.” Thus the report envisions civil society organizations’ playing a role not only in digitization but also in long-term preservation of digitized historical federal legal information. This recommendation suggests a prominent role for legal information archives coordinated by the Legal Information Preservation Alliance (LIPA), such as the Chesapeake Project.

The ITHAKA report’s recommendation that “some original print copies [of U.S. federal government documents] must continue to be preserved” suggests the value of LIPA’s efforts to coordinate preservation of U.S. federal legal resources in print format among U.S. law libraries.

Finally, the ITHAKA report’s recommendation that “[d]epository libraries [] reemphasize their commitment to serving user needs for outreach, discovery, and access” underlines the importance of making federal legal information usable. This suggests the value of adding good-quality secondary resources to open collections of primary legal resources — as Cornell’s Legal Information Institute has done with its Wex legal encyclopedia, and as Google has begun to do within Google Scholar.

That said, the ITHAKA report exhibits certain shortcomings respecting its treatment of U.S. federal legal information. First, because the report is not based on a formal user needs assessment, the report’s findings and recommendations respecting the identity and needs of users of federal legal information lack empirical support. For example, respecting uses of federal legal information by persons not connected with the legal system, the report appears to mention only those uses related to “legal disputes.” The report thus seems to neglect the wide range of non-dispute-related purposes for which nonlawyers use federal legal information, such as for engaging in personal or business transactions, personal financial planning, estate planning, starting a business, or expanding one’s existing business.

Second, the report neglects to mention several important categories of frequently used federal legal information, including:

  • Legislative history materials;
  • Federal court documents other than court decisions — such as pleadings, motions, briefs, and docket information;
  • Federal court publications, such as Federal Judicial Center reports, treatises, and handbooks; periodic reports, policy documents, and statistical reports published by the Administrative Office of the United States Courts or the federal judicial conferences; and reports of committees and commissions appointed by the federal courts;
  • Administrative decisions;
  • Regulatory history materials;
  • Policy documents issued by executive or legislative agencies;
  • Reports of executive or legislative commissions; and
  • General Accountability Office reports.

Nonetheless, the ITHAKA report offers a vision for the FDLP in which U.S. federal legal information is more likely to be found, authenticated, used, made more useful, and preserved for future generations.

Interlaken Conference on the Future of the European Court of Human Rights

December 25, 2009

[NOTE added on 28 February 2010:

]

A Conference on the Future of the European Court of Human Rights, among States that are subject to the jurisdiction of that court, will be held February 18-19, 2010, in Interlaken, Switzerland. The purpose of the conference is to enable the Member States to engage in strategic planning for the Court respecting the next decade. According to the official preparatory memorandum, the conference will address two key issues:

  • The Court’s large and growing caseload; and
  • Problems with enforcement and effect of the Court’s rulings at the Member State level.

The Court identifies the following primary causes for these problems:

  • [M]any applicants are not familiar with either the substantive limits of the Convention or the procedural conditions for admissibility.”
  • “The high number of repetitive applications before the Court is an indication that the subsidiarity principle does not operate adequately” (e.g., many Member States allegedly:
    • are not providing adequate remedies for human rights violations,
    • are permitting individuals to apply to the Court without having exhausted national remedies,
    • are failing to enforce the Court’s judgments, or
    • are declining to conform their law and practices to Court rulings applicable to other Member States).

To address these problems, the Court proposes several reforms. Of those reform proposals the following involve legal information or communication issues:

  • “[S]et up special sections, an applications division …, or another filtering body, … the Court proper ruling only on those cases found admissible”;
  • Create a “Human Rights Tribunal [similar to the Court of First Instance of the ECJ] subordinate to the Court” which Tribunal “would deal with admissibility [while] the Court would rule on the merits”;
  • Create “a preliminary reference mechanism or possibly an extension of the Court’s advisory competence”;
  • Require each State to implement the following corrective measures “at the national level”:
    • providing human rights training to its citizens;
    • translating the Court’s judgments into the languages of its citizens;
    • “execution of national judgments”;
    • “solutions for the excessive length of proceedings”; and
    • “reopening of proceedings following Strasbourg judgments.”
  • Require States to “execute the Court’s judgments promptly”;
  • Furnish “human rights training” to the States (see, e.g., the Court’s Warsaw Pilot Project, in which the Court educates potential applicants about the Court’s procedures and substantive human rights law);
  • “[E]nsur[e] better dissemination of the Court’s case-law”;
  • Authorize “’[c]lass actions’ or collective applications”; and
  • Refer “purely repetitive cases to the Committee of Ministers and/or to the States concerned.”

Also of interest to legal informatics researchers is the Court’s description of information- or communication-related steps it has already taken to reduce its caseload or expedite case processing:

  • Adding a fifth section (or panel) of judges to the court;
  • Deciding on admissibility and the merits at the same time;
  • Encouraging settlement;
  • Accepting “unilateral declarations of violations”;
  • Implementing a “pilot judgment” procedure;
  • Simplifying the drafting of judgments;
  • Creating “a new order for processing applications based on well-defined criteria”;
  • Implementing better data processing tools, and
  • “Develop[ing] its Research [and Library] Division.”

For more information on the Interlaken conference and stakeholders’ preparations for it, please see Antoine Buyse’s very informative posts on the ECHR Blog.

Drupal for Courts

December 22, 2009

A group of legal informatics researchers, of which I’m one, is currently trying to identify courts that use the Drupal open-source content management system for their Websites or other information systems.

So far, we have identified just one court Website that uses Drupal: the emergency preparedness site of the U.S. District Court for the Central District of California.

If you know of other uses of Drupal by courts, in the U.S. or in any other jurisdiction, we would be most grateful if you would please identify them in the comments.

Interest in Drupal in connection with law-related Websites has grown in recent months in the wake of the U.S. Government’s decision to use Drupal for a number of Executive Branch Websites, including the White House, and a number of sites hosted by the U.S. Department of Commerce.

Legal information systems administrators on the TEKNOIDS listserv recently identified the following potential benefits of using Drupal for court information systems:

  • Because Drupal is open source and free of charge, use of Drupal:
    • entails lower cost;
    • gives one access to a depth of community support, via a large and growing community of developers;
    • gives one the option to make use of professional support, but at a lower cost than support for proprietary systems;
    • offers access to the code; and
    • offers the ability to freely make modifications.
  • As a content management system, Drupal facilitates a higher quality of Web publishing with greater functionality than does a collection of inert HTML pages.
  • Drupal is more secure than most other Website platforms.
  • Drupal sites can be of any size, and can be designed for scale.
  • Drupal sites can be hosted in the cloud. They do not require that a court or court system maintain its own servers.
  • Drupal accords with courts’ character as communities:
    • Drupal provides tools for exchanging information easily and quickly between personnel of the same court or court system.
    • Drupal creates a culture in which personnel from different courts or court systems can share expertise.
  • Drupal enables the creation of a uniform look and feel for all court Websites within a given system, while also allowing individual courts to have custom applications and remain part of their larger community:
    • Drupal allows all of the courts within a given system to share information, while enabling each individual court to maintain a unique web presence.
  • Drupal provides excellent tools for managing users and their access
    to data on the system:
    • Drupal allows access to sections of a court’s Website to be restricted to members of particular groups, such as attorneys who practice before that court, or pro se litigants.
    • Drupal enables single sign on.
  • Drupal has tools that can manage the flow of information to
    interested parties:
    • Drupal enables court personnel to easily publish current information via Websites and blogs.
    • Information published through Drupal can then be distributed — widely or in targeted fashion — via a multitude of RSS feeds, email notification, etc.
  • Drupal enables the interoperability of government information:
    • With more and more government agencies using Drupal, Drupal use in the judicial branch would increase the interoperability of information between branches of government.
    • As Drupal is being adopted by federal, state, and local government agencies, Drupal fosters interoperability of information between different levels of government.
    • Data published through Drupal is often stored in a MySQL database, making the data portable and increasing the likelihood of standardization.
  • Drupal enables courts, if they wish, to allow the public to provide input via court Websites.
  • Bulk data can be made available to the public via Drupal.

Many thanks to Elmer Masters, John Mayer, Dan Nagy, and Stuart Sierra for their comments.

Update on Legal Information Preservation: 2009 LIPA Minutes Available

November 23, 2009

Minutes are now available of the July 26, 2009 meeting of LIPA, the Legal Information Preservation Alliance, held in Washington, DC, at the 2009 AALL Annual Meeting. LIPA coordinates legal information preservation efforts in the United States. Here are some highlights of the July 2009 meeting:

For more information, please see the minutes, or visit the LIPA Website.

New Blog on Developing Digital Law Collections: Hacked Librarian

November 18, 2009

John Joergensen, Esq., the award-winning digital law library developer, and creator of the Rutgers University Camden Law Library Digital Collections, has launched a new blog, called Hacked Librarian, focused on the details of creating digital legal collections.

The first post, Supreme Court Documents, lays out the procedures John uses to process U.S. Supreme Court decisions to make them available in the Rutgers Camden Federal Courts Database.

I understand that John plans to use this blog to publish a range of procedures used in processing documents for the Rutgers University Camden Law Library Digital Collections. This blog complements the Cornell Legal Information Institute’s LexCraft Wiki, where developers of digital law collections share technical information and advice. Both Hacked Librarian and LexCraft are excellent resources for those developing or maintaining digital legal collections.


Follow

Get every new post delivered to your Inbox.

Join 106 other followers

%d bloggers like this: