Internal:Public Policy/Orphan works

From Wikimedia District of Columbia
< Internal:Public Policy
Revision as of 15:04, 27 March 2014 by Geraldshields11 (talk | contribs) (→‎Definitions: spelled out the word page per the precedent of earlier use in the document)
Jump to navigation Jump to search

Status: Draft

This is a draft of the Public Policy Committee's statement for the U. S. Copyright Office workshop on Orphan Works and Mass Digitization. We participated in the roundtable on March 10–11, 2014; the then-current guidance we prepared for our panelist is here.


  • The draft below is organized, as requested by the Copyright office, along the lines of the topics addressed by the panels, per the original inquiry. Please consult with User:Antony-22 before reorganizing sections.
  • They also want us to not just reiterate our spoken comments in our written statement, but want us to respond to what others said.
  • Should probably aim for about 5 pages
  • Want to get this done in advance of WMDC board meeting on March 29



Wikimedia District of Columbia respectfully submits these comments in response to the Copyright Office’s Notice of Inquiry dated February 10, 2014. We are grateful for the opportunity to participate in these roundtables and give the unique viewpoint of the Wikimedia movement, which we do believe has not yet been reflected in previous comments to the Copyright Office.

The Wikimedia community of volunteers worldwide is dedicated to creating "a world in which every single human being can freely share in the sum of all knowledge" through its educational projects, such as Wikipedia, Wikimedia Commons, and Wikisource. Wikipedia is the fifth-most-visited website on the Internet and the largest reference work ever assembled, with 20 million articles across 285 language editions. Wikimedia Commons is a media file repository of public-domain and freely-licensed educational media content containing over 20 million images, sound clips, and video clips. Wikisource is an online digital library of free-content texts, 300,000 of which are in English. All the content in these Wikimedia projects is freely available to be seen and used legally by anyone in the world because of a free-content license. Wikimedia content is created and maintained not by paid archivists, but by volunteers from around the world and from all walks of life who contribute by editing and uploading files to Wikimedia-supported wikis on the Internet.

Wikimedia District of Columbia is the regional Wikimedia chapter serving Maryland, Virginia, Delaware, West Virginia, and the District of Columbia. We are a nonprofit charitable organization dedicated to the advancement of general knowledge and the collection, development, and availability of educational content under a free license or in the public domain. As an official Wikimedia chapter, we support the goals of the Wikimedia Foundation, including the work of volunteers on Wikipedia and other free knowledge initiatives. We hold events, including training sessions and edit-a-thons, where experienced Wikipedians and newcomers alike have the opportunity to network and learn from each other. We work with museums, libraries, and other specialized institutions to share their knowledge with the rest of the world through Wikipedia, the world's most widely visited not-for-profit website. We are an all-volunteer organization.

Wikimedia's educational mission would benefit from orphan works legislation that is designed to minimize the burden on its volunteer contributors. First, the law should provide simple and clear guidelines on what satisfies the requirement of a diligent search, to avoid the chilling effect of uncertainty among users as to when orphan works can freely be uploaded. Furthermore, the law should not require users to register or to pay an escrow fee, which would unduly discourage users and make it difficult or impossible for Wikimedia projects to host orphan works. Most important, the law should reasonably limit liability for non-profit users of orphan works, and it should exempt educational institutions such as the Wikimedia Foundation from liability, for the same public-goods reasons that libraries, archives, and museums are exempt.

  • (refers to Google Books project and associated legal cases) We believe that a legislative solution is still necessary, as recent technological and legal developments are not useful for the types of works and uses that Wikimedia projects seek to make use of. For Wikimedia purposes, the 2006 orphan works proposals in the Copyright Office report would help us in our public mission.
  • We want any reform to minimize burdens on people who contribute to Wikimedia projects. The volunteers include high school students, preeminent scientists, and people from developing countries. The amount of burden in any reform will affect how we are able to use orphan works in the pursuit of our educational mission. Any reform that imposes large burdens such as an escrow fee, or that does not limit or remove monetary remedies for non-profit educational use, would likely prevent us from using them.

On the need for legislation

Wikimedia projects have long used public-domain and freely-licensed works to support Wikimedia's educational mission. These projects include using textual and visual media as part of encyclopedia articles and making them available for reuse by others. However, very few works first published after 1922 are in the public domain, and these very few are due to technicalities that are often hard to verify (partually due to lack of copyright records modernization). Wikimedia projects also use user-provided content granted under a free license, such as those of Creative Commons or the GFDL (GNU Free Documentation License), but these are almost all of very recent publication. Most media created from 1923 to the last ~10 years are therefore unavailable for use in Wikimedia projects.

  • Thousands of our volunteers, across the globe, have confronted these issues. They arise frequently. Some volunteers have built up a lot of expertise about it and documented procedures and recommendations online. We could quantify some of this effort if it would serve reformers to have evidence on the costs of the current system.
  • These works are also unlikely to be unorphaned using improved search tools or databases/registries of owner information, regardless of how much these are improved.
  • Since our goal is provide material that is freely available, the Wikimedia projects make very limited use of fair use. In fact, our internal criteria for non-free content are purposely much stricter than the law would allow for (see Wikipedia:Non-free content policy). We only use works under fair use if no freely licensed equivalent could possibly be created. For example, we generally do not host non-free photographs of living persons or of existing buildings, because a free equivalent could be created by a volunteer taking a new photograph. Thus an expansion of fair use, alone, does not enable readers of Wikimedia to benefit from most orphan works.

On defining "reasonably diligent search"

Upload to wp1.png
Upload to wp2.png
  • We want the procedure to be simple enough that our contributors can follow it easily by following a checklist on a web form (see examples at right). This includes a spectrum of Internet users of all ages and English fluency. The guidelines need to be acessible to individuals, not just to sophisticated players with specialized skills and/or teams of professionals supporting them. Wikimedians are systematically careful about copyright, and to do that the systems have to be easy for volunteers to learn and use, and for software to verify.
  • The law should provide simple and clear guidelines on what satisfies the requirement of a diligent search, to avoid the chilling effect of uncertainty among users as to when orphan works can freely be uploaded. These guidelines could be provided through either legislation or Copyright Office regulations. It is reasonable to have different guidelines for different classes of works.
  • Author groups at the roundtable expressed the concern that their works could be stripped of metadata and posted to the internet on sites like Twitter, thus becoming "instant orphans" exploitable by downstream users. Wikipedia's policies are careful about copyright; we require source and author information and actively delete uploads that to not provide this metadata. (w:WP:IUP#Copyright and licensing and w:WP:GID#Addressing suspected copyright infringement) In addition, the pages on which users can upload to the English Wikipedia require users to choose one of the three options that it is demonstrably a free work, it meets Wikipedia's criteria for fair use (which are much more stringent than what is legally allowable), or "I found this file somewhere, but I don't really know who made it or who owns it." Selecting the last of these brings up a message sternly warning the user not to upload it!
  • There is the possibility that we can internally verify orphan work status, either automatically or through some process involving other volunteers. This would be subject to internal comunity discussion.
  • Should comment on 2008 bill

On the role of registries

  • Mandatory registration of uses of orphan works would be a burden on us, but if the details are properly thought out this could potenitally be overcome. It would be especially nice if registration could be automated through our own upload form. Optional registration would be fine.
  • We recognize the interest of copyright holders in avoiding a situation where works could labeled as orphans then their uses could not be found again or tracked. Unlike other categories of reuse, any orphan work hosted on Wikimedia projects is widely available on the Internet and easily findable by a creator wishing to reclaim their rights. Each work classified as an orphan and stored in the Wikimedia servers could be found by searching categories of licenses. Such searches can be conducted quickly and easily for free from regular Web browsers anywhere without any need to log in. We can illustrate our mechanisms for such searches if it is of interest to orphan-works reformers. (Two such mechanisms are: search by category; and what-links-here from license templates). Wikimedia could therefore constitute a "registry" for orphan works it stores, and would certainly it easy for other registries to incorporate lists of the orphan works stored there.

On types of works, including photographs

  • Photographs should not be excluded from orphan works reform; in fact, these are the works we probably want the most!
  • The European orphan works reform directive (details here) did not cover photographs to the extent we need. It would not allow commercial use, and therefore orphan works could not be freely licensed in the way usable in Wikimedia projects.
  • The types of works we are interested are mainly historical photos and works such as those found in archives and libraries. These may be either attached to larger works like books, or be stand-alone items. We think that the body of works that the author groups are concerned about are different from the ones we want, and perhaps some line can be drawn to separate these (such as providing evidence that the work is at least 20 or 30 years old).
  • Orphan work status does not override privacy concerns and we already have guidelines about privacy
  • Our contributors are both individuals and partner organizations such as galleries, libraries, archives, and museums.
  • It can be unclear whether a copyright exists on a work, if for example its creation date is unclear (notably if it might be before or after 1923), the country in which it was created is unclear, or if it is unclear whether its creator was a U.S. federal employee on duty, or off duty, or a government contractor. It would be helpful for our mission to allow such cases to be searched, documented, and treated as orphan works and allow some use of them.

Wikimedia projects currently can use orphan works only when they would qualify for "fair use." One of the constraints on works used in a fair-use context is that they are available only for English-language projects. Where possible we use images and other works that are in the public domain globally; this helps serve readers of languages with less Web content. Some of those readers are Americans. (is it correct to say that "fair use" does not apply to native american languages? peter to check with Kristin)

Here we list examples where reform would help serve our mission.

  • Wikimedia DC's upcoming photo hunt in the South. Individually created photos will be under a free license, but buildings that no longer exist or had a different appearance need historical photos. We're seeking to partner with local archives and historical societies, but their photos often have no metadata. Without author and publication data, we have no way to determine if they are in the public domain and thus can't use them. Orphan works reform would however allow us to use these historical works to illustrate articles, and make them available for wider reuse by others.
  • Ezra Pound photo. A photo of the young Ezra Pound from 1898 (or maybe 1897) has no known copyright-holder and its status is unclear, partly because its date of first publication is unclear. Permission does not appear to have been given by any copyright holder. A dozen Wikimedians have worked on its provenance and copyright status. For the extensive and good-hearted and well-educated discussion, see [1], [2], and [3]. Plainly the public interest would be served by clarifying the issue and allowing the photo simply to be used.
    • Original comment from WP:VPM: "A particularly interesting (and still current) case concerns w:File:EzraPound&IsabelPound1898.jpg, a image that has been extremely carefully researched and for which quite a lot is known about its provenance. Its "non-free" categorisation has been raised here, spilling over to commons. Issues have arisen of whether the photographer, his employer, or the subject might have been the original copyright holder. It seems for early US photos, the subject was generally the default copyright holder. Bearing in mind that possibly no one ever knew who the copyright holder was it cannot be determined whether any of its publications over the years have been lawful and so copyright expiry cannot be determined. However, it seems to be accepted that WP's current use of the image (if it is still in copyright) meets WP's non-free use policy, Thincat (talk) 23:31, 8 March 2014 (UTC)"
    • Note that only works published before 1923 are in the public domain. If they were created before then but published after that date, or never published, they are still subject to copyright. See w:User:Antony-22/Drafts#PD table.
  • Family photos. I struggled with the decision about whether to use a photo of economist Bert Hoselitz that came from his family to illustrate the article about him. It wasn't clear what to do and I gave up, to the disappointment of people who had helped me. The original photographer could not be identified. Who held the copyright? How would I label it on Commons? I didn't want to end up in an argument. I'd already had to fight pretty hard to kill off a photo of someone else that was incorrectly labeled as a photo of Hoselitz. The photo was also quite dark--a side issue. To the point: if orphan works were a nicely available permission category on Commons, that would have helped me. -- Econterms (talk)
  • US government works where it is ambiguous whether they might have been produced by federal employees on duty (in which case the work is public domain) or by contractors (in which case the work is under copyright). National labs are a big examples of this. One of our users (Antony–22) found photos of scientists at a conference on the website of Oak Ridge National Laboratory. Luckily, ORNL has specifically clarified that all material on their website is public domain, but if this clarification had not been made the copyright status of these works would be unclear without specific information about the author, and would not have been able to been used.
  • Users avoiding specific topics. One member is often paralyzed with fear regarding materials since 1923, which might be under copyright, and retreats to the safe harbor of post-copyright materials before 1923. This user reports a chilling effect on writing on topics that requires sources written after 1923, due to the complex copyright laws. She would really like to write new articles about topics that are more modern than clipper ships and defunct Tibetan monasteries, and is eagerly awaiting reforms in the law

On types of users and uses

  • The Wikimedia Foundation is a non-profit educational organization, but it is not a traditional gallery, library, archive, or museum. Any legislation that treats these types of organization differently should be worded broadly enough that organizations like the Wikimedia Foundation would be included in that special treatment.
  • We do not consider licenses that ban commercial use to be free licenses; these must instead fall under our strict fair use restrictions to be used. If orphan works reform does not apply to commercial uses, that may affect our ability to use them. Wikimedia community discussion online will determine whether we create a class of restrictions intermediate between fair use and free content.
  • Public access is a core part of our mission and must be allowed by orphan works reform.

On remedies

  • We support low caps on remedies (licenses/infringement paid to copyright owners) for not-for-profit uses of works judged as orphans whose owner surfaces later. The draft legislations lists of institutions that have a low cap or zero on how much they would owe for infringement. We would like Wikimedia projects to be part of that list. We would like the law to include educational nonprofits generally in the legislation and supporting documents not only identifiable museums, archives, and galleries.
  • We are also, of course, happy to remove an orphan work if the creator requests it. We make it easy for rightsholders to notice and request removal of their work through non-legalistic means.

On extended collective licensing

  • Requiring payment to use an orphan work, e.g, a self-defense fee in escrow would likely prevent ("be prohibitive for") most uses on Wikimedia, as the Wikimedia Foundation and many of our volunteers have limited financial resources.
  • We have partnerships with GLAM organizations in which they digitize their own holdings but post them on Wikimedia Commons intead of their own website; any reform should not impose any new burdens on this.


Source for most of these: 2006 copyright office Report on Orphan Works

  • orphan works -- These are works whose copyright owners cannot be identified or cannot be located, and therefore permission cannot be obtained from them for using the works. (The 2006 report, page 35, excludes works for which it is not known whether a copyright currently applies, e.g. because creation date or location or creator are unknown. That report refers to such situations as "problems determining copyright status. Wikimedia DC would definitely prefer that the legislation includes such cases, which are just like orphans in our practice. See our Oak Ridge example for a case like that. However, for clarity of communication, note that the Copyright Office report did not call those orphan works.)
  • registry -- In this context, this refers to databases where works and their copyright ownership are listed and searchable. There are many registries of copyrighted works whose copyright-owner is known; there are, or could be, registries of orphan works which identify someone who identify those works as orphans, describe the search they conducted to find an owner, and how the orphan-work was used.
  • 1976 Copyright Act -- This U.S. law says a copyright applies as soon as a work is "fixed in a tangible medium of expression." It removed the requirement of the 1909 Act that a work be registered with the Copyright Office. The 1976 law was more compatible with the international Berne Convention. (These statements are simplifications of a more complex legal situation.)
  • Berne Convention -- This treaty on copyrights forbids national laws from imposing the requirement that creators of works would have to register them explicitly, or renew them, or give certain kinds of notice to users. Apparently, it says copyrights must apply mostly automatically. (There had been a lot of complaints that it was hard to track and keep up the copyrights by freelance essayists, for example.) It is implied that the U.S. didn't sign on to this convention, or not early anyway.
  • diligent search -- a likely requirement of a law permitting the use of orphan works is that the user has tried to search for the copyright owner, and the law introduces less friction to the extent if makes a clear explicit feasible to-do list for that user to satisfy the search requirement. So one "diligent search" requirement might be that the user has conducted certain queries to specific registries to see if the works were listed there. A search can be more or less deep or diligent (it's both a quantitative and qualitative spectrum), so people say sometimes refer to a "reasonably diligent search" when referring to that threshold that is sufficient for a user to use the orphan work.
  • no response case -- Suppose a copyright owner is identified but does not respond to requests by potential users of the work? The Copyright Office report recommends against calling this an "orphan works" case, because the copyright owner is not obliged to respond. However this is in the class of marginal cases that commentators of the orphan-works cases sometimes mention or make recommendations about.
  • protection and protected -- In Copyright Office reports, a work is protected if it is copyrighted, and its owner is protected. In general, I recommend our documents avoid this term, because it's subject to instant misinterpretion: Wikimedians often think about whether a user or use of a work is protected from claims/accusations/prosecution for copyright infringement. The purpose of an orphan works rule would be to "protect" users and uses -- but this is the wrong use of the word to a copyright expert. It is better for broad communication to say that we want a "safe harbor" for users and uses. (The term comes from User:Slowking4 and may need a brief definition in our documents if the copyright people do not recognize it.) If asked about "protection," think thrice before answering; it's trappy.
  • safe harbor: the idea that a user of a work can follow some procedures to sharply reduce risks of prosecution and payment of damages; this is what an orphan works law would deliver.
  • Collecting societies or Reproduction Rights Organizations act as agents for a large number of copyright owners. Collecting societies administer copyrights, and collect and distribute income, in relation to copyrights owned by their members. (Source: Society of American Archivists paper below.) These are talked about a lot in the copyright and orphan works realms, especially internationally. They don't seem relevant to Wikimedia cases.
  • "Formalities" -- The Berne Convention requires that the countries do not impose a filing or registration requirement on creators of works to get them copyright; basically it must be easy or automatically. The US got on board with this in 1988 more or less. (2006, page 60)
  • The TRIPS Agreement is part of the WTO agreement and explicitly bans formalities to get a copyright on a created work. (2006, page 60)


Statements opposing orphan works legislation
  • Lawrence Lessig. Little Orphan Artworks. Op-Ed, New York Times, May 20, 2008. -- Lessig opposed the Copyright Office's 2008 proposal (possibly not the same as the 2006 proposal) because it is not clear what a diligent search is, except that it involves expensive copyright specialists, that the new freedom for users would interfere with the expectations of copyright holders and creators; and that he had a better design which was that for U.S. works copyright should be automatic for 14 years or so then require registration. Econterms's view: While one hesitates to disagree with Lessig, the 2006 proposal would be at least be better for Wikimedia users than the present situation, and his proposal which would also be an improvement for Wikimedia users is not apparently being broadly considered (although we might think it through as an alternative in the future).
  • NPPA Cannot Support Orphan Works Legislation. National Press Photographers Association. May 7, 2008. -- They oppose it, describing it as a kind of threat, and focused on commercial use of their works without licensing. Not convincing, and they do not address the purposes of educational nonprofits. They say the 2008 version is better than the 2006 version, which suggests that from our point of view it was worse, although I didn't see specifics.
  • Orphan Works statements by Library Copyright Alliance. Key observation: In their latest statement, the LCA supports expanded fair-use, not the 2006 recommendations. We prefer the 2006 recommendations. Their latest statement says that the widespread support of the 2006 proposals has shattered, and they give examples ; if true this is unfortunate.

See also

  • /Notes for summaries and notes on some of the sources above
  • This page's talk page for discussions and earlier drafts of sections of this page