Open Access Faces Many Problems; Here's One That The Indispensable Internet Archive Is Helping To Solve

from the now-would-be-a-good-time-to-make-a-donation dept

As Techdirt has reported many times, open access is a self-evidently great idea, but one that is still beset with many problems. That's not least because academic publishers are keen to remain in control of any transition to open access, and aim to maintain their extremely high profit margins whatever the publishing model. But there's one problem for open access that ironically derives from its greatest strength -- the fact that anyone can access journals at any time, for free. Because material is always available, librarians have tended not to worry about making some kind of backup. That's not the case for traditional journals, where there is potentially a big problem if a subscription is cancelled. The end of a subscription often means that readers lose their existing access to journals. To address this, librarians have come up with a variety of ways to ensure "post-cancellation access", explained well in a 2007 post on a blog about digital preservation, written by David Rosenthal. A recent article on the Internet Archive site provides some interesting statistics on the scale of the problem of creating permanent copies of open access titles:

Of the 14.8 million known open access articles published since 1996, the Internet Archive has archived, identified, and made available through the Wayback Machine 9.1 million of them... In the jargon of Open Access, we are counting only "gold" and "hybrid" articles which we expect to be available directly from the publisher, as opposed to preprints, such as in arxiv.org or institutional repositories. Another 3.2 million are believed to be preserved by one or more contracted preservation organizations, based on records kept by Keepers Registry... These copies are not intended to be accessible to anybody unless the publisher becomes inaccessible, in which case they are "triggered" and become accessible.

This leaves at least 2.4 million Open Access articles at risk of vanishing from the web... While many of these are still on publisher's websites, these have proven difficult to archive.

That's a pretty serious problem, and one which the Internet Archive is taking steps to address, for example by trawling through the petabytes of Web content that it has built up since 1996. There's an editable catalog with an open API that aims to provide "Perpetual Access to Millions of Open Research Publications From Around The World". Internet Archive has also created a full-text search index to over 25 million research articles and other scholarly documents.

Although few people are aware of this project, it is vital work. There is little point publishing open access titles, theoretically available to all, if their holdings simply disappear at some point in the future. The Internet Archive's copies will ensure that doesn't happen. They are yet another indication of the invaluable and unique role the site plays in the online world. Without it, we would already have lost so much of the amazing material that was once online, but which has since vanished except for the copies held by the Wayback Machine. Another good reason to support this incredible, free resource financially, and to help defend it from incredibly selfish attacks by publishers.

Follow me @glynmoody on Twitter, Diaspora, or Mastodon.

Hide this

Thank you for reading this Techdirt post. With so many things competing for everyone’s attention these days, we really appreciate you giving us your time. We work hard every day to put quality content out there for our community.

Techdirt is one of the few remaining truly independent media outlets. We do not have a giant corporation behind us, and we rely heavily on our community to support us, in an age when advertisers are increasingly uninterested in sponsoring small, independent sites — especially a site like ours that is unwilling to pull punches in its reporting and analysis.

While other websites have resorted to paywalls, registration requirements, and increasingly annoying/intrusive advertising, we have always kept Techdirt open and available to anyone. But in order to continue doing so, we need your support. We offer a variety of ways for our readers to support us, from direct donations to special subscriptions and cool merchandise — and every little bit helps. Thank you.

–The Techdirt Team

Filed Under: culture, open access, research
Companies: internet archive


Reader Comments

Subscribe: RSS

View by: Time | Thread


  • identicon
    Pixelation, 5 Oct 2020 @ 9:36pm

    What's in a name

    When I hear Open Access, I don't immediately know what that is. I wonder if a name change would make a difference? I guess, when I hear Open Access, I think, "Open Access to what?" Perhaps I'm being pedantic.

    link to this | view in chronology ]

  • identicon
    Brewster Kahle, 6 Oct 2020 @ 8:33am

    Open Access institutions are building a new ecosystem

    Thank you for the hat-tip to the Internet Archive and the project to support the commons-- There are many of us supporting open access.

    When materials are open access, then institutions can more easily cooperate because we do not need NDA's, contracts, lawyers, firewalls, etc.

    Open Access journal articles are cited more and apparently read more, so this is a great way to move forward. The Internet Archive can serve the role of backup, but also bulk access for researchers.

    I am looking forward to the meta-science, the science of studying scholarly output, that can be more easily done because the materials are publicly accessible and in bulk.

    -brewster

    link to this | view in chronology ]

  • icon
    Samuel Abram (profile), 6 Oct 2020 @ 12:14pm

    Internet Archive under siege

    The very facts that

    1. The Internet Archive is being sued for billions in © infringement by the major publishers, and
    2. the publishers are likely to win

    is why I donate as much money as I can to them (as well as legally upload all I can as well). The Internet Archive is far too valuable a resource to perish.

    link to this | view in chronology ]

    • identicon
      jersey111, 25 Jan 2021 @ 6:39pm

      Re: Internet Archive under siege

      The lawsuit has no real actual merit, and the publishers know this. The Internet Archive is going to be fine, judging by recent events.

      link to this | view in chronology ]

  • identicon
    Crafty Coyote, 6 Oct 2020 @ 3:27pm

    If copyright infringement is the same as stealing a car, then the Internet Archive is the dealership that has an infinite number of cars in its lot, and is currently asking for people to send in more and "steal" more.

    link to this | view in chronology ]


Follow Techdirt
Essential Reading
Techdirt Deals
Report this ad  |  Hide Techdirt ads
Techdirt Insider Discord

The latest chatter on the Techdirt Insider Discord channel...

Loading...
Recent Stories

This site, like most other sites on the web, uses cookies. For more information, see our privacy policy. Got it
Close

Email This

This feature is only available to registered users. Register or sign in to use it.