RESOURCE: Writing an extension to add new GREL functions to OpenRefine

Owen Stephens (Open University) has authored a post documenting the development of the Google Refine Expression Language (GREL) extension he wrote to add functionality to OpenRefine. [W]hile OpenRefine extensions could be complex, writing one that simply added a new GREL function was quite straightforward with a little boilerplate code and the actual GREL function written in ...

RESOURCE: Endangered Accountability: A DLF-Sponsored Webinar on FOIA, Government Data + Transparency

The audio and slides from the Digital Library Federation’s #EndangeredData week webinar are now available. “Endangered Accountability: A DLF-Sponsored Webinar on FOIA, Government Data, and Transparency” marked the close of a week (April 17-21, 2017) dedicated to “raising awareness of threats to publicly available data; exploring the power dynamics of data creation, sharing, and retention; ...

RESOURCE: Data Privacy Project Learning Modules

The Data Privacy Project has launched X Learning Modules on the building blocks of privacy protection and digital security. The three modules, Mapping Data Flows, Risk Assessment, and Historical Overview offer background and details on threats and protections. Additionally, the Curriculum section, provides teaching materials, strategies, and hands on guides for librarians interested in exploring ...

RESOURCE: Perseids: Experimenting with Infrastructure for Creating and Sharing Research Data in the Digital Humanities

The latest issue of Data Science Journal includes an article from Bridget Almas (Tufts University), “Perseids: Experimenting with Infrastructure for Creating and Sharing Research Data in the Digital Humanities.” From the abstract: The Perseids project provides a platform for creating, publishing, and sharing research data, in the form of textual transcriptions, annotations and analyses. An ...

RESOURCE: “What Constitutes Peer Review of Data: A Survey of Published Peer Review Guidelines”

Todd A. Carpenter (National Information Standards Organization) uploaded “What Constitutes Peer Review of Data: A Survey of Published Peer Review Guidelines” to the Cornell University Repository to download. The paper reviews the policies around publication of data sets and how to understand peer review and data sets. Carpenter writes: The process of peer review of articles ...

RESOURCE: NMC Horizon Report, 2017 Library Edition

The New Media Consortium (NMC) Horizon Report > 2017 Library Edition was jointly released by the NMC, University of Applied Sciences (HTW) Chur, Technische Informationsbibliothek (TIB), ETH Library, and the Association of College & Research Libraries (ACRL) at the ACRL 2017 Conference last week. The report identifies “trends and technology developments” that drive the five-year ...

RESOURCE: New and Improv(ing) Digitization Cost Calculator

Joyce Chapman (Duke University) and Genya O’Gara (Virtual Library of Virginia) contributed an update about the Digitization Cost Calculator developed by the Digital Library Federation Assessment Interest Group (DLF AIG) in the March/April Edition of D-Lib Magazine.  The purpose of the Calculator is to support institutions in estimating the costs of digitization projects before they ...

RESOURCE: Using Text Analysis to Discover Work in JSTOR

Jason B. Jones (Trinity College) wrote a ProfHacker post discussing the JSTOR Labs Text Analyzer, a new tool that allows researchers to locate relevant related articles from the JSTOR archive by uploading a source text of the user’s choice. JSTOR says that it compares the text to a list of 40,000 topics and a set ...

RESOURCE: Teaching Yourself to Code in DH

In “Teaching Yourself to Code in DH,” Scott Weingart (Carnegie Mellon University) has compiled an annotated list of “book-length introductions to analytic programming in DH.” Weingart invited participation via Twitter and a Google spreadsheet as part of a larger project collecting humanities research methodologies. He then culled the most relevant from these to form an ...

RESOURCE: Computation and the Humanities: Towards an Oral History of DH

In a recent news post, the European Association for Digital Humanities (EADH) draws attention to a new open access book by Julianne Nyhan and Andrew Flinn (both at University College London), Computation and the Humanities: Towards an Oral History of Digital Humanities. As EADH notes, “By taking an oral history approach, this book explores questions ...

RESOURCE: On a Collections as Data Imperative

The Library of Congress has published “On a Collections as Data Imperative” by Thomas Padilla (UC Santa Barbara). The article focuses on rethinking and reframing collections as data, which then opens up “potential of a collections as data imperative”: To see collections as data begins with reframing all digital objects as data. Data are defined ...