* To visit the full conference website click here!
Back To Schedule
Wednesday, September 20 • 11:45 - 12:45
Panel - Let the Computer do the work! Use of Computational Tools in Audiovisual Cataloguing

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
The session will present the use of computational tools to enhance discoverability of digital audiovisual collections. In the digital age, large A/V collections face the dilemma necessary cataloguing for search and discovery with search engines requiring text to retrieve relevant items. But as large numbers of analog collections are digitized, and with the influx of more digital materials, how can archives and libraries keep up? Computational tools are improving with the recognition of speech, images, and audio waveform patterns. Within the next five years, archives and libraries could begin to utilize these tools to automate much of the cataloguing of their digital files, or at the very least identify relevant people, topics, and locations associated with the content.

The American Archive of Public Broadcasting, with support from IMLS and in collaboration with Pop Up Archive, created more than 71,000 speech-to-text transcripts using a specially-trained version of the open source Kaldi software. We will show an online game called FixIt to crowdsource correction of speech-to-text transcripts.

The Netherlands Institute of Sound and Vision has been using automatic speaker labeling and thesaurus label extraction from subtitles to achieve fine-grained access. Currently we are at the stage of implementing 2.0 versions of these annotation techniques to improve quantity and quality. We intend to combine techniques and automate our workflows to enhance the results with minimal manual effort.

Brandeis University will demonstrate the suite of Natural Language processing tools available on the LAPPS grid. The Galaxy platform and workflow tools will also be demonstrated.

avatar for Karen Cariani

Karen Cariani

Senior Director Media Library and Archives, WGBH Educational Foundation
I am passionate about making media archives accessible on-line. This goes hand in hand with digital preservation, metadata processes, and systems to manage both. I seek to use technology as much as possible to help archivists and librarians with their work.

avatar for Karin van Arkel

Karin van Arkel

Ingest team coordinator, Sound & Vision
Karin van Arkel is manager of the Archive at Sound & Vision in The Netherlands. She is responsible for the selection, ingest, digitisation and access of all collections in the institute. Sound & Vision do not only archive content from broadcasters, but also websites, webvideo, podcasts... Read More →
avatar for Casey Davis Kaufman

Casey Davis Kaufman

Associate Director, WGBH Educational Foundation
Casey Davis Kaufman (she/they) is Associate Director of the GBH Archives & Project Manager for the American Archive of Public Broadcasting.
avatar for Tim Manders

Tim Manders

Advisor, Netherlands Institute for Sound and Vision
Tim Manders works at the Exploration department of the Netherlands Institute for Sound and Vision. He is experienced in operationalising automatic annotation techniques such as speaker labeling, thesaurus label extraction and face recognition, applied on daily ingest into the archive... Read More →

Wednesday September 20, 2017 11:45 - 12:45 CEST
Ethnologisches Museum, Room 1 (Auditorium, off the Lower Foyer) Lansstraße 8, Berlin, Germany