Loading…
Wednesday October 7, 2026 12:30pm - 1:00pm GMT-03
Many waves in technology development are stories of powerful entities becoming more powerful.  However, our work with audiovisual archives aims technology development along an alternative trajectory: rather than reinforcing the global monoculture, preserving the diversity of human experience and reducing the risk that less that less visible communities are absent from the historical record.


In this presentation, we describe work that follows this alternative trajectory to address such “archival absences". First, we introduce the CLAMS platform for computational processing of audiovisual archives. CLAMS is an open-source software designed to operate on modest, local hardware, permitting data sovereignty. With this computational context, we present two case studies from the collaboration between archivists at the American Archive of Public Broadcasting and researchers at Brandeis University on using technology to repair archival absences.  


The first case study focuses on cataloging items in broadcast television archives, demonstrating a multi-stage AI processing pipeline used by archivists to surface lesser-known individuals who appeared in old broadcast television programs in order to enrich metadata and improve discoverability.


The second case study focuses on linguistic absence and diversity. While ASR systems such as Whisper yield high quality transcripts of speech in high-resource languages, these systems offer no benefit for the unsupported languages and even produce misleading and nonsensical outputs. We present ongoing work on language identification for multilingual broadcast content, focusing on indigenous languages such as Yupik and Samoan. This includes challenges in segmentation, annotation, and evaluation, as well as the design of annotation workflows in collaboration with community experts.
These case studies illustrate how computational tools, when designed in collaboration with archivists and communities, can support more inclusive and representative audiovisual archives.  We close the presentation with an invitation to consider other sources of archival absence that can be resisted through creative collaborations between archivists and AI.
Speakers
avatar for Yangyang Chen

Yangyang Chen

PhD Student, Brandeis University
Yangyang Chen is a PhD student in computer science at Brandeis University. Her research focuses on natural language processing, with an emphasis on speech technology, NLP for low-resource languages, and meaning representation. 
avatar for Caroline Mango

Caroline Mango

Archivist, GBH Archives
Caroline Mango (she/her) is an archivist originally from Brazil, currently residing in New York City. She holds a Master of Arts in Moving Image Preservation and Archiving and a Bachelor’s Degree in Film and Television Production from New York University. Her work includes ingestion... Read More →
Wednesday October 7, 2026 12:30pm - 1:00pm GMT-03
Grande Otelo Room

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Share Modal

Share this link via

Or copy link