Thursday, January 29, 2026

2025 at the Digitization Desk

 

 A Year in Review - Digitization Desk


We are delighted to announce our first ever end-of-year tally for the digitization desk. In sum, our volunteers digitized 414 videotapes constituting 410 hours of home movies and 9.1 terabytes of data.

That's a real cause for celebration for us! We are an entirely volunteer-run organization, without funding from any grants or large institutions. We crowdfund and we charge accessible prices for workshops ($20 - $100), screenings ($10), and digitization services ($5/hour of content).

With about 30 volunteers working at the digitization desk, we were able to accomplish so much. It's our first year that we are able to distribute surplus to our volunteers, who are shaping the worker-cooperative model that allows for sustainable service. 

The numbers don't even reflect how many times we've digitized tapes twice, three times, or more to get the best possible transfer. Nor do the reflect the hours baking, cleaning, and repairing damaged tapes to get tapes playable.

In addition to digitizing, we also trim, create lightweight MP4 files, and create digital preservation checksums using FFMPEG, an open source tool for making and changing video files using the command line. We catalog tapes in depth, created detailed spreadsheets with descriptive metadata (information written on the tape), technical metadata (information about the deck used, settings on the processing amplifier, and file size), and we robustly quality control each file. 



WHY DO WE DO THIS?

The level of care and labor we invest in home videos parallels the value they represent to communities, families, friends, and individuals. Home videos are living records of everyday life. The sound of someone's voice or their embodied mannerisms are so evocative, a simple error can interrupt the memory. Therefore, we want to make sure the record of that person's life is treated with care. That's why home video preservation is so important to us, and core to our work. 

Frankly, the consumer market for home video digitization is bleak! Primarily it's hugely expensive, often costing $30 / per tape. It is also quite factory style, with digitizations often running in the hundreds on cheap consumer VCRs in bulk. 

Literally a promo

I'm not necessarily disparaging scale, but I am suggesting that this costly yet large-scale model for digitization simply cannot address the unique issues each tape presents nor is capable of the level of care that small batch work enables. I mean how do you clean all the heads?? How do you deal with damage? Plus these consumer decks create a softer image than is inherent to the VHS. There is quality loss inherent in this setup. 

Compression artifact - WikipediaAdditionally there is profound quality loss when digitizing straight to MP4 with high compression rates and digital artifacting. Literally the detail is lost and digital artifacts are permanently introduced when they don't have to be! 

I have also heard that some companies functionally hold your digital files hostage. The "Netflix" model where they offer streaming on your smart TV means that you have to keep paying for a subscription model to access the digitized videos each year. Frankly unethical. One of our volunteers even downloaded the files off the streamer from his MIL's subscription, making sure she had permanent access to them. 


Do I dream of a TAPE with several parallel digitization stations? Yess! But it will still be under the same rigorous ideas of quality and care we maintain today. 

I also fundamentally am interested in the development and access to preservation knowledge for the public. I recognize the knowledge barriers inherent in building and maintaining a digitization station. But I also believe that these are entirely surmountable knowledge barriers. With support and access to greater documentation, so much more is possible. T.A.P.E. founder Jess often talks about this beautiful vision they have of sitting in traffic and imagining how each car represents a person and story, with videotapes sitting at home. The scale of that could be terrifying. And for many institutional archives, the scale of digitizing videotape is simply too large. But with technical training for the public and a broader access to equipment, people can build digitization racks everywhere, all over the city to help support the preservation of moving and emotional memories. 


PROJECTS OF NOTE

One of the most exciting and honoring projects we've been a part of has been our work with EZTV, Elizabeth Purchell, and Hollywood Entertainment. In March 2025, we were partners on 40 Years of EZTV, a retrospective of the remarkably dynamic video gallery, collective, DIY production space, home to performance art, video, and emergent digital technologies. EZTV has been at the cutting edge of media technologies and the fostering of an artist scene. 



Read more about EZTV at https://eztvmuseum.com/

I would also recommend https://eztv-beginnings.com/ and a great article  "Out to See Video": EZTV' Queer Microcinema in West Hollywood.

Over the past year, we've taken baby steps to assess and digitize tapes covering the broad universe of EZTV - from the camera tapes of various video makers to the exhibition tapes from the video gallery. Our first goal is to catalog existing tapes and identify unique and master material. From there, we are digitizing select materials and remastering what features we can from original camera tapes. 

A great example is the work Elizabeth Purchell is doing to exhibit Highway Hypnosis, and incredible SOV feature from Ken Camp. Ken, who held onto the original camera tapes, dug the original elements up for Liz. We've been able to digitize the original camera tapes as well as edited U-matic master copies to begin the larger remastering process. The improvement from the existing VHS dubs are astronomical, demonstrating the profound beauty and richness of videotape. 



In meeting Michael J. Masucci and the incredibly kind and talented video makers from EZTV, as well as numerous visits to the ONE Gay & Lesbian National Archives to see the paper collections, we've been able to see the innovation, care, and community at the heart of this expansive space. So much more to come.

We've also been so lucky to meet Jeaneen Lund, a photographer and filmmaker, and begin digitizing her collection of tapes. Jeaneen's mother, Sharon Lund, is an HIV/AIDS advocate, especially for women with HIV, fighting for care and against stigma. Jeaneen and her mother travelled around the world advocating for safe sex, support, and education around HIV/AIDS prevention. These tapes represent numerous lectures, television appearances, and panel conversations of Jeaneen and her mother. 

Jeaneen is making a film about her mother and generously shared a preview of her work. You can watch more and hear about their story here. https://vimeo.com/1152959501



EDUCATION & WORKSHOPS

In 2025, we quite aggressively ramped up our AV Preservation Workshop. I'm so glad Liz & Alex pushed me to do a Video Digitization Workshop as part of the EZTV Series. It has been one of the most rewarding and packed of all. 


We've also offered Audio Cassette Digitization, Audio Deck Maintenance, VCR Maintenance, Digital Preservation (FFmpeg, Intro to Digital Preservation) & Digital Autonomy (like Get Off Of iCloud & Message Mori), and Data Moshing Workshops! 

We've been so lucky to even offer these for free at the public library - Los Angeles Public Library's Central Branch. A literal literal dream come true for me to lug our digitization deck, computer, and analog to digital converter to set up for demonstration at the library.  


For volunteers, we also held our first ever DECK-A-THON! We repaired and cleaned close to 10 decks in one night!





FRAME SYNC/TBC INSTALL & REPAIR

Thanks to the incredible fundraising support we received last year (EOY 2024) we were able to make a big purchase that immediately upgraded our digitization capabilities. We purchased a Leitch 575 Frame Sync. We spent $1300 on the device, with a built in warranty.  

S/O to Sean McGuirk for this incredible promo for our fundraiser

We like to name our equipment after our favorite video artists, so we named the Frame Sync after Shigeko Kubota. Kubota was a prominent video and performance artist, whose expressive and political work with portable video was a revolutionary intervention into the formalist art world. She was also the first video curator at Anthology Film Archives, giving legitimacy and opening avenues for exhibition of video. 

This device can fix a number of issues like the frame falling out of sync, flashing / dropping frames, skew, and other stabilization issues. You can read more about a TBC in our blog on the topic.

However, haffway through the year we began noticing a cross hatching pattern in the image. There are two issues visible here. If you look closely, you can see a cross-hatching pattern visible in the image. 



If you look closely, you can see a cross-hatching pattern visible in the image. 

The pulsing bars running horizontally and vertically is the result of lack of black burst sync in the JVC VCR. By running the black burst generator directly into the deck, we fixed this issue. 

As the component continued to degrade, the issue got worse and worse. We tried to push through, but eventually I called it and pulled all the files that had this issue for re-digitization after the TBC was repaired. 


Additionally, the screen would work and then go black. And it would stop processing the black levels, so it would be crushing and the adjustments wouldn't work. 




When we bought the Leitch 575 we bought a warranty for an extra fee that was great. Southern Advantage was totally lovely to work with on this repair. 

While this put our digitization setup out of order for a few weeks, we notified patrons about this update and were able to get orders done for everyone as soon as the frame sync was returned.


PANASONIC REPAIR

Shortly after the return of the Frame Sync, I began noticing an issue with our main digitizing deck. For Standard Play tapes, we primarily use a Panasonic AG-7750, donated to us from Whammy! Analog Media. This thing is a workhorse! It has super high quality heads, an internal time base corrector, manual tracking, and so much more. One of the things I love most about this deck is the crispness of the image. I feel like this deck simply breathes life into videotape.

During transfers I began noticing the cross hatching reminiscent of the issues with the Frame Sync. Uh oh!

Well, the good news I could pretty much identify the issue immediately. When I spoke with the folks at Southern Advantage they said that a component went bad on the board. Well, crosshatching then = bad component.

The good news also was that the issue wasn't particular to the TBC that is built into this deck. The issue was present even with the TBC off (so none of those boards). It was visible when just running a composite signal passively through the machine via another deck (so not the video heads). That made the problem much much easier to locate.

Luckily, this model does have a free publicly accessible manual. I used it a bit to find the boards, but I also just broke the thing open. Adjacent to the chassis there are boards for the TBC and other parts, so I inspected each of those. I was looking specifically for any sign of burning. They all looked clear and I have all of them a quick dusting too. 

I unscrewed the bottom of the deck and found two boards affixed to the bottom. The Video PB board ( or the video playback board) - now that started to make sense. The issue was specific to playback.


Lo and behold - a burnt component! There was also this weird gooey strands of plastic almost like spider webs all over the board. I cleaned those off. 

See the brown bubbles! That is burnt!

Rather than try to find this obsolete part (although I did track down the part number and try). I thought it best to simply try to find a matching cheap replacement deck. I went on Ebay and found an exact copy with an unknown condition for only about $150. It was a bit of a risk that the board would be missing or broken, but at the very least I would have another deck that could be working or be used for other parts and boards. 

While the heads looked trashed and the motors were all disconnected in the chassis (not necessarily unfixable issues), the boards were in great condition. I simply replaced the Video PB board and reinstalled the old into the parts deck (just in case we need a single component from that board). Now we have back up TBC boards & other parts! Super exciting. 



EXPERIMENTS IN BAKING

This year we placed a greater emphasis on baking tapes, to significant results. Baking is process to remediate "sticky shed syndrome" or "binder hydrolosis." In this form of deterioration, the glue that holds the magnetic particles to the plastic tape swells upon exposure to moisture, causing the magnetic particles to shed from the surface of the plastic tape. 

Read more about sticky shed: 

Baking is a remediation process most commonly used with Umatic (3/4") video tapes of particular brands (Ampex) or polyester black reel-to-reel 1/4" audio tape. However, it can impact any black (polyester) tape, be it data, audio, or video. "Baking" is really dehydration, performed (with equal results) in a consumer dehydration or scientific oven.

Depending on climate the temperature can range from 55°C (131°F) all the way up to 70°C (160°F) in highly humid areas like Texas (s/o TAMI!) Minimum bake time is 12 hours, but standards today are 3 days, but I've seen it go for a week or longer. As tapes age and environmental conditions worsen, bake temperatures and times have increased steadily, with some believing there is a fast approaching timeline in which tapes will be unplayable. 

VHS tapes are highly understudied in the world of archival baking because 1) many collections attached to institutions work predominately with broadcast standard video (Umatic, BetacamSP, etc.) not consumer formats 2) VHS has posed significantly fewer shedding risks than Umatic. Today, Ampex brand Umatic tapes almost exclusively have to be baked. 

Sticky shed can be identified in the following ways:

1. Visible tape shed inside the cassette shell or in the VCR during playback (not ideal)

2. Tape refuses to advance, or does so slowly (stuck to itself) either manually or in the VCR (such as the timecode running in stops and starts, squealing noise)

3. During playback tape shows significant stabilization issues despite running through a line TBC and frame sync

It was that third category we really began researching, as we hoped that baking could be a solution to tapes that seemingly wouldn't play. With assistance from some of our friends with access to professional digitization labs, we began baking more tapes that had significant playback issues. 



Despite the physical damage to the tape : seen below, the image post bake was strong! In short - tape is more robust than we give it credit for. 



The results are clear: the capture on the left is the pre-bake, capture on the right is post-bake. We also adjusted settings on the Frame Sync to only address line level issues (TBC mode). 



Here's another great before and after demonstrating improved playback post bake. 

While not all issues were fixed, such as physical damage to the tape that produced errors present in both, we were able to get the tape looking significantly better. The left is functionally unwatchable, while the right captures the precious family memory as best as possible. 

We also collaborated with UCLA Library's Audio Visual Preservation Department to produce a zine introducing the public to baking concepts! You can read or print it out yourself via the Internet Archive!




CLEANING TAPES & CLEANING HEADS

 Head clog is a pervasive issue in digitizing tapes and can be hard to identify, especially in comparison to damaged tapes or head clog that is "baked into" the tape itself from a poor dub. 


Above is a clear example of head clog - you can see the bands of sync loss in the upper part of the image, which progressively turns into bands of static, with total static covering the image. 

If the static disappears quickly and doesn't return, likely it was damage to the tape or something baked into the transfer. But if the static persists, head clog can be diagnosed. From there, the transfer would be stopped, the tape taken out, the tape cleaned, and the deck cleaned too. 

Inside the drum are these small heads that read and write information. They are characterized by two iron bars with a gap between then, surrounded by coiled copper wires. The two iron bars/coiled wires setups create an electromagnet that can circulate magnetic fluxes. 




Blank video tape is made up of magnetic particles with a randomized polarity. When those randomized particles in front of a magnetic flux, they close the field, as it creates the path of least resistance (in contrast to air). Therefore, the magnetic particles get fixed into a pattern of polarity. 



When the tape is read back, those polarity patterns induce changes in the magnetic field, which can be read by the heads and decoded inside the machine's circuitry. 



Head Clog occurs when dirt, debris, or magnetic deposits shed from the tape into the very fine heads inside the video drum. That gunk impedes the heads' ability to read magnetic flux, weakening the signal, and leading to signal drop outs and information loss.


This problem can be quite easily fixed through a manual cleaning of the video heads using 99% Isopropyl alcohol and either blank white strips of paper OR TEK wipes. (nothing that is fibrous or foam as that will rip the delicate copper wires - if you feel resistance stop!)

While this is standard practice for us to not only regularly clean heads but to stop when we suspect head clog, cleaning tapes before playback is a more recent introduction. 

We had persistent issues with head clog for EP (Extended Play) tapes. The reason why is that Extended Play tapes move more slowly in the deck, giving more time for particles to shed and the slowness creates a rough texture against which particles grind. Additionally, the magnetic information is denser on the tape, meaning a smaller head clog could block the entirety of the image. 

Some professional labs have RTI tape cleaners, which use a specialized diamond tip and pellons to gently scrape microns off the surface of the tape. While fabulous - those cost thousands of dollars. 

One method of cleaning tapes has been to "Dry Clean" - remove shedding particles and dirt by playing a tape in the VCR all the way through before digitization. For this particular tape, I had to clean it in the VCR 3 times to get the image stable enough for playback. The left is the first digitization attempt. The top is the final product after several rounds of cleaning. The right is the final product after several rounds of cleaning.





CLEANING MOLDY TAPES

Another method is a wet clean, using 99% Isopropyl alcohol and a tape cleaner, which is needed for things like mold, but can also be useful for dirty tapes. 

Mold can also be brown!

As we have talked about in previous blogs, we use two different mold cleaners, which are essentially just motorized transport systems with pellons attached, like the RTI system. I haven't seen too much resulting damage to the image as the result of mold. 

One cleaner we use is the VHS Cleaner from 4Mob (shipped from Brazil) -  this one has a lot of different adapters for tape formats and I like the double pellon 




You can also DIY your own tape cleaner using a tape rewinder like Whammy! did last year. 

This year we really had a lot of moldy tapes come our way, so we borrowed Whammy' VHS is Life cleaner to do mold cleaning on dozens of tapes.


One critique I have of this device is the tape path is a little short. The alcohol should dry completely before the tape is wound onto the other spool, or it can cause sticking or warping. My solution is to do this cleaning outside in the bright sunlight so it dries quickly, but also that's good for the tape to be slightly warmed and my health to clean outside!





TRACKING

Another tool I began using more was the manual tracking on the Panasonic Pro Deck. I used this most for tapes for Liz Purchell from the EZTV collection, specifically the camera tapes for the Ken Camp feature films. Because these tapes were so old and had so many tapings over content, they often needed manual tracking to remove visual errors. In this example I introduce tracking errors by increasing and decreasing the tension excessively. 


A combination of good manual tracking and a time base corrector can also cause issue like skew, where the top portion of the video is flagging / being pulled. 



ONGOING MYSTERIES

The biggest ongoing mystery are these VHS-C tapes that were recorded in EP mode. During playback I couldn't get any image, but the sound was great. This is because the audio and video heads are separated and two different methods of encoding (linear versus helical respectively). To even get an image I had to run the entire signal chain in reverse! Going from the camcorder & deck with adapter (I tried both) into the Frame Sync and the into the Pansonic Pro Deck to use its line TBC then out to the analog to digital converter. Even then the image was so badly out of sync. 



I have never seen anything like this and still haven't been able to figure it out!

Thank you to everyone who has supported this work. We do it because of the generosity of everyday people who make this possible!!!

T.A.P.E. is a 501(c) 3 non-profit dedicated to facilitating access to analog media making, preservation, and exhibition. To support our work and access great benefits, join our patreon at just $5/month. You'll get access to exclusive rates for our rental equipment library, access to our digital and physical videotape library, and other member benefits like free workshops. 

If you'd like to help make essential digitization equipment purchases, you can donate directly our Go Fund Me (Ongoing!) to provide more archival transfer services for more tape formats. A donation will advance the work of people-oriented digitization services!

info@tapeanalog.org

Blog is written by Jackie Forsyte, T.A.P.E.'s Technical Director, and an audio-visual archivist. 

Wednesday, October 22, 2025

T.A.P.E. at No Time to Wait 9 - Part 1

T.A.P.E.'s Technical Director Jackie Forsyte got the opportunity to travel to Dublin on scholarship to attend 🅽🅾 🆃🅸🅼🅴 🆃🅾 🆆🅰🅸🆃 - a free conference on open standards, open source tools, and digital audiovisual preservation. I was funded by Pratt Institute's DPOE-N Professional Development Stipend, who generously granted me funds to attend a digital preservation conference of my choice, special thanks to Kirk Mudle of that organization! 

No Time to Wait is in its 9th year, focusing on showcasing and developing open source digital preservation standards and tools. 

𝒲𝒽𝒶𝓉'𝓈 𝑜𝓅𝑒𝓃 𝓈𝑜𝓊𝓇𝒸𝑒? Open-source is a philosophy for making, using, and documenting tools and workflows that are public-facing, available for the public to at minimum inspect, but also often reuse, deconstruct, and remix. This philosophy is in start contrast to many computing universes which "black-box" their software, making it impossible to understand how it is made and thus impossible to use outside of a licensed, supported, and often paid for environments controlled by a corporation. Open source is hand-in-hand with digital preservation as we are often fighting for autonomy over the stability of our files, the ability to emulate digital environments, and need as preservationists to get under the hood of the code making up our environment. You probably have a story about locked, lost, corrupted, or proprietary files, which can be destabilizing and painful. 

No Time to Wait intervenes in these increasingly proprietary and closed box digital environments, advocacating for policy action, tools, and workflows rooted in such a philosophy.

I've been highly interested in open source tools for personal digital preservation and the courses I teach with TAPE. Seeing the enthusiastic response to courses like Get Off Of iCloud, Preserving Texts & Voicemails, Intro to Digital Preservation, Hard Drive Office Hours .... affirm that there is a desperate need for autonomy over our personal media. A real frustration is bubbling, that demands a collective channeling methods, tools, and learning to facilitate greater control over our personal digital universes. Questions like:

  • How do I get my files out of closed sandboxes like Apple's iCloud?
  • What file formats am I even dealing with on my hard drives and mobile devices? Will they last? 
  • How do I care for the hard drives and storage devices I've accumulated? 
  • And most frequently, how do I manage the mountain of personal files that feels like it's teetering over? 

TAPE has taught Digital Preservation workshops for the past year at Whammy! Analog Media and increasingly at Los Angeles Public Library's Central Branch, teaching the public how to use their command line to do things like mass download text messages, batch rename files, or generate checksums to fingerprint files for long term storage. A lot of the first workshops came out of a conference I attended in October 2024 called Digital POWRR (Preserving digital Objects With Restricted Resources). You can read those blogs here! 

Sharing knowledge is at the core of my personal ethos and the Conference, so I've outlined my notes and some exciting takeaways that you should expect to see in upcoming workshops! 

🅵🅸🆁🅴 🆂🆃🅰🆃🅸🅾🅽 🅰🆁🆃🅸🆂🆃🆂 🆂🆃🆄🅳🅸🅾🆂


Before arriving in Dublin, I reached out on Instagram to get in touch with people doing experimental film and preservation locally. Through the magical international network of Erica Sheu (our favorite experimental filmmaker) I got in contact with Ruairí, who lives in Belfast and is a musician, artist, and researcher on Independent Irish films. Ruairí edits Ultra Dogma, a film journal and virtual cinematheque, check out their writing. Ruairí put me in contact with Helena, who is the Digital Media Manager at the Fire Station Artists Studios. 





A 33-year old live-in artist residence and teaching space inside a 19th century Fire Station, the space is a hub for arts-making outside of the university system, serving as a workshop, studio, and exhibition space. Helena was kind enough to show me around the space and talk about efforts to build a motion picture processing darkroom, which will become the only space in Ireland for artists to process their own films. The lack of national labs - professional or amateur - has created a real gap in what artists can do at home, often requiring travel to other EU countries. It was so exciting to talk with Helena, who has a really infectious joy for filmmaking and the fostering of an experimental culture locally. We got really excited talking about the preservation of the Fire Station's own videotape archive, and instantly clicked over sharing resources to set up an in-house digitization station. Ireland has a rich history of alternative filmmaking and videomaking, and I was excited to learn about a number of collectives who were making work like the Derry Film & Video Workshop. 


Experimental filmmakers have one of the most incredible networks of open source knowledge. They are a real model for preservationists to think about knowledge sharing and resource sharing. And most importantly, about creating work and resources that are outward facing, assisting the broader public in the creation of their own work!

I also took the train out to Howth and visited the Ye Olde Hurdy Gurdy Museum of Vintage Radio which houses unique radio equipment and ephemera inside of a Napoleonic Wars defense tower on a cliff! My favorite item was a radio hidden inside a portrait of Rita Hayworth created by the French underground during Nazi occupation. 



Conference Day 1 python, rawcooked, and floppys!

🅴🆇🆃🆁🅰🅲🆃🅸🅽🅶 🅵🅸🅻🅴 🅼🅴🆃🅰🅳🅰🆃🅰 🆆🅸🆃🅷 🅿🆈🆃🅷🅾🅽

The first workshop I attended was from Joanna White who is an in-house developer for the British Film Archive, Extracting file metadata using Python, MediaInfo and other open-source tools. You can read her blog about it here!

White is building and building upon open source tools for managing the BFI's massive collection of moving image materials - I mean how cool that a moving image archive has an in-house developer -  in stark contrast to how most archives rely on IT departments that lock us into proprietary and ready-made software.

The tool Joanna built allows archivists to search across the entire database by very specific metadata fields. 𝒲𝒽𝒶𝓉'𝓈 𝓂𝑒𝓉𝒶𝒹𝒶𝓉𝒶? It's data about the data - a tautology I know - it's the technical information that surrounds the audio visual information. Technical information like how long is the video, what file format is it, what frame rate... is now searchable as fields across the BFI's internal catalog. This is a technical feat! 

Joanna walked us through a snippet of how the code works. Python is the language of the code and it calls upon Media Info software to do some heavy lifting of the file analysis. 

MediaInfo is a project by the people behind the No Time to Wait Conference. It is a free, open-source tool that analyzes and reads out metadata in a human readable way. At TAPE I use Media Info allllll the time for simple tasks like how many Gigabytes is a file to investigate more complicated problems like why won't my file open in Quicktime (hint, it's always Apple's fault). MediaInfo can point me to embedded information about video files that are obscured to the user because your computer assumes you'll never need it (hint, you do!) 

What Joanna's script does is point the computer to a universe of instructions - giving it access to a library of how MediaInfo analyzes, sorts, and presents technical media, then telling the computer to grab specific fields of information from the file (like file format, duration, number of audio channels), and then presents that metadata in a human-readable CSV (comma separated value) file that can be imported into a spreadsheet or even a database with ease! 

This script would be especially helpful for someone who is cataloging and "ingesting" digital files into an archive. You have no idea how the file was made and so that technical information is critical to figuring out how to sustainably care for and potentially migrate the file onto newer hardware or even more sustainable file formats. For TAPE, we aren't quite dealing with digital files in this way, we are creating digital files to give to people.

🆁🅰🆆🅲🅾🅾🅺🅴🅳 🅰🅽🅳 🅾🅿🆃🅸🅼🅸🆉🅸🅽🅶 🆂🆃🅾🆁🅰🅶🅴 🅵🅾🆁 🅵🅸🅻🅼 🆂🅲🅰🅽🆂

Jérôme Martinez is the founder of MediaArea, one of the backers of NTTW and an open source software company with a suite of tools for analyzing digital media. He lead the conversation about Optimizing Storage for DPX film scans using an open source program he worked to develop called RawCooked

DPX is an image sequence format used for film scans. In the same way a gif is a string of still images that are in sequence, DPX are a string of uncompressed images that have metadata that relates them together so they can be transformed into a moving image. Particular to film scanners, DPX are extremely large files - from my experience in scanning an ~2 hour feature film scanned at 4K Resolution (4,000 pixels running horizontally across the active image area) is usually about 10 TB of data. As fetishists increasingly become obsessed with 6K , 8K, 12K and other absurd resolutions, storage becomes a definite crisis. For context, the UCLA Film & Television Archive's Digital Lab outputs about 50TB a week of digitized film & video, with holdings of about 9 Petabytes of unique data. 

In moving image preservation, we often rely on LTO (Linear Tape Open) storage to address this high demand for storage. LTO is more "shelf stable" data tape, i.e. 0s and 1s encoded on magnetic tape - yes ! most long term storage still relies on tape! There are lots of wonderful things about LTO and a lot of annoying things, but the format itself is reliable and LTO 8 can support 10TB of storage per tape. LTO will come up again in this blog.

RawCooked is an open source program to transform DPX sequences into a preservation stable, open source file format - Matroska container & FFV1 codec. Patrons of tape will know that we digitize all home movies to Matroska FFV1 as it is a preferred format of a number of leading archives such as the British Film Institute and Library of Congress. Matroska is an open source container, meaning it isn't owned by a particular company but rather maintained by a large international community. FFV1 is similarly open source, and is a codec that relies on lossless compression, like a zip file it reduces file size without reducing quality by re-encoding data with high-efficiency standards. 

To explain container and codec I use the analogy of a book. The content of the book is like the video and audio streams. A container is like the front cover, back cover, and table of contents. It contains the content and tells you what the book is and where to find the content within it. The codec is like the language of the book, it tells you information about how to read the book. You must be able to read the language to understand the content, in the same way the computer needs to be able to be able to interpret the codec to decode the content. 

Rawcooked is a remarkable tool for preservation. It transforms DPX sequences into Matroska FFV1 files, reducing file size by 2/3, i.e. reducing the number of LTO tapes you need to buy. And also reducing the number of tapes you need to migrate each time an older generation becomes obsolete. Later in the conference Joanna White from the BFI revealed that Rawcooked saved the BFI ---------- in just 6 months from storage costs.

And critically, the Rawcooked function is fully reversible. You can reverse the process of transformation from DPX to MKV FFV1 back to DPX and your file will mathematically be the exact same. This is the same as a ZIP file, when you unzip your file is identical to pre-zipping. How do we know it is identical? Using checksums! 

𝒞𝒽𝑒𝒸𝓀𝓈𝓊𝓂𝓈 are a mathematical fingerprint applied to a file. You run an algorithm over a file and it generates a string of numbers to represent the file as is. When you run the algorithm over the file again, you check to see if there is fixity --- If the string of numbers changes, the file has changed. If it is the same, the file is the same. 

FFV1 and Matroska are quite good at handling checksums and managing corruption. There is a checksum embedded in the file by slice (more on that later) but it effectively means that many many portions of a single frame are assigned a checksum, which is detectable. Therefore, a corrupt slice can be replaced by a complete frame and does not corrupt the entire frame or the entire file. Therefore, FFV1 may be safer from corruption than even DPX which will corrupt on the frame level, and therefore that frame is gone. Just like if you scratched a frame up on a physical film, that frame is gone.

Rawcooked has two problems that are in active development. The first:

Rawcooked currently supports DPX "flavors" for scanners that conform to particular standards. The scanners that I personally use - the Lasergraphics machine - and the ones that organizations like Origins Archival use (the Film Fabriek) use two additional "flavors" of DPX file not currently supported by RawCooked. The RGGB color space and Bayer Pattern. This a particular philosophy and method of film scanner's sensors that use Red, Green, and Blue filter layers to filter incoming light to active elements that are sensitive to each of those colors. This is a mimicking of the human eye and developed by Kodak, thus closely matching the color technology of their color film that has red, green, and blue separated dye layers. To get RawCooked to support this DPX flavor, it is simply money for developing so that the program can understand and encode such flavors.


The other problem is speed. What Rawcooked does in saving money, it does consume in time. We are in the midst of a liminal period in computing power history in between the CPU and GPU support for various video codecs.


If you're a gamer, this isn't news to you. A CPU handles the general parts of the computer, its applications, its web searches, its's operating system, all the things the computer is really good at doing efficiently. It has a small number of cores that are little engines that work in sequence. A GPU is really good at things like exporting video, rendering video, or rendering complex images (like video games) because it has a lot of cores (engines) that work at the same time (parallel processing). 

Codecs as I talked about above are like the "language" of a video file, requiring translation to encode and decode by a computer. Like anything else we have to tell the computer how to do that. Right now most computers have been given instructions to do video functions on their GPU only for certain codecs. Whereas CPUs have instructions for more codecs. But CPUs are slower and so for open source codecs like FFV1, the computer takes much longer than if it were a codec like ProRes which can use the GPU. 

I wonder why! It's almost like Apple owns ProRes and thus supports faster encoding for its proprietary codecs ??? 

Therefore, right now programs like FFMPEG and codecs like FFV1 have only been given permission to run on most CPUs, and don't have access to GPUs. Even though they are generally pretty lightweight in the grand scheme of things, the CPU thing slows them down significantly. Even when running Rawcooked on 6K files, the CPU will not exceed 3 mb of usage, so it can take 8 hours to process 1 hour of DPX. But that's changing - see more later in this blog. 

One really exciting thing is I feel like I finally got a straight answer on slice count for FFV1. If you've been trained by me at the desk, I frankly sort of shrug my shoulders and say no one smarter than me has explained this in a way I understand so IDK ! In FFV1, specifically with vrecord too, you can select the slice count. But someone smarter than me finally explained it!


Slices are the number of times you divide an image during compression. “Multi-thread” or “parallel processing” is a way to understand it. Your CPU can use multiple cores to process more than one part of a single frame at a time, increasing processing speed. DV (Digital Video) tape works similarly, creating blocks of the image that are processed in parallel per frame rather than trying to process the entire frame as one big image. Speeds it up and reduces file size. 

Fewer slices = fewer segments inside the frame. Easier for compression. Faster but a little bit clunkier.

More slices = more segments inside the frame. Harder for compression. Slower but more precise. 

For 6K overscans, raw cooked automatically indexes at 1024 slices. Probably overkill, but it does preserve the rich nuances between slices. 

🅵🅻🅾🅿🅿🆈 🅳🅸🆂🅺🆂

This was probably one of my favorite parts of the conference. I’m literally so ready to set up a floppy disk transfer station for TAPE. 

Leontien Talboom works for Cambridge University Libraries overseeing the transfer station. She had a floppy disk sweater and a PCB board dress at the conference it was so sick.

We first walked through identifying the major types of floppy disk starting with the earliest models called “Stiffies” (not floppy at all) and the different sizes 3, 3.5,
5.25, and 8” sizes.
 

The interior construction of a floppy disk is made up of a disk covered in iron-xoide, housed inside of a paper or plastic shell, sometimes with a spring and thin metal shutter. 

Information is encoded onto the disk in two ways. First, there are sectors or pie slice areas that are separated from other sectors. Within each sector there are tracks that contain the written information. 


There are also sides to disks, where sometimes the disk is formatted to be single or double sided. In addition to size there are three density modes for disks, which represent how frequently each track is encoded - 20 track (double density) 40 track, 80 track (high density). Much like videotape formats, these density types are specific to the players, but are backwards compatible, meaning high density drives will play 80, 40, and 20 track disks, but 20 track drives will only play 20 track disks. You can typically identify the type of track configuration by the number and location of notches, with double holes indicating high density.

Of course, there are lots of weird formats and propriety formats that require specific drives like Mac formatted disks. And there are event things called flippy disks where people would DIY cut second notches into floppy disks to trick the drive into writing to the disk in high density. This can be particularly tricky to read back because it is formatted to write information in previously blank areas. Plus if the disk is actually single sided, the other side will be encoded backwards.

If you’re counting there are three main compatibility issues with floppy disks - the drive must be able to support 

a) the size of the disk

b) density / number of tracks

c) single or double sided disks

In addition to any number of proprietary, early, DIY, or non-standard formats. Floppys, flippys, and stiffies! But newer drives, high density, are more likely to support the greatest number of disks. 

Once you find a drive that can support the floppy disks of interest it’s time to clean it. It likely has never been cleaned so be sure to open the drive and give it a deep dusting with compressed air. You a clean the reader heads with isopropyl alcohol, as any dust on the heads will scratch the disk.

The drive usually goes bad with the rubber belt that powers the motor - if the drive won't move that belt needs replacement. If the drive is squealing, it needs lubrication. 

You may also need to clean the disk itself. Dirt or mold on a floppy disk causes the drive to squeal, because the head is tracking the contaminant across the disk and scratching it up. You can clean it with alcohol and a non-abrasive wipe like PEC pad. 

Then you’ll need a ribbon that connects to your drive. Each drive has specific number of pins, usually 34 or 26 pins which are easy to find on eBay. You’ll need power for the drive, which can be a portable unit, usually 5 or 12V, but 8” floppy drives have 240V. 

The most important tool is a floppy controller, which allows the older drive to communicate with a modern computer. There are bunch of different models, usually built by the retro gaming community such as the Supercard Pro or Cryoflux, but Leontien recommended the Grease Weazle because it has a lot of support for various formats, an active development team, and an active community supporting and providing documentation. The Grease Weazle connects to the computer via USB. 


There are two ways to read a floppy disk - flux steam and logical image. A flux stream is a capture of the fluctuations in the magnetic field on the disk. The logical image is a transformation of that fluctuations into human readable files. The flux stream is necessary to figure out how the disk is encoded and to capture an accurate full image of the disk. The logical image is based on the correct assessment of the disk's formatting from the flux stream so we can get the files off of it. 

Leontien then demo'd the Grease Weazle and terminal commands needed to image the disks. Opening the terminal, she ran the command to read the flux stream in the .scp format. The command is:




gw read Workshop/test.scp --drive=B

which translates to:

GreaseWeazle Read Drive B and write it to Workshop folder and the filename test.scp

One challenge she mentioned is figuring out how the drive sees itself. In the command you have to specify the location of the drive you want to access. Each drive sees itself as a different input and she recommended trying them in this order 

--Drive=A / --Drive=B / --Drive=0 / --Drive=1

Then opening HVC Floppy Emulator allows us to examine the flux streams. From there, you can determine the number of tracks and even tell the logical format read to skip over empty tracks. This is particularly useful when reading a disk that has fewer tracks than the drive can read (i.e. a 20 track in an 80 track drive). You can tell it to skip empty tracks otherwise your image turns up a bunch of red for empty (unreadable) tracks.

All that red is empty information from not specifying the number of tracks



You can also tell it the formatting of the drive - in this example it was an IBM 360 format. How do we know? We have to match the size, track number, and other technical information to the logical format using this handy wikipedia page!

Because the Grease Weazle was designed for retro gaming, it allows the ability to write over disks, which you wouldn't want for preservation. It doesn't respect write blockers (i.e. stickers covering the read write notch). Plus stickers tend to get stuck in the drive so they aren't particularly useful. Instead, you have to explicitly tell the GW to write in the command line, which you would never do because you're always going to give it the gw read command.

I am so excited to get starting building a lab for TAPE! Leontien, as part of her grant funded project is writing a guide for the Digital Preservation Coalition, so there will a comprehensive guide on floppy disks that we can all reference. This is particularly exciting because much of the information is scattered across forums and there is no centralized place to hold it. 

🅸🆁🅸🆂🅷 🅵🅸🅻🅼 🅰🆁🅲🅷🅸🆅🅴

After the talks, I snuck onto the booked up tour of the Irish Film Institute (IFI). I wasn’t going to miss it, sorry. I first learned about the IFI through their incredible python scripts - thanks to AV Preservation at UCLA Library’s Special Collections, I use the copyit.py script all the time! It moves files from one location to another, generating a file level checksum and then checking the checksum upon delivery to ensure that the files match in both locations. But there are dozens of tasks and full workflows the IFI uses, that I highly recommend.

The Archive is housed in a 17th century building in the heart of Dublin, right next to their cinema for the public. They have a full bookstore and DVD collection, a bar, a cafe, and a 3-screen cinema showing independent and repertory cinema. Their programming is closest to the Laemmele in Los Angeles. We walked through their workspaces, including their digitization lab for film which uses a lot of the same equipment I am familiar with (like Cintels (Blackmagic) and through their collections workroom which has 4 Steenbecks used for print traffic inspection. We also got to learn from their Special Collections department (anything but moving image) about their print and 3-d object collection. 


We talked a bit about the challenges with repairing Steenbecks, especially in Ireland. Steenbeck was bought 2 years ago and now there are no official technicians or parts associated with the company, only an aging group of engineers and repair people who want and need to pass on their knowledge. It’s a similar issue to VCRs and other video electronic equipment. 

I missed the afternoon talks and workshops unfortunately and I did skip the social dinner (sorry!). Instead I returned to the IFI after some dinner for a screening organized by the Archive of Nightshift (1981) dir. Robina Rose preceded by Meshes of the Afternoon and a conversation with the legendary Irish experimental filmmaker Pat Murphy. I met up with Ruairí who came down from Belfast for the screening and some other filmmakers and archivists at the IFI. It was a real gathering of the local scene of artists, filmmakers, programmers, and archivists! 

Nightshift played in LA earlier this year but I missed it, so was excited to see it and with the additional context provided by Murphy who was a peer of Rose. 


After the screening, I hung out with a group of people who I met and we had a Guinness outside the Archive. We talked a lot about the emergent microcinema scene in Dublin as many of them were the organizers behind Fanvid, a DIY film club showcasing artist-made work and experimental film in roving locations. I met Alexa who works at the IFI as an archivist I also met Frank who makes some really cool work with Ruairí around media censorship and building a Pirate Television archive particularly in Northern Ireland. Frank is part of the Repeater Collective which is a DIY collective for artists and musicians in Ireland. I am super excited to check out his film Few Can See and potentially bring some work to the Whammy! microcinema!

Gifs made using FFMPEG - Instructions sourced from