Embracing AI in Audio Editing: A Professional's Perspective

Introduction

As an audio editor with extensive experience in not only editing but producing and overseeing audio productions, I find myself intrigued by the recent advancements in AI-driven tools like Adobe's AI podcast editor. While these innovations are indeed remarkable, they do not scare me or diminish my role as an editor. Rather, they make me reflect on the multifaceted nature of audio editing and how human intuition still plays an irreplaceable part in the process.

Breaking Down Audio Editing

Audio editing can be broken down into three essential components:

  1. The Technical Bits: This includes the actual splicing, retiming, and rearranging of audio segments.

  2. The High-Level Bits: Mastering, echo removal, and other intricate sound enhancements fall under this category.

  3. The Editorial Skill: This is the meta-skill that transcends the previous two and is often overlooked. It encompasses understanding the symbolic content, including capturing a guest's emotions or recognizing the importance of a seemingly trivial mistake.

The Rise of AI in Editing

The new Adobe AI and older systems like Descript have shown prowess in mastering and removing background noise and even automatically identifying and removing crutch words and long silences. These tools indeed make the field of audio editing more accessible and are warmly welcomed by many audio editors, including myself.

However, the power of human intuition and creativity cannot be understated. This is most evident in the third component: the editorial skill.

The Human Touch in Editing

The title "editor" is not merely a functional term; it encompasses a deep understanding of content. When a guest repeats themselves or takes a long pause before expressing something heartfelt, a human editor sees depth and meaning. To AI, these might be mere repetitions or silences to be removed.

A professional editor knows when a mistake contains more symbolic content than simply being an error. AI, without human guidance, cannot recognize the significance of these moments. When you hire an editor, you ensure that the same type of being who will consume your content—humans—will be involved in crafting it.

Mass-Produced or Meticulously Crafted?

Audio producers must reflect on their content's nature: is it akin to a mass-produced Ford or a meticulously crafted Ferrari? Is it a product to be assembled en masse by AI, or a unique piece of art guided by human hands?

These questions are not rhetorical. Both mass-produced and artisan products have their markets, and even luxury brands like Ferrari utilize mass-production elements.

Conclusion

In 2023, most people have experienced art created by both humans and machines, and they often know which they prefer, quirks and all. AI systems make mass assembly easier but represent an alternative to artisan crafts, not a replacement.

I think most would do well to inform themselves on the limits of AI and the human mind. To that end, reading "What Computers Still Can't Do" by Hubert Dreyfus and "Mind: A Brief Introduction" by John Searle could provide valuable insights.

p.s. Embracing AI's potential without losing sight of human creativity and intuition is key to enriching the audio editing landscape. Tools like Adobe's AI-driven podcast editor enhance the process, but they can never replace the discerning symbolic meta-content that only a human editor can provide. In the ever-changing world of media production, understanding this balance ensures we create content that resonates with its human audience, one nuanced edit at a time.