How to use AI in audio Asbjoern Andersen


What does AI mean for us in sound? Many are understandably worried that it might take our jobs away, but in this Sound Opinions piece, sound designer and recordist Ana Monte argues that AI offers plenty of ways we can use it creatively - without making ourselves redundant:
By Ana Monte, with additional research by Wieland Müller
Please share:

AI is not here to replace us. It’s a powerful tool that, when placed in the right hands, unlocks new creative possibilities

As a sound designer, I’ve been fascinated by the growing role AI is playing in the world of audio.
But here’s my 2 cents: AI is not here to replace us. It’s a powerful tool that, when placed in the right hands, unlocks new creative possibilities.

AI can assist in a variety of ways:

 

Restoring damaged audio: AI can remove clicks, hums, and background noise automatically, saving you time on tedious manual fixes. (Examples of tools to explore: Accentize dxRevive, Supertone, iZotope RX)

Generating sound sketches: AI can create sketches based on prompts, giving you a quick starting point for sound design. (Examples of tools to explore: Krotos Studio Pro, Sonic Alchemist)

Smart EQ adjustments: AI can analyze your audio and suggest custom EQ settings, streamlining the mix process. (Examples of tools to explore: Soundtheory Gulfoss, Sonible Smart: EQ 3, Oeksound Bloom)

Editing audio through text: AI lets you edit audio by editing text, making dialogue editing faster. (Examples of tools to explore: Descript, Spext)

Finding similar sounds: AI can analyze your sound library and help you discover similar sounds quickly, saving time on browsing (Examples of tools to explore: Sononym, Waves COSMOS Sample Finder)

Isolating stems from a mix: AI can separate individual stems from a full mix, allowing you to extract elements for remixing or editing. (Examples of tools to explore: audioshake.ai, Audionamix ADX TRAX, LALAL.AI)

– Simulating reverb environments: AI can recreate realistic reverb spaces, allowing you to apply different acoustic environments to your audio without complex manual adjustments. (Examples of tools to explore: Accentize Chameleon, Zynaptiq Adaptiverb)

– Processing voice recordings: AI can process voice recordings, refining clarity, adjusting tone, and improving intelligibility in real-time. It can also adapt to different voice profiles, creating personalized audio outputs. (Examples of tools to explore: Auphonic, SoundID VoiceAI, Supertone Shift)

But here’s the thing, AI doesn’t have the creative intuition or understanding of human emotion that we, as sound engineers and sound designers, bring to every project (for writing, editors have seen this as well). It’s our artistry, vision, and expertise that guide these tools to produce something special.

Let’s embrace these technologies, not fear them! This way we can focus on what really matters: creativity, storytelling, and emotional impact. If you have suggestions for AI audio tools that empower us, be sure to share them in the comments!

About Ana Monte:

Ana Monte, co-founder and sound designer at DELTA Soundworks, is a leading expert in spatial audio. She specialises in creating immersive content for Fulldome, XR, and Themed Attractions. Learn more about her here

 

Please share this:


 



 
 
THE WORLD’S EASIEST WAY TO GET INDEPENDENT SOUND EFFECTS:
 
A Sound Effect gives you easy access to an absolutely huge sound effects catalog from a myriad of independent sound creators, all covered by one license agreement - a few highlights:

  • Seismic Core is a modern sound effects library crafted to give your sound design its defining foundation. Every element in this collection is built for layering. Designed not to dominate, but to enhance, shape, and energize your creations. Whether you’re working on cinematic hits, trailers, motion design, game audio, or abstract sonic branding, Seismic Core delivers the essential low-end weight, transient snap, and textured grit that bring your designs to life.

    These sounds are meticulously processed and polished, yet intentionally left with enough space to sit perfectly under your own layers. They’re bold enough to add instant character, but subtle enough to leave room for further creativity. This makes Seismic Core not just a toolkit, but a true starting point for powerful, flexible, and unique sound design.

    This collection of Building Blocks contains following categories:

    • IMPACT *
    • BREAK *
    • WHOOSH *
    • SUSTAIN *
    • TRANSIENT (high and low)

     

    * Also including HEAVY & SUB variants

    Build with it. Layer on top of it. Drop the anchor. Feel the quake. Design from the core.

    25 %
    OFF
  • Step into a world of spells, mysticism, and creative power with Spells Variations Vol. 2, the follow-up to our acclaimed magical sound collection. This time, we’ve expanded the elemental and thematic range, offering 405 professionally crafted sound effects that are fully categorized and ready to use.
    Inside, you’ll find a wide variety of magical types:
    🔥 Fire
    💧 Water
    🌪️ Air
    🌍 Earth
    Electricity
    ❄️ Ice
    ☠️ Poison, Mud, Rocks
    ✨ As well as Arcane, Dark, Monstrous, and other mystical spell types.

    Each category is organized into individual folders, with multiple variations for every spell, giving you complete flexibility to choose the perfect sound for each moment, All this makes a total of 44 different spells.
    Just like in Volume 1, every sound in this collection was recorded, edited, and mastered at 192 kHz / 24-bit, ensuring top-tier quality and adaptability—perfect for professional sound designers or anyone looking for drag-and-drop magical effects for games, trailers, animations, or any audio-visual production.

    More about the pack
    – Intuitive file naming
    – All you’ll ever need regarding magical sounds [Use them again & again
    – Use the sound effects over and over, in any of your projects or productions, forever without any additional fees or royalties. Use the SFX in your game, in your trailer, in a Kickstarter campaign, wherever you need to, as much as you want to.
    – Totally mono compatibility
    – All sounds have several variations.
    – Use your imagination and feel free to use any sound for a creature other than the one described, remember that the world of sound is totally subjective.
    – For any questions or problems: khronstudio@gmail.com

    Features
    – 405 spell sounds
    – Format: 192KHz / 24 bits
    – Win/Mac: Yes
    – Minutes of audio provided: 22:26

    45 %
    OFF
    Ends 1754776800
  • Charge up on magical energy with our Spells Variations Vol 1 sound library! We’ve designed this collection to give you a wide range of magical effects, allowing your project to shine with an extraordinary variety of sounds. With 361 fully categorized and carefully named magic sounds, you’ll have everything you need to create an immersive and magical atmosphere.

    Explore categories such as arcane magic, water magic, electric magic, zaps, whooshes, celestial magic, dark magic, summons, and much more. Each effect has been meticulously recorded,edited and distributed at 192 kHz and 24-bit, ensuring exceptional sound quality. Whether you’re a professional sound designer or just looking for magic effects to drag and drop into your projects, you’ll find what you need here.

    Our files have been named to reflect the essence of each magic, making it easy to intuitively find the perfect sounds for your creation.

    More about the pack
    – Intuitive file naming
    – All you’ll ever need regarding magical elemental sounds [Use them again & again
    Use the sound effects over and over, in any of your projects or productions, forever without any additional fees or royalties. Use the SFX in your game, in your trailer, in a Kickstarter campaign, wherever you need to, as much as you want to.
    – Totally mono compatibility
    – All sounds have several variations.
    – Use your imagination and feel free to use any sound for a creature other than the one described, remember that the world of sound is totally subjective.
    – For any questions or problems: khronstudio@gmail.com

    Features
    – 361 spell sounds
    – Number of Audio Waves: 361
    – Format: 192KHz / 24 bits
    – Do Sound FX loop yes
    – Minutes of audio provided: 12 minutes and 31 seconds

    45 %
    OFF
    Ends 1754776800
Explore the full, unique collection here

Latest sound effects libraries:
 
  • All files are recorded 32bit, 192 kHz, with RØDE NTG1, Line Audio Omni1 and FEL Clippy XLR EM272 microphones, Sound Devices MixPre-6 II recorder. Library contains wav files of driving, interior and exterior foley, mechanical and electrical sounds. It is also available in UCS.

  • With this lemur sound library, you’ll find 18 high-quality audio tracks, each featuring multiple variations. This collection captures the distinctive sounds of these curious animals in great detail.

    Easy to Use Structure
    All recordings are uniformly labeled to integrate smoothly into your workflow, making it easy to select, combine, or replace takes based on your creative needs. This library includes the typical short calls lemurs produce, such as sharp squeals, brief meows, and rhythmic wails.

    Captured at Very Close Range
    The sounds were recorded at very close proximity, allowing for exceptional clarity and detail. As with many of our other libraries, we used high-end recording equipment, including:
    – Sennheiser MKH 8050
    – Sanken CO-100K
    – Zoom F6
    – Zoom H6

    The ultrasonic capabilities of this setup allow for pitch shifting without losing depth or richness, making these sounds ideal for fantasy creature design or realistics purposes.

    Professional Standards & Quality
    Resolution: Recorded at 24-bit / 192 kHz – 96 kHz, and delivered in the same formats, ensuring outstanding detail and dynamic range.
    Editing: All tracks were meticulously cleaned to remove unwanted background noises such as birds, wind, footsteps, or human activity, delivering a clean, professional-grade product.

    Ideal Applications
    Video games: Add realism and depth to the natural environments of your games.
    Cinema and Documentaries: Enrichment of the audiovisual product with authentic sounds.
    Educational Applications: Use these sounds in educational projects to teach about wildlife and animal behavior.
    Multimedia Projects: Ideal for any project that seeks to enrich the user’s listening experience.

    Technical Details
    Total Tracks: 18 (approx. 34 isolated sounds per mic, total of 102 sounds
    Format: 192 kHz – 96 kHz / 24-bit
    Equipment Used: Zoom F6 with Sennheiser MKH 8050 and Sanken CO-100K microphones, plus Zoom H6 for stereo
    Total Duration: 1 minute and 11 seconds

    45 %
    OFF
    Ends 1754776800
  • City Life Sound Effects NSL – 60s TRAINS Play Track 51 sounds included, 100 mins total $15

    This Sound-Pack features a diversity of 51 Punctual Sounds, including Train Pass by, Onboard High Speed Rolling, Raw Metal Railway Sounds, Train Doors, and a lot more !

    40 %
    OFF
  • The Heart Beat library is a small collection of real heart beat recordings captured using stethoscope. Featuring BPMs from 50 to 120, this library covers a range of emotional states—from calm and resting to tense and elevated.

  • Thunder Drums is a sound effects library designed to deliver powerful low-frequency impacts, rumbles, and strikes for cinematic sound design. It features non-traditional percussion instruments, including a thunder tube (spring drum), loose drumheads, and large plexiglass sheets, struck with mallets and drumsticks to produce a wide range of impacts, thunder-like rumbles, deep strikes, and heavy drum hits.

    In addition to raw recordings, the library includes designed elements enhanced with granular processing, distortion, and impulse response reverbs. Perfect for sound design in film, games, and trailers where powerful hits and low-end energy is needed.

    All sounds were captured using a Sennheiser MKH 8050 microphone and Zoom F6 recorder at 32-bit / 192kHz, and are delivered as 24-bit / 192kHz WAV files. The library includes 185 files (512 sounds), is 3.7 GB in size, and is fully UCS-compliant with embedded metadata.

    20 %
    OFF

   

Leave a Reply

Your email address will not be published. Required fields are marked *

HTML tags are not allowed.