...

Audio tools

Audio tools

Generate voices, music, and sound effects for your projects.

Magnific includes four audio tools for generating and transforming sound: a Voice Generator for text-to-speech voiceovers, a Voice Changer to transform existing recordings, a Sound Effects Generator for synced sound design, and a Music Generator for AI-composed tracks. This guide covers all four in one place.

In this article

Voice Generator

The Voice Generator turns any written text into a spoken voiceover. Write your script, choose a voice, select a model, and generate audio ready to download or use directly in a video project.

The script field is not a creative prompt. It is the exact text the voice will speak. Write it as you would a script: punctuation affects pauses and rhythm, so use it intentionally.

How to generate a voiceover

  1. Open the Voice Generator

    Go to magnific.ebundletools.com/ai/voice-generator, or find it in the Audio section of the AI Suite menu.

  2. Write your script

    Type or paste the text you want spoken. What you write is exactly what the voice will say.

  3. Choose a model

    Select ElevenLabs v2, ElevenLabs v3, or Gemini 2.5 Pro.

  4. Pick a voice

    Click the voice selector to open the voice library. Browse by category, search by name, or preview voices before selecting one.

  5. Adjust parameters

    Set speed, stability, and similarity boost if needed.

  6. Generate

    Click Generate. Preview the result and download or use it in a video project.

Voice parameters

ParameterRangeWhat it does
Speed0.7x to 1.2xSpeaking rate. Lower is slower, higher is faster.
Stability0 to 1Voice consistency. Lower is more expressive, higher is more stable.
Similarity Boost0 to 1How closely the output matches the selected voice.

Voice models

Three AI models are available for voiceover generation. Each has different strengths.

ModelProviderSpeedQualityBest for
ElevenLabs v2 TurboElevenLabsFastGoodQuick narration, batch processing
ElevenLabs v3ElevenLabsModerateHighFinal production, emotional narration
Gemini 2.5 ProGoogleModerateHighMulti-language content, conversational tone

ElevenLabs v2 is the fastest option. Use it when you need to generate many voiceovers quickly or iterate on scripts during production. It supports Speed, Stability, and Similarity Boost parameters. Script limit: 5,000 characters.

ElevenLabs v3 delivers more natural prosody and intonation than v2. Pacing, emphasis, and emotional tone feel closer to a human read. Use it for final output when quality matters most. Same parameters as v2. Script limit: 5,000 characters.

Gemini 2.5 Pro excels at multi-language content and conversational delivery. It supports Temperature, System Instruction, and Language selection. Script limit: 40,000 characters, ideal for long narration.

Voice library

The voice library contains hundreds of AI voices organized by category. You can browse, search, and preview any voice before using it.

Multi-speaker voiceovers

Generate dialogue between two speakers in a single generation. This is useful for conversations, interviews, podcast-style content, or any script with more than one voice.

  1. Open the Voice Generator

    Go to the Voice Generator.

  2. Enable Multi-speaker

    Toggle Multi-speaker mode on.

  3. Assign voices

    Select a different voice for Speaker 1 and Speaker 2.

  4. Write your script

    Use the speaker labels to indicate which lines belong to each speaker.

  5. Generate

    Click Generate. Both voices are rendered in a single audio file with natural turn-taking.

Voice Cloning

Voice Cloning lets you create a custom AI voice from audio samples of a real voice. Once cloned, the voice appears in My Voices and can be used in any audio tool.

  1. Open Voice Cloning

    In the Voice Generator, go to My Voices and click Create Voice.

  2. Upload audio samples

    Upload between 1 and 5 audio recordings of the voice you want to clone. For best results, use clear recordings with no background noise, up to 3 minutes total.

  3. Name and customize

    Give your voice a name and add any notes to help you identify it later.

  4. Create the voice

    Click Create voice. Processing happens in the background. When ready, the cloned voice appears in My Voices.

  5. Use it

    Select the cloned voice from My Voices in any audio tool and generate as normal.

You can create up to 100 custom voices per account. Voice Cloning supports 37+ languages. The cloned voice will adapt to whichever language you write your script in.

Voice Changer

The Voice Changer transforms an existing audio recording into a different voice while preserving the spoken content. Upload a recording, choose a target voice from the library, and the AI re-voices it. Useful for adapting recorded content to a different speaker, changing tone, or applying a cloned voice to existing audio.

  1. Open the Voice Changer

    Go to magnific.ebundletools.com/pikaso/tools/voice-changer, or find it in the Audio section of the AI Suite menu.

  2. Upload your audio

    Upload the recording you want to transform. Accepted formats: MP3, WAV, WebM, MP4, OGG.

  3. Choose a target voice

    Select the voice you want the audio converted to. You can use voices from the library or your own cloned voices from My Voices.

  4. Generate

    Click Generate. The output is delivered as an MP3 file at 44.1kHz, 128kbps.

Voice Changer costs 4 credits per second of audio. Make sure your recording is the final version before converting. Trimming audio beforehand helps keep credit usage efficient.

Sound Effects Generator

The Sound Effects Generator creates audio from short, descriptive text prompts. Use it for video sound design, game audio, UI sounds, or any project that needs custom sound effects.

  1. Open the Sound Effects Generator

    Go to magnific.ebundletools.com/ai/sound-effect-generator, or find it in the Audio section of the AI Suite menu.

  2. Write your prompt

    Describe the sound you want using short, clear language. Focus on what you hear, not why it happens. Example: metal door slams shut in an empty hallway.

  3. Set Prompt Influence

    Adjust the Prompt Influence slider. Higher values make the sound follow your prompt more closely. Lower values introduce more variation.

  4. Set duration

    Choose how long the sound effect should be. Match the duration to the type of sound. A door slam needs less time than ambient crowd noise.

  5. Generate

    Click Generate and preview the result. Adjust the prompt or slider and regenerate for variations.

Prompting tips

Keep prompts under one sentence. Use clear nouns and verbs: sound source + action + optional environment. Avoid storytelling. Crowd cheering in a stadium works better than the sound you might hear when a team scores a goal at a sports event.

Music Generator

The Music Generator composes original music tracks from text prompts. Think of the prompt as a producer brief: describe the genre, mood, instrumentation, tempo, and whether you want vocals or an instrumental track. Two models are available: ElevenLabs Music and Google Lyria.

How to generate music

  1. Open the Music Generator

    Go to magnific.ebundletools.com/pikaso/music, or find it in the Audio section of the AI Suite menu.

  2. Choose a model

    Select ElevenLabs Music or Google Lyria. ElevenLabs Music offers more precise control over structure. Google Lyria works well for ambient and mood-based tracks.

  3. Write your prompt

    Describe the track you want. Include genre, mood, instrumentation, tempo, and whether it should have vocals or be instrumental-only.

  4. Generate

    Click Generate. Preview the result and download it, or use it directly in your video project.

Music models

ModelProviderMax durationBest for
ElevenLabs MusicElevenLabsUp to 3 minutesProduction-ready tracks, precise control over BPM and key
Google LyriaGoogleUp to 30 secondsBackground music, ambient, mood-based and atmospheric tracks

ElevenLabs Music is best overall for production-ready music. It offers control over structure, BPM, key, and vocal intent. Tracks can be up to 3 minutes long.

Google Lyria excels at ambient, mood-based, and atmospheric tracks where feel matters more than technical structure. Max 30 seconds.

Quick guide: which model to choose

You needUse this
More than 30 secondsElevenLabs Music
Short ambient pieceGoogle Lyria
Control over BPM and keyElevenLabs Music
Quick generationGoogle Lyria
Soundtrack or background musicGoogle Lyria
Jingle, intro, or sound logoElevenLabs Music

Prompting tips for music

Structure your music prompts like a producer brief. Include as many of these elements as relevant:

ElementExamples
GenreLo-fi hip hop, cinematic orchestral, indie folk, synthwave
MoodUplifting, melancholic, tense, dreamy, energetic
InstrumentsAcoustic guitar, soft piano, electronic drums, strings
TempoSlow, moderate, fast, 120 BPM
VocalsInstrumental only, female vocals, humming, choir
PurposeBackground for product video, intro jingle, podcast outro

Managing your creations

All generated audio is saved automatically to your account. Access it from Recent creations on the AI Suite homepage or from the History tab inside each tool.

All AI-generated audio content is subject to Magnific Terms and Conditions for AI Products.

Possible issues

Generated voice sounds robotic or unnatural

Try switching to a different model — ElevenLabs v3 generally produces the most natural results. Make sure your script uses proper punctuation: commas create short pauses, periods create longer ones. Avoid ALL CAPS or unusual formatting. If a specific word is mispronounced, try spelling it phonetically.

Voice Cloning result doesn't sound like the original

The quality of your input samples matters more than anything else. Use clear recordings with no background noise or music, at least 30 seconds total. Speak naturally — don't read in a monotone. If the result still doesn't match, try uploading different samples or adding more (up to 5).

Script is too long for the selected model

ElevenLabs v2 and v3 support up to approximately 5,000 characters per generation. Gemini 2.5 Pro supports up to 40,000 characters. If your script exceeds the limit, either switch to Gemini or split it into shorter segments and generate each one separately.

Voice Changer output has artifacts or distortion

The input audio must be clean — background noise, music, or overlapping speakers will degrade the result. Use MP3 or WAV format for best quality. If the output still has issues, try a shorter clip first to test before converting longer recordings.

Sound effect doesn't match the prompt

Be specific. Instead of "explosion," try "distant muffled explosion with debris falling on concrete." Include details about environment, intensity, and duration. Avoid abstract or emotional descriptions — the model works best with concrete, physical sounds.

Music track loops awkwardly or ends abruptly

Set the duration to match your needs before generating. If you need a seamless loop, mention "loopable" or "seamless loop" in your prompt. For tracks that need a clean ending, include "with fade out" or "with natural ending" in the description.

Credits consumed but generation failed

If a generation fails due to a system error, credits are refunded automatically. Check your credit balance in your account settings. If credits were deducted and you received no output, contact support.