Google’s Gemini app now accepts audio file uploads, answering what the corporate acknowledges was its most requested function.
For entrepreneurs and content material groups, it means you may push recordings straight into Gemini for evaluation, summaries, and repurposed content material with out leaping between instruments.
Josh Woodward, VP at Google Labs and Gemini, introduced the change on X:
“Now you can add any file to @GeminiApp. Together with the #1 request: audio information are actually supported!”
What’s New
Gemini can now ingest audio information in the identical multi-file workflow you already use for paperwork and pictures.
You possibly can connect as much as 10 information per immediate, and information inside ZIP archives are supported, which helps while you wish to add uncooked tracks or a number of interview takes collectively.
Limits
- Free plan: whole audio size as much as 10 minutes per immediate; as much as 5 prompts per day.
- AI Professional and AI Extremely: whole audio size as much as 3 hours per immediate.
- Per immediate: as much as 10 information throughout supported codecs. Particulars are listed in Google’s Assist Middle.
Why This Issues
In case your group works with podcasts, webinars, interviews, or buyer calls, this closes a niche that usually pressured a separate transcription step.
You possibly can add a full interview and switch it into present notes, pull quotes, or a working draft in a single place. It additionally helps meeting-heavy groups: a recorded technique session can turn into motion gadgets and a short with out exporting to a different software first.
For companies and networks, batching a number of episodes or takes into one immediate reduces friction in weekly workflows.
The sensible win is fewer handoffs: supply audio goes in, and the outlines, summaries, and excerpts you want come out. Inside the identical system you already use for textual content prompting.
Fast Tip
Add your audio along with any supporting context in the identical immediate. That provides Gemini the grounding it wants to supply cleaner summaries and extra correct excerpts.
Should you’re testing on the free tier, plan across the 10-minute ceiling; longer content material is greatest on AI Professional or Extremely.
Trying Forward
Google’s limits pages do change, so control whole size, file-count guidelines, and any new guardrails that have an effect on longer recordings or bigger groups. Additionally look ahead to deeper Workspace tie-ins (for instance, simpler handoffs from Meet recordings) that will streamline getting audio into Gemini with out handbook uploads.
Featured Picture: Photograph Company/Shutterstock