Skip to content

Latest commit

ย 

History

History
39 lines (27 loc) ยท 1.23 KB

File metadata and controls

39 lines (27 loc) ยท 1.23 KB

AI Audio Transcriber (Gemini 2.5)

A Next.js application that transcribes audio files using the Google Gemini 2.5 Flash model, featuring automatic speaker detection.

๐Ÿš€ Deployment (Vercel)

  1. Push to GitHub: Create a new repository on GitHub and push your code.
  2. Import to Vercel: Go to Vercel and import your repository.
  3. Environment Variables: Add the following variable in the Vercel project settings:
    • GEMINI_API_KEY: Your Google AI Studio API key.

โš ๏ธ Important Note on File Sizes

Vercel's Serverless Functions (Hobby plan) have a 4.5MB payload limit.

  • Local Development: Works for large files (up to 100MB as configured).
  • Vercel: Uploads larger than 4.5MB may fail on the standard plan. For production with large files, consider using Vercel Blob or direct client-side uploads to a storage bucket (S3/GCS).

๐Ÿ› ๏ธ Local Development

  1. Install dependencies:

    npm install
  2. Setup environment: Create a .env.local file and add:

    GEMINI_API_KEY=your_actual_api_key_here
  3. Run the app:

    npm run dev

๐Ÿ“„ License

MIT