Skip to content

norikokono/Bolt-Elevenlabs-Netlify-hackathon

Repository files navigation

๐ŸŽค Voice AI Assistant - Advanced Speech Recognition with AI Responses

Built with Bolt.new - A production-ready voice AI application featuring real-time speech recognition, intelligent AI responses, and premium voice synthesis.

๐ŸŒŸ Live Demo

๐Ÿš€ Try it live: https://bolt-hackathon-voice-ai.netlify.app

๐Ÿš€ Project Overview

This Voice AI Assistant represents cutting-edge conversational AI technology, seamlessly integrating real-time speech recognition, intelligent response generation, and premium voice synthesis. Built with modern web technologies, this application demonstrates production-ready AI capabilities with an exceptional user experience.

โœจ Key Features

๐ŸŽค Advanced Voice Processing

  • Real-time Speech Recognition with automatic retry and error recovery
  • Natural Voice Synthesis using ElevenLabs' cutting-edge technology
  • Context-Aware Conversations with memory across interactions
  • ESL Optimizations with support for various English accents
  • Mobile-Optimized speech recognition with battery-conscious design

๐Ÿง  Intelligent AI Capabilities

  • Comprehensive AI Knowledge Base covering ML, deep learning, NLP, computer vision
  • Contextual Understanding that remembers conversation history
  • Technical Expertise in artificial intelligence and machine learning
  • Real-time Processing with sub-second response times
  • Personality-Driven Responses for engaging interactions

๐ŸŽจ Production-Ready Design

  • Apple-level Aesthetics with attention to micro-interactions
  • Fully Responsive Design optimized for all devices and screen sizes
  • Accessibility Features for inclusive user experience
  • Performance Optimized for smooth real-time interactions
  • Advanced Animations including shooting stars and smooth transitions

๐Ÿ—๏ธ Enterprise Architecture

  • Modular Component Design for scalability and maintainability
  • TypeScript Implementation for type safety and better development experience
  • Comprehensive Error Handling for reliability across all features
  • Cloud-Ready Infrastructure deployed on Netlify
  • Clean Code Structure following React best practices

๐Ÿ› ๏ธ Technology Stack

  • Frontend: React 18, TypeScript, Tailwind CSS
  • Voice Processing: Web Speech API, ElevenLabs API
  • AI Integration: Advanced conversation engine with comprehensive knowledge base
  • Build Tool: Vite for optimal performance and fast development
  • Icons: Lucide React for consistent, beautiful iconography
  • Deployment: Netlify with automatic CI/CD
  • Styling: Custom CSS animations and Tailwind utilities

๐ŸŽฏ Technical Highlights

Innovation Points

  • โœ… Advanced Speech Recognition - Browser-native with ESL optimizations
  • โœ… Premium Voice Synthesis - ElevenLabs integration with customizable settings
  • โœ… Intelligent AI Responses - Comprehensive knowledge base covering AI/ML topics
  • โœ… Mobile-First Design - Optimized for touch devices and mobile browsers
  • โœ… Real-time Interaction - Sub-second response times across all features
  • โœ… Production Architecture - Scalable, maintainable, and well-documented code

Competitive Advantages

  1. Comprehensive AI Knowledge - Deep expertise in machine learning, neural networks, and modern AI
  2. ESL-Friendly Recognition - Optimized for non-native English speakers
  3. Mobile Excellence - Seamless experience across all devices
  4. Beautiful Design - Award-worthy user interface with smooth animations
  5. Robust Error Handling - Graceful failure recovery for all services
  6. Extensible Platform - Ready for advanced AI feature expansion

๐Ÿš€ Getting Started

Prerequisites

  • Node.js 18+ and npm
  • ElevenLabs API key (optional, for voice synthesis)

Installation

  1. Clone the repository

    git clone <repository-url>
    cd voice-ai-assistant
  2. Install dependencies

    npm install
  3. Set up environment variables (optional)

    # Create .env file for automatic ElevenLabs connection
    VITE_ELEVENLABS_API_KEY=your_elevenlabs_key_here
  4. Start development server

    npm run dev
  5. Open in browser Navigate to http://localhost:5173

ElevenLabs Setup (Optional)

  1. Visit ElevenLabs and create an account
  2. Navigate to Profile โ†’ API Keys
  3. Create a new API key or copy your existing one
  4. Either add it to your .env file or enter it in the app's setup modal

๐ŸŽฎ How to Use

Basic Voice Interaction

  1. Grant microphone permissions when prompted by your browser
  2. Click the large microphone button to start speaking
  3. Speak naturally - the AI will process your voice in real-time
  4. Listen to AI responses with natural voice synthesis (if ElevenLabs is configured)

Advanced Features

  • Voice Settings - Customize voice parameters like stability, similarity, and speed
  • Text Input - Type messages as an alternative to voice input
  • Message History - Review and replay previous conversations
  • Fix/Erase - Correct or remove your last message
  • ESL Tips - Guidance for non-native English speakers
  • Mobile Optimization - Seamless experience on phones and tablets

Voice Settings Customization

  • Voice Selection - Choose from available ElevenLabs voices
  • Stability - Control voice consistency (0.0 = variable, 1.0 = stable)
  • Similarity - Adjust voice similarity to original (0.0 = creative, 1.0 = similar)
  • Speed - Modify speech rate (0.5x to 2.0x)
  • Presets - Quick settings for Natural, Expressive, Stable, or Dynamic voices

๐Ÿ† What This Project Demonstrates

Technical Excellence

  • Modern Web Development with React 18, TypeScript, and Vite
  • Advanced API Integration with ElevenLabs for voice synthesis
  • Browser API Mastery using Web Speech API for recognition
  • Responsive Design with Tailwind CSS and custom animations
  • Performance Optimization for real-time voice processing

AI/ML Knowledge Integration

  • Comprehensive AI Expertise covering machine learning, deep learning, NLP
  • Technical Accuracy with up-to-date information on AI technologies
  • Educational Value making complex AI concepts accessible
  • Practical Applications demonstrating real-world AI implementations

User Experience Excellence

  • Intuitive Interface that anyone can use immediately
  • Accessibility Features for inclusive interaction
  • Error Recovery that handles real-world scenarios gracefully
  • Engaging Personality that makes AI interaction enjoyable
  • Mobile-First Approach ensuring great experience on all devices

๐Ÿ“ Project Structure

src/
โ”œโ”€โ”€ components/           # React components
โ”‚   โ”œโ”€โ”€ HeroSection.tsx          # Landing section with animations
โ”‚   โ”œโ”€โ”€ VoiceButton.tsx          # Main voice interaction button
โ”‚   โ”œโ”€โ”€ ConversationHistory.tsx  # Message display and management
โ”‚   โ”œโ”€โ”€ VoiceSettingsPanel.tsx   # Voice customization interface
โ”‚   โ”œโ”€โ”€ TextInputPanel.tsx       # Text input with fix/erase features
โ”‚   โ”œโ”€โ”€ ElevenLabsShowcase.tsx   # ElevenLabs integration showcase
โ”‚   โ”œโ”€โ”€ ESLTipsModal.tsx         # ESL user guidance
โ”‚   โ””โ”€โ”€ ...                     # Additional UI components
โ”œโ”€โ”€ utils/                # Core services and utilities
โ”‚   โ”œโ”€โ”€ speechRecognition.ts     # Speech recognition service
โ”‚   โ”œโ”€โ”€ elevenLabsService.ts     # ElevenLabs API integration
โ”‚   โ””โ”€โ”€ aiService.ts             # AI conversation engine
โ”œโ”€โ”€ hooks/                # Custom React hooks
โ”‚   โ””โ”€โ”€ useScrollAnimation.ts    # Scroll-triggered animations
โ”œโ”€โ”€ types/                # TypeScript type definitions
โ””โ”€โ”€ App.tsx              # Main application component

๐ŸŒŸ Key Components

Speech Recognition Service

  • ESL Optimizations - Lower confidence thresholds and pronunciation corrections
  • Mobile Support - Battery-conscious design with gesture detection
  • Error Recovery - Comprehensive error handling with user-friendly messages
  • Real-time Processing - Interim and final transcript handling

AI Conversation Engine

  • Knowledge Base - Comprehensive coverage of AI/ML topics
  • Context Awareness - Maintains conversation history and context
  • Personality System - Configurable enthusiasm, helpfulness, and technical depth
  • Speech Optimization - Emoji removal and text processing for voice synthesis

Voice Synthesis Integration

  • ElevenLabs API - Premium voice synthesis with multiple voice options
  • Customizable Settings - Stability, similarity, and speed controls
  • Error Handling - Graceful fallbacks and user feedback
  • Audio Management - Proper audio lifecycle and memory management

๐Ÿ”ง Configuration Options

Environment Variables

# Optional: Auto-connect to ElevenLabs
VITE_ELEVENLABS_API_KEY=your_api_key_here

Voice Recognition Settings

  • Confidence Threshold - Adjustable for different user needs
  • Auto-stop Delay - Configurable silence detection
  • Language Support - Primary and alternative language codes
  • ESL Optimizations - Enable/disable pronunciation corrections

Voice Synthesis Settings

  • Voice Selection - Choose from available ElevenLabs voices
  • Quality Settings - Stability and similarity boost controls
  • Speed Control - Adjustable speech rate
  • Preset Configurations - Quick settings for different use cases

๐Ÿš€ Deployment

This application is deployed on Netlify with automatic CI/CD:

  • Live URL: https://bolt-hackathon-voice-ai.netlify.app
  • Automatic Builds - Deploys on every push to main branch
  • Environment Variables - Configured in Netlify dashboard
  • Custom Domain - Ready for custom domain configuration

Manual Deployment

# Build for production
npm run build

# Preview build locally
npm run preview

# Deploy to Netlify (if Netlify CLI is configured)
netlify deploy --prod --dir=dist

๐Ÿค Contributing

This project showcases modern AI-powered web development. Contributions are welcome to further enhance the voice AI capabilities:

  1. Fork the repository
  2. Create a feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

๐Ÿ“„ License

Built with โค๏ธ using Bolt.new - Representing the future of AI-powered development.

๐Ÿ™ Acknowledgments

  • ElevenLabs - For providing exceptional voice synthesis technology
  • Bolt.new - For the powerful development platform
  • React Team - For the excellent React framework
  • Tailwind CSS - For the utility-first CSS framework
  • Lucide - For the beautiful icon library

๐ŸŽค Voice AI Assistant: This project demonstrates cutting-edge AI technology, production-ready architecture, and exceptional user experience - showcasing the true potential of modern voice-enabled AI applications.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages