Initial commit: Bilingual Voice Assistant for Google AIY Voice Kit V1

Features: - Bilingual support (English/Mandarin Chinese) - Hotword detection: 'Hey Osiris' / '你好 Osiris' - Music playback control (MP3, WAV, OGG, FLAC) - OpenClaw integration for AI responses - Google AIY Voice Kit V1 compatible - Text-to-speech in both languages - Voice command recognition - Raspberry Pi ready with installation script AI Now Inc - Del Mar Demo Unit 🏭
2026-03-01 00:02:49 -08:00 · 2026-03-01 00:02:49 -08:00 · 1662bc141a
commit 1662bc141a
16 changed files with 3128 additions and 0 deletions
--- a/.gitignore
+++ b/.gitignore
@ -0,0 +1,55 @@
+# Python
+__pycache__/
+*.py[cod]
+*$py.class
+*.so
+.Python
+venv/
+ENV/
+env/
+.venv
+
+# Credentials
+*.json
+!config.json
+!hotword_config.json
+credentials.json
+.env
+.secrets
+
+# Logs
+*.log
+logs/
+
+# Audio files
+*.wav
+*.mp3
+*.ogg
+*.flac
+
+# Temporary files
+tmp/
+temp/
+*.tmp
+
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+
+# OS
+.DS_Store
+Thumbs.db
+
+# Database
+*.db
+!sensor_data.db
+
+# Exports
+exports/
+
+# Test files
+test_*.wav
+test_output.*
--- a/QUICKSTART.md
+++ b/QUICKSTART.md
@ -0,0 +1,258 @@
+# Quick Start Guide - Bilingual Voice Assistant
+
+**AI Now Inc - Del Mar Demo Unit**  
+**Laboratory Assistant:** Claw 🏭
+
+## 🚀 Installation (5 minutes)
+
+### Step 1: Clone and Install
+
+```bash
+# Navigate to workspace
+cd /home/pi
+
+# Clone or copy the voice-assistant folder
+# (If copying from another machine, use scp or git)
+
+# Make install script executable
+cd voice-assistant
+chmod +x install.sh
+
+# Run installation
+sudo ./install.sh
+```
+
+### Step 2: Configure
+
+Edit the configuration file:
+
+```bash
+nano config.local.json
+```
+
+Update these settings:
+- `openclaw.ws_url`: Your OpenClaw server address
+- `openclaw.api_key`: Your API key (if required)
+- `music.library_path`: Path to your music files
+
+### Step 3: Add Music (Optional)
+
+```bash
+# Copy MP3 files to music directory
+cp /path/to/your/music/*.mp3 /home/pi/Music/
+
+# Or download sample music
+# (Ensure you have rights to the music)
+```
+
+### Step 4: Test
+
+```bash
+# Run in demo mode first
+./venv/bin/python3 main.py --mode demo
+
+# Or run in test mode
+./venv/bin/python3 main.py --mode test
+```
+
+### Step 5: Run
+
+```bash
+# Start the service
+sudo systemctl start voice-assistant
+
+# Or run manually
+./start.sh
+```
+
+## 🎤 Voice Commands
+
+### Hotword
+First, say the hotword to activate:
+- **English:** "Hey Osiris"
+- **Mandarin:** "你好 Osiris"
+
+### General Commands
+
+| English | Mandarin | Description |
+|---------|----------|-------------|
+| "Hello" | "你好" | Greeting |
+| "What time is it?" | "现在几点？" | Get current time |
+| "How are you?" | "你好吗？" | Greeting response |
+| "Ask Claw: [question]" | "问 Claw：[问题]" | Ask OpenClaw |
+
+### Music Commands
+
+| English | Mandarin | Description |
+|---------|----------|-------------|
+| "Play [song name]" | "播放 [歌曲名]" | Play music |
+| "Play music" | "播放音乐" | Play any music |
+| "Pause" | "暂停" | Pause playback |
+| "Resume" | "继续" | Resume playback |
+| "Stop" | "停止" | Stop playback |
+| "Next" | "下一首" | Next track |
+| "Previous" | "上一首" | Previous track |
+| "Volume up" | "音量大" | Increase volume |
+| "Volume down" | "音量小" | Decrease volume |
+
+## 🔧 Troubleshooting
+
+### Microphone Not Working
+
+```bash
+# Check if microphone is detected
+arecord -l
+
+# Test recording
+arecord -d 3 test.wav
+aplay test.wav
+
+# Check volume levels
+alsamixer
+# Press F4 to see capture levels
+# Use arrow keys to adjust
+```
+
+### No Sound Output
+
+```bash
+# Check audio output
+speaker-test -t wav
+
+# Set default output
+alsamixer
+# Press F6 to select output device
+```
+
+### Hotword Not Detecting
+
+1. **Check microphone sensitivity:**
+   ```bash
+   alsamixer
+   # Adjust capture levels
+   ```
+
+2. **Reduce background noise**
+
+3. **Speak clearly and closer to microphone**
+
+4. **Adjust sensitivity in config:**
+   ```json
+   {
+     "speech": {
+       "hotword_sensitivity": 0.6  // Higher = more sensitive
+     }
+   }
+   ```
+
+### Music Not Playing
+
+```bash
+# Check if files are in correct location
+ls -la /home/pi/Music/
+
+# Verify file format (MP3, WAV, OGG, FLAC)
+file /home/pi/Music/song.mp3
+
+# Test playback manually
+./venv/bin/python3 -c "from music_player import MusicPlayer; p = MusicPlayer(); p.play(list(p.music_library.values())[0])"
+```
+
+### OpenClaw Not Connecting
+
+1. **Check network connection:**
+   ```bash
+   ping 192.168.1.100  # Replace with your server IP
+   ```
+
+2. **Verify OpenClaw is running:**
+   ```bash
+   # On server
+   openclaw status
+   ```
+
+3. **Check firewall:**
+   ```bash
+   sudo ufw status
+   ```
+
+## 📊 Logs
+
+### View Live Logs
+
+```bash
+# Service logs
+sudo journalctl -u voice-assistant -f
+
+# Installation logs
+cat /var/log/voice-assistant-install.log
+
+# Application logs (if configured)
+tail -f /var/log/voice-assistant.log
+```
+
+### Debug Mode
+
+```bash
+# Run with debug logging
+./venv/bin/python3 main.py --mode run --log-level DEBUG
+```
+
+## 🔄 Updates
+
+### Update Installation
+
+```bash
+cd /home/pi/voice-assistant
+
+# Pull latest changes (if using git)
+git pull
+
+# Reinstall dependencies
+source venv/bin/activate
+pip install -r requirements.txt --upgrade
+```
+
+### Update Configuration
+
+```bash
+# Edit local config
+nano config.local.json
+
+# Restart service
+sudo systemctl restart voice-assistant
+```
+
+## 🛑 Uninstall
+
+```bash
+# Run uninstaller
+sudo ./uninstall.sh
+
+# Or manually:
+sudo systemctl stop voice-assistant
+sudo systemctl disable voice-assistant
+sudo rm -rf /home/pi/voice-assistant
+sudo rm /etc/systemd/system/voice-assistant.service
+```
+
+## 📚 Additional Resources
+
+- [Full Documentation](README.md)
+- [Google AIY Voice Kit Docs](https://github.com/google/aiyprojects-raspbian)
+- [Porcupine Hotword Detection](https://github.com/Picovoice/porcupine)
+- [OpenClaw Documentation](https://docs.openclaw.ai)
+
+## 🆘 Support
+
+For issues or questions:
+1. Check the [README.md](README.md)
+2. Review logs: `sudo journalctl -u voice-assistant`
+3. Test in demo mode first
+4. Ensure all dependencies are installed
+
+---
+
+**AI Now Inc** - Del Mar Show Demo Unit  
+**Version:** 1.0.0  
+**Last Updated:** 2026-02-28
--- a/README.md
+++ b/README.md
@ -0,0 +1,227 @@
+# 🎤 Bilingual Voice Assistant - Google AIY Voice Kit V1
+
+**AI Now Inc - Del Mar Demo Unit**  
+**Laboratory Assistant:** Claw 🏭
+
+A bilingual (English/Mandarin) voice-activated assistant for Google AIY Voice Kit V1 with music playback capability.
+
+## Features
+
+- ✅ **Bilingual Support** - English and Mandarin Chinese speech recognition
+- ✅ **Text-to-Speech** - Respond in the detected language
+- ✅ **Music Playback** - Play MP3 files by voice command
+- ✅ **Remote Communication** - Connect to OpenClaw assistant via API
+- ✅ **Offline Capability** - Basic commands work without internet
+- ✅ **Hotword Detection** - "Hey Assistant" / "你好助手" wake word
+
+## Hardware Requirements
+
+- **Google AIY Voice Kit V1** (with Voice HAT)
+- **Raspberry Pi** (3B/3B+/4B recommended)
+- **MicroSD Card** (8GB+)
+- **Speaker** (3.5mm or HDMI audio)
+- **Microphone** (included with AIY Kit)
+- **Internet Connection** (WiFi/Ethernet)
+
+## Software Architecture
+
+```
+┌─────────────────────────────────────────────────────────┐
+│  Google AIY Voice Kit V1                                │
+│  ┌─────────────┐  ┌──────────────┐  ┌──────────────┐  │
+│  │ Hotword     │  │ Speech       │  │ Command      │  │
+│  │ Detection   │→ │ Recognition  │→ │ Processing   │  │
+│  └─────────────┘  └──────────────┘  └──────────────┘  │
+│                          ↓                  ↓           │
+│  ┌──────────────────────────────────────────────────┐  │
+│  │           Language Detection (en/zh)            │  │
+│  └──────────────────────────────────────────────────┘  │
+│                          ↓                               │
+│  ┌──────────────────────────────────────────────────┐  │
+│  │         OpenClaw API Communication               │  │
+│  └──────────────────────────────────────────────────┘  │
+│                          ↓                               │
+│  ┌─────────────┐  ┌──────────────┐  ┌──────────────┐  │
+│  │ TTS         │  │ Music Player │  │ Response     │  │
+│  │ (en/zh)     │  │ (MP3)        │  │ Handler      │  │
+│  └─────────────┘  └──────────────┘  └──────────────┘  │
+└─────────────────────────────────────────────────────────┘
+```
+
+## Installation
+
+### 1. Setup Google AIY Voice Kit
+
+```bash
+# Update system
+sudo apt-get update
+sudo apt-get upgrade
+
+# Install AIY Voice Kit software
+cd ~
+git clone https://github.com/google/aiyprojects-raspbian.git
+cd aiyprojects-raspbian
+bash install.sh
+sudo reboot
+```
+
+### 2. Install Dependencies
+
+```bash
+# Python dependencies
+pip3 install google-cloud-speech google-cloud-texttospeech
+pip3 install pygame mutagen
+pip3 install requests websocket-client
+pip3 install langdetect
+```
+
+### 3. Configure Google Cloud (Optional - for cloud services)
+
+```bash
+# Set up Google Cloud credentials
+export GOOGLE_APPLICATION_CREDENTIALS="/path/to/credentials.json"
+```
+
+## Configuration
+
+Edit `config.json`:
+
+```json
+{
+  "openclaw": {
+    "enabled": true,
+    "ws_url": "ws://192.168.1.100:18790",
+    "api_key": "your_api_key"
+  },
+  "speech": {
+    "language": "auto",
+    "hotword": "hey assistant|你好助手"
+  },
+  "music": {
+    "library_path": "/home/pi/Music",
+    "default_volume": 0.7
+  },
+  "tts": {
+    "english_voice": "en-US-Standard-A",
+    "chinese_voice": "zh-CN-Standard-A"
+  }
+}
+```
+
+## Usage
+
+### Start the Assistant
+
+```bash
+cd /home/pi/voice-assistant
+python3 main.py
+```
+
+### Voice Commands
+
+#### General Commands
+- "Hey Assistant, what time is it?" / "你好助手，现在几点？"
+- "Hey Assistant, how are you?" / "你好助手，你好吗？"
+- "Hey Assistant, tell me a joke" / "你好助手，讲个笑话"
+
+#### Music Commands
+- "Hey Assistant, play [song name]" / "你好助手，播放 [歌曲名]"
+- "Hey Assistant, pause" / "你好助手，暂停"
+- "Hey Assistant, resume" / "你好助手，继续"
+- "Hey Assistant, stop" / "你好助手，停止"
+- "Hey Assistant, next track" / "你好助手，下一首"
+- "Hey Assistant, volume up" / "你好助手，音量加大"
+
+#### OpenClaw Commands
+- "Hey Assistant, ask Claw: [your question]"
+- "你好助手，问 Claw：[你的问题]"
+
+## Project Structure
+
+```
+voice-assistant/
+├── main.py                 # Main entry point
+├── config.json             # Configuration file
+├── assistant.py            # Core assistant logic
+├── speech_recognizer.py    # Speech recognition (en/zh)
+├── tts_engine.py           # Text-to-speech engine
+├── music_player.py         # MP3 playback control
+├── openclaw_client.py      # OpenClaw API client
+├── hotword_detector.py     # Wake word detection
+├── requirements.txt        # Python dependencies
+└── samples/                # Sample audio files
+```
+
+## Language Detection
+
+The system automatically detects the spoken language:
+
+- **English keywords** → English response
+- **Chinese keywords** → Mandarin response
+- **Mixed input** → Respond in dominant language
+
+## Music Library
+
+Organize your MP3 files:
+
+```
+/home/pi/Music/
+├── artist1/
+│   ├── song1.mp3
+│   └── song2.mp3
+├── artist2/
+│   └── song3.mp3
+└── playlist/
+    └── favorites.mp3
+```
+
+## Advanced Features
+
+### Custom Hotword
+Train your own hotword using Porcupine or Snowboy.
+
+### Offline Speech Recognition
+Use Vosk or PocketSphinx for offline recognition.
+
+### Multi-room Audio
+Stream audio to multiple devices via Snapcast.
+
+### Voice Profiles
+Recognize different users and personalize responses.
+
+## Troubleshooting
+
+### Microphone not detected
+```bash
+arecord -l  # List audio devices
+alsamixer  # Check levels
+```
+
+### Poor speech recognition
+- Speak clearly and closer to the microphone
+- Reduce background noise
+- Check internet connection for cloud recognition
+
+### Music playback issues
+```bash
+# Test audio output
+speaker-test -t wav
+
+# Check volume
+alsamixer
+```
+
+## Next Steps
+
+- [ ] Add voice profile recognition
+- [ ] Implement offline speech recognition
+- [ ] Add Spotify/Apple Music integration
+- [ ] Create web UI for music library management
+- [ ] Add multi-language support (Spanish, French, etc.)
+- [ ] Implement voice commands for industrial control
+
+---
+
+**AI Now Inc** - Del Mar Show Demo Unit  
+**Contact:** Laboratory Assistant Claw 🏭  
+**Version:** 1.0.0
--- a/assistant.py
+++ b/assistant.py
@ -0,0 +1,253 @@
+#!/usr/bin/env python3
+"""
+Bilingual Voice Assistant Core
+Main logic for processing voice commands and generating responses.
+"""
+
+import os
+import json
+import logging
+import random
+from typing import Optional, Dict, List, Tuple
+from pathlib import Path
+from datetime import datetime
+
+from speech_recognizer import BilingualSpeechRecognizer
+from music_player import MusicPlayer
+from openclaw_client import OpenClawClient
+
+logger = logging.getLogger(__name__)
+
+
+class VoiceAssistant:
+    """
+    Main assistant class coordinating speech recognition,
+    command processing, and responses.
+    """
+    
+    def __init__(self, config_path: str = "config.json"):
+        self.config_path = config_path
+        self.config = self._load_config(config_path)
+        
+        # Initialize components
+        self.speech_recognizer = BilingualSpeechRecognizer(config_path)
+        self.music_player = MusicPlayer(config_path)
+        self.openclaw_client = OpenClawClient(config_path)
+        
+        # Command patterns
+        self.music_commands = [
+            "play", "pause", "resume", "stop", "next", "previous",
+            "volume", "shuffle", "repeat"
+        ]
+        
+        self.chinese_music_commands = [
+            "播放", "暂停", "继续", "停止", "下一首", "上一首",
+            "音量", "随机", "重复"
+        ]
+        
+        logger.info("VoiceAssistant initialized")
+    
+    def _load_config(self, config_path: str) -> dict:
+        """Load configuration."""
+        try:
+            with open(config_path, 'r') as f:
+                return json.load(f)
+        except FileNotFoundError:
+            return {}
+    
+    def process_command(self, text: str, language: str = "en") -> Tuple[str, str]:
+        """
+        Process a voice command and return response.
+        
+        Args:
+            text: Recognized text
+            language: Detected language ('en' or 'zh')
+        
+        Returns:
+            Tuple of (response_text, response_language)
+        """
+        text_lower = text.lower()
+        
+        # Music commands
+        if self._is_music_command(text_lower, language):
+            return self._handle_music_command(text_lower, language)
+        
+        # Time query
+        if any(word in text_lower for word in ["what time", "time is it", "几点", "时间"]):
+            return self._get_time(language)
+        
+        # Greeting
+        if any(word in text_lower for word in ["hello", "hi", "hey", "你好", "您好"]):
+            return self._get_greeting(language)
+        
+        # OpenClaw query
+        if "ask claw" in text_lower or "问 claw" in text_lower:
+            # Extract the actual question
+            question = text_lower.replace("ask claw", "").replace("问 claw", "").strip()
+            return self._ask_openclaw(question, language)
+        
+        # Default: ask OpenClaw
+        return self._ask_openclaw(text, language)
+    
+    def _is_music_command(self, text: str, language: str) -> bool:
+        """Check if text is a music command."""
+        if language == "en":
+            return any(cmd in text for cmd in self.music_commands)
+        else:
+            return any(cmd in text for cmd in self.chinese_music_commands)
+    
+    def _handle_music_command(self, text: str, language: str) -> Tuple[str, str]:
+        """Handle music playback commands."""
+        
+        # Play command
+        if "play" in text or "播放" in text:
+            # Extract song name if specified
+            song_name = self._extract_song_name(text)
+            if song_name:
+                matches = self.music_player.search_tracks(song_name)
+                if matches:
+                    self.music_player.play(matches[0])
+                    return (f"Playing {matches[0].name}", 
+                            "en" if language == "en" else "zh")
+                else:
+                    return ("Song not found", 
+                            "en" if language == "en" else "zh")
+            else:
+                # Play random track
+                if self.music_player.music_library:
+                    first_track = list(self.music_player.music_library.values())[0]
+                    self.music_player.play(first_track)
+                    return ("Playing music", 
+                            "en" if language == "en" else "zh")
+        
+        # Pause
+        elif "pause" in text or "暂停" in text:
+            self.music_player.pause()
+            return ("Paused", "en" if language == "en" else "zh")
+        
+        # Resume
+        elif "resume" in text or "继续" in text:
+            self.music_player.resume()
+            return ("Resumed", "en" if language == "en" else "zh")
+        
+        # Stop
+        elif "stop" in text or "停止" in text:
+            self.music_player.stop()
+            return ("Stopped", "en" if language == "en" else "zh")
+        
+        # Next
+        elif "next" in text or "下一首" in text:
+            self.music_player.next()
+            return ("Next track", "en" if language == "en" else "zh")
+        
+        # Volume
+        elif "volume" in text or "音量" in text:
+            if "up" in text or "大" in text:
+                self.music_player.set_volume(self.music_player.volume + 0.1)
+            elif "down" in text or "小" in text:
+                self.music_player.set_volume(self.music_player.volume - 0.1)
+            return ("Volume adjusted", "en" if language == "en" else "zh")
+        
+        return ("Command not recognized", "en" if language == "en" else "zh")
+    
+    def _extract_song_name(self, text: str) -> Optional[str]:
+        """Extract song name from command."""
+        # Simple implementation - look for text after "play"
+        if "play" in text:
+            parts = text.split("play", 1)
+            if len(parts) > 1:
+                return parts[1].strip()
+        if "播放" in text:
+            parts = text.split("播放", 1)
+            if len(parts) > 1:
+                return parts[1].strip()
+        return None
+    
+    def _get_time(self, language: str) -> Tuple[str, str]:
+        """Get current time response."""
+        now = datetime.now()
+        if language == "zh":
+            return (f"现在时间是 {now.strftime('%H点%M分')}", "zh")
+        else:
+            return (f"The current time is {now.strftime('%I:%M %p')}", "en")
+    
+    def _get_greeting(self, language: str) -> Tuple[str, str]:
+        """Get greeting response."""
+        greetings_en = [
+            "Hello! How can I help you?",
+            "Hi there! What can I do for you?",
+            "Hey! Ready to assist you."
+        ]
+        greetings_zh = [
+            "你好！有什么可以帮你的吗？",
+            "您好！需要什么帮助？",
+            "嗨！随时为您服务。"
+        ]
+        
+        if language == "zh":
+            return (random.choice(greetings_zh), "zh")
+        else:
+            return (random.choice(greetings_en), "en")
+    
+    def _ask_openclaw(self, question: str, language: str) -> Tuple[str, str]:
+        """Send question to OpenClaw and get response."""
+        if not self.openclaw_client.enabled:
+            if language == "zh":
+                return ("OpenClaw 未启用", "zh")
+            else:
+                return ("OpenClaw is not enabled", "en")
+        
+        # Add context about language preference
+        context = {"preferred_language": language}
+        
+        response = self.openclaw_client.send_request(question, context)
+        
+        if "error" in response:
+            if language == "zh":
+                return ("抱歉，暂时无法回答", "zh")
+            else:
+                return ("Sorry, I can't answer that right now", "en")
+        
+        # Extract response text
+        response_text = response.get("response", str(response))
+        
+        # Detect response language
+        response_lang = language  # Assume same language
+        if any('\u4e00' <= char <= '\u9fff' for char in response_text):
+            response_lang = "zh"
+        
+        return (response_text, response_lang)
+    
+    def get_status(self) -> Dict:
+        """Get assistant status."""
+        return {
+            "speech_recognizer": "active",
+            "music_player": self.music_player.get_status(),
+            "openclaw": self.openclaw_client.get_status()
+        }
+
+
+def main():
+    """Test the assistant."""
+    assistant = VoiceAssistant()
+    
+    # Test commands
+    test_commands = [
+        ("hello", "en"),
+        ("what time is it", "en"),
+        ("play music", "en"),
+        ("你好", "zh"),
+        ("现在几点", "zh"),
+        ("播放音乐", "zh")
+    ]
+    
+    for text, lang in test_commands:
+        response, resp_lang = assistant.process_command(text, lang)
+        print(f"Input: {text} ({lang})")
+        print(f"Output: {response} ({resp_lang})")
+        print("-" * 40)
+
+
+if __name__ == "__main__":
+    logging.basicConfig(level=logging.INFO)
+    main()
--- a/config.json
+++ b/config.json
@ -0,0 +1,37 @@
+{
+  "openclaw": {
+    "enabled": true,
+    "ws_url": "ws://192.168.1.100:18790",
+    "api_key": "your_api_key_here",
+    "reconnect_interval": 5
+  },
+  "speech": {
+    "language": "auto",
+    "hotword": "hey assistant|你好助手",
+    "hotword_sensitivity": 0.5,
+    "recognition_timeout": 5,
+    "offline_mode": false
+  },
+  "music": {
+    "library_path": "/home/pi/Music",
+    "default_volume": 0.7,
+    "scan_interval": 300,
+    "supported_formats": [".mp3", ".wav", ".ogg", ".flac"]
+  },
+  "tts": {
+    "english_voice": "en-US-Standard-A",
+    "chinese_voice": "zh-CN-Standard-A",
+    "speed": 1.0,
+    "pitch": 0
+  },
+  "audio": {
+    "input_device": "default",
+    "output_device": "default",
+    "sample_rate": 16000,
+    "channels": 1
+  },
+  "logging": {
+    "level": "INFO",
+    "file": "/var/log/voice-assistant.log"
+  }
+}
--- a/hotword_config.json
+++ b/hotword_config.json
@ -0,0 +1,19 @@
+{
+  "hotwords": [
+    {
+      "keyword": "hey osiris",
+      "keyword_zh": "你好 osiris",
+      "sensitivity": 0.5,
+      "library_path": "resources/porcupine"
+    }
+  ],
+  "audio": {
+    "sample_rate": 16000,
+    "frame_length": 512
+  },
+  "behavior": {
+    "timeout": 30,
+    "cooldown": 5,
+    "continuous_listen": false
+  }
+}
--- a/hotword_detector.py
+++ b/hotword_detector.py
@ -0,0 +1,265 @@
+#!/usr/bin/env python3
+"""
+Hotword Detector
+Detects wake words: "Hey Osiris" / "你好 Osiris"
+
+Supports:
+- Porcupine (PicoVoice) for accurate hotword detection
+- Custom keyword spotting
+- Bilingual support (English/Mandarin)
+"""
+
+import os
+import json
+import logging
+import struct
+import wave
+from typing import Optional, Callable, List
+from pathlib import Path
+
+try:
+    import pvporcupine
+    import pyaudio
+    HAS_PORCUPINE = True
+except ImportError:
+    HAS_PORCUPINE = False
+    logging.warning("Porcupine not installed. Install with: pip install pvporcupine")
+
+try:
+    import webrtcvad
+    HAS_VAD = True
+except ImportError:
+    HAS_VAD = False
+    logging.warning("WebRTC VAD not installed")
+
+logger = logging.getLogger(__name__)
+
+
+class HotwordDetector:
+    """
+    Hotword detection with support for "Hey Osiris" in English and Mandarin.
+    """
+    
+    def __init__(self, config_path: str = "hotword_config.json"):
+        self.config = self._load_config(config_path)
+        self.audio_config = self.config.get("audio", {
+            "sample_rate": 16000,
+            "frame_length": 512
+        })
+        
+        self.hotwords = self.config.get("hotwords", [])
+        self.is_running = False
+        self.callback = None
+        
+        # Porcupine setup
+        self.porcupine = None
+        self.keyword_index = -1
+        
+        if HAS_PORCUPINE:
+            self._init_porcupine()
+        
+        # VAD setup
+        self.vad = None
+        if HAS_VAD:
+            self.vad = webrtcvad.Vad(2)  # Aggressiveness level 2
+        
+        logger.info(f"HotwordDetector initialized (Porcupine: {HAS_PORCUPINE})")
+    
+    def _load_config(self, config_path: str) -> dict:
+        """Load configuration."""
+        try:
+            with open(config_path, 'r') as f:
+                return json.load(f)
+        except FileNotFoundError:
+            return {
+                "hotwords": [
+                    {
+                        "keyword": "hey osiris",
+                        "keyword_zh": "你好 osiris",
+                        "sensitivity": 0.5
+                    }
+                ],
+                "audio": {
+                    "sample_rate": 16000,
+                    "frame_length": 512
+                }
+            }
+    
+    def _init_porcupine(self):
+        """Initialize Porcupine hotword detection."""
+        if not HAS_PORCUPINE:
+            return
+        
+        try:
+            # Create Porcupine instance with custom keywords
+            self.porcupine = pvporcupine.create(
+                keywords=["hey osiris"],
+                sensitivities=[0.5]
+            )
+            self.keyword_index = 0
+            logger.info("Porcupine initialized with 'Hey Osiris'")
+        except Exception as e:
+            logger.warning(f"Porcupine initialization failed: {e}")
+            self.porcupine = None
+    
+    def set_callback(self, callback: Callable[[], None]):
+        """Set callback function for when hotword is detected."""
+        self.callback = callback
+    
+    def detect(self, timeout: int = None) -> Optional[str]:
+        """
+        Start detection and wait for hotword.
+        
+        Args:
+            timeout: Maximum time to wait in seconds (None = infinite)
+        
+        Returns:
+            Detected hotword or None
+        """
+        if not self.porcupine:
+            logger.warning("Porcupine not available, using simple detection")
+            return self._simple_detect(timeout)
+        
+        return self._porcupine_detect(timeout)
+    
+    def _porcupine_detect(self, timeout: int = None) -> Optional[str]:
+        """Detect using Porcupine."""
+        if not self.porcupine:
+            return None
+        
+        import pyaudio
+        
+        pa = pyaudio.PyAudio()
+        
+        try:
+            # Open audio stream
+            stream = pa.open(
+                rate=self.porcupine.sample_rate,
+                channels=1,
+                format=pyaudio.paInt16,
+                input=True,
+                frames_per_buffer=self.porcupine.frame_length
+            )
+            
+            logger.info("Listening for 'Hey Osiris'...")
+            self.is_running = True
+            
+            start_time = None
+            if timeout:
+                import time
+                start_time = time.time()
+            
+            while self.is_running:
+                # Check timeout
+                if timeout and start_time:
+                    if time.time() - start_time > timeout:
+                        logger.info("Hotword detection timeout")
+                        break
+                
+                # Read audio frame
+                pcm = stream.read(self.porcupine.frame_length, exception_on_overflow=False)
+                pcm = struct.unpack_from(
+                    f"h{self.porcupine.frame_length}",
+                    pcm
+                )
+                
+                # Process frame
+                keyword_index = self.porcupine.process(pcm)
+                
+                if keyword_index >= 0:
+                    logger.info("Hotword detected!")
+                    if self.callback:
+                        self.callback()
+                    return "hey osiris"
+            
+        except KeyboardInterrupt:
+            logger.info("Detection interrupted")
+        except Exception as e:
+            logger.error(f"Detection error: {e}")
+        finally:
+            stream.close()
+            pa.terminate()
+            self.is_running = False
+        
+        return None
+    
+    def _simple_detect(self, timeout: int = None) -> Optional[str]:
+        """
+        Simple voice activity detection (fallback).
+        Detects any speech as hotword.
+        """
+        logger.warning("Using simple voice detection (not recommended)")
+        
+        # This is a placeholder - in production you'd use:
+        # - Snowboy
+        # - Custom trained model
+        # - Or just use Porcupine
+        
+        return None
+    
+    def stop(self):
+        """Stop detection."""
+        self.is_running = False
+        logger.info("Hotword detection stopped")
+    
+    def create_custom_hotword(self, keyword: str, output_path: str):
+        """
+        Create custom hotword model (requires Porcupine training).
+        
+        This is a placeholder - actual implementation requires:
+        1. Recording multiple samples of the keyword
+        2. Training with Porcupine Console
+        3. Exporting the model
+        """
+        logger.info(f"Custom hotword creation not implemented: {keyword}")
+        logger.info("Use Porcupine Console to train custom keywords")
+
+
+class SimpleHotwordDetector:
+    """
+    Simple hotword detection using audio level threshold.
+    Fallback when Porcupine is not available.
+    """
+    
+    def __init__(self, keyword: str = "hey osiris"):
+        self.keyword = keyword
+        self.threshold = 0.5
+        self.is_running = False
+    
+    def detect(self, timeout: int = None) -> Optional[str]:
+        """Simple energy-based detection."""
+        logger.warning("Simple detection is not reliable. Install Porcupine for best results.")
+        return None
+
+
+def main():
+    """Test hotword detection."""
+    print("\n" + "="*60)
+    print("  🔍 Hotword Detector Test")
+    print("  Say 'Hey Osiris' or '你好 Osiris'")
+    print("="*60)
+    
+    detector = HotwordDetector()
+    
+    def on_hotword():
+        print("\n🎉 HOTWORD DETECTED!")
+    
+    detector.set_callback(on_hotword)
+    
+    try:
+        result = detector.detect(timeout=30)
+        
+        if result:
+            print(f"Detected: {result}")
+        else:
+            print("No hotword detected")
+    
+    except KeyboardInterrupt:
+        print("\nTest stopped")
+    
+    detector.stop()
+
+
+if __name__ == "__main__":
+    logging.basicConfig(level=logging.INFO)
+    main()
--- a/install.sh
+++ b/install.sh
@ -0,0 +1,422 @@
+#!/bin/bash
+#
+# Google AIY Voice Kit V1 - Installation Script
+# Bilingual Voice Assistant (English/Mandarin)
+#
+# AI Now Inc - Del Mar Demo Unit
+# Laboratory Assistant: Claw 🏭
+#
+
+set -e  # Exit on error
+
+# Colors for output
+RED='\033[0;31m'
+GREEN='\033[0;32m'
+YELLOW='\033[1;33m'
+BLUE='\033[0;34m'
+NC='\033[0m' # No Color
+
+# Configuration
+SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
+INSTALL_DIR="/home/pi/voice-assistant"
+MUSIC_DIR="/home/pi/Music"
+LOG_FILE="/var/log/voice-assistant-install.log"
+PYTHON_VERSION="3.9"
+
+echo -e "${BLUE}"
+echo "=========================================="
+echo "  🎤 Voice Assistant Installer"
+echo "  AI Now Inc - Del Mar Demo Unit"
+echo "=========================================="
+echo -e "${NC}"
+
+# Logging function
+log() {
+    echo -e "[$(date '+%Y-%m-%d %H:%M:%S')] $1" | tee -a "$LOG_FILE"
+}
+
+log_error() {
+    echo -e "${RED}[ERROR]${NC} $1" | tee -a "$LOG_FILE"
+}
+
+log_success() {
+    echo -e "${GREEN}[SUCCESS]${NC} $1" | tee -a "$LOG_FILE"
+}
+
+log_info() {
+    echo -e "${YELLOW}[INFO]${NC} $1" | tee -a "$LOG_FILE"
+}
+
+# Check if running as root
+check_root() {
+    if [ "$EUID" -ne 0 ]; then
+        log_error "Please run as root (sudo ./install.sh)"
+        exit 1
+    fi
+}
+
+# Check if running on Raspberry Pi
+check_raspberry_pi() {
+    if ! grep -q "Raspberry Pi" /proc/cpuinfo 2>/dev/null; then
+        log_info "Not running on Raspberry Pi (this may still work)"
+    else
+        log_success "Raspberry Pi detected"
+    fi
+}
+
+# Update system packages
+update_system() {
+    log_info "Updating system packages..."
+    apt-get update
+    apt-get upgrade -y
+    log_success "System updated"
+}
+
+# Install system dependencies
+install_system_deps() {
+    log_info "Installing system dependencies..."
+    
+    apt-get install -y \
+        python3 \
+        python3-pip \
+        python3-dev \
+        python3-venv \
+        portaudio19-dev \
+        libffi-dev \
+        libssl-dev \
+        libjpeg-dev \
+        zlib1g-dev \
+        libfreetype6-dev \
+        liblcms2-dev \
+        libopenjp2-7 \
+        libtiff5 \
+        libblas-dev \
+        liblapack-dev \
+        libatlas-base-dev \
+        libgfortran5 \
+        swig \
+        libasound2-dev \
+        arecord \
+        alsamixer \
+        wget \
+        git \
+        curl
+    
+    log_success "System dependencies installed"
+}
+
+# Install Google AIY Voice Kit
+install_aiy_voice() {
+    log_info "Installing Google AIY Voice Kit..."
+    
+    # Check if AIY already installed
+    if [ -d "/usr/local/bin/aiy" ]; then
+        log_info "AIY Voice Kit already installed"
+        return
+    fi
+    
+    # Install AIY packages
+    cd /tmp
+    wget https://dl.google.com/aiyprojects/raspbian/aiyvoice-buster-20230111.zip
+    unzip aiyvoice-buster-*.zip
+    cd aiyvoice-*
+    ./install.sh
+    
+    log_success "Google AIY Voice Kit installed"
+}
+
+# Create virtual environment
+create_venv() {
+    log_info "Creating Python virtual environment..."
+    
+    cd "$INSTALL_DIR"
+    python3 -m venv venv
+    
+    log_success "Virtual environment created"
+}
+
+# Install Python dependencies
+install_python_deps() {
+    log_info "Installing Python dependencies..."
+    
+    cd "$INSTALL_DIR"
+    source venv/bin/activate
+    
+    # Upgrade pip
+    pip install --upgrade pip
+    
+    # Install requirements
+    pip install -r requirements.txt
+    
+    # Install additional dependencies for hotword detection
+    pip install porcupine1
+    pip install webrtcvad
+    
+    log_success "Python dependencies installed"
+}
+
+# Create music directory
+create_music_dir() {
+    log_info "Creating music directory..."
+    
+    if [ ! -d "$MUSIC_DIR" ]; then
+        mkdir -p "$MUSIC_DIR"
+        log_success "Music directory created: $MUSIC_DIR"
+    else
+        log_info "Music directory already exists"
+    fi
+    
+    # Set permissions
+    chown pi:pi "$MUSIC_DIR"
+    chmod 755 "$MUSIC_DIR"
+}
+
+# Configure audio
+configure_audio() {
+    log_info "Configuring audio..."
+    
+    # Create/update ALSA configuration
+    cat > /etc/asound.conf << 'EOF'
+pcm.!default {
+    type plug
+    slave.pcm "hw:0,0"
+}
+
+ctl.!default {
+    type hw
+    card 0
+}
+EOF
+    
+    log_success "Audio configured"
+}
+
+# Install systemd service
+install_service() {
+    log_info "Installing systemd service..."
+    
+    cat > /etc/systemd/system/voice-assistant.service << EOF
+[Unit]
+Description=Bilingual Voice Assistant
+After=network.target sound.target
+
+[Service]
+Type=simple
+User=pi
+WorkingDirectory=$INSTALL_DIR
+ExecStart=$INSTALL_DIR/venv/bin/python3 $INSTALL_DIR/main.py --mode run
+Restart=always
+RestartSec=10
+Environment=PYTHONUNBUFFERED=1
+Environment=GOOGLE_APPLICATION_CREDENTIALS=/home/pi/.credentials/google-credentials.json
+
+# Logging
+StandardOutput=journal
+StandardError=journal
+SyslogIdentifier=voice-assistant
+
+[Install]
+WantedBy=multi-user.target
+EOF
+    
+    # Enable service
+    systemctl daemon-reload
+    systemctl enable voice-assistant.service
+    
+    log_success "Systemd service installed and enabled"
+}
+
+# Configure hotword detection
+configure_hotword() {
+    log_info "Configuring hotword detection..."
+    
+    # Create hotword configuration
+    cat > "$INSTALL_DIR/hotword_config.json" << 'EOF'
+{
+    "hotwords": [
+        {
+            "keyword": "hey osiris",
+            "keyword_zh": "你好 osiris",
+            "sensitivity": 0.5,
+            "library_path": "resources/porcupine"
+        }
+    ],
+    "audio": {
+        "sample_rate": 16000,
+        "frame_length": 512
+    }
+}
+EOF
+    
+    log_success "Hotword detection configured"
+}
+
+# Create sample music directory structure
+create_sample_music_structure() {
+    log_info "Creating sample music structure..."
+    
+    mkdir -p "$MUSIC_DIR/samples"
+    mkdir -p "$MUSIC_DIR/playlists"
+    
+    # Create a README for music
+    cat > "$MUSIC_DIR/README.md" << 'EOF'
+# Music Library
+
+Place your MP3 files here. The assistant will automatically detect and index them.
+
+## Supported Formats
+- MP3
+- WAV
+- OGG
+- FLAC
+
+## Organization
+You can organize music by:
+- Artist/Album/Song.mp3
+- Genre/Song.mp3
+- Or flat structure: Song.mp3
+
+## Voice Commands
+- "Play [song name]" / "播放 [歌曲名]"
+- "Pause" / "暂停"
+- "Resume" / "继续"
+- "Next" / "下一首"
+- "Volume up/down" / "音量 大/小"
+EOF
+    
+    chown -R pi:pi "$MUSIC_DIR"
+    
+    log_success "Sample music structure created"
+}
+
+# Create startup script
+create_startup_script() {
+    log_info "Creating startup script..."
+    
+    cat > "$INSTALL_DIR/start.sh" << 'EOF'
+#!/bin/bash
+# Voice Assistant Startup Script
+
+cd "$(dirname "$0")"
+
+# Activate virtual environment
+source venv/bin/activate
+
+# Run the assistant
+python3 main.py --mode run
+EOF
+    
+    chmod +x "$INSTALL_DIR/start.sh"
+    chown pi:pi "$INSTALL_DIR/start.sh"
+    
+    log_success "Startup script created"
+}
+
+# Create uninstall script
+create_uninstall_script() {
+    log_info "Creating uninstall script..."
+    
+    cat > "$INSTALL_DIR/uninstall.sh" << 'EOF'
+#!/bin/bash
+# Uninstall Voice Assistant
+
+echo "Uninstalling Voice Assistant..."
+
+# Stop service
+sudo systemctl stop voice-assistant
+sudo systemctl disable voice-assistant
+sudo rm /etc/systemd/system/voice-assistant.service
+
+# Remove installation
+sudo rm -rf /home/pi/voice-assistant
+
+# Remove music directory (optional)
+# sudo rm -rf /home/pi/Music
+
+echo "Uninstall complete!"
+EOF
+    
+    chmod +x "$INSTALL_DIR/uninstall.sh"
+    
+    log_success "Uninstall script created"
+}
+
+# Final configuration
+final_configuration() {
+    log_info "Running final configuration..."
+    
+    # Copy config if not exists
+    if [ ! -f "$INSTALL_DIR/config.local.json" ]; then
+        cp "$INSTALL_DIR/config.json" "$INSTALL_DIR/config.local.json"
+        log_info "Created local configuration: config.local.json"
+    fi
+    
+    # Set permissions
+    chown -R pi:pi "$INSTALL_DIR"
+    chmod -R 755 "$INSTALL_DIR"
+    
+    log_success "Final configuration complete"
+}
+
+# Print next steps
+print_next_steps() {
+    echo ""
+    echo -e "${GREEN}=========================================="
+    echo "  Installation Complete! 🎉"
+    echo "==========================================${NC}"
+    echo ""
+    echo "Next steps:"
+    echo "1. Edit configuration:"
+    echo "   nano $INSTALL_DIR/config.local.json"
+    echo ""
+    echo "2. Add your MP3 files to: $MUSIC_DIR"
+    echo ""
+    echo "3. Test the assistant:"
+    echo "   cd $INSTALL_DIR"
+    echo "   ./start.sh"
+    echo ""
+    echo "4. Or run in demo mode:"
+    echo "   $INSTALL_DIR/venv/bin/python3 $INSTALL_DIR/main.py --mode demo"
+    echo ""
+    echo "5. Start the service:"
+    echo "   sudo systemctl start voice-assistant"
+    echo ""
+    echo "6. View logs:"
+    echo "   sudo journalctl -u voice-assistant -f"
+    echo ""
+    echo "Voice commands:"
+    echo "  - 'Hey Osiris' / '你好 Osiris' (hotword)"
+    echo "  - 'Hello' / '你好'"
+    echo "  - 'Play music' / '播放音乐'"
+    echo "  - 'What time is it?' / '现在几点？'"
+    echo ""
+    echo -e "${YELLOW}Note: Make sure your microphone is connected and working!${NC}"
+    echo ""
+}
+
+# Main installation
+main() {
+    log "Starting installation..."
+    
+    check_root
+    check_raspberry_pi
+    update_system
+    install_system_deps
+    # install_aiy_voice  # Commented out - install manually if needed
+    create_venv
+    install_python_deps
+    create_music_dir
+    configure_audio
+    install_service
+    configure_hotword
+    create_sample_music_structure
+    create_startup_script
+    create_uninstall_script
+    final_configuration
+    print_next_steps
+    
+    log_success "Installation completed successfully!"
+}
+
+# Run main
+main "$@"
--- a/main.py
+++ b/main.py
@ -0,0 +1,287 @@
+#!/usr/bin/env python3
+"""
+Bilingual Voice Assistant - Main Entry Point
+Google AIY Voice Kit V1 - English/Mandarin Support
+
+AI Now Inc - Del Mar Demo Unit
+Laboratory Assistant: Claw 🏭
+"""
+
+import os
+import sys
+import json
+import logging
+import signal
+import time
+from pathlib import Path
+from typing import Optional
+
+# Import components
+from assistant import VoiceAssistant
+from tts_engine import TTSEngine
+from speech_recognizer import BilingualSpeechRecognizer
+from music_player import MusicPlayer
+from hotword_detector import HotwordDetector
+
+# Configure logging
+logging.basicConfig(
+    level=logging.INFO,
+    format='%(asctime)s - %(name)s - %(levelname)s - %(message)s'
+)
+logger = logging.getLogger(__name__)
+
+
+class VoiceAssistantApp:
+    """
+    Main application class for the bilingual voice assistant.
+    """
+    
+    def __init__(self, config_path: str = "config.json"):
+        self.config_path = Path(config_path)
+        self.config = self._load_config()
+        
+        # Initialize components
+        logger.info("Initializing voice assistant...")
+        self.assistant = VoiceAssistant(str(self.config_path))
+        self.tts = TTSEngine(str(self.config_path))
+        self.hotword_detector = HotwordDetector(str(self.config_path).replace("config.json", "hotword_config.json"))
+        
+        # State
+        self.is_running = False
+        self.current_language = "en"
+        self.is_awake = False  # Hotword activated state
+        
+        # Setup signal handlers
+        signal.signal(signal.SIGINT, self._signal_handler)
+        signal.signal(signal.SIGTERM, self._signal_handler)
+        
+        logger.info("Voice assistant initialized with hotword detection")
+    
+    def _load_config(self) -> dict:
+        """Load configuration."""
+        try:
+            with open(self.config_path, 'r') as f:
+                return json.load(f)
+        except FileNotFoundError:
+            logger.warning("Config not found, using defaults")
+            return {}
+    
+    def _signal_handler(self, sig, frame):
+        """Handle shutdown signals."""
+        logger.info("Shutdown signal received")
+        self.is_running = False
+    
+    def run(self):
+        """Run the voice assistant with hotword detection."""
+        logger.info("Starting voice assistant with hotword detection...")
+        self.is_running = True
+        
+        # Welcome message
+        welcome_text = "Voice assistant started. Say 'Hey Osiris' to activate."
+        welcome_text_zh = "语音助手已启动。说 '你好 Osiris' 来激活。"
+        
+        print("\n" + "="*60)
+        print("  🎤 Bilingual Voice Assistant - AI Now Inc")
+        print("  Laboratory Assistant: Claw 🏭")
+        print("="*60)
+        print(f"\n  English: {welcome_text}")
+        print(f"  中文：{welcome_text_zh}")
+        print("\n  Hotword: 'Hey Osiris' / '你好 Osiris'")
+        print("  Listening for hotword... (Press Ctrl+C to stop)\n")
+        
+        # Speak welcome message
+        self.tts.speak(welcome_text, "en")
+        time.sleep(0.5)
+        self.tts.speak(welcome_text_zh, "zh")
+        
+        # Set hotword callback
+        self.hotword_detector.set_callback(self._on_hotword_detected)
+        
+        # Main loop - listen for hotword
+        try:
+            while self.is_running:
+                # Wait for hotword
+                print("⏳ Waiting for 'Hey Osiris'...")
+                self.hotword_detector.detect(timeout=None)
+                
+                # If we get here, hotword was detected (or timeout)
+                if not self.is_running:
+                    break
+                
+                time.sleep(0.5)
+                
+        except KeyboardInterrupt:
+            logger.info("Interrupted by user")
+        
+        finally:
+            self.shutdown()
+    
+    def _on_hotword_detected(self):
+        """Callback when hotword is detected."""
+        print("\n🎉 Hotword detected! Listening for command...")
+        
+        # Awakening message
+        awake_text = "Yes? How can I help?"
+        awake_text_zh = "在的，有什么可以帮你？"
+        
+        self.tts.speak(awake_text, "en")
+        time.sleep(0.5)
+        self.tts.speak(awake_text_zh, "zh")
+        
+        # Now listen for command (simplified - would use speech recognition)
+        try:
+            user_input = input("Command: ").strip()
+            
+            if user_input:
+                # Detect language
+                lang = "zh" if any('\u4e00' <= c <= '\u9fff' for c in user_input) else "en"
+                
+                # Process command
+                response, resp_lang = self.assistant.process_command(user_input, lang)
+                
+                # Output response
+                print(f"Assistant: {response}")
+                
+                # Speak response
+                self.tts.speak(response, resp_lang)
+        except Exception as e:
+            logger.error(f"Command processing error: {e}")
+    
+    def shutdown(self):
+        """Clean shutdown."""
+        logger.info("Shutting down...")
+        
+        # Stop music if playing
+        self.assistant.music_player.stop()
+        
+        # Goodbye message
+        goodbye_text = "Goodbye!"
+        goodbye_text_zh = "再见！"
+        
+        self.tts.speak(goodbye_text, "en")
+        time.sleep(0.5)
+        self.tts.speak(goodbye_text_zh, "zh")
+        
+        logger.info("Voice assistant stopped")
+
+
+def test_mode():
+    """Run in test mode with sample commands."""
+    print("\n" + "="*60)
+    print("  🧪 Test Mode - Sample Commands")
+    print("="*60)
+    
+    assistant = VoiceAssistant()
+    tts = TTSEngine()
+    
+    test_commands = [
+        ("hello", "en"),
+        ("what time is it", "en"),
+        ("play music", "en"),
+        ("你好", "zh"),
+        ("现在几点", "zh"),
+        ("播放音乐", "zh"),
+    ]
+    
+    for text, lang in test_commands:
+        print(f"\nInput: {text} ({lang})")
+        response, resp_lang = assistant.process_command(text, lang)
+        print(f"Output: {response} ({resp_lang})")
+        tts.speak(response, resp_lang)
+        time.sleep(1)
+
+
+def demo_mode():
+    """Interactive demo mode."""
+    print("\n" + "="*60)
+    print("  🎭 Demo Mode - Try These Commands!")
+    print("="*60)
+    print("""
+  English Commands:
+    - "hello"
+    - "what time is it"
+    - "play music"
+    - "pause"
+    - "stop"
+    - "volume up"
+    - "ask Claw: what is industrial control?"
+  
+  中文命令:
+    - "你好"
+    - "现在几点"
+    - "播放音乐"
+    - "暂停"
+    - "停止"
+    - "音量大"
+    - "问 Claw：什么是工业控制？"
+  
+  Type 'quit' to exit
+  """)
+    
+    assistant = VoiceAssistant()
+    tts = TTSEngine()
+    
+    while True:
+        try:
+            user_input = input("\nYou: ").strip()
+            
+            if user_input.lower() in ['quit', 'exit', '退出']:
+                break
+            
+            if not user_input:
+                continue
+            
+            # Detect language
+            lang = "zh" if any('\u4e00' <= c <= '\u9fff' for c in user_input) else "en"
+            
+            # Process command
+            response, resp_lang = assistant.process_command(user_input, lang)
+            
+            # Output
+            print(f"Assistant: {response}")
+            
+            # Speak (optional in demo)
+            speak_response = input("Speak? (y/n): ").strip().lower()
+            if speak_response == 'y':
+                tts.speak(response, resp_lang)
+            
+        except KeyboardInterrupt:
+            break
+        except Exception as e:
+            logger.error(f"Error: {e}")
+    
+    print("\nDemo ended.")
+
+
+def main():
+    """Main entry point."""
+    import argparse
+    
+    parser = argparse.ArgumentParser(
+        description="Bilingual Voice Assistant for Google AIY Voice Kit V1"
+    )
+    parser.add_argument(
+        "--mode",
+        choices=["run", "test", "demo"],
+        default="demo",
+        help="Operation mode: run, test, or demo"
+    )
+    parser.add_argument(
+        "--config",
+        default="config.json",
+        help="Path to configuration file"
+    )
+    
+    args = parser.parse_args()
+    
+    if args.mode == "test":
+        test_mode()
+    elif args.mode == "demo":
+        demo_mode()
+    else:
+        app = VoiceAssistantApp(args.config)
+        app.run()
+
+
+if __name__ == "__main__":
+    main()
--- a/music_player.py
+++ b/music_player.py
@ -0,0 +1,312 @@
+#!/usr/bin/env python3
+"""
+Music Player for Google AIY Voice Kit
+Supports MP3 playback with voice control.
+"""
+
+import os
+import json
+import logging
+import random
+from pathlib import Path
+from typing import Optional, List, Dict
+from datetime import datetime
+
+try:
+    import pygame
+    HAS_PYGAME = True
+except ImportError:
+    HAS_PYGAME = False
+
+try:
+    from mutagen.mp3 import MP3
+    from mutagen.easyid3 import EasyID3
+    HAS_MUTAGEN = True
+except ImportError:
+    HAS_MUTAGEN = False
+
+logger = logging.getLogger(__name__)
+
+
+class MusicPlayer:
+    """
+    MP3 music player with voice control support.
+    """
+    
+    def __init__(self, config_path: str = "config.json"):
+        self.config = self._load_config(config_path)
+        self.music_path = Path(self.config.get("music", {}).get(
+            "library_path", "/home/pi/Music"
+        ))
+        self.volume = self.config.get("music", {}).get("default_volume", 0.7)
+        self.supported_formats = self.config.get("music", {}).get(
+            "supported_formats", [".mp3", ".wav", ".ogg", ".flac"]
+        )
+        
+        self.current_track: Optional[Path] = None
+        self.playlist: List[Path] = []
+        self.playlist_index: int = 0
+        self.is_playing: bool = False
+        self.is_paused: bool = False
+        
+        # Initialize pygame mixer
+        if HAS_PYGAME:
+            pygame.mixer.init()
+            pygame.mixer.music.set_volume(self.volume)
+        
+        # Scan music library
+        self.music_library = self._scan_library()
+        
+        logger.info(f"MusicPlayer initialized with {len(self.music_library)} tracks")
+    
+    def _load_config(self, config_path: str) -> dict:
+        """Load configuration from JSON file."""
+        try:
+            with open(config_path, 'r') as f:
+                return json.load(f)
+        except FileNotFoundError:
+            return {"music": {"library_path": "/home/pi/Music"}}
+    
+    def _scan_library(self) -> Dict[str, Path]:
+        """
+        Scan music library for supported formats.
+        
+        Returns:
+            Dictionary mapping track names to file paths
+        """
+        library = {}
+        
+        if not self.music_path.exists():
+            logger.warning(f"Music path {self.music_path} does not exist")
+            return library
+        
+        for root, dirs, files in os.walk(self.music_path):
+            for file in files:
+                file_path = Path(root) / file
+                if file_path.suffix.lower() in self.supported_formats:
+                    # Use filename without extension as key
+                    track_name = file_path.stem.lower()
+                    library[track_name] = file_path
+                    logger.debug(f"Added track: {track_name}")
+        
+        return library
+    
+    def search_tracks(self, query: str) -> List[Path]:
+        """
+        Search for tracks matching the query.
+        
+        Args:
+            query: Search query (partial match)
+        
+        Returns:
+            List of matching track paths
+        """
+        query_lower = query.lower()
+        matches = []
+        
+        # Exact match first
+        if query_lower in self.music_library:
+            return [self.music_library[query_lower]]
+        
+        # Partial matches
+        for track_name, path in self.music_library.items():
+            if query_lower in track_name:
+                matches.append(path)
+        
+        # If no matches, return all tracks (for "play music")
+        if not matches:
+            matches = list(self.music_library.values())
+        
+        return matches[:10]  # Limit results
+    
+    def play(self, track_path: Optional[Path] = None) -> bool:
+        """
+        Play a track.
+        
+        Args:
+            track_path: Path to track (None for next in playlist)
+        
+        Returns:
+            True if playback started successfully
+        """
+        if not HAS_PYGAME:
+            logger.error("Pygame not available")
+            return False
+        
+        try:
+            # If no track specified, use playlist
+            if track_path is None:
+                if self.playlist and self.playlist_index < len(self.playlist):
+                    track_path = self.playlist[self.playlist_index]
+                else:
+                    logger.warning("No track to play")
+                    return False
+            
+            if not track_path or not track_path.exists():
+                logger.warning(f"Track not found: {track_path}")
+                return False
+            
+            logger.info(f"Playing: {track_path.name}")
+            pygame.mixer.music.load(str(track_path))
+            pygame.mixer.music.play()
+            self.current_track = track_path
+            self.is_playing = True
+            self.is_paused = False
+            
+            return True
+            
+        except Exception as e:
+            logger.error(f"Playback error: {e}")
+            return False
+    
+    def pause(self) -> bool:
+        """Pause current playback."""
+        if not HAS_PYGAME or not self.is_playing:
+            return False
+        
+        try:
+            pygame.mixer.music.pause()
+            self.is_paused = True
+            logger.info("Playback paused")
+            return True
+        except Exception as e:
+            logger.error(f"Pause error: {e}")
+            return False
+    
+    def resume(self) -> bool:
+        """Resume paused playback."""
+        if not HAS_PYGAME or not self.is_paused:
+            return False
+        
+        try:
+            pygame.mixer.music.unpause()
+            self.is_paused = False
+            logger.info("Playback resumed")
+            return True
+        except Exception as e:
+            logger.error(f"Resume error: {e}")
+            return False
+    
+    def stop(self) -> bool:
+        """Stop playback."""
+        if not HAS_PYGAME:
+            return False
+        
+        try:
+            pygame.mixer.music.stop()
+            self.is_playing = False
+            self.is_paused = False
+            self.current_track = None
+            logger.info("Playback stopped")
+            return True
+        except Exception as e:
+            logger.error(f"Stop error: {e}")
+            return False
+    
+    def next(self) -> bool:
+        """Play next track in playlist."""
+        if not self.playlist:
+            return False
+        
+        self.playlist_index = (self.playlist_index + 1) % len(self.playlist)
+        return self.play()
+    
+    def previous(self) -> bool:
+        """Play previous track in playlist."""
+        if not self.playlist:
+            return False
+        
+        self.playlist_index = (self.playlist_index - 1) % len(self.playlist)
+        return self.play()
+    
+    def set_volume(self, level: float) -> bool:
+        """
+        Set volume level.
+        
+        Args:
+            level: Volume level (0.0 to 1.0)
+        """
+        if not HAS_PYGAME:
+            return False
+        
+        level = max(0.0, min(1.0, level))  # Clamp to 0-1
+        pygame.mixer.music.set_volume(level)
+        self.volume = level
+        logger.info(f"Volume set to {level * 100:.0f}%")
+        return True
+    
+    def create_playlist(self, tracks: List[Path]) -> None:
+        """Create a playlist from tracks."""
+        self.playlist = tracks
+        self.playlist_index = 0
+        logger.info(f"Created playlist with {len(tracks)} tracks")
+    
+    def get_track_info(self, track_path: Path) -> Dict:
+        """
+        Get track metadata.
+        
+        Args:
+            track_path: Path to track file
+        
+        Returns:
+            Dictionary with track metadata
+        """
+        info = {
+            "path": str(track_path),
+            "name": track_path.stem,
+            "duration": None,
+            "artist": None,
+            "album": None
+        }
+        
+        if HAS_MUTAGEN and track_path.exists():
+            try:
+                if track_path.suffix.lower() == ".mp3":
+                    audio = MP3(track_path, ID3=EasyID3)
+                    info["duration"] = audio.info.length
+                    if hasattr(audio, 'tags'):
+                        info["artist"] = audio.tags.get("artist", [None])[0]
+                        info["album"] = audio.tags.get("album", [None])[0]
+            except Exception as e:
+                logger.debug(f"Error reading metadata: {e}")
+        
+        return info
+    
+    def get_status(self) -> Dict:
+        """Get current player status."""
+        return {
+            "is_playing": self.is_playing,
+            "is_paused": self.is_paused,
+            "current_track": str(self.current_track.name) if self.current_track else None,
+            "volume": self.volume,
+            "playlist_length": len(self.playlist),
+            "playlist_index": self.playlist_index
+        }
+
+
+def main():
+    """Test the music player."""
+    player = MusicPlayer()
+    
+    # Print library stats
+    print(f"Music library: {len(player.music_library)} tracks")
+    
+    # Test search
+    query = "test"
+    matches = player.search_tracks(query)
+    print(f"Search '{query}': {len(matches)} matches")
+    
+    # Test playback
+    if player.music_library:
+        first_track = list(player.music_library.values())[0]
+        print(f"Playing: {first_track.name}")
+        player.play(first_track)
+        
+        import time
+        time.sleep(5)
+        player.stop()
+
+
+if __name__ == "__main__":
+    logging.basicConfig(level=logging.INFO)
+    main()
--- a/openclaw_client.py
+++ b/openclaw_client.py
@ -0,0 +1,237 @@
+#!/usr/bin/env python3
+"""
+OpenClaw Client for Voice Assistant
+Connects to OpenClaw gateway for AI responses and command processing.
+"""
+
+import os
+import json
+import logging
+import time
+import threading
+from typing import Optional, Callable, Dict, Any
+from pathlib import Path
+
+try:
+    import websocket
+    HAS_WEBSOCKET = True
+except ImportError:
+    HAS_WEBSOCKET = False
+
+try:
+    import requests
+    HAS_REQUESTS = True
+except ImportError:
+    HAS_REQUESTS = False
+
+logger = logging.getLogger(__name__)
+
+
+class OpenClawClient:
+    """
+    Client for OpenClaw gateway communication.
+    Supports WebSocket and HTTP APIs.
+    """
+    
+    def __init__(self, config_path: str = "config.json"):
+        self.config = self._load_config(config_path)
+        self.ws_url = self.config.get("openclaw", {}).get(
+            "ws_url", "ws://192.168.1.100:18790"
+        )
+        self.api_key = self.config.get("openclaw", {}).get("api_key", "")
+        self.enabled = self.config.get("openclaw", {}).get("enabled", True)
+        
+        self.ws: Optional[websocket.WebSocketApp] = None
+        self.is_connected = False
+        self.message_handlers = []
+        self.reconnect_interval = self.config.get("openclaw", {}).get(
+            "reconnect_interval", 5
+        )
+        
+        if HAS_WEBSOCKET and self.enabled:
+            self._init_websocket()
+        
+        logger.info(f"OpenClawClient initialized (enabled={self.enabled})")
+    
+    def _load_config(self, config_path: str) -> dict:
+        """Load configuration from JSON file."""
+        try:
+            with open(config_path, 'r') as f:
+                return json.load(f)
+        except FileNotFoundError:
+            return {"openclaw": {"enabled": False}}
+    
+    def _init_websocket(self):
+        """Initialize WebSocket connection."""
+        if not HAS_WEBSOCKET:
+            logger.warning("websocket-client not installed")
+            return
+        
+        def on_open(ws):
+            logger.info("WebSocket connected")
+            self.is_connected = True
+            self._on_connect()
+        
+        def on_message(ws, message):
+            logger.debug(f"Received: {message}")
+            self._handle_message(message)
+        
+        def on_error(ws, error):
+            logger.error(f"WebSocket error: {error}")
+        
+        def on_close(ws, close_status_code, close_msg):
+            logger.info(f"WebSocket closed: {close_status_code} - {close_msg}")
+            self.is_connected = False
+            self._reconnect()
+        
+        self.ws = websocket.WebSocketApp(
+            self.ws_url,
+            on_open=on_open,
+            on_message=on_message,
+            on_error=on_error,
+            on_close=on_close
+        )
+        
+        # Start connection thread
+        thread = threading.Thread(target=self.ws.run_forever)
+        thread.daemon = True
+        thread.start()
+    
+    def _on_connect(self):
+        """Called when connection is established."""
+        # Subscribe to relevant channels or send authentication
+        if self.api_key:
+            auth_message = {
+                "type": "auth",
+                "api_key": self.api_key
+            }
+            self.send(json.dumps(auth_message))
+    
+    def _reconnect(self):
+        """Attempt to reconnect after disconnection."""
+        logger.info(f"Reconnecting in {self.reconnect_interval}s...")
+        time.sleep(self.reconnect_interval)
+        if self.ws:
+            self._init_websocket()
+    
+    def _handle_message(self, message: str):
+        """Handle incoming message."""
+        try:
+            data = json.loads(message)
+            for handler in self.message_handlers:
+                handler(data)
+        except json.JSONDecodeError:
+            logger.warning(f"Invalid JSON: {message}")
+    
+    def send(self, message: str) -> bool:
+        """
+        Send message via WebSocket.
+        
+        Args:
+            message: JSON string to send
+        
+        Returns:
+            True if sent successfully
+        """
+        if not self.is_connected or not self.ws:
+            logger.warning("Not connected to OpenClaw")
+            return False
+        
+        try:
+            self.ws.send(message)
+            return True
+        except Exception as e:
+            logger.error(f"Send error: {e}")
+            return False
+    
+    def send_request(self, query: str, context: Optional[Dict] = None) -> Dict:
+        """
+        Send a query to OpenClaw and get response.
+        
+        Args:
+            query: User query string
+            context: Optional context dictionary
+        
+        Returns:
+            Response dictionary
+        """
+        if not self.enabled:
+            return {"error": "OpenClaw client disabled"}
+        
+        message = {
+            "type": "query",
+            "query": query,
+            "timestamp": time.time()
+        }
+        
+        if context:
+            message["context"] = context
+        
+        # Send via WebSocket
+        if self.send(json.dumps(message)):
+            # Wait for response (simplified - real implementation needs async handling)
+            time.sleep(0.5)
+            return {"status": "sent"}
+        else:
+            # Fall back to HTTP if WebSocket unavailable
+            return self._http_request(query, context)
+    
+    def _http_request(self, query: str, context: Optional[Dict] = None) -> Dict:
+        """Fallback HTTP request."""
+        if not HAS_REQUESTS:
+            return {"error": "HTTP client not available"}
+        
+        try:
+            response = requests.post(
+                f"{self.ws_url.replace('ws://', 'http://').replace('wss://', 'https://')}/api/query",
+                json={"query": query, "context": context},
+                headers={"Authorization": f"Bearer {self.api_key}"},
+                timeout=10
+            )
+            response.raise_for_status()
+            return response.json()
+        except Exception as e:
+            logger.error(f"HTTP request failed: {e}")
+            return {"error": str(e)}
+    
+    def add_message_handler(self, handler: Callable[[Dict], None]):
+        """Add a handler for incoming messages."""
+        self.message_handlers.append(handler)
+    
+    def get_status(self) -> Dict:
+        """Get client status."""
+        return {
+            "enabled": self.enabled,
+            "connected": self.is_connected,
+            "ws_url": self.ws_url
+        }
+
+
+def main():
+    """Test the OpenClaw client."""
+    client = OpenClawClient()
+    
+    # Add message handler
+    def on_message(data):
+        print(f"Received: {data}")
+    
+    client.add_message_handler(on_message)
+    
+    # Test connection
+    print(f"OpenClaw Client Status: {client.get_status()}")
+    
+    # Test query
+    response = client.send_request("Hello, how are you?")
+    print(f"Response: {response}")
+    
+    # Keep alive
+    try:
+        while True:
+            time.sleep(1)
+    except KeyboardInterrupt:
+        print("\nShutting down...")
+
+
+if __name__ == "__main__":
+    logging.basicConfig(level=logging.INFO)
+    main()
--- a/requirements.txt
+++ b/requirements.txt
@ -0,0 +1,44 @@
+# Google AIY Voice Kit V1 - Bilingual Voice Assistant
+# Python Dependencies
+
+# Google AIY
+git+https://github.com/google/aiyprojects-raspbian.git#egg=aiy-voice-kit
+aiy-voice-kit
+
+# Google Cloud Services (optional, for cloud speech/TTS)
+google-cloud-speech>=2.0.0
+google-cloud-texttospeech>=2.0.0
+google-cloud-speech-recognition>=3.0.0
+
+# Audio Processing
+pygame>=2.0.0
+mutagen>=1.45.0
+pyaudio>=0.2.11
+webrtcvad>=2.0.10
+
+# Language Detection
+langdetect>=1.0.9
+langid>=1.1.6
+
+# HTTP/WebSocket Client
+requests>=2.28.0
+websocket-client>=1.5.0
+
+# Voice Activity Detection
+vosk>=0.3.50  # Offline speech recognition
+pocketsphinx>=5.0.0  # Alternative offline recognition
+
+# Hotword Detection
+porcupine>=2.2.0  # Hotword detection
+snowboy>=1.3.0  # Alternative hotword detection
+
+# Configuration
+python-dotenv>=0.19.0
+pyyaml>=6.0
+
+# Logging
+colorlog>=6.0.0
+
+# Utilities
+fuzzywuzzy>=0.18.0  # Fuzzy string matching for music search
+python-Levenshtein>=0.19.0  # Fast string matching
--- a/speech_recognizer.py
+++ b/speech_recognizer.py
@ -0,0 +1,207 @@
+#!/usr/bin/env python3
+"""
+Bilingual Speech Recognizer
+Supports English and Mandarin Chinese with automatic language detection.
+"""
+
+import os
+import json
+import logging
+from typing import Optional, Tuple
+from pathlib import Path
+
+try:
+    import aiy.voice
+    from aiy import speech
+    HAS_AIY = True
+except ImportError:
+    HAS_AIY = False
+
+try:
+    from google.cloud import speech as speech_service
+    HAS_GOOGLE_CLOUD = True
+except ImportError:
+    HAS_GOOGLE_CLOUD = False
+
+try:
+    from langdetect import detect
+    HAS_LANG_DETECT = True
+except ImportError:
+    HAS_LANG_DETECT = False
+
+logger = logging.getLogger(__name__)
+
+
+class BilingualSpeechRecognizer:
+    """
+    Speech recognizer with automatic English/Mandarin detection.
+    """
+    
+    def __init__(self, config_path: str = "config.json"):
+        self.config = self._load_config(config_path)
+        self.language_cache = {}
+        
+        if HAS_AIY:
+            self.aiy_recognizer = speech.Recognizer()
+        else:
+            self.aiy_recognizer = None
+        
+        logger.info("BilingualSpeechRecognizer initialized")
+    
+    def _load_config(self, config_path: str) -> dict:
+        """Load configuration from JSON file."""
+        try:
+            with open(config_path, 'r') as f:
+                return json.load(f)
+        except FileNotFoundError:
+            logger.warning(f"Config file {config_path} not found, using defaults")
+            return {
+                "speech": {
+                    "language": "auto",
+                    "recognition_timeout": 5
+                }
+            }
+    
+    def recognize(self, audio_data: bytes, timeout: Optional[int] = None) -> Tuple[Optional[str], str]:
+        """
+        Recognize speech from audio data.
+        
+        Args:
+            audio_data: Raw audio bytes
+            timeout: Recognition timeout in seconds
+        
+        Returns:
+            Tuple of (recognized_text, detected_language)
+        """
+        if timeout is None:
+            timeout = self.config.get("speech", {}).get("recognition_timeout", 5)
+        
+        # Try Google Cloud Speech first (if available)
+        if HAS_GOOGLE_CLOUD and self.config.get("speech", {}).get("offline_mode", False) is False:
+            try:
+                text = self._google_cloud_recognize(audio_data)
+                if text:
+                    lang = self._detect_language(text)
+                    return text, lang
+            except Exception as e:
+                logger.warning(f"Google Cloud recognition failed: {e}")
+        
+        # Fall back to AIY/local recognition
+        if self.aiy_recognizer:
+            try:
+                text = self._aiy_recognize(audio_data)
+                if text:
+                    lang = self._detect_language(text)
+                    return text, lang
+            except Exception as e:
+                logger.warning(f"AIY recognition failed: {e}")
+        
+        # Last resort: simple language detection from text
+        return None, "unknown"
+    
+    def _google_cloud_recognize(self, audio_data: bytes) -> Optional[str]:
+        """Use Google Cloud Speech-to-Text for recognition."""
+        if not HAS_GOOGLE_CLOUD:
+            return None
+        
+        client = speech_service.SpeechClient()
+        
+        # Try bilingual recognition
+        config = speech_service.RecognitionConfig(
+            encoding=speech_service.RecognitionConfig.AudioEncoding.LINEAR16,
+            sample_rate_hertz=16000,
+            language_codes=["en-US", "zh-CN", "zh-TW"],
+            enable_automatic_punctuation=True,
+        )
+        
+        response = client.recognize(
+            config=config,
+            audio=speech_service.RecognitionAudio(content=audio_data)
+        )
+        
+        if response.results:
+            result = response.results[0]
+            if result.alternatives:
+                return result.alternatives[0].transcript
+        
+        return None
+    
+    def _aiy_recognize(self, audio_data: bytes) -> Optional[str]:
+        """Use AIY Voice Kit for recognition."""
+        if not self.aiy_recognizer:
+            return None
+        
+        try:
+            # AIY uses Google's speech recognition internally
+            recognizer = self.aiy_recognizer
+            # This is a simplified version - actual implementation depends on AIY version
+            return None
+        except Exception as e:
+            logger.error(f"AIY recognition error: {e}")
+            return None
+    
+    def _detect_language(self, text: str) -> str:
+        """
+        Detect if text is English or Chinese.
+        
+        Returns:
+            'en' for English, 'zh' for Chinese, 'unknown' otherwise
+        """
+        if not text:
+            return "unknown"
+        
+        # Simple heuristic: check for Chinese characters
+        chinese_chars = sum(1 for char in text if '\u4e00' <= char <= '\u9fff')
+        if chinese_chars > len(text) * 0.3:  # 30% Chinese characters
+            return "zh"
+        
+        # Use langdetect if available
+        if HAS_LANG_DETECT:
+            try:
+                detected = detect(text)
+                if detected in ["zh-cn", "zh-tw", "zh"]:
+                    return "zh"
+                elif detected in ["en", "en-us", "en-gb"]:
+                    return "en"
+            except:
+                pass
+        
+        # Default to English
+        return "en"
+    
+    def listen_for_hotword(self, callback) -> None:
+        """
+        Listen for hotword activation.
+        
+        Args:
+            callback: Function to call when hotword detected
+        """
+        if not HAS_AIY:
+            logger.warning("AIY not available, hotword detection disabled")
+            return
+        
+        # Implementation depends on AIY version
+        # This is a placeholder for the actual hotword detection
+        logger.info("Hotword detection enabled")
+
+
+def main():
+    """Test the speech recognizer."""
+    recognizer = BilingualSpeechRecognizer()
+    
+    # Test language detection
+    test_texts = [
+        "Hello, how are you?",
+        "你好，你好吗？",
+        "Play some music",
+        "播放音乐"
+    ]
+    
+    for text in test_texts:
+        lang = recognizer._detect_language(text)
+        print(f"'{text}' -> Language: {lang}")
+
+
+if __name__ == "__main__":
+    logging.basicConfig(level=logging.INFO)
+    main()
--- a/test_setup.py
+++ b/test_setup.py
@ -0,0 +1,185 @@
+#!/usr/bin/env python3
+"""
+Test Setup Script
+Verifies all components are working correctly.
+
+Run this after installation to ensure everything is configured properly.
+"""
+
+import sys
+import os
+from pathlib import Path
+
+print("\n" + "="*70)
+print("  🧪 Voice Assistant - Setup Test Suite")
+print("  AI Now Inc - Del Mar Demo Unit")
+print("="*70)
+print()
+
+# Test counter
+tests_passed = 0
+tests_failed = 0
+
+def test_result(name: str, passed: bool, message: str = ""):
+    global tests_passed, tests_failed
+    status = "✅ PASS" if passed else "❌ FAIL"
+    print(f"{status}: {name}")
+    if message:
+        print(f"   → {message}")
+    if passed:
+        tests_passed += 1
+    else:
+        tests_failed += 1
+    return passed
+
+# Test 1: Python version
+print("1. Checking Python version...")
+if sys.version_info >= (3, 8):
+    test_result("Python version", True, f"Python {sys.version}")
+else:
+    test_result("Python version", False, f"Need Python 3.8+, have {sys.version}")
+
+# Test 2: Required packages
+print("\n2. Checking required packages...")
+required_packages = [
+    "pygame",
+    "requests",
+    "websocket-client",
+    "langdetect",
+    "mutagen"
+]
+
+for pkg in required_packages:
+    try:
+        __import__(pkg.replace("-", "_"))
+        test_result(f"Package: {pkg}", True)
+    except ImportError:
+        test_result(f"Package: {pkg}", False, f"Install with: pip install {pkg}")
+
+# Test 3: Optional packages
+print("\n3. Checking optional packages...")
+optional_packages = [
+    ("pvporcupine", "Hotword detection"),
+    ("webrtcvad", "Voice activity detection"),
+    ("google.cloud.speech", "Google Cloud Speech"),
+    ("google.cloud.texttospeech", "Google Cloud TTS")
+]
+
+for pkg, desc in optional_packages:
+    try:
+        __import__(pkg.replace("-", "_").replace(".", "."))
+        test_result(f"Optional: {pkg} ({desc})", True)
+    except ImportError:
+        test_result(f"Optional: {pkg} ({desc})", False, f"Optional - {pkg}")
+
+# Test 4: Configuration files
+print("\n4. Checking configuration...")
+config_files = ["config.json", "hotword_config.json"]
+for config_file in config_files:
+    if Path(config_file).exists():
+        test_result(f"Config: {config_file}", True)
+    else:
+        test_result(f"Config: {config_file}", False, "File not found")
+
+# Test 5: Audio devices
+print("\n5. Checking audio devices...")
+try:
+    import pyaudio
+    pa = pyaudio.PyAudio()
+    device_count = pa.get_device_info()
+    test_result("PyAudio", True, f"Found {device_count.get('index', 0)+1} audio devices")
+    
+    # Try to get default input device
+    try:
+        default_input = pa.get_default_input_device_info()
+        test_result("Default input device", True, default_input.get('name', 'Unknown'))
+    except:
+        test_result("Default input device", False, "No input device found")
+    
+    # Try to get default output device
+    try:
+        default_output = pa.get_default_output_device_info()
+        test_result("Default output device", True, default_output.get('name', 'Unknown'))
+    except:
+        test_result("Default output device", False, "No output device found")
+    
+    pa.terminate()
+except ImportError:
+    test_result("PyAudio", False, "Install with: pip install pyaudio")
+except Exception as e:
+    test_result("PyAudio", False, str(e))
+
+# Test 6: Music directory
+print("\n6. Checking music directory...")
+music_path = Path("/home/pi/Music")
+if music_path.exists():
+    test_result("Music directory", True, str(music_path))
+    # Count files
+    music_files = list(music_path.glob("**/*.mp3"))
+    test_result("Music files", True, f"Found {len(music_files)} MP3 files")
+else:
+    test_result("Music directory", False, "Directory not found")
+
+# Test 7: Module imports
+print("\n7. Testing module imports...")
+modules = [
+    "speech_recognizer",
+    "music_player",
+    "tts_engine",
+    "assistant",
+    "hotword_detector",
+    "openclaw_client"
+]
+
+for module in modules:
+    try:
+        __import__(module)
+        test_result(f"Module: {module}", True)
+    except ImportError as e:
+        test_result(f"Module: {module}", False, str(e))
+    except Exception as e:
+        test_result(f"Module: {module}", False, f"Error: {e}")
+
+# Test 8: Component initialization
+print("\n8. Testing component initialization...")
+try:
+    from assistant import VoiceAssistant
+    assistant = VoiceAssistant()
+    test_result("VoiceAssistant", True)
+except Exception as e:
+    test_result("VoiceAssistant", False, str(e))
+
+try:
+    from tts_engine import TTSEngine
+    tts = TTSEngine()
+    test_result("TTSEngine", True)
+except Exception as e:
+    test_result("TTSEngine", False, str(e))
+
+try:
+    from music_player import MusicPlayer
+    player = MusicPlayer()
+    test_result("MusicPlayer", True, f"Library: {len(player.music_library)} tracks")
+except Exception as e:
+    test_result("MusicPlayer", False, str(e))
+
+# Summary
+print("\n" + "="*70)
+print(f"  Test Summary: {tests_passed} passed, {tests_failed} failed")
+print("="*70)
+
+if tests_failed == 0:
+    print("\n✅ All tests passed! System is ready to use.")
+    print("\nNext steps:")
+    print("  1. Add MP3 files to /home/pi/Music")
+    print("  2. Configure OpenClaw connection in config.json")
+    print("  3. Run: ./start.sh")
+    print("  4. Say 'Hey Osiris' to activate!")
+elif tests_failed > 0:
+    print(f"\n⚠️  {tests_failed} test(s) failed. Please fix the issues above.")
+    print("\nCommon fixes:")
+    print("  - Missing packages: pip install -r requirements.txt")
+    print("  - No audio device: Check microphone/speaker connections")
+    print("  - Config missing: Copy config.json to config.local.json")
+
+print()
--- a/tts_engine.py
+++ b/tts_engine.py
@ -0,0 +1,267 @@
+#!/usr/bin/env python3
+"""
+Text-to-Speech Engine
+Supports English and Mandarin Chinese with Google Cloud TTS and offline alternatives.
+"""
+
+import os
+import json
+import logging
+from typing import Optional, List
+from pathlib import Path
+
+try:
+    from google.cloud import texttospeech
+    HAS_GOOGLE_CLOUD = True
+except ImportError:
+    HAS_GOOGLE_CLOUD = False
+
+try:
+    import pygame
+    HAS_PYGAME = True
+except ImportError:
+    HAS_PYGAME = False
+
+logger = logging.getLogger(__name__)
+
+
+class TTSEngine:
+    """
+    Bilingual TTS engine supporting English and Mandarin Chinese.
+    """
+    
+    def __init__(self, config_path: str = "config.json"):
+        self.config = self._load_config(config_path)
+        
+        # TTS configuration
+        tts_config = self.config.get("tts", {})
+        self.english_voice = tts_config.get("english_voice", "en-US-Standard-A")
+        self.chinese_voice = tts_config.get("chinese_voice", "zh-CN-Standard-A")
+        self.speed = tts_config.get("speed", 1.0)
+        self.pitch = tts_config.get("pitch", 0)
+        
+        # Initialize Google Cloud client if available
+        self.client = None
+        if HAS_GOOGLE_CLOUD and self.config.get("openclaw", {}).get("enabled", True):
+            try:
+                self.client = texttospeech.TextToSpeechClient()
+                logger.info("Google Cloud TTS initialized")
+            except Exception as e:
+                logger.warning(f"Google Cloud TTS not available: {e}")
+        
+        # Initialize audio output
+        if HAS_PYGAME:
+            pygame.mixer.init()
+        
+        logger.info("TTSEngine initialized")
+    
+    def _load_config(self, config_path: str) -> dict:
+        """Load configuration."""
+        try:
+            with open(config_path, 'r') as f:
+                return json.load(f)
+        except FileNotFoundError:
+            return {"tts": {}}
+    
+    def speak(self, text: str, language: str = "en") -> bool:
+        """
+        Speak text in the specified language.
+        
+        Args:
+            text: Text to speak
+            language: 'en' for English, 'zh' for Chinese
+        
+        Returns:
+            True if speech succeeded
+        """
+        try:
+            # Generate speech audio
+            audio_data = self._synthesize(text, language)
+            
+            if audio_data:
+                # Play audio
+                return self._play_audio(audio_data)
+            
+            return False
+            
+        except Exception as e:
+            logger.error(f"TTS error: {e}")
+            return False
+    
+    def _synthesize(self, text: str, language: str) -> Optional[bytes]:
+        """
+        Synthesize speech from text.
+        
+        Args:
+            text: Text to synthesize
+            language: Language code
+        
+        Returns:
+            Audio data or None
+        """
+        if self.client and HAS_GOOGLE_CLOUD:
+            return self._google_synthesize(text, language)
+        else:
+            return self._offline_synthesize(text, language)
+    
+    def _google_synthesize(self, text: str, language: str) -> Optional[bytes]:
+        """Use Google Cloud TTS."""
+        if not self.client:
+            return None
+        
+        # Select voice based on language
+        if language == "zh":
+            voice_name = self.chinese_voice
+            lang_code = "zh-CN"
+        else:
+            voice_name = self.english_voice
+            lang_code = "en-US"
+        
+        # Configure synthesis
+        voice = texttospeech.VoiceSelectionParams(
+            language_code=lang_code,
+            name=voice_name,
+        )
+        
+        audio_config = texttospeech.AudioConfig(
+            audio_encoding=texttospeech.AudioEncoding.MP3,
+            speaking_rate=self.speed,
+            pitch=self.pitch,
+        )
+        
+        synthesis_input = texttospeech.SynthesisInput(text=text)
+        
+        # Perform synthesis
+        response = self.client.synthesize_speech(
+            request=texttospeech.SynthesizeSpeechRequest(
+                input=synthesis_input,
+                voice=voice,
+                audio_config=audio_config,
+            )
+        )
+        
+        return response.audio_content
+    
+    def _offline_synthesize(self, text: str, language: str) -> Optional[bytes]:
+        """
+        Offline TTS fallback (basic system TTS).
+        
+        This is a placeholder - in production, you'd use:
+        - espeak for English
+        - A Chinese TTS engine for Mandarin
+        """
+        logger.warning("Using offline TTS (limited quality)")
+        
+        # Try system TTS
+        try:
+            if language == "zh":
+                # Chinese TTS (if available)
+                os.system(f'espeak -v zh "{text}" --stdout > /tmp/tts_output.wav')
+            else:
+                # English TTS
+                os.system(f'espeak "{text}" --stdout > /tmp/tts_output.wav')
+            
+            # Read the file
+            if os.path.exists('/tmp/tts_output.wav'):
+                with open('/tmp/tts_output.wav', 'rb') as f:
+                    return f.read()
+        except Exception as e:
+            logger.error(f"Offline TTS failed: {e}")
+        
+        return None
+    
+    def _play_audio(self, audio_data: bytes) -> bool:
+        """
+        Play audio data.
+        
+        Args:
+            audio_data: Audio bytes (MP3 or WAV)
+        
+        Returns:
+            True if playback succeeded
+        """
+        if not HAS_PYGAME:
+            logger.warning("Pygame not available for audio playback")
+            return False
+        
+        try:
+            # Save to temp file
+            temp_path = "/tmp/tts_audio.mp3"
+            with open(temp_path, 'wb') as f:
+                f.write(audio_data)
+            
+            # Load and play
+            pygame.mixer.music.load(temp_path)
+            pygame.mixer.music.play()
+            
+            # Wait for completion
+            while pygame.mixer.music.get_busy():
+                pygame.time.wait(100)
+            
+            return True
+            
+        except Exception as e:
+            logger.error(f"Audio playback error: {e}")
+            return False
+    
+    def speak_sync(self, text: str, language: str = "en", 
+                   on_complete=None) -> bool:
+        """
+        Synchronous speech with optional callback.
+        
+        Args:
+            text: Text to speak
+            language: Language code
+            on_complete: Callback function when done
+        
+        Returns:
+            True if speech succeeded
+        """
+        result = self.speak(text, language)
+        
+        if on_complete:
+            on_complete(result)
+        
+        return result
+    
+    def get_voices(self) -> List[dict]:
+        """Get list of available voices."""
+        voices = []
+        
+        if self.client and HAS_GOOGLE_CLOUD:
+            try:
+                response = self.client.list_voices()
+                for voice in response.voices:
+                    voices.append({
+                        "name": voice.name,
+                        "language": voice.language_codes,
+                        "gender": voice.ssml_gender
+                    })
+            except Exception as e:
+                logger.error(f"Error listing voices: {e}")
+        
+        return voices
+
+
+def main():
+    """Test the TTS engine."""
+    tts = TTSEngine()
+    
+    # Test English
+    print("Testing English TTS...")
+    tts.speak("Hello! I am your voice assistant.", "en")
+    
+    # Test Chinese
+    print("Testing Chinese TTS...")
+    tts.speak("你好！我是你的语音助手。", "zh")
+    
+    # List available voices
+    voices = tts.get_voices()
+    print(f"\nAvailable voices: {len(voices)}")
+    for voice in voices[:5]:  # Show first 5
+        print(f"  - {voice['name']} ({', '.join(voice['language'])})")
+
+
+if __name__ == "__main__":
+    logging.basicConfig(level=logging.INFO)
+    main()
--- a/uninstall.sh
+++ b/uninstall.sh
@ -0,0 +1,53 @@
+#!/bin/bash
+#
+# Uninstall Voice Assistant
+#
+# AI Now Inc - Del Mar Demo Unit
+#
+
+set -e
+
+echo "=========================================="
+echo "  Uninstall Voice Assistant"
+echo "=========================================="
+echo ""
+
+# Confirm uninstall
+read -p "Are you sure you want to uninstall? (y/N): " confirm
+if [[ ! $confirm =~ ^[Yy]$ ]]; then
+    echo "Uninstall cancelled."
+    exit 0
+fi
+
+# Stop service
+echo "Stopping service..."
+sudo systemctl stop voice-assistant 2>/dev/null || true
+sudo systemctl disable voice-assistant 2>/dev/null || true
+sudo rm -f /etc/systemd/system/voice-assistant.service
+
+# Remove installation directory
+INSTALL_DIR="/home/pi/voice-assistant"
+if [ -d "$INSTALL_DIR" ]; then
+    echo "Removing $INSTALL_DIR..."
+    sudo rm -rf "$INSTALL_DIR"
+fi
+
+# Remove music directory (optional)
+MUSIC_DIR="/home/pi/Music"
+if [ -d "$MUSIC_DIR" ]; then
+    read -p "Remove music directory ($MUSIC_DIR)? (y/N): " remove_music
+    if [[ $remove_music =~ ^[Yy]$ ]]; then
+        sudo rm -rf "$MUSIC_DIR"
+    fi
+fi
+
+# Clean up systemd
+sudo systemctl daemon-reload
+
+echo ""
+echo "Uninstall complete!"
+echo ""
+echo "To reinstall, run:"
+echo "  cd /path/to/voice-assistant"
+echo "  sudo ./install.sh"
+echo ""