diff --git a/AI_PROVIDERS_GUIDE.md b/AI_PROVIDERS_GUIDE.md index 5fbb555..482943c 100644 --- a/AI_PROVIDERS_GUIDE.md +++ b/AI_PROVIDERS_GUIDE.md @@ -22,6 +22,12 @@ Your AI Interview Assistant now supports multiple AI providers! Here's how to se - **Recommended Model**: Gemini-1.5-Flash (fast and efficient) - **Cost**: Free tier available, then pay per token +## 🌊 **DeepSeek** +- **Models Available**: DeepSeek-Chat, DeepSeek-Reasoner +- **API Key**: Get from [DeepSeek Platform](https://platform.deepseek.com/) +- **Recommended Model**: DeepSeek-Chat (general use) +- **Cost**: Pay per token usage + ## 🏠 **Ollama (Local)** - **Models Available**: Llama3.2, Llama3.1, Mistral, CodeLlama, Phi3 - **Setup**: Install [Ollama](https://ollama.ai/) locally diff --git a/Plans_and_Todo.md b/Plans_and_Todo.md new file mode 100644 index 0000000..9f933b4 --- /dev/null +++ b/Plans_and_Todo.md @@ -0,0 +1,78 @@ +# Personal Browser Companion - Plans & To-Do + +## Goals +- Start local-first with an option to sync to cloud. +- Online-only operation (LLM required for decisions). +- Auto-start mode during meetings. +- Integrations: calendar, email, Discord, Nextcloud. + +## Phase Plan + +### Phase 1: Local MVP (Foundation) +- Local storage for sessions, summaries, and user profile. +- Meeting/interview modes with manual start and overlay UI. +- Basic memory retrieval: recent session summaries + user profile. +- Audio capture + STT pipeline (mic + tab) and transcript display. +- Privacy controls: store/forget, per-session toggle. + +### Phase 2: Smart Auto-Start +- Detect meeting tabs (Google Meet, Zoom, Teams) and prompt to start. +- Auto-start rules (domain allowlist, time-based, calendar hints). +- Lightweight on-device heuristics for meeting detection. + +### Phase 3: Cloud Sync (Optional) +- Opt-in cloud sync for memory + settings. +- Conflict resolution strategy (last-write wins + merge for summaries). +- Encryption at rest, user-controlled delete/export. + +### Phase 4: Integrations (MCP) +- Calendar: read upcoming meetings, attach context. +- Email: draft follow-ups, summaries. +- Discord: post meeting summary or action items to a channel. +- Nextcloud: store meeting notes, transcripts, and attachments. + +## MVP To-Do (Local) + +### Core +- Define memory schema (profile, session, summary, action items). +- Implement local RAG: index summaries + profile into embeddings. +- Add session lifecycle: start, pause, end, summarize. + +### Audio + STT +- Implement reliable STT for tab audio (server-side if needed). +- Keep mic-only STT as fallback. +- Add device selection + live mic monitor. + +### UI/UX +- Overlay controls: resize, hide/show, minimize. +- Auto-start toggle in side panel. +- Session summary view with “save to memory” toggle. + +### Privacy +- Per-session storage consent prompt. +- “Forget session” button. + +## Integration To-Do (MCP) + +### MCP Server Options +- Build a local MCP server as a bridge for integrations. +- Use MCP tool registry for calendar/email/Discord/Nextcloud. + +### Calendar +- Read upcoming meetings and titles. +- Auto-attach relevant context packs. + +### Email +- Generate follow-up drafts from summary + action items. + +### Discord +- Post meeting summary/action items to a selected channel. + +### Nextcloud +- Upload meeting notes and transcripts. + +## Open Questions +- Preferred cloud provider for sync? +- How long should session memories persist by default? +- Should auto-start be opt-in per domain or global? +- What data should be redacted before sync? diff --git a/README.md b/README.md index 99b474a..2b7f40a 100644 --- a/README.md +++ b/README.md @@ -10,12 +10,16 @@ The AI Interview Assistant is a Chrome extension designed to help users during i ## Features -- Real-time audio capture from the current tab -- Speech-to-text transcription -- Question detection -- AI-powered responses using OpenAI's GPT-3.5-turbo model +- Real-time audio capture (tab, mic, or mixed mode) +- Speech-to-text transcription with live overlay +- AI-powered responses with multiple providers (OpenAI, Anthropic, Google, DeepSeek, Ollama) - Persistent side panel interface - Secure API key storage +- Context management (upload or paste documents for better answers) +- Speed mode (faster, shorter responses) +- Multi-device demo mode for remote access +- Overlay controls: drag, resize, minimize, detach, hide/show +- Mic monitor with input device selection and live level meter ## Installation @@ -50,6 +54,18 @@ The AI Interview Assistant is a Chrome extension designed to help users during i 6. Click "Stop Listening" to end the audio capture. +## Plans & Roadmap + +- See the evolving roadmap and to-do list in `Plans_and_Todo.md`. + +## Recent Improvements + +- Larger, lighter overlay with a visible resize handle. +- Overlay hide/show controls. +- Mic monitor with input device selection and live level meter. +- Auto-open assistant window option after Start Listening. +- Better async message handling in content scripts. + ## Privacy and Security - The extension only captures audio from the current tab when actively listening. diff --git a/assistant.html b/assistant.html index 77ef3bc..35261d1 100644 --- a/assistant.html +++ b/assistant.html @@ -9,12 +9,10 @@