Skip to content

Changelog

Version history and release notes for VoiceOtaku.


v1.0.0

2026-04-24 — Initial release

The first public release of VoiceOtaku — a voice-driven AI anime recommendation assistant.

Features

Voice Call Interface

  • One-tap call flow from a single screen with no account or registration required
  • Browser-based microphone recording sends audio to the AI pipeline in real time
  • AI response played back as synthesised speech and displayed as transcript simultaneously
  • Audio waveform visualiser shows microphone activity during a call

AI Recommendation Engine

  • Conversational AI understands natural language requests — genres, moods, titles, themes
  • Supports follow-up questions within a single session
  • Detects conversation end signals (RECOMMENDATION_COMPLETE, GOODBYE) and closes the call gracefully

Queue System

  • Single-slot FIFO queue ensures one caller at a time for consistent quality
  • Queue position and estimated wait time displayed in real time, auto-refreshed every few seconds
  • JWT-authenticated session tokens scoped to each queue slot
  • Processing slot released automatically after the call ends or after the 2-minute timeout — abandoned tabs never stall the queue

Call Timer

  • Per-call 2-minute hard cap with a visible countdown timer
  • Warning shown at 20 seconds remaining so callers can wrap up naturally
  • Session ends automatically when the timer expires

Microphone Selector

  • Lists all available audio input devices recognised by the browser
  • Device selection persists for the duration of the session

Transcript Display

  • User speech and AI responses appear on screen as text alongside the voice output
  • Transcript clears between sessions — nothing is stored after the call ends

End Screen

  • Brief summary screen shown after each call before returning to the main screen

Deployment

  • Single docker compose up --build command builds and serves the frontend on port 8080
  • Build-time environment variables configure the N8N backend URL, call duration, queue polling interval, and request timeout

Version Format

VoiceOtaku follows Semantic Versioning:

SegmentMeaning
MAJORBreaking changes to the API or user flow
MINORNew features, backwards-compatible
PATCHBug fixes and minor improvements

Upcoming

See the FAQ for a note on planned improvements, or reach out to the team running your instance with feature requests.

Made with ❤️by Aldrick Bonaobra