A conversational voice agent system that generates immersive, self-guided audio tours based on the user’s location, areas of interest, and tour duration. Built on a multi-agent architecture using OpenAI Agents SDK, real-time information retrieval, and expressive TTS for natural speech output.
-
Orchestrator Agent
Coordinates the overall tour flow, manages transitions, and assembles content from all expert agents. -
History Agent
Delivers insightful historical narratives with an authoritative voice. -
Architecture Agent
Highlights architectural details, styles, and design elements using a descriptive and technical tone. -
Culture Agent
Explores local customs, traditions, and artistic heritage with an enthusiastic voice. -
Culinary Agent
Describes iconic dishes and food culture in a passionate and engaging tone.
- Dynamic content generation based on user-input location
- Real-time web search integration to fetch relevant, up-to-date details
- Personalized content delivery filtered by user interest categories
- Selectable tour length: 15, 30, or 60 minutes
- Time allocations adapt to user interest weights and location relevance
- Ensures well-paced and proportioned narratives across sections
- High-quality audio generated using Gpt-4o Mini Audio
- Clone the GitHub repository
git clone https://github.com/Shubhamsaboo/awesome-llm-apps.git
cd ai_agent_tutorials/ai_audio_tour_agent
- Install the required dependencies:
pip install -r requirements.txt
- Get your OpenAI API Key
- Sign up for an OpenAI account (or the LLM provider of your choice) and obtain your API key.
- Run the Streamlit App
streamlit run ai_audio_tour_agent.py