MCP (Model Context Protocol) server that utilizes the Google Gemini Vision API to interact with YouTube videos.