Building an Image Analysis Agent: OCR, Object Detection, and Visual QA
Build a Python-based image analysis agent that performs OCR text extraction, object detection, and visual question answering. Includes preprocessing pipelines and structured output formatting.