Having the ability so speak to and upload images that the agent can reason with would be amazing. Would be even better if the agent can speak back!