GPT4Tools is a new way to teach language models to use visual tools and models for comprehending and generating images. The researchers' key insight is to leverage advanced LLMs like ChatGPT as powerful "teacher models" to provide visual grounding data for training other LLMs. They propose metrics to evaluate whether models can accurately determine when tools should be used.