Robust, Efficient, and Adaptable Multimodal Artificial Intelligence for Vertical Applications