Explore open access research and scholarly works from NERC Open Research Archive

Advanced Search

Towards deployment-centric multimodal AI beyond vision and language

Liu, Xianyuan ORCID: https://orcid.org/0000-0002-3084-519X; Zhang, Jiayang; Zhou, Shuo ORCID: https://orcid.org/0000-0002-8069-2814; van der Plas, Thijs L. ORCID: https://orcid.org/0000-0001-5490-1785; Vijayaraghavan, Avish ORCID: https://orcid.org/0009-0007-2821-1917; Grishina, Anastasiia; Zhuang, Mengdie; Schofield, Daniel; Tomlinson, Christopher ORCID: https://orcid.org/0000-0002-0903-5395; Wang, Yuhan ORCID: https://orcid.org/0000-0003-0718-3359; Li, Ruizhe ORCID: https://orcid.org/0000-0003-2512-845X; van Zeeland, Louisa ORCID: https://orcid.org/0009-0005-0392-4377; Tabakhi, Sina ORCID: https://orcid.org/0000-0002-3075-7907; Demeocq, Cyndie ORCID: https://orcid.org/0009-0000-7713-3707; Li, Xiang; Das, Arunav ORCID: https://orcid.org/0009-0008-9989-1718; Timmerman, Orlando; Baldwin-McDonald, Thomas ORCID: https://orcid.org/0000-0001-7301-4399; Wu, Jinge; Bai, Peizhen ORCID: https://orcid.org/0000-0003-3027-5518; Al Sahili, Zahraa; Alwazzan, Omnia ORCID: https://orcid.org/0000-0001-7416-1622; Do, Thao N. ORCID: https://orcid.org/0000-0002-0015-892X; Suvon, Mohammod N. I. ORCID: https://orcid.org/0000-0001-9962-315X; Wang, Angeline ORCID: https://orcid.org/0009-0002-0845-6136; Cipolina-Kun, Lucia; Moretti, Luigi A. ORCID: https://orcid.org/0009-0002-6180-0565; Farndale, Lucas ORCID: https://orcid.org/0009-0003-3667-2001; Jain, Nitisha ORCID: https://orcid.org/0000-0002-7429-7949; Efremova, Natalia ORCID: https://orcid.org/0000-0003-4853-9550; Ge, Yan; Varela, Marta; Lam, Hak-Keung; Celiktutan, Oya; Evans, Ben R. ORCID: https://orcid.org/0000-0003-0643-526X; Coca-Castro, Alejandro ORCID: https://orcid.org/0000-0002-9264-1539; Wu, Honghan; Abdallah, Zahraa S.; Chen, Chen; Danchev, Valentin ORCID: https://orcid.org/0000-0002-7563-0168; Tkachenko, Nataliya; Lu, Lei; Zhu, Tingting ORCID: https://orcid.org/0000-0002-1552-5630; Slabaugh, Gregory G.; Moore, Roger K.; Cheung, William K. ORCID: https://orcid.org/0000-0002-7428-2050; Charlton, Peter H.; Lu, Haiping ORCID: https://orcid.org/0000-0002-0349-2181. 2025 Towards deployment-centric multimodal AI beyond vision and language. Nature Machine Intelligence, 7 (10). 1612-1624. 10.1038/s42256-025-01116-5

Abstract
Multimodal artificial intelligence (AI) integrates diverse types of data via machine learning to improve understanding, prediction and decision-making across disciplines such as healthcare, science and engineering. However, most multimodal AI advances focus on models for vision and language data, and their deployability remains a key challenge. We advocate a deployment-centric workflow that incorporates deployment constraints early on to reduce the likelihood of undeployable solutions, complementing data-centric and model-centric approaches. We also emphasize deeper integration across multiple levels of multimodality through stakeholder engagement and interdisciplinary collaboration to broaden the research scope beyond vision and language. To facilitate this approach, we identify common multimodal-AI-specific challenges shared across disciplines and examine three real-world use cases: pandemic response, self-driving car design and climate change adaptation, drawing expertise from healthcare, social science, engineering, science, sustainability and finance. By fostering interdisciplinary dialogue and open research practices, our community can accelerate deployment-centric development for broad societal impact.
Documents
Full text not available from this repository. (Request a copy)
Information
Programmes:
BAS Programmes 2015 > AI Lab (2022-)
Library
Metrics

Altmetric Badge

Dimensions Badge

Share
Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email
View Item