PromptBio: A Multi-Agent AI Platform for Bioinformatics Data Analysis
PromptBio: A Multi-Agent AI Platform for Bioinformatics Data Analysis
Yang, X.; Shashidhar, K.; Zhang, M.; Gu, W.; Han, B.; Guo, V.; Zheng, J.; Lin, X.; Addoni, C.; Zheng, Y.; Chen, J.; Li, K.; Wang, J.; Yu, L.; Wu, L.; Shi, S.; Wang, W.; Leng, Y.; Ma, Y.
AbstractPromptBio is a modular AI platform for scalable, reproducible, and user-adaptable bioinformatics analysis, powered by generative AI and natural language interaction. It supports three complementary modes of analysis designed to meet diverse research needs. PromptGenie is a multi-agent system that enables stepwise, human-in-the-loop workflows using prevalidated domain-standard tools. Within PromptGenie, specialized agents, including DataAgent, OmicsAgent, AnalysisAgent, and QAgent, collaborate to manage tasks such as data ingestion, pipeline execution, statistical analysis, and interactive summarization. DiscoverFlow provides integrated, automated workflows for large-scale multi-omics analysis, offering end-to-end execution and streamlined orchestration. ToolsGenie complements these modes by dynamically generating executable bioinformatics code for custom, user-defined analyses, enabling flexibility beyond standardized workflows. PromptGenie and DiscoverFlow leverage a suite of domain-specific tools, including Omics Tools for standardized omics pipelines, Analysis Tools for downstream statistical interpretation, and MLGenie for machine learning and multi-omics modeling. We present the design, capabilities, and validation of these components, highlight their integration into automated and customizable workflows, and discuss extensibility, monitoring, and compliance. PromptBio aims to democratize high-throughput bioinformatics through a large language model-powered, natural language understanding, workflow generation and agent orchestration.