HuatuoGPT Medical AI Ecosystem
Open medical language, vision, reasoning, data, and deployment infrastructure for clinical AI.
HuatuoGPT is the lab's flagship medical AI program: an open ecosystem that moves from Chinese medical QA data to doctor-facing chat models, medical complex reasoning, multimodal medical vision, and real-world triage and pre-consultation systems.
Research Storyline
Huatuo-26M and the early HuatuoGPT work created a reusable Chinese medical question-answering substrate for medical instruction tuning.
HuatuoGPT-II simplifies medical adaptation with a one-stage recipe and public checkpoints, making the training pipeline easier to reproduce.
HuatuoGPT-o1 introduces medical complex reasoning data and models, pushing beyond fluent consultation toward step-by-step clinical reasoning.
HuatuoGPT-Vision and PubMedVision connect medical images, reports, and dialogue so the ecosystem can handle multimodal clinical questions.
Longgang triage and pre-consultation deployment turns evaluation pressure back into product and research questions.
What It Builds
HuatuoGPT and HuatuoGPT-II adapt general LLMs to medical consultation, QA, and instruction-following with domain-specific supervision and evaluation.
HuatuoGPT-o1 introduces verifiable medical problems and reasoning SFT data, targeting multi-step diagnosis, treatment reasoning, and medical decision support.
HuatuoGPT-Vision and PubMedVision inject medical visual knowledge into multimodal LLMs for image-grounded medical dialogue and visual QA.
HuatuoGPT-powered intelligent triage and pre-consultation systems have been used in Longgang, Shenzhen public hospital and community health settings.
Project Gallery
Representative Work
A large Chinese medical QA dataset that seeded the early HuatuoGPT training pipeline and made Chinese medical instruction data reusable.
Code and dataMedical complex reasoning models trained with medical-o1-reasoning-SFT and verifiable medical problem data.
ModelA multimodal medical model line connected to PubMedVision, a large image-text medical visual instruction dataset.
ModelPaper Trail
Builds the data foundation for Chinese medical instruction tuning and evaluation.
Code and dataTurns the data foundation into an open medical dialogue model and establishes the first HuatuoGPT model family.
RepositorySimplifies the adaptation pipeline and releases public checkpoints for downstream medical LLM research.
RepositoryAdds verifier-style medical reasoning data and models to make medical outputs more structured and testable.
RepositoryExtends the ecosystem from text-only medical QA to image-grounded medical dialogue and multimodal knowledge.
RepositoryWhy It Matters
- It turns medical AI from a demo into a reproducible stack of datasets, models, benchmarks, and deployment artifacts.
- It connects text-only consultation, visual medical understanding, and explicit reasoning rather than treating them as isolated model families.
- It provides open entry points for the community: GitHub repositories, Hugging Face checkpoints, and medical reasoning datasets.