HuatuoGPT-II
One-stage medical adaptation for open medical language models.
HuatuoGPT-II studies how to adapt large language models to medicine with a cleaner one-stage training recipe, reducing pipeline complexity while preserving strong medical instruction-following and exam-style performance.
Storyline
Early medical LLM pipelines mixed domain corpora, instruction tuning, and dialogue alignment in separate steps, which made reproduction and ablation harder.
HuatuoGPT-II asks whether medical ability can be learned through a cleaner one-stage format that combines medical instructions and general dialogue behavior.
The 7B/34B checkpoints and HuatuoGPT2-SFT-GPT4-140K data turn the idea into a reusable baseline for later medical reasoning and vision work.
Core Idea
Training examples are organized so general instruction following and medical domain adaptation can be learned together rather than through brittle multi-stage handoffs.
The system uses curated medical QA and GPT-assisted medical SFT data to teach medical language, clinical question answering, and medical dialogue behavior.
HuatuoGPT2-7B and HuatuoGPT2-34B provide reusable model checkpoints for downstream research and comparison.
Display Figures
Project Outputs
Training, evaluation, and release notes for the one-stage medical adaptation approach.
RepositoryA medical SFT dataset connected to the HuatuoGPT-II training pipeline.
DatasetOpen Hugging Face checkpoints for Chinese medical language modeling and medical QA experiments.
ModelPaper Trail
Defines the medical dialogue direction and shows why domain data matters for Chinese medical QA.
HuatuoGPTThe focused contribution: simpler one-stage medical adaptation with released checkpoints and SFT data.
PaperLater projects reuse the medical model stack for complex reasoning and medical multimodal understanding.
Full ecosystemWhere It Fits
HuatuoGPT-II is the practical adaptation layer inside the broader HuatuoGPT ecosystem. HuatuoGPT starts the open medical dialogue agenda, HuatuoGPT-II simplifies medical adaptation, HuatuoGPT-o1 pushes complex reasoning, and HuatuoGPT-Vision expands the stack to multimodal medical knowledge.