Publications
No publications found for this tag.
2026
- CALM Before the STORM: Unlocking Native Reasoning for Optimization ModelingIn ICML 2026 2026
- OnePO: Direct One-stage Policy Optimization for SFT-free Domain AdaptationIn ICML 2026 2026
- CRPO: Character-centric Group Relative Policy Optimization for Role-aware ReasoningIn ICML 2026 2026
- MicroVerse: A Preliminary Exploration Toward a Micro-World SimulationIn ICLR 2026 2026
- LiveClin: A Live Clinical Benchmark without LeakageIn ICLR 2026 2026
- Human or Machine? A Preliminary Turing Test for Speech-to-Speech InteractionIn ICLR 2026 2026
- Does Higher Interpretability Imply Better Utility? A Pairwise Analysis on Sparse AutoencodersIn ICLR 2026 2026
- EchoMind: An Interrelated Multi-level Benchmark for Evaluating Empathetic Speech Language ModelsIn ICLR 2026 2026
- Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel ReasoningIn ACL 2026 2026
- Human or LLM as Standardized Patients? A Comparative Study in Medical EducationIn ACL 2026 2026
- Towards Fine-grained Audio Captioning with Multimodal Contextual FusionIn ACL 2026 2026
- S2S-Arena: Evaluating Paralinguistic Instruction Following in Speech-to-Speech ModelsIn ACL 2026 2026
- It Talks Like a Patient, But Feels Different: Co-Designing AI Standardized PatientsIn CHI 2026 2026
- Can Audio Language Models Listen Between the Lines? A Study on Metaphorical ReasoningIn ACM MM 2026 2026
2025
- TwinMarket: A Scalable Behavioral and Social Simulation for Financial MarketsIn NeurIPS 2025 2025
- Question-Free Fine-Tuning: Towards Efficient and Adaptive Reasoning in LLMsIn NeurIPS 2025 Spotlight 2025
- Video-R1: Reinforcing Video Reasoning in MLLMsIn NeurIPS 2025 2025
- The First Few Tokens Are All You Need: An Efficient Unsupervised Prefix Fine-Tuning MethodIn NeurIPS 2025 2025
- CoRT: Code-integrated Reasoning within ThinkingIn NeurIPS 2025 2025
- Enabling Doctor-Centric Medical AI with LLMs through Workflow-Aligned Tasksnpj Health Systems 2025
- MotionLLM: Understanding Human Behaviors from Human Motions and VideosIEEE TPAMI 2025
- Guidelines for Reporting Studies on Large Language Models in RadiologyRadiology 2025
- RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented InstructionsIn EMNLP 2025 Oral 2025
- From Word to World: Evaluate and Mitigate Culture Bias in LLMs via Word Association TestIn EMNLP 2025 2025
- Model Unlearning via Sparse Autoencoder Subspace Guided ProjectionsIn EMNLP 2025 2025
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid ArchitectureIn EMNLP 2025 Findings 2025
- Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit AnalysisIn ICML 2025 2025
- Self-Evolving Critique Abilities in Large Language ModelsIn COLM 2025 2025
- E2-RAG: Towards Editable Efficient RAG by Editing Compressed KV CachesIn COLM 2025 2025
- Periodical Moving Average Accelerates Gradient Accumulation for Post-TrainingIn UAI 2025 2025
-
- Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary ExpansionIn ACL 2025 2025
- Exploring Compositional Generalization of Multimodal LLMs for Medical ImagingIn ACL 2025 2025
- Soundwave: Less is More for Speech-Text Alignment in LLMsIn ACL 2025 2025
- CoD, Towards an Interpretable Medical Agent using Chain of DiagnosisIn ACL 2025 Findings 2025
- Towards Medical Complex Reasoning with LLMs through Medical Verifiable ProblemsIn ACL 2025 Findings 2025
- Unlocking LLMs’ Self-Improvement Capacity with Autonomous Learning for Domain AdaptationIn ACL 2025 Findings 2025
- Efficiently Democratizing Medical LLMs for 50 Languages via Mixture of Language Family ExpertsIn ICLR 2025 2025
- Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language ModelsIn ICLR 2025 2025
- ORLM: A Customizable Framework in Training Large Models for Automated Optimization ModelingOperations Research 2025
- Smurfs: Multi-Agent System using Context-Efficient DFSDT for Tool PlanningIn NAACL 2025 2025
- Is Your LLM Outdated? Evaluating LLMs at Temporal GeneralizationIn NAACL 2025 2025
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample CriteriaIn NAACL 2025 2025
- Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMsIn COLING 2025 2025
2024
- Large Language Model as a User SimulatorACL 2024 2024
- Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language ModelsTMLR 2024
- Alignment at Pre-training! Towards Native Alignment for Arabic LLMsIn NeurIPS 2024 2024
- GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AIIn NeurIPS 2024 Datasets and Benchmarks Track 2024
- FinBen: An Holistic Financial Benchmark for Large Language ModelsIn NeurIPS 2024 Datasets and Benchmarks Track 2024
- HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at ScaleIn EMNLP 2024 2024
- Humans or LLMs as the Judge? A Study on Judgement BiasesIn EMNLP 2024 2024
- VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models AlignmentIn EMNLP 2024 2024
- HuatuoGPT-II, One-stage Training for Medical Adaption of LLMsIn COLM 2024 2024
- MileBench: Benchmarking MLLMs in Long ContextIn COLM 2024 2024
- MathScale: Scaling Instruction Tuning for Mathematical ReasoningIn ICML 2024 2024
- Rethinking the Uniformity Metric in Self-Supervised LearningIn ICLR 2024 2024
- CMB: A Comprehensive Medical Benchmark in ChineseIn NAACL 2024 2024
- AceGPT, Localizing Large Language Models in ArabicIn NAACL 2024 2024
- Outcome-supervised verifiers for planning in mathematical reasoningIn NAACL 2024 Findings 2024
-
2023
- Pre-trained Language Models in Biomedical Domain: A Survey from Multiscale PerspectiveACM Computing Surveys 2023
- Can Language Models Make Fun? A Case Study in Chinese Comical CrosstalkACL 2023 2023
- Adapting Pre-trained Language Models for Quantum Natural Language ProcessingarXiv preprint arXiv:2302.13812 2023
- Towards Unifying Medical Vision-and-Language Pre-training via Soft PromptsICCV 2023 2023
- Modular Retrieval for Generalization and InterpretationarXiv preprint arXiv:2303.13419 2023
- Effective Open Intent Classification with K-center Contrastive Learning and Adjustable Decision BoundaryIn AAAI 2023
- Phoenix: Democratizing chatgpt across languagesarXiv preprint arXiv:2304.10453 2023
- Lifting the Curse of Capacity Gap in Distilling Language ModelsACL 2023 2023
- One Cannot Stand for Everyone! Leveraging Multiple User Simulators to train Task-oriented Dialogue SystemsACL 2023 2023
- On the Difference of BERT-style and CLIP-style Text EncodersACL 2023 findings 2023
- Few-Shot Class-Incremental Learning for Medical Time Series ClassificationIEEE Journal of Biomedical and Health Informatics 2023
- Natural language reasoning, a surveyarXiv preprint arXiv:2303.14725 2023
- Huatuo-26M, a Large-scale Chinese Medical QA DatasetarXiv preprint arXiv:2305.01526 2023
- Is Information Extraction Solved by ChatGPT? An Analysis of Performance, Evaluation Criteria, Robustness and ErrorsarXiv preprint arXiv:2305.14450 2023
- HuatuoGPT, towards Taming Language Model to Be a DoctorarXiv preprint arXiv:2305.15075 2023
- Injecting Knowledge into Biomedical Pre-trained Models via Polymorphism and Synonymous SubstitutionarXiv preprint arXiv:2305.15010 2023
- Spatio-Temporal Contrastive Learning Enhanced GNNs for Session-based RecommendationACM TOIS 2023
- Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing BiasIn NeurIPS 2023 2023
- All In One: A Chinese Multi-Modal Dataset for Multi-Affection Detection in ConversationsIn NeurIPS 2023 Datasets and Benchmarks Track 2023
2022
- Exploring extreme parameter compression for pre-trained language modelsIn ICLR 2022 2022
- DPTDR: Deep Prompt Tuning for Dense Passage RetrievalCOLING 2022 2022
- Doge Tickets: Uncovering Domain-general Language Models by Playing Lottery TicketsNLPCC 2022 Best Paper 2022
- MorphTE: Injecting Morphology in Tensorized EmbeddingsNeurIPS 2022 2022
- Document-level Relation Extraction with Relation CorrelationsarXiv preprint arXiv:2212.10171 2022
- Hypoformer: Hybrid decomposition transformer for edge-friendly neural machine translationIn Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing 2022
- Complex-valued Neural Network-based Quantum Language ModelsACM TOIS 2022
2021
- Word2fun, modeling words as functions for dynamic word embeddingsIn NeurIPS 2021 2021
-
2020
- Encoding word order in complex embeddingsIn ICLR 2020 Spotlight 2020
2019
- A Complex-valued Network for MatchingIn NAACL 2019 Best Explainable NLP Paper 2019
- Dynamic content monitoring and exploration using vector spacesIn SIGIR 2019 Doctoral Consortium 2019
- Semantic Hilbert Space for Text Representation LearningIn WWW 2019 2019
- Leveraging Long and Short-term Information in Content-aware Movie Recommendation via Adversarial TrainingIEEE Transactions on Cybernetics 2019
2018
- A Quantum Many-body Wave Function Inspired Language Modeling ApproachIn CIKM 2018 2018
- PLASTIC: Prioritize Long and Short-term Information in Top-n Recommendation using Adversarial TrainingIn IJCAI 2018 2018
- A Multi-task Learning Approach for Image CaptioningIn IJCAI 2018 2018
- End-to-End Quantum-like Language Models with Application to Question AnsweringIn AAAI 2018 2018
2017
- IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval ModelsIn SIGIR 2017 Best Paper Award Honorable Mentions 2017
2016
- Exploration of quantum interference in document relevance judgement discrepancyEntropy 2016