Skip to content

Research History

2026 — Year 4 (Ph.D.)

Month Content
1 1. Logic paper resubmitted to ICML
2. LRM: overall algorithm design and basic engineering plan completed; initial draft formed
3. Projects/Applications: National Natural Science Foundation materials (evidence-based medicine + LLM collaborative decision-making)

2025 — Year 3 (Ph.D.)

  1. Doctoral Year 3; Main research focus: LLM research surpassing top human expert levels, focusing on large model logical reasoning capability enhancement
  2. Core work: Logic project (140,000+ data construction, 47+ SFT/CPT models trained), benchmark development, meta-evaluation (poetry and other artistic scenarios), o1-like model generalization evaluation
  3. Paper submissions: Logic work ARR→ICLR→ICML in submission; 1 collaborative paper, 1 in progress; disease causality discovery (SCI Q2) finalized; AI4Sports EDMIT paper; survey writing in progress
  4. Honors: Tsinghua University Bodybuilding Competition 5th place, Doctoral Academic Forum 1st place (oral + poster)
  5. Other: Tencent Qingyun Scholarship application, thesis proposal preparation, Wild Goose Migration Plan "Large Model-Driven Digital China Construction" project implementation (11,000-word report)
Month Content
12 1. LRM literature review
2. ICML submission preparation
3. Bodybuilding competition 5th place
11 1. Logic revisions; 2. Survey; 3. Algorithm research; 4. Thesis proposal preparation
1. Tencent Qingyun Scholarship application
2. Logic training
3. ICLR decision received; Logic supplementary experiments
10 1. ICLR 2026 submission + ICLR reviewing
2. Open-source SFT+RL method reproduction, evaluation, data synthesis experiment design, training results
3. Poster and oral preparation: Doctoral Forum 1st place
9 Logic work submitted to ICLR: supplementary experiments / paper refinement
8 1. Logic: paper revisions
2. Survey: outline; paper collection
3. Other papers: AI4Sports article EDMIT: An End-to-End Agentic Framework for Enhanced Decision-Making in Interactive Motion Tutoring
Brainstorming: dLLM diffusion language model, Universal Model general large model
Paper finalized: Towards Artificial Intelligence for Science: A Case Study of Using ChatGPT for Disease Causality Discovery from Biomedical Literature (SCI Q2)
4. Other: Party-building paper; Wild Goose Migration Plan: "Digital-Real Integration New Engine · Intelligent Creation · Industrial Future" — Large Model-Driven Digital China Construction project implementation, 11,000-word report + external publicity
7 1. Benchmark work preliminarily completed
6 1. Overall paper writing framework established; core sections draft largely completed
2. Code development core features preliminarily completed
3. 10 metadata items collected
5 1. [Research] Benchmark coding, paper draft
2. [Paper] Towards Artificial Intelligence for Science: A Case Study of Using ChatGPT for Disease Causality Discovery — review response consideration
4 1. Logical reasoning training: 28 SFT models, 2 CPT models
2. Revisions to 1 paper
3. 140,000 data construction
4. 2 article frameworks preliminarily constructed
3 1. Base logic reasoning capability enhancement: 47 models trained
2 1. Base logic reasoning capability enhancement project launched
2. Survey
3. Basic data construction
4. Basic training attempts
1 1. o1-like model generalization evaluation and research — first month
2. LLM research surpassing top human expert levels — research direction determined
3. Meta-evaluation — poetry scenario algorithm implementation

2024 — Year 2 (Ph.D.)

  1. Doctoral Year 2; Advisor: Jie Tang; Intern at Zhipu AI; Teaching assistant for AML & ML course; KEG Large Model Bootcamp instructor (Deep Learning Fundamentals)
  2. Core research: ChatGLM mathematical reasoning (PRM, full RLHF pipeline), multimodal mathematical reasoning, MalayGLM internationalization, ChatGLM mixed Chinese-English response issues, meta-evaluation (poetry and other artistic text evaluation)
  3. Papers and outcomes: 6 submissions (IJCAI, CogSci, ICML, etc.); 2 publications (federated learning, medical knowledge base); AiMed software copyright; Served as session chair at two paper conferences
  4. Honors: Challenge Cup Capital University Student Entrepreneurship Competition Gold (Beijing 1st), National 3rd; Social Practice Gold Award (2nd university-wide); Outstanding Communist Youth League Member, Computer Science Department Outstanding Student Cadre; External expert at Public Security Bureau; 2 municipal government thank-you letters
  5. Scholarships: Social Practice Scholarship, University Huiyan Elite Scholarship (Second Class)
  6. Social work: Computer Science Department Party Branch Secretary, Class Assistant; responsible for university Youth League "Tongxing" platform
  7. Application deployment: Public security system, medical system, LLMDailyDigest website, AML course public website (aminer.cn/aml2024)
Paper Title Submission Venue
ChatFUV: Chat Chain for Follow-Up Visit — Developing Personalized Follow-up Plans with Chat Chain IJCAI AI
AiMed: Artificial Intelligence Large Language Model for Chinese Medicine IJCAI AI
NewMed: Large Language Modeling Technology Enables Full Process Digital Intelligence in Medical Care CogSci Cognitive Science
MedRad: A Reliable Assisted Decision Making Framework for Medical Large Language Models ICML Machine Learning
Med-Eval: Benchmarks for the Medical Large Language Model ICML Machine Learning
Doctor: The Most Reliable Digital Intelligence Healthcare Large Language Model System -
OpenMonet: Open Model Orchestration Network -
MedLib: Research on the Construction of a Knowledge Library for Medical Large Language Modeling -
Month Summary
12 KEG Large Model Bootcamp instructor — Deep Learning Fundamentals
Malay LLM
AML course conclusion: homework baseline, panel, grading standards, final project submission, paper session application, AML book preparation
11 Course public website: https://www.aminer.cn/aml2024
AML computing platform setup
Computing Platform tutorial
Meta Evaluation: Use LLM to evaluate the LLM evaluator
Project proposal — poetry and other artistic text evaluation platform
10 Reinforcement Learning Survey, Self-Learning: Evaluation & Data & New Scaling Law course materials
Post_Training_Scaling_Laws_Survey survey revision
9 Enhancing Mathematical Reasoning in Multimodal Large Language Models
8 Social practice summary; math literature review
7 ChatGLM mathematical reasoning | Project progress month 4: math2-prm evaluation fix; Summer doctoral required practice project
6 ChatGLM mathematical reasoning | Project progress month 3: model | PPO training; RLHF model training; model validation | PRM
5 ChatGLM mathematical reasoning | Project progress month 2: model | PRM Inference; model | PRM Training; model | PRM Evaluation
4 ChatGLM mathematical reasoning | Project progress month 1: data construction | automated step-by-step annotation; human feedback algorithm | forward auto-annotation and backward scoring feedback for process reward computation
3 1. ChatGLM internationalization 2. Mixed Chinese-English handling
2 Personal materials preparation
1 Paper submissions × 6

2023 — Year 1 (Ph.D.)

  1. Research focus: medical large models and knowledge engineering
  2. Core outcomes: AiMed 1.0 open-source release, Doctor 1.0 deployment, Med-Eval benchmark construction launched
  3. Paper directions: AiMed, ChatFUV, NewMed, Med-Eval, MedRad and other medical large model work in progress
  4. Courses: Advanced Machine Learning (RLHF, RAG assignments and projects), CSE paper reports (KrNER, PoKG), Chinese Marxism and Contemporary
  5. Data construction: Preprocessed 80,000 electronic medical record entries; built guideline library and medical record library; drug instructions and lab knowledge bases
  6. Other: Department practice review, Zhipu AI events, Journal of Software materials, federated learning paper, etc.
# Task Details
1 Model selection Separate links
2 Knowledge base external Similar patients batch 1: preprocessed 20,000 electronic medical records; SQL export to formatted JSON
3 Knowledge base statistics AiMed current data
4 Knowledge base external Guidelines
5 Department practice review Materials preparation
6 CSE paper report KrNER: PPT
7 CSE paper report PoKG: PPT
8 Department practice review On-site review
9 AiMed 1.0 release: copyright Copyright issues
10 CSE paper report KrNER: script preparation, video recording
11 CSE paper report PoKG: script preparation, video recording
12 AiMed 1.0 release: service Sensitive information filtering
13 CSE paper report On-site presentation
14 Multi-model chain Related research survey
15 Advanced Machine Learning HW1 — Tokenization and compression ratio comparison (5 papers / 5 experiments)
16 AiMed 1.0 release AiMed 1.0 project open-source
17 AiMed 1.0 release AiMed 1.0-chat parameters release
18 AiMed 1.0 release AiMed 1.0-paperabs parameters release
19 AiMed 1.0 release AiMed 1.0 frontend integration
20 AiMed 1.0 release AiMed 1.0 backend integration
21 AiMed 1.0 release: parameters AiMed-Base full-process model parameters release
22 Chinese Marxism and Contemporary Topic selection
23 Social work software + Zhipu AI joint event
24 AiMed 1.0 release Institute of Medical Information integration
25 LLM survey
26 AiMed 2.0 Data preparation
27 LLM Related survey
28 Advanced Machine Learning Project proposal
29 Advanced Machine Learning Project proposal
30 AiMed 2.0 training AiMed 2.0-Chat dialogue model training — round 1
31 AiMed 2.0 training AiMed 2.0-Chat dialogue model training — round 1 testing
32 Advanced Machine Learning Project proposal PPT
33 Chinese Marxism and Contemporary PPT
34 Patent Senior Qiu patent revision
35 Advanced Machine Learning Project proposal PPT
36 Chinese Marxism and Contemporary PPT
37 Patent Senior Qiu patent revision
38 Advanced Machine Learning Project proposal PPT
39 Advanced Machine Learning Project proposal PPT
40 Advanced Machine Learning Project proposal PPT script
41 Institute of Medical Information report PPT
42 Advanced Machine Learning Project proposal PPT script
43 Advanced Machine Learning Project discussion
44 Senior Zhang Ruilin Journal of Software materials preparation
45 Advanced Machine Learning Group discussion preparation
46 AiMed interface optimization Sensitive information
47 Doctor 1.0 deployment Model deployment to Changsha server
48 LLM Related survey
49 Senior Zhang Ruilin Journal of Software materials preparation
50 Advanced Machine Learning Project discussion
51 Doctor Guideline library and medical record library development
52 AiMed 1.0 release: service Similar patients batch 2: preprocessed 60,000 electronic medical records; SQL export to formatted JSON
53 Engineering library and retrieval Drug instructions, lab tests, guideline library
54 Changsha document handling Switch to Changsha permissions for access
55 Advanced Machine Learning Assignment 2: RLHF application in multimodal domain
56 Med-eval Overall implementation plan
57 Med-eval Colleague task allocation
58 Advanced Machine Learning Poster
59 Advanced Machine Learning PPT
60 Med-eval Overall implementation plan
61 Med-eval Colleague task allocation
62 Med-eval Dataset construction: 3 datasets
63 Med-eval Point-to-point and individual task allocation
64 Med-eval RAG-related organization
65 Advanced Machine Learning Assignment 3: RAG
66 Blockchain Final project
67 Advanced Machine Learning MedRad: paper maintenance location
68 AiMed 1.0 AiMed: Artificial Intelligence Large Language Model for Chinese Medicine
69 ChatFUV 1.0 ChatFUV: Chat Chain for Follow-Up Visit — Developing Personalized Follow-up Plans with Chat Chain
70 NewMed 1.0 NewMed: Large Language Modeling Technology Enables Full Process Digital Intelligence in Medical Care
71 Med-Eval 1.0 Med-Eval: Benchmarks for the Medical Large Language Model
72 MedRad 1.0 MedRad: paper maintenance location
73 Federated learning paper Journal of Software materials mailing

2022 — Year 0 (Ph.D.)

  1. Undergraduate graduation + Ph.D. admission: Completed undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition (department and university defense); entered Tsinghua University Computer Science Department for Ph.D. in September
  2. Research papers: MoNER (medical-oriented named entity recognition) research paper submission and multiple revisions; survey completed
  3. Competitions and survey: CBLUE/CBLUE2 leaderboard; BioNLP benchmark survey; CCKS/EMNLP conference analysis; blockchain key R&D indicators survey
  4. Summer practice: Tsinghua & Central South University medical-engineering crossover summer camp; Changsha project (AI diagnostic robot, entity recognition, disease prediction, Xiangya Hospital testing)
  5. Ph.D. exploration: Knowledge graph survey; patient-centered knowledge graph; data and knowledge joint-driven intelligent patient management
  6. Other: Personal wiki setup; exchanges with Chinese Academy of Medical Sciences / Institute of Medical Information, CAS
Month Week Content
1 1 Undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition proposal defense
2 Personal wiki setup / BERT experiments / survey revision
3 CBLUE Chinese medical information processing challenge leaderboard / survey revision
2 1 CBLUE2 Chinese medical information processing challenge leaderboard / survey revision
2 International BioNLP benchmark survey / domestic Chinese medical information processing conferences and competitions survey / medical data source survey / survey revision / experiments
3 Survey theoretical analysis / experiments
3 1 In-depth reading of 2 papers
2 In-depth reading of 16 papers
3 Survey completed / blockchain key R&D indicators analysis survey
4 Blockchain key R&D indicators analysis survey 2 / experiments
4 1 Undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition midterm defense
2 CCKS conference analysis / EMNLP conference analysis
3 EMNLP detailed survey / Institute of Medical Information, Chinese Academy of Medical Sciences meeting summary / survey / experiments
4 Survey finalized / blockchain key R&D indicators supplementary survey
5 1 Research paper initial draft / medical knowledge graph survey / lab homepage survey and design
2 Lab detail page survey and design / graduation design
3 Undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition finalized
4 Undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition PPT
6 1 Undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition defense (department and university); nested NER research
2 Chronic kidney disease full-course service system BRD v1.0 technical assessment / digital therapy insight report
3 MoNER: A Novel Remote Supervision-based Medical-oriented Named Entity Recognition Method — research paper writing
4 Research paper submission / survey revision / NER interface encapsulation and API documentation / lexicon update / Senior Zou work handover / Central South University summer research plan
7 1 Word segmentation lexicon update / research analysis platform analysis / pre-diagnosis interface update / Central South University summer camp cooperation plan / disease prediction prior research summary / Institute of Medical Information cooperation paper directions
2 Tsinghua & Central South University summer camp medical-engineering crossover project introduction
3 Research accumulation planning / Changsha project accumulation planning / academic journal and conference survey / Changsha work refinement / entity recognition & disease prediction & diagnostic app deployment architecture / disease-specific knowledge graph research and progress / disease prediction algorithm survey
4 Nature Medicine journal survey / disease prediction work refinement / Changsha work plan / disease-specific knowledge graph survey
5 Changsha environment deployment / entity recognition & disease prediction & diagnostic app deployment architecture / data flow / GPU usage / dynamic lexicon update mechanism / handover clarification / patient-centered knowledge graph construction research
8 1 AI diagnostic robot: requirements analysis / overall architecture / detailed design / implementation plan
2 Named entity recognition model training / disease prediction model training / dynamic lexicon update / Changsha data familiarization
3 Xiangya Hospital No. 1 testing / Xiangya Hospital No. 2 testing / work handover
4 Summer league school / electronic medical record data requirements analysis / research paper revision
9 1 * Basic workflow established, adjustments underway * Goals still unclear — New student orientation / research paper author response
2 Classes / Open Week report / journal survey / MoNER research paper writing
3 Classes / knowledge graph survey
4 Classes / Institute of Medical Information, CAS exchange
10 1 Knowledge graph experiments
2 Classes / MoNER research paper revision 2
3 Classes / MoNER research paper overall revision plan
4 Classes / data and knowledge joint-driven intelligent patient management
11 1 Classes / MoNER research paper revision 3 / patient-centered knowledge graph survey
2 Classes / elderly renal function decline clinical assessment and early warning — NLP exchange meeting / pre-trained language model as knowledge graph research trends
12 1 First half: COVID; Second half: exam period — Classes / MoNER research paper revision 4 / data and knowledge joint-driven intelligent patient management 2.0 / nested entity recognition experiments