Research History

2026 — Year 4 (Ph.D.)

Month	Content
1	1. Logic paper resubmitted to ICML 2. LRM: overall algorithm design and basic engineering plan completed; initial draft formed 3. Projects/Applications: National Natural Science Foundation materials (evidence-based medicine + LLM collaborative decision-making)

2025 — Year 3 (Ph.D.)

Doctoral Year 3; Main research focus: LLM research surpassing top human expert levels, focusing on large model logical reasoning capability enhancement
Core work: Logic project (140,000+ data construction, 47+ SFT/CPT models trained), benchmark development, meta-evaluation (poetry and other artistic scenarios), o1-like model generalization evaluation
Paper submissions: Logic work ARR→ICLR→ICML in submission; 1 collaborative paper, 1 in progress; disease causality discovery (SCI Q2) finalized; AI4Sports EDMIT paper; survey writing in progress
Honors: Tsinghua University Bodybuilding Competition 5th place, Doctoral Academic Forum 1st place (oral + poster)
Other: Tencent Qingyun Scholarship application, thesis proposal preparation, Wild Goose Migration Plan "Large Model-Driven Digital China Construction" project implementation (11,000-word report)

Month	Content
12	1. LRM literature review 2. ICML submission preparation 3. Bodybuilding competition 5th place
11	1. Logic revisions; 2. Survey; 3. Algorithm research; 4. Thesis proposal preparation 1. Tencent Qingyun Scholarship application 2. Logic training 3. ICLR decision received; Logic supplementary experiments
10	1. ICLR 2026 submission + ICLR reviewing 2. Open-source SFT+RL method reproduction, evaluation, data synthesis experiment design, training results 3. Poster and oral preparation: Doctoral Forum 1st place
9	Logic work submitted to ICLR: supplementary experiments / paper refinement
8	1. Logic: paper revisions 2. Survey: outline; paper collection 3. Other papers: AI4Sports article EDMIT: An End-to-End Agentic Framework for Enhanced Decision-Making in Interactive Motion Tutoring Brainstorming: dLLM diffusion language model, Universal Model general large model Paper finalized: Towards Artificial Intelligence for Science: A Case Study of Using ChatGPT for Disease Causality Discovery from Biomedical Literature (SCI Q2) 4. Other: Party-building paper; Wild Goose Migration Plan: "Digital-Real Integration New Engine · Intelligent Creation · Industrial Future" — Large Model-Driven Digital China Construction project implementation, 11,000-word report + external publicity
7	1. Benchmark work preliminarily completed
6	1. Overall paper writing framework established; core sections draft largely completed 2. Code development core features preliminarily completed 3. 10 metadata items collected
5	1. [Research] Benchmark coding, paper draft 2. [Paper] Towards Artificial Intelligence for Science: A Case Study of Using ChatGPT for Disease Causality Discovery — review response consideration
4	1. Logical reasoning training: 28 SFT models, 2 CPT models 2. Revisions to 1 paper 3. 140,000 data construction 4. 2 article frameworks preliminarily constructed
3	1. Base logic reasoning capability enhancement: 47 models trained
2	1. Base logic reasoning capability enhancement project launched 2. Survey 3. Basic data construction 4. Basic training attempts
1	1. o1-like model generalization evaluation and research — first month 2. LLM research surpassing top human expert levels — research direction determined 3. Meta-evaluation — poetry scenario algorithm implementation

2024 — Year 2 (Ph.D.)

Doctoral Year 2; Advisor: Jie Tang; Intern at Zhipu AI; Teaching assistant for AML & ML course; KEG Large Model Bootcamp instructor (Deep Learning Fundamentals)
Core research: ChatGLM mathematical reasoning (PRM, full RLHF pipeline), multimodal mathematical reasoning, MalayGLM internationalization, ChatGLM mixed Chinese-English response issues, meta-evaluation (poetry and other artistic text evaluation)
Papers and outcomes: 6 submissions (IJCAI, CogSci, ICML, etc.); 2 publications (federated learning, medical knowledge base); AiMed software copyright; Served as session chair at two paper conferences
Honors: Challenge Cup Capital University Student Entrepreneurship Competition Gold (Beijing 1st), National 3rd; Social Practice Gold Award (2nd university-wide); Outstanding Communist Youth League Member, Computer Science Department Outstanding Student Cadre; External expert at Public Security Bureau; 2 municipal government thank-you letters
Scholarships: Social Practice Scholarship, University Huiyan Elite Scholarship (Second Class)
Social work: Computer Science Department Party Branch Secretary, Class Assistant; responsible for university Youth League "Tongxing" platform
Application deployment: Public security system, medical system, LLMDailyDigest website, AML course public website (aminer.cn/aml2024)

Paper Title	Submission Venue
ChatFUV: Chat Chain for Follow-Up Visit — Developing Personalized Follow-up Plans with Chat Chain	IJCAI AI
AiMed: Artificial Intelligence Large Language Model for Chinese Medicine	IJCAI AI
NewMed: Large Language Modeling Technology Enables Full Process Digital Intelligence in Medical Care	CogSci Cognitive Science
MedRad: A Reliable Assisted Decision Making Framework for Medical Large Language Models	ICML Machine Learning
Med-Eval: Benchmarks for the Medical Large Language Model	ICML Machine Learning
Doctor: The Most Reliable Digital Intelligence Healthcare Large Language Model System	-
OpenMonet: Open Model Orchestration Network	-
MedLib: Research on the Construction of a Knowledge Library for Medical Large Language Modeling	-

Month	Summary
12	KEG Large Model Bootcamp instructor — Deep Learning Fundamentals Malay LLM AML course conclusion: homework baseline, panel, grading standards, final project submission, paper session application, AML book preparation
11	Course public website: https://www.aminer.cn/aml2024 AML computing platform setup Computing Platform tutorial Meta Evaluation: Use LLM to evaluate the LLM evaluator Project proposal — poetry and other artistic text evaluation platform
10	Reinforcement Learning Survey, Self-Learning: Evaluation & Data & New Scaling Law course materials Post_Training_Scaling_Laws_Survey survey revision
9	Enhancing Mathematical Reasoning in Multimodal Large Language Models
8	Social practice summary; math literature review
7	ChatGLM mathematical reasoning \| Project progress month 4: math2-prm evaluation fix; Summer doctoral required practice project
6	ChatGLM mathematical reasoning \| Project progress month 3: model \| PPO training; RLHF model training; model validation \| PRM
5	ChatGLM mathematical reasoning \| Project progress month 2: model \| PRM Inference; model \| PRM Training; model \| PRM Evaluation
4	ChatGLM mathematical reasoning \| Project progress month 1: data construction \| automated step-by-step annotation; human feedback algorithm \| forward auto-annotation and backward scoring feedback for process reward computation
3	1. ChatGLM internationalization 2. Mixed Chinese-English handling
2	Personal materials preparation
1	Paper submissions × 6

2023 — Year 1 (Ph.D.)

Research focus: medical large models and knowledge engineering
Core outcomes: AiMed 1.0 open-source release, Doctor 1.0 deployment, Med-Eval benchmark construction launched
Paper directions: AiMed, ChatFUV, NewMed, Med-Eval, MedRad and other medical large model work in progress
Courses: Advanced Machine Learning (RLHF, RAG assignments and projects), CSE paper reports (KrNER, PoKG), Chinese Marxism and Contemporary
Data construction: Preprocessed 80,000 electronic medical record entries; built guideline library and medical record library; drug instructions and lab knowledge bases
Other: Department practice review, Zhipu AI events, Journal of Software materials, federated learning paper, etc.

#	Task	Details
1	Model selection	Separate links
2	Knowledge base external	Similar patients batch 1: preprocessed 20,000 electronic medical records; SQL export to formatted JSON
3	Knowledge base statistics	AiMed current data
4	Knowledge base external	Guidelines
5	Department practice review	Materials preparation
6	CSE paper report	KrNER: PPT
7	CSE paper report	PoKG: PPT
8	Department practice review	On-site review
9	AiMed 1.0 release: copyright	Copyright issues
10	CSE paper report	KrNER: script preparation, video recording
11	CSE paper report	PoKG: script preparation, video recording
12	AiMed 1.0 release: service	Sensitive information filtering
13	CSE paper report	On-site presentation
14	Multi-model chain	Related research survey
15	Advanced Machine Learning	HW1 — Tokenization and compression ratio comparison (5 papers / 5 experiments)
16	AiMed 1.0 release	AiMed 1.0 project open-source
17	AiMed 1.0 release	AiMed 1.0-chat parameters release
18	AiMed 1.0 release	AiMed 1.0-paperabs parameters release
19	AiMed 1.0 release	AiMed 1.0 frontend integration
20	AiMed 1.0 release	AiMed 1.0 backend integration
21	AiMed 1.0 release: parameters	AiMed-Base full-process model parameters release
22	Chinese Marxism and Contemporary	Topic selection
23	Social work	software + Zhipu AI joint event
24	AiMed 1.0 release	Institute of Medical Information integration
25	LLM survey
26	AiMed 2.0	Data preparation
27	LLM	Related survey
28	Advanced Machine Learning	Project proposal
29	Advanced Machine Learning	Project proposal
30	AiMed 2.0 training	AiMed 2.0-Chat dialogue model training — round 1
31	AiMed 2.0 training	AiMed 2.0-Chat dialogue model training — round 1 testing
32	Advanced Machine Learning	Project proposal PPT
33	Chinese Marxism and Contemporary	PPT
34	Patent	Senior Qiu patent revision
35	Advanced Machine Learning	Project proposal PPT
36	Chinese Marxism and Contemporary	PPT
37	Patent	Senior Qiu patent revision
38	Advanced Machine Learning	Project proposal PPT
39	Advanced Machine Learning	Project proposal PPT
40	Advanced Machine Learning	Project proposal PPT script
41	Institute of Medical Information report PPT
42	Advanced Machine Learning	Project proposal PPT script
43	Advanced Machine Learning	Project discussion
44	Senior Zhang Ruilin	Journal of Software materials preparation
45	Advanced Machine Learning	Group discussion preparation
46	AiMed interface optimization	Sensitive information
47	Doctor 1.0 deployment	Model deployment to Changsha server
48	LLM	Related survey
49	Senior Zhang Ruilin	Journal of Software materials preparation
50	Advanced Machine Learning	Project discussion
51	Doctor	Guideline library and medical record library development
52	AiMed 1.0 release: service	Similar patients batch 2: preprocessed 60,000 electronic medical records; SQL export to formatted JSON
53	Engineering library and retrieval	Drug instructions, lab tests, guideline library
54	Changsha document handling	Switch to Changsha permissions for access
55	Advanced Machine Learning	Assignment 2: RLHF application in multimodal domain
56	Med-eval	Overall implementation plan
57	Med-eval	Colleague task allocation
58	Advanced Machine Learning	Poster
59	Advanced Machine Learning	PPT
60	Med-eval	Overall implementation plan
61	Med-eval	Colleague task allocation
62	Med-eval	Dataset construction: 3 datasets
63	Med-eval	Point-to-point and individual task allocation
64	Med-eval	RAG-related organization
65	Advanced Machine Learning	Assignment 3: RAG
66	Blockchain	Final project
67	Advanced Machine Learning	MedRad: paper maintenance location
68	AiMed 1.0	AiMed: Artificial Intelligence Large Language Model for Chinese Medicine
69	ChatFUV 1.0	ChatFUV: Chat Chain for Follow-Up Visit — Developing Personalized Follow-up Plans with Chat Chain
70	NewMed 1.0	NewMed: Large Language Modeling Technology Enables Full Process Digital Intelligence in Medical Care
71	Med-Eval 1.0	Med-Eval: Benchmarks for the Medical Large Language Model
72	MedRad 1.0	MedRad: paper maintenance location
73	Federated learning paper	Journal of Software materials mailing

2022 — Year 0 (Ph.D.)

Undergraduate graduation + Ph.D. admission: Completed undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition (department and university defense); entered Tsinghua University Computer Science Department for Ph.D. in September
Research papers: MoNER (medical-oriented named entity recognition) research paper submission and multiple revisions; survey completed
Competitions and survey: CBLUE/CBLUE2 leaderboard; BioNLP benchmark survey; CCKS/EMNLP conference analysis; blockchain key R&D indicators survey
Summer practice: Tsinghua & Central South University medical-engineering crossover summer camp; Changsha project (AI diagnostic robot, entity recognition, disease prediction, Xiangya Hospital testing)
Ph.D. exploration: Knowledge graph survey; patient-centered knowledge graph; data and knowledge joint-driven intelligent patient management
Other: Personal wiki setup; exchanges with Chinese Academy of Medical Sciences / Institute of Medical Information, CAS

Month	Week	Content
1	1	Undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition proposal defense
	2	Personal wiki setup / BERT experiments / survey revision
	3	CBLUE Chinese medical information processing challenge leaderboard / survey revision
2	1	CBLUE2 Chinese medical information processing challenge leaderboard / survey revision
	2	International BioNLP benchmark survey / domestic Chinese medical information processing conferences and competitions survey / medical data source survey / survey revision / experiments
	3	Survey theoretical analysis / experiments
3	1	In-depth reading of 2 papers
	2	In-depth reading of 16 papers
	3	Survey completed / blockchain key R&D indicators analysis survey
	4	Blockchain key R&D indicators analysis survey 2 / experiments
4	1	Undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition midterm defense
	2	CCKS conference analysis / EMNLP conference analysis
	3	EMNLP detailed survey / Institute of Medical Information, Chinese Academy of Medical Sciences meeting summary / survey / experiments
	4	Survey finalized / blockchain key R&D indicators supplementary survey
5	1	Research paper initial draft / medical knowledge graph survey / lab homepage survey and design
	2	Lab detail page survey and design / graduation design
	3	Undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition finalized
	4	Undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition PPT
6	1	Undergraduate thesis Research on Deep Learning Models for Chinese Electronic Medical Record Named Entity Recognition defense (department and university); nested NER research
	2	Chronic kidney disease full-course service system BRD v1.0 technical assessment / digital therapy insight report
	3	MoNER: A Novel Remote Supervision-based Medical-oriented Named Entity Recognition Method — research paper writing
	4	Research paper submission / survey revision / NER interface encapsulation and API documentation / lexicon update / Senior Zou work handover / Central South University summer research plan
7	1	Word segmentation lexicon update / research analysis platform analysis / pre-diagnosis interface update / Central South University summer camp cooperation plan / disease prediction prior research summary / Institute of Medical Information cooperation paper directions
	2	Tsinghua & Central South University summer camp medical-engineering crossover project introduction
	3	Research accumulation planning / Changsha project accumulation planning / academic journal and conference survey / Changsha work refinement / entity recognition & disease prediction & diagnostic app deployment architecture / disease-specific knowledge graph research and progress / disease prediction algorithm survey
	4	Nature Medicine journal survey / disease prediction work refinement / Changsha work plan / disease-specific knowledge graph survey
	5	Changsha environment deployment / entity recognition & disease prediction & diagnostic app deployment architecture / data flow / GPU usage / dynamic lexicon update mechanism / handover clarification / patient-centered knowledge graph construction research
8	1	AI diagnostic robot: requirements analysis / overall architecture / detailed design / implementation plan
	2	Named entity recognition model training / disease prediction model training / dynamic lexicon update / Changsha data familiarization
	3	Xiangya Hospital No. 1 testing / Xiangya Hospital No. 2 testing / work handover
	4	Summer league school / electronic medical record data requirements analysis / research paper revision
9	1	* Basic workflow established, adjustments underway * Goals still unclear — New student orientation / research paper author response
	2	Classes / Open Week report / journal survey / MoNER research paper writing
	3	Classes / knowledge graph survey
	4	Classes / Institute of Medical Information, CAS exchange
10	1	Knowledge graph experiments
	2	Classes / MoNER research paper revision 2
	3	Classes / MoNER research paper overall revision plan
	4	Classes / data and knowledge joint-driven intelligent patient management
11	1	Classes / MoNER research paper revision 3 / patient-centered knowledge graph survey
	2	Classes / elderly renal function decline clinical assessment and early warning — NLP exchange meeting / pre-trained language model as knowledge graph research trends
12	1	First half: COVID; Second half: exam period — Classes / MoNER research paper revision 4 / data and knowledge joint-driven intelligent patient management 2.0 / nested entity recognition experiments