Team

Leader Teacher Student

Name: 曹亚男 Yanan Cao
Title: Research Fellow
Education: Ph.D.
Research Direction: NLP, SNA, ML
Email: caoyanan@iie.ac.cn

Introduction

曹亚男，中国科学院信息工程研究所第四研究室副主任，研究员，中国科学院大学岗位教授，博士生导师，国家重点研发计划青年科学家项目负责人。研究方向为信息内容安全，包括文本内容安全、社会媒体分析等，致力于采用自然语言处理、图神经网络等技术解决网络空间文本、关系等信息内容安全问题。入选2016年度中国科学院信息工程研究所青年之星、2018年度中国科学院青年创新促进会、2021年度国科大优秀岗位教师、2022年度中科院优秀导师；获2017年度百度之星第一名、2020年度PAKDD唯一最佳论文奖、2022年度CCL新闻脉络分析评测第一名、2025年度NLPCC大模型生成文本检测共享任务第一名；获2022年度中国电子学会科技进步二等奖。迄今在WWW、AAAI、ICDM、CIKM等CCF-A/B类会议和期刊上发表50余篇学术论文，连续多年任ACL、COLING领域委员，AAAI、IJCAI高级程序委员会委员，是TKDE、TOIS等国际期刊的审稿人。先后主持包括国家重点研发计划项目、国家自然科学基金、国防预先研究、国家信息安全专项在内的20余项国家级和省部级科研项目，具有丰富的科研经验和项目经验。在国科大主讲《自然语言处理》、《自然语言处理实战》、《深度学习与自然语言处理》等课程，深受学生喜爱和好评。

Prof. Yanan Cao, Deputy Director of the Fourth Faculty, is a research fellow of Institute of Information Engineering, Chinese Academy of Sciences(IIE, CAS) and a Ph.D. program supervisor of University of Chinese Academy of Sciences(UCAS). She had received her bachelor's degree from the School of Computer Science and Technology of Shandong University in 2006 and her Ph.D. from the Institute of Computing Technology of the Chinese Academy of Sciences in 2012. Her research directions include natural language processing(NLP), social network analysis(SNA) and machine learning(ML). Her specific research fields include knowledge map construction and reasoning, text generation, graph neural network and so on. She was selected as the youth star of Institute of Information Engineering, Chinese Academy of Sciences in 2016 and selected into the Youth Innovation Promotion Association(CAS) in 2018. She won the first place in the Astar in 2017, won the third place in the OAG-WHOISWHO competition in 2019 and won the best paper award of PAKDD in 2020. So far, she has published many academic papers in international top conferences and journals including WWW, AAAI, ICDM and CIKM, and has published more than 60 EI and SCI search papers. She has been a member of ACL, AAAI, EMNLP, CIKM program committee and IJCAI senior program committee for many years. She is a peer reviewer of academic journals at home and abroad such as TKDE, TIS, FCS, Journal of Computer Science and Technology, Journal of Chinese Information Processing, and serves as a review expert of National Natural Science foundation of China and Beijing Municipal Natural Science Foundation. She has presided over and participated in a number of projects supported by National Natural Science Foundation of China, National Key Research and Development Project of China, Strategic Priority Research Program of the Chinese Academy of Sciences, and has rich scientific research experience and project experience. She offered postgraduate courses "Deep Learning and Natural Language Processing" and "Intelligent Question Answering Frontier Technology" at the University of Chinese Academy of Sciences , which were well received and praised by students.

Social Service

1.ACL 2022 AC、AAAI 2022 SPC、KDD 2022 PC
2.Program committee member of ACL, IJCAI, EMNLP, WWW, SIGIR and other international conferences for many years
3.Member of CCF YOCSEF and Youth Working Committee of China Chinese Information Society (CIPS)
4.Review expert of National Natural Science foundation of China and Beijing Municipal Natural Science Foundation.

Awards and Honors

1.Special contribution award of National Engineering Laboratory for Information Content Security Technology,2013
2.Outstanding communist party member of the Institute of Information Engineering, Chinese Academy of Sciences,2015
3.Youth star of Institute of Information Engineering, Chinese Academy of Sciences,2016
4.1st place in Astar,2017
5.Outstanding staff of the Institute of Information Engineering, Chinese Academy of Sciences,2017
6.Member of Youth Innovation Promotion Association of Chinese Academy of Sciences,2018
7.3rd place in the OAG-WHOISWHO,2019
8.Best paper award of PAKDD,2020
9.Excellent staff at the office level of the Institute of Information Engineering, Chinese Academy of Sciences,2021

Representative Works

ECAI-2025 CCF-B Yixuan Nan, Xixun Lin, Yanmin Shang, Zhuofan Li, Can Zhao, Yanan Cao. RANA: Robust Active Learning for Noisy Network Alignment. [pdf][code]
NLPCC-2025 CCF-C Zhuoshang Wang, Yubing Ren, Guoyu Zhao, Xiaowei Zhu, Hao Li, Yanan Cao. EnsemJudge: Enhancing Reliability in Chinese LLM-Generated Text Detection through Diverse Model Ensembles [pdf][code]
ECML-2025 CCF-B Nan Sun, Xixun Lin*, Zhiheng Zhou, Yanmin Shang, Zhenlin Cheng, Yanan Cao. Evidential Spectrum-Aware Contrastive Learning for OOD Detection in Dynamic Graphs. [pdf]
ACL-2025 CCF-A Yidan Wang, Yubing Ren, Yanan Cao, Binxing Fang. From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models. [pdf]
ACL-2025 CCF-A Yidan Wang, Yanan Cao, Yubing Ren, Fang Fang, Zheng Lin, Binxing Fang. PIG: Privacy Jailbreak Attack on LLMs via Gradient-based Iterative In-Context Optimization. [pdf]
ACL-2025 CCF-A Lanxue Zhang, Yanan Cao, Yuqiang Xie, Fang Fang, Yangxi Li. Dynamic Evaluation with Cognitive Reasoning for Multi-turn Safety of Large Language Models. [pdf]
ACL-2025 CCF-A Xiaowei Zhu, Yubing Ren, Yanan Cao, Xixun Lin, Fang Fang, Yangxi Li. Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction. [pdf]
KDD-2025 CCF-A Yongxuan Wu, Yang Liu, Xixun Lin, Hong Zhou, Yanan Cao, Lixin Zou, Yanmin Shang, Yanbing Liu. FairCDR: Transferring Fairness and User Preferences for Cross-Domain Recommendation. [pdf]
TOIS-2025 CCF-A Xixun Lin, Rui Liu, Yanan Cao, Lixin Zou, Qian Li, Yongxuan Wu, Yang Liu, Dawei Yin, Guandong Xu. Contrastive Modality-Disentangled Learning for Multimodal Recommendation. [pdf]
WWW-2025 CCF-A Hao Li, Yubing Ren, Yanan Cao, Yingjie Li, Fang Fang, Zheng Lin, Shi Wang. Bridging the Gap: Aligning Language Model Generation with Structured Information Extraction via Controllable State Transition. [pdf]
WWW-2025 CCF-A Xixun Lin, Yanan Cao, Nan Sun, Lixin Zou, Chuan Zhou, Peng Zhang, Shuai Zhang, Ge Zhang, Jia Wu. Conformal Graph-level Out-of-distribution Detection with Adaptive Data Augmentation. [pdf]
AAAI-2025 CCF-A Chuancheng Song, Xixun Lin, Hanyang Shen, Yanmin Shang, Yanan Cao. UniFORM: Towards Unified Framework for Anomaly Detection on Graphs. (Oral) [pdf]
WWWJ-2024 CCF-B Yanan Cao, Xixun Lin, Yongxuan Wu, Fengzhao Shi, Yanmin Shang, Qingfeng Tan, Chuan Zhou, Peng Zhang. A Data-centric Framework of Improving Graph Neural Networks for Knowledge Graph Embedding. [pdf][code]
WWWJ-2024 CCF-B Fengzhao Shi, Yanan Cao, Ren Li, Xixun Lin, Yanmin Shang, Chuan Zhou, Jia Wu, Shirui Pan. VR-GNN: Variational Relation Vector Graph Neural Network for Modeling Homophily and Heterophily. [pdf][code]
ICML-2024 CCF-A Xixun Lin, Wenxiao Zhang, Fengzhao Shi, Chuan Zhou, Lixin Zou, Xiangyu Zhao, Dawei Yin, Shirui Pan, Yanan Cao. Graph Neural Stochastic Diffusion for Estimating Uncertainty in Node Classification. [pdf]
ACL-2024 CCF-A Yubing Ren, Ping Guo, Yanan Cao, Wei Ma. Subtle Signatures, Strong Shields: Advancing Robust and Imperceptible Watermarking in Large Language Models. [pdf][code]
NAACL-2024 CCF-B Yanhe Fu, Yanan Cao∗, Qingyue Wang,and Yi Liu. TISE: A Tripartite In-context Selection Method for Event Argument Extraction. [pdf][code]
WSDM-2024 CCF-B Yu Liu, Yanan Cao, Shi Wang, Qingyue Wang, Guanqun Bi. Generative Models for Complex Logical Reasoning over Knowledge Graphs. [pdf][code]
COLING-2024 CCF-B Yubing Ren, Yanan Cao, Hao Li, Yingjie Li, Zixuan Ma, Fang Fang, Ping Guo and Wei Ma. DEIE: Benchmarking Document-level Event Information Extraction with a Large-scale Chinese News Dataset. [pdf][code]
ICASSP-2024 CCF-B Hao Li, Yanan Cao, Yubing Ren, Fang Fang, Lanxue Zhang, Yingjie Li, Shi Wang. Sorting, Reasoning, and Extraction: an Easy-to-Hard Reasoning Framework for Document-level Event Argument Extraction. [pdf][code]
EMNLP-2023 CCF-B Hao Li, Yanan Cao, Yubing Ren, Fang Fang, Lanxue Zhang, Yingjie Li, Shi Wang. Intra-Event and Inter-Event Dependency-Aware Graph Network for Event Argument Extraction. [pdf]
NLPCC-2023 CCF-C Yanhe Fu, Yi Liu, Yanan Cao, Yubing Ren, Qingyue Wang, Fang Fang, Cong Cao. A Multi-granularity Similarity Enhanced Model for Implicit Event Argument Extraction. [pdf][code]
ACL-2023 CCF-A Yi Liu, Yuan Tian, Jianxun Lian, Xinlong Wang, Yanan Cao, Fang Fang, Wen Zhang, Haizhen Huang, Denvy Deng, Qi Zhang. Towards Better Entity Linking with Multi-View enhanced Distillation. [pdf][code]
ACL-2023 CCF-A Yubing Ren, Yanan Cao, Ping Guo, Fang Fang, Wei Ma, Zheng Lin. Retrieve-and-Sample: Document-level Event Argument Extraction via Hybrid Retrieval Augmentation. [pdf]
ACL-2023 CCF-A Guanqun Bi, Lei Shen, Yanan Cao, Meng Chen, Yuqiang Xie, Zheng Lin, Xiaodong He. DiffusEmp: A Diffusion Model-Based Framework with Multi-Grained Control for Empathetic Response Generation. [pdf][code]
ACL-2023 CCF-A Qingyue Wang, Liang Ding, Yanan Cao, Yibing Zhan, Zheng Lin, Shi Wang, Dacheng Tao, Li Guo. Divide, Conquer, and Combine: Mixture of Semantic-Independent Experts for Zero-Shot Dialogue State Tracking. [pdf][code]
WWW-2023 CCF-A Yuchen Zhou, Yanan Cao, Yongchao Liu, Yanmin Shang, Peng Zhang, Zheng Lin, Yun Yue, Baokun Wang, Xing Fu, Weiqiang Wang. Multi-Aspect Heterogeneous Graph Augmentation. [pdf][code]
TOIS-2023 CCF-A Yuchen Zhou, Yanan Cao, Yanmin Shang, Chuan Zhou, Shirui Pan, Zheng Lin, Qian Li. Explainable Hyperbolic Temporal Point Process for User-Item Interaction Sequence Generation. [pdf][code]
ACL-2022 CCF-A Ruipeng Jia, Xingxing Zhang, Yanan Cao*, Shi Wang, Zheng Lin, Furu Wei. Neural Label Search for Zero-Shot Multi-Lingual Extractive Summarization. [pdf][code]
WWW-2022 CCF-A Fengzhao Shi, Yanan Cao, Yanmin Shang*, Yuchen Zhou, Chuan Zhou, Jia Wu. H2-FDetector: A GNN-based Fraud Detector with Homophilic and Heterophilic Connections. [pdf][code]
AAAI-2022 CCF-A Ren Li, Yanan Cao, Qiannan Zhu, Guanqun Bi, Fang Fang*, Yi Liu, Qian Li. How Does Knowledge Graph Embedding Extrapolate to Unseen Data: a Semantic Evidence View. [pdf][code]
ICDM-2022 CCF-B Yuchen Zhou, Yanan Cao, Yanmin Shang, Chuan Zhou, Chuancheng Song, Fengzhao Shi, Qian Li. Task-level Relations Modelling for Graph Meta-learning. [pdf][code]
WWWJ-2022 CCF-B Yuchen Zhou, Yanmin Shang, Yanan Cao, Qian Li, Chuan Zhou, Guandong Xu. API-GNN: Attribute Preserving Oriented Interactive Graph Neural Network. [pdf]
COLING-2022 CCF-B Yubing Ren, Yanan Cao, Fang Fang, Ping Guo, Zheng Lin, Wei Ma, Yi Liu. CLIO: Role-interactive Multi-event Head Attention Network for Document-level Event Extraction. [pdf]
COLING-2022 CCF-B Qingyue Wang, Yanan Cao, Piji Li and Li Guo. Slot Dependency Modeling for Zero-shot Cross-domain Dialogue State Tracking. [pdf]
IJCNN-2021 CCF-C Qingyue Wang, Yanan Cao, Junyan Jiang, Yafang Wang and Li Guo. Incorporating Specific Knowledge into End-to-End Task-oriented Dialogue Systems. [pdf]
AAAI-2021 CCF-A Ruipeng Jia, Yanan Cao*, Haichao Shi, Fang Fang, Pengfei Yin, Shi Wang. Flexible Non-Autoregressive Extractive Summarization with Threshold: How to Extract a Non-Fixed Number of Summary Sentences? [pdf][code]
ACL-2021 CCF-A Ruipeng Jia, Yanan Cao*, Fang Fang, Yuchen Zhou, Zheng Fang, Yanbing Liu, Shi Wang. Deep Differential Amplifier for Extractive Summarization. [pdf][code]
ICASSP-2021 CCF-B Hengzhu Tang, Yanan Cao*, Zhenyu Zhang, Ruipeng Jia, Fang Fang, Shi Wang. Multi-Granularity Hetegrogeneous Graph For Document-level Relation Extraction.
EMNLP-2021 CCF-B Zheng Fang, Yanan Cao, Tai Li, Ruipeng Jia, Fang Fang, Yanmin Shang, Yuhai Lu. TEBNER: Domain Specific Named Entity Recognition with Type Expanded Boundary-aware Network.
WWWJ-2021 CCF-B Xiaoxue Li, Yanan Cao*, Yanmin Shang, Yangxi Li, Qian Li, Guandong Xu. RLINK: Deep Reinforcement Learning for User Identity Linkage. [pdf]
EMNLP-2020 CCF-B Ruipeng Jia, Yanan Cao*, Hengzhu Tang, Fang Fang, Cong Cao, Shi Wang. Neural Extractive Summarization with Hierarchical Attentive Heterogeneous Graph Network. [pdf][code]
CIKM-2020 CCF-B Ruipeng Jia, Yanan Cao*, Haichao Shi, Fang Fang, Yanbing Liu, Jianlong Tan. DistilSum: Distilling the Knowledge for Extractive Summarization. [pdf][code]
WWW-2020 CCF-A Zheng Fang, Yanan Cao*, Ren Li, Zhenyu Zhang, Yanbing Liu, Shi Wang. High quality Candidate Generation and Sequential Graph Attention Network for Entity Linking. [pdf][code]
AAAI-2020 CCF-A Xiaoxue Li, Yanan Cao*, Yanmin Shang, Yangxi Li, Yanbing Liu, Jianlong Tan. Type-aware Anchor Link Prediction across Heterogeneous Networks based on Graph Attention Network. [pdf]
WWW-2019 CCF-A Zheng Fang, Yanan Cao*, Qian Li, Dongjie Zhang, Zhenyu Zhang, Yanbing Liu. Joint Entity Linking with Deep Reinforcement Learning. [code]
ICME-2019 CCF-B Yanmin Shang, Zhezhou Kang, Yanan Cao*, Yanbing Liu, Jianlong Tan. PAAE: A Unified Framework for Predicting Anchor Links with Adversarial Embedding.
PAKDD-2020 CCF-C Hengzhu Tang, Yanan Cao, Zhenyu Zhang, Jiangxia Cao, Fang Fang, Shi Wang, Pengfei Yin. HIN: Hierarchical Inference Network for Document-Level Relation Extraction. (Best Paper Award)
KSEM-2018 CCF-C Qingyue Wang, Yanjing Song, Hao Liu, Yanan Cao and Li Guo. A Sequence Transformation Model for Chinese Named Entity Recognition. [pdf]

Undertaking Research Projects

1.2021.06~2022.12, pre-research project "Character Analysis Technology", project leader
2.2018.07~2021.06, the sub-project of the National Key R&D Program "Precise Expert Recommendation and Recommendation Basis Visual Presentation Technology", sub-project leader
3.2018.07~2021.06, the sub-project of the National Key R&D Program "Reliable Traceability of Scientific Research Behavior Data and Privacy Protection Technology", the executive director of the sub-project
4.2016.12~2019.11, National Key R&D Program sub-project "Virtual User Profile and Association Analysis Technology Research", sub-project leader
5.2015.01~2018.12, National Natural Science Foundation of China Youth Fund Project "Causal Knowledge Discovery, Verification and Inference Research for Event Prediction", project leader
6.2014.01~2017.12, National Natural Science Foundation of China General Project "Research on the Next Generation Big Data Stream Classification System", the backbone of the project
7.2013.05~2014.04, the forward-looking project of the Institute of Information Engineering, Chinese Academy of Sciences "Research on the Key Technologies of Network Information Source Discovery and Information Dissemination", the backbone of the project
8.2012.01~2016.12, Strategic Pilot Project of Chinese Academy of Sciences: Social Situation Awareness and Handling,the backbone of the project
9.Undertake and participate in more than 10 horizontal projects of national departments