教授,博导,上海市数据科学重点实验室主任
地址:中国上海杨浦区淞沪路2005号江湾校区2号交叉学科楼
E-mail:shawyh@fudan.edu.cn
Tel: +86-021-51355548
个人简介
复旦大学计算机科学技术学院教授、博导、上海市数据科学重点实验室主任。2009年获得复旦大学博士学位后留校任教,先后任讲师、副教授、教授(2017年)。
2024 ICDE十年影响力论文奖
2023 ACL 杰出论文奖
2023 华为火花奖
2022美团科研合作创新奖
2021上海市计算机学会科学技术奖二等奖
2020华为优秀合作伙伴奖
2019、2020 CCF优秀学术主任、CCF杰出演讲者
2019入选赋能中国人工智能知识图谱20位标杆人物
2018阿里巴巴学术合作最佳合作伙伴
2017阿里巴巴Research Fellowship Award
2017教育部高校科研成果二等奖
知识图谱:以知识图谱为核心的知识管理、构建、分析与应用
大模型:大模型的认知评测与增强,大模型的领域适配
社会科学启发下的人工智能
以知识图谱为核心的大数据知识工程是践行人工智能发展战略的主要形式之一。知识图谱是一种大规模语义网络,是人类知识的重要表达形式之一。团队系统性地提出了一系列数据驱动的知识图谱构建方法,包括知识补全、知识清洗、知识推断、知识更新等方法,实现了大规模知识图谱的自动化构建和更新。相关研究成果形成了CCF-A类会议与期刊论文30余篇;获得2019年语言与智能技术竞赛“信息抽取”比赛第一名、荣获 ACL 2023杰出论文奖。构建并发布了亿级中文百科知识图谱 CN-DBpedia,成为国内下载量最大的公开知识库资源之一,累计下载量超两万次。构建了知识工场平台,发布 5 个大型知识图谱,超过 10 个知识图谱运维支撑工具, 以及 20 多种知识图谱数据服务与认知服务 API,产生了规模化调用,累计 500 多家公司及个人用户,超过 6500 个独立 IP 调用近20亿次。主编了知识图谱教材《知识图谱:概念与技术》 该书被 20多所高校用作教材,被读者评价为“学习知识图谱的必备读物” 。大规模知识图谱的知识更新方案荣获华为火花奖。大规模生活常识知识图谱构建方案荣获美团科研合作创新奖。
大模型已经成为机器实现认知智能的基础设施。围绕大模型认知能力评测以及认知能力优化,构建了系统的评测体系,提出了数据驱动的大模型认知优化方法。围绕大模型的学科知识、指令理解、规划能力、幽默认知、隐喻认知、类别认知、概念认知、量纲认知构建了相应的评测数据集、微调指令集。近两年累计发表CCF A类论文30多篇。团队构建了类比推理数据集、结构化类比测评基准、中文幽默认知数据集、语言模型全学科知识评估基准数据集(Xiezhi)、复杂指令理解评估数据集。其中Xiezhi被大模型研发社区评为最具有参考价值的中文语言模型评测基准之一。发布了情感认知增强的大模型CuteGPT。并在此基础上,以知识图谱和大模型为基础,发布了数智教材平台,目前平台已有数千人次学习使用,受到高校和教师广泛关注。
发表于AAAI 2024的《Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation》构建了全学科语言模型评估基准 Xiezhi(图a),被AINLP 评为最优参考价值的中文语言模型评测基准之一(图b)
数智教材平台
1) Xintao Wang, Yunze Xiao, Jen-tse Huang, Siyu Yuan, Rui Xu, Haoran Guo, Quan Tu, Yaying Fei, Ziang Leng, Wei Wang, Jiangjie Chen, Cheng Li, Yanghua Xiao. InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews. ACL 2024
2) Yikai Zhang, Siyu Yuan, Caiyu Hu, Kyle Richardson, Yanghua Xiao, Jiangjie Chen. TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation. ACL 2024
3) Jiayi Fu, Xuandong Zhao, Ruihan Yang, Yuansen Zhang, Jiangjie Chen, Yanghua Xiao. GumbelSoft: Diversified Language Model Watermarking viathe GumbelMax-trick. ACL 2024
4) CR-LLM: A Dataset and Optimization for Concept Reasoning of Large Language Models. Nianqi Li, Jingping Liu, Sihang Jiang, Haiyun Jiang, Yanghua Xiao,Jiaqing Liang, Zujie Liang, Feng Wei, Jinglei Chen. ACL Findings 2024
5) Yikai Zhang, Qianyu He, Xintao Wang, Siyu Yuan, Jiaqing Liang, Yanghua Xiao. Light Up the Shadows: Enhance Long-Tail Entity Grounding with Concept-Guided Vision-Language Models. ACL Findings 2024
6) Chenghua Huang, Shisong Chen, Zhixu Li, Jianfeng Qu,Yanghua Xiao, Jiaxin Liu, Zhigang Chen. GeoAgent: To Empower LLMs using Geospatial Tools for Address Standardization. ACL Findings 2024
7) Jian Xie, Kai Zhang, Jiangjie Chen, Tinghui Zhu, Renze Lou, Yuandong Tian, Yanghua Xiao, Yu Su. TravelPlanner: A Benchmark for Real-World Planning with Language Agents. ICML 2024
8) Yaoxian Song, Penglei Sun, Haoyu Liu, Zhixu Li, Wei Song, Yanghua Xiao, Xiaofang ZhouScene-Driven Multimodal Knowledge Graph Construction for Embodied AI. ICDE 2024
9) Yidan Xu, Jiaqing Liang, Yaoyao Zhuo, Lei Liu, Yanghua Xiao, Lingxiao Zhou. TDASD: Generating medically significant fine-grained lung adenocarcinoma nodule CT images based on stable diffusion models with limited sample size.[J]Computer Methods and Programs in Biomedicine 248 (2024): 108103.
10) Jingping Liu, Tao Chen, Hao Guo, Chao Wang, Haiyun Jiang, Yanghua Xiao, Xiang Xu, Baohua Wu. Exploiting Duality in Aspect Sentiment Triplet Extraction with Sequential Prompting. TKDE 2024
11) Qianyu He, Jie Zeng, Wenhao Huang, Lina Chen, Jin Xiao, Qianxi He, Xunzhe Zhou, Lida Chen, Xintao Wang, Yuncheng Huang, Haoning Ye, Zihan Li, Shisong Chen, Yikai Zhang,
Zhouhong Gu, Jiaqing Liang, Yanghua Xiao. Can Large Language Models Understand Real-World Complex Instructions? AAAI 2024
12) Zhouhong Gu, Xiaoxuan Zhu, Haoning Ye, Lin Zhang, Jianchen Wang, Sihang Jiang, Zhuozhi Xiong, Zihan Li, Qianyu He, Rui Xu, Wenhao Huang, Zili Wang, Shusen Wang, Weiguo Zheng,
Hongwei Feng, Yanghua Xiao. Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation. AAAI 2024
13) Haixia Han, Jiaqing Liang, Jie Shi, Qianyu He, Yanghua Xiao. Small Language Model Can Self-correct. AAAI 2024
14) Yuyan Chen, Yichen Yuan, Panjun Liu, Dayiheng Liu, Qinghao Guan, Mengfei Guo, Haiming Peng, Zhixu Li, Yanghua Xiao, Bang Liu. Talk Funny! A Large-scaleumor Response Dataset with Chain-of-Humor Interpretation. AAAI 2024
15) Jingping Liu, Mingchuan Zhang, Weichen Li, Chao Wang, Shuang Li, Shuang Li, Sihang Jiang, Yanghua Xiao, Yunwen Chen. Beyond Entities: A Large-Scale Multi-Modal Knowledge Graphwith Triplet Fact Grounding. AAAI 2024
16) Shuang Li, Jiangjie Chen, Siyu Yuan, Xinyi Wu, Hao Yang, Shimin Tao, Yanghua Xiao. Translate Meanings, Not Just Words: ldiomKB's Role in Optimizin Gldiomatic Translation with
Language Models. AAAI 2024
17) Yuncheng Huang, Qianyu He, Jiaqing Liang, Sihang Jiang, Yanghua Xiao, Yunwen Chen. Enhancing Quantitative Reasoning Skills of Large Language Models through Dimension
Perception. ICDE 2024
18) Lipeng Ma, Weidong Yang, Bo Xu, Sihang Jiang, Ben Fei, Jiaqing Liang, Mingjie Zhou, Yanghua Xiao. KnowLog: Knowledge Enhanced Pre-trained Language Model for Log Understanding. ICSE 2024
19) Chao Wang, Juntao Liu, Jingping Liu, Sihang Jiang, Zhixu Li, Yanghua Xiao. Sweet Apple, company? or food? Adjective-centric commonsense knowledge acquisition with taxonomy-guided induction. Knowledge Based Systems 2023
20) Xuwu Wang, Lihan Chen, Wei Zhu, Yuan Ni, Guotong Xie, Deqing Yang, Yanghua Xiao. Muti-task entity linking with supervision from a taxonomy. Knowledge Based Systems 2023
21) Tinghui Zhu, Jingping Liu, Jiaqing Liang, Haiyun Jiang, Yanghua Xiao, ZongYu Wang, Rui Xie, Yunsen Xian. Towards Visual Taxonomy Expansion. MM 2023
22) Yuyan Chen, Yanghua Xiao, Zhixu li, Bang Liu. XMQAs: Constructing Complex-Modified Question-Answering Dataset for Robust Question Understanding. TKDE 2023
23) Zhiang Yue, Jingping Liu, Cong Zhang, Chao Wang, Haiyun Jiang, Yue Zhang, Xianyang Tian, Zhedong Cen, Yanghua Xiao, Tong Ruan. MA-MRC: A Multi-answer Machine Reading Comprehension Dataset. SIGIR 2023
24) Sihang Jiang, Jianchuan Feng, Chao Wang, Jingping Liu, Zhuozhi Xiong, Chaofeng Sha, Weiguo Zheng, Jiaqing Liang, Yanghua Xiao. EASC: An Exception-Aware Semantic Compression Framework for Real-World Knowledge Graphs. Knowledge Based Systems 2023
25) Siyu Yuan, Jiangjie Chen, Ziquan Fu, Xuyang Ge, Soham Shah, C. R. Jankowski, Deqing Yang, Yanghua Xiao. Distilling Script Knowledge from Large
Language Models for Constrained Language Planning. ACL 2023(Outstanding Paper)
26) Siyu Yuan, Deqing Yang, Jinxi Liu, Shuyu Tian, Jiaqing Liang, Yanghua Xiao, Rui Xie. C-ausality-aware Concept Extraction based on Knowledge-guided Prompting. ACL 2023
27) Jiangjie Chen, Wei Shi, Ziquan Fu, Sijie Cheng, Lei Li, Yanghua Xiao. Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense
Knowledge. ACL 2023
28) Qianyu He, Yikai Zhang, Jiaqing Liang, Yuncheng Huang, Yanghua Xiao, Yunwen Chen. HAUSER: Towards Holistic and Automatic Evaluation of Simile Generation. ACL 2023
29) Wenhao Huang, Jiaqing Liang, Zhixu Li, Yanghua Xiao, Chuanjun Ji. Adaptive Ordered Information Extraction with Deep Reinforcement Learning. ACL 2023
30) Xiaodan Wang, Chengyu Wang, Lei Li, Zhixu Li, Ben Chen, Linbo Jin, Jun Huang, Yanghua Xiao, Ming Gao. FashionKLIP: Enhancing E-Commerce Image-Text Retrieval with Fashion Multi-Modal Conceptual Knowledge Graph. ACL 2023
31) Jian Xie, Yidan Liang, Jingping Liu, Yanghua Xiao, Baohua Wu, Shenghua Ni. QUERT: Continual Pre-training of Language Model for Query Understanding in Travel Domain Search. KDD 2023
32) Jingsong Yang, Guanzhou Han, Deqing Yang, Jingping Liu, Yanghua Xiao, Xiang Xu, Baohua Wu, Shenghua Ni. M3PT: A Multi-Modal Model for POI Tagging. KDD 2023
33) Mingxi Zhang, Yanghua Xiao, Wei Wang. Effcient single-source SimRank query by path aggregation. KDD 2023
34) Sheng-Chi You, Chao Wang, Baohua Wu, Jingping Liu, Quan Lu, Guanzhou Han, Yanghua Xiao. What Image do You Need? A Two-stage Framework for Image Selection in E-commerce. WWW 2023
35) Jiangjie Chen, Rui Xu, Wenxuan Zeng, Changzhi Sun, Lei li, Yanghua Xiao*. Factual Error Correction via Iterative Constrained Editing. AAAI 2023
36) Lihan Chen, Tinghui Zhu, Jingping Liu, Jiaqing Liang, Yanghua Xiao*. End-to-end Entity Linking with Hierarchical Reinforcement Learning. AAAI 2023
37) Zhouhong Gu, Sihang Jiang, Jingping Liu, Yanghua Xiao*, Hongwei Feng, Zhixu Li, Jiaqing Liang, Jian Zhong. GANTEE: Generative Adversarial Network for Taxonomy Entering
Evaluation. AAAI 2023
38) Yu Hong, Jiahang Li, Jianchuang Feng, Chenghua Huang, Zhixu Li, Jianfeng Qu, Yanghua Xiao*, Wei Wang. Competition or Cooperation? Exploring Unlabeled Data via Challenging
Minimax Game for Semi-Supervised Relation Extraction. AAAI 2023
39) Qianyu He, Xintao Wang, Jiaqing Liang, Yanghua Xiao*. MAPS-KB: A Million-scale Probabilistic Simile Knowledge Base. AAAI 2023
40) Shuoyao Zhai, Baichuan Liu, Deqing Yang, Yanghua Xiao. Group Buying Recommend- ation Model Based on Multi-task Learning. ICDE 2023
41) Lyuxin Xue, Deqing Yang, Shuoyao Zhai, Yuxin Li, Yanghua Xiao. Learning Dual-view User Representations for Enhanced Sequential Recommendation. TOIS 2023
42) Jingping Liu, Tao Chen, Chao Wang, Jiaqing Liang, Lihan Chen, Yanghua Xiao*, Yunwen Chen, Ke Jin. VoCSK: Verb-oriented commonsense knowledge mining with taxonomy-guided induction. Artificial Intelligence Journal 2022
43) Qianyu He, Sijie Cheng, Zhixu Li, Rui Xie, Yanghua Xiao*. Can Pre-trained Language Models Interpret Similes as Smart as Human? ACL 2022
44) Xuwu Wang, Junfeng Tian, Min Gui, Zhixu Li, Rui Wang, Ming Yan, Lihan Chen, Yanghua Xiao. WikiDiverse: A Multimodal Entity Linking Dataset with Diversified Contextual Topics and
Entity Types, ACL 2022
45) Chao Wang, Haiyun Jiang, Tao Chen, Jingping Liu, Menghui Wang, Sihang Jiang, Zhixu Li, Yanghua Xiao. Entity Understanding with Hierarchical Graph Learning for Enhanced
Text Classification, Knowledge Based Systems 2022
46) Sijie Cheng, Zhouhong Gu, Bang Liu, Rui Xie, Wei Wu, Yanghua Xiao. Learning What You Need from What You Did: Product Taxonomy Expansion with User Behaviors Supervision,
ICDE 2022
47) Xintao Wang, Qianyu He, Jiaqing Liang, Yanghua Xiao. Language Models as Knowledge Embeddings, IJCAI 2022
48) Jiangjie Chen, Qiaoben Bao, Changzhi Sun, Xinbo Zhang, Jiaze Chen, Hao Zhou, Yanghua Xiao, Lei Li. LOREN: Logic-Regularized Reasoning for Interpretable Fact Verification, AAAI 2022
49) Jiangjie Chen, Chun Gan, Sijie Cheng, Hao Zhou , Yanghua Xiao, Lei Li. Unsupervised Editing for Counterfactual Stories, AAAI 2022
50) Yuyan Chen, Yanghua Xiao, Bang Liu. Grow-and-Clip: Informative-yet-Concise Evidence Distillation for Answer Explanation, ICDE 2022
51) Jiangjie Chen, Rui Xu, Ziquan Fu, Wei Shi, Zhongqiao Li, Xinbo Zhang, Changzhi Sun, Lei Li, Yanghua Xiao, Hao Zhou. E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning, ACL 2022
52) Chao Wang, Jingping Liu, Tianyi Zhuang, Jiahang Li, Juntao Liu, Yanghua Xiao, Wei Wang, Rui Xie. A Sequence-to-Sequence Model for Large-scale Chinese Abbreviation Database Construction. WSDM 2022(入选Best of WSDM 2022)
53) Lihan Chen , Sihang Jiang , Jingping Liu , Chao Wang , Sheng Zhang, Chenhao Xie , Jiaqing Liang , Yanghua Xiao, Rui Song. Rule Mining over Knowledge Graphs via Reinforcement
Learning, Knowledge Based Systems 2022