个人简介
地址:中国上海杨浦区淞沪路2005号江湾校区2号交叉学科楼
E-mail: yunx@fudan.edu.cn
Tel: +86-21-51355518
Fax: +86-21-51355518
复旦大学计算机科学技术学院教授、博士生导师。研究兴趣包括数据科学、数据挖掘和大数据处理。长期从事数据领域研究,在数据科学学科、特异群组挖掘等方面做出了开拓性、系统性工作。研究工作获得了国家自然科学基金、国家863计划、国家科技支撑计划、上海市科委基金等资助,研究成果在包括TKDE、KDD、AAAI、ICDM、CIKM等数据领域国际权威期刊和会议上发表论文50余篇。在国内率先开展大数据和数据科学的研究工作,2009年发表论文“Data Explosion, Data Nature and Dataology”并出版第一本数据科学专著《数据学》(《Dataology and Data Science》),原创性地提出了数据科学的定义和内涵;提出了一类大数据形态--特异群组挖掘,提出了一批新型的大数据挖掘算法,这些算法已被应用于上海市医联平台、上海市证券交易所、上海市交通信息中心等上海市大数据建设重要领域。
研究兴趣
数据学和数据科学
数据挖掘及其应用(金融、保险、生物医疗、智能交通等领域)
大数据分析
数据云
主讲课程
本科生专业课程《数据库与数据仓库设计》、《大数据引论》
研究生专业课程《数据仓库与数据挖掘》、《高级数据库》
科研项目
[1]国家863项目,2015AA020105-10,基于区域医疗与健康大数据处理分析与应用研究,2015/05-2017/12,在研,主持(子课题)
[2]国家863项目子课题.基于区域医疗与健康大数据处理分析与应用研究.2015AA020105-10. 2015-01-01 ~~ 2017-12-31
[3]国家863高技术研究发展计划基金项目.转录因子结合位点及其组合模块的挖掘(2006AA02Z329).(项目副组长)
[4]国家自然科学基金项目.生物序列数据库数据模型、索引、体系结构研究(60573093).(骨干参与)
[5]大数据环境下的证券市场操纵行为发现机理、模型与方法.国家自然科学基金重大研究计划培育项目.91546105. 2016-01-01 ~~ 2018-12-30
[6]国家自然科学基金面上项目,61170096,特异群组挖掘算法研究,2012/01-2015/12,已结题,项目副组长
[7]国家自然科学基金重大研究计划培育项目,91546105,大数据环境下的证券市场操纵行为发现机理、模型与方法,2016/01-2018/12,在研,主持
[8]国家自然科学基金青年项目,60903075,基于特征挖掘的生物网络构建算法研究,2010/01-2012/12,已结题,主持
[9]国家科技支撑计划,2012BAH13F02,证券业云平台研发与运营、2012/01-2014/12,已结题,主持(子课题)
[10]上海市科委项目.音视频样本库自动化标注关键技术研究.17511101702. 2017-07-01 ~~ 2019-6-30
[11]上海市科委项目.大数据试验场数据挖掘算法库开发.17511105502. 2017-07-01 ~~ 2019-06-30.
[12]上海市科委项目.轨道交通大数据服务平台关键技术研究与示范.16511102204. 2016-07-01 ~~ 2018-06-30
[13]上海市科委项目.基于区域医疗大数据的数据挖掘技术与示范应用.12511502403. 2013-07-01 ~~ 2014-09-30
[14]上海市科委发展基金项目.面向医保基金风险防控的数据分析关键技术研究与实现(08511500203,已结题,主持
[15]上海市科委重点实验室建设专项,13dz2260200,上海市数据科学重点实验室,2013/09-2015/08,已结题,参加
[16]上海市科委项目,13511504300,数据科学基础理论研究与学科体系研究,2013/07-2015/07,已结题,主要参与
[17]上海市科委项目,12511502403,基于区域医疗大数据的数据挖掘技术研究,2012/09-2014/09,已结题,主持
[18]企业委托.深度学习与投资者画像.2018-05-01 ~~2018-12-31
[19]企业合作项目(骨干参与).短线操纵行为的模式与监管研究
[20]企业委托.岩土工程大数据分析方法研究及应用.2017-06-01 ~~2019-12-31
[21]企业委托.燃气轮机工程设计智能辅助推荐系统.2017-04-01 ~~2020-4-1
[22]企业委托.神经系统检测及血压等参数数据分析咨询.2017-03-01 ~~2017-11-30
[23]企业委托.基于E卡通的苏州教育大数据五年行动计划研究.2017-04-17 ~~2017-07-30
[24]企业委托.教育大数据分析咨询.2016-10-30 ~~2017-09-15
[25]企业委托.证券金融大数据应用研究.2016-05-20 ~~2016-12-31
[26]企业委托.上海市科技资源数据加工建设咨询.2015-12-09 ~~2016-11-30
[27]企业合作项目(项目负责人).Web3.0和数据挖掘技术研究
[28]企业合作项目(项目负责人).中小企业数据挖掘技术预研
[29]企业合作项目(项目负责人).智能交通数据挖掘技术
[30]企业合作项目(骨干参与).某医院信息化发展规划
代表论文
[1]Tang Chunlei, Joseph M. Plasekd, Xiong Yun, Zhou Li: Clustering Similar Clinical Documents in Electronic Health Records. ICDS: 2018(6).
[2]Yuanliang Zhang, Shanshan Li, Xiangyang Xu, Xiangke Liao, Shazhou Yang, Yun Xiong: ConfigFile++: Automatic comment enhancement for misconfiguration prevention. MaLTeSQuE@SANER2018: 37-42.
[3]Yizhou Zhang, Yun Xiong, Xiangnan Kong, Shanshan Li, Jinhong Mi, Yangyong Zhu: Deep Collective Classification in Heterogeneous Information Networks.WWW2018: 399-408.
[4]Chang Liao, Yun Xiong, Xiangnan Kong, Yangyong Zhu, Shimin Zhao, Shanshan Li: Functional-Oriented Relationship Strength Estimation: From Online Events to Offline Interactions.DASFAA,2018(1): 442-459
[5]Yun Xiong, Yizhou Zhang, Xiangnan Kong, Yangyong Zhu: NetCycle+: A Framework for Collective Evolution Inference in Dynamic Heterogeneous Networks.IEEE Trans. Knowl. Data Eng. 2018, 30(8): 1547-1560
[6]Chang Liao, Yun Xiong, Xiangnan Kong, Yangyong Zhu: Tracking Dynamic Magnet Communities: Insights from a Network Perspective.DASFAA, 2018(1): 406-424
[7]Ting Li, Tao Yu, Lyu Li, Lunbo Lu, Yaoyao Zhuo, Jingge Lian, Yun Xiong, Dexing Kong, Kangan Li: Use of Diffusion Kurtosis Imaging and Quantitative Dynamic Contrast-Enhanced MRI for the Differentiation of Breast Tumors. Journal of Magnetic Resonance Imaging : JMRI, 2018(2): 1-9.
[8]Chunlei Tang, Haohan Zhang, Kenneth H. Lai, Yuxuan She, Yun Xiong, Li Zhou: Developing a regional classifier to track patient needs in medical literature using spiral timelines on a geographical map.BIBM2017: 874-879.
[9]Shulin Zhou, Shanshan Li, Xiaodong Liu, Xiangyang Xu, Si Zheng, Xiangke Liao, Yun Xiong: Easier Said Than Done: Diagnosing Misconfiguration via Configuration Constraints Analysis: A Study of the Variance of Configuration Constraints in Source Code.EASE2017: 196-201.
[10]Guandong Sun, Yun Xiong, Yangyong Zhu: How the Passengers Flow in Complex Metro Networks?SSDBM2017: 23:1-23:6.
[11]Yao Zhang, Yun Xiong, Xiangnan Kong, Yangyong Zhu: Learning Node Embeddings in Interaction Graphs.ACM on Conference on Information and Knowledge Management, 2017: 397-406.
[12]Yao Zhang, Yun Xiong, Xinyue Liu, Xiangnan Kong, Yangyong Zhu: Meta-Path Graphical Lasso for Learning Heterogeneous Connectivities.SDM2017: 642-650
[13]Shulin Zhou, Xiaodong Liu, Shanshan Li, Wei Dong, Xiangke Liao, Yun Xiong: ConfMapper: Automated Variable Finding for Configuration Items in Source Code.IEEE International Conference on Software Quality, Reliability and Security Companion, 2016: 228-235.
[14]Yizhou Zhang, Yun Xiong, Xiangnan Kong, Yangyong Zhu: NetCycle: Collective Evolution Inference in Heterogeneous Information Networks.KDD2016: 1365-1374
[15]Yangyong Zhu, Yun Xiong: Defining Data Science. Computer Science, 2015,1501.05039.
[16]Jing Sun, Yun Xiong, Yangyong Zhu, Junming Liu, Chu Guan, Hui Xiong: Multi-source Information Fusion for Personalized Restaurant Recommendation.SIGIR, 2015: 983-986
[17]Yun Xiong, Yangyong Zhu, Philip S. Yu: Top-k Similarity Join in Heterogeneous Information Networks.IEEE Trans. Knowl. Data Eng.2015, 27(6): 1710-1723
[18]Zhu Yangyong, Xiong Yun. Towards Data Science. Data Science Journal. 2015, 14:1-7.
[19]Congcong Li, Jing Sun, Yun Xiong, Guangyong Zheng: An Efficient Drug-Target Interaction Mining Algorithm in Heterogeneous Biological Networks.PAKDD Workshops, 2014: 65-76
[20]Zhicheng Liao,Yun Xiong,Yangyong Zhu. DacIoT: A data cloud for the internet of things. Computer Science and Applications - Proceedings of the Asia-Pacific Conference on Computer Science and Applications (CSAC). 2014: 181-186.
[21]Xue Bai, Yitong Wang, Heng Jiang, Zhicheng Liao, Yun Xiong, Xibin Shi: Mining high-temperature event space-time regions in geo-referenced temperature series data.FSKD, 2014: 671-676
[22]Xue Bai, Yun Xiong, Yangyong Zhu, Qi Liu, Zhiyuan Chen: Co-anomaly Event Detection in Multiple Temperature Series.KSEM, 2013: 1-14.
[23]Li Xue, Yun Xiong, Yangyong Zhu, Jianfeng Wu, Zhiyuan Chen: Stock Trend Prediction by Classifying Aggregative Web Topic-Opinion.PAKDD, 2013(2): 173-184.
[24]Xue Bai, Yun Xiong, Yangyong Zhu, Hengshu Zhu: Time Series Representation: A Random Shifting Perspective.WAIM, 2013: 37-50.
[25]Yun Xiong, Yangyong Zhu, Philip S. Yu, Jian Pei: Towards Cohesive Anomaly Mining.Twenty-Seventh AAAI Conference on Artificial Intelligence. AAAI Press, 2013: 984-990.
[26]Xue Bai, Zezhen Lin, Yun Xiong, Yangyong Zhu: Clustering Based on Yukawa Potential.SDM, 2012: 672-683.
[27]Zhicheng Liao, Yun Xiong, Yangyong Zhu: FIND-A Data Cloud Platform for Financial Data Services.DATA, 2012: 117-122.
[28]Zhijie Lin, Yun Xiong, Yangyong Zhu: Link Prediction Using BenefitRanks in Weighted Networks.Web Intelligence, 2012: 423-430.
[29]Zhijie Lin, Yan Chen, Shiwei Wu, Yun Xiong, Yangyong Zhu, Guangyong Zheng: Mining hub-based protein complexes in massive biological networks.IEEE, 2012: 166-173.
[30]Li Xue, Yun Xiong, Yangyong Zhu: A Brain Data Integration Model Based on Multiple Ontology and Semantic Similarity.Brain Informatics, 2010: 192-199.
[31]Li Xue, Ming Chen, Yun Xiong, Yangyong Zhu: User Navigation Behavior Mining Using Multiple Data Domain Description.Web Intelligence/IAT Workshops, 2010: 132-135.
[32]Yun Xiong, Guangyong Zheng, Qing Yang, Yangyong Zhu: A Collaborative Multiagent System for Mining Transcriptional Regulatory Elements.IEEE Intelligent Systems, 2009, 24(3): 26-37.
[33]Gang Zhao, Yun Xiong, Longbing Cao, Dan Luo, Xuchun Su, Yangyong Zhu: A Cost-Effective LSH Filter for Fast Pairwise Mining.ICDM, 2009: 1088-1093.
[34]Yangyong Zhu, Ning Zhong, Yun Xiong: Data Explosion, Data Nature and Dataology.Brain Informatics, 2009: 147-158
[35]Yun Xiong,Yangyong Zhu. Mining Peculiarity Groups in Day-by-Day Behavioral Datasets. International Conference on Data Mining. 2009: 578-587.
[36]Yun Xiong, Guangyong Zheng, Qing Yang, Yangyong Zhu: An Agent-Based Approach to Mine Transcriptional Regulatory Elements. International Conference on Web Intelligence and Intelligent Agent Technology, 2008: 567-570.
[37]Guangyong Zheng, Kang Tu, Qing Yang, Yun Xiong, Chaochun Wei, Lu Xie, Yangyong Zhu, Yixue Li: ITFP: an integrated platform of mammalian transcription factors.Bioinformatics, 2008, 24(20): 2416-2417.
[38]Yue Chen, Jiankui Guo, Yaqin Wang, Yun Xiong, Yangyong Zhu: Incremental Mining of Sequential Patterns Using Prefix Tree.Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD, 2007: 433-440.
[39]Yun Xiong, Yangyong Zhu: A Multi-Supports-Based Sequential Pattern Mining Algorithm.CIT, 2005: 170-174.
[40]王晓霞,蒋伏松,王宇,熊赟.基于ICD-10诊断编码的慢性病并发症聚类算法[J].大数据,2018(3): 37-45.
[41]于广军,熊贇,彭思佳,等.基于环境气象因素影响的异常就诊量预测[J].大数据,2018(3).
[42]李婷,鲁伦博,卓瑶瑶,连婧阁,孔德兴,熊贇,李康安.扩散峰度成像联合动态对比增强MRI诊断乳腺良恶性病变的价值[J].中华放射学杂志,2018(6): 436-441.
[43]李婷,鲁伦博,卓瑶瑶,连靖阁,孔德兴,熊赟,李康安.扩散峰度成像与扩散加权成像对乳腺癌诊断价值的对比研究[J].中华放射学杂志,2018(3): 177-182.
[44]熊贇,朱扬勇.面向数据自治开放的数据盒模型[J].大数据,2018(2).
[45]朱扬勇,熊贇,廖志成,等.数据自治开放模式[J].大数据,2018(2).
[46]朱扬勇,熊贇.大数据的若干基础研究方向[J].大数据,2017,3(2):104-114.
[47]孙冠东,张兵,刘禹岍,等.基于载客数据的出租车热门区域功能发现[J].计算机工程,2017, 43(5):16-22.
[48]李婷,李康安,熊赟.乳腺癌计算机辅助诊断技术研究进展[J].中华放射学杂志,2017, 51(11): 887-890.
[49]周靖,佘玉轩,熊赟.MaLDA:基于LDA的用药分析[J].计算机工程与应用.2016,(18):8-13.
[50]朱扬勇,熊贇.大数据人才培养的基础条件初探[J].大数据,2016,2(5):107-114.
[51]朱扬勇,熊贇.大数据时代的数据科学家培养[J].大数据,2016,2(3):106-112.
[52]何友,朱扬勇,赵鹏,柴勇,廖志成,周伟,周向东,王海鹏,汪卫,熊赟,许舟军,彭煊,孟晖,王生进.国防大数据概论[J].2016, 38(6):1300-1305.
[53]张一舟,曾剑平,孙婧,熊赟.OMisy:一个面向股吧数据的观点挖掘系统[J].计算机研究与发展.2015, (1): 123-128.
[54]张一舟,曾剑平,孙婧,熊赟.OMisy:一个面向股吧数据的观点挖掘系统[J].计算机研究与发展.2015,(1):123-128.
[55]朱扬勇,熊赟.大数据是数据、技术,还是应用[J].大数据.2015,1(1): 1-12.
[56]吴世伟,熊赟.基于概率矩阵分解的个性化链接预测算法[J].计算机应用与软件.2015,(8): 243-247.
[57]熊赟,朱扬勇.特异群组挖掘:框架与应用[J].大数据.2015,1(2):1-12.
[58]孙婧,熊赟.CMMDI:一个中医多元数据整合平台[C].第31届中国数据库学术会议论文集复旦大学.2014:364-367.
[59]杨林瑞,陈磊,熊赟,先梦涵,朱扬勇.PuMA:面向微博话题分析的舆情数据服务平台[C].中国数据库学术会议.2014.
[60]陈磊,于广军,程思远,王飞,熊赟,刘卉.大数据挖掘平台的构建及其在医疗领域的应用[C].第31届中国数据库学术会议论文集.2014:334-337.
[61]周健雯,李聪聪,熊赟,朱扬勇. 一种基于R*树的自相似性连接算法[J].计算机应用与软件,2014, 31(8):50-53.
[62]徐宇明,陈诚,熊赟,朱扬勇.APT-KNN:一种面向分类问题的高效缺失值填充算法[J].计算机应用与软件,2011, 28(4):135-139.
[63]周坤,王爱荣,张敬谊,熊赟,朱扬勇.VPM:一个就医行为模式挖掘算法[J].计算机应用与软件,2011, 28(8):123-125.
[64]陈鸣,薛慧君,熊赟,朱扬勇. 基于多数据域描述的转录因子结合位点识别[J].计算机应用与软件,2011, 28(6):1-4.
[65]高臻耀,张敬谊,林志杰,熊赟,朱扬勇.一个医保基金风险防控平台中的数据挖掘技术[J].计算机应用与软件,2011, 28(8):120-122.
[66]秦德霖,王爱荣,陈诚,林志杰,熊赟,朱扬勇.一个医保基金风险防控数据仓库的设计和实现[J].计算机应用与软件,2011, 28(7):75-78.
[67]王月,王伟俊,童庆,熊赟,朱扬勇. 一个医保数据仓库的元数据管理解决方案[J]. 计算机应用与软件,2011, 28(8):126-129.
[68]何俊华,张静谊,熊赟,朱扬勇.医保就医聚集行为挖掘[J].计算机应用与软件,2011,28(7):79-81.
[69]戴东波,熊赟,朱扬勇.基于参考集索引的高效序列相似性查找算法[J].软件学报,2010,21(4):718-731.
[70]戴东波,汤春蕾,熊赟.基于整体和局部相似性的序列聚类算法[J].软件学报.2010, (4): 702-717.
[71]朱扬勇,戴东波,熊赟.序列数据相似性查询技术研究综述[J].计算机研究与发展.2010.47(2):264-276.
[72]戴东波,汤春蕾,邱伯仁,熊赟,朱扬勇.一种优化多重过滤得序列查询算法[J].计算机研究与发展.2010,42(10):1785-1796.
[73]邱伯仁,熊赟,朱扬勇.BioIndex:一种生物序列相似性查询的高效索引[J].计算机应用与软件.2009, 26(10): 1-4.
[74]戴东波,汤春蕾,邱伯仁,熊赟,朱扬勇.一种优化多重过滤的序列查询算法[J].第三届中国数据挖掘学术会议(CCDM2009)论文集复旦大学.2009:80.
[75]朱扬勇,熊赟.BioSeg: 一个生物序列数据模型[J].计算机科学与探索.2008, 1:77-96.
[76]熊赟,邱伯仁,张坤,朱扬勇.Gen-Cluster:一个基因表达数据的高维聚类算法[J].复旦学报(自然科学版).2008,47(2):135-146.
[77]杨青,郑广勇,熊赟,朱扬勇.Qnet-BSTM:一个转录因子结合位点文本挖掘算法[J].第二十五届中国数据库学术会议(NDBC2008)论文集复旦大学.2008:562-568.
[78]李金玖,熊赟,朱扬勇.不确定数据集上的k-Skyline查询[J].计算机研究与发展.2008,45(suppl.):315-322.
[79]朱扬勇,熊赟.DNA序列数据挖掘技术[J].软件学报.2007,18(11):2766-2781.
[80]熊赟,陈越,朱扬勇.ProFaM:一个蛋白质序列家族挖掘算法[J].计算机研究与发展.2007,44(7):1160-1168.
[81]阮璐,熊赟.基于网络表示学习的miRNA功能相似性研究[J].计算机研究发展,2019(3).
[82]Yao Zhang, Yun Xiong, Yun Ye, Tengfei Liu, Weiqiang Wang, Yangyong Zhu, Philip S. Yu. SEAL: Learning Heuristics for Community Detection with Generative Adversarial Networks AUTHORS. ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2020.
[83]Dan Shi ,Chunlei Tang, Suzanne V Blackley, Liqin Wang, Jiahong Yang, Yanming He, Samuel I Bennett, Yun Xiong, Xiao Shi, Li Zhou, David W Bates. An annotated dataset of tongue images supporting geriatric disease diagnosis. Data in Brief, 2020.08.
[84]Huidi Chen,Yun Xiong,Yangyong Zhu,Philip S. Yu. Highly Liquid Temporal Interaction Graph Embeddings. WWW'2,2021.04.
[85]Tianqi Zhang, Yun Xiong, Jiawei Zhang, Yao Zhang, Yizhu Jiao, Yangyong Zhu. CommDGI: Community Detection Oriented Deep Graph Infomax. ACM International Conference on Information and Knowledge Management (CIKM), 2020.
[86]Tao Lei, Yun Xiong, Peng Tian, Yangyong Zhu. MinSR: Multi-level Interests Network for Session-based Recommendation. DASFAA, 2020.
[87]Huidi Chen, Yun Xiong, Changdong Wang, Yangyong Zhu, Wei Wang. SpEC: Sparse Embedding-based Community Detection in Attributed Graphs. DASFAA, 2020.
[88]Yi Xie, Yun Xiong, Yangyong Zhu. SAST-GNN: A Self-Attention based Spatio-Temporal Graph Neural Network for Traffic Prediction. DASFAA, 2020.
[89]Yizhu Jiao, Yun Xiong , Jiawei Zhang, Yao Zhang, Tianqi Zhang, Yangyong Zhu. Sub-graph Contrast for Scalable Self-Superxised Graph Representation Learning. ICDM, 2020.
[90]Yali Xiang, Yun Xiong, Yangyong Zhu.TI-GCN: A Dynamic Network Embedding Method with Time Interval Information. IEEE International Conference on Big Data, 2020.
代表专著
[1]熊贇,朱扬勇.《特异群组挖掘》.人民邮电出版社.2020. ISBN:978-115-54326-4.
[2]朱扬勇,熊赟.《生物数据整合与挖掘》.复旦大学出版社.2009.ISBN978-7-309-06614-2/Q.72.
[3]朱扬勇,熊赟.《数据学》.复旦大学出版社.2009.ISBN978-7-309-06956-3/T.350.
[4]熊赟,朱扬勇.《大数据挖掘》.上海科学技术出版社.2016.ISBN:9787547829615.