教务管理

资源下载

English

师资队伍

首页 > 师资队伍 > ALL > L

刘宏

职称：教授

电话：

办公室：A329

Email：liuh@pkusz.edu.cn

实验室网站：http://robotics.pkusz.edu.cn

研究方向：1、计算机视觉与智能机器人； 2、机器学习与智能人机交互。

导师与研究领域、方向：

刘宏，北京大学教授，国家级领军人才，北京大学人工智能研究院-具身智能与机器人中心主任，兼任重庆理工大学两江人工智能学院院长；科技部国家重点研发计划“智能机器人”总体专家组专家，JCR一区重要国际学术期刊CAAI Transactions on Intelligence Technology创刊主编、执行主编。长期担任中国人工智能学会副理事长(2014年-2025年)，现任学会监事，CAAI Fellow。

长期从事计算机视觉与智能机器人、机器学习与智能人机交互等领域的教学和科研工作，先后师从蔡鹤皋院士、王选院士和石青云院士等知名专家学者，多次赴美国、加拿大、日本和新加坡等国家的著名大学和研究机构访问交流。先后承担国家863、973、国家重点研发计划和新一代人工智能重大项目，国家自然科学基金重点项目等二十余项重要科研项目。发表学术论文300余篇，近年的研究成果被国内外同行研究机构引用18000余次。相关成果申报/获得国家发明专利40余项。先后获国家航天科技进步奖、吴文俊人工智能自然科学一等奖；近年来，以第一完成人获得深圳市自然科学一等奖、广东省自然科学一等奖，并获2025年度国家自然科学奖提名。

作为深圳教育界首位国家科技创新领军人才，二十多年来刘宏教授专注于所热爱的人工智能和机器人事业，先后培养了100多名硕士博士研究生和博士后，是我国“智能科学与技术”新学科建设的积极倡导者、智能机器人领域科技创新的潜心实践者。

近年来，刘宏教授的学术.研究主要聚焦在“机器人视觉感知与自主学习”等领域。

近年承担的主要科研项目:

1）国家重点研发计划项目：轻量化智能双臂护理机器人系统研发与应用示范（项目组副组长，在研）

2）国家自然科学基金项目：面向多机器人视觉感知的自主学习机制（项目负责人，在研）

3）广东省新一代人工智能旗舰项目：复杂场景自适应智能体基础理论和关键技术研究（项目负责人，在研）

4）科技创新2030国家新一代人工智能重大项目：基于数字孪生的室内服务机器人自主学习与进化关键技术（课题负责人，已完成）

5）国家重点研发计划项目：面向智能手机制造的柔性机器人自动化生产线研制及示范应用（技术指导，已完成）

6）科技部创新人才推进计划项目：面向复杂场景人机交互的嵌入式仿生视觉技术（项目负责人，已完成）

7）国家863课题：面向HRI的机器人视听觉注意机制及运动规划技术（项目负责人，已完成）

8）国家863课题：基于多源信息融合的交通事件自动检测技术（项目组副组长，已完成）

9）国家自然科学基金项目：面向混合增强智能运动规划的机器人位姿空间建模方法（项目负责人，已完成）

10）国家自然科学基金(联合基金)重点项目：面向服务机器人的视听感知融合与多模态人机交互关键技术（项目负责人，已完成）

11）国家自然科学基金项目：基于麦克风阵列的移动机器人实时声源定位方法研究（项目负责人，已完成）

12）国家自然科学基金项目：人机互动环境下机器人实时运动规划研究（项目负责人，已完成）

13）国家自然科学基金项目：面向人体目标实时跟踪的视觉注意转移机制研究（项目负责人，已完成）

14）教育部博士点基金课题：面向显著事件主动感知的仿生立体视觉研究（项目负责人，已完成）

15）广东省重大产业攻关项目：新一代家用服务机器人关键技术突破及集成应用示范（课题负责人，已完成）

16）深圳市高等院校稳定支持计划重点项目：面向复杂场景机器人高效作业的混合增强智能（项目负责人，已完成）

17）深圳市基础研究重点项目：基于情感计算和机器人交互模型研究（项目负责人，已完成）

18）深圳市“创新链-产业链”双链融合重大产业化项目：智能电视生产流水线的视觉检测和定位方法（课题负责人，已完成）

19）深圳市战略新兴产业项目：网络环境下智能监控系统公共技术平台（项目负责人，已完成）

20）深圳市战略新兴产业项目：物联网智能感知技术工程实验室（项目负责人，已完成）

21）深圳市基础研究重点项目：面向复杂场景人机交互的仿生视觉技术与系统（项目负责人，已完成）

近年发表的主要学术论文:

[1]J.Liu, H.Liu(刘宏), X. Li, J. Ren, X. Xu,MilNet: Multiplex Interactive Learning Network for RGB-T Semantic Segmentation, IEEE Transactions onImage Processing(TIP), 2025. (图像处理领域顶级国际学术期刊)

[2] W.Li, M.Liu, H.Liu(刘宏), T.Guo, T.Wang, H.Tang, N.Sebe, GraphMLP:A graph MLP-like architecture for 3D human pose estimation, Pattern Recognition (PR), 2025.（模式识别领域顶级国际学术期刊）

[3] W.Li, M.Liu, H.Liu(刘宏), B.Ren, X.Li, Y.You, N.Sebe, HYRE: Hybrid Regressor for 3D Human Pose and Shape Estimation, IEEE Transactions on Image Processing (TIP), 2024.（图像处理领域顶级国际学术期刊）

[4] Y.Li, H.Liu(刘宏), B.Yang, STNet: Deep Audio-Visual Fusion Network for Robust Speaker Tracking, IEEE Transactions on Multimedia (TMM), 2024.（多媒体领域顶级国际学术期刊）

[5] M.Liu, H.Liu(刘宏), T.Guo, Cross-model cross-stream learning for self-supervised human action recognition, IEEE Transactions on Human-Machine Systems (THMS), 2024. (智能系统领域重要国际学术期刊)

[6] G.Wang, M.Liu, H.Liu(刘宏), P.Guo, T.Wang, J.Guo, R.Fan, Augmented skeleton sequences with hypergraph network for self-supervised group activity recognition, Pattern Recognition (PR), 2024.（模式识别领域顶级国际学术期刊）

[7] S.Yan, M.Liu, Y.Wang, Y.Liu, H.Liu(刘宏), MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Human Motions, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024.（多媒体领域顶级国际学术期刊）

[8] T.Guo, M.Liu, H.Liu(刘宏), G.Wang, W.Li, Improving self-supervised action recognition from extremely augmented skeleton sequences, Pattern Recognition (PR), 2024.（模式识别领域顶级国际学术期刊）

[9] L.Dai, H.Liu(刘宏), P.Song, M.Liu, A gated cross-domain collaborative network for underwater object detection, Pattern Recognition (PR), 2024.（模式识别领域顶级国际学术期刊）

[10] T.Wang, M.Liu, H.Liu(刘宏), W.Li, M.Ban, T.Guo, Y.Li, Feature completion transformer for occluded person re-identification, IEEE Transactions on Multimedia (TMM), 2024.（多媒体领域顶级国际学术期刊）

[11] W.Li, H.Liu(刘宏), H.Tang, P.Wang, Multi-hypothesis representation learning for transformer-based 3D human pose estimation, Pattern Recognition (PR), 2023.（模式识别领域顶级国际学术期刊）

[12] J.Wu, H.Liu(刘宏), W.Shi, M.Liu, W.Li, Style-agnostic representation learning for visible-infrared person re-identification, IEEE Transactions on Multimedia (TMM), 2023.（多媒体领域顶级国际学术期刊）

[13] Y.Liu, H.Liu(刘宏), H.Wang, F.Meng, M.Liu, BCAN: Bidirectional correct attention network for cross-modal retrieval, IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023.（人工智能领域顶级国际学术期刊）

[14] L.Dai, H.Liu(刘宏), H.Tang, Z.Wu, P.Song, Ao2-detr: Arbitrary-oriented object detection transformer, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2022.（多媒体领域顶级国际学术期刊）

[15] Z.Chen, H.Liu(刘宏), L.Zhang, X.Liao, Multi-dimensional attention with similarity constraint for weakly-supervised temporal action localization, IEEE Transactions on Multimedia (TMM), 2022.（多媒体领域顶级国际学术期刊）

[16] G.Hua, H.Liu(刘宏), W.Li, Q.Zhang, R.Ding, X.Xu, Weakly-supervised 3D human pose estimation with cross-view U-shaped graph convolutional network, IEEE Transactions on Multimedia (TMM), 2022.（多媒体领域顶级国际学术期刊）

[17] Wei Shi, Hong Liu(刘宏), and Mengyuan Liu. Image-to-video person re-identification using three-dimensional semantic appearance alignment and cross-modal interactive learning. Pattern Recognition(PR), 2022.（模式识别领域顶级国际学术期刊）

[18] Wenhao Li, Hong Liu(刘宏), Runwei Ding, Mengyuan Liu, Pichao Wang and Wenming Yang. Exploiting temporal contexts with strided transformer for 3d human pose estimation. IEEE Transactions on Multimedia (TMM), 2022.（多媒体领域顶级国际学术期刊）

[19] Weibo Huang and Hong Liu(刘宏). A robust pixel-aware gyro-aided KLT feature tracker for large camera motions. IEEE Transactions on Instrumentation and Measurement(TIM), 2022. (机器智能领域重要国际学术期刊)

[20] Meijia Song, Hong Liu(刘宏), Wei Shi, and Xia Li. PCLoss: Fashion Landmark Estimation with Position Constraint Loss. Pattern Recognition(PR), 2021.（模式识别领域顶级国际学术期刊）

[21] Hao Tang, Hong Liu(刘宏), Dan Xu, Philip HS Torr, and Nicu Sebe. Attentiongan: Unpaired image-to-image translation using attention-guided generative adversarial networks. IEEE Transactions on Neural Networks and Learning Systems(TNNLS), 2021.（人工智能领域顶级国际学术期刊）

[22] Hanrong Ye, Hong Liu(刘宏), Fanyang Meng, and Xia Li. Bi-Directional Exponential Angular Triplet Loss for RGB-Infrared Person Re-Identification. IEEE Transactions on Image Processing(TIP), 2021. （图像处理领域顶级国际学术期刊）

[23] Bing Yang, Hong Liu(刘宏), and Xiaofei Li. Learning deep direct-path relative transfer function for binaural sound source localization. IEEE/ACM Transactions on Audio, Speech, and Language Processing(TASLP), 2021.（机器听觉领域顶级国际学术期刊）

[24] Hao Tang, Hong Liu(刘宏), and Nicu Sebe. Unified Generative Adversarial Networks for Controllable Image-to-Image Translation. IEEE Transactions on Image Processing(TIP), 2020.（图像处理领域顶级国际学术期刊）

[25] C.Tian, Y.Xu, Z.Li, W.Zuo, L.Fei, H.Liu(刘宏), Attention-guided CNN for image denoising, Neural Networks (NN), 2020.（人工智能领域重要国际学术期刊）

[26] Weibo Huang, Hong Liu(刘宏), and Weiwei Wan. An Online Initialization and Self-Calibration Method for Stereo Visual-Inertial Odometry. IEEE Transactions on Robotics(TRO), 2020.（机器人领域顶级国际学术期刊）

[27] Hao Tang, Hong Liu(刘宏), Wei Xiao, and Nicu Sebe. When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition With Limited Data. IEEE Transactions on Neural Networks and Learning Systems(TNNLS), 2020.（人工智能领域顶级国际学术期刊）

[28] Jie Wen, Yong Xu, and Hong Liu(刘宏). Incomplete Multiview Spectral Clustering With Adaptive Graph Learning. IEEE Transactions on Cybernetics(TCYB), 2020.（智能控制领域顶级国际学术期刊）

[29] Bing Yang, Hong Liu(刘宏), Cheng Pang and Xiaofei Li. Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering. IEEE/ACM Transactions on Audio, Speech and Language Processing(TASLP), 2019.（机器听觉领域顶级国际学术期刊）

[30] Fanyang Meng, Hong Liu(刘宏), Yongsheng Liang, Juanhui Tu and Mengyuan Liu. Sample Fusion Network: An End-to-End Data Augmentation Network for Skeleton-based Human Action Recognition. IEEE Transactions on Image Processing(TIP), 2019.（图像处理领域顶级国际学术期刊）

[31] Jiayao Ma, Weiwei Wan, Kensuke Harada, Qiuguo Zhu, and Hong Liu(刘宏). Regrasp Planning Using Stable Object Poses Supported by Complex Structures. IEEE Transactions on Cognitive and Developmental Systems(TCDS), 2019.（人工智能领域重要国际学术期刊）

[32] Mengyuan Liu, Hong Liu(刘宏), Chen Chen, Robust 3D Action Recognition Through Sampling Local Appearances and Global Distributions. IEEE Transaction on Multimedia(TMM) , 2018.（多媒体领域顶级国际学术期刊）

[33] Mengyuan Liu, Hong Liu(刘宏), and Chen Chen, 3D action recognition using multi-scale energy-based global ternary image, Accepted by IEEE Transactions on Circuits and Systems for Video Technology(TCSVT), 2018.（多媒体领域顶级国际学术期刊）

[34] M.Liu, H.Liu(刘宏), C.Chen, Robust 3D action recognition through sampling local appearances and global distributions, IEEE Transactions on Multimedia (TMM), 2017.（多媒体领域顶级国际学术期刊）

[35] Mengyuan Liu, Hong Liu(刘宏), and Chen Chen, Enhanced skeleton visualization for view invariant human action recognition, Pattern Recognition(PR), 2017.（模式识别领域顶级国际学术期刊）

[36] Chen Pang, Hong Liu(刘宏), Jie Zhang, Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping, Accepted by IEEE Transaction on Audio, Speech and Language Processing(TASLP), 2017.（机器听觉领域顶级国际学术期刊）

[37] Qianru Su, Hong Liu(刘宏), Tatsuya Harada, Online Growing Neural Gas for Anomaly Detection in Changing Surveillance Scenes, Pattern Recognition(PR), 2017.（模式识别领域顶级国际学术期刊）

[38] Pingping Wu, Hong Liu (刘宏), Xiaofei Li, A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion, IEEE Transactions on Multimedia (TMM), 2016.（多媒体领域顶级国际学术期刊）

[39] M.Shi, X.Sun, D.Tao, C.Xu, G.Baciu, H.Liu (刘宏), Exploring spatial correlation for visual object retrieval, ACM Transactions on Intelligent Systems and Technology , 2015.（智能系统领域重要国际学术期刊）

[40] Jie Zhang, Hong Liu (刘宏), Robust Acoustic Localization Via Time-Delay Compensation and Interaural Matching Filter, IEEE Transactions on Signal Processing(TSP), 2015.（信号处理领域顶级国际学术期刊）

对计划招收研究生的基本要求：

1）专业范围：智能科学与技术/人工智能、计算机科学与技术、自动化等相关学科

2）外语/数学能力：英语六级；数学（高数、线代、概率、组合等）基础好。

3）研究/开发能力：探索能力、创新精神、动手能力强，愿意按高标准严格要求自己。

智能机器人开放实验室（点击进入）

职称	教授	电话
办公室	A329	Email	liuh@pkusz.edu.cn
研究方向	1、计算机视觉与智能机器人； 2、机器学习与智能人机交互。	实验室网站	http://robotics.pkusz.edu.cn