高伟

职称:助理教授
电话:0755-26033202
办公室:A214
Email:gaowei262@pku.edu.cn
实验室网站:
研究方向:1、多媒体编码;2、多媒体处理;3、深度学习与人工智能。
职称 助理教授 电话 0755-26033202
办公室 A214 Email gaowei262@pku.edu.cn
研究方向 1、多媒体编码;2、多媒体处理;3、深度学习与人工智能。 实验室网站

​导师与研究领域、方向:

高伟,博士,北京大学信息工程学院助理教授/副研究员/博士生导师,IEEE高级会员,深圳市海外高层次人才。香港城市大学计算机科学博士,曾在美国加州大学洛杉矶分校(UCLA)做访问学者,曾在香港城市大学和新加坡南洋理工大学做博士后研究,曾在CMOS图像传感器制造商OmniVision Technologies公司担任图像信号处理器ASIC芯片设计工程师。研究方向为沉浸式与3D视觉媒体信息处理技术(包括点云、光场、全景、多视点/双目3D等),主要研究兴趣包括:(1)多媒体编码:点云与视频编码、深度学习智能编码;(2)多媒体处理与计算机视觉:点云处理与分析、图像/视频处理与分析、沉浸式与3D视觉媒体中的计算机视觉(包括感知质量评价、视觉注意机制与显著性分割、复原增强、3D目标检测等);(3)深度学习与人工智能:机器学习/深度学习与优化理论及应用、可解释性深度神经网络理论、人工智能中深度学习算法轻量化与软硬件加速。近年来主要科研成果发表在相关领域高水平国际期刊(如IEEE TIP、TCSVT、TMM、TNNLS、TCYB等)和高水平国际会议(如CVPR、ECCV、AAAI、ACM MM、DCC等)上70余篇,申请或授权美国/中国/PCT专利50余项,积极参与多媒体与人工智能技术的标准提案工作。

由于在3D沉浸式媒体方面的研究荣获2021年IEEE多媒体学术新星奖(IEEE Multimedia Rising Star,全球仅4人获奖),荣获2021年CCF-腾讯犀牛鸟优秀专利奖等、2020年和2019年连续两年的CCF-腾讯犀牛鸟基金(科研基金全球遴选入选率12%)、2019年广东省计算机学会优秀论文一等奖(第1作者论文)。受邀担任四个多媒体计算与机器学习领域国际重要SCI期刊编委(Associate Editor),包括JCR一区期刊Signal Processing(Elsevier)、JCR二区期刊Neural Processing Letters(Springer)等,并同时是亚太信号与信息处理协会图像、视频与多媒体技术委员会(APSIPA IVM TC)委员、中国计算机学会多媒体技术专委会(CCF TCMM)执行委员、中国图象图形学学会多媒体专业委员会(CSIG TCMM)委员。曾于IEEE ICME 2021、IEEE VCIP 2022和ACM MM 2022会议上组织过交互式媒体、点云编码与处理等领域的研讨会(Workshop)和专题会议(Special Session)。国家自然科学基金、广东省与深圳市项目评审专家。担任多个国际顶级期刊IEEE TIP、TCSVT、TMM、TNNLS、TCYB等以及国际重要学术会议CVPR、ECCV、ACM MM、IJCAI等的审稿人,多个国际学术会议程序委员会委员与组织方等。课题组与工业界有广泛的技术研发合作,并与鹏城实验室合作正在搭建和维护面向点云技术和视觉信息压缩的开源算法库,包括OpenPointCloud(首个面向点云编码与处理的开源项目)、OpenHardwareVC(首个面向AVS3 8K硬件编码器的开源项目)、OpenAICoding、OpenCompression、OpenVision等。课题组致力于提升沉浸式与3D视觉媒体的观看体验与工业应用,促进新兴与未来多媒体与视觉信息处理技术发展。

课题组致力于提升沉浸式与3D视觉媒体的观看体验与工业应用,促进新兴与未来多媒体与视觉信息处理技术发展。欢迎优秀的本科生和硕士生保送和报考北京大学信息工程学院的硕士和博士研究生,同时欢迎申请课题组的博士后和访问职位,从事多媒体计算与人工智能相关热门与前沿课题的研究探索。请查看主页:https://gaowei262.github.io/(查看最新招生与科研信息)。


主要科研项目

近年来,作为负责人曾经或正在主持10余项国家级与省部级等重大/重要科研项目,包括国家重点研发计划项目/课题2项、国家自然科学基金项目/课题2项(重点项目课题1项,青年项目1项)、广东省自然科学基金项目1项(面上项目1项)、深圳市基础研究项目2项(重点项目1项,面上项目1项)、企业委托项目5项(腾讯、华为、联想)等。作为研究骨干参与国家自然科学基金3项、香港研究资助局优配研究基金1项、香港创新科技署项目1项等。


近年来发表的部分期刊和会议论文(30余篇,IEEE/ACM Transactions和顶级会议)

1. Songlin Fan, Wei Gao, Ge Li, “Salient Object Detection for Point Clouds,” European Conference on Computer Vision (ECCV), 2022.

2. Hang Yuan, Wei Gao, Ge Li, Zhu Li, “Rate-Distortion-Guided Learning Approach with Cross-Projection Information for V-PCC Fast CU Decision,” ACM International Conference on Multimedia (ACM MM), 2022.

3. Guanghui Yue, Siying Li, Tianwei Zhou, Miaohui Wang, Jingfeng Du, Tianfu Wang, Qiuping Jiang, Wei Gao, “Adaptive Context Exploration Network for Polyp Segmentation in Colonoscopy Images,” IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI), 2022.

4. Runmin Cong, Haowei Yang, Qiuping Jiang, Wei Gao, Haisheng Li, Yao Zhao, and Sam Kwong, “BCS-Net: Boundary, Context and Semantic for Automatic COVID-19 Lung Infection Segmentation from CT Images,” IEEE Transactions on Instrumentation and Measurement (TIM), 2022.

5. Wei Gao, Hua Ye, Ge Li, Huiming Zheng, Yuyang Wu and Liang Xie, “OpenPointCloud: An Open-Source Algorithm Library of Deep Learning Based Point Cloud Compression,” ACM International Conference on Multimedia (ACM MM), 2022.

6. Guibiao Liao, Wei Gao, Ge Li, Junle Wang, Sam Kwong, “Cross-Collaborative Fusion-Encoder Network for Robust RGB-T Salient Object Detection,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2022.

7. Wei Gao, Yang Guo, Siwei Ma, Ge Li, and Sam Kwong, “NNCS: Neural Network Compression via Compressive Sensing,” IEEE Transactions on Neural Network and Learning Systems (TNNLS), 202.

8. Dinghao Yang, Wei Gao, Hui Yuan, Junhui Hou, Ge Li, Sam Kwong, “3D Point Cloud Classification via Exploiting Efficient Manifold Learning Based Feature Representation,” ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2022.

9. Ruonan Zhang, Jingyi Chen, Wei Gao, Ge Li, Thomas Li, “PointOT: Interpretable Geometry-Inspired Point Cloud Generative Model via Optimal Transport,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2022.

10. Wei Gao, Hang Yuan, Yang Guo, Lvfang Tao, Zhanyuan Cai, Ge Li, “OpenHardwareVC: An Open Source Library for 8K UHD Video Coding Hardware Implementation,” ACM International Conference on Multimedia (ACM MM), 2022.

11. Xiaoyu Zhang, Wei Gao, Ge Li, Qiuping Jiang, Runmin Cong, “Image Quality Assessment Driven Reinforcement Learning for Mixed Distorted Image Restoration,” ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2022.

12. Zhuangzi Li, Ge Li, Thomas Li, Shan Liu, Wei Gao, “Semantic Point Cloud Upsampling,” IEEE Transactions on Multimedia (TMM), 2022.

13. Xianghao Zang, Ge Li, Wei Gao, “Multi-dimension and Multi-scale Pyramid in Transformer for Video-based Pedestrian Retrieval,” IEEE Transactions on Industrial Informatics (TII), 2022.

14. Yang Guo, Wei Gao, Siwei Ma, Ge Li, “Accelerating Transform Algorithm Implementation for Efficient Intra Encoder of 8K UHD Videos,” ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 18, no. 4, pp. 1-20, 2022.

15. Zhenyu Peng, Qiuping Jiang, Feng Shao, Wei Gao, Weisi Lin, “LGGD+: Image Retargeting Quality Assessment by Measuring Local and Global Geometric Distortions,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), September, 2021.

16. Fei Song, Yiting Shao, Wei Gao, Haiqiang Wang, and Thomas Li, “Layer-Wise Geometry Aggregation Framework for Lossless LiDAR Point Cloud Compression,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 31, no. 12, pp. 4603-4616, Dec. 2021.

17. Yudong Mao, Qiuping Jiang, Runmin Cong, Wei Gao, Feng Shao, Sam Kwong, “Cross-modality Fusion and Progressive Integration Network for Saliency Prediction on Stereoscopic 3D Images,” IEEE Transactions on Multimedia (TMM), 2021.

18. Wei Gao, Guibiao Liao, Siwei Ma, Ge Li, Yongsheng Liang, and Weisi Lin, “Unified Information Fusion Network for Multi-Modal RGB-D and RGB-T Salient Object Detection,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 32, no. 4, pp. 2091-2106, April 2022.

19. Wei Gao, Qiuping Jiang, Ronggang Wang, Siwei Ma, Ge Li, and Sam Kwong, “Consistent Quality Oriented Rate Control in HEVC via Balancing Intra and Inter Frame Coding,” IEEE Transactions on Industrial Informatics (TII), vol. 18, no. 3, pp. 1594-1604, March 2022.

20. Wei Gao, Linjie Zhou, and Lvfang Tao, “A Fast View Synthesis Implementation Method for Light Field Applications,” ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 17, no. 4, pp. 1-20, 2021.

21. Wei Gao, Sam Kwong, Qiuping Jiang, Chi-Keung Fong, Peter H. W. Wong, Wilson Y. F. Yuen, “Data-Driven Rate Control for Rate-Distortion Optimization in HEVC Based on Simplified Effective Initial QP Learning,” IEEE Transactions on Broadcasting (TBC), vol. 65, no. 1, pp. 94-108, March 2019.

22. Wei Gao, Sam Kwong, and Yuheng Jia, “Joint Machine Learning and Game Theory for Rate Control in High Efficiency Video Coding,” IEEE Transactions on Image Processing (TIP), vol. 26, no. 12, pp. 6074-6089, Dec. 2017.

23. Wei Gao, Sam Kwong, Yu Zhou, and Hui Yuan, “SSIM-Based Game Theory Approach for Rate-Distortion Optimized Intra Frame CTU-level Bit Allocation,” IEEE Transactions on Multimedia (TMM), vol. 18, no. 6, pp. 988-999, June 2016.

24. Wei Gao, Sam Kwong, Hui Yuan, and Xu Wang, “DCT Coefficient Distribution Modeling and Quality Dependency Analysis Based Frame-level Bit Allocation for HEVC,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol.26, no.1, pp. 139-153, Jan. 2016.

25. Qiuping Jiang, Wei Gao, Shiqi Wang, Guanghui Yue, Feng Shao, Yo-Sung Ho, Sam Kwong, “Blind Image Quality Measurement by Exploiting High Order Statistics with Deep Dictionary Encoding Network,” IEEE Transactions on Instrumentation and Measurement (TIM), vol. 69, no. 10, pp. 7398-7410, Oct. 2020.

26. Mingliang Zhou, Xuekai Wei, Chi-Keung Fong, Peter H. W. Wong, Wilson Y. F. Yuen, Shiqi Wang, Sam Kwong, and Wei Gao, “SSIM-Based Global Optimization for CTU-Level Rate Control in HEVC,” IEEE Transactions on Multimedia (TMM), vol. 21, no. 8, pp. 1921-1933, Aug. 2019.

27. Qiuping Jiang, Feng Shao, Wei Gao, Zhuo Chen, and Gangyi Jiang, “Unified No-Reference Quality Assessment of Singly and Multiply Distorted Stereoscopic Images,” IEEE Transactions on Image Processing (TIP), vol. 28 , no. 4, April 2019.

28. Yuheng Jia, Sam Kwong, Wenhui Wu, Ran Wang, and Wei Gao, “Sparse Bayesian Learning Based Kernel Poisson Regression,” IEEE Transactions on Cybernetics (TCYB), vol. 49, no. 1, Jan. 2019.

29. Hui Yuan, Sam Kwong, Xu Wang, Wei Gao, Yun Zhang, “Rate Distortion Optimized Inter View Frame Level Bit Allocation Method for MV-HEVC,” IEEE Transactions on Multimedia (TMM), vol. 17, no. 12, pp. 2134-2146, Dec. 2015.

30. Wenbo Zhao, Xianming Liu, Zhiwei Zhong, Junjun Jiang, Wei Gao, Ge Li, Xiangyang Ji, “Self-Supervised Arbitrary-Scale Point Clouds Upsampling via Implicit Neural Representation,” IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, Louisiana, June 21-24, 2022.

31. Chunyang Fu, Ge Li, Rui Song, Wei Gao, Shan Liu, “OctAttention: Octree-based Large-scale Contexts Model for Point Cloud Compression,” AAAI Conference on Artificial Intelligence (AAAI), Vancouver, Canada, February 22 to March 1, 2022.

32. Zhuangzi Li, Ge Li, Thomas Li, Shan Liu, Wei Gao, “Information-Growth Attention Network for Image Super-Resolution,” ACM International Conference on Multimedia (ACM MM), Chengdu, China, 2021.

33. Guibiao Liao, Wei Gao, Qiuping Jiang, Ronggang Wang, Ge Li, “MMNet: Multi-Stage and Multi-Scale Fusion Network for RGB-D Salient Object Detection”, ACM International Conference on Multimedia (ACM MM), Seattle, WA, USA, pp. 2436-2444, October 2020.


中国/美国专利和PCT专利(申请/授权50余项)

1. Systems and Methods for Rate Control in Video Coding using Joint Machine Learning and Game Theory, United States Patent, US10542262B2, Jan. 21, 2020.

2. Method for Initial Quantization Parameter Optimization in Video Coding, United States Patent, US10560696B2, Feb. 11, 2020.

3. Methods, Apparatus, and Computer Readable Storage Mediums for Determination of Neural Network Pruning, United States Patent, Filed in Dec. 9, 2021.

4. Methods, Apparatus, Devices, Mediums and Products for Object Detection Network Design, United States Patent, Filed in May 14, 2021.

5. 目标检测网络构建优化方法、装置、设备、介质及产品,PCT国际专利,PCT/CN2021/093911,2021年5月14日。

6. 基于压缩感知的神经网络模型压缩方法、设备及存储介质,PCT国际专利,WO2022000373A1,2020年7月1日。

7. 视频编码质量平滑度的优化方法、装置、设备及存储介质,PCT国际专利,WO2020042177A1, 2020年3月5日。


开设课程:

(近年来,为计算机应用技术专业研究生开设以下两门选修课程,受到同学们的欢迎。)

1. 《三维视觉与计算摄像学》(Fall Semester,选修)

2. 《现代视频处理专题》(Spring Semester,选修)


对计划招收的硕士和博士研究生的基本要求:

点击查看招生要求

1. 专业范围:计算机、电子信息、自动化等信息科学类专业的本科和硕士毕业生。

2. 外语/数学能力:英语六级。

3. 研究/开发能力:熟练的程序设计能力,具有一定的探索能力和创新精神。

4. 其他要求:对做科研工作有热情、有兴趣,自我驱动力强。