HE Zhongjun, a Professor-level Senior Engineer and CCF Distinguished Member, has long been dedicated to the research and development of artificial intelligence, natural language processing, and machine translation.
He has been the Chair of Baidu Artificial Intelligence Technical Committee. He spearheaded the development of the world’s first internet-based neural machine translation (NMT) system and a semantic unit-driven machine simultaneous interpretation system. In recent years, he has focused his efforts on the research and development of Large Language Models (LLMs). As a core R&D member, he has contributed to the success of ERNIE-5.0 and led the development of its multi-modal capabilities, which ranked #1 in China and #8 globally on the Vision Arena (Visual Understanding) leaderboard in January, 2026.
He has published over 30 academic papers in leading conferences and journals and holds more than 160 granted invention patents. His contributions have been recognized with numerous honors, including the Second Prize of the National Scientific and Technological Progress Award, the First Prize of the Beijing Science and Technology Progress Award, the First Prize of the CIE Science and Technology Progress Award, the China Patent Silver Award. He has also been named a Beijing Youth Role Model and an Outstanding Science and Technology Professional of the Chinese Institute of Electronics.
Workshop
I’m the co-organizer of the Workshop on Automatic Simultaneous Translation.
Tutorial
- Tutorial in Simultaneous Translation at EMNLP 2020. [video]
Selected Publications
- Haifeng Wang, Hua Wu, Tian Wu, Yu Sun, Jing Liu, Dianhai Yu, Yanjun Ma, Jingzhou He, Zhongjun He, et al. ERNIE 5.0 Technical Report.
- Pengzhi Gao, Liwen Zhang, Zhongjun He, Hua Wu, Haifeng Wang. Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 243–262.
- Pengzhi Gao, Liwen Zhang, Zhongjun He, Hua Wu, Haifeng Wang. Improving zero-shot multilingual neural machine translation by leveraging cross-lingual consistency regularization. In Findings of the Association for Computational Linguistics: ACL 2023, pages 12103–12119.
Zheng Fang, Ruiqing Zhang, Zhongjun He, Hua Wu, Yanan Cao. Non-Autoregressive Chinese ASR Error Correction with Phonological Training. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 5907-5917, Seattle, United States. Association for Computational Linguistics.
Pengzhi Gao, Zhongjun He, Hua Wu, Haifeng Wang. 2022. Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 3938–3948, Seattle, United States. Association for Computational Linguistics.
Ruiqing Zhang, Zhongjun He, Hua Wu, Haifeng Wang. 2022. Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 7862–7874, Dublin, Ireland. Association for Computational Linguistics.
Haifeng Wang, Hua Wu, Zhongjun He, Liang Huang, and Kenneth Ward Church. 2021. Progress in Machine Translation. Engineering.
- Ruiqing Zhang, Chao Pang, Chuanqiang Zhang, Shuohuan Wang, Zhongjun He, Yu Sun, Hua Wu and Haifeng Wang. 2021. Correcting Chinese Spelling Errors with Phonetic Pre-training. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 2250–2261, August 1–6, 2021.
- Ruiqing Zhang, Chuanqiang Zhang, Zhongjun He, Hua Wu, and Haifeng Wang. 2020. Learning Adaptive Segmentation Policy for Simultaneous Translation. In Proceedings of EMNLP 2020, pages 2280–2289, Online, November 16-20, 2020.
- Yuchen Liu, Jiajun Zhang, Hao Xiong, Long Zhou, Zhongjun He, Hua Wu, Haifeng Wang, and Chengqing Zong. 2020. Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding. In Proceedings of The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20), pages 8417-8424, New York, USA, February 7-12.
- Tianchi Bi, Hao Xiong, Zhongjun He, Hua Wu, and Haifeng Wang. 2019. Multi-agent Learning for Neural Machine Translation. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, pages 856–865, Hong Kong, China, November 3–7, 2019.
- Yuchen Liu, Hao Xiong, Jiajun Zhang, Zhongjun He, Hua Wu, Haifeng Wang, and Chengqing Zong. 2019. End-to-End Speech Translation with Knowledge Distillation. In Proceedings of Interspeech 2019, pages 1128-1132, Graz, Austria, September 15–19, 2019.
- Meng Sun, Bojian Jiang, Hao Xiong, Zhongjun He, Hua Wu, and Haifeng Wang. 2019. Baidu Neural Machine Translation Systems for WMT19. In Proceedings of the Fourth Conference on Machine Translation (WMT), Volume 2: Shared Task Papers (Day 1), pages 374–381, Florence, Italy, August 1-2, 2019. (Ranked the 1st in Chinese-English Human Evaluation)
- Hao Xiong, Zhongjun He, Hua Wu, and Haifeng Wang. 2019. Modeling Coherence for Discourse Neural Machine Translation. In Proceedings of The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19), pages 7338-7345, Hawaii, USA, January 27 - February 1, 2019.
- Yang Zhao, Jiajun Zhang, Chengqing Zong, Zhongjun He, and Hua Wu. 2019. Addressing the Under-translation Problem from the Entropy Perspective. In Proceedings of The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19), pages 451-458, Hawaii, USA, January 27 - February 1, 2019.
- Hao Xiong, Zhongjun He, Xiaoguang Hu, and Hua Wu. 2018. Multi-channel Encoder for Neural Machine Translation. In Proceedings of The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), pages 4962-4969, New Orleans, USA, February 2-7, 2018.
- Yang Zhao, Jiajun Zhang, Zhongjun He, Chengqing Zong, and Hua Wu. 2018. Addressing Troublesome Words in Neural Machine Translation. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 391–400, Brussels, Belgium, October 31 - November 4, 2018.
- Wei He, Zhongjun He, Hua Wu, and Haifeng Wang. 2016. Improved Neural Machine Translation with SMT Features. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16), pages 151-157, Phoenix, USA, February 12–17, 2016.
- Shiqi Shen, Yong Cheng, Zhongjun He, Wei He, Hua Wu, Maosong Sun, and Yang Liu. 2016. Minimum Risk Training for Neural Machine Translation. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pages 1683–1692, Berlin, Germany, August 7-12, 2016.
- Yong Cheng, Wei Xu, Zhongjun He, Wei He, Hua Wu, Maosong Sun, and Yang Liu. 2016. Semi-Supervised Learning for Neural Machine Translation. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, pages 1683–1692, Berlin, Germany, August 7-12, 2016.
- Yong Cheng, Shiqi Shen, Zhongjun He, Wei He, Hua Wu, Maosong Sun, and Yang Liu. 2016. Agreement-based Joint Training for Bidirectional Attention-based Neural Machine Translation. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16), pages 2761-2767, New York City, USA, July 9-15, 2016.
- Xiaoning Zhu, Zhongjun He, Hua Wu, Conghui Zhu, Haifeng Wang, and Tiejun Zhao. 2014. Improving Pivot-Based Statistical Machine Translation by Pivoting the Co-occurrence Frequency of Phrase Pairs. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 1665–1675, Doha, Qatar, October 25-29, 2014.
- Zhongjun He, Hua Wu, Haifeng Wang, Ting Liu. 2014. Transformation from Discontinuous to Continuous Word Alignment Improves Translation Quality. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 147–152, Doha, Qatar, October 25-29, 2014.
- Xiaoning Zhu, Zhongjun He, Hua Wu, Haifeng Wang, Conghui Zhu, Tiejun Zhao. 2013. Improving Pivot-Based Statistical Machine Translation Using Random Walk. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 524–534, Seattle, Washington, USA, October 18-21, 2013.
- Zhongjun He, Qun Liu, and Shouxun Lin. 2008. Improving Statistical Machine Translation Using Lexicalized Rule Selection. In Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008), pages 321–328, Manchester, August 2008.
- Qun Liu, Zhongjun He, Yang Liu, and Shouxun Lin. 2008. Maximum entropy based rule selection model for syntax-based statistical machine translation. In Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, pages 89–97, Honolulu, October, 2008.
Awards
- CIE Science and Technology Progress Award (First Prize), 2024
- Beijing Youth Role Model, 2021
- Beijing Science and Technology Progress Award (First Prize), 2020
- CIE Science and Technology Progress Award (First Prize), 2019
- The 1st place in WMT-19 (Chinese-English Track)
- CIE Outstanding Science and Technology Professional, 2018
- China Patent Silver Award, 2018
- National Scientific and Technological Progress Award (Second Prize), 2015
- CIE Science and Technology Progress Award (First Prize), 2014
Research Activity
- ACL Rolling Review: Action Editor
- Sponsorship Co-Chair: COLING 2022
- Remote Presentation Co-Chair: ACL-IJCNLP 2021
- Area Co-Chair: AACL-2020
