I am a Ph.D. candidate from the Nankai University (NKU) in China, and a visiting student from Nanyang Technolgical University (NTU) in Singapore. My research area includes AI Compression for Model and Data, Parallel Computing, and Bioinformatics. I’m services as a reviewer of ICASSP-25, IJCNN-25, and WWW-25 etal.

I am seeking positions as a postdoctoral researcher, assistant professor, or in industry. My current research focuses on models and data in the era of large models, exploring AI-driven methods to establish intrinsic connections among 'data-algorithm-system' to reconstruct compression systems. If there are relevant opportunities, please do not hesitate to contact me via sunh@nbjl.nankai.edu.cn. Thanks!

Papers and Patents

Leadership Publication Work

  • [11] Hui Sun$^\dagger$, Liping Yi$^\dagger$, Huidong Ma, Yongxia Sun, Yingfeng Zheng, Wenwen Cui, Meng Yan, Gang Wang$^{\star}$, Xiaoguang Liu$^{\star}$. Genomics data lossless compression with $(s,k)$-mer encoding and deep neural networks, Published in The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025, CCF-A.
  • [10] Huidong Ma$^\dagger$, Hui Sun$^\dagger$, Liping Yi, Yanfeng Ding, Xiaoguang Liu$^{\star}$, Gang Wang$^{\star}$. MSDZip: Universal Lossless Compression for Multi-source Data via Stepwise-parallel and Learning-based Prediction, Published in The Web Conference (WWW), 2025, CCF-A.
  • [9] Hui Sun$^\dagger$, Yanfeng Ding$^\dagger$, Liping Yi, Huidong Ma, Haonan Xie, Gang Wang$^{\star}$, Xiaoguang Liu$^{\star}$. Adaptive lossless compression for genomics data by multiple ($s, k$)-mer encoding and xLSTM, Published in 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025, CCF-B.
  • [8] Hui Sun, Yingfeng Zheng, Haonan Xie, Huidong Ma, Cheng Zhong, Meng Yan$^{\star}$, Xiaoguang Liu, Gang Wang. PQSDC: a novel parallel quality scores data compressor via sequences partition and run-length prediction mapping, Published in Bioinformatics journal, 2024, CCF-B, JCR-Q1, (领域Top期刊).
  • [7] Hui Sun$^\dagger$, Huidong Ma$^\dagger$, Feng Ling, Haonan Xie, Yongxia Sun, Liping Yi, Meng Yan$^{\star}$, Cheng Zhong, Xiaoguang Liu, Gang Wang$^{\star}$. A survey and benchmark evaluation for neural-network-based lossless universal compressors toward multi-source data, Published in FCS journal, 2024, CCF-B, JCR-Q2.
  • [6] Hui Sun$^\dagger$, Huidong Ma$^\dagger$,Yingfeng Zheng, Haonan Xie, Cheng Zhong, Xiaoguang Liu$^{\star}$, Gang Wang$^{\star}$. LRCB: A comprehensive benchmark evaluation of reference-free lossless compression tools for genomics sequencing long reads data, Published In 2024 Data Compression Conference (DCC), CCF-B, EI.
  • [5] 钟诚, 孙辉$^{\star}$. 高错误率长序列基因组数据敏感序列识别并行算法, 通信学报, 2023, CCF-A, EI.
  • [4] Hui Sun$^\dagger$, Huidong Ma$^\dagger$, Yingfeng Zheng$^\dagger$, Haonan Xie, Xiaofei Wang, Xiaoguang Liu$^{\star}$, Gang Wang$^{\star}$. SR2C: A structurally redundant short reads collapser for optimizing DNA data compression, Published in 2023 IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS), CCF-C.
  • [3] Hui Sun, Yingfeng Zheng, Haonan Xie, Huidong Ma, Xiaoguang Liu$^{\star}$, Gang Wang$^{\star}$. PMFFRC: a large-scale genomic short reads compression optimizer via memory modeling and redundant clustering, Published in BMC Bioinformatics journal, 2023, CCF-C, JCR-Q2.
  • [2] 孙辉, 丁延锋, 王刚$^{\star}$, 李桢荣. 融合聚类和卷积神经网络的原理图美观度评价算法, 华中科技大学学报, 2024, EI.
  • [1] 孙辉, 钟诚$^{\star}$. 融合过滤和相似度计算的高错误率基因组数据敏感序列识别, 小型微型计算机系统, 2022, CCF-B.

Participating Publication Work

  • [5] Huidong Ma, Hui Sun, Liping Yi, Gang Wang$^{\star}$, Xiaoguang Liu$^{\star}$. Multi-source data lossless compression via parallel expansion mapping and xLSTM, Published in 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025, CCF-B.
  • [4] Huidong Ma, Cheng Zhong$^{\star}$, Hui Sun, Danyang Chen, Haixiang Lin. ricME: long-read based mobile element variant detection using sequence realignment and identity calculation, Published International Symposium on Bioinformatics Research and Applications (ISBRA), 2023, CCF-C.
  • [3] Haonan Xie, Renhao Huang, Hui Sun, Zepeng Han, Meihui Jiang, Dongdong Zhang$^{\star}$, Hui Hwang Goh$^{\star}$, Tonni Agustiono Kurniawan, Fei Han, Hui Liu, Thomas Wu. Wireless energy: Paving the way for smart cities and a greener future, Energy and Buildings, 2023, JCR-Q1.
  • [2] Haonan Xie, Hui Hwang Goh$^{\star}$, Dongdong Zhang$^{\star}$, Hui Sun, Wei Dai, Tonni Agustiono Kurniawan, Dennis WL Wong, Kenneth Tze Kin Teo, Kai Chen Goh. Eco-Energetical analysis of circular economy and community-based virtual power plants (CE-cVPP): A systems engineering-engaged life cycle assessment (SE-LCA) method for sustainable renewable energy development, Applied Energy, 2024, JCR-Q1.
  • [1] 唐印浒, 刘峻$^{\star}$, 王淋, 孙辉, 赵凤娇, 钟诚. 多维度评价科技工作责任主体信用模型与算法, 广西大学学报, 2021, CSCD核心.

Patents and Software Copyrights

  • [8] A Heuristic Layout and Routing Method for Circuit Schematic Diagrams, CN Patent, CN117057302A, Gang Wang (supervisor), Hui Sun, XiaoG Liu, et al.
  • [7] Method for Evaluating the Aesthetics Level of Circuit Layout, Routing, and Schematic Diagram, CN Patent, CN117058096A, (ranked 2nd).
  • [6] A Parallel Optimization Method for High-Throughput Genomics Sequencing reads Data Compression, CN Patent, CN117059181A, (ranked 2nd).
  • [5] A Parallel Compression Method for High-Throughput Genomic Sequencing Quality Score Data, CN Patent, CN117133365A, (ranked 2nd).
  • [4] A Non-Intrusive Identification Method, System, Device, and Storage Medium for Multi-Parameter Identification of Electromagnetic Equipment Based on Ensemble Algorithms, CN Patent, CN113723495A, (ranked 3rd).
  • [3] Research Integrity Evaluation Software System, Software Copyrights, 2021SR06838011, (ranked 2nd).
  • [2] DNA Sensitive Sequence Filtering Software, Software Copyrights, 2021SR068771, (ranked 1st).
  • [1] Improved RC4 Cryptography Auxiliary Teaching Platform, Software Copyrights, 2020SR0283880, (ranked 1st).

Honors

  • 2024-2025 First Prize Doctoral Graduate Scholarship of Nankai University
  • 2023-2024 Doctoral Graduate “GongNeng” Scholarship of Nankai University
  • 2022-2023 Doctoral Graduate “GongNeng” Scholarship of Nankai University
  • 2021-2022 Outstanding Graduate Student of Guangxi University
  • 2021-2022 Annual Outstanding Student Scholarship of Guangxi University
  • 2021-2022 First Prize Graduate Academic Scholarship of Guangxi University
  • 2020-2021 First Prize Graduate Academic Scholarship of Guangxi University
  • 2019-2020 Outstanding Graduate Cadre of Guangxi University
  • 2017-2018 National Inspirational Scholarship of China

Competitions

  • 2025 Computer Achievement Award of Guangxi Zhuang Autonomous Region, Key Participant, Second Prize
  • 2024 11th National Parallel Application Challenge, Team Leader, National Second Prize, Top-2 in China
  • 2024 Gold Prize in Tianjin’s ‘Challenge Cup’ Innovation and Entrepreneurship Plan, Technical Director, Top-1 in Tianjin
  • 2024 First Prize in Nankai University’s ‘President’s Cup’ Innovation and Entrepreneurship Competition, Technical Director
  • 2023 Second Prize in the “Huawei Cup” National College Student Storage Modeling Competition, top 2 in China | [Linkage]
  • 2020 Parallel Fund Award in the 8th National Parallel Application Challenge of the “Intel Cup”, top 9 in China | [Linkage]
  • 2020 Third Prize in the Guangxi College Student Computer Design Competition
  • 2019 First Prize for Outstanding Paper at the 2019 Conference of the Guangxi Computer Society
  • 2017 First Prize in “Data Creation Cup” National College Student Mathematical Modeling Challenge