I am a Ph.D. candidate from the Nankai-Baidu Joint Laboratory, as well as Parallel and Distributed Software Technology Lab. As the principal investigator, we are conducting interdisciplinary research in the fields of high-performance computing, information security, and data compression. Our ultimate goal is to develop a high-performance compression framework tailored for large-scale biological data. I, along with Dr. HuiD Ma, Dr. HaoN Xie, Master YingF Zheng, and Master YanF Ding, have jointly initiated an open-source project called BioCompressor. All the developed algorithm components can be found on my GitHub Site.

🔥 News

  • 2024.05: 🎉 Our PQSDC work published in Bioinformatics (Top journal in computational biology)
  • 2023.12: 🎉 Our work of LRCB has been accepted by DCC-2024 (Top conference in data compression)
  • 2023.12: 🎉 Participating in ICPADS-2023 and giving an oral presentation
  • 2023.11: 🔥 The PMFFRC work has been published in BMC Bioinformatics

💻 Papers and Patents

My full papers and patents list are shown at my personal homepage.

🎙 Parallel and Security Data Compression

  • 2025 Genomics Data Lossless Compression with (s,k)-mer Encoding and Deep Neural Networks, Submitted to NIPS-2024 conference, (ranked 1st).
  • 2024 A Survey and Benchmark Evaluation for Neural-Network Based Lossless Universal Compressors Toward Multi-Source Data, Submitted to FCS journal, (ranked 1st).
  • 2024 PQSDC: A Novel Parallel Quality Scores Data Compressor via Sequences Partition and Run-length Prediction Mapping, Published in Bioinformatics, (ranked 1st).
  • 2024 LRCB: A Comprehensive Benchmark Evaluation of Reference-free Lossless Compression Tools for Genomics Sequencing Long Reads Data, Accepted by DCC 2024, (ranked 1st).
  • 2023 SR2C: A Structurally Redundant Short Reads Collapser for Optimizing DNA Data Compression, Published in ICPADS 2024, (ranked 1st).
  • 2023 PMFFRC: a large-scale genomic short reads compression optimizer via memory modeling and redundant clustering, Published in BMC Bioinformatics, (ranked 1st).
  • 2023 Parallel Algorithm for Sensitive Sequence Recognition from Long-read Genome Data with High Error Rate, Published in Journal on Communications, (ranked 2nd, Corresponding Author)
  • 2022 Recognizing Sensitive Sequences from Genomic Data with High Error Rate Integrating Filter and Similarity Calculation, Published in Journal of Chinese Computer Systems, (ranked 1st).

👄 Electronic Design Automation (EDA) Circuit Layout and Routing.

  • 2024 A Schematic Aesthetics Evaluation Algorithm Integrating Clustering and Convolutional Neural Networks, Accepted by Huazhong University of Science and Technology, (ranked 2nd).
  • 2024 Heuristic Placement and Routing Algorithm for Optimizing Logical Clarity of Schematics, Accepted by Huazhong University of Science and Technology, (ranked 4th).

📚 Other Co-authored Publications.

  • 2023 Paving the Way for Smart Cities and a Greener Future, Published in Energy and Buildings, (ranked 3rd).
  • 2023 ricME: Long-read based Mobile Element Variant Detection using Sequence Realignment and Identity Calculation, (ranked 3rd).
  • 2020 Credit Model and Algorithm for Multi-Dimensional Evaluating Responsible Subjects of Scientific and Technological Activities, (ranked 4th).

🧑‍🎨 Patents and Software Copyrights

  • 2023 A Heuristic Layout and Routing Method for Circuit Schematic Diagrams, CN Patent, CN117057302A, Gang Wang (supervisor), Hui Sun, XiaoG Liu, et al.
  • 2023 Method for Evaluating the Aesthetics Level of Circuit Layout, Routing, and Schematic Diagram, CN Patent, CN117058096A, (ranked 2nd).
  • 2023 A Parallel Optimization Method for High-Throughput Genomics Sequencing reads Data Compression, CN Patent, CN117059181A, (ranked 2nd).
  • 2023 A Parallel Compression Method for High-Throughput Genomic Sequencing Quality Score Data, CN Patent, CN117133365A, (ranked 2nd).
  • 2022 A Non-Intrusive Identification Method, System, Device, and Storage Medium for Multi-Parameter Identification of Electromagnetic Equipment Based on Ensemble Algorithms, CN Patent, CN113723495A, (ranked 3rd).
  • 2021 Research Integrity Evaluation Software System, Software Copyrights, 2021SR06838011, (ranked 2nd).
  • 2021 DNA Sensitive Sequence Filtering Software, Software Copyrights, 2021SR068771, (ranked 1st).
  • 2020 Improved RC4 Cryptography Auxiliary Teaching Platform, Software Copyrights, 2020SR0283880, (ranked 1st).

🎖 Honors

  • 2023-2024 Doctoral Graduate “GongNeng” Scholarship of Nankai University.
  • 2022-2023 Doctoral Graduate “GongNeng” Scholarship of Nankai University.
  • 2021-2022 Outstanding Graduate Student of Guangxi University.
  • 2021-2022 Annual Outstanding Student Scholarship of Guangxi University.
  • 2021-2022 First Prize Graduate Academic Scholarship of Guangxi University.
  • 2020-2021 First Prize Graduate Academic Scholarship of Guangxi University.
  • 2019-2020 Outstanding Graduate Cadre of Guangxi University.
  • 2017-2018 National Inspirational Scholarship of China.

💬 Competitions

  • 2023 Second Prize in the “Huawei Cup” National College Student Storage Modeling Competition, top 2 in China. | [Linkage]
  • 2020 Parallel Fund Award in the 8th National Parallel Application Challenge of the “Intel Cup”, top 9 in China. | [Linkage]
  • 2020 Third Prize in the Guangxi College Student Computer Design Competition.
  • 2019 First Prize for Outstanding Paper at the 2019 Conference of the Guangxi Computer Society.
  • 2017 First Prize in “Data Creation Cup” National College Student Mathematical Modeling Challenge.