I am a Ph.D. candidate from the Nankai-Baidu Joint Laboratory, as well as Parallel and Distributed Software Technology Lab. As the principal investigator, we are conducting interdisciplinary research in the fields of high-performance computing, information security, and data compression. Our ultimate goal is to develop a high-performance compression framework tailored for large-scale biological data. I, along with Dr. HuiD Ma, Dr. HaoN Xie, Master YingF Zheng, and Master YanF Ding, have jointly initiated an open-source project called BioCompressor. All the developed algorithm components can be found on my GitHub Site.
🔥 News
- 2024.05: 🎉 Our PQSDC work published in Bioinformatics (Top journal in computational biology)
- 2023.12: 🎉 Our work of LRCB has been accepted by DCC-2024 (Top conference in data compression)
- 2023.12: 🎉 Participating in ICPADS-2023 and giving an oral presentation
- 2023.11: 🔥 The PMFFRC work has been published in BMC Bioinformatics
💻 Papers and Patents
My full papers and patents list are shown at my personal homepage.
🎙 Parallel and Security Data Compression
2025
Genomics Data Lossless Compression with (s,k)-mer Encoding and Deep Neural Networks, Submitted to NIPS-2024 conference, (ranked 1st).2024
A Survey and Benchmark Evaluation for Neural-Network Based Lossless Universal Compressors Toward Multi-Source Data, Submitted to FCS journal, (ranked 1st).2024
PQSDC: A Novel Parallel Quality Scores Data Compressor via Sequences Partition and Run-length Prediction Mapping, Published in Bioinformatics, (ranked 1st).2024
LRCB: A Comprehensive Benchmark Evaluation of Reference-free Lossless Compression Tools for Genomics Sequencing Long Reads Data, Accepted by DCC 2024, (ranked 1st).2023
SR2C: A Structurally Redundant Short Reads Collapser for Optimizing DNA Data Compression, Published in ICPADS 2024, (ranked 1st).2023
PMFFRC: a large-scale genomic short reads compression optimizer via memory modeling and redundant clustering, Published in BMC Bioinformatics, (ranked 1st).2023
Parallel Algorithm for Sensitive Sequence Recognition from Long-read Genome Data with High Error Rate, Published in Journal on Communications, (ranked 2nd, Corresponding Author)2022
Recognizing Sensitive Sequences from Genomic Data with High Error Rate Integrating Filter and Similarity Calculation, Published in Journal of Chinese Computer Systems, (ranked 1st).
👄 Electronic Design Automation (EDA) Circuit Layout and Routing.
2024
A Schematic Aesthetics Evaluation Algorithm Integrating Clustering and Convolutional Neural Networks, Accepted by Huazhong University of Science and Technology, (ranked 2nd).2024
Heuristic Placement and Routing Algorithm for Optimizing Logical Clarity of Schematics, Accepted by Huazhong University of Science and Technology, (ranked 4th).
📚 Other Co-authored Publications.
2023
Paving the Way for Smart Cities and a Greener Future, Published in Energy and Buildings, (ranked 3rd).2023
ricME: Long-read based Mobile Element Variant Detection using Sequence Realignment and Identity Calculation, (ranked 3rd).2020
Credit Model and Algorithm for Multi-Dimensional Evaluating Responsible Subjects of Scientific and Technological Activities, (ranked 4th).
🧑🎨 Patents and Software Copyrights
2023
A Heuristic Layout and Routing Method for Circuit Schematic Diagrams, CN Patent, CN117057302A, Gang Wang (supervisor), Hui Sun, XiaoG Liu, et al.2023
Method for Evaluating the Aesthetics Level of Circuit Layout, Routing, and Schematic Diagram, CN Patent, CN117058096A, (ranked 2nd).2023
A Parallel Optimization Method for High-Throughput Genomics Sequencing reads Data Compression, CN Patent, CN117059181A, (ranked 2nd).2023
A Parallel Compression Method for High-Throughput Genomic Sequencing Quality Score Data, CN Patent, CN117133365A, (ranked 2nd).2022
A Non-Intrusive Identification Method, System, Device, and Storage Medium for Multi-Parameter Identification of Electromagnetic Equipment Based on Ensemble Algorithms, CN Patent, CN113723495A, (ranked 3rd).2021
Research Integrity Evaluation Software System, Software Copyrights, 2021SR06838011, (ranked 2nd).2021
DNA Sensitive Sequence Filtering Software, Software Copyrights, 2021SR068771, (ranked 1st).2020
Improved RC4 Cryptography Auxiliary Teaching Platform, Software Copyrights, 2020SR0283880, (ranked 1st).
🎖 Honors
2023-2024
Doctoral Graduate “GongNeng” Scholarship of Nankai University.2022-2023
Doctoral Graduate “GongNeng” Scholarship of Nankai University.2021-2022
Outstanding Graduate Student of Guangxi University.2021-2022
Annual Outstanding Student Scholarship of Guangxi University.2021-2022
First Prize Graduate Academic Scholarship of Guangxi University.2020-2021
First Prize Graduate Academic Scholarship of Guangxi University.2019-2020
Outstanding Graduate Cadre of Guangxi University.2017-2018
National Inspirational Scholarship of China.
💬 Competitions
2023
Second Prize in the “Huawei Cup” National College Student Storage Modeling Competition, top 2 in China. | [Linkage]2020
Parallel Fund Award in the 8th National Parallel Application Challenge of the “Intel Cup”, top 9 in China. | [Linkage]2020
Third Prize in the Guangxi College Student Computer Design Competition.2019
First Prize for Outstanding Paper at the 2019 Conference of the Guangxi Computer Society.2017
First Prize in “Data Creation Cup” National College Student Mathematical Modeling Challenge.