Publications

2024

  1. ATC’24
    88. Efficient Large Graph Processing with Chunk-Based Graph Representation Model
    Rui Wang, Weixu Zong, Shuibing He, Xinyu Chen, Zhenxin Li, and Zheng Dang
    In Proceedings of the USENIX Annual Technical Conference, 2024
  2. MSST’24
    87. Dissecting I/O Burstiness in Machine Learning Cloud Platform: A Case Study on Alibaba’s MLaaS
    Qiang Zou, Yuhui Deng, Yifeng Zhu, Yi Zhou, Jianghe Cai, and Shuibing He
    In Proceedings of the 38th International Conference on Massive Storage Systems and Technology, 2024
  3. TOCS’24
    86. PMAlloc: A Holistic Approach to Improving Persistent Memory Allocation
    Zheng Dang, Shuibing He, Xuechen Zhang, Peiyi Hong, Zhenxin Li, Xinyu Chen, Haozhe Song, Xian-He Sun, and Gang Chen
    ACM Transactions on Computer Systems, 2024
  4. EuroSys’24
    85. CCL-BTree: A Crash-Consistent Locality-Aware B+-Tree for Reducing XPBuffer-Induced Write Amplification in Persistent Memory
    Zhenxin Li, Shuibing He, Zheng Dang, Peiyi Hong, Xuechen Zhang, Rui Wang, and Fei Wu
    In Proceedings of the European Conference on Computer Systems, 2024

2023

  1. TPDS
    84. APQ: Automated DNN Pruning and Quantization for ReRAM-based Accelerators
    Siling Yang, Shuibing He, Hexiao Duan, Weijian Chen, Xuechen Zhang, Tong Wu, and Yanlong Yin
    IEEE Transactions on Parallel and Distributed Systems, 2023
  2. SC’23
    83. Efficient Maximal Biclique Enumeration on GPUs
    Zhe Pan, Shuibing He, Xu Li, Xuechen Zhang, Rui Wang, and Gang Chen
    In the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, 2023
  3. ASPLOS’23
    82. Mapping Very Large Scale Spiking Neuron Network to Neuromorphic Hardware
    Ouwen Jin, Qinghui Xing, Ying Li, Shuiguang Deng, Shuibing He, and Gang Pan
    In 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, 2023
  4. HPCA’23
    81. iCache: An Importance-Sampling-Informed Cache for Accelerating I/O-Bound DNN Model Training
    Weijian Chen, Shuibing He, Yaowen Xu, Xuechen Zhang, Siling Yang, Shuang Hu, Xian-He Sun, and Gang Chen
    In 2023 IEEE International Symposium on High-Performance Computer Architecture, 2023
  5. HPCA’23
    80. BM-Store: A Transparent and High-Performance Local Storage Architecture for Bare-Metal Clouds Enabling Large-Scale Deployment
    Yiquan Chen, Jiexiong Xu, Chengkun Wei, Yijing Wang, Xin Yuan, Yangming Zhang, Xulin Yu, Yi Chen, Zeke Wang, Shuibing He, and Wenzhi Chen
    In 2023 IEEE International Symposium on High-Performance Computer Architecture, 2023
  6. TC
    79. HOME: A Holistic GPU Memory Management Framework for Deep Learning
    Shuibing He, Ping Chen, Shuaiben Chen, Zheng Li, Siling Yang, Weijian Chen, and Lidan Shou
    IEEE Transactions on Computers, 2023
  7. CPE
    68. Accelerating Real-Time Object Detection in High-Resolution Video Surveillance
    Yuefeng Wang, Kuang Mao, Tong Chen, Yanglong Yin, Shuibing He, and Gang Chen
    Concurrency and Computation: Practice and Experience, 2023
  8. CLUS
    78. Workload Time Series Prediction in Storage Systems: A Deep Learning based Approach
    Li Ruan, Yu Bai, Shaoning Li, Shuibing He, and Limin Xiao
    Cluster Computing, 2023

2022

  1. MICRO’22
    77. XPGraph: XPline-Friendly Persistent Memory Graph Stores for Large-Scale Evolving Graphs
    Rui Wang, Shuibing He, Weixu Zong, Yongkun Li, and Yinlong Xu
    In 2022 55th IEEE/ACM International Symposium on Microarchitecture, 2022
  2. ASPLOS’22
    76. NVAlloc: Rethinking Heap Metadata Management in Persistent Memory Allocators
    Zheng Dang, Shuibing He, Peiyi Hong, Zhenxin Li, Xuechen Zhang, Xian-He Sun, and Gang Chen
    In 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2022
  3. TPDS
    75. Accelerating Tensor Swapping in GPUs With Self-Tuning Compression
    Ping Chen, Shuibing He, Xuechen Zhang, Shuaiben Chen, Peiyi Hong, Yanlong Yin, and Xian-He Sun
    IEEE Transactions on Parallel and Distributed Systems, 2022
  4. NAS’22
    74. WAFLASH: Taming Unaligned Writes in Solid-State Disks
    Shuibing He, Matthew Myers, Xuehao Duan, Keegan Sanchez, and Xuechen Zhang
    In 2022 IEEE International Conference on Networking, Architecture and Storage, 2022
  5. TPDS
    73. PHAST: Hierarchical Concurrent Log-Free Skip List for Persistent Memory
    Zhenxin Li, Bing Jiao, Shuibing He, and Weikuan Yu
    IEEE Transactions on Parallel and Distributed Systems, 2022
  6. TOS
    72. Toward Fast and Scalable Random Walks over Disk-Resident Graphs via Efficient I/O Management
    Rui Wang, Yongkun Li, Yinlong Xu, Hong Xie, John CS Lui, and Shuibing He
    ACM Transactions on Storage, 2022

2021

  1. CLUSTER’21
    71. CSWAP: A Self-Tuning Compression Framework for Accelerating Tensor Swapping in GPUs
    Ping Chen, Shuibing He, Xuechen Zhang, Shuaiben Chen, Peiyi Hong, Yanlong Yin, Xian-He Sun, and Gang Chen
    In 2021 IEEE International Conference on Cluster Computing, 2021
  2. ICPP’21
    70. A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-Based Matrix Factorization
    Yizhi Huang, Yanlong Yin, Yan Liu, Shuibing He, Yang Bai, and Renfa Li
    In 50th International Conference on Parallel Processing, 2021
  3. ICS’21
    69. AUTO-PRUNE: Automated DNN Pruning and Mapping for ReRAM-Based Accelerator
    Siling Yang, Weijian Chen, Xuechen Zhang, Shuibing He, Yanlong Yin, and Xian-He Sun
    In 2021 International Conference on Supercomputing, 2021

2020

  1. TPDS
    67. Sova: A Software-Defined Optimization Framework for Virtual Network Allocations
    Zhiyong Ye, Yang Wang, Shuibing He, Chengzhong Xu, and Xian-He Sun
    IEEE Transactions on Parallel and Distributed Systems, 2020
  2. ICS’20
    66. Compiler Aided Checkpointing using Crash-Consistent Data Structures in NVMM Systems
    Tyler Coy, Shuibing He, Bin Ren, and Xuechen Zhang
    In 2020 International Conference on Supercomputing, 2020
  3. JCSC
    65. Application and Storage-Aware Data Placement and Job Scheduling for Hadoop Clusters
    Tao Li, Shuibing He, Ping Chen, Siling Yang, Yanlong Yin, and Cheng Xu
    Journal of Circuits, Systems and Computers, 2020
  4. VTC’20
    64. Budget Feasible Roadside Unit Allocation Mechanism in Vehicular Ad-Hoc Networks
    Xiaohua Xu, Shuibing He, Meng Han, Reza M Parizi, and Gautam Srivastava
    In 2020 IEEE 91st vehicular technology conference, 2020

2019

  1. TC
    63. PRS: A Pattern-Directed Replication Scheme for Heterogeneous Object-Based Storage
    Jiang Zhou, Yong Chen, Wei Xie, Dong Dai, Shuibing He, and Weiping Wang
    IEEE Transactions on Computers, 2019
  2. TPDS
    62. A Holistic Heterogeneity-Aware Data Placement Scheme for Hybrid Parallel I/O Systems
    Shuibing He, Zheng Li, Jiang Zhou, Yanlong Yin, Xiaohua Xu, Yong Chen, and Xian-He Sun
    IEEE Transactions on Parallel and Distributed Systems, 2019
  3. TC
    61. Optimizing Parallel I/O Accesses Through Pattern-Directed and Layout-Aware Replication
    Shuibing He, Yanlong Yin, Xian-He Sun, Xuechen Zhang, and Zongpeng Li
    IEEE Transactions on Computers, 2019
  4. TPDS
    60. A Highly Reliable Metadata Service for Large-Scale Distributed File Systems
    Jiang Zhou, Yong Chen, Weiping Wang, Shuibing He, and Dan Meng
    IEEE Transactions on Parallel and Distributed Systems, 2019
  5. CLUSTER’19
    59. DP_Greedy: A Two-Phase Caching Algorithm for Mobile Cloud Services
    Dong Huang, Xiaopeng Fan, Yang Wang, Shuibing He, and Chengzhong Xu
    In 2019 IEEE International Conference on Cluster Computing, 2019
  6. NAS’19
    58. Towards Cluster-Wide Deduplication Based on Ceph
    Jinpeng Wang, Yang Wang, Hekang Wang, Kejiang Ye, Chengzhong Xu, Shuibing He, and Lingfang Zeng
    In 2019 IEEE International Conference on Networking, Architecture and Storage, 2019
  7. ICMIC’19
    57. Delay Efficient D2D Communications Over 5G Edge-Computing Mobile Networks
    Xiaohua Xu, Yuanfang Chen, Yanxiao Zhao, Shuibing He, and Houbing Song
    In 11th International Conference on Modelling, Identification and Control, 2019
  8. ICPP’19
    56. Integration of Appends and Merges in Log-Structured Merge Trees
    Caixin Gong, Shuibing He, Yili Gong, and Yingchun Lei
    In 48th International Conference on Parallel Processing, 2019
  9. WASA’19
    55. OWL: Fast Opportunistic Wireless Link Scheduling with SINR Constraints
    Shuibing He Xiaohua Xu, and Patrick Otoo Bobbie
    In the 14th International Conference on Wireless Algorithms, Systems, and Applications, 2019
  10. JSA
    54. Run-Time Timing Prediction for System Reconfiguration on Many-core Embedded Systems
    Zheng Li, and Shuibing He
    Journal of Systems Architecture, 2019

2018

  1. TPDS
    53. On Cost-Driven Collaborative Data Caching: A New Model Approach
    Yang Wang, Shuibing He, Xiaopeng Fan, Chengzhong Xu, and Xian-He Sun
    IEEE Transactions on Parallel and Distributed Systems, 2018
  2. TC
    52. A Cost-Effective Distribution-Aware Data Replication Scheme for Parallel I/O Systems
    Shuibing He, and Xian-He Sun
    IEEE Transactions on Computers, 2018
  3. CMS
    51. Cluster-Based Niching Differential Evolution Algorithm for Optimizing the Stable Structures of Metallic Clusters
    Yuan-Hua Yang, Xian-Bin Xu, Shuibing He, Jin-Bo Wang, and Yu-Hua Wen
    Computational Materials Science, 2018
  4. SOCO
    50. Improving File Locality in Multi-Keyword Top-k Search Based on Clustering
    Lanxiang Chen, Nan Zhang, Kuan-Ching Li, Shuibing He, and Linbing Qiu
    Soft Computing, 2018
  5. DAAC’18
    49. Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach
    Li Ruan, Yu Bai, Shaoning Li, Shuibing He, and Limin Xiao
    the 2nd International Industry/University Workshop on Data-center Automation, Analytics, and Control, 2018
  6. NPC’18
    48. KT-Store: A Key-Order and Write-Order Hybrid Key-Value Store with High Write and Range-Query Performance
    Haobo Wang, Yinliang Yue, Shuibing He, and Weiping Wang
    In 15th IFIP International Conference on Network and Parallel Computing, 2018
  7. IPDPS’18
    47. A Migratory Heterogeneity-Aware Data Layout Scheme for Parallel File Systems
    Shuibing He, Xian-He Sun, Yang Wang, and Chengzhong Xu
    In 2018 IEEE International Parallel and Distributed Processing Symposium, 2018
  8. HPSC’18
    46. Timing Prediction for Dynamic Application Migration on Multi-Core Embedded Systems
    Zheng Li, Hao Wu, and Shuibing He
    In IEEE International Conference on High Performance and Smart Computing, 2018

2017

  1. TECS
    45. Fixed-Priority Scheduling for Two-Phase Mixed-Criticality Systems
    Zheng Li, and Shuibing He
    ACM Transactions on Embedded Computing Systems, 2017
  2. Computer Science
    44. Performance Model of Sparse Matrix-Vector Multiplication on GPU
    Yang Wang, Bharadwaj Veeravalli, Chen-Khong Tham, Shuibing He, and Chengzhong Xu
    Computer Science, 2017
  3. TAAS
    43. On Service Migrations in the Cloud for Mobile Accesses: A Distributed Approach
    Yang Wang, Bharadwaj Veeravalli, Chen-Khong Tham, Shuibing He, and Chengzhong Xu
    ACM Transactions on Autonomous and Adaptive Systems, 2017
  4. TPDS
    42. Cost-Aware Region-Level Data Placement in Multi-Tiered Parallel I/O Systems
    Shuibing He, Yang Wang, Zheng Li, Xian-He Sun, and Chenzhong Xu
    IEEE Transactions on Parallel and Distributed Systems, 2017
  5. TC
    41. Heterogeneity-Aware Collective I/O for Parallel I/O Systems with Hybrid HDD/SSD Servers
    Shuibing He, Yang Wang, Xian-He Sun, Chuanhe Huang, and Chengzhong Xu
    IEEE Transactions on Computers, 2017
  6. TC
    40. HARL: Optimizing Parallel File Systems with Heterogeneity-Aware Region-Level Data Layout
    Shuibing He, Yang Wang, Xian-He Sun, and Chengzhong Xu
    IEEE Transactions on Computers, 2017
  7. TPDS
    39. Using MinMax-Memory Claims to Improve In-Memory Workflow Computations in the Cloud
    Shuibing He, Yang Wang, Xian-He Sun, and Chengzhong Xu
    IEEE Transactions on Parallel and Distributed Systems, 2017
  8. ISPA’17
    38. MCS-B: An Energy-Efficient Storage System for Astronomical Observation Data Based on Logical Block Replacement Strategy
    Chao Sun, Shanjiang Tang, Zichao Yuan, Ce Yu, Jian Xiao, Jizhou Sun, and Shuibing He
    In 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), 2017
  9. ICPP’17
    37. Data Caching in Next Generation Mobile Cloud Services, Online vs. Off-Line
    Yang Wang, Shuibing He, Xiaopeng Fan, Chengzhong Xu, Joseph Culberson, and Joseph Horton
    In 2017 46th International Conference on Parallel Processing (ICPP), 2017
  10. EUC’17
    36. Prediction Based Run-Time Reconfiguration on Many-Core Embedded Systems
    Zheng Li, Shuibing He, and Li Wang
    In 2017 IEEE International Conference on Embedded and Ubiquitous Computing (EUC), 2017

2016

  1. TPDS
    35. Improving Performance of Parallel I/O Systems through Selective and Layout-Aware SSD Cache
    Shuibing He, Yang Wang, and Xian-He Sun
    IEEE Transactions on Parallel and Distributed Systems, 2016
  2. IJHPCA
    34. Enhancing Hybrid Parallel File System through Performance and Space-Aware Data Layout
    Shuibing He, Yan Liu, Yang Wang, Xian-He Sun, and Chuanhe Huang
    The International Journal of High Performance Computing Applications, 2016
  3. IJES
    33. MGPA: A Multi-Granularity Space Pre-Allocation Algorithm for Object-Based Storage Devices
    Shuibing He, Yuanhua Yang, Xianbin Xu, and Xiaohua Xu
    International Journal of Embedded Systems, 2016
  4. WHUJNS
    32. Capability-Aware Data Placement for Heterogeneous Active Storage Systems
    Xiangyu Li, Shuibing He, Xianbin Xu, and Yang Wang
    Wuhan University Journal of Natural Sciences, 2016
  5. IJDTA
    31. JAS: JVM-Based Active Storage Framework for Object-based Storage Systems
    Xiangyu Li, Shuibing He, Xianbin Xu, and Yang Wang
    International Journal of Database Theory and Application, 2016
  6. IJGDC
    30. Skewed Data Distribution for Active Storage Systems on Hybrid Servers
    Xiangyu Li, Shuibing He, and Xianbin Xu
    International Journal of Grid and Distributed Computing, 2016
  7. ICPADS’16
    29. On Autonomous Service Migrations in the Cloud for Mobile Accesses
    Yang Wang, Shuibing He, Fuji Ren, Lujia Wang, and Chengzhong Xu
    In 22nd International Conference on Parallel and Distributed Systems, 2016
  8. ICDCS’16
    28. On MinMax-Memory Claims for Scientific Workflows in the In-memory Cloud Computing
    Yang Wang, Chengzhong Xu, Shuibing He, and Xian-He Sun
    In 36th International Conference on Distributed Computing Systems, 2016
  9. TPDS
    27. Boosting Parallel File System Performance with Heterogeneity-Aware Selective Data Layout
    Shuibing He, Yang Wang, and Xian-He Sun
    IEEE Transactions on Parallel and Distributed Systems, 2016

2015

  1. ICPP’15
    26. A Heterogeneity-Aware Region-Level Data Layout for Hybrid Parallel File Systems
    Shuibing He, Xian-He Sun, Yang Wang, Antonis Kougkas, and Adnan Haider
    In 2015 44th International Conference on Parallel Processing, 2015
  2. IPDPS’15
    25. HAS: Heterogeneity-Aware Selective Data Layout Scheme for Parallel File Systems on Hybrid Servers
    Shuibing He, Xian-He Sun, and Adnan Haider
    In 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
  3. HiPC’15
    24. IC-Data: Improving Compressed Data Processing in Hadoop
    Adnan Haider, Xi Yang, Ning Liu, Xian-He Sun, and Shuibing He
    In 2015 IEEE 22nd International Conference on High Performance Computing, 2015

2014

  1. ICDCS’14
    23. S4D-Cache: Smart Selective SSD Cache for Parallel I/O Systems
    Shuibing He, Xian-He Sun, and Bo Feng
    In 2014 IEEE 34th International Conference on Distributed Computing Systems, 2014
  2. DISC’14
    22. PSA: A Performance and Space-Aware Data Layout Scheme for Hybrid Parallel File Systems
    Shuibing He, Yan Liu, and Xian-He Sun
    In 2014 International Workshop on Data Intensive Scalable Computing Systems, 2014
  3. PDSW’14
    21. HPIS3: Towards a High-Performance Simulator for Hybrid Parallel I/O and Storage Systems
    Bo Feng, Ning Liu, Shuibing He, and Xian-He Sun
    In 2014 9th Parallel Data Storage Workshop, 2014
  4. ICA3PP’14
    20. Performance-Aware Data Placement in Hybrid Parallel File Systems
    Shuibing He, Xian-He Sun, Bo Feng, and Kun Feng
    In Algorithms and Architectures for Parallel Processing, 2014
  5. AMR
    19. A Statistics-Based Data Placement Strategy for Hybrid Storage
    Yuan Hua Yang, Xian Bin Xu, Shuibing He, and Yu Hua Wen
    2014

2013

  1. JCRD
    18. Parallel Acceleration and Performance Optimization for GRAPES Model Based on GPU
    Zhuowei Wang, Xianbin Xu, Wuqing Zhao, Shuibing He, and Zhang Yuping
    In Journal of Computer Research and Development, 2013
  2. JC
    17. GPU-Based Parallel Researches on RRTM Module of GRAPES Numerical Prediction System.
    Fang Zheng, Xianbin Xu, Dongdong Xiang, Zhuowei Wang, Ming Xu, and Shuibing He
    Journal of Computers, 2013
  3. AMR
    16. WLVT: A Static Wear-Leveling Algorithm with Variable Threshold
    Yuan Hua Yang, Xian Bin Xu, Shuibing He, Fang Zhen, and Yu Ping Zhang
    In Advanced Materials Research, 2013
  4. HPDIC’13
    15. BPS: A Performance Metric of I/O System
    Shuibing He, Xian-He Sun, and Yanlong Yin
    In 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, 2013
  5. CLUSTER’13
    14. A Cost-Aware Region-Level Data Placement Scheme for Hybrid Parallel I/O Systems
    Shuibing He, Xian-He Sun, Bo Feng, Xin Huang, and Kun Feng
    In 2013 IEEE International Conference on Cluster Computing, 2013
  6. IEA
    13. Parallel Optimization for Sparse Matrix–Vector on GPU
    Meng Jia Yin, Xian Bin Xu, Hua Chen, Shuibing He, and Jing Hu
    In the International Conference on Information Engineering and Applications: Volume 2, 2013

2012

  1. IJCIS
    12. Oasa: An Active Storage Architecture for Object-based Storage System
    Shuibing He, Xianbin Xu, and Yuanhua Yang
    International journal of computational intelligence systems, 2012
  2. JTAIT
    11. Optimizing Sparse Matrix-Vector Multiplication Based on GPU
    Mengjia Yin, Tao Zhang, Xianbin Xu, Jin Hu, and Shuibing He
    Journal of Theoretical and Applied Information Technology, 2012

2011

  1. ICCIS’11
    10. Accelerating Biological Sequence Alignment Algorithm on GPU with CUDA
    Fang Zheng, Xianbin Xu, Yuanhua Yang, Shuibing He, and Yuping Zhang
    In 2011 International Conference on Computational and Information Sciences, 2011
  2. BMEI’11
    9. Communication-Aware Task Scheduling for Multi-Core Architectures with Segmented Buses
    Yuping Zhang, Xianbin Xu, Yuanhua Yang, Shuibing He, and Zimian Hao
    In 2011 4th International Conference on Biomedical Engineering and Informatics, 2011

2010

  1. SKG’10
    8. Improve the Performance of Data Grids by Value-Based Replication Strategy
    Wuqing Zhao, Xianbin Xu, Zhuowei Wang, Yuping Zhang, and Shuibing He
    In 2010 Sixth International Conference on Semantics, Knowledge and Grids, 2010
  2. ITA’10
    7. A Dynamic Optimal Replication Strategy in Data Grid Environment
    Wuqing Zhao, Xianbin Xu, Zhuowei Wang, Yuping Zhang, and Shuibing He
    In 2010 International Conference on Internet Technology and Applications, 2010
  3. ICETC’10
    6. Optimizing Sparse Matrix-Vector Multiplication on CUDA
    Zhuowei Wang, Xianbin Xu, Wuqing Zhao, Yuping Zhang, and Shuibing He
    In 2010 2nd International Conference on Education Technology and Computer, 2010

2009

  1. IJDSN
    5. A Rule-Based Prefetching Approach for Object-Based Storage Device
    Shuibing He, Dan Feng, Chunhua Li, and Yanli Yuan
    International Journal of Distributed Sensor Networks, 2009
  2. ACSNS’09
    4. A Rule-Based Prefetching Approach for Object-Based Storage Device
    Shuibing He, and Dan Feng
    In the International Symposium on Advances in Computer and Sensor Networks and Systems, 2009

2008

  1. SIGOPS OSR
    3. Design of An Object-Based Storage Device Based on I/O Processor
    Shuibing He, and Dan Feng
    ACM SIGOPS Operating Systems Review/The 1st International Workshop on Storage and I/O Virtualization, Performance, Energy, Evaluation and Dependability (SPEED), 2008

2007

  1. NAS’07
    2. An Object-based Storage Controller Based on Switch Fabric
    Shuibing He, and Dan Feng
    In 2007 International Conference on Networking, Architecture, and Storage, 2007
  2. SNAPI’07
    1. Implementation and Performance Evaluation of An Object-Based Storage Device
    Shuibing He, and Dan Feng
    In Fourth International Workshop on Storage Network Architecture and Parallel I/Os, 2007