Publications | Shuibing He

2026

ACL’26

118. SpiderFlow: Efficient Topology-Aware Scheduling for LLM Training Across Decentralized GPU Clusters

Zihan Chang, Shuibing He, Bo Zhou, Sheng Xiao, Siling Yang, Rui Wang, and Zhe Pan

In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics, 2026

PDF

2025

TCAD’25

117. SNNcut: An Efficient Partitioning Method for Large-Scale Spiking Neural Networks Using Spike-Sharing

Qinghui Xing, Ouwen Jin, Zhuo Chen, Xin Du, Ming Zhang, Shuiguang Deng, Shuibing He, Ying Li, and Gang Pan

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2025

PDF
arXiv’25

116. Heimdall++: Optimizing GPU Utilization and Pipeline Parallelism for Efficient Single-Pulse Detection

Bingzheng Xia, Zujie Ren, Kuang Ma, Xiaoqian Li, Wenda Li, and Shuibing He

2025

PDF
TNSM’25

115. Analyzing Request Volatility in Cloud-based Machine Learning: Insights from Alibaba’s Machine Learning as a Service Platform

Qiang Zou, Yuhui Deng, Yifeng Zhu, Yi Zhou, Jianghe Cai, Shuibing He, and Lina Ge

IEEE Transactions on Network and Service Management, 2025

PDF
TC’25

114. DepAsync: An Asynchronous SNN Accelerator Based on Core-Dependency

Zhuo Chen, De Ma, Xiaofei Jin, Qinghui Xing, Ouwen Jin, Xin Du, Shuibing He, and Gang Pan

IEEE Transactions on Computers, 2025

PDF
ICPADS’25

113. An Efficient Server-Side Prefetching Scheme to Optimize Performance of Distribution File Systems

Yong Li, Shuibing He, Qian Zhao, Zhan Shi, Yi Qin, Weixu Zong, Peng Xu, and Lingfang Zeng

In Proceedings of the 31st IEEE International Conference on Parallel and Distributed Systems, 2025

PDF
TKDE’25

112. A Comprehensive Survey of Dynamic Graph Neural Networks: Models, Frameworks, Benchmarks, Experiments and Challenges

Zhengzhao Feng, Rui Wang, Tianxing Wang, Mingli Song, Sai Wu, and Shuibing He

IEEE Transactions on Knowledge and Data Engineering, 2025

PDF
arXiv’25

111. CRouting: Reducing Expensive Distance Calls in Graph-Based Approximate Nearest Neighbor Search

Zhenxin Li, Shuibing He, Jiahao Guo, Xuechen Zhang, Xian-He Sun, and Gang Chen

2025

PDF
TKDE’25

110. Efficient Distributed Graph Neural Network Training with Source Chunking and Moving Aggregation

Wenjie Huang, Tongya Zheng, Rui Wang, Tongtian Zhu, Bingde Hu, Shuibing He, Mingli Song, Xinyu Wang, Sai Wu, and Chun Chen

IEEE Transactions on Knowledge and Data Engineering, 2025

PDF
TPDS’25

109. Mapping Large-Scale Spiking Neural Network on Arbitrary Meshed Neuromorphic Hardware

Ouwen Jin, Qinghui Xing, Zhuo Chen, Ming Zhang, De Ma, Ying Li, Xin Du, Shuibing He, Shuiguang Deng, and Gang Pan

IEEE Transactions on Parallel and Distributed Systems, 2025

PDF
arXiv’25

108. Adacc: An Adaptive Framework Unifying Compression and Activation Recomputation for LLM Training

Ping Chen, Zhuohong Deng, Ping Li, Shuibing He, Hongzi Zhu, Yi Zheng, Zhefeng Wang, Baoxing Huai, and Minyi Guo

2025

PDF
VLDB’25

107. Effective and Efficient Distributed Temporal Graph Learning through Hotspot Memory Sharing

Longjiao Zhang, Rui Wang, Tongya Zheng, Ziqi Huang, Wenjie Huang, Xinyu Wang, Can Wang, Mingli Song, Sai Wu, and Shuibing He

In Proceedings of the International Conference of Very Large Databases Endowment, 2025

PDF
TC’25

106. IMPACT: Importance-Informed Prefetching and Caching for I/O-Bound DNN Training

Weijian chen, Shuibing He, Ruidong Zhang, Xuechen Zhang, Ping Chen, Siling Yang, Haoyang Qu, and Xuan Zhan

IEEE Transactions on Computers, 2025

PDF
TC’25

105. Advanced Maximal Biclique Enumeration on GPUs Using Bitmaps

Zhe Pan, Shuibing He, Xu Li, Xuechen Zhang, Rui Wang, Yanlong Yin, and Gang Chen

IEEE Transactions on Computers, 2025

PDF
FAST’25-Poster

104. Orchestrating GPU Memory for LLM Training on Heterogeneous Clusters

Yi Zhang, Shuibing He, and Ping Chen

In Proceedings of the 23rd USENIX Conference on File and Storage Technologies (WiPs and Posters), 2025

PDF Slides
TOS’25

103. An Efficient Delta Compression Framework Seamlessly Integrated into Inline Deduplication

Yucheng Zhang, Wenbin Zeng, Hong Jiang, Dan Feng, Zichen Xu, and Shuibing He

ACM Transactions on Storage, 2025

PDF
TOS’25

102. Scalable and High-Performance Large-Scale Dynamic Graph Storage and Processing System

Rui Wang, Weixu Zong, Shuibing He, Yongkun Li, and Yinlong Xu

ACM Transactions on Storage, 2025

PDF
IoT’25

101. Multi-Level and Energy-Efficient Partial Computation Offloading in Heterogeneous Edge Intelligence

Baoyu Xu, Yancheng Ruan, Chenghu Qiu, Shuibing He, Feng Shu, Xiaoyang Kang, and Lihua Zhang

IEEE Internet of Things Journal, 2025

PDF
FAST’25

100. LeapGNN: Accelerating Distributed GNN Training Leveraging Feature-Centric Model Migration

Weijian Chen, Shuibing He, Haoyang Qu, and Xuechen Zhang

In Proceedings of the 23rd USENIX Conference on File and Storage Technologies, 2025

PDF Slides Code
FAST’25

99. IMPRESS: An Importance-Informed Multi-Tier Prefix KV Storage System for Large Language Model Inference

Weijian Chen, Shuibing He, Haoyang Qu, Ruidong Zhang, Siling Yang, Ping Chen, Yi Zheng, Baoxing Huai, and Gang Chen

In Proceedings of the 23rd USENIX Conference on File and Storage Technologies, 2025

PDF Slides
HPCA’25

98. GOPIM: GCN-Oriented Pipeline Optimization for PIM Accelerators

Siling Yang, Shuibing He, Wenjiong Wang, Yanlong Yin, Tong Wu, Weijian Chen, Xuechen Zhang, Xian-He Sun, and Dan Feng

In Proceedings of the 31st IEEE International Symposium on High-Performance Computer Architecture, 2025

PDF Slides

2024

NAS’24

97. IOWA: An I/O-Aware Adaptive Sampling Framework for Deep Learning

Shuang Hu, Weijian Chen, Yanlong Yin, and Shuibing He

In Proceedings of the 17th International Conference on Networking, Architecture, and Storage, 2024

PDF Slides
arXiv’24

96. HopGNN: Boosting Distributed GNN Training Efficiency via Feature-Centric Model Migration

Weijian Chen, Shuibing He, Haoyang Qu, and Xuechen Zhang

2024

PDF
arXiv’24

95. An Asynchronous Multi-core Accelerator for SNN Inference

Zhuo Chen, De Ma, Xiaofei Jin, Qinghui Xing, Ouwen Jin, Xin Du, Shuibing He, and Gang Pan

2024

PDF
TC’24

94. AMBEA: Aggressive Maximal Biclique Enumeration in Large Bipartite Graph Computing

Zhe Pan, Xu Li, Shuibing He, Xuechen Zhang, Rui Wang, Yunjun Gao, Gang Chen, and Xian-He Sun

IEEE Transactions on Computers, 2024

PDF Code
CLUSTER’24

93. FTGraph: A Flexible Tree-based Graph Store on Persistent Memory for Large-Scale Dynamic Graphs

Gan Sun, Jiang Zhou, Bo Li, Xiaoyan Gu, Weiping Wang, and Shuibing He

In Proceedings of the IEEE International Conference on Cluster Computing, 2024

PDF
SC’24

92. Enumeration of Billions of Maximal Bicliques in Bipartite Graphs without Using GPUs

Zhe Pan, Shuibing He, Xu Li, Xuechen Zhang, Yanlong Yin, Rui Wang, Lidan Shou, Mingli Song, Xian-He Sun, and Gang Chen

In Proceedings of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, 2024

PDF Slides Code
arXiv’24

91. Optimizing Large Model Training through Overlapped Activation Recomputation

Ping Chen, Wenjie Zhang, Shuibing He, Yingjie Gu, Zhuwei Peng, Kexin Huang, Xuan Zhan, Weijian Chen, Yi Zheng, Zhefeng Wang, Yanlong Yin, and Gang Chen

2024

PDF
ICPP’24

90. AutoHet: An Automated Heterogeneous ReRAM-Based Accelerator for DNN Inference

Tong Wu, Shuibing He, Jianxin Zhu, Weijian Chen, Siling Yang, Ping Chen, Yanlong Yin, Xuechen Zhang, Xian-He Sun, and Gang Chen

In Proceedings of the 53rd International Conference on Parallel Processing, 2024

PDF Slides
arXiv’24

89. A Comprehensive Survey of Dynamic Graph Neural Networks: Models, Frameworks, Benchmarks, Experiments and Challenges

Zhengzhao Feng, Rui Wang, Tianxing Wang, Mingli Song, Sai Wu, and Shuibing He

2024

PDF
ATC’24

88. Efficient Large Graph Processing with Chunk-Based Graph Representation Model

Rui Wang, Weixu Zong, Shuibing He, Xinyu Chen, Zhenxin Li, and Zheng Dang

In Proceedings of the USENIX Annual Technical Conference, 2024

PDF Slides Code Long Talk
MSST’24

87. Dissecting I/O Burstiness in Machine Learning Cloud Platform: A Case Study on Alibaba’s MLaaS

Qiang Zou, Yuhui Deng, Yifeng Zhu, Yi Zhou, Jianghe Cai, and Shuibing He

In Proceedings of the 38th International Conference on Massive Storage Systems and Technology, 2024

PDF Slides Short Talk
TOCS’24

86. PMAlloc: A Holistic Approach to Improving Persistent Memory Allocation

Zheng Dang, Shuibing He, Xuechen Zhang, Peiyi Hong, Zhenxin Li, Xinyu Chen, Haozhe Song, Xian-He Sun, and Gang Chen

ACM Transactions on Computer Systems, 2024

PDF
EuroSys’24

85. CCL-BTree: A Crash-Consistent Locality-Aware B+-Tree for Reducing XPBuffer-Induced Write Amplification in Persistent Memory

Zhenxin Li, Shuibing He, Zheng Dang, Peiyi Hong, Xuechen Zhang, Rui Wang, and Fei Wu

In Proceedings of the European Conference on Computer Systems, 2024

PDF Slides Code

2023

TPDS

84. APQ: Automated DNN Pruning and Quantization for ReRAM-based Accelerators

Siling Yang, Shuibing He, Hexiao Duan, Weijian Chen, Xuechen Zhang, Tong Wu, and Yanlong Yin

IEEE Transactions on Parallel and Distributed Systems, 2023

PDF
SC’23

83. Efficient Maximal Biclique Enumeration on GPUs

Zhe Pan, Shuibing He, Xu Li, Xuechen Zhang, Rui Wang, and Gang Chen

In the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, 2023

PDF Slides Code
ASPLOS’23

82. Mapping Very Large Scale Spiking Neuron Network to Neuromorphic Hardware

Ouwen Jin, Qinghui Xing, Ying Li, Shuiguang Deng, Shuibing He, and Gang Pan

In 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, 2023

PDF Slides Long Talk
HPCA’23

81. iCache: An Importance-Sampling-Informed Cache for Accelerating I/O-Bound DNN Model Training

Weijian Chen, Shuibing He, Yaowen Xu, Xuechen Zhang, Siling Yang, Shuang Hu, Xian-He Sun, and Gang Chen

In 2023 IEEE International Symposium on High-Performance Computer Architecture, 2023

PDF Slides Code
HPCA’23

80. BM-Store: A Transparent and High-Performance Local Storage Architecture for Bare-Metal Clouds Enabling Large-Scale Deployment

Yiquan Chen, Jiexiong Xu, Chengkun Wei, Yijing Wang, Xin Yuan, Yangming Zhang, Xulin Yu, Yi Chen, Zeke Wang, Shuibing He, and Wenzhi Chen

In 2023 IEEE International Symposium on High-Performance Computer Architecture, 2023

PDF Slides
TC

79. HOME: A Holistic GPU Memory Management Framework for Deep Learning

Shuibing He, Ping Chen, Shuaiben Chen, Zheng Li, Siling Yang, Weijian Chen, and Lidan Shou

IEEE Transactions on Computers, 2023

PDF
CPE

68. Accelerating Real-Time Object Detection in High-Resolution Video Surveillance

Yuefeng Wang, Kuang Mao, Tong Chen, Yanglong Yin, Shuibing He, and Gang Chen

Concurrency and Computation: Practice and Experience, 2023

PDF
CLUS

78. Workload Time Series Prediction in Storage Systems: A Deep Learning based Approach

Li Ruan, Yu Bai, Shaoning Li, Shuibing He, and Limin Xiao

Cluster Computing, 2023

PDF

2022

MICRO’22

77. XPGraph: XPline-Friendly Persistent Memory Graph Stores for Large-Scale Evolving Graphs

Rui Wang, Shuibing He, Weixu Zong, Yongkun Li, and Yinlong Xu

In 2022 55th IEEE/ACM International Symposium on Microarchitecture, 2022

PDF Slides Code Short Talk
ASPLOS’22

76. NVAlloc: Rethinking Heap Metadata Management in Persistent Memory Allocators

Zheng Dang, Shuibing He, Peiyi Hong, Zhenxin Li, Xuechen Zhang, Xian-He Sun, and Gang Chen

In 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2022

PDF Slides Code Short Talk Long Talk
TPDS

75. Accelerating Tensor Swapping in GPUs With Self-Tuning Compression

Ping Chen, Shuibing He, Xuechen Zhang, Shuaiben Chen, Peiyi Hong, Yanlong Yin, and Xian-He Sun

IEEE Transactions on Parallel and Distributed Systems, 2022

PDF
NAS’22

74. WAFLASH: Taming Unaligned Writes in Solid-State Disks

Shuibing He, Matthew Myers, Xuehao Duan, Keegan Sanchez, and Xuechen Zhang

In 2022 IEEE International Conference on Networking, Architecture and Storage, 2022

PDF
TPDS

73. PHAST: Hierarchical Concurrent Log-Free Skip List for Persistent Memory

Zhenxin Li, Bing Jiao, Shuibing He, and Weikuan Yu

IEEE Transactions on Parallel and Distributed Systems, 2022

PDF Code
TOS

72. Toward Fast and Scalable Random Walks over Disk-Resident Graphs via Efficient I/O Management

Rui Wang, Yongkun Li, Yinlong Xu, Hong Xie, John CS Lui, and Shuibing He

ACM Transactions on Storage, 2022

PDF

2021

CLUSTER’21

71. CSWAP: A Self-Tuning Compression Framework for Accelerating Tensor Swapping in GPUs

Ping Chen, Shuibing He, Xuechen Zhang, Shuaiben Chen, Peiyi Hong, Yanlong Yin, Xian-He Sun, and Gang Chen

In 2021 IEEE International Conference on Cluster Computing, 2021

PDF Slides Code
ICPP’21

70. A Novel Multi-CPU/GPU Collaborative Computing Framework for SGD-Based Matrix Factorization

Yizhi Huang, Yanlong Yin, Yan Liu, Shuibing He, Yang Bai, and Renfa Li

In 50th International Conference on Parallel Processing, 2021

PDF Slides
ICS’21

69. AUTO-PRUNE: Automated DNN Pruning and Mapping for ReRAM-Based Accelerator

Siling Yang, Weijian Chen, Xuechen Zhang, Shuibing He, Yanlong Yin, and Xian-He Sun

In 2021 International Conference on Supercomputing, 2021

PDF Slides

2020

TPDS

67. Sova: A Software-Defined Optimization Framework for Virtual Network Allocations

Zhiyong Ye, Yang Wang, Shuibing He, Chengzhong Xu, and Xian-He Sun

IEEE Transactions on Parallel and Distributed Systems, 2020

PDF
ICS’20

66. Compiler Aided Checkpointing using Crash-Consistent Data Structures in NVMM Systems

Tyler Coy, Shuibing He, Bin Ren, and Xuechen Zhang

In 2020 International Conference on Supercomputing, 2020

PDF
JCSC

65. Application and Storage-Aware Data Placement and Job Scheduling for Hadoop Clusters

Tao Li, Shuibing He, Ping Chen, Siling Yang, Yanlong Yin, and Cheng Xu

Journal of Circuits, Systems and Computers, 2020

PDF
VTC’20

64. Budget Feasible Roadside Unit Allocation Mechanism in Vehicular Ad-Hoc Networks

Xiaohua Xu, Shuibing He, Meng Han, Reza M Parizi, and Gautam Srivastava

In 2020 IEEE 91st vehicular technology conference, 2020

PDF

2019

TC

63. PRS: A Pattern-Directed Replication Scheme for Heterogeneous Object-Based Storage

Jiang Zhou, Yong Chen, Wei Xie, Dong Dai, Shuibing He, and Weiping Wang

IEEE Transactions on Computers, 2019

PDF
TPDS

62. A Holistic Heterogeneity-Aware Data Placement Scheme for Hybrid Parallel I/O Systems

Shuibing He, Zheng Li, Jiang Zhou, Yanlong Yin, Xiaohua Xu, Yong Chen, and Xian-He Sun

IEEE Transactions on Parallel and Distributed Systems, 2019

PDF
TC

61. Optimizing Parallel I/O Accesses Through Pattern-Directed and Layout-Aware Replication

Shuibing He, Yanlong Yin, Xian-He Sun, Xuechen Zhang, and Zongpeng Li

IEEE Transactions on Computers, 2019

PDF
TPDS

60. A Highly Reliable Metadata Service for Large-Scale Distributed File Systems

Jiang Zhou, Yong Chen, Weiping Wang, Shuibing He, and Dan Meng

IEEE Transactions on Parallel and Distributed Systems, 2019

PDF
CLUSTER’19

59. DP_Greedy: A Two-Phase Caching Algorithm for Mobile Cloud Services

Dong Huang, Xiaopeng Fan, Yang Wang, Shuibing He, and Chengzhong Xu

In 2019 IEEE International Conference on Cluster Computing, 2019

PDF
NAS’19

58. Towards Cluster-Wide Deduplication Based on Ceph

Jinpeng Wang, Yang Wang, Hekang Wang, Kejiang Ye, Chengzhong Xu, Shuibing He, and Lingfang Zeng

In 2019 IEEE International Conference on Networking, Architecture and Storage, 2019

PDF
ICMIC’19

57. Delay Efficient D2D Communications Over 5G Edge-Computing Mobile Networks

Xiaohua Xu, Yuanfang Chen, Yanxiao Zhao, Shuibing He, and Houbing Song

In 11th International Conference on Modelling, Identification and Control, 2019
ICPP’19

56. Integration of Appends and Merges in Log-Structured Merge Trees

Caixin Gong, Shuibing He, Yili Gong, and Yingchun Lei

In 48th International Conference on Parallel Processing, 2019

PDF
WASA’19

55. OWL: Fast Opportunistic Wireless Link Scheduling with SINR Constraints

Shuibing He Xiaohua Xu, and Patrick Otoo Bobbie

In the 14th International Conference on Wireless Algorithms, Systems, and Applications, 2019

PDF
JSA

54. Run-Time Timing Prediction for System Reconfiguration on Many-core Embedded Systems

Zheng Li, and Shuibing He

Journal of Systems Architecture, 2019

PDF

2018

TPDS

53. On Cost-Driven Collaborative Data Caching: A New Model Approach

Yang Wang, Shuibing He, Xiaopeng Fan, Chengzhong Xu, and Xian-He Sun

IEEE Transactions on Parallel and Distributed Systems, 2018

PDF
TC

52. A Cost-Effective Distribution-Aware Data Replication Scheme for Parallel I/O Systems

Shuibing He, and Xian-He Sun

IEEE Transactions on Computers, 2018

PDF
CMS

51. Cluster-Based Niching Differential Evolution Algorithm for Optimizing the Stable Structures of Metallic Clusters

Yuan-Hua Yang, Xian-Bin Xu, Shuibing He, Jin-Bo Wang, and Yu-Hua Wen

Computational Materials Science, 2018

PDF
SOCO

50. Improving File Locality in Multi-Keyword Top-k Search Based on Clustering

Lanxiang Chen, Nan Zhang, Kuan-Ching Li, Shuibing He, and Linbing Qiu

Soft Computing, 2018

PDF
DAAC’18

49. Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approach

Li Ruan, Yu Bai, Shaoning Li, Shuibing He, and Limin Xiao

the 2nd International Industry/University Workshop on Data-center Automation, Analytics, and Control, 2018

PDF
NPC’18

48. KT-Store: A Key-Order and Write-Order Hybrid Key-Value Store with High Write and Range-Query Performance

Haobo Wang, Yinliang Yue, Shuibing He, and Weiping Wang

In 15th IFIP International Conference on Network and Parallel Computing, 2018

PDF
IPDPS’18

47. A Migratory Heterogeneity-Aware Data Layout Scheme for Parallel File Systems

Shuibing He, Xian-He Sun, Yang Wang, and Chengzhong Xu

In 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

PDF
HPSC’18

46. Timing Prediction for Dynamic Application Migration on Multi-Core Embedded Systems

Zheng Li, Hao Wu, and Shuibing He

In IEEE International Conference on High Performance and Smart Computing, 2018

PDF

2017

TECS

45. Fixed-Priority Scheduling for Two-Phase Mixed-Criticality Systems

Zheng Li, and Shuibing He

ACM Transactions on Embedded Computing Systems, 2017

PDF
Computer Science

44. Performance Model of Sparse Matrix-Vector Multiplication on GPU

Yang Wang, Bharadwaj Veeravalli, Chen-Khong Tham, Shuibing He, and Chengzhong Xu

Computer Science, 2017
TAAS

43. On Service Migrations in the Cloud for Mobile Accesses: A Distributed Approach

Yang Wang, Bharadwaj Veeravalli, Chen-Khong Tham, Shuibing He, and Chengzhong Xu

ACM Transactions on Autonomous and Adaptive Systems, 2017

PDF
TPDS

42. Cost-Aware Region-Level Data Placement in Multi-Tiered Parallel I/O Systems

Shuibing He, Yang Wang, Zheng Li, Xian-He Sun, and Chenzhong Xu

IEEE Transactions on Parallel and Distributed Systems, 2017

PDF
TC

41. Heterogeneity-Aware Collective I/O for Parallel I/O Systems with Hybrid HDD/SSD Servers

Shuibing He, Yang Wang, Xian-He Sun, Chuanhe Huang, and Chengzhong Xu

IEEE Transactions on Computers, 2017

PDF
TC

40. HARL: Optimizing Parallel File Systems with Heterogeneity-Aware Region-Level Data Layout

Shuibing He, Yang Wang, Xian-He Sun, and Chengzhong Xu

IEEE Transactions on Computers, 2017

PDF
TPDS

39. Using MinMax-Memory Claims to Improve In-Memory Workflow Computations in the Cloud

Shuibing He, Yang Wang, Xian-He Sun, and Chengzhong Xu

IEEE Transactions on Parallel and Distributed Systems, 2017

PDF
ISPA’17

38. MCS-B: An Energy-Efficient Storage System for Astronomical Observation Data Based on Logical Block Replacement Strategy

Chao Sun, Shanjiang Tang, Zichao Yuan, Ce Yu, Jian Xiao, Jizhou Sun, and Shuibing He

In 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), 2017

PDF
ICPP’17

37. Data Caching in Next Generation Mobile Cloud Services, Online vs. Off-Line

Yang Wang, Shuibing He, Xiaopeng Fan, Chengzhong Xu, Joseph Culberson, and Joseph Horton

In 2017 46th International Conference on Parallel Processing (ICPP), 2017

PDF
EUC’17

36. Prediction Based Run-Time Reconfiguration on Many-Core Embedded Systems

Zheng Li, Shuibing He, and Li Wang

In 2017 IEEE International Conference on Embedded and Ubiquitous Computing (EUC), 2017

PDF

2016

TPDS

35. Improving Performance of Parallel I/O Systems through Selective and Layout-Aware SSD Cache

Shuibing He, Yang Wang, and Xian-He Sun

IEEE Transactions on Parallel and Distributed Systems, 2016

PDF
IJHPCA

34. Enhancing Hybrid Parallel File System through Performance and Space-Aware Data Layout

Shuibing He, Yan Liu, Yang Wang, Xian-He Sun, and Chuanhe Huang

The International Journal of High Performance Computing Applications, 2016

PDF
IJES

33. MGPA: A Multi-Granularity Space Pre-Allocation Algorithm for Object-Based Storage Devices

Shuibing He, Yuanhua Yang, Xianbin Xu, and Xiaohua Xu

International Journal of Embedded Systems, 2016

PDF
WHUJNS

32. Capability-Aware Data Placement for Heterogeneous Active Storage Systems

Xiangyu Li, Shuibing He, Xianbin Xu, and Yang Wang

Wuhan University Journal of Natural Sciences, 2016

PDF
IJDTA

31. JAS: JVM-Based Active Storage Framework for Object-based Storage Systems

Xiangyu Li, Shuibing He, Xianbin Xu, and Yang Wang

International Journal of Database Theory and Application, 2016

PDF
IJGDC

30. Skewed Data Distribution for Active Storage Systems on Hybrid Servers

Xiangyu Li, Shuibing He, and Xianbin Xu

International Journal of Grid and Distributed Computing, 2016

PDF
ICPADS’16

29. On Autonomous Service Migrations in the Cloud for Mobile Accesses

Yang Wang, Shuibing He, Fuji Ren, Lujia Wang, and Chengzhong Xu

In 22nd International Conference on Parallel and Distributed Systems, 2016

PDF
ICDCS’16

28. On MinMax-Memory Claims for Scientific Workflows in the In-memory Cloud Computing

Yang Wang, Chengzhong Xu, Shuibing He, and Xian-He Sun

In 36th International Conference on Distributed Computing Systems, 2016

PDF
TPDS

27. Boosting Parallel File System Performance with Heterogeneity-Aware Selective Data Layout

Shuibing He, Yang Wang, and Xian-He Sun

IEEE Transactions on Parallel and Distributed Systems, 2016

PDF

2015

ICPP’15

26. A Heterogeneity-Aware Region-Level Data Layout for Hybrid Parallel File Systems

Shuibing He, Xian-He Sun, Yang Wang, Antonis Kougkas, and Adnan Haider

In 2015 44th International Conference on Parallel Processing, 2015

PDF
IPDPS’15

25. HAS: Heterogeneity-Aware Selective Data Layout Scheme for Parallel File Systems on Hybrid Servers

Shuibing He, Xian-He Sun, and Adnan Haider

In 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

PDF
HiPC’15

24. IC-Data: Improving Compressed Data Processing in Hadoop

Adnan Haider, Xi Yang, Ning Liu, Xian-He Sun, and Shuibing He

In 2015 IEEE 22nd International Conference on High Performance Computing, 2015

PDF

2014

ICDCS’14

23. S4D-Cache: Smart Selective SSD Cache for Parallel I/O Systems

Shuibing He, Xian-He Sun, and Bo Feng

In 2014 IEEE 34th International Conference on Distributed Computing Systems, 2014

PDF
DISC’14

22. PSA: A Performance and Space-Aware Data Layout Scheme for Hybrid Parallel File Systems

Shuibing He, Yan Liu, and Xian-He Sun

In 2014 International Workshop on Data Intensive Scalable Computing Systems, 2014

PDF
PDSW’14

21. HPIS3: Towards a High-Performance Simulator for Hybrid Parallel I/O and Storage Systems

Bo Feng, Ning Liu, Shuibing He, and Xian-He Sun

In 2014 9th Parallel Data Storage Workshop, 2014

PDF
ICA3PP’14

20. Performance-Aware Data Placement in Hybrid Parallel File Systems

Shuibing He, Xian-He Sun, Bo Feng, and Kun Feng

In Algorithms and Architectures for Parallel Processing, 2014

PDF
AMR

19. A Statistics-Based Data Placement Strategy for Hybrid Storage

Yuanhua Yang, Xianbin Xu, Shuibing He, and Yuhua Wen

2014

PDF

2013

JCRD

18. Parallel Acceleration and Performance Optimization for GRAPES Model Based on GPU

Zhuowei Wang, Xianbin Xu, Wuqing Zhao, Shuibing He, and Zhang Yuping

In Journal of Computer Research and Development, 2013
JC

17. GPU-Based Parallel Researches on RRTM Module of GRAPES Numerical Prediction System.

Fang Zheng, Xianbin Xu, Dongdong Xiang, Zhuowei Wang, Ming Xu, and Shuibing He

Journal of Computers, 2013

PDF
AMR

16. WLVT: A Static Wear-Leveling Algorithm with Variable Threshold

Yuanhua Yang, Xianbin Xu, Shuibing He, Fang Zheng, and Yuping Zhang

In Advanced Materials Research, 2013
HPDIC’13

15. BPS: A Performance Metric of I/O System

Shuibing He, Xian-He Sun, and Yanlong Yin

In 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, 2013

PDF
CLUSTER’13

14. A Cost-Aware Region-Level Data Placement Scheme for Hybrid Parallel I/O Systems

Shuibing He, Xian-He Sun, Bo Feng, Xin Huang, and Kun Feng

In 2013 IEEE International Conference on Cluster Computing, 2013

PDF
IEA

13. Parallel Optimization for Sparse Matrix–Vector on GPU

Meng Jia Yin, Xian Bin Xu, Hua Chen, Shuibing He, and Jing Hu

In the International Conference on Information Engineering and Applications: Volume 2, 2013

PDF

2012

IJCIS

12. Oasa: An Active Storage Architecture for Object-based Storage System

Shuibing He, Xianbin Xu, and Yuanhua Yang

International journal of computational intelligence systems, 2012

PDF
JTAIT

11. Optimizing Sparse Matrix-Vector Multiplication Based on GPU

Mengjia Yin, Tao Zhang, Xianbin Xu, Jin Hu, and Shuibing He

Journal of Theoretical and Applied Information Technology, 2012

2011

ICCIS’11

10. Accelerating Biological Sequence Alignment Algorithm on GPU with CUDA

Fang Zheng, Xianbin Xu, Yuanhua Yang, Shuibing He, and Yuping Zhang

In 2011 International Conference on Computational and Information Sciences, 2011

PDF
BMEI’11

9. Communication-Aware Task Scheduling for Multi-Core Architectures with Segmented Buses

Yuping Zhang, Xianbin Xu, Yuanhua Yang, Shuibing He, and Zimian Hao

In 2011 4th International Conference on Biomedical Engineering and Informatics, 2011

PDF

2010

SKG’10

8. Improve the Performance of Data Grids by Value-Based Replication Strategy

Wuqing Zhao, Xianbin Xu, Zhuowei Wang, Yuping Zhang, and Shuibing He

In 2010 Sixth International Conference on Semantics, Knowledge and Grids, 2010

PDF
ITA’10

7. A Dynamic Optimal Replication Strategy in Data Grid Environment

Wuqing Zhao, Xianbin Xu, Zhuowei Wang, Yuping Zhang, and Shuibing He

In 2010 International Conference on Internet Technology and Applications, 2010

PDF
ICETC’10

6. Optimizing Sparse Matrix-Vector Multiplication on CUDA

Zhuowei Wang, Xianbin Xu, Wuqing Zhao, Yuping Zhang, and Shuibing He

In 2010 2nd International Conference on Education Technology and Computer, 2010

PDF

2009

IJDSN

5. A Rule-Based Prefetching Approach for Object-Based Storage Device

Shuibing He, Dan Feng, Chunhua Li, and Yanli Yuan

International Journal of Distributed Sensor Networks, 2009

PDF
ACSNS’09

4. A Rule-Based Prefetching Approach for Object-Based Storage Device

Shuibing He, and Dan Feng

In the International Symposium on Advances in Computer and Sensor Networks and Systems, 2009

PDF

2008

SIGOPS OSR

3. Design of An Object-Based Storage Device Based on I/O Processor

Shuibing He, and Dan Feng

ACM SIGOPS Operating Systems Review/The 1st International Workshop on Storage and I/O Virtualization, Performance, Energy, Evaluation and Dependability (SPEED), 2008

PDF

2007

NAS’07

2. An Object-based Storage Controller Based on Switch Fabric

Shuibing He, and Dan Feng

In 2007 International Conference on Networking, Architecture, and Storage, 2007

PDF
SNAPI’07

1. Implementation and Performance Evaluation of An Object-Based Storage Device

Shuibing He, and Dan Feng

In Fourth International Workshop on Storage Network Architecture and Parallel I/Os, 2007

PDF