2025
2024
-
arXiv’2496. HopGNN: Boosting Distributed GNN Training Efficiency via Feature-Centric Model Migration2024
-
arXiv’24
-
arXiv’24
-
arXiv’2489. A Comprehensive Survey of Dynamic Graph Neural Networks: Models, Frameworks, Benchmarks, Experiments and Challenges2024
-
MSST’2487. Dissecting I/O Burstiness in Machine Learning Cloud Platform: A Case Study on Alibaba’s MLaaSIn Proceedings of the 38th International Conference on Massive Storage Systems and Technology, 2024
2023
-
TPDS84. APQ: Automated DNN Pruning and Quantization for ReRAM-based AcceleratorsIEEE Transactions on Parallel and Distributed Systems, 2023
-
TC79. HOME: A Holistic GPU Memory Management Framework for Deep LearningIEEE Transactions on Computers, 2023
-
CPE68. Accelerating Real-Time Object Detection in High-Resolution Video SurveillanceConcurrency and Computation: Practice and Experience, 2023
-
CLUS78. Workload Time Series Prediction in Storage Systems: A Deep Learning based ApproachCluster Computing, 2023
2022
-
MICRO’2277. XPGraph: XPline-Friendly Persistent Memory Graph Stores for Large-Scale Evolving GraphsIn 2022 55th IEEE/ACM International Symposium on Microarchitecture, 2022
-
ASPLOS’2276. NVAlloc: Rethinking Heap Metadata Management in Persistent Memory AllocatorsIn 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2022
-
TPDS75. Accelerating Tensor Swapping in GPUs With Self-Tuning CompressionIEEE Transactions on Parallel and Distributed Systems, 2022
-
NAS’2274. WAFLASH: Taming Unaligned Writes in Solid-State DisksIn 2022 IEEE International Conference on Networking, Architecture and Storage, 2022
-
TOS72. Toward Fast and Scalable Random Walks over Disk-Resident Graphs via Efficient I/O ManagementACM Transactions on Storage, 2022
2021
2020
-
TPDS67. Sova: A Software-Defined Optimization Framework for Virtual Network AllocationsIEEE Transactions on Parallel and Distributed Systems, 2020
-
ICS’2066. Compiler Aided Checkpointing using Crash-Consistent Data Structures in NVMM SystemsIn 2020 International Conference on Supercomputing, 2020
-
JCSC65. Application and Storage-Aware Data Placement and Job Scheduling for Hadoop ClustersJournal of Circuits, Systems and Computers, 2020
-
VTC’2064. Budget Feasible Roadside Unit Allocation Mechanism in Vehicular Ad-Hoc NetworksIn 2020 IEEE 91st vehicular technology conference, 2020
2019
-
TC63. PRS: A Pattern-Directed Replication Scheme for Heterogeneous Object-Based StorageIEEE Transactions on Computers, 2019
-
TPDS62. A Holistic Heterogeneity-Aware Data Placement Scheme for Hybrid Parallel I/O SystemsIEEE Transactions on Parallel and Distributed Systems, 2019
-
TC61. Optimizing Parallel I/O Accesses Through Pattern-Directed and Layout-Aware ReplicationIEEE Transactions on Computers, 2019
-
TPDS60. A Highly Reliable Metadata Service for Large-Scale Distributed File SystemsIEEE Transactions on Parallel and Distributed Systems, 2019
-
CLUSTER’1959. DP_Greedy: A Two-Phase Caching Algorithm for Mobile Cloud ServicesIn 2019 IEEE International Conference on Cluster Computing, 2019
-
NAS’1958. Towards Cluster-Wide Deduplication Based on CephIn 2019 IEEE International Conference on Networking, Architecture and Storage, 2019
-
ICMIC’1957. Delay Efficient D2D Communications Over 5G Edge-Computing Mobile NetworksIn 11th International Conference on Modelling, Identification and Control, 2019
-
ICPP’1956. Integration of Appends and Merges in Log-Structured Merge TreesIn 48th International Conference on Parallel Processing, 2019
-
WASA’1955. OWL: Fast Opportunistic Wireless Link Scheduling with SINR ConstraintsIn the 14th International Conference on Wireless Algorithms, Systems, and Applications, 2019
-
JSA54. Run-Time Timing Prediction for System Reconfiguration on Many-core Embedded SystemsJournal of Systems Architecture, 2019
2018
-
TPDS53. On Cost-Driven Collaborative Data Caching: A New Model ApproachIEEE Transactions on Parallel and Distributed Systems, 2018
-
TC52. A Cost-Effective Distribution-Aware Data Replication Scheme for Parallel I/O SystemsIEEE Transactions on Computers, 2018
-
CMS51. Cluster-Based Niching Differential Evolution Algorithm for Optimizing the Stable Structures of Metallic ClustersComputational Materials Science, 2018
-
SOCO50. Improving File Locality in Multi-Keyword Top-k Search Based on ClusteringSoft Computing, 2018
-
DAAC’1849. Workload Time Series Prediction in Storage Systems: A Deep Learning Based Approachthe 2nd International Industry/University Workshop on Data-center Automation, Analytics, and Control, 2018
-
NPC’1848. KT-Store: A Key-Order and Write-Order Hybrid Key-Value Store with High Write and Range-Query PerformanceIn 15th IFIP International Conference on Network and Parallel Computing, 2018
-
IPDPS’1847. A Migratory Heterogeneity-Aware Data Layout Scheme for Parallel File SystemsIn 2018 IEEE International Parallel and Distributed Processing Symposium, 2018
-
HPSC’1846. Timing Prediction for Dynamic Application Migration on Multi-Core Embedded SystemsIn IEEE International Conference on High Performance and Smart Computing, 2018
2017
-
TECS45. Fixed-Priority Scheduling for Two-Phase Mixed-Criticality SystemsACM Transactions on Embedded Computing Systems, 2017
-
Computer Science44. Performance Model of Sparse Matrix-Vector Multiplication on GPUComputer Science, 2017
-
TAAS43. On Service Migrations in the Cloud for Mobile Accesses: A Distributed ApproachACM Transactions on Autonomous and Adaptive Systems, 2017
-
TPDS42. Cost-Aware Region-Level Data Placement in Multi-Tiered Parallel I/O SystemsIEEE Transactions on Parallel and Distributed Systems, 2017
-
TC41. Heterogeneity-Aware Collective I/O for Parallel I/O Systems with Hybrid HDD/SSD ServersIEEE Transactions on Computers, 2017
-
TC40. HARL: Optimizing Parallel File Systems with Heterogeneity-Aware Region-Level Data LayoutIEEE Transactions on Computers, 2017
-
TPDS39. Using MinMax-Memory Claims to Improve In-Memory Workflow Computations in the CloudIEEE Transactions on Parallel and Distributed Systems, 2017
-
ISPA’1738. MCS-B: An Energy-Efficient Storage System for Astronomical Observation Data Based on Logical Block Replacement StrategyIn 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC), 2017
-
ICPP’1737. Data Caching in Next Generation Mobile Cloud Services, Online vs. Off-LineIn 2017 46th International Conference on Parallel Processing (ICPP), 2017
-
EUC’1736. Prediction Based Run-Time Reconfiguration on Many-Core Embedded SystemsIn 2017 IEEE International Conference on Embedded and Ubiquitous Computing (EUC), 2017
2016
-
TPDS35. Improving Performance of Parallel I/O Systems through Selective and Layout-Aware SSD CacheIEEE Transactions on Parallel and Distributed Systems, 2016
-
IJHPCA34. Enhancing Hybrid Parallel File System through Performance and Space-Aware Data LayoutThe International Journal of High Performance Computing Applications, 2016
-
IJES33. MGPA: A Multi-Granularity Space Pre-Allocation Algorithm for Object-Based Storage DevicesInternational Journal of Embedded Systems, 2016
-
WHUJNS32. Capability-Aware Data Placement for Heterogeneous Active Storage SystemsWuhan University Journal of Natural Sciences, 2016
-
IJDTA31. JAS: JVM-Based Active Storage Framework for Object-based Storage SystemsInternational Journal of Database Theory and Application, 2016
-
IJGDC30. Skewed Data Distribution for Active Storage Systems on Hybrid ServersInternational Journal of Grid and Distributed Computing, 2016
-
ICPADS’1629. On Autonomous Service Migrations in the Cloud for Mobile AccessesIn 22nd International Conference on Parallel and Distributed Systems, 2016
-
ICDCS’1628. On MinMax-Memory Claims for Scientific Workflows in the In-memory Cloud ComputingIn 36th International Conference on Distributed Computing Systems, 2016
-
TPDS27. Boosting Parallel File System Performance with Heterogeneity-Aware Selective Data LayoutIEEE Transactions on Parallel and Distributed Systems, 2016
2015
-
ICPP’1526. A Heterogeneity-Aware Region-Level Data Layout for Hybrid Parallel File SystemsIn 2015 44th International Conference on Parallel Processing, 2015
-
IPDPS’1525. HAS: Heterogeneity-Aware Selective Data Layout Scheme for Parallel File Systems on Hybrid ServersIn 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
-
HiPC’1524. IC-Data: Improving Compressed Data Processing in HadoopIn 2015 IEEE 22nd International Conference on High Performance Computing, 2015
2014
-
ICDCS’1423. S4D-Cache: Smart Selective SSD Cache for Parallel I/O SystemsIn 2014 IEEE 34th International Conference on Distributed Computing Systems, 2014
-
DISC’1422. PSA: A Performance and Space-Aware Data Layout Scheme for Hybrid Parallel File SystemsIn 2014 International Workshop on Data Intensive Scalable Computing Systems, 2014
-
PDSW’1421. HPIS3: Towards a High-Performance Simulator for Hybrid Parallel I/O and Storage SystemsIn 2014 9th Parallel Data Storage Workshop, 2014
-
ICA3PP’1420. Performance-Aware Data Placement in Hybrid Parallel File SystemsIn Algorithms and Architectures for Parallel Processing, 2014
-
AMR
2013
-
JCRD18. Parallel Acceleration and Performance Optimization for GRAPES Model Based on GPUIn Journal of Computer Research and Development, 2013
-
JC17. GPU-Based Parallel Researches on RRTM Module of GRAPES Numerical Prediction System.Journal of Computers, 2013
-
AMR16. WLVT: A Static Wear-Leveling Algorithm with Variable ThresholdIn Advanced Materials Research, 2013
-
HPDIC’1315. BPS: A Performance Metric of I/O SystemIn 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, 2013
-
CLUSTER’1314. A Cost-Aware Region-Level Data Placement Scheme for Hybrid Parallel I/O SystemsIn 2013 IEEE International Conference on Cluster Computing, 2013
-
IEA13. Parallel Optimization for Sparse Matrix–Vector on GPUIn the International Conference on Information Engineering and Applications: Volume 2, 2013
2012
-
IJCIS12. Oasa: An Active Storage Architecture for Object-based Storage SystemInternational journal of computational intelligence systems, 2012
-
JTAIT11. Optimizing Sparse Matrix-Vector Multiplication Based on GPUJournal of Theoretical and Applied Information Technology, 2012
2011
-
ICCIS’1110. Accelerating Biological Sequence Alignment Algorithm on GPU with CUDAIn 2011 International Conference on Computational and Information Sciences, 2011
-
BMEI’119. Communication-Aware Task Scheduling for Multi-Core Architectures with Segmented BusesIn 2011 4th International Conference on Biomedical Engineering and Informatics, 2011
2010
-
SKG’108. Improve the Performance of Data Grids by Value-Based Replication StrategyIn 2010 Sixth International Conference on Semantics, Knowledge and Grids, 2010
-
ITA’107. A Dynamic Optimal Replication Strategy in Data Grid EnvironmentIn 2010 International Conference on Internet Technology and Applications, 2010
-
ICETC’106. Optimizing Sparse Matrix-Vector Multiplication on CUDAIn 2010 2nd International Conference on Education Technology and Computer, 2010
2009
-
IJDSN5. A Rule-Based Prefetching Approach for Object-Based Storage DeviceInternational Journal of Distributed Sensor Networks, 2009
-
ACSNS’094. A Rule-Based Prefetching Approach for Object-Based Storage DeviceIn the International Symposium on Advances in Computer and Sensor Networks and Systems, 2009
2008
-
SIGOPS OSR3. Design of An Object-Based Storage Device Based on I/O ProcessorACM SIGOPS Operating Systems Review/The 1st International Workshop on Storage and I/O Virtualization, Performance, Energy, Evaluation and Dependability (SPEED), 2008
2007
-
NAS’072. An Object-based Storage Controller Based on Switch FabricIn 2007 International Conference on Networking, Architecture, and Storage, 2007
-
SNAPI’071. Implementation and Performance Evaluation of An Object-Based Storage DeviceIn Fourth International Workshop on Storage Network Architecture and Parallel I/Os, 2007