电子学报2025,Vol.53Issue(2):314-328,15.DOI:10.12263/DZXB.20240868
探索面向分布式共享缓存架构的高性能数据库
Exploring the High-Performance Database Based on Distributed Shared Cache Architecture
摘要
Abstract
With the widespread adoption of internet of things(IoT)and smart devices,the volume of data generated at the edge has far exceeded the computational and storage capabilities of edge nodes.This creates an urgent need for cloud-edge collaborative processing to meet the real-time analysis demands of large-scale data.With the decoupling of computa-tion,memory,and storage,the shared-cahe architecture has become a critical solution for addressing the processing require-ments of massive edge data.However,there are still several issuses remained in shared-cache architecture.First,in transac-tional processing scenarios,when hotspot cached data frequently migrates between nodes,the log persistence mechanisms of existing databases will generate a large number of log write operations,thereby impacting system performance.Second-ly,the existing cache write-invalidation mechanism can lead to frequent eviction of some hotspot cached data,causing slow-er transactions to fail in retrieving target data from the shared cache in time.This could trigger a large number of cache re-loads,resulting in system performance degradation.To address these issues,this paper proposes a dependency-table-based delayed log flushing mechanism.By consolidating multiple log write operations and deferring them until the log buffer is full or a transaction is committed,the mechanism reduces the frequency of log flushing and the overhead of disk writes.In addition,this paper also introduces a cache delayed invalidation mechanism that incorporates asynchronous replay of invali-dation messages,page visibility determination,and an optimized cache replacement.This approach effectively extends the service time of cached data,improving cache hit rates and overall system performance.Based on these mechanisms,this pa-per implements a high-performance shared-cache database system called EBASE-T.Experimental results show that,com-pared to its pre-optimized version,EBASE-T achieves a 19.5%increase in throughput and a 13.1%reduction in latency.In TPC-C(online transaction processing system benchmarks)tests,EBASE-T demonstrates significant performance advantag-es over most shared-cache database systems.关键词
数据库/分布式共享缓存/事务处理/日志刷写/缓存失效Key words
database/distributed shared cache/transaction processing/log flushing/cache invalidation分类
信息技术与安全科学引用本文复制引用
尚碧筠,魏星,周士俊,冀文冠,董诗琦,屠要峰,董振江..探索面向分布式共享缓存架构的高性能数据库[J].电子学报,2025,53(2):314-328,15.基金项目
国家重点研发计划(No.2021YFB3101101) (No.2021YFB3101101)
江苏省重点研发计划重点项目(No.BE2023025) National Key Research and Development Program of China(No.2021YFB3101101) (No.BE2023025)
Jiangsu Key Research and Development Plan(No.BE2023025) (No.BE2023025)