Penalty- and Locality-aware Memory Allocation in Redis Using Enhanced AET

Cheng Pan; Xiaolin Wang; Yingwei Luo; Zhenlin Wang

doi:10.1145/3447573

Penalty- and Locality-aware Memory Allocation in Redis Using Enhanced AET

ACM Transactions on Storage ◽

10.1145/3447573 ◽

2021 ◽

Vol 17 (2) ◽

pp. 1-45

Author(s):

Cheng Pan ◽

Xiaolin Wang ◽

Yingwei Luo ◽

Zhenlin Wang

Keyword(s):

Large Data ◽

Data Locality ◽

Memory Allocation ◽

Cache Management ◽

Cache Replacement ◽

Time Model ◽

Memory Cache ◽

Management Scheme ◽

Data Volume ◽

Replacement Algorithms

Due to large data volume and low latency requirements of modern web services, the use of an in-memory key-value (KV) cache often becomes an inevitable choice (e.g., Redis and Memcached). The in-memory cache holds hot data, reduces request latency, and alleviates the load on background databases. Inheriting from the traditional hardware cache design, many existing KV cache systems still use recency-based cache replacement algorithms, e.g., least recently used or its approximations. However, the diversity of miss penalty distinguishes a KV cache from a hardware cache. Inadequate consideration of penalty can substantially compromise space utilization and request service time. KV accesses also demonstrate locality, which needs to be coordinated with miss penalty to guide cache management. In this article, we first discuss how to enhance the existing cache model, the Average Eviction Time model, so that it can adapt to modeling a KV cache. After that, we apply the model to Redis and propose pRedis, Penalty- and Locality-aware Memory Allocation in Redis, which synthesizes data locality and miss penalty, in a quantitative manner, to guide memory allocation and replacement in Redis. At the same time, we also explore the diurnal behavior of a KV store and exploit long-term reuse. We replace the original passive eviction mechanism with an automatic dump/load mechanism, to smooth the transition between access peaks and valleys. Our evaluation shows that pRedis effectively reduces the average and tail access latency with minimal time and space overhead. For both real-world and synthetic workloads, our approach delivers an average of 14.0%∼52.3% latency reduction over a state-of-the-art penalty-aware cache management scheme, Hyperbolic Caching (HC), and shows more quantitative predictability of performance. Moreover, we can obtain even lower average latency (1.1%∼5.5%) when dynamically switching policies between pRedis and HC.

Download Full-text

A Survey on Adaptive Wildcard Rule Cache Management with Cache Replacement Algorithms for Software - Defined Networks

IJARCCE ◽

10.17148/ijarcce.2018.7103 ◽

2018 ◽

Vol 7 (10) ◽

pp. 10-13 ◽

Cited By ~ 1

Author(s):

Kusekar Shrutika Ajaykumar ◽

Prof. H.A. Hingoliwala

Keyword(s):

Cache Management ◽

Cache Replacement ◽

Software Defined Networks ◽

Replacement Algorithms

Download Full-text

Efficient Lossless Compression of Multitemporal Hyperspectral Image Data

Journal of Imaging ◽

10.3390/jimaging4120142 ◽

2018 ◽

Vol 4 (12) ◽

pp. 142 ◽

Cited By ~ 6

Author(s):

Hongda Shen ◽

Zhuocheng Jiang ◽

W. Pan

Keyword(s):

Data Compression ◽

Hyperspectral Image ◽

Image Data ◽

Lossless Compression ◽

Large Data ◽

Hyperspectral Data ◽

Compression Performance ◽

Temporal Correlations ◽

Sensing Applications ◽

Data Volume

Hyperspectral imaging (HSI) technology has been used for various remote sensing applications due to its excellent capability of monitoring regions-of-interest over a period of time. However, the large data volume of four-dimensional multitemporal hyperspectral imagery demands massive data compression techniques. While conventional 3D hyperspectral data compression methods exploit only spatial and spectral correlations, we propose a simple yet effective predictive lossless compression algorithm that can achieve significant gains on compression efficiency, by also taking into account temporal correlations inherent in the multitemporal data. We present an information theoretic analysis to estimate potential compression performance gain with varying configurations of context vectors. Extensive simulation results demonstrate the effectiveness of the proposed algorithm. We also provide in-depth discussions on how to construct the context vectors in the prediction model for both multitemporal HSI and conventional 3D HSI data.

Download Full-text

Transparent In-memory Cache Management in Apache Spark based on Post-Mortem Analysis

2019 IEEE International Conference on Big Data (Big Data) ◽

10.1109/bigdata47090.2019.9006590 ◽

2019 ◽

Author(s):

Atsuya Nasu ◽

Kenji Yoneo ◽

Masao Okita ◽

Fumihiko Ino

Keyword(s):

Apache Spark ◽

Cache Management ◽

Post Mortem ◽

Memory Cache

Download Full-text

Load-aware Adaptive Cache Management Scheme for Enterprise-level Stackable Cryptographic File System*

2020 IEEE 22nd International Conference on High Performance Computing and Communications; IEEE 18th International Conference on Smart City; IEEE 6th International Conference on Data Science and Systems (HPCC/SmartCity/DSS) ◽

10.1109/hpcc-smartcity-dss50907.2020.00006 ◽

2020 ◽

Author(s):

Chunhua Xiao ◽

Yanyue Pan ◽

Dandan Xu ◽

Weichen Liu ◽

Shuting Sun ◽

...

Keyword(s):

File System ◽

Cache Management ◽

Management Scheme ◽

Enterprise Level ◽

Adaptive Cache

Download Full-text

Microseismicity in the Eastern Alps: Preliminary Results From the Swath-D Network

10.5194/egusphere-egu21-9829 ◽

2021 ◽

Author(s):

Rens Hofman ◽

Joern Kummerow ◽

Simone Cesca ◽

Joachim Wassermann ◽

Thomas Plenefisch ◽

...

Keyword(s):

Large Data ◽

Eastern Alps ◽

Detection Methods ◽

Mountain Range ◽

Network Methods ◽

Data Volume ◽

Geophysical Processes ◽

The Alps ◽

Mountain Belts ◽

Surrounding Areas

<p>The AlpArray seismological experiment is an international and interdisciplinary project to advance our understanding of geophysical processes in the greater Alpine region. The heart of the project consists of a large seismological array that covers the mountain range and its surrounding areas. To understand how the Alps and their neighbouring mountain belts evolved through time, we can only study its current structure and processes. The Eastern Alps are of prime interest since they currently demonstrate the highest crustal deformation rates. A key question is how these surface processes are linked to deeper structures. The Swath-D network is an array of temporary seismological stations complementary to the AlpArray network located in the Eastern Alps. This creates a unique opportunity to investigate high resolution seismicity on a local scale.</p><p>In this study, a combination of waveform-based detection methods was used to find small earthquakes in the large data volume of the Swath-D network. Methods were developed to locate the seismic events using semi-automatic picks, and estimate event magnitudes. We present an overview of the methods and workflow, as well as a preliminary overview of the seismicity in the Eastern Alps.</p>

Download Full-text

Performance-oriented cache management scheme based on a retention state for energy-harvesting nonvolatile processors

Future Generation Computer Systems ◽

10.1016/j.future.2021.11.010 ◽

2021 ◽

Author(s):

Yan Wang ◽

Henian Fang ◽

Linbo Long ◽

Jinhui Liu

Keyword(s):

Energy Harvesting ◽

Cache Management ◽

Management Scheme

Download Full-text

Deep learning predicts short non-coding RNA functions from only raw sequence data

PLoS Computational Biology ◽

10.1371/journal.pcbi.1008415 ◽

2020 ◽

Vol 16 (11) ◽

pp. e1008415

Author(s):

Teresa Maria Rosaria Noviello ◽

Francesco Ceccarelli ◽

Michele Ceccarelli ◽

Luigi Cerulo

Keyword(s):

Secondary Structure ◽

Sequence Data ◽

Computational Cost ◽

Large Data ◽

Sequence Information ◽

Structure Information ◽

Non Coding Rna ◽

A Genome ◽

Data Volume ◽

Biological Functionality

Small non-coding RNAs (ncRNAs) are short non-coding sequences involved in gene regulation in many biological processes and diseases. The lack of a complete comprehension of their biological functionality, especially in a genome-wide scenario, has demanded new computational approaches to annotate their roles. It is widely known that secondary structure is determinant to know RNA function and machine learning based approaches have been successfully proven to predict RNA function from secondary structure information. Here we show that RNA function can be predicted with good accuracy from a lightweight representation of sequence information without the necessity of computing secondary structure features which is computationally expensive. This finding appears to go against the dogma of secondary structure being a key determinant of function in RNA. Compared to recent secondary structure based methods, the proposed solution is more robust to sequence boundary noise and reduces drastically the computational cost allowing for large data volume annotations. Scripts and datasets to reproduce the results of experiments proposed in this study are available at: https://github.com/bioinformatics-sannio/ncrna-deep.

Download Full-text

A Comparative study between cache replacement algorithms used in the scalable asynchronous cache consistency scheme. (c2006)

10.26756/th.2006.39 ◽

2006 ◽

Author(s):

Lana Turk

Keyword(s):

Comparative Study ◽

Cache Replacement ◽

Cache Consistency ◽

Replacement Algorithms

Download Full-text

Large Dataset Classification Using Parallel Processing Concept

JOIV International Journal on Informatics Visualization ◽

10.30630/joiv.4.4.361 ◽

2020 ◽

Vol 4 (4) ◽

pp. 191

Author(s):

Mohammad Aljanabi ◽

Hind Ra'ad Ebraheem ◽

Zahraa Faiz Hussain ◽

Mohd Farhan Md Fudzee ◽

Shahreen Kasim ◽

...

Keyword(s):

Big Data Analytics ◽

Large Data ◽

Business Analytics ◽

Healthcare Applications ◽

Current Increase ◽

High Data ◽

Huge Data ◽

Data Volume ◽

Effective Decision Making ◽

Experimental Findings

Much attention has been paid to large data technologies in the past few years mainly due to its capability to impact business analytics and data mining practices, as well as the possibility of influencing an ambit of a highly effective decision-making tools. With the current increase in the number of modern applications (including social media and other web-based and healthcare applications) which generates high data in different forms and volume, the processing of such huge data volume is becoming a challenge with the conventional data processing tools. This has resulted in the emergence of big data analytics which also comes with many challenges. This paper introduced the use of principal components analysis (PCA) for data size reduction, followed by SVM parallelization. The proposed scheme in this study was executed on the Spark platform and the experimental findings revealed the capability of the proposed scheme to reduce the classifiers’ classification time without much influence on the classification accuracy of the classifier.

Download Full-text

The Performance Impact of Kernel Prefetching on Buffer Cache Replacement Algorithms

IEEE Transactions on Computers ◽

10.1109/tc.2007.1029 ◽

2007 ◽

Vol 56 (7) ◽

pp. 889-908 ◽

Cited By ~ 19

Author(s):

Ali R. Butt ◽

Chris Gniady ◽

Y. Charlie Hu

Keyword(s):

Cache Replacement ◽

Performance Impact ◽

Buffer Cache ◽

Replacement Algorithms

Download Full-text