Parallel trace analysis: project deliverable D4.3
Castañé, Gabriel González
Nikolopoulos, Dimitrios S.
FakultätFakultät für Ingenieurwissenschaften, Informatik und Psychologie
InstitutionInstitut für Organisation und Management von Informationssystemen
Externe KooperationenQueen’s University of Belfast
Dublin City University
Ressourcen- / MedientypBericht, Text
Datum der Erstveröffentlichung2017-04-20
CactoScale provides monitoring and data analysis functionality to CACTOS. This deliverable presents the framework and algorithms used by CactoScale for parallel trace analysis. We describe different CactoScale framework extensions which enable the implementation of parallel correlation analysis of system utilisation metric traces and cloud data logs. We also present the implementation of Lambda Architecture into CactoScale which parallelises several aspects of monitoring and exchanging information in CACTOS. CactoScale trace analysis tackles parallelism on various dimensions. We describe a hierarchical log analysis and anomaly detection framework. The anomaly detection utilises parallel data analysis frameworks such as Spark and mapreduce framework for parallel analysis of workload traces and system logs, coupled with HDFS for in-memory processing of the data. The trace analysis also involves the pre-processing of raw data logs for storage in HDFS. It allows executing anomaly detection algorithms hierarchically, both utilising the compute nodes in situ and the parallel HDFS monitoring facility. This is feasible by pairing the CactoScale agents with in situ analytics modules to cover the cases such as workload spike detection, but also to filter the data that flows to the database for post-processing. An in situ analytic module is a process designed to run locally in a node. This tactic provides the advantage of data locality. The data are pre-processed by the local node before being collected by a remote distributed service for further processing. In this way, the hierarchical design of data analysis allows for an additional level of real-time processing which is much closer to the data source. CactoScale has different features and capabilities for parallel trace analysis which are demonstrated in this deliverable by using different algorithms for anomaly detection. Anomaly detection involves the use of trace analysis algorithms that detects outliers (numerical, textual, or correlation based) in data traces. Detecting outliers can trigger actions in resource management and for this reason we focus in anomaly detection as a use case. We demonstrate a Lightweight Anomaly Detection Tool based on correlation analysis. This tool utilises a monitoring cluster to perform parallel trace analysis using Spark and mapreduce. The online data analysis modules that we demonstrate include a log analysis module and several spike detection methods. Workload spikes are one of the main causes of QoS degradation in cloud applications. The log analysis demonstrates how information on cloud platform can contribute in reducing any false positive alerts.
LizenzCC BY-ND 4.0 International
Electric network topology
Context-aware cloud topology
DDC-SachgruppeDDC 004 / Data processing & computer science
Das könnte Sie auch interessieren:
Groenda, Henning; Stier, Christian; Krzywda, Jakub; Byrne, James; Svorobej, Sergej; Papazachos, Zafeirios; Sheridan, Craig; Whigham, Darren; Östberg, Per-Olov (Universität Ulm, 2017-04-21)The CACTOS project aims to improve the operational efficiency of cloud data centres by supporting data centre operators in the planning and operation of heterogeneous data centres. One major goal of CACTOS is to enable ...
Stier, Christian; Groenda, Henning; Whigham, Darren; Bharbuiya, Sakil; Papazachos, Zafeirios; Hauser, Christopher; Krzywda, Jakub; Östberg, Per-Olov (Universität Ulm, 2017-04-21)Infrastructure as a Service (IaaS) cloud data centres enable customers to run arbitrary software systems on virtualised infrastructure. In contrast to Software or Platform as a Service approaches, customers do not need to ...
Sheridan, Craig; Whigham, Darren; Stewart, Claire; Domaschka, Jörg; Tsitsipas, Athanasios; Hauser, Christopher; Papazachos, Zafeirios; Ali-Eldin, Ahmed; Krzywda, Jakub; Byrne, James (Universität Ulm, 2017-04-20)This document describes the validation of the CACTOS software components. This validation has been completed in two iterations. The first iteration aimed to deliver a current validation of the CACTOS Runtime tools as they ...