Parallel trace analysis: project deliverable D4.3
FakultätenFakultät für Ingenieurwissenschaften, Informatik und Psychologie
InstitutionenInstitut für Organisation und Management von Informationssystemen
Externe KooperationenQueen’s University of Belfast
Dublin City University
LizenzCC BY-ND 4.0 International
CactoScale provides monitoring and data analysis functionality to CACTOS. This deliverable presents the framework and algorithms used by CactoScale for parallel trace analysis. We describe different CactoScale framework extensions which enable the implementation of parallel correlation analysis of system utilisation metric traces and cloud data logs. We also present the implementation of Lambda Architecture into CactoScale which parallelises several aspects of monitoring and exchanging information in CACTOS. CactoScale trace analysis tackles parallelism on various dimensions. We describe a hierarchical log analysis and anomaly detection framework. The anomaly detection utilises parallel data analysis frameworks such as Spark and mapreduce framework for parallel analysis of workload traces and system logs, coupled with HDFS for in-memory processing of the data. The trace analysis also involves the pre-processing of raw data logs for storage in HDFS. It allows executing anomaly detection algorithms hierarchically, both utilising the compute nodes in situ and the parallel HDFS monitoring facility. This is feasible by pairing the CactoScale agents with in situ analytics modules to cover the cases such as workload spike detection, but also to filter the data that flows to the database for post-processing. An in situ analytic module is a process designed to run locally in a node. This tactic provides the advantage of data locality. The data are pre-processed by the local node before being collected by a remote distributed service for further processing. In this way, the hierarchical design of data analysis allows for an additional level of real-time processing which is much closer to the data source. CactoScale has different features and capabilities for parallel trace analysis which are demonstrated in this deliverable by using different algorithms for anomaly detection. Anomaly detection involves the use of trace analysis algorithms that detects outliers (numerical, textual, or correlation based) in data traces. Detecting outliers can trigger actions in resource management and for this reason we focus in anomaly detection as a use case. We demonstrate a Lightweight Anomaly Detection Tool based on correlation analysis. This tool utilises a monitoring cluster to perform parallel trace analysis using Spark and mapreduce. The online data analysis modules that we demonstrate include a log analysis module and several spike detection methods. Workload spikes are one of the main causes of QoS degradation in cloud applications. The log analysis demonstrates how information on cloud platform can contribute in reducing any false positive alerts.
Erstellung / Fertigstellung
CACTOS / Context-Aware Cloud Topology Optimisation and Simulation / EC / FP7 / 610711
EC / FP7
EC / FP7
Normierte SchlagwörterDatenmanagement [GND]
Cloud Computing [GND]
Cloud computing [LCSH]
Electric network topology [LCSH]
SchlagwörterAnalytics; Analysis; Cloud; Framework; Optimisation; Simulation; Cactos Projekt; Data management; Parallel trace; Context-aware cloud topology; Cloud services
DDC-SachgruppeDDC 004 / Data processing & computer science
Das könnte Sie auch interessieren:
BerichtSvorobej, Sergej; Byrne, James; Byrne, Peter J.; Groenda, Henning; Stier, Christian; Domaschka, Jörg; Wesner, Stefan; Krzywda, Jakub; Östberg, Per-Olov (Universität Ulm, 2017-04-21)This deliverable provides supporting documentation for the official deliverable D6.1, the initial release of the CactoSim simulation framework. It presents the reader with the scope of the deliverable, initial requirements ...
BerichtGroenda, Henning; Stier, Christian; Krzywda, Jakub; Byrne, James; Svorobej, Sergej; Castañé, Gabriel González; Papazachos, Zafeirios; Sheridan, Craig; Whigham, Darren; Hauser, Christopher; Tsitsipas, Athanasios; Domaschka, Jörg; Ali-Eldin, Ahmed; Östberg, Per-Olov (Universität Ulm, 2017-04-21)This document is accompanying material for the prototype deliverable D5.2.2. It describes the changes for the second version of the CACTOS Toolkit and provides details on the integration between the tools and toolkits. A ...
Preliminary results from optimisation models validation and experimentation: project deliverable D6.2 BerichtSvorobej, Sergej; Byrne, James; Castañé, Gabriel González; Krzywda, Jakub; Groenda, Henning; Stier, Christian; Domaschka, Jörg; Ahir, Mayur; Byrne, Peter J.; Östberg, Per-Olov (Universität Ulm, 2017-04-21)Since the arrival of cloud computing, a significant amount of research has been and continues to be carried out towards the creation of efficient optimisation strategies for meeting certain optimisation goals such as energy ...