Home
Partners
Research interests
Projects
People
Opportunities (tesi di laurea IT)
Publications
Software
Events
Publications by year:
Publications by type: Conference papers - Journals - Technical Reports - Books - Book Chapters - Phd Thesis - Master Thesis (Tesi di Laurea Magistrale) - Bachelor Thesis (Tesi di Laurea)
Publications by author:
Publications by project:
Free search: Search


2013

L. Aniello, R. Baldoni, L. Querzoni
Adaptive Online Scheduling in Storm

To appear in proceedings of the 7th ACM International Conference on Distributed Event-Based Systems (DEBS), 2013

Abstract [+]

"Today we are witnessing a dramatic shift toward a data-driven economy, where the ability to efficiently and timely analyze huge amounts of data marks the difference between industrial success stories and catastrophic failures. In this scenario Storm, an open source distributed realtime computation system, represents a disruptive technology that is quickly gaining the favor of big players like Twitter and Groupon. A Storm application is modeled as a topology, i.e. a graph where nodes are operators and edges represent data flows among such operators. A key aspect in tuning Storm performance lies in the strategy used to deploy a topology, i.e. how Storm schedules the execution of each topology component on the available computing infrastructure. In this paper we propose two advanced generic schedulers for Storm that provide improved performance for a wide range of application topologies. The first scheduler works offline by analyzing the topology structure and adapting the deployment to it; the second scheduler enhance the previous approach by continuously monitoring system performance and rescheduling the deployment at run-time to improve overall performance. Experimental results show that these algorithms can produce schedules that achieve significantly better performances compared to those produced by Storm's default scheduler."

Downloads:pdf - Paper (accepted version)
bib - BibTeX reference



R. Baldoni, S. Bonomi, A. Cerocchi, L. Querzoni
Virtual Tree: a Robust Architecture for Interval Valid Queries in Dynamic Distributed Systems

Journal of Parallel and Distributed Computing (JPDC), Elsevier (accepted for publication), 2013

Abstract [+]

"This paper studies the problem of answering aggregation queries, satisfying the interval validity semantics, in a distributed system prone to continuous arrival and departure of participants. The interval validity semantics states that the query answer must be calculated considering contributions of at least all processes that remained in the distributed system for the whole query duration. Satisfying this semantics in systems experiencing unbounded churn is impossible due to the lack of connectivity and path stability between processes. This paper presents a novel architecture, namely Virtual Tree, for building and maintaining a structured overlay network with guaranteed connectivity and path stability in settings characterized by bounded churn rate. The architecture includes a simple query answering algorithm that provides interval valid answers. The overlay network generated by the Virtual Tree architecture is a tree-shaped topology with virtual nodes constituted by clusters of processes and virtual links constituted by multiple communication links connecting processes located in adjacent virtual nodes. We formally prove a bound on the churn rate for interval valid queries in a distributed system where communication latencies are bounded by a constant unknown by processes. Finally, we carry out an extensive experimental evaluation that shows the degree of robustness of the overlay network generated by the virtual tree architecture under different churn rates."

Downloads:pdf - Paper (accepted version)
bib - BibTeX reference



R. Baldoni, L. Querzoni, L. Aniello
Input data organization for batch processing in time window based computations

In Proceedings of the 28th ACM Symposium On Applied Computing (SAC), 2013

Abstract [+]

"Applications based on event processing are often designed to continuously evaluate set of events defined by sliding time windows. Solutions employing long-running continuous queries executed in-memory show their limits in applications characterized by a staggering growth of available sources that continuously produce new events at high rates (e.g. intrusion detection systems and algorithmic trading). Problems arise due to the complexities in maintaining large amounts of events in memory for continuous elaboration, and due to the difficulties in managing at run-time the network of elaborating nodes. A batch approach to this kind of computation provides a viable solution for scenarios characterized by non frequent computations of very large time windows. In this paper we propose a model for batch processing in time window event computations that allows the definition of multiple metrics for performance optimization. These metrics specifically take into account the organization of input data to minimize its impact on computation latency. The model is then instantiated on Hadoop, a batch processing engine based on the MapReduce paradigm, and a set of strategies for efficiently arranging input data is described and evaluated."

Downloads:bib - BibTeX reference



R. Baldoni, G. Di Luna, L. Querzoni
Collaborative Detection of Coordinated Port Scans

In Proceedings of the 14th International Conference on Distributed Computing and Networking (ICDCN), 2013

Abstract [+]

"In this paper we analyze the coordinated port scan attack where a single adversary coordinates a Group of Attackers (GoA) in order to obtain information on a set of target networks. Such orchestration aims at avoiding Local Intrusion Detection Systems checks allowing each host of the GoA to send a very few number of probes to hosts of the target network. In order to detect this complex attack we propose a collaborative architecture where each target network deploys local sensors that send alarms to a collaborative layer. This, in turn, correlates this data with the aim of (i) identifying coordinated attacks while (ii) reducing false positive alarms and (iii) correctly separating GoAs that act concurrently on overlapping targets. Locally deployed sensors adopt graph-based clustering techniques over non-established TCP connections to generate alarms. The collaborative layer employs a similarity approach to aggregate alarms and approximated optimization algorithms to separate distinct GoAs. The soundness of our approach is tested on real network traces. Tests show that collaboration among networks domains is mandatory to achieve accurate detection of coordinated attacks and sharp separation between GoAs that execute concurrent attacks on the same targets."

Downloads:pdf - Technical report version - MIDLAB 1/12
bib - BibTeX reference

Top -


DISCLAIMER - The reports contained in this page are included by the contributing authors as a mechanism to ensure timely dissemination of scholarly/technical information on a non-commerical basis. Copyright and all rights therein are maintained by the authors, despite the fact they have offered this information electronically. It is understood that all individuals copying this information will adhere to the terms/ constraints invoked by each author's copyright. Reports may not be copied for commercial redistribution, republication, or dissemination without the explicit permission of the authors. Sections of some of these reports have been published by IEEE, Springer-Verlag, Kluwer etc. and have Copyright. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works, must be obtained from the publisher.