Apache SAMOA

Scalable Advanced Massive Online Analysis

Apache SAMOA is currently undergoing incubation at the Apache Software Foundation.
Latest source release: 0.4.0-incubating
View on GitHub.

Apache SAMOA

Apache Incubator Apache SAMOA is a distributed streaming machine learning (ML) framework that contains a programing abstraction for distributed streaming ML algorithms.

Apache SAMOA enables development of new ML algorithms without directly dealing with the complexity of underlying distributed stream processing engines (DSPEe, such as Apache Storm, Apache Flink, and Apache Samza). Apache SAMOA users can develop distributed streaming ML algorithms once and execute them on multiple DSPEs.


Apache Samoa was presented in Apache Big Data North America, 2016

Apache Samoa was presented in Apache Big Data Europe, 2015


SAMOA Slides

G. De Francisci Morales, A. Bifet. "SAMOA: Scalable Advanced Massive Online Analysis." Journal of Machine Learning Research, 16(Jan):149−153, 2015.


NoSQL matters Conference, Barcelona 2013.

Apache SAMOA Developer's Guide

SAMOA Developer's guide

API Javadoc Reference


Mailing list

Development mailing list dev@samoa.incubator.apache.org
[ subscribe | unsubscribe | archives ]


List of contributors to the SAMOA project.


The use and distribution terms for this software are covered by the Apache License, Version 2.0 (http://www.apache.org/licenses/LICENSE-2.0.html).

Build Apache SAMOA

Getting Started

Apache Incubator

Apache SAMOA is an effort undergoing incubation at The Apache Software Foundation (ASF), sponsored by the Apache Incubator. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.

Apache and the Apache feather logo are trademarks of The Apache Software Foundation.