The power of S4, Yahoo’s distributed stream computing platform, in telco?
In October 2010 Yahoo made another internal system open source: S4. S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily...
View ArticleHadoop for Real-Time: Spark, Shark, Spark Streaming, Bagel, etc. will be...
The website defines Spark as a MapReduce-like cluster computing framework designed to support low-latency iterative jobs. However it would be easier to say that Spark is Hadoop for real-time. Spark...
View ArticleScaling Machine Learning
There is currently still a vacuum for easy & scalable solutions in the machine learning space. At the moment everybody is talking about Hadoop as the de-facto standard for Big Data. Unfortunately...
View Article
More Pages to Explore .....