Diving into Big Data : Data lakes vs. data streams

Image Credit: siliconangle.com

Image Credit: siliconangle.com

RSMI Note - The following article illustrates the difference between monitoring social media and capturing social data. The RSMI captures data. 

Data lakes and data streams are becoming common analogies in the discussion around analytics and potential enterprise big data strategies. The analogies are appropriate from several aspects beyond just visualizing different approaches to accessing useful information. As in nature, both lakes and streams have their individual characteristics and are each important to the overall ecosystem. The question is not whether one should exist in favor of the other; both provide benefits but the key is in understanding how to best utilize each for the optimum outcome.

Fishing for Trends in Data Lakes

Data lakes as in nature represent a large pool, in this case a body of information that has been built up over time. Man-made lakes are often formed by the construction of a dam that is placed on a river to harness the power of that river in a controlled fashion for use at a later time.

Data lakes operate in much the same way, with a deluge of information being diverted into a large repository where it can be held for a long period of time until it is needed. The lakes are constantly fed by new water flowing into the lake, and in fact, they are dependent on the constant flow of water to keep the environment vibrant otherwise the lake could stagnate. Read More