Why big data storage medium like data lake, data warehouse and data lakehouse matters
Hello all, I have a question:
I was reading a bit about big data in general and often, I came across the terms data lake, data warehouse and data lakehouse.
I know each storage medium have their way of storing data, for example, from what I've read, data lake is more suitable to store unstructured data while data warehouse is more suitable for structured data while data lakehouse involves both.
My question is, why can't we just use SQL databases or NoSQL databases?
6 Replies
because Big Data is truly massive
once you have hundreds of millions of rows, query performance starts to degrade, and you have to find solutions that fit
ah ok, everything revolves about performance issues ?
i would say "latency" instead of "performance"
latency is just 1/performance in this case, but sure
it is
yep I see, thanks !